我在应用程序引擎中使用Alchemy API,所以在使用simplejson库来解析响应。问题是响应中包含具有sme名称的条目
{ "status": "OK", "usage": "By accessing AlchemyAPI or using information generated by AlchemyAPI, you are agreeing to be bound by the AlchemyAPI Terms of Use: http://www.alchemyapi.com/company/terms.html", "url": "", "language": "english", "entities": [ { "type": "Person", "relevance": "0.33", "count": "1", "text": "Michael Jordan", "disambiguated": { "name": "Michael Jordan", "subType": "Athlete", "subType": "AwardWinner", "subType": "BasketballPlayer", "subType": "HallOfFameInductee", "subType": "OlympicAthlete", "subType": "SportsLeagueAwardWinner", "subType": "FilmActor", "subType": "TVActor", "dbpedia": "http://dbpedia.org/resource/Michael_Jordan", "freebase": "http://rdf.freebase.com/ns/guid.9202a8c04000641f8000000000029161", "umbel": "http://umbel.org/umbel/ne/wikipedia/Michael_Jordan", "opencyc": "http://sw.opencyc.org/concept/Mx4rvViVq5wpEbGdrcN5Y29ycA", "yago": "http://mpii.de/yago/resource/Michael_Jordan" } } ] }
因此,问题在于重复了“ subType”,因此加载返回的指令只是“ TVActor”,而不是列表。无论如何要解决这个问题?
定义的RFC 4627application/json表示:
application/json
An object is an unordered collection of zero or more name/value pairs
和:
The names within an object SHOULD be unique.
这意味着AlchemyAPI不应"subType"在同一个对象内返回多个名称,并声称它是JSON。
"subType"
您可以尝试以XML格式(outputMode=xml)请求相同的内容,以避免结果出现歧义或将重复的键值转换为列表:
outputMode=xml
import simplejson as json from collections import defaultdict def multidict(ordered_pairs): """Convert duplicate keys values to lists.""" # read all values into lists d = defaultdict(list) for k, v in ordered_pairs: d[k].append(v) # unpack lists that have only 1 item for k, v in d.items(): if len(v) == 1: d[k] = v[0] return dict(d) print json.JSONDecoder(object_pairs_hook=multidict).decode(text)
text = """{ "type": "Person", "subType": "Athlete", "subType": "AwardWinner" }"""
{u'subType': [u'Athlete', u'AwardWinner'], u'type': u'Person'}