how can i decode unicode string in python? -
wikipedia api encodes string unicode format
"golden globe award best motion picture \u2013 drama" how can convert to
"golden globe award best motion picture – drama" 
the wikipedia api returns json data, use json module decode:
json.loads(inputstring) demo:
>>> import json >>> print json.loads('"golden globe award best motion picture \u2013 drama"') golden globe award best motion picture – drama if instead have string starts u'', have python unicode value , looking @ representation of string:
>>> json.loads('"golden globe award best motion picture \u2013 drama"') u'golden globe award best motion picture \u2013 drama' just print value have python encode terminal codec , represent em-dash character in format terminal understand.
you may want read python , unicode , encodings before continue, if not understand difference between unicode value , byte strings:
Comments
Post a Comment