blob: a791259831788681390a446995c11a85b43b1968 [file] [log] [blame]
Christian Heimes90540002008-05-08 14:29:10 +00001:mod:`json` --- JSON encoder and decoder
2========================================
3
4.. module:: json
5 :synopsis: Encode and decode the JSON format.
6.. moduleauthor:: Bob Ippolito <bob@redivi.com>
7.. sectionauthor:: Bob Ippolito <bob@redivi.com>
Christian Heimes90540002008-05-08 14:29:10 +00008
Benjamin Petersonf3352e72011-02-27 15:15:06 +00009`JSON (JavaScript Object Notation) <http://json.org>`_ is a subset of JavaScript
Christian Heimes90540002008-05-08 14:29:10 +000010syntax (ECMA-262 3rd edition) used as a lightweight data interchange format.
11
12:mod:`json` exposes an API familiar to users of the standard library
13:mod:`marshal` and :mod:`pickle` modules.
14
15Encoding basic Python object hierarchies::
Georg Brandl48310cd2009-01-03 21:18:54 +000016
Christian Heimes90540002008-05-08 14:29:10 +000017 >>> import json
18 >>> json.dumps(['foo', {'bar': ('baz', None, 1.0, 2)}])
19 '["foo", {"bar": ["baz", null, 1.0, 2]}]'
Neal Norwitz752abd02008-05-13 04:55:24 +000020 >>> print(json.dumps("\"foo\bar"))
Christian Heimes90540002008-05-08 14:29:10 +000021 "\"foo\bar"
Benjamin Peterson2505bc62008-05-15 02:17:58 +000022 >>> print(json.dumps('\u1234'))
Christian Heimes90540002008-05-08 14:29:10 +000023 "\u1234"
Neal Norwitz752abd02008-05-13 04:55:24 +000024 >>> print(json.dumps('\\'))
Christian Heimes90540002008-05-08 14:29:10 +000025 "\\"
Neal Norwitz752abd02008-05-13 04:55:24 +000026 >>> print(json.dumps({"c": 0, "b": 0, "a": 0}, sort_keys=True))
Christian Heimes90540002008-05-08 14:29:10 +000027 {"a": 0, "b": 0, "c": 0}
Benjamin Peterson2505bc62008-05-15 02:17:58 +000028 >>> from io import StringIO
Christian Heimes90540002008-05-08 14:29:10 +000029 >>> io = StringIO()
30 >>> json.dump(['streaming API'], io)
31 >>> io.getvalue()
32 '["streaming API"]'
33
34Compact encoding::
35
36 >>> import json
Éric Araujode579d42011-04-21 02:37:41 +020037 >>> json.dumps([1,2,3,{'4': 5, '6': 7}], separators=(',', ':'))
Christian Heimes90540002008-05-08 14:29:10 +000038 '[1,2,3,{"4":5,"6":7}]'
39
40Pretty printing::
41
42 >>> import json
Neal Norwitz752abd02008-05-13 04:55:24 +000043 >>> print(json.dumps({'4': 5, '6': 7}, sort_keys=True, indent=4))
Christian Heimes90540002008-05-08 14:29:10 +000044 {
Georg Brandl48310cd2009-01-03 21:18:54 +000045 "4": 5,
Christian Heimes90540002008-05-08 14:29:10 +000046 "6": 7
47 }
48
49Decoding JSON::
Georg Brandl48310cd2009-01-03 21:18:54 +000050
Christian Heimes90540002008-05-08 14:29:10 +000051 >>> import json
52 >>> json.loads('["foo", {"bar":["baz", null, 1.0, 2]}]')
Benjamin Peterson2505bc62008-05-15 02:17:58 +000053 ['foo', {'bar': ['baz', None, 1.0, 2]}]
Christian Heimes90540002008-05-08 14:29:10 +000054 >>> json.loads('"\\"foo\\bar"')
Benjamin Peterson2505bc62008-05-15 02:17:58 +000055 '"foo\x08ar'
56 >>> from io import StringIO
Christian Heimes90540002008-05-08 14:29:10 +000057 >>> io = StringIO('["streaming API"]')
58 >>> json.load(io)
Benjamin Peterson2505bc62008-05-15 02:17:58 +000059 ['streaming API']
Christian Heimes90540002008-05-08 14:29:10 +000060
61Specializing JSON object decoding::
62
63 >>> import json
64 >>> def as_complex(dct):
65 ... if '__complex__' in dct:
66 ... return complex(dct['real'], dct['imag'])
67 ... return dct
Benjamin Peterson2505bc62008-05-15 02:17:58 +000068 ...
Christian Heimes90540002008-05-08 14:29:10 +000069 >>> json.loads('{"__complex__": true, "real": 1, "imag": 2}',
70 ... object_hook=as_complex)
71 (1+2j)
72 >>> import decimal
73 >>> json.loads('1.1', parse_float=decimal.Decimal)
74 Decimal('1.1')
75
76Extending :class:`JSONEncoder`::
Georg Brandl48310cd2009-01-03 21:18:54 +000077
Christian Heimes90540002008-05-08 14:29:10 +000078 >>> import json
79 >>> class ComplexEncoder(json.JSONEncoder):
80 ... def default(self, obj):
81 ... if isinstance(obj, complex):
82 ... return [obj.real, obj.imag]
83 ... return json.JSONEncoder.default(self, obj)
Benjamin Peterson2505bc62008-05-15 02:17:58 +000084 ...
Georg Brandl0bb73b82010-09-03 22:36:22 +000085 >>> json.dumps(2 + 1j, cls=ComplexEncoder)
Christian Heimes90540002008-05-08 14:29:10 +000086 '[2.0, 1.0]'
87 >>> ComplexEncoder().encode(2 + 1j)
88 '[2.0, 1.0]'
89 >>> list(ComplexEncoder().iterencode(2 + 1j))
Georg Brandl0bb73b82010-09-03 22:36:22 +000090 ['[2.0', ', 1.0', ']']
Georg Brandl48310cd2009-01-03 21:18:54 +000091
Christian Heimes90540002008-05-08 14:29:10 +000092
93.. highlight:: none
94
95Using json.tool from the shell to validate and pretty-print::
Georg Brandl48310cd2009-01-03 21:18:54 +000096
Christian Heimes90540002008-05-08 14:29:10 +000097 $ echo '{"json":"obj"}' | python -mjson.tool
98 {
99 "json": "obj"
100 }
101 $ echo '{ 1.2:3.4}' | python -mjson.tool
102 Expecting property name: line 1 column 2 (char 2)
103
104.. highlight:: python
105
Georg Brandl48310cd2009-01-03 21:18:54 +0000106.. note::
Christian Heimes90540002008-05-08 14:29:10 +0000107
108 The JSON produced by this module's default settings is a subset of
109 YAML, so it may be used as a serializer for that as well.
110
111
112Basic Usage
113-----------
114
Georg Brandlcd7f32b2009-06-08 09:13:45 +0000115.. function:: dump(obj, fp, skipkeys=False, ensure_ascii=True, check_circular=True, allow_nan=True, cls=None, indent=None, separators=None, default=None, **kw)
Christian Heimes90540002008-05-08 14:29:10 +0000116
117 Serialize *obj* as a JSON formatted stream to *fp* (a ``.write()``-supporting
118 file-like object).
119
120 If *skipkeys* is ``True`` (default: ``False``), then dict keys that are not
Antoine Pitrou00d650b2011-01-21 21:37:32 +0000121 of a basic type (:class:`str`, :class:`int`, :class:`float`, :class:`bool`,
122 ``None``) will be skipped instead of raising a :exc:`TypeError`.
Christian Heimes90540002008-05-08 14:29:10 +0000123
Benjamin Petersonc6b607d2009-05-02 12:36:44 +0000124 The :mod:`json` module always produces :class:`str` objects, not
125 :class:`bytes` objects. Therefore, ``fp.write()`` must support :class:`str`
126 input.
127
Éric Araujo6f7aa002012-01-16 10:09:20 +0100128 If *ensure_ascii* is ``True`` (the default), the output is guaranteed to
129 have all incoming non-ASCII characters escaped. If *ensure_ascii* is
130 ``False``, these characters will be output as-is.
131
Christian Heimes90540002008-05-08 14:29:10 +0000132 If *check_circular* is ``False`` (default: ``True``), then the circular
133 reference check for container types will be skipped and a circular reference
134 will result in an :exc:`OverflowError` (or worse).
135
136 If *allow_nan* is ``False`` (default: ``True``), then it will be a
137 :exc:`ValueError` to serialize out of range :class:`float` values (``nan``,
138 ``inf``, ``-inf``) in strict compliance of the JSON specification, instead of
139 using the JavaScript equivalents (``NaN``, ``Infinity``, ``-Infinity``).
140
Raymond Hettingerb643ef82010-10-31 08:00:16 +0000141 If *indent* is a non-negative integer or string, then JSON array elements and
142 object members will be pretty-printed with that indent level. An indent level
R David Murrayd5315482011-04-12 21:09:18 -0400143 of 0, negative, or ``""`` will only insert newlines. ``None`` (the default)
144 selects the most compact representation. Using a positive integer indent
145 indents that many spaces per level. If *indent* is a string (such at '\t'),
146 that string is used to indent each level.
Christian Heimes90540002008-05-08 14:29:10 +0000147
148 If *separators* is an ``(item_separator, dict_separator)`` tuple, then it
149 will be used instead of the default ``(', ', ': ')`` separators. ``(',',
150 ':')`` is the most compact JSON representation.
151
Christian Heimes90540002008-05-08 14:29:10 +0000152 *default(obj)* is a function that should return a serializable version of
153 *obj* or raise :exc:`TypeError`. The default simply raises :exc:`TypeError`.
154
Georg Brandl1f01deb2009-01-03 22:47:39 +0000155 To use a custom :class:`JSONEncoder` subclass (e.g. one that overrides the
Christian Heimes90540002008-05-08 14:29:10 +0000156 :meth:`default` method to serialize additional types), specify it with the
Georg Brandld4460aa2010-10-15 17:03:02 +0000157 *cls* kwarg; otherwise :class:`JSONEncoder` is used.
Christian Heimes90540002008-05-08 14:29:10 +0000158
159
Georg Brandlcd7f32b2009-06-08 09:13:45 +0000160.. function:: dumps(obj, skipkeys=False, ensure_ascii=True, check_circular=True, allow_nan=True, cls=None, indent=None, separators=None, default=None, **kw)
Christian Heimes90540002008-05-08 14:29:10 +0000161
Benjamin Petersonc6b607d2009-05-02 12:36:44 +0000162 Serialize *obj* to a JSON formatted :class:`str`. The arguments have the
163 same meaning as in :func:`dump`.
Christian Heimes90540002008-05-08 14:29:10 +0000164
Ezio Melotti60adf952011-04-15 07:37:00 +0300165 .. note::
166
Georg Brandl340d2692011-04-16 16:54:15 +0200167 Unlike :mod:`pickle` and :mod:`marshal`, JSON is not a framed protocol,
168 so trying to serialize multiple objects with repeated calls to
169 :func:`dump` using the same *fp* will result in an invalid JSON file.
170
Christian Heimes90540002008-05-08 14:29:10 +0000171
Georg Brandlcd7f32b2009-06-08 09:13:45 +0000172.. function:: load(fp, cls=None, object_hook=None, parse_float=None, parse_int=None, parse_constant=None, object_pairs_hook=None, **kw)
Christian Heimes90540002008-05-08 14:29:10 +0000173
174 Deserialize *fp* (a ``.read()``-supporting file-like object containing a JSON
175 document) to a Python object.
176
Christian Heimes90540002008-05-08 14:29:10 +0000177 *object_hook* is an optional function that will be called with the result of
Benjamin Peterson25c95f12009-05-08 20:42:26 +0000178 any object literal decoded (a :class:`dict`). The return value of
Christian Heimes90540002008-05-08 14:29:10 +0000179 *object_hook* will be used instead of the :class:`dict`. This feature can be used
180 to implement custom decoders (e.g. JSON-RPC class hinting).
181
Raymond Hettinger9b8d0692009-04-21 03:27:12 +0000182 *object_pairs_hook* is an optional function that will be called with the
Benjamin Peterson25c95f12009-05-08 20:42:26 +0000183 result of any object literal decoded with an ordered list of pairs. The
Raymond Hettinger9b8d0692009-04-21 03:27:12 +0000184 return value of *object_pairs_hook* will be used instead of the
185 :class:`dict`. This feature can be used to implement custom decoders that
186 rely on the order that the key and value pairs are decoded (for example,
187 :func:`collections.OrderedDict` will remember the order of insertion). If
188 *object_hook* is also defined, the *object_pairs_hook* takes priority.
189
190 .. versionchanged:: 3.1
Hirokazu Yamamotoae9eb5c2009-04-26 03:34:06 +0000191 Added support for *object_pairs_hook*.
Raymond Hettinger9b8d0692009-04-21 03:27:12 +0000192
Christian Heimes90540002008-05-08 14:29:10 +0000193 *parse_float*, if specified, will be called with the string of every JSON
194 float to be decoded. By default, this is equivalent to ``float(num_str)``.
195 This can be used to use another datatype or parser for JSON floats
196 (e.g. :class:`decimal.Decimal`).
197
198 *parse_int*, if specified, will be called with the string of every JSON int
199 to be decoded. By default, this is equivalent to ``int(num_str)``. This can
200 be used to use another datatype or parser for JSON integers
201 (e.g. :class:`float`).
202
203 *parse_constant*, if specified, will be called with one of the following
204 strings: ``'-Infinity'``, ``'Infinity'``, ``'NaN'``, ``'null'``, ``'true'``,
205 ``'false'``. This can be used to raise an exception if invalid JSON numbers
206 are encountered.
207
208 To use a custom :class:`JSONDecoder` subclass, specify it with the ``cls``
Georg Brandld4460aa2010-10-15 17:03:02 +0000209 kwarg; otherwise :class:`JSONDecoder` is used. Additional keyword arguments
210 will be passed to the constructor of the class.
Christian Heimes90540002008-05-08 14:29:10 +0000211
212
Georg Brandlcd7f32b2009-06-08 09:13:45 +0000213.. function:: loads(s, encoding=None, cls=None, object_hook=None, parse_float=None, parse_int=None, parse_constant=None, object_pairs_hook=None, **kw)
Christian Heimes90540002008-05-08 14:29:10 +0000214
Antoine Pitrou00d650b2011-01-21 21:37:32 +0000215 Deserialize *s* (a :class:`str` instance containing a JSON document) to a
216 Python object.
Christian Heimes90540002008-05-08 14:29:10 +0000217
Antoine Pitrou00d650b2011-01-21 21:37:32 +0000218 The other arguments have the same meaning as in :func:`load`, except
219 *encoding* which is ignored and deprecated.
Christian Heimes90540002008-05-08 14:29:10 +0000220
221
222Encoders and decoders
223---------------------
224
Georg Brandlcd7f32b2009-06-08 09:13:45 +0000225.. class:: JSONDecoder(object_hook=None, parse_float=None, parse_int=None, parse_constant=None, strict=True, object_pairs_hook=None)
Christian Heimes90540002008-05-08 14:29:10 +0000226
227 Simple JSON decoder.
228
229 Performs the following translations in decoding by default:
230
231 +---------------+-------------------+
232 | JSON | Python |
233 +===============+===================+
234 | object | dict |
235 +---------------+-------------------+
236 | array | list |
237 +---------------+-------------------+
Benjamin Petersonc6b607d2009-05-02 12:36:44 +0000238 | string | str |
Christian Heimes90540002008-05-08 14:29:10 +0000239 +---------------+-------------------+
Georg Brandl639ce962009-04-11 18:18:16 +0000240 | number (int) | int |
Christian Heimes90540002008-05-08 14:29:10 +0000241 +---------------+-------------------+
242 | number (real) | float |
243 +---------------+-------------------+
244 | true | True |
245 +---------------+-------------------+
246 | false | False |
247 +---------------+-------------------+
248 | null | None |
249 +---------------+-------------------+
250
251 It also understands ``NaN``, ``Infinity``, and ``-Infinity`` as their
252 corresponding ``float`` values, which is outside the JSON spec.
253
Christian Heimes90540002008-05-08 14:29:10 +0000254 *object_hook*, if specified, will be called with the result of every JSON
255 object decoded and its return value will be used in place of the given
256 :class:`dict`. This can be used to provide custom deserializations (e.g. to
257 support JSON-RPC class hinting).
258
Raymond Hettinger9b8d0692009-04-21 03:27:12 +0000259 *object_pairs_hook*, if specified will be called with the result of every
260 JSON object decoded with an ordered list of pairs. The return value of
261 *object_pairs_hook* will be used instead of the :class:`dict`. This
262 feature can be used to implement custom decoders that rely on the order
263 that the key and value pairs are decoded (for example,
264 :func:`collections.OrderedDict` will remember the order of insertion). If
265 *object_hook* is also defined, the *object_pairs_hook* takes priority.
266
267 .. versionchanged:: 3.1
Hirokazu Yamamotoae9eb5c2009-04-26 03:34:06 +0000268 Added support for *object_pairs_hook*.
Raymond Hettinger9b8d0692009-04-21 03:27:12 +0000269
Christian Heimes90540002008-05-08 14:29:10 +0000270 *parse_float*, if specified, will be called with the string of every JSON
271 float to be decoded. By default, this is equivalent to ``float(num_str)``.
272 This can be used to use another datatype or parser for JSON floats
273 (e.g. :class:`decimal.Decimal`).
274
275 *parse_int*, if specified, will be called with the string of every JSON int
276 to be decoded. By default, this is equivalent to ``int(num_str)``. This can
277 be used to use another datatype or parser for JSON integers
278 (e.g. :class:`float`).
279
280 *parse_constant*, if specified, will be called with one of the following
281 strings: ``'-Infinity'``, ``'Infinity'``, ``'NaN'``, ``'null'``, ``'true'``,
282 ``'false'``. This can be used to raise an exception if invalid JSON numbers
283 are encountered.
284
Georg Brandld4460aa2010-10-15 17:03:02 +0000285 If *strict* is ``False`` (``True`` is the default), then control characters
286 will be allowed inside strings. Control characters in this context are
287 those with character codes in the 0-31 range, including ``'\t'`` (tab),
288 ``'\n'``, ``'\r'`` and ``'\0'``.
289
Christian Heimes90540002008-05-08 14:29:10 +0000290
291 .. method:: decode(s)
292
Benjamin Petersonc6b607d2009-05-02 12:36:44 +0000293 Return the Python representation of *s* (a :class:`str` instance
294 containing a JSON document)
Christian Heimes90540002008-05-08 14:29:10 +0000295
296 .. method:: raw_decode(s)
297
Benjamin Petersonc6b607d2009-05-02 12:36:44 +0000298 Decode a JSON document from *s* (a :class:`str` beginning with a
299 JSON document) and return a 2-tuple of the Python representation
300 and the index in *s* where the document ended.
Christian Heimes90540002008-05-08 14:29:10 +0000301
302 This can be used to decode a JSON document from a string that may have
303 extraneous data at the end.
304
305
Georg Brandlcd7f32b2009-06-08 09:13:45 +0000306.. class:: JSONEncoder(skipkeys=False, ensure_ascii=True, check_circular=True, allow_nan=True, sort_keys=False, indent=None, separators=None, default=None)
Christian Heimes90540002008-05-08 14:29:10 +0000307
308 Extensible JSON encoder for Python data structures.
309
310 Supports the following objects and types by default:
311
312 +-------------------+---------------+
313 | Python | JSON |
314 +===================+===============+
315 | dict | object |
316 +-------------------+---------------+
317 | list, tuple | array |
318 +-------------------+---------------+
Benjamin Petersonc6b607d2009-05-02 12:36:44 +0000319 | str | string |
Christian Heimes90540002008-05-08 14:29:10 +0000320 +-------------------+---------------+
Georg Brandl639ce962009-04-11 18:18:16 +0000321 | int, float | number |
Christian Heimes90540002008-05-08 14:29:10 +0000322 +-------------------+---------------+
323 | True | true |
324 +-------------------+---------------+
325 | False | false |
326 +-------------------+---------------+
327 | None | null |
328 +-------------------+---------------+
329
330 To extend this to recognize other objects, subclass and implement a
331 :meth:`default` method with another method that returns a serializable object
332 for ``o`` if possible, otherwise it should call the superclass implementation
333 (to raise :exc:`TypeError`).
334
335 If *skipkeys* is ``False`` (the default), then it is a :exc:`TypeError` to
Georg Brandl639ce962009-04-11 18:18:16 +0000336 attempt encoding of keys that are not str, int, float or None. If
Christian Heimes90540002008-05-08 14:29:10 +0000337 *skipkeys* is ``True``, such items are simply skipped.
338
Benjamin Petersonc6b607d2009-05-02 12:36:44 +0000339 If *ensure_ascii* is ``True`` (the default), the output is guaranteed to
340 have all incoming non-ASCII characters escaped. If *ensure_ascii* is
341 ``False``, these characters will be output as-is.
Christian Heimes90540002008-05-08 14:29:10 +0000342
343 If *check_circular* is ``True`` (the default), then lists, dicts, and custom
344 encoded objects will be checked for circular references during encoding to
345 prevent an infinite recursion (which would cause an :exc:`OverflowError`).
346 Otherwise, no such check takes place.
347
348 If *allow_nan* is ``True`` (the default), then ``NaN``, ``Infinity``, and
349 ``-Infinity`` will be encoded as such. This behavior is not JSON
350 specification compliant, but is consistent with most JavaScript based
351 encoders and decoders. Otherwise, it will be a :exc:`ValueError` to encode
352 such floats.
353
Georg Brandl6a74da32010-08-22 20:23:38 +0000354 If *sort_keys* is ``True`` (default ``False``), then the output of dictionaries
Christian Heimes90540002008-05-08 14:29:10 +0000355 will be sorted by key; this is useful for regression tests to ensure that
356 JSON serializations can be compared on a day-to-day basis.
357
358 If *indent* is a non-negative integer (it is ``None`` by default), then JSON
359 array elements and object members will be pretty-printed with that indent
360 level. An indent level of 0 will only insert newlines. ``None`` is the most
361 compact representation.
362
363 If specified, *separators* should be an ``(item_separator, key_separator)``
364 tuple. The default is ``(', ', ': ')``. To get the most compact JSON
365 representation, you should specify ``(',', ':')`` to eliminate whitespace.
366
367 If specified, *default* is a function that gets called for objects that can't
368 otherwise be serialized. It should return a JSON encodable version of the
369 object or raise a :exc:`TypeError`.
370
Christian Heimes90540002008-05-08 14:29:10 +0000371
372 .. method:: default(o)
373
374 Implement this method in a subclass such that it returns a serializable
375 object for *o*, or calls the base implementation (to raise a
376 :exc:`TypeError`).
377
378 For example, to support arbitrary iterators, you could implement default
379 like this::
Georg Brandl48310cd2009-01-03 21:18:54 +0000380
Christian Heimes90540002008-05-08 14:29:10 +0000381 def default(self, o):
382 try:
Benjamin Petersone9bbc8b2008-09-28 02:06:32 +0000383 iterable = iter(o)
Christian Heimes90540002008-05-08 14:29:10 +0000384 except TypeError:
Benjamin Petersone9bbc8b2008-09-28 02:06:32 +0000385 pass
Christian Heimes90540002008-05-08 14:29:10 +0000386 else:
387 return list(iterable)
Georg Brandl0bb73b82010-09-03 22:36:22 +0000388 return json.JSONEncoder.default(self, o)
Christian Heimes90540002008-05-08 14:29:10 +0000389
390
391 .. method:: encode(o)
392
393 Return a JSON string representation of a Python data structure, *o*. For
394 example::
395
Georg Brandl0bb73b82010-09-03 22:36:22 +0000396 >>> json.JSONEncoder().encode({"foo": ["bar", "baz"]})
Christian Heimes90540002008-05-08 14:29:10 +0000397 '{"foo": ["bar", "baz"]}'
398
399
400 .. method:: iterencode(o)
401
402 Encode the given object, *o*, and yield each string representation as
403 available. For example::
Georg Brandl48310cd2009-01-03 21:18:54 +0000404
Georg Brandl0bb73b82010-09-03 22:36:22 +0000405 for chunk in json.JSONEncoder().iterencode(bigobject):
Christian Heimes90540002008-05-08 14:29:10 +0000406 mysocket.write(chunk)