blob: 6e1a9c3d2ed713f1709a3a44f4e0e7cd9f4ffa21 [file] [log] [blame]
Georg Brandl3961f182008-05-05 20:53:39 +00001:mod:`json` --- JSON encoder and decoder
2========================================
Brett Cannon4b964f92008-05-05 20:21:38 +00003
4.. module:: json
Georg Brandl3961f182008-05-05 20:53:39 +00005 :synopsis: Encode and decode the JSON format.
Brett Cannon4b964f92008-05-05 20:21:38 +00006.. moduleauthor:: Bob Ippolito <bob@redivi.com>
7.. sectionauthor:: Bob Ippolito <bob@redivi.com>
8.. versionadded:: 2.6
9
Antoine Pitrouf3e0a692012-08-24 19:46:17 +020010`JSON (JavaScript Object Notation) <http://json.org>`_, specified by
11:rfc:`4627`, is a lightweight data interchange format based on a subset of
12`JavaScript <http://en.wikipedia.org/wiki/JavaScript>`_ syntax (`ECMA-262 3rd
13edition <http://www.ecma-international.org/publications/files/ECMA-ST-ARCH/ECMA-262,%203rd%20edition,%20December%201999.pdf>`_).
Brett Cannon4b964f92008-05-05 20:21:38 +000014
Georg Brandl3961f182008-05-05 20:53:39 +000015:mod:`json` exposes an API familiar to users of the standard library
16:mod:`marshal` and :mod:`pickle` modules.
Brett Cannon4b964f92008-05-05 20:21:38 +000017
18Encoding basic Python object hierarchies::
Georg Brandlc62ef8b2009-01-03 20:55:06 +000019
Brett Cannon4b964f92008-05-05 20:21:38 +000020 >>> import json
21 >>> json.dumps(['foo', {'bar': ('baz', None, 1.0, 2)}])
22 '["foo", {"bar": ["baz", null, 1.0, 2]}]'
23 >>> print json.dumps("\"foo\bar")
24 "\"foo\bar"
25 >>> print json.dumps(u'\u1234')
26 "\u1234"
27 >>> print json.dumps('\\')
28 "\\"
29 >>> print json.dumps({"c": 0, "b": 0, "a": 0}, sort_keys=True)
30 {"a": 0, "b": 0, "c": 0}
31 >>> from StringIO import StringIO
32 >>> io = StringIO()
33 >>> json.dump(['streaming API'], io)
34 >>> io.getvalue()
35 '["streaming API"]'
36
37Compact encoding::
38
39 >>> import json
40 >>> json.dumps([1,2,3,{'4': 5, '6': 7}], separators=(',',':'))
41 '[1,2,3,{"4":5,"6":7}]'
42
43Pretty printing::
44
45 >>> import json
Ezio Melotti3a237eb2012-11-29 00:22:30 +020046 >>> print json.dumps({'4': 5, '6': 7}, sort_keys=True,
47 ... indent=4, separators=(',', ': '))
Brett Cannon4b964f92008-05-05 20:21:38 +000048 {
Georg Brandlc62ef8b2009-01-03 20:55:06 +000049 "4": 5,
Brett Cannon4b964f92008-05-05 20:21:38 +000050 "6": 7
51 }
52
53Decoding JSON::
Georg Brandlc62ef8b2009-01-03 20:55:06 +000054
Brett Cannon4b964f92008-05-05 20:21:38 +000055 >>> import json
56 >>> json.loads('["foo", {"bar":["baz", null, 1.0, 2]}]')
57 [u'foo', {u'bar': [u'baz', None, 1.0, 2]}]
58 >>> json.loads('"\\"foo\\bar"')
59 u'"foo\x08ar'
60 >>> from StringIO import StringIO
61 >>> io = StringIO('["streaming API"]')
62 >>> json.load(io)
63 [u'streaming API']
64
65Specializing JSON object decoding::
66
67 >>> import json
68 >>> def as_complex(dct):
69 ... if '__complex__' in dct:
70 ... return complex(dct['real'], dct['imag'])
71 ... return dct
Georg Brandlc62ef8b2009-01-03 20:55:06 +000072 ...
Brett Cannon4b964f92008-05-05 20:21:38 +000073 >>> json.loads('{"__complex__": true, "real": 1, "imag": 2}',
74 ... object_hook=as_complex)
75 (1+2j)
76 >>> import decimal
77 >>> json.loads('1.1', parse_float=decimal.Decimal)
78 Decimal('1.1')
79
Georg Brandl3961f182008-05-05 20:53:39 +000080Extending :class:`JSONEncoder`::
Georg Brandlc62ef8b2009-01-03 20:55:06 +000081
Brett Cannon4b964f92008-05-05 20:21:38 +000082 >>> import json
83 >>> class ComplexEncoder(json.JSONEncoder):
84 ... def default(self, obj):
85 ... if isinstance(obj, complex):
86 ... return [obj.real, obj.imag]
R David Murray35893b72013-03-17 22:06:18 -040087 ... # Let the base class default method raise the TypeError
Brett Cannon4b964f92008-05-05 20:21:38 +000088 ... return json.JSONEncoder.default(self, obj)
Georg Brandlc62ef8b2009-01-03 20:55:06 +000089 ...
Brett Cannon4b964f92008-05-05 20:21:38 +000090 >>> dumps(2 + 1j, cls=ComplexEncoder)
91 '[2.0, 1.0]'
92 >>> ComplexEncoder().encode(2 + 1j)
93 '[2.0, 1.0]'
94 >>> list(ComplexEncoder().iterencode(2 + 1j))
95 ['[', '2.0', ', ', '1.0', ']']
Georg Brandlc62ef8b2009-01-03 20:55:06 +000096
Brett Cannon4b964f92008-05-05 20:21:38 +000097
98.. highlight:: none
99
100Using json.tool from the shell to validate and pretty-print::
Georg Brandlc62ef8b2009-01-03 20:55:06 +0000101
Brett Cannon4b964f92008-05-05 20:21:38 +0000102 $ echo '{"json":"obj"}' | python -mjson.tool
103 {
104 "json": "obj"
105 }
Antoine Pitroud9a51372012-06-29 01:58:26 +0200106 $ echo '{1.2:3.4}' | python -mjson.tool
Serhiy Storchaka49d40222013-02-21 20:17:54 +0200107 Expecting property name enclosed in double quotes: line 1 column 2 (char 1)
Brett Cannon4b964f92008-05-05 20:21:38 +0000108
109.. highlight:: python
110
Georg Brandlc62ef8b2009-01-03 20:55:06 +0000111.. note::
Brett Cannon4b964f92008-05-05 20:21:38 +0000112
Antoine Pitrouf3e0a692012-08-24 19:46:17 +0200113 JSON is a subset of `YAML <http://yaml.org/>`_ 1.2. The JSON produced by
114 this module's default settings (in particular, the default *separators*
115 value) is also a subset of YAML 1.0 and 1.1. This module can thus also be
116 used as a YAML serializer.
Brett Cannon4b964f92008-05-05 20:21:38 +0000117
118
119Basic Usage
120-----------
121
Andrew Svetlov41c25ba2012-10-28 14:58:52 +0200122.. function:: dump(obj, fp, skipkeys=False, ensure_ascii=True, \
123 check_circular=True, allow_nan=True, cls=None, \
124 indent=None, separators=None, encoding="utf-8", \
125 default=None, sort_keys=False, **kw)
Brett Cannon4b964f92008-05-05 20:21:38 +0000126
Georg Brandl3961f182008-05-05 20:53:39 +0000127 Serialize *obj* as a JSON formatted stream to *fp* (a ``.write()``-supporting
Ezio Melottid5cdc942013-03-29 03:59:29 +0200128 :term:`file-like object`) using this :ref:`conversion table
129 <py-to-json-table>`.
Brett Cannon4b964f92008-05-05 20:21:38 +0000130
Georg Brandl3961f182008-05-05 20:53:39 +0000131 If *skipkeys* is ``True`` (default: ``False``), then dict keys that are not
132 of a basic type (:class:`str`, :class:`unicode`, :class:`int`, :class:`long`,
133 :class:`float`, :class:`bool`, ``None``) will be skipped instead of raising a
134 :exc:`TypeError`.
Brett Cannon4b964f92008-05-05 20:21:38 +0000135
Petri Lehtinenf9e1f112012-09-01 07:27:58 +0300136 If *ensure_ascii* is ``True`` (the default), all non-ASCII characters in the
137 output are escaped with ``\uXXXX`` sequences, and the result is a
138 :class:`str` instance consisting of ASCII characters only. If
139 *ensure_ascii* is ``False``, some chunks written to *fp* may be
140 :class:`unicode` instances. This usually happens because the input contains
141 unicode strings or the *encoding* parameter is used. Unless ``fp.write()``
142 explicitly understands :class:`unicode` (as in :func:`codecs.getwriter`)
143 this is likely to cause an error.
Brett Cannon4b964f92008-05-05 20:21:38 +0000144
Georg Brandl3961f182008-05-05 20:53:39 +0000145 If *check_circular* is ``False`` (default: ``True``), then the circular
146 reference check for container types will be skipped and a circular reference
147 will result in an :exc:`OverflowError` (or worse).
Brett Cannon4b964f92008-05-05 20:21:38 +0000148
Georg Brandl3961f182008-05-05 20:53:39 +0000149 If *allow_nan* is ``False`` (default: ``True``), then it will be a
150 :exc:`ValueError` to serialize out of range :class:`float` values (``nan``,
151 ``inf``, ``-inf``) in strict compliance of the JSON specification, instead of
152 using the JavaScript equivalents (``NaN``, ``Infinity``, ``-Infinity``).
Brett Cannon4b964f92008-05-05 20:21:38 +0000153
Georg Brandl3961f182008-05-05 20:53:39 +0000154 If *indent* is a non-negative integer, then JSON array elements and object
R David Murrayea8b6ef2011-04-12 21:00:26 -0400155 members will be pretty-printed with that indent level. An indent level of 0,
156 or negative, will only insert newlines. ``None`` (the default) selects the
157 most compact representation.
Brett Cannon4b964f92008-05-05 20:21:38 +0000158
Ezio Melotti3a237eb2012-11-29 00:22:30 +0200159 .. note::
160
161 Since the default item separator is ``', '``, the output might include
162 trailing whitespace when *indent* is specified. You can use
163 ``separators=(',', ': ')`` to avoid this.
164
Georg Brandl3961f182008-05-05 20:53:39 +0000165 If *separators* is an ``(item_separator, dict_separator)`` tuple, then it
166 will be used instead of the default ``(', ', ': ')`` separators. ``(',',
167 ':')`` is the most compact JSON representation.
Brett Cannon4b964f92008-05-05 20:21:38 +0000168
Georg Brandl3961f182008-05-05 20:53:39 +0000169 *encoding* is the character encoding for str instances, default is UTF-8.
Brett Cannon4b964f92008-05-05 20:21:38 +0000170
Georg Brandl3961f182008-05-05 20:53:39 +0000171 *default(obj)* is a function that should return a serializable version of
172 *obj* or raise :exc:`TypeError`. The default simply raises :exc:`TypeError`.
Brett Cannon4b964f92008-05-05 20:21:38 +0000173
Andrew Svetlov41c25ba2012-10-28 14:58:52 +0200174 If *sort_keys* is ``True`` (default: ``False``), then the output of
175 dictionaries will be sorted by key.
176
Georg Brandlfc29f272009-01-02 20:25:14 +0000177 To use a custom :class:`JSONEncoder` subclass (e.g. one that overrides the
Georg Brandl3961f182008-05-05 20:53:39 +0000178 :meth:`default` method to serialize additional types), specify it with the
Georg Brandldb949b82010-10-15 17:04:45 +0000179 *cls* kwarg; otherwise :class:`JSONEncoder` is used.
Brett Cannon4b964f92008-05-05 20:21:38 +0000180
Ezio Melotti6033d262011-04-15 07:37:00 +0300181 .. note::
182
183 Unlike :mod:`pickle` and :mod:`marshal`, JSON is not a framed protocol so
184 trying to serialize more objects with repeated calls to :func:`dump` and
185 the same *fp* will result in an invalid JSON file.
Brett Cannon4b964f92008-05-05 20:21:38 +0000186
Andrew Svetlov41c25ba2012-10-28 14:58:52 +0200187.. function:: dumps(obj, skipkeys=False, ensure_ascii=True, \
188 check_circular=True, allow_nan=True, cls=None, \
189 indent=None, separators=None, encoding="utf-8", \
190 default=None, sort_keys=False, **kw)
Brett Cannon4b964f92008-05-05 20:21:38 +0000191
Ezio Melottid5cdc942013-03-29 03:59:29 +0200192 Serialize *obj* to a JSON formatted :class:`str` using this :ref:`conversion
193 table <py-to-json-table>`. If *ensure_ascii* is ``False``, the result may
194 contain non-ASCII characters and the return value may be a :class:`unicode`
195 instance.
Brett Cannon4b964f92008-05-05 20:21:38 +0000196
Petri Lehtinenf9e1f112012-09-01 07:27:58 +0300197 The arguments have the same meaning as in :func:`dump`.
Brett Cannon4b964f92008-05-05 20:21:38 +0000198
Senthil Kumarane3d73542012-03-17 00:37:38 -0700199 .. note::
200
201 Keys in key/value pairs of JSON are always of the type :class:`str`. When
202 a dictionary is converted into JSON, all the keys of the dictionary are
Terry Jan Reedy3d08f252013-03-08 19:35:15 -0500203 coerced to strings. As a result of this, if a dictionary is converted
Senthil Kumarane3d73542012-03-17 00:37:38 -0700204 into JSON and then back into a dictionary, the dictionary may not equal
205 the original one. That is, ``loads(dumps(x)) != x`` if x has non-string
206 keys.
Brett Cannon4b964f92008-05-05 20:21:38 +0000207
Raymond Hettinger91852ca2009-03-19 19:19:03 +0000208.. function:: load(fp[, encoding[, cls[, object_hook[, parse_float[, parse_int[, parse_constant[, object_pairs_hook[, **kw]]]]]]]])
Brett Cannon4b964f92008-05-05 20:21:38 +0000209
Antoine Pitrou85ede8d2012-08-24 19:49:08 +0200210 Deserialize *fp* (a ``.read()``-supporting :term:`file-like object`
Ezio Melottid5cdc942013-03-29 03:59:29 +0200211 containing a JSON document) to a Python object using this :ref:`conversion
212 table <json-to-py-table>`.
Brett Cannon4b964f92008-05-05 20:21:38 +0000213
Georg Brandl3961f182008-05-05 20:53:39 +0000214 If the contents of *fp* are encoded with an ASCII based encoding other than
215 UTF-8 (e.g. latin-1), then an appropriate *encoding* name must be specified.
216 Encodings that are not ASCII based (such as UCS-2) are not allowed, and
Georg Brandl49cc4ea2009-04-23 08:44:57 +0000217 should be wrapped with ``codecs.getreader(encoding)(fp)``, or simply decoded
Georg Brandl3961f182008-05-05 20:53:39 +0000218 to a :class:`unicode` object and passed to :func:`loads`.
Brett Cannon4b964f92008-05-05 20:21:38 +0000219
220 *object_hook* is an optional function that will be called with the result of
Andrew M. Kuchling19672002009-03-30 22:29:15 +0000221 any object literal decoded (a :class:`dict`). The return value of
Georg Brandl3961f182008-05-05 20:53:39 +0000222 *object_hook* will be used instead of the :class:`dict`. This feature can be used
Antoine Pitrouf3e0a692012-08-24 19:46:17 +0200223 to implement custom decoders (e.g. `JSON-RPC <http://www.jsonrpc.org>`_
224 class hinting).
Georg Brandl3961f182008-05-05 20:53:39 +0000225
Raymond Hettinger91852ca2009-03-19 19:19:03 +0000226 *object_pairs_hook* is an optional function that will be called with the
Andrew M. Kuchling19672002009-03-30 22:29:15 +0000227 result of any object literal decoded with an ordered list of pairs. The
Raymond Hettinger91852ca2009-03-19 19:19:03 +0000228 return value of *object_pairs_hook* will be used instead of the
229 :class:`dict`. This feature can be used to implement custom decoders that
230 rely on the order that the key and value pairs are decoded (for example,
231 :func:`collections.OrderedDict` will remember the order of insertion). If
232 *object_hook* is also defined, the *object_pairs_hook* takes priority.
233
234 .. versionchanged:: 2.7
235 Added support for *object_pairs_hook*.
236
Georg Brandl3961f182008-05-05 20:53:39 +0000237 *parse_float*, if specified, will be called with the string of every JSON
238 float to be decoded. By default, this is equivalent to ``float(num_str)``.
239 This can be used to use another datatype or parser for JSON floats
240 (e.g. :class:`decimal.Decimal`).
241
242 *parse_int*, if specified, will be called with the string of every JSON int
243 to be decoded. By default, this is equivalent to ``int(num_str)``. This can
244 be used to use another datatype or parser for JSON integers
245 (e.g. :class:`float`).
246
247 *parse_constant*, if specified, will be called with one of the following
Hynek Schlawack019935f2012-05-16 18:02:54 +0200248 strings: ``'-Infinity'``, ``'Infinity'``, ``'NaN'``.
249 This can be used to raise an exception if invalid JSON numbers
Georg Brandl3961f182008-05-05 20:53:39 +0000250 are encountered.
Brett Cannon4b964f92008-05-05 20:21:38 +0000251
Hynek Schlawack897b2782012-05-20 11:50:41 +0200252 .. versionchanged:: 2.7
253 *parse_constant* doesn't get called on 'null', 'true', 'false' anymore.
254
Brett Cannon4b964f92008-05-05 20:21:38 +0000255 To use a custom :class:`JSONDecoder` subclass, specify it with the ``cls``
Georg Brandldb949b82010-10-15 17:04:45 +0000256 kwarg; otherwise :class:`JSONDecoder` is used. Additional keyword arguments
257 will be passed to the constructor of the class.
Brett Cannon4b964f92008-05-05 20:21:38 +0000258
259
Raymond Hettinger91852ca2009-03-19 19:19:03 +0000260.. function:: loads(s[, encoding[, cls[, object_hook[, parse_float[, parse_int[, parse_constant[, object_pairs_hook[, **kw]]]]]]]])
Georg Brandl3961f182008-05-05 20:53:39 +0000261
262 Deserialize *s* (a :class:`str` or :class:`unicode` instance containing a JSON
Ezio Melottid5cdc942013-03-29 03:59:29 +0200263 document) to a Python object using this :ref:`conversion table
264 <json-to-py-table>`.
Georg Brandl3961f182008-05-05 20:53:39 +0000265
266 If *s* is a :class:`str` instance and is encoded with an ASCII based encoding
267 other than UTF-8 (e.g. latin-1), then an appropriate *encoding* name must be
268 specified. Encodings that are not ASCII based (such as UCS-2) are not
269 allowed and should be decoded to :class:`unicode` first.
270
Georg Brandlc6301952010-05-10 21:02:51 +0000271 The other arguments have the same meaning as in :func:`load`.
Georg Brandl3961f182008-05-05 20:53:39 +0000272
273
Antoine Pitrouf3e0a692012-08-24 19:46:17 +0200274Encoders and Decoders
Brett Cannon4b964f92008-05-05 20:21:38 +0000275---------------------
276
Raymond Hettinger91852ca2009-03-19 19:19:03 +0000277.. class:: JSONDecoder([encoding[, object_hook[, parse_float[, parse_int[, parse_constant[, strict[, object_pairs_hook]]]]]]])
Brett Cannon4b964f92008-05-05 20:21:38 +0000278
Georg Brandl3961f182008-05-05 20:53:39 +0000279 Simple JSON decoder.
Brett Cannon4b964f92008-05-05 20:21:38 +0000280
281 Performs the following translations in decoding by default:
282
Ezio Melottid5cdc942013-03-29 03:59:29 +0200283 .. _json-to-py-table:
284
Brett Cannon4b964f92008-05-05 20:21:38 +0000285 +---------------+-------------------+
286 | JSON | Python |
287 +===============+===================+
288 | object | dict |
289 +---------------+-------------------+
290 | array | list |
291 +---------------+-------------------+
292 | string | unicode |
293 +---------------+-------------------+
294 | number (int) | int, long |
295 +---------------+-------------------+
296 | number (real) | float |
297 +---------------+-------------------+
298 | true | True |
299 +---------------+-------------------+
300 | false | False |
301 +---------------+-------------------+
302 | null | None |
303 +---------------+-------------------+
304
305 It also understands ``NaN``, ``Infinity``, and ``-Infinity`` as their
306 corresponding ``float`` values, which is outside the JSON spec.
307
Georg Brandl3961f182008-05-05 20:53:39 +0000308 *encoding* determines the encoding used to interpret any :class:`str` objects
309 decoded by this instance (UTF-8 by default). It has no effect when decoding
310 :class:`unicode` objects.
Brett Cannon4b964f92008-05-05 20:21:38 +0000311
Georg Brandl3961f182008-05-05 20:53:39 +0000312 Note that currently only encodings that are a superset of ASCII work, strings
313 of other encodings should be passed in as :class:`unicode`.
Brett Cannon4b964f92008-05-05 20:21:38 +0000314
315 *object_hook*, if specified, will be called with the result of every JSON
316 object decoded and its return value will be used in place of the given
Georg Brandl3961f182008-05-05 20:53:39 +0000317 :class:`dict`. This can be used to provide custom deserializations (e.g. to
Brett Cannon4b964f92008-05-05 20:21:38 +0000318 support JSON-RPC class hinting).
319
Raymond Hettinger91852ca2009-03-19 19:19:03 +0000320 *object_pairs_hook*, if specified will be called with the result of every
321 JSON object decoded with an ordered list of pairs. The return value of
322 *object_pairs_hook* will be used instead of the :class:`dict`. This
323 feature can be used to implement custom decoders that rely on the order
324 that the key and value pairs are decoded (for example,
325 :func:`collections.OrderedDict` will remember the order of insertion). If
326 *object_hook* is also defined, the *object_pairs_hook* takes priority.
327
328 .. versionchanged:: 2.7
329 Added support for *object_pairs_hook*.
330
Brett Cannon4b964f92008-05-05 20:21:38 +0000331 *parse_float*, if specified, will be called with the string of every JSON
Georg Brandl3961f182008-05-05 20:53:39 +0000332 float to be decoded. By default, this is equivalent to ``float(num_str)``.
333 This can be used to use another datatype or parser for JSON floats
334 (e.g. :class:`decimal.Decimal`).
Brett Cannon4b964f92008-05-05 20:21:38 +0000335
336 *parse_int*, if specified, will be called with the string of every JSON int
Georg Brandl3961f182008-05-05 20:53:39 +0000337 to be decoded. By default, this is equivalent to ``int(num_str)``. This can
338 be used to use another datatype or parser for JSON integers
339 (e.g. :class:`float`).
Brett Cannon4b964f92008-05-05 20:21:38 +0000340
341 *parse_constant*, if specified, will be called with one of the following
Georg Brandl3961f182008-05-05 20:53:39 +0000342 strings: ``'-Infinity'``, ``'Infinity'``, ``'NaN'``, ``'null'``, ``'true'``,
343 ``'false'``. This can be used to raise an exception if invalid JSON numbers
344 are encountered.
Brett Cannon4b964f92008-05-05 20:21:38 +0000345
Georg Brandldb949b82010-10-15 17:04:45 +0000346 If *strict* is ``False`` (``True`` is the default), then control characters
347 will be allowed inside strings. Control characters in this context are
348 those with character codes in the 0-31 range, including ``'\t'`` (tab),
349 ``'\n'``, ``'\r'`` and ``'\0'``.
350
Brett Cannon4b964f92008-05-05 20:21:38 +0000351
352 .. method:: decode(s)
353
Georg Brandl3961f182008-05-05 20:53:39 +0000354 Return the Python representation of *s* (a :class:`str` or
355 :class:`unicode` instance containing a JSON document)
Brett Cannon4b964f92008-05-05 20:21:38 +0000356
357 .. method:: raw_decode(s)
358
Georg Brandl3961f182008-05-05 20:53:39 +0000359 Decode a JSON document from *s* (a :class:`str` or :class:`unicode`
360 beginning with a JSON document) and return a 2-tuple of the Python
361 representation and the index in *s* where the document ended.
Brett Cannon4b964f92008-05-05 20:21:38 +0000362
Georg Brandl3961f182008-05-05 20:53:39 +0000363 This can be used to decode a JSON document from a string that may have
364 extraneous data at the end.
Brett Cannon4b964f92008-05-05 20:21:38 +0000365
366
367.. class:: JSONEncoder([skipkeys[, ensure_ascii[, check_circular[, allow_nan[, sort_keys[, indent[, separators[, encoding[, default]]]]]]]]])
368
Georg Brandl3961f182008-05-05 20:53:39 +0000369 Extensible JSON encoder for Python data structures.
Brett Cannon4b964f92008-05-05 20:21:38 +0000370
371 Supports the following objects and types by default:
372
Ezio Melottid5cdc942013-03-29 03:59:29 +0200373 .. _py-to-json-table:
374
Brett Cannon4b964f92008-05-05 20:21:38 +0000375 +-------------------+---------------+
376 | Python | JSON |
377 +===================+===============+
378 | dict | object |
379 +-------------------+---------------+
380 | list, tuple | array |
381 +-------------------+---------------+
382 | str, unicode | string |
383 +-------------------+---------------+
384 | int, long, float | number |
385 +-------------------+---------------+
386 | True | true |
387 +-------------------+---------------+
388 | False | false |
389 +-------------------+---------------+
390 | None | null |
391 +-------------------+---------------+
392
393 To extend this to recognize other objects, subclass and implement a
Georg Brandl3961f182008-05-05 20:53:39 +0000394 :meth:`default` method with another method that returns a serializable object
Brett Cannon4b964f92008-05-05 20:21:38 +0000395 for ``o`` if possible, otherwise it should call the superclass implementation
396 (to raise :exc:`TypeError`).
397
398 If *skipkeys* is ``False`` (the default), then it is a :exc:`TypeError` to
399 attempt encoding of keys that are not str, int, long, float or None. If
400 *skipkeys* is ``True``, such items are simply skipped.
401
Petri Lehtinenf9e1f112012-09-01 07:27:58 +0300402 If *ensure_ascii* is ``True`` (the default), all non-ASCII characters in the
403 output are escaped with ``\uXXXX`` sequences, and the results are
404 :class:`str` instances consisting of ASCII characters only. If
405 *ensure_ascii* is ``False``, a result may be a :class:`unicode`
406 instance. This usually happens if the input contains unicode strings or the
407 *encoding* parameter is used.
Brett Cannon4b964f92008-05-05 20:21:38 +0000408
409 If *check_circular* is ``True`` (the default), then lists, dicts, and custom
410 encoded objects will be checked for circular references during encoding to
411 prevent an infinite recursion (which would cause an :exc:`OverflowError`).
412 Otherwise, no such check takes place.
413
Georg Brandl3961f182008-05-05 20:53:39 +0000414 If *allow_nan* is ``True`` (the default), then ``NaN``, ``Infinity``, and
415 ``-Infinity`` will be encoded as such. This behavior is not JSON
416 specification compliant, but is consistent with most JavaScript based
417 encoders and decoders. Otherwise, it will be a :exc:`ValueError` to encode
418 such floats.
Brett Cannon4b964f92008-05-05 20:21:38 +0000419
Georg Brandl21946af2010-10-06 09:28:45 +0000420 If *sort_keys* is ``True`` (default ``False``), then the output of dictionaries
Brett Cannon4b964f92008-05-05 20:21:38 +0000421 will be sorted by key; this is useful for regression tests to ensure that
422 JSON serializations can be compared on a day-to-day basis.
423
Georg Brandl3961f182008-05-05 20:53:39 +0000424 If *indent* is a non-negative integer (it is ``None`` by default), then JSON
Brett Cannon4b964f92008-05-05 20:21:38 +0000425 array elements and object members will be pretty-printed with that indent
426 level. An indent level of 0 will only insert newlines. ``None`` is the most
427 compact representation.
428
Ezio Melotti3a237eb2012-11-29 00:22:30 +0200429 .. note::
430
431 Since the default item separator is ``', '``, the output might include
432 trailing whitespace when *indent* is specified. You can use
433 ``separators=(',', ': ')`` to avoid this.
434
Georg Brandl3961f182008-05-05 20:53:39 +0000435 If specified, *separators* should be an ``(item_separator, key_separator)``
436 tuple. The default is ``(', ', ': ')``. To get the most compact JSON
Brett Cannon4b964f92008-05-05 20:21:38 +0000437 representation, you should specify ``(',', ':')`` to eliminate whitespace.
438
439 If specified, *default* is a function that gets called for objects that can't
440 otherwise be serialized. It should return a JSON encodable version of the
441 object or raise a :exc:`TypeError`.
442
443 If *encoding* is not ``None``, then all input strings will be transformed
444 into unicode using that encoding prior to JSON-encoding. The default is
445 UTF-8.
446
447
448 .. method:: default(o)
449
450 Implement this method in a subclass such that it returns a serializable
451 object for *o*, or calls the base implementation (to raise a
452 :exc:`TypeError`).
453
454 For example, to support arbitrary iterators, you could implement default
455 like this::
Georg Brandlc62ef8b2009-01-03 20:55:06 +0000456
Brett Cannon4b964f92008-05-05 20:21:38 +0000457 def default(self, o):
458 try:
Georg Brandl1379ae02008-09-24 09:47:55 +0000459 iterable = iter(o)
Brett Cannon4b964f92008-05-05 20:21:38 +0000460 except TypeError:
Georg Brandl1379ae02008-09-24 09:47:55 +0000461 pass
Brett Cannon4b964f92008-05-05 20:21:38 +0000462 else:
463 return list(iterable)
R David Murray35893b72013-03-17 22:06:18 -0400464 # Let the base class default method raise the TypeError
Brett Cannon4b964f92008-05-05 20:21:38 +0000465 return JSONEncoder.default(self, o)
466
467
468 .. method:: encode(o)
469
Georg Brandl3961f182008-05-05 20:53:39 +0000470 Return a JSON string representation of a Python data structure, *o*. For
Brett Cannon4b964f92008-05-05 20:21:38 +0000471 example::
472
473 >>> JSONEncoder().encode({"foo": ["bar", "baz"]})
474 '{"foo": ["bar", "baz"]}'
475
476
477 .. method:: iterencode(o)
478
479 Encode the given object, *o*, and yield each string representation as
Georg Brandl3961f182008-05-05 20:53:39 +0000480 available. For example::
Georg Brandlc62ef8b2009-01-03 20:55:06 +0000481
Brett Cannon4b964f92008-05-05 20:21:38 +0000482 for chunk in JSONEncoder().iterencode(bigobject):
483 mysocket.write(chunk)
Antoine Pitrouf3e0a692012-08-24 19:46:17 +0200484
485
486Standard Compliance
487-------------------
488
489The JSON format is specified by :rfc:`4627`. This section details this
490module's level of compliance with the RFC. For simplicity,
491:class:`JSONEncoder` and :class:`JSONDecoder` subclasses, and parameters other
492than those explicitly mentioned, are not considered.
493
494This module does not comply with the RFC in a strict fashion, implementing some
495extensions that are valid JavaScript but not valid JSON. In particular:
496
497- Top-level non-object, non-array values are accepted and output;
498- Infinite and NaN number values are accepted and output;
499- Repeated names within an object are accepted, and only the value of the last
500 name-value pair is used.
501
502Since the RFC permits RFC-compliant parsers to accept input texts that are not
503RFC-compliant, this module's deserializer is technically RFC-compliant under
504default settings.
505
506Character Encodings
507^^^^^^^^^^^^^^^^^^^
508
509The RFC recommends that JSON be represented using either UTF-8, UTF-16, or
510UTF-32, with UTF-8 being the default. Accordingly, this module uses UTF-8 as
511the default for its *encoding* parameter.
512
513This module's deserializer only directly works with ASCII-compatible encodings;
514UTF-16, UTF-32, and other ASCII-incompatible encodings require the use of
515workarounds described in the documentation for the deserializer's *encoding*
516parameter.
517
518The RFC also non-normatively describes a limited encoding detection technique
519for JSON texts; this module's deserializer does not implement this or any other
520kind of encoding detection.
521
522As permitted, though not required, by the RFC, this module's serializer sets
523*ensure_ascii=True* by default, thus escaping the output so that the resulting
524strings only contain ASCII characters.
525
526
527Top-level Non-Object, Non-Array Values
528^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
529
530The RFC specifies that the top-level value of a JSON text must be either a
531JSON object or array (Python :class:`dict` or :class:`list`). This module's
532deserializer also accepts input texts consisting solely of a
533JSON null, boolean, number, or string value::
534
535 >>> just_a_json_string = '"spam and eggs"' # Not by itself a valid JSON text
536 >>> json.loads(just_a_json_string)
537 u'spam and eggs'
538
539This module itself does not include a way to request that such input texts be
540regarded as illegal. Likewise, this module's serializer also accepts single
541Python :data:`None`, :class:`bool`, numeric, and :class:`str`
542values as input and will generate output texts consisting solely of a top-level
543JSON null, boolean, number, or string value without raising an exception::
544
545 >>> neither_a_list_nor_a_dict = u"spam and eggs"
546 >>> json.dumps(neither_a_list_nor_a_dict) # The result is not a valid JSON text
547 '"spam and eggs"'
548
549This module's serializer does not itself include a way to enforce the
550aforementioned constraint.
551
552
553Infinite and NaN Number Values
554^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
555
556The RFC does not permit the representation of infinite or NaN number values.
557Despite that, by default, this module accepts and outputs ``Infinity``,
558``-Infinity``, and ``NaN`` as if they were valid JSON number literal values::
559
560 >>> # Neither of these calls raises an exception, but the results are not valid JSON
561 >>> json.dumps(float('-inf'))
562 '-Infinity'
563 >>> json.dumps(float('nan'))
564 'NaN'
565 >>> # Same when deserializing
566 >>> json.loads('-Infinity')
567 -inf
568 >>> json.loads('NaN')
569 nan
570
571In the serializer, the *allow_nan* parameter can be used to alter this
572behavior. In the deserializer, the *parse_constant* parameter can be used to
573alter this behavior.
574
575
576Repeated Names Within an Object
577^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
578
579The RFC specifies that the names within a JSON object should be unique, but
580does not specify how repeated names in JSON objects should be handled. By
581default, this module does not raise an exception; instead, it ignores all but
582the last name-value pair for a given name::
583
584 >>> weird_json = '{"x": 1, "x": 2, "x": 3}'
585 >>> json.loads(weird_json)
586 {u'x': 3}
587
588The *object_pairs_hook* parameter can be used to alter this behavior.