blob: 287a08b90d8d4f6041b56ade9f99bb66bd5b3877 [file] [log] [blame]
Georg Brandl116aa622007-08-15 14:28:22 +00001:mod:`pickle` --- Python object serialization
2=============================================
3
4.. index::
5 single: persistence
6 pair: persistent; objects
7 pair: serializing; objects
8 pair: marshalling; objects
9 pair: flattening; objects
10 pair: pickling; objects
11
12.. module:: pickle
13 :synopsis: Convert Python objects to streams of bytes and back.
Christian Heimes5b5e81c2007-12-31 16:14:33 +000014.. sectionauthor:: Jim Kerr <jbkerr@sr.hp.com>.
15.. sectionauthor:: Barry Warsaw <barry@zope.com>
Georg Brandl116aa622007-08-15 14:28:22 +000016
Alexandre Vassalotti9d7665d2009-04-03 06:13:29 +000017
Georg Brandl116aa622007-08-15 14:28:22 +000018The :mod:`pickle` module implements a fundamental, but powerful algorithm for
19serializing and de-serializing a Python object structure. "Pickling" is the
20process whereby a Python object hierarchy is converted into a byte stream, and
21"unpickling" is the inverse operation, whereby a byte stream is converted back
22into an object hierarchy. Pickling (and unpickling) is alternatively known as
23"serialization", "marshalling," [#]_ or "flattening", however, to avoid
Benjamin Petersonbe149d02008-06-20 21:03:22 +000024confusion, the terms used here are "pickling" and "unpickling"..
Georg Brandl116aa622007-08-15 14:28:22 +000025
Georg Brandl0036bcf2010-10-17 10:24:54 +000026.. warning::
27
28 The :mod:`pickle` module is not intended to be secure against erroneous or
29 maliciously constructed data. Never unpickle data received from an untrusted
30 or unauthenticated source.
31
Georg Brandl116aa622007-08-15 14:28:22 +000032
33Relationship to other Python modules
34------------------------------------
35
Benjamin Petersonbe149d02008-06-20 21:03:22 +000036The :mod:`pickle` module has an transparent optimizer (:mod:`_pickle`) written
Alexandre Vassalotti5f3b63a2008-10-18 20:47:58 +000037in C. It is used whenever available. Otherwise the pure Python implementation is
Benjamin Petersonbe149d02008-06-20 21:03:22 +000038used.
Georg Brandl116aa622007-08-15 14:28:22 +000039
40Python has a more primitive serialization module called :mod:`marshal`, but in
41general :mod:`pickle` should always be the preferred way to serialize Python
42objects. :mod:`marshal` exists primarily to support Python's :file:`.pyc`
43files.
44
Georg Brandl5aa580f2010-11-30 14:57:54 +000045The :mod:`pickle` module differs from :mod:`marshal` in several significant ways:
Georg Brandl116aa622007-08-15 14:28:22 +000046
47* The :mod:`pickle` module keeps track of the objects it has already serialized,
48 so that later references to the same object won't be serialized again.
49 :mod:`marshal` doesn't do this.
50
51 This has implications both for recursive objects and object sharing. Recursive
52 objects are objects that contain references to themselves. These are not
53 handled by marshal, and in fact, attempting to marshal recursive objects will
54 crash your Python interpreter. Object sharing happens when there are multiple
55 references to the same object in different places in the object hierarchy being
56 serialized. :mod:`pickle` stores such objects only once, and ensures that all
57 other references point to the master copy. Shared objects remain shared, which
58 can be very important for mutable objects.
59
60* :mod:`marshal` cannot be used to serialize user-defined classes and their
61 instances. :mod:`pickle` can save and restore class instances transparently,
62 however the class definition must be importable and live in the same module as
63 when the object was stored.
64
65* The :mod:`marshal` serialization format is not guaranteed to be portable
66 across Python versions. Because its primary job in life is to support
67 :file:`.pyc` files, the Python implementers reserve the right to change the
68 serialization format in non-backwards compatible ways should the need arise.
69 The :mod:`pickle` serialization format is guaranteed to be backwards compatible
70 across Python releases.
71
Georg Brandl116aa622007-08-15 14:28:22 +000072Note that serialization is a more primitive notion than persistence; although
73:mod:`pickle` reads and writes file objects, it does not handle the issue of
74naming persistent objects, nor the (even more complicated) issue of concurrent
75access to persistent objects. The :mod:`pickle` module can transform a complex
76object into a byte stream and it can transform the byte stream into an object
77with the same internal structure. Perhaps the most obvious thing to do with
78these byte streams is to write them onto a file, but it is also conceivable to
79send them across a network or store them in a database. The module
80:mod:`shelve` provides a simple interface to pickle and unpickle objects on
81DBM-style database files.
82
83
84Data stream format
85------------------
86
87.. index::
Georg Brandl116aa622007-08-15 14:28:22 +000088 single: External Data Representation
89
90The data format used by :mod:`pickle` is Python-specific. This has the
91advantage that there are no restrictions imposed by external standards such as
Antoine Pitroua9494f62012-05-10 15:38:30 +020092JSON or XDR (which can't represent pointer sharing); however it means that
93non-Python programs may not be able to reconstruct pickled Python objects.
Georg Brandl116aa622007-08-15 14:28:22 +000094
Antoine Pitroua9494f62012-05-10 15:38:30 +020095By default, the :mod:`pickle` data format uses a relatively compact binary
96representation. If you need optimal size characteristics, you can efficiently
97:doc:`compress <archiving>` pickled data.
98
Alexandre Vassalotti758bca62008-10-18 19:25:07 +000099The module :mod:`pickletools` contains tools for analyzing data streams
Antoine Pitroua9494f62012-05-10 15:38:30 +0200100generated by :mod:`pickle`. :mod:`pickletools` source code has extensive
101comments about opcodes used by pickle protocols.
Georg Brandl116aa622007-08-15 14:28:22 +0000102
Georg Brandl42f2ae02008-04-06 08:39:37 +0000103There are currently 4 different protocols which can be used for pickling.
Georg Brandl116aa622007-08-15 14:28:22 +0000104
Antoine Pitroua9494f62012-05-10 15:38:30 +0200105* Protocol version 0 is the original "human-readable" protocol and is
Alexandre Vassalottif7d08c72009-01-23 04:50:05 +0000106 backwards compatible with earlier versions of Python.
Georg Brandl116aa622007-08-15 14:28:22 +0000107
Antoine Pitroua9494f62012-05-10 15:38:30 +0200108* Protocol version 1 is an old binary format which is also compatible with
Georg Brandl116aa622007-08-15 14:28:22 +0000109 earlier versions of Python.
110
111* Protocol version 2 was introduced in Python 2.3. It provides much more
Antoine Pitroua9494f62012-05-10 15:38:30 +0200112 efficient pickling of :term:`new-style class`\es. Refer to :pep:`307` for
113 information about improvements brought by protocol 2.
Georg Brandl116aa622007-08-15 14:28:22 +0000114
Antoine Pitroua9494f62012-05-10 15:38:30 +0200115* Protocol version 3 was added in Python 3. It has explicit support for
116 :class:`bytes` objects and cannot be unpickled by Python 2.x. This is
117 the default as well as the current recommended protocol; use it whenever
118 possible.
Georg Brandl116aa622007-08-15 14:28:22 +0000119
Georg Brandl116aa622007-08-15 14:28:22 +0000120
Alexandre Vassalottid0392862008-10-24 01:32:40 +0000121Module Interface
122----------------
Georg Brandl116aa622007-08-15 14:28:22 +0000123
Antoine Pitroua9494f62012-05-10 15:38:30 +0200124To serialize an object hierarchy, you simply call the :func:`dumps` function.
125Similarly, to de-serialize a data stream, you call the :func:`loads` function.
126However, if you want more control over serialization and de-serialization,
127you can create a :class:`Pickler` or an :class:`Unpickler` object, respectively.
128
129The :mod:`pickle` module provides the following constants:
Georg Brandl116aa622007-08-15 14:28:22 +0000130
131
132.. data:: HIGHEST_PROTOCOL
133
134 The highest protocol version available. This value can be passed as a
135 *protocol* value.
136
Alexandre Vassalotti758bca62008-10-18 19:25:07 +0000137.. data:: DEFAULT_PROTOCOL
138
139 The default protocol used for pickling. May be less than HIGHEST_PROTOCOL.
Antoine Pitroua9494f62012-05-10 15:38:30 +0200140 Currently the default protocol is 3, a new protocol designed for Python 3.0.
Alexandre Vassalotti758bca62008-10-18 19:25:07 +0000141
142
Georg Brandl116aa622007-08-15 14:28:22 +0000143The :mod:`pickle` module provides the following functions to make the pickling
144process more convenient:
145
Georg Brandl18244152009-09-02 20:34:52 +0000146.. function:: dump(obj, file, protocol=None, \*, fix_imports=True)
Georg Brandl116aa622007-08-15 14:28:22 +0000147
Antoine Pitrou11cb9612010-09-15 11:11:28 +0000148 Write a pickled representation of *obj* to the open :term:`file object` *file*.
149 This is equivalent to ``Pickler(file, protocol).dump(obj)``.
Georg Brandl116aa622007-08-15 14:28:22 +0000150
Alexandre Vassalotti758bca62008-10-18 19:25:07 +0000151 The optional *protocol* argument tells the pickler to use the given protocol;
152 supported protocols are 0, 1, 2, 3. The default protocol is 3; a
153 backward-incompatible protocol designed for Python 3.0.
Georg Brandl116aa622007-08-15 14:28:22 +0000154
Alexandre Vassalotti758bca62008-10-18 19:25:07 +0000155 Specifying a negative protocol version selects the highest protocol version
156 supported. The higher the protocol used, the more recent the version of
157 Python needed to read the pickle produced.
Georg Brandl116aa622007-08-15 14:28:22 +0000158
Alexandre Vassalotti758bca62008-10-18 19:25:07 +0000159 The *file* argument must have a write() method that accepts a single bytes
Antoine Pitrou11cb9612010-09-15 11:11:28 +0000160 argument. It can thus be an on-disk file opened for binary writing, a
161 :class:`io.BytesIO` instance, or any other custom object that meets this
162 interface.
Georg Brandl116aa622007-08-15 14:28:22 +0000163
Antoine Pitroud9dfaa92009-06-04 20:32:06 +0000164 If *fix_imports* is True and *protocol* is less than 3, pickle will try to
165 map the new Python 3.x names to the old module names used in Python 2.x,
166 so that the pickle data stream is readable with Python 2.x.
167
Georg Brandl18244152009-09-02 20:34:52 +0000168.. function:: dumps(obj, protocol=None, \*, fix_imports=True)
Georg Brandl116aa622007-08-15 14:28:22 +0000169
Mark Summerfieldb9e23042008-04-21 14:47:45 +0000170 Return the pickled representation of the object as a :class:`bytes`
171 object, instead of writing it to a file.
Georg Brandl116aa622007-08-15 14:28:22 +0000172
Alexandre Vassalotti758bca62008-10-18 19:25:07 +0000173 The optional *protocol* argument tells the pickler to use the given protocol;
174 supported protocols are 0, 1, 2, 3. The default protocol is 3; a
175 backward-incompatible protocol designed for Python 3.0.
176
177 Specifying a negative protocol version selects the highest protocol version
178 supported. The higher the protocol used, the more recent the version of
179 Python needed to read the pickle produced.
180
Antoine Pitroud9dfaa92009-06-04 20:32:06 +0000181 If *fix_imports* is True and *protocol* is less than 3, pickle will try to
182 map the new Python 3.x names to the old module names used in Python 2.x,
183 so that the pickle data stream is readable with Python 2.x.
184
Georg Brandl18244152009-09-02 20:34:52 +0000185.. function:: load(file, \*, fix_imports=True, encoding="ASCII", errors="strict")
Alexandre Vassalotti758bca62008-10-18 19:25:07 +0000186
Antoine Pitrou11cb9612010-09-15 11:11:28 +0000187 Read a pickled object representation from the open :term:`file object` *file*
188 and return the reconstituted object hierarchy specified therein. This is
Alexandre Vassalotti758bca62008-10-18 19:25:07 +0000189 equivalent to ``Unpickler(file).load()``.
190
191 The protocol version of the pickle is detected automatically, so no protocol
192 argument is needed. Bytes past the pickled object's representation are
193 ignored.
194
195 The argument *file* must have two methods, a read() method that takes an
196 integer argument, and a readline() method that requires no arguments. Both
Antoine Pitrou11cb9612010-09-15 11:11:28 +0000197 methods should return bytes. Thus *file* can be an on-disk file opened
198 for binary reading, a :class:`io.BytesIO` object, or any other custom object
199 that meets this interface.
Alexandre Vassalotti758bca62008-10-18 19:25:07 +0000200
Antoine Pitroud9dfaa92009-06-04 20:32:06 +0000201 Optional keyword arguments are *fix_imports*, *encoding* and *errors*,
Georg Brandl6faee4e2010-09-21 14:48:28 +0000202 which are used to control compatibility support for pickle stream generated
Antoine Pitroud9dfaa92009-06-04 20:32:06 +0000203 by Python 2.x. If *fix_imports* is True, pickle will try to map the old
204 Python 2.x names to the new names used in Python 3.x. The *encoding* and
205 *errors* tell pickle how to decode 8-bit string instances pickled by Python
206 2.x; these default to 'ASCII' and 'strict', respectively.
Alexandre Vassalotti758bca62008-10-18 19:25:07 +0000207
Georg Brandl18244152009-09-02 20:34:52 +0000208.. function:: loads(bytes_object, \*, fix_imports=True, encoding="ASCII", errors="strict")
Alexandre Vassalotti758bca62008-10-18 19:25:07 +0000209
210 Read a pickled object hierarchy from a :class:`bytes` object and return the
211 reconstituted object hierarchy specified therein
212
213 The protocol version of the pickle is detected automatically, so no protocol
214 argument is needed. Bytes past the pickled object's representation are
215 ignored.
216
Antoine Pitroud9dfaa92009-06-04 20:32:06 +0000217 Optional keyword arguments are *fix_imports*, *encoding* and *errors*,
Georg Brandl6faee4e2010-09-21 14:48:28 +0000218 which are used to control compatibility support for pickle stream generated
Antoine Pitroud9dfaa92009-06-04 20:32:06 +0000219 by Python 2.x. If *fix_imports* is True, pickle will try to map the old
220 Python 2.x names to the new names used in Python 3.x. The *encoding* and
221 *errors* tell pickle how to decode 8-bit string instances pickled by Python
222 2.x; these default to 'ASCII' and 'strict', respectively.
Georg Brandl116aa622007-08-15 14:28:22 +0000223
Georg Brandl116aa622007-08-15 14:28:22 +0000224
Alexandre Vassalotti758bca62008-10-18 19:25:07 +0000225The :mod:`pickle` module defines three exceptions:
Georg Brandl116aa622007-08-15 14:28:22 +0000226
227.. exception:: PickleError
228
Alexandre Vassalotti5f3b63a2008-10-18 20:47:58 +0000229 Common base class for the other pickling exceptions. It inherits
Georg Brandl116aa622007-08-15 14:28:22 +0000230 :exc:`Exception`.
231
Georg Brandl116aa622007-08-15 14:28:22 +0000232.. exception:: PicklingError
233
Alexandre Vassalotti758bca62008-10-18 19:25:07 +0000234 Error raised when an unpicklable object is encountered by :class:`Pickler`.
235 It inherits :exc:`PickleError`.
Georg Brandl116aa622007-08-15 14:28:22 +0000236
Alexandre Vassalotti5f3b63a2008-10-18 20:47:58 +0000237 Refer to :ref:`pickle-picklable` to learn what kinds of objects can be
238 pickled.
239
Georg Brandl116aa622007-08-15 14:28:22 +0000240.. exception:: UnpicklingError
241
Ezio Melottie62aad32011-11-18 13:51:10 +0200242 Error raised when there is a problem unpickling an object, such as a data
Alexandre Vassalotti5f3b63a2008-10-18 20:47:58 +0000243 corruption or a security violation. It inherits :exc:`PickleError`.
Georg Brandl116aa622007-08-15 14:28:22 +0000244
Alexandre Vassalotti758bca62008-10-18 19:25:07 +0000245 Note that other exceptions may also be raised during unpickling, including
246 (but not necessarily limited to) AttributeError, EOFError, ImportError, and
247 IndexError.
248
249
250The :mod:`pickle` module exports two classes, :class:`Pickler` and
Georg Brandl116aa622007-08-15 14:28:22 +0000251:class:`Unpickler`:
252
Georg Brandl18244152009-09-02 20:34:52 +0000253.. class:: Pickler(file, protocol=None, \*, fix_imports=True)
Georg Brandl116aa622007-08-15 14:28:22 +0000254
Alexandre Vassalotti758bca62008-10-18 19:25:07 +0000255 This takes a binary file for writing a pickle data stream.
Georg Brandl116aa622007-08-15 14:28:22 +0000256
Alexandre Vassalotti758bca62008-10-18 19:25:07 +0000257 The optional *protocol* argument tells the pickler to use the given protocol;
258 supported protocols are 0, 1, 2, 3. The default protocol is 3; a
259 backward-incompatible protocol designed for Python 3.0.
Georg Brandl116aa622007-08-15 14:28:22 +0000260
Alexandre Vassalotti758bca62008-10-18 19:25:07 +0000261 Specifying a negative protocol version selects the highest protocol version
262 supported. The higher the protocol used, the more recent the version of
263 Python needed to read the pickle produced.
Georg Brandl116aa622007-08-15 14:28:22 +0000264
Alexandre Vassalotti758bca62008-10-18 19:25:07 +0000265 The *file* argument must have a write() method that accepts a single bytes
Antoine Pitrou11cb9612010-09-15 11:11:28 +0000266 argument. It can thus be an on-disk file opened for binary writing, a
267 :class:`io.BytesIO` instance, or any other custom object that meets this interface.
Georg Brandl116aa622007-08-15 14:28:22 +0000268
Antoine Pitroud9dfaa92009-06-04 20:32:06 +0000269 If *fix_imports* is True and *protocol* is less than 3, pickle will try to
270 map the new Python 3.x names to the old module names used in Python 2.x,
271 so that the pickle data stream is readable with Python 2.x.
272
Benjamin Petersone41251e2008-04-25 01:59:09 +0000273 .. method:: dump(obj)
Georg Brandl116aa622007-08-15 14:28:22 +0000274
Alexandre Vassalotti758bca62008-10-18 19:25:07 +0000275 Write a pickled representation of *obj* to the open file object given in
276 the constructor.
Georg Brandl116aa622007-08-15 14:28:22 +0000277
Alexandre Vassalotti758bca62008-10-18 19:25:07 +0000278 .. method:: persistent_id(obj)
279
280 Do nothing by default. This exists so a subclass can override it.
281
282 If :meth:`persistent_id` returns ``None``, *obj* is pickled as usual. Any
283 other value causes :class:`Pickler` to emit the returned value as a
284 persistent ID for *obj*. The meaning of this persistent ID should be
285 defined by :meth:`Unpickler.persistent_load`. Note that the value
286 returned by :meth:`persistent_id` cannot itself have a persistent ID.
287
288 See :ref:`pickle-persistent` for details and examples of uses.
Georg Brandl116aa622007-08-15 14:28:22 +0000289
Antoine Pitrou8d3c2902012-03-04 18:31:48 +0100290 .. attribute:: dispatch_table
291
292 A pickler object's dispatch table is a registry of *reduction
293 functions* of the kind which can be declared using
294 :func:`copyreg.pickle`. It is a mapping whose keys are classes
295 and whose values are reduction functions. A reduction function
296 takes a single argument of the associated class and should
297 conform to the same interface as a :meth:`~object.__reduce__`
298 method.
299
300 By default, a pickler object will not have a
301 :attr:`dispatch_table` attribute, and it will instead use the
302 global dispatch table managed by the :mod:`copyreg` module.
303 However, to customize the pickling for a specific pickler object
304 one can set the :attr:`dispatch_table` attribute to a dict-like
305 object. Alternatively, if a subclass of :class:`Pickler` has a
306 :attr:`dispatch_table` attribute then this will be used as the
307 default dispatch table for instances of that class.
308
309 See :ref:`pickle-dispatch` for usage examples.
310
311 .. versionadded:: 3.3
312
Alexandre Vassalotti758bca62008-10-18 19:25:07 +0000313 .. attribute:: fast
314
Alexandre Vassalotti73b90a82008-10-29 23:32:33 +0000315 Deprecated. Enable fast mode if set to a true value. The fast mode
316 disables the usage of memo, therefore speeding the pickling process by not
317 generating superfluous PUT opcodes. It should not be used with
318 self-referential objects, doing otherwise will cause :class:`Pickler` to
319 recurse infinitely.
Alexandre Vassalotti758bca62008-10-18 19:25:07 +0000320
321 Use :func:`pickletools.optimize` if you need more compact pickles.
322
Georg Brandl116aa622007-08-15 14:28:22 +0000323
Georg Brandl18244152009-09-02 20:34:52 +0000324.. class:: Unpickler(file, \*, fix_imports=True, encoding="ASCII", errors="strict")
Georg Brandl116aa622007-08-15 14:28:22 +0000325
Alexandre Vassalotti758bca62008-10-18 19:25:07 +0000326 This takes a binary file for reading a pickle data stream.
Georg Brandl116aa622007-08-15 14:28:22 +0000327
Alexandre Vassalotti758bca62008-10-18 19:25:07 +0000328 The protocol version of the pickle is detected automatically, so no
329 protocol argument is needed.
330
331 The argument *file* must have two methods, a read() method that takes an
332 integer argument, and a readline() method that requires no arguments. Both
Antoine Pitrou11cb9612010-09-15 11:11:28 +0000333 methods should return bytes. Thus *file* can be an on-disk file object opened
334 for binary reading, a :class:`io.BytesIO` object, or any other custom object
335 that meets this interface.
Georg Brandl116aa622007-08-15 14:28:22 +0000336
Antoine Pitroud9dfaa92009-06-04 20:32:06 +0000337 Optional keyword arguments are *fix_imports*, *encoding* and *errors*,
Georg Brandl6faee4e2010-09-21 14:48:28 +0000338 which are used to control compatibility support for pickle stream generated
Antoine Pitroud9dfaa92009-06-04 20:32:06 +0000339 by Python 2.x. If *fix_imports* is True, pickle will try to map the old
340 Python 2.x names to the new names used in Python 3.x. The *encoding* and
341 *errors* tell pickle how to decode 8-bit string instances pickled by Python
342 2.x; these default to 'ASCII' and 'strict', respectively.
Georg Brandl116aa622007-08-15 14:28:22 +0000343
Benjamin Petersone41251e2008-04-25 01:59:09 +0000344 .. method:: load()
Georg Brandl116aa622007-08-15 14:28:22 +0000345
Benjamin Petersone41251e2008-04-25 01:59:09 +0000346 Read a pickled object representation from the open file object given in
347 the constructor, and return the reconstituted object hierarchy specified
Alexandre Vassalotti5f3b63a2008-10-18 20:47:58 +0000348 therein. Bytes past the pickled object's representation are ignored.
Georg Brandl116aa622007-08-15 14:28:22 +0000349
Alexandre Vassalotti758bca62008-10-18 19:25:07 +0000350 .. method:: persistent_load(pid)
Georg Brandl116aa622007-08-15 14:28:22 +0000351
Ezio Melottie62aad32011-11-18 13:51:10 +0200352 Raise an :exc:`UnpicklingError` by default.
Georg Brandl116aa622007-08-15 14:28:22 +0000353
Alexandre Vassalotti758bca62008-10-18 19:25:07 +0000354 If defined, :meth:`persistent_load` should return the object specified by
Alexandre Vassalotti5f3b63a2008-10-18 20:47:58 +0000355 the persistent ID *pid*. If an invalid persistent ID is encountered, an
Ezio Melottie62aad32011-11-18 13:51:10 +0200356 :exc:`UnpicklingError` should be raised.
Georg Brandl116aa622007-08-15 14:28:22 +0000357
Alexandre Vassalotti758bca62008-10-18 19:25:07 +0000358 See :ref:`pickle-persistent` for details and examples of uses.
359
360 .. method:: find_class(module, name)
361
Alexandre Vassalotti5f3b63a2008-10-18 20:47:58 +0000362 Import *module* if necessary and return the object called *name* from it,
Alexandre Vassalottid0392862008-10-24 01:32:40 +0000363 where the *module* and *name* arguments are :class:`str` objects. Note,
364 unlike its name suggests, :meth:`find_class` is also used for finding
365 functions.
Georg Brandl116aa622007-08-15 14:28:22 +0000366
Alexandre Vassalotti5f3b63a2008-10-18 20:47:58 +0000367 Subclasses may override this to gain control over what type of objects and
Alexandre Vassalottid0392862008-10-24 01:32:40 +0000368 how they can be loaded, potentially reducing security risks. Refer to
369 :ref:`pickle-restrict` for details.
Alexandre Vassalotti5f3b63a2008-10-18 20:47:58 +0000370
371
372.. _pickle-picklable:
Georg Brandl116aa622007-08-15 14:28:22 +0000373
374What can be pickled and unpickled?
375----------------------------------
376
377The following types can be pickled:
378
379* ``None``, ``True``, and ``False``
380
Georg Brandlba956ae2007-11-29 17:24:34 +0000381* integers, floating point numbers, complex numbers
Georg Brandl116aa622007-08-15 14:28:22 +0000382
Georg Brandlf6945182008-02-01 11:56:49 +0000383* strings, bytes, bytearrays
Georg Brandl116aa622007-08-15 14:28:22 +0000384
385* tuples, lists, sets, and dictionaries containing only picklable objects
386
387* functions defined at the top level of a module
388
389* built-in functions defined at the top level of a module
390
391* classes that are defined at the top level of a module
392
393* instances of such classes whose :attr:`__dict__` or :meth:`__setstate__` is
Alexandre Vassalotti73b90a82008-10-29 23:32:33 +0000394 picklable (see section :ref:`pickle-inst` for details)
Georg Brandl116aa622007-08-15 14:28:22 +0000395
396Attempts to pickle unpicklable objects will raise the :exc:`PicklingError`
397exception; when this happens, an unspecified number of bytes may have already
Alexandre Vassalotti5f3b63a2008-10-18 20:47:58 +0000398been written to the underlying file. Trying to pickle a highly recursive data
Georg Brandl116aa622007-08-15 14:28:22 +0000399structure may exceed the maximum recursion depth, a :exc:`RuntimeError` will be
Alexandre Vassalotti5f3b63a2008-10-18 20:47:58 +0000400raised in this case. You can carefully raise this limit with
Georg Brandl116aa622007-08-15 14:28:22 +0000401:func:`sys.setrecursionlimit`.
402
403Note that functions (built-in and user-defined) are pickled by "fully qualified"
404name reference, not by value. This means that only the function name is
Ezio Melottie62aad32011-11-18 13:51:10 +0200405pickled, along with the name of the module the function is defined in. Neither the
Georg Brandl116aa622007-08-15 14:28:22 +0000406function's code, nor any of its function attributes are pickled. Thus the
407defining module must be importable in the unpickling environment, and the module
408must contain the named object, otherwise an exception will be raised. [#]_
409
410Similarly, classes are pickled by named reference, so the same restrictions in
411the unpickling environment apply. Note that none of the class's code or data is
412pickled, so in the following example the class attribute ``attr`` is not
413restored in the unpickling environment::
414
415 class Foo:
Alexandre Vassalotti5f3b63a2008-10-18 20:47:58 +0000416 attr = 'A class attribute'
Georg Brandl116aa622007-08-15 14:28:22 +0000417
418 picklestring = pickle.dumps(Foo)
419
420These restrictions are why picklable functions and classes must be defined in
421the top level of a module.
422
423Similarly, when class instances are pickled, their class's code and data are not
424pickled along with them. Only the instance data are pickled. This is done on
425purpose, so you can fix bugs in a class or add methods to the class and still
426load objects that were created with an earlier version of the class. If you
427plan to have long-lived objects that will see many versions of a class, it may
428be worthwhile to put a version number in the objects so that suitable
429conversions can be made by the class's :meth:`__setstate__` method.
430
431
Georg Brandl116aa622007-08-15 14:28:22 +0000432.. _pickle-inst:
433
Alexandre Vassalotti73b90a82008-10-29 23:32:33 +0000434Pickling Class Instances
435------------------------
Georg Brandl116aa622007-08-15 14:28:22 +0000436
Alexandre Vassalotti73b90a82008-10-29 23:32:33 +0000437In this section, we describe the general mechanisms available to you to define,
438customize, and control how class instances are pickled and unpickled.
Georg Brandl116aa622007-08-15 14:28:22 +0000439
Alexandre Vassalotti73b90a82008-10-29 23:32:33 +0000440In most cases, no additional code is needed to make instances picklable. By
441default, pickle will retrieve the class and the attributes of an instance via
442introspection. When a class instance is unpickled, its :meth:`__init__` method
443is usually *not* invoked. The default behaviour first creates an uninitialized
444instance and then restores the saved attributes. The following code shows an
445implementation of this behaviour::
Georg Brandl85eb8c12007-08-31 16:33:38 +0000446
Alexandre Vassalotti73b90a82008-10-29 23:32:33 +0000447 def save(obj):
448 return (obj.__class__, obj.__dict__)
449
450 def load(cls, attributes):
451 obj = cls.__new__(cls)
452 obj.__dict__.update(attributes)
453 return obj
Georg Brandl116aa622007-08-15 14:28:22 +0000454
Georg Brandl6faee4e2010-09-21 14:48:28 +0000455Classes can alter the default behaviour by providing one or several special
Georg Brandlc8148262010-10-17 11:13:37 +0000456methods:
Georg Brandl116aa622007-08-15 14:28:22 +0000457
Georg Brandlc8148262010-10-17 11:13:37 +0000458.. method:: object.__getnewargs__()
Georg Brandl116aa622007-08-15 14:28:22 +0000459
Georg Brandlc8148262010-10-17 11:13:37 +0000460 In protocol 2 and newer, classes that implements the :meth:`__getnewargs__`
461 method can dictate the values passed to the :meth:`__new__` method upon
462 unpickling. This is often needed for classes whose :meth:`__new__` method
463 requires arguments.
Georg Brandl116aa622007-08-15 14:28:22 +0000464
Georg Brandl116aa622007-08-15 14:28:22 +0000465
Georg Brandlc8148262010-10-17 11:13:37 +0000466.. method:: object.__getstate__()
Alexandre Vassalotti73b90a82008-10-29 23:32:33 +0000467
Georg Brandlc8148262010-10-17 11:13:37 +0000468 Classes can further influence how their instances are pickled; if the class
469 defines the method :meth:`__getstate__`, it is called and the returned object
470 is pickled as the contents for the instance, instead of the contents of the
471 instance's dictionary. If the :meth:`__getstate__` method is absent, the
472 instance's :attr:`__dict__` is pickled as usual.
Georg Brandl116aa622007-08-15 14:28:22 +0000473
Georg Brandlc8148262010-10-17 11:13:37 +0000474
475.. method:: object.__setstate__(state)
476
477 Upon unpickling, if the class defines :meth:`__setstate__`, it is called with
478 the unpickled state. In that case, there is no requirement for the state
479 object to be a dictionary. Otherwise, the pickled state must be a dictionary
480 and its items are assigned to the new instance's dictionary.
481
482 .. note::
483
484 If :meth:`__getstate__` returns a false value, the :meth:`__setstate__`
485 method will not be called upon unpickling.
486
Georg Brandl116aa622007-08-15 14:28:22 +0000487
Alexandre Vassalotti73b90a82008-10-29 23:32:33 +0000488Refer to the section :ref:`pickle-state` for more information about how to use
489the methods :meth:`__getstate__` and :meth:`__setstate__`.
Georg Brandl116aa622007-08-15 14:28:22 +0000490
Benjamin Petersond23f8222009-04-05 19:13:16 +0000491.. note::
Georg Brandle720c0a2009-04-27 16:20:50 +0000492
Benjamin Petersond23f8222009-04-05 19:13:16 +0000493 At unpickling time, some methods like :meth:`__getattr__`,
494 :meth:`__getattribute__`, or :meth:`__setattr__` may be called upon the
Georg Brandlc8148262010-10-17 11:13:37 +0000495 instance. In case those methods rely on some internal invariant being true,
496 the type should implement :meth:`__getnewargs__` to establish such an
497 invariant; otherwise, neither :meth:`__new__` nor :meth:`__init__` will be
498 called.
Benjamin Petersond23f8222009-04-05 19:13:16 +0000499
Georg Brandlc8148262010-10-17 11:13:37 +0000500.. index:: pair: copy; protocol
Christian Heimes05e8be12008-02-23 18:30:17 +0000501
Alexandre Vassalotti73b90a82008-10-29 23:32:33 +0000502As we shall see, pickle does not use directly the methods described above. In
503fact, these methods are part of the copy protocol which implements the
504:meth:`__reduce__` special method. The copy protocol provides a unified
505interface for retrieving the data necessary for pickling and copying
Georg Brandl48310cd2009-01-03 21:18:54 +0000506objects. [#]_
Georg Brandl116aa622007-08-15 14:28:22 +0000507
Alexandre Vassalotti73b90a82008-10-29 23:32:33 +0000508Although powerful, implementing :meth:`__reduce__` directly in your classes is
509error prone. For this reason, class designers should use the high-level
510interface (i.e., :meth:`__getnewargs__`, :meth:`__getstate__` and
Georg Brandlc8148262010-10-17 11:13:37 +0000511:meth:`__setstate__`) whenever possible. We will show, however, cases where
512using :meth:`__reduce__` is the only option or leads to more efficient pickling
513or both.
Georg Brandl116aa622007-08-15 14:28:22 +0000514
Georg Brandlc8148262010-10-17 11:13:37 +0000515.. method:: object.__reduce__()
Alexandre Vassalotti73b90a82008-10-29 23:32:33 +0000516
Georg Brandlc8148262010-10-17 11:13:37 +0000517 The interface is currently defined as follows. The :meth:`__reduce__` method
518 takes no argument and shall return either a string or preferably a tuple (the
519 returned object is often referred to as the "reduce value").
Alexandre Vassalotti73b90a82008-10-29 23:32:33 +0000520
Georg Brandlc8148262010-10-17 11:13:37 +0000521 If a string is returned, the string should be interpreted as the name of a
522 global variable. It should be the object's local name relative to its
523 module; the pickle module searches the module namespace to determine the
524 object's module. This behaviour is typically useful for singletons.
Alexandre Vassalotti73b90a82008-10-29 23:32:33 +0000525
Georg Brandlc8148262010-10-17 11:13:37 +0000526 When a tuple is returned, it must be between two and five items long.
527 Optional items can either be omitted, or ``None`` can be provided as their
528 value. The semantics of each item are in order:
Georg Brandl116aa622007-08-15 14:28:22 +0000529
Georg Brandlc8148262010-10-17 11:13:37 +0000530 .. XXX Mention __newobj__ special-case?
Georg Brandl116aa622007-08-15 14:28:22 +0000531
Georg Brandlc8148262010-10-17 11:13:37 +0000532 * A callable object that will be called to create the initial version of the
533 object.
Georg Brandl116aa622007-08-15 14:28:22 +0000534
Georg Brandlc8148262010-10-17 11:13:37 +0000535 * A tuple of arguments for the callable object. An empty tuple must be given
536 if the callable does not accept any argument.
Georg Brandl116aa622007-08-15 14:28:22 +0000537
Georg Brandlc8148262010-10-17 11:13:37 +0000538 * Optionally, the object's state, which will be passed to the object's
539 :meth:`__setstate__` method as previously described. If the object has no
540 such method then, the value must be a dictionary and it will be added to
541 the object's :attr:`__dict__` attribute.
Georg Brandl116aa622007-08-15 14:28:22 +0000542
Georg Brandlc8148262010-10-17 11:13:37 +0000543 * Optionally, an iterator (and not a sequence) yielding successive items.
544 These items will be appended to the object either using
545 ``obj.append(item)`` or, in batch, using ``obj.extend(list_of_items)``.
546 This is primarily used for list subclasses, but may be used by other
547 classes as long as they have :meth:`append` and :meth:`extend` methods with
548 the appropriate signature. (Whether :meth:`append` or :meth:`extend` is
549 used depends on which pickle protocol version is used as well as the number
550 of items to append, so both must be supported.)
Georg Brandl116aa622007-08-15 14:28:22 +0000551
Georg Brandlc8148262010-10-17 11:13:37 +0000552 * Optionally, an iterator (not a sequence) yielding successive key-value
553 pairs. These items will be stored to the object using ``obj[key] =
554 value``. This is primarily used for dictionary subclasses, but may be used
555 by other classes as long as they implement :meth:`__setitem__`.
Georg Brandl116aa622007-08-15 14:28:22 +0000556
Georg Brandlc8148262010-10-17 11:13:37 +0000557
558.. method:: object.__reduce_ex__(protocol)
559
560 Alternatively, a :meth:`__reduce_ex__` method may be defined. The only
561 difference is this method should take a single integer argument, the protocol
562 version. When defined, pickle will prefer it over the :meth:`__reduce__`
563 method. In addition, :meth:`__reduce__` automatically becomes a synonym for
564 the extended version. The main use for this method is to provide
565 backwards-compatible reduce values for older Python releases.
Georg Brandl116aa622007-08-15 14:28:22 +0000566
Alexandre Vassalotti758bca62008-10-18 19:25:07 +0000567.. _pickle-persistent:
568
Alexandre Vassalotti73b90a82008-10-29 23:32:33 +0000569Persistence of External Objects
570^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
Georg Brandl116aa622007-08-15 14:28:22 +0000571
Christian Heimes05e8be12008-02-23 18:30:17 +0000572.. index::
573 single: persistent_id (pickle protocol)
574 single: persistent_load (pickle protocol)
575
Georg Brandl116aa622007-08-15 14:28:22 +0000576For the benefit of object persistence, the :mod:`pickle` module supports the
577notion of a reference to an object outside the pickled data stream. Such
Alexandre Vassalotti5f3b63a2008-10-18 20:47:58 +0000578objects are referenced by a persistent ID, which should be either a string of
579alphanumeric characters (for protocol 0) [#]_ or just an arbitrary object (for
580any newer protocol).
Georg Brandl116aa622007-08-15 14:28:22 +0000581
Alexandre Vassalotti5f3b63a2008-10-18 20:47:58 +0000582The resolution of such persistent IDs is not defined by the :mod:`pickle`
583module; it will delegate this resolution to the user defined methods on the
584pickler and unpickler, :meth:`persistent_id` and :meth:`persistent_load`
585respectively.
Georg Brandl116aa622007-08-15 14:28:22 +0000586
587To pickle objects that have an external persistent id, the pickler must have a
Alexandre Vassalotti5f3b63a2008-10-18 20:47:58 +0000588custom :meth:`persistent_id` method that takes an object as an argument and
Georg Brandl116aa622007-08-15 14:28:22 +0000589returns either ``None`` or the persistent id for that object. When ``None`` is
Alexandre Vassalotti5f3b63a2008-10-18 20:47:58 +0000590returned, the pickler simply pickles the object as normal. When a persistent ID
591string is returned, the pickler will pickle that object, along with a marker so
592that the unpickler will recognize it as a persistent ID.
Georg Brandl116aa622007-08-15 14:28:22 +0000593
594To unpickle external objects, the unpickler must have a custom
Alexandre Vassalotti5f3b63a2008-10-18 20:47:58 +0000595:meth:`persistent_load` method that takes a persistent ID object and returns the
596referenced object.
Georg Brandl116aa622007-08-15 14:28:22 +0000597
Alexandre Vassalotti73b90a82008-10-29 23:32:33 +0000598Here is a comprehensive example presenting how persistent ID can be used to
599pickle external objects by reference.
Georg Brandl116aa622007-08-15 14:28:22 +0000600
Alexandre Vassalotti5f3b63a2008-10-18 20:47:58 +0000601.. literalinclude:: ../includes/dbpickle.py
Alexandre Vassalottibcd1e3a2009-01-23 05:28:16 +0000602
Antoine Pitrou8d3c2902012-03-04 18:31:48 +0100603.. _pickle-dispatch:
604
605Dispatch Tables
606^^^^^^^^^^^^^^^
607
608If one wants to customize pickling of some classes without disturbing
609any other code which depends on pickling, then one can create a
610pickler with a private dispatch table.
611
612The global dispatch table managed by the :mod:`copyreg` module is
613available as :data:`copyreg.dispatch_table`. Therefore, one may
614choose to use a modified copy of :data:`copyreg.dispatch_table` as a
615private dispatch table.
616
617For example ::
618
619 f = io.BytesIO()
620 p = pickle.Pickler(f)
621 p.dispatch_table = copyreg.dispatch_table.copy()
622 p.dispatch_table[SomeClass] = reduce_SomeClass
623
624creates an instance of :class:`pickle.Pickler` with a private dispatch
625table which handles the ``SomeClass`` class specially. Alternatively,
626the code ::
627
628 class MyPickler(pickle.Pickler):
629 dispatch_table = copyreg.dispatch_table.copy()
630 dispatch_table[SomeClass] = reduce_SomeClass
631 f = io.BytesIO()
632 p = MyPickler(f)
633
634does the same, but all instances of ``MyPickler`` will by default
635share the same dispatch table. The equivalent code using the
636:mod:`copyreg` module is ::
637
638 copyreg.pickle(SomeClass, reduce_SomeClass)
639 f = io.BytesIO()
640 p = pickle.Pickler(f)
Georg Brandl116aa622007-08-15 14:28:22 +0000641
Alexandre Vassalotti73b90a82008-10-29 23:32:33 +0000642.. _pickle-state:
643
644Handling Stateful Objects
645^^^^^^^^^^^^^^^^^^^^^^^^^
646
647.. index::
648 single: __getstate__() (copy protocol)
649 single: __setstate__() (copy protocol)
650
651Here's an example that shows how to modify pickling behavior for a class.
652The :class:`TextReader` class opens a text file, and returns the line number and
653line contents each time its :meth:`readline` method is called. If a
654:class:`TextReader` instance is pickled, all attributes *except* the file object
655member are saved. When the instance is unpickled, the file is reopened, and
656reading resumes from the last location. The :meth:`__setstate__` and
657:meth:`__getstate__` methods are used to implement this behavior. ::
658
659 class TextReader:
660 """Print and number lines in a text file."""
661
662 def __init__(self, filename):
663 self.filename = filename
664 self.file = open(filename)
665 self.lineno = 0
666
667 def readline(self):
668 self.lineno += 1
669 line = self.file.readline()
670 if not line:
671 return None
Alexandre Vassalotti9d7665d2009-04-03 06:13:29 +0000672 if line.endswith('\n'):
Alexandre Vassalotti73b90a82008-10-29 23:32:33 +0000673 line = line[:-1]
674 return "%i: %s" % (self.lineno, line)
675
676 def __getstate__(self):
677 # Copy the object's state from self.__dict__ which contains
678 # all our instance attributes. Always use the dict.copy()
679 # method to avoid modifying the original state.
680 state = self.__dict__.copy()
681 # Remove the unpicklable entries.
682 del state['file']
683 return state
684
685 def __setstate__(self, state):
686 # Restore instance attributes (i.e., filename and lineno).
687 self.__dict__.update(state)
688 # Restore the previously opened file's state. To do so, we need to
689 # reopen it and read from it until the line count is restored.
690 file = open(self.filename)
691 for _ in range(self.lineno):
692 file.readline()
693 # Finally, save the file.
694 self.file = file
695
696
697A sample usage might be something like this::
698
699 >>> reader = TextReader("hello.txt")
700 >>> reader.readline()
701 '1: Hello world!'
702 >>> reader.readline()
703 '2: I am line number two.'
704 >>> new_reader = pickle.loads(pickle.dumps(reader))
705 >>> new_reader.readline()
706 '3: Goodbye!'
707
708
Alexandre Vassalottid0392862008-10-24 01:32:40 +0000709.. _pickle-restrict:
Georg Brandl116aa622007-08-15 14:28:22 +0000710
Alexandre Vassalottid0392862008-10-24 01:32:40 +0000711Restricting Globals
Alexandre Vassalotti73b90a82008-10-29 23:32:33 +0000712-------------------
Georg Brandl116aa622007-08-15 14:28:22 +0000713
Christian Heimes05e8be12008-02-23 18:30:17 +0000714.. index::
Alexandre Vassalottid0392862008-10-24 01:32:40 +0000715 single: find_class() (pickle protocol)
Christian Heimes05e8be12008-02-23 18:30:17 +0000716
Alexandre Vassalottid0392862008-10-24 01:32:40 +0000717By default, unpickling will import any class or function that it finds in the
718pickle data. For many applications, this behaviour is unacceptable as it
719permits the unpickler to import and invoke arbitrary code. Just consider what
720this hand-crafted pickle data stream does when loaded::
Georg Brandl116aa622007-08-15 14:28:22 +0000721
Alexandre Vassalottid0392862008-10-24 01:32:40 +0000722 >>> import pickle
723 >>> pickle.loads(b"cos\nsystem\n(S'echo hello world'\ntR.")
724 hello world
725 0
Georg Brandl116aa622007-08-15 14:28:22 +0000726
Alexandre Vassalottid0392862008-10-24 01:32:40 +0000727In this example, the unpickler imports the :func:`os.system` function and then
728apply the string argument "echo hello world". Although this example is
729inoffensive, it is not difficult to imagine one that could damage your system.
Georg Brandl116aa622007-08-15 14:28:22 +0000730
Alexandre Vassalottid0392862008-10-24 01:32:40 +0000731For this reason, you may want to control what gets unpickled by customizing
732:meth:`Unpickler.find_class`. Unlike its name suggests, :meth:`find_class` is
733called whenever a global (i.e., a class or a function) is requested. Thus it is
Ezio Melottie62aad32011-11-18 13:51:10 +0200734possible to either completely forbid globals or restrict them to a safe subset.
Alexandre Vassalottid0392862008-10-24 01:32:40 +0000735
736Here is an example of an unpickler allowing only few safe classes from the
737:mod:`builtins` module to be loaded::
738
739 import builtins
740 import io
741 import pickle
742
743 safe_builtins = {
744 'range',
745 'complex',
746 'set',
747 'frozenset',
748 'slice',
749 }
750
751 class RestrictedUnpickler(pickle.Unpickler):
Alexandre Vassalotti73b90a82008-10-29 23:32:33 +0000752
Alexandre Vassalottid0392862008-10-24 01:32:40 +0000753 def find_class(self, module, name):
754 # Only allow safe classes from builtins.
755 if module == "builtins" and name in safe_builtins:
756 return getattr(builtins, name)
757 # Forbid everything else.
758 raise pickle.UnpicklingError("global '%s.%s' is forbidden" %
759 (module, name))
760
761 def restricted_loads(s):
762 """Helper function analogous to pickle.loads()."""
763 return RestrictedUnpickler(io.BytesIO(s)).load()
764
765A sample usage of our unpickler working has intended::
766
767 >>> restricted_loads(pickle.dumps([1, 2, range(15)]))
768 [1, 2, range(0, 15)]
769 >>> restricted_loads(b"cos\nsystem\n(S'echo hello world'\ntR.")
770 Traceback (most recent call last):
771 ...
772 pickle.UnpicklingError: global 'os.system' is forbidden
773 >>> restricted_loads(b'cbuiltins\neval\n'
774 ... b'(S\'getattr(__import__("os"), "system")'
775 ... b'("echo hello world")\'\ntR.')
776 Traceback (most recent call last):
777 ...
778 pickle.UnpicklingError: global 'builtins.eval' is forbidden
779
Alexandre Vassalotti73b90a82008-10-29 23:32:33 +0000780
781.. XXX Add note about how extension codes could evade our protection
Georg Brandl48310cd2009-01-03 21:18:54 +0000782 mechanism (e.g. cached classes do not invokes find_class()).
Alexandre Vassalotti73b90a82008-10-29 23:32:33 +0000783
784As our examples shows, you have to be careful with what you allow to be
785unpickled. Therefore if security is a concern, you may want to consider
Alexandre Vassalotti9d7665d2009-04-03 06:13:29 +0000786alternatives such as the marshalling API in :mod:`xmlrpc.client` or
787third-party solutions.
Alexandre Vassalotti73b90a82008-10-29 23:32:33 +0000788
Georg Brandl116aa622007-08-15 14:28:22 +0000789
790.. _pickle-example:
791
Alexandre Vassalotti9d7665d2009-04-03 06:13:29 +0000792Examples
793--------
Georg Brandl116aa622007-08-15 14:28:22 +0000794
Alexandre Vassalotti9d7665d2009-04-03 06:13:29 +0000795For the simplest code, use the :func:`dump` and :func:`load` functions. ::
Georg Brandl116aa622007-08-15 14:28:22 +0000796
797 import pickle
798
Alexandre Vassalottibcd1e3a2009-01-23 05:28:16 +0000799 # An arbitrary collection of objects supported by pickle.
800 data = {
Alexandre Vassalotti9d7665d2009-04-03 06:13:29 +0000801 'a': [1, 2.0, 3, 4+6j],
802 'b': ("character string", b"byte string"),
803 'c': set([None, True, False])
Alexandre Vassalottibcd1e3a2009-01-23 05:28:16 +0000804 }
Georg Brandl116aa622007-08-15 14:28:22 +0000805
Alexandre Vassalottibcd1e3a2009-01-23 05:28:16 +0000806 with open('data.pickle', 'wb') as f:
807 # Pickle the 'data' dictionary using the highest protocol available.
808 pickle.dump(data, f, pickle.HIGHEST_PROTOCOL)
Georg Brandl116aa622007-08-15 14:28:22 +0000809
Georg Brandl116aa622007-08-15 14:28:22 +0000810
Alexandre Vassalottibcd1e3a2009-01-23 05:28:16 +0000811The following example reads the resulting pickled data. ::
Georg Brandl116aa622007-08-15 14:28:22 +0000812
Alexandre Vassalottibcd1e3a2009-01-23 05:28:16 +0000813 import pickle
Georg Brandl116aa622007-08-15 14:28:22 +0000814
Alexandre Vassalottibcd1e3a2009-01-23 05:28:16 +0000815 with open('data.pickle', 'rb') as f:
816 # The protocol version used is detected automatically, so we do not
817 # have to specify it.
818 data = pickle.load(f)
Georg Brandl116aa622007-08-15 14:28:22 +0000819
Georg Brandl116aa622007-08-15 14:28:22 +0000820
Alexandre Vassalotti9d7665d2009-04-03 06:13:29 +0000821.. XXX: Add examples showing how to optimize pickles for size (like using
822.. pickletools.optimize() or the gzip module).
823
824
Georg Brandl116aa622007-08-15 14:28:22 +0000825.. seealso::
826
Alexandre Vassalottif7fa63d2008-05-11 08:55:36 +0000827 Module :mod:`copyreg`
Georg Brandl116aa622007-08-15 14:28:22 +0000828 Pickle interface constructor registration for extension types.
829
Alexandre Vassalotti9d7665d2009-04-03 06:13:29 +0000830 Module :mod:`pickletools`
831 Tools for working with and analyzing pickled data.
832
Georg Brandl116aa622007-08-15 14:28:22 +0000833 Module :mod:`shelve`
834 Indexed databases of objects; uses :mod:`pickle`.
835
836 Module :mod:`copy`
837 Shallow and deep object copying.
838
839 Module :mod:`marshal`
840 High-performance serialization of built-in types.
841
842
Georg Brandl116aa622007-08-15 14:28:22 +0000843.. rubric:: Footnotes
844
845.. [#] Don't confuse this with the :mod:`marshal` module
846
Georg Brandl116aa622007-08-15 14:28:22 +0000847.. [#] The exception raised will likely be an :exc:`ImportError` or an
848 :exc:`AttributeError` but it could be something else.
849
Alexandre Vassalotti73b90a82008-10-29 23:32:33 +0000850.. [#] The :mod:`copy` module uses this protocol for shallow and deep copying
851 operations.
Alexandre Vassalotti5f3b63a2008-10-18 20:47:58 +0000852
Alexandre Vassalottid0392862008-10-24 01:32:40 +0000853.. [#] The limitation on alphanumeric characters is due to the fact
854 the persistent IDs, in protocol 0, are delimited by the newline
855 character. Therefore if any kind of newline characters occurs in
Alexandre Vassalotti5f3b63a2008-10-18 20:47:58 +0000856 persistent IDs, the resulting pickle will become unreadable.