blob: 45e80b82ccb376489c7f0b4a27bee7c2dd77507b [file] [log] [blame]
Barry Warsawf595fd92001-11-15 23:39:07 +00001\section{\module{pickle} --- Python object serialization}
Fred Drakeb91e9341998-07-23 17:59:49 +00002
Fred Drakeffbe6871999-04-22 21:23:22 +00003\declaremodule{standard}{pickle}
Fred Drakeb91e9341998-07-23 17:59:49 +00004\modulesynopsis{Convert Python objects to streams of bytes and back.}
Fred Drake38e5d272000-04-03 20:13:55 +00005% Substantial improvements by Jim Kerr <jbkerr@sr.hp.com>.
Barry Warsawf595fd92001-11-15 23:39:07 +00006% Rewritten by Barry Warsaw <barry@zope.com>
Fred Drakeb91e9341998-07-23 17:59:49 +00007
Thomas Woutersf8316632000-07-16 19:01:10 +00008\index{persistence}
Guido van Rossumd1883581995-02-15 15:53:08 +00009\indexii{persistent}{objects}
10\indexii{serializing}{objects}
11\indexii{marshalling}{objects}
12\indexii{flattening}{objects}
13\indexii{pickling}{objects}
14
Barry Warsawf595fd92001-11-15 23:39:07 +000015The \module{pickle} module implements a fundamental, but powerful
16algorithm for serializing and de-serializing a Python object
17structure. ``Pickling'' is the process whereby a Python object
18hierarchy is converted into a byte stream, and ``unpickling'' is the
19inverse operation, whereby a byte stream is converted back into an
20object hierarchy. Pickling (and unpickling) is alternatively known as
Fred Drake2744f432001-11-26 21:30:36 +000021``serialization'', ``marshalling,''\footnote{Don't confuse this with
22the \refmodule{marshal} module} or ``flattening'',
Raymond Hettinger35fd9262003-06-25 15:07:45 +000023however, to avoid confusion, the terms used here are ``pickling'' and
24``unpickling''.
Guido van Rossum470be141995-03-17 16:07:09 +000025
Barry Warsawf595fd92001-11-15 23:39:07 +000026This documentation describes both the \module{pickle} module and the
Fred Drake2744f432001-11-26 21:30:36 +000027\refmodule{cPickle} module.
Fred Drakeffbe6871999-04-22 21:23:22 +000028
Barry Warsawf595fd92001-11-15 23:39:07 +000029\subsection{Relationship to other Python modules}
Guido van Rossumd1883581995-02-15 15:53:08 +000030
Barry Warsawf595fd92001-11-15 23:39:07 +000031The \module{pickle} module has an optimized cousin called the
32\module{cPickle} module. As its name implies, \module{cPickle} is
33written in C, so it can be up to 1000 times faster than
34\module{pickle}. However it does not support subclassing of the
35\function{Pickler()} and \function{Unpickler()} classes, because in
36\module{cPickle} these are functions, not classes. Most applications
37have no need for this functionality, and can benefit from the improved
38performance of \module{cPickle}. Other than that, the interfaces of
39the two modules are nearly identical; the common interface is
40described in this manual and differences are pointed out where
41necessary. In the following discussions, we use the term ``pickle''
42to collectively describe the \module{pickle} and
43\module{cPickle} modules.
Guido van Rossum736fe5e1997-12-09 20:45:08 +000044
Barry Warsawf595fd92001-11-15 23:39:07 +000045The data streams the two modules produce are guaranteed to be
46interchangeable.
47
48Python has a more primitive serialization module called
Fred Drake2744f432001-11-26 21:30:36 +000049\refmodule{marshal}, but in general
Barry Warsawf595fd92001-11-15 23:39:07 +000050\module{pickle} should always be the preferred way to serialize Python
51objects. \module{marshal} exists primarily to support Python's
52\file{.pyc} files.
53
54The \module{pickle} module differs from \refmodule{marshal} several
55significant ways:
Guido van Rossumd1883581995-02-15 15:53:08 +000056
57\begin{itemize}
58
Barry Warsawf595fd92001-11-15 23:39:07 +000059\item The \module{pickle} module keeps track of the objects it has
60 already serialized, so that later references to the same object
61 won't be serialized again. \module{marshal} doesn't do this.
Guido van Rossumd1883581995-02-15 15:53:08 +000062
Barry Warsawf595fd92001-11-15 23:39:07 +000063 This has implications both for recursive objects and object
64 sharing. Recursive objects are objects that contain references
65 to themselves. These are not handled by marshal, and in fact,
66 attempting to marshal recursive objects will crash your Python
67 interpreter. Object sharing happens when there are multiple
68 references to the same object in different places in the object
69 hierarchy being serialized. \module{pickle} stores such objects
70 only once, and ensures that all other references point to the
71 master copy. Shared objects remain shared, which can be very
72 important for mutable objects.
Guido van Rossumd1883581995-02-15 15:53:08 +000073
Barry Warsawf595fd92001-11-15 23:39:07 +000074\item \module{marshal} cannot be used to serialize user-defined
75 classes and their instances. \module{pickle} can save and
76 restore class instances transparently, however the class
77 definition must be importable and live in the same module as
78 when the object was stored.
79
80\item The \module{marshal} serialization format is not guaranteed to
81 be portable across Python versions. Because its primary job in
82 life is to support \file{.pyc} files, the Python implementers
83 reserve the right to change the serialization format in
84 non-backwards compatible ways should the need arise. The
85 \module{pickle} serialization format is guaranteed to be
86 backwards compatible across Python releases.
87
Guido van Rossumd1883581995-02-15 15:53:08 +000088\end{itemize}
89
Andrew M. Kuchling76963442003-05-14 16:51:46 +000090\begin{notice}[warning]
91The \module{pickle} module is not intended to be secure against
92erroneous or maliciously constructed data. Never unpickle data
93received from an untrusted or unauthenticated source.
94\end{notice}
95
Barry Warsawf595fd92001-11-15 23:39:07 +000096Note that serialization is a more primitive notion than persistence;
97although
98\module{pickle} reads and writes file objects, it does not handle the
99issue of naming persistent objects, nor the (even more complicated)
100issue of concurrent access to persistent objects. The \module{pickle}
101module can transform a complex object into a byte stream and it can
102transform the byte stream into an object with the same internal
103structure. Perhaps the most obvious thing to do with these byte
104streams is to write them onto a file, but it is also conceivable to
105send them across a network or store them in a database. The module
106\refmodule{shelve} provides a simple interface
107to pickle and unpickle objects on DBM-style database files.
108
109\subsection{Data stream format}
110
Fred Drake9b28fe21998-04-04 06:20:28 +0000111The data format used by \module{pickle} is Python-specific. This has
Guido van Rossumd1883581995-02-15 15:53:08 +0000112the advantage that there are no restrictions imposed by external
Barry Warsawf595fd92001-11-15 23:39:07 +0000113standards such as XDR\index{XDR}\index{External Data Representation}
114(which can't represent pointer sharing); however it means that
115non-Python programs may not be able to reconstruct pickled Python
116objects.
Guido van Rossumd1883581995-02-15 15:53:08 +0000117
Fred Drake9b28fe21998-04-04 06:20:28 +0000118By default, the \module{pickle} data format uses a printable \ASCII{}
Guido van Rossum736fe5e1997-12-09 20:45:08 +0000119representation. This is slightly more voluminous than a binary
120representation. The big advantage of using printable \ASCII{} (and of
Fred Drake9b28fe21998-04-04 06:20:28 +0000121some other characteristics of \module{pickle}'s representation) is that
Guido van Rossum736fe5e1997-12-09 20:45:08 +0000122for debugging or recovery purposes it is possible for a human to read
123the pickled file with a standard text editor.
124
Neal Norwitz12d31e22003-02-13 03:12:48 +0000125There are currently 3 different protocols which can be used for pickling.
126
127\begin{itemize}
128
129\item Protocol version 0 is the original ASCII protocol and is backwards
130compatible with earlier versions of Python.
131
132\item Protocol version 1 is the old binary format which is also compatible
133with earlier versions of Python.
134
135\item Protocol version 2 was introduced in Python 2.3. It provides
136much more efficient pickling of new-style classes.
137
138\end{itemize}
139
140Refer to PEP 307 for more information.
141
142If a \var{protocol} is not specified, protocol 0 is used.
Neal Norwitzd08baa92003-02-21 00:26:33 +0000143If \var{protocol} is specified as a negative value
144or \constant{HIGHEST_PROTOCOL},
145the highest protocol version available will be used.
Neal Norwitz12d31e22003-02-13 03:12:48 +0000146
Raymond Hettinger3489cad2004-12-05 05:20:42 +0000147\versionchanged[Introduced the \var{protocol} parameter]{2.3}
Neal Norwitz12d31e22003-02-13 03:12:48 +0000148
Guido van Rossum736fe5e1997-12-09 20:45:08 +0000149A binary format, which is slightly more efficient, can be chosen by
Raymond Hettinger3489cad2004-12-05 05:20:42 +0000150specifying a \var{protocol} version >= 1.
Guido van Rossumd1883581995-02-15 15:53:08 +0000151
Barry Warsawf595fd92001-11-15 23:39:07 +0000152\subsection{Usage}
Guido van Rossumd1883581995-02-15 15:53:08 +0000153
Barry Warsawf595fd92001-11-15 23:39:07 +0000154To serialize an object hierarchy, you first create a pickler, then you
155call the pickler's \method{dump()} method. To de-serialize a data
156stream, you first create an unpickler, then you call the unpickler's
157\method{load()} method. The \module{pickle} module provides the
Neal Norwitzd08baa92003-02-21 00:26:33 +0000158following constant:
159
160\begin{datadesc}{HIGHEST_PROTOCOL}
161The highest protocol version available. This value can be passed
162as a \var{protocol} value.
Fred Drake7c4d8f32003-09-10 20:47:43 +0000163\versionadded{2.3}
Neal Norwitzd08baa92003-02-21 00:26:33 +0000164\end{datadesc}
165
Georg Brandl69774c22006-02-20 13:12:02 +0000166\note{Be sure to always open pickle files created with protocols >= 1 in
167 binary mode. For the old ASCII-based pickle protocol 0 you can use
168 either text mode or binary mode as long as you stay consistent.
169
170 A pickle file written with protocol 0 in binary mode will contain
171 lone linefeeds as line terminators and therefore will look ``funny''
172 when viewed in Notepad or other editors which do not support this
173 format.}
174
Neal Norwitzd08baa92003-02-21 00:26:33 +0000175The \module{pickle} module provides the
Georg Brandl69774c22006-02-20 13:12:02 +0000176following functions to make the pickling process more convenient:
Guido van Rossumd1883581995-02-15 15:53:08 +0000177
Raymond Hettinger3489cad2004-12-05 05:20:42 +0000178\begin{funcdesc}{dump}{obj, file\optional{, protocol}}
Andrew M. Kuchling2ee6a702004-08-07 20:25:55 +0000179Write a pickled representation of \var{obj} to the open file object
Barry Warsawf595fd92001-11-15 23:39:07 +0000180\var{file}. This is equivalent to
Raymond Hettinger3489cad2004-12-05 05:20:42 +0000181\code{Pickler(\var{file}, \var{protocol}).dump(\var{obj})}.
Neal Norwitz12d31e22003-02-13 03:12:48 +0000182
Andrew M. Kuchling2ee6a702004-08-07 20:25:55 +0000183If the \var{protocol} parameter is omitted, protocol 0 is used.
Neal Norwitzd08baa92003-02-21 00:26:33 +0000184If \var{protocol} is specified as a negative value
185or \constant{HIGHEST_PROTOCOL},
Neal Norwitz12d31e22003-02-13 03:12:48 +0000186the highest protocol version will be used.
187
Raymond Hettinger3489cad2004-12-05 05:20:42 +0000188\versionchanged[Introduced the \var{protocol} parameter]{2.3}
Guido van Rossumd1883581995-02-15 15:53:08 +0000189
Barry Warsawf595fd92001-11-15 23:39:07 +0000190\var{file} must have a \method{write()} method that accepts a single
191string argument. It can thus be a file object opened for writing, a
192\refmodule{StringIO} object, or any other custom
193object that meets this interface.
194\end{funcdesc}
Guido van Rossumd1883581995-02-15 15:53:08 +0000195
Barry Warsawf595fd92001-11-15 23:39:07 +0000196\begin{funcdesc}{load}{file}
197Read a string from the open file object \var{file} and interpret it as
198a pickle data stream, reconstructing and returning the original object
199hierarchy. This is equivalent to \code{Unpickler(\var{file}).load()}.
Guido van Rossum470be141995-03-17 16:07:09 +0000200
Barry Warsawf595fd92001-11-15 23:39:07 +0000201\var{file} must have two methods, a \method{read()} method that takes
202an integer argument, and a \method{readline()} method that requires no
203arguments. Both methods should return a string. Thus \var{file} can
204be a file object opened for reading, a
205\module{StringIO} object, or any other custom
206object that meets this interface.
Guido van Rossum736fe5e1997-12-09 20:45:08 +0000207
Barry Warsawf595fd92001-11-15 23:39:07 +0000208This function automatically determines whether the data stream was
209written in binary mode or not.
210\end{funcdesc}
Guido van Rossumd1883581995-02-15 15:53:08 +0000211
Raymond Hettinger3489cad2004-12-05 05:20:42 +0000212\begin{funcdesc}{dumps}{obj\optional{, protocol}}
Barry Warsawf595fd92001-11-15 23:39:07 +0000213Return the pickled representation of the object as a string, instead
Neal Norwitz12d31e22003-02-13 03:12:48 +0000214of writing it to a file.
215
Andrew M. Kuchling2ee6a702004-08-07 20:25:55 +0000216If the \var{protocol} parameter is omitted, protocol 0 is used.
Neal Norwitzd08baa92003-02-21 00:26:33 +0000217If \var{protocol} is specified as a negative value
218or \constant{HIGHEST_PROTOCOL},
Neal Norwitz12d31e22003-02-13 03:12:48 +0000219the highest protocol version will be used.
220
Raymond Hettinger3489cad2004-12-05 05:20:42 +0000221\versionchanged[The \var{protocol} parameter was added]{2.3}
Neal Norwitz12d31e22003-02-13 03:12:48 +0000222
Barry Warsawf595fd92001-11-15 23:39:07 +0000223\end{funcdesc}
Guido van Rossumd1883581995-02-15 15:53:08 +0000224
Barry Warsawf595fd92001-11-15 23:39:07 +0000225\begin{funcdesc}{loads}{string}
226Read a pickled object hierarchy from a string. Characters in the
227string past the pickled object's representation are ignored.
228\end{funcdesc}
Guido van Rossumd1883581995-02-15 15:53:08 +0000229
Barry Warsawf595fd92001-11-15 23:39:07 +0000230The \module{pickle} module also defines three exceptions:
Guido van Rossum470be141995-03-17 16:07:09 +0000231
Barry Warsawf595fd92001-11-15 23:39:07 +0000232\begin{excdesc}{PickleError}
233A common base class for the other exceptions defined below. This
234inherits from \exception{Exception}.
235\end{excdesc}
Guido van Rossum470be141995-03-17 16:07:09 +0000236
Barry Warsawf595fd92001-11-15 23:39:07 +0000237\begin{excdesc}{PicklingError}
238This exception is raised when an unpicklable object is passed to
239the \method{dump()} method.
240\end{excdesc}
Guido van Rossumd1883581995-02-15 15:53:08 +0000241
Barry Warsawf595fd92001-11-15 23:39:07 +0000242\begin{excdesc}{UnpicklingError}
Andrew M. Kuchling76963442003-05-14 16:51:46 +0000243This exception is raised when there is a problem unpickling an object.
244Note that other exceptions may also be raised during unpickling,
245including (but not necessarily limited to) \exception{AttributeError},
246\exception{EOFError}, \exception{ImportError}, and \exception{IndexError}.
Barry Warsawf595fd92001-11-15 23:39:07 +0000247\end{excdesc}
248
249The \module{pickle} module also exports two callables\footnote{In the
250\module{pickle} module these callables are classes, which you could
Fred Drake7c4d8f32003-09-10 20:47:43 +0000251subclass to customize the behavior. However, in the \refmodule{cPickle}
252module these callables are factory functions and so cannot be
253subclassed. One common reason to subclass is to control what
Andrew M. Kuchling76963442003-05-14 16:51:46 +0000254objects can actually be unpickled. See section~\ref{pickle-sub} for
Fred Drake7c4d8f32003-09-10 20:47:43 +0000255more details.}, \class{Pickler} and \class{Unpickler}:
Barry Warsawf595fd92001-11-15 23:39:07 +0000256
Raymond Hettinger3489cad2004-12-05 05:20:42 +0000257\begin{classdesc}{Pickler}{file\optional{, protocol}}
Barry Warsawf595fd92001-11-15 23:39:07 +0000258This takes a file-like object to which it will write a pickle data
Neal Norwitz12d31e22003-02-13 03:12:48 +0000259stream.
260
Andrew M. Kuchling2ee6a702004-08-07 20:25:55 +0000261If the \var{protocol} parameter is omitted, protocol 0 is used.
Neal Norwitz12d31e22003-02-13 03:12:48 +0000262If \var{protocol} is specified as a negative value,
263the highest protocol version will be used.
264
Raymond Hettinger3489cad2004-12-05 05:20:42 +0000265\versionchanged[Introduced the \var{protocol} parameter]{2.3}
Barry Warsawf595fd92001-11-15 23:39:07 +0000266
267\var{file} must have a \method{write()} method that accepts a single
268string argument. It can thus be an open file object, a
269\module{StringIO} object, or any other custom
270object that meets this interface.
271\end{classdesc}
272
273\class{Pickler} objects define one (or two) public methods:
274
Andrew M. Kuchling2ee6a702004-08-07 20:25:55 +0000275\begin{methoddesc}[Pickler]{dump}{obj}
276Write a pickled representation of \var{obj} to the open file object
Barry Warsawf595fd92001-11-15 23:39:07 +0000277given in the constructor. Either the binary or \ASCII{} format will
Raymond Hettinger3489cad2004-12-05 05:20:42 +0000278be used, depending on the value of the \var{protocol} argument passed to the
Barry Warsawf595fd92001-11-15 23:39:07 +0000279constructor.
280\end{methoddesc}
281
282\begin{methoddesc}[Pickler]{clear_memo}{}
283Clears the pickler's ``memo''. The memo is the data structure that
284remembers which objects the pickler has already seen, so that shared
285or recursive objects pickled by reference and not by value. This
286method is useful when re-using picklers.
287
Fred Drake7f781c92002-05-01 20:33:53 +0000288\begin{notice}
289Prior to Python 2.3, \method{clear_memo()} was only available on the
290picklers created by \refmodule{cPickle}. In the \module{pickle} module,
291picklers have an instance variable called \member{memo} which is a
292Python dictionary. So to clear the memo for a \module{pickle} module
Barry Warsawf595fd92001-11-15 23:39:07 +0000293pickler, you could do the following:
Guido van Rossumd1883581995-02-15 15:53:08 +0000294
Fred Drake19479911998-02-13 06:58:54 +0000295\begin{verbatim}
Barry Warsawf595fd92001-11-15 23:39:07 +0000296mypickler.memo.clear()
Fred Drake19479911998-02-13 06:58:54 +0000297\end{verbatim}
Fred Drake7f781c92002-05-01 20:33:53 +0000298
299Code that does not need to support older versions of Python should
300simply use \method{clear_memo()}.
301\end{notice}
Barry Warsawf595fd92001-11-15 23:39:07 +0000302\end{methoddesc}
Fred Drake9b28fe21998-04-04 06:20:28 +0000303
Barry Warsawf595fd92001-11-15 23:39:07 +0000304It is possible to make multiple calls to the \method{dump()} method of
305the same \class{Pickler} instance. These must then be matched to the
306same number of calls to the \method{load()} method of the
307corresponding \class{Unpickler} instance. If the same object is
308pickled by multiple \method{dump()} calls, the \method{load()} will
Fred Drakef5f0c172003-09-09 19:49:18 +0000309all yield references to the same object.\footnote{\emph{Warning}: this
Barry Warsawf595fd92001-11-15 23:39:07 +0000310is intended for pickling multiple objects without intervening
311modifications to the objects or their parts. If you modify an object
312and then pickle it again using the same \class{Pickler} instance, the
313object is not pickled again --- a reference to it is pickled and the
314\class{Unpickler} will return the old value, not the modified one.
315There are two problems here: (1) detecting changes, and (2)
316marshalling a minimal set of changes. Garbage Collection may also
Fred Drakef5f0c172003-09-09 19:49:18 +0000317become a problem here.}
Guido van Rossum470be141995-03-17 16:07:09 +0000318
Barry Warsawf595fd92001-11-15 23:39:07 +0000319\class{Unpickler} objects are defined as:
Fred Drake9b28fe21998-04-04 06:20:28 +0000320
Barry Warsawf595fd92001-11-15 23:39:07 +0000321\begin{classdesc}{Unpickler}{file}
322This takes a file-like object from which it will read a pickle data
323stream. This class automatically determines whether the data stream
324was written in binary mode or not, so it does not need a flag as in
325the \class{Pickler} factory.
Guido van Rossumd1883581995-02-15 15:53:08 +0000326
Barry Warsawf595fd92001-11-15 23:39:07 +0000327\var{file} must have two methods, a \method{read()} method that takes
328an integer argument, and a \method{readline()} method that requires no
329arguments. Both methods should return a string. Thus \var{file} can
330be a file object opened for reading, a
331\module{StringIO} object, or any other custom
332object that meets this interface.
333\end{classdesc}
Fred Drake9b28fe21998-04-04 06:20:28 +0000334
Barry Warsawf595fd92001-11-15 23:39:07 +0000335\class{Unpickler} objects have one (or two) public methods:
Guido van Rossum470be141995-03-17 16:07:09 +0000336
Barry Warsawf595fd92001-11-15 23:39:07 +0000337\begin{methoddesc}[Unpickler]{load}{}
338Read a pickled object representation from the open file object given
339in the constructor, and return the reconstituted object hierarchy
340specified therein.
341\end{methoddesc}
Fred Drake9b28fe21998-04-04 06:20:28 +0000342
Barry Warsawf595fd92001-11-15 23:39:07 +0000343\begin{methoddesc}[Unpickler]{noload}{}
344This is just like \method{load()} except that it doesn't actually
345create any objects. This is useful primarily for finding what's
346called ``persistent ids'' that may be referenced in a pickle data
347stream. See section~\ref{pickle-protocol} below for more details.
Guido van Rossumd1883581995-02-15 15:53:08 +0000348
Barry Warsawf595fd92001-11-15 23:39:07 +0000349\strong{Note:} the \method{noload()} method is currently only
350available on \class{Unpickler} objects created with the
351\module{cPickle} module. \module{pickle} module \class{Unpickler}s do
352not have the \method{noload()} method.
353\end{methoddesc}
354
355\subsection{What can be pickled and unpickled?}
Guido van Rossum736fe5e1997-12-09 20:45:08 +0000356
Guido van Rossumd1883581995-02-15 15:53:08 +0000357The following types can be pickled:
Fred Drake41796911999-07-02 14:25:37 +0000358
Guido van Rossumd1883581995-02-15 15:53:08 +0000359\begin{itemize}
360
Raymond Hettingeracb45d72002-08-05 03:55:36 +0000361\item \code{None}, \code{True}, and \code{False}
Guido van Rossumd1883581995-02-15 15:53:08 +0000362
Barry Warsawf595fd92001-11-15 23:39:07 +0000363\item integers, long integers, floating point numbers, complex numbers
Guido van Rossumd1883581995-02-15 15:53:08 +0000364
Fred Drake56ced2a2000-04-06 15:04:30 +0000365\item normal and Unicode strings
Guido van Rossumd1883581995-02-15 15:53:08 +0000366
Raymond Hettinger621c53e2004-01-01 05:53:51 +0000367\item tuples, lists, sets, and dictionaries containing only picklable objects
Guido van Rossumd1883581995-02-15 15:53:08 +0000368
Barry Warsawf595fd92001-11-15 23:39:07 +0000369\item functions defined at the top level of a module
Fred Drake38e5d272000-04-03 20:13:55 +0000370
Barry Warsawf595fd92001-11-15 23:39:07 +0000371\item built-in functions defined at the top level of a module
Fred Drake38e5d272000-04-03 20:13:55 +0000372
Barry Warsawf595fd92001-11-15 23:39:07 +0000373\item classes that are defined at the top level of a module
Guido van Rossum470be141995-03-17 16:07:09 +0000374
Fred Drake9b28fe21998-04-04 06:20:28 +0000375\item instances of such classes whose \member{__dict__} or
Barry Warsawf595fd92001-11-15 23:39:07 +0000376\method{__setstate__()} is picklable (see
377section~\ref{pickle-protocol} for details)
Guido van Rossumd1883581995-02-15 15:53:08 +0000378
379\end{itemize}
380
Guido van Rossum470be141995-03-17 16:07:09 +0000381Attempts to pickle unpicklable objects will raise the
Fred Drake9b28fe21998-04-04 06:20:28 +0000382\exception{PicklingError} exception; when this happens, an unspecified
Barry Warsawf595fd92001-11-15 23:39:07 +0000383number of bytes may have already been written to the underlying file.
Georg Brandld90d1c12005-12-26 23:27:49 +0000384Trying to pickle a highly recursive data structure may exceed the
385maximum recursion depth, a \exception{RuntimeError} will be raised
386in this case. You can carefully raise this limit with
387\function{sys.setrecursionlimit()}.
Guido van Rossumd1883581995-02-15 15:53:08 +0000388
Barry Warsawf595fd92001-11-15 23:39:07 +0000389Note that functions (built-in and user-defined) are pickled by ``fully
390qualified'' name reference, not by value. This means that only the
391function name is pickled, along with the name of module the function
392is defined in. Neither the function's code, nor any of its function
393attributes are pickled. Thus the defining module must be importable
394in the unpickling environment, and the module must contain the named
Fred Drakef5f0c172003-09-09 19:49:18 +0000395object, otherwise an exception will be raised.\footnote{The exception
Barry Warsawf595fd92001-11-15 23:39:07 +0000396raised will likely be an \exception{ImportError} or an
Fred Drakef5f0c172003-09-09 19:49:18 +0000397\exception{AttributeError} but it could be something else.}
Guido van Rossum470be141995-03-17 16:07:09 +0000398
Barry Warsawf595fd92001-11-15 23:39:07 +0000399Similarly, classes are pickled by named reference, so the same
400restrictions in the unpickling environment apply. Note that none of
401the class's code or data is pickled, so in the following example the
402class attribute \code{attr} is not restored in the unpickling
403environment:
Guido van Rossum470be141995-03-17 16:07:09 +0000404
Barry Warsawf595fd92001-11-15 23:39:07 +0000405\begin{verbatim}
406class Foo:
407 attr = 'a class attr'
Guido van Rossum470be141995-03-17 16:07:09 +0000408
Barry Warsawf595fd92001-11-15 23:39:07 +0000409picklestring = pickle.dumps(Foo)
410\end{verbatim}
Guido van Rossum470be141995-03-17 16:07:09 +0000411
Barry Warsawf595fd92001-11-15 23:39:07 +0000412These restrictions are why picklable functions and classes must be
413defined in the top level of a module.
Guido van Rossum470be141995-03-17 16:07:09 +0000414
Barry Warsawf595fd92001-11-15 23:39:07 +0000415Similarly, when class instances are pickled, their class's code and
416data are not pickled along with them. Only the instance data are
417pickled. This is done on purpose, so you can fix bugs in a class or
418add methods to the class and still load objects that were created with
419an earlier version of the class. If you plan to have long-lived
420objects that will see many versions of a class, it may be worthwhile
421to put a version number in the objects so that suitable conversions
422can be made by the class's \method{__setstate__()} method.
Guido van Rossum470be141995-03-17 16:07:09 +0000423
Barry Warsawf595fd92001-11-15 23:39:07 +0000424\subsection{The pickle protocol
425\label{pickle-protocol}}\setindexsubitem{(pickle protocol)}
Fred Drake40748961998-03-06 21:27:14 +0000426
Barry Warsawf595fd92001-11-15 23:39:07 +0000427This section describes the ``pickling protocol'' that defines the
428interface between the pickler/unpickler and the objects that are being
429serialized. This protocol provides a standard way for you to define,
430customize, and control how your objects are serialized and
431de-serialized. The description in this section doesn't cover specific
432customizations that you can employ to make the unpickling environment
Andrew M. Kuchling76963442003-05-14 16:51:46 +0000433slightly safer from untrusted pickle data streams; see section~\ref{pickle-sub}
Barry Warsawf595fd92001-11-15 23:39:07 +0000434for more details.
Fred Drake40748961998-03-06 21:27:14 +0000435
Barry Warsawf595fd92001-11-15 23:39:07 +0000436\subsubsection{Pickling and unpickling normal class
437 instances\label{pickle-inst}}
Fred Drake9b28fe21998-04-04 06:20:28 +0000438
Barry Warsawf595fd92001-11-15 23:39:07 +0000439When a pickled class instance is unpickled, its \method{__init__()}
440method is normally \emph{not} invoked. If it is desirable that the
Fred Drake0de77d12004-05-05 04:54:37 +0000441\method{__init__()} method be called on unpickling, an old-style class
442can define a method \method{__getinitargs__()}, which should return a
Barry Warsawf595fd92001-11-15 23:39:07 +0000443\emph{tuple} containing the arguments to be passed to the class
Raymond Hettinger3489cad2004-12-05 05:20:42 +0000444constructor (\method{__init__()} for example). The
Barry Warsawf595fd92001-11-15 23:39:07 +0000445\method{__getinitargs__()} method is called at
446pickle time; the tuple it returns is incorporated in the pickle for
447the instance.
448\withsubitem{(copy protocol)}{\ttindex{__getinitargs__()}}
449\withsubitem{(instance constructor)}{\ttindex{__init__()}}
Fred Drake17e56401998-04-11 20:43:51 +0000450
Fred Drake8aa8c842004-05-05 04:56:06 +0000451\withsubitem{(copy protocol)}{\ttindex{__getnewargs__()}}
452
Fred Drake0de77d12004-05-05 04:54:37 +0000453New-style types can provide a \method{__getnewargs__()} method that is
454used for protocol 2. Implementing this method is needed if the type
455establishes some internal invariants when the instance is created, or
456if the memory allocation is affected by the values passed to the
457\method{__new__()} method for the type (as it is for tuples and
458strings). Instances of a new-style type \class{C} are created using
459
460\begin{alltt}
461obj = C.__new__(C, *\var{args})
462\end{alltt}
463
464where \var{args} is the result of calling \method{__getnewargs__()} on
465the original object; if there is no \method{__getnewargs__()}, an
466empty tuple is assumed.
467
Barry Warsawf595fd92001-11-15 23:39:07 +0000468\withsubitem{(copy protocol)}{
469 \ttindex{__getstate__()}\ttindex{__setstate__()}}
470\withsubitem{(instance attribute)}{
471 \ttindex{__dict__}}
Fred Drake17e56401998-04-11 20:43:51 +0000472
Barry Warsawf595fd92001-11-15 23:39:07 +0000473Classes can further influence how their instances are pickled; if the
474class defines the method \method{__getstate__()}, it is called and the
475return state is pickled as the contents for the instance, instead of
476the contents of the instance's dictionary. If there is no
477\method{__getstate__()} method, the instance's \member{__dict__} is
478pickled.
Fred Drake9463de21998-04-11 20:05:43 +0000479
Barry Warsawf595fd92001-11-15 23:39:07 +0000480Upon unpickling, if the class also defines the method
481\method{__setstate__()}, it is called with the unpickled
Fred Drakef5f0c172003-09-09 19:49:18 +0000482state.\footnote{These methods can also be used to implement copying
483class instances.} If there is no \method{__setstate__()} method, the
Fred Drakee9cfcef2002-11-27 05:26:46 +0000484pickled state must be a dictionary and its items are assigned to the
Barry Warsawf595fd92001-11-15 23:39:07 +0000485new instance's dictionary. If a class defines both
486\method{__getstate__()} and \method{__setstate__()}, the state object
487needn't be a dictionary and these methods can do what they
Fred Drakee9cfcef2002-11-27 05:26:46 +0000488want.\footnote{This protocol is also used by the shallow and deep
Barry Warsawf595fd92001-11-15 23:39:07 +0000489copying operations defined in the
Fred Drakee9cfcef2002-11-27 05:26:46 +0000490\refmodule{copy} module.}
491
492\begin{notice}[warning]
493 For new-style classes, if \method{__getstate__()} returns a false
494 value, the \method{__setstate__()} method will not be called.
495\end{notice}
496
Barry Warsawf595fd92001-11-15 23:39:07 +0000497
498\subsubsection{Pickling and unpickling extension types}
499
500When the \class{Pickler} encounters an object of a type it knows
501nothing about --- such as an extension type --- it looks in two places
502for a hint of how to pickle it. One alternative is for the object to
503implement a \method{__reduce__()} method. If provided, at pickling
504time \method{__reduce__()} will be called with no arguments, and it
505must return either a string or a tuple.
506
507If a string is returned, it names a global variable whose contents are
Andrew M. Kuchlingcbbee6f2004-08-07 16:24:18 +0000508pickled as normal. The string returned by \method{__reduce__} should
509be the object's local name relative to its module; the pickle module
510searches the module namespace to determine the object's module.
511
512When a tuple is returned, it must be between two and five elements
513long. Optional elements can either be omitted, or \code{None} can be provided
514as their value. The semantics of each element are:
Barry Warsawf595fd92001-11-15 23:39:07 +0000515
516\begin{itemize}
517
Andrew M. Kuchlingcbbee6f2004-08-07 16:24:18 +0000518\item A callable object that will be called to create the initial
519version of the object. The next element of the tuple will provide
520arguments for this callable, and later elements provide additional
521state information that will subsequently be used to fully reconstruct
522the pickled date.
523
524In the unpickling environment this object must be either a class, a
525callable registered as a ``safe constructor'' (see below), or it must
526have an attribute \member{__safe_for_unpickling__} with a true value.
527Otherwise, an \exception{UnpicklingError} will be raised in the
528unpickling environment. Note that as usual, the callable itself is
529pickled by name.
Barry Warsawf595fd92001-11-15 23:39:07 +0000530
Raymond Hettingera6b45cc2004-12-07 07:05:57 +0000531\item A tuple of arguments for the callable object.
532\versionchanged[Formerly, this argument could also be \code{None}]{2.5}
Andrew M. Kuchlingcbbee6f2004-08-07 16:24:18 +0000533
Barry Warsawf595fd92001-11-15 23:39:07 +0000534\item Optionally, the object's state, which will be passed to
535 the object's \method{__setstate__()} method as described in
536 section~\ref{pickle-inst}. If the object has no
537 \method{__setstate__()} method, then, as above, the value must
538 be a dictionary and it will be added to the object's
539 \member{__dict__}.
540
Andrew M. Kuchling14d535c2004-08-07 15:49:24 +0000541\item Optionally, an iterator (and not a sequence) yielding successive
542list items. These list items will be pickled, and appended to the
543object using either \code{obj.append(\var{item})} or
544\code{obj.extend(\var{list_of_items})}. This is primarily used for
545list subclasses, but may be used by other classes as long as they have
546\method{append()} and \method{extend()} methods with the appropriate
547signature. (Whether \method{append()} or \method{extend()} is used
548depends on which pickle protocol version is used as well as the number
549of items to append, so both must be supported.)
550
551\item Optionally, an iterator (not a sequence)
552yielding successive dictionary items, which should be tuples of the
553form \code{(\var{key}, \var{value})}. These items will be pickled
554and stored to the object using \code{obj[\var{key}] = \var{value}}.
555This is primarily used for dictionary subclasses, but may be used by
556other classes as long as they implement \method{__setitem__}.
557
Barry Warsawf595fd92001-11-15 23:39:07 +0000558\end{itemize}
559
Andrew M. Kuchling14d535c2004-08-07 15:49:24 +0000560It is sometimes useful to know the protocol version when implementing
561\method{__reduce__}. This can be done by implementing a method named
562\method{__reduce_ex__} instead of \method{__reduce__}.
563\method{__reduce_ex__}, when it exists, is called in preference over
564\method{__reduce__} (you may still provide \method{__reduce__} for
565backwards compatibility). The \method{__reduce_ex__} method will be
566called with a single integer argument, the protocol version.
567
568The \class{object} class implements both \method{__reduce__} and
569\method{__reduce_ex__}; however, if a subclass overrides
570\method{__reduce__} but not \method{__reduce_ex__}, the
571\method{__reduce_ex__} implementation detects this and calls
572\method{__reduce__}.
573
Andrew M. Kuchlingcbbee6f2004-08-07 16:24:18 +0000574An alternative to implementing a \method{__reduce__()} method on the
575object to be pickled, is to register the callable with the
576\refmodule[copyreg]{copy_reg} module. This module provides a way
577for programs to register ``reduction functions'' and constructors for
578user-defined types. Reduction functions have the same semantics and
579interface as the \method{__reduce__()} method described above, except
580that they are called with a single argument, the object to be pickled.
581
582The registered constructor is deemed a ``safe constructor'' for purposes
583of unpickling as described above.
Andrew M. Kuchling14d535c2004-08-07 15:49:24 +0000584
Barry Warsawf595fd92001-11-15 23:39:07 +0000585
586\subsubsection{Pickling and unpickling external objects}
587
588For the benefit of object persistence, the \module{pickle} module
589supports the notion of a reference to an object outside the pickled
590data stream. Such objects are referenced by a ``persistent id'',
591which is just an arbitrary string of printable \ASCII{} characters.
592The resolution of such names is not defined by the \module{pickle}
593module; it will delegate this resolution to user defined functions on
Fred Drakef5f0c172003-09-09 19:49:18 +0000594the pickler and unpickler.\footnote{The actual mechanism for
Barry Warsawf595fd92001-11-15 23:39:07 +0000595associating these user defined functions is slightly different for
596\module{pickle} and \module{cPickle}. The description given here
597works the same for both implementations. Users of the \module{pickle}
598module could also use subclassing to effect the same results,
599overriding the \method{persistent_id()} and \method{persistent_load()}
Fred Drakef5f0c172003-09-09 19:49:18 +0000600methods in the derived classes.}
Barry Warsawf595fd92001-11-15 23:39:07 +0000601
602To define external persistent id resolution, you need to set the
603\member{persistent_id} attribute of the pickler object and the
604\member{persistent_load} attribute of the unpickler object.
605
606To pickle objects that have an external persistent id, the pickler
607must have a custom \function{persistent_id()} method that takes an
608object as an argument and returns either \code{None} or the persistent
609id for that object. When \code{None} is returned, the pickler simply
610pickles the object as normal. When a persistent id string is
611returned, the pickler will pickle that string, along with a marker
612so that the unpickler will recognize the string as a persistent id.
613
614To unpickle external objects, the unpickler must have a custom
615\function{persistent_load()} function that takes a persistent id
616string and returns the referenced object.
617
618Here's a silly example that \emph{might} shed more light:
619
620\begin{verbatim}
621import pickle
622from cStringIO import StringIO
623
624src = StringIO()
625p = pickle.Pickler(src)
626
627def persistent_id(obj):
628 if hasattr(obj, 'x'):
629 return 'the value %d' % obj.x
630 else:
631 return None
632
633p.persistent_id = persistent_id
634
635class Integer:
636 def __init__(self, x):
637 self.x = x
638 def __str__(self):
639 return 'My name is integer %d' % self.x
640
641i = Integer(7)
642print i
643p.dump(i)
644
645datastream = src.getvalue()
646print repr(datastream)
647dst = StringIO(datastream)
648
649up = pickle.Unpickler(dst)
650
651class FancyInteger(Integer):
652 def __str__(self):
653 return 'I am the integer %d' % self.x
654
655def persistent_load(persid):
656 if persid.startswith('the value '):
657 value = int(persid.split()[2])
658 return FancyInteger(value)
659 else:
660 raise pickle.UnpicklingError, 'Invalid persistent id'
661
662up.persistent_load = persistent_load
663
664j = up.load()
665print j
666\end{verbatim}
667
668In the \module{cPickle} module, the unpickler's
669\member{persistent_load} attribute can also be set to a Python
670list, in which case, when the unpickler reaches a persistent id, the
671persistent id string will simply be appended to this list. This
672functionality exists so that a pickle data stream can be ``sniffed''
673for object references without actually instantiating all the objects
Fred Drakef5f0c172003-09-09 19:49:18 +0000674in a pickle.\footnote{We'll leave you with the image of Guido and Jim
675sitting around sniffing pickles in their living rooms.} Setting
Barry Warsawf595fd92001-11-15 23:39:07 +0000676\member{persistent_load} to a list is usually used in conjunction with
677the \method{noload()} method on the Unpickler.
678
679% BAW: Both pickle and cPickle support something called
680% inst_persistent_id() which appears to give unknown types a second
681% shot at producing a persistent id. Since Jim Fulton can't remember
682% why it was added or what it's for, I'm leaving it undocumented.
683
Andrew M. Kuchling76963442003-05-14 16:51:46 +0000684\subsection{Subclassing Unpicklers \label{pickle-sub}}
Barry Warsawf595fd92001-11-15 23:39:07 +0000685
Andrew M. Kuchling76963442003-05-14 16:51:46 +0000686By default, unpickling will import any class that it finds in the
687pickle data. You can control exactly what gets unpickled and what
688gets called by customizing your unpickler. Unfortunately, exactly how
689you do this is different depending on whether you're using
690\module{pickle} or \module{cPickle}.\footnote{A word of caution: the
Barry Warsawf595fd92001-11-15 23:39:07 +0000691mechanisms described here use internal attributes and methods, which
692are subject to change in future versions of Python. We intend to
693someday provide a common interface for controlling this behavior,
Fred Drakef5f0c172003-09-09 19:49:18 +0000694which will work in either \module{pickle} or \module{cPickle}.}
Barry Warsawf595fd92001-11-15 23:39:07 +0000695
696In the \module{pickle} module, you need to derive a subclass from
697\class{Unpickler}, overriding the \method{load_global()}
698method. \method{load_global()} should read two lines from the pickle
Raymond Hettingerf17d65d2003-08-12 00:01:16 +0000699data stream where the first line will the name of the module
Barry Warsawf595fd92001-11-15 23:39:07 +0000700containing the class and the second line will be the name of the
Andrew M. Kuchling76963442003-05-14 16:51:46 +0000701instance's class. It then looks up the class, possibly importing the
Barry Warsawf595fd92001-11-15 23:39:07 +0000702module and digging out the attribute, then it appends what it finds to
703the unpickler's stack. Later on, this class will be assigned to the
704\member{__class__} attribute of an empty class, as a way of magically
705creating an instance without calling its class's \method{__init__()}.
Andrew M. Kuchling76963442003-05-14 16:51:46 +0000706Your job (should you choose to accept it), would be to have
Barry Warsawf595fd92001-11-15 23:39:07 +0000707\method{load_global()} push onto the unpickler's stack, a known safe
708version of any class you deem safe to unpickle. It is up to you to
709produce such a class. Or you could raise an error if you want to
710disallow all unpickling of instances. If this sounds like a hack,
Andrew M. Kuchling76963442003-05-14 16:51:46 +0000711you're right. Refer to the source code to make this work.
Barry Warsawf595fd92001-11-15 23:39:07 +0000712
713Things are a little cleaner with \module{cPickle}, but not by much.
714To control what gets unpickled, you can set the unpickler's
715\member{find_global} attribute to a function or \code{None}. If it is
716\code{None} then any attempts to unpickle instances will raise an
717\exception{UnpicklingError}. If it is a function,
718then it should accept a module name and a class name, and return the
719corresponding class object. It is responsible for looking up the
Andrew M. Kuchling76963442003-05-14 16:51:46 +0000720class and performing any necessary imports, and it may raise an
Barry Warsawf595fd92001-11-15 23:39:07 +0000721error to prevent instances of the class from being unpickled.
722
723The moral of the story is that you should be really careful about the
724source of the strings your application unpickles.
Fred Drake9463de21998-04-11 20:05:43 +0000725
Fred Drake38e5d272000-04-03 20:13:55 +0000726\subsection{Example \label{pickle-example}}
727
728Here's a simple example of how to modify pickling behavior for a
729class. The \class{TextReader} class opens a text file, and returns
730the line number and line contents each time its \method{readline()}
731method is called. If a \class{TextReader} instance is pickled, all
732attributes \emph{except} the file object member are saved. When the
733instance is unpickled, the file is reopened, and reading resumes from
734the last location. The \method{__setstate__()} and
735\method{__getstate__()} methods are used to implement this behavior.
736
737\begin{verbatim}
Fred Drake38e5d272000-04-03 20:13:55 +0000738class TextReader:
Fred Drakec8252802001-09-25 16:29:17 +0000739 """Print and number lines in a text file."""
740 def __init__(self, file):
Fred Drake38e5d272000-04-03 20:13:55 +0000741 self.file = file
Fred Drakec8252802001-09-25 16:29:17 +0000742 self.fh = open(file)
Fred Drake38e5d272000-04-03 20:13:55 +0000743 self.lineno = 0
744
745 def readline(self):
746 self.lineno = self.lineno + 1
747 line = self.fh.readline()
748 if not line:
749 return None
Fred Drakec8252802001-09-25 16:29:17 +0000750 if line.endswith("\n"):
751 line = line[:-1]
752 return "%d: %s" % (self.lineno, line)
Fred Drake38e5d272000-04-03 20:13:55 +0000753
Fred Drake38e5d272000-04-03 20:13:55 +0000754 def __getstate__(self):
Fred Drakec8252802001-09-25 16:29:17 +0000755 odict = self.__dict__.copy() # copy the dict since we change it
756 del odict['fh'] # remove filehandle entry
Fred Drake38e5d272000-04-03 20:13:55 +0000757 return odict
758
Fred Drake38e5d272000-04-03 20:13:55 +0000759 def __setstate__(self,dict):
Fred Drakec8252802001-09-25 16:29:17 +0000760 fh = open(dict['file']) # reopen file
761 count = dict['lineno'] # read from file...
762 while count: # until line count is restored
Fred Drake38e5d272000-04-03 20:13:55 +0000763 fh.readline()
764 count = count - 1
Fred Drakec8252802001-09-25 16:29:17 +0000765 self.__dict__.update(dict) # update attributes
766 self.fh = fh # save the file object
Fred Drake38e5d272000-04-03 20:13:55 +0000767\end{verbatim}
768
769A sample usage might be something like this:
770
771\begin{verbatim}
772>>> import TextReader
773>>> obj = TextReader.TextReader("TextReader.py")
774>>> obj.readline()
775'1: #!/usr/local/bin/python'
776>>> # (more invocations of obj.readline() here)
777... obj.readline()
778'7: class TextReader:'
779>>> import pickle
780>>> pickle.dump(obj,open('save.p','w'))
Fred Drakec8252802001-09-25 16:29:17 +0000781\end{verbatim}
Fred Drake38e5d272000-04-03 20:13:55 +0000782
Fred Drakec8252802001-09-25 16:29:17 +0000783If you want to see that \refmodule{pickle} works across Python
784processes, start another Python session, before continuing. What
785follows can happen from either the same process or a new process.
Fred Drake38e5d272000-04-03 20:13:55 +0000786
Fred Drakec8252802001-09-25 16:29:17 +0000787\begin{verbatim}
Fred Drake38e5d272000-04-03 20:13:55 +0000788>>> import pickle
789>>> reader = pickle.load(open('save.p'))
790>>> reader.readline()
791'8: "Print and number lines in a text file."'
792\end{verbatim}
793
794
Barry Warsawf595fd92001-11-15 23:39:07 +0000795\begin{seealso}
796 \seemodule[copyreg]{copy_reg}{Pickle interface constructor
797 registration for extension types.}
798
799 \seemodule{shelve}{Indexed databases of objects; uses \module{pickle}.}
800
801 \seemodule{copy}{Shallow and deep object copying.}
802
803 \seemodule{marshal}{High-performance serialization of built-in types.}
804\end{seealso}
805
806
807\section{\module{cPickle} --- A faster \module{pickle}}
Fred Drakeffbe6871999-04-22 21:23:22 +0000808
Fred Drakeb91e9341998-07-23 17:59:49 +0000809\declaremodule{builtin}{cPickle}
Fred Drake38e5d272000-04-03 20:13:55 +0000810\modulesynopsis{Faster version of \refmodule{pickle}, but not subclassable.}
Andrew M. Kuchlingc62af022004-01-08 15:01:08 +0000811\moduleauthor{Jim Fulton}{jim@zope.com}
Fred Drakeffbe6871999-04-22 21:23:22 +0000812\sectionauthor{Fred L. Drake, Jr.}{fdrake@acm.org}
Fred Drakeb91e9341998-07-23 17:59:49 +0000813
Barry Warsawf595fd92001-11-15 23:39:07 +0000814The \module{cPickle} module supports serialization and
815de-serialization of Python objects, providing an interface and
816functionality nearly identical to the
817\refmodule{pickle}\refstmodindex{pickle} module. There are several
818differences, the most important being performance and subclassability.
Fred Drake9463de21998-04-11 20:05:43 +0000819
Barry Warsawf595fd92001-11-15 23:39:07 +0000820First, \module{cPickle} can be up to 1000 times faster than
821\module{pickle} because the former is implemented in C. Second, in
822the \module{cPickle} module the callables \function{Pickler()} and
823\function{Unpickler()} are functions, not classes. This means that
824you cannot use them to derive custom pickling and unpickling
825subclasses. Most applications have no need for this functionality and
826should benefit from the greatly improved performance of the
827\module{cPickle} module.
Fred Drake9463de21998-04-11 20:05:43 +0000828
Barry Warsawf595fd92001-11-15 23:39:07 +0000829The pickle data stream produced by \module{pickle} and
830\module{cPickle} are identical, so it is possible to use
831\module{pickle} and \module{cPickle} interchangeably with existing
Fred Drakef5f0c172003-09-09 19:49:18 +0000832pickles.\footnote{Since the pickle data format is actually a tiny
Barry Warsawf595fd92001-11-15 23:39:07 +0000833stack-oriented programming language, and some freedom is taken in the
834encodings of certain objects, it is possible that the two modules
835produce different data streams for the same input objects. However it
836is guaranteed that they will always be able to read each other's
Fred Drakef5f0c172003-09-09 19:49:18 +0000837data streams.}
Guido van Rossumcf3ce921999-01-06 23:34:39 +0000838
Barry Warsawf595fd92001-11-15 23:39:07 +0000839There are additional minor differences in API between \module{cPickle}
840and \module{pickle}, however for most applications, they are
Johannes Gijsbersf4a70f32004-12-12 16:52:40 +0000841interchangeable. More documentation is provided in the
Barry Warsawf595fd92001-11-15 23:39:07 +0000842\module{pickle} module documentation, which
843includes a list of the documented differences.
844
845