blob: f4dfc1f04ff7af36564b68bbf07320749e0da2a9 [file] [log] [blame]
Georg Brandl8ec7f652007-08-15 14:28:01 +00001
2:mod:`marshal` --- Internal Python object serialization
3=======================================================
4
5.. module:: marshal
6 :synopsis: Convert Python objects to streams of bytes and back (with different
7 constraints).
8
9
10This module contains functions that can read and write Python values in a binary
11format. The format is specific to Python, but independent of machine
12architecture issues (e.g., you can write a Python value to a file on a PC,
13transport the file to a Sun, and read it back there). Details of the format are
14undocumented on purpose; it may change between Python versions (although it
15rarely does). [#]_
16
17.. index::
18 module: pickle
19 module: shelve
20 object: code
21
22This is not a general "persistence" module. For general persistence and
23transfer of Python objects through RPC calls, see the modules :mod:`pickle` and
24:mod:`shelve`. The :mod:`marshal` module exists mainly to support reading and
25writing the "pseudo-compiled" code for Python modules of :file:`.pyc` files.
26Therefore, the Python maintainers reserve the right to modify the marshal format
27in backward incompatible ways should the need arise. If you're serializing and
Raymond Hettinger84e26b62007-10-31 21:57:58 +000028de-serializing Python objects, use the :mod:`pickle` module instead -- the
29performance is comparable, version independence is guaranteed, and pickle
30supports a substantially wider range of objects than marshal.
Georg Brandl8ec7f652007-08-15 14:28:01 +000031
32.. warning::
33
34 The :mod:`marshal` module is not intended to be secure against erroneous or
35 maliciously constructed data. Never unmarshal data received from an
36 untrusted or unauthenticated source.
37
38Not all Python object types are supported; in general, only objects whose value
39is independent from a particular invocation of Python can be written and read by
Georg Brandlaf795e52009-09-03 12:31:39 +000040this module. The following types are supported: booleans, integers, long
41integers, floating point numbers, complex numbers, strings, Unicode objects,
42tuples, lists, sets, frozensets, dictionaries, and code objects, where it should
43be understood that tuples, lists, sets, frozensets and dictionaries are only
44supported as long as the values contained therein are themselves supported; and
45recursive lists, sets and dictionaries should not be written (they will cause
46infinite loops). The singletons :const:`None`, :const:`Ellipsis` and
47:exc:`StopIteration` can also be marshalled and unmarshalled.
Georg Brandl8ec7f652007-08-15 14:28:01 +000048
Georg Brandlbf863b12007-08-15 19:06:04 +000049.. warning::
Georg Brandlc62ef8b2009-01-03 20:55:06 +000050
Georg Brandlbf863b12007-08-15 19:06:04 +000051 On machines where C's ``long int`` type has more than 32 bits (such as the
52 DEC Alpha), it is possible to create plain Python integers that are longer
53 than 32 bits. If such an integer is marshaled and read back in on a machine
54 where C's ``long int`` type has only 32 bits, a Python long integer object
55 is returned instead. While of a different type, the numeric value is the
56 same. (This behavior is new in Python 2.2. In earlier versions, all but the
57 least-significant 32 bits of the value were lost, and a warning message was
58 printed.)
Georg Brandl8ec7f652007-08-15 14:28:01 +000059
60There are functions that read/write files as well as functions operating on
61strings.
62
63The module defines these functions:
64
65
66.. function:: dump(value, file[, version])
67
68 Write the value on the open file. The value must be a supported type. The
Tim Goldene9864c52014-04-29 16:11:18 +010069 file must be a open file object such as ``sys.stdout`` or returned by
70 :func:`open` or :func:`os.popen`. It may not be a wrapper such as
71 TemporaryFile on Windows. It must be opened in binary mode (``'wb'``
Georg Brandl8ec7f652007-08-15 14:28:01 +000072 or ``'w+b'``).
73
74 If the value has (or contains an object that has) an unsupported type, a
75 :exc:`ValueError` exception is raised --- but garbage data will also be written
76 to the file. The object will not be properly read back by :func:`load`.
77
78 .. versionadded:: 2.4
79 The *version* argument indicates the data format that ``dump`` should use
80 (see below).
81
82
83.. function:: load(file)
84
85 Read one value from the open file and return it. If no valid value is read
86 (e.g. because the data has a different Python version's incompatible marshal
87 format), raise :exc:`EOFError`, :exc:`ValueError` or :exc:`TypeError`. The
88 file must be an open file object opened in binary mode (``'rb'`` or
89 ``'r+b'``).
90
Georg Brandl16a57f62009-04-27 15:29:09 +000091 .. note::
Georg Brandl8ec7f652007-08-15 14:28:01 +000092
93 If an object containing an unsupported type was marshalled with :func:`dump`,
94 :func:`load` will substitute ``None`` for the unmarshallable type.
95
96
97.. function:: dumps(value[, version])
98
99 Return the string that would be written to a file by ``dump(value, file)``. The
100 value must be a supported type. Raise a :exc:`ValueError` exception if value
101 has (or contains an object that has) an unsupported type.
102
103 .. versionadded:: 2.4
104 The *version* argument indicates the data format that ``dumps`` should use
105 (see below).
106
107
108.. function:: loads(string)
109
110 Convert the string to a value. If no valid value is found, raise
111 :exc:`EOFError`, :exc:`ValueError` or :exc:`TypeError`. Extra characters in the
112 string are ignored.
113
114
115In addition, the following constants are defined:
116
117.. data:: version
118
119 Indicates the format that the module uses. Version 0 is the historical format,
120 version 1 (added in Python 2.4) shares interned strings and version 2 (added in
121 Python 2.5) uses a binary format for floating point numbers. The current version
122 is 2.
123
124 .. versionadded:: 2.4
125
126
127.. rubric:: Footnotes
128
129.. [#] The name of this module stems from a bit of terminology used by the designers of
130 Modula-3 (amongst others), who use the term "marshalling" for shipping of data
131 around in a self-contained form. Strictly speaking, "to marshal" means to
132 convert some data from internal to external form (in an RPC buffer for instance)
133 and "unmarshalling" for the reverse process.
134