blob: 00bbab1780e67d3a686a4b6fe8a36972e0ced3d9 [file] [log] [blame]
Georg Brandl116aa622007-08-15 14:28:22 +00001
2:mod:`marshal` --- Internal Python object serialization
3=======================================================
4
5.. module:: marshal
6 :synopsis: Convert Python objects to streams of bytes and back (with different
7 constraints).
8
9
10This module contains functions that can read and write Python values in a binary
11format. The format is specific to Python, but independent of machine
12architecture issues (e.g., you can write a Python value to a file on a PC,
13transport the file to a Sun, and read it back there). Details of the format are
14undocumented on purpose; it may change between Python versions (although it
15rarely does). [#]_
16
17.. index::
18 module: pickle
19 module: shelve
20 object: code
21
22This is not a general "persistence" module. For general persistence and
23transfer of Python objects through RPC calls, see the modules :mod:`pickle` and
24:mod:`shelve`. The :mod:`marshal` module exists mainly to support reading and
25writing the "pseudo-compiled" code for Python modules of :file:`.pyc` files.
26Therefore, the Python maintainers reserve the right to modify the marshal format
27in backward incompatible ways should the need arise. If you're serializing and
28de-serializing Python objects, use the :mod:`pickle` module instead.
29
30.. warning::
31
32 The :mod:`marshal` module is not intended to be secure against erroneous or
33 maliciously constructed data. Never unmarshal data received from an
34 untrusted or unauthenticated source.
35
36Not all Python object types are supported; in general, only objects whose value
37is independent from a particular invocation of Python can be written and read by
38this module. The following types are supported: ``None``, integers, long
39integers, floating point numbers, strings, Unicode objects, tuples, lists,
40dictionaries, and code objects, where it should be understood that tuples, lists
41and dictionaries are only supported as long as the values contained therein are
42themselves supported; and recursive lists and dictionaries should not be written
43(they will cause infinite loops).
44
Guido van Rossumda27fd22007-08-17 00:24:54 +000045.. warning::
46
47 On machines where C's ``long int`` type has more than 32 bits (such as the
48 DEC Alpha), it is possible to create plain Python integers that are longer
49 than 32 bits. If such an integer is marshaled and read back in on a machine
50 where C's ``long int`` type has only 32 bits, a Python long integer object
51 is returned instead. While of a different type, the numeric value is the
52 same. (This behavior is new in Python 2.2. In earlier versions, all but the
53 least-significant 32 bits of the value were lost, and a warning message was
54 printed.)
Georg Brandl116aa622007-08-15 14:28:22 +000055
56There are functions that read/write files as well as functions operating on
57strings.
58
59The module defines these functions:
60
61
62.. function:: dump(value, file[, version])
63
64 Write the value on the open file. The value must be a supported type. The
65 file must be an open file object such as ``sys.stdout`` or returned by
66 :func:`open` or :func:`os.popen`. It must be opened in binary mode (``'wb'``
67 or ``'w+b'``).
68
69 If the value has (or contains an object that has) an unsupported type, a
70 :exc:`ValueError` exception is raised --- but garbage data will also be written
71 to the file. The object will not be properly read back by :func:`load`.
72
Georg Brandl55ac8f02007-09-01 13:51:09 +000073 The *version* argument indicates the data format that ``dump`` should use
74 (see below).
Georg Brandl116aa622007-08-15 14:28:22 +000075
76
77.. function:: load(file)
78
79 Read one value from the open file and return it. If no valid value is read
80 (e.g. because the data has a different Python version's incompatible marshal
81 format), raise :exc:`EOFError`, :exc:`ValueError` or :exc:`TypeError`. The
82 file must be an open file object opened in binary mode (``'rb'`` or
83 ``'r+b'``).
84
85 .. warning::
86
87 If an object containing an unsupported type was marshalled with :func:`dump`,
88 :func:`load` will substitute ``None`` for the unmarshallable type.
89
90
91.. function:: dumps(value[, version])
92
93 Return the string that would be written to a file by ``dump(value, file)``. The
94 value must be a supported type. Raise a :exc:`ValueError` exception if value
95 has (or contains an object that has) an unsupported type.
96
Georg Brandl55ac8f02007-09-01 13:51:09 +000097 The *version* argument indicates the data format that ``dumps`` should use
98 (see below).
Georg Brandl116aa622007-08-15 14:28:22 +000099
100
101.. function:: loads(string)
102
103 Convert the string to a value. If no valid value is found, raise
104 :exc:`EOFError`, :exc:`ValueError` or :exc:`TypeError`. Extra characters in the
105 string are ignored.
106
107
108In addition, the following constants are defined:
109
110.. data:: version
111
112 Indicates the format that the module uses. Version 0 is the historical format,
113 version 1 (added in Python 2.4) shares interned strings and version 2 (added in
114 Python 2.5) uses a binary format for floating point numbers. The current version
115 is 2.
116
Georg Brandl116aa622007-08-15 14:28:22 +0000117
118.. rubric:: Footnotes
119
120.. [#] The name of this module stems from a bit of terminology used by the designers of
121 Modula-3 (amongst others), who use the term "marshalling" for shipping of data
122 around in a self-contained form. Strictly speaking, "to marshal" means to
123 convert some data from internal to external form (in an RPC buffer for instance)
124 and "unmarshalling" for the reverse process.
125