| Fred Drake | 295da24 | 1998-08-10 19:42:37 +0000 | [diff] [blame] | 1 | \section{\module{marshal} --- | 
| Barry Warsaw | 69b2d75 | 2001-11-15 23:55:12 +0000 | [diff] [blame] | 2 | Internal Python object serialization} | 
| Fred Drake | b91e934 | 1998-07-23 17:59:49 +0000 | [diff] [blame] | 3 |  | 
| Fred Drake | ffbe687 | 1999-04-22 21:23:22 +0000 | [diff] [blame] | 4 | \declaremodule{builtin}{marshal} | 
| Fred Drake | 295da24 | 1998-08-10 19:42:37 +0000 | [diff] [blame] | 5 | \modulesynopsis{Convert Python objects to streams of bytes and back | 
| Fred Drake | ffbe687 | 1999-04-22 21:23:22 +0000 | [diff] [blame] | 6 | (with different constraints).} | 
| Fred Drake | b91e934 | 1998-07-23 17:59:49 +0000 | [diff] [blame] | 7 |  | 
| Fred Drake | 0c2af2b | 1998-03-08 06:28:00 +0000 | [diff] [blame] | 8 |  | 
| Guido van Rossum | 5fdeeea | 1994-01-02 01:22:07 +0000 | [diff] [blame] | 9 | This module contains functions that can read and write Python | 
|  | 10 | values in a binary format.  The format is specific to Python, but | 
|  | 11 | independent of machine architecture issues (e.g., you can write a | 
| Guido van Rossum | 470be14 | 1995-03-17 16:07:09 +0000 | [diff] [blame] | 12 | Python value to a file on a PC, transport the file to a Sun, and read | 
|  | 13 | it back there).  Details of the format are undocumented on purpose; | 
| Fred Drake | ea003fc | 1999-04-05 21:59:15 +0000 | [diff] [blame] | 14 | it may change between Python versions (although it rarely | 
|  | 15 | does).\footnote{The name of this module stems from a bit of | 
|  | 16 | terminology used by the designers of Modula-3 (amongst others), who | 
|  | 17 | use the term ``marshalling'' for shipping of data around in a | 
|  | 18 | self-contained form. Strictly speaking, ``to marshal'' means to | 
|  | 19 | convert some data from internal to external form (in an RPC buffer for | 
|  | 20 | instance) and ``unmarshalling'' for the reverse process.} | 
| Guido van Rossum | 5fdeeea | 1994-01-02 01:22:07 +0000 | [diff] [blame] | 21 |  | 
| Thomas Wouters | f831663 | 2000-07-16 19:01:10 +0000 | [diff] [blame] | 22 | This is not a general ``persistence'' module.  For general persistence | 
| Guido van Rossum | 470be14 | 1995-03-17 16:07:09 +0000 | [diff] [blame] | 23 | and transfer of Python objects through RPC calls, see the modules | 
| Fred Drake | ffbe687 | 1999-04-22 21:23:22 +0000 | [diff] [blame] | 24 | \refmodule{pickle} and \refmodule{shelve}.  The \module{marshal} module exists | 
| Guido van Rossum | 470be14 | 1995-03-17 16:07:09 +0000 | [diff] [blame] | 25 | mainly to support reading and writing the ``pseudo-compiled'' code for | 
| Barry Warsaw | 69b2d75 | 2001-11-15 23:55:12 +0000 | [diff] [blame] | 26 | Python modules of \file{.pyc} files.  Therefore, the Python | 
|  | 27 | maintainers reserve the right to modify the marshal format in backward | 
|  | 28 | incompatible ways should the need arise.  If you're serializing and | 
|  | 29 | de-serializing Python objects, use the \module{pickle} module.  There | 
|  | 30 | may also be unknown security problems with | 
|  | 31 | \module{marshal}\footnote{As opposed to the known security issues in | 
|  | 32 | the \module{pickle} module!}. | 
| Fred Drake | 54820dc | 1997-12-15 21:56:05 +0000 | [diff] [blame] | 33 | \refstmodindex{pickle} | 
|  | 34 | \refstmodindex{shelve} | 
| Guido van Rossum | 470be14 | 1995-03-17 16:07:09 +0000 | [diff] [blame] | 35 | \obindex{code} | 
| Guido van Rossum | 5fdeeea | 1994-01-02 01:22:07 +0000 | [diff] [blame] | 36 |  | 
|  | 37 | Not all Python object types are supported; in general, only objects | 
|  | 38 | whose value is independent from a particular invocation of Python can | 
|  | 39 | be written and read by this module.  The following types are supported: | 
|  | 40 | \code{None}, integers, long integers, floating point numbers, | 
| Fred Drake | 61098f2 | 2000-04-06 14:47:20 +0000 | [diff] [blame] | 41 | strings, Unicode objects, tuples, lists, dictionaries, and code | 
|  | 42 | objects, where it should be understood that tuples, lists and | 
|  | 43 | dictionaries are only supported as long as the values contained | 
|  | 44 | therein are themselves supported; and recursive lists and dictionaries | 
|  | 45 | should not be written (they will cause infinite loops). | 
| Guido van Rossum | 470be14 | 1995-03-17 16:07:09 +0000 | [diff] [blame] | 46 |  | 
| Fred Drake | af8a015 | 1998-01-14 14:51:31 +0000 | [diff] [blame] | 47 | \strong{Caveat:} On machines where C's \code{long int} type has more than | 
| Tim Peters | ad2dc3f | 2001-09-14 20:40:13 +0000 | [diff] [blame] | 48 | 32 bits (such as the DEC Alpha), it is possible to create plain Python | 
|  | 49 | integers that are longer than 32 bits. | 
|  | 50 | If such an integer is marshaled and read back in on a machine where | 
|  | 51 | C's \code{long int} type has only 32 bits, a Python long integer object | 
|  | 52 | is returned instead.  While of a different type, the numeric value is | 
|  | 53 | the same.  (This behavior is new in Python 2.2.  In earlier versions, | 
|  | 54 | all but the least-significant 32 bits of the value were lost, and a | 
|  | 55 | warning message was printed.) | 
| Guido van Rossum | 5fdeeea | 1994-01-02 01:22:07 +0000 | [diff] [blame] | 56 |  | 
|  | 57 | There are functions that read/write files as well as functions | 
|  | 58 | operating on strings. | 
|  | 59 |  | 
|  | 60 | The module defines these functions: | 
|  | 61 |  | 
| Fred Drake | 0c2af2b | 1998-03-08 06:28:00 +0000 | [diff] [blame] | 62 | \begin{funcdesc}{dump}{value, file} | 
| Guido van Rossum | 5fdeeea | 1994-01-02 01:22:07 +0000 | [diff] [blame] | 63 | Write the value on the open file.  The value must be a supported | 
|  | 64 | type.  The file must be an open file object such as | 
| Fred Drake | 7506298 | 1998-02-16 20:40:37 +0000 | [diff] [blame] | 65 | \code{sys.stdout} or returned by \function{open()} or | 
| Fred Drake | 38e5d27 | 2000-04-03 20:13:55 +0000 | [diff] [blame] | 66 | \function{posix.popen()}.  It must be opened in binary mode | 
|  | 67 | (\code{'wb'} or \code{'w+b'}). | 
| Fred Drake | 7506298 | 1998-02-16 20:40:37 +0000 | [diff] [blame] | 68 |  | 
| Guido van Rossum | bbb1e26 | 1996-06-26 20:20:57 +0000 | [diff] [blame] | 69 | If the value has (or contains an object that has) an unsupported type, | 
| Fred Drake | 0c2af2b | 1998-03-08 06:28:00 +0000 | [diff] [blame] | 70 | a \exception{ValueError} exception is raised --- but garbage data | 
| Fred Drake | 7506298 | 1998-02-16 20:40:37 +0000 | [diff] [blame] | 71 | will also be written to the file.  The object will not be properly | 
|  | 72 | read back by \function{load()}. | 
| Guido van Rossum | 5fdeeea | 1994-01-02 01:22:07 +0000 | [diff] [blame] | 73 | \end{funcdesc} | 
|  | 74 |  | 
|  | 75 | \begin{funcdesc}{load}{file} | 
|  | 76 | Read one value from the open file and return it.  If no valid value | 
| Fred Drake | 7506298 | 1998-02-16 20:40:37 +0000 | [diff] [blame] | 77 | is read, raise \exception{EOFError}, \exception{ValueError} or | 
| Fred Drake | 38e5d27 | 2000-04-03 20:13:55 +0000 | [diff] [blame] | 78 | \exception{TypeError}.  The file must be an open file object opened | 
|  | 79 | in binary mode (\code{'rb'} or \code{'r+b'}). | 
| Guido van Rossum | bbb1e26 | 1996-06-26 20:20:57 +0000 | [diff] [blame] | 80 |  | 
| Fred Drake | 0aa811c | 2001-10-20 04:24:09 +0000 | [diff] [blame] | 81 | \warning{If an object containing an unsupported type was | 
| Fred Drake | 0c2af2b | 1998-03-08 06:28:00 +0000 | [diff] [blame] | 82 | marshalled with \function{dump()}, \function{load()} will substitute | 
| Fred Drake | 0aa811c | 2001-10-20 04:24:09 +0000 | [diff] [blame] | 83 | \code{None} for the unmarshallable type.} | 
| Guido van Rossum | 5fdeeea | 1994-01-02 01:22:07 +0000 | [diff] [blame] | 84 | \end{funcdesc} | 
|  | 85 |  | 
|  | 86 | \begin{funcdesc}{dumps}{value} | 
|  | 87 | Return the string that would be written to a file by | 
| Fred Drake | 7506298 | 1998-02-16 20:40:37 +0000 | [diff] [blame] | 88 | \code{dump(\var{value}, \var{file})}.  The value must be a supported | 
|  | 89 | type.  Raise a \exception{ValueError} exception if value has (or | 
|  | 90 | contains an object that has) an unsupported type. | 
| Guido van Rossum | 5fdeeea | 1994-01-02 01:22:07 +0000 | [diff] [blame] | 91 | \end{funcdesc} | 
|  | 92 |  | 
|  | 93 | \begin{funcdesc}{loads}{string} | 
|  | 94 | Convert the string to a value.  If no valid value is found, raise | 
| Fred Drake | 7506298 | 1998-02-16 20:40:37 +0000 | [diff] [blame] | 95 | \exception{EOFError}, \exception{ValueError} or | 
|  | 96 | \exception{TypeError}.  Extra characters in the string are ignored. | 
| Guido van Rossum | 5fdeeea | 1994-01-02 01:22:07 +0000 | [diff] [blame] | 97 | \end{funcdesc} |