Georg Brandl | 54a3faa | 2008-01-20 09:30:57 +0000 | [diff] [blame] | 1 | .. highlightlang:: c |
| 2 | |
| 3 | .. _marshalling-utils: |
| 4 | |
| 5 | Data marshalling support |
| 6 | ======================== |
| 7 | |
Jeroen Ruigrok van der Werven | 47a7d70 | 2009-04-27 05:43:17 +0000 | [diff] [blame] | 8 | These routines allow C code to work with serialized objects using the same |
| 9 | data format as the :mod:`marshal` module. There are functions to write data |
| 10 | into the serialization format, and additional functions that can be used to |
| 11 | read the data back. Files used to store marshalled data must be opened in |
| 12 | binary mode. |
Georg Brandl | 54a3faa | 2008-01-20 09:30:57 +0000 | [diff] [blame] | 13 | |
| 14 | Numeric values are stored with the least significant byte first. |
| 15 | |
Jeroen Ruigrok van der Werven | 47a7d70 | 2009-04-27 05:43:17 +0000 | [diff] [blame] | 16 | The module supports two versions of the data format: version 0 is the |
| 17 | historical version, version 1 shares interned strings in the file, and upon |
| 18 | unmarshalling. Version 2 uses a binary format for floating point numbers. |
Georg Brandl | e6bcc91 | 2008-05-12 18:05:20 +0000 | [diff] [blame] | 19 | *Py_MARSHAL_VERSION* indicates the current file format (currently 2). |
Georg Brandl | 54a3faa | 2008-01-20 09:30:57 +0000 | [diff] [blame] | 20 | |
| 21 | |
Georg Brandl | 60203b4 | 2010-10-06 10:11:56 +0000 | [diff] [blame] | 22 | .. c:function:: void PyMarshal_WriteLongToFile(long value, FILE *file, int version) |
Georg Brandl | 54a3faa | 2008-01-20 09:30:57 +0000 | [diff] [blame] | 23 | |
Georg Brandl | 60203b4 | 2010-10-06 10:11:56 +0000 | [diff] [blame] | 24 | Marshal a :c:type:`long` integer, *value*, to *file*. This will only write |
Jeroen Ruigrok van der Werven | 47a7d70 | 2009-04-27 05:43:17 +0000 | [diff] [blame] | 25 | the least-significant 32 bits of *value*; regardless of the size of the |
Georg Brandl | 60203b4 | 2010-10-06 10:11:56 +0000 | [diff] [blame] | 26 | native :c:type:`long` type. *version* indicates the file format. |
Georg Brandl | 54a3faa | 2008-01-20 09:30:57 +0000 | [diff] [blame] | 27 | |
| 28 | |
Georg Brandl | 60203b4 | 2010-10-06 10:11:56 +0000 | [diff] [blame] | 29 | .. c:function:: void PyMarshal_WriteObjectToFile(PyObject *value, FILE *file, int version) |
Georg Brandl | 54a3faa | 2008-01-20 09:30:57 +0000 | [diff] [blame] | 30 | |
| 31 | Marshal a Python object, *value*, to *file*. |
| 32 | *version* indicates the file format. |
| 33 | |
| 34 | |
Georg Brandl | 60203b4 | 2010-10-06 10:11:56 +0000 | [diff] [blame] | 35 | .. c:function:: PyObject* PyMarshal_WriteObjectToString(PyObject *value, int version) |
Georg Brandl | 54a3faa | 2008-01-20 09:30:57 +0000 | [diff] [blame] | 36 | |
| 37 | Return a string object containing the marshalled representation of *value*. |
| 38 | *version* indicates the file format. |
| 39 | |
| 40 | |
| 41 | The following functions allow marshalled values to be read back in. |
| 42 | |
| 43 | XXX What about error detection? It appears that reading past the end of the |
Jeroen Ruigrok van der Werven | 47a7d70 | 2009-04-27 05:43:17 +0000 | [diff] [blame] | 44 | file will always result in a negative numeric value (where that's relevant), |
| 45 | but it's not clear that negative values won't be handled properly when there's |
| 46 | no error. What's the right way to tell? Should only non-negative values be |
| 47 | written using these routines? |
Georg Brandl | 54a3faa | 2008-01-20 09:30:57 +0000 | [diff] [blame] | 48 | |
| 49 | |
Georg Brandl | 60203b4 | 2010-10-06 10:11:56 +0000 | [diff] [blame] | 50 | .. c:function:: long PyMarshal_ReadLongFromFile(FILE *file) |
Georg Brandl | 54a3faa | 2008-01-20 09:30:57 +0000 | [diff] [blame] | 51 | |
Georg Brandl | 60203b4 | 2010-10-06 10:11:56 +0000 | [diff] [blame] | 52 | Return a C :c:type:`long` from the data stream in a :c:type:`FILE\*` opened |
Jeroen Ruigrok van der Werven | 47a7d70 | 2009-04-27 05:43:17 +0000 | [diff] [blame] | 53 | for reading. Only a 32-bit value can be read in using this function, |
Georg Brandl | 60203b4 | 2010-10-06 10:11:56 +0000 | [diff] [blame] | 54 | regardless of the native size of :c:type:`long`. |
Georg Brandl | 54a3faa | 2008-01-20 09:30:57 +0000 | [diff] [blame] | 55 | |
Victor Stinner | 6a318d4 | 2015-03-18 13:58:49 +0100 | [diff] [blame^] | 56 | On error, raise an exception and return ``-1``. |
| 57 | |
Georg Brandl | 54a3faa | 2008-01-20 09:30:57 +0000 | [diff] [blame] | 58 | |
Georg Brandl | 60203b4 | 2010-10-06 10:11:56 +0000 | [diff] [blame] | 59 | .. c:function:: int PyMarshal_ReadShortFromFile(FILE *file) |
Georg Brandl | 54a3faa | 2008-01-20 09:30:57 +0000 | [diff] [blame] | 60 | |
Georg Brandl | 60203b4 | 2010-10-06 10:11:56 +0000 | [diff] [blame] | 61 | Return a C :c:type:`short` from the data stream in a :c:type:`FILE\*` opened |
Jeroen Ruigrok van der Werven | 47a7d70 | 2009-04-27 05:43:17 +0000 | [diff] [blame] | 62 | for reading. Only a 16-bit value can be read in using this function, |
Georg Brandl | 60203b4 | 2010-10-06 10:11:56 +0000 | [diff] [blame] | 63 | regardless of the native size of :c:type:`short`. |
Georg Brandl | 54a3faa | 2008-01-20 09:30:57 +0000 | [diff] [blame] | 64 | |
Victor Stinner | 6a318d4 | 2015-03-18 13:58:49 +0100 | [diff] [blame^] | 65 | On error, raise an exception and return ``-1``. |
| 66 | |
Georg Brandl | 54a3faa | 2008-01-20 09:30:57 +0000 | [diff] [blame] | 67 | |
Georg Brandl | 60203b4 | 2010-10-06 10:11:56 +0000 | [diff] [blame] | 68 | .. c:function:: PyObject* PyMarshal_ReadObjectFromFile(FILE *file) |
Georg Brandl | 54a3faa | 2008-01-20 09:30:57 +0000 | [diff] [blame] | 69 | |
Georg Brandl | 60203b4 | 2010-10-06 10:11:56 +0000 | [diff] [blame] | 70 | Return a Python object from the data stream in a :c:type:`FILE\*` opened for |
Victor Stinner | 6a318d4 | 2015-03-18 13:58:49 +0100 | [diff] [blame^] | 71 | reading. |
| 72 | |
| 73 | On error, sets the appropriate exception (:exc:`EOFError` or |
Georg Brandl | 54a3faa | 2008-01-20 09:30:57 +0000 | [diff] [blame] | 74 | :exc:`TypeError`) and returns *NULL*. |
| 75 | |
| 76 | |
Georg Brandl | 60203b4 | 2010-10-06 10:11:56 +0000 | [diff] [blame] | 77 | .. c:function:: PyObject* PyMarshal_ReadLastObjectFromFile(FILE *file) |
Georg Brandl | 54a3faa | 2008-01-20 09:30:57 +0000 | [diff] [blame] | 78 | |
Georg Brandl | 60203b4 | 2010-10-06 10:11:56 +0000 | [diff] [blame] | 79 | Return a Python object from the data stream in a :c:type:`FILE\*` opened for |
| 80 | reading. Unlike :c:func:`PyMarshal_ReadObjectFromFile`, this function |
Jeroen Ruigrok van der Werven | 47a7d70 | 2009-04-27 05:43:17 +0000 | [diff] [blame] | 81 | assumes that no further objects will be read from the file, allowing it to |
| 82 | aggressively load file data into memory so that the de-serialization can |
| 83 | operate from data in memory rather than reading a byte at a time from the |
| 84 | file. Only use these variant if you are certain that you won't be reading |
Victor Stinner | 6a318d4 | 2015-03-18 13:58:49 +0100 | [diff] [blame^] | 85 | anything else from the file. |
| 86 | |
| 87 | On error, sets the appropriate exception (:exc:`EOFError` or |
| 88 | :exc:`TypeError`) and returns *NULL*. |
Georg Brandl | 54a3faa | 2008-01-20 09:30:57 +0000 | [diff] [blame] | 89 | |
| 90 | |
Georg Brandl | 60203b4 | 2010-10-06 10:11:56 +0000 | [diff] [blame] | 91 | .. c:function:: PyObject* PyMarshal_ReadObjectFromString(char *string, Py_ssize_t len) |
Georg Brandl | 54a3faa | 2008-01-20 09:30:57 +0000 | [diff] [blame] | 92 | |
Jeroen Ruigrok van der Werven | 47a7d70 | 2009-04-27 05:43:17 +0000 | [diff] [blame] | 93 | Return a Python object from the data stream in a character buffer |
Victor Stinner | 6a318d4 | 2015-03-18 13:58:49 +0100 | [diff] [blame^] | 94 | containing *len* bytes pointed to by *string*. |
| 95 | |
| 96 | On error, sets the appropriate exception (:exc:`EOFError` or |
| 97 | :exc:`TypeError`) and returns *NULL*. |
Jeroen Ruigrok van der Werven | 47a7d70 | 2009-04-27 05:43:17 +0000 | [diff] [blame] | 98 | |