Georg Brandl | 8ec7f65 | 2007-08-15 14:28:01 +0000 | [diff] [blame] | 1 | |
| 2 | :mod:`mmap` --- Memory-mapped file support |
| 3 | ========================================== |
| 4 | |
| 5 | .. module:: mmap |
| 6 | :synopsis: Interface to memory-mapped files for Unix and Windows. |
| 7 | |
| 8 | |
| 9 | Memory-mapped file objects behave like both strings and like file objects. |
| 10 | Unlike normal string objects, however, these are mutable. You can use mmap |
| 11 | objects in most places where strings are expected; for example, you can use the |
| 12 | :mod:`re` module to search through a memory-mapped file. Since they're mutable, |
| 13 | you can change a single character by doing ``obj[index] = 'a'``, or change a |
| 14 | substring by assigning to a slice: ``obj[i1:i2] = '...'``. You can also read |
| 15 | and write data starting at the current file position, and :meth:`seek` through |
| 16 | the file to different positions. |
| 17 | |
| 18 | A memory-mapped file is created by the :func:`mmap` function, which is different |
| 19 | on Unix and on Windows. In either case you must provide a file descriptor for a |
| 20 | file opened for update. If you wish to map an existing Python file object, use |
| 21 | its :meth:`fileno` method to obtain the correct value for the *fileno* |
| 22 | parameter. Otherwise, you can open the file using the :func:`os.open` function, |
| 23 | which returns a file descriptor directly (the file still needs to be closed when |
| 24 | done). |
| 25 | |
| 26 | For both the Unix and Windows versions of the function, *access* may be |
| 27 | specified as an optional keyword parameter. *access* accepts one of three |
| 28 | values: :const:`ACCESS_READ`, :const:`ACCESS_WRITE`, or :const:`ACCESS_COPY` to |
| 29 | specify readonly, write-through or copy-on-write memory respectively. *access* |
| 30 | can be used on both Unix and Windows. If *access* is not specified, Windows |
| 31 | mmap returns a write-through mapping. The initial memory values for all three |
| 32 | access types are taken from the specified file. Assignment to an |
| 33 | :const:`ACCESS_READ` memory map raises a :exc:`TypeError` exception. Assignment |
| 34 | to an :const:`ACCESS_WRITE` memory map affects both memory and the underlying |
| 35 | file. Assignment to an :const:`ACCESS_COPY` memory map affects memory but does |
| 36 | not update the underlying file. |
| 37 | |
| 38 | .. versionchanged:: 2.5 |
| 39 | To map anonymous memory, -1 should be passed as the fileno along with the |
| 40 | length. |
| 41 | |
| 42 | |
Travis E. Oliphant | 8feafab | 2007-10-23 02:40:56 +0000 | [diff] [blame] | 43 | .. function:: mmap(fileno, length[, tagname[, access[, offset]]]) |
Georg Brandl | 8ec7f65 | 2007-08-15 14:28:01 +0000 | [diff] [blame] | 44 | |
| 45 | **(Windows version)** Maps *length* bytes from the file specified by the file |
| 46 | handle *fileno*, and returns a mmap object. If *length* is larger than the |
| 47 | current size of the file, the file is extended to contain *length* bytes. If |
| 48 | *length* is ``0``, the maximum length of the map is the current size of the |
| 49 | file, except that if the file is empty Windows raises an exception (you cannot |
| 50 | create an empty mapping on Windows). |
| 51 | |
| 52 | *tagname*, if specified and not ``None``, is a string giving a tag name for the |
| 53 | mapping. Windows allows you to have many different mappings against the same |
| 54 | file. If you specify the name of an existing tag, that tag is opened, otherwise |
| 55 | a new tag of this name is created. If this parameter is omitted or ``None``, |
| 56 | the mapping is created without a name. Avoiding the use of the tag parameter |
| 57 | will assist in keeping your code portable between Unix and Windows. |
| 58 | |
Travis E. Oliphant | 8feafab | 2007-10-23 02:40:56 +0000 | [diff] [blame] | 59 | *offset* may be specified as a non-negative integer offset. mmap references will |
| 60 | be relative to the offset from the beginning of the file. *offset* defaults to 0. |
| 61 | *offset* must be a multiple of the ALLOCATIONGRANULARITY. |
Georg Brandl | 8ec7f65 | 2007-08-15 14:28:01 +0000 | [diff] [blame] | 62 | |
Travis E. Oliphant | 8feafab | 2007-10-23 02:40:56 +0000 | [diff] [blame] | 63 | |
| 64 | .. function:: mmap(fileno, length[, flags[, prot[, access[, offset]]]]) |
Georg Brandl | 8ec7f65 | 2007-08-15 14:28:01 +0000 | [diff] [blame] | 65 | :noindex: |
| 66 | |
| 67 | **(Unix version)** Maps *length* bytes from the file specified by the file |
| 68 | descriptor *fileno*, and returns a mmap object. If *length* is ``0``, the |
| 69 | maximum length of the map will be the current size of the file when :func:`mmap` |
| 70 | is called. |
| 71 | |
| 72 | *flags* specifies the nature of the mapping. :const:`MAP_PRIVATE` creates a |
| 73 | private copy-on-write mapping, so changes to the contents of the mmap object |
| 74 | will be private to this process, and :const:`MAP_SHARED` creates a mapping |
| 75 | that's shared with all other processes mapping the same areas of the file. The |
| 76 | default value is :const:`MAP_SHARED`. |
| 77 | |
| 78 | *prot*, if specified, gives the desired memory protection; the two most useful |
| 79 | values are :const:`PROT_READ` and :const:`PROT_WRITE`, to specify that the pages |
| 80 | may be read or written. *prot* defaults to :const:`PROT_READ \| PROT_WRITE`. |
| 81 | |
| 82 | *access* may be specified in lieu of *flags* and *prot* as an optional keyword |
| 83 | parameter. It is an error to specify both *flags*, *prot* and *access*. See |
| 84 | the description of *access* above for information on how to use this parameter. |
| 85 | |
Travis E. Oliphant | 8feafab | 2007-10-23 02:40:56 +0000 | [diff] [blame] | 86 | *offset* may be specified as a non-negative integer offset. mmap references will |
| 87 | be relative to the offset from the beginning of the file. *offset* defaults to 0. |
| 88 | *offset* must be a multiple of the PAGESIZE or ALLOCATIONGRANULARITY. |
Georg Brandl | fefcd4e | 2007-12-02 14:34:34 +0000 | [diff] [blame] | 89 | |
| 90 | This example shows a simple way of using :func:`mmap`:: |
| 91 | |
| 92 | import mmap |
| 93 | |
| 94 | # write a simple example file |
| 95 | with open("hello.txt", "w") as f: |
| 96 | f.write("Hello Python!\n") |
| 97 | |
| 98 | with open("hello.txt", "r+") as f: |
| 99 | # memory-map the file, size 0 means whole file |
| 100 | map = mmap.mmap(f.fileno(), 0) |
| 101 | # read content via standard file methods |
| 102 | print map.readline() # prints "Hello Python!" |
| 103 | # read content via slice notation |
| 104 | print map[:5] # prints "Hello" |
| 105 | # update content using slice notation; |
| 106 | # note that new content must have same size |
| 107 | map[6:] = " world!\n" |
| 108 | # ... and read again using standard file methods |
| 109 | map.seek(0) |
| 110 | print map.readline() # prints "Hello world!" |
| 111 | # close the map |
| 112 | map.close() |
| 113 | |
| 114 | |
| 115 | The next example demonstrates how to create an anonymous map and exchange |
| 116 | data between the parent and child processes:: |
| 117 | |
| 118 | import mmap |
| 119 | import os |
| 120 | |
| 121 | map = mmap.mmap(-1, 13) |
| 122 | map.write("Hello world!") |
| 123 | |
| 124 | pid = os.fork() |
| 125 | |
| 126 | if pid == 0: # In a child process |
| 127 | map.seek(0) |
| 128 | print map.readline() |
| 129 | |
| 130 | map.close() |
| 131 | |
Travis E. Oliphant | 8feafab | 2007-10-23 02:40:56 +0000 | [diff] [blame] | 132 | |
Georg Brandl | 8ec7f65 | 2007-08-15 14:28:01 +0000 | [diff] [blame] | 133 | Memory-mapped file objects support the following methods: |
| 134 | |
| 135 | |
| 136 | .. method:: mmap.close() |
| 137 | |
| 138 | Close the file. Subsequent calls to other methods of the object will result in |
| 139 | an exception being raised. |
| 140 | |
| 141 | |
Andrew M. Kuchling | 5c60bfc | 2008-01-19 18:18:41 +0000 | [diff] [blame^] | 142 | .. method:: mmap.find(string[, start[, end]]) |
Georg Brandl | 8ec7f65 | 2007-08-15 14:28:01 +0000 | [diff] [blame] | 143 | |
Andrew M. Kuchling | 5c60bfc | 2008-01-19 18:18:41 +0000 | [diff] [blame^] | 144 | Returns the lowest index in the object where the substring *string* is found, |
| 145 | such that *string* is contained in the range [*start*, *end*]. Optional |
| 146 | arguments *start* and *end* are interpreted as in slice notation. |
| 147 | Returns ``-1`` on failure. |
Georg Brandl | 8ec7f65 | 2007-08-15 14:28:01 +0000 | [diff] [blame] | 148 | |
| 149 | |
| 150 | .. method:: mmap.flush([offset, size]) |
| 151 | |
| 152 | Flushes changes made to the in-memory copy of a file back to disk. Without use |
| 153 | of this call there is no guarantee that changes are written back before the |
| 154 | object is destroyed. If *offset* and *size* are specified, only changes to the |
| 155 | given range of bytes will be flushed to disk; otherwise, the whole extent of the |
| 156 | mapping is flushed. |
| 157 | |
| 158 | |
| 159 | .. method:: mmap.move(dest, src, count) |
| 160 | |
| 161 | Copy the *count* bytes starting at offset *src* to the destination index *dest*. |
| 162 | If the mmap was created with :const:`ACCESS_READ`, then calls to move will throw |
| 163 | a :exc:`TypeError` exception. |
| 164 | |
| 165 | |
| 166 | .. method:: mmap.read(num) |
| 167 | |
| 168 | Return a string containing up to *num* bytes starting from the current file |
| 169 | position; the file position is updated to point after the bytes that were |
| 170 | returned. |
| 171 | |
| 172 | |
| 173 | .. method:: mmap.read_byte() |
| 174 | |
| 175 | Returns a string of length 1 containing the character at the current file |
| 176 | position, and advances the file position by 1. |
| 177 | |
| 178 | |
| 179 | .. method:: mmap.readline() |
| 180 | |
| 181 | Returns a single line, starting at the current file position and up to the next |
| 182 | newline. |
| 183 | |
| 184 | |
| 185 | .. method:: mmap.resize(newsize) |
| 186 | |
| 187 | Resizes the map and the underlying file, if any. If the mmap was created with |
| 188 | :const:`ACCESS_READ` or :const:`ACCESS_COPY`, resizing the map will throw a |
| 189 | :exc:`TypeError` exception. |
| 190 | |
| 191 | |
Andrew M. Kuchling | 5c60bfc | 2008-01-19 18:18:41 +0000 | [diff] [blame^] | 192 | .. method:: mmap.rfind(string[, start[, end]]) |
| 193 | |
| 194 | Returns the highest index in the object where the substring *string* is |
| 195 | found, such that *string* is contained in the range [*start*, |
| 196 | *end*]. Optional arguments *start* and *end* are interpreted as in slice |
| 197 | notation. Returns ``-1`` on failure. |
| 198 | |
| 199 | |
Georg Brandl | 8ec7f65 | 2007-08-15 14:28:01 +0000 | [diff] [blame] | 200 | .. method:: mmap.seek(pos[, whence]) |
| 201 | |
| 202 | Set the file's current position. *whence* argument is optional and defaults to |
| 203 | ``os.SEEK_SET`` or ``0`` (absolute file positioning); other values are |
| 204 | ``os.SEEK_CUR`` or ``1`` (seek relative to the current position) and |
| 205 | ``os.SEEK_END`` or ``2`` (seek relative to the file's end). |
| 206 | |
| 207 | |
| 208 | .. method:: mmap.size() |
| 209 | |
| 210 | Return the length of the file, which can be larger than the size of the |
| 211 | memory-mapped area. |
| 212 | |
| 213 | |
| 214 | .. method:: mmap.tell() |
| 215 | |
| 216 | Returns the current position of the file pointer. |
| 217 | |
| 218 | |
| 219 | .. method:: mmap.write(string) |
| 220 | |
| 221 | Write the bytes in *string* into memory at the current position of the file |
| 222 | pointer; the file position is updated to point after the bytes that were |
| 223 | written. If the mmap was created with :const:`ACCESS_READ`, then writing to it |
| 224 | will throw a :exc:`TypeError` exception. |
| 225 | |
| 226 | |
| 227 | .. method:: mmap.write_byte(byte) |
| 228 | |
| 229 | Write the single-character string *byte* into memory at the current position of |
| 230 | the file pointer; the file position is advanced by ``1``. If the mmap was |
| 231 | created with :const:`ACCESS_READ`, then writing to it will throw a |
| 232 | :exc:`TypeError` exception. |
| 233 | |
Travis E. Oliphant | 8feafab | 2007-10-23 02:40:56 +0000 | [diff] [blame] | 234 | |