Stéphane Wirtel | cbb6484 | 2019-05-17 11:55:34 +0200 | [diff] [blame] | 1 | .. highlight:: c |
Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 2 | |
| 3 | |
| 4 | .. _memory: |
| 5 | |
| 6 | ***************** |
| 7 | Memory Management |
| 8 | ***************** |
| 9 | |
| 10 | .. sectionauthor:: Vladimir Marangozov <Vladimir.Marangozov@inrialpes.fr> |
| 11 | |
| 12 | |
| 13 | |
| 14 | .. _memoryoverview: |
| 15 | |
| 16 | Overview |
| 17 | ======== |
| 18 | |
| 19 | Memory management in Python involves a private heap containing all Python |
| 20 | objects and data structures. The management of this private heap is ensured |
| 21 | internally by the *Python memory manager*. The Python memory manager has |
| 22 | different components which deal with various dynamic storage management aspects, |
| 23 | like sharing, segmentation, preallocation or caching. |
| 24 | |
| 25 | At the lowest level, a raw memory allocator ensures that there is enough room in |
| 26 | the private heap for storing all Python-related data by interacting with the |
| 27 | memory manager of the operating system. On top of the raw memory allocator, |
| 28 | several object-specific allocators operate on the same heap and implement |
| 29 | distinct memory management policies adapted to the peculiarities of every object |
| 30 | type. For example, integer objects are managed differently within the heap than |
| 31 | strings, tuples or dictionaries because integers imply different storage |
| 32 | requirements and speed/space tradeoffs. The Python memory manager thus delegates |
| 33 | some of the work to the object-specific allocators, but ensures that the latter |
| 34 | operate within the bounds of the private heap. |
| 35 | |
| 36 | It is important to understand that the management of the Python heap is |
| 37 | performed by the interpreter itself and that the user has no control over it, |
Andrés Delfino | 5092439 | 2018-06-18 01:34:30 -0300 | [diff] [blame] | 38 | even if they regularly manipulate object pointers to memory blocks inside that |
Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 39 | heap. The allocation of heap space for Python objects and other internal |
| 40 | buffers is performed on demand by the Python memory manager through the Python/C |
| 41 | API functions listed in this document. |
| 42 | |
| 43 | .. index:: |
| 44 | single: malloc() |
| 45 | single: calloc() |
| 46 | single: realloc() |
| 47 | single: free() |
| 48 | |
| 49 | To avoid memory corruption, extension writers should never try to operate on |
Georg Brandl | 60203b4 | 2010-10-06 10:11:56 +0000 | [diff] [blame] | 50 | Python objects with the functions exported by the C library: :c:func:`malloc`, |
| 51 | :c:func:`calloc`, :c:func:`realloc` and :c:func:`free`. This will result in mixed |
Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 52 | calls between the C allocator and the Python memory manager with fatal |
| 53 | consequences, because they implement different algorithms and operate on |
| 54 | different heaps. However, one may safely allocate and release memory blocks |
| 55 | with the C library allocator for individual purposes, as shown in the following |
| 56 | example:: |
| 57 | |
| 58 | PyObject *res; |
| 59 | char *buf = (char *) malloc(BUFSIZ); /* for I/O */ |
| 60 | |
| 61 | if (buf == NULL) |
| 62 | return PyErr_NoMemory(); |
| 63 | ...Do some I/O operation involving buf... |
Gregory P. Smith | 4b52ae8 | 2013-03-22 13:43:30 -0700 | [diff] [blame] | 64 | res = PyBytes_FromString(buf); |
Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 65 | free(buf); /* malloc'ed */ |
| 66 | return res; |
| 67 | |
| 68 | In this example, the memory request for the I/O buffer is handled by the C |
| 69 | library allocator. The Python memory manager is involved only in the allocation |
Hai Shi | 39a5d17 | 2019-07-05 23:03:13 -0500 | [diff] [blame] | 70 | of the bytes object returned as a result. |
Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 71 | |
| 72 | In most situations, however, it is recommended to allocate memory from the |
| 73 | Python heap specifically because the latter is under control of the Python |
| 74 | memory manager. For example, this is required when the interpreter is extended |
| 75 | with new object types written in C. Another reason for using the Python heap is |
| 76 | the desire to *inform* the Python memory manager about the memory needs of the |
| 77 | extension module. Even when the requested memory is used exclusively for |
| 78 | internal, highly-specific purposes, delegating all memory requests to the Python |
| 79 | memory manager causes the interpreter to have a more accurate image of its |
| 80 | memory footprint as a whole. Consequently, under certain circumstances, the |
| 81 | Python memory manager may or may not trigger appropriate actions, like garbage |
| 82 | collection, memory compaction or other preventive procedures. Note that by using |
| 83 | the C library allocator as shown in the previous example, the allocated memory |
| 84 | for the I/O buffer escapes completely the Python memory manager. |
| 85 | |
Victor Stinner | 29bf27f | 2016-03-09 14:49:52 +0100 | [diff] [blame] | 86 | .. seealso:: |
| 87 | |
Victor Stinner | 34be807 | 2016-03-14 12:04:26 +0100 | [diff] [blame] | 88 | The :envvar:`PYTHONMALLOC` environment variable can be used to configure |
| 89 | the memory allocators used by Python. |
| 90 | |
Victor Stinner | 29bf27f | 2016-03-09 14:49:52 +0100 | [diff] [blame] | 91 | The :envvar:`PYTHONMALLOCSTATS` environment variable can be used to print |
Victor Stinner | 34be807 | 2016-03-14 12:04:26 +0100 | [diff] [blame] | 92 | statistics of the :ref:`pymalloc memory allocator <pymalloc>` every time a |
| 93 | new pymalloc object arena is created, and on shutdown. |
Victor Stinner | 29bf27f | 2016-03-09 14:49:52 +0100 | [diff] [blame] | 94 | |
Pablo Galindo | bc450f9 | 2021-01-18 22:20:57 +0000 | [diff] [blame] | 95 | Allocator Domains |
| 96 | ================= |
| 97 | |
| 98 | All allocating functions belong to one of three different "domains" (see also |
Pablo Galindo | b0478d7 | 2021-02-02 20:43:11 +0000 | [diff] [blame] | 99 | :c:type:`PyMemAllocatorDomain`). These domains represent different allocation |
Pablo Galindo | bc450f9 | 2021-01-18 22:20:57 +0000 | [diff] [blame] | 100 | strategies and are optimized for different purposes. The specific details on |
| 101 | how every domain allocates memory or what internal functions each domain calls |
| 102 | is considered an implementation detail, but for debugging purposes a simplified |
| 103 | table can be found at :ref:`here <default-memory-allocators>`. There is no hard |
| 104 | requirement to use the memory returned by the allocation functions belonging to |
| 105 | a given domain for only the purposes hinted by that domain (although this is the |
| 106 | recommended practice). For example, one could use the memory returned by |
| 107 | :c:func:`PyMem_RawMalloc` for allocating Python objects or the memory returned |
| 108 | by :c:func:`PyObject_Malloc` for allocating memory for buffers. |
| 109 | |
| 110 | The three allocation domains are: |
| 111 | |
| 112 | * Raw domain: intended for allocating memory for general-purpose memory |
| 113 | buffers where the allocation *must* go to the system allocator or where the |
| 114 | allocator can operate without the :term:`GIL`. The memory is requested directly |
| 115 | to the system. |
| 116 | |
| 117 | * "Mem" domain: intended for allocating memory for Python buffers and |
| 118 | general-purpose memory buffers where the allocation must be performed with |
| 119 | the :term:`GIL` held. The memory is taken from the Python private heap. |
| 120 | |
| 121 | * Object domain: intended for allocating memory belonging to Python objects. The |
| 122 | memory is taken from the Python private heap. |
| 123 | |
| 124 | When freeing memory previously allocated by the allocating functions belonging to a |
| 125 | given domain,the matching specific deallocating functions must be used. For example, |
| 126 | :c:func:`PyMem_Free` must be used to free memory allocated using :c:func:`PyMem_Malloc`. |
Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 127 | |
Victor Stinner | 0507bf5 | 2013-07-07 02:05:46 +0200 | [diff] [blame] | 128 | Raw Memory Interface |
| 129 | ==================== |
| 130 | |
| 131 | The following function sets are wrappers to the system allocator. These |
| 132 | functions are thread-safe, the :term:`GIL <global interpreter lock>` does not |
| 133 | need to be held. |
| 134 | |
Victor Stinner | 5d39e04 | 2017-11-29 17:20:38 +0100 | [diff] [blame] | 135 | The :ref:`default raw memory allocator <default-memory-allocators>` uses |
| 136 | the following functions: :c:func:`malloc`, :c:func:`calloc`, :c:func:`realloc` |
| 137 | and :c:func:`free`; call ``malloc(1)`` (or ``calloc(1, 1)``) when requesting |
| 138 | zero bytes. |
Victor Stinner | 0507bf5 | 2013-07-07 02:05:46 +0200 | [diff] [blame] | 139 | |
| 140 | .. versionadded:: 3.4 |
| 141 | |
| 142 | .. c:function:: void* PyMem_RawMalloc(size_t n) |
| 143 | |
Victor Stinner | 474652f | 2020-08-13 22:11:50 +0200 | [diff] [blame] | 144 | Allocates *n* bytes and returns a pointer of type :c:type:`void*` to the |
Serhiy Storchaka | 25fc088 | 2019-10-30 12:03:20 +0200 | [diff] [blame] | 145 | allocated memory, or ``NULL`` if the request fails. |
Victor Stinner | 29bf27f | 2016-03-09 14:49:52 +0100 | [diff] [blame] | 146 | |
Serhiy Storchaka | 25fc088 | 2019-10-30 12:03:20 +0200 | [diff] [blame] | 147 | Requesting zero bytes returns a distinct non-``NULL`` pointer if possible, as |
Victor Stinner | 29bf27f | 2016-03-09 14:49:52 +0100 | [diff] [blame] | 148 | if ``PyMem_RawMalloc(1)`` had been called instead. The memory will not have |
Victor Stinner | 0507bf5 | 2013-07-07 02:05:46 +0200 | [diff] [blame] | 149 | been initialized in any way. |
| 150 | |
| 151 | |
Victor Stinner | db067af | 2014-05-02 22:31:14 +0200 | [diff] [blame] | 152 | .. c:function:: void* PyMem_RawCalloc(size_t nelem, size_t elsize) |
| 153 | |
| 154 | Allocates *nelem* elements each whose size in bytes is *elsize* and returns |
Victor Stinner | 474652f | 2020-08-13 22:11:50 +0200 | [diff] [blame] | 155 | a pointer of type :c:type:`void*` to the allocated memory, or ``NULL`` if the |
Victor Stinner | 29bf27f | 2016-03-09 14:49:52 +0100 | [diff] [blame] | 156 | request fails. The memory is initialized to zeros. |
| 157 | |
| 158 | Requesting zero elements or elements of size zero bytes returns a distinct |
Serhiy Storchaka | 25fc088 | 2019-10-30 12:03:20 +0200 | [diff] [blame] | 159 | non-``NULL`` pointer if possible, as if ``PyMem_RawCalloc(1, 1)`` had been |
Victor Stinner | 29bf27f | 2016-03-09 14:49:52 +0100 | [diff] [blame] | 160 | called instead. |
Victor Stinner | db067af | 2014-05-02 22:31:14 +0200 | [diff] [blame] | 161 | |
| 162 | .. versionadded:: 3.5 |
| 163 | |
| 164 | |
Victor Stinner | 0507bf5 | 2013-07-07 02:05:46 +0200 | [diff] [blame] | 165 | .. c:function:: void* PyMem_RawRealloc(void *p, size_t n) |
| 166 | |
| 167 | Resizes the memory block pointed to by *p* to *n* bytes. The contents will |
Victor Stinner | 29bf27f | 2016-03-09 14:49:52 +0100 | [diff] [blame] | 168 | be unchanged to the minimum of the old and the new sizes. |
| 169 | |
Serhiy Storchaka | 25fc088 | 2019-10-30 12:03:20 +0200 | [diff] [blame] | 170 | If *p* is ``NULL``, the call is equivalent to ``PyMem_RawMalloc(n)``; else if |
Victor Stinner | 29bf27f | 2016-03-09 14:49:52 +0100 | [diff] [blame] | 171 | *n* is equal to zero, the memory block is resized but is not freed, and the |
Serhiy Storchaka | 25fc088 | 2019-10-30 12:03:20 +0200 | [diff] [blame] | 172 | returned pointer is non-``NULL``. |
Victor Stinner | 29bf27f | 2016-03-09 14:49:52 +0100 | [diff] [blame] | 173 | |
Serhiy Storchaka | 25fc088 | 2019-10-30 12:03:20 +0200 | [diff] [blame] | 174 | Unless *p* is ``NULL``, it must have been returned by a previous call to |
Victor Stinner | 29bf27f | 2016-03-09 14:49:52 +0100 | [diff] [blame] | 175 | :c:func:`PyMem_RawMalloc`, :c:func:`PyMem_RawRealloc` or |
| 176 | :c:func:`PyMem_RawCalloc`. |
| 177 | |
Serhiy Storchaka | 25fc088 | 2019-10-30 12:03:20 +0200 | [diff] [blame] | 178 | If the request fails, :c:func:`PyMem_RawRealloc` returns ``NULL`` and *p* |
Victor Stinner | 29bf27f | 2016-03-09 14:49:52 +0100 | [diff] [blame] | 179 | remains a valid pointer to the previous memory area. |
Victor Stinner | 0507bf5 | 2013-07-07 02:05:46 +0200 | [diff] [blame] | 180 | |
| 181 | |
| 182 | .. c:function:: void PyMem_RawFree(void *p) |
| 183 | |
| 184 | Frees the memory block pointed to by *p*, which must have been returned by a |
Victor Stinner | 29bf27f | 2016-03-09 14:49:52 +0100 | [diff] [blame] | 185 | previous call to :c:func:`PyMem_RawMalloc`, :c:func:`PyMem_RawRealloc` or |
Victor Stinner | ec2cbdd | 2017-10-31 09:37:25 -0700 | [diff] [blame] | 186 | :c:func:`PyMem_RawCalloc`. Otherwise, or if ``PyMem_RawFree(p)`` has been |
Victor Stinner | 29bf27f | 2016-03-09 14:49:52 +0100 | [diff] [blame] | 187 | called before, undefined behavior occurs. |
| 188 | |
Serhiy Storchaka | 25fc088 | 2019-10-30 12:03:20 +0200 | [diff] [blame] | 189 | If *p* is ``NULL``, no operation is performed. |
Victor Stinner | 0507bf5 | 2013-07-07 02:05:46 +0200 | [diff] [blame] | 190 | |
| 191 | |
Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 192 | .. _memoryinterface: |
| 193 | |
| 194 | Memory Interface |
| 195 | ================ |
| 196 | |
| 197 | The following function sets, modeled after the ANSI C standard, but specifying |
| 198 | behavior when requesting zero bytes, are available for allocating and releasing |
Victor Stinner | 0507bf5 | 2013-07-07 02:05:46 +0200 | [diff] [blame] | 199 | memory from the Python heap. |
Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 200 | |
Victor Stinner | 5d39e04 | 2017-11-29 17:20:38 +0100 | [diff] [blame] | 201 | The :ref:`default memory allocator <default-memory-allocators>` uses the |
| 202 | :ref:`pymalloc memory allocator <pymalloc>`. |
Victor Stinner | 0507bf5 | 2013-07-07 02:05:46 +0200 | [diff] [blame] | 203 | |
| 204 | .. warning:: |
| 205 | |
| 206 | The :term:`GIL <global interpreter lock>` must be held when using these |
| 207 | functions. |
Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 208 | |
Victor Stinner | f5c4b99 | 2016-04-22 16:26:23 +0200 | [diff] [blame] | 209 | .. versionchanged:: 3.6 |
| 210 | |
| 211 | The default allocator is now pymalloc instead of system :c:func:`malloc`. |
| 212 | |
Georg Brandl | 60203b4 | 2010-10-06 10:11:56 +0000 | [diff] [blame] | 213 | .. c:function:: void* PyMem_Malloc(size_t n) |
Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 214 | |
Victor Stinner | 474652f | 2020-08-13 22:11:50 +0200 | [diff] [blame] | 215 | Allocates *n* bytes and returns a pointer of type :c:type:`void*` to the |
Serhiy Storchaka | 25fc088 | 2019-10-30 12:03:20 +0200 | [diff] [blame] | 216 | allocated memory, or ``NULL`` if the request fails. |
Victor Stinner | 29bf27f | 2016-03-09 14:49:52 +0100 | [diff] [blame] | 217 | |
Serhiy Storchaka | 25fc088 | 2019-10-30 12:03:20 +0200 | [diff] [blame] | 218 | Requesting zero bytes returns a distinct non-``NULL`` pointer if possible, as |
Victor Stinner | 29bf27f | 2016-03-09 14:49:52 +0100 | [diff] [blame] | 219 | if ``PyMem_Malloc(1)`` had been called instead. The memory will not have |
| 220 | been initialized in any way. |
Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 221 | |
| 222 | |
Victor Stinner | db067af | 2014-05-02 22:31:14 +0200 | [diff] [blame] | 223 | .. c:function:: void* PyMem_Calloc(size_t nelem, size_t elsize) |
| 224 | |
| 225 | Allocates *nelem* elements each whose size in bytes is *elsize* and returns |
Victor Stinner | 474652f | 2020-08-13 22:11:50 +0200 | [diff] [blame] | 226 | a pointer of type :c:type:`void*` to the allocated memory, or ``NULL`` if the |
Victor Stinner | 29bf27f | 2016-03-09 14:49:52 +0100 | [diff] [blame] | 227 | request fails. The memory is initialized to zeros. |
| 228 | |
| 229 | Requesting zero elements or elements of size zero bytes returns a distinct |
Serhiy Storchaka | 25fc088 | 2019-10-30 12:03:20 +0200 | [diff] [blame] | 230 | non-``NULL`` pointer if possible, as if ``PyMem_Calloc(1, 1)`` had been called |
Victor Stinner | 29bf27f | 2016-03-09 14:49:52 +0100 | [diff] [blame] | 231 | instead. |
Victor Stinner | db067af | 2014-05-02 22:31:14 +0200 | [diff] [blame] | 232 | |
| 233 | .. versionadded:: 3.5 |
| 234 | |
| 235 | |
Georg Brandl | 60203b4 | 2010-10-06 10:11:56 +0000 | [diff] [blame] | 236 | .. c:function:: void* PyMem_Realloc(void *p, size_t n) |
Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 237 | |
| 238 | Resizes the memory block pointed to by *p* to *n* bytes. The contents will be |
Victor Stinner | 29bf27f | 2016-03-09 14:49:52 +0100 | [diff] [blame] | 239 | unchanged to the minimum of the old and the new sizes. |
| 240 | |
Serhiy Storchaka | 25fc088 | 2019-10-30 12:03:20 +0200 | [diff] [blame] | 241 | If *p* is ``NULL``, the call is equivalent to ``PyMem_Malloc(n)``; else if *n* |
Victor Stinner | 29bf27f | 2016-03-09 14:49:52 +0100 | [diff] [blame] | 242 | is equal to zero, the memory block is resized but is not freed, and the |
Serhiy Storchaka | 25fc088 | 2019-10-30 12:03:20 +0200 | [diff] [blame] | 243 | returned pointer is non-``NULL``. |
Victor Stinner | 29bf27f | 2016-03-09 14:49:52 +0100 | [diff] [blame] | 244 | |
Serhiy Storchaka | 25fc088 | 2019-10-30 12:03:20 +0200 | [diff] [blame] | 245 | Unless *p* is ``NULL``, it must have been returned by a previous call to |
Victor Stinner | 29bf27f | 2016-03-09 14:49:52 +0100 | [diff] [blame] | 246 | :c:func:`PyMem_Malloc`, :c:func:`PyMem_Realloc` or :c:func:`PyMem_Calloc`. |
| 247 | |
Serhiy Storchaka | 25fc088 | 2019-10-30 12:03:20 +0200 | [diff] [blame] | 248 | If the request fails, :c:func:`PyMem_Realloc` returns ``NULL`` and *p* remains |
Victor Stinner | 29bf27f | 2016-03-09 14:49:52 +0100 | [diff] [blame] | 249 | a valid pointer to the previous memory area. |
Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 250 | |
| 251 | |
Georg Brandl | 60203b4 | 2010-10-06 10:11:56 +0000 | [diff] [blame] | 252 | .. c:function:: void PyMem_Free(void *p) |
Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 253 | |
| 254 | Frees the memory block pointed to by *p*, which must have been returned by a |
Victor Stinner | 29bf27f | 2016-03-09 14:49:52 +0100 | [diff] [blame] | 255 | previous call to :c:func:`PyMem_Malloc`, :c:func:`PyMem_Realloc` or |
| 256 | :c:func:`PyMem_Calloc`. Otherwise, or if ``PyMem_Free(p)`` has been called |
| 257 | before, undefined behavior occurs. |
| 258 | |
Serhiy Storchaka | 25fc088 | 2019-10-30 12:03:20 +0200 | [diff] [blame] | 259 | If *p* is ``NULL``, no operation is performed. |
Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 260 | |
| 261 | The following type-oriented macros are provided for convenience. Note that |
| 262 | *TYPE* refers to any C type. |
| 263 | |
| 264 | |
Georg Brandl | 60203b4 | 2010-10-06 10:11:56 +0000 | [diff] [blame] | 265 | .. c:function:: TYPE* PyMem_New(TYPE, size_t n) |
Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 266 | |
Georg Brandl | 60203b4 | 2010-10-06 10:11:56 +0000 | [diff] [blame] | 267 | Same as :c:func:`PyMem_Malloc`, but allocates ``(n * sizeof(TYPE))`` bytes of |
Victor Stinner | 474652f | 2020-08-13 22:11:50 +0200 | [diff] [blame] | 268 | memory. Returns a pointer cast to :c:type:`TYPE*`. The memory will not have |
Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 269 | been initialized in any way. |
| 270 | |
| 271 | |
Georg Brandl | 60203b4 | 2010-10-06 10:11:56 +0000 | [diff] [blame] | 272 | .. c:function:: TYPE* PyMem_Resize(void *p, TYPE, size_t n) |
Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 273 | |
Georg Brandl | 60203b4 | 2010-10-06 10:11:56 +0000 | [diff] [blame] | 274 | Same as :c:func:`PyMem_Realloc`, but the memory block is resized to ``(n * |
Victor Stinner | 474652f | 2020-08-13 22:11:50 +0200 | [diff] [blame] | 275 | sizeof(TYPE))`` bytes. Returns a pointer cast to :c:type:`TYPE*`. On return, |
Serhiy Storchaka | 25fc088 | 2019-10-30 12:03:20 +0200 | [diff] [blame] | 276 | *p* will be a pointer to the new memory area, or ``NULL`` in the event of |
Victor Stinner | 29bf27f | 2016-03-09 14:49:52 +0100 | [diff] [blame] | 277 | failure. |
| 278 | |
| 279 | This is a C preprocessor macro; *p* is always reassigned. Save the original |
| 280 | value of *p* to avoid losing memory when handling errors. |
Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 281 | |
| 282 | |
Georg Brandl | 60203b4 | 2010-10-06 10:11:56 +0000 | [diff] [blame] | 283 | .. c:function:: void PyMem_Del(void *p) |
Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 284 | |
Georg Brandl | 60203b4 | 2010-10-06 10:11:56 +0000 | [diff] [blame] | 285 | Same as :c:func:`PyMem_Free`. |
Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 286 | |
| 287 | In addition, the following macro sets are provided for calling the Python memory |
| 288 | allocator directly, without involving the C API functions listed above. However, |
| 289 | note that their use does not preserve binary compatibility across Python |
| 290 | versions and is therefore deprecated in extension modules. |
| 291 | |
Victor Stinner | 29bf27f | 2016-03-09 14:49:52 +0100 | [diff] [blame] | 292 | * ``PyMem_MALLOC(size)`` |
| 293 | * ``PyMem_NEW(type, size)`` |
| 294 | * ``PyMem_REALLOC(ptr, size)`` |
| 295 | * ``PyMem_RESIZE(ptr, type, size)`` |
| 296 | * ``PyMem_FREE(ptr)`` |
| 297 | * ``PyMem_DEL(ptr)`` |
Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 298 | |
| 299 | |
Victor Stinner | ec2cbdd | 2017-10-31 09:37:25 -0700 | [diff] [blame] | 300 | Object allocators |
| 301 | ================= |
| 302 | |
| 303 | The following function sets, modeled after the ANSI C standard, but specifying |
| 304 | behavior when requesting zero bytes, are available for allocating and releasing |
| 305 | memory from the Python heap. |
| 306 | |
Pablo Galindo | e485be5 | 2021-01-19 13:09:06 +0000 | [diff] [blame] | 307 | .. note:: |
| 308 | There is no guarantee that the memory returned by these allocators can be |
JT | e08c673 | 2021-04-30 15:35:07 -0700 | [diff] [blame] | 309 | successfully casted to a Python object when intercepting the allocating |
Pablo Galindo | e485be5 | 2021-01-19 13:09:06 +0000 | [diff] [blame] | 310 | functions in this domain by the methods described in |
| 311 | the :ref:`Customize Memory Allocators <customize-memory-allocators>` section. |
| 312 | |
Victor Stinner | 5d39e04 | 2017-11-29 17:20:38 +0100 | [diff] [blame] | 313 | The :ref:`default object allocator <default-memory-allocators>` uses the |
| 314 | :ref:`pymalloc memory allocator <pymalloc>`. |
Victor Stinner | ec2cbdd | 2017-10-31 09:37:25 -0700 | [diff] [blame] | 315 | |
| 316 | .. warning:: |
| 317 | |
| 318 | The :term:`GIL <global interpreter lock>` must be held when using these |
| 319 | functions. |
| 320 | |
| 321 | .. c:function:: void* PyObject_Malloc(size_t n) |
| 322 | |
Victor Stinner | 474652f | 2020-08-13 22:11:50 +0200 | [diff] [blame] | 323 | Allocates *n* bytes and returns a pointer of type :c:type:`void*` to the |
Serhiy Storchaka | 25fc088 | 2019-10-30 12:03:20 +0200 | [diff] [blame] | 324 | allocated memory, or ``NULL`` if the request fails. |
Victor Stinner | ec2cbdd | 2017-10-31 09:37:25 -0700 | [diff] [blame] | 325 | |
Serhiy Storchaka | 25fc088 | 2019-10-30 12:03:20 +0200 | [diff] [blame] | 326 | Requesting zero bytes returns a distinct non-``NULL`` pointer if possible, as |
Victor Stinner | ec2cbdd | 2017-10-31 09:37:25 -0700 | [diff] [blame] | 327 | if ``PyObject_Malloc(1)`` had been called instead. The memory will not have |
| 328 | been initialized in any way. |
| 329 | |
| 330 | |
| 331 | .. c:function:: void* PyObject_Calloc(size_t nelem, size_t elsize) |
| 332 | |
| 333 | Allocates *nelem* elements each whose size in bytes is *elsize* and returns |
Victor Stinner | 474652f | 2020-08-13 22:11:50 +0200 | [diff] [blame] | 334 | a pointer of type :c:type:`void*` to the allocated memory, or ``NULL`` if the |
Victor Stinner | ec2cbdd | 2017-10-31 09:37:25 -0700 | [diff] [blame] | 335 | request fails. The memory is initialized to zeros. |
| 336 | |
| 337 | Requesting zero elements or elements of size zero bytes returns a distinct |
Serhiy Storchaka | 25fc088 | 2019-10-30 12:03:20 +0200 | [diff] [blame] | 338 | non-``NULL`` pointer if possible, as if ``PyObject_Calloc(1, 1)`` had been called |
Victor Stinner | ec2cbdd | 2017-10-31 09:37:25 -0700 | [diff] [blame] | 339 | instead. |
| 340 | |
| 341 | .. versionadded:: 3.5 |
| 342 | |
| 343 | |
| 344 | .. c:function:: void* PyObject_Realloc(void *p, size_t n) |
| 345 | |
| 346 | Resizes the memory block pointed to by *p* to *n* bytes. The contents will be |
| 347 | unchanged to the minimum of the old and the new sizes. |
| 348 | |
Serhiy Storchaka | 25fc088 | 2019-10-30 12:03:20 +0200 | [diff] [blame] | 349 | If *p* is ``NULL``, the call is equivalent to ``PyObject_Malloc(n)``; else if *n* |
Victor Stinner | ec2cbdd | 2017-10-31 09:37:25 -0700 | [diff] [blame] | 350 | is equal to zero, the memory block is resized but is not freed, and the |
Serhiy Storchaka | 25fc088 | 2019-10-30 12:03:20 +0200 | [diff] [blame] | 351 | returned pointer is non-``NULL``. |
Victor Stinner | ec2cbdd | 2017-10-31 09:37:25 -0700 | [diff] [blame] | 352 | |
Serhiy Storchaka | 25fc088 | 2019-10-30 12:03:20 +0200 | [diff] [blame] | 353 | Unless *p* is ``NULL``, it must have been returned by a previous call to |
Victor Stinner | ec2cbdd | 2017-10-31 09:37:25 -0700 | [diff] [blame] | 354 | :c:func:`PyObject_Malloc`, :c:func:`PyObject_Realloc` or :c:func:`PyObject_Calloc`. |
| 355 | |
Serhiy Storchaka | 25fc088 | 2019-10-30 12:03:20 +0200 | [diff] [blame] | 356 | If the request fails, :c:func:`PyObject_Realloc` returns ``NULL`` and *p* remains |
Victor Stinner | ec2cbdd | 2017-10-31 09:37:25 -0700 | [diff] [blame] | 357 | a valid pointer to the previous memory area. |
| 358 | |
| 359 | |
| 360 | .. c:function:: void PyObject_Free(void *p) |
| 361 | |
| 362 | Frees the memory block pointed to by *p*, which must have been returned by a |
| 363 | previous call to :c:func:`PyObject_Malloc`, :c:func:`PyObject_Realloc` or |
| 364 | :c:func:`PyObject_Calloc`. Otherwise, or if ``PyObject_Free(p)`` has been called |
| 365 | before, undefined behavior occurs. |
| 366 | |
Serhiy Storchaka | 25fc088 | 2019-10-30 12:03:20 +0200 | [diff] [blame] | 367 | If *p* is ``NULL``, no operation is performed. |
Victor Stinner | ec2cbdd | 2017-10-31 09:37:25 -0700 | [diff] [blame] | 368 | |
| 369 | |
Victor Stinner | 5d39e04 | 2017-11-29 17:20:38 +0100 | [diff] [blame] | 370 | .. _default-memory-allocators: |
| 371 | |
| 372 | Default Memory Allocators |
| 373 | ========================= |
| 374 | |
| 375 | Default memory allocators: |
| 376 | |
| 377 | =============================== ==================== ================== ===================== ==================== |
| 378 | Configuration Name PyMem_RawMalloc PyMem_Malloc PyObject_Malloc |
| 379 | =============================== ==================== ================== ===================== ==================== |
| 380 | Release build ``"pymalloc"`` ``malloc`` ``pymalloc`` ``pymalloc`` |
| 381 | Debug build ``"pymalloc_debug"`` ``malloc`` + debug ``pymalloc`` + debug ``pymalloc`` + debug |
| 382 | Release build, without pymalloc ``"malloc"`` ``malloc`` ``malloc`` ``malloc`` |
Kevin Adler | a407004 | 2018-11-30 01:42:47 -0600 | [diff] [blame] | 383 | Debug build, without pymalloc ``"malloc_debug"`` ``malloc`` + debug ``malloc`` + debug ``malloc`` + debug |
Victor Stinner | 5d39e04 | 2017-11-29 17:20:38 +0100 | [diff] [blame] | 384 | =============================== ==================== ================== ===================== ==================== |
| 385 | |
| 386 | Legend: |
| 387 | |
Victor Stinner | a41782c | 2021-04-08 22:32:21 +0200 | [diff] [blame] | 388 | * Name: value for :envvar:`PYTHONMALLOC` environment variable. |
Victor Stinner | 5d39e04 | 2017-11-29 17:20:38 +0100 | [diff] [blame] | 389 | * ``malloc``: system allocators from the standard C library, C functions: |
Victor Stinner | a41782c | 2021-04-08 22:32:21 +0200 | [diff] [blame] | 390 | :c:func:`malloc`, :c:func:`calloc`, :c:func:`realloc` and :c:func:`free`. |
| 391 | * ``pymalloc``: :ref:`pymalloc memory allocator <pymalloc>`. |
Victor Stinner | 645ed62 | 2021-04-29 10:47:47 +0200 | [diff] [blame] | 392 | * "+ debug": with :ref:`debug hooks on the Python memory allocators |
| 393 | <pymem-debug-hooks>`. |
Victor Stinner | a41782c | 2021-04-08 22:32:21 +0200 | [diff] [blame] | 394 | * "Debug build": :ref:`Python build in debug mode <debug-build>`. |
Victor Stinner | 5d39e04 | 2017-11-29 17:20:38 +0100 | [diff] [blame] | 395 | |
Pablo Galindo | e485be5 | 2021-01-19 13:09:06 +0000 | [diff] [blame] | 396 | .. _customize-memory-allocators: |
Victor Stinner | 5d39e04 | 2017-11-29 17:20:38 +0100 | [diff] [blame] | 397 | |
Victor Stinner | 0507bf5 | 2013-07-07 02:05:46 +0200 | [diff] [blame] | 398 | Customize Memory Allocators |
| 399 | =========================== |
| 400 | |
| 401 | .. versionadded:: 3.4 |
| 402 | |
Victor Stinner | d8f0d92 | 2014-06-02 21:57:10 +0200 | [diff] [blame] | 403 | .. c:type:: PyMemAllocatorEx |
Victor Stinner | 0507bf5 | 2013-07-07 02:05:46 +0200 | [diff] [blame] | 404 | |
| 405 | Structure used to describe a memory block allocator. The structure has |
| 406 | four fields: |
| 407 | |
| 408 | +----------------------------------------------------------+---------------------------------------+ |
| 409 | | Field | Meaning | |
| 410 | +==========================================================+=======================================+ |
| 411 | | ``void *ctx`` | user context passed as first argument | |
| 412 | +----------------------------------------------------------+---------------------------------------+ |
| 413 | | ``void* malloc(void *ctx, size_t size)`` | allocate a memory block | |
| 414 | +----------------------------------------------------------+---------------------------------------+ |
Victor Stinner | db067af | 2014-05-02 22:31:14 +0200 | [diff] [blame] | 415 | | ``void* calloc(void *ctx, size_t nelem, size_t elsize)`` | allocate a memory block initialized | |
| 416 | | | with zeros | |
| 417 | +----------------------------------------------------------+---------------------------------------+ |
Victor Stinner | 0507bf5 | 2013-07-07 02:05:46 +0200 | [diff] [blame] | 418 | | ``void* realloc(void *ctx, void *ptr, size_t new_size)`` | allocate or resize a memory block | |
| 419 | +----------------------------------------------------------+---------------------------------------+ |
| 420 | | ``void free(void *ctx, void *ptr)`` | free a memory block | |
| 421 | +----------------------------------------------------------+---------------------------------------+ |
| 422 | |
Victor Stinner | db067af | 2014-05-02 22:31:14 +0200 | [diff] [blame] | 423 | .. versionchanged:: 3.5 |
Victor Stinner | d8f0d92 | 2014-06-02 21:57:10 +0200 | [diff] [blame] | 424 | The :c:type:`PyMemAllocator` structure was renamed to |
| 425 | :c:type:`PyMemAllocatorEx` and a new ``calloc`` field was added. |
| 426 | |
Victor Stinner | db067af | 2014-05-02 22:31:14 +0200 | [diff] [blame] | 427 | |
Victor Stinner | 0507bf5 | 2013-07-07 02:05:46 +0200 | [diff] [blame] | 428 | .. c:type:: PyMemAllocatorDomain |
| 429 | |
| 430 | Enum used to identify an allocator domain. Domains: |
| 431 | |
Victor Stinner | 474652f | 2020-08-13 22:11:50 +0200 | [diff] [blame] | 432 | .. c:macro:: PYMEM_DOMAIN_RAW |
Victor Stinner | 0507bf5 | 2013-07-07 02:05:46 +0200 | [diff] [blame] | 433 | |
Victor Stinner | f5c4b99 | 2016-04-22 16:26:23 +0200 | [diff] [blame] | 434 | Functions: |
| 435 | |
| 436 | * :c:func:`PyMem_RawMalloc` |
| 437 | * :c:func:`PyMem_RawRealloc` |
| 438 | * :c:func:`PyMem_RawCalloc` |
| 439 | * :c:func:`PyMem_RawFree` |
| 440 | |
Victor Stinner | 474652f | 2020-08-13 22:11:50 +0200 | [diff] [blame] | 441 | .. c:macro:: PYMEM_DOMAIN_MEM |
Victor Stinner | f5c4b99 | 2016-04-22 16:26:23 +0200 | [diff] [blame] | 442 | |
| 443 | Functions: |
| 444 | |
| 445 | * :c:func:`PyMem_Malloc`, |
| 446 | * :c:func:`PyMem_Realloc` |
| 447 | * :c:func:`PyMem_Calloc` |
| 448 | * :c:func:`PyMem_Free` |
| 449 | |
Victor Stinner | 474652f | 2020-08-13 22:11:50 +0200 | [diff] [blame] | 450 | .. c:macro:: PYMEM_DOMAIN_OBJ |
Victor Stinner | f5c4b99 | 2016-04-22 16:26:23 +0200 | [diff] [blame] | 451 | |
| 452 | Functions: |
| 453 | |
| 454 | * :c:func:`PyObject_Malloc` |
| 455 | * :c:func:`PyObject_Realloc` |
| 456 | * :c:func:`PyObject_Calloc` |
| 457 | * :c:func:`PyObject_Free` |
Victor Stinner | 0507bf5 | 2013-07-07 02:05:46 +0200 | [diff] [blame] | 458 | |
Victor Stinner | d8f0d92 | 2014-06-02 21:57:10 +0200 | [diff] [blame] | 459 | .. c:function:: void PyMem_GetAllocator(PyMemAllocatorDomain domain, PyMemAllocatorEx *allocator) |
Victor Stinner | 0507bf5 | 2013-07-07 02:05:46 +0200 | [diff] [blame] | 460 | |
| 461 | Get the memory block allocator of the specified domain. |
| 462 | |
| 463 | |
Victor Stinner | d8f0d92 | 2014-06-02 21:57:10 +0200 | [diff] [blame] | 464 | .. c:function:: void PyMem_SetAllocator(PyMemAllocatorDomain domain, PyMemAllocatorEx *allocator) |
Victor Stinner | 0507bf5 | 2013-07-07 02:05:46 +0200 | [diff] [blame] | 465 | |
| 466 | Set the memory block allocator of the specified domain. |
| 467 | |
Serhiy Storchaka | e835b31 | 2019-10-30 21:37:16 +0200 | [diff] [blame] | 468 | The new allocator must return a distinct non-``NULL`` pointer when requesting |
Victor Stinner | 0507bf5 | 2013-07-07 02:05:46 +0200 | [diff] [blame] | 469 | zero bytes. |
| 470 | |
| 471 | For the :c:data:`PYMEM_DOMAIN_RAW` domain, the allocator must be |
| 472 | thread-safe: the :term:`GIL <global interpreter lock>` is not held when the |
| 473 | allocator is called. |
| 474 | |
| 475 | If the new allocator is not a hook (does not call the previous allocator), |
| 476 | the :c:func:`PyMem_SetupDebugHooks` function must be called to reinstall the |
| 477 | debug hooks on top on the new allocator. |
| 478 | |
| 479 | |
| 480 | .. c:function:: void PyMem_SetupDebugHooks(void) |
| 481 | |
Victor Stinner | 645ed62 | 2021-04-29 10:47:47 +0200 | [diff] [blame] | 482 | Setup :ref:`debug hooks in the Python memory allocators <pymem-debug-hooks>` |
| 483 | to detect memory errors. |
Victor Stinner | 0507bf5 | 2013-07-07 02:05:46 +0200 | [diff] [blame] | 484 | |
Victor Stinner | f5c4b99 | 2016-04-22 16:26:23 +0200 | [diff] [blame] | 485 | |
Victor Stinner | 645ed62 | 2021-04-29 10:47:47 +0200 | [diff] [blame] | 486 | .. _pymem-debug-hooks: |
Victor Stinner | 0507bf5 | 2013-07-07 02:05:46 +0200 | [diff] [blame] | 487 | |
Victor Stinner | 645ed62 | 2021-04-29 10:47:47 +0200 | [diff] [blame] | 488 | Debug hooks on the Python memory allocators |
| 489 | =========================================== |
Victor Stinner | 0507bf5 | 2013-07-07 02:05:46 +0200 | [diff] [blame] | 490 | |
Miss Islington (bot) | bea618d | 2021-07-03 11:15:49 -0700 | [diff] [blame] | 491 | When :ref:`Python is built in debug mode <debug-build>`, the |
Victor Stinner | 645ed62 | 2021-04-29 10:47:47 +0200 | [diff] [blame] | 492 | :c:func:`PyMem_SetupDebugHooks` function is called at the :ref:`Python |
| 493 | preinitialization <c-preinit>` to setup debug hooks on Python memory allocators |
| 494 | to detect memory errors. |
Victor Stinner | 0611c26 | 2016-03-15 22:22:13 +0100 | [diff] [blame] | 495 | |
Victor Stinner | 645ed62 | 2021-04-29 10:47:47 +0200 | [diff] [blame] | 496 | The :envvar:`PYTHONMALLOC` environment variable can be used to install debug |
| 497 | hooks on a Python compiled in release mode (ex: ``PYTHONMALLOC=debug``). |
Victor Stinner | 34be807 | 2016-03-14 12:04:26 +0100 | [diff] [blame] | 498 | |
Victor Stinner | 645ed62 | 2021-04-29 10:47:47 +0200 | [diff] [blame] | 499 | The :c:func:`PyMem_SetupDebugHooks` function can be used to set debug hooks |
| 500 | after calling :c:func:`PyMem_SetAllocator`. |
Victor Stinner | 0507bf5 | 2013-07-07 02:05:46 +0200 | [diff] [blame] | 501 | |
Victor Stinner | 645ed62 | 2021-04-29 10:47:47 +0200 | [diff] [blame] | 502 | These debug hooks fill dynamically allocated memory blocks with special, |
| 503 | recognizable bit patterns. Newly allocated memory is filled with the byte |
| 504 | ``0xCD`` (``PYMEM_CLEANBYTE``), freed memory is filled with the byte ``0xDD`` |
| 505 | (``PYMEM_DEADBYTE``). Memory blocks are surrounded by "forbidden bytes" |
| 506 | filled with the byte ``0xFD`` (``PYMEM_FORBIDDENBYTE``). Strings of these bytes |
| 507 | are unlikely to be valid addresses, floats, or ASCII strings. |
| 508 | |
| 509 | Runtime checks: |
| 510 | |
| 511 | - Detect API violations. For example, detect if :c:func:`PyObject_Free` is |
| 512 | called on a memory block allocated by :c:func:`PyMem_Malloc`. |
| 513 | - Detect write before the start of the buffer (buffer underflow). |
| 514 | - Detect write after the end of the buffer (buffer overflow). |
| 515 | - Check that the :term:`GIL <global interpreter lock>` is held when |
| 516 | allocator functions of :c:data:`PYMEM_DOMAIN_OBJ` (ex: |
| 517 | :c:func:`PyObject_Malloc`) and :c:data:`PYMEM_DOMAIN_MEM` (ex: |
| 518 | :c:func:`PyMem_Malloc`) domains are called. |
| 519 | |
| 520 | On error, the debug hooks use the :mod:`tracemalloc` module to get the |
| 521 | traceback where a memory block was allocated. The traceback is only displayed |
| 522 | if :mod:`tracemalloc` is tracing Python memory allocations and the memory block |
| 523 | was traced. |
| 524 | |
| 525 | Let *S* = ``sizeof(size_t)``. ``2*S`` bytes are added at each end of each block |
| 526 | of *N* bytes requested. The memory layout is like so, where p represents the |
| 527 | address returned by a malloc-like or realloc-like function (``p[i:j]`` means |
| 528 | the slice of bytes from ``*(p+i)`` inclusive up to ``*(p+j)`` exclusive; note |
| 529 | that the treatment of negative indices differs from a Python slice): |
| 530 | |
| 531 | ``p[-2*S:-S]`` |
| 532 | Number of bytes originally asked for. This is a size_t, big-endian (easier |
| 533 | to read in a memory dump). |
| 534 | ``p[-S]`` |
| 535 | API identifier (ASCII character): |
| 536 | |
| 537 | * ``'r'`` for :c:data:`PYMEM_DOMAIN_RAW`. |
| 538 | * ``'m'`` for :c:data:`PYMEM_DOMAIN_MEM`. |
| 539 | * ``'o'`` for :c:data:`PYMEM_DOMAIN_OBJ`. |
| 540 | |
| 541 | ``p[-S+1:0]`` |
| 542 | Copies of PYMEM_FORBIDDENBYTE. Used to catch under- writes and reads. |
| 543 | |
| 544 | ``p[0:N]`` |
| 545 | The requested memory, filled with copies of PYMEM_CLEANBYTE, used to catch |
| 546 | reference to uninitialized memory. When a realloc-like function is called |
| 547 | requesting a larger memory block, the new excess bytes are also filled with |
| 548 | PYMEM_CLEANBYTE. When a free-like function is called, these are |
| 549 | overwritten with PYMEM_DEADBYTE, to catch reference to freed memory. When |
| 550 | a realloc- like function is called requesting a smaller memory block, the |
| 551 | excess old bytes are also filled with PYMEM_DEADBYTE. |
| 552 | |
| 553 | ``p[N:N+S]`` |
| 554 | Copies of PYMEM_FORBIDDENBYTE. Used to catch over- writes and reads. |
| 555 | |
| 556 | ``p[N+S:N+2*S]`` |
| 557 | Only used if the ``PYMEM_DEBUG_SERIALNO`` macro is defined (not defined by |
| 558 | default). |
| 559 | |
| 560 | A serial number, incremented by 1 on each call to a malloc-like or |
| 561 | realloc-like function. Big-endian ``size_t``. If "bad memory" is detected |
| 562 | later, the serial number gives an excellent way to set a breakpoint on the |
| 563 | next run, to capture the instant at which this block was passed out. The |
| 564 | static function bumpserialno() in obmalloc.c is the only place the serial |
| 565 | number is incremented, and exists so you can set such a breakpoint easily. |
| 566 | |
| 567 | A realloc-like or free-like function first checks that the PYMEM_FORBIDDENBYTE |
| 568 | bytes at each end are intact. If they've been altered, diagnostic output is |
| 569 | written to stderr, and the program is aborted via Py_FatalError(). The other |
| 570 | main failure mode is provoking a memory error when a program reads up one of |
| 571 | the special bit patterns and tries to use it as an address. If you get in a |
| 572 | debugger then and look at the object, you're likely to see that it's entirely |
| 573 | filled with PYMEM_DEADBYTE (meaning freed memory is getting used) or |
| 574 | PYMEM_CLEANBYTE (meaning uninitialized memory is getting used). |
| 575 | |
| 576 | .. versionchanged:: 3.6 |
| 577 | The :c:func:`PyMem_SetupDebugHooks` function now also works on Python |
| 578 | compiled in release mode. On error, the debug hooks now use |
| 579 | :mod:`tracemalloc` to get the traceback where a memory block was allocated. |
| 580 | The debug hooks now also check if the GIL is held when functions of |
| 581 | :c:data:`PYMEM_DOMAIN_OBJ` and :c:data:`PYMEM_DOMAIN_MEM` domains are |
| 582 | called. |
| 583 | |
| 584 | .. versionchanged:: 3.8 |
| 585 | Byte patterns ``0xCB`` (``PYMEM_CLEANBYTE``), ``0xDB`` (``PYMEM_DEADBYTE``) |
| 586 | and ``0xFB`` (``PYMEM_FORBIDDENBYTE``) have been replaced with ``0xCD``, |
| 587 | ``0xDD`` and ``0xFD`` to use the same values than Windows CRT debug |
| 588 | ``malloc()`` and ``free()``. |
Victor Stinner | 4c409be | 2019-04-11 13:01:15 +0200 | [diff] [blame] | 589 | |
Victor Stinner | 0507bf5 | 2013-07-07 02:05:46 +0200 | [diff] [blame] | 590 | |
Victor Stinner | 34be807 | 2016-03-14 12:04:26 +0100 | [diff] [blame] | 591 | .. _pymalloc: |
Victor Stinner | 0507bf5 | 2013-07-07 02:05:46 +0200 | [diff] [blame] | 592 | |
Victor Stinner | 34be807 | 2016-03-14 12:04:26 +0100 | [diff] [blame] | 593 | The pymalloc allocator |
| 594 | ====================== |
Victor Stinner | 0507bf5 | 2013-07-07 02:05:46 +0200 | [diff] [blame] | 595 | |
Victor Stinner | 34be807 | 2016-03-14 12:04:26 +0100 | [diff] [blame] | 596 | Python has a *pymalloc* allocator optimized for small objects (smaller or equal |
| 597 | to 512 bytes) with a short lifetime. It uses memory mappings called "arenas" |
Victor Stinner | 8c663fd | 2017-11-08 14:44:44 -0800 | [diff] [blame] | 598 | with a fixed size of 256 KiB. It falls back to :c:func:`PyMem_RawMalloc` and |
Victor Stinner | 34be807 | 2016-03-14 12:04:26 +0100 | [diff] [blame] | 599 | :c:func:`PyMem_RawRealloc` for allocations larger than 512 bytes. |
| 600 | |
Victor Stinner | 5d39e04 | 2017-11-29 17:20:38 +0100 | [diff] [blame] | 601 | *pymalloc* is the :ref:`default allocator <default-memory-allocators>` of the |
| 602 | :c:data:`PYMEM_DOMAIN_MEM` (ex: :c:func:`PyMem_Malloc`) and |
| 603 | :c:data:`PYMEM_DOMAIN_OBJ` (ex: :c:func:`PyObject_Malloc`) domains. |
Victor Stinner | 34be807 | 2016-03-14 12:04:26 +0100 | [diff] [blame] | 604 | |
| 605 | The arena allocator uses the following functions: |
Victor Stinner | 0507bf5 | 2013-07-07 02:05:46 +0200 | [diff] [blame] | 606 | |
| 607 | * :c:func:`VirtualAlloc` and :c:func:`VirtualFree` on Windows, |
| 608 | * :c:func:`mmap` and :c:func:`munmap` if available, |
| 609 | * :c:func:`malloc` and :c:func:`free` otherwise. |
| 610 | |
Victor Stinner | 645ed62 | 2021-04-29 10:47:47 +0200 | [diff] [blame] | 611 | This allocator is disabled if Python is configured with the |
| 612 | :option:`--without-pymalloc` option. It can also be disabled at runtime using |
| 613 | the :envvar:`PYTHONMALLOC` environment variable (ex: ``PYTHONMALLOC=malloc``). |
| 614 | |
Victor Stinner | 34be807 | 2016-03-14 12:04:26 +0100 | [diff] [blame] | 615 | Customize pymalloc Arena Allocator |
| 616 | ---------------------------------- |
| 617 | |
Victor Stinner | 0507bf5 | 2013-07-07 02:05:46 +0200 | [diff] [blame] | 618 | .. versionadded:: 3.4 |
| 619 | |
| 620 | .. c:type:: PyObjectArenaAllocator |
| 621 | |
| 622 | Structure used to describe an arena allocator. The structure has |
| 623 | three fields: |
| 624 | |
| 625 | +--------------------------------------------------+---------------------------------------+ |
| 626 | | Field | Meaning | |
| 627 | +==================================================+=======================================+ |
| 628 | | ``void *ctx`` | user context passed as first argument | |
| 629 | +--------------------------------------------------+---------------------------------------+ |
| 630 | | ``void* alloc(void *ctx, size_t size)`` | allocate an arena of size bytes | |
| 631 | +--------------------------------------------------+---------------------------------------+ |
Victor Stinner | 0d6bd1c | 2021-03-09 12:16:42 +0100 | [diff] [blame] | 632 | | ``void free(void *ctx, void *ptr, size_t size)`` | free an arena | |
Victor Stinner | 0507bf5 | 2013-07-07 02:05:46 +0200 | [diff] [blame] | 633 | +--------------------------------------------------+---------------------------------------+ |
| 634 | |
Victor Stinner | 474652f | 2020-08-13 22:11:50 +0200 | [diff] [blame] | 635 | .. c:function:: void PyObject_GetArenaAllocator(PyObjectArenaAllocator *allocator) |
Victor Stinner | 0507bf5 | 2013-07-07 02:05:46 +0200 | [diff] [blame] | 636 | |
| 637 | Get the arena allocator. |
| 638 | |
Victor Stinner | 474652f | 2020-08-13 22:11:50 +0200 | [diff] [blame] | 639 | .. c:function:: void PyObject_SetArenaAllocator(PyObjectArenaAllocator *allocator) |
Victor Stinner | 0507bf5 | 2013-07-07 02:05:46 +0200 | [diff] [blame] | 640 | |
| 641 | Set the arena allocator. |
| 642 | |
| 643 | |
Victor Stinner | 5ea4c06 | 2017-06-20 17:46:36 +0200 | [diff] [blame] | 644 | tracemalloc C API |
| 645 | ================= |
| 646 | |
| 647 | .. versionadded:: 3.7 |
| 648 | |
Julien Danjou | d4d17fd | 2020-02-21 11:47:41 +0100 | [diff] [blame] | 649 | .. c:function:: int PyTraceMalloc_Track(unsigned int domain, uintptr_t ptr, size_t size) |
Victor Stinner | 5ea4c06 | 2017-06-20 17:46:36 +0200 | [diff] [blame] | 650 | |
| 651 | Track an allocated memory block in the :mod:`tracemalloc` module. |
| 652 | |
Serhiy Storchaka | 5bb0005 | 2018-02-09 13:31:19 +0200 | [diff] [blame] | 653 | Return ``0`` on success, return ``-1`` on error (failed to allocate memory to |
Victor Stinner | 5ea4c06 | 2017-06-20 17:46:36 +0200 | [diff] [blame] | 654 | store the trace). Return ``-2`` if tracemalloc is disabled. |
| 655 | |
| 656 | If memory block is already tracked, update the existing trace. |
| 657 | |
Julien Danjou | d4d17fd | 2020-02-21 11:47:41 +0100 | [diff] [blame] | 658 | .. c:function:: int PyTraceMalloc_Untrack(unsigned int domain, uintptr_t ptr) |
Victor Stinner | 5ea4c06 | 2017-06-20 17:46:36 +0200 | [diff] [blame] | 659 | |
| 660 | Untrack an allocated memory block in the :mod:`tracemalloc` module. |
| 661 | Do nothing if the block was not tracked. |
| 662 | |
| 663 | Return ``-2`` if tracemalloc is disabled, otherwise return ``0``. |
| 664 | |
| 665 | |
Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 666 | .. _memoryexamples: |
| 667 | |
| 668 | Examples |
| 669 | ======== |
| 670 | |
| 671 | Here is the example from section :ref:`memoryoverview`, rewritten so that the |
| 672 | I/O buffer is allocated from the Python heap by using the first function set:: |
| 673 | |
| 674 | PyObject *res; |
| 675 | char *buf = (char *) PyMem_Malloc(BUFSIZ); /* for I/O */ |
| 676 | |
| 677 | if (buf == NULL) |
| 678 | return PyErr_NoMemory(); |
| 679 | /* ...Do some I/O operation involving buf... */ |
Gregory P. Smith | 4b52ae8 | 2013-03-22 13:43:30 -0700 | [diff] [blame] | 680 | res = PyBytes_FromString(buf); |
Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 681 | PyMem_Free(buf); /* allocated with PyMem_Malloc */ |
| 682 | return res; |
| 683 | |
| 684 | The same code using the type-oriented function set:: |
| 685 | |
| 686 | PyObject *res; |
| 687 | char *buf = PyMem_New(char, BUFSIZ); /* for I/O */ |
| 688 | |
| 689 | if (buf == NULL) |
| 690 | return PyErr_NoMemory(); |
| 691 | /* ...Do some I/O operation involving buf... */ |
Gregory P. Smith | 4b52ae8 | 2013-03-22 13:43:30 -0700 | [diff] [blame] | 692 | res = PyBytes_FromString(buf); |
Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 693 | PyMem_Del(buf); /* allocated with PyMem_New */ |
| 694 | return res; |
| 695 | |
| 696 | Note that in the two examples above, the buffer is always manipulated via |
| 697 | functions belonging to the same set. Indeed, it is required to use the same |
| 698 | memory API family for a given memory block, so that the risk of mixing different |
| 699 | allocators is reduced to a minimum. The following code sequence contains two |
| 700 | errors, one of which is labeled as *fatal* because it mixes two different |
| 701 | allocators operating on different heaps. :: |
| 702 | |
| 703 | char *buf1 = PyMem_New(char, BUFSIZ); |
| 704 | char *buf2 = (char *) malloc(BUFSIZ); |
| 705 | char *buf3 = (char *) PyMem_Malloc(BUFSIZ); |
| 706 | ... |
| 707 | PyMem_Del(buf3); /* Wrong -- should be PyMem_Free() */ |
| 708 | free(buf2); /* Right -- allocated via malloc() */ |
| 709 | free(buf1); /* Fatal -- should be PyMem_Del() */ |
| 710 | |
| 711 | In addition to the functions aimed at handling raw memory blocks from the Python |
Georg Brandl | 60203b4 | 2010-10-06 10:11:56 +0000 | [diff] [blame] | 712 | heap, objects in Python are allocated and released with :c:func:`PyObject_New`, |
| 713 | :c:func:`PyObject_NewVar` and :c:func:`PyObject_Del`. |
Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 714 | |
| 715 | These will be explained in the next chapter on defining and implementing new |
| 716 | object types in C. |