blob: a82e1c2f2c5058651af1389cd8a91ed9a9ad5ccf [file] [log] [blame]
Georg Brandl116aa622007-08-15 14:28:22 +00001.. highlightlang:: c
2
3
4.. _memory:
5
6*****************
7Memory Management
8*****************
9
10.. sectionauthor:: Vladimir Marangozov <Vladimir.Marangozov@inrialpes.fr>
11
12
13
14.. _memoryoverview:
15
16Overview
17========
18
19Memory management in Python involves a private heap containing all Python
20objects and data structures. The management of this private heap is ensured
21internally by the *Python memory manager*. The Python memory manager has
22different components which deal with various dynamic storage management aspects,
23like sharing, segmentation, preallocation or caching.
24
25At the lowest level, a raw memory allocator ensures that there is enough room in
26the private heap for storing all Python-related data by interacting with the
27memory manager of the operating system. On top of the raw memory allocator,
28several object-specific allocators operate on the same heap and implement
29distinct memory management policies adapted to the peculiarities of every object
30type. For example, integer objects are managed differently within the heap than
31strings, tuples or dictionaries because integers imply different storage
32requirements and speed/space tradeoffs. The Python memory manager thus delegates
33some of the work to the object-specific allocators, but ensures that the latter
34operate within the bounds of the private heap.
35
36It is important to understand that the management of the Python heap is
37performed by the interpreter itself and that the user has no control over it,
38even if she regularly manipulates object pointers to memory blocks inside that
39heap. The allocation of heap space for Python objects and other internal
40buffers is performed on demand by the Python memory manager through the Python/C
41API functions listed in this document.
42
43.. index::
44 single: malloc()
45 single: calloc()
46 single: realloc()
47 single: free()
48
49To avoid memory corruption, extension writers should never try to operate on
Georg Brandl60203b42010-10-06 10:11:56 +000050Python objects with the functions exported by the C library: :c:func:`malloc`,
51:c:func:`calloc`, :c:func:`realloc` and :c:func:`free`. This will result in mixed
Georg Brandl116aa622007-08-15 14:28:22 +000052calls between the C allocator and the Python memory manager with fatal
53consequences, because they implement different algorithms and operate on
54different heaps. However, one may safely allocate and release memory blocks
55with the C library allocator for individual purposes, as shown in the following
56example::
57
58 PyObject *res;
59 char *buf = (char *) malloc(BUFSIZ); /* for I/O */
60
61 if (buf == NULL)
62 return PyErr_NoMemory();
63 ...Do some I/O operation involving buf...
Gregory P. Smith4b52ae82013-03-22 13:43:30 -070064 res = PyBytes_FromString(buf);
Georg Brandl116aa622007-08-15 14:28:22 +000065 free(buf); /* malloc'ed */
66 return res;
67
68In this example, the memory request for the I/O buffer is handled by the C
69library allocator. The Python memory manager is involved only in the allocation
70of the string object returned as a result.
71
72In most situations, however, it is recommended to allocate memory from the
73Python heap specifically because the latter is under control of the Python
74memory manager. For example, this is required when the interpreter is extended
75with new object types written in C. Another reason for using the Python heap is
76the desire to *inform* the Python memory manager about the memory needs of the
77extension module. Even when the requested memory is used exclusively for
78internal, highly-specific purposes, delegating all memory requests to the Python
79memory manager causes the interpreter to have a more accurate image of its
80memory footprint as a whole. Consequently, under certain circumstances, the
81Python memory manager may or may not trigger appropriate actions, like garbage
82collection, memory compaction or other preventive procedures. Note that by using
83the C library allocator as shown in the previous example, the allocated memory
84for the I/O buffer escapes completely the Python memory manager.
85
86
Victor Stinner0507bf52013-07-07 02:05:46 +020087Raw Memory Interface
88====================
89
90The following function sets are wrappers to the system allocator. These
91functions are thread-safe, the :term:`GIL <global interpreter lock>` does not
92need to be held.
93
94The default raw memory block allocator uses the following functions:
95:c:func:`malloc`, :c:func:`realloc` and :c:func:`free`; call ``malloc(1)`` when
96requesting zero bytes.
97
98.. versionadded:: 3.4
99
100.. c:function:: void* PyMem_RawMalloc(size_t n)
101
102 Allocates *n* bytes and returns a pointer of type :c:type:`void\*` to the
103 allocated memory, or *NULL* if the request fails. Requesting zero bytes
104 returns a distinct non-*NULL* pointer if possible, as if
105 ``PyMem_RawMalloc(1)`` had been called instead. The memory will not have
106 been initialized in any way.
107
108
109.. c:function:: void* PyMem_RawRealloc(void *p, size_t n)
110
111 Resizes the memory block pointed to by *p* to *n* bytes. The contents will
112 be unchanged to the minimum of the old and the new sizes. If *p* is *NULL*,
113 the call is equivalent to ``PyMem_RawMalloc(n)``; else if *n* is equal to
114 zero, the memory block is resized but is not freed, and the returned pointer
115 is non-*NULL*. Unless *p* is *NULL*, it must have been returned by a
116 previous call to :c:func:`PyMem_RawMalloc` or :c:func:`PyMem_RawRealloc`. If
117 the request fails, :c:func:`PyMem_RawRealloc` returns *NULL* and *p* remains
118 a valid pointer to the previous memory area.
119
120
121.. c:function:: void PyMem_RawFree(void *p)
122
123 Frees the memory block pointed to by *p*, which must have been returned by a
124 previous call to :c:func:`PyMem_RawMalloc` or :c:func:`PyMem_RawRealloc`.
125 Otherwise, or if ``PyMem_Free(p)`` has been called before, undefined
126 behavior occurs. If *p* is *NULL*, no operation is performed.
127
128
Georg Brandl116aa622007-08-15 14:28:22 +0000129.. _memoryinterface:
130
131Memory Interface
132================
133
134The following function sets, modeled after the ANSI C standard, but specifying
135behavior when requesting zero bytes, are available for allocating and releasing
Victor Stinner0507bf52013-07-07 02:05:46 +0200136memory from the Python heap.
Georg Brandl116aa622007-08-15 14:28:22 +0000137
Victor Stinner0507bf52013-07-07 02:05:46 +0200138The default memory block allocator uses the following functions:
139:c:func:`malloc`, :c:func:`realloc` and :c:func:`free`; call ``malloc(1)`` when
140requesting zero bytes.
141
142.. warning::
143
144 The :term:`GIL <global interpreter lock>` must be held when using these
145 functions.
Georg Brandl116aa622007-08-15 14:28:22 +0000146
Georg Brandl60203b42010-10-06 10:11:56 +0000147.. c:function:: void* PyMem_Malloc(size_t n)
Georg Brandl116aa622007-08-15 14:28:22 +0000148
Georg Brandl60203b42010-10-06 10:11:56 +0000149 Allocates *n* bytes and returns a pointer of type :c:type:`void\*` to the
Georg Brandl116aa622007-08-15 14:28:22 +0000150 allocated memory, or *NULL* if the request fails. Requesting zero bytes returns
Andrew Svetlov7dbee382012-08-09 21:26:34 +0300151 a distinct non-*NULL* pointer if possible, as if ``PyMem_Malloc(1)`` had
Georg Brandl116aa622007-08-15 14:28:22 +0000152 been called instead. The memory will not have been initialized in any way.
153
154
Georg Brandl60203b42010-10-06 10:11:56 +0000155.. c:function:: void* PyMem_Realloc(void *p, size_t n)
Georg Brandl116aa622007-08-15 14:28:22 +0000156
157 Resizes the memory block pointed to by *p* to *n* bytes. The contents will be
158 unchanged to the minimum of the old and the new sizes. If *p* is *NULL*, the
Andrew Svetlov7dbee382012-08-09 21:26:34 +0300159 call is equivalent to ``PyMem_Malloc(n)``; else if *n* is equal to zero,
Georg Brandl116aa622007-08-15 14:28:22 +0000160 the memory block is resized but is not freed, and the returned pointer is
161 non-*NULL*. Unless *p* is *NULL*, it must have been returned by a previous call
Georg Brandl60203b42010-10-06 10:11:56 +0000162 to :c:func:`PyMem_Malloc` or :c:func:`PyMem_Realloc`. If the request fails,
163 :c:func:`PyMem_Realloc` returns *NULL* and *p* remains a valid pointer to the
Georg Brandl116aa622007-08-15 14:28:22 +0000164 previous memory area.
165
166
Georg Brandl60203b42010-10-06 10:11:56 +0000167.. c:function:: void PyMem_Free(void *p)
Georg Brandl116aa622007-08-15 14:28:22 +0000168
169 Frees the memory block pointed to by *p*, which must have been returned by a
Georg Brandl60203b42010-10-06 10:11:56 +0000170 previous call to :c:func:`PyMem_Malloc` or :c:func:`PyMem_Realloc`. Otherwise, or
Andrew Svetlov7dbee382012-08-09 21:26:34 +0300171 if ``PyMem_Free(p)`` has been called before, undefined behavior occurs. If
Georg Brandl116aa622007-08-15 14:28:22 +0000172 *p* is *NULL*, no operation is performed.
173
174The following type-oriented macros are provided for convenience. Note that
175*TYPE* refers to any C type.
176
177
Georg Brandl60203b42010-10-06 10:11:56 +0000178.. c:function:: TYPE* PyMem_New(TYPE, size_t n)
Georg Brandl116aa622007-08-15 14:28:22 +0000179
Georg Brandl60203b42010-10-06 10:11:56 +0000180 Same as :c:func:`PyMem_Malloc`, but allocates ``(n * sizeof(TYPE))`` bytes of
181 memory. Returns a pointer cast to :c:type:`TYPE\*`. The memory will not have
Georg Brandl116aa622007-08-15 14:28:22 +0000182 been initialized in any way.
183
184
Georg Brandl60203b42010-10-06 10:11:56 +0000185.. c:function:: TYPE* PyMem_Resize(void *p, TYPE, size_t n)
Georg Brandl116aa622007-08-15 14:28:22 +0000186
Georg Brandl60203b42010-10-06 10:11:56 +0000187 Same as :c:func:`PyMem_Realloc`, but the memory block is resized to ``(n *
188 sizeof(TYPE))`` bytes. Returns a pointer cast to :c:type:`TYPE\*`. On return,
Georg Brandld492ad82008-07-23 16:13:07 +0000189 *p* will be a pointer to the new memory area, or *NULL* in the event of
190 failure. This is a C preprocessor macro; p is always reassigned. Save
191 the original value of p to avoid losing memory when handling errors.
Georg Brandl116aa622007-08-15 14:28:22 +0000192
193
Georg Brandl60203b42010-10-06 10:11:56 +0000194.. c:function:: void PyMem_Del(void *p)
Georg Brandl116aa622007-08-15 14:28:22 +0000195
Georg Brandl60203b42010-10-06 10:11:56 +0000196 Same as :c:func:`PyMem_Free`.
Georg Brandl116aa622007-08-15 14:28:22 +0000197
198In addition, the following macro sets are provided for calling the Python memory
199allocator directly, without involving the C API functions listed above. However,
200note that their use does not preserve binary compatibility across Python
201versions and is therefore deprecated in extension modules.
202
Georg Brandl60203b42010-10-06 10:11:56 +0000203:c:func:`PyMem_MALLOC`, :c:func:`PyMem_REALLOC`, :c:func:`PyMem_FREE`.
Georg Brandl116aa622007-08-15 14:28:22 +0000204
Georg Brandl60203b42010-10-06 10:11:56 +0000205:c:func:`PyMem_NEW`, :c:func:`PyMem_RESIZE`, :c:func:`PyMem_DEL`.
Georg Brandl116aa622007-08-15 14:28:22 +0000206
207
Victor Stinner0507bf52013-07-07 02:05:46 +0200208Customize Memory Allocators
209===========================
210
211.. versionadded:: 3.4
212
213.. c:type:: PyMemAllocator
214
215 Structure used to describe a memory block allocator. The structure has
216 four fields:
217
218 +----------------------------------------------------------+---------------------------------------+
219 | Field | Meaning |
220 +==========================================================+=======================================+
221 | ``void *ctx`` | user context passed as first argument |
222 +----------------------------------------------------------+---------------------------------------+
223 | ``void* malloc(void *ctx, size_t size)`` | allocate a memory block |
224 +----------------------------------------------------------+---------------------------------------+
225 | ``void* realloc(void *ctx, void *ptr, size_t new_size)`` | allocate or resize a memory block |
226 +----------------------------------------------------------+---------------------------------------+
227 | ``void free(void *ctx, void *ptr)`` | free a memory block |
228 +----------------------------------------------------------+---------------------------------------+
229
230.. c:type:: PyMemAllocatorDomain
231
232 Enum used to identify an allocator domain. Domains:
233
234 * :c:data:`PYMEM_DOMAIN_RAW`: functions :c:func:`PyMem_RawMalloc`,
235 :c:func:`PyMem_RawRealloc` and :c:func:`PyMem_RawFree`
236 * :c:data:`PYMEM_DOMAIN_MEM`: functions :c:func:`PyMem_Malloc`,
237 :c:func:`PyMem_Realloc` and :c:func:`PyMem_Free`
238 * :c:data:`PYMEM_DOMAIN_OBJ`: functions :c:func:`PyObject_Malloc`,
239 :c:func:`PyObject_Realloc` and :c:func:`PyObject_Free`
240
241
242.. c:function:: void PyMem_GetAllocator(PyMemAllocatorDomain domain, PyMemAllocator *allocator)
243
244 Get the memory block allocator of the specified domain.
245
246
247.. c:function:: void PyMem_SetAllocator(PyMemAllocatorDomain domain, PyMemAllocator *allocator)
248
249 Set the memory block allocator of the specified domain.
250
251 The new allocator must return a distinct non-NULL pointer when requesting
252 zero bytes.
253
254 For the :c:data:`PYMEM_DOMAIN_RAW` domain, the allocator must be
255 thread-safe: the :term:`GIL <global interpreter lock>` is not held when the
256 allocator is called.
257
258 If the new allocator is not a hook (does not call the previous allocator),
259 the :c:func:`PyMem_SetupDebugHooks` function must be called to reinstall the
260 debug hooks on top on the new allocator.
261
262
263.. c:function:: void PyMem_SetupDebugHooks(void)
264
265 Setup hooks to detect bugs in the following Python memory allocator
266 functions:
267
268 - :c:func:`PyMem_RawMalloc`, :c:func:`PyMem_RawRealloc`,
269 :c:func:`PyMem_RawFree`
270 - :c:func:`PyMem_Malloc`, :c:func:`PyMem_Realloc`, :c:func:`PyMem_Free`
271 - :c:func:`PyObject_Malloc`, :c:func:`PyObject_Realloc`,
272 :c:func:`PyObject_Free`
273
274 Newly allocated memory is filled with the byte ``0xCB``, freed memory is
275 filled with the byte ``0xDB``. Additionnal checks:
276
277 - detect API violations, ex: :c:func:`PyObject_Free` called on a buffer
278 allocated by :c:func:`PyMem_Malloc`
279 - detect write before the start of the buffer (buffer underflow)
280 - detect write after the end of the buffer (buffer overflow)
281
282 The function does nothing if Python is not compiled is debug mode.
283
284
285Customize PyObject Arena Allocator
286==================================
287
288Python has a *pymalloc* allocator for allocations smaller than 512 bytes. This
289allocator is optimized for small objects with a short lifetime. It uses memory
290mappings called "arenas" with a fixed size of 256 KB. It falls back to
Victor Stinner6cf185d2013-10-10 15:58:42 +0200291:c:func:`PyMem_RawMalloc` and :c:func:`PyMem_RawRealloc` for allocations larger
292than 512 bytes. *pymalloc* is the default allocator used by
Victor Stinner0507bf52013-07-07 02:05:46 +0200293:c:func:`PyObject_Malloc`.
294
295The default arena allocator uses the following functions:
296
297* :c:func:`VirtualAlloc` and :c:func:`VirtualFree` on Windows,
298* :c:func:`mmap` and :c:func:`munmap` if available,
299* :c:func:`malloc` and :c:func:`free` otherwise.
300
301.. versionadded:: 3.4
302
303.. c:type:: PyObjectArenaAllocator
304
305 Structure used to describe an arena allocator. The structure has
306 three fields:
307
308 +--------------------------------------------------+---------------------------------------+
309 | Field | Meaning |
310 +==================================================+=======================================+
311 | ``void *ctx`` | user context passed as first argument |
312 +--------------------------------------------------+---------------------------------------+
313 | ``void* alloc(void *ctx, size_t size)`` | allocate an arena of size bytes |
314 +--------------------------------------------------+---------------------------------------+
315 | ``void free(void *ctx, size_t size, void *ptr)`` | free an arena |
316 +--------------------------------------------------+---------------------------------------+
317
318.. c:function:: PyObject_GetArenaAllocator(PyObjectArenaAllocator *allocator)
319
320 Get the arena allocator.
321
322.. c:function:: PyObject_SetArenaAllocator(PyObjectArenaAllocator *allocator)
323
324 Set the arena allocator.
325
326
Georg Brandl116aa622007-08-15 14:28:22 +0000327.. _memoryexamples:
328
329Examples
330========
331
332Here is the example from section :ref:`memoryoverview`, rewritten so that the
333I/O buffer is allocated from the Python heap by using the first function set::
334
335 PyObject *res;
336 char *buf = (char *) PyMem_Malloc(BUFSIZ); /* for I/O */
337
338 if (buf == NULL)
339 return PyErr_NoMemory();
340 /* ...Do some I/O operation involving buf... */
Gregory P. Smith4b52ae82013-03-22 13:43:30 -0700341 res = PyBytes_FromString(buf);
Georg Brandl116aa622007-08-15 14:28:22 +0000342 PyMem_Free(buf); /* allocated with PyMem_Malloc */
343 return res;
344
345The same code using the type-oriented function set::
346
347 PyObject *res;
348 char *buf = PyMem_New(char, BUFSIZ); /* for I/O */
349
350 if (buf == NULL)
351 return PyErr_NoMemory();
352 /* ...Do some I/O operation involving buf... */
Gregory P. Smith4b52ae82013-03-22 13:43:30 -0700353 res = PyBytes_FromString(buf);
Georg Brandl116aa622007-08-15 14:28:22 +0000354 PyMem_Del(buf); /* allocated with PyMem_New */
355 return res;
356
357Note that in the two examples above, the buffer is always manipulated via
358functions belonging to the same set. Indeed, it is required to use the same
359memory API family for a given memory block, so that the risk of mixing different
360allocators is reduced to a minimum. The following code sequence contains two
361errors, one of which is labeled as *fatal* because it mixes two different
362allocators operating on different heaps. ::
363
364 char *buf1 = PyMem_New(char, BUFSIZ);
365 char *buf2 = (char *) malloc(BUFSIZ);
366 char *buf3 = (char *) PyMem_Malloc(BUFSIZ);
367 ...
368 PyMem_Del(buf3); /* Wrong -- should be PyMem_Free() */
369 free(buf2); /* Right -- allocated via malloc() */
370 free(buf1); /* Fatal -- should be PyMem_Del() */
371
372In addition to the functions aimed at handling raw memory blocks from the Python
Georg Brandl60203b42010-10-06 10:11:56 +0000373heap, objects in Python are allocated and released with :c:func:`PyObject_New`,
374:c:func:`PyObject_NewVar` and :c:func:`PyObject_Del`.
Georg Brandl116aa622007-08-15 14:28:22 +0000375
376These will be explained in the next chapter on defining and implementing new
377object types in C.
378