blob: 17cfdeb965d3a6d924a41f4eec66b9155dff92cb [file] [log] [blame]
Georg Brandl116aa622007-08-15 14:28:22 +00001.. highlightlang:: c
2
3
4.. _extending-intro:
5
6******************************
7Extending Python with C or C++
8******************************
9
10It is quite easy to add new built-in modules to Python, if you know how to
11program in C. Such :dfn:`extension modules` can do two things that can't be
12done directly in Python: they can implement new built-in object types, and they
13can call C library functions and system calls.
14
15To support extensions, the Python API (Application Programmers Interface)
16defines a set of functions, macros and variables that provide access to most
17aspects of the Python run-time system. The Python API is incorporated in a C
18source file by including the header ``"Python.h"``.
19
20The compilation of an extension module depends on its intended use as well as on
21your system setup; details are given in later chapters.
22
Benjamin Peterson63b55582015-01-05 14:38:46 -060023.. note::
Brett Cannon7f98a6c2009-09-17 03:39:33 +000024
Benjamin Peterson63b55582015-01-05 14:38:46 -060025 The C extension interface is specific to CPython, and extension modules do
26 not work on other Python implementations. In many cases, it is possible to
27 avoid writing C extensions and preserve portability to other implementations.
28 For example, if your use case is calling C library functions or system calls,
29 you should consider using the :mod:`ctypes` module or the `cffi
30 <http://cffi.readthedocs.org>`_ library rather than writing custom C code.
31 These modules let you write Python code to interface with C code and are more
32 portable between implementations of Python than writing and compiling a C
33 extension module.
Brett Cannon7f98a6c2009-09-17 03:39:33 +000034
Georg Brandl116aa622007-08-15 14:28:22 +000035
36.. _extending-simpleexample:
37
38A Simple Example
39================
40
41Let's create an extension module called ``spam`` (the favorite food of Monty
42Python fans...) and let's say we want to create a Python interface to the C
Georg Brandl60203b42010-10-06 10:11:56 +000043library function :c:func:`system`. [#]_ This function takes a null-terminated
Georg Brandl116aa622007-08-15 14:28:22 +000044character string as argument and returns an integer. We want this function to
45be callable from Python as follows::
46
47 >>> import spam
48 >>> status = spam.system("ls -l")
49
50Begin by creating a file :file:`spammodule.c`. (Historically, if a module is
51called ``spam``, the C file containing its implementation is called
52:file:`spammodule.c`; if the module name is very long, like ``spammify``, the
53module name can be just :file:`spammify.c`.)
54
55The first line of our file can be::
56
57 #include <Python.h>
58
59which pulls in the Python API (you can add a comment describing the purpose of
60the module and a copyright notice if you like).
61
Georg Brandle720c0a2009-04-27 16:20:50 +000062.. note::
Georg Brandl116aa622007-08-15 14:28:22 +000063
64 Since Python may define some pre-processor definitions which affect the standard
65 headers on some systems, you *must* include :file:`Python.h` before any standard
66 headers are included.
67
68All user-visible symbols defined by :file:`Python.h` have a prefix of ``Py`` or
69``PY``, except those defined in standard header files. For convenience, and
70since they are used extensively by the Python interpreter, ``"Python.h"``
71includes a few standard header files: ``<stdio.h>``, ``<string.h>``,
72``<errno.h>``, and ``<stdlib.h>``. If the latter header file does not exist on
Georg Brandl60203b42010-10-06 10:11:56 +000073your system, it declares the functions :c:func:`malloc`, :c:func:`free` and
74:c:func:`realloc` directly.
Georg Brandl116aa622007-08-15 14:28:22 +000075
76The next thing we add to our module file is the C function that will be called
77when the Python expression ``spam.system(string)`` is evaluated (we'll see
78shortly how it ends up being called)::
79
80 static PyObject *
81 spam_system(PyObject *self, PyObject *args)
82 {
83 const char *command;
84 int sts;
85
86 if (!PyArg_ParseTuple(args, "s", &command))
87 return NULL;
88 sts = system(command);
Georg Brandlc877a7c2010-11-26 11:55:48 +000089 return PyLong_FromLong(sts);
Georg Brandl116aa622007-08-15 14:28:22 +000090 }
91
92There is a straightforward translation from the argument list in Python (for
93example, the single expression ``"ls -l"``) to the arguments passed to the C
94function. The C function always has two arguments, conventionally named *self*
95and *args*.
96
Georg Brandl21dc5ba2009-07-11 10:43:08 +000097The *self* argument points to the module object for module-level functions;
98for a method it would point to the object instance.
Georg Brandl116aa622007-08-15 14:28:22 +000099
100The *args* argument will be a pointer to a Python tuple object containing the
101arguments. Each item of the tuple corresponds to an argument in the call's
102argument list. The arguments are Python objects --- in order to do anything
103with them in our C function we have to convert them to C values. The function
Georg Brandl60203b42010-10-06 10:11:56 +0000104:c:func:`PyArg_ParseTuple` in the Python API checks the argument types and
Georg Brandl116aa622007-08-15 14:28:22 +0000105converts them to C values. It uses a template string to determine the required
106types of the arguments as well as the types of the C variables into which to
107store the converted values. More about this later.
108
Georg Brandl60203b42010-10-06 10:11:56 +0000109:c:func:`PyArg_ParseTuple` returns true (nonzero) if all arguments have the right
Georg Brandl116aa622007-08-15 14:28:22 +0000110type and its components have been stored in the variables whose addresses are
111passed. It returns false (zero) if an invalid argument list was passed. In the
112latter case it also raises an appropriate exception so the calling function can
113return *NULL* immediately (as we saw in the example).
114
115
116.. _extending-errors:
117
118Intermezzo: Errors and Exceptions
119=================================
120
121An important convention throughout the Python interpreter is the following: when
122a function fails, it should set an exception condition and return an error value
123(usually a *NULL* pointer). Exceptions are stored in a static global variable
124inside the interpreter; if this variable is *NULL* no exception has occurred. A
125second global variable stores the "associated value" of the exception (the
126second argument to :keyword:`raise`). A third variable contains the stack
127traceback in case the error originated in Python code. These three variables
128are the C equivalents of the result in Python of :meth:`sys.exc_info` (see the
129section on module :mod:`sys` in the Python Library Reference). It is important
130to know about them to understand how errors are passed around.
131
132The Python API defines a number of functions to set various types of exceptions.
133
Georg Brandl60203b42010-10-06 10:11:56 +0000134The most common one is :c:func:`PyErr_SetString`. Its arguments are an exception
Georg Brandl116aa622007-08-15 14:28:22 +0000135object and a C string. The exception object is usually a predefined object like
Georg Brandl60203b42010-10-06 10:11:56 +0000136:c:data:`PyExc_ZeroDivisionError`. The C string indicates the cause of the error
Georg Brandl116aa622007-08-15 14:28:22 +0000137and is converted to a Python string object and stored as the "associated value"
138of the exception.
139
Georg Brandl60203b42010-10-06 10:11:56 +0000140Another useful function is :c:func:`PyErr_SetFromErrno`, which only takes an
Georg Brandl116aa622007-08-15 14:28:22 +0000141exception argument and constructs the associated value by inspection of the
Georg Brandl60203b42010-10-06 10:11:56 +0000142global variable :c:data:`errno`. The most general function is
143:c:func:`PyErr_SetObject`, which takes two object arguments, the exception and
144its associated value. You don't need to :c:func:`Py_INCREF` the objects passed
Georg Brandl116aa622007-08-15 14:28:22 +0000145to any of these functions.
146
147You can test non-destructively whether an exception has been set with
Georg Brandl60203b42010-10-06 10:11:56 +0000148:c:func:`PyErr_Occurred`. This returns the current exception object, or *NULL*
Georg Brandl116aa622007-08-15 14:28:22 +0000149if no exception has occurred. You normally don't need to call
Georg Brandl60203b42010-10-06 10:11:56 +0000150:c:func:`PyErr_Occurred` to see whether an error occurred in a function call,
Georg Brandl116aa622007-08-15 14:28:22 +0000151since you should be able to tell from the return value.
152
153When a function *f* that calls another function *g* detects that the latter
154fails, *f* should itself return an error value (usually *NULL* or ``-1``). It
Georg Brandl60203b42010-10-06 10:11:56 +0000155should *not* call one of the :c:func:`PyErr_\*` functions --- one has already
Georg Brandl116aa622007-08-15 14:28:22 +0000156been called by *g*. *f*'s caller is then supposed to also return an error
Georg Brandl60203b42010-10-06 10:11:56 +0000157indication to *its* caller, again *without* calling :c:func:`PyErr_\*`, and so on
Georg Brandl116aa622007-08-15 14:28:22 +0000158--- the most detailed cause of the error was already reported by the function
159that first detected it. Once the error reaches the Python interpreter's main
160loop, this aborts the currently executing Python code and tries to find an
161exception handler specified by the Python programmer.
162
163(There are situations where a module can actually give a more detailed error
Georg Brandl60203b42010-10-06 10:11:56 +0000164message by calling another :c:func:`PyErr_\*` function, and in such cases it is
Georg Brandl116aa622007-08-15 14:28:22 +0000165fine to do so. As a general rule, however, this is not necessary, and can cause
166information about the cause of the error to be lost: most operations can fail
167for a variety of reasons.)
168
169To ignore an exception set by a function call that failed, the exception
Georg Brandl682d7e02010-10-06 10:26:05 +0000170condition must be cleared explicitly by calling :c:func:`PyErr_Clear`. The only
Georg Brandl60203b42010-10-06 10:11:56 +0000171time C code should call :c:func:`PyErr_Clear` is if it doesn't want to pass the
Georg Brandl116aa622007-08-15 14:28:22 +0000172error on to the interpreter but wants to handle it completely by itself
173(possibly by trying something else, or pretending nothing went wrong).
174
Georg Brandl60203b42010-10-06 10:11:56 +0000175Every failing :c:func:`malloc` call must be turned into an exception --- the
176direct caller of :c:func:`malloc` (or :c:func:`realloc`) must call
177:c:func:`PyErr_NoMemory` and return a failure indicator itself. All the
178object-creating functions (for example, :c:func:`PyLong_FromLong`) already do
179this, so this note is only relevant to those who call :c:func:`malloc` directly.
Georg Brandl116aa622007-08-15 14:28:22 +0000180
Georg Brandl60203b42010-10-06 10:11:56 +0000181Also note that, with the important exception of :c:func:`PyArg_ParseTuple` and
Georg Brandl116aa622007-08-15 14:28:22 +0000182friends, functions that return an integer status usually return a positive value
183or zero for success and ``-1`` for failure, like Unix system calls.
184
Georg Brandl60203b42010-10-06 10:11:56 +0000185Finally, be careful to clean up garbage (by making :c:func:`Py_XDECREF` or
186:c:func:`Py_DECREF` calls for objects you have already created) when you return
Georg Brandl116aa622007-08-15 14:28:22 +0000187an error indicator!
188
189The choice of which exception to raise is entirely yours. There are predeclared
190C objects corresponding to all built-in Python exceptions, such as
Georg Brandl60203b42010-10-06 10:11:56 +0000191:c:data:`PyExc_ZeroDivisionError`, which you can use directly. Of course, you
192should choose exceptions wisely --- don't use :c:data:`PyExc_TypeError` to mean
193that a file couldn't be opened (that should probably be :c:data:`PyExc_IOError`).
194If something's wrong with the argument list, the :c:func:`PyArg_ParseTuple`
195function usually raises :c:data:`PyExc_TypeError`. If you have an argument whose
Georg Brandl116aa622007-08-15 14:28:22 +0000196value must be in a particular range or must satisfy other conditions,
Georg Brandl60203b42010-10-06 10:11:56 +0000197:c:data:`PyExc_ValueError` is appropriate.
Georg Brandl116aa622007-08-15 14:28:22 +0000198
199You can also define a new exception that is unique to your module. For this, you
200usually declare a static object variable at the beginning of your file::
201
202 static PyObject *SpamError;
203
Georg Brandl60203b42010-10-06 10:11:56 +0000204and initialize it in your module's initialization function (:c:func:`PyInit_spam`)
Georg Brandl116aa622007-08-15 14:28:22 +0000205with an exception object (leaving out the error checking for now)::
206
207 PyMODINIT_FUNC
Martin v. Löwis1a214512008-06-11 05:26:20 +0000208 PyInit_spam(void)
Georg Brandl116aa622007-08-15 14:28:22 +0000209 {
210 PyObject *m;
211
Martin v. Löwis1a214512008-06-11 05:26:20 +0000212 m = PyModule_Create(&spammodule);
Georg Brandl116aa622007-08-15 14:28:22 +0000213 if (m == NULL)
Martin v. Löwis1a214512008-06-11 05:26:20 +0000214 return NULL;
Georg Brandl116aa622007-08-15 14:28:22 +0000215
216 SpamError = PyErr_NewException("spam.error", NULL, NULL);
217 Py_INCREF(SpamError);
218 PyModule_AddObject(m, "error", SpamError);
Martin v. Löwis1a214512008-06-11 05:26:20 +0000219 return m;
Georg Brandl116aa622007-08-15 14:28:22 +0000220 }
221
222Note that the Python name for the exception object is :exc:`spam.error`. The
Georg Brandl60203b42010-10-06 10:11:56 +0000223:c:func:`PyErr_NewException` function may create a class with the base class
Georg Brandl116aa622007-08-15 14:28:22 +0000224being :exc:`Exception` (unless another class is passed in instead of *NULL*),
225described in :ref:`bltin-exceptions`.
226
Georg Brandl60203b42010-10-06 10:11:56 +0000227Note also that the :c:data:`SpamError` variable retains a reference to the newly
Georg Brandl116aa622007-08-15 14:28:22 +0000228created exception class; this is intentional! Since the exception could be
229removed from the module by external code, an owned reference to the class is
Georg Brandl60203b42010-10-06 10:11:56 +0000230needed to ensure that it will not be discarded, causing :c:data:`SpamError` to
Georg Brandl116aa622007-08-15 14:28:22 +0000231become a dangling pointer. Should it become a dangling pointer, C code which
232raises the exception could cause a core dump or other unintended side effects.
233
Georg Brandl9c491c92010-08-02 20:21:21 +0000234We discuss the use of ``PyMODINIT_FUNC`` as a function return type later in this
Georg Brandl116aa622007-08-15 14:28:22 +0000235sample.
236
Georg Brandl9c491c92010-08-02 20:21:21 +0000237The :exc:`spam.error` exception can be raised in your extension module using a
Georg Brandl60203b42010-10-06 10:11:56 +0000238call to :c:func:`PyErr_SetString` as shown below::
Georg Brandl9c491c92010-08-02 20:21:21 +0000239
240 static PyObject *
241 spam_system(PyObject *self, PyObject *args)
242 {
243 const char *command;
244 int sts;
245
246 if (!PyArg_ParseTuple(args, "s", &command))
247 return NULL;
248 sts = system(command);
249 if (sts < 0) {
250 PyErr_SetString(SpamError, "System command failed");
251 return NULL;
252 }
253 return PyLong_FromLong(sts);
254 }
255
Georg Brandl116aa622007-08-15 14:28:22 +0000256
257.. _backtoexample:
258
259Back to the Example
260===================
261
262Going back to our example function, you should now be able to understand this
263statement::
264
265 if (!PyArg_ParseTuple(args, "s", &command))
266 return NULL;
267
268It returns *NULL* (the error indicator for functions returning object pointers)
269if an error is detected in the argument list, relying on the exception set by
Georg Brandl60203b42010-10-06 10:11:56 +0000270:c:func:`PyArg_ParseTuple`. Otherwise the string value of the argument has been
271copied to the local variable :c:data:`command`. This is a pointer assignment and
Georg Brandl116aa622007-08-15 14:28:22 +0000272you are not supposed to modify the string to which it points (so in Standard C,
Georg Brandl60203b42010-10-06 10:11:56 +0000273the variable :c:data:`command` should properly be declared as ``const char
Georg Brandl116aa622007-08-15 14:28:22 +0000274*command``).
275
Georg Brandl60203b42010-10-06 10:11:56 +0000276The next statement is a call to the Unix function :c:func:`system`, passing it
277the string we just got from :c:func:`PyArg_ParseTuple`::
Georg Brandl116aa622007-08-15 14:28:22 +0000278
279 sts = system(command);
280
Georg Brandl60203b42010-10-06 10:11:56 +0000281Our :func:`spam.system` function must return the value of :c:data:`sts` as a
Georg Brandlc877a7c2010-11-26 11:55:48 +0000282Python object. This is done using the function :c:func:`PyLong_FromLong`. ::
Georg Brandl116aa622007-08-15 14:28:22 +0000283
Georg Brandlc877a7c2010-11-26 11:55:48 +0000284 return PyLong_FromLong(sts);
Georg Brandl116aa622007-08-15 14:28:22 +0000285
286In this case, it will return an integer object. (Yes, even integers are objects
287on the heap in Python!)
288
289If you have a C function that returns no useful argument (a function returning
Georg Brandl60203b42010-10-06 10:11:56 +0000290:c:type:`void`), the corresponding Python function must return ``None``. You
291need this idiom to do so (which is implemented by the :c:macro:`Py_RETURN_NONE`
Georg Brandl116aa622007-08-15 14:28:22 +0000292macro)::
293
294 Py_INCREF(Py_None);
295 return Py_None;
296
Georg Brandl60203b42010-10-06 10:11:56 +0000297:c:data:`Py_None` is the C name for the special Python object ``None``. It is a
Georg Brandl116aa622007-08-15 14:28:22 +0000298genuine Python object rather than a *NULL* pointer, which means "error" in most
299contexts, as we have seen.
300
301
302.. _methodtable:
303
304The Module's Method Table and Initialization Function
305=====================================================
306
Georg Brandl60203b42010-10-06 10:11:56 +0000307I promised to show how :c:func:`spam_system` is called from Python programs.
Georg Brandl116aa622007-08-15 14:28:22 +0000308First, we need to list its name and address in a "method table"::
309
310 static PyMethodDef SpamMethods[] = {
311 ...
312 {"system", spam_system, METH_VARARGS,
313 "Execute a shell command."},
314 ...
315 {NULL, NULL, 0, NULL} /* Sentinel */
316 };
317
318Note the third entry (``METH_VARARGS``). This is a flag telling the interpreter
319the calling convention to be used for the C function. It should normally always
320be ``METH_VARARGS`` or ``METH_VARARGS | METH_KEYWORDS``; a value of ``0`` means
Georg Brandl60203b42010-10-06 10:11:56 +0000321that an obsolete variant of :c:func:`PyArg_ParseTuple` is used.
Georg Brandl116aa622007-08-15 14:28:22 +0000322
323When using only ``METH_VARARGS``, the function should expect the Python-level
324parameters to be passed in as a tuple acceptable for parsing via
Georg Brandl60203b42010-10-06 10:11:56 +0000325:c:func:`PyArg_ParseTuple`; more information on this function is provided below.
Georg Brandl116aa622007-08-15 14:28:22 +0000326
327The :const:`METH_KEYWORDS` bit may be set in the third field if keyword
328arguments should be passed to the function. In this case, the C function should
Eli Bendersky44fb6132012-02-11 10:27:31 +0200329accept a third ``PyObject *`` parameter which will be a dictionary of keywords.
Georg Brandl60203b42010-10-06 10:11:56 +0000330Use :c:func:`PyArg_ParseTupleAndKeywords` to parse the arguments to such a
Georg Brandl116aa622007-08-15 14:28:22 +0000331function.
332
Martin v. Löwis1a214512008-06-11 05:26:20 +0000333The method table must be referenced in the module definition structure::
334
Benjamin Peterson3851d122008-10-20 21:04:06 +0000335 static struct PyModuleDef spammodule = {
Martin v. Löwis1a214512008-06-11 05:26:20 +0000336 PyModuleDef_HEAD_INIT,
337 "spam", /* name of module */
338 spam_doc, /* module documentation, may be NULL */
339 -1, /* size of per-interpreter state of the module,
340 or -1 if the module keeps state in global variables. */
341 SpamMethods
342 };
343
344This structure, in turn, must be passed to the interpreter in the module's
Georg Brandl116aa622007-08-15 14:28:22 +0000345initialization function. The initialization function must be named
Georg Brandl60203b42010-10-06 10:11:56 +0000346:c:func:`PyInit_name`, where *name* is the name of the module, and should be the
Christian Heimes5b5e81c2007-12-31 16:14:33 +0000347only non-\ ``static`` item defined in the module file::
Georg Brandl116aa622007-08-15 14:28:22 +0000348
349 PyMODINIT_FUNC
Martin v. Löwis1a214512008-06-11 05:26:20 +0000350 PyInit_spam(void)
Georg Brandl116aa622007-08-15 14:28:22 +0000351 {
Martin v. Löwis1a214512008-06-11 05:26:20 +0000352 return PyModule_Create(&spammodule);
Georg Brandl116aa622007-08-15 14:28:22 +0000353 }
354
Benjamin Peterson71e30a02008-12-24 16:27:25 +0000355Note that PyMODINIT_FUNC declares the function as ``PyObject *`` return type,
356declares any special linkage declarations required by the platform, and for C++
Georg Brandl116aa622007-08-15 14:28:22 +0000357declares the function as ``extern "C"``.
358
359When the Python program imports module :mod:`spam` for the first time,
Georg Brandl60203b42010-10-06 10:11:56 +0000360:c:func:`PyInit_spam` is called. (See below for comments about embedding Python.)
361It calls :c:func:`PyModule_Create`, which returns a module object, and
Georg Brandl116aa622007-08-15 14:28:22 +0000362inserts built-in function objects into the newly created module based upon the
Georg Brandl60203b42010-10-06 10:11:56 +0000363table (an array of :c:type:`PyMethodDef` structures) found in the module definition.
364:c:func:`PyModule_Create` returns a pointer to the module object
Martin v. Löwis1a214512008-06-11 05:26:20 +0000365that it creates. It may abort with a fatal error for
Georg Brandl116aa622007-08-15 14:28:22 +0000366certain errors, or return *NULL* if the module could not be initialized
Martin v. Löwis1a214512008-06-11 05:26:20 +0000367satisfactorily. The init function must return the module object to its caller,
368so that it then gets inserted into ``sys.modules``.
Georg Brandl116aa622007-08-15 14:28:22 +0000369
Georg Brandl60203b42010-10-06 10:11:56 +0000370When embedding Python, the :c:func:`PyInit_spam` function is not called
371automatically unless there's an entry in the :c:data:`PyImport_Inittab` table.
372To add the module to the initialization table, use :c:func:`PyImport_AppendInittab`,
Martin v. Löwis1a214512008-06-11 05:26:20 +0000373optionally followed by an import of the module::
Georg Brandl116aa622007-08-15 14:28:22 +0000374
375 int
376 main(int argc, char *argv[])
377 {
Victor Stinner25e014b2014-08-01 12:28:49 +0200378 wchar_t *program = Py_DecodeLocale(argv[0], NULL);
379 if (program == NULL) {
380 fprintf(stderr, "Fatal error: cannot decode argv[0]\n");
381 exit(1);
382 }
383
Georg Brandlc4a55fc2010-02-06 18:46:57 +0000384 /* Add a built-in module, before Py_Initialize */
Martin v. Löwis1a214512008-06-11 05:26:20 +0000385 PyImport_AppendInittab("spam", PyInit_spam);
386
Georg Brandl116aa622007-08-15 14:28:22 +0000387 /* Pass argv[0] to the Python interpreter */
Victor Stinner25e014b2014-08-01 12:28:49 +0200388 Py_SetProgramName(program);
Georg Brandl116aa622007-08-15 14:28:22 +0000389
390 /* Initialize the Python interpreter. Required. */
391 Py_Initialize();
392
Martin v. Löwis1a214512008-06-11 05:26:20 +0000393 /* Optionally import the module; alternatively,
394 import can be deferred until the embedded script
395 imports it. */
396 PyImport_ImportModule("spam");
Georg Brandl116aa622007-08-15 14:28:22 +0000397
Georg Brandl49c6fc92013-10-06 13:14:10 +0200398 ...
399
Victor Stinner25e014b2014-08-01 12:28:49 +0200400 PyMem_RawFree(program);
401 return 0;
402 }
403
Georg Brandl116aa622007-08-15 14:28:22 +0000404.. note::
405
406 Removing entries from ``sys.modules`` or importing compiled modules into
Georg Brandl60203b42010-10-06 10:11:56 +0000407 multiple interpreters within a process (or following a :c:func:`fork` without an
408 intervening :c:func:`exec`) can create problems for some extension modules.
Georg Brandl116aa622007-08-15 14:28:22 +0000409 Extension module authors should exercise caution when initializing internal data
410 structures.
411
412A more substantial example module is included in the Python source distribution
413as :file:`Modules/xxmodule.c`. This file may be used as a template or simply
Benjamin Peterson2614cda2010-03-21 22:36:19 +0000414read as an example.
Georg Brandl116aa622007-08-15 14:28:22 +0000415
416
417.. _compilation:
418
419Compilation and Linkage
420=======================
421
422There are two more things to do before you can use your new extension: compiling
423and linking it with the Python system. If you use dynamic loading, the details
424may depend on the style of dynamic loading your system uses; see the chapters
425about building extension modules (chapter :ref:`building`) and additional
426information that pertains only to building on Windows (chapter
427:ref:`building-on-windows`) for more information about this.
428
429If you can't use dynamic loading, or if you want to make your module a permanent
430part of the Python interpreter, you will have to change the configuration setup
431and rebuild the interpreter. Luckily, this is very simple on Unix: just place
432your file (:file:`spammodule.c` for example) in the :file:`Modules/` directory
433of an unpacked source distribution, add a line to the file
434:file:`Modules/Setup.local` describing your file::
435
436 spam spammodule.o
437
438and rebuild the interpreter by running :program:`make` in the toplevel
439directory. You can also run :program:`make` in the :file:`Modules/`
440subdirectory, but then you must first rebuild :file:`Makefile` there by running
441':program:`make` Makefile'. (This is necessary each time you change the
442:file:`Setup` file.)
443
444If your module requires additional libraries to link with, these can be listed
445on the line in the configuration file as well, for instance::
446
447 spam spammodule.o -lX11
448
449
450.. _callingpython:
451
452Calling Python Functions from C
453===============================
454
455So far we have concentrated on making C functions callable from Python. The
456reverse is also useful: calling Python functions from C. This is especially the
457case for libraries that support so-called "callback" functions. If a C
458interface makes use of callbacks, the equivalent Python often needs to provide a
459callback mechanism to the Python programmer; the implementation will require
460calling the Python callback functions from a C callback. Other uses are also
461imaginable.
462
463Fortunately, the Python interpreter is easily called recursively, and there is a
464standard interface to call a Python function. (I won't dwell on how to call the
465Python parser with a particular string as input --- if you're interested, have a
466look at the implementation of the :option:`-c` command line option in
Georg Brandl22291c52007-09-06 14:49:02 +0000467:file:`Modules/main.c` from the Python source code.)
Georg Brandl116aa622007-08-15 14:28:22 +0000468
469Calling a Python function is easy. First, the Python program must somehow pass
470you the Python function object. You should provide a function (or some other
471interface) to do this. When this function is called, save a pointer to the
Georg Brandl60203b42010-10-06 10:11:56 +0000472Python function object (be careful to :c:func:`Py_INCREF` it!) in a global
Georg Brandl116aa622007-08-15 14:28:22 +0000473variable --- or wherever you see fit. For example, the following function might
474be part of a module definition::
475
476 static PyObject *my_callback = NULL;
477
478 static PyObject *
479 my_set_callback(PyObject *dummy, PyObject *args)
480 {
481 PyObject *result = NULL;
482 PyObject *temp;
483
484 if (PyArg_ParseTuple(args, "O:set_callback", &temp)) {
485 if (!PyCallable_Check(temp)) {
486 PyErr_SetString(PyExc_TypeError, "parameter must be callable");
487 return NULL;
488 }
489 Py_XINCREF(temp); /* Add a reference to new callback */
490 Py_XDECREF(my_callback); /* Dispose of previous callback */
491 my_callback = temp; /* Remember new callback */
492 /* Boilerplate to return "None" */
493 Py_INCREF(Py_None);
494 result = Py_None;
495 }
496 return result;
497 }
498
499This function must be registered with the interpreter using the
500:const:`METH_VARARGS` flag; this is described in section :ref:`methodtable`. The
Georg Brandl60203b42010-10-06 10:11:56 +0000501:c:func:`PyArg_ParseTuple` function and its arguments are documented in section
Georg Brandl116aa622007-08-15 14:28:22 +0000502:ref:`parsetuple`.
503
Georg Brandl60203b42010-10-06 10:11:56 +0000504The macros :c:func:`Py_XINCREF` and :c:func:`Py_XDECREF` increment/decrement the
Georg Brandl116aa622007-08-15 14:28:22 +0000505reference count of an object and are safe in the presence of *NULL* pointers
506(but note that *temp* will not be *NULL* in this context). More info on them
507in section :ref:`refcounts`.
508
Benjamin Petersond23f8222009-04-05 19:13:16 +0000509.. index:: single: PyObject_CallObject()
Georg Brandl116aa622007-08-15 14:28:22 +0000510
511Later, when it is time to call the function, you call the C function
Georg Brandl60203b42010-10-06 10:11:56 +0000512:c:func:`PyObject_CallObject`. This function has two arguments, both pointers to
Georg Brandl116aa622007-08-15 14:28:22 +0000513arbitrary Python objects: the Python function, and the argument list. The
514argument list must always be a tuple object, whose length is the number of
Georg Brandl48310cd2009-01-03 21:18:54 +0000515arguments. To call the Python function with no arguments, pass in NULL, or
Christian Heimesd8654cf2007-12-02 15:22:16 +0000516an empty tuple; to call it with one argument, pass a singleton tuple.
Georg Brandl60203b42010-10-06 10:11:56 +0000517:c:func:`Py_BuildValue` returns a tuple when its format string consists of zero
Christian Heimesd8654cf2007-12-02 15:22:16 +0000518or more format codes between parentheses. For example::
Georg Brandl116aa622007-08-15 14:28:22 +0000519
520 int arg;
521 PyObject *arglist;
522 PyObject *result;
523 ...
524 arg = 123;
525 ...
526 /* Time to call the callback */
527 arglist = Py_BuildValue("(i)", arg);
Benjamin Petersond23f8222009-04-05 19:13:16 +0000528 result = PyObject_CallObject(my_callback, arglist);
Georg Brandl116aa622007-08-15 14:28:22 +0000529 Py_DECREF(arglist);
530
Georg Brandl60203b42010-10-06 10:11:56 +0000531:c:func:`PyObject_CallObject` returns a Python object pointer: this is the return
532value of the Python function. :c:func:`PyObject_CallObject` is
Georg Brandl116aa622007-08-15 14:28:22 +0000533"reference-count-neutral" with respect to its arguments. In the example a new
Georg Brandl60203b42010-10-06 10:11:56 +0000534tuple was created to serve as the argument list, which is :c:func:`Py_DECREF`\
Georg Brandl337672b2013-10-06 11:02:38 +0200535-ed immediately after the :c:func:`PyObject_CallObject` call.
Georg Brandl116aa622007-08-15 14:28:22 +0000536
Georg Brandl60203b42010-10-06 10:11:56 +0000537The return value of :c:func:`PyObject_CallObject` is "new": either it is a brand
Georg Brandl116aa622007-08-15 14:28:22 +0000538new object, or it is an existing object whose reference count has been
539incremented. So, unless you want to save it in a global variable, you should
Georg Brandl60203b42010-10-06 10:11:56 +0000540somehow :c:func:`Py_DECREF` the result, even (especially!) if you are not
Georg Brandl116aa622007-08-15 14:28:22 +0000541interested in its value.
542
543Before you do this, however, it is important to check that the return value
544isn't *NULL*. If it is, the Python function terminated by raising an exception.
Georg Brandl60203b42010-10-06 10:11:56 +0000545If the C code that called :c:func:`PyObject_CallObject` is called from Python, it
Georg Brandl116aa622007-08-15 14:28:22 +0000546should now return an error indication to its Python caller, so the interpreter
547can print a stack trace, or the calling Python code can handle the exception.
548If this is not possible or desirable, the exception should be cleared by calling
Georg Brandl60203b42010-10-06 10:11:56 +0000549:c:func:`PyErr_Clear`. For example::
Georg Brandl116aa622007-08-15 14:28:22 +0000550
551 if (result == NULL)
552 return NULL; /* Pass error back */
553 ...use result...
Georg Brandl48310cd2009-01-03 21:18:54 +0000554 Py_DECREF(result);
Georg Brandl116aa622007-08-15 14:28:22 +0000555
556Depending on the desired interface to the Python callback function, you may also
Georg Brandl60203b42010-10-06 10:11:56 +0000557have to provide an argument list to :c:func:`PyObject_CallObject`. In some cases
Georg Brandl116aa622007-08-15 14:28:22 +0000558the argument list is also provided by the Python program, through the same
559interface that specified the callback function. It can then be saved and used
560in the same manner as the function object. In other cases, you may have to
561construct a new tuple to pass as the argument list. The simplest way to do this
Georg Brandl60203b42010-10-06 10:11:56 +0000562is to call :c:func:`Py_BuildValue`. For example, if you want to pass an integral
Georg Brandl116aa622007-08-15 14:28:22 +0000563event code, you might use the following code::
564
565 PyObject *arglist;
566 ...
567 arglist = Py_BuildValue("(l)", eventcode);
Benjamin Petersond23f8222009-04-05 19:13:16 +0000568 result = PyObject_CallObject(my_callback, arglist);
Georg Brandl116aa622007-08-15 14:28:22 +0000569 Py_DECREF(arglist);
570 if (result == NULL)
571 return NULL; /* Pass error back */
572 /* Here maybe use the result */
573 Py_DECREF(result);
574
575Note the placement of ``Py_DECREF(arglist)`` immediately after the call, before
Christian Heimesd8654cf2007-12-02 15:22:16 +0000576the error check! Also note that strictly speaking this code is not complete:
Georg Brandl60203b42010-10-06 10:11:56 +0000577:c:func:`Py_BuildValue` may run out of memory, and this should be checked.
Georg Brandl116aa622007-08-15 14:28:22 +0000578
Georg Brandl48310cd2009-01-03 21:18:54 +0000579You may also call a function with keyword arguments by using
Georg Brandl60203b42010-10-06 10:11:56 +0000580:c:func:`PyObject_Call`, which supports arguments and keyword arguments. As in
581the above example, we use :c:func:`Py_BuildValue` to construct the dictionary. ::
Christian Heimesd8654cf2007-12-02 15:22:16 +0000582
583 PyObject *dict;
584 ...
585 dict = Py_BuildValue("{s:i}", "name", val);
Benjamin Petersond23f8222009-04-05 19:13:16 +0000586 result = PyObject_Call(my_callback, NULL, dict);
Christian Heimesd8654cf2007-12-02 15:22:16 +0000587 Py_DECREF(dict);
588 if (result == NULL)
589 return NULL; /* Pass error back */
590 /* Here maybe use the result */
591 Py_DECREF(result);
Georg Brandl116aa622007-08-15 14:28:22 +0000592
Benjamin Petersond23f8222009-04-05 19:13:16 +0000593
Georg Brandl116aa622007-08-15 14:28:22 +0000594.. _parsetuple:
595
596Extracting Parameters in Extension Functions
597============================================
598
599.. index:: single: PyArg_ParseTuple()
600
Georg Brandl60203b42010-10-06 10:11:56 +0000601The :c:func:`PyArg_ParseTuple` function is declared as follows::
Georg Brandl116aa622007-08-15 14:28:22 +0000602
603 int PyArg_ParseTuple(PyObject *arg, char *format, ...);
604
605The *arg* argument must be a tuple object containing an argument list passed
606from Python to a C function. The *format* argument must be a format string,
607whose syntax is explained in :ref:`arg-parsing` in the Python/C API Reference
608Manual. The remaining arguments must be addresses of variables whose type is
609determined by the format string.
610
Georg Brandl60203b42010-10-06 10:11:56 +0000611Note that while :c:func:`PyArg_ParseTuple` checks that the Python arguments have
Georg Brandl116aa622007-08-15 14:28:22 +0000612the required types, it cannot check the validity of the addresses of C variables
613passed to the call: if you make mistakes there, your code will probably crash or
614at least overwrite random bits in memory. So be careful!
615
616Note that any Python object references which are provided to the caller are
617*borrowed* references; do not decrement their reference count!
618
619Some example calls::
620
Gregory P. Smith02c3b5c2008-11-23 23:49:16 +0000621 #define PY_SSIZE_T_CLEAN /* Make "s#" use Py_ssize_t rather than int. */
622 #include <Python.h>
623
624::
625
Georg Brandl116aa622007-08-15 14:28:22 +0000626 int ok;
627 int i, j;
628 long k, l;
629 const char *s;
Gregory P. Smith02c3b5c2008-11-23 23:49:16 +0000630 Py_ssize_t size;
Georg Brandl116aa622007-08-15 14:28:22 +0000631
632 ok = PyArg_ParseTuple(args, ""); /* No arguments */
633 /* Python call: f() */
634
635::
636
637 ok = PyArg_ParseTuple(args, "s", &s); /* A string */
638 /* Possible Python call: f('whoops!') */
639
640::
641
642 ok = PyArg_ParseTuple(args, "lls", &k, &l, &s); /* Two longs and a string */
643 /* Possible Python call: f(1, 2, 'three') */
644
645::
646
647 ok = PyArg_ParseTuple(args, "(ii)s#", &i, &j, &s, &size);
648 /* A pair of ints and a string, whose size is also returned */
649 /* Possible Python call: f((1, 2), 'three') */
650
651::
652
653 {
654 const char *file;
655 const char *mode = "r";
656 int bufsize = 0;
657 ok = PyArg_ParseTuple(args, "s|si", &file, &mode, &bufsize);
658 /* A string, and optionally another string and an integer */
659 /* Possible Python calls:
660 f('spam')
661 f('spam', 'w')
662 f('spam', 'wb', 100000) */
663 }
664
665::
666
667 {
668 int left, top, right, bottom, h, v;
669 ok = PyArg_ParseTuple(args, "((ii)(ii))(ii)",
670 &left, &top, &right, &bottom, &h, &v);
671 /* A rectangle and a point */
672 /* Possible Python call:
673 f(((0, 0), (400, 300)), (10, 10)) */
674 }
675
676::
677
678 {
679 Py_complex c;
680 ok = PyArg_ParseTuple(args, "D:myfunction", &c);
681 /* a complex, also providing a function name for errors */
682 /* Possible Python call: myfunction(1+2j) */
683 }
684
685
686.. _parsetupleandkeywords:
687
688Keyword Parameters for Extension Functions
689==========================================
690
691.. index:: single: PyArg_ParseTupleAndKeywords()
692
Georg Brandl60203b42010-10-06 10:11:56 +0000693The :c:func:`PyArg_ParseTupleAndKeywords` function is declared as follows::
Georg Brandl116aa622007-08-15 14:28:22 +0000694
695 int PyArg_ParseTupleAndKeywords(PyObject *arg, PyObject *kwdict,
696 char *format, char *kwlist[], ...);
697
698The *arg* and *format* parameters are identical to those of the
Georg Brandl60203b42010-10-06 10:11:56 +0000699:c:func:`PyArg_ParseTuple` function. The *kwdict* parameter is the dictionary of
Georg Brandl116aa622007-08-15 14:28:22 +0000700keywords received as the third parameter from the Python runtime. The *kwlist*
701parameter is a *NULL*-terminated list of strings which identify the parameters;
702the names are matched with the type information from *format* from left to
Georg Brandl60203b42010-10-06 10:11:56 +0000703right. On success, :c:func:`PyArg_ParseTupleAndKeywords` returns true, otherwise
Georg Brandl116aa622007-08-15 14:28:22 +0000704it returns false and raises an appropriate exception.
705
706.. note::
707
708 Nested tuples cannot be parsed when using keyword arguments! Keyword parameters
709 passed in which are not present in the *kwlist* will cause :exc:`TypeError` to
710 be raised.
711
712.. index:: single: Philbrick, Geoff
713
714Here is an example module which uses keywords, based on an example by Geoff
Christian Heimes5b5e81c2007-12-31 16:14:33 +0000715Philbrick (philbrick@hks.com)::
Georg Brandl116aa622007-08-15 14:28:22 +0000716
717 #include "Python.h"
718
719 static PyObject *
720 keywdarg_parrot(PyObject *self, PyObject *args, PyObject *keywds)
Georg Brandl48310cd2009-01-03 21:18:54 +0000721 {
Georg Brandl116aa622007-08-15 14:28:22 +0000722 int voltage;
723 char *state = "a stiff";
724 char *action = "voom";
725 char *type = "Norwegian Blue";
726
727 static char *kwlist[] = {"voltage", "state", "action", "type", NULL};
728
Georg Brandl48310cd2009-01-03 21:18:54 +0000729 if (!PyArg_ParseTupleAndKeywords(args, keywds, "i|sss", kwlist,
Georg Brandl116aa622007-08-15 14:28:22 +0000730 &voltage, &state, &action, &type))
Georg Brandl48310cd2009-01-03 21:18:54 +0000731 return NULL;
Georg Brandl116aa622007-08-15 14:28:22 +0000732
Georg Brandl48310cd2009-01-03 21:18:54 +0000733 printf("-- This parrot wouldn't %s if you put %i Volts through it.\n",
Georg Brandl116aa622007-08-15 14:28:22 +0000734 action, voltage);
735 printf("-- Lovely plumage, the %s -- It's %s!\n", type, state);
736
Georg Brandla072de12013-10-06 20:46:08 +0200737 Py_RETURN_NONE;
Georg Brandl116aa622007-08-15 14:28:22 +0000738 }
739
740 static PyMethodDef keywdarg_methods[] = {
741 /* The cast of the function is necessary since PyCFunction values
742 * only take two PyObject* parameters, and keywdarg_parrot() takes
743 * three.
744 */
745 {"parrot", (PyCFunction)keywdarg_parrot, METH_VARARGS | METH_KEYWORDS,
746 "Print a lovely skit to standard output."},
747 {NULL, NULL, 0, NULL} /* sentinel */
748 };
749
Eli Bendersky8f773492012-08-15 14:49:49 +0300750 static struct PyModuleDef keywdargmodule = {
751 PyModuleDef_HEAD_INIT,
752 "keywdarg",
753 NULL,
754 -1,
755 keywdarg_methods
756 };
Georg Brandl116aa622007-08-15 14:28:22 +0000757
Eli Bendersky8f773492012-08-15 14:49:49 +0300758 PyMODINIT_FUNC
759 PyInit_keywdarg(void)
Georg Brandl116aa622007-08-15 14:28:22 +0000760 {
Eli Bendersky8f773492012-08-15 14:49:49 +0300761 return PyModule_Create(&keywdargmodule);
Georg Brandl116aa622007-08-15 14:28:22 +0000762 }
763
764
765.. _buildvalue:
766
767Building Arbitrary Values
768=========================
769
Georg Brandl60203b42010-10-06 10:11:56 +0000770This function is the counterpart to :c:func:`PyArg_ParseTuple`. It is declared
Georg Brandl116aa622007-08-15 14:28:22 +0000771as follows::
772
773 PyObject *Py_BuildValue(char *format, ...);
774
775It recognizes a set of format units similar to the ones recognized by
Georg Brandl60203b42010-10-06 10:11:56 +0000776:c:func:`PyArg_ParseTuple`, but the arguments (which are input to the function,
Georg Brandl116aa622007-08-15 14:28:22 +0000777not output) must not be pointers, just values. It returns a new Python object,
778suitable for returning from a C function called from Python.
779
Georg Brandl60203b42010-10-06 10:11:56 +0000780One difference with :c:func:`PyArg_ParseTuple`: while the latter requires its
Georg Brandl116aa622007-08-15 14:28:22 +0000781first argument to be a tuple (since Python argument lists are always represented
Georg Brandl60203b42010-10-06 10:11:56 +0000782as tuples internally), :c:func:`Py_BuildValue` does not always build a tuple. It
Georg Brandl116aa622007-08-15 14:28:22 +0000783builds a tuple only if its format string contains two or more format units. If
784the format string is empty, it returns ``None``; if it contains exactly one
785format unit, it returns whatever object is described by that format unit. To
786force it to return a tuple of size 0 or one, parenthesize the format string.
787
788Examples (to the left the call, to the right the resulting Python value)::
789
790 Py_BuildValue("") None
791 Py_BuildValue("i", 123) 123
792 Py_BuildValue("iii", 123, 456, 789) (123, 456, 789)
793 Py_BuildValue("s", "hello") 'hello'
794 Py_BuildValue("y", "hello") b'hello'
795 Py_BuildValue("ss", "hello", "world") ('hello', 'world')
796 Py_BuildValue("s#", "hello", 4) 'hell'
797 Py_BuildValue("y#", "hello", 4) b'hell'
798 Py_BuildValue("()") ()
799 Py_BuildValue("(i)", 123) (123,)
800 Py_BuildValue("(ii)", 123, 456) (123, 456)
801 Py_BuildValue("(i,i)", 123, 456) (123, 456)
802 Py_BuildValue("[i,i]", 123, 456) [123, 456]
803 Py_BuildValue("{s:i,s:i}",
804 "abc", 123, "def", 456) {'abc': 123, 'def': 456}
805 Py_BuildValue("((ii)(ii)) (ii)",
806 1, 2, 3, 4, 5, 6) (((1, 2), (3, 4)), (5, 6))
807
808
809.. _refcounts:
810
811Reference Counts
812================
813
814In languages like C or C++, the programmer is responsible for dynamic allocation
815and deallocation of memory on the heap. In C, this is done using the functions
Georg Brandl60203b42010-10-06 10:11:56 +0000816:c:func:`malloc` and :c:func:`free`. In C++, the operators ``new`` and
Christian Heimes5b5e81c2007-12-31 16:14:33 +0000817``delete`` are used with essentially the same meaning and we'll restrict
Georg Brandl116aa622007-08-15 14:28:22 +0000818the following discussion to the C case.
819
Georg Brandl60203b42010-10-06 10:11:56 +0000820Every block of memory allocated with :c:func:`malloc` should eventually be
821returned to the pool of available memory by exactly one call to :c:func:`free`.
822It is important to call :c:func:`free` at the right time. If a block's address
823is forgotten but :c:func:`free` is not called for it, the memory it occupies
Georg Brandl116aa622007-08-15 14:28:22 +0000824cannot be reused until the program terminates. This is called a :dfn:`memory
Georg Brandl60203b42010-10-06 10:11:56 +0000825leak`. On the other hand, if a program calls :c:func:`free` for a block and then
Georg Brandl116aa622007-08-15 14:28:22 +0000826continues to use the block, it creates a conflict with re-use of the block
Georg Brandl60203b42010-10-06 10:11:56 +0000827through another :c:func:`malloc` call. This is called :dfn:`using freed memory`.
Georg Brandl116aa622007-08-15 14:28:22 +0000828It has the same bad consequences as referencing uninitialized data --- core
829dumps, wrong results, mysterious crashes.
830
831Common causes of memory leaks are unusual paths through the code. For instance,
832a function may allocate a block of memory, do some calculation, and then free
833the block again. Now a change in the requirements for the function may add a
834test to the calculation that detects an error condition and can return
835prematurely from the function. It's easy to forget to free the allocated memory
836block when taking this premature exit, especially when it is added later to the
837code. Such leaks, once introduced, often go undetected for a long time: the
838error exit is taken only in a small fraction of all calls, and most modern
839machines have plenty of virtual memory, so the leak only becomes apparent in a
840long-running process that uses the leaking function frequently. Therefore, it's
841important to prevent leaks from happening by having a coding convention or
842strategy that minimizes this kind of errors.
843
Georg Brandl60203b42010-10-06 10:11:56 +0000844Since Python makes heavy use of :c:func:`malloc` and :c:func:`free`, it needs a
Georg Brandl116aa622007-08-15 14:28:22 +0000845strategy to avoid memory leaks as well as the use of freed memory. The chosen
846method is called :dfn:`reference counting`. The principle is simple: every
847object contains a counter, which is incremented when a reference to the object
848is stored somewhere, and which is decremented when a reference to it is deleted.
849When the counter reaches zero, the last reference to the object has been deleted
850and the object is freed.
851
852An alternative strategy is called :dfn:`automatic garbage collection`.
853(Sometimes, reference counting is also referred to as a garbage collection
854strategy, hence my use of "automatic" to distinguish the two.) The big
855advantage of automatic garbage collection is that the user doesn't need to call
Georg Brandl60203b42010-10-06 10:11:56 +0000856:c:func:`free` explicitly. (Another claimed advantage is an improvement in speed
Georg Brandl116aa622007-08-15 14:28:22 +0000857or memory usage --- this is no hard fact however.) The disadvantage is that for
858C, there is no truly portable automatic garbage collector, while reference
Georg Brandl60203b42010-10-06 10:11:56 +0000859counting can be implemented portably (as long as the functions :c:func:`malloc`
860and :c:func:`free` are available --- which the C Standard guarantees). Maybe some
Georg Brandl116aa622007-08-15 14:28:22 +0000861day a sufficiently portable automatic garbage collector will be available for C.
862Until then, we'll have to live with reference counts.
863
864While Python uses the traditional reference counting implementation, it also
865offers a cycle detector that works to detect reference cycles. This allows
866applications to not worry about creating direct or indirect circular references;
867these are the weakness of garbage collection implemented using only reference
868counting. Reference cycles consist of objects which contain (possibly indirect)
869references to themselves, so that each object in the cycle has a reference count
870which is non-zero. Typical reference counting implementations are not able to
871reclaim the memory belonging to any objects in a reference cycle, or referenced
872from the objects in the cycle, even though there are no further references to
873the cycle itself.
874
Georg Brandla4c8c472014-10-31 10:38:49 +0100875The cycle detector is able to detect garbage cycles and can reclaim them.
876The :mod:`gc` module exposes a way to run the detector (the
Serhiy Storchaka0b68a2d2013-10-09 13:26:17 +0300877:func:`~gc.collect` function), as well as configuration
Georg Brandl116aa622007-08-15 14:28:22 +0000878interfaces and the ability to disable the detector at runtime. The cycle
879detector is considered an optional component; though it is included by default,
880it can be disabled at build time using the :option:`--without-cycle-gc` option
Georg Brandlf6945182008-02-01 11:56:49 +0000881to the :program:`configure` script on Unix platforms (including Mac OS X). If
882the cycle detector is disabled in this way, the :mod:`gc` module will not be
883available.
Georg Brandl116aa622007-08-15 14:28:22 +0000884
885
886.. _refcountsinpython:
887
888Reference Counting in Python
889----------------------------
890
891There are two macros, ``Py_INCREF(x)`` and ``Py_DECREF(x)``, which handle the
Georg Brandl60203b42010-10-06 10:11:56 +0000892incrementing and decrementing of the reference count. :c:func:`Py_DECREF` also
Georg Brandl116aa622007-08-15 14:28:22 +0000893frees the object when the count reaches zero. For flexibility, it doesn't call
Georg Brandl60203b42010-10-06 10:11:56 +0000894:c:func:`free` directly --- rather, it makes a call through a function pointer in
Georg Brandl116aa622007-08-15 14:28:22 +0000895the object's :dfn:`type object`. For this purpose (and others), every object
896also contains a pointer to its type object.
897
898The big question now remains: when to use ``Py_INCREF(x)`` and ``Py_DECREF(x)``?
899Let's first introduce some terms. Nobody "owns" an object; however, you can
900:dfn:`own a reference` to an object. An object's reference count is now defined
901as the number of owned references to it. The owner of a reference is
Georg Brandl60203b42010-10-06 10:11:56 +0000902responsible for calling :c:func:`Py_DECREF` when the reference is no longer
Georg Brandl116aa622007-08-15 14:28:22 +0000903needed. Ownership of a reference can be transferred. There are three ways to
Georg Brandl60203b42010-10-06 10:11:56 +0000904dispose of an owned reference: pass it on, store it, or call :c:func:`Py_DECREF`.
Georg Brandl116aa622007-08-15 14:28:22 +0000905Forgetting to dispose of an owned reference creates a memory leak.
906
907It is also possible to :dfn:`borrow` [#]_ a reference to an object. The
Georg Brandl60203b42010-10-06 10:11:56 +0000908borrower of a reference should not call :c:func:`Py_DECREF`. The borrower must
Georg Brandl116aa622007-08-15 14:28:22 +0000909not hold on to the object longer than the owner from which it was borrowed.
910Using a borrowed reference after the owner has disposed of it risks using freed
911memory and should be avoided completely. [#]_
912
913The advantage of borrowing over owning a reference is that you don't need to
914take care of disposing of the reference on all possible paths through the code
915--- in other words, with a borrowed reference you don't run the risk of leaking
Benjamin Peterson6ebe78f2008-12-21 00:06:59 +0000916when a premature exit is taken. The disadvantage of borrowing over owning is
Georg Brandl116aa622007-08-15 14:28:22 +0000917that there are some subtle situations where in seemingly correct code a borrowed
918reference can be used after the owner from which it was borrowed has in fact
919disposed of it.
920
921A borrowed reference can be changed into an owned reference by calling
Georg Brandl60203b42010-10-06 10:11:56 +0000922:c:func:`Py_INCREF`. This does not affect the status of the owner from which the
Georg Brandl116aa622007-08-15 14:28:22 +0000923reference was borrowed --- it creates a new owned reference, and gives full
924owner responsibilities (the new owner must dispose of the reference properly, as
925well as the previous owner).
926
927
928.. _ownershiprules:
929
930Ownership Rules
931---------------
932
933Whenever an object reference is passed into or out of a function, it is part of
934the function's interface specification whether ownership is transferred with the
935reference or not.
936
937Most functions that return a reference to an object pass on ownership with the
938reference. In particular, all functions whose function it is to create a new
Georg Brandl60203b42010-10-06 10:11:56 +0000939object, such as :c:func:`PyLong_FromLong` and :c:func:`Py_BuildValue`, pass
Georg Brandl116aa622007-08-15 14:28:22 +0000940ownership to the receiver. Even if the object is not actually new, you still
941receive ownership of a new reference to that object. For instance,
Georg Brandl60203b42010-10-06 10:11:56 +0000942:c:func:`PyLong_FromLong` maintains a cache of popular values and can return a
Georg Brandl116aa622007-08-15 14:28:22 +0000943reference to a cached item.
944
945Many functions that extract objects from other objects also transfer ownership
Georg Brandl60203b42010-10-06 10:11:56 +0000946with the reference, for instance :c:func:`PyObject_GetAttrString`. The picture
Georg Brandl116aa622007-08-15 14:28:22 +0000947is less clear, here, however, since a few common routines are exceptions:
Georg Brandl60203b42010-10-06 10:11:56 +0000948:c:func:`PyTuple_GetItem`, :c:func:`PyList_GetItem`, :c:func:`PyDict_GetItem`, and
949:c:func:`PyDict_GetItemString` all return references that you borrow from the
Georg Brandl116aa622007-08-15 14:28:22 +0000950tuple, list or dictionary.
951
Georg Brandl60203b42010-10-06 10:11:56 +0000952The function :c:func:`PyImport_AddModule` also returns a borrowed reference, even
Georg Brandl116aa622007-08-15 14:28:22 +0000953though it may actually create the object it returns: this is possible because an
954owned reference to the object is stored in ``sys.modules``.
955
956When you pass an object reference into another function, in general, the
957function borrows the reference from you --- if it needs to store it, it will use
Georg Brandl60203b42010-10-06 10:11:56 +0000958:c:func:`Py_INCREF` to become an independent owner. There are exactly two
959important exceptions to this rule: :c:func:`PyTuple_SetItem` and
960:c:func:`PyList_SetItem`. These functions take over ownership of the item passed
961to them --- even if they fail! (Note that :c:func:`PyDict_SetItem` and friends
Georg Brandl116aa622007-08-15 14:28:22 +0000962don't take over ownership --- they are "normal.")
963
964When a C function is called from Python, it borrows references to its arguments
965from the caller. The caller owns a reference to the object, so the borrowed
966reference's lifetime is guaranteed until the function returns. Only when such a
967borrowed reference must be stored or passed on, it must be turned into an owned
Georg Brandl60203b42010-10-06 10:11:56 +0000968reference by calling :c:func:`Py_INCREF`.
Georg Brandl116aa622007-08-15 14:28:22 +0000969
970The object reference returned from a C function that is called from Python must
971be an owned reference --- ownership is transferred from the function to its
972caller.
973
974
975.. _thinice:
976
977Thin Ice
978--------
979
980There are a few situations where seemingly harmless use of a borrowed reference
981can lead to problems. These all have to do with implicit invocations of the
982interpreter, which can cause the owner of a reference to dispose of it.
983
Georg Brandl60203b42010-10-06 10:11:56 +0000984The first and most important case to know about is using :c:func:`Py_DECREF` on
Georg Brandl116aa622007-08-15 14:28:22 +0000985an unrelated object while borrowing a reference to a list item. For instance::
986
987 void
988 bug(PyObject *list)
989 {
990 PyObject *item = PyList_GetItem(list, 0);
991
Georg Brandl9914dd32007-12-02 23:08:39 +0000992 PyList_SetItem(list, 1, PyLong_FromLong(0L));
Georg Brandl116aa622007-08-15 14:28:22 +0000993 PyObject_Print(item, stdout, 0); /* BUG! */
994 }
995
996This function first borrows a reference to ``list[0]``, then replaces
997``list[1]`` with the value ``0``, and finally prints the borrowed reference.
998Looks harmless, right? But it's not!
999
Georg Brandl60203b42010-10-06 10:11:56 +00001000Let's follow the control flow into :c:func:`PyList_SetItem`. The list owns
Georg Brandl116aa622007-08-15 14:28:22 +00001001references to all its items, so when item 1 is replaced, it has to dispose of
1002the original item 1. Now let's suppose the original item 1 was an instance of a
1003user-defined class, and let's further suppose that the class defined a
1004:meth:`__del__` method. If this class instance has a reference count of 1,
1005disposing of it will call its :meth:`__del__` method.
1006
1007Since it is written in Python, the :meth:`__del__` method can execute arbitrary
1008Python code. Could it perhaps do something to invalidate the reference to
Georg Brandl60203b42010-10-06 10:11:56 +00001009``item`` in :c:func:`bug`? You bet! Assuming that the list passed into
1010:c:func:`bug` is accessible to the :meth:`__del__` method, it could execute a
Georg Brandl116aa622007-08-15 14:28:22 +00001011statement to the effect of ``del list[0]``, and assuming this was the last
1012reference to that object, it would free the memory associated with it, thereby
1013invalidating ``item``.
1014
1015The solution, once you know the source of the problem, is easy: temporarily
1016increment the reference count. The correct version of the function reads::
1017
1018 void
1019 no_bug(PyObject *list)
1020 {
1021 PyObject *item = PyList_GetItem(list, 0);
1022
1023 Py_INCREF(item);
Georg Brandl9914dd32007-12-02 23:08:39 +00001024 PyList_SetItem(list, 1, PyLong_FromLong(0L));
Georg Brandl116aa622007-08-15 14:28:22 +00001025 PyObject_Print(item, stdout, 0);
1026 Py_DECREF(item);
1027 }
1028
1029This is a true story. An older version of Python contained variants of this bug
1030and someone spent a considerable amount of time in a C debugger to figure out
1031why his :meth:`__del__` methods would fail...
1032
1033The second case of problems with a borrowed reference is a variant involving
1034threads. Normally, multiple threads in the Python interpreter can't get in each
1035other's way, because there is a global lock protecting Python's entire object
1036space. However, it is possible to temporarily release this lock using the macro
Georg Brandl60203b42010-10-06 10:11:56 +00001037:c:macro:`Py_BEGIN_ALLOW_THREADS`, and to re-acquire it using
1038:c:macro:`Py_END_ALLOW_THREADS`. This is common around blocking I/O calls, to
Georg Brandl116aa622007-08-15 14:28:22 +00001039let other threads use the processor while waiting for the I/O to complete.
1040Obviously, the following function has the same problem as the previous one::
1041
1042 void
1043 bug(PyObject *list)
1044 {
1045 PyObject *item = PyList_GetItem(list, 0);
1046 Py_BEGIN_ALLOW_THREADS
1047 ...some blocking I/O call...
1048 Py_END_ALLOW_THREADS
1049 PyObject_Print(item, stdout, 0); /* BUG! */
1050 }
1051
1052
1053.. _nullpointers:
1054
1055NULL Pointers
1056-------------
1057
1058In general, functions that take object references as arguments do not expect you
1059to pass them *NULL* pointers, and will dump core (or cause later core dumps) if
1060you do so. Functions that return object references generally return *NULL* only
1061to indicate that an exception occurred. The reason for not testing for *NULL*
1062arguments is that functions often pass the objects they receive on to other
1063function --- if each function were to test for *NULL*, there would be a lot of
1064redundant tests and the code would run more slowly.
1065
1066It is better to test for *NULL* only at the "source:" when a pointer that may be
Georg Brandl60203b42010-10-06 10:11:56 +00001067*NULL* is received, for example, from :c:func:`malloc` or from a function that
Georg Brandl116aa622007-08-15 14:28:22 +00001068may raise an exception.
1069
Georg Brandl60203b42010-10-06 10:11:56 +00001070The macros :c:func:`Py_INCREF` and :c:func:`Py_DECREF` do not check for *NULL*
1071pointers --- however, their variants :c:func:`Py_XINCREF` and :c:func:`Py_XDECREF`
Georg Brandl116aa622007-08-15 14:28:22 +00001072do.
1073
1074The macros for checking for a particular object type (``Pytype_Check()``) don't
1075check for *NULL* pointers --- again, there is much code that calls several of
1076these in a row to test an object against various different expected types, and
1077this would generate redundant tests. There are no variants with *NULL*
1078checking.
1079
1080The C function calling mechanism guarantees that the argument list passed to C
1081functions (``args`` in the examples) is never *NULL* --- in fact it guarantees
1082that it is always a tuple. [#]_
1083
1084It is a severe error to ever let a *NULL* pointer "escape" to the Python user.
1085
Christian Heimes5b5e81c2007-12-31 16:14:33 +00001086.. Frank Stajano:
1087 A pedagogically buggy example, along the lines of the previous listing, would
1088 be helpful here -- showing in more concrete terms what sort of actions could
1089 cause the problem. I can't very well imagine it from the description.
Georg Brandl116aa622007-08-15 14:28:22 +00001090
1091
1092.. _cplusplus:
1093
1094Writing Extensions in C++
1095=========================
1096
1097It is possible to write extension modules in C++. Some restrictions apply. If
1098the main program (the Python interpreter) is compiled and linked by the C
1099compiler, global or static objects with constructors cannot be used. This is
1100not a problem if the main program is linked by the C++ compiler. Functions that
1101will be called by the Python interpreter (in particular, module initialization
1102functions) have to be declared using ``extern "C"``. It is unnecessary to
1103enclose the Python header files in ``extern "C" {...}`` --- they use this form
1104already if the symbol ``__cplusplus`` is defined (all recent C++ compilers
1105define this symbol).
1106
1107
Benjamin Petersonb173f782009-05-05 22:31:58 +00001108.. _using-capsules:
Georg Brandl116aa622007-08-15 14:28:22 +00001109
1110Providing a C API for an Extension Module
1111=========================================
1112
1113.. sectionauthor:: Konrad Hinsen <hinsen@cnrs-orleans.fr>
1114
1115
1116Many extension modules just provide new functions and types to be used from
1117Python, but sometimes the code in an extension module can be useful for other
1118extension modules. For example, an extension module could implement a type
1119"collection" which works like lists without order. Just like the standard Python
1120list type has a C API which permits extension modules to create and manipulate
1121lists, this new collection type should have a set of C functions for direct
1122manipulation from other extension modules.
1123
1124At first sight this seems easy: just write the functions (without declaring them
Christian Heimes5b5e81c2007-12-31 16:14:33 +00001125``static``, of course), provide an appropriate header file, and document
Georg Brandl116aa622007-08-15 14:28:22 +00001126the C API. And in fact this would work if all extension modules were always
1127linked statically with the Python interpreter. When modules are used as shared
1128libraries, however, the symbols defined in one module may not be visible to
1129another module. The details of visibility depend on the operating system; some
1130systems use one global namespace for the Python interpreter and all extension
1131modules (Windows, for example), whereas others require an explicit list of
1132imported symbols at module link time (AIX is one example), or offer a choice of
1133different strategies (most Unices). And even if symbols are globally visible,
1134the module whose functions one wishes to call might not have been loaded yet!
1135
1136Portability therefore requires not to make any assumptions about symbol
1137visibility. This means that all symbols in extension modules should be declared
Christian Heimes5b5e81c2007-12-31 16:14:33 +00001138``static``, except for the module's initialization function, in order to
Georg Brandl116aa622007-08-15 14:28:22 +00001139avoid name clashes with other extension modules (as discussed in section
1140:ref:`methodtable`). And it means that symbols that *should* be accessible from
1141other extension modules must be exported in a different way.
1142
1143Python provides a special mechanism to pass C-level information (pointers) from
Benjamin Petersonb173f782009-05-05 22:31:58 +00001144one extension module to another one: Capsules. A Capsule is a Python data type
Georg Brandl60203b42010-10-06 10:11:56 +00001145which stores a pointer (:c:type:`void \*`). Capsules can only be created and
Georg Brandl116aa622007-08-15 14:28:22 +00001146accessed via their C API, but they can be passed around like any other Python
1147object. In particular, they can be assigned to a name in an extension module's
1148namespace. Other extension modules can then import this module, retrieve the
Benjamin Petersonb173f782009-05-05 22:31:58 +00001149value of this name, and then retrieve the pointer from the Capsule.
Georg Brandl116aa622007-08-15 14:28:22 +00001150
Benjamin Petersonb173f782009-05-05 22:31:58 +00001151There are many ways in which Capsules can be used to export the C API of an
1152extension module. Each function could get its own Capsule, or all C API pointers
1153could be stored in an array whose address is published in a Capsule. And the
Georg Brandl116aa622007-08-15 14:28:22 +00001154various tasks of storing and retrieving the pointers can be distributed in
1155different ways between the module providing the code and the client modules.
1156
Benjamin Petersonb173f782009-05-05 22:31:58 +00001157Whichever method you choose, it's important to name your Capsules properly.
Georg Brandl60203b42010-10-06 10:11:56 +00001158The function :c:func:`PyCapsule_New` takes a name parameter
1159(:c:type:`const char \*`); you're permitted to pass in a *NULL* name, but
Benjamin Petersonb173f782009-05-05 22:31:58 +00001160we strongly encourage you to specify a name. Properly named Capsules provide
1161a degree of runtime type-safety; there is no feasible way to tell one unnamed
1162Capsule from another.
1163
1164In particular, Capsules used to expose C APIs should be given a name following
1165this convention::
1166
1167 modulename.attributename
1168
Georg Brandl60203b42010-10-06 10:11:56 +00001169The convenience function :c:func:`PyCapsule_Import` makes it easy to
Benjamin Petersonb173f782009-05-05 22:31:58 +00001170load a C API provided via a Capsule, but only if the Capsule's name
1171matches this convention. This behavior gives C API users a high degree
1172of certainty that the Capsule they load contains the correct C API.
1173
Georg Brandl116aa622007-08-15 14:28:22 +00001174The following example demonstrates an approach that puts most of the burden on
1175the writer of the exporting module, which is appropriate for commonly used
1176library modules. It stores all C API pointers (just one in the example!) in an
Georg Brandl60203b42010-10-06 10:11:56 +00001177array of :c:type:`void` pointers which becomes the value of a Capsule. The header
Georg Brandl116aa622007-08-15 14:28:22 +00001178file corresponding to the module provides a macro that takes care of importing
1179the module and retrieving its C API pointers; client modules only have to call
1180this macro before accessing the C API.
1181
1182The exporting module is a modification of the :mod:`spam` module from section
1183:ref:`extending-simpleexample`. The function :func:`spam.system` does not call
Georg Brandl60203b42010-10-06 10:11:56 +00001184the C library function :c:func:`system` directly, but a function
1185:c:func:`PySpam_System`, which would of course do something more complicated in
Georg Brandl116aa622007-08-15 14:28:22 +00001186reality (such as adding "spam" to every command). This function
Georg Brandl60203b42010-10-06 10:11:56 +00001187:c:func:`PySpam_System` is also exported to other extension modules.
Georg Brandl116aa622007-08-15 14:28:22 +00001188
Georg Brandl60203b42010-10-06 10:11:56 +00001189The function :c:func:`PySpam_System` is a plain C function, declared
Christian Heimes5b5e81c2007-12-31 16:14:33 +00001190``static`` like everything else::
Georg Brandl116aa622007-08-15 14:28:22 +00001191
1192 static int
1193 PySpam_System(const char *command)
1194 {
1195 return system(command);
1196 }
1197
Georg Brandl60203b42010-10-06 10:11:56 +00001198The function :c:func:`spam_system` is modified in a trivial way::
Georg Brandl116aa622007-08-15 14:28:22 +00001199
1200 static PyObject *
1201 spam_system(PyObject *self, PyObject *args)
1202 {
1203 const char *command;
1204 int sts;
1205
1206 if (!PyArg_ParseTuple(args, "s", &command))
1207 return NULL;
1208 sts = PySpam_System(command);
Georg Brandlc877a7c2010-11-26 11:55:48 +00001209 return PyLong_FromLong(sts);
Georg Brandl116aa622007-08-15 14:28:22 +00001210 }
1211
1212In the beginning of the module, right after the line ::
1213
1214 #include "Python.h"
1215
1216two more lines must be added::
1217
1218 #define SPAM_MODULE
1219 #include "spammodule.h"
1220
1221The ``#define`` is used to tell the header file that it is being included in the
1222exporting module, not a client module. Finally, the module's initialization
1223function must take care of initializing the C API pointer array::
1224
1225 PyMODINIT_FUNC
Martin v. Löwis1a214512008-06-11 05:26:20 +00001226 PyInit_spam(void)
Georg Brandl116aa622007-08-15 14:28:22 +00001227 {
1228 PyObject *m;
1229 static void *PySpam_API[PySpam_API_pointers];
1230 PyObject *c_api_object;
1231
Martin v. Löwis1a214512008-06-11 05:26:20 +00001232 m = PyModule_Create(&spammodule);
Georg Brandl116aa622007-08-15 14:28:22 +00001233 if (m == NULL)
Martin v. Löwis1a214512008-06-11 05:26:20 +00001234 return NULL;
Georg Brandl116aa622007-08-15 14:28:22 +00001235
1236 /* Initialize the C API pointer array */
1237 PySpam_API[PySpam_System_NUM] = (void *)PySpam_System;
1238
Benjamin Petersonb173f782009-05-05 22:31:58 +00001239 /* Create a Capsule containing the API pointer array's address */
1240 c_api_object = PyCapsule_New((void *)PySpam_API, "spam._C_API", NULL);
Georg Brandl116aa622007-08-15 14:28:22 +00001241
1242 if (c_api_object != NULL)
1243 PyModule_AddObject(m, "_C_API", c_api_object);
Martin v. Löwis1a214512008-06-11 05:26:20 +00001244 return m;
Georg Brandl116aa622007-08-15 14:28:22 +00001245 }
1246
Christian Heimes5b5e81c2007-12-31 16:14:33 +00001247Note that ``PySpam_API`` is declared ``static``; otherwise the pointer
Martin v. Löwis1a214512008-06-11 05:26:20 +00001248array would disappear when :func:`PyInit_spam` terminates!
Georg Brandl116aa622007-08-15 14:28:22 +00001249
1250The bulk of the work is in the header file :file:`spammodule.h`, which looks
1251like this::
1252
1253 #ifndef Py_SPAMMODULE_H
1254 #define Py_SPAMMODULE_H
1255 #ifdef __cplusplus
1256 extern "C" {
1257 #endif
1258
1259 /* Header file for spammodule */
1260
1261 /* C API functions */
1262 #define PySpam_System_NUM 0
1263 #define PySpam_System_RETURN int
1264 #define PySpam_System_PROTO (const char *command)
1265
1266 /* Total number of C API pointers */
1267 #define PySpam_API_pointers 1
1268
1269
1270 #ifdef SPAM_MODULE
1271 /* This section is used when compiling spammodule.c */
1272
1273 static PySpam_System_RETURN PySpam_System PySpam_System_PROTO;
1274
1275 #else
1276 /* This section is used in modules that use spammodule's API */
1277
1278 static void **PySpam_API;
1279
1280 #define PySpam_System \
1281 (*(PySpam_System_RETURN (*)PySpam_System_PROTO) PySpam_API[PySpam_System_NUM])
1282
Benjamin Petersonb173f782009-05-05 22:31:58 +00001283 /* Return -1 on error, 0 on success.
1284 * PyCapsule_Import will set an exception if there's an error.
1285 */
Georg Brandl116aa622007-08-15 14:28:22 +00001286 static int
1287 import_spam(void)
1288 {
Benjamin Petersonb173f782009-05-05 22:31:58 +00001289 PySpam_API = (void **)PyCapsule_Import("spam._C_API", 0);
1290 return (PySpam_API != NULL) ? 0 : -1;
Georg Brandl116aa622007-08-15 14:28:22 +00001291 }
1292
1293 #endif
1294
1295 #ifdef __cplusplus
1296 }
1297 #endif
1298
1299 #endif /* !defined(Py_SPAMMODULE_H) */
1300
1301All that a client module must do in order to have access to the function
Georg Brandl60203b42010-10-06 10:11:56 +00001302:c:func:`PySpam_System` is to call the function (or rather macro)
1303:c:func:`import_spam` in its initialization function::
Georg Brandl116aa622007-08-15 14:28:22 +00001304
1305 PyMODINIT_FUNC
Benjamin Peterson7c435242009-03-24 01:40:39 +00001306 PyInit_client(void)
Georg Brandl116aa622007-08-15 14:28:22 +00001307 {
1308 PyObject *m;
1309
Georg Brandl21151762009-03-31 15:52:41 +00001310 m = PyModule_Create(&clientmodule);
Georg Brandl116aa622007-08-15 14:28:22 +00001311 if (m == NULL)
Georg Brandl21151762009-03-31 15:52:41 +00001312 return NULL;
Georg Brandl116aa622007-08-15 14:28:22 +00001313 if (import_spam() < 0)
Georg Brandl21151762009-03-31 15:52:41 +00001314 return NULL;
Georg Brandl116aa622007-08-15 14:28:22 +00001315 /* additional initialization can happen here */
Georg Brandl21151762009-03-31 15:52:41 +00001316 return m;
Georg Brandl116aa622007-08-15 14:28:22 +00001317 }
1318
1319The main disadvantage of this approach is that the file :file:`spammodule.h` is
1320rather complicated. However, the basic structure is the same for each function
1321that is exported, so it has to be learned only once.
1322
Benjamin Petersonb173f782009-05-05 22:31:58 +00001323Finally it should be mentioned that Capsules offer additional functionality,
Georg Brandl116aa622007-08-15 14:28:22 +00001324which is especially useful for memory allocation and deallocation of the pointer
Benjamin Petersonb173f782009-05-05 22:31:58 +00001325stored in a Capsule. The details are described in the Python/C API Reference
1326Manual in the section :ref:`capsules` and in the implementation of Capsules (files
1327:file:`Include/pycapsule.h` and :file:`Objects/pycapsule.c` in the Python source
Georg Brandl116aa622007-08-15 14:28:22 +00001328code distribution).
1329
1330.. rubric:: Footnotes
1331
1332.. [#] An interface for this function already exists in the standard module :mod:`os`
1333 --- it was chosen as a simple and straightforward example.
1334
1335.. [#] The metaphor of "borrowing" a reference is not completely correct: the owner
1336 still has a copy of the reference.
1337
1338.. [#] Checking that the reference count is at least 1 **does not work** --- the
1339 reference count itself could be in freed memory and may thus be reused for
1340 another object!
1341
1342.. [#] These guarantees don't hold when you use the "old" style calling convention ---
1343 this is still found in much existing code.
1344