blob: 50e52e205812af577fedc9328f14a05dab1f80d6 [file] [log] [blame]
Fred Drake6659c301998-03-03 22:02:19 +00001\documentclass{manual}
Guido van Rossum9231c8f1997-05-15 21:43:21 +00002
Guido van Rossum9faf4c51997-10-07 14:38:54 +00003\title{Python/C API Reference Manual}
Guido van Rossum9231c8f1997-05-15 21:43:21 +00004
5\input{boilerplate}
6
Marc-André Lemburga544ea22001-01-17 18:04:31 +00007\makeindex % tell \index to actually write the .idx file
Guido van Rossum9231c8f1997-05-15 21:43:21 +00008
9
10\begin{document}
11
Guido van Rossum9231c8f1997-05-15 21:43:21 +000012\maketitle
13
Fred Drake9f86b661998-07-28 21:55:19 +000014\ifhtml
15\chapter*{Front Matter\label{front}}
16\fi
17
Guido van Rossum9231c8f1997-05-15 21:43:21 +000018\input{copyright}
19
20\begin{abstract}
21
22\noindent
Fred Drake659ebfa2000-04-03 15:42:13 +000023This manual documents the API used by C and \Cpp{} programmers who
Fred Drakee058b4f1998-02-16 06:15:35 +000024want to write extension modules or embed Python. It is a companion to
Fred Drakebe486461999-11-09 17:03:03 +000025\citetitle[../ext/ext.html]{Extending and Embedding the Python
26Interpreter}, which describes the general principles of extension
27writing but does not document the API functions in detail.
Guido van Rossum9231c8f1997-05-15 21:43:21 +000028
Guido van Rossum5b8a5231997-12-30 04:38:44 +000029\strong{Warning:} The current version of this document is incomplete.
30I hope that it is nevertheless useful. I will continue to work on it,
31and release new versions from time to time, independent from Python
32source code releases.
33
Guido van Rossum9231c8f1997-05-15 21:43:21 +000034\end{abstract}
35
Fred Drake4d4f9e71998-01-13 22:25:02 +000036\tableofcontents
Guido van Rossum9231c8f1997-05-15 21:43:21 +000037
Guido van Rossum5060b3b1997-08-17 18:02:23 +000038% XXX Consider moving all this back to ext.tex and giving api.tex
39% XXX a *really* short intro only.
Guido van Rossum9231c8f1997-05-15 21:43:21 +000040
Fred Drakeefd146c1999-02-15 15:30:45 +000041\chapter{Introduction \label{intro}}
Guido van Rossum9231c8f1997-05-15 21:43:21 +000042
Fred Drake659ebfa2000-04-03 15:42:13 +000043The Application Programmer's Interface to Python gives C and
44\Cpp{} programmers access to the Python interpreter at a variety of
45levels. The API is equally usable from \Cpp{}, but for brevity it is
46generally referred to as the Python/C API. There are two
47fundamentally different reasons for using the Python/C API. The first
48reason is to write \emph{extension modules} for specific purposes;
49these are C modules that extend the Python interpreter. This is
50probably the most common use. The second reason is to use Python as a
51component in a larger application; this technique is generally
52referred to as \dfn{embedding} Python in an application.
Guido van Rossum59a61351997-08-14 20:34:33 +000053
Guido van Rossum4a944d71997-08-14 20:35:38 +000054Writing an extension module is a relatively well-understood process,
55where a ``cookbook'' approach works well. There are several tools
56that automate the process to some extent. While people have embedded
57Python in other applications since its early existence, the process of
Fred Drakefc43d002001-05-21 15:03:35 +000058embedding Python is less straightforward than writing an extension.
Guido van Rossum59a61351997-08-14 20:34:33 +000059
Guido van Rossum4a944d71997-08-14 20:35:38 +000060Many API functions are useful independent of whether you're embedding
61or extending Python; moreover, most applications that embed Python
62will need to provide a custom extension as well, so it's probably a
63good idea to become familiar with writing an extension before
Guido van Rossum59a61351997-08-14 20:34:33 +000064attempting to embed Python in a real application.
65
Fred Drakeefd146c1999-02-15 15:30:45 +000066
67\section{Include Files \label{includes}}
Guido van Rossum580aa8d1997-11-25 15:34:51 +000068
69All function, type and macro definitions needed to use the Python/C
70API are included in your code by the following line:
71
Fred Drakee058b4f1998-02-16 06:15:35 +000072\begin{verbatim}
73#include "Python.h"
74\end{verbatim}
Guido van Rossum580aa8d1997-11-25 15:34:51 +000075
Fred Drakee058b4f1998-02-16 06:15:35 +000076This implies inclusion of the following standard headers:
Fred Drake0b71cea2000-09-26 05:51:50 +000077\code{<stdio.h>}, \code{<string.h>}, \code{<errno.h>},
78\code{<limits.h>}, and \code{<stdlib.h>} (if available).
Guido van Rossum580aa8d1997-11-25 15:34:51 +000079
80All user visible names defined by Python.h (except those defined by
Fred Drakee058b4f1998-02-16 06:15:35 +000081the included standard headers) have one of the prefixes \samp{Py} or
Fred Drake659ebfa2000-04-03 15:42:13 +000082\samp{_Py}. Names beginning with \samp{_Py} are for internal use by
83the Python implementation and should not be used by extension writers.
84Structure member names do not have a reserved prefix.
Guido van Rossum580aa8d1997-11-25 15:34:51 +000085
Fred Drakee058b4f1998-02-16 06:15:35 +000086\strong{Important:} user code should never define names that begin
87with \samp{Py} or \samp{_Py}. This confuses the reader, and
88jeopardizes the portability of the user code to future Python
89versions, which may define additional names beginning with one of
90these prefixes.
Guido van Rossum580aa8d1997-11-25 15:34:51 +000091
Fred Drake659ebfa2000-04-03 15:42:13 +000092The header files are typically installed with Python. On \UNIX, these
93are located in the directories
94\file{\envvar{prefix}/include/python\var{version}/} and
95\file{\envvar{exec_prefix}/include/python\var{version}/}, where
96\envvar{prefix} and \envvar{exec_prefix} are defined by the
97corresponding parameters to Python's \program{configure} script and
98\var{version} is \code{sys.version[:3]}. On Windows, the headers are
99installed in \file{\envvar{prefix}/include}, where \envvar{prefix} is
100the installation directory specified to the installer.
101
102To include the headers, place both directories (if different) on your
103compiler's search path for includes. Do \emph{not} place the parent
104directories on the search path and then use
Fred Draked5d04352000-09-14 20:24:17 +0000105\samp{\#include <python\shortversion/Python.h>}; this will break on
Fred Drake659ebfa2000-04-03 15:42:13 +0000106multi-platform builds since the platform independent headers under
107\envvar{prefix} include the platform specific headers from
108\envvar{exec_prefix}.
109
Fred Drakebab29652001-07-10 16:10:08 +0000110\Cpp{} users should note that though the API is defined entirely using
111C, the header files do properly declare the entry points to be
112\code{extern "C"}, so there is no need to do anything special to use
113the API from \Cpp.
114
Fred Drakeefd146c1999-02-15 15:30:45 +0000115
116\section{Objects, Types and Reference Counts \label{objects}}
Guido van Rossum59a61351997-08-14 20:34:33 +0000117
Guido van Rossum580aa8d1997-11-25 15:34:51 +0000118Most Python/C API functions have one or more arguments as well as a
Fred Drake659ebfa2000-04-03 15:42:13 +0000119return value of type \ctype{PyObject*}. This type is a pointer
Fred Drakee058b4f1998-02-16 06:15:35 +0000120to an opaque data type representing an arbitrary Python
Guido van Rossum580aa8d1997-11-25 15:34:51 +0000121object. Since all Python object types are treated the same way by the
122Python language in most situations (e.g., assignments, scope rules,
123and argument passing), it is only fitting that they should be
Fred Drake659ebfa2000-04-03 15:42:13 +0000124represented by a single C type. Almost all Python objects live on the
125heap: you never declare an automatic or static variable of type
126\ctype{PyObject}, only pointer variables of type \ctype{PyObject*} can
127be declared. The sole exception are the type objects\obindex{type};
128since these must never be deallocated, they are typically static
129\ctype{PyTypeObject} objects.
Guido van Rossum59a61351997-08-14 20:34:33 +0000130
Fred Drakee058b4f1998-02-16 06:15:35 +0000131All Python objects (even Python integers) have a \dfn{type} and a
132\dfn{reference count}. An object's type determines what kind of object
Guido van Rossum4a944d71997-08-14 20:35:38 +0000133it is (e.g., an integer, a list, or a user-defined function; there are
Fred Drakebe486461999-11-09 17:03:03 +0000134many more as explained in the \citetitle[../ref/ref.html]{Python
135Reference Manual}). For each of the well-known types there is a macro
136to check whether an object is of that type; for instance,
Fred Drake659ebfa2000-04-03 15:42:13 +0000137\samp{PyList_Check(\var{a})} is true if (and only if) the object
138pointed to by \var{a} is a Python list.
Guido van Rossum59a61351997-08-14 20:34:33 +0000139
Fred Drakeefd146c1999-02-15 15:30:45 +0000140
141\subsection{Reference Counts \label{refcounts}}
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000142
Guido van Rossum580aa8d1997-11-25 15:34:51 +0000143The reference count is important because today's computers have a
Fred Drake003d8da1998-04-13 00:53:42 +0000144finite (and often severely limited) memory size; it counts how many
Guido van Rossum4a944d71997-08-14 20:35:38 +0000145different places there are that have a reference to an object. Such a
Fred Drake659ebfa2000-04-03 15:42:13 +0000146place could be another object, or a global (or static) C variable, or
147a local variable in some C function. When an object's reference count
Guido van Rossum4a944d71997-08-14 20:35:38 +0000148becomes zero, the object is deallocated. If it contains references to
149other objects, their reference count is decremented. Those other
150objects may be deallocated in turn, if this decrement makes their
151reference count become zero, and so on. (There's an obvious problem
152with objects that reference each other here; for now, the solution is
Fred Drake659ebfa2000-04-03 15:42:13 +0000153``don't do that.'')
Guido van Rossum59a61351997-08-14 20:34:33 +0000154
Guido van Rossum4a944d71997-08-14 20:35:38 +0000155Reference counts are always manipulated explicitly. The normal way is
Fred Drake659ebfa2000-04-03 15:42:13 +0000156to use the macro \cfunction{Py_INCREF()}\ttindex{Py_INCREF()} to
157increment an object's reference count by one, and
158\cfunction{Py_DECREF()}\ttindex{Py_DECREF()} to decrement it by
159one. The \cfunction{Py_DECREF()} macro is considerably more complex
160than the incref one, since it must check whether the reference count
161becomes zero and then cause the object's deallocator to be called.
162The deallocator is a function pointer contained in the object's type
163structure. The type-specific deallocator takes care of decrementing
164the reference counts for other objects contained in the object if this
165is a compound object type, such as a list, as well as performing any
166additional finalization that's needed. There's no chance that the
167reference count can overflow; at least as many bits are used to hold
168the reference count as there are distinct memory locations in virtual
169memory (assuming \code{sizeof(long) >= sizeof(char*)}). Thus, the
170reference count increment is a simple operation.
Guido van Rossum59a61351997-08-14 20:34:33 +0000171
Guido van Rossum4a944d71997-08-14 20:35:38 +0000172It is not necessary to increment an object's reference count for every
173local variable that contains a pointer to an object. In theory, the
Fred Drakee058b4f1998-02-16 06:15:35 +0000174object's reference count goes up by one when the variable is made to
Guido van Rossum4a944d71997-08-14 20:35:38 +0000175point to it and it goes down by one when the variable goes out of
176scope. However, these two cancel each other out, so at the end the
177reference count hasn't changed. The only real reason to use the
178reference count is to prevent the object from being deallocated as
179long as our variable is pointing to it. If we know that there is at
180least one other reference to the object that lives at least as long as
181our variable, there is no need to increment the reference count
182temporarily. An important situation where this arises is in objects
Fred Drake659ebfa2000-04-03 15:42:13 +0000183that are passed as arguments to C functions in an extension module
Guido van Rossum4a944d71997-08-14 20:35:38 +0000184that are called from Python; the call mechanism guarantees to hold a
Guido van Rossum59a61351997-08-14 20:34:33 +0000185reference to every argument for the duration of the call.
186
Fred Drakee058b4f1998-02-16 06:15:35 +0000187However, a common pitfall is to extract an object from a list and
188hold on to it for a while without incrementing its reference count.
189Some other operation might conceivably remove the object from the
190list, decrementing its reference count and possible deallocating it.
191The real danger is that innocent-looking operations may invoke
192arbitrary Python code which could do this; there is a code path which
193allows control to flow back to the user from a \cfunction{Py_DECREF()},
194so almost any operation is potentially dangerous.
Guido van Rossum59a61351997-08-14 20:34:33 +0000195
Guido van Rossum4a944d71997-08-14 20:35:38 +0000196A safe approach is to always use the generic operations (functions
Fred Drake659ebfa2000-04-03 15:42:13 +0000197whose name begins with \samp{PyObject_}, \samp{PyNumber_},
198\samp{PySequence_} or \samp{PyMapping_}). These operations always
199increment the reference count of the object they return. This leaves
200the caller with the responsibility to call
201\cfunction{Py_DECREF()} when they are done with the result; this soon
202becomes second nature.
Guido van Rossum59a61351997-08-14 20:34:33 +0000203
Fred Drakeefd146c1999-02-15 15:30:45 +0000204
205\subsubsection{Reference Count Details \label{refcountDetails}}
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000206
207The reference count behavior of functions in the Python/C API is best
Fred Drake659ebfa2000-04-03 15:42:13 +0000208explained in terms of \emph{ownership of references}. Note that we
Guido van Rossum580aa8d1997-11-25 15:34:51 +0000209talk of owning references, never of owning objects; objects are always
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000210shared! When a function owns a reference, it has to dispose of it
Fred Drakee058b4f1998-02-16 06:15:35 +0000211properly --- either by passing ownership on (usually to its caller) or
212by calling \cfunction{Py_DECREF()} or \cfunction{Py_XDECREF()}. When
213a function passes ownership of a reference on to its caller, the
214caller is said to receive a \emph{new} reference. When no ownership
215is transferred, the caller is said to \emph{borrow} the reference.
216Nothing needs to be done for a borrowed reference.
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000217
Fred Drakea8455ab2000-06-16 19:58:42 +0000218Conversely, when a calling function passes it a reference to an
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000219object, there are two possibilities: the function \emph{steals} a
220reference to the object, or it does not. Few functions steal
Fred Drakee058b4f1998-02-16 06:15:35 +0000221references; the two notable exceptions are
Fred Drake659ebfa2000-04-03 15:42:13 +0000222\cfunction{PyList_SetItem()}\ttindex{PyList_SetItem()} and
223\cfunction{PyTuple_SetItem()}\ttindex{PyTuple_SetItem()}, which
Fred Drakee058b4f1998-02-16 06:15:35 +0000224steal a reference to the item (but not to the tuple or list into which
Fred Drake003d8da1998-04-13 00:53:42 +0000225the item is put!). These functions were designed to steal a reference
Fred Drakee058b4f1998-02-16 06:15:35 +0000226because of a common idiom for populating a tuple or list with newly
227created objects; for example, the code to create the tuple \code{(1,
2282, "three")} could look like this (forgetting about error handling for
Fred Drake659ebfa2000-04-03 15:42:13 +0000229the moment; a better way to code this is shown below):
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000230
231\begin{verbatim}
232PyObject *t;
Fred Drakec6fa34e1998-04-02 06:47:24 +0000233
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000234t = PyTuple_New(3);
235PyTuple_SetItem(t, 0, PyInt_FromLong(1L));
236PyTuple_SetItem(t, 1, PyInt_FromLong(2L));
237PyTuple_SetItem(t, 2, PyString_FromString("three"));
238\end{verbatim}
239
Fred Drakee058b4f1998-02-16 06:15:35 +0000240Incidentally, \cfunction{PyTuple_SetItem()} is the \emph{only} way to
241set tuple items; \cfunction{PySequence_SetItem()} and
242\cfunction{PyObject_SetItem()} refuse to do this since tuples are an
243immutable data type. You should only use
244\cfunction{PyTuple_SetItem()} for tuples that you are creating
Guido van Rossum5b8a5231997-12-30 04:38:44 +0000245yourself.
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000246
247Equivalent code for populating a list can be written using
Fred Drakee058b4f1998-02-16 06:15:35 +0000248\cfunction{PyList_New()} and \cfunction{PyList_SetItem()}. Such code
249can also use \cfunction{PySequence_SetItem()}; this illustrates the
250difference between the two (the extra \cfunction{Py_DECREF()} calls):
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000251
252\begin{verbatim}
253PyObject *l, *x;
Fred Drakec6fa34e1998-04-02 06:47:24 +0000254
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000255l = PyList_New(3);
256x = PyInt_FromLong(1L);
Guido van Rossum5b8a5231997-12-30 04:38:44 +0000257PySequence_SetItem(l, 0, x); Py_DECREF(x);
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000258x = PyInt_FromLong(2L);
Guido van Rossum5b8a5231997-12-30 04:38:44 +0000259PySequence_SetItem(l, 1, x); Py_DECREF(x);
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000260x = PyString_FromString("three");
Guido van Rossum5b8a5231997-12-30 04:38:44 +0000261PySequence_SetItem(l, 2, x); Py_DECREF(x);
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000262\end{verbatim}
263
Guido van Rossum580aa8d1997-11-25 15:34:51 +0000264You might find it strange that the ``recommended'' approach takes more
265code. However, in practice, you will rarely use these ways of
266creating and populating a tuple or list. There's a generic function,
Fred Drakee058b4f1998-02-16 06:15:35 +0000267\cfunction{Py_BuildValue()}, that can create most common objects from
Fred Drake659ebfa2000-04-03 15:42:13 +0000268C values, directed by a \dfn{format string}. For example, the
Fred Drakee058b4f1998-02-16 06:15:35 +0000269above two blocks of code could be replaced by the following (which
270also takes care of the error checking):
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000271
272\begin{verbatim}
273PyObject *t, *l;
Fred Drakec6fa34e1998-04-02 06:47:24 +0000274
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000275t = Py_BuildValue("(iis)", 1, 2, "three");
276l = Py_BuildValue("[iis]", 1, 2, "three");
277\end{verbatim}
278
Fred Drakee058b4f1998-02-16 06:15:35 +0000279It is much more common to use \cfunction{PyObject_SetItem()} and
280friends with items whose references you are only borrowing, like
281arguments that were passed in to the function you are writing. In
282that case, their behaviour regarding reference counts is much saner,
283since you don't have to increment a reference count so you can give a
284reference away (``have it be stolen''). For example, this function
285sets all items of a list (actually, any mutable sequence) to a given
286item:
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000287
288\begin{verbatim}
289int set_all(PyObject *target, PyObject *item)
290{
291 int i, n;
Fred Drakec6fa34e1998-04-02 06:47:24 +0000292
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000293 n = PyObject_Length(target);
294 if (n < 0)
295 return -1;
296 for (i = 0; i < n; i++) {
297 if (PyObject_SetItem(target, i, item) < 0)
298 return -1;
299 }
300 return 0;
301}
302\end{verbatim}
Fred Drake659ebfa2000-04-03 15:42:13 +0000303\ttindex{set_all()}
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000304
305The situation is slightly different for function return values.
306While passing a reference to most functions does not change your
307ownership responsibilities for that reference, many functions that
308return a referece to an object give you ownership of the reference.
309The reason is simple: in many cases, the returned object is created
310on the fly, and the reference you get is the only reference to the
Fred Drakee058b4f1998-02-16 06:15:35 +0000311object. Therefore, the generic functions that return object
312references, like \cfunction{PyObject_GetItem()} and
Fred Drakebab29652001-07-10 16:10:08 +0000313\cfunction{PySequence_GetItem()}, always return a new reference (the
314caller becomes the owner of the reference).
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000315
316It is important to realize that whether you own a reference returned
Fred Drakee058b4f1998-02-16 06:15:35 +0000317by a function depends on which function you call only --- \emph{the
Fred Drakebab29652001-07-10 16:10:08 +0000318plumage} (the type of the type of the object passed as an
Fred Drakee058b4f1998-02-16 06:15:35 +0000319argument to the function) \emph{doesn't enter into it!} Thus, if you
320extract an item from a list using \cfunction{PyList_GetItem()}, you
321don't own the reference --- but if you obtain the same item from the
322same list using \cfunction{PySequence_GetItem()} (which happens to
323take exactly the same arguments), you do own a reference to the
324returned object.
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000325
Fred Drakee058b4f1998-02-16 06:15:35 +0000326Here is an example of how you could write a function that computes the
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000327sum of the items in a list of integers; once using
Fred Drake659ebfa2000-04-03 15:42:13 +0000328\cfunction{PyList_GetItem()}\ttindex{PyList_GetItem()}, and once using
329\cfunction{PySequence_GetItem()}\ttindex{PySequence_GetItem()}.
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000330
331\begin{verbatim}
332long sum_list(PyObject *list)
333{
334 int i, n;
335 long total = 0;
336 PyObject *item;
Fred Drakec6fa34e1998-04-02 06:47:24 +0000337
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000338 n = PyList_Size(list);
339 if (n < 0)
340 return -1; /* Not a list */
341 for (i = 0; i < n; i++) {
342 item = PyList_GetItem(list, i); /* Can't fail */
343 if (!PyInt_Check(item)) continue; /* Skip non-integers */
344 total += PyInt_AsLong(item);
345 }
346 return total;
347}
348\end{verbatim}
Fred Drake659ebfa2000-04-03 15:42:13 +0000349\ttindex{sum_list()}
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000350
351\begin{verbatim}
352long sum_sequence(PyObject *sequence)
353{
354 int i, n;
355 long total = 0;
356 PyObject *item;
Fred Drake659ebfa2000-04-03 15:42:13 +0000357 n = PySequence_Length(sequence);
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000358 if (n < 0)
359 return -1; /* Has no length */
360 for (i = 0; i < n; i++) {
Fred Drake659ebfa2000-04-03 15:42:13 +0000361 item = PySequence_GetItem(sequence, i);
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000362 if (item == NULL)
363 return -1; /* Not a sequence, or other failure */
364 if (PyInt_Check(item))
365 total += PyInt_AsLong(item);
Guido van Rossum5b8a5231997-12-30 04:38:44 +0000366 Py_DECREF(item); /* Discard reference ownership */
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000367 }
368 return total;
369}
370\end{verbatim}
Fred Drake659ebfa2000-04-03 15:42:13 +0000371\ttindex{sum_sequence()}
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000372
Fred Drakeefd146c1999-02-15 15:30:45 +0000373
374\subsection{Types \label{types}}
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000375
376There are few other data types that play a significant role in
Fred Drake659ebfa2000-04-03 15:42:13 +0000377the Python/C API; most are simple C types such as \ctype{int},
378\ctype{long}, \ctype{double} and \ctype{char*}. A few structure types
Guido van Rossum4a944d71997-08-14 20:35:38 +0000379are used to describe static tables used to list the functions exported
Fred Drake659ebfa2000-04-03 15:42:13 +0000380by a module or the data attributes of a new object type, and another
381is used to describe the value of a complex number. These will
Guido van Rossum59a61351997-08-14 20:34:33 +0000382be discussed together with the functions that use them.
383
Fred Drakeefd146c1999-02-15 15:30:45 +0000384
385\section{Exceptions \label{exceptions}}
Guido van Rossum59a61351997-08-14 20:34:33 +0000386
Guido van Rossum4a944d71997-08-14 20:35:38 +0000387The Python programmer only needs to deal with exceptions if specific
388error handling is required; unhandled exceptions are automatically
Fred Drake659ebfa2000-04-03 15:42:13 +0000389propagated to the caller, then to the caller's caller, and so on, until
Guido van Rossum4a944d71997-08-14 20:35:38 +0000390they reach the top-level interpreter, where they are reported to the
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000391user accompanied by a stack traceback.
Guido van Rossum59a61351997-08-14 20:34:33 +0000392
Fred Drake659ebfa2000-04-03 15:42:13 +0000393For C programmers, however, error checking always has to be explicit.
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000394All functions in the Python/C API can raise exceptions, unless an
395explicit claim is made otherwise in a function's documentation. In
396general, when a function encounters an error, it sets an exception,
397discards any object references that it owns, and returns an
Fred Drakee058b4f1998-02-16 06:15:35 +0000398error indicator --- usually \NULL{} or \code{-1}. A few functions
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000399return a Boolean true/false result, with false indicating an error.
400Very few functions return no explicit error indicator or have an
401ambiguous return value, and require explicit testing for errors with
Fred Drake659ebfa2000-04-03 15:42:13 +0000402\cfunction{PyErr_Occurred()}\ttindex{PyErr_Occurred()}.
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000403
404Exception state is maintained in per-thread storage (this is
405equivalent to using global storage in an unthreaded application). A
Fred Drakec6fa34e1998-04-02 06:47:24 +0000406thread can be in one of two states: an exception has occurred, or not.
Fred Drakee058b4f1998-02-16 06:15:35 +0000407The function \cfunction{PyErr_Occurred()} can be used to check for
408this: it returns a borrowed reference to the exception type object
409when an exception has occurred, and \NULL{} otherwise. There are a
410number of functions to set the exception state:
Fred Drake659ebfa2000-04-03 15:42:13 +0000411\cfunction{PyErr_SetString()}\ttindex{PyErr_SetString()} is the most
412common (though not the most general) function to set the exception
413state, and \cfunction{PyErr_Clear()}\ttindex{PyErr_Clear()} clears the
414exception state.
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000415
416The full exception state consists of three objects (all of which can
Fred Drakee058b4f1998-02-16 06:15:35 +0000417be \NULL{}): the exception type, the corresponding exception
Fred Drake659ebfa2000-04-03 15:42:13 +0000418value, and the traceback. These have the same meanings as the Python
419\withsubitem{(in module sys)}{
420 \ttindex{exc_type}\ttindex{exc_value}\ttindex{exc_traceback}}
421objects \code{sys.exc_type}, \code{sys.exc_value}, and
422\code{sys.exc_traceback}; however, they are not the same: the Python
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000423objects represent the last exception being handled by a Python
Fred Drake659ebfa2000-04-03 15:42:13 +0000424\keyword{try} \ldots\ \keyword{except} statement, while the C level
Fred Drakee058b4f1998-02-16 06:15:35 +0000425exception state only exists while an exception is being passed on
Fred Drake659ebfa2000-04-03 15:42:13 +0000426between C functions until it reaches the Python bytecode interpreter's
427main loop, which takes care of transferring it to \code{sys.exc_type}
428and friends.
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000429
Fred Drakec6fa34e1998-04-02 06:47:24 +0000430Note that starting with Python 1.5, the preferred, thread-safe way to
Fred Drake659ebfa2000-04-03 15:42:13 +0000431access the exception state from Python code is to call the function
432\withsubitem{(in module sys)}{\ttindex{exc_info()}}
Fred Drakee058b4f1998-02-16 06:15:35 +0000433\function{sys.exc_info()}, which returns the per-thread exception state
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000434for Python code. Also, the semantics of both ways to access the
435exception state have changed so that a function which catches an
436exception will save and restore its thread's exception state so as to
437preserve the exception state of its caller. This prevents common bugs
438in exception handling code caused by an innocent-looking function
439overwriting the exception being handled; it also reduces the often
440unwanted lifetime extension for objects that are referenced by the
Fred Drakec6fa34e1998-04-02 06:47:24 +0000441stack frames in the traceback.
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000442
443As a general principle, a function that calls another function to
444perform some task should check whether the called function raised an
445exception, and if so, pass the exception state on to its caller. It
Fred Drake659ebfa2000-04-03 15:42:13 +0000446should discard any object references that it owns, and return an
Fred Drakee058b4f1998-02-16 06:15:35 +0000447error indicator, but it should \emph{not} set another exception ---
448that would overwrite the exception that was just raised, and lose
449important information about the exact cause of the error.
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000450
Fred Drake659ebfa2000-04-03 15:42:13 +0000451A simple example of detecting exceptions and passing them on is shown
452in the \cfunction{sum_sequence()}\ttindex{sum_sequence()} example
453above. It so happens that that example doesn't need to clean up any
454owned references when it detects an error. The following example
455function shows some error cleanup. First, to remind you why you like
456Python, we show the equivalent Python code:
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000457
458\begin{verbatim}
Guido van Rossum580aa8d1997-11-25 15:34:51 +0000459def incr_item(dict, key):
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000460 try:
Guido van Rossum580aa8d1997-11-25 15:34:51 +0000461 item = dict[key]
462 except KeyError:
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000463 item = 0
Fred Drake6b3f3f22000-11-29 15:48:22 +0000464 dict[key] = item + 1
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000465\end{verbatim}
Fred Drake659ebfa2000-04-03 15:42:13 +0000466\ttindex{incr_item()}
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000467
Fred Drake659ebfa2000-04-03 15:42:13 +0000468Here is the corresponding C code, in all its glory:
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000469
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000470\begin{verbatim}
Guido van Rossum580aa8d1997-11-25 15:34:51 +0000471int incr_item(PyObject *dict, PyObject *key)
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000472{
473 /* Objects all initialized to NULL for Py_XDECREF */
474 PyObject *item = NULL, *const_one = NULL, *incremented_item = NULL;
Guido van Rossum5b8a5231997-12-30 04:38:44 +0000475 int rv = -1; /* Return value initialized to -1 (failure) */
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000476
Guido van Rossum580aa8d1997-11-25 15:34:51 +0000477 item = PyObject_GetItem(dict, key);
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000478 if (item == NULL) {
Fred Drakec6fa34e1998-04-02 06:47:24 +0000479 /* Handle KeyError only: */
Fred Drake6b3f3f22000-11-29 15:48:22 +0000480 if (!PyErr_ExceptionMatches(PyExc_KeyError))
481 goto error;
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000482
483 /* Clear the error and use zero: */
484 PyErr_Clear();
Guido van Rossum580aa8d1997-11-25 15:34:51 +0000485 item = PyInt_FromLong(0L);
Fred Drake6b3f3f22000-11-29 15:48:22 +0000486 if (item == NULL)
487 goto error;
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000488 }
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000489 const_one = PyInt_FromLong(1L);
Fred Drake6b3f3f22000-11-29 15:48:22 +0000490 if (const_one == NULL)
491 goto error;
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000492
493 incremented_item = PyNumber_Add(item, const_one);
Fred Drake6b3f3f22000-11-29 15:48:22 +0000494 if (incremented_item == NULL)
495 goto error;
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000496
Fred Drake6b3f3f22000-11-29 15:48:22 +0000497 if (PyObject_SetItem(dict, key, incremented_item) < 0)
498 goto error;
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000499 rv = 0; /* Success */
500 /* Continue with cleanup code */
501
502 error:
503 /* Cleanup code, shared by success and failure path */
504
505 /* Use Py_XDECREF() to ignore NULL references */
506 Py_XDECREF(item);
507 Py_XDECREF(const_one);
508 Py_XDECREF(incremented_item);
509
510 return rv; /* -1 for error, 0 for success */
511}
512\end{verbatim}
Fred Drake659ebfa2000-04-03 15:42:13 +0000513\ttindex{incr_item()}
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000514
Fred Drakef8830d11998-04-23 14:06:01 +0000515This example represents an endorsed use of the \keyword{goto} statement
Fred Drake659ebfa2000-04-03 15:42:13 +0000516in C! It illustrates the use of
517\cfunction{PyErr_ExceptionMatches()}\ttindex{PyErr_ExceptionMatches()} and
518\cfunction{PyErr_Clear()}\ttindex{PyErr_Clear()} to
519handle specific exceptions, and the use of
520\cfunction{Py_XDECREF()}\ttindex{Py_XDECREF()} to
521dispose of owned references that may be \NULL{} (note the
522\character{X} in the name; \cfunction{Py_DECREF()} would crash when
523confronted with a \NULL{} reference). It is important that the
524variables used to hold owned references are initialized to \NULL{} for
525this to work; likewise, the proposed return value is initialized to
526\code{-1} (failure) and only set to success after the final call made
527is successful.
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000528
Guido van Rossum59a61351997-08-14 20:34:33 +0000529
Fred Drakeefd146c1999-02-15 15:30:45 +0000530\section{Embedding Python \label{embedding}}
Guido van Rossum59a61351997-08-14 20:34:33 +0000531
Guido van Rossum580aa8d1997-11-25 15:34:51 +0000532The one important task that only embedders (as opposed to extension
533writers) of the Python interpreter have to worry about is the
534initialization, and possibly the finalization, of the Python
535interpreter. Most functionality of the interpreter can only be used
536after the interpreter has been initialized.
Guido van Rossum59a61351997-08-14 20:34:33 +0000537
Fred Drake659ebfa2000-04-03 15:42:13 +0000538The basic initialization function is
539\cfunction{Py_Initialize()}\ttindex{Py_Initialize()}.
Fred Drakee058b4f1998-02-16 06:15:35 +0000540This initializes the table of loaded modules, and creates the
Fred Drake4de05a91998-02-16 14:25:26 +0000541fundamental modules \module{__builtin__}\refbimodindex{__builtin__},
542\module{__main__}\refbimodindex{__main__} and
543\module{sys}\refbimodindex{sys}. It also initializes the module
Fred Drakec6fa34e1998-04-02 06:47:24 +0000544search path (\code{sys.path}).%
545\indexiii{module}{search}{path}
Fred Drake659ebfa2000-04-03 15:42:13 +0000546\withsubitem{(in module sys)}{\ttindex{path}}
Guido van Rossum59a61351997-08-14 20:34:33 +0000547
Fred Drakee058b4f1998-02-16 06:15:35 +0000548\cfunction{Py_Initialize()} does not set the ``script argument list''
Guido van Rossum4a944d71997-08-14 20:35:38 +0000549(\code{sys.argv}). If this variable is needed by Python code that
550will be executed later, it must be set explicitly with a call to
Fred Drake659ebfa2000-04-03 15:42:13 +0000551\code{PySys_SetArgv(\var{argc},
552\var{argv})}\ttindex{PySys_SetArgv()} subsequent to the call to
553\cfunction{Py_Initialize()}.
Guido van Rossum59a61351997-08-14 20:34:33 +0000554
Fred Drakeb0a78731998-01-13 18:51:10 +0000555On most systems (in particular, on \UNIX{} and Windows, although the
Fred Drake659ebfa2000-04-03 15:42:13 +0000556details are slightly different),
557\cfunction{Py_Initialize()} calculates the module search path based
558upon its best guess for the location of the standard Python
559interpreter executable, assuming that the Python library is found in a
560fixed location relative to the Python interpreter executable. In
561particular, it looks for a directory named
Fred Draked5d04352000-09-14 20:24:17 +0000562\file{lib/python\shortversion} relative to the parent directory where
563the executable named \file{python} is found on the shell command
564search path (the environment variable \envvar{PATH}).
Guido van Rossum42cefd01997-10-05 15:27:29 +0000565
566For instance, if the Python executable is found in
Fred Drakee058b4f1998-02-16 06:15:35 +0000567\file{/usr/local/bin/python}, it will assume that the libraries are in
Fred Draked5d04352000-09-14 20:24:17 +0000568\file{/usr/local/lib/python\shortversion}. (In fact, this particular path
Fred Drakee058b4f1998-02-16 06:15:35 +0000569is also the ``fallback'' location, used when no executable file named
Fred Drakec6fa34e1998-04-02 06:47:24 +0000570\file{python} is found along \envvar{PATH}.) The user can override
571this behavior by setting the environment variable \envvar{PYTHONHOME},
Guido van Rossum580aa8d1997-11-25 15:34:51 +0000572or insert additional directories in front of the standard path by
Fred Drakec6fa34e1998-04-02 06:47:24 +0000573setting \envvar{PYTHONPATH}.
Guido van Rossum59a61351997-08-14 20:34:33 +0000574
Guido van Rossum4a944d71997-08-14 20:35:38 +0000575The embedding application can steer the search by calling
Fred Drake659ebfa2000-04-03 15:42:13 +0000576\code{Py_SetProgramName(\var{file})}\ttindex{Py_SetProgramName()} \emph{before} calling
Fred Drakec6fa34e1998-04-02 06:47:24 +0000577\cfunction{Py_Initialize()}. Note that \envvar{PYTHONHOME} still
578overrides this and \envvar{PYTHONPATH} is still inserted in front of
Fred Drakee058b4f1998-02-16 06:15:35 +0000579the standard path. An application that requires total control has to
Fred Drake659ebfa2000-04-03 15:42:13 +0000580provide its own implementation of
581\cfunction{Py_GetPath()}\ttindex{Py_GetPath()},
582\cfunction{Py_GetPrefix()}\ttindex{Py_GetPrefix()},
583\cfunction{Py_GetExecPrefix()}\ttindex{Py_GetExecPrefix()}, and
584\cfunction{Py_GetProgramFullPath()}\ttindex{Py_GetProgramFullPath()} (all
585defined in \file{Modules/getpath.c}).
Guido van Rossum59a61351997-08-14 20:34:33 +0000586
Guido van Rossum4a944d71997-08-14 20:35:38 +0000587Sometimes, it is desirable to ``uninitialize'' Python. For instance,
588the application may want to start over (make another call to
Fred Drakee058b4f1998-02-16 06:15:35 +0000589\cfunction{Py_Initialize()}) or the application is simply done with its
Guido van Rossum580aa8d1997-11-25 15:34:51 +0000590use of Python and wants to free all memory allocated by Python. This
Fred Drakee058b4f1998-02-16 06:15:35 +0000591can be accomplished by calling \cfunction{Py_Finalize()}. The function
Fred Drake659ebfa2000-04-03 15:42:13 +0000592\cfunction{Py_IsInitialized()}\ttindex{Py_IsInitialized()} returns
593true if Python is currently in the initialized state. More
594information about these functions is given in a later chapter.
Guido van Rossum59a61351997-08-14 20:34:33 +0000595
Guido van Rossum4a944d71997-08-14 20:35:38 +0000596
Fred Drakeefd146c1999-02-15 15:30:45 +0000597\chapter{The Very High Level Layer \label{veryhigh}}
Guido van Rossum4a944d71997-08-14 20:35:38 +0000598
Fred Drakee5bf8b21998-02-12 21:22:28 +0000599The functions in this chapter will let you execute Python source code
600given in a file or a buffer, but they will not let you interact in a
601more detailed way with the interpreter.
Guido van Rossum4a944d71997-08-14 20:35:38 +0000602
Fred Drake659ebfa2000-04-03 15:42:13 +0000603Several of these functions accept a start symbol from the grammar as a
604parameter. The available start symbols are \constant{Py_eval_input},
605\constant{Py_file_input}, and \constant{Py_single_input}. These are
606described following the functions which accept them as parameters.
607
Fred Drake510d08b2000-08-14 02:50:21 +0000608Note also that several of these functions take \ctype{FILE*}
609parameters. On particular issue which needs to be handled carefully
610is that the \ctype{FILE} structure for different C libraries can be
611different and incompatible. Under Windows (at least), it is possible
612for dynamically linked extensions to actually use different libraries,
613so care should be taken that \ctype{FILE*} parameters are only passed
614to these functions if it is certain that they were created by the same
615library that the Python runtime is using.
616
Fred Drake24e62192001-05-21 15:56:55 +0000617\begin{cfuncdesc}{int}{Py_Main}{int argc, char **argv}
618 The main program for the standard interpreter. This is made
619 available for programs which embed Python. The \var{argc} and
620 \var{argv} parameters should be prepared exactly as those which are
621 passed to a C program's \cfunction{main()} function. It is
622 important to note that the argument list may be modified (but the
623 contents of the strings pointed to by the argument list are not).
624 The return value will be the integer passed to the
625 \function{sys.exit()} function, \code{1} if the interpreter exits
626 due to an exception, or \code{2} if the parameter list does not
627 represent a valid Python command line.
628\end{cfuncdesc}
629
Fred Drakec6fa34e1998-04-02 06:47:24 +0000630\begin{cfuncdesc}{int}{PyRun_AnyFile}{FILE *fp, char *filename}
Fred Drake0041a941999-04-29 04:20:46 +0000631 If \var{fp} refers to a file associated with an interactive device
632 (console or terminal input or \UNIX{} pseudo-terminal), return the
633 value of \cfunction{PyRun_InteractiveLoop()}, otherwise return the
634 result of \cfunction{PyRun_SimpleFile()}. If \var{filename} is
Fred Drakea8d73412000-08-11 20:39:29 +0000635 \NULL{}, this function uses \code{"???"} as the filename.
Guido van Rossum9231c8f1997-05-15 21:43:21 +0000636\end{cfuncdesc}
637
Fred Drakec6fa34e1998-04-02 06:47:24 +0000638\begin{cfuncdesc}{int}{PyRun_SimpleString}{char *command}
Fred Drake0041a941999-04-29 04:20:46 +0000639 Executes the Python source code from \var{command} in the
640 \module{__main__} module. If \module{__main__} does not already
641 exist, it is created. Returns \code{0} on success or \code{-1} if
642 an exception was raised. If there was an error, there is no way to
643 get the exception information.
Guido van Rossum9231c8f1997-05-15 21:43:21 +0000644\end{cfuncdesc}
645
Fred Drakec6fa34e1998-04-02 06:47:24 +0000646\begin{cfuncdesc}{int}{PyRun_SimpleFile}{FILE *fp, char *filename}
Fred Drake0041a941999-04-29 04:20:46 +0000647 Similar to \cfunction{PyRun_SimpleString()}, but the Python source
648 code is read from \var{fp} instead of an in-memory string.
649 \var{filename} should be the name of the file.
Fred Drakee5bf8b21998-02-12 21:22:28 +0000650\end{cfuncdesc}
651
Fred Drakec6fa34e1998-04-02 06:47:24 +0000652\begin{cfuncdesc}{int}{PyRun_InteractiveOne}{FILE *fp, char *filename}
Fred Drakea8d73412000-08-11 20:39:29 +0000653 Read and execute a single statement from a file associated with an
654 interactive device. If \var{filename} is \NULL, \code{"???"} is
655 used instead. The user will be prompted using \code{sys.ps1} and
656 \code{sys.ps2}. Returns \code{0} when the input was executed
657 successfully, \code{-1} if there was an exception, or an error code
658 from the \file{errcode.h} include file distributed as part of Python
659 in case of a parse error. (Note that \file{errcode.h} is not
660 included by \file{Python.h}, so must be included specifically if
661 needed.)
Fred Drakee5bf8b21998-02-12 21:22:28 +0000662\end{cfuncdesc}
663
Fred Drakec6fa34e1998-04-02 06:47:24 +0000664\begin{cfuncdesc}{int}{PyRun_InteractiveLoop}{FILE *fp, char *filename}
Fred Drakea8d73412000-08-11 20:39:29 +0000665 Read and execute statements from a file associated with an
666 interactive device until \EOF{} is reached. If \var{filename} is
667 \NULL, \code{"???"} is used instead. The user will be prompted
668 using \code{sys.ps1} and \code{sys.ps2}. Returns \code{0} at \EOF.
Fred Drakee5bf8b21998-02-12 21:22:28 +0000669\end{cfuncdesc}
670
Fred Drakec6fa34e1998-04-02 06:47:24 +0000671\begin{cfuncdesc}{struct _node*}{PyParser_SimpleParseString}{char *str,
672 int start}
Fred Drake0041a941999-04-29 04:20:46 +0000673 Parse Python source code from \var{str} using the start token
674 \var{start}. The result can be used to create a code object which
675 can be evaluated efficiently. This is useful if a code fragment
676 must be evaluated many times.
Fred Drakee5bf8b21998-02-12 21:22:28 +0000677\end{cfuncdesc}
678
Fred Drakec6fa34e1998-04-02 06:47:24 +0000679\begin{cfuncdesc}{struct _node*}{PyParser_SimpleParseFile}{FILE *fp,
680 char *filename, int start}
Fred Drake0041a941999-04-29 04:20:46 +0000681 Similar to \cfunction{PyParser_SimpleParseString()}, but the Python
682 source code is read from \var{fp} instead of an in-memory string.
683 \var{filename} should be the name of the file.
Fred Drakee5bf8b21998-02-12 21:22:28 +0000684\end{cfuncdesc}
685
Fred Drakec6fa34e1998-04-02 06:47:24 +0000686\begin{cfuncdesc}{PyObject*}{PyRun_String}{char *str, int start,
687 PyObject *globals,
688 PyObject *locals}
Fred Drake0041a941999-04-29 04:20:46 +0000689 Execute Python source code from \var{str} in the context specified
690 by the dictionaries \var{globals} and \var{locals}. The parameter
691 \var{start} specifies the start token that should be used to parse
692 the source code.
693
694 Returns the result of executing the code as a Python object, or
695 \NULL{} if an exception was raised.
Fred Drakee5bf8b21998-02-12 21:22:28 +0000696\end{cfuncdesc}
697
Fred Drakec6fa34e1998-04-02 06:47:24 +0000698\begin{cfuncdesc}{PyObject*}{PyRun_File}{FILE *fp, char *filename,
699 int start, PyObject *globals,
700 PyObject *locals}
Fred Drake0041a941999-04-29 04:20:46 +0000701 Similar to \cfunction{PyRun_String()}, but the Python source code is
Fred Drake659ebfa2000-04-03 15:42:13 +0000702 read from \var{fp} instead of an in-memory string.
703 \var{filename} should be the name of the file.
Fred Drakee5bf8b21998-02-12 21:22:28 +0000704\end{cfuncdesc}
705
Fred Drakec6fa34e1998-04-02 06:47:24 +0000706\begin{cfuncdesc}{PyObject*}{Py_CompileString}{char *str, char *filename,
707 int start}
Fred Drake0041a941999-04-29 04:20:46 +0000708 Parse and compile the Python source code in \var{str}, returning the
709 resulting code object. The start token is given by \var{start};
Fred Drakec924b8d1999-08-23 18:57:25 +0000710 this can be used to constrain the code which can be compiled and should
711 be \constant{Py_eval_input}, \constant{Py_file_input}, or
712 \constant{Py_single_input}. The filename specified by
713 \var{filename} is used to construct the code object and may appear
714 in tracebacks or \exception{SyntaxError} exception messages. This
715 returns \NULL{} if the code cannot be parsed or compiled.
Guido van Rossum9231c8f1997-05-15 21:43:21 +0000716\end{cfuncdesc}
717
Fred Drakec924b8d1999-08-23 18:57:25 +0000718\begin{cvardesc}{int}{Py_eval_input}
719 The start symbol from the Python grammar for isolated expressions;
Fred Drake659ebfa2000-04-03 15:42:13 +0000720 for use with \cfunction{Py_CompileString()}\ttindex{Py_CompileString()}.
Fred Drakec924b8d1999-08-23 18:57:25 +0000721\end{cvardesc}
722
723\begin{cvardesc}{int}{Py_file_input}
724 The start symbol from the Python grammar for sequences of statements
725 as read from a file or other source; for use with
Fred Drake659ebfa2000-04-03 15:42:13 +0000726 \cfunction{Py_CompileString()}\ttindex{Py_CompileString()}. This is
727 the symbol to use when compiling arbitrarily long Python source code.
Fred Drakec924b8d1999-08-23 18:57:25 +0000728\end{cvardesc}
729
730\begin{cvardesc}{int}{Py_single_input}
731 The start symbol from the Python grammar for a single statement; for
Fred Drake659ebfa2000-04-03 15:42:13 +0000732 use with \cfunction{Py_CompileString()}\ttindex{Py_CompileString()}.
733 This is the symbol used for the interactive interpreter loop.
Fred Drakec924b8d1999-08-23 18:57:25 +0000734\end{cvardesc}
735
Guido van Rossum9231c8f1997-05-15 21:43:21 +0000736
Fred Drakeefd146c1999-02-15 15:30:45 +0000737\chapter{Reference Counting \label{countingRefs}}
Guido van Rossum9231c8f1997-05-15 21:43:21 +0000738
Guido van Rossum580aa8d1997-11-25 15:34:51 +0000739The macros in this section are used for managing reference counts
Guido van Rossum9231c8f1997-05-15 21:43:21 +0000740of Python objects.
741
742\begin{cfuncdesc}{void}{Py_INCREF}{PyObject *o}
Fred Drakec6fa34e1998-04-02 06:47:24 +0000743Increment the reference count for object \var{o}. The object must
Guido van Rossum9231c8f1997-05-15 21:43:21 +0000744not be \NULL{}; if you aren't sure that it isn't \NULL{}, use
Fred Drakee058b4f1998-02-16 06:15:35 +0000745\cfunction{Py_XINCREF()}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +0000746\end{cfuncdesc}
747
748\begin{cfuncdesc}{void}{Py_XINCREF}{PyObject *o}
Fred Drakee058b4f1998-02-16 06:15:35 +0000749Increment the reference count for object \var{o}. The object may be
Guido van Rossum580aa8d1997-11-25 15:34:51 +0000750\NULL{}, in which case the macro has no effect.
Guido van Rossum9231c8f1997-05-15 21:43:21 +0000751\end{cfuncdesc}
752
753\begin{cfuncdesc}{void}{Py_DECREF}{PyObject *o}
Fred Drakee058b4f1998-02-16 06:15:35 +0000754Decrement the reference count for object \var{o}. The object must
Guido van Rossum9231c8f1997-05-15 21:43:21 +0000755not be \NULL{}; if you aren't sure that it isn't \NULL{}, use
Fred Drakee058b4f1998-02-16 06:15:35 +0000756\cfunction{Py_XDECREF()}. If the reference count reaches zero, the
757object's type's deallocation function (which must not be \NULL{}) is
758invoked.
Guido van Rossum9231c8f1997-05-15 21:43:21 +0000759
760\strong{Warning:} The deallocation function can cause arbitrary Python
Fred Drake659ebfa2000-04-03 15:42:13 +0000761code to be invoked (e.g. when a class instance with a
762\method{__del__()} method is deallocated). While exceptions in such
763code are not propagated, the executed code has free access to all
764Python global variables. This means that any object that is reachable
765from a global variable should be in a consistent state before
766\cfunction{Py_DECREF()} is invoked. For example, code to delete an
767object from a list should copy a reference to the deleted object in a
768temporary variable, update the list data structure, and then call
769\cfunction{Py_DECREF()} for the temporary variable.
Guido van Rossum9231c8f1997-05-15 21:43:21 +0000770\end{cfuncdesc}
771
772\begin{cfuncdesc}{void}{Py_XDECREF}{PyObject *o}
Fred Drakee058b4f1998-02-16 06:15:35 +0000773Decrement the reference count for object \var{o}. The object may be
774\NULL{}, in which case the macro has no effect; otherwise the effect
775is the same as for \cfunction{Py_DECREF()}, and the same warning
Guido van Rossum9231c8f1997-05-15 21:43:21 +0000776applies.
777\end{cfuncdesc}
778
Fred Drake659ebfa2000-04-03 15:42:13 +0000779The following functions or macros are only for use within the
780interpreter core: \cfunction{_Py_Dealloc()},
781\cfunction{_Py_ForgetReference()}, \cfunction{_Py_NewReference()}, as
782well as the global variable \cdata{_Py_RefTotal}.
Guido van Rossum580aa8d1997-11-25 15:34:51 +0000783
Guido van Rossum9231c8f1997-05-15 21:43:21 +0000784
Fred Drakeefd146c1999-02-15 15:30:45 +0000785\chapter{Exception Handling \label{exceptionHandling}}
Guido van Rossum9231c8f1997-05-15 21:43:21 +0000786
Fred Drake659ebfa2000-04-03 15:42:13 +0000787The functions described in this chapter will let you handle and raise Python
Guido van Rossumae110af1997-05-22 20:11:52 +0000788exceptions. It is important to understand some of the basics of
Fred Drake659ebfa2000-04-03 15:42:13 +0000789Python exception handling. It works somewhat like the
790\UNIX{} \cdata{errno} variable: there is a global indicator (per
791thread) of the last error that occurred. Most functions don't clear
792this on success, but will set it to indicate the cause of the error on
793failure. Most functions also return an error indicator, usually
794\NULL{} if they are supposed to return a pointer, or \code{-1} if they
795return an integer (exception: the \cfunction{PyArg_Parse*()} functions
796return \code{1} for success and \code{0} for failure). When a
797function must fail because some function it called failed, it
798generally doesn't set the error indicator; the function it called
799already set it.
Guido van Rossumae110af1997-05-22 20:11:52 +0000800
801The error indicator consists of three Python objects corresponding to
Fred Drake659ebfa2000-04-03 15:42:13 +0000802\withsubitem{(in module sys)}{
803 \ttindex{exc_type}\ttindex{exc_value}\ttindex{exc_traceback}}
Guido van Rossumae110af1997-05-22 20:11:52 +0000804the Python variables \code{sys.exc_type}, \code{sys.exc_value} and
805\code{sys.exc_traceback}. API functions exist to interact with the
806error indicator in various ways. There is a separate error indicator
807for each thread.
808
809% XXX Order of these should be more thoughtful.
810% Either alphabetical or some kind of structure.
Guido van Rossum9231c8f1997-05-15 21:43:21 +0000811
812\begin{cfuncdesc}{void}{PyErr_Print}{}
Guido van Rossumae110af1997-05-22 20:11:52 +0000813Print a standard traceback to \code{sys.stderr} and clear the error
814indicator. Call this function only when the error indicator is set.
815(Otherwise it will cause a fatal error!)
Guido van Rossum9231c8f1997-05-15 21:43:21 +0000816\end{cfuncdesc}
817
Fred Drakec6fa34e1998-04-02 06:47:24 +0000818\begin{cfuncdesc}{PyObject*}{PyErr_Occurred}{}
Guido van Rossumae110af1997-05-22 20:11:52 +0000819Test whether the error indicator is set. If set, return the exception
Fred Drakee058b4f1998-02-16 06:15:35 +0000820\emph{type} (the first argument to the last call to one of the
Fred Drakef8830d11998-04-23 14:06:01 +0000821\cfunction{PyErr_Set*()} functions or to \cfunction{PyErr_Restore()}). If
Fred Drakee058b4f1998-02-16 06:15:35 +0000822not set, return \NULL{}. You do not own a reference to the return
823value, so you do not need to \cfunction{Py_DECREF()} it.
Fred Drake659ebfa2000-04-03 15:42:13 +0000824\strong{Note:} Do not compare the return value to a specific
Fred Drakee058b4f1998-02-16 06:15:35 +0000825exception; use \cfunction{PyErr_ExceptionMatches()} instead, shown
Fred Drake659ebfa2000-04-03 15:42:13 +0000826below. (The comparison could easily fail since the exception may be
827an instance instead of a class, in the case of a class exception, or
828it may the a subclass of the expected exception.)
Guido van Rossum42cefd01997-10-05 15:27:29 +0000829\end{cfuncdesc}
830
831\begin{cfuncdesc}{int}{PyErr_ExceptionMatches}{PyObject *exc}
Guido van Rossum42cefd01997-10-05 15:27:29 +0000832Equivalent to
Fred Drakee058b4f1998-02-16 06:15:35 +0000833\samp{PyErr_GivenExceptionMatches(PyErr_Occurred(), \var{exc})}.
Fred Drake659ebfa2000-04-03 15:42:13 +0000834This should only be called when an exception is actually set; a memory
835access violation will occur if no exception has been raised.
Guido van Rossum42cefd01997-10-05 15:27:29 +0000836\end{cfuncdesc}
837
838\begin{cfuncdesc}{int}{PyErr_GivenExceptionMatches}{PyObject *given, PyObject *exc}
Guido van Rossum42cefd01997-10-05 15:27:29 +0000839Return true if the \var{given} exception matches the exception in
840\var{exc}. If \var{exc} is a class object, this also returns true
Fred Drake659ebfa2000-04-03 15:42:13 +0000841when \var{given} is an instance of a subclass. If \var{exc} is a tuple, all
Guido van Rossum42cefd01997-10-05 15:27:29 +0000842exceptions in the tuple (and recursively in subtuples) are searched
Fred Drake659ebfa2000-04-03 15:42:13 +0000843for a match. If \var{given} is \NULL, a memory access violation will
844occur.
Guido van Rossum42cefd01997-10-05 15:27:29 +0000845\end{cfuncdesc}
846
847\begin{cfuncdesc}{void}{PyErr_NormalizeException}{PyObject**exc, PyObject**val, PyObject**tb}
Guido van Rossum42cefd01997-10-05 15:27:29 +0000848Under certain circumstances, the values returned by
Fred Drakee058b4f1998-02-16 06:15:35 +0000849\cfunction{PyErr_Fetch()} below can be ``unnormalized'', meaning that
850\code{*\var{exc}} is a class object but \code{*\var{val}} is not an
851instance of the same class. This function can be used to instantiate
852the class in that case. If the values are already normalized, nothing
Fred Drake659ebfa2000-04-03 15:42:13 +0000853happens. The delayed normalization is implemented to improve
854performance.
Guido van Rossumae110af1997-05-22 20:11:52 +0000855\end{cfuncdesc}
856
857\begin{cfuncdesc}{void}{PyErr_Clear}{}
858Clear the error indicator. If the error indicator is not set, there
859is no effect.
860\end{cfuncdesc}
861
Fred Drake659ebfa2000-04-03 15:42:13 +0000862\begin{cfuncdesc}{void}{PyErr_Fetch}{PyObject **ptype, PyObject **pvalue,
863 PyObject **ptraceback}
Guido van Rossumae110af1997-05-22 20:11:52 +0000864Retrieve the error indicator into three variables whose addresses are
865passed. If the error indicator is not set, set all three variables to
866\NULL{}. If it is set, it will be cleared and you own a reference to
Fred Drake659ebfa2000-04-03 15:42:13 +0000867each object retrieved. The value and traceback object may be
868\NULL{} even when the type object is not. \strong{Note:} This
869function is normally only used by code that needs to handle exceptions
870or by code that needs to save and restore the error indicator
871temporarily.
Guido van Rossumae110af1997-05-22 20:11:52 +0000872\end{cfuncdesc}
873
Fred Drake17e63432000-08-31 05:50:40 +0000874\begin{cfuncdesc}{void}{PyErr_Restore}{PyObject *type, PyObject *value,
875 PyObject *traceback}
Guido van Rossumae110af1997-05-22 20:11:52 +0000876Set the error indicator from the three objects. If the error
877indicator is already set, it is cleared first. If the objects are
878\NULL{}, the error indicator is cleared. Do not pass a \NULL{} type
879and non-\NULL{} value or traceback. The exception type should be a
880string or class; if it is a class, the value should be an instance of
881that class. Do not pass an invalid exception type or value.
882(Violating these rules will cause subtle problems later.) This call
Fred Drakebab29652001-07-10 16:10:08 +0000883takes away a reference to each object: you must own a reference
Guido van Rossumae110af1997-05-22 20:11:52 +0000884to each object before the call and after the call you no longer own
885these references. (If you don't understand this, don't use this
Fred Drake659ebfa2000-04-03 15:42:13 +0000886function. I warned you.) \strong{Note:} This function is normally
Guido van Rossumae110af1997-05-22 20:11:52 +0000887only used by code that needs to save and restore the error indicator
888temporarily.
889\end{cfuncdesc}
890
891\begin{cfuncdesc}{void}{PyErr_SetString}{PyObject *type, char *message}
892This is the most common way to set the error indicator. The first
893argument specifies the exception type; it is normally one of the
Fred Drakef8830d11998-04-23 14:06:01 +0000894standard exceptions, e.g. \cdata{PyExc_RuntimeError}. You need not
Guido van Rossumae110af1997-05-22 20:11:52 +0000895increment its reference count. The second argument is an error
896message; it is converted to a string object.
897\end{cfuncdesc}
898
899\begin{cfuncdesc}{void}{PyErr_SetObject}{PyObject *type, PyObject *value}
Fred Drakee058b4f1998-02-16 06:15:35 +0000900This function is similar to \cfunction{PyErr_SetString()} but lets you
Guido van Rossumae110af1997-05-22 20:11:52 +0000901specify an arbitrary Python object for the ``value'' of the exception.
902You need not increment its reference count.
903\end{cfuncdesc}
904
Fred Drake73577702000-04-10 18:50:14 +0000905\begin{cfuncdesc}{PyObject*}{PyErr_Format}{PyObject *exception,
Moshe Zadka57a59322000-09-01 09:47:20 +0000906 const char *format, \moreargs}
Fred Drake89fb0352000-10-14 05:49:30 +0000907This function sets the error indicator. \var{exception} should be a
908Python exception (string or class, not an instance).
Fred Drake5566c1c2001-01-19 22:48:33 +0000909\var{format} should be a string, containing format codes, similar to
Moshe Zadka57a59322000-09-01 09:47:20 +0000910\cfunction{printf}. The \code{width.precision} before a format code
911is parsed, but the width part is ignored.
912
913\begin{tableii}{c|l}{character}{Character}{Meaning}
914 \lineii{c}{Character, as an \ctype{int} parameter}
915 \lineii{d}{Number in decimal, as an \ctype{int} parameter}
916 \lineii{x}{Number in hexadecimal, as an \ctype{int} parameter}
917 \lineii{x}{A string, as a \ctype{char *} parameter}
918\end{tableii}
919
920An unrecognized format character causes all the rest of
921the format string to be copied as-is to the result string,
922and any extra arguments discarded.
923
924A new reference is returned, which is owned by the caller.
Jeremy Hylton98605b52000-04-10 18:40:57 +0000925\end{cfuncdesc}
926
Guido van Rossumae110af1997-05-22 20:11:52 +0000927\begin{cfuncdesc}{void}{PyErr_SetNone}{PyObject *type}
Fred Drakee058b4f1998-02-16 06:15:35 +0000928This is a shorthand for \samp{PyErr_SetObject(\var{type}, Py_None)}.
Guido van Rossumae110af1997-05-22 20:11:52 +0000929\end{cfuncdesc}
930
931\begin{cfuncdesc}{int}{PyErr_BadArgument}{}
Fred Drakee058b4f1998-02-16 06:15:35 +0000932This is a shorthand for \samp{PyErr_SetString(PyExc_TypeError,
Guido van Rossumae110af1997-05-22 20:11:52 +0000933\var{message})}, where \var{message} indicates that a built-in operation
934was invoked with an illegal argument. It is mostly for internal use.
935\end{cfuncdesc}
936
Fred Drakec6fa34e1998-04-02 06:47:24 +0000937\begin{cfuncdesc}{PyObject*}{PyErr_NoMemory}{}
Fred Drakee058b4f1998-02-16 06:15:35 +0000938This is a shorthand for \samp{PyErr_SetNone(PyExc_MemoryError)}; it
Guido van Rossumae110af1997-05-22 20:11:52 +0000939returns \NULL{} so an object allocation function can write
Fred Drakee058b4f1998-02-16 06:15:35 +0000940\samp{return PyErr_NoMemory();} when it runs out of memory.
Guido van Rossumae110af1997-05-22 20:11:52 +0000941\end{cfuncdesc}
942
Fred Drakec6fa34e1998-04-02 06:47:24 +0000943\begin{cfuncdesc}{PyObject*}{PyErr_SetFromErrno}{PyObject *type}
Fred Drake659ebfa2000-04-03 15:42:13 +0000944This is a convenience function to raise an exception when a C library
945function has returned an error and set the C variable \cdata{errno}.
Guido van Rossumae110af1997-05-22 20:11:52 +0000946It constructs a tuple object whose first item is the integer
Fred Drakef8830d11998-04-23 14:06:01 +0000947\cdata{errno} value and whose second item is the corresponding error
Fred Drake659ebfa2000-04-03 15:42:13 +0000948message (gotten from \cfunction{strerror()}\ttindex{strerror()}), and
949then calls
Fred Drakee058b4f1998-02-16 06:15:35 +0000950\samp{PyErr_SetObject(\var{type}, \var{object})}. On \UNIX{}, when
Fred Drakef8830d11998-04-23 14:06:01 +0000951the \cdata{errno} value is \constant{EINTR}, indicating an interrupted
Fred Drakee058b4f1998-02-16 06:15:35 +0000952system call, this calls \cfunction{PyErr_CheckSignals()}, and if that set
Guido van Rossumae110af1997-05-22 20:11:52 +0000953the error indicator, leaves it set to that. The function always
954returns \NULL{}, so a wrapper function around a system call can write
Fred Drakee058b4f1998-02-16 06:15:35 +0000955\samp{return PyErr_SetFromErrno();} when the system call returns an
956error.
Guido van Rossumae110af1997-05-22 20:11:52 +0000957\end{cfuncdesc}
958
Fred Drake490d34d2001-06-20 21:39:12 +0000959\begin{cfuncdesc}{PyObject*}{PyErr_SetFromErrnoWithFilename}{PyObject *type,
960 char *filename}
961Similar to \cfunction{PyErr_SetFromErrno()}, with the additional
962behavior that if \var{filename} is not \NULL, it is passed to the
963constructor of \var{type} as a third parameter. In the case of
964exceptions such as \exception{IOError} and \exception{OSError}, this
965is used to define the \member{filename} attribute of the exception
966instance.
967\end{cfuncdesc}
968
Guido van Rossumae110af1997-05-22 20:11:52 +0000969\begin{cfuncdesc}{void}{PyErr_BadInternalCall}{}
Fred Drakee058b4f1998-02-16 06:15:35 +0000970This is a shorthand for \samp{PyErr_SetString(PyExc_TypeError,
Guido van Rossumae110af1997-05-22 20:11:52 +0000971\var{message})}, where \var{message} indicates that an internal
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000972operation (e.g. a Python/C API function) was invoked with an illegal
Guido van Rossumae110af1997-05-22 20:11:52 +0000973argument. It is mostly for internal use.
974\end{cfuncdesc}
975
Guido van Rossum3dbb4062000-12-19 03:53:01 +0000976\begin{cfuncdesc}{int}{PyErr_Warn}{PyObject *category, char *message}
977Issue a warning message. The \var{category} argument is a warning
Fred Drake5566c1c2001-01-19 22:48:33 +0000978category (see below) or \NULL; the \var{message} argument is a message
Guido van Rossum3dbb4062000-12-19 03:53:01 +0000979string.
980
981This function normally prints a warning message to \var{sys.stderr};
982however, it is also possible that the user has specified that warnings
983are to be turned into errors, and in that case this will raise an
984exception. It is also possible that the function raises an exception
985because of a problem with the warning machinery (the implementation
986imports the \module{warnings} module to do the heavy lifting). The
987return value is \code{0} if no exception is raised, or \code{-1} if
988an exception is raised. (It is not possible to determine whether a
989warning message is actually printed, nor what the reason is for the
990exception; this is intentional.) If an exception is raised, the
Fred Drake5566c1c2001-01-19 22:48:33 +0000991caller should do its normal exception handling
992(e.g. \cfunction{Py_DECREF()} owned references and return an error
993value).
Guido van Rossum3dbb4062000-12-19 03:53:01 +0000994
995Warning categories must be subclasses of \cdata{Warning}; the default
996warning category is \cdata{RuntimeWarning}. The standard Python
997warning categories are available as global variables whose names are
998\samp{PyExc_} followed by the Python exception name. These have the
999type \ctype{PyObject*}; they are all class objects. Their names are
1000\cdata{PyExc_Warning}, \cdata{PyExc_UserWarning},
1001\cdata{PyExc_DeprecationWarning}, \cdata{PyExc_SyntaxWarning}, and
1002\cdata{PyExc_RuntimeWarning}. \cdata{PyExc_Warning} is a subclass of
1003\cdata{PyExc_Exception}; the other warning categories are subclasses
1004of \cdata{PyExc_Warning}.
1005
1006For information about warning control, see the documentation for the
Fred Drake316ef7c2001-01-04 05:56:34 +00001007\module{warnings} module and the \programopt{-W} option in the command
1008line documentation. There is no C API for warning control.
Guido van Rossum3dbb4062000-12-19 03:53:01 +00001009\end{cfuncdesc}
1010
Guido van Rossum1874c8f2001-02-28 23:46:44 +00001011\begin{cfuncdesc}{int}{PyErr_WarnExplicit}{PyObject *category, char *message,
1012char *filename, int lineno, char *module, PyObject *registry}
1013Issue a warning message with explicit control over all warning
1014attributes. This is a straightforward wrapper around the Python
1015function \function{warnings.warn_explicit()}, see there for more
1016information. The \var{module} and \var{registry} arguments may be
1017set to \code{NULL} to get the default effect described there.
1018\end{cfuncdesc}
1019
Guido van Rossumae110af1997-05-22 20:11:52 +00001020\begin{cfuncdesc}{int}{PyErr_CheckSignals}{}
1021This function interacts with Python's signal handling. It checks
1022whether a signal has been sent to the processes and if so, invokes the
Fred Drake4de05a91998-02-16 14:25:26 +00001023corresponding signal handler. If the
1024\module{signal}\refbimodindex{signal} module is supported, this can
1025invoke a signal handler written in Python. In all cases, the default
Fred Drake659ebfa2000-04-03 15:42:13 +00001026effect for \constant{SIGINT}\ttindex{SIGINT} is to raise the
1027\withsubitem{(built-in exception)}{\ttindex{KeyboardInterrupt}}
1028\exception{KeyboardInterrupt} exception. If an exception is raised the
Fred Drakee058b4f1998-02-16 06:15:35 +00001029error indicator is set and the function returns \code{1}; otherwise
1030the function returns \code{0}. The error indicator may or may not be
1031cleared if it was previously set.
Guido van Rossumae110af1997-05-22 20:11:52 +00001032\end{cfuncdesc}
1033
1034\begin{cfuncdesc}{void}{PyErr_SetInterrupt}{}
Fred Drake659ebfa2000-04-03 15:42:13 +00001035This function is obsolete. It simulates the effect of a
1036\constant{SIGINT}\ttindex{SIGINT} signal arriving --- the next time
Fred Drakee058b4f1998-02-16 06:15:35 +00001037\cfunction{PyErr_CheckSignals()} is called,
Fred Drake659ebfa2000-04-03 15:42:13 +00001038\withsubitem{(built-in exception)}{\ttindex{KeyboardInterrupt}}
1039\exception{KeyboardInterrupt} will be raised.
1040It may be called without holding the interpreter lock.
Guido van Rossumae110af1997-05-22 20:11:52 +00001041\end{cfuncdesc}
1042
Fred Drakec6fa34e1998-04-02 06:47:24 +00001043\begin{cfuncdesc}{PyObject*}{PyErr_NewException}{char *name,
1044 PyObject *base,
1045 PyObject *dict}
Guido van Rossum42cefd01997-10-05 15:27:29 +00001046This utility function creates and returns a new exception object. The
Fred Drake659ebfa2000-04-03 15:42:13 +00001047\var{name} argument must be the name of the new exception, a C string
1048of the form \code{module.class}. The \var{base} and
Fred Draked04038d2000-06-29 20:15:14 +00001049\var{dict} arguments are normally \NULL{}. This creates a
Fred Drake659ebfa2000-04-03 15:42:13 +00001050class object derived from the root for all exceptions, the built-in
1051name \exception{Exception} (accessible in C as
Fred Draked04038d2000-06-29 20:15:14 +00001052\cdata{PyExc_Exception}). The \member{__module__} attribute of the
1053new class is set to the first part (up to the last dot) of the
1054\var{name} argument, and the class name is set to the last part (after
1055the last dot). The \var{base} argument can be used to specify an
1056alternate base class. The \var{dict} argument can be used to specify
1057a dictionary of class variables and methods.
Guido van Rossum42cefd01997-10-05 15:27:29 +00001058\end{cfuncdesc}
1059
Jeremy Hyltonb709df32000-09-01 02:47:25 +00001060\begin{cfuncdesc}{void}{PyErr_WriteUnraisable}{PyObject *obj}
1061This utility function prints a warning message to \var{sys.stderr}
1062when an exception has been set but it is impossible for the
1063interpreter to actually raise the exception. It is used, for example,
1064when an exception occurs in an \member{__del__} method.
1065
1066The function is called with a single argument \var{obj} that
1067identifies where the context in which the unraisable exception
1068occurred. The repr of \var{obj} will be printed in the warning
1069message.
1070\end{cfuncdesc}
Guido van Rossum42cefd01997-10-05 15:27:29 +00001071
Fred Drakeefd146c1999-02-15 15:30:45 +00001072\section{Standard Exceptions \label{standardExceptions}}
Guido van Rossumae110af1997-05-22 20:11:52 +00001073
1074All standard Python exceptions are available as global variables whose
Fred Drake659ebfa2000-04-03 15:42:13 +00001075names are \samp{PyExc_} followed by the Python exception name. These
1076have the type \ctype{PyObject*}; they are all class objects. For
1077completeness, here are all the variables:
1078
1079\begin{tableiii}{l|l|c}{cdata}{C Name}{Python Name}{Notes}
1080 \lineiii{PyExc_Exception}{\exception{Exception}}{(1)}
1081 \lineiii{PyExc_StandardError}{\exception{StandardError}}{(1)}
1082 \lineiii{PyExc_ArithmeticError}{\exception{ArithmeticError}}{(1)}
1083 \lineiii{PyExc_LookupError}{\exception{LookupError}}{(1)}
1084 \lineiii{PyExc_AssertionError}{\exception{AssertionError}}{}
1085 \lineiii{PyExc_AttributeError}{\exception{AttributeError}}{}
1086 \lineiii{PyExc_EOFError}{\exception{EOFError}}{}
1087 \lineiii{PyExc_EnvironmentError}{\exception{EnvironmentError}}{(1)}
1088 \lineiii{PyExc_FloatingPointError}{\exception{FloatingPointError}}{}
1089 \lineiii{PyExc_IOError}{\exception{IOError}}{}
1090 \lineiii{PyExc_ImportError}{\exception{ImportError}}{}
1091 \lineiii{PyExc_IndexError}{\exception{IndexError}}{}
1092 \lineiii{PyExc_KeyError}{\exception{KeyError}}{}
1093 \lineiii{PyExc_KeyboardInterrupt}{\exception{KeyboardInterrupt}}{}
1094 \lineiii{PyExc_MemoryError}{\exception{MemoryError}}{}
1095 \lineiii{PyExc_NameError}{\exception{NameError}}{}
1096 \lineiii{PyExc_NotImplementedError}{\exception{NotImplementedError}}{}
1097 \lineiii{PyExc_OSError}{\exception{OSError}}{}
1098 \lineiii{PyExc_OverflowError}{\exception{OverflowError}}{}
1099 \lineiii{PyExc_RuntimeError}{\exception{RuntimeError}}{}
1100 \lineiii{PyExc_SyntaxError}{\exception{SyntaxError}}{}
1101 \lineiii{PyExc_SystemError}{\exception{SystemError}}{}
1102 \lineiii{PyExc_SystemExit}{\exception{SystemExit}}{}
1103 \lineiii{PyExc_TypeError}{\exception{TypeError}}{}
1104 \lineiii{PyExc_ValueError}{\exception{ValueError}}{}
Fred Drakea8d73412000-08-11 20:39:29 +00001105 \lineiii{PyExc_WindowsError}{\exception{WindowsError}}{(2)}
Fred Drake659ebfa2000-04-03 15:42:13 +00001106 \lineiii{PyExc_ZeroDivisionError}{\exception{ZeroDivisionError}}{}
1107\end{tableiii}
1108
1109\noindent
Fred Drakea8d73412000-08-11 20:39:29 +00001110Notes:
Fred Drake659ebfa2000-04-03 15:42:13 +00001111\begin{description}
1112\item[(1)]
Fred Draked04038d2000-06-29 20:15:14 +00001113 This is a base class for other standard exceptions.
Fred Drakea8d73412000-08-11 20:39:29 +00001114
1115\item[(2)]
1116 Only defined on Windows; protect code that uses this by testing that
1117 the preprocessor macro \code{MS_WINDOWS} is defined.
Fred Drake659ebfa2000-04-03 15:42:13 +00001118\end{description}
1119
1120
1121\section{Deprecation of String Exceptions}
1122
Fred Draked04038d2000-06-29 20:15:14 +00001123All exceptions built into Python or provided in the standard library
1124are derived from \exception{Exception}.
Fred Drake659ebfa2000-04-03 15:42:13 +00001125\withsubitem{(built-in exception)}{\ttindex{Exception}}
Fred Drake659ebfa2000-04-03 15:42:13 +00001126
Fred Draked04038d2000-06-29 20:15:14 +00001127String exceptions are still supported in the interpreter to allow
Fred Drake659ebfa2000-04-03 15:42:13 +00001128existing code to run unmodified, but this will also change in a future
1129release.
Guido van Rossumae110af1997-05-22 20:11:52 +00001130
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001131
Fred Drakeefd146c1999-02-15 15:30:45 +00001132\chapter{Utilities \label{utilities}}
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001133
1134The functions in this chapter perform various utility tasks, such as
Fred Drake659ebfa2000-04-03 15:42:13 +00001135parsing function arguments and constructing Python values from C
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001136values.
1137
Fred Drake377fb1e2001-07-14 03:01:48 +00001138\section{Operating System Utilities \label{os}}
Guido van Rossum42cefd01997-10-05 15:27:29 +00001139
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001140\begin{cfuncdesc}{int}{Py_FdIsInteractive}{FILE *fp, char *filename}
Fred Drakee058b4f1998-02-16 06:15:35 +00001141Return true (nonzero) if the standard I/O file \var{fp} with name
1142\var{filename} is deemed interactive. This is the case for files for
1143which \samp{isatty(fileno(\var{fp}))} is true. If the global flag
Fred Drakef8830d11998-04-23 14:06:01 +00001144\cdata{Py_InteractiveFlag} is true, this function also returns true if
Fred Drake5566c1c2001-01-19 22:48:33 +00001145the \var{filename} pointer is \NULL{} or if the name is equal to one of
Fred Drakea8455ab2000-06-16 19:58:42 +00001146the strings \code{'<stdin>'} or \code{'???'}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001147\end{cfuncdesc}
1148
1149\begin{cfuncdesc}{long}{PyOS_GetLastModificationTime}{char *filename}
Fred Drakee058b4f1998-02-16 06:15:35 +00001150Return the time of last modification of the file \var{filename}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001151The result is encoded in the same way as the timestamp returned by
Fred Drake659ebfa2000-04-03 15:42:13 +00001152the standard C library function \cfunction{time()}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001153\end{cfuncdesc}
1154
Fred Drakecabbc3b2000-06-28 15:53:13 +00001155\begin{cfuncdesc}{void}{PyOS_AfterFork}{}
1156Function to update some internal state after a process fork; this
1157should be called in the new process if the Python interpreter will
1158continue to be used. If a new executable is loaded into the new
1159process, this function does not need to be called.
1160\end{cfuncdesc}
1161
Fred Drake17e63432000-08-31 05:50:40 +00001162\begin{cfuncdesc}{int}{PyOS_CheckStack}{}
1163Return true when the interpreter runs out of stack space. This is a
1164reliable check, but is only available when \code{USE_STACKCHECK} is
1165defined (currently on Windows using the Microsoft Visual C++ compiler
1166and on the Macintosh). \code{USE_CHECKSTACK} will be defined
1167automatically; you should never change the definition in your own
1168code.
1169\end{cfuncdesc}
1170
Guido van Rossumc96ec6e2000-09-16 16:30:48 +00001171\begin{cfuncdesc}{PyOS_sighandler_t}{PyOS_getsig}{int i}
1172Return the current signal handler for signal \var{i}.
1173This is a thin wrapper around either \cfunction{sigaction} or
1174\cfunction{signal}. Do not call those functions directly!
1175\ctype{PyOS_sighandler_t} is a typedef alias for \ctype{void (*)(int)}.
1176\end{cfuncdesc}
1177
1178\begin{cfuncdesc}{PyOS_sighandler_t}{PyOS_setsig}{int i, PyOS_sighandler_t h}
1179Set the signal handler for signal \var{i} to be \var{h};
1180return the old signal handler.
1181This is a thin wrapper around either \cfunction{sigaction} or
1182\cfunction{signal}. Do not call those functions directly!
1183\ctype{PyOS_sighandler_t} is a typedef alias for \ctype{void (*)(int)}.
1184\end{cfuncdesc}
1185
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001186
Fred Drakeefd146c1999-02-15 15:30:45 +00001187\section{Process Control \label{processControl}}
Fred Drakee5bf8b21998-02-12 21:22:28 +00001188
1189\begin{cfuncdesc}{void}{Py_FatalError}{char *message}
1190Print a fatal error message and kill the process. No cleanup is
1191performed. This function should only be invoked when a condition is
1192detected that would make it dangerous to continue using the Python
1193interpreter; e.g., when the object administration appears to be
Fred Drake659ebfa2000-04-03 15:42:13 +00001194corrupted. On \UNIX{}, the standard C library function
1195\cfunction{abort()}\ttindex{abort()} is called which will attempt to
1196produce a \file{core} file.
Fred Drakee5bf8b21998-02-12 21:22:28 +00001197\end{cfuncdesc}
1198
1199\begin{cfuncdesc}{void}{Py_Exit}{int status}
Fred Drake659ebfa2000-04-03 15:42:13 +00001200Exit the current process. This calls
1201\cfunction{Py_Finalize()}\ttindex{Py_Finalize()} and
1202then calls the standard C library function
1203\code{exit(\var{status})}\ttindex{exit()}.
Fred Drakee5bf8b21998-02-12 21:22:28 +00001204\end{cfuncdesc}
1205
1206\begin{cfuncdesc}{int}{Py_AtExit}{void (*func) ()}
Fred Drake659ebfa2000-04-03 15:42:13 +00001207Register a cleanup function to be called by
1208\cfunction{Py_Finalize()}\ttindex{Py_Finalize()}.
Fred Drakee5bf8b21998-02-12 21:22:28 +00001209The cleanup function will be called with no arguments and should
Fred Drake659ebfa2000-04-03 15:42:13 +00001210return no value. At most 32 \index{cleanup functions}cleanup
1211functions can be registered.
Fred Drakee5bf8b21998-02-12 21:22:28 +00001212When the registration is successful, \cfunction{Py_AtExit()} returns
1213\code{0}; on failure, it returns \code{-1}. The cleanup function
1214registered last is called first. Each cleanup function will be called
1215at most once. Since Python's internal finallization will have
1216completed before the cleanup function, no Python APIs should be called
1217by \var{func}.
1218\end{cfuncdesc}
1219
1220
Fred Drakeefd146c1999-02-15 15:30:45 +00001221\section{Importing Modules \label{importing}}
Guido van Rossum42cefd01997-10-05 15:27:29 +00001222
Fred Drakec6fa34e1998-04-02 06:47:24 +00001223\begin{cfuncdesc}{PyObject*}{PyImport_ImportModule}{char *name}
Fred Drake659ebfa2000-04-03 15:42:13 +00001224This is a simplified interface to
1225\cfunction{PyImport_ImportModuleEx()} below, leaving the
1226\var{globals} and \var{locals} arguments set to \NULL{}. When the
Fred Drakebab29652001-07-10 16:10:08 +00001227\var{name} argument contains a dot (when it specifies a
Fred Drake659ebfa2000-04-03 15:42:13 +00001228submodule of a package), the \var{fromlist} argument is set to the
1229list \code{['*']} so that the return value is the named module rather
1230than the top-level package containing it as would otherwise be the
1231case. (Unfortunately, this has an additional side effect when
1232\var{name} in fact specifies a subpackage instead of a submodule: the
1233submodules specified in the package's \code{__all__} variable are
1234\index{package variable!\code{__all__}}
1235\withsubitem{(package variable)}{\ttindex{__all__}}loaded.) Return a
1236new reference to the imported module, or
1237\NULL{} with an exception set on failure (the module may still be
1238created in this case --- examine \code{sys.modules} to find out).
1239\withsubitem{(in module sys)}{\ttindex{modules}}
Guido van Rossum42cefd01997-10-05 15:27:29 +00001240\end{cfuncdesc}
1241
Fred Drakec6fa34e1998-04-02 06:47:24 +00001242\begin{cfuncdesc}{PyObject*}{PyImport_ImportModuleEx}{char *name, PyObject *globals, PyObject *locals, PyObject *fromlist}
Guido van Rossum42cefd01997-10-05 15:27:29 +00001243Import a module. This is best described by referring to the built-in
Fred Drake53fb7721998-02-16 06:23:20 +00001244Python function \function{__import__()}\bifuncindex{__import__}, as
1245the standard \function{__import__()} function calls this function
1246directly.
Guido van Rossum42cefd01997-10-05 15:27:29 +00001247
Guido van Rossum42cefd01997-10-05 15:27:29 +00001248The return value is a new reference to the imported module or
Guido van Rossum580aa8d1997-11-25 15:34:51 +00001249top-level package, or \NULL{} with an exception set on failure
Guido van Rossumc44d3d61997-10-06 05:10:47 +00001250(the module may still be created in this case). Like for
Fred Drakee058b4f1998-02-16 06:15:35 +00001251\function{__import__()}, the return value when a submodule of a
1252package was requested is normally the top-level package, unless a
1253non-empty \var{fromlist} was given.
Guido van Rossum42cefd01997-10-05 15:27:29 +00001254\end{cfuncdesc}
1255
Fred Drakec6fa34e1998-04-02 06:47:24 +00001256\begin{cfuncdesc}{PyObject*}{PyImport_Import}{PyObject *name}
Guido van Rossum42cefd01997-10-05 15:27:29 +00001257This is a higher-level interface that calls the current ``import hook
Fred Drakee058b4f1998-02-16 06:15:35 +00001258function''. It invokes the \function{__import__()} function from the
Guido van Rossum42cefd01997-10-05 15:27:29 +00001259\code{__builtins__} of the current globals. This means that the
1260import is done using whatever import hooks are installed in the
Fred Drake4de05a91998-02-16 14:25:26 +00001261current environment, e.g. by \module{rexec}\refstmodindex{rexec} or
1262\module{ihooks}\refstmodindex{ihooks}.
Guido van Rossum42cefd01997-10-05 15:27:29 +00001263\end{cfuncdesc}
1264
Fred Drakec6fa34e1998-04-02 06:47:24 +00001265\begin{cfuncdesc}{PyObject*}{PyImport_ReloadModule}{PyObject *m}
Guido van Rossum42cefd01997-10-05 15:27:29 +00001266Reload a module. This is best described by referring to the built-in
Fred Drake53fb7721998-02-16 06:23:20 +00001267Python function \function{reload()}\bifuncindex{reload}, as the standard
Fred Drakee058b4f1998-02-16 06:15:35 +00001268\function{reload()} function calls this function directly. Return a
1269new reference to the reloaded module, or \NULL{} with an exception set
1270on failure (the module still exists in this case).
Guido van Rossum42cefd01997-10-05 15:27:29 +00001271\end{cfuncdesc}
1272
Fred Drakec6fa34e1998-04-02 06:47:24 +00001273\begin{cfuncdesc}{PyObject*}{PyImport_AddModule}{char *name}
Guido van Rossum42cefd01997-10-05 15:27:29 +00001274Return the module object corresponding to a module name. The
1275\var{name} argument may be of the form \code{package.module}). First
1276check the modules dictionary if there's one there, and if not, create
Fred Drake659ebfa2000-04-03 15:42:13 +00001277a new one and insert in in the modules dictionary.
Guido van Rossuma096a2e1998-11-02 17:02:42 +00001278Warning: this function does not load or import the module; if the
1279module wasn't already loaded, you will get an empty module object.
1280Use \cfunction{PyImport_ImportModule()} or one of its variants to
1281import a module.
Fred Drake659ebfa2000-04-03 15:42:13 +00001282Return \NULL{} with an exception set on failure.
Guido van Rossum42cefd01997-10-05 15:27:29 +00001283\end{cfuncdesc}
1284
Fred Drakec6fa34e1998-04-02 06:47:24 +00001285\begin{cfuncdesc}{PyObject*}{PyImport_ExecCodeModule}{char *name, PyObject *co}
Guido van Rossum42cefd01997-10-05 15:27:29 +00001286Given a module name (possibly of the form \code{package.module}) and a
1287code object read from a Python bytecode file or obtained from the
Fred Drake53fb7721998-02-16 06:23:20 +00001288built-in function \function{compile()}\bifuncindex{compile}, load the
1289module. Return a new reference to the module object, or \NULL{} with
1290an exception set if an error occurred (the module may still be created
1291in this case). (This function would reload the module if it was
1292already imported.)
Guido van Rossum42cefd01997-10-05 15:27:29 +00001293\end{cfuncdesc}
1294
1295\begin{cfuncdesc}{long}{PyImport_GetMagicNumber}{}
Fred Drake659ebfa2000-04-03 15:42:13 +00001296Return the magic number for Python bytecode files (a.k.a.
1297\file{.pyc} and \file{.pyo} files). The magic number should be
1298present in the first four bytes of the bytecode file, in little-endian
1299byte order.
Guido van Rossum42cefd01997-10-05 15:27:29 +00001300\end{cfuncdesc}
1301
Fred Drakec6fa34e1998-04-02 06:47:24 +00001302\begin{cfuncdesc}{PyObject*}{PyImport_GetModuleDict}{}
Guido van Rossum42cefd01997-10-05 15:27:29 +00001303Return the dictionary used for the module administration
1304(a.k.a. \code{sys.modules}). Note that this is a per-interpreter
1305variable.
1306\end{cfuncdesc}
1307
1308\begin{cfuncdesc}{void}{_PyImport_Init}{}
1309Initialize the import mechanism. For internal use only.
1310\end{cfuncdesc}
1311
1312\begin{cfuncdesc}{void}{PyImport_Cleanup}{}
1313Empty the module table. For internal use only.
1314\end{cfuncdesc}
1315
1316\begin{cfuncdesc}{void}{_PyImport_Fini}{}
1317Finalize the import mechanism. For internal use only.
1318\end{cfuncdesc}
1319
Fred Drakec6fa34e1998-04-02 06:47:24 +00001320\begin{cfuncdesc}{PyObject*}{_PyImport_FindExtension}{char *, char *}
Guido van Rossum42cefd01997-10-05 15:27:29 +00001321For internal use only.
Guido van Rossum5b8a5231997-12-30 04:38:44 +00001322\end{cfuncdesc}
Guido van Rossum42cefd01997-10-05 15:27:29 +00001323
Fred Drakec6fa34e1998-04-02 06:47:24 +00001324\begin{cfuncdesc}{PyObject*}{_PyImport_FixupExtension}{char *, char *}
Guido van Rossum42cefd01997-10-05 15:27:29 +00001325For internal use only.
Guido van Rossum5b8a5231997-12-30 04:38:44 +00001326\end{cfuncdesc}
Guido van Rossum42cefd01997-10-05 15:27:29 +00001327
Fred Drake1d158692000-06-18 05:21:21 +00001328\begin{cfuncdesc}{int}{PyImport_ImportFrozenModule}{char *name}
1329Load a frozen module named \var{name}. Return \code{1} for success,
1330\code{0} if the module is not found, and \code{-1} with an exception
1331set if the initialization failed. To access the imported module on a
1332successful load, use \cfunction{PyImport_ImportModule()}.
Fred Drakee058b4f1998-02-16 06:15:35 +00001333(Note the misnomer --- this function would reload the module if it was
Guido van Rossum42cefd01997-10-05 15:27:29 +00001334already imported.)
1335\end{cfuncdesc}
1336
Fred Drake659ebfa2000-04-03 15:42:13 +00001337\begin{ctypedesc}[_frozen]{struct _frozen}
Guido van Rossum42cefd01997-10-05 15:27:29 +00001338This is the structure type definition for frozen module descriptors,
Fred Drakec6fa34e1998-04-02 06:47:24 +00001339as generated by the \program{freeze}\index{freeze utility} utility
1340(see \file{Tools/freeze/} in the Python source distribution). Its
Fred Drakee0d9a832000-09-01 05:30:00 +00001341definition, found in \file{Include/import.h}, is:
Fred Drakec6fa34e1998-04-02 06:47:24 +00001342
Guido van Rossum9faf4c51997-10-07 14:38:54 +00001343\begin{verbatim}
Guido van Rossum42cefd01997-10-05 15:27:29 +00001344struct _frozen {
Fred Drake36fbe761997-10-13 18:18:33 +00001345 char *name;
1346 unsigned char *code;
1347 int size;
Guido van Rossum42cefd01997-10-05 15:27:29 +00001348};
Guido van Rossum9faf4c51997-10-07 14:38:54 +00001349\end{verbatim}
Guido van Rossum42cefd01997-10-05 15:27:29 +00001350\end{ctypedesc}
1351
Fred Drakec6fa34e1998-04-02 06:47:24 +00001352\begin{cvardesc}{struct _frozen*}{PyImport_FrozenModules}
Fred Drakef8830d11998-04-23 14:06:01 +00001353This pointer is initialized to point to an array of \ctype{struct
Fred Drake659ebfa2000-04-03 15:42:13 +00001354_frozen} records, terminated by one whose members are all
1355\NULL{} or zero. When a frozen module is imported, it is searched in
1356this table. Third-party code could play tricks with this to provide a
Guido van Rossum42cefd01997-10-05 15:27:29 +00001357dynamically created collection of frozen modules.
1358\end{cvardesc}
1359
Fred Drakee0d9a832000-09-01 05:30:00 +00001360\begin{cfuncdesc}{int}{PyImport_AppendInittab}{char *name,
1361 void (*initfunc)(void)}
1362Add a single module to the existing table of built-in modules. This
1363is a convenience wrapper around \cfunction{PyImport_ExtendInittab()},
1364returning \code{-1} if the table could not be extended. The new
1365module can be imported by the name \var{name}, and uses the function
1366\var{initfunc} as the initialization function called on the first
1367attempted import. This should be called before
1368\cfunction{Py_Initialize()}.
1369\end{cfuncdesc}
1370
1371\begin{ctypedesc}[_inittab]{struct _inittab}
1372Structure describing a single entry in the list of built-in modules.
1373Each of these structures gives the name and initialization function
1374for a module built into the interpreter. Programs which embed Python
1375may use an array of these structures in conjunction with
1376\cfunction{PyImport_ExtendInittab()} to provide additional built-in
1377modules. The structure is defined in \file{Include/import.h} as:
1378
1379\begin{verbatim}
1380struct _inittab {
1381 char *name;
1382 void (*initfunc)(void);
1383};
1384\end{verbatim}
1385\end{ctypedesc}
1386
1387\begin{cfuncdesc}{int}{PyImport_ExtendInittab}{struct _inittab *newtab}
1388Add a collection of modules to the table of built-in modules. The
1389\var{newtab} array must end with a sentinel entry which contains
1390\NULL{} for the \member{name} field; failure to provide the sentinel
1391value can result in a memory fault. Returns \code{0} on success or
1392\code{-1} if insufficient memory could be allocated to extend the
1393internal table. In the event of failure, no modules are added to the
1394internal table. This should be called before
1395\cfunction{Py_Initialize()}.
1396\end{cfuncdesc}
1397
Guido van Rossum42cefd01997-10-05 15:27:29 +00001398
Fred Drakeefd146c1999-02-15 15:30:45 +00001399\chapter{Abstract Objects Layer \label{abstract}}
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001400
1401The functions in this chapter interact with Python objects regardless
1402of their type, or with wide classes of object types (e.g. all
1403numerical types, or all sequence types). When used on object types
Fred Drake659ebfa2000-04-03 15:42:13 +00001404for which they do not apply, they will raise a Python exception.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001405
Fred Drakeefd146c1999-02-15 15:30:45 +00001406\section{Object Protocol \label{object}}
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001407
1408\begin{cfuncdesc}{int}{PyObject_Print}{PyObject *o, FILE *fp, int flags}
Fred Drake659ebfa2000-04-03 15:42:13 +00001409Print an object \var{o}, on file \var{fp}. Returns \code{-1} on error.
1410The flags argument is used to enable certain printing options. The
1411only option currently supported is \constant{Py_PRINT_RAW}; if given,
1412the \function{str()} of the object is written instead of the
1413\function{repr()}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001414\end{cfuncdesc}
1415
1416\begin{cfuncdesc}{int}{PyObject_HasAttrString}{PyObject *o, char *attr_name}
Fred Drakee058b4f1998-02-16 06:15:35 +00001417Returns \code{1} if \var{o} has the attribute \var{attr_name}, and
1418\code{0} otherwise. This is equivalent to the Python expression
1419\samp{hasattr(\var{o}, \var{attr_name})}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001420This function always succeeds.
1421\end{cfuncdesc}
1422
Fred Drake659ebfa2000-04-03 15:42:13 +00001423\begin{cfuncdesc}{PyObject*}{PyObject_GetAttrString}{PyObject *o,
1424 char *attr_name}
Fred Drakee058b4f1998-02-16 06:15:35 +00001425Retrieve an attribute named \var{attr_name} from object \var{o}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001426Returns the attribute value on success, or \NULL{} on failure.
Fred Drakee058b4f1998-02-16 06:15:35 +00001427This is the equivalent of the Python expression
1428\samp{\var{o}.\var{attr_name}}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001429\end{cfuncdesc}
1430
1431
1432\begin{cfuncdesc}{int}{PyObject_HasAttr}{PyObject *o, PyObject *attr_name}
Fred Drakee058b4f1998-02-16 06:15:35 +00001433Returns \code{1} if \var{o} has the attribute \var{attr_name}, and
1434\code{0} otherwise. This is equivalent to the Python expression
1435\samp{hasattr(\var{o}, \var{attr_name})}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001436This function always succeeds.
1437\end{cfuncdesc}
1438
1439
Fred Drake659ebfa2000-04-03 15:42:13 +00001440\begin{cfuncdesc}{PyObject*}{PyObject_GetAttr}{PyObject *o,
1441 PyObject *attr_name}
Fred Drakee058b4f1998-02-16 06:15:35 +00001442Retrieve an attribute named \var{attr_name} from object \var{o}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001443Returns the attribute value on success, or \NULL{} on failure.
Fred Drakee058b4f1998-02-16 06:15:35 +00001444This is the equivalent of the Python expression
1445\samp{\var{o}.\var{attr_name}}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001446\end{cfuncdesc}
1447
1448
1449\begin{cfuncdesc}{int}{PyObject_SetAttrString}{PyObject *o, char *attr_name, PyObject *v}
Fred Drakee058b4f1998-02-16 06:15:35 +00001450Set the value of the attribute named \var{attr_name}, for object
1451\var{o}, to the value \var{v}. Returns \code{-1} on failure. This is
1452the equivalent of the Python statement \samp{\var{o}.\var{attr_name} =
1453\var{v}}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001454\end{cfuncdesc}
1455
1456
1457\begin{cfuncdesc}{int}{PyObject_SetAttr}{PyObject *o, PyObject *attr_name, PyObject *v}
Fred Drakee058b4f1998-02-16 06:15:35 +00001458Set the value of the attribute named \var{attr_name}, for
1459object \var{o},
1460to the value \var{v}. Returns \code{-1} on failure. This is
1461the equivalent of the Python statement \samp{\var{o}.\var{attr_name} =
1462\var{v}}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001463\end{cfuncdesc}
1464
1465
1466\begin{cfuncdesc}{int}{PyObject_DelAttrString}{PyObject *o, char *attr_name}
Fred Drakee058b4f1998-02-16 06:15:35 +00001467Delete attribute named \var{attr_name}, for object \var{o}. Returns
1468\code{-1} on failure. This is the equivalent of the Python
1469statement: \samp{del \var{o}.\var{attr_name}}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001470\end{cfuncdesc}
1471
1472
1473\begin{cfuncdesc}{int}{PyObject_DelAttr}{PyObject *o, PyObject *attr_name}
Fred Drakee058b4f1998-02-16 06:15:35 +00001474Delete attribute named \var{attr_name}, for object \var{o}. Returns
1475\code{-1} on failure. This is the equivalent of the Python
1476statement \samp{del \var{o}.\var{attr_name}}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001477\end{cfuncdesc}
1478
1479
1480\begin{cfuncdesc}{int}{PyObject_Cmp}{PyObject *o1, PyObject *o2, int *result}
Fred Drakee058b4f1998-02-16 06:15:35 +00001481Compare the values of \var{o1} and \var{o2} using a routine provided
1482by \var{o1}, if one exists, otherwise with a routine provided by
1483\var{o2}. The result of the comparison is returned in \var{result}.
1484Returns \code{-1} on failure. This is the equivalent of the Python
Fred Drake659ebfa2000-04-03 15:42:13 +00001485statement\bifuncindex{cmp} \samp{\var{result} = cmp(\var{o1}, \var{o2})}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001486\end{cfuncdesc}
1487
1488
1489\begin{cfuncdesc}{int}{PyObject_Compare}{PyObject *o1, PyObject *o2}
Fred Drakee058b4f1998-02-16 06:15:35 +00001490Compare the values of \var{o1} and \var{o2} using a routine provided
1491by \var{o1}, if one exists, otherwise with a routine provided by
1492\var{o2}. Returns the result of the comparison on success. On error,
1493the value returned is undefined; use \cfunction{PyErr_Occurred()} to
Fred Drake659ebfa2000-04-03 15:42:13 +00001494detect an error. This is equivalent to the Python
1495expression\bifuncindex{cmp} \samp{cmp(\var{o1}, \var{o2})}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001496\end{cfuncdesc}
1497
1498
1499\begin{cfuncdesc}{PyObject*}{PyObject_Repr}{PyObject *o}
Fred Drake659ebfa2000-04-03 15:42:13 +00001500Compute a string representation of object \var{o}. Returns the
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001501string representation on success, \NULL{} on failure. This is
Fred Drakee058b4f1998-02-16 06:15:35 +00001502the equivalent of the Python expression \samp{repr(\var{o})}.
1503Called by the \function{repr()}\bifuncindex{repr} built-in function
1504and by reverse quotes.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001505\end{cfuncdesc}
1506
1507
1508\begin{cfuncdesc}{PyObject*}{PyObject_Str}{PyObject *o}
Fred Drake659ebfa2000-04-03 15:42:13 +00001509Compute a string representation of object \var{o}. Returns the
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001510string representation on success, \NULL{} on failure. This is
Fred Drakee058b4f1998-02-16 06:15:35 +00001511the equivalent of the Python expression \samp{str(\var{o})}.
1512Called by the \function{str()}\bifuncindex{str} built-in function and
1513by the \keyword{print} statement.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001514\end{cfuncdesc}
1515
1516
Marc-André Lemburgad7c98e2001-01-17 17:09:53 +00001517\begin{cfuncdesc}{PyObject*}{PyObject_Unicode}{PyObject *o}
1518Compute a Unicode string representation of object \var{o}. Returns the
1519Unicode string representation on success, \NULL{} on failure. This is
1520the equivalent of the Python expression \samp{unistr(\var{o})}.
1521Called by the \function{unistr()}\bifuncindex{unistr} built-in function.
1522\end{cfuncdesc}
1523
Fred Drake58c8f9f2001-03-28 21:14:32 +00001524\begin{cfuncdesc}{int}{PyObject_IsInstance}{PyObject *inst, PyObject *cls}
1525Return \code{1} if \var{inst} is an instance of the class \var{cls} or
1526a subclass of \var{cls}. If \var{cls} is a type object rather than a
1527class object, \cfunction{PyObject_IsInstance()} returns \code{1} if
1528\var{inst} is of type \var{cls}. If \var{inst} is not a class
1529instance and \var{cls} is neither a type object or class object,
1530\var{inst} must have a \member{__class__} attribute --- the class
1531relationship of the value of that attribute with \var{cls} will be
1532used to determine the result of this function.
1533\versionadded{2.1}
1534\end{cfuncdesc}
1535
1536Subclass determination is done in a fairly straightforward way, but
1537includes a wrinkle that implementors of extensions to the class system
1538may want to be aware of. If \class{A} and \class{B} are class
1539objects, \class{B} is a subclass of \class{A} if it inherits from
1540\class{A} either directly or indirectly. If either is not a class
1541object, a more general mechanism is used to determine the class
1542relationship of the two objects. When testing if \var{B} is a
1543subclass of \var{A}, if \var{A} is \var{B},
1544\cfunction{PyObject_IsSubclass()} returns true. If \var{A} and
1545\var{B} are different objects, \var{B}'s \member{__bases__} attribute
1546is searched in a depth-first fashion for \var{A} --- the presence of
1547the \member{__bases__} attribute is considered sufficient for this
1548determination.
1549
1550\begin{cfuncdesc}{int}{PyObject_IsSubclass}{PyObject *derived,
1551 PyObject *cls}
1552Returns \code{1} if the class \var{derived} is identical to or derived
1553from the class \var{cls}, otherwise returns \code{0}. In case of an
1554error, returns \code{-1}. If either \var{derived} or \var{cls} is not
1555an actual class object, this function uses the generic algorithm
1556described above.
1557\versionadded{2.1}
1558\end{cfuncdesc}
1559
Marc-André Lemburgad7c98e2001-01-17 17:09:53 +00001560
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001561\begin{cfuncdesc}{int}{PyCallable_Check}{PyObject *o}
Fred Drake659ebfa2000-04-03 15:42:13 +00001562Determine if the object \var{o} is callable. Return \code{1} if the
Fred Drakee058b4f1998-02-16 06:15:35 +00001563object is callable and \code{0} otherwise.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001564This function always succeeds.
1565\end{cfuncdesc}
1566
1567
Fred Drake659ebfa2000-04-03 15:42:13 +00001568\begin{cfuncdesc}{PyObject*}{PyObject_CallObject}{PyObject *callable_object,
1569 PyObject *args}
Fred Drakee058b4f1998-02-16 06:15:35 +00001570Call a callable Python object \var{callable_object}, with
1571arguments given by the tuple \var{args}. If no arguments are
Fred Drake659ebfa2000-04-03 15:42:13 +00001572needed, then \var{args} may be \NULL{}. Returns the result of the
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001573call on success, or \NULL{} on failure. This is the equivalent
Fred Drake5566c1c2001-01-19 22:48:33 +00001574of the Python expression \samp{apply(\var{callable_object}, \var{args})}.
Fred Drake659ebfa2000-04-03 15:42:13 +00001575\bifuncindex{apply}
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001576\end{cfuncdesc}
1577
Fred Drake5566c1c2001-01-19 22:48:33 +00001578\begin{cfuncdesc}{PyObject*}{PyObject_CallFunction}{PyObject *callable_object,
1579 char *format, ...}
Fred Drakee058b4f1998-02-16 06:15:35 +00001580Call a callable Python object \var{callable_object}, with a
Fred Drake659ebfa2000-04-03 15:42:13 +00001581variable number of C arguments. The C arguments are described
Fred Drakee058b4f1998-02-16 06:15:35 +00001582using a \cfunction{Py_BuildValue()} style format string. The format may
1583be \NULL{}, indicating that no arguments are provided. Returns the
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001584result of the call on success, or \NULL{} on failure. This is
Fred Drake5566c1c2001-01-19 22:48:33 +00001585the equivalent of the Python expression \samp{apply(\var{callable_object},
Fred Drake659ebfa2000-04-03 15:42:13 +00001586\var{args})}.\bifuncindex{apply}
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001587\end{cfuncdesc}
1588
1589
Fred Drake5566c1c2001-01-19 22:48:33 +00001590\begin{cfuncdesc}{PyObject*}{PyObject_CallMethod}{PyObject *o,
1591 char *method, char *format, ...}
Fred Drakee058b4f1998-02-16 06:15:35 +00001592Call the method named \var{m} of object \var{o} with a variable number
Fred Drake659ebfa2000-04-03 15:42:13 +00001593of C arguments. The C arguments are described by a
Fred Drakee058b4f1998-02-16 06:15:35 +00001594\cfunction{Py_BuildValue()} format string. The format may be \NULL{},
1595indicating that no arguments are provided. Returns the result of the
1596call on success, or \NULL{} on failure. This is the equivalent of the
1597Python expression \samp{\var{o}.\var{method}(\var{args})}.
Fred Drake659ebfa2000-04-03 15:42:13 +00001598Note that special method names, such as \method{__add__()},
1599\method{__getitem__()}, and so on are not supported. The specific
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001600abstract-object routines for these must be used.
1601\end{cfuncdesc}
1602
1603
1604\begin{cfuncdesc}{int}{PyObject_Hash}{PyObject *o}
Fred Drakee058b4f1998-02-16 06:15:35 +00001605Compute and return the hash value of an object \var{o}. On
1606failure, return \code{-1}. This is the equivalent of the Python
Fred Drake659ebfa2000-04-03 15:42:13 +00001607expression \samp{hash(\var{o})}.\bifuncindex{hash}
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001608\end{cfuncdesc}
1609
1610
1611\begin{cfuncdesc}{int}{PyObject_IsTrue}{PyObject *o}
Fred Drakee058b4f1998-02-16 06:15:35 +00001612Returns \code{1} if the object \var{o} is considered to be true, and
1613\code{0} otherwise. This is equivalent to the Python expression
1614\samp{not not \var{o}}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001615This function always succeeds.
1616\end{cfuncdesc}
1617
1618
1619\begin{cfuncdesc}{PyObject*}{PyObject_Type}{PyObject *o}
1620On success, returns a type object corresponding to the object
Fred Drakee058b4f1998-02-16 06:15:35 +00001621type of object \var{o}. On failure, returns \NULL{}. This is
1622equivalent to the Python expression \samp{type(\var{o})}.
Fred Drake53fb7721998-02-16 06:23:20 +00001623\bifuncindex{type}
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001624\end{cfuncdesc}
1625
1626\begin{cfuncdesc}{int}{PyObject_Length}{PyObject *o}
Fred Drakee058b4f1998-02-16 06:15:35 +00001627Return the length of object \var{o}. If the object \var{o} provides
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001628both sequence and mapping protocols, the sequence length is
Fred Drake659ebfa2000-04-03 15:42:13 +00001629returned. On error, \code{-1} is returned. This is the equivalent
1630to the Python expression \samp{len(\var{o})}.\bifuncindex{len}
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001631\end{cfuncdesc}
1632
1633
1634\begin{cfuncdesc}{PyObject*}{PyObject_GetItem}{PyObject *o, PyObject *key}
Fred Drakee058b4f1998-02-16 06:15:35 +00001635Return element of \var{o} corresponding to the object \var{key} or
1636\NULL{} on failure. This is the equivalent of the Python expression
1637\samp{\var{o}[\var{key}]}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001638\end{cfuncdesc}
1639
1640
1641\begin{cfuncdesc}{int}{PyObject_SetItem}{PyObject *o, PyObject *key, PyObject *v}
Fred Drakee058b4f1998-02-16 06:15:35 +00001642Map the object \var{key} to the value \var{v}.
1643Returns \code{-1} on failure. This is the equivalent
1644of the Python statement \samp{\var{o}[\var{key}] = \var{v}}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001645\end{cfuncdesc}
1646
1647
Guido van Rossumd1dbf631999-01-22 20:10:49 +00001648\begin{cfuncdesc}{int}{PyObject_DelItem}{PyObject *o, PyObject *key}
Fred Drakee058b4f1998-02-16 06:15:35 +00001649Delete the mapping for \var{key} from \var{o}. Returns \code{-1} on
1650failure. This is the equivalent of the Python statement \samp{del
1651\var{o}[\var{key}]}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001652\end{cfuncdesc}
1653
Andrew M. Kuchling8c46b302000-07-13 23:58:16 +00001654\begin{cfuncdesc}{int}{PyObject_AsFileDescriptor}{PyObject *o}
1655Derives a file-descriptor from a Python object. If the object
1656is an integer or long integer, its value is returned. If not, the
1657object's \method{fileno()} method is called if it exists; the method
1658must return an integer or long integer, which is returned as the file
1659descriptor value. Returns \code{-1} on failure.
1660\end{cfuncdesc}
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001661
Fred Drakeefd146c1999-02-15 15:30:45 +00001662\section{Number Protocol \label{number}}
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001663
1664\begin{cfuncdesc}{int}{PyNumber_Check}{PyObject *o}
Fred Drakee058b4f1998-02-16 06:15:35 +00001665Returns \code{1} if the object \var{o} provides numeric protocols, and
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001666false otherwise.
1667This function always succeeds.
1668\end{cfuncdesc}
1669
1670
1671\begin{cfuncdesc}{PyObject*}{PyNumber_Add}{PyObject *o1, PyObject *o2}
Fred Drakee058b4f1998-02-16 06:15:35 +00001672Returns the result of adding \var{o1} and \var{o2}, or \NULL{} on
1673failure. This is the equivalent of the Python expression
1674\samp{\var{o1} + \var{o2}}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001675\end{cfuncdesc}
1676
1677
1678\begin{cfuncdesc}{PyObject*}{PyNumber_Subtract}{PyObject *o1, PyObject *o2}
Fred Drake659ebfa2000-04-03 15:42:13 +00001679Returns the result of subtracting \var{o2} from \var{o1}, or
1680\NULL{} on failure. This is the equivalent of the Python expression
Fred Drakee058b4f1998-02-16 06:15:35 +00001681\samp{\var{o1} - \var{o2}}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001682\end{cfuncdesc}
1683
1684
1685\begin{cfuncdesc}{PyObject*}{PyNumber_Multiply}{PyObject *o1, PyObject *o2}
Fred Drakee058b4f1998-02-16 06:15:35 +00001686Returns the result of multiplying \var{o1} and \var{o2}, or \NULL{} on
1687failure. This is the equivalent of the Python expression
1688\samp{\var{o1} * \var{o2}}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001689\end{cfuncdesc}
1690
1691
1692\begin{cfuncdesc}{PyObject*}{PyNumber_Divide}{PyObject *o1, PyObject *o2}
Fred Drakee058b4f1998-02-16 06:15:35 +00001693Returns the result of dividing \var{o1} by \var{o2}, or \NULL{} on
1694failure.
1695This is the equivalent of the Python expression \samp{\var{o1} /
1696\var{o2}}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001697\end{cfuncdesc}
1698
1699
1700\begin{cfuncdesc}{PyObject*}{PyNumber_Remainder}{PyObject *o1, PyObject *o2}
Fred Drakee058b4f1998-02-16 06:15:35 +00001701Returns the remainder of dividing \var{o1} by \var{o2}, or \NULL{} on
1702failure. This is the equivalent of the Python expression
Fred Drake659ebfa2000-04-03 15:42:13 +00001703\samp{\var{o1} \%\ \var{o2}}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001704\end{cfuncdesc}
1705
1706
1707\begin{cfuncdesc}{PyObject*}{PyNumber_Divmod}{PyObject *o1, PyObject *o2}
Fred Drake53fb7721998-02-16 06:23:20 +00001708See the built-in function \function{divmod()}\bifuncindex{divmod}.
1709Returns \NULL{} on failure. This is the equivalent of the Python
1710expression \samp{divmod(\var{o1}, \var{o2})}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001711\end{cfuncdesc}
1712
1713
1714\begin{cfuncdesc}{PyObject*}{PyNumber_Power}{PyObject *o1, PyObject *o2, PyObject *o3}
Fred Drake53fb7721998-02-16 06:23:20 +00001715See the built-in function \function{pow()}\bifuncindex{pow}. Returns
1716\NULL{} on failure. This is the equivalent of the Python expression
Fred Drakee058b4f1998-02-16 06:15:35 +00001717\samp{pow(\var{o1}, \var{o2}, \var{o3})}, where \var{o3} is optional.
Fred Drake659ebfa2000-04-03 15:42:13 +00001718If \var{o3} is to be ignored, pass \cdata{Py_None} in its place
1719(passing \NULL{} for \var{o3} would cause an illegal memory access).
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001720\end{cfuncdesc}
1721
1722
1723\begin{cfuncdesc}{PyObject*}{PyNumber_Negative}{PyObject *o}
Fred Drakee058b4f1998-02-16 06:15:35 +00001724Returns the negation of \var{o} on success, or \NULL{} on failure.
1725This is the equivalent of the Python expression \samp{-\var{o}}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001726\end{cfuncdesc}
1727
1728
1729\begin{cfuncdesc}{PyObject*}{PyNumber_Positive}{PyObject *o}
Fred Drakee058b4f1998-02-16 06:15:35 +00001730Returns \var{o} on success, or \NULL{} on failure.
1731This is the equivalent of the Python expression \samp{+\var{o}}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001732\end{cfuncdesc}
1733
1734
1735\begin{cfuncdesc}{PyObject*}{PyNumber_Absolute}{PyObject *o}
Fred Drakee058b4f1998-02-16 06:15:35 +00001736Returns the absolute value of \var{o}, or \NULL{} on failure. This is
1737the equivalent of the Python expression \samp{abs(\var{o})}.
Fred Drake659ebfa2000-04-03 15:42:13 +00001738\bifuncindex{abs}
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001739\end{cfuncdesc}
1740
1741
1742\begin{cfuncdesc}{PyObject*}{PyNumber_Invert}{PyObject *o}
Fred Drakee058b4f1998-02-16 06:15:35 +00001743Returns the bitwise negation of \var{o} on success, or \NULL{} on
1744failure. This is the equivalent of the Python expression
1745\samp{\~\var{o}}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001746\end{cfuncdesc}
1747
1748
1749\begin{cfuncdesc}{PyObject*}{PyNumber_Lshift}{PyObject *o1, PyObject *o2}
Fred Drakee058b4f1998-02-16 06:15:35 +00001750Returns the result of left shifting \var{o1} by \var{o2} on success,
1751or \NULL{} on failure. This is the equivalent of the Python
Fred Draked20d8b32001-04-13 14:52:39 +00001752expression \samp{\var{o1} <\code{<} \var{o2}}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001753\end{cfuncdesc}
1754
1755
1756\begin{cfuncdesc}{PyObject*}{PyNumber_Rshift}{PyObject *o1, PyObject *o2}
Fred Drakee058b4f1998-02-16 06:15:35 +00001757Returns the result of right shifting \var{o1} by \var{o2} on success,
1758or \NULL{} on failure. This is the equivalent of the Python
Fred Draked20d8b32001-04-13 14:52:39 +00001759expression \samp{\var{o1} >\code{>} \var{o2}}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001760\end{cfuncdesc}
1761
1762
1763\begin{cfuncdesc}{PyObject*}{PyNumber_And}{PyObject *o1, PyObject *o2}
Fred Drake7740a012000-09-12 20:27:05 +00001764Returns the ``bitwise and'' of \var{o2} and \var{o2} on success and
1765\NULL{} on failure. This is the equivalent of the Python expression
Fred Drake5566c1c2001-01-19 22:48:33 +00001766\samp{\var{o1} \&\ \var{o2}}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001767\end{cfuncdesc}
1768
1769
1770\begin{cfuncdesc}{PyObject*}{PyNumber_Xor}{PyObject *o1, PyObject *o2}
Fred Drake7740a012000-09-12 20:27:05 +00001771Returns the ``bitwise exclusive or'' of \var{o1} by \var{o2} on success,
Fred Drakee058b4f1998-02-16 06:15:35 +00001772or \NULL{} on failure. This is the equivalent of the Python
Fred Drake755c23d2001-07-14 03:05:53 +00001773expression \samp{\var{o1} \textasciicircum{} \var{o2}}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001774\end{cfuncdesc}
1775
1776\begin{cfuncdesc}{PyObject*}{PyNumber_Or}{PyObject *o1, PyObject *o2}
Fred Drake7740a012000-09-12 20:27:05 +00001777Returns the ``bitwise or'' of \var{o1} and \var{o2} on success, or
1778\NULL{} on failure. This is the equivalent of the Python expression
1779\samp{\var{o1} | \var{o2}}.
1780\end{cfuncdesc}
1781
1782
1783\begin{cfuncdesc}{PyObject*}{PyNumber_InPlaceAdd}{PyObject *o1, PyObject *o2}
1784Returns the result of adding \var{o1} and \var{o2}, or \NULL{} on failure.
1785The operation is done \emph{in-place} when \var{o1} supports it. This is the
1786equivalent of the Python expression \samp{\var{o1} += \var{o2}}.
1787\end{cfuncdesc}
1788
1789
1790\begin{cfuncdesc}{PyObject*}{PyNumber_InPlaceSubtract}{PyObject *o1, PyObject *o2}
1791Returns the result of subtracting \var{o2} from \var{o1}, or
1792\NULL{} on failure. The operation is done \emph{in-place} when \var{o1}
1793supports it. This is the equivalent of the Python expression \samp{\var{o1}
1794-= \var{o2}}.
1795\end{cfuncdesc}
1796
1797
1798\begin{cfuncdesc}{PyObject*}{PyNumber_InPlaceMultiply}{PyObject *o1, PyObject *o2}
1799Returns the result of multiplying \var{o1} and \var{o2}, or \NULL{} on
1800failure. The operation is done \emph{in-place} when \var{o1} supports it.
1801This is the equivalent of the Python expression \samp{\var{o1} *= \var{o2}}.
1802\end{cfuncdesc}
1803
1804
1805\begin{cfuncdesc}{PyObject*}{PyNumber_InPlaceDivide}{PyObject *o1, PyObject *o2}
1806Returns the result of dividing \var{o1} by \var{o2}, or \NULL{} on failure.
1807The operation is done \emph{in-place} when \var{o1} supports it. This is the
1808equivalent of the Python expression \samp{\var{o1} /= \var{o2}}.
1809\end{cfuncdesc}
1810
1811
1812\begin{cfuncdesc}{PyObject*}{PyNumber_InPlaceRemainder}{PyObject *o1, PyObject *o2}
1813Returns the remainder of dividing \var{o1} by \var{o2}, or \NULL{} on
1814failure. The operation is done \emph{in-place} when \var{o1} supports it.
1815This is the equivalent of the Python expression \samp{\var{o1} \%= \var{o2}}.
1816\end{cfuncdesc}
1817
1818
1819\begin{cfuncdesc}{PyObject*}{PyNumber_InPlacePower}{PyObject *o1, PyObject *o2, PyObject *o3}
1820See the built-in function \function{pow()}\bifuncindex{pow}. Returns
1821\NULL{} on failure. The operation is done \emph{in-place} when \var{o1}
1822supports it. This is the equivalent of the Python expression \samp{\var{o1}
1823**= \var{o2}} when o3 is \cdata{Py_None}, or an in-place variant of
Fred Drake5566c1c2001-01-19 22:48:33 +00001824\samp{pow(\var{o1}, \var{o2}, \var{o3})} otherwise. If \var{o3} is to be
Fred Drake7740a012000-09-12 20:27:05 +00001825ignored, pass \cdata{Py_None} in its place (passing \NULL{} for \var{o3}
1826would cause an illegal memory access).
1827\end{cfuncdesc}
1828
1829\begin{cfuncdesc}{PyObject*}{PyNumber_InPlaceLshift}{PyObject *o1, PyObject *o2}
1830Returns the result of left shifting \var{o1} by \var{o2} on success, or
1831\NULL{} on failure. The operation is done \emph{in-place} when \var{o1}
1832supports it. This is the equivalent of the Python expression \samp{\var{o1}
Fred Draked20d8b32001-04-13 14:52:39 +00001833<\code{<=} \var{o2}}.
Fred Drake7740a012000-09-12 20:27:05 +00001834\end{cfuncdesc}
1835
1836
1837\begin{cfuncdesc}{PyObject*}{PyNumber_InPlaceRshift}{PyObject *o1, PyObject *o2}
1838Returns the result of right shifting \var{o1} by \var{o2} on success, or
1839\NULL{} on failure. The operation is done \emph{in-place} when \var{o1}
1840supports it. This is the equivalent of the Python expression \samp{\var{o1}
Fred Draked20d8b32001-04-13 14:52:39 +00001841>\code{>=} \var{o2}}.
Fred Drake7740a012000-09-12 20:27:05 +00001842\end{cfuncdesc}
1843
1844
1845\begin{cfuncdesc}{PyObject*}{PyNumber_InPlaceAnd}{PyObject *o1, PyObject *o2}
Fred Drake5566c1c2001-01-19 22:48:33 +00001846Returns the ``bitwise and'' of \var{o1} and \var{o2} on success
1847and \NULL{} on failure. The operation is done \emph{in-place} when
1848\var{o1} supports it. This is the equivalent of the Python expression
1849\samp{\var{o1} \&= \var{o2}}.
Fred Drake7740a012000-09-12 20:27:05 +00001850\end{cfuncdesc}
1851
1852
1853\begin{cfuncdesc}{PyObject*}{PyNumber_InPlaceXor}{PyObject *o1, PyObject *o2}
1854Returns the ``bitwise exclusive or'' of \var{o1} by \var{o2} on success, or
1855\NULL{} on failure. The operation is done \emph{in-place} when \var{o1}
1856supports it. This is the equivalent of the Python expression \samp{\var{o1}
Fred Drake377fb1e2001-07-14 03:01:48 +00001857\textasciicircum= \var{o2}}.
Fred Drake7740a012000-09-12 20:27:05 +00001858\end{cfuncdesc}
1859
1860\begin{cfuncdesc}{PyObject*}{PyNumber_InPlaceOr}{PyObject *o1, PyObject *o2}
1861Returns the ``bitwise or'' of \var{o1} and \var{o2} on success, or \NULL{}
1862on failure. The operation is done \emph{in-place} when \var{o1} supports
1863it. This is the equivalent of the Python expression \samp{\var{o1} |=
1864\var{o2}}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001865\end{cfuncdesc}
1866
Fred Drakec0e6c5b2000-09-22 18:17:49 +00001867\begin{cfuncdesc}{int}{PyNumber_Coerce}{PyObject **p1, PyObject **p2}
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001868This function takes the addresses of two variables of type
Fred Drake659ebfa2000-04-03 15:42:13 +00001869\ctype{PyObject*}. If the objects pointed to by \code{*\var{p1}} and
1870\code{*\var{p2}} have the same type, increment their reference count
1871and return \code{0} (success). If the objects can be converted to a
1872common numeric type, replace \code{*p1} and \code{*p2} by their
1873converted value (with 'new' reference counts), and return \code{0}.
1874If no conversion is possible, or if some other error occurs, return
1875\code{-1} (failure) and don't increment the reference counts. The
1876call \code{PyNumber_Coerce(\&o1, \&o2)} is equivalent to the Python
1877statement \samp{\var{o1}, \var{o2} = coerce(\var{o1}, \var{o2})}.
1878\bifuncindex{coerce}
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001879\end{cfuncdesc}
1880
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001881\begin{cfuncdesc}{PyObject*}{PyNumber_Int}{PyObject *o}
Fred Drakee058b4f1998-02-16 06:15:35 +00001882Returns the \var{o} converted to an integer object on success, or
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001883\NULL{} on failure. This is the equivalent of the Python
Fred Drake659ebfa2000-04-03 15:42:13 +00001884expression \samp{int(\var{o})}.\bifuncindex{int}
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001885\end{cfuncdesc}
1886
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001887\begin{cfuncdesc}{PyObject*}{PyNumber_Long}{PyObject *o}
Fred Drakee058b4f1998-02-16 06:15:35 +00001888Returns the \var{o} converted to a long integer object on success,
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001889or \NULL{} on failure. This is the equivalent of the Python
Fred Drake659ebfa2000-04-03 15:42:13 +00001890expression \samp{long(\var{o})}.\bifuncindex{long}
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001891\end{cfuncdesc}
1892
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001893\begin{cfuncdesc}{PyObject*}{PyNumber_Float}{PyObject *o}
Fred Drake659ebfa2000-04-03 15:42:13 +00001894Returns the \var{o} converted to a float object on success, or
1895\NULL{} on failure. This is the equivalent of the Python expression
1896\samp{float(\var{o})}.\bifuncindex{float}
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001897\end{cfuncdesc}
1898
1899
Fred Drakeefd146c1999-02-15 15:30:45 +00001900\section{Sequence Protocol \label{sequence}}
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001901
1902\begin{cfuncdesc}{int}{PySequence_Check}{PyObject *o}
Fred Drake659ebfa2000-04-03 15:42:13 +00001903Return \code{1} if the object provides sequence protocol, and
1904\code{0} otherwise. This function always succeeds.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001905\end{cfuncdesc}
1906
Fred Drakec6a3cb42001-04-04 01:25:17 +00001907\begin{cfuncdesc}{int}{PySequence_Size}{PyObject *o}
Fred Drake659ebfa2000-04-03 15:42:13 +00001908Returns the number of objects in sequence \var{o} on success, and
1909\code{-1} on failure. For objects that do not provide sequence
1910protocol, this is equivalent to the Python expression
1911\samp{len(\var{o})}.\bifuncindex{len}
1912\end{cfuncdesc}
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001913
Fred Drakec6a3cb42001-04-04 01:25:17 +00001914\begin{cfuncdesc}{int}{PySequence_Length}{PyObject *o}
1915Alternate name for \cfunction{PySequence_Size()}.
1916\end{cfuncdesc}
1917
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001918\begin{cfuncdesc}{PyObject*}{PySequence_Concat}{PyObject *o1, PyObject *o2}
Fred Drakee058b4f1998-02-16 06:15:35 +00001919Return the concatenation of \var{o1} and \var{o2} on success, and \NULL{} on
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001920failure. This is the equivalent of the Python
Fred Drakee058b4f1998-02-16 06:15:35 +00001921expression \samp{\var{o1} + \var{o2}}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001922\end{cfuncdesc}
1923
1924
1925\begin{cfuncdesc}{PyObject*}{PySequence_Repeat}{PyObject *o, int count}
Fred Drake659ebfa2000-04-03 15:42:13 +00001926Return the result of repeating sequence object
1927\var{o} \var{count} times, or \NULL{} on failure. This is the
1928equivalent of the Python expression \samp{\var{o} * \var{count}}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001929\end{cfuncdesc}
1930
Fred Drake7740a012000-09-12 20:27:05 +00001931\begin{cfuncdesc}{PyObject*}{PySequence_InPlaceConcat}{PyObject *o1, PyObject *o2}
1932Return the concatenation of \var{o1} and \var{o2} on success, and \NULL{} on
1933failure. The operation is done \emph{in-place} when \var{o1} supports it.
1934This is the equivalent of the Python expression \samp{\var{o1} += \var{o2}}.
1935\end{cfuncdesc}
1936
1937
1938\begin{cfuncdesc}{PyObject*}{PySequence_InPlaceRepeat}{PyObject *o, int count}
1939Return the result of repeating sequence object \var{o} \var{count} times, or
1940\NULL{} on failure. The operation is done \emph{in-place} when \var{o}
1941supports it. This is the equivalent of the Python expression \samp{\var{o}
1942*= \var{count}}.
1943\end{cfuncdesc}
1944
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001945
1946\begin{cfuncdesc}{PyObject*}{PySequence_GetItem}{PyObject *o, int i}
Fred Drakee058b4f1998-02-16 06:15:35 +00001947Return the \var{i}th element of \var{o}, or \NULL{} on failure. This
1948is the equivalent of the Python expression \samp{\var{o}[\var{i}]}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001949\end{cfuncdesc}
1950
1951
1952\begin{cfuncdesc}{PyObject*}{PySequence_GetSlice}{PyObject *o, int i1, int i2}
Fred Drakee058b4f1998-02-16 06:15:35 +00001953Return the slice of sequence object \var{o} between \var{i1} and
1954\var{i2}, or \NULL{} on failure. This is the equivalent of the Python
1955expression \samp{\var{o}[\var{i1}:\var{i2}]}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001956\end{cfuncdesc}
1957
1958
1959\begin{cfuncdesc}{int}{PySequence_SetItem}{PyObject *o, int i, PyObject *v}
Fred Drakee058b4f1998-02-16 06:15:35 +00001960Assign object \var{v} to the \var{i}th element of \var{o}.
1961Returns \code{-1} on failure. This is the equivalent of the Python
1962statement \samp{\var{o}[\var{i}] = \var{v}}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001963\end{cfuncdesc}
1964
1965\begin{cfuncdesc}{int}{PySequence_DelItem}{PyObject *o, int i}
Fred Drake5566c1c2001-01-19 22:48:33 +00001966Delete the \var{i}th element of object \var{o}. Returns
Fred Drakee058b4f1998-02-16 06:15:35 +00001967\code{-1} on failure. This is the equivalent of the Python
1968statement \samp{del \var{o}[\var{i}]}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001969\end{cfuncdesc}
1970
Fred Drake659ebfa2000-04-03 15:42:13 +00001971\begin{cfuncdesc}{int}{PySequence_SetSlice}{PyObject *o, int i1,
1972 int i2, PyObject *v}
Fred Drakee058b4f1998-02-16 06:15:35 +00001973Assign the sequence object \var{v} to the slice in sequence
1974object \var{o} from \var{i1} to \var{i2}. This is the equivalent of
1975the Python statement \samp{\var{o}[\var{i1}:\var{i2}] = \var{v}}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001976\end{cfuncdesc}
1977
1978\begin{cfuncdesc}{int}{PySequence_DelSlice}{PyObject *o, int i1, int i2}
Fred Drakee058b4f1998-02-16 06:15:35 +00001979Delete the slice in sequence object \var{o} from \var{i1} to \var{i2}.
1980Returns \code{-1} on failure. This is the equivalent of the Python
1981statement \samp{del \var{o}[\var{i1}:\var{i2}]}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001982\end{cfuncdesc}
1983
1984\begin{cfuncdesc}{PyObject*}{PySequence_Tuple}{PyObject *o}
Fred Drakee058b4f1998-02-16 06:15:35 +00001985Returns the \var{o} as a tuple on success, and \NULL{} on failure.
Fred Drake659ebfa2000-04-03 15:42:13 +00001986This is equivalent to the Python expression \samp{tuple(\var{o})}.
1987\bifuncindex{tuple}
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001988\end{cfuncdesc}
1989
1990\begin{cfuncdesc}{int}{PySequence_Count}{PyObject *o, PyObject *value}
Fred Drakee058b4f1998-02-16 06:15:35 +00001991Return the number of occurrences of \var{value} in \var{o}, that is,
1992return the number of keys for which \code{\var{o}[\var{key}] ==
1993\var{value}}. On failure, return \code{-1}. This is equivalent to
1994the Python expression \samp{\var{o}.count(\var{value})}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001995\end{cfuncdesc}
1996
Fred Drake659ebfa2000-04-03 15:42:13 +00001997\begin{cfuncdesc}{int}{PySequence_Contains}{PyObject *o, PyObject *value}
Fred Drakee058b4f1998-02-16 06:15:35 +00001998Determine if \var{o} contains \var{value}. If an item in \var{o} is
1999equal to \var{value}, return \code{1}, otherwise return \code{0}. On
2000error, return \code{-1}. This is equivalent to the Python expression
2001\samp{\var{value} in \var{o}}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00002002\end{cfuncdesc}
2003
2004\begin{cfuncdesc}{int}{PySequence_Index}{PyObject *o, PyObject *value}
Fred Drakee058b4f1998-02-16 06:15:35 +00002005Return the first index \var{i} for which \code{\var{o}[\var{i}] ==
2006\var{value}}. On error, return \code{-1}. This is equivalent to
2007the Python expression \samp{\var{o}.index(\var{value})}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00002008\end{cfuncdesc}
2009
Fred Drakea8455ab2000-06-16 19:58:42 +00002010\begin{cfuncdesc}{PyObject*}{PySequence_List}{PyObject *o}
2011Return a list object with the same contents as the arbitrary sequence
2012\var{o}. The returned list is guaranteed to be new.
2013\end{cfuncdesc}
2014
2015\begin{cfuncdesc}{PyObject*}{PySequence_Tuple}{PyObject *o}
2016Return a tuple object with the same contents as the arbitrary sequence
2017\var{o}. If \var{o} is a tuple, a new reference will be returned,
2018otherwise a tuple will be constructed with the appropriate contents.
2019\end{cfuncdesc}
2020
Fred Drakef39ed671998-02-26 22:01:23 +00002021
Fred Drake81cccb72000-09-12 15:22:05 +00002022\begin{cfuncdesc}{PyObject*}{PySequence_Fast}{PyObject *o, const char *m}
2023Returns the sequence \var{o} as a tuple, unless it is already a
2024tuple or list, in which case \var{o} is returned. Use
2025\cfunction{PySequence_Fast_GET_ITEM()} to access the members of the
2026result. Returns \NULL{} on failure. If the object is not a sequence,
2027raises \exception{TypeError} with \var{m} as the message text.
2028\end{cfuncdesc}
2029
2030\begin{cfuncdesc}{PyObject*}{PySequence_Fast_GET_ITEM}{PyObject *o, int i}
2031Return the \var{i}th element of \var{o}, assuming that \var{o} was
2032returned by \cfunction{PySequence_Fast()}, and that \var{i} is within
2033bounds. The caller is expected to get the length of the sequence by
Fred Drake96a2a802001-05-29 18:51:41 +00002034calling \cfunction{PySequence_Size()} on \var{o}, since lists and tuples
Fred Drake81cccb72000-09-12 15:22:05 +00002035are guaranteed to always return their true length.
2036\end{cfuncdesc}
2037
2038
Fred Drakeefd146c1999-02-15 15:30:45 +00002039\section{Mapping Protocol \label{mapping}}
Guido van Rossum9231c8f1997-05-15 21:43:21 +00002040
2041\begin{cfuncdesc}{int}{PyMapping_Check}{PyObject *o}
Fred Drake659ebfa2000-04-03 15:42:13 +00002042Return \code{1} if the object provides mapping protocol, and
2043\code{0} otherwise. This function always succeeds.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00002044\end{cfuncdesc}
2045
2046
2047\begin{cfuncdesc}{int}{PyMapping_Length}{PyObject *o}
Fred Drake659ebfa2000-04-03 15:42:13 +00002048Returns the number of keys in object \var{o} on success, and
2049\code{-1} on failure. For objects that do not provide mapping
2050protocol, this is equivalent to the Python expression
2051\samp{len(\var{o})}.\bifuncindex{len}
Guido van Rossum9231c8f1997-05-15 21:43:21 +00002052\end{cfuncdesc}
2053
2054
2055\begin{cfuncdesc}{int}{PyMapping_DelItemString}{PyObject *o, char *key}
Fred Drakee058b4f1998-02-16 06:15:35 +00002056Remove the mapping for object \var{key} from the object \var{o}.
2057Return \code{-1} on failure. This is equivalent to
2058the Python statement \samp{del \var{o}[\var{key}]}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00002059\end{cfuncdesc}
2060
2061
2062\begin{cfuncdesc}{int}{PyMapping_DelItem}{PyObject *o, PyObject *key}
Fred Drakee058b4f1998-02-16 06:15:35 +00002063Remove the mapping for object \var{key} from the object \var{o}.
2064Return \code{-1} on failure. This is equivalent to
2065the Python statement \samp{del \var{o}[\var{key}]}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00002066\end{cfuncdesc}
2067
2068
2069\begin{cfuncdesc}{int}{PyMapping_HasKeyString}{PyObject *o, char *key}
Fred Drake659ebfa2000-04-03 15:42:13 +00002070On success, return \code{1} if the mapping object has the key
2071\var{key} and \code{0} otherwise. This is equivalent to the Python
2072expression \samp{\var{o}.has_key(\var{key})}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00002073This function always succeeds.
2074\end{cfuncdesc}
2075
2076
2077\begin{cfuncdesc}{int}{PyMapping_HasKey}{PyObject *o, PyObject *key}
Fred Drakee058b4f1998-02-16 06:15:35 +00002078Return \code{1} if the mapping object has the key \var{key} and
2079\code{0} otherwise. This is equivalent to the Python expression
2080\samp{\var{o}.has_key(\var{key})}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00002081This function always succeeds.
2082\end{cfuncdesc}
2083
2084
2085\begin{cfuncdesc}{PyObject*}{PyMapping_Keys}{PyObject *o}
Fred Drakee058b4f1998-02-16 06:15:35 +00002086On success, return a list of the keys in object \var{o}. On
Guido van Rossum9231c8f1997-05-15 21:43:21 +00002087failure, return \NULL{}. This is equivalent to the Python
Fred Drakee058b4f1998-02-16 06:15:35 +00002088expression \samp{\var{o}.keys()}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00002089\end{cfuncdesc}
2090
2091
2092\begin{cfuncdesc}{PyObject*}{PyMapping_Values}{PyObject *o}
Fred Drakee058b4f1998-02-16 06:15:35 +00002093On success, return a list of the values in object \var{o}. On
Guido van Rossum9231c8f1997-05-15 21:43:21 +00002094failure, return \NULL{}. This is equivalent to the Python
Fred Drakee058b4f1998-02-16 06:15:35 +00002095expression \samp{\var{o}.values()}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00002096\end{cfuncdesc}
2097
2098
2099\begin{cfuncdesc}{PyObject*}{PyMapping_Items}{PyObject *o}
Fred Drakee058b4f1998-02-16 06:15:35 +00002100On success, return a list of the items in object \var{o}, where
Guido van Rossum9231c8f1997-05-15 21:43:21 +00002101each item is a tuple containing a key-value pair. On
2102failure, return \NULL{}. This is equivalent to the Python
Fred Drakee058b4f1998-02-16 06:15:35 +00002103expression \samp{\var{o}.items()}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00002104\end{cfuncdesc}
2105
Guido van Rossum9231c8f1997-05-15 21:43:21 +00002106
2107\begin{cfuncdesc}{PyObject*}{PyMapping_GetItemString}{PyObject *o, char *key}
Fred Drakee058b4f1998-02-16 06:15:35 +00002108Return element of \var{o} corresponding to the object \var{key} or
2109\NULL{} on failure. This is the equivalent of the Python expression
2110\samp{\var{o}[\var{key}]}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00002111\end{cfuncdesc}
2112
Fred Drakedbcaeda2001-05-07 17:42:18 +00002113\begin{cfuncdesc}{int}{PyMapping_SetItemString}{PyObject *o, char *key,
2114 PyObject *v}
Fred Drakee058b4f1998-02-16 06:15:35 +00002115Map the object \var{key} to the value \var{v} in object \var{o}.
2116Returns \code{-1} on failure. This is the equivalent of the Python
2117statement \samp{\var{o}[\var{key}] = \var{v}}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00002118\end{cfuncdesc}
2119
2120
Fred Drakedbcaeda2001-05-07 17:42:18 +00002121\section{Iterator Protocol \label{iterator}}
2122
Fred Drakea8e08272001-05-07 17:47:07 +00002123\versionadded{2.2}
2124
Fred Drakedbcaeda2001-05-07 17:42:18 +00002125There are only a couple of functions specifically for working with
2126iterators.
2127
2128\begin{cfuncdesc}{int}{PyIter_Check}{PyObject *o}
2129 Return true if the object \var{o} supports the iterator protocol.
2130\end{cfuncdesc}
2131
2132\begin{cfuncdesc}{PyObject*}{PyIter_Next}{PyObject *o}
2133 Return the next value from the iteration \var{o}. If the object is
2134 an iterator, this retrieves the next value from the iteration, and
2135 returns \NULL{} with no exception set if there are no remaining
2136 items. If the object is not an iterator, \exception{TypeError} is
2137 raised, or if there is an error in retrieving the item, returns
2138 \NULL{} and passes along the exception.
2139\end{cfuncdesc}
2140
2141To write a loop which iterates over an iterator, the C code should
2142look something like this:
2143
2144\begin{verbatim}
2145PyObject *iterator = ...;
2146PyObject *item;
2147
2148while (item = PyIter_Next(iter)) {
2149 /* do something with item */
2150}
2151if (PyErr_Occurred()) {
2152 /* propogate error */
2153}
2154else {
2155 /* continue doing useful work */
2156}
2157\end{verbatim}
2158
2159
Fred Drakeefd146c1999-02-15 15:30:45 +00002160\chapter{Concrete Objects Layer \label{concrete}}
Guido van Rossum9231c8f1997-05-15 21:43:21 +00002161
2162The functions in this chapter are specific to certain Python object
2163types. Passing them an object of the wrong type is not a good idea;
2164if you receive an object from a Python program and you are not sure
2165that it has the right type, you must perform a type check first;
Fred Drake5566c1c2001-01-19 22:48:33 +00002166for example, to check that an object is a dictionary, use
Fred Drakee5bf8b21998-02-12 21:22:28 +00002167\cfunction{PyDict_Check()}. The chapter is structured like the
2168``family tree'' of Python object types.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00002169
Fred Drake89024422000-10-23 16:00:54 +00002170\strong{Warning:}
2171While the functions described in this chapter carefully check the type
2172of the objects which are passed in, many of them do not check for
2173\NULL{} being passed instead of a valid object. Allowing \NULL{} to
2174be passed in can cause memory access violations and immediate
2175termination of the interpreter.
2176
Guido van Rossum9231c8f1997-05-15 21:43:21 +00002177
Fred Drakeefd146c1999-02-15 15:30:45 +00002178\section{Fundamental Objects \label{fundamental}}
Guido van Rossum9231c8f1997-05-15 21:43:21 +00002179
Fred Drakee5bf8b21998-02-12 21:22:28 +00002180This section describes Python type objects and the singleton object
2181\code{None}.
2182
2183
Fred Drakeefd146c1999-02-15 15:30:45 +00002184\subsection{Type Objects \label{typeObjects}}
Fred Drakee5bf8b21998-02-12 21:22:28 +00002185
Fred Drake659ebfa2000-04-03 15:42:13 +00002186\obindex{type}
Fred Drakee5bf8b21998-02-12 21:22:28 +00002187\begin{ctypedesc}{PyTypeObject}
Fred Drake659ebfa2000-04-03 15:42:13 +00002188The C structure of the objects used to describe built-in types.
Fred Drakee5bf8b21998-02-12 21:22:28 +00002189\end{ctypedesc}
2190
Fred Drake659ebfa2000-04-03 15:42:13 +00002191\begin{cvardesc}{PyObject*}{PyType_Type}
Fred Drakeefd146c1999-02-15 15:30:45 +00002192This is the type object for type objects; it is the same object as
2193\code{types.TypeType} in the Python layer.
Fred Drake659ebfa2000-04-03 15:42:13 +00002194\withsubitem{(in module types)}{\ttindex{TypeType}}
Fred Drakee5bf8b21998-02-12 21:22:28 +00002195\end{cvardesc}
2196
Fred Drake659ebfa2000-04-03 15:42:13 +00002197\begin{cfuncdesc}{int}{PyType_Check}{PyObject *o}
2198Returns true is the object \var{o} is a type object.
2199\end{cfuncdesc}
2200
2201\begin{cfuncdesc}{int}{PyType_HasFeature}{PyObject *o, int feature}
2202Returns true if the type object \var{o} sets the feature
Fred Drakef0e08ef2001-02-03 01:11:26 +00002203\var{feature}. Type features are denoted by single bit flags.
Fred Drake659ebfa2000-04-03 15:42:13 +00002204\end{cfuncdesc}
2205
Fred Drakee5bf8b21998-02-12 21:22:28 +00002206
Fred Drakeefd146c1999-02-15 15:30:45 +00002207\subsection{The None Object \label{noneObject}}
Fred Drakee5bf8b21998-02-12 21:22:28 +00002208
Fred Drake659ebfa2000-04-03 15:42:13 +00002209\obindex{None@\texttt{None}}
2210Note that the \ctype{PyTypeObject} for \code{None} is not directly
2211exposed in the Python/C API. Since \code{None} is a singleton,
2212testing for object identity (using \samp{==} in C) is sufficient.
2213There is no \cfunction{PyNone_Check()} function for the same reason.
2214
2215\begin{cvardesc}{PyObject*}{Py_None}
Guido van Rossum44475131998-04-21 15:30:01 +00002216The Python \code{None} object, denoting lack of value. This object has
2217no methods.
Fred Drakee5bf8b21998-02-12 21:22:28 +00002218\end{cvardesc}
2219
2220
Fred Drakefa774872001-07-11 20:35:37 +00002221\section{Numeric Objects \label{numericObjects}}
2222
2223\obindex{numeric}
2224
2225
2226\subsection{Plain Integer Objects \label{intObjects}}
2227
2228\obindex{integer}
2229\begin{ctypedesc}{PyIntObject}
2230This subtype of \ctype{PyObject} represents a Python integer object.
2231\end{ctypedesc}
2232
2233\begin{cvardesc}{PyTypeObject}{PyInt_Type}
2234This instance of \ctype{PyTypeObject} represents the Python plain
2235integer type. This is the same object as \code{types.IntType}.
2236\withsubitem{(in modules types)}{\ttindex{IntType}}
2237\end{cvardesc}
2238
2239\begin{cfuncdesc}{int}{PyInt_Check}{PyObject* o}
2240Returns true if \var{o} is of type \cdata{PyInt_Type}.
2241\end{cfuncdesc}
2242
2243\begin{cfuncdesc}{PyObject*}{PyInt_FromLong}{long ival}
2244Creates a new integer object with a value of \var{ival}.
2245
2246The current implementation keeps an array of integer objects for all
2247integers between \code{-1} and \code{100}, when you create an int in
2248that range you actually just get back a reference to the existing
2249object. So it should be possible to change the value of \code{1}. I
2250suspect the behaviour of Python in this case is undefined. :-)
2251\end{cfuncdesc}
2252
2253\begin{cfuncdesc}{long}{PyInt_AsLong}{PyObject *io}
2254Will first attempt to cast the object to a \ctype{PyIntObject}, if
2255it is not already one, and then return its value.
2256\end{cfuncdesc}
2257
2258\begin{cfuncdesc}{long}{PyInt_AS_LONG}{PyObject *io}
2259Returns the value of the object \var{io}. No error checking is
2260performed.
2261\end{cfuncdesc}
2262
2263\begin{cfuncdesc}{long}{PyInt_GetMax}{}
2264Returns the system's idea of the largest integer it can handle
2265(\constant{LONG_MAX}\ttindex{LONG_MAX}, as defined in the system
2266header files).
2267\end{cfuncdesc}
2268
2269
2270\subsection{Long Integer Objects \label{longObjects}}
2271
2272\obindex{long integer}
2273\begin{ctypedesc}{PyLongObject}
2274This subtype of \ctype{PyObject} represents a Python long integer
2275object.
2276\end{ctypedesc}
2277
2278\begin{cvardesc}{PyTypeObject}{PyLong_Type}
2279This instance of \ctype{PyTypeObject} represents the Python long
2280integer type. This is the same object as \code{types.LongType}.
2281\withsubitem{(in modules types)}{\ttindex{LongType}}
2282\end{cvardesc}
2283
2284\begin{cfuncdesc}{int}{PyLong_Check}{PyObject *p}
2285Returns true if its argument is a \ctype{PyLongObject}.
2286\end{cfuncdesc}
2287
2288\begin{cfuncdesc}{PyObject*}{PyLong_FromLong}{long v}
2289Returns a new \ctype{PyLongObject} object from \var{v}, or \NULL{} on
2290failure.
2291\end{cfuncdesc}
2292
2293\begin{cfuncdesc}{PyObject*}{PyLong_FromUnsignedLong}{unsigned long v}
2294Returns a new \ctype{PyLongObject} object from a C \ctype{unsigned
2295long}, or \NULL{} on failure.
2296\end{cfuncdesc}
2297
2298\begin{cfuncdesc}{PyObject*}{PyLong_FromDouble}{double v}
2299Returns a new \ctype{PyLongObject} object from the integer part of
2300\var{v}, or \NULL{} on failure.
2301\end{cfuncdesc}
2302
2303\begin{cfuncdesc}{long}{PyLong_AsLong}{PyObject *pylong}
2304Returns a C \ctype{long} representation of the contents of
2305\var{pylong}. If \var{pylong} is greater than
2306\constant{LONG_MAX}\ttindex{LONG_MAX}, an \exception{OverflowError} is
2307raised.\withsubitem{(built-in exception)}{\ttindex{OverflowError}}
2308\end{cfuncdesc}
2309
2310\begin{cfuncdesc}{unsigned long}{PyLong_AsUnsignedLong}{PyObject *pylong}
2311Returns a C \ctype{unsigned long} representation of the contents of
2312\var{pylong}. If \var{pylong} is greater than
2313\constant{ULONG_MAX}\ttindex{ULONG_MAX}, an \exception{OverflowError}
2314is raised.\withsubitem{(built-in exception)}{\ttindex{OverflowError}}
2315\end{cfuncdesc}
2316
2317\begin{cfuncdesc}{double}{PyLong_AsDouble}{PyObject *pylong}
2318Returns a C \ctype{double} representation of the contents of \var{pylong}.
2319\end{cfuncdesc}
2320
2321\begin{cfuncdesc}{PyObject*}{PyLong_FromString}{char *str, char **pend,
2322 int base}
2323Return a new \ctype{PyLongObject} based on the string value in
2324\var{str}, which is interpreted according to the radix in \var{base}.
2325If \var{pend} is non-\NULL, \code{*\var{pend}} will point to the first
2326character in \var{str} which follows the representation of the
2327number. If \var{base} is \code{0}, the radix will be determined base
2328on the leading characters of \var{str}: if \var{str} starts with
2329\code{'0x'} or \code{'0X'}, radix 16 will be used; if \var{str} starts
2330with \code{'0'}, radix 8 will be used; otherwise radix 10 will be
2331used. If \var{base} is not \code{0}, it must be between \code{2} and
2332\code{36}, inclusive. Leading spaces are ignored. If there are no
2333digits, \exception{ValueError} will be raised.
2334\end{cfuncdesc}
2335
2336
2337\subsection{Floating Point Objects \label{floatObjects}}
2338
2339\obindex{floating point}
2340\begin{ctypedesc}{PyFloatObject}
2341This subtype of \ctype{PyObject} represents a Python floating point
2342object.
2343\end{ctypedesc}
2344
2345\begin{cvardesc}{PyTypeObject}{PyFloat_Type}
2346This instance of \ctype{PyTypeObject} represents the Python floating
2347point type. This is the same object as \code{types.FloatType}.
2348\withsubitem{(in modules types)}{\ttindex{FloatType}}
2349\end{cvardesc}
2350
2351\begin{cfuncdesc}{int}{PyFloat_Check}{PyObject *p}
2352Returns true if its argument is a \ctype{PyFloatObject}.
2353\end{cfuncdesc}
2354
2355\begin{cfuncdesc}{PyObject*}{PyFloat_FromDouble}{double v}
2356Creates a \ctype{PyFloatObject} object from \var{v}, or \NULL{} on
2357failure.
2358\end{cfuncdesc}
2359
2360\begin{cfuncdesc}{double}{PyFloat_AsDouble}{PyObject *pyfloat}
2361Returns a C \ctype{double} representation of the contents of \var{pyfloat}.
2362\end{cfuncdesc}
2363
2364\begin{cfuncdesc}{double}{PyFloat_AS_DOUBLE}{PyObject *pyfloat}
2365Returns a C \ctype{double} representation of the contents of
2366\var{pyfloat}, but without error checking.
2367\end{cfuncdesc}
2368
2369
2370\subsection{Complex Number Objects \label{complexObjects}}
2371
2372\obindex{complex number}
2373Python's complex number objects are implemented as two distinct types
2374when viewed from the C API: one is the Python object exposed to
2375Python programs, and the other is a C structure which represents the
2376actual complex number value. The API provides functions for working
2377with both.
2378
2379\subsubsection{Complex Numbers as C Structures}
2380
2381Note that the functions which accept these structures as parameters
2382and return them as results do so \emph{by value} rather than
2383dereferencing them through pointers. This is consistent throughout
2384the API.
2385
2386\begin{ctypedesc}{Py_complex}
2387The C structure which corresponds to the value portion of a Python
2388complex number object. Most of the functions for dealing with complex
2389number objects use structures of this type as input or output values,
2390as appropriate. It is defined as:
2391
2392\begin{verbatim}
2393typedef struct {
2394 double real;
2395 double imag;
2396} Py_complex;
2397\end{verbatim}
2398\end{ctypedesc}
2399
2400\begin{cfuncdesc}{Py_complex}{_Py_c_sum}{Py_complex left, Py_complex right}
2401Return the sum of two complex numbers, using the C
2402\ctype{Py_complex} representation.
2403\end{cfuncdesc}
2404
2405\begin{cfuncdesc}{Py_complex}{_Py_c_diff}{Py_complex left, Py_complex right}
2406Return the difference between two complex numbers, using the C
2407\ctype{Py_complex} representation.
2408\end{cfuncdesc}
2409
2410\begin{cfuncdesc}{Py_complex}{_Py_c_neg}{Py_complex complex}
2411Return the negation of the complex number \var{complex}, using the C
2412\ctype{Py_complex} representation.
2413\end{cfuncdesc}
2414
2415\begin{cfuncdesc}{Py_complex}{_Py_c_prod}{Py_complex left, Py_complex right}
2416Return the product of two complex numbers, using the C
2417\ctype{Py_complex} representation.
2418\end{cfuncdesc}
2419
2420\begin{cfuncdesc}{Py_complex}{_Py_c_quot}{Py_complex dividend,
2421 Py_complex divisor}
2422Return the quotient of two complex numbers, using the C
2423\ctype{Py_complex} representation.
2424\end{cfuncdesc}
2425
2426\begin{cfuncdesc}{Py_complex}{_Py_c_pow}{Py_complex num, Py_complex exp}
2427Return the exponentiation of \var{num} by \var{exp}, using the C
2428\ctype{Py_complex} representation.
2429\end{cfuncdesc}
2430
2431
2432\subsubsection{Complex Numbers as Python Objects}
2433
2434\begin{ctypedesc}{PyComplexObject}
2435This subtype of \ctype{PyObject} represents a Python complex number object.
2436\end{ctypedesc}
2437
2438\begin{cvardesc}{PyTypeObject}{PyComplex_Type}
2439This instance of \ctype{PyTypeObject} represents the Python complex
2440number type.
2441\end{cvardesc}
2442
2443\begin{cfuncdesc}{int}{PyComplex_Check}{PyObject *p}
2444Returns true if its argument is a \ctype{PyComplexObject}.
2445\end{cfuncdesc}
2446
2447\begin{cfuncdesc}{PyObject*}{PyComplex_FromCComplex}{Py_complex v}
2448Create a new Python complex number object from a C
2449\ctype{Py_complex} value.
2450\end{cfuncdesc}
2451
2452\begin{cfuncdesc}{PyObject*}{PyComplex_FromDoubles}{double real, double imag}
2453Returns a new \ctype{PyComplexObject} object from \var{real} and \var{imag}.
2454\end{cfuncdesc}
2455
2456\begin{cfuncdesc}{double}{PyComplex_RealAsDouble}{PyObject *op}
2457Returns the real part of \var{op} as a C \ctype{double}.
2458\end{cfuncdesc}
2459
2460\begin{cfuncdesc}{double}{PyComplex_ImagAsDouble}{PyObject *op}
2461Returns the imaginary part of \var{op} as a C \ctype{double}.
2462\end{cfuncdesc}
2463
2464\begin{cfuncdesc}{Py_complex}{PyComplex_AsCComplex}{PyObject *op}
2465Returns the \ctype{Py_complex} value of the complex number \var{op}.
2466\end{cfuncdesc}
2467
2468
2469
Fred Drakeefd146c1999-02-15 15:30:45 +00002470\section{Sequence Objects \label{sequenceObjects}}
Fred Drakee5bf8b21998-02-12 21:22:28 +00002471
Fred Drake659ebfa2000-04-03 15:42:13 +00002472\obindex{sequence}
Fred Drakee5bf8b21998-02-12 21:22:28 +00002473Generic operations on sequence objects were discussed in the previous
2474chapter; this section deals with the specific kinds of sequence
2475objects that are intrinsic to the Python language.
2476
2477
Fred Drakeefd146c1999-02-15 15:30:45 +00002478\subsection{String Objects \label{stringObjects}}
Fred Drakee5bf8b21998-02-12 21:22:28 +00002479
Fred Drake89024422000-10-23 16:00:54 +00002480These functions raise \exception{TypeError} when expecting a string
2481parameter and are called with a non-string parameter.
2482
Fred Drake659ebfa2000-04-03 15:42:13 +00002483\obindex{string}
Fred Drakee5bf8b21998-02-12 21:22:28 +00002484\begin{ctypedesc}{PyStringObject}
Fred Drakef8830d11998-04-23 14:06:01 +00002485This subtype of \ctype{PyObject} represents a Python string object.
Fred Drakee5bf8b21998-02-12 21:22:28 +00002486\end{ctypedesc}
2487
2488\begin{cvardesc}{PyTypeObject}{PyString_Type}
Fred Drake659ebfa2000-04-03 15:42:13 +00002489This instance of \ctype{PyTypeObject} represents the Python string
2490type; it is the same object as \code{types.TypeType} in the Python
2491layer.\withsubitem{(in module types)}{\ttindex{StringType}}.
Fred Drakee5bf8b21998-02-12 21:22:28 +00002492\end{cvardesc}
2493
2494\begin{cfuncdesc}{int}{PyString_Check}{PyObject *o}
Guido van Rossum3c4378b1998-04-14 20:21:10 +00002495Returns true if the object \var{o} is a string object.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00002496\end{cfuncdesc}
2497
Fred Drakec6fa34e1998-04-02 06:47:24 +00002498\begin{cfuncdesc}{PyObject*}{PyString_FromString}{const char *v}
Guido van Rossum3c4378b1998-04-14 20:21:10 +00002499Returns a new string object with the value \var{v} on success, and
2500\NULL{} on failure.
Fred Drakec6fa34e1998-04-02 06:47:24 +00002501\end{cfuncdesc}
2502
Fred Drake659ebfa2000-04-03 15:42:13 +00002503\begin{cfuncdesc}{PyObject*}{PyString_FromStringAndSize}{const char *v,
2504 int len}
2505Returns a new string object with the value \var{v} and length
2506\var{len} on success, and \NULL{} on failure. If \var{v} is \NULL{},
2507the contents of the string are uninitialized.
2508\end{cfuncdesc}
2509
Fred Drakec6fa34e1998-04-02 06:47:24 +00002510\begin{cfuncdesc}{int}{PyString_Size}{PyObject *string}
Guido van Rossum3c4378b1998-04-14 20:21:10 +00002511Returns the length of the string in string object \var{string}.
Fred Drakec6fa34e1998-04-02 06:47:24 +00002512\end{cfuncdesc}
2513
Fred Drake659ebfa2000-04-03 15:42:13 +00002514\begin{cfuncdesc}{int}{PyString_GET_SIZE}{PyObject *string}
Fred Drake5d644212000-10-07 12:31:50 +00002515Macro form of \cfunction{PyString_Size()} but without error
Fred Drake659ebfa2000-04-03 15:42:13 +00002516checking.
2517\end{cfuncdesc}
2518
Fred Drakec6fa34e1998-04-02 06:47:24 +00002519\begin{cfuncdesc}{char*}{PyString_AsString}{PyObject *string}
Fred Drake659ebfa2000-04-03 15:42:13 +00002520Returns a null-terminated representation of the contents of
2521\var{string}. The pointer refers to the internal buffer of
Fred Drake89024422000-10-23 16:00:54 +00002522\var{string}, not a copy. The data must not be modified in any way,
2523unless the string was just created using
2524\code{PyString_FromStringAndSize(NULL, \var{size})}.
2525It must not be deallocated.
Fred Drake659ebfa2000-04-03 15:42:13 +00002526\end{cfuncdesc}
2527
2528\begin{cfuncdesc}{char*}{PyString_AS_STRING}{PyObject *string}
2529Macro form of \cfunction{PyString_AsString()} but without error
2530checking.
Fred Drakec6fa34e1998-04-02 06:47:24 +00002531\end{cfuncdesc}
2532
Marc-André Lemburgd1ba4432000-09-19 21:04:18 +00002533\begin{cfuncdesc}{int}{PyString_AsStringAndSize}{PyObject *obj,
2534 char **buffer,
2535 int *length}
2536Returns a null-terminated representation of the contents of the object
2537\var{obj} through the output variables \var{buffer} and \var{length}.
2538
2539The function accepts both string and Unicode objects as input. For
2540Unicode objects it returns the default encoded version of the object.
2541If \var{length} is set to \NULL{}, the resulting buffer may not contain
2542null characters; if it does, the function returns -1 and a
2543TypeError is raised.
2544
2545The buffer refers to an internal string buffer of \var{obj}, not a
Fred Drake89024422000-10-23 16:00:54 +00002546copy. The data must not be modified in any way, unless the string was
2547just created using \code{PyString_FromStringAndSize(NULL,
2548\var{size})}. It must not be deallocated.
Marc-André Lemburgd1ba4432000-09-19 21:04:18 +00002549\end{cfuncdesc}
2550
Fred Drakec6fa34e1998-04-02 06:47:24 +00002551\begin{cfuncdesc}{void}{PyString_Concat}{PyObject **string,
2552 PyObject *newpart}
Fred Drake66b989c1999-02-15 20:15:39 +00002553Creates a new string object in \var{*string} containing the
Fred Drakeddc6c272000-03-31 18:22:38 +00002554contents of \var{newpart} appended to \var{string}; the caller will
2555own the new reference. The reference to the old value of \var{string}
2556will be stolen. If the new string
Fred Drake66b989c1999-02-15 20:15:39 +00002557cannot be created, the old reference to \var{string} will still be
2558discarded and the value of \var{*string} will be set to
2559\NULL{}; the appropriate exception will be set.
Fred Drakec6fa34e1998-04-02 06:47:24 +00002560\end{cfuncdesc}
2561
2562\begin{cfuncdesc}{void}{PyString_ConcatAndDel}{PyObject **string,
2563 PyObject *newpart}
Guido van Rossum3c4378b1998-04-14 20:21:10 +00002564Creates a new string object in \var{*string} containing the contents
Guido van Rossum44475131998-04-21 15:30:01 +00002565of \var{newpart} appended to \var{string}. This version decrements
2566the reference count of \var{newpart}.
Fred Drakec6fa34e1998-04-02 06:47:24 +00002567\end{cfuncdesc}
2568
2569\begin{cfuncdesc}{int}{_PyString_Resize}{PyObject **string, int newsize}
Guido van Rossum44475131998-04-21 15:30:01 +00002570A way to resize a string object even though it is ``immutable''.
2571Only use this to build up a brand new string object; don't use this if
2572the string may already be known in other parts of the code.
Fred Drakec6fa34e1998-04-02 06:47:24 +00002573\end{cfuncdesc}
2574
2575\begin{cfuncdesc}{PyObject*}{PyString_Format}{PyObject *format,
2576 PyObject *args}
Guido van Rossum44475131998-04-21 15:30:01 +00002577Returns a new string object from \var{format} and \var{args}. Analogous
Fred Drake659ebfa2000-04-03 15:42:13 +00002578to \code{\var{format} \%\ \var{args}}. The \var{args} argument must be
Guido van Rossum44475131998-04-21 15:30:01 +00002579a tuple.
Fred Drakec6fa34e1998-04-02 06:47:24 +00002580\end{cfuncdesc}
2581
2582\begin{cfuncdesc}{void}{PyString_InternInPlace}{PyObject **string}
Guido van Rossum44475131998-04-21 15:30:01 +00002583Intern the argument \var{*string} in place. The argument must be the
2584address of a pointer variable pointing to a Python string object.
2585If there is an existing interned string that is the same as
2586\var{*string}, it sets \var{*string} to it (decrementing the reference
2587count of the old string object and incrementing the reference count of
2588the interned string object), otherwise it leaves \var{*string} alone
2589and interns it (incrementing its reference count). (Clarification:
2590even though there is a lot of talk about reference counts, think of
Fred Drakef8830d11998-04-23 14:06:01 +00002591this function as reference-count-neutral; you own the object after
2592the call if and only if you owned it before the call.)
Fred Drakec6fa34e1998-04-02 06:47:24 +00002593\end{cfuncdesc}
2594
2595\begin{cfuncdesc}{PyObject*}{PyString_InternFromString}{const char *v}
Fred Drakef8830d11998-04-23 14:06:01 +00002596A combination of \cfunction{PyString_FromString()} and
2597\cfunction{PyString_InternInPlace()}, returning either a new string object
Guido van Rossum44475131998-04-21 15:30:01 +00002598that has been interned, or a new (``owned'') reference to an earlier
2599interned string object with the same value.
Fred Drakec6fa34e1998-04-02 06:47:24 +00002600\end{cfuncdesc}
2601
Marc-André Lemburg5a20b212000-07-07 15:47:06 +00002602\begin{cfuncdesc}{PyObject*}{PyString_Decode}{const char *s,
2603 int size,
2604 const char *encoding,
2605 const char *errors}
Marc-André Lemburg2d920412001-05-15 12:00:02 +00002606Creates an object by decoding \var{size} bytes of the encoded
2607buffer \var{s} using the codec registered
2608for \var{encoding}. \var{encoding} and \var{errors} have the same meaning
Marc-André Lemburg5a20b212000-07-07 15:47:06 +00002609as the parameters of the same name in the unicode() builtin
2610function. The codec to be used is looked up using the Python codec
2611registry. Returns \NULL{} in case an exception was raised by the
2612codec.
2613\end{cfuncdesc}
2614
Marc-André Lemburg2d920412001-05-15 12:00:02 +00002615\begin{cfuncdesc}{PyObject*}{PyString_AsDecodedObject}{PyObject *str,
2616 const char *encoding,
2617 const char *errors}
2618Decodes a string object by passing it to the codec registered
2619for \var{encoding} and returns the result as Python
2620object. \var{encoding} and \var{errors} have the same meaning as the
2621parameters of the same name in the string .encode() method. The codec
2622to be used is looked up using the Python codec registry. Returns
2623\NULL{} in case an exception was raised by the codec.
2624\end{cfuncdesc}
2625
2626\begin{cfuncdesc}{PyObject*}{PyString_Encode}{const char *s,
Marc-André Lemburg5a20b212000-07-07 15:47:06 +00002627 int size,
2628 const char *encoding,
2629 const char *errors}
Marc-André Lemburg2d920412001-05-15 12:00:02 +00002630Encodes the \ctype{char} buffer of the given size by passing it to
2631the codec registered for \var{encoding} and returns a Python object.
2632\var{encoding} and \var{errors} have the same
Marc-André Lemburg5a20b212000-07-07 15:47:06 +00002633meaning as the parameters of the same name in the string .encode()
2634method. The codec to be used is looked up using the Python codec
2635registry. Returns \NULL{} in case an exception was raised by the
2636codec.
2637\end{cfuncdesc}
2638
Marc-André Lemburg2d920412001-05-15 12:00:02 +00002639\begin{cfuncdesc}{PyObject*}{PyString_AsEncodedObject}{PyObject *str,
Marc-André Lemburg5a20b212000-07-07 15:47:06 +00002640 const char *encoding,
2641 const char *errors}
Marc-André Lemburg2d920412001-05-15 12:00:02 +00002642Encodes a string object using the codec registered
2643for \var{encoding} and returns the result as Python
Marc-André Lemburg5a20b212000-07-07 15:47:06 +00002644object. \var{encoding} and \var{errors} have the same meaning as the
2645parameters of the same name in the string .encode() method. The codec
2646to be used is looked up using the Python codec registry. Returns
2647\NULL{} in case an exception was raised by the codec.
2648\end{cfuncdesc}
2649
Fred Drakee5bf8b21998-02-12 21:22:28 +00002650
Fred Drakea4cd2612000-04-06 14:10:29 +00002651\subsection{Unicode Objects \label{unicodeObjects}}
2652\sectionauthor{Marc-Andre Lemburg}{mal@lemburg.com}
2653
2654%--- Unicode Type -------------------------------------------------------
2655
2656These are the basic Unicode object types used for the Unicode
2657implementation in Python:
2658
2659\begin{ctypedesc}{Py_UNICODE}
2660This type represents a 16-bit unsigned storage type which is used by
2661Python internally as basis for holding Unicode ordinals. On platforms
2662where \ctype{wchar_t} is available and also has 16-bits,
2663\ctype{Py_UNICODE} is a typedef alias for \ctype{wchar_t} to enhance
2664native platform compatibility. On all other platforms,
2665\ctype{Py_UNICODE} is a typedef alias for \ctype{unsigned short}.
2666\end{ctypedesc}
2667
2668\begin{ctypedesc}{PyUnicodeObject}
2669This subtype of \ctype{PyObject} represents a Python Unicode object.
2670\end{ctypedesc}
2671
2672\begin{cvardesc}{PyTypeObject}{PyUnicode_Type}
2673This instance of \ctype{PyTypeObject} represents the Python Unicode type.
2674\end{cvardesc}
2675
2676%--- These are really C macros... is there a macrodesc TeX macro ?
2677
2678The following APIs are really C macros and can be used to do fast
2679checks and to access internal read-only data of Unicode objects:
2680
2681\begin{cfuncdesc}{int}{PyUnicode_Check}{PyObject *o}
2682Returns true if the object \var{o} is a Unicode object.
2683\end{cfuncdesc}
2684
2685\begin{cfuncdesc}{int}{PyUnicode_GET_SIZE}{PyObject *o}
2686Returns the size of the object. o has to be a
2687PyUnicodeObject (not checked).
2688\end{cfuncdesc}
2689
2690\begin{cfuncdesc}{int}{PyUnicode_GET_DATA_SIZE}{PyObject *o}
2691Returns the size of the object's internal buffer in bytes. o has to be
2692a PyUnicodeObject (not checked).
2693\end{cfuncdesc}
2694
Fred Drake992fe5a2000-06-16 21:04:15 +00002695\begin{cfuncdesc}{Py_UNICODE*}{PyUnicode_AS_UNICODE}{PyObject *o}
Fred Drakea4cd2612000-04-06 14:10:29 +00002696Returns a pointer to the internal Py_UNICODE buffer of the object. o
2697has to be a PyUnicodeObject (not checked).
2698\end{cfuncdesc}
2699
Fred Drake992fe5a2000-06-16 21:04:15 +00002700\begin{cfuncdesc}{const char*}{PyUnicode_AS_DATA}{PyObject *o}
Fred Drakea4cd2612000-04-06 14:10:29 +00002701Returns a (const char *) pointer to the internal buffer of the object.
2702o has to be a PyUnicodeObject (not checked).
2703\end{cfuncdesc}
2704
2705% --- Unicode character properties ---------------------------------------
2706
2707Unicode provides many different character properties. The most often
2708needed ones are available through these macros which are mapped to C
2709functions depending on the Python configuration.
2710
2711\begin{cfuncdesc}{int}{Py_UNICODE_ISSPACE}{Py_UNICODE ch}
2712Returns 1/0 depending on whether \var{ch} is a whitespace character.
2713\end{cfuncdesc}
2714
2715\begin{cfuncdesc}{int}{Py_UNICODE_ISLOWER}{Py_UNICODE ch}
2716Returns 1/0 depending on whether \var{ch} is a lowercase character.
2717\end{cfuncdesc}
2718
2719\begin{cfuncdesc}{int}{Py_UNICODE_ISUPPER}{Py_UNICODE ch}
Fred Drakeae96aab2000-07-03 13:38:10 +00002720Returns 1/0 depending on whether \var{ch} is an uppercase character.
Fred Drakea4cd2612000-04-06 14:10:29 +00002721\end{cfuncdesc}
2722
2723\begin{cfuncdesc}{int}{Py_UNICODE_ISTITLE}{Py_UNICODE ch}
2724Returns 1/0 depending on whether \var{ch} is a titlecase character.
2725\end{cfuncdesc}
2726
2727\begin{cfuncdesc}{int}{Py_UNICODE_ISLINEBREAK}{Py_UNICODE ch}
2728Returns 1/0 depending on whether \var{ch} is a linebreak character.
2729\end{cfuncdesc}
2730
2731\begin{cfuncdesc}{int}{Py_UNICODE_ISDECIMAL}{Py_UNICODE ch}
2732Returns 1/0 depending on whether \var{ch} is a decimal character.
2733\end{cfuncdesc}
2734
2735\begin{cfuncdesc}{int}{Py_UNICODE_ISDIGIT}{Py_UNICODE ch}
2736Returns 1/0 depending on whether \var{ch} is a digit character.
2737\end{cfuncdesc}
2738
2739\begin{cfuncdesc}{int}{Py_UNICODE_ISNUMERIC}{Py_UNICODE ch}
2740Returns 1/0 depending on whether \var{ch} is a numeric character.
2741\end{cfuncdesc}
2742
Fred Drakeae96aab2000-07-03 13:38:10 +00002743\begin{cfuncdesc}{int}{Py_UNICODE_ISALPHA}{Py_UNICODE ch}
2744Returns 1/0 depending on whether \var{ch} is an alphabetic character.
2745\end{cfuncdesc}
2746
2747\begin{cfuncdesc}{int}{Py_UNICODE_ISALNUM}{Py_UNICODE ch}
2748Returns 1/0 depending on whether \var{ch} is an alphanumeric character.
2749\end{cfuncdesc}
2750
Fred Drakea4cd2612000-04-06 14:10:29 +00002751These APIs can be used for fast direct character conversions:
2752
2753\begin{cfuncdesc}{Py_UNICODE}{Py_UNICODE_TOLOWER}{Py_UNICODE ch}
2754Returns the character \var{ch} converted to lower case.
2755\end{cfuncdesc}
2756
2757\begin{cfuncdesc}{Py_UNICODE}{Py_UNICODE_TOUPPER}{Py_UNICODE ch}
2758Returns the character \var{ch} converted to upper case.
2759\end{cfuncdesc}
2760
2761\begin{cfuncdesc}{Py_UNICODE}{Py_UNICODE_TOTITLE}{Py_UNICODE ch}
2762Returns the character \var{ch} converted to title case.
2763\end{cfuncdesc}
2764
2765\begin{cfuncdesc}{int}{Py_UNICODE_TODECIMAL}{Py_UNICODE ch}
2766Returns the character \var{ch} converted to a decimal positive integer.
2767Returns -1 in case this is not possible. Does not raise exceptions.
2768\end{cfuncdesc}
2769
2770\begin{cfuncdesc}{int}{Py_UNICODE_TODIGIT}{Py_UNICODE ch}
2771Returns the character \var{ch} converted to a single digit integer.
2772Returns -1 in case this is not possible. Does not raise exceptions.
2773\end{cfuncdesc}
2774
2775\begin{cfuncdesc}{double}{Py_UNICODE_TONUMERIC}{Py_UNICODE ch}
2776Returns the character \var{ch} converted to a (positive) double.
2777Returns -1.0 in case this is not possible. Does not raise exceptions.
2778\end{cfuncdesc}
2779
2780% --- Plain Py_UNICODE ---------------------------------------------------
2781
2782To create Unicode objects and access their basic sequence properties,
2783use these APIs:
2784
2785\begin{cfuncdesc}{PyObject*}{PyUnicode_FromUnicode}{const Py_UNICODE *u,
2786 int size}
2787
2788Create a Unicode Object from the Py_UNICODE buffer \var{u} of the
2789given size. \var{u} may be \NULL{} which causes the contents to be
2790undefined. It is the user's responsibility to fill in the needed data.
Marc-André Lemburg8155e0e2001-04-23 14:44:21 +00002791The buffer is copied into the new object. If the buffer is not \NULL{},
2792the return value might be a shared object. Therefore, modification of
2793the resulting Unicode Object is only allowed when \var{u} is \NULL{}.
Fred Drakea4cd2612000-04-06 14:10:29 +00002794\end{cfuncdesc}
2795
Fred Drake1d158692000-06-18 05:21:21 +00002796\begin{cfuncdesc}{Py_UNICODE*}{PyUnicode_AsUnicode}{PyObject *unicode}
Fred Drakea4cd2612000-04-06 14:10:29 +00002797Return a read-only pointer to the Unicode object's internal
2798\ctype{Py_UNICODE} buffer.
2799\end{cfuncdesc}
2800
2801\begin{cfuncdesc}{int}{PyUnicode_GetSize}{PyObject *unicode}
2802Return the length of the Unicode object.
2803\end{cfuncdesc}
2804
Marc-André Lemburg5a20b212000-07-07 15:47:06 +00002805\begin{cfuncdesc}{PyObject*}{PyUnicode_FromEncodedObject}{PyObject *obj,
2806 const char *encoding,
2807 const char *errors}
Fred Drakea4cd2612000-04-06 14:10:29 +00002808
Marc-André Lemburg5a20b212000-07-07 15:47:06 +00002809Coerce an encoded object obj to an Unicode object and return a
2810reference with incremented refcount.
Fred Drakea4cd2612000-04-06 14:10:29 +00002811
2812Coercion is done in the following way:
2813\begin{enumerate}
2814\item Unicode objects are passed back as-is with incremented
Marc-André Lemburg5a20b212000-07-07 15:47:06 +00002815 refcount. Note: these cannot be decoded; passing a non-NULL
2816 value for encoding will result in a TypeError.
Fred Drakea4cd2612000-04-06 14:10:29 +00002817
2818\item String and other char buffer compatible objects are decoded
Marc-André Lemburg5a20b212000-07-07 15:47:06 +00002819 according to the given encoding and using the error handling
2820 defined by errors. Both can be NULL to have the interface use
2821 the default values (see the next section for details).
Fred Drakea4cd2612000-04-06 14:10:29 +00002822
Marc-André Lemburg5a20b212000-07-07 15:47:06 +00002823\item All other objects cause an exception.
Fred Drakea4cd2612000-04-06 14:10:29 +00002824\end{enumerate}
2825The API returns NULL in case of an error. The caller is responsible
2826for decref'ing the returned objects.
2827\end{cfuncdesc}
2828
Marc-André Lemburg5a20b212000-07-07 15:47:06 +00002829\begin{cfuncdesc}{PyObject*}{PyUnicode_FromObject}{PyObject *obj}
2830
2831Shortcut for PyUnicode_FromEncodedObject(obj, NULL, ``strict'')
2832which is used throughout the interpreter whenever coercion to
2833Unicode is needed.
2834\end{cfuncdesc}
2835
Fred Drakea4cd2612000-04-06 14:10:29 +00002836% --- wchar_t support for platforms which support it ---------------------
2837
2838If the platform supports \ctype{wchar_t} and provides a header file
2839wchar.h, Python can interface directly to this type using the
2840following functions. Support is optimized if Python's own
2841\ctype{Py_UNICODE} type is identical to the system's \ctype{wchar_t}.
2842
2843\begin{cfuncdesc}{PyObject*}{PyUnicode_FromWideChar}{const wchar_t *w,
2844 int size}
2845Create a Unicode Object from the \ctype{whcar_t} buffer \var{w} of the
2846given size. Returns \NULL{} on failure.
2847\end{cfuncdesc}
2848
2849\begin{cfuncdesc}{int}{PyUnicode_AsWideChar}{PyUnicodeObject *unicode,
2850 wchar_t *w,
2851 int size}
Fred Drakea4cd2612000-04-06 14:10:29 +00002852Copies the Unicode Object contents into the \ctype{whcar_t} buffer
2853\var{w}. At most \var{size} \ctype{whcar_t} characters are copied.
2854Returns the number of \ctype{whcar_t} characters copied or -1 in case
2855of an error.
2856\end{cfuncdesc}
2857
2858
2859\subsubsection{Builtin Codecs \label{builtinCodecs}}
2860
2861Python provides a set of builtin codecs which are written in C
2862for speed. All of these codecs are directly usable via the
2863following functions.
2864
2865Many of the following APIs take two arguments encoding and
2866errors. These parameters encoding and errors have the same semantics
2867as the ones of the builtin unicode() Unicode object constructor.
2868
2869Setting encoding to NULL causes the default encoding to be used which
2870is UTF-8.
2871
2872Error handling is set by errors which may also be set to NULL meaning
2873to use the default handling defined for the codec. Default error
2874handling for all builtin codecs is ``strict'' (ValueErrors are raised).
2875
2876The codecs all use a similar interface. Only deviation from the
2877following generic ones are documented for simplicity.
2878
2879% --- Generic Codecs -----------------------------------------------------
2880
2881These are the generic codec APIs:
2882
2883\begin{cfuncdesc}{PyObject*}{PyUnicode_Decode}{const char *s,
2884 int size,
2885 const char *encoding,
2886 const char *errors}
Fred Drakea4cd2612000-04-06 14:10:29 +00002887Create a Unicode object by decoding \var{size} bytes of the encoded
2888string \var{s}. \var{encoding} and \var{errors} have the same meaning
2889as the parameters of the same name in the unicode() builtin
2890function. The codec to be used is looked up using the Python codec
2891registry. Returns \NULL{} in case an exception was raised by the
2892codec.
2893\end{cfuncdesc}
2894
2895\begin{cfuncdesc}{PyObject*}{PyUnicode_Encode}{const Py_UNICODE *s,
2896 int size,
2897 const char *encoding,
2898 const char *errors}
Fred Drakea4cd2612000-04-06 14:10:29 +00002899Encodes the \ctype{Py_UNICODE} buffer of the given size and returns a
2900Python string object. \var{encoding} and \var{errors} have the same
2901meaning as the parameters of the same name in the Unicode .encode()
2902method. The codec to be used is looked up using the Python codec
2903registry. Returns \NULL{} in case an exception was raised by the
2904codec.
2905\end{cfuncdesc}
2906
2907\begin{cfuncdesc}{PyObject*}{PyUnicode_AsEncodedString}{PyObject *unicode,
2908 const char *encoding,
2909 const char *errors}
Fred Drakea4cd2612000-04-06 14:10:29 +00002910Encodes a Unicode object and returns the result as Python string
2911object. \var{encoding} and \var{errors} have the same meaning as the
2912parameters of the same name in the Unicode .encode() method. The codec
2913to be used is looked up using the Python codec registry. Returns
2914\NULL{} in case an exception was raised by the codec.
2915\end{cfuncdesc}
2916
2917% --- UTF-8 Codecs -------------------------------------------------------
2918
2919These are the UTF-8 codec APIs:
2920
2921\begin{cfuncdesc}{PyObject*}{PyUnicode_DecodeUTF8}{const char *s,
2922 int size,
2923 const char *errors}
Fred Drakea4cd2612000-04-06 14:10:29 +00002924Creates a Unicode object by decoding \var{size} bytes of the UTF-8
2925encoded string \var{s}. Returns \NULL{} in case an exception was
2926raised by the codec.
2927\end{cfuncdesc}
2928
2929\begin{cfuncdesc}{PyObject*}{PyUnicode_EncodeUTF8}{const Py_UNICODE *s,
2930 int size,
2931 const char *errors}
Fred Drakea4cd2612000-04-06 14:10:29 +00002932Encodes the \ctype{Py_UNICODE} buffer of the given size using UTF-8
2933and returns a Python string object. Returns \NULL{} in case an
2934exception was raised by the codec.
2935\end{cfuncdesc}
2936
2937\begin{cfuncdesc}{PyObject*}{PyUnicode_AsUTF8String}{PyObject *unicode}
Fred Drakea4cd2612000-04-06 14:10:29 +00002938Encodes a Unicode objects using UTF-8 and returns the result as Python
2939string object. Error handling is ``strict''. Returns
2940\NULL{} in case an exception was raised by the codec.
2941\end{cfuncdesc}
2942
2943% --- UTF-16 Codecs ------------------------------------------------------ */
2944
2945These are the UTF-16 codec APIs:
2946
2947\begin{cfuncdesc}{PyObject*}{PyUnicode_DecodeUTF16}{const char *s,
2948 int size,
2949 const char *errors,
2950 int *byteorder}
Fred Drakea4cd2612000-04-06 14:10:29 +00002951Decodes \var{length} bytes from a UTF-16 encoded buffer string and
2952returns the corresponding Unicode object.
2953
2954\var{errors} (if non-NULL) defines the error handling. It defaults
2955to ``strict''.
2956
2957If \var{byteorder} is non-\NULL{}, the decoder starts decoding using
2958the given byte order:
2959
2960\begin{verbatim}
2961 *byteorder == -1: little endian
2962 *byteorder == 0: native order
2963 *byteorder == 1: big endian
2964\end{verbatim}
2965
2966and then switches according to all byte order marks (BOM) it finds in
2967the input data. BOM marks are not copied into the resulting Unicode
2968string. After completion, \var{*byteorder} is set to the current byte
2969order at the end of input data.
2970
2971If \var{byteorder} is \NULL{}, the codec starts in native order mode.
2972
2973Returns \NULL{} in case an exception was raised by the codec.
2974\end{cfuncdesc}
2975
2976\begin{cfuncdesc}{PyObject*}{PyUnicode_EncodeUTF16}{const Py_UNICODE *s,
2977 int size,
2978 const char *errors,
2979 int byteorder}
Fred Drakea4cd2612000-04-06 14:10:29 +00002980Returns a Python string object holding the UTF-16 encoded value of the
2981Unicode data in \var{s}.
2982
Fred Drakea8455ab2000-06-16 19:58:42 +00002983If \var{byteorder} is not \code{0}, output is written according to the
Fred Drakea4cd2612000-04-06 14:10:29 +00002984following byte order:
2985
2986\begin{verbatim}
2987 byteorder == -1: little endian
2988 byteorder == 0: native byte order (writes a BOM mark)
2989 byteorder == 1: big endian
2990\end{verbatim}
2991
Fred Drakea8455ab2000-06-16 19:58:42 +00002992If byteorder is \code{0}, the output string will always start with the
Fred Drakea4cd2612000-04-06 14:10:29 +00002993Unicode BOM mark (U+FEFF). In the other two modes, no BOM mark is
2994prepended.
2995
2996Note that \ctype{Py_UNICODE} data is being interpreted as UTF-16
2997reduced to UCS-2. This trick makes it possible to add full UTF-16
2998capabilities at a later point without comprimising the APIs.
2999
3000Returns \NULL{} in case an exception was raised by the codec.
3001\end{cfuncdesc}
3002
3003\begin{cfuncdesc}{PyObject*}{PyUnicode_AsUTF16String}{PyObject *unicode}
Fred Drakea4cd2612000-04-06 14:10:29 +00003004Returns a Python string using the UTF-16 encoding in native byte
3005order. The string always starts with a BOM mark. Error handling is
3006``strict''. Returns \NULL{} in case an exception was raised by the
3007codec.
3008\end{cfuncdesc}
3009
3010% --- Unicode-Escape Codecs ----------------------------------------------
3011
3012These are the ``Unicode Esacpe'' codec APIs:
3013
3014\begin{cfuncdesc}{PyObject*}{PyUnicode_DecodeUnicodeEscape}{const char *s,
3015 int size,
3016 const char *errors}
Fred Drakea4cd2612000-04-06 14:10:29 +00003017Creates a Unicode object by decoding \var{size} bytes of the Unicode-Esacpe
3018encoded string \var{s}. Returns \NULL{} in case an exception was
3019raised by the codec.
3020\end{cfuncdesc}
3021
3022\begin{cfuncdesc}{PyObject*}{PyUnicode_EncodeUnicodeEscape}{const Py_UNICODE *s,
3023 int size,
3024 const char *errors}
Fred Drakea4cd2612000-04-06 14:10:29 +00003025Encodes the \ctype{Py_UNICODE} buffer of the given size using Unicode-Escape
3026and returns a Python string object. Returns \NULL{} in case an
3027exception was raised by the codec.
3028\end{cfuncdesc}
3029
3030\begin{cfuncdesc}{PyObject*}{PyUnicode_AsUnicodeEscapeString}{PyObject *unicode}
Fred Drakea4cd2612000-04-06 14:10:29 +00003031Encodes a Unicode objects using Unicode-Escape and returns the result
3032as Python string object. Error handling is ``strict''. Returns
3033\NULL{} in case an exception was raised by the codec.
3034\end{cfuncdesc}
3035
3036% --- Raw-Unicode-Escape Codecs ------------------------------------------
3037
3038These are the ``Raw Unicode Esacpe'' codec APIs:
3039
3040\begin{cfuncdesc}{PyObject*}{PyUnicode_DecodeRawUnicodeEscape}{const char *s,
3041 int size,
3042 const char *errors}
Fred Drakea4cd2612000-04-06 14:10:29 +00003043Creates a Unicode object by decoding \var{size} bytes of the Raw-Unicode-Esacpe
3044encoded string \var{s}. Returns \NULL{} in case an exception was
3045raised by the codec.
3046\end{cfuncdesc}
3047
3048\begin{cfuncdesc}{PyObject*}{PyUnicode_EncodeRawUnicodeEscape}{const Py_UNICODE *s,
3049 int size,
3050 const char *errors}
Fred Drakea4cd2612000-04-06 14:10:29 +00003051Encodes the \ctype{Py_UNICODE} buffer of the given size using Raw-Unicode-Escape
3052and returns a Python string object. Returns \NULL{} in case an
3053exception was raised by the codec.
3054\end{cfuncdesc}
3055
3056\begin{cfuncdesc}{PyObject*}{PyUnicode_AsRawUnicodeEscapeString}{PyObject *unicode}
Fred Drakea4cd2612000-04-06 14:10:29 +00003057Encodes a Unicode objects using Raw-Unicode-Escape and returns the result
3058as Python string object. Error handling is ``strict''. Returns
3059\NULL{} in case an exception was raised by the codec.
3060\end{cfuncdesc}
3061
3062% --- Latin-1 Codecs -----------------------------------------------------
3063
3064These are the Latin-1 codec APIs:
3065
3066Latin-1 corresponds to the first 256 Unicode ordinals and only these
3067are accepted by the codecs during encoding.
3068
3069\begin{cfuncdesc}{PyObject*}{PyUnicode_DecodeLatin1}{const char *s,
Fred Drake1d158692000-06-18 05:21:21 +00003070 int size,
3071 const char *errors}
Fred Drakea4cd2612000-04-06 14:10:29 +00003072Creates a Unicode object by decoding \var{size} bytes of the Latin-1
3073encoded string \var{s}. Returns \NULL{} in case an exception was
3074raised by the codec.
3075\end{cfuncdesc}
3076
3077\begin{cfuncdesc}{PyObject*}{PyUnicode_EncodeLatin1}{const Py_UNICODE *s,
Fred Drake1d158692000-06-18 05:21:21 +00003078 int size,
3079 const char *errors}
Fred Drakea4cd2612000-04-06 14:10:29 +00003080Encodes the \ctype{Py_UNICODE} buffer of the given size using Latin-1
3081and returns a Python string object. Returns \NULL{} in case an
3082exception was raised by the codec.
3083\end{cfuncdesc}
3084
3085\begin{cfuncdesc}{PyObject*}{PyUnicode_AsLatin1String}{PyObject *unicode}
Fred Drakea4cd2612000-04-06 14:10:29 +00003086Encodes a Unicode objects using Latin-1 and returns the result as
3087Python string object. Error handling is ``strict''. Returns
3088\NULL{} in case an exception was raised by the codec.
3089\end{cfuncdesc}
3090
3091% --- ASCII Codecs -------------------------------------------------------
3092
Fred Drake1d158692000-06-18 05:21:21 +00003093These are the \ASCII{} codec APIs. Only 7-bit \ASCII{} data is
3094accepted. All other codes generate errors.
Fred Drakea4cd2612000-04-06 14:10:29 +00003095
3096\begin{cfuncdesc}{PyObject*}{PyUnicode_DecodeASCII}{const char *s,
Fred Drake1d158692000-06-18 05:21:21 +00003097 int size,
3098 const char *errors}
3099Creates a Unicode object by decoding \var{size} bytes of the
3100\ASCII{} encoded string \var{s}. Returns \NULL{} in case an exception
3101was raised by the codec.
Fred Drakea4cd2612000-04-06 14:10:29 +00003102\end{cfuncdesc}
3103
3104\begin{cfuncdesc}{PyObject*}{PyUnicode_EncodeASCII}{const Py_UNICODE *s,
Fred Drake1d158692000-06-18 05:21:21 +00003105 int size,
3106 const char *errors}
3107Encodes the \ctype{Py_UNICODE} buffer of the given size using
3108\ASCII{} and returns a Python string object. Returns \NULL{} in case
3109an exception was raised by the codec.
Fred Drakea4cd2612000-04-06 14:10:29 +00003110\end{cfuncdesc}
3111
3112\begin{cfuncdesc}{PyObject*}{PyUnicode_AsASCIIString}{PyObject *unicode}
Fred Drake1d158692000-06-18 05:21:21 +00003113Encodes a Unicode objects using \ASCII{} and returns the result as Python
Fred Drakea4cd2612000-04-06 14:10:29 +00003114string object. Error handling is ``strict''. Returns
3115\NULL{} in case an exception was raised by the codec.
3116\end{cfuncdesc}
3117
3118% --- Character Map Codecs -----------------------------------------------
3119
3120These are the mapping codec APIs:
3121
3122This codec is special in that it can be used to implement many
3123different codecs (and this is in fact what was done to obtain most of
3124the standard codecs included in the \module{encodings} package). The
3125codec uses mapping to encode and decode characters.
3126
3127Decoding mappings must map single string characters to single Unicode
3128characters, integers (which are then interpreted as Unicode ordinals)
3129or None (meaning "undefined mapping" and causing an error).
3130
3131Encoding mappings must map single Unicode characters to single string
3132characters, integers (which are then interpreted as Latin-1 ordinals)
3133or None (meaning "undefined mapping" and causing an error).
3134
3135The mapping objects provided must only support the __getitem__ mapping
3136interface.
3137
3138If a character lookup fails with a LookupError, the character is
3139copied as-is meaning that its ordinal value will be interpreted as
3140Unicode or Latin-1 ordinal resp. Because of this, mappings only need
3141to contain those mappings which map characters to different code
3142points.
3143
3144\begin{cfuncdesc}{PyObject*}{PyUnicode_DecodeCharmap}{const char *s,
3145 int size,
3146 PyObject *mapping,
3147 const char *errors}
Fred Drakea4cd2612000-04-06 14:10:29 +00003148Creates a Unicode object by decoding \var{size} bytes of the encoded
3149string \var{s} using the given \var{mapping} object. Returns \NULL{}
3150in case an exception was raised by the codec.
3151\end{cfuncdesc}
3152
3153\begin{cfuncdesc}{PyObject*}{PyUnicode_EncodeCharmap}{const Py_UNICODE *s,
3154 int size,
3155 PyObject *mapping,
3156 const char *errors}
Fred Drakea4cd2612000-04-06 14:10:29 +00003157Encodes the \ctype{Py_UNICODE} buffer of the given size using the
3158given \var{mapping} object and returns a Python string object.
3159Returns \NULL{} in case an exception was raised by the codec.
3160\end{cfuncdesc}
3161
3162\begin{cfuncdesc}{PyObject*}{PyUnicode_AsCharmapString}{PyObject *unicode,
3163 PyObject *mapping}
Fred Drakea4cd2612000-04-06 14:10:29 +00003164Encodes a Unicode objects using the given \var{mapping} object and
3165returns the result as Python string object. Error handling is
3166``strict''. Returns \NULL{} in case an exception was raised by the
3167codec.
3168\end{cfuncdesc}
3169
3170The following codec API is special in that maps Unicode to Unicode.
3171
3172\begin{cfuncdesc}{PyObject*}{PyUnicode_TranslateCharmap}{const Py_UNICODE *s,
3173 int size,
3174 PyObject *table,
3175 const char *errors}
Fred Drakea4cd2612000-04-06 14:10:29 +00003176Translates a \ctype{Py_UNICODE} buffer of the given length by applying
3177a character mapping \var{table} to it and returns the resulting
Fred Drake1d158692000-06-18 05:21:21 +00003178Unicode object. Returns \NULL{} when an exception was raised by the
3179codec.
Fred Drakea4cd2612000-04-06 14:10:29 +00003180
3181The \var{mapping} table must map Unicode ordinal integers to Unicode
3182ordinal integers or None (causing deletion of the character).
3183
3184Mapping tables must only provide the __getitem__ interface,
3185e.g. dictionaries or sequences. Unmapped character ordinals (ones
3186which cause a LookupError) are left untouched and are copied as-is.
Fred Drakea4cd2612000-04-06 14:10:29 +00003187\end{cfuncdesc}
3188
3189% --- MBCS codecs for Windows --------------------------------------------
3190
Fred Drake1d158692000-06-18 05:21:21 +00003191These are the MBCS codec APIs. They are currently only available on
Fred Drakea4cd2612000-04-06 14:10:29 +00003192Windows and use the Win32 MBCS converters to implement the
Fred Drake1d158692000-06-18 05:21:21 +00003193conversions. Note that MBCS (or DBCS) is a class of encodings, not
3194just one. The target encoding is defined by the user settings on the
3195machine running the codec.
Fred Drakea4cd2612000-04-06 14:10:29 +00003196
3197\begin{cfuncdesc}{PyObject*}{PyUnicode_DecodeMBCS}{const char *s,
3198 int size,
3199 const char *errors}
Fred Drakea4cd2612000-04-06 14:10:29 +00003200Creates a Unicode object by decoding \var{size} bytes of the MBCS
Fred Drake1d158692000-06-18 05:21:21 +00003201encoded string \var{s}. Returns \NULL{} in case an exception was
Fred Drakea4cd2612000-04-06 14:10:29 +00003202raised by the codec.
3203\end{cfuncdesc}
3204
3205\begin{cfuncdesc}{PyObject*}{PyUnicode_EncodeMBCS}{const Py_UNICODE *s,
3206 int size,
3207 const char *errors}
Fred Drakea4cd2612000-04-06 14:10:29 +00003208Encodes the \ctype{Py_UNICODE} buffer of the given size using MBCS
3209and returns a Python string object. Returns \NULL{} in case an
3210exception was raised by the codec.
3211\end{cfuncdesc}
3212
3213\begin{cfuncdesc}{PyObject*}{PyUnicode_AsMBCSString}{PyObject *unicode}
Fred Drakea4cd2612000-04-06 14:10:29 +00003214Encodes a Unicode objects using MBCS and returns the result as Python
Fred Drake1d158692000-06-18 05:21:21 +00003215string object. Error handling is ``strict''. Returns \NULL{} in case
3216an exception was raised by the codec.
Fred Drakea4cd2612000-04-06 14:10:29 +00003217\end{cfuncdesc}
3218
3219% --- Methods & Slots ----------------------------------------------------
3220
3221\subsubsection{Methods and Slot Functions \label{unicodeMethodsAndSlots}}
3222
3223The following APIs are capable of handling Unicode objects and strings
3224on input (we refer to them as strings in the descriptions) and return
3225Unicode objects or integers as apporpriate.
3226
3227They all return \NULL{} or -1 in case an exception occurrs.
3228
3229\begin{cfuncdesc}{PyObject*}{PyUnicode_Concat}{PyObject *left,
3230 PyObject *right}
Fred Drakea4cd2612000-04-06 14:10:29 +00003231Concat two strings giving a new Unicode string.
3232\end{cfuncdesc}
3233
3234\begin{cfuncdesc}{PyObject*}{PyUnicode_Split}{PyObject *s,
3235 PyObject *sep,
3236 int maxsplit}
Fred Drakea4cd2612000-04-06 14:10:29 +00003237Split a string giving a list of Unicode strings.
3238
3239If sep is NULL, splitting will be done at all whitespace
3240substrings. Otherwise, splits occur at the given separator.
3241
3242At most maxsplit splits will be done. If negative, no limit is set.
3243
3244Separators are not included in the resulting list.
3245\end{cfuncdesc}
3246
3247\begin{cfuncdesc}{PyObject*}{PyUnicode_Splitlines}{PyObject *s,
3248 int maxsplit}
Fred Drake1d158692000-06-18 05:21:21 +00003249Split a Unicode string at line breaks, returning a list of Unicode
3250strings. CRLF is considered to be one line break. The Line break
3251characters are not included in the resulting strings.
Fred Drakea4cd2612000-04-06 14:10:29 +00003252\end{cfuncdesc}
3253
3254\begin{cfuncdesc}{PyObject*}{PyUnicode_Translate}{PyObject *str,
3255 PyObject *table,
3256 const char *errors}
Fred Drakea4cd2612000-04-06 14:10:29 +00003257Translate a string by applying a character mapping table to it and
3258return the resulting Unicode object.
3259
3260The mapping table must map Unicode ordinal integers to Unicode ordinal
3261integers or None (causing deletion of the character).
3262
3263Mapping tables must only provide the __getitem__ interface,
3264e.g. dictionaries or sequences. Unmapped character ordinals (ones
3265which cause a LookupError) are left untouched and are copied as-is.
3266
3267\var{errors} has the usual meaning for codecs. It may be \NULL{}
3268which indicates to use the default error handling.
Fred Drakea4cd2612000-04-06 14:10:29 +00003269\end{cfuncdesc}
3270
3271\begin{cfuncdesc}{PyObject*}{PyUnicode_Join}{PyObject *separator,
3272 PyObject *seq}
Fred Drakea4cd2612000-04-06 14:10:29 +00003273Join a sequence of strings using the given separator and return
3274the resulting Unicode string.
3275\end{cfuncdesc}
3276
3277\begin{cfuncdesc}{PyObject*}{PyUnicode_Tailmatch}{PyObject *str,
3278 PyObject *substr,
3279 int start,
3280 int end,
3281 int direction}
Fred Drakea4cd2612000-04-06 14:10:29 +00003282Return 1 if \var{substr} matches \var{str}[\var{start}:\var{end}] at
3283the given tail end (\var{direction} == -1 means to do a prefix match,
3284\var{direction} == 1 a suffix match), 0 otherwise.
3285\end{cfuncdesc}
3286
3287\begin{cfuncdesc}{PyObject*}{PyUnicode_Find}{PyObject *str,
3288 PyObject *substr,
3289 int start,
3290 int end,
3291 int direction}
Fred Drakea4cd2612000-04-06 14:10:29 +00003292Return the first position of \var{substr} in
3293\var{str}[\var{start}:\var{end}] using the given \var{direction}
3294(\var{direction} == 1 means to do a forward search,
3295\var{direction} == -1 a backward search), 0 otherwise.
3296\end{cfuncdesc}
3297
3298\begin{cfuncdesc}{PyObject*}{PyUnicode_Count}{PyObject *str,
3299 PyObject *substr,
3300 int start,
3301 int end}
Fred Drakea4cd2612000-04-06 14:10:29 +00003302Count the number of occurrences of \var{substr} in
3303\var{str}[\var{start}:\var{end}]
3304\end{cfuncdesc}
3305
3306\begin{cfuncdesc}{PyObject*}{PyUnicode_Replace}{PyObject *str,
3307 PyObject *substr,
3308 PyObject *replstr,
3309 int maxcount}
Fred Drakea4cd2612000-04-06 14:10:29 +00003310Replace at most \var{maxcount} occurrences of \var{substr} in
3311\var{str} with \var{replstr} and return the resulting Unicode object.
3312\var{maxcount} == -1 means: replace all occurrences.
3313\end{cfuncdesc}
3314
Fred Drake1d158692000-06-18 05:21:21 +00003315\begin{cfuncdesc}{int}{PyUnicode_Compare}{PyObject *left, PyObject *right}
Fred Drakea4cd2612000-04-06 14:10:29 +00003316Compare two strings and return -1, 0, 1 for less than, equal,
3317greater than resp.
3318\end{cfuncdesc}
3319
3320\begin{cfuncdesc}{PyObject*}{PyUnicode_Format}{PyObject *format,
3321 PyObject *args}
Fred Drake1d158692000-06-18 05:21:21 +00003322Returns a new string object from \var{format} and \var{args}; this is
3323analogous to \code{\var{format} \%\ \var{args}}. The
3324\var{args} argument must be a tuple.
Fred Drakea4cd2612000-04-06 14:10:29 +00003325\end{cfuncdesc}
3326
3327\begin{cfuncdesc}{int}{PyUnicode_Contains}{PyObject *container,
3328 PyObject *element}
Fred Drakea4cd2612000-04-06 14:10:29 +00003329Checks whether \var{element} is contained in \var{container} and
Fred Drake1d158692000-06-18 05:21:21 +00003330returns true or false accordingly.
Fred Drakea4cd2612000-04-06 14:10:29 +00003331
Fred Drake1d158692000-06-18 05:21:21 +00003332\var{element} has to coerce to a one element Unicode string. \code{-1} is
Fred Drakea4cd2612000-04-06 14:10:29 +00003333returned in case of an error.
3334\end{cfuncdesc}
3335
3336
Fred Drake58c5a2a1999-08-04 13:13:24 +00003337\subsection{Buffer Objects \label{bufferObjects}}
Fred Drake659ebfa2000-04-03 15:42:13 +00003338\sectionauthor{Greg Stein}{gstein@lyra.org}
Fred Drake58c5a2a1999-08-04 13:13:24 +00003339
Fred Drake659ebfa2000-04-03 15:42:13 +00003340\obindex{buffer}
3341Python objects implemented in C can export a group of functions called
3342the ``buffer\index{buffer interface} interface.'' These functions can
3343be used by an object to expose its data in a raw, byte-oriented
3344format. Clients of the object can use the buffer interface to access
3345the object data directly, without needing to copy it first.
3346
3347Two examples of objects that support
3348the buffer interface are strings and arrays. The string object exposes
3349the character contents in the buffer interface's byte-oriented
3350form. An array can also expose its contents, but it should be noted
3351that array elements may be multi-byte values.
3352
3353An example user of the buffer interface is the file object's
3354\method{write()} method. Any object that can export a series of bytes
3355through the buffer interface can be written to a file. There are a
3356number of format codes to \cfunction{PyArgs_ParseTuple()} that operate
3357against an object's buffer interface, returning data from the target
3358object.
3359
3360More information on the buffer interface is provided in the section
3361``Buffer Object Structures'' (section \ref{buffer-structs}), under
3362the description for \ctype{PyBufferProcs}\ttindex{PyBufferProcs}.
3363
3364A ``buffer object'' is defined in the \file{bufferobject.h} header
3365(included by \file{Python.h}). These objects look very similar to
3366string objects at the Python programming level: they support slicing,
3367indexing, concatenation, and some other standard string
3368operations. However, their data can come from one of two sources: from
3369a block of memory, or from another object which exports the buffer
3370interface.
3371
3372Buffer objects are useful as a way to expose the data from another
3373object's buffer interface to the Python programmer. They can also be
3374used as a zero-copy slicing mechanism. Using their ability to
3375reference a block of memory, it is possible to expose any data to the
3376Python programmer quite easily. The memory could be a large, constant
3377array in a C extension, it could be a raw block of memory for
3378manipulation before passing to an operating system library, or it
3379could be used to pass around structured data in its native, in-memory
3380format.
3381
3382\begin{ctypedesc}{PyBufferObject}
3383This subtype of \ctype{PyObject} represents a buffer object.
3384\end{ctypedesc}
Fred Drake58c5a2a1999-08-04 13:13:24 +00003385
3386\begin{cvardesc}{PyTypeObject}{PyBuffer_Type}
3387The instance of \ctype{PyTypeObject} which represents the Python
Fred Drake659ebfa2000-04-03 15:42:13 +00003388buffer type; it is the same object as \code{types.BufferType} in the
3389Python layer.\withsubitem{(in module types)}{\ttindex{BufferType}}.
Fred Drake58c5a2a1999-08-04 13:13:24 +00003390\end{cvardesc}
3391
3392\begin{cvardesc}{int}{Py_END_OF_BUFFER}
Fred Drake659ebfa2000-04-03 15:42:13 +00003393This constant may be passed as the \var{size} parameter to
3394\cfunction{PyBuffer_FromObject()} or
3395\cfunction{PyBuffer_FromReadWriteObject()}. It indicates that the new
3396\ctype{PyBufferObject} should refer to \var{base} object from the
3397specified \var{offset} to the end of its exported buffer. Using this
3398enables the caller to avoid querying the \var{base} object for its
3399length.
Fred Drake58c5a2a1999-08-04 13:13:24 +00003400\end{cvardesc}
3401
3402\begin{cfuncdesc}{int}{PyBuffer_Check}{PyObject *p}
3403Return true if the argument has type \cdata{PyBuffer_Type}.
3404\end{cfuncdesc}
3405
3406\begin{cfuncdesc}{PyObject*}{PyBuffer_FromObject}{PyObject *base,
3407 int offset, int size}
Fred Drake659ebfa2000-04-03 15:42:13 +00003408Return a new read-only buffer object. This raises
3409\exception{TypeError} if \var{base} doesn't support the read-only
3410buffer protocol or doesn't provide exactly one buffer segment, or it
3411raises \exception{ValueError} if \var{offset} is less than zero. The
3412buffer will hold a reference to the \var{base} object, and the
3413buffer's contents will refer to the \var{base} object's buffer
3414interface, starting as position \var{offset} and extending for
3415\var{size} bytes. If \var{size} is \constant{Py_END_OF_BUFFER}, then
3416the new buffer's contents extend to the length of the
3417\var{base} object's exported buffer data.
Fred Drake58c5a2a1999-08-04 13:13:24 +00003418\end{cfuncdesc}
3419
3420\begin{cfuncdesc}{PyObject*}{PyBuffer_FromReadWriteObject}{PyObject *base,
3421 int offset,
3422 int size}
3423Return a new writable buffer object. Parameters and exceptions are
3424similar to those for \cfunction{PyBuffer_FromObject()}.
Fred Drake659ebfa2000-04-03 15:42:13 +00003425If the \var{base} object does not export the writeable buffer
3426protocol, then \exception{TypeError} is raised.
Fred Drake58c5a2a1999-08-04 13:13:24 +00003427\end{cfuncdesc}
3428
3429\begin{cfuncdesc}{PyObject*}{PyBuffer_FromMemory}{void *ptr, int size}
Fred Drake659ebfa2000-04-03 15:42:13 +00003430Return a new read-only buffer object that reads from a specified
3431location in memory, with a specified size.
Fred Drake58c5a2a1999-08-04 13:13:24 +00003432The caller is responsible for ensuring that the memory buffer, passed
3433in as \var{ptr}, is not deallocated while the returned buffer object
3434exists. Raises \exception{ValueError} if \var{size} is less than
Fred Drake659ebfa2000-04-03 15:42:13 +00003435zero. Note that \constant{Py_END_OF_BUFFER} may \emph{not} be passed
3436for the \var{size} parameter; \exception{ValueError} will be raised in
3437that case.
Fred Drake58c5a2a1999-08-04 13:13:24 +00003438\end{cfuncdesc}
3439
3440\begin{cfuncdesc}{PyObject*}{PyBuffer_FromReadWriteMemory}{void *ptr, int size}
Fred Drake659ebfa2000-04-03 15:42:13 +00003441Similar to \cfunction{PyBuffer_FromMemory()}, but the returned buffer
3442is writable.
Fred Drake58c5a2a1999-08-04 13:13:24 +00003443\end{cfuncdesc}
3444
3445\begin{cfuncdesc}{PyObject*}{PyBuffer_New}{int size}
3446Returns a new writable buffer object that maintains its own memory
Fred Drake659ebfa2000-04-03 15:42:13 +00003447buffer of \var{size} bytes. \exception{ValueError} is returned if
3448\var{size} is not zero or positive.
Fred Drake58c5a2a1999-08-04 13:13:24 +00003449\end{cfuncdesc}
3450
Guido van Rossum44475131998-04-21 15:30:01 +00003451
Fred Drakeefd146c1999-02-15 15:30:45 +00003452\subsection{Tuple Objects \label{tupleObjects}}
Fred Drakee5bf8b21998-02-12 21:22:28 +00003453
Fred Drake659ebfa2000-04-03 15:42:13 +00003454\obindex{tuple}
Fred Drakee5bf8b21998-02-12 21:22:28 +00003455\begin{ctypedesc}{PyTupleObject}
Fred Drakef8830d11998-04-23 14:06:01 +00003456This subtype of \ctype{PyObject} represents a Python tuple object.
Fred Drakee5bf8b21998-02-12 21:22:28 +00003457\end{ctypedesc}
3458
3459\begin{cvardesc}{PyTypeObject}{PyTuple_Type}
Fred Drake659ebfa2000-04-03 15:42:13 +00003460This instance of \ctype{PyTypeObject} represents the Python tuple
3461type; it is the same object as \code{types.TupleType} in the Python
3462layer.\withsubitem{(in module types)}{\ttindex{TupleType}}.
Fred Drakee5bf8b21998-02-12 21:22:28 +00003463\end{cvardesc}
3464
3465\begin{cfuncdesc}{int}{PyTuple_Check}{PyObject *p}
3466Return true if the argument is a tuple object.
3467\end{cfuncdesc}
3468
Fred Drake659ebfa2000-04-03 15:42:13 +00003469\begin{cfuncdesc}{PyObject*}{PyTuple_New}{int len}
3470Return a new tuple object of size \var{len}, or \NULL{} on failure.
Fred Drakee5bf8b21998-02-12 21:22:28 +00003471\end{cfuncdesc}
3472
Fred Drakea05460c2001-02-12 17:38:18 +00003473\begin{cfuncdesc}{int}{PyTuple_Size}{PyObject *p}
Fred Drakee058b4f1998-02-16 06:15:35 +00003474Takes a pointer to a tuple object, and returns the size
Fred Drakee5bf8b21998-02-12 21:22:28 +00003475of that tuple.
3476\end{cfuncdesc}
3477
Fred Drakea05460c2001-02-12 17:38:18 +00003478\begin{cfuncdesc}{PyObject*}{PyTuple_GetItem}{PyObject *p, int pos}
Fred Drakee058b4f1998-02-16 06:15:35 +00003479Returns the object at position \var{pos} in the tuple pointed
3480to by \var{p}. If \var{pos} is out of bounds, returns \NULL{} and
Fred Drake659ebfa2000-04-03 15:42:13 +00003481sets an \exception{IndexError} exception.
Fred Drakee5bf8b21998-02-12 21:22:28 +00003482\end{cfuncdesc}
3483
Fred Drakea05460c2001-02-12 17:38:18 +00003484\begin{cfuncdesc}{PyObject*}{PyTuple_GET_ITEM}{PyObject *p, int pos}
Fred Drakefac312f2001-05-29 15:13:00 +00003485Like \cfunction{PyTuple_GetItem()}, but does no checking of its
3486arguments.
Fred Drakee5bf8b21998-02-12 21:22:28 +00003487\end{cfuncdesc}
3488
Fred Drakea05460c2001-02-12 17:38:18 +00003489\begin{cfuncdesc}{PyObject*}{PyTuple_GetSlice}{PyObject *p,
3490 int low, int high}
Fred Drakee058b4f1998-02-16 06:15:35 +00003491Takes a slice of the tuple pointed to by \var{p} from
3492\var{low} to \var{high} and returns it as a new tuple.
Fred Drakee5bf8b21998-02-12 21:22:28 +00003493\end{cfuncdesc}
3494
Fred Drake659ebfa2000-04-03 15:42:13 +00003495\begin{cfuncdesc}{int}{PyTuple_SetItem}{PyObject *p,
3496 int pos, PyObject *o}
Fred Drakee058b4f1998-02-16 06:15:35 +00003497Inserts a reference to object \var{o} at position \var{pos} of
3498the tuple pointed to by \var{p}. It returns \code{0} on success.
Fred Drake659ebfa2000-04-03 15:42:13 +00003499\strong{Note:} This function ``steals'' a reference to \var{o}.
Fred Drakee5bf8b21998-02-12 21:22:28 +00003500\end{cfuncdesc}
3501
Fred Drake659ebfa2000-04-03 15:42:13 +00003502\begin{cfuncdesc}{void}{PyTuple_SET_ITEM}{PyObject *p,
3503 int pos, PyObject *o}
Fred Drakefac312f2001-05-29 15:13:00 +00003504Like \cfunction{PyTuple_SetItem()}, but does no error checking, and
Fred Drakee5bf8b21998-02-12 21:22:28 +00003505should \emph{only} be used to fill in brand new tuples.
Fred Drake659ebfa2000-04-03 15:42:13 +00003506\strong{Note:} This function ``steals'' a reference to \var{o}.
Fred Drakee5bf8b21998-02-12 21:22:28 +00003507\end{cfuncdesc}
3508
Fred Drakefac312f2001-05-29 15:13:00 +00003509\begin{cfuncdesc}{int}{_PyTuple_Resize}{PyObject **p, int newsize}
Fred Drake659ebfa2000-04-03 15:42:13 +00003510Can be used to resize a tuple. \var{newsize} will be the new length
3511of the tuple. Because tuples are \emph{supposed} to be immutable,
3512this should only be used if there is only one reference to the object.
3513Do \emph{not} use this if the tuple may already be known to some other
Fred Drakefac312f2001-05-29 15:13:00 +00003514part of the code. The tuple will always grow or shrink at the end.
3515Think of this as destroying the old tuple and creating a new one, only
3516more efficiently. Returns \code{0} on success. Client code should
3517never assume that the resulting value of \code{*\var{p}} will be the
3518same as before calling this function. If the object referenced by
3519\code{*\var{p}} is replaced, the original \code{*\var{p}} is
3520destroyed. On failure, returns \code{-1} and sets \code{*\var{p}} to
3521\NULL, and raises \exception{MemoryError} or \exception{SystemError}.
3522\versionchanged[Removed unused third parameter, \var{last_is_sticky}]{2.2}
Fred Drakee5bf8b21998-02-12 21:22:28 +00003523\end{cfuncdesc}
3524
3525
Fred Drakeefd146c1999-02-15 15:30:45 +00003526\subsection{List Objects \label{listObjects}}
Fred Drakee5bf8b21998-02-12 21:22:28 +00003527
Fred Drake659ebfa2000-04-03 15:42:13 +00003528\obindex{list}
Fred Drakee5bf8b21998-02-12 21:22:28 +00003529\begin{ctypedesc}{PyListObject}
Fred Drakef8830d11998-04-23 14:06:01 +00003530This subtype of \ctype{PyObject} represents a Python list object.
Fred Drakee5bf8b21998-02-12 21:22:28 +00003531\end{ctypedesc}
3532
3533\begin{cvardesc}{PyTypeObject}{PyList_Type}
Fred Drake659ebfa2000-04-03 15:42:13 +00003534This instance of \ctype{PyTypeObject} represents the Python list
3535type. This is the same object as \code{types.ListType}.
3536\withsubitem{(in module types)}{\ttindex{ListType}}
Fred Drakee5bf8b21998-02-12 21:22:28 +00003537\end{cvardesc}
3538
3539\begin{cfuncdesc}{int}{PyList_Check}{PyObject *p}
Fred Drakef8830d11998-04-23 14:06:01 +00003540Returns true if its argument is a \ctype{PyListObject}.
Fred Drakee5bf8b21998-02-12 21:22:28 +00003541\end{cfuncdesc}
3542
Fred Drake659ebfa2000-04-03 15:42:13 +00003543\begin{cfuncdesc}{PyObject*}{PyList_New}{int len}
3544Returns a new list of length \var{len} on success, or \NULL{} on
Guido van Rossum3c4378b1998-04-14 20:21:10 +00003545failure.
Fred Drakee5bf8b21998-02-12 21:22:28 +00003546\end{cfuncdesc}
3547
Fred Drakec6fa34e1998-04-02 06:47:24 +00003548\begin{cfuncdesc}{int}{PyList_Size}{PyObject *list}
Fred Drake659ebfa2000-04-03 15:42:13 +00003549Returns the length of the list object in \var{list}; this is
3550equivalent to \samp{len(\var{list})} on a list object.
3551\bifuncindex{len}
3552\end{cfuncdesc}
3553
3554\begin{cfuncdesc}{int}{PyList_GET_SIZE}{PyObject *list}
Fred Drake5d644212000-10-07 12:31:50 +00003555Macro form of \cfunction{PyList_Size()} without error checking.
Fred Drakee5bf8b21998-02-12 21:22:28 +00003556\end{cfuncdesc}
3557
Fred Drakec6fa34e1998-04-02 06:47:24 +00003558\begin{cfuncdesc}{PyObject*}{PyList_GetItem}{PyObject *list, int index}
Guido van Rossum44475131998-04-21 15:30:01 +00003559Returns the object at position \var{pos} in the list pointed
3560to by \var{p}. If \var{pos} is out of bounds, returns \NULL{} and
Fred Drake659ebfa2000-04-03 15:42:13 +00003561sets an \exception{IndexError} exception.
3562\end{cfuncdesc}
3563
3564\begin{cfuncdesc}{PyObject*}{PyList_GET_ITEM}{PyObject *list, int i}
3565Macro form of \cfunction{PyList_GetItem()} without error checking.
Fred Drakee5bf8b21998-02-12 21:22:28 +00003566\end{cfuncdesc}
3567
Fred Drakec6fa34e1998-04-02 06:47:24 +00003568\begin{cfuncdesc}{int}{PyList_SetItem}{PyObject *list, int index,
3569 PyObject *item}
Guido van Rossum3c4378b1998-04-14 20:21:10 +00003570Sets the item at index \var{index} in list to \var{item}.
Fred Drakebab29652001-07-10 16:10:08 +00003571Returns \code{0} on success or \code{-1} on failure.
Fred Drake00d0cb62001-06-03 03:12:57 +00003572\strong{Note:} This function ``steals'' a reference to \var{item} and
3573discards a reference to an item already in the list at the affected
3574position.
Fred Drake659ebfa2000-04-03 15:42:13 +00003575\end{cfuncdesc}
3576
Fred Drakebab29652001-07-10 16:10:08 +00003577\begin{cfuncdesc}{void}{PyList_SET_ITEM}{PyObject *list, int i,
Fred Drake659ebfa2000-04-03 15:42:13 +00003578 PyObject *o}
3579Macro form of \cfunction{PyList_SetItem()} without error checking.
Fred Drake00d0cb62001-06-03 03:12:57 +00003580\strong{Note:} This function ``steals'' a reference to \var{item},
3581and, unlike \cfunction{PyList_SetItem()}, does \emph{not} discard a
Fred Drakebab29652001-07-10 16:10:08 +00003582reference to any item that it being replaced; any reference in
3583\var{list} at position \var{i} will be leaked. This is normally only
3584used to fill in new lists where there is no previous content.
Fred Drakee5bf8b21998-02-12 21:22:28 +00003585\end{cfuncdesc}
3586
Fred Drakec6fa34e1998-04-02 06:47:24 +00003587\begin{cfuncdesc}{int}{PyList_Insert}{PyObject *list, int index,
Guido van Rossum44475131998-04-21 15:30:01 +00003588 PyObject *item}
3589Inserts the item \var{item} into list \var{list} in front of index
Fred Drake659ebfa2000-04-03 15:42:13 +00003590\var{index}. Returns \code{0} if successful; returns \code{-1} and
3591raises an exception if unsuccessful. Analogous to
3592\code{\var{list}.insert(\var{index}, \var{item})}.
Fred Drakee5bf8b21998-02-12 21:22:28 +00003593\end{cfuncdesc}
3594
Fred Drakec6fa34e1998-04-02 06:47:24 +00003595\begin{cfuncdesc}{int}{PyList_Append}{PyObject *list, PyObject *item}
Guido van Rossum44475131998-04-21 15:30:01 +00003596Appends the object \var{item} at the end of list \var{list}. Returns
Fred Drake659ebfa2000-04-03 15:42:13 +00003597\code{0} if successful; returns \code{-1} and sets an exception if
3598unsuccessful. Analogous to \code{\var{list}.append(\var{item})}.
Fred Drakee5bf8b21998-02-12 21:22:28 +00003599\end{cfuncdesc}
3600
Fred Drakec6fa34e1998-04-02 06:47:24 +00003601\begin{cfuncdesc}{PyObject*}{PyList_GetSlice}{PyObject *list,
3602 int low, int high}
Guido van Rossum3c4378b1998-04-14 20:21:10 +00003603Returns a list of the objects in \var{list} containing the objects
Guido van Rossum44475131998-04-21 15:30:01 +00003604\emph{between} \var{low} and \var{high}. Returns NULL and sets an
3605exception if unsuccessful.
Fred Drake659ebfa2000-04-03 15:42:13 +00003606Analogous to \code{\var{list}[\var{low}:\var{high}]}.
Fred Drakee5bf8b21998-02-12 21:22:28 +00003607\end{cfuncdesc}
3608
Fred Drakec6fa34e1998-04-02 06:47:24 +00003609\begin{cfuncdesc}{int}{PyList_SetSlice}{PyObject *list,
3610 int low, int high,
3611 PyObject *itemlist}
Fred Drake659ebfa2000-04-03 15:42:13 +00003612Sets the slice of \var{list} between \var{low} and \var{high} to the
3613contents of \var{itemlist}. Analogous to
3614\code{\var{list}[\var{low}:\var{high}] = \var{itemlist}}. Returns
3615\code{0} on success, \code{-1} on failure.
Fred Drakee5bf8b21998-02-12 21:22:28 +00003616\end{cfuncdesc}
3617
Fred Drakec6fa34e1998-04-02 06:47:24 +00003618\begin{cfuncdesc}{int}{PyList_Sort}{PyObject *list}
Fred Drake659ebfa2000-04-03 15:42:13 +00003619Sorts the items of \var{list} in place. Returns \code{0} on success,
3620\code{-1} on failure. This is equivalent to
3621\samp{\var{list}.sort()}.
Fred Drakee5bf8b21998-02-12 21:22:28 +00003622\end{cfuncdesc}
3623
Fred Drakec6fa34e1998-04-02 06:47:24 +00003624\begin{cfuncdesc}{int}{PyList_Reverse}{PyObject *list}
Fred Drake659ebfa2000-04-03 15:42:13 +00003625Reverses the items of \var{list} in place. Returns \code{0} on
3626success, \code{-1} on failure. This is the equivalent of
3627\samp{\var{list}.reverse()}.
Fred Drakee5bf8b21998-02-12 21:22:28 +00003628\end{cfuncdesc}
3629
Fred Drakec6fa34e1998-04-02 06:47:24 +00003630\begin{cfuncdesc}{PyObject*}{PyList_AsTuple}{PyObject *list}
Fred Drake659ebfa2000-04-03 15:42:13 +00003631Returns a new tuple object containing the contents of \var{list};
3632equivalent to \samp{tuple(\var{list})}.\bifuncindex{tuple}
Fred Drakee5bf8b21998-02-12 21:22:28 +00003633\end{cfuncdesc}
3634
3635
Fred Drakeefd146c1999-02-15 15:30:45 +00003636\section{Mapping Objects \label{mapObjects}}
Fred Drakee5bf8b21998-02-12 21:22:28 +00003637
Fred Drake659ebfa2000-04-03 15:42:13 +00003638\obindex{mapping}
3639
3640
Fred Drakeefd146c1999-02-15 15:30:45 +00003641\subsection{Dictionary Objects \label{dictObjects}}
Fred Drakee5bf8b21998-02-12 21:22:28 +00003642
Fred Drake659ebfa2000-04-03 15:42:13 +00003643\obindex{dictionary}
Fred Drakee5bf8b21998-02-12 21:22:28 +00003644\begin{ctypedesc}{PyDictObject}
Fred Drakef8830d11998-04-23 14:06:01 +00003645This subtype of \ctype{PyObject} represents a Python dictionary object.
Fred Drakee5bf8b21998-02-12 21:22:28 +00003646\end{ctypedesc}
3647
3648\begin{cvardesc}{PyTypeObject}{PyDict_Type}
Fred Drake659ebfa2000-04-03 15:42:13 +00003649This instance of \ctype{PyTypeObject} represents the Python dictionary
3650type. This is exposed to Python programs as \code{types.DictType} and
3651\code{types.DictionaryType}.
3652\withsubitem{(in module types)}{\ttindex{DictType}\ttindex{DictionaryType}}
Fred Drakee5bf8b21998-02-12 21:22:28 +00003653\end{cvardesc}
3654
3655\begin{cfuncdesc}{int}{PyDict_Check}{PyObject *p}
Fred Drakef8830d11998-04-23 14:06:01 +00003656Returns true if its argument is a \ctype{PyDictObject}.
Fred Drakee5bf8b21998-02-12 21:22:28 +00003657\end{cfuncdesc}
3658
Fred Drakec6fa34e1998-04-02 06:47:24 +00003659\begin{cfuncdesc}{PyObject*}{PyDict_New}{}
Fred Drake659ebfa2000-04-03 15:42:13 +00003660Returns a new empty dictionary, or \NULL{} on failure.
3661\end{cfuncdesc}
3662
3663\begin{cfuncdesc}{void}{PyDict_Clear}{PyObject *p}
3664Empties an existing dictionary of all key-value pairs.
Fred Drakee5bf8b21998-02-12 21:22:28 +00003665\end{cfuncdesc}
3666
Jeremy Hyltona12c7a72000-03-30 22:27:31 +00003667\begin{cfuncdesc}{PyObject*}{PyDict_Copy}{PyObject *p}
Fred Drake659ebfa2000-04-03 15:42:13 +00003668Returns a new dictionary that contains the same key-value pairs as p.
3669Empties an existing dictionary of all key-value pairs.
Jeremy Hyltona12c7a72000-03-30 22:27:31 +00003670\end{cfuncdesc}
3671
Fred Drake659ebfa2000-04-03 15:42:13 +00003672\begin{cfuncdesc}{int}{PyDict_SetItem}{PyObject *p, PyObject *key,
3673 PyObject *val}
Fred Drakebab29652001-07-10 16:10:08 +00003674Inserts \var{value} into the dictionary \var{p} with a key of \var{key}.
Fred Drake659ebfa2000-04-03 15:42:13 +00003675\var{key} must be hashable; if it isn't, \exception{TypeError} will be
3676raised.
Fred Drakebab29652001-07-10 16:10:08 +00003677Returns \code{0} on success or \code{-1} on failure.
Fred Drakee5bf8b21998-02-12 21:22:28 +00003678\end{cfuncdesc}
3679
Fred Drake83e01bf2001-03-16 15:41:29 +00003680\begin{cfuncdesc}{int}{PyDict_SetItemString}{PyObject *p,
Fred Drakee5bf8b21998-02-12 21:22:28 +00003681 char *key,
3682 PyObject *val}
Fred Drakebab29652001-07-10 16:10:08 +00003683Inserts \var{value} into the dictionary \var{p} using \var{key}
Fred Drake1d158692000-06-18 05:21:21 +00003684as a key. \var{key} should be a \ctype{char*}. The key object is
Fred Drakee058b4f1998-02-16 06:15:35 +00003685created using \code{PyString_FromString(\var{key})}.
Fred Drakebab29652001-07-10 16:10:08 +00003686Returns \code{0} on success or \code{-1} on failure.
Fred Drake659ebfa2000-04-03 15:42:13 +00003687\ttindex{PyString_FromString()}
Fred Drakee5bf8b21998-02-12 21:22:28 +00003688\end{cfuncdesc}
3689
Fred Drake659ebfa2000-04-03 15:42:13 +00003690\begin{cfuncdesc}{int}{PyDict_DelItem}{PyObject *p, PyObject *key}
Fred Drakee058b4f1998-02-16 06:15:35 +00003691Removes the entry in dictionary \var{p} with key \var{key}.
Fred Drake659ebfa2000-04-03 15:42:13 +00003692\var{key} must be hashable; if it isn't, \exception{TypeError} is
3693raised.
Fred Drakee5bf8b21998-02-12 21:22:28 +00003694\end{cfuncdesc}
3695
Fred Drake659ebfa2000-04-03 15:42:13 +00003696\begin{cfuncdesc}{int}{PyDict_DelItemString}{PyObject *p, char *key}
Fred Drakee058b4f1998-02-16 06:15:35 +00003697Removes the entry in dictionary \var{p} which has a key
Fred Drake659ebfa2000-04-03 15:42:13 +00003698specified by the string \var{key}.
Fred Drakebab29652001-07-10 16:10:08 +00003699Returns \code{0} on success or \code{-1} on failure.
Fred Drakee5bf8b21998-02-12 21:22:28 +00003700\end{cfuncdesc}
3701
Fred Drake659ebfa2000-04-03 15:42:13 +00003702\begin{cfuncdesc}{PyObject*}{PyDict_GetItem}{PyObject *p, PyObject *key}
Fred Drakee058b4f1998-02-16 06:15:35 +00003703Returns the object from dictionary \var{p} which has a key
Guido van Rossum44475131998-04-21 15:30:01 +00003704\var{key}. Returns \NULL{} if the key \var{key} is not present, but
Fred Drake659ebfa2000-04-03 15:42:13 +00003705\emph{without} setting an exception.
Fred Drakee5bf8b21998-02-12 21:22:28 +00003706\end{cfuncdesc}
3707
Fred Drake659ebfa2000-04-03 15:42:13 +00003708\begin{cfuncdesc}{PyObject*}{PyDict_GetItemString}{PyObject *p, char *key}
Fred Drakef8830d11998-04-23 14:06:01 +00003709This is the same as \cfunction{PyDict_GetItem()}, but \var{key} is
Fred Drake659ebfa2000-04-03 15:42:13 +00003710specified as a \ctype{char*}, rather than a \ctype{PyObject*}.
Fred Drakee5bf8b21998-02-12 21:22:28 +00003711\end{cfuncdesc}
3712
Fred Drake659ebfa2000-04-03 15:42:13 +00003713\begin{cfuncdesc}{PyObject*}{PyDict_Items}{PyObject *p}
Fred Drakef8830d11998-04-23 14:06:01 +00003714Returns a \ctype{PyListObject} containing all the items
Guido van Rossum44475131998-04-21 15:30:01 +00003715from the dictionary, as in the dictinoary method \method{items()} (see
Fred Drakebe486461999-11-09 17:03:03 +00003716the \citetitle[../lib/lib.html]{Python Library Reference}).
Fred Drakee5bf8b21998-02-12 21:22:28 +00003717\end{cfuncdesc}
3718
Fred Drake659ebfa2000-04-03 15:42:13 +00003719\begin{cfuncdesc}{PyObject*}{PyDict_Keys}{PyObject *p}
Fred Drakef8830d11998-04-23 14:06:01 +00003720Returns a \ctype{PyListObject} containing all the keys
Guido van Rossum44475131998-04-21 15:30:01 +00003721from the dictionary, as in the dictionary method \method{keys()} (see the
Fred Drakebe486461999-11-09 17:03:03 +00003722\citetitle[../lib/lib.html]{Python Library Reference}).
Fred Drakee5bf8b21998-02-12 21:22:28 +00003723\end{cfuncdesc}
3724
Fred Drake659ebfa2000-04-03 15:42:13 +00003725\begin{cfuncdesc}{PyObject*}{PyDict_Values}{PyObject *p}
Fred Drakef8830d11998-04-23 14:06:01 +00003726Returns a \ctype{PyListObject} containing all the values
Guido van Rossum44475131998-04-21 15:30:01 +00003727from the dictionary \var{p}, as in the dictionary method
Fred Drakebe486461999-11-09 17:03:03 +00003728\method{values()} (see the \citetitle[../lib/lib.html]{Python Library
3729Reference}).
Fred Drakee5bf8b21998-02-12 21:22:28 +00003730\end{cfuncdesc}
3731
Fred Drake659ebfa2000-04-03 15:42:13 +00003732\begin{cfuncdesc}{int}{PyDict_Size}{PyObject *p}
3733Returns the number of items in the dictionary. This is equivalent to
3734\samp{len(\var{p})} on a dictionary.\bifuncindex{len}
Fred Drakee5bf8b21998-02-12 21:22:28 +00003735\end{cfuncdesc}
3736
Fred Drake83e01bf2001-03-16 15:41:29 +00003737\begin{cfuncdesc}{int}{PyDict_Next}{PyObject *p, int *ppos,
Fred Drake7d45d342000-08-11 17:07:32 +00003738 PyObject **pkey, PyObject **pvalue}
Fred Drake83e01bf2001-03-16 15:41:29 +00003739Iterate over all key-value pairs in the dictionary \var{p}. The
3740\ctype{int} referred to by \var{ppos} must be initialized to \code{0}
3741prior to the first call to this function to start the iteration; the
3742function returns true for each pair in the dictionary, and false once
3743all pairs have been reported. The parameters \var{pkey} and
3744\var{pvalue} should either point to \ctype{PyObject*} variables that
3745will be filled in with each key and value, respectively, or may be
Fred Drake8d00a0f2001-04-13 17:55:02 +00003746\NULL.
3747
Fred Drake83e01bf2001-03-16 15:41:29 +00003748For example:
Fred Drakee5bf8b21998-02-12 21:22:28 +00003749
Fred Drake83e01bf2001-03-16 15:41:29 +00003750\begin{verbatim}
3751PyObject *key, *value;
3752int pos = 0;
3753
3754while (PyDict_Next(self->dict, &pos, &key, &value)) {
3755 /* do something interesting with the values... */
3756 ...
3757}
3758\end{verbatim}
Fred Drake8d00a0f2001-04-13 17:55:02 +00003759
3760The dictionary \var{p} should not be mutated during iteration. It is
3761safe (since Python 2.1) to modify the values of the keys as you
3762iterate over the dictionary, for example:
3763
3764\begin{verbatim}
3765PyObject *key, *value;
3766int pos = 0;
3767
3768while (PyDict_Next(self->dict, &pos, &key, &value)) {
3769 int i = PyInt_AS_LONG(value) + 1;
3770 PyObject *o = PyInt_FromLong(i);
3771 if (o == NULL)
3772 return -1;
3773 if (PyDict_SetItem(self->dict, key, o) < 0) {
3774 Py_DECREF(o);
3775 return -1;
3776 }
3777 Py_DECREF(o);
3778}
3779\end{verbatim}
Fred Drakee5bf8b21998-02-12 21:22:28 +00003780\end{cfuncdesc}
3781
3782
Fred Drakeefd146c1999-02-15 15:30:45 +00003783\section{Other Objects \label{otherObjects}}
Fred Drakee5bf8b21998-02-12 21:22:28 +00003784
Fred Drakeefd146c1999-02-15 15:30:45 +00003785\subsection{File Objects \label{fileObjects}}
Fred Drakee5bf8b21998-02-12 21:22:28 +00003786
Fred Drake659ebfa2000-04-03 15:42:13 +00003787\obindex{file}
3788Python's built-in file objects are implemented entirely on the
3789\ctype{FILE*} support from the C standard library. This is an
3790implementation detail and may change in future releases of Python.
3791
Fred Drakee5bf8b21998-02-12 21:22:28 +00003792\begin{ctypedesc}{PyFileObject}
Fred Drakef8830d11998-04-23 14:06:01 +00003793This subtype of \ctype{PyObject} represents a Python file object.
Fred Drakee5bf8b21998-02-12 21:22:28 +00003794\end{ctypedesc}
3795
3796\begin{cvardesc}{PyTypeObject}{PyFile_Type}
Fred Drake659ebfa2000-04-03 15:42:13 +00003797This instance of \ctype{PyTypeObject} represents the Python file
3798type. This is exposed to Python programs as \code{types.FileType}.
3799\withsubitem{(in module types)}{\ttindex{FileType}}
Fred Drakee5bf8b21998-02-12 21:22:28 +00003800\end{cvardesc}
3801
3802\begin{cfuncdesc}{int}{PyFile_Check}{PyObject *p}
Fred Drakef8830d11998-04-23 14:06:01 +00003803Returns true if its argument is a \ctype{PyFileObject}.
Fred Drakee5bf8b21998-02-12 21:22:28 +00003804\end{cfuncdesc}
3805
Fred Drake659ebfa2000-04-03 15:42:13 +00003806\begin{cfuncdesc}{PyObject*}{PyFile_FromString}{char *filename, char *mode}
3807On success, returns a new file object that is opened on the
3808file given by \var{filename}, with a file mode given by \var{mode},
3809where \var{mode} has the same semantics as the standard C routine
3810\cfunction{fopen()}\ttindex{fopen()}. On failure, returns \NULL.
Fred Drakee5bf8b21998-02-12 21:22:28 +00003811\end{cfuncdesc}
3812
Fred Drakec6fa34e1998-04-02 06:47:24 +00003813\begin{cfuncdesc}{PyObject*}{PyFile_FromFile}{FILE *fp,
Fred Drake659ebfa2000-04-03 15:42:13 +00003814 char *name, char *mode,
3815 int (*close)(FILE*)}
3816Creates a new \ctype{PyFileObject} from the already-open standard C
3817file pointer, \var{fp}. The function \var{close} will be called when
3818the file should be closed. Returns \NULL{} on failure.
Fred Drakee5bf8b21998-02-12 21:22:28 +00003819\end{cfuncdesc}
3820
Fred Drake659ebfa2000-04-03 15:42:13 +00003821\begin{cfuncdesc}{FILE*}{PyFile_AsFile}{PyFileObject *p}
3822Returns the file object associated with \var{p} as a \ctype{FILE*}.
Fred Drakee5bf8b21998-02-12 21:22:28 +00003823\end{cfuncdesc}
3824
Fred Drakec6fa34e1998-04-02 06:47:24 +00003825\begin{cfuncdesc}{PyObject*}{PyFile_GetLine}{PyObject *p, int n}
Fred Drake659ebfa2000-04-03 15:42:13 +00003826Equivalent to \code{\var{p}.readline(\optional{\var{n}})}, this
3827function reads one line from the object \var{p}. \var{p} may be a
3828file object or any object with a \method{readline()} method. If
3829\var{n} is \code{0}, exactly one line is read, regardless of the
3830length of the line. If \var{n} is greater than \code{0}, no more than
3831\var{n} bytes will be read from the file; a partial line can be
3832returned. In both cases, an empty string is returned if the end of
3833the file is reached immediately. If \var{n} is less than \code{0},
3834however, one line is read regardless of length, but
3835\exception{EOFError} is raised if the end of the file is reached
3836immediately.
3837\withsubitem{(built-in exception)}{\ttindex{EOFError}}
Fred Drakee5bf8b21998-02-12 21:22:28 +00003838\end{cfuncdesc}
3839
Fred Drakec6fa34e1998-04-02 06:47:24 +00003840\begin{cfuncdesc}{PyObject*}{PyFile_Name}{PyObject *p}
Fred Drake659ebfa2000-04-03 15:42:13 +00003841Returns the name of the file specified by \var{p} as a string object.
Fred Drakee5bf8b21998-02-12 21:22:28 +00003842\end{cfuncdesc}
3843
3844\begin{cfuncdesc}{void}{PyFile_SetBufSize}{PyFileObject *p, int n}
Fred Drake659ebfa2000-04-03 15:42:13 +00003845Available on systems with \cfunction{setvbuf()}\ttindex{setvbuf()}
3846only. This should only be called immediately after file object
3847creation.
Fred Drakee5bf8b21998-02-12 21:22:28 +00003848\end{cfuncdesc}
3849
Fred Drake659ebfa2000-04-03 15:42:13 +00003850\begin{cfuncdesc}{int}{PyFile_SoftSpace}{PyObject *p, int newflag}
3851This function exists for internal use by the interpreter.
3852Sets the \member{softspace} attribute of \var{p} to \var{newflag} and
3853\withsubitem{(file attribute)}{\ttindex{softspace}}returns the
3854previous value. \var{p} does not have to be a file object
3855for this function to work properly; any object is supported (thought
3856its only interesting if the \member{softspace} attribute can be set).
3857This function clears any errors, and will return \code{0} as the
3858previous value if the attribute either does not exist or if there were
3859errors in retrieving it. There is no way to detect errors from this
3860function, but doing so should not be needed.
Fred Drakee5bf8b21998-02-12 21:22:28 +00003861\end{cfuncdesc}
3862
Fred Drakec6fa34e1998-04-02 06:47:24 +00003863\begin{cfuncdesc}{int}{PyFile_WriteObject}{PyObject *obj, PyFileObject *p,
3864 int flags}
Fred Drake659ebfa2000-04-03 15:42:13 +00003865Writes object \var{obj} to file object \var{p}. The only supported
3866flag for \var{flags} is \constant{Py_PRINT_RAW}\ttindex{Py_PRINT_RAW};
3867if given, the \function{str()} of the object is written instead of the
3868\function{repr()}. Returns \code{0} on success or \code{-1} on
3869failure; the appropriate exception will be set.
Fred Drakee5bf8b21998-02-12 21:22:28 +00003870\end{cfuncdesc}
3871
Fred Drakec6fa34e1998-04-02 06:47:24 +00003872\begin{cfuncdesc}{int}{PyFile_WriteString}{char *s, PyFileObject *p,
3873 int flags}
Fred Drake659ebfa2000-04-03 15:42:13 +00003874Writes string \var{s} to file object \var{p}. Returns \code{0} on
3875success or \code{-1} on failure; the appropriate exception will be
3876set.
Fred Drakee5bf8b21998-02-12 21:22:28 +00003877\end{cfuncdesc}
3878
3879
Fred Drake5838d0f2001-01-28 06:39:35 +00003880\subsection{Instance Objects \label{instanceObjects}}
3881
3882\obindex{instance}
3883There are very few functions specific to instance objects.
3884
3885\begin{cvardesc}{PyTypeObject}{PyInstance_Type}
3886 Type object for class instances.
3887\end{cvardesc}
3888
3889\begin{cfuncdesc}{int}{PyInstance_Check}{PyObject *obj}
3890 Returns true if \var{obj} is an instance.
3891\end{cfuncdesc}
3892
3893\begin{cfuncdesc}{PyObject*}{PyInstance_New}{PyObject *class,
3894 PyObject *arg,
3895 PyObject *kw}
3896 Create a new instance of a specific class. The parameters \var{arg}
3897 and \var{kw} are used as the positional and keyword parameters to
3898 the object's constructor.
3899\end{cfuncdesc}
3900
3901\begin{cfuncdesc}{PyObject*}{PyInstance_NewRaw}{PyObject *class,
3902 PyObject *dict}
3903 Create a new instance of a specific class without calling it's
3904 constructor. \var{class} is the class of new object. The
3905 \var{dict} parameter will be used as the object's \member{__dict__};
3906 if \NULL, a new dictionary will be created for the instance.
3907\end{cfuncdesc}
3908
3909
Fred Drakeefd146c1999-02-15 15:30:45 +00003910\subsection{Module Objects \label{moduleObjects}}
3911
3912\obindex{module}
3913There are only a few functions special to module objects.
3914
Fred Drake659ebfa2000-04-03 15:42:13 +00003915\begin{cvardesc}{PyTypeObject}{PyModule_Type}
3916This instance of \ctype{PyTypeObject} represents the Python module
3917type. This is exposed to Python programs as \code{types.ModuleType}.
3918\withsubitem{(in module types)}{\ttindex{ModuleType}}
3919\end{cvardesc}
3920
3921\begin{cfuncdesc}{int}{PyModule_Check}{PyObject *p}
3922Returns true if its argument is a module object.
Fred Drakeefd146c1999-02-15 15:30:45 +00003923\end{cfuncdesc}
3924
Fred Drake659ebfa2000-04-03 15:42:13 +00003925\begin{cfuncdesc}{PyObject*}{PyModule_New}{char *name}
3926Return a new module object with the \member{__name__} attribute set to
3927\var{name}. Only the module's \member{__doc__} and
3928\member{__name__} attributes are filled in; the caller is responsible
3929for providing a \member{__file__} attribute.
3930\withsubitem{(module attribute)}{
3931 \ttindex{__name__}\ttindex{__doc__}\ttindex{__file__}}
3932\end{cfuncdesc}
3933
3934\begin{cfuncdesc}{PyObject*}{PyModule_GetDict}{PyObject *module}
Fred Drakeefd146c1999-02-15 15:30:45 +00003935Return the dictionary object that implements \var{module}'s namespace;
3936this object is the same as the \member{__dict__} attribute of the
3937module object. This function never fails.
Fred Drake659ebfa2000-04-03 15:42:13 +00003938\withsubitem{(module attribute)}{\ttindex{__dict__}}
Fred Drakeefd146c1999-02-15 15:30:45 +00003939\end{cfuncdesc}
3940
Fred Drake659ebfa2000-04-03 15:42:13 +00003941\begin{cfuncdesc}{char*}{PyModule_GetName}{PyObject *module}
Fred Drakeefd146c1999-02-15 15:30:45 +00003942Return \var{module}'s \member{__name__} value. If the module does not
Fred Drake659ebfa2000-04-03 15:42:13 +00003943provide one, or if it is not a string, \exception{SystemError} is
3944raised and \NULL{} is returned.
3945\withsubitem{(module attribute)}{\ttindex{__name__}}
3946\withsubitem{(built-in exception)}{\ttindex{SystemError}}
Fred Drakeefd146c1999-02-15 15:30:45 +00003947\end{cfuncdesc}
3948
Fred Drake659ebfa2000-04-03 15:42:13 +00003949\begin{cfuncdesc}{char*}{PyModule_GetFilename}{PyObject *module}
Fred Drakeefd146c1999-02-15 15:30:45 +00003950Return the name of the file from which \var{module} was loaded using
3951\var{module}'s \member{__file__} attribute. If this is not defined,
Fred Drake659ebfa2000-04-03 15:42:13 +00003952or if it is not a string, raise \exception{SystemError} and return
3953\NULL.
3954\withsubitem{(module attribute)}{\ttindex{__file__}}
3955\withsubitem{(built-in exception)}{\ttindex{SystemError}}
Fred Drakeefd146c1999-02-15 15:30:45 +00003956\end{cfuncdesc}
3957
Fred Drake891150b2000-09-23 03:25:42 +00003958\begin{cfuncdesc}{int}{PyModule_AddObject}{PyObject *module,
3959 char *name, PyObject *value}
3960Add an object to \var{module} as \var{name}. This is a convenience
3961function which can be used from the module's initialization function.
3962This steals a reference to \var{value}. Returns \code{-1} on error,
3963\code{0} on success.
3964\versionadded{2.0}
3965\end{cfuncdesc}
3966
3967\begin{cfuncdesc}{int}{PyModule_AddIntConstant}{PyObject *module,
3968 char *name, int value}
3969Add an integer constant to \var{module} as \var{name}. This convenience
3970function can be used from the module's initialization function.
3971Returns \code{-1} on error, \code{0} on success.
3972\versionadded{2.0}
3973\end{cfuncdesc}
3974
3975\begin{cfuncdesc}{int}{PyModule_AddStringConstant}{PyObject *module,
3976 char *name, char *value}
3977Add a string constant to \var{module} as \var{name}. This convenience
3978function can be used from the module's initialization function. The
3979string \var{value} must be null-terminated. Returns \code{-1} on
3980error, \code{0} on success.
3981\versionadded{2.0}
3982\end{cfuncdesc}
3983
Fred Drakeefd146c1999-02-15 15:30:45 +00003984
3985\subsection{CObjects \label{cObjects}}
Fred Drakee5bf8b21998-02-12 21:22:28 +00003986
Fred Drake659ebfa2000-04-03 15:42:13 +00003987\obindex{CObject}
3988Refer to \emph{Extending and Embedding the Python Interpreter},
3989section 1.12 (``Providing a C API for an Extension Module''), for more
3990information on using these objects.
3991
3992
Guido van Rossum44475131998-04-21 15:30:01 +00003993\begin{ctypedesc}{PyCObject}
Fred Drakef8830d11998-04-23 14:06:01 +00003994This subtype of \ctype{PyObject} represents an opaque value, useful for
Fred Drake659ebfa2000-04-03 15:42:13 +00003995C extension modules who need to pass an opaque value (as a
3996\ctype{void*} pointer) through Python code to other C code. It is
Guido van Rossum44475131998-04-21 15:30:01 +00003997often used to make a C function pointer defined in one module
3998available to other modules, so the regular import mechanism can be
3999used to access C APIs defined in dynamically loaded modules.
4000\end{ctypedesc}
Fred Drakee5bf8b21998-02-12 21:22:28 +00004001
Fred Drake659ebfa2000-04-03 15:42:13 +00004002\begin{cfuncdesc}{int}{PyCObject_Check}{PyObject *p}
4003Returns true if its argument is a \ctype{PyCObject}.
4004\end{cfuncdesc}
4005
4006\begin{cfuncdesc}{PyObject*}{PyCObject_FromVoidPtr}{void* cobj,
Marc-André Lemburga544ea22001-01-17 18:04:31 +00004007 void (*destr)(void *)}
Fred Drake1d158692000-06-18 05:21:21 +00004008Creates a \ctype{PyCObject} from the \code{void *}\var{cobj}. The
Fred Drakedab44681999-05-13 18:41:14 +00004009\var{destr} function will be called when the object is reclaimed, unless
4010it is \NULL.
Guido van Rossum44475131998-04-21 15:30:01 +00004011\end{cfuncdesc}
4012
Fred Drake659ebfa2000-04-03 15:42:13 +00004013\begin{cfuncdesc}{PyObject*}{PyCObject_FromVoidPtrAndDesc}{void* cobj,
Marc-André Lemburga544ea22001-01-17 18:04:31 +00004014 void* desc, void (*destr)(void *, void *) }
Fred Drakef8830d11998-04-23 14:06:01 +00004015Creates a \ctype{PyCObject} from the \ctype{void *}\var{cobj}. The
4016\var{destr} function will be called when the object is reclaimed. The
4017\var{desc} argument can be used to pass extra callback data for the
4018destructor function.
Guido van Rossum44475131998-04-21 15:30:01 +00004019\end{cfuncdesc}
4020
Fred Drake659ebfa2000-04-03 15:42:13 +00004021\begin{cfuncdesc}{void*}{PyCObject_AsVoidPtr}{PyObject* self}
4022Returns the object \ctype{void *} that the
4023\ctype{PyCObject} \var{self} was created with.
Guido van Rossum44475131998-04-21 15:30:01 +00004024\end{cfuncdesc}
4025
Fred Drake659ebfa2000-04-03 15:42:13 +00004026\begin{cfuncdesc}{void*}{PyCObject_GetDesc}{PyObject* self}
4027Returns the description \ctype{void *} that the
4028\ctype{PyCObject} \var{self} was created with.
Guido van Rossum44475131998-04-21 15:30:01 +00004029\end{cfuncdesc}
Fred Drakee5bf8b21998-02-12 21:22:28 +00004030
Fred Drake659ebfa2000-04-03 15:42:13 +00004031
Fred Drakeefd146c1999-02-15 15:30:45 +00004032\chapter{Initialization, Finalization, and Threads
4033 \label{initialization}}
Guido van Rossum4a944d71997-08-14 20:35:38 +00004034
Guido van Rossum4a944d71997-08-14 20:35:38 +00004035\begin{cfuncdesc}{void}{Py_Initialize}{}
4036Initialize the Python interpreter. In an application embedding
4037Python, this should be called before using any other Python/C API
Fred Drake659ebfa2000-04-03 15:42:13 +00004038functions; with the exception of
4039\cfunction{Py_SetProgramName()}\ttindex{Py_SetProgramName()},
4040\cfunction{PyEval_InitThreads()}\ttindex{PyEval_InitThreads()},
4041\cfunction{PyEval_ReleaseLock()}\ttindex{PyEval_ReleaseLock()},
4042and \cfunction{PyEval_AcquireLock()}\ttindex{PyEval_AcquireLock()}.
4043This initializes the table of loaded modules (\code{sys.modules}), and
4044\withsubitem{(in module sys)}{\ttindex{modules}\ttindex{path}}creates the
4045fundamental modules \module{__builtin__}\refbimodindex{__builtin__},
Fred Drake4de05a91998-02-16 14:25:26 +00004046\module{__main__}\refbimodindex{__main__} and
4047\module{sys}\refbimodindex{sys}. It also initializes the module
Fred Drake659ebfa2000-04-03 15:42:13 +00004048search\indexiii{module}{search}{path} path (\code{sys.path}).
4049It does not set \code{sys.argv}; use
4050\cfunction{PySys_SetArgv()}\ttindex{PySys_SetArgv()} for that. This
4051is a no-op when called for a second time (without calling
4052\cfunction{Py_Finalize()}\ttindex{Py_Finalize()} first). There is no
4053return value; it is a fatal error if the initialization fails.
Guido van Rossum42cefd01997-10-05 15:27:29 +00004054\end{cfuncdesc}
4055
4056\begin{cfuncdesc}{int}{Py_IsInitialized}{}
Guido van Rossum42cefd01997-10-05 15:27:29 +00004057Return true (nonzero) when the Python interpreter has been
Fred Drakee058b4f1998-02-16 06:15:35 +00004058initialized, false (zero) if not. After \cfunction{Py_Finalize()} is
4059called, this returns false until \cfunction{Py_Initialize()} is called
Guido van Rossum42cefd01997-10-05 15:27:29 +00004060again.
Guido van Rossum4a944d71997-08-14 20:35:38 +00004061\end{cfuncdesc}
4062
4063\begin{cfuncdesc}{void}{Py_Finalize}{}
Fred Drakee058b4f1998-02-16 06:15:35 +00004064Undo all initializations made by \cfunction{Py_Initialize()} and
4065subsequent use of Python/C API functions, and destroy all
4066sub-interpreters (see \cfunction{Py_NewInterpreter()} below) that were
4067created and not yet destroyed since the last call to
4068\cfunction{Py_Initialize()}. Ideally, this frees all memory allocated
4069by the Python interpreter. This is a no-op when called for a second
4070time (without calling \cfunction{Py_Initialize()} again first). There
4071is no return value; errors during finalization are ignored.
Guido van Rossum4a944d71997-08-14 20:35:38 +00004072
4073This function is provided for a number of reasons. An embedding
4074application might want to restart Python without having to restart the
4075application itself. An application that has loaded the Python
4076interpreter from a dynamically loadable library (or DLL) might want to
4077free all memory allocated by Python before unloading the DLL. During a
4078hunt for memory leaks in an application a developer might want to free
4079all memory allocated by Python before exiting from the application.
4080
Fred Drakee058b4f1998-02-16 06:15:35 +00004081\strong{Bugs and caveats:} The destruction of modules and objects in
Guido van Rossum4a944d71997-08-14 20:35:38 +00004082modules is done in random order; this may cause destructors
Fred Drakee058b4f1998-02-16 06:15:35 +00004083(\method{__del__()} methods) to fail when they depend on other objects
Guido van Rossum4a944d71997-08-14 20:35:38 +00004084(even functions) or modules. Dynamically loaded extension modules
4085loaded by Python are not unloaded. Small amounts of memory allocated
4086by the Python interpreter may not be freed (if you find a leak, please
4087report it). Memory tied up in circular references between objects is
4088not freed. Some memory allocated by extension modules may not be
4089freed. Some extension may not work properly if their initialization
4090routine is called more than once; this can happen if an applcation
Fred Drakee058b4f1998-02-16 06:15:35 +00004091calls \cfunction{Py_Initialize()} and \cfunction{Py_Finalize()} more
4092than once.
Guido van Rossum4a944d71997-08-14 20:35:38 +00004093\end{cfuncdesc}
4094
Fred Drakec6fa34e1998-04-02 06:47:24 +00004095\begin{cfuncdesc}{PyThreadState*}{Py_NewInterpreter}{}
Fred Drake4de05a91998-02-16 14:25:26 +00004096Create a new sub-interpreter. This is an (almost) totally separate
4097environment for the execution of Python code. In particular, the new
4098interpreter has separate, independent versions of all imported
4099modules, including the fundamental modules
4100\module{__builtin__}\refbimodindex{__builtin__},
4101\module{__main__}\refbimodindex{__main__} and
4102\module{sys}\refbimodindex{sys}. The table of loaded modules
4103(\code{sys.modules}) and the module search path (\code{sys.path}) are
4104also separate. The new environment has no \code{sys.argv} variable.
4105It has new standard I/O stream file objects \code{sys.stdin},
4106\code{sys.stdout} and \code{sys.stderr} (however these refer to the
Fred Drake659ebfa2000-04-03 15:42:13 +00004107same underlying \ctype{FILE} structures in the C library).
4108\withsubitem{(in module sys)}{
4109 \ttindex{stdout}\ttindex{stderr}\ttindex{stdin}}
Guido van Rossum4a944d71997-08-14 20:35:38 +00004110
4111The return value points to the first thread state created in the new
4112sub-interpreter. This thread state is made the current thread state.
4113Note that no actual thread is created; see the discussion of thread
4114states below. If creation of the new interpreter is unsuccessful,
Guido van Rossum580aa8d1997-11-25 15:34:51 +00004115\NULL{} is returned; no exception is set since the exception state
Guido van Rossum4a944d71997-08-14 20:35:38 +00004116is stored in the current thread state and there may not be a current
4117thread state. (Like all other Python/C API functions, the global
4118interpreter lock must be held before calling this function and is
4119still held when it returns; however, unlike most other Python/C API
4120functions, there needn't be a current thread state on entry.)
4121
4122Extension modules are shared between (sub-)interpreters as follows:
4123the first time a particular extension is imported, it is initialized
4124normally, and a (shallow) copy of its module's dictionary is
4125squirreled away. When the same extension is imported by another
4126(sub-)interpreter, a new module is initialized and filled with the
Fred Drakee058b4f1998-02-16 06:15:35 +00004127contents of this copy; the extension's \code{init} function is not
4128called. Note that this is different from what happens when an
4129extension is imported after the interpreter has been completely
Fred Drake659ebfa2000-04-03 15:42:13 +00004130re-initialized by calling
4131\cfunction{Py_Finalize()}\ttindex{Py_Finalize()} and
4132\cfunction{Py_Initialize()}\ttindex{Py_Initialize()}; in that case,
4133the extension's \code{init\var{module}} function \emph{is} called
4134again.
Guido van Rossum4a944d71997-08-14 20:35:38 +00004135
Fred Drakee058b4f1998-02-16 06:15:35 +00004136\strong{Bugs and caveats:} Because sub-interpreters (and the main
Guido van Rossum4a944d71997-08-14 20:35:38 +00004137interpreter) are part of the same process, the insulation between them
Fred Drakee058b4f1998-02-16 06:15:35 +00004138isn't perfect --- for example, using low-level file operations like
Fred Drake659ebfa2000-04-03 15:42:13 +00004139\withsubitem{(in module os)}{\ttindex{close()}}
Fred Drakef8830d11998-04-23 14:06:01 +00004140\function{os.close()} they can (accidentally or maliciously) affect each
Guido van Rossum4a944d71997-08-14 20:35:38 +00004141other's open files. Because of the way extensions are shared between
4142(sub-)interpreters, some extensions may not work properly; this is
4143especially likely when the extension makes use of (static) global
4144variables, or when the extension manipulates its module's dictionary
4145after its initialization. It is possible to insert objects created in
4146one sub-interpreter into a namespace of another sub-interpreter; this
4147should be done with great care to avoid sharing user-defined
4148functions, methods, instances or classes between sub-interpreters,
4149since import operations executed by such objects may affect the
4150wrong (sub-)interpreter's dictionary of loaded modules. (XXX This is
4151a hard-to-fix bug that will be addressed in a future release.)
4152\end{cfuncdesc}
4153
4154\begin{cfuncdesc}{void}{Py_EndInterpreter}{PyThreadState *tstate}
4155Destroy the (sub-)interpreter represented by the given thread state.
4156The given thread state must be the current thread state. See the
4157discussion of thread states below. When the call returns, the current
Guido van Rossum580aa8d1997-11-25 15:34:51 +00004158thread state is \NULL{}. All thread states associated with this
Guido van Rossum4a944d71997-08-14 20:35:38 +00004159interpreted are destroyed. (The global interpreter lock must be held
4160before calling this function and is still held when it returns.)
Fred Drake659ebfa2000-04-03 15:42:13 +00004161\cfunction{Py_Finalize()}\ttindex{Py_Finalize()} will destroy all
4162sub-interpreters that haven't been explicitly destroyed at that point.
Guido van Rossum4a944d71997-08-14 20:35:38 +00004163\end{cfuncdesc}
4164
4165\begin{cfuncdesc}{void}{Py_SetProgramName}{char *name}
Fred Drake659ebfa2000-04-03 15:42:13 +00004166This function should be called before
4167\cfunction{Py_Initialize()}\ttindex{Py_Initialize()} is called
Guido van Rossum4a944d71997-08-14 20:35:38 +00004168for the first time, if it is called at all. It tells the interpreter
Fred Drake659ebfa2000-04-03 15:42:13 +00004169the value of the \code{argv[0]} argument to the
4170\cfunction{main()}\ttindex{main()} function of the program. This is
4171used by \cfunction{Py_GetPath()}\ttindex{Py_GetPath()} and some other
Guido van Rossum4a944d71997-08-14 20:35:38 +00004172functions below to find the Python run-time libraries relative to the
Fred Drakea8455ab2000-06-16 19:58:42 +00004173interpreter executable. The default value is \code{'python'}. The
Guido van Rossum4a944d71997-08-14 20:35:38 +00004174argument should point to a zero-terminated character string in static
4175storage whose contents will not change for the duration of the
4176program's execution. No code in the Python interpreter will change
4177the contents of this storage.
4178\end{cfuncdesc}
4179
Fred Drakec6fa34e1998-04-02 06:47:24 +00004180\begin{cfuncdesc}{char*}{Py_GetProgramName}{}
Fred Drake659ebfa2000-04-03 15:42:13 +00004181Return the program name set with
4182\cfunction{Py_SetProgramName()}\ttindex{Py_SetProgramName()}, or the
Guido van Rossum4a944d71997-08-14 20:35:38 +00004183default. The returned string points into static storage; the caller
4184should not modify its value.
4185\end{cfuncdesc}
4186
Fred Drakec6fa34e1998-04-02 06:47:24 +00004187\begin{cfuncdesc}{char*}{Py_GetPrefix}{}
Fred Drakee058b4f1998-02-16 06:15:35 +00004188Return the \emph{prefix} for installed platform-independent files. This
Guido van Rossum4a944d71997-08-14 20:35:38 +00004189is derived through a number of complicated rules from the program name
Fred Drakee058b4f1998-02-16 06:15:35 +00004190set with \cfunction{Py_SetProgramName()} and some environment variables;
Fred Drakea8455ab2000-06-16 19:58:42 +00004191for example, if the program name is \code{'/usr/local/bin/python'},
4192the prefix is \code{'/usr/local'}. The returned string points into
Guido van Rossum4a944d71997-08-14 20:35:38 +00004193static storage; the caller should not modify its value. This
Fred Drakec94d9341998-04-12 02:39:13 +00004194corresponds to the \makevar{prefix} variable in the top-level
Fred Drakea8455ab2000-06-16 19:58:42 +00004195\file{Makefile} and the \longprogramopt{prefix} argument to the
Fred Drakee058b4f1998-02-16 06:15:35 +00004196\program{configure} script at build time. The value is available to
Fred Drakeb0a78731998-01-13 18:51:10 +00004197Python code as \code{sys.prefix}. It is only useful on \UNIX{}. See
Guido van Rossum4a944d71997-08-14 20:35:38 +00004198also the next function.
4199\end{cfuncdesc}
4200
Fred Drakec6fa34e1998-04-02 06:47:24 +00004201\begin{cfuncdesc}{char*}{Py_GetExecPrefix}{}
Fred Drakee058b4f1998-02-16 06:15:35 +00004202Return the \emph{exec-prefix} for installed platform-\emph{de}pendent
Guido van Rossum4a944d71997-08-14 20:35:38 +00004203files. This is derived through a number of complicated rules from the
Fred Drakee058b4f1998-02-16 06:15:35 +00004204program name set with \cfunction{Py_SetProgramName()} and some environment
Guido van Rossum4a944d71997-08-14 20:35:38 +00004205variables; for example, if the program name is
Fred Drakea8455ab2000-06-16 19:58:42 +00004206\code{'/usr/local/bin/python'}, the exec-prefix is
4207\code{'/usr/local'}. The returned string points into static storage;
Guido van Rossum4a944d71997-08-14 20:35:38 +00004208the caller should not modify its value. This corresponds to the
Fred Drakec94d9341998-04-12 02:39:13 +00004209\makevar{exec_prefix} variable in the top-level \file{Makefile} and the
Fred Drakea8455ab2000-06-16 19:58:42 +00004210\longprogramopt{exec-prefix} argument to the
Fred Drake310ee611999-11-09 17:31:42 +00004211\program{configure} script at build time. The value is available to
4212Python code as \code{sys.exec_prefix}. It is only useful on \UNIX{}.
Guido van Rossum4a944d71997-08-14 20:35:38 +00004213
4214Background: The exec-prefix differs from the prefix when platform
4215dependent files (such as executables and shared libraries) are
4216installed in a different directory tree. In a typical installation,
4217platform dependent files may be installed in the
Fred Drakea8455ab2000-06-16 19:58:42 +00004218\file{/usr/local/plat} subtree while platform independent may be
4219installed in \file{/usr/local}.
Guido van Rossum4a944d71997-08-14 20:35:38 +00004220
4221Generally speaking, a platform is a combination of hardware and
4222software families, e.g. Sparc machines running the Solaris 2.x
4223operating system are considered the same platform, but Intel machines
4224running Solaris 2.x are another platform, and Intel machines running
4225Linux are yet another platform. Different major revisions of the same
Fred Drakeb0a78731998-01-13 18:51:10 +00004226operating system generally also form different platforms. Non-\UNIX{}
Guido van Rossum4a944d71997-08-14 20:35:38 +00004227operating systems are a different story; the installation strategies
4228on those systems are so different that the prefix and exec-prefix are
4229meaningless, and set to the empty string. Note that compiled Python
4230bytecode files are platform independent (but not independent from the
4231Python version by which they were compiled!).
4232
Fred Drakee058b4f1998-02-16 06:15:35 +00004233System administrators will know how to configure the \program{mount} or
Fred Drakea8455ab2000-06-16 19:58:42 +00004234\program{automount} programs to share \file{/usr/local} between platforms
4235while having \file{/usr/local/plat} be a different filesystem for each
Guido van Rossum4a944d71997-08-14 20:35:38 +00004236platform.
4237\end{cfuncdesc}
4238
Fred Drakec6fa34e1998-04-02 06:47:24 +00004239\begin{cfuncdesc}{char*}{Py_GetProgramFullPath}{}
Guido van Rossum4a944d71997-08-14 20:35:38 +00004240Return the full program name of the Python executable; this is
4241computed as a side-effect of deriving the default module search path
Fred Drake659ebfa2000-04-03 15:42:13 +00004242from the program name (set by
4243\cfunction{Py_SetProgramName()}\ttindex{Py_SetProgramName()} above).
4244The returned string points into static storage; the caller should not
Guido van Rossum4a944d71997-08-14 20:35:38 +00004245modify its value. The value is available to Python code as
Guido van Rossum42cefd01997-10-05 15:27:29 +00004246\code{sys.executable}.
Fred Drake659ebfa2000-04-03 15:42:13 +00004247\withsubitem{(in module sys)}{\ttindex{executable}}
Guido van Rossum4a944d71997-08-14 20:35:38 +00004248\end{cfuncdesc}
4249
Fred Drakec6fa34e1998-04-02 06:47:24 +00004250\begin{cfuncdesc}{char*}{Py_GetPath}{}
Fred Drake4de05a91998-02-16 14:25:26 +00004251\indexiii{module}{search}{path}
Guido van Rossum4a944d71997-08-14 20:35:38 +00004252Return the default module search path; this is computed from the
Fred Drakee058b4f1998-02-16 06:15:35 +00004253program name (set by \cfunction{Py_SetProgramName()} above) and some
Guido van Rossum4a944d71997-08-14 20:35:38 +00004254environment variables. The returned string consists of a series of
4255directory names separated by a platform dependent delimiter character.
Fred Drakef8830d11998-04-23 14:06:01 +00004256The delimiter character is \character{:} on \UNIX{}, \character{;} on
Fred Drake659ebfa2000-04-03 15:42:13 +00004257DOS/Windows, and \character{\e n} (the \ASCII{} newline character) on
Fred Drakee5bc4971998-02-12 23:36:49 +00004258Macintosh. The returned string points into static storage; the caller
Guido van Rossum4a944d71997-08-14 20:35:38 +00004259should not modify its value. The value is available to Python code
Fred Drake659ebfa2000-04-03 15:42:13 +00004260as the list \code{sys.path}\withsubitem{(in module sys)}{\ttindex{path}},
4261which may be modified to change the future search path for loaded
4262modules.
Guido van Rossum4a944d71997-08-14 20:35:38 +00004263
4264% XXX should give the exact rules
4265\end{cfuncdesc}
4266
Fred Drakec6fa34e1998-04-02 06:47:24 +00004267\begin{cfuncdesc}{const char*}{Py_GetVersion}{}
Guido van Rossum4a944d71997-08-14 20:35:38 +00004268Return the version of this Python interpreter. This is a string that
4269looks something like
4270
Guido van Rossum09270b51997-08-15 18:57:32 +00004271\begin{verbatim}
Fred Drakee058b4f1998-02-16 06:15:35 +00004272"1.5 (#67, Dec 31 1997, 22:34:28) [GCC 2.7.2.2]"
Guido van Rossum09270b51997-08-15 18:57:32 +00004273\end{verbatim}
Guido van Rossum4a944d71997-08-14 20:35:38 +00004274
4275The first word (up to the first space character) is the current Python
4276version; the first three characters are the major and minor version
4277separated by a period. The returned string points into static storage;
4278the caller should not modify its value. The value is available to
4279Python code as the list \code{sys.version}.
Fred Drake659ebfa2000-04-03 15:42:13 +00004280\withsubitem{(in module sys)}{\ttindex{version}}
Guido van Rossum4a944d71997-08-14 20:35:38 +00004281\end{cfuncdesc}
4282
Fred Drakec6fa34e1998-04-02 06:47:24 +00004283\begin{cfuncdesc}{const char*}{Py_GetPlatform}{}
Fred Drakeb0a78731998-01-13 18:51:10 +00004284Return the platform identifier for the current platform. On \UNIX{},
Guido van Rossum4a944d71997-08-14 20:35:38 +00004285this is formed from the ``official'' name of the operating system,
4286converted to lower case, followed by the major revision number; e.g.,
4287for Solaris 2.x, which is also known as SunOS 5.x, the value is
Fred Drakea8455ab2000-06-16 19:58:42 +00004288\code{'sunos5'}. On Macintosh, it is \code{'mac'}. On Windows, it
4289is \code{'win'}. The returned string points into static storage;
Guido van Rossum4a944d71997-08-14 20:35:38 +00004290the caller should not modify its value. The value is available to
4291Python code as \code{sys.platform}.
Fred Drake659ebfa2000-04-03 15:42:13 +00004292\withsubitem{(in module sys)}{\ttindex{platform}}
Guido van Rossum4a944d71997-08-14 20:35:38 +00004293\end{cfuncdesc}
4294
Fred Drakec6fa34e1998-04-02 06:47:24 +00004295\begin{cfuncdesc}{const char*}{Py_GetCopyright}{}
Guido van Rossum4a944d71997-08-14 20:35:38 +00004296Return the official copyright string for the current Python version,
4297for example
4298
Fred Drakea8455ab2000-06-16 19:58:42 +00004299\code{'Copyright 1991-1995 Stichting Mathematisch Centrum, Amsterdam'}
Guido van Rossum4a944d71997-08-14 20:35:38 +00004300
4301The returned string points into static storage; the caller should not
4302modify its value. The value is available to Python code as the list
4303\code{sys.copyright}.
Fred Drake659ebfa2000-04-03 15:42:13 +00004304\withsubitem{(in module sys)}{\ttindex{copyright}}
Guido van Rossum4a944d71997-08-14 20:35:38 +00004305\end{cfuncdesc}
4306
Fred Drakec6fa34e1998-04-02 06:47:24 +00004307\begin{cfuncdesc}{const char*}{Py_GetCompiler}{}
Guido van Rossum4a944d71997-08-14 20:35:38 +00004308Return an indication of the compiler used to build the current Python
Fred Drakee058b4f1998-02-16 06:15:35 +00004309version, in square brackets, for example:
Guido van Rossum4a944d71997-08-14 20:35:38 +00004310
Fred Drakee058b4f1998-02-16 06:15:35 +00004311\begin{verbatim}
4312"[GCC 2.7.2.2]"
4313\end{verbatim}
Guido van Rossum4a944d71997-08-14 20:35:38 +00004314
4315The returned string points into static storage; the caller should not
4316modify its value. The value is available to Python code as part of
4317the variable \code{sys.version}.
Fred Drake659ebfa2000-04-03 15:42:13 +00004318\withsubitem{(in module sys)}{\ttindex{version}}
Guido van Rossum4a944d71997-08-14 20:35:38 +00004319\end{cfuncdesc}
4320
Fred Drakec6fa34e1998-04-02 06:47:24 +00004321\begin{cfuncdesc}{const char*}{Py_GetBuildInfo}{}
Guido van Rossum4a944d71997-08-14 20:35:38 +00004322Return information about the sequence number and build date and time
4323of the current Python interpreter instance, for example
4324
Guido van Rossum09270b51997-08-15 18:57:32 +00004325\begin{verbatim}
4326"#67, Aug 1 1997, 22:34:28"
4327\end{verbatim}
Guido van Rossum4a944d71997-08-14 20:35:38 +00004328
4329The returned string points into static storage; the caller should not
4330modify its value. The value is available to Python code as part of
4331the variable \code{sys.version}.
Fred Drake659ebfa2000-04-03 15:42:13 +00004332\withsubitem{(in module sys)}{\ttindex{version}}
Guido van Rossum4a944d71997-08-14 20:35:38 +00004333\end{cfuncdesc}
4334
4335\begin{cfuncdesc}{int}{PySys_SetArgv}{int argc, char **argv}
Fred Drake659ebfa2000-04-03 15:42:13 +00004336Set \code{sys.argv} based on \var{argc} and \var{argv}. These
4337parameters are similar to those passed to the program's
4338\cfunction{main()}\ttindex{main()} function with the difference that
4339the first entry should refer to the script file to be executed rather
4340than the executable hosting the Python interpreter. If there isn't a
4341script that will be run, the first entry in \var{argv} can be an empty
4342string. If this function fails to initialize \code{sys.argv}, a fatal
4343condition is signalled using
4344\cfunction{Py_FatalError()}\ttindex{Py_FatalError()}.
4345\withsubitem{(in module sys)}{\ttindex{argv}}
4346% XXX impl. doesn't seem consistent in allowing 0/NULL for the params;
4347% check w/ Guido.
Guido van Rossum4a944d71997-08-14 20:35:38 +00004348\end{cfuncdesc}
4349
4350% XXX Other PySys thingies (doesn't really belong in this chapter)
4351
Fred Drakeefd146c1999-02-15 15:30:45 +00004352\section{Thread State and the Global Interpreter Lock
4353 \label{threads}}
Guido van Rossum4a944d71997-08-14 20:35:38 +00004354
Fred Drake659ebfa2000-04-03 15:42:13 +00004355\index{global interpreter lock}
4356\index{interpreter lock}
4357\index{lock, interpreter}
4358
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004359The Python interpreter is not fully thread safe. In order to support
4360multi-threaded Python programs, there's a global lock that must be
4361held by the current thread before it can safely access Python objects.
4362Without the lock, even the simplest operations could cause problems in
Fred Drake7baf3d41998-02-20 00:45:52 +00004363a multi-threaded program: for example, when two threads simultaneously
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004364increment the reference count of the same object, the reference count
4365could end up being incremented only once instead of twice.
4366
4367Therefore, the rule exists that only the thread that has acquired the
4368global interpreter lock may operate on Python objects or call Python/C
4369API functions. In order to support multi-threaded Python programs,
Fred Drake659ebfa2000-04-03 15:42:13 +00004370the interpreter regularly releases and reacquires the lock --- by
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004371default, every ten bytecode instructions (this can be changed with
Fred Drake659ebfa2000-04-03 15:42:13 +00004372\withsubitem{(in module sys)}{\ttindex{setcheckinterval()}}
Fred Drakee058b4f1998-02-16 06:15:35 +00004373\function{sys.setcheckinterval()}). The lock is also released and
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004374reacquired around potentially blocking I/O operations like reading or
4375writing a file, so that other threads can run while the thread that
4376requests the I/O is waiting for the I/O operation to complete.
4377
4378The Python interpreter needs to keep some bookkeeping information
Fred Drakee058b4f1998-02-16 06:15:35 +00004379separate per thread --- for this it uses a data structure called
Fred Drake659ebfa2000-04-03 15:42:13 +00004380\ctype{PyThreadState}\ttindex{PyThreadState}. This is new in Python
43811.5; in earlier versions, such state was stored in global variables,
4382and switching threads could cause problems. In particular, exception
4383handling is now thread safe, when the application uses
4384\withsubitem{(in module sys)}{\ttindex{exc_info()}}
4385\function{sys.exc_info()} to access the exception last raised in the
4386current thread.
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004387
4388There's one global variable left, however: the pointer to the current
Fred Drake659ebfa2000-04-03 15:42:13 +00004389\ctype{PyThreadState}\ttindex{PyThreadState} structure. While most
4390thread packages have a way to store ``per-thread global data,''
4391Python's internal platform independent thread abstraction doesn't
4392support this yet. Therefore, the current thread state must be
4393manipulated explicitly.
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004394
4395This is easy enough in most cases. Most code manipulating the global
4396interpreter lock has the following simple structure:
4397
Guido van Rossum9faf4c51997-10-07 14:38:54 +00004398\begin{verbatim}
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004399Save the thread state in a local variable.
4400Release the interpreter lock.
4401...Do some blocking I/O operation...
4402Reacquire the interpreter lock.
4403Restore the thread state from the local variable.
Guido van Rossum9faf4c51997-10-07 14:38:54 +00004404\end{verbatim}
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004405
4406This is so common that a pair of macros exists to simplify it:
4407
Guido van Rossum9faf4c51997-10-07 14:38:54 +00004408\begin{verbatim}
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004409Py_BEGIN_ALLOW_THREADS
4410...Do some blocking I/O operation...
4411Py_END_ALLOW_THREADS
Guido van Rossum9faf4c51997-10-07 14:38:54 +00004412\end{verbatim}
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004413
Fred Drake659ebfa2000-04-03 15:42:13 +00004414The \code{Py_BEGIN_ALLOW_THREADS}\ttindex{Py_BEGIN_ALLOW_THREADS} macro
4415opens a new block and declares a hidden local variable; the
4416\code{Py_END_ALLOW_THREADS}\ttindex{Py_END_ALLOW_THREADS} macro closes
Fred Drakee058b4f1998-02-16 06:15:35 +00004417the block. Another advantage of using these two macros is that when
4418Python is compiled without thread support, they are defined empty,
4419thus saving the thread state and lock manipulations.
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004420
4421When thread support is enabled, the block above expands to the
4422following code:
4423
Guido van Rossum9faf4c51997-10-07 14:38:54 +00004424\begin{verbatim}
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004425 PyThreadState *_save;
Fred Drake659ebfa2000-04-03 15:42:13 +00004426
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004427 _save = PyEval_SaveThread();
4428 ...Do some blocking I/O operation...
4429 PyEval_RestoreThread(_save);
Guido van Rossum9faf4c51997-10-07 14:38:54 +00004430\end{verbatim}
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004431
4432Using even lower level primitives, we can get roughly the same effect
4433as follows:
4434
Guido van Rossum9faf4c51997-10-07 14:38:54 +00004435\begin{verbatim}
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004436 PyThreadState *_save;
Fred Drake659ebfa2000-04-03 15:42:13 +00004437
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004438 _save = PyThreadState_Swap(NULL);
4439 PyEval_ReleaseLock();
4440 ...Do some blocking I/O operation...
4441 PyEval_AcquireLock();
4442 PyThreadState_Swap(_save);
Guido van Rossum9faf4c51997-10-07 14:38:54 +00004443\end{verbatim}
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004444
4445There are some subtle differences; in particular,
Fred Drake659ebfa2000-04-03 15:42:13 +00004446\cfunction{PyEval_RestoreThread()}\ttindex{PyEval_RestoreThread()} saves
4447and restores the value of the global variable
4448\cdata{errno}\ttindex{errno}, since the lock manipulation does not
Fred Drakef8830d11998-04-23 14:06:01 +00004449guarantee that \cdata{errno} is left alone. Also, when thread support
Fred Drake659ebfa2000-04-03 15:42:13 +00004450is disabled,
4451\cfunction{PyEval_SaveThread()}\ttindex{PyEval_SaveThread()} and
Fred Drakee058b4f1998-02-16 06:15:35 +00004452\cfunction{PyEval_RestoreThread()} don't manipulate the lock; in this
Fred Drake659ebfa2000-04-03 15:42:13 +00004453case, \cfunction{PyEval_ReleaseLock()}\ttindex{PyEval_ReleaseLock()} and
4454\cfunction{PyEval_AcquireLock()}\ttindex{PyEval_AcquireLock()} are not
4455available. This is done so that dynamically loaded extensions
4456compiled with thread support enabled can be loaded by an interpreter
4457that was compiled with disabled thread support.
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004458
4459The global interpreter lock is used to protect the pointer to the
4460current thread state. When releasing the lock and saving the thread
4461state, the current thread state pointer must be retrieved before the
4462lock is released (since another thread could immediately acquire the
4463lock and store its own thread state in the global variable).
Fred Drakeffe58ca2000-09-29 17:31:54 +00004464Conversely, when acquiring the lock and restoring the thread state,
4465the lock must be acquired before storing the thread state pointer.
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004466
4467Why am I going on with so much detail about this? Because when
Fred Drake659ebfa2000-04-03 15:42:13 +00004468threads are created from C, they don't have the global interpreter
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004469lock, nor is there a thread state data structure for them. Such
4470threads must bootstrap themselves into existence, by first creating a
4471thread state data structure, then acquiring the lock, and finally
4472storing their thread state pointer, before they can start using the
4473Python/C API. When they are done, they should reset the thread state
4474pointer, release the lock, and finally free their thread state data
4475structure.
4476
4477When creating a thread data structure, you need to provide an
4478interpreter state data structure. The interpreter state data
4479structure hold global data that is shared by all threads in an
4480interpreter, for example the module administration
4481(\code{sys.modules}). Depending on your needs, you can either create
4482a new interpreter state data structure, or share the interpreter state
4483data structure used by the Python main thread (to access the latter,
Fred Drakef8830d11998-04-23 14:06:01 +00004484you must obtain the thread state and access its \member{interp} member;
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004485this must be done by a thread that is created by Python or by the main
4486thread after Python is initialized).
4487
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004488
4489\begin{ctypedesc}{PyInterpreterState}
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004490This data structure represents the state shared by a number of
4491cooperating threads. Threads belonging to the same interpreter
4492share their module administration and a few other internal items.
4493There are no public members in this structure.
4494
4495Threads belonging to different interpreters initially share nothing,
4496except process state like available memory, open file descriptors and
4497such. The global interpreter lock is also shared by all threads,
4498regardless of to which interpreter they belong.
4499\end{ctypedesc}
4500
4501\begin{ctypedesc}{PyThreadState}
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004502This data structure represents the state of a single thread. The only
Fred Drakef8830d11998-04-23 14:06:01 +00004503public data member is \ctype{PyInterpreterState *}\member{interp},
4504which points to this thread's interpreter state.
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004505\end{ctypedesc}
4506
4507\begin{cfuncdesc}{void}{PyEval_InitThreads}{}
4508Initialize and acquire the global interpreter lock. It should be
4509called in the main thread before creating a second thread or engaging
Fred Drakee058b4f1998-02-16 06:15:35 +00004510in any other thread operations such as
Fred Drake659ebfa2000-04-03 15:42:13 +00004511\cfunction{PyEval_ReleaseLock()}\ttindex{PyEval_ReleaseLock()} or
4512\code{PyEval_ReleaseThread(\var{tstate})}\ttindex{PyEval_ReleaseThread()}.
4513It is not needed before calling
4514\cfunction{PyEval_SaveThread()}\ttindex{PyEval_SaveThread()} or
4515\cfunction{PyEval_RestoreThread()}\ttindex{PyEval_RestoreThread()}.
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004516
4517This is a no-op when called for a second time. It is safe to call
Fred Drake659ebfa2000-04-03 15:42:13 +00004518this function before calling
4519\cfunction{Py_Initialize()}\ttindex{Py_Initialize()}.
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004520
4521When only the main thread exists, no lock operations are needed. This
4522is a common situation (most Python programs do not use threads), and
4523the lock operations slow the interpreter down a bit. Therefore, the
4524lock is not created initially. This situation is equivalent to having
4525acquired the lock: when there is only a single thread, all object
4526accesses are safe. Therefore, when this function initializes the
Fred Drake4de05a91998-02-16 14:25:26 +00004527lock, it also acquires it. Before the Python
4528\module{thread}\refbimodindex{thread} module creates a new thread,
4529knowing that either it has the lock or the lock hasn't been created
4530yet, it calls \cfunction{PyEval_InitThreads()}. When this call
4531returns, it is guaranteed that the lock has been created and that it
4532has acquired it.
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004533
4534It is \strong{not} safe to call this function when it is unknown which
4535thread (if any) currently has the global interpreter lock.
4536
4537This function is not available when thread support is disabled at
4538compile time.
4539\end{cfuncdesc}
4540
Guido van Rossum4a944d71997-08-14 20:35:38 +00004541\begin{cfuncdesc}{void}{PyEval_AcquireLock}{}
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004542Acquire the global interpreter lock. The lock must have been created
4543earlier. If this thread already has the lock, a deadlock ensues.
4544This function is not available when thread support is disabled at
4545compile time.
Guido van Rossum4a944d71997-08-14 20:35:38 +00004546\end{cfuncdesc}
4547
4548\begin{cfuncdesc}{void}{PyEval_ReleaseLock}{}
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004549Release the global interpreter lock. The lock must have been created
4550earlier. This function is not available when thread support is
Fred Drakee058b4f1998-02-16 06:15:35 +00004551disabled at compile time.
Guido van Rossum4a944d71997-08-14 20:35:38 +00004552\end{cfuncdesc}
4553
4554\begin{cfuncdesc}{void}{PyEval_AcquireThread}{PyThreadState *tstate}
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004555Acquire the global interpreter lock and then set the current thread
Guido van Rossum580aa8d1997-11-25 15:34:51 +00004556state to \var{tstate}, which should not be \NULL{}. The lock must
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004557have been created earlier. If this thread already has the lock,
4558deadlock ensues. This function is not available when thread support
Fred Drakee058b4f1998-02-16 06:15:35 +00004559is disabled at compile time.
Guido van Rossum4a944d71997-08-14 20:35:38 +00004560\end{cfuncdesc}
4561
4562\begin{cfuncdesc}{void}{PyEval_ReleaseThread}{PyThreadState *tstate}
Guido van Rossum580aa8d1997-11-25 15:34:51 +00004563Reset the current thread state to \NULL{} and release the global
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004564interpreter lock. The lock must have been created earlier and must be
4565held by the current thread. The \var{tstate} argument, which must not
Guido van Rossum580aa8d1997-11-25 15:34:51 +00004566be \NULL{}, is only used to check that it represents the current
Fred Drakee058b4f1998-02-16 06:15:35 +00004567thread state --- if it isn't, a fatal error is reported. This
4568function is not available when thread support is disabled at compile
4569time.
Guido van Rossum4a944d71997-08-14 20:35:38 +00004570\end{cfuncdesc}
4571
Fred Drakec6fa34e1998-04-02 06:47:24 +00004572\begin{cfuncdesc}{PyThreadState*}{PyEval_SaveThread}{}
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004573Release the interpreter lock (if it has been created and thread
Guido van Rossum580aa8d1997-11-25 15:34:51 +00004574support is enabled) and reset the thread state to \NULL{},
4575returning the previous thread state (which is not \NULL{}). If
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004576the lock has been created, the current thread must have acquired it.
4577(This function is available even when thread support is disabled at
4578compile time.)
Guido van Rossum4a944d71997-08-14 20:35:38 +00004579\end{cfuncdesc}
4580
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004581\begin{cfuncdesc}{void}{PyEval_RestoreThread}{PyThreadState *tstate}
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004582Acquire the interpreter lock (if it has been created and thread
4583support is enabled) and set the thread state to \var{tstate}, which
Guido van Rossum580aa8d1997-11-25 15:34:51 +00004584must not be \NULL{}. If the lock has been created, the current
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004585thread must not have acquired it, otherwise deadlock ensues. (This
4586function is available even when thread support is disabled at compile
4587time.)
Guido van Rossum4a944d71997-08-14 20:35:38 +00004588\end{cfuncdesc}
4589
Fred Drake659ebfa2000-04-03 15:42:13 +00004590The following macros are normally used without a trailing semicolon;
4591look for example usage in the Python source distribution.
4592
4593\begin{csimplemacrodesc}{Py_BEGIN_ALLOW_THREADS}
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004594This macro expands to
Fred Drakee058b4f1998-02-16 06:15:35 +00004595\samp{\{ PyThreadState *_save; _save = PyEval_SaveThread();}.
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004596Note that it contains an opening brace; it must be matched with a
4597following \code{Py_END_ALLOW_THREADS} macro. See above for further
4598discussion of this macro. It is a no-op when thread support is
4599disabled at compile time.
Fred Drake659ebfa2000-04-03 15:42:13 +00004600\end{csimplemacrodesc}
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004601
Fred Drake659ebfa2000-04-03 15:42:13 +00004602\begin{csimplemacrodesc}{Py_END_ALLOW_THREADS}
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004603This macro expands to
Fred Drakee058b4f1998-02-16 06:15:35 +00004604\samp{PyEval_RestoreThread(_save); \}}.
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004605Note that it contains a closing brace; it must be matched with an
4606earlier \code{Py_BEGIN_ALLOW_THREADS} macro. See above for further
4607discussion of this macro. It is a no-op when thread support is
4608disabled at compile time.
Fred Drake659ebfa2000-04-03 15:42:13 +00004609\end{csimplemacrodesc}
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004610
Thomas Wouterse30ac572001-07-09 14:35:01 +00004611\begin{csimplemacrodesc}{Py_BLOCK_THREADS}
Fred Drakebab29652001-07-10 16:10:08 +00004612This macro expands to \samp{PyEval_RestoreThread(_save);}: it
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004613is equivalent to \code{Py_END_ALLOW_THREADS} without the closing
4614brace. It is a no-op when thread support is disabled at compile
4615time.
Fred Drake659ebfa2000-04-03 15:42:13 +00004616\end{csimplemacrodesc}
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004617
Thomas Wouterse30ac572001-07-09 14:35:01 +00004618\begin{csimplemacrodesc}{Py_UNBLOCK_THREADS}
Fred Drakebab29652001-07-10 16:10:08 +00004619This macro expands to \samp{_save = PyEval_SaveThread();}: it is
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004620equivalent to \code{Py_BEGIN_ALLOW_THREADS} without the opening brace
4621and variable declaration. It is a no-op when thread support is
4622disabled at compile time.
Fred Drake659ebfa2000-04-03 15:42:13 +00004623\end{csimplemacrodesc}
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004624
4625All of the following functions are only available when thread support
4626is enabled at compile time, and must be called only when the
Fred Drake9d20ac31998-02-16 15:27:08 +00004627interpreter lock has been created.
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004628
Fred Drakec6fa34e1998-04-02 06:47:24 +00004629\begin{cfuncdesc}{PyInterpreterState*}{PyInterpreterState_New}{}
Guido van Rossumed9dcc11998-08-07 18:28:03 +00004630Create a new interpreter state object. The interpreter lock need not
4631be held, but may be held if it is necessary to serialize calls to this
4632function.
Guido van Rossum4a944d71997-08-14 20:35:38 +00004633\end{cfuncdesc}
4634
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004635\begin{cfuncdesc}{void}{PyInterpreterState_Clear}{PyInterpreterState *interp}
4636Reset all information in an interpreter state object. The interpreter
4637lock must be held.
Guido van Rossum4a944d71997-08-14 20:35:38 +00004638\end{cfuncdesc}
4639
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004640\begin{cfuncdesc}{void}{PyInterpreterState_Delete}{PyInterpreterState *interp}
4641Destroy an interpreter state object. The interpreter lock need not be
4642held. The interpreter state must have been reset with a previous
Fred Drakee058b4f1998-02-16 06:15:35 +00004643call to \cfunction{PyInterpreterState_Clear()}.
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004644\end{cfuncdesc}
4645
Fred Drakec6fa34e1998-04-02 06:47:24 +00004646\begin{cfuncdesc}{PyThreadState*}{PyThreadState_New}{PyInterpreterState *interp}
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004647Create a new thread state object belonging to the given interpreter
Guido van Rossumed9dcc11998-08-07 18:28:03 +00004648object. The interpreter lock need not be held, but may be held if it
4649is necessary to serialize calls to this function.
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004650\end{cfuncdesc}
4651
4652\begin{cfuncdesc}{void}{PyThreadState_Clear}{PyThreadState *tstate}
4653Reset all information in a thread state object. The interpreter lock
4654must be held.
4655\end{cfuncdesc}
4656
4657\begin{cfuncdesc}{void}{PyThreadState_Delete}{PyThreadState *tstate}
4658Destroy a thread state object. The interpreter lock need not be
4659held. The thread state must have been reset with a previous
Fred Drakee058b4f1998-02-16 06:15:35 +00004660call to \cfunction{PyThreadState_Clear()}.
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004661\end{cfuncdesc}
4662
Fred Drakec6fa34e1998-04-02 06:47:24 +00004663\begin{cfuncdesc}{PyThreadState*}{PyThreadState_Get}{}
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004664Return the current thread state. The interpreter lock must be held.
Guido van Rossum580aa8d1997-11-25 15:34:51 +00004665When the current thread state is \NULL{}, this issues a fatal
Guido van Rossum5b8a5231997-12-30 04:38:44 +00004666error (so that the caller needn't check for \NULL{}).
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004667\end{cfuncdesc}
4668
Fred Drakec6fa34e1998-04-02 06:47:24 +00004669\begin{cfuncdesc}{PyThreadState*}{PyThreadState_Swap}{PyThreadState *tstate}
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004670Swap the current thread state with the thread state given by the
Guido van Rossum580aa8d1997-11-25 15:34:51 +00004671argument \var{tstate}, which may be \NULL{}. The interpreter lock
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004672must be held.
4673\end{cfuncdesc}
4674
Fred Drake24e62192001-05-21 15:56:55 +00004675\begin{cfuncdesc}{PyObject*}{PyThreadState_GetDict}{}
4676Return a dictionary in which extensions can store thread-specific
4677state information. Each extension should use a unique key to use to
4678store state in the dictionary. If this function returns \NULL, an
4679exception has been raised and the caller should allow it to
4680propogate.
4681\end{cfuncdesc}
4682
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004683
Fred Drake659ebfa2000-04-03 15:42:13 +00004684\chapter{Memory Management \label{memory}}
4685\sectionauthor{Vladimir Marangozov}{Vladimir.Marangozov@inrialpes.fr}
4686
4687
4688\section{Overview \label{memoryOverview}}
4689
4690Memory management in Python involves a private heap containing all
4691Python objects and data structures. The management of this private
4692heap is ensured internally by the \emph{Python memory manager}. The
4693Python memory manager has different components which deal with various
4694dynamic storage management aspects, like sharing, segmentation,
4695preallocation or caching.
4696
4697At the lowest level, a raw memory allocator ensures that there is
4698enough room in the private heap for storing all Python-related data
4699by interacting with the memory manager of the operating system. On top
4700of the raw memory allocator, several object-specific allocators
4701operate on the same heap and implement distinct memory management
4702policies adapted to the peculiarities of every object type. For
4703example, integer objects are managed differently within the heap than
4704strings, tuples or dictionaries because integers imply different
4705storage requirements and speed/space tradeoffs. The Python memory
4706manager thus delegates some of the work to the object-specific
4707allocators, but ensures that the latter operate within the bounds of
4708the private heap.
4709
4710It is important to understand that the management of the Python heap
4711is performed by the interpreter itself and that the user has no
4712control on it, even if she regularly manipulates object pointers to
4713memory blocks inside that heap. The allocation of heap space for
4714Python objects and other internal buffers is performed on demand by
4715the Python memory manager through the Python/C API functions listed in
4716this document.
4717
4718To avoid memory corruption, extension writers should never try to
4719operate on Python objects with the functions exported by the C
4720library: \cfunction{malloc()}\ttindex{malloc()},
4721\cfunction{calloc()}\ttindex{calloc()},
4722\cfunction{realloc()}\ttindex{realloc()} and
4723\cfunction{free()}\ttindex{free()}. This will result in
4724mixed calls between the C allocator and the Python memory manager
4725with fatal consequences, because they implement different algorithms
4726and operate on different heaps. However, one may safely allocate and
4727release memory blocks with the C library allocator for individual
4728purposes, as shown in the following example:
4729
4730\begin{verbatim}
4731 PyObject *res;
4732 char *buf = (char *) malloc(BUFSIZ); /* for I/O */
4733
4734 if (buf == NULL)
4735 return PyErr_NoMemory();
4736 ...Do some I/O operation involving buf...
4737 res = PyString_FromString(buf);
4738 free(buf); /* malloc'ed */
4739 return res;
4740\end{verbatim}
4741
4742In this example, the memory request for the I/O buffer is handled by
4743the C library allocator. The Python memory manager is involved only
4744in the allocation of the string object returned as a result.
4745
4746In most situations, however, it is recommended to allocate memory from
4747the Python heap specifically because the latter is under control of
4748the Python memory manager. For example, this is required when the
4749interpreter is extended with new object types written in C. Another
4750reason for using the Python heap is the desire to \emph{inform} the
4751Python memory manager about the memory needs of the extension module.
4752Even when the requested memory is used exclusively for internal,
4753highly-specific purposes, delegating all memory requests to the Python
4754memory manager causes the interpreter to have a more accurate image of
4755its memory footprint as a whole. Consequently, under certain
4756circumstances, the Python memory manager may or may not trigger
4757appropriate actions, like garbage collection, memory compaction or
4758other preventive procedures. Note that by using the C library
4759allocator as shown in the previous example, the allocated memory for
4760the I/O buffer escapes completely the Python memory manager.
4761
4762
4763\section{Memory Interface \label{memoryInterface}}
4764
4765The following function sets, modeled after the ANSI C standard, are
4766available for allocating and releasing memory from the Python heap:
4767
4768
Fred Drake7d45d342000-08-11 17:07:32 +00004769\begin{cfuncdesc}{void*}{PyMem_Malloc}{size_t n}
4770Allocates \var{n} bytes and returns a pointer of type \ctype{void*} to
Fred Drakebab29652001-07-10 16:10:08 +00004771the allocated memory, or \NULL{} if the request fails. Requesting zero
Fred Drake659ebfa2000-04-03 15:42:13 +00004772bytes returns a non-\NULL{} pointer.
Fred Drakebab29652001-07-10 16:10:08 +00004773The memory will not have been initialized in any way.
Fred Drake659ebfa2000-04-03 15:42:13 +00004774\end{cfuncdesc}
4775
Fred Drake7d45d342000-08-11 17:07:32 +00004776\begin{cfuncdesc}{void*}{PyMem_Realloc}{void *p, size_t n}
Fred Drake659ebfa2000-04-03 15:42:13 +00004777Resizes the memory block pointed to by \var{p} to \var{n} bytes. The
4778contents will be unchanged to the minimum of the old and the new
4779sizes. If \var{p} is \NULL{}, the call is equivalent to
Fred Drakebab29652001-07-10 16:10:08 +00004780\cfunction{PyMem_Malloc(\var{n})}; if \var{n} is equal to zero, the
4781memory block is resized but is not freed, and the returned pointer is
4782non-\NULL{}. Unless \var{p} is \NULL{}, it must have been returned by
4783a previous call to \cfunction{PyMem_Malloc()} or
4784\cfunction{PyMem_Realloc()}.
Fred Drake659ebfa2000-04-03 15:42:13 +00004785\end{cfuncdesc}
4786
Fred Drake7d45d342000-08-11 17:07:32 +00004787\begin{cfuncdesc}{void}{PyMem_Free}{void *p}
Fred Drake659ebfa2000-04-03 15:42:13 +00004788Frees the memory block pointed to by \var{p}, which must have been
4789returned by a previous call to \cfunction{PyMem_Malloc()} or
4790\cfunction{PyMem_Realloc()}. Otherwise, or if
4791\cfunction{PyMem_Free(p)} has been called before, undefined behaviour
4792occurs. If \var{p} is \NULL{}, no operation is performed.
4793\end{cfuncdesc}
4794
Fred Drake659ebfa2000-04-03 15:42:13 +00004795The following type-oriented macros are provided for convenience. Note
4796that \var{TYPE} refers to any C type.
4797
Fred Drakef913e542000-09-12 20:17:17 +00004798\begin{cfuncdesc}{\var{TYPE}*}{PyMem_New}{TYPE, size_t n}
Fred Drake659ebfa2000-04-03 15:42:13 +00004799Same as \cfunction{PyMem_Malloc()}, but allocates \code{(\var{n} *
4800sizeof(\var{TYPE}))} bytes of memory. Returns a pointer cast to
4801\ctype{\var{TYPE}*}.
Fred Drakebab29652001-07-10 16:10:08 +00004802The memory will not have been initialized in any way.
Fred Drake659ebfa2000-04-03 15:42:13 +00004803\end{cfuncdesc}
4804
Fred Drakef913e542000-09-12 20:17:17 +00004805\begin{cfuncdesc}{\var{TYPE}*}{PyMem_Resize}{void *p, TYPE, size_t n}
Fred Drake659ebfa2000-04-03 15:42:13 +00004806Same as \cfunction{PyMem_Realloc()}, but the memory block is resized
4807to \code{(\var{n} * sizeof(\var{TYPE}))} bytes. Returns a pointer
4808cast to \ctype{\var{TYPE}*}.
4809\end{cfuncdesc}
4810
Fred Drakef913e542000-09-12 20:17:17 +00004811\begin{cfuncdesc}{void}{PyMem_Del}{void *p}
Fred Drake659ebfa2000-04-03 15:42:13 +00004812Same as \cfunction{PyMem_Free()}.
4813\end{cfuncdesc}
4814
Fred Drakef913e542000-09-12 20:17:17 +00004815In addition, the following macro sets are provided for calling the
4816Python memory allocator directly, without involving the C API functions
4817listed above. However, note that their use does not preserve binary
4818compatibility accross Python versions and is therefore deprecated in
4819extension modules.
4820
4821\cfunction{PyMem_MALLOC()}, \cfunction{PyMem_REALLOC()}, \cfunction{PyMem_FREE()}.
4822
4823\cfunction{PyMem_NEW()}, \cfunction{PyMem_RESIZE()}, \cfunction{PyMem_DEL()}.
4824
Fred Drake659ebfa2000-04-03 15:42:13 +00004825
4826\section{Examples \label{memoryExamples}}
4827
4828Here is the example from section \ref{memoryOverview}, rewritten so
4829that the I/O buffer is allocated from the Python heap by using the
4830first function set:
4831
4832\begin{verbatim}
4833 PyObject *res;
4834 char *buf = (char *) PyMem_Malloc(BUFSIZ); /* for I/O */
4835
4836 if (buf == NULL)
4837 return PyErr_NoMemory();
4838 /* ...Do some I/O operation involving buf... */
4839 res = PyString_FromString(buf);
4840 PyMem_Free(buf); /* allocated with PyMem_Malloc */
4841 return res;
4842\end{verbatim}
4843
Fred Drakef913e542000-09-12 20:17:17 +00004844The same code using the type-oriented function set:
Fred Drake659ebfa2000-04-03 15:42:13 +00004845
4846\begin{verbatim}
4847 PyObject *res;
Fred Drakef913e542000-09-12 20:17:17 +00004848 char *buf = PyMem_New(char, BUFSIZ); /* for I/O */
Fred Drake659ebfa2000-04-03 15:42:13 +00004849
4850 if (buf == NULL)
4851 return PyErr_NoMemory();
4852 /* ...Do some I/O operation involving buf... */
4853 res = PyString_FromString(buf);
Fred Drakef913e542000-09-12 20:17:17 +00004854 PyMem_Del(buf); /* allocated with PyMem_New */
Fred Drake659ebfa2000-04-03 15:42:13 +00004855 return res;
4856\end{verbatim}
4857
Fred Drakef913e542000-09-12 20:17:17 +00004858Note that in the two examples above, the buffer is always
4859manipulated via functions belonging to the same set. Indeed, it
Fred Drake659ebfa2000-04-03 15:42:13 +00004860is required to use the same memory API family for a given
4861memory block, so that the risk of mixing different allocators is
4862reduced to a minimum. The following code sequence contains two errors,
4863one of which is labeled as \emph{fatal} because it mixes two different
4864allocators operating on different heaps.
4865
4866\begin{verbatim}
Fred Drakef913e542000-09-12 20:17:17 +00004867char *buf1 = PyMem_New(char, BUFSIZ);
Fred Drake659ebfa2000-04-03 15:42:13 +00004868char *buf2 = (char *) malloc(BUFSIZ);
4869char *buf3 = (char *) PyMem_Malloc(BUFSIZ);
4870...
Fred Drakef913e542000-09-12 20:17:17 +00004871PyMem_Del(buf3); /* Wrong -- should be PyMem_Free() */
Fred Drake659ebfa2000-04-03 15:42:13 +00004872free(buf2); /* Right -- allocated via malloc() */
Fred Drakef913e542000-09-12 20:17:17 +00004873free(buf1); /* Fatal -- should be PyMem_Del() */
Fred Drake659ebfa2000-04-03 15:42:13 +00004874\end{verbatim}
4875
4876In addition to the functions aimed at handling raw memory blocks from
4877the Python heap, objects in Python are allocated and released with
Fred Drakef913e542000-09-12 20:17:17 +00004878\cfunction{PyObject_New()}, \cfunction{PyObject_NewVar()} and
4879\cfunction{PyObject_Del()}, or with their corresponding macros
4880\cfunction{PyObject_NEW()}, \cfunction{PyObject_NEW_VAR()} and
Fred Drakee06f0f92000-06-30 15:52:39 +00004881\cfunction{PyObject_DEL()}.
Fred Drake659ebfa2000-04-03 15:42:13 +00004882
Fred Drakee06f0f92000-06-30 15:52:39 +00004883These will be explained in the next chapter on defining and
4884implementing new object types in C.
Fred Drake659ebfa2000-04-03 15:42:13 +00004885
4886
Fred Drakeefd146c1999-02-15 15:30:45 +00004887\chapter{Defining New Object Types \label{newTypes}}
Guido van Rossum4a944d71997-08-14 20:35:38 +00004888
Fred Drakec6fa34e1998-04-02 06:47:24 +00004889\begin{cfuncdesc}{PyObject*}{_PyObject_New}{PyTypeObject *type}
Fred Drakee058b4f1998-02-16 06:15:35 +00004890\end{cfuncdesc}
4891
Fred Drakef913e542000-09-12 20:17:17 +00004892\begin{cfuncdesc}{PyVarObject*}{_PyObject_NewVar}{PyTypeObject *type, int size}
Fred Drakee058b4f1998-02-16 06:15:35 +00004893\end{cfuncdesc}
4894
Fred Drakef913e542000-09-12 20:17:17 +00004895\begin{cfuncdesc}{void}{_PyObject_Del}{PyObject *op}
Fred Drakee058b4f1998-02-16 06:15:35 +00004896\end{cfuncdesc}
4897
Fred Drakef913e542000-09-12 20:17:17 +00004898\begin{cfuncdesc}{PyObject*}{PyObject_Init}{PyObject *op,
Fred Drakebab29652001-07-10 16:10:08 +00004899 PyTypeObject *type}
4900 Initialize a newly-allocated object \var{op} with its type and
4901 initial reference. Returns the initialized object. If \var{type}
4902 indicates that the object participates in the cyclic garbage
4903 detector, it it added to the detector's set of observed objects.
4904 Other fields of the object are not affected.
Fred Drakef913e542000-09-12 20:17:17 +00004905\end{cfuncdesc}
4906
4907\begin{cfuncdesc}{PyVarObject*}{PyObject_InitVar}{PyVarObject *op,
Fred Drakebab29652001-07-10 16:10:08 +00004908 PyTypeObject *type, int size}
4909 This does everything \cfunction{PyObject_Init()} does, and also
4910 initializes the length information for a variable-size object.
Fred Drakef913e542000-09-12 20:17:17 +00004911\end{cfuncdesc}
4912
4913\begin{cfuncdesc}{\var{TYPE}*}{PyObject_New}{TYPE, PyTypeObject *type}
Fred Drakebab29652001-07-10 16:10:08 +00004914 Allocate a new Python object using the C structure type \var{TYPE}
4915 and the Python type object \var{type}. Fields not defined by the
4916 Python object header are not initialized; the object's reference
4917 count will be one. The size of the memory
4918 allocation is determined from the \member{tp_basicsize} field of the
4919 type object.
Fred Drakef913e542000-09-12 20:17:17 +00004920\end{cfuncdesc}
4921
4922\begin{cfuncdesc}{\var{TYPE}*}{PyObject_NewVar}{TYPE, PyTypeObject *type,
4923 int size}
Fred Drakebab29652001-07-10 16:10:08 +00004924 Allocate a new Python object using the C structure type \var{TYPE}
4925 and the Python type object \var{type}. Fields not defined by the
4926 Python object header are not initialized. The allocated memory
4927 allows for the \var{TYPE} structure plus \var{size} fields of the
4928 size given by the \member{tp_itemsize} field of \var{type}. This is
4929 useful for implementing objects like tuples, which are able to
4930 determine their size at construction time. Embedding the array of
4931 fields into the same allocation decreases the number of allocations,
4932 improving the memory management efficiency.
Fred Drakef913e542000-09-12 20:17:17 +00004933\end{cfuncdesc}
4934
4935\begin{cfuncdesc}{void}{PyObject_Del}{PyObject *op}
Fred Drakebab29652001-07-10 16:10:08 +00004936 Releases memory allocated to an object using
4937 \cfunction{PyObject_New()} or \cfunction{PyObject_NewVar()}. This
4938 is normally called from the \member{tp_dealloc} handler specified in
4939 the object's type. The fields of the object should not be accessed
4940 after this call as the memory is no longer a valid Python object.
Fred Drakef913e542000-09-12 20:17:17 +00004941\end{cfuncdesc}
4942
4943\begin{cfuncdesc}{\var{TYPE}*}{PyObject_NEW}{TYPE, PyTypeObject *type}
Fred Drakebab29652001-07-10 16:10:08 +00004944 Macro version of \cfunction{PyObject_New()}, to gain performance at
4945 the expense of safety. This does not check \var{type} for a \NULL{}
4946 value.
Fred Drakef913e542000-09-12 20:17:17 +00004947\end{cfuncdesc}
4948
4949\begin{cfuncdesc}{\var{TYPE}*}{PyObject_NEW_VAR}{TYPE, PyTypeObject *type,
4950 int size}
Fred Drakebab29652001-07-10 16:10:08 +00004951 Macro version of \cfunction{PyObject_NewVar()}, to gain performance
4952 at the expense of safety. This does not check \var{type} for a
4953 \NULL{} value.
Fred Drakef913e542000-09-12 20:17:17 +00004954\end{cfuncdesc}
4955
4956\begin{cfuncdesc}{void}{PyObject_DEL}{PyObject *op}
Fred Drakebab29652001-07-10 16:10:08 +00004957 Macro version of \cfunction{PyObject_Del()}.
Fred Drakee058b4f1998-02-16 06:15:35 +00004958\end{cfuncdesc}
4959
Fred Drakeee814bf2000-11-28 22:34:32 +00004960\begin{cfuncdesc}{PyObject*}{Py_InitModule}{char *name,
4961 PyMethodDef *methods}
4962 Create a new module object based on a name and table of functions,
4963 returning the new module object.
4964\end{cfuncdesc}
4965
4966\begin{cfuncdesc}{PyObject*}{Py_InitModule3}{char *name,
4967 PyMethodDef *methods,
4968 char *doc}
4969 Create a new module object based on a name and table of functions,
4970 returning the new module object. If \var{doc} is non-\NULL, it will
4971 be used to define the docstring for the module.
4972\end{cfuncdesc}
4973
4974\begin{cfuncdesc}{PyObject*}{Py_InitModule4}{char *name,
4975 PyMethodDef *methods,
4976 char *doc, PyObject *self,
4977 int apiver}
4978 Create a new module object based on a name and table of functions,
4979 returning the new module object. If \var{doc} is non-\NULL, it will
4980 be used to define the docstring for the module. If \var{self} is
4981 non-\NULL, it will passed to the functions of the module as their
4982 (otherwise \NULL) first parameter. (This was added as an
4983 experimental feature, and there are no known uses in the current
4984 version of Python.) For \var{apiver}, the only value which should
4985 be passed is defined by the constant \constant{PYTHON_API_VERSION}.
4986
4987 \strong{Note:} Most uses of this function should probably be using
4988 the \cfunction{Py_InitModule3()} instead; only use this if you are
4989 sure you need it.
4990\end{cfuncdesc}
Guido van Rossum3c4378b1998-04-14 20:21:10 +00004991
Fred Drakebab29652001-07-10 16:10:08 +00004992\begin{cfuncdesc}{int}{PyArg_ParseTuple}{PyObject *args, char *format,
4993 \moreargs}
4994 Parse the parameters of a function that takes only positional
4995 parameters into local variables. See
4996 \citetitle[../ext/parseTuple.html]{Extending and Embedding the
4997 Python Interpreter} for more information.
4998\end{cfuncdesc}
4999
5000\begin{cfuncdesc}{int}{PyArg_ParseTupleAndKeywords}{PyObject *args,
5001 PyObject *kw, char *format, char *keywords[], \moreargs}
5002 Parse the parameters of a function that takes both positional and
5003 keyword parameters into local variables. See
5004 \citetitle[../ext/parseTupleAndKeywords.html]{Extending and
5005 Embedding the Python Interpreter} for more information.
5006\end{cfuncdesc}
5007
5008\begin{cfuncdesc}{int}{PyArg_Parse}{PyObject *args, char *format, \moreargs}
5009 Function used to deconstruct the argument lists of ``old-style''
5010 functions --- these are functions which use the
5011 \constant{METH_OLDARGS} parameter parsing method. This is not
5012 recommended for new code, and most code in the standard interpreter
5013 has been modified to no longer use this.
5014\end{cfuncdesc}
Guido van Rossum3c4378b1998-04-14 20:21:10 +00005015
5016Py_BuildValue
Guido van Rossumae110af1997-05-22 20:11:52 +00005017
Fred Drake659ebfa2000-04-03 15:42:13 +00005018DL_IMPORT
5019
Fred Drakebab29652001-07-10 16:10:08 +00005020\begin{cvardesc}{PyObject}{_Py_NoneStruct}
5021 Object which is visible in Python as \code{None}. This should only
5022 be accessed using the \code{Py_None} macro, which evaluates to a
5023 pointer to this object.
5024\end{cvardesc}
Fred Drake659ebfa2000-04-03 15:42:13 +00005025
5026
5027\section{Common Object Structures \label{common-structs}}
5028
Guido van Rossumae110af1997-05-22 20:11:52 +00005029PyObject, PyVarObject
5030
5031PyObject_HEAD, PyObject_HEAD_INIT, PyObject_VAR_HEAD
5032
5033Typedefs:
5034unaryfunc, binaryfunc, ternaryfunc, inquiry, coercion, intargfunc,
5035intintargfunc, intobjargproc, intintobjargproc, objobjargproc,
Guido van Rossumae110af1997-05-22 20:11:52 +00005036destructor, printfunc, getattrfunc, getattrofunc, setattrfunc,
5037setattrofunc, cmpfunc, reprfunc, hashfunc
5038
Fred Drakea8455ab2000-06-16 19:58:42 +00005039\begin{ctypedesc}{PyCFunction}
5040Type of the functions used to implement most Python callables in C.
5041\end{ctypedesc}
5042
5043\begin{ctypedesc}{PyMethodDef}
5044Structure used to describe a method of an extension type. This
5045structure has four fields:
5046
5047\begin{tableiii}{l|l|l}{member}{Field}{C Type}{Meaning}
5048 \lineiii{ml_name}{char *}{name of the method}
5049 \lineiii{ml_meth}{PyCFunction}{pointer to the C implementation}
5050 \lineiii{ml_flags}{int}{flag bits indicating how the call should be
5051 constructed}
5052 \lineiii{ml_doc}{char *}{points to the contents of the docstring}
5053\end{tableiii}
5054\end{ctypedesc}
5055
5056\begin{cfuncdesc}{PyObject*}{Py_FindMethod}{PyMethodDef[] table,
5057 PyObject *ob, char *name}
5058Return a bound method object for an extension type implemented in C.
5059This function also handles the special attribute \member{__methods__},
5060returning a list of all the method names defined in \var{table}.
5061\end{cfuncdesc}
5062
Fred Drake659ebfa2000-04-03 15:42:13 +00005063
5064\section{Mapping Object Structures \label{mapping-structs}}
5065
5066\begin{ctypedesc}{PyMappingMethods}
5067Structure used to hold pointers to the functions used to implement the
5068mapping protocol for an extension type.
5069\end{ctypedesc}
5070
5071
5072\section{Number Object Structures \label{number-structs}}
5073
5074\begin{ctypedesc}{PyNumberMethods}
5075Structure used to hold pointers to the functions an extension type
5076uses to implement the number protocol.
5077\end{ctypedesc}
5078
5079
5080\section{Sequence Object Structures \label{sequence-structs}}
5081
5082\begin{ctypedesc}{PySequenceMethods}
5083Structure used to hold pointers to the functions which an object uses
5084to implement the sequence protocol.
5085\end{ctypedesc}
5086
5087
5088\section{Buffer Object Structures \label{buffer-structs}}
5089\sectionauthor{Greg J. Stein}{greg@lyra.org}
5090
5091The buffer interface exports a model where an object can expose its
5092internal data as a set of chunks of data, where each chunk is
5093specified as a pointer/length pair. These chunks are called
5094\dfn{segments} and are presumed to be non-contiguous in memory.
5095
5096If an object does not export the buffer interface, then its
5097\member{tp_as_buffer} member in the \ctype{PyTypeObject} structure
5098should be \NULL{}. Otherwise, the \member{tp_as_buffer} will point to
5099a \ctype{PyBufferProcs} structure.
5100
5101\strong{Note:} It is very important that your
Fred Drakec392b572001-03-21 22:15:01 +00005102\ctype{PyTypeObject} structure uses \constant{Py_TPFLAGS_DEFAULT} for
5103the value of the \member{tp_flags} member rather than \code{0}. This
Fred Drake659ebfa2000-04-03 15:42:13 +00005104tells the Python runtime that your \ctype{PyBufferProcs} structure
5105contains the \member{bf_getcharbuffer} slot. Older versions of Python
5106did not have this member, so a new Python interpreter using an old
5107extension needs to be able to test for its presence before using it.
5108
5109\begin{ctypedesc}{PyBufferProcs}
5110Structure used to hold the function pointers which define an
5111implementation of the buffer protocol.
5112
5113The first slot is \member{bf_getreadbuffer}, of type
5114\ctype{getreadbufferproc}. If this slot is \NULL{}, then the object
5115does not support reading from the internal data. This is
5116non-sensical, so implementors should fill this in, but callers should
5117test that the slot contains a non-\NULL{} value.
5118
5119The next slot is \member{bf_getwritebuffer} having type
5120\ctype{getwritebufferproc}. This slot may be \NULL{} if the object
5121does not allow writing into its returned buffers.
5122
5123The third slot is \member{bf_getsegcount}, with type
5124\ctype{getsegcountproc}. This slot must not be \NULL{} and is used to
5125inform the caller how many segments the object contains. Simple
5126objects such as \ctype{PyString_Type} and
5127\ctype{PyBuffer_Type} objects contain a single segment.
5128
5129The last slot is \member{bf_getcharbuffer}, of type
5130\ctype{getcharbufferproc}. This slot will only be present if the
Fred Drakec392b572001-03-21 22:15:01 +00005131\constant{Py_TPFLAGS_HAVE_GETCHARBUFFER} flag is present in the
Fred Drake659ebfa2000-04-03 15:42:13 +00005132\member{tp_flags} field of the object's \ctype{PyTypeObject}. Before using
5133this slot, the caller should test whether it is present by using the
5134\cfunction{PyType_HasFeature()}\ttindex{PyType_HasFeature()} function.
5135If present, it may be \NULL, indicating that the object's contents
5136cannot be used as \emph{8-bit characters}.
5137The slot function may also raise an error if the object's contents
5138cannot be interpreted as 8-bit characters. For example, if the object
5139is an array which is configured to hold floating point values, an
5140exception may be raised if a caller attempts to use
5141\member{bf_getcharbuffer} to fetch a sequence of 8-bit characters.
5142This notion of exporting the internal buffers as ``text'' is used to
5143distinguish between objects that are binary in nature, and those which
5144have character-based content.
5145
5146\strong{Note:} The current policy seems to state that these characters
5147may be multi-byte characters. This implies that a buffer size of
5148\var{N} does not mean there are \var{N} characters present.
5149\end{ctypedesc}
5150
5151\begin{datadesc}{Py_TPFLAGS_HAVE_GETCHARBUFFER}
5152Flag bit set in the type structure to indicate that the
5153\member{bf_getcharbuffer} slot is known. This being set does not
5154indicate that the object supports the buffer interface or that the
5155\member{bf_getcharbuffer} slot is non-\NULL.
5156\end{datadesc}
5157
5158\begin{ctypedesc}[getreadbufferproc]{int (*getreadbufferproc)
5159 (PyObject *self, int segment, void **ptrptr)}
5160Return a pointer to a readable segment of the buffer. This function
5161is allowed to raise an exception, in which case it must return
5162\code{-1}. The \var{segment} which is passed must be zero or
5163positive, and strictly less than the number of segments returned by
Greg Stein4d4d0032001-04-07 16:14:49 +00005164the \member{bf_getsegcount} slot function. On success, it returns the
5165length of the buffer memory, and sets \code{*\var{ptrptr}} to a
5166pointer to that memory.
Fred Drake659ebfa2000-04-03 15:42:13 +00005167\end{ctypedesc}
5168
5169\begin{ctypedesc}[getwritebufferproc]{int (*getwritebufferproc)
5170 (PyObject *self, int segment, void **ptrptr)}
Greg Stein4d4d0032001-04-07 16:14:49 +00005171Return a pointer to a writable memory buffer in \code{*\var{ptrptr}},
5172and the length of that segment as the function return value.
5173The memory buffer must correspond to buffer segment \var{segment}.
Fred Drake58c5a2a1999-08-04 13:13:24 +00005174Must return \code{-1} and set an exception on error.
5175\exception{TypeError} should be raised if the object only supports
5176read-only buffers, and \exception{SystemError} should be raised when
5177\var{segment} specifies a segment that doesn't exist.
5178% Why doesn't it raise ValueError for this one?
Fred Drake659ebfa2000-04-03 15:42:13 +00005179% GJS: because you shouldn't be calling it with an invalid
5180% segment. That indicates a blatant programming error in the C
5181% code.
Fred Drake58c5a2a1999-08-04 13:13:24 +00005182\end{ctypedesc}
5183
Fred Drake659ebfa2000-04-03 15:42:13 +00005184\begin{ctypedesc}[getsegcountproc]{int (*getsegcountproc)
5185 (PyObject *self, int *lenp)}
5186Return the number of memory segments which comprise the buffer. If
5187\var{lenp} is not \NULL, the implementation must report the sum of the
5188sizes (in bytes) of all segments in \code{*\var{lenp}}.
5189The function cannot fail.
5190\end{ctypedesc}
Guido van Rossumae110af1997-05-22 20:11:52 +00005191
Fred Drake659ebfa2000-04-03 15:42:13 +00005192\begin{ctypedesc}[getcharbufferproc]{int (*getcharbufferproc)
5193 (PyObject *self, int segment, const char **ptrptr)}
5194\end{ctypedesc}
Guido van Rossumae110af1997-05-22 20:11:52 +00005195
Guido van Rossumae110af1997-05-22 20:11:52 +00005196
Fred Drakec392b572001-03-21 22:15:01 +00005197\section{Supporting Cyclic Garbarge Collection
5198 \label{supporting-cycle-detection}}
5199
5200Python's support for detecting and collecting garbage which involves
5201circular references requires support from object types which are
5202``containers'' for other objects which may also be containers. Types
5203which do not store references to other objects, or which only store
5204references to atomic types (such as numbers or strings), do not need
5205to provide any explicit support for garbage collection.
5206
5207To create a container type, the \member{tp_flags} field of the type
5208object must include the \constant{Py_TPFLAGS_GC} and provide an
Fred Drakee28d8ae2001-03-22 16:30:17 +00005209implementation of the \member{tp_traverse} handler. The computed
5210value of the \member{tp_basicsize} field must include
5211\constant{PyGC_HEAD_SIZE} as well. If instances of the type are
5212mutable, a \member{tp_clear} implementation must also be provided.
Fred Drakec392b572001-03-21 22:15:01 +00005213
5214\begin{datadesc}{Py_TPFLAGS_GC}
5215 Objects with a type with this flag set must conform with the rules
5216 documented here. For convenience these objects will be referred to
5217 as container objects.
5218\end{datadesc}
5219
5220\begin{datadesc}{PyGC_HEAD_SIZE}
5221 Extra memory needed for the garbage collector. Container objects
5222 must include this in the calculation of their tp_basicsize. If the
5223 collector is disabled at compile time then this is \code{0}.
5224\end{datadesc}
5225
Fred Drakee28d8ae2001-03-22 16:30:17 +00005226Constructors for container types must conform to two rules:
5227
5228\begin{enumerate}
5229\item The memory for the object must be allocated using
5230 \cfunction{PyObject_New()} or \cfunction{PyObject_VarNew()}.
5231
5232\item Once all the fields which may contain references to other
5233 containers are initialized, it must call
5234 \cfunction{PyObject_GC_Init()}.
5235\end{enumerate}
5236
Fred Drakec392b572001-03-21 22:15:01 +00005237\begin{cfuncdesc}{void}{PyObject_GC_Init}{PyObject *op}
5238 Adds the object \var{op} to the set of container objects tracked by
5239 the collector. The collector can run at unexpected times so objects
5240 must be valid while being tracked. This should be called once all
5241 the fields followed by the \member{tp_traverse} handler become valid,
5242 usually near the end of the constructor.
5243\end{cfuncdesc}
5244
Fred Drakee28d8ae2001-03-22 16:30:17 +00005245Similarly, the deallocator for the object must conform to a similar
5246pair of rules:
5247
5248\begin{enumerate}
5249\item Before fields which refer to other containers are invalidated,
5250 \cfunction{PyObject_GC_Fini()} must be called.
5251
5252\item The object's memory must be deallocated using
5253 \cfunction{PyObject_Del()}.
5254\end{enumerate}
5255
Fred Drakec392b572001-03-21 22:15:01 +00005256\begin{cfuncdesc}{void}{PyObject_GC_Fini}{PyObject *op}
5257 Remove the object \var{op} from the set of container objects tracked
5258 by the collector. Note that \cfunction{PyObject_GC_Init()} can be
5259 called again on this object to add it back to the set of tracked
5260 objects. The deallocator (\member{tp_dealloc} handler) should call
5261 this for the object before any of the fields used by the
5262 \member{tp_traverse} handler become invalid.
Fred Drake8f6df462001-03-23 17:42:09 +00005263
5264 \strong{Note:} Any container which may be referenced from another
5265 object reachable by the collector must itself be tracked by the
5266 collector, so it is generally not safe to call this function
5267 anywhere but in the object's deallocator.
Fred Drakec392b572001-03-21 22:15:01 +00005268\end{cfuncdesc}
5269
5270The \member{tp_traverse} handler accepts a function parameter of this
5271type:
5272
5273\begin{ctypedesc}[visitproc]{int (*visitproc)(PyObject *object, void *arg)}
5274 Type of the visitor function passed to the \member{tp_traverse}
5275 handler. The function should be called with an object to traverse
5276 as \var{object} and the third parameter to the \member{tp_traverse}
5277 handler as \var{arg}.
5278\end{ctypedesc}
5279
5280The \member{tp_traverse} handler must have the following type:
5281
5282\begin{ctypedesc}[traverseproc]{int (*traverseproc)(PyObject *self,
5283 visitproc visit, void *arg)}
5284 Traversal function for a container object. Implementations must
5285 call the \var{visit} function for each object directly contained by
5286 \var{self}, with the parameters to \var{visit} being the contained
5287 object and the \var{arg} value passed to the handler. If
5288 \var{visit} returns a non-zero value then an error has occurred and
5289 that value should be returned immediately.
5290\end{ctypedesc}
5291
5292The \member{tp_clear} handler must be of the \ctype{inquiry} type, or
5293\NULL{} if the object is immutable.
5294
5295\begin{ctypedesc}[inquiry]{int (*inquiry)(PyObject *self)}
5296 Drop references that may have created reference cycles. Immutable
5297 objects do not have to define this method since they can never
5298 directly create reference cycles. Note that the object must still
Fred Drakebab29652001-07-10 16:10:08 +00005299 be valid after calling this method (don't just call
Fred Drakec392b572001-03-21 22:15:01 +00005300 \cfunction{Py_DECREF()} on a reference). The collector will call
5301 this method if it detects that this object is involved in a
5302 reference cycle.
5303\end{ctypedesc}
5304
5305
Fred Drakee28d8ae2001-03-22 16:30:17 +00005306\subsection{Example Cycle Collector Support
5307 \label{example-cycle-support}}
5308
5309This example shows only enough of the implementation of an extension
5310type to show how the garbage collector support needs to be added. It
5311shows the definition of the object structure, the
5312\member{tp_traverse}, \member{tp_clear} and \member{tp_dealloc}
5313implementations, the type structure, and a constructor --- the module
5314initialization needed to export the constructor to Python is not shown
5315as there are no special considerations there for the collector. To
5316make this interesting, assume that the module exposes ways for the
5317\member{container} field of the object to be modified. Note that
5318since no checks are made on the type of the object used to initialize
5319\member{container}, we have to assume that it may be a container.
5320
5321\begin{verbatim}
5322#include "Python.h"
5323
5324typedef struct {
5325 PyObject_HEAD
5326 PyObject *container;
5327} MyObject;
5328
5329static int
5330my_traverse(MyObject *self, visitproc visit, void *arg)
5331{
5332 if (self->container != NULL)
5333 return visit(self->container, arg);
5334 else
5335 return 0;
5336}
5337
5338static int
5339my_clear(MyObject *self)
5340{
5341 Py_XDECREF(self->container);
5342 self->container = NULL;
5343
5344 return 0;
5345}
5346
5347static void
5348my_dealloc(MyObject *self)
5349{
5350 PyObject_GC_Fini((PyObject *) self);
5351 Py_XDECREF(self->container);
5352 PyObject_Del(self);
5353}
5354\end{verbatim}
5355
5356\begin{verbatim}
5357statichere PyTypeObject
5358MyObject_Type = {
5359 PyObject_HEAD_INIT(NULL)
5360 0,
5361 "MyObject",
5362 sizeof(MyObject) + PyGC_HEAD_SIZE,
5363 0,
5364 (destructor)my_dealloc, /* tp_dealloc */
5365 0, /* tp_print */
5366 0, /* tp_getattr */
5367 0, /* tp_setattr */
5368 0, /* tp_compare */
5369 0, /* tp_repr */
5370 0, /* tp_as_number */
5371 0, /* tp_as_sequence */
5372 0, /* tp_as_mapping */
5373 0, /* tp_hash */
5374 0, /* tp_call */
5375 0, /* tp_str */
5376 0, /* tp_getattro */
5377 0, /* tp_setattro */
5378 0, /* tp_as_buffer */
5379 Py_TPFLAGS_DEFAULT | Py_TPFLAGS_GC,
5380 0, /* tp_doc */
5381 (traverseproc)my_traverse, /* tp_traverse */
5382 (inquiry)my_clear, /* tp_clear */
5383 0, /* tp_richcompare */
5384 0, /* tp_weaklistoffset */
5385};
5386
5387/* This constructor should be made accessible from Python. */
5388static PyObject *
5389new_object(PyObject *unused, PyObject *args)
5390{
5391 PyObject *container = NULL;
5392 MyObject *result = NULL;
5393
5394 if (PyArg_ParseTuple(args, "|O:new_object", &container)) {
5395 result = PyObject_New(MyObject, &MyObject_Type);
5396 if (result != NULL) {
5397 result->container = container;
5398 PyObject_GC_Init();
5399 }
5400 }
5401 return (PyObject *) result;
5402}
5403\end{verbatim}
5404
5405
Fred Drake659ebfa2000-04-03 15:42:13 +00005406% \chapter{Debugging \label{debugging}}
5407%
5408% XXX Explain Py_DEBUG, Py_TRACE_REFS, Py_REF_DEBUG.
Guido van Rossum5b8a5231997-12-30 04:38:44 +00005409
5410
Fred Drakeed773ef2000-09-21 21:35:22 +00005411\appendix
5412\chapter{Reporting Bugs}
5413\input{reportingbugs}
5414
Fred Drake490d34d2001-06-20 21:39:12 +00005415\chapter{History and License}
5416\input{license}
5417
Marc-André Lemburga544ea22001-01-17 18:04:31 +00005418\input{api.ind} % Index -- must be last
Guido van Rossum9231c8f1997-05-15 21:43:21 +00005419
5420\end{document}