blob: 5d2820e31ce1f9bd93c9c3f1ac428eda5f2ddb79 [file] [log] [blame]
Fred Drake6659c301998-03-03 22:02:19 +00001\documentclass{manual}
Guido van Rossum9231c8f1997-05-15 21:43:21 +00002
Guido van Rossum9faf4c51997-10-07 14:38:54 +00003\title{Python/C API Reference Manual}
Guido van Rossum9231c8f1997-05-15 21:43:21 +00004
5\input{boilerplate}
6
Marc-André Lemburga544ea22001-01-17 18:04:31 +00007\makeindex % tell \index to actually write the .idx file
Guido van Rossum9231c8f1997-05-15 21:43:21 +00008
9
10\begin{document}
11
Guido van Rossum9231c8f1997-05-15 21:43:21 +000012\maketitle
13
Fred Drake9f86b661998-07-28 21:55:19 +000014\ifhtml
15\chapter*{Front Matter\label{front}}
16\fi
17
Guido van Rossum9231c8f1997-05-15 21:43:21 +000018\input{copyright}
19
20\begin{abstract}
21
22\noindent
Fred Drake659ebfa2000-04-03 15:42:13 +000023This manual documents the API used by C and \Cpp{} programmers who
Fred Drakee058b4f1998-02-16 06:15:35 +000024want to write extension modules or embed Python. It is a companion to
Fred Drakebe486461999-11-09 17:03:03 +000025\citetitle[../ext/ext.html]{Extending and Embedding the Python
26Interpreter}, which describes the general principles of extension
27writing but does not document the API functions in detail.
Guido van Rossum9231c8f1997-05-15 21:43:21 +000028
Guido van Rossum5b8a5231997-12-30 04:38:44 +000029\strong{Warning:} The current version of this document is incomplete.
30I hope that it is nevertheless useful. I will continue to work on it,
31and release new versions from time to time, independent from Python
32source code releases.
33
Guido van Rossum9231c8f1997-05-15 21:43:21 +000034\end{abstract}
35
Fred Drake4d4f9e71998-01-13 22:25:02 +000036\tableofcontents
Guido van Rossum9231c8f1997-05-15 21:43:21 +000037
Guido van Rossum5060b3b1997-08-17 18:02:23 +000038% XXX Consider moving all this back to ext.tex and giving api.tex
39% XXX a *really* short intro only.
Guido van Rossum9231c8f1997-05-15 21:43:21 +000040
Fred Drakeefd146c1999-02-15 15:30:45 +000041\chapter{Introduction \label{intro}}
Guido van Rossum9231c8f1997-05-15 21:43:21 +000042
Fred Drake659ebfa2000-04-03 15:42:13 +000043The Application Programmer's Interface to Python gives C and
44\Cpp{} programmers access to the Python interpreter at a variety of
45levels. The API is equally usable from \Cpp{}, but for brevity it is
46generally referred to as the Python/C API. There are two
47fundamentally different reasons for using the Python/C API. The first
48reason is to write \emph{extension modules} for specific purposes;
49these are C modules that extend the Python interpreter. This is
50probably the most common use. The second reason is to use Python as a
51component in a larger application; this technique is generally
52referred to as \dfn{embedding} Python in an application.
Guido van Rossum59a61351997-08-14 20:34:33 +000053
Guido van Rossum4a944d71997-08-14 20:35:38 +000054Writing an extension module is a relatively well-understood process,
55where a ``cookbook'' approach works well. There are several tools
56that automate the process to some extent. While people have embedded
57Python in other applications since its early existence, the process of
Fred Drakefc43d002001-05-21 15:03:35 +000058embedding Python is less straightforward than writing an extension.
Guido van Rossum59a61351997-08-14 20:34:33 +000059
Guido van Rossum4a944d71997-08-14 20:35:38 +000060Many API functions are useful independent of whether you're embedding
61or extending Python; moreover, most applications that embed Python
62will need to provide a custom extension as well, so it's probably a
63good idea to become familiar with writing an extension before
Guido van Rossum59a61351997-08-14 20:34:33 +000064attempting to embed Python in a real application.
65
Fred Drakeefd146c1999-02-15 15:30:45 +000066
67\section{Include Files \label{includes}}
Guido van Rossum580aa8d1997-11-25 15:34:51 +000068
69All function, type and macro definitions needed to use the Python/C
70API are included in your code by the following line:
71
Fred Drakee058b4f1998-02-16 06:15:35 +000072\begin{verbatim}
73#include "Python.h"
74\end{verbatim}
Guido van Rossum580aa8d1997-11-25 15:34:51 +000075
Fred Drakee058b4f1998-02-16 06:15:35 +000076This implies inclusion of the following standard headers:
Fred Drake0b71cea2000-09-26 05:51:50 +000077\code{<stdio.h>}, \code{<string.h>}, \code{<errno.h>},
78\code{<limits.h>}, and \code{<stdlib.h>} (if available).
Guido van Rossum580aa8d1997-11-25 15:34:51 +000079
80All user visible names defined by Python.h (except those defined by
Fred Drakee058b4f1998-02-16 06:15:35 +000081the included standard headers) have one of the prefixes \samp{Py} or
Fred Drake659ebfa2000-04-03 15:42:13 +000082\samp{_Py}. Names beginning with \samp{_Py} are for internal use by
83the Python implementation and should not be used by extension writers.
84Structure member names do not have a reserved prefix.
Guido van Rossum580aa8d1997-11-25 15:34:51 +000085
Fred Drakee058b4f1998-02-16 06:15:35 +000086\strong{Important:} user code should never define names that begin
87with \samp{Py} or \samp{_Py}. This confuses the reader, and
88jeopardizes the portability of the user code to future Python
89versions, which may define additional names beginning with one of
90these prefixes.
Guido van Rossum580aa8d1997-11-25 15:34:51 +000091
Fred Drake659ebfa2000-04-03 15:42:13 +000092The header files are typically installed with Python. On \UNIX, these
93are located in the directories
94\file{\envvar{prefix}/include/python\var{version}/} and
95\file{\envvar{exec_prefix}/include/python\var{version}/}, where
96\envvar{prefix} and \envvar{exec_prefix} are defined by the
97corresponding parameters to Python's \program{configure} script and
98\var{version} is \code{sys.version[:3]}. On Windows, the headers are
99installed in \file{\envvar{prefix}/include}, where \envvar{prefix} is
100the installation directory specified to the installer.
101
102To include the headers, place both directories (if different) on your
103compiler's search path for includes. Do \emph{not} place the parent
104directories on the search path and then use
Fred Draked5d04352000-09-14 20:24:17 +0000105\samp{\#include <python\shortversion/Python.h>}; this will break on
Fred Drake659ebfa2000-04-03 15:42:13 +0000106multi-platform builds since the platform independent headers under
107\envvar{prefix} include the platform specific headers from
108\envvar{exec_prefix}.
109
Fred Drakebab29652001-07-10 16:10:08 +0000110\Cpp{} users should note that though the API is defined entirely using
111C, the header files do properly declare the entry points to be
112\code{extern "C"}, so there is no need to do anything special to use
113the API from \Cpp.
114
Fred Drakeefd146c1999-02-15 15:30:45 +0000115
116\section{Objects, Types and Reference Counts \label{objects}}
Guido van Rossum59a61351997-08-14 20:34:33 +0000117
Guido van Rossum580aa8d1997-11-25 15:34:51 +0000118Most Python/C API functions have one or more arguments as well as a
Fred Drake659ebfa2000-04-03 15:42:13 +0000119return value of type \ctype{PyObject*}. This type is a pointer
Fred Drakee058b4f1998-02-16 06:15:35 +0000120to an opaque data type representing an arbitrary Python
Guido van Rossum580aa8d1997-11-25 15:34:51 +0000121object. Since all Python object types are treated the same way by the
122Python language in most situations (e.g., assignments, scope rules,
123and argument passing), it is only fitting that they should be
Fred Drake659ebfa2000-04-03 15:42:13 +0000124represented by a single C type. Almost all Python objects live on the
125heap: you never declare an automatic or static variable of type
126\ctype{PyObject}, only pointer variables of type \ctype{PyObject*} can
127be declared. The sole exception are the type objects\obindex{type};
128since these must never be deallocated, they are typically static
129\ctype{PyTypeObject} objects.
Guido van Rossum59a61351997-08-14 20:34:33 +0000130
Fred Drakee058b4f1998-02-16 06:15:35 +0000131All Python objects (even Python integers) have a \dfn{type} and a
132\dfn{reference count}. An object's type determines what kind of object
Guido van Rossum4a944d71997-08-14 20:35:38 +0000133it is (e.g., an integer, a list, or a user-defined function; there are
Fred Drakebe486461999-11-09 17:03:03 +0000134many more as explained in the \citetitle[../ref/ref.html]{Python
135Reference Manual}). For each of the well-known types there is a macro
136to check whether an object is of that type; for instance,
Fred Drake659ebfa2000-04-03 15:42:13 +0000137\samp{PyList_Check(\var{a})} is true if (and only if) the object
138pointed to by \var{a} is a Python list.
Guido van Rossum59a61351997-08-14 20:34:33 +0000139
Fred Drakeefd146c1999-02-15 15:30:45 +0000140
141\subsection{Reference Counts \label{refcounts}}
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000142
Guido van Rossum580aa8d1997-11-25 15:34:51 +0000143The reference count is important because today's computers have a
Fred Drake003d8da1998-04-13 00:53:42 +0000144finite (and often severely limited) memory size; it counts how many
Guido van Rossum4a944d71997-08-14 20:35:38 +0000145different places there are that have a reference to an object. Such a
Fred Drake659ebfa2000-04-03 15:42:13 +0000146place could be another object, or a global (or static) C variable, or
147a local variable in some C function. When an object's reference count
Guido van Rossum4a944d71997-08-14 20:35:38 +0000148becomes zero, the object is deallocated. If it contains references to
149other objects, their reference count is decremented. Those other
150objects may be deallocated in turn, if this decrement makes their
151reference count become zero, and so on. (There's an obvious problem
152with objects that reference each other here; for now, the solution is
Fred Drake659ebfa2000-04-03 15:42:13 +0000153``don't do that.'')
Guido van Rossum59a61351997-08-14 20:34:33 +0000154
Guido van Rossum4a944d71997-08-14 20:35:38 +0000155Reference counts are always manipulated explicitly. The normal way is
Fred Drake659ebfa2000-04-03 15:42:13 +0000156to use the macro \cfunction{Py_INCREF()}\ttindex{Py_INCREF()} to
157increment an object's reference count by one, and
158\cfunction{Py_DECREF()}\ttindex{Py_DECREF()} to decrement it by
159one. The \cfunction{Py_DECREF()} macro is considerably more complex
160than the incref one, since it must check whether the reference count
161becomes zero and then cause the object's deallocator to be called.
162The deallocator is a function pointer contained in the object's type
163structure. The type-specific deallocator takes care of decrementing
164the reference counts for other objects contained in the object if this
165is a compound object type, such as a list, as well as performing any
166additional finalization that's needed. There's no chance that the
167reference count can overflow; at least as many bits are used to hold
168the reference count as there are distinct memory locations in virtual
169memory (assuming \code{sizeof(long) >= sizeof(char*)}). Thus, the
170reference count increment is a simple operation.
Guido van Rossum59a61351997-08-14 20:34:33 +0000171
Guido van Rossum4a944d71997-08-14 20:35:38 +0000172It is not necessary to increment an object's reference count for every
173local variable that contains a pointer to an object. In theory, the
Fred Drakee058b4f1998-02-16 06:15:35 +0000174object's reference count goes up by one when the variable is made to
Guido van Rossum4a944d71997-08-14 20:35:38 +0000175point to it and it goes down by one when the variable goes out of
176scope. However, these two cancel each other out, so at the end the
177reference count hasn't changed. The only real reason to use the
178reference count is to prevent the object from being deallocated as
179long as our variable is pointing to it. If we know that there is at
180least one other reference to the object that lives at least as long as
181our variable, there is no need to increment the reference count
182temporarily. An important situation where this arises is in objects
Fred Drake659ebfa2000-04-03 15:42:13 +0000183that are passed as arguments to C functions in an extension module
Guido van Rossum4a944d71997-08-14 20:35:38 +0000184that are called from Python; the call mechanism guarantees to hold a
Guido van Rossum59a61351997-08-14 20:34:33 +0000185reference to every argument for the duration of the call.
186
Fred Drakee058b4f1998-02-16 06:15:35 +0000187However, a common pitfall is to extract an object from a list and
188hold on to it for a while without incrementing its reference count.
189Some other operation might conceivably remove the object from the
190list, decrementing its reference count and possible deallocating it.
191The real danger is that innocent-looking operations may invoke
192arbitrary Python code which could do this; there is a code path which
193allows control to flow back to the user from a \cfunction{Py_DECREF()},
194so almost any operation is potentially dangerous.
Guido van Rossum59a61351997-08-14 20:34:33 +0000195
Guido van Rossum4a944d71997-08-14 20:35:38 +0000196A safe approach is to always use the generic operations (functions
Fred Drake659ebfa2000-04-03 15:42:13 +0000197whose name begins with \samp{PyObject_}, \samp{PyNumber_},
198\samp{PySequence_} or \samp{PyMapping_}). These operations always
199increment the reference count of the object they return. This leaves
200the caller with the responsibility to call
201\cfunction{Py_DECREF()} when they are done with the result; this soon
202becomes second nature.
Guido van Rossum59a61351997-08-14 20:34:33 +0000203
Fred Drakeefd146c1999-02-15 15:30:45 +0000204
205\subsubsection{Reference Count Details \label{refcountDetails}}
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000206
207The reference count behavior of functions in the Python/C API is best
Fred Drake659ebfa2000-04-03 15:42:13 +0000208explained in terms of \emph{ownership of references}. Note that we
Guido van Rossum580aa8d1997-11-25 15:34:51 +0000209talk of owning references, never of owning objects; objects are always
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000210shared! When a function owns a reference, it has to dispose of it
Fred Drakee058b4f1998-02-16 06:15:35 +0000211properly --- either by passing ownership on (usually to its caller) or
212by calling \cfunction{Py_DECREF()} or \cfunction{Py_XDECREF()}. When
213a function passes ownership of a reference on to its caller, the
214caller is said to receive a \emph{new} reference. When no ownership
215is transferred, the caller is said to \emph{borrow} the reference.
216Nothing needs to be done for a borrowed reference.
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000217
Fred Drakea8455ab2000-06-16 19:58:42 +0000218Conversely, when a calling function passes it a reference to an
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000219object, there are two possibilities: the function \emph{steals} a
220reference to the object, or it does not. Few functions steal
Fred Drakee058b4f1998-02-16 06:15:35 +0000221references; the two notable exceptions are
Fred Drake659ebfa2000-04-03 15:42:13 +0000222\cfunction{PyList_SetItem()}\ttindex{PyList_SetItem()} and
223\cfunction{PyTuple_SetItem()}\ttindex{PyTuple_SetItem()}, which
Fred Drakee058b4f1998-02-16 06:15:35 +0000224steal a reference to the item (but not to the tuple or list into which
Fred Drake003d8da1998-04-13 00:53:42 +0000225the item is put!). These functions were designed to steal a reference
Fred Drakee058b4f1998-02-16 06:15:35 +0000226because of a common idiom for populating a tuple or list with newly
227created objects; for example, the code to create the tuple \code{(1,
2282, "three")} could look like this (forgetting about error handling for
Fred Drake659ebfa2000-04-03 15:42:13 +0000229the moment; a better way to code this is shown below):
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000230
231\begin{verbatim}
232PyObject *t;
Fred Drakec6fa34e1998-04-02 06:47:24 +0000233
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000234t = PyTuple_New(3);
235PyTuple_SetItem(t, 0, PyInt_FromLong(1L));
236PyTuple_SetItem(t, 1, PyInt_FromLong(2L));
237PyTuple_SetItem(t, 2, PyString_FromString("three"));
238\end{verbatim}
239
Fred Drakee058b4f1998-02-16 06:15:35 +0000240Incidentally, \cfunction{PyTuple_SetItem()} is the \emph{only} way to
241set tuple items; \cfunction{PySequence_SetItem()} and
242\cfunction{PyObject_SetItem()} refuse to do this since tuples are an
243immutable data type. You should only use
244\cfunction{PyTuple_SetItem()} for tuples that you are creating
Guido van Rossum5b8a5231997-12-30 04:38:44 +0000245yourself.
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000246
247Equivalent code for populating a list can be written using
Fred Drakee058b4f1998-02-16 06:15:35 +0000248\cfunction{PyList_New()} and \cfunction{PyList_SetItem()}. Such code
249can also use \cfunction{PySequence_SetItem()}; this illustrates the
250difference between the two (the extra \cfunction{Py_DECREF()} calls):
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000251
252\begin{verbatim}
253PyObject *l, *x;
Fred Drakec6fa34e1998-04-02 06:47:24 +0000254
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000255l = PyList_New(3);
256x = PyInt_FromLong(1L);
Guido van Rossum5b8a5231997-12-30 04:38:44 +0000257PySequence_SetItem(l, 0, x); Py_DECREF(x);
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000258x = PyInt_FromLong(2L);
Guido van Rossum5b8a5231997-12-30 04:38:44 +0000259PySequence_SetItem(l, 1, x); Py_DECREF(x);
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000260x = PyString_FromString("three");
Guido van Rossum5b8a5231997-12-30 04:38:44 +0000261PySequence_SetItem(l, 2, x); Py_DECREF(x);
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000262\end{verbatim}
263
Guido van Rossum580aa8d1997-11-25 15:34:51 +0000264You might find it strange that the ``recommended'' approach takes more
265code. However, in practice, you will rarely use these ways of
266creating and populating a tuple or list. There's a generic function,
Fred Drakee058b4f1998-02-16 06:15:35 +0000267\cfunction{Py_BuildValue()}, that can create most common objects from
Fred Drake659ebfa2000-04-03 15:42:13 +0000268C values, directed by a \dfn{format string}. For example, the
Fred Drakee058b4f1998-02-16 06:15:35 +0000269above two blocks of code could be replaced by the following (which
270also takes care of the error checking):
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000271
272\begin{verbatim}
273PyObject *t, *l;
Fred Drakec6fa34e1998-04-02 06:47:24 +0000274
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000275t = Py_BuildValue("(iis)", 1, 2, "three");
276l = Py_BuildValue("[iis]", 1, 2, "three");
277\end{verbatim}
278
Fred Drakee058b4f1998-02-16 06:15:35 +0000279It is much more common to use \cfunction{PyObject_SetItem()} and
280friends with items whose references you are only borrowing, like
281arguments that were passed in to the function you are writing. In
282that case, their behaviour regarding reference counts is much saner,
283since you don't have to increment a reference count so you can give a
284reference away (``have it be stolen''). For example, this function
285sets all items of a list (actually, any mutable sequence) to a given
286item:
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000287
288\begin{verbatim}
289int set_all(PyObject *target, PyObject *item)
290{
291 int i, n;
Fred Drakec6fa34e1998-04-02 06:47:24 +0000292
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000293 n = PyObject_Length(target);
294 if (n < 0)
295 return -1;
296 for (i = 0; i < n; i++) {
297 if (PyObject_SetItem(target, i, item) < 0)
298 return -1;
299 }
300 return 0;
301}
302\end{verbatim}
Fred Drake659ebfa2000-04-03 15:42:13 +0000303\ttindex{set_all()}
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000304
305The situation is slightly different for function return values.
306While passing a reference to most functions does not change your
307ownership responsibilities for that reference, many functions that
308return a referece to an object give you ownership of the reference.
309The reason is simple: in many cases, the returned object is created
310on the fly, and the reference you get is the only reference to the
Fred Drakee058b4f1998-02-16 06:15:35 +0000311object. Therefore, the generic functions that return object
312references, like \cfunction{PyObject_GetItem()} and
Fred Drakebab29652001-07-10 16:10:08 +0000313\cfunction{PySequence_GetItem()}, always return a new reference (the
314caller becomes the owner of the reference).
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000315
316It is important to realize that whether you own a reference returned
Fred Drakee058b4f1998-02-16 06:15:35 +0000317by a function depends on which function you call only --- \emph{the
Fred Drakebab29652001-07-10 16:10:08 +0000318plumage} (the type of the type of the object passed as an
Fred Drakee058b4f1998-02-16 06:15:35 +0000319argument to the function) \emph{doesn't enter into it!} Thus, if you
320extract an item from a list using \cfunction{PyList_GetItem()}, you
321don't own the reference --- but if you obtain the same item from the
322same list using \cfunction{PySequence_GetItem()} (which happens to
323take exactly the same arguments), you do own a reference to the
324returned object.
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000325
Fred Drakee058b4f1998-02-16 06:15:35 +0000326Here is an example of how you could write a function that computes the
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000327sum of the items in a list of integers; once using
Fred Drake659ebfa2000-04-03 15:42:13 +0000328\cfunction{PyList_GetItem()}\ttindex{PyList_GetItem()}, and once using
329\cfunction{PySequence_GetItem()}\ttindex{PySequence_GetItem()}.
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000330
331\begin{verbatim}
332long sum_list(PyObject *list)
333{
334 int i, n;
335 long total = 0;
336 PyObject *item;
Fred Drakec6fa34e1998-04-02 06:47:24 +0000337
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000338 n = PyList_Size(list);
339 if (n < 0)
340 return -1; /* Not a list */
341 for (i = 0; i < n; i++) {
342 item = PyList_GetItem(list, i); /* Can't fail */
343 if (!PyInt_Check(item)) continue; /* Skip non-integers */
344 total += PyInt_AsLong(item);
345 }
346 return total;
347}
348\end{verbatim}
Fred Drake659ebfa2000-04-03 15:42:13 +0000349\ttindex{sum_list()}
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000350
351\begin{verbatim}
352long sum_sequence(PyObject *sequence)
353{
354 int i, n;
355 long total = 0;
356 PyObject *item;
Fred Drake659ebfa2000-04-03 15:42:13 +0000357 n = PySequence_Length(sequence);
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000358 if (n < 0)
359 return -1; /* Has no length */
360 for (i = 0; i < n; i++) {
Fred Drake659ebfa2000-04-03 15:42:13 +0000361 item = PySequence_GetItem(sequence, i);
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000362 if (item == NULL)
363 return -1; /* Not a sequence, or other failure */
364 if (PyInt_Check(item))
365 total += PyInt_AsLong(item);
Guido van Rossum5b8a5231997-12-30 04:38:44 +0000366 Py_DECREF(item); /* Discard reference ownership */
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000367 }
368 return total;
369}
370\end{verbatim}
Fred Drake659ebfa2000-04-03 15:42:13 +0000371\ttindex{sum_sequence()}
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000372
Fred Drakeefd146c1999-02-15 15:30:45 +0000373
374\subsection{Types \label{types}}
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000375
376There are few other data types that play a significant role in
Fred Drake659ebfa2000-04-03 15:42:13 +0000377the Python/C API; most are simple C types such as \ctype{int},
378\ctype{long}, \ctype{double} and \ctype{char*}. A few structure types
Guido van Rossum4a944d71997-08-14 20:35:38 +0000379are used to describe static tables used to list the functions exported
Fred Drake659ebfa2000-04-03 15:42:13 +0000380by a module or the data attributes of a new object type, and another
381is used to describe the value of a complex number. These will
Guido van Rossum59a61351997-08-14 20:34:33 +0000382be discussed together with the functions that use them.
383
Fred Drakeefd146c1999-02-15 15:30:45 +0000384
385\section{Exceptions \label{exceptions}}
Guido van Rossum59a61351997-08-14 20:34:33 +0000386
Guido van Rossum4a944d71997-08-14 20:35:38 +0000387The Python programmer only needs to deal with exceptions if specific
388error handling is required; unhandled exceptions are automatically
Fred Drake659ebfa2000-04-03 15:42:13 +0000389propagated to the caller, then to the caller's caller, and so on, until
Guido van Rossum4a944d71997-08-14 20:35:38 +0000390they reach the top-level interpreter, where they are reported to the
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000391user accompanied by a stack traceback.
Guido van Rossum59a61351997-08-14 20:34:33 +0000392
Fred Drake659ebfa2000-04-03 15:42:13 +0000393For C programmers, however, error checking always has to be explicit.
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000394All functions in the Python/C API can raise exceptions, unless an
395explicit claim is made otherwise in a function's documentation. In
396general, when a function encounters an error, it sets an exception,
397discards any object references that it owns, and returns an
Fred Drakee058b4f1998-02-16 06:15:35 +0000398error indicator --- usually \NULL{} or \code{-1}. A few functions
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000399return a Boolean true/false result, with false indicating an error.
400Very few functions return no explicit error indicator or have an
401ambiguous return value, and require explicit testing for errors with
Fred Drake659ebfa2000-04-03 15:42:13 +0000402\cfunction{PyErr_Occurred()}\ttindex{PyErr_Occurred()}.
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000403
404Exception state is maintained in per-thread storage (this is
405equivalent to using global storage in an unthreaded application). A
Fred Drakec6fa34e1998-04-02 06:47:24 +0000406thread can be in one of two states: an exception has occurred, or not.
Fred Drakee058b4f1998-02-16 06:15:35 +0000407The function \cfunction{PyErr_Occurred()} can be used to check for
408this: it returns a borrowed reference to the exception type object
409when an exception has occurred, and \NULL{} otherwise. There are a
410number of functions to set the exception state:
Fred Drake659ebfa2000-04-03 15:42:13 +0000411\cfunction{PyErr_SetString()}\ttindex{PyErr_SetString()} is the most
412common (though not the most general) function to set the exception
413state, and \cfunction{PyErr_Clear()}\ttindex{PyErr_Clear()} clears the
414exception state.
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000415
416The full exception state consists of three objects (all of which can
Fred Drakee058b4f1998-02-16 06:15:35 +0000417be \NULL{}): the exception type, the corresponding exception
Fred Drake659ebfa2000-04-03 15:42:13 +0000418value, and the traceback. These have the same meanings as the Python
419\withsubitem{(in module sys)}{
420 \ttindex{exc_type}\ttindex{exc_value}\ttindex{exc_traceback}}
421objects \code{sys.exc_type}, \code{sys.exc_value}, and
422\code{sys.exc_traceback}; however, they are not the same: the Python
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000423objects represent the last exception being handled by a Python
Fred Drake659ebfa2000-04-03 15:42:13 +0000424\keyword{try} \ldots\ \keyword{except} statement, while the C level
Fred Drakee058b4f1998-02-16 06:15:35 +0000425exception state only exists while an exception is being passed on
Fred Drake659ebfa2000-04-03 15:42:13 +0000426between C functions until it reaches the Python bytecode interpreter's
427main loop, which takes care of transferring it to \code{sys.exc_type}
428and friends.
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000429
Fred Drakec6fa34e1998-04-02 06:47:24 +0000430Note that starting with Python 1.5, the preferred, thread-safe way to
Fred Drake659ebfa2000-04-03 15:42:13 +0000431access the exception state from Python code is to call the function
432\withsubitem{(in module sys)}{\ttindex{exc_info()}}
Fred Drakee058b4f1998-02-16 06:15:35 +0000433\function{sys.exc_info()}, which returns the per-thread exception state
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000434for Python code. Also, the semantics of both ways to access the
435exception state have changed so that a function which catches an
436exception will save and restore its thread's exception state so as to
437preserve the exception state of its caller. This prevents common bugs
438in exception handling code caused by an innocent-looking function
439overwriting the exception being handled; it also reduces the often
440unwanted lifetime extension for objects that are referenced by the
Fred Drakec6fa34e1998-04-02 06:47:24 +0000441stack frames in the traceback.
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000442
443As a general principle, a function that calls another function to
444perform some task should check whether the called function raised an
445exception, and if so, pass the exception state on to its caller. It
Fred Drake659ebfa2000-04-03 15:42:13 +0000446should discard any object references that it owns, and return an
Fred Drakee058b4f1998-02-16 06:15:35 +0000447error indicator, but it should \emph{not} set another exception ---
448that would overwrite the exception that was just raised, and lose
449important information about the exact cause of the error.
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000450
Fred Drake659ebfa2000-04-03 15:42:13 +0000451A simple example of detecting exceptions and passing them on is shown
452in the \cfunction{sum_sequence()}\ttindex{sum_sequence()} example
453above. It so happens that that example doesn't need to clean up any
454owned references when it detects an error. The following example
455function shows some error cleanup. First, to remind you why you like
456Python, we show the equivalent Python code:
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000457
458\begin{verbatim}
Guido van Rossum580aa8d1997-11-25 15:34:51 +0000459def incr_item(dict, key):
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000460 try:
Guido van Rossum580aa8d1997-11-25 15:34:51 +0000461 item = dict[key]
462 except KeyError:
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000463 item = 0
Fred Drake6b3f3f22000-11-29 15:48:22 +0000464 dict[key] = item + 1
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000465\end{verbatim}
Fred Drake659ebfa2000-04-03 15:42:13 +0000466\ttindex{incr_item()}
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000467
Fred Drake659ebfa2000-04-03 15:42:13 +0000468Here is the corresponding C code, in all its glory:
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000469
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000470\begin{verbatim}
Guido van Rossum580aa8d1997-11-25 15:34:51 +0000471int incr_item(PyObject *dict, PyObject *key)
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000472{
473 /* Objects all initialized to NULL for Py_XDECREF */
474 PyObject *item = NULL, *const_one = NULL, *incremented_item = NULL;
Guido van Rossum5b8a5231997-12-30 04:38:44 +0000475 int rv = -1; /* Return value initialized to -1 (failure) */
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000476
Guido van Rossum580aa8d1997-11-25 15:34:51 +0000477 item = PyObject_GetItem(dict, key);
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000478 if (item == NULL) {
Fred Drakec6fa34e1998-04-02 06:47:24 +0000479 /* Handle KeyError only: */
Fred Drake6b3f3f22000-11-29 15:48:22 +0000480 if (!PyErr_ExceptionMatches(PyExc_KeyError))
481 goto error;
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000482
483 /* Clear the error and use zero: */
484 PyErr_Clear();
Guido van Rossum580aa8d1997-11-25 15:34:51 +0000485 item = PyInt_FromLong(0L);
Fred Drake6b3f3f22000-11-29 15:48:22 +0000486 if (item == NULL)
487 goto error;
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000488 }
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000489 const_one = PyInt_FromLong(1L);
Fred Drake6b3f3f22000-11-29 15:48:22 +0000490 if (const_one == NULL)
491 goto error;
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000492
493 incremented_item = PyNumber_Add(item, const_one);
Fred Drake6b3f3f22000-11-29 15:48:22 +0000494 if (incremented_item == NULL)
495 goto error;
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000496
Fred Drake6b3f3f22000-11-29 15:48:22 +0000497 if (PyObject_SetItem(dict, key, incremented_item) < 0)
498 goto error;
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000499 rv = 0; /* Success */
500 /* Continue with cleanup code */
501
502 error:
503 /* Cleanup code, shared by success and failure path */
504
505 /* Use Py_XDECREF() to ignore NULL references */
506 Py_XDECREF(item);
507 Py_XDECREF(const_one);
508 Py_XDECREF(incremented_item);
509
510 return rv; /* -1 for error, 0 for success */
511}
512\end{verbatim}
Fred Drake659ebfa2000-04-03 15:42:13 +0000513\ttindex{incr_item()}
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000514
Fred Drakef8830d11998-04-23 14:06:01 +0000515This example represents an endorsed use of the \keyword{goto} statement
Fred Drake659ebfa2000-04-03 15:42:13 +0000516in C! It illustrates the use of
517\cfunction{PyErr_ExceptionMatches()}\ttindex{PyErr_ExceptionMatches()} and
518\cfunction{PyErr_Clear()}\ttindex{PyErr_Clear()} to
519handle specific exceptions, and the use of
520\cfunction{Py_XDECREF()}\ttindex{Py_XDECREF()} to
521dispose of owned references that may be \NULL{} (note the
522\character{X} in the name; \cfunction{Py_DECREF()} would crash when
523confronted with a \NULL{} reference). It is important that the
524variables used to hold owned references are initialized to \NULL{} for
525this to work; likewise, the proposed return value is initialized to
526\code{-1} (failure) and only set to success after the final call made
527is successful.
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000528
Guido van Rossum59a61351997-08-14 20:34:33 +0000529
Fred Drakeefd146c1999-02-15 15:30:45 +0000530\section{Embedding Python \label{embedding}}
Guido van Rossum59a61351997-08-14 20:34:33 +0000531
Guido van Rossum580aa8d1997-11-25 15:34:51 +0000532The one important task that only embedders (as opposed to extension
533writers) of the Python interpreter have to worry about is the
534initialization, and possibly the finalization, of the Python
535interpreter. Most functionality of the interpreter can only be used
536after the interpreter has been initialized.
Guido van Rossum59a61351997-08-14 20:34:33 +0000537
Fred Drake659ebfa2000-04-03 15:42:13 +0000538The basic initialization function is
539\cfunction{Py_Initialize()}\ttindex{Py_Initialize()}.
Fred Drakee058b4f1998-02-16 06:15:35 +0000540This initializes the table of loaded modules, and creates the
Fred Drake4de05a91998-02-16 14:25:26 +0000541fundamental modules \module{__builtin__}\refbimodindex{__builtin__},
542\module{__main__}\refbimodindex{__main__} and
543\module{sys}\refbimodindex{sys}. It also initializes the module
Fred Drakec6fa34e1998-04-02 06:47:24 +0000544search path (\code{sys.path}).%
545\indexiii{module}{search}{path}
Fred Drake659ebfa2000-04-03 15:42:13 +0000546\withsubitem{(in module sys)}{\ttindex{path}}
Guido van Rossum59a61351997-08-14 20:34:33 +0000547
Fred Drakee058b4f1998-02-16 06:15:35 +0000548\cfunction{Py_Initialize()} does not set the ``script argument list''
Guido van Rossum4a944d71997-08-14 20:35:38 +0000549(\code{sys.argv}). If this variable is needed by Python code that
550will be executed later, it must be set explicitly with a call to
Fred Drake659ebfa2000-04-03 15:42:13 +0000551\code{PySys_SetArgv(\var{argc},
552\var{argv})}\ttindex{PySys_SetArgv()} subsequent to the call to
553\cfunction{Py_Initialize()}.
Guido van Rossum59a61351997-08-14 20:34:33 +0000554
Fred Drakeb0a78731998-01-13 18:51:10 +0000555On most systems (in particular, on \UNIX{} and Windows, although the
Fred Drake659ebfa2000-04-03 15:42:13 +0000556details are slightly different),
557\cfunction{Py_Initialize()} calculates the module search path based
558upon its best guess for the location of the standard Python
559interpreter executable, assuming that the Python library is found in a
560fixed location relative to the Python interpreter executable. In
561particular, it looks for a directory named
Fred Draked5d04352000-09-14 20:24:17 +0000562\file{lib/python\shortversion} relative to the parent directory where
563the executable named \file{python} is found on the shell command
564search path (the environment variable \envvar{PATH}).
Guido van Rossum42cefd01997-10-05 15:27:29 +0000565
566For instance, if the Python executable is found in
Fred Drakee058b4f1998-02-16 06:15:35 +0000567\file{/usr/local/bin/python}, it will assume that the libraries are in
Fred Draked5d04352000-09-14 20:24:17 +0000568\file{/usr/local/lib/python\shortversion}. (In fact, this particular path
Fred Drakee058b4f1998-02-16 06:15:35 +0000569is also the ``fallback'' location, used when no executable file named
Fred Drakec6fa34e1998-04-02 06:47:24 +0000570\file{python} is found along \envvar{PATH}.) The user can override
571this behavior by setting the environment variable \envvar{PYTHONHOME},
Guido van Rossum580aa8d1997-11-25 15:34:51 +0000572or insert additional directories in front of the standard path by
Fred Drakec6fa34e1998-04-02 06:47:24 +0000573setting \envvar{PYTHONPATH}.
Guido van Rossum59a61351997-08-14 20:34:33 +0000574
Guido van Rossum4a944d71997-08-14 20:35:38 +0000575The embedding application can steer the search by calling
Fred Drake659ebfa2000-04-03 15:42:13 +0000576\code{Py_SetProgramName(\var{file})}\ttindex{Py_SetProgramName()} \emph{before} calling
Fred Drakec6fa34e1998-04-02 06:47:24 +0000577\cfunction{Py_Initialize()}. Note that \envvar{PYTHONHOME} still
578overrides this and \envvar{PYTHONPATH} is still inserted in front of
Fred Drakee058b4f1998-02-16 06:15:35 +0000579the standard path. An application that requires total control has to
Fred Drake659ebfa2000-04-03 15:42:13 +0000580provide its own implementation of
581\cfunction{Py_GetPath()}\ttindex{Py_GetPath()},
582\cfunction{Py_GetPrefix()}\ttindex{Py_GetPrefix()},
583\cfunction{Py_GetExecPrefix()}\ttindex{Py_GetExecPrefix()}, and
584\cfunction{Py_GetProgramFullPath()}\ttindex{Py_GetProgramFullPath()} (all
585defined in \file{Modules/getpath.c}).
Guido van Rossum59a61351997-08-14 20:34:33 +0000586
Guido van Rossum4a944d71997-08-14 20:35:38 +0000587Sometimes, it is desirable to ``uninitialize'' Python. For instance,
588the application may want to start over (make another call to
Fred Drakee058b4f1998-02-16 06:15:35 +0000589\cfunction{Py_Initialize()}) or the application is simply done with its
Guido van Rossum580aa8d1997-11-25 15:34:51 +0000590use of Python and wants to free all memory allocated by Python. This
Fred Drakee058b4f1998-02-16 06:15:35 +0000591can be accomplished by calling \cfunction{Py_Finalize()}. The function
Fred Drake659ebfa2000-04-03 15:42:13 +0000592\cfunction{Py_IsInitialized()}\ttindex{Py_IsInitialized()} returns
593true if Python is currently in the initialized state. More
594information about these functions is given in a later chapter.
Guido van Rossum59a61351997-08-14 20:34:33 +0000595
Guido van Rossum4a944d71997-08-14 20:35:38 +0000596
Fred Drakeefd146c1999-02-15 15:30:45 +0000597\chapter{The Very High Level Layer \label{veryhigh}}
Guido van Rossum4a944d71997-08-14 20:35:38 +0000598
Fred Drakee5bf8b21998-02-12 21:22:28 +0000599The functions in this chapter will let you execute Python source code
600given in a file or a buffer, but they will not let you interact in a
601more detailed way with the interpreter.
Guido van Rossum4a944d71997-08-14 20:35:38 +0000602
Fred Drake659ebfa2000-04-03 15:42:13 +0000603Several of these functions accept a start symbol from the grammar as a
604parameter. The available start symbols are \constant{Py_eval_input},
605\constant{Py_file_input}, and \constant{Py_single_input}. These are
606described following the functions which accept them as parameters.
607
Fred Drake510d08b2000-08-14 02:50:21 +0000608Note also that several of these functions take \ctype{FILE*}
609parameters. On particular issue which needs to be handled carefully
610is that the \ctype{FILE} structure for different C libraries can be
611different and incompatible. Under Windows (at least), it is possible
612for dynamically linked extensions to actually use different libraries,
613so care should be taken that \ctype{FILE*} parameters are only passed
614to these functions if it is certain that they were created by the same
615library that the Python runtime is using.
616
Fred Drake24e62192001-05-21 15:56:55 +0000617\begin{cfuncdesc}{int}{Py_Main}{int argc, char **argv}
618 The main program for the standard interpreter. This is made
619 available for programs which embed Python. The \var{argc} and
620 \var{argv} parameters should be prepared exactly as those which are
621 passed to a C program's \cfunction{main()} function. It is
622 important to note that the argument list may be modified (but the
623 contents of the strings pointed to by the argument list are not).
624 The return value will be the integer passed to the
625 \function{sys.exit()} function, \code{1} if the interpreter exits
626 due to an exception, or \code{2} if the parameter list does not
627 represent a valid Python command line.
628\end{cfuncdesc}
629
Fred Drakec6fa34e1998-04-02 06:47:24 +0000630\begin{cfuncdesc}{int}{PyRun_AnyFile}{FILE *fp, char *filename}
Fred Drake0041a941999-04-29 04:20:46 +0000631 If \var{fp} refers to a file associated with an interactive device
632 (console or terminal input or \UNIX{} pseudo-terminal), return the
633 value of \cfunction{PyRun_InteractiveLoop()}, otherwise return the
634 result of \cfunction{PyRun_SimpleFile()}. If \var{filename} is
Fred Drakea8d73412000-08-11 20:39:29 +0000635 \NULL{}, this function uses \code{"???"} as the filename.
Guido van Rossum9231c8f1997-05-15 21:43:21 +0000636\end{cfuncdesc}
637
Fred Drakec6fa34e1998-04-02 06:47:24 +0000638\begin{cfuncdesc}{int}{PyRun_SimpleString}{char *command}
Fred Drake0041a941999-04-29 04:20:46 +0000639 Executes the Python source code from \var{command} in the
640 \module{__main__} module. If \module{__main__} does not already
641 exist, it is created. Returns \code{0} on success or \code{-1} if
642 an exception was raised. If there was an error, there is no way to
643 get the exception information.
Guido van Rossum9231c8f1997-05-15 21:43:21 +0000644\end{cfuncdesc}
645
Fred Drakec6fa34e1998-04-02 06:47:24 +0000646\begin{cfuncdesc}{int}{PyRun_SimpleFile}{FILE *fp, char *filename}
Fred Drake0041a941999-04-29 04:20:46 +0000647 Similar to \cfunction{PyRun_SimpleString()}, but the Python source
648 code is read from \var{fp} instead of an in-memory string.
649 \var{filename} should be the name of the file.
Fred Drakee5bf8b21998-02-12 21:22:28 +0000650\end{cfuncdesc}
651
Fred Drakec6fa34e1998-04-02 06:47:24 +0000652\begin{cfuncdesc}{int}{PyRun_InteractiveOne}{FILE *fp, char *filename}
Fred Drakea8d73412000-08-11 20:39:29 +0000653 Read and execute a single statement from a file associated with an
654 interactive device. If \var{filename} is \NULL, \code{"???"} is
655 used instead. The user will be prompted using \code{sys.ps1} and
656 \code{sys.ps2}. Returns \code{0} when the input was executed
657 successfully, \code{-1} if there was an exception, or an error code
658 from the \file{errcode.h} include file distributed as part of Python
659 in case of a parse error. (Note that \file{errcode.h} is not
660 included by \file{Python.h}, so must be included specifically if
661 needed.)
Fred Drakee5bf8b21998-02-12 21:22:28 +0000662\end{cfuncdesc}
663
Fred Drakec6fa34e1998-04-02 06:47:24 +0000664\begin{cfuncdesc}{int}{PyRun_InteractiveLoop}{FILE *fp, char *filename}
Fred Drakea8d73412000-08-11 20:39:29 +0000665 Read and execute statements from a file associated with an
666 interactive device until \EOF{} is reached. If \var{filename} is
667 \NULL, \code{"???"} is used instead. The user will be prompted
668 using \code{sys.ps1} and \code{sys.ps2}. Returns \code{0} at \EOF.
Fred Drakee5bf8b21998-02-12 21:22:28 +0000669\end{cfuncdesc}
670
Fred Drakec6fa34e1998-04-02 06:47:24 +0000671\begin{cfuncdesc}{struct _node*}{PyParser_SimpleParseString}{char *str,
672 int start}
Fred Drake0041a941999-04-29 04:20:46 +0000673 Parse Python source code from \var{str} using the start token
674 \var{start}. The result can be used to create a code object which
675 can be evaluated efficiently. This is useful if a code fragment
676 must be evaluated many times.
Fred Drakee5bf8b21998-02-12 21:22:28 +0000677\end{cfuncdesc}
678
Fred Drakec6fa34e1998-04-02 06:47:24 +0000679\begin{cfuncdesc}{struct _node*}{PyParser_SimpleParseFile}{FILE *fp,
680 char *filename, int start}
Fred Drake0041a941999-04-29 04:20:46 +0000681 Similar to \cfunction{PyParser_SimpleParseString()}, but the Python
682 source code is read from \var{fp} instead of an in-memory string.
683 \var{filename} should be the name of the file.
Fred Drakee5bf8b21998-02-12 21:22:28 +0000684\end{cfuncdesc}
685
Fred Drakec6fa34e1998-04-02 06:47:24 +0000686\begin{cfuncdesc}{PyObject*}{PyRun_String}{char *str, int start,
687 PyObject *globals,
688 PyObject *locals}
Fred Drake0041a941999-04-29 04:20:46 +0000689 Execute Python source code from \var{str} in the context specified
690 by the dictionaries \var{globals} and \var{locals}. The parameter
691 \var{start} specifies the start token that should be used to parse
692 the source code.
693
694 Returns the result of executing the code as a Python object, or
695 \NULL{} if an exception was raised.
Fred Drakee5bf8b21998-02-12 21:22:28 +0000696\end{cfuncdesc}
697
Fred Drakec6fa34e1998-04-02 06:47:24 +0000698\begin{cfuncdesc}{PyObject*}{PyRun_File}{FILE *fp, char *filename,
699 int start, PyObject *globals,
700 PyObject *locals}
Fred Drake0041a941999-04-29 04:20:46 +0000701 Similar to \cfunction{PyRun_String()}, but the Python source code is
Fred Drake659ebfa2000-04-03 15:42:13 +0000702 read from \var{fp} instead of an in-memory string.
703 \var{filename} should be the name of the file.
Fred Drakee5bf8b21998-02-12 21:22:28 +0000704\end{cfuncdesc}
705
Fred Drakec6fa34e1998-04-02 06:47:24 +0000706\begin{cfuncdesc}{PyObject*}{Py_CompileString}{char *str, char *filename,
707 int start}
Fred Drake0041a941999-04-29 04:20:46 +0000708 Parse and compile the Python source code in \var{str}, returning the
709 resulting code object. The start token is given by \var{start};
Fred Drakec924b8d1999-08-23 18:57:25 +0000710 this can be used to constrain the code which can be compiled and should
711 be \constant{Py_eval_input}, \constant{Py_file_input}, or
712 \constant{Py_single_input}. The filename specified by
713 \var{filename} is used to construct the code object and may appear
714 in tracebacks or \exception{SyntaxError} exception messages. This
715 returns \NULL{} if the code cannot be parsed or compiled.
Guido van Rossum9231c8f1997-05-15 21:43:21 +0000716\end{cfuncdesc}
717
Fred Drakec924b8d1999-08-23 18:57:25 +0000718\begin{cvardesc}{int}{Py_eval_input}
719 The start symbol from the Python grammar for isolated expressions;
Fred Drake659ebfa2000-04-03 15:42:13 +0000720 for use with \cfunction{Py_CompileString()}\ttindex{Py_CompileString()}.
Fred Drakec924b8d1999-08-23 18:57:25 +0000721\end{cvardesc}
722
723\begin{cvardesc}{int}{Py_file_input}
724 The start symbol from the Python grammar for sequences of statements
725 as read from a file or other source; for use with
Fred Drake659ebfa2000-04-03 15:42:13 +0000726 \cfunction{Py_CompileString()}\ttindex{Py_CompileString()}. This is
727 the symbol to use when compiling arbitrarily long Python source code.
Fred Drakec924b8d1999-08-23 18:57:25 +0000728\end{cvardesc}
729
730\begin{cvardesc}{int}{Py_single_input}
731 The start symbol from the Python grammar for a single statement; for
Fred Drake659ebfa2000-04-03 15:42:13 +0000732 use with \cfunction{Py_CompileString()}\ttindex{Py_CompileString()}.
733 This is the symbol used for the interactive interpreter loop.
Fred Drakec924b8d1999-08-23 18:57:25 +0000734\end{cvardesc}
735
Guido van Rossum9231c8f1997-05-15 21:43:21 +0000736
Fred Drakeefd146c1999-02-15 15:30:45 +0000737\chapter{Reference Counting \label{countingRefs}}
Guido van Rossum9231c8f1997-05-15 21:43:21 +0000738
Guido van Rossum580aa8d1997-11-25 15:34:51 +0000739The macros in this section are used for managing reference counts
Guido van Rossum9231c8f1997-05-15 21:43:21 +0000740of Python objects.
741
742\begin{cfuncdesc}{void}{Py_INCREF}{PyObject *o}
Fred Drakec6fa34e1998-04-02 06:47:24 +0000743Increment the reference count for object \var{o}. The object must
Guido van Rossum9231c8f1997-05-15 21:43:21 +0000744not be \NULL{}; if you aren't sure that it isn't \NULL{}, use
Fred Drakee058b4f1998-02-16 06:15:35 +0000745\cfunction{Py_XINCREF()}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +0000746\end{cfuncdesc}
747
748\begin{cfuncdesc}{void}{Py_XINCREF}{PyObject *o}
Fred Drakee058b4f1998-02-16 06:15:35 +0000749Increment the reference count for object \var{o}. The object may be
Guido van Rossum580aa8d1997-11-25 15:34:51 +0000750\NULL{}, in which case the macro has no effect.
Guido van Rossum9231c8f1997-05-15 21:43:21 +0000751\end{cfuncdesc}
752
753\begin{cfuncdesc}{void}{Py_DECREF}{PyObject *o}
Fred Drakee058b4f1998-02-16 06:15:35 +0000754Decrement the reference count for object \var{o}. The object must
Guido van Rossum9231c8f1997-05-15 21:43:21 +0000755not be \NULL{}; if you aren't sure that it isn't \NULL{}, use
Fred Drakee058b4f1998-02-16 06:15:35 +0000756\cfunction{Py_XDECREF()}. If the reference count reaches zero, the
757object's type's deallocation function (which must not be \NULL{}) is
758invoked.
Guido van Rossum9231c8f1997-05-15 21:43:21 +0000759
760\strong{Warning:} The deallocation function can cause arbitrary Python
Fred Drake659ebfa2000-04-03 15:42:13 +0000761code to be invoked (e.g. when a class instance with a
762\method{__del__()} method is deallocated). While exceptions in such
763code are not propagated, the executed code has free access to all
764Python global variables. This means that any object that is reachable
765from a global variable should be in a consistent state before
766\cfunction{Py_DECREF()} is invoked. For example, code to delete an
767object from a list should copy a reference to the deleted object in a
768temporary variable, update the list data structure, and then call
769\cfunction{Py_DECREF()} for the temporary variable.
Guido van Rossum9231c8f1997-05-15 21:43:21 +0000770\end{cfuncdesc}
771
772\begin{cfuncdesc}{void}{Py_XDECREF}{PyObject *o}
Fred Drakee058b4f1998-02-16 06:15:35 +0000773Decrement the reference count for object \var{o}. The object may be
774\NULL{}, in which case the macro has no effect; otherwise the effect
775is the same as for \cfunction{Py_DECREF()}, and the same warning
Guido van Rossum9231c8f1997-05-15 21:43:21 +0000776applies.
777\end{cfuncdesc}
778
Fred Drake659ebfa2000-04-03 15:42:13 +0000779The following functions or macros are only for use within the
780interpreter core: \cfunction{_Py_Dealloc()},
781\cfunction{_Py_ForgetReference()}, \cfunction{_Py_NewReference()}, as
782well as the global variable \cdata{_Py_RefTotal}.
Guido van Rossum580aa8d1997-11-25 15:34:51 +0000783
Guido van Rossum9231c8f1997-05-15 21:43:21 +0000784
Fred Drakeefd146c1999-02-15 15:30:45 +0000785\chapter{Exception Handling \label{exceptionHandling}}
Guido van Rossum9231c8f1997-05-15 21:43:21 +0000786
Fred Drake659ebfa2000-04-03 15:42:13 +0000787The functions described in this chapter will let you handle and raise Python
Guido van Rossumae110af1997-05-22 20:11:52 +0000788exceptions. It is important to understand some of the basics of
Fred Drake659ebfa2000-04-03 15:42:13 +0000789Python exception handling. It works somewhat like the
790\UNIX{} \cdata{errno} variable: there is a global indicator (per
791thread) of the last error that occurred. Most functions don't clear
792this on success, but will set it to indicate the cause of the error on
793failure. Most functions also return an error indicator, usually
794\NULL{} if they are supposed to return a pointer, or \code{-1} if they
795return an integer (exception: the \cfunction{PyArg_Parse*()} functions
796return \code{1} for success and \code{0} for failure). When a
797function must fail because some function it called failed, it
798generally doesn't set the error indicator; the function it called
799already set it.
Guido van Rossumae110af1997-05-22 20:11:52 +0000800
801The error indicator consists of three Python objects corresponding to
Fred Drake659ebfa2000-04-03 15:42:13 +0000802\withsubitem{(in module sys)}{
803 \ttindex{exc_type}\ttindex{exc_value}\ttindex{exc_traceback}}
Guido van Rossumae110af1997-05-22 20:11:52 +0000804the Python variables \code{sys.exc_type}, \code{sys.exc_value} and
805\code{sys.exc_traceback}. API functions exist to interact with the
806error indicator in various ways. There is a separate error indicator
807for each thread.
808
809% XXX Order of these should be more thoughtful.
810% Either alphabetical or some kind of structure.
Guido van Rossum9231c8f1997-05-15 21:43:21 +0000811
812\begin{cfuncdesc}{void}{PyErr_Print}{}
Guido van Rossumae110af1997-05-22 20:11:52 +0000813Print a standard traceback to \code{sys.stderr} and clear the error
814indicator. Call this function only when the error indicator is set.
815(Otherwise it will cause a fatal error!)
Guido van Rossum9231c8f1997-05-15 21:43:21 +0000816\end{cfuncdesc}
817
Fred Drakec6fa34e1998-04-02 06:47:24 +0000818\begin{cfuncdesc}{PyObject*}{PyErr_Occurred}{}
Guido van Rossumae110af1997-05-22 20:11:52 +0000819Test whether the error indicator is set. If set, return the exception
Fred Drakee058b4f1998-02-16 06:15:35 +0000820\emph{type} (the first argument to the last call to one of the
Fred Drakef8830d11998-04-23 14:06:01 +0000821\cfunction{PyErr_Set*()} functions or to \cfunction{PyErr_Restore()}). If
Fred Drakee058b4f1998-02-16 06:15:35 +0000822not set, return \NULL{}. You do not own a reference to the return
823value, so you do not need to \cfunction{Py_DECREF()} it.
Fred Drake659ebfa2000-04-03 15:42:13 +0000824\strong{Note:} Do not compare the return value to a specific
Fred Drakee058b4f1998-02-16 06:15:35 +0000825exception; use \cfunction{PyErr_ExceptionMatches()} instead, shown
Fred Drake659ebfa2000-04-03 15:42:13 +0000826below. (The comparison could easily fail since the exception may be
827an instance instead of a class, in the case of a class exception, or
828it may the a subclass of the expected exception.)
Guido van Rossum42cefd01997-10-05 15:27:29 +0000829\end{cfuncdesc}
830
831\begin{cfuncdesc}{int}{PyErr_ExceptionMatches}{PyObject *exc}
Guido van Rossum42cefd01997-10-05 15:27:29 +0000832Equivalent to
Fred Drakee058b4f1998-02-16 06:15:35 +0000833\samp{PyErr_GivenExceptionMatches(PyErr_Occurred(), \var{exc})}.
Fred Drake659ebfa2000-04-03 15:42:13 +0000834This should only be called when an exception is actually set; a memory
835access violation will occur if no exception has been raised.
Guido van Rossum42cefd01997-10-05 15:27:29 +0000836\end{cfuncdesc}
837
838\begin{cfuncdesc}{int}{PyErr_GivenExceptionMatches}{PyObject *given, PyObject *exc}
Guido van Rossum42cefd01997-10-05 15:27:29 +0000839Return true if the \var{given} exception matches the exception in
840\var{exc}. If \var{exc} is a class object, this also returns true
Fred Drake659ebfa2000-04-03 15:42:13 +0000841when \var{given} is an instance of a subclass. If \var{exc} is a tuple, all
Guido van Rossum42cefd01997-10-05 15:27:29 +0000842exceptions in the tuple (and recursively in subtuples) are searched
Fred Drake659ebfa2000-04-03 15:42:13 +0000843for a match. If \var{given} is \NULL, a memory access violation will
844occur.
Guido van Rossum42cefd01997-10-05 15:27:29 +0000845\end{cfuncdesc}
846
847\begin{cfuncdesc}{void}{PyErr_NormalizeException}{PyObject**exc, PyObject**val, PyObject**tb}
Guido van Rossum42cefd01997-10-05 15:27:29 +0000848Under certain circumstances, the values returned by
Fred Drakee058b4f1998-02-16 06:15:35 +0000849\cfunction{PyErr_Fetch()} below can be ``unnormalized'', meaning that
850\code{*\var{exc}} is a class object but \code{*\var{val}} is not an
851instance of the same class. This function can be used to instantiate
852the class in that case. If the values are already normalized, nothing
Fred Drake659ebfa2000-04-03 15:42:13 +0000853happens. The delayed normalization is implemented to improve
854performance.
Guido van Rossumae110af1997-05-22 20:11:52 +0000855\end{cfuncdesc}
856
857\begin{cfuncdesc}{void}{PyErr_Clear}{}
858Clear the error indicator. If the error indicator is not set, there
859is no effect.
860\end{cfuncdesc}
861
Fred Drake659ebfa2000-04-03 15:42:13 +0000862\begin{cfuncdesc}{void}{PyErr_Fetch}{PyObject **ptype, PyObject **pvalue,
863 PyObject **ptraceback}
Guido van Rossumae110af1997-05-22 20:11:52 +0000864Retrieve the error indicator into three variables whose addresses are
865passed. If the error indicator is not set, set all three variables to
866\NULL{}. If it is set, it will be cleared and you own a reference to
Fred Drake659ebfa2000-04-03 15:42:13 +0000867each object retrieved. The value and traceback object may be
868\NULL{} even when the type object is not. \strong{Note:} This
869function is normally only used by code that needs to handle exceptions
870or by code that needs to save and restore the error indicator
871temporarily.
Guido van Rossumae110af1997-05-22 20:11:52 +0000872\end{cfuncdesc}
873
Fred Drake17e63432000-08-31 05:50:40 +0000874\begin{cfuncdesc}{void}{PyErr_Restore}{PyObject *type, PyObject *value,
875 PyObject *traceback}
Guido van Rossumae110af1997-05-22 20:11:52 +0000876Set the error indicator from the three objects. If the error
877indicator is already set, it is cleared first. If the objects are
878\NULL{}, the error indicator is cleared. Do not pass a \NULL{} type
879and non-\NULL{} value or traceback. The exception type should be a
880string or class; if it is a class, the value should be an instance of
881that class. Do not pass an invalid exception type or value.
882(Violating these rules will cause subtle problems later.) This call
Fred Drakebab29652001-07-10 16:10:08 +0000883takes away a reference to each object: you must own a reference
Guido van Rossumae110af1997-05-22 20:11:52 +0000884to each object before the call and after the call you no longer own
885these references. (If you don't understand this, don't use this
Fred Drake659ebfa2000-04-03 15:42:13 +0000886function. I warned you.) \strong{Note:} This function is normally
Guido van Rossumae110af1997-05-22 20:11:52 +0000887only used by code that needs to save and restore the error indicator
888temporarily.
889\end{cfuncdesc}
890
891\begin{cfuncdesc}{void}{PyErr_SetString}{PyObject *type, char *message}
892This is the most common way to set the error indicator. The first
893argument specifies the exception type; it is normally one of the
Fred Drakef8830d11998-04-23 14:06:01 +0000894standard exceptions, e.g. \cdata{PyExc_RuntimeError}. You need not
Guido van Rossumae110af1997-05-22 20:11:52 +0000895increment its reference count. The second argument is an error
896message; it is converted to a string object.
897\end{cfuncdesc}
898
899\begin{cfuncdesc}{void}{PyErr_SetObject}{PyObject *type, PyObject *value}
Fred Drakee058b4f1998-02-16 06:15:35 +0000900This function is similar to \cfunction{PyErr_SetString()} but lets you
Guido van Rossumae110af1997-05-22 20:11:52 +0000901specify an arbitrary Python object for the ``value'' of the exception.
902You need not increment its reference count.
903\end{cfuncdesc}
904
Fred Drake73577702000-04-10 18:50:14 +0000905\begin{cfuncdesc}{PyObject*}{PyErr_Format}{PyObject *exception,
Moshe Zadka57a59322000-09-01 09:47:20 +0000906 const char *format, \moreargs}
Fred Drake89fb0352000-10-14 05:49:30 +0000907This function sets the error indicator. \var{exception} should be a
908Python exception (string or class, not an instance).
Fred Drake5566c1c2001-01-19 22:48:33 +0000909\var{format} should be a string, containing format codes, similar to
Moshe Zadka57a59322000-09-01 09:47:20 +0000910\cfunction{printf}. The \code{width.precision} before a format code
911is parsed, but the width part is ignored.
912
913\begin{tableii}{c|l}{character}{Character}{Meaning}
914 \lineii{c}{Character, as an \ctype{int} parameter}
915 \lineii{d}{Number in decimal, as an \ctype{int} parameter}
916 \lineii{x}{Number in hexadecimal, as an \ctype{int} parameter}
917 \lineii{x}{A string, as a \ctype{char *} parameter}
918\end{tableii}
919
920An unrecognized format character causes all the rest of
921the format string to be copied as-is to the result string,
922and any extra arguments discarded.
923
924A new reference is returned, which is owned by the caller.
Jeremy Hylton98605b52000-04-10 18:40:57 +0000925\end{cfuncdesc}
926
Guido van Rossumae110af1997-05-22 20:11:52 +0000927\begin{cfuncdesc}{void}{PyErr_SetNone}{PyObject *type}
Fred Drakee058b4f1998-02-16 06:15:35 +0000928This is a shorthand for \samp{PyErr_SetObject(\var{type}, Py_None)}.
Guido van Rossumae110af1997-05-22 20:11:52 +0000929\end{cfuncdesc}
930
931\begin{cfuncdesc}{int}{PyErr_BadArgument}{}
Fred Drakee058b4f1998-02-16 06:15:35 +0000932This is a shorthand for \samp{PyErr_SetString(PyExc_TypeError,
Guido van Rossumae110af1997-05-22 20:11:52 +0000933\var{message})}, where \var{message} indicates that a built-in operation
934was invoked with an illegal argument. It is mostly for internal use.
935\end{cfuncdesc}
936
Fred Drakec6fa34e1998-04-02 06:47:24 +0000937\begin{cfuncdesc}{PyObject*}{PyErr_NoMemory}{}
Fred Drakee058b4f1998-02-16 06:15:35 +0000938This is a shorthand for \samp{PyErr_SetNone(PyExc_MemoryError)}; it
Guido van Rossumae110af1997-05-22 20:11:52 +0000939returns \NULL{} so an object allocation function can write
Fred Drakee058b4f1998-02-16 06:15:35 +0000940\samp{return PyErr_NoMemory();} when it runs out of memory.
Guido van Rossumae110af1997-05-22 20:11:52 +0000941\end{cfuncdesc}
942
Fred Drakec6fa34e1998-04-02 06:47:24 +0000943\begin{cfuncdesc}{PyObject*}{PyErr_SetFromErrno}{PyObject *type}
Fred Drake659ebfa2000-04-03 15:42:13 +0000944This is a convenience function to raise an exception when a C library
945function has returned an error and set the C variable \cdata{errno}.
Guido van Rossumae110af1997-05-22 20:11:52 +0000946It constructs a tuple object whose first item is the integer
Fred Drakef8830d11998-04-23 14:06:01 +0000947\cdata{errno} value and whose second item is the corresponding error
Fred Drake659ebfa2000-04-03 15:42:13 +0000948message (gotten from \cfunction{strerror()}\ttindex{strerror()}), and
949then calls
Fred Drakee058b4f1998-02-16 06:15:35 +0000950\samp{PyErr_SetObject(\var{type}, \var{object})}. On \UNIX{}, when
Fred Drakef8830d11998-04-23 14:06:01 +0000951the \cdata{errno} value is \constant{EINTR}, indicating an interrupted
Fred Drakee058b4f1998-02-16 06:15:35 +0000952system call, this calls \cfunction{PyErr_CheckSignals()}, and if that set
Guido van Rossumae110af1997-05-22 20:11:52 +0000953the error indicator, leaves it set to that. The function always
954returns \NULL{}, so a wrapper function around a system call can write
Fred Drakee058b4f1998-02-16 06:15:35 +0000955\samp{return PyErr_SetFromErrno();} when the system call returns an
956error.
Guido van Rossumae110af1997-05-22 20:11:52 +0000957\end{cfuncdesc}
958
Fred Drake490d34d2001-06-20 21:39:12 +0000959\begin{cfuncdesc}{PyObject*}{PyErr_SetFromErrnoWithFilename}{PyObject *type,
960 char *filename}
961Similar to \cfunction{PyErr_SetFromErrno()}, with the additional
962behavior that if \var{filename} is not \NULL, it is passed to the
963constructor of \var{type} as a third parameter. In the case of
964exceptions such as \exception{IOError} and \exception{OSError}, this
965is used to define the \member{filename} attribute of the exception
966instance.
967\end{cfuncdesc}
968
Guido van Rossumae110af1997-05-22 20:11:52 +0000969\begin{cfuncdesc}{void}{PyErr_BadInternalCall}{}
Fred Drakee058b4f1998-02-16 06:15:35 +0000970This is a shorthand for \samp{PyErr_SetString(PyExc_TypeError,
Guido van Rossumae110af1997-05-22 20:11:52 +0000971\var{message})}, where \var{message} indicates that an internal
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000972operation (e.g. a Python/C API function) was invoked with an illegal
Guido van Rossumae110af1997-05-22 20:11:52 +0000973argument. It is mostly for internal use.
974\end{cfuncdesc}
975
Guido van Rossum3dbb4062000-12-19 03:53:01 +0000976\begin{cfuncdesc}{int}{PyErr_Warn}{PyObject *category, char *message}
977Issue a warning message. The \var{category} argument is a warning
Fred Drake5566c1c2001-01-19 22:48:33 +0000978category (see below) or \NULL; the \var{message} argument is a message
Guido van Rossum3dbb4062000-12-19 03:53:01 +0000979string.
980
981This function normally prints a warning message to \var{sys.stderr};
982however, it is also possible that the user has specified that warnings
983are to be turned into errors, and in that case this will raise an
984exception. It is also possible that the function raises an exception
985because of a problem with the warning machinery (the implementation
986imports the \module{warnings} module to do the heavy lifting). The
987return value is \code{0} if no exception is raised, or \code{-1} if
988an exception is raised. (It is not possible to determine whether a
989warning message is actually printed, nor what the reason is for the
990exception; this is intentional.) If an exception is raised, the
Fred Drake5566c1c2001-01-19 22:48:33 +0000991caller should do its normal exception handling
992(e.g. \cfunction{Py_DECREF()} owned references and return an error
993value).
Guido van Rossum3dbb4062000-12-19 03:53:01 +0000994
995Warning categories must be subclasses of \cdata{Warning}; the default
996warning category is \cdata{RuntimeWarning}. The standard Python
997warning categories are available as global variables whose names are
998\samp{PyExc_} followed by the Python exception name. These have the
999type \ctype{PyObject*}; they are all class objects. Their names are
1000\cdata{PyExc_Warning}, \cdata{PyExc_UserWarning},
1001\cdata{PyExc_DeprecationWarning}, \cdata{PyExc_SyntaxWarning}, and
1002\cdata{PyExc_RuntimeWarning}. \cdata{PyExc_Warning} is a subclass of
1003\cdata{PyExc_Exception}; the other warning categories are subclasses
1004of \cdata{PyExc_Warning}.
1005
1006For information about warning control, see the documentation for the
Fred Drake316ef7c2001-01-04 05:56:34 +00001007\module{warnings} module and the \programopt{-W} option in the command
1008line documentation. There is no C API for warning control.
Guido van Rossum3dbb4062000-12-19 03:53:01 +00001009\end{cfuncdesc}
1010
Guido van Rossum1874c8f2001-02-28 23:46:44 +00001011\begin{cfuncdesc}{int}{PyErr_WarnExplicit}{PyObject *category, char *message,
1012char *filename, int lineno, char *module, PyObject *registry}
1013Issue a warning message with explicit control over all warning
1014attributes. This is a straightforward wrapper around the Python
1015function \function{warnings.warn_explicit()}, see there for more
1016information. The \var{module} and \var{registry} arguments may be
1017set to \code{NULL} to get the default effect described there.
1018\end{cfuncdesc}
1019
Guido van Rossumae110af1997-05-22 20:11:52 +00001020\begin{cfuncdesc}{int}{PyErr_CheckSignals}{}
1021This function interacts with Python's signal handling. It checks
1022whether a signal has been sent to the processes and if so, invokes the
Fred Drake4de05a91998-02-16 14:25:26 +00001023corresponding signal handler. If the
1024\module{signal}\refbimodindex{signal} module is supported, this can
1025invoke a signal handler written in Python. In all cases, the default
Fred Drake659ebfa2000-04-03 15:42:13 +00001026effect for \constant{SIGINT}\ttindex{SIGINT} is to raise the
1027\withsubitem{(built-in exception)}{\ttindex{KeyboardInterrupt}}
1028\exception{KeyboardInterrupt} exception. If an exception is raised the
Fred Drakee058b4f1998-02-16 06:15:35 +00001029error indicator is set and the function returns \code{1}; otherwise
1030the function returns \code{0}. The error indicator may or may not be
1031cleared if it was previously set.
Guido van Rossumae110af1997-05-22 20:11:52 +00001032\end{cfuncdesc}
1033
1034\begin{cfuncdesc}{void}{PyErr_SetInterrupt}{}
Fred Drake659ebfa2000-04-03 15:42:13 +00001035This function is obsolete. It simulates the effect of a
1036\constant{SIGINT}\ttindex{SIGINT} signal arriving --- the next time
Fred Drakee058b4f1998-02-16 06:15:35 +00001037\cfunction{PyErr_CheckSignals()} is called,
Fred Drake659ebfa2000-04-03 15:42:13 +00001038\withsubitem{(built-in exception)}{\ttindex{KeyboardInterrupt}}
1039\exception{KeyboardInterrupt} will be raised.
1040It may be called without holding the interpreter lock.
Guido van Rossumae110af1997-05-22 20:11:52 +00001041\end{cfuncdesc}
1042
Fred Drakec6fa34e1998-04-02 06:47:24 +00001043\begin{cfuncdesc}{PyObject*}{PyErr_NewException}{char *name,
1044 PyObject *base,
1045 PyObject *dict}
Guido van Rossum42cefd01997-10-05 15:27:29 +00001046This utility function creates and returns a new exception object. The
Fred Drake659ebfa2000-04-03 15:42:13 +00001047\var{name} argument must be the name of the new exception, a C string
1048of the form \code{module.class}. The \var{base} and
Fred Draked04038d2000-06-29 20:15:14 +00001049\var{dict} arguments are normally \NULL{}. This creates a
Fred Drake659ebfa2000-04-03 15:42:13 +00001050class object derived from the root for all exceptions, the built-in
1051name \exception{Exception} (accessible in C as
Fred Draked04038d2000-06-29 20:15:14 +00001052\cdata{PyExc_Exception}). The \member{__module__} attribute of the
1053new class is set to the first part (up to the last dot) of the
1054\var{name} argument, and the class name is set to the last part (after
1055the last dot). The \var{base} argument can be used to specify an
1056alternate base class. The \var{dict} argument can be used to specify
1057a dictionary of class variables and methods.
Guido van Rossum42cefd01997-10-05 15:27:29 +00001058\end{cfuncdesc}
1059
Jeremy Hyltonb709df32000-09-01 02:47:25 +00001060\begin{cfuncdesc}{void}{PyErr_WriteUnraisable}{PyObject *obj}
1061This utility function prints a warning message to \var{sys.stderr}
1062when an exception has been set but it is impossible for the
1063interpreter to actually raise the exception. It is used, for example,
1064when an exception occurs in an \member{__del__} method.
1065
1066The function is called with a single argument \var{obj} that
1067identifies where the context in which the unraisable exception
1068occurred. The repr of \var{obj} will be printed in the warning
1069message.
1070\end{cfuncdesc}
Guido van Rossum42cefd01997-10-05 15:27:29 +00001071
Fred Drakeefd146c1999-02-15 15:30:45 +00001072\section{Standard Exceptions \label{standardExceptions}}
Guido van Rossumae110af1997-05-22 20:11:52 +00001073
1074All standard Python exceptions are available as global variables whose
Fred Drake659ebfa2000-04-03 15:42:13 +00001075names are \samp{PyExc_} followed by the Python exception name. These
1076have the type \ctype{PyObject*}; they are all class objects. For
1077completeness, here are all the variables:
1078
1079\begin{tableiii}{l|l|c}{cdata}{C Name}{Python Name}{Notes}
1080 \lineiii{PyExc_Exception}{\exception{Exception}}{(1)}
1081 \lineiii{PyExc_StandardError}{\exception{StandardError}}{(1)}
1082 \lineiii{PyExc_ArithmeticError}{\exception{ArithmeticError}}{(1)}
1083 \lineiii{PyExc_LookupError}{\exception{LookupError}}{(1)}
1084 \lineiii{PyExc_AssertionError}{\exception{AssertionError}}{}
1085 \lineiii{PyExc_AttributeError}{\exception{AttributeError}}{}
1086 \lineiii{PyExc_EOFError}{\exception{EOFError}}{}
1087 \lineiii{PyExc_EnvironmentError}{\exception{EnvironmentError}}{(1)}
1088 \lineiii{PyExc_FloatingPointError}{\exception{FloatingPointError}}{}
1089 \lineiii{PyExc_IOError}{\exception{IOError}}{}
1090 \lineiii{PyExc_ImportError}{\exception{ImportError}}{}
1091 \lineiii{PyExc_IndexError}{\exception{IndexError}}{}
1092 \lineiii{PyExc_KeyError}{\exception{KeyError}}{}
1093 \lineiii{PyExc_KeyboardInterrupt}{\exception{KeyboardInterrupt}}{}
1094 \lineiii{PyExc_MemoryError}{\exception{MemoryError}}{}
1095 \lineiii{PyExc_NameError}{\exception{NameError}}{}
1096 \lineiii{PyExc_NotImplementedError}{\exception{NotImplementedError}}{}
1097 \lineiii{PyExc_OSError}{\exception{OSError}}{}
1098 \lineiii{PyExc_OverflowError}{\exception{OverflowError}}{}
1099 \lineiii{PyExc_RuntimeError}{\exception{RuntimeError}}{}
1100 \lineiii{PyExc_SyntaxError}{\exception{SyntaxError}}{}
1101 \lineiii{PyExc_SystemError}{\exception{SystemError}}{}
1102 \lineiii{PyExc_SystemExit}{\exception{SystemExit}}{}
1103 \lineiii{PyExc_TypeError}{\exception{TypeError}}{}
1104 \lineiii{PyExc_ValueError}{\exception{ValueError}}{}
Fred Drakea8d73412000-08-11 20:39:29 +00001105 \lineiii{PyExc_WindowsError}{\exception{WindowsError}}{(2)}
Fred Drake659ebfa2000-04-03 15:42:13 +00001106 \lineiii{PyExc_ZeroDivisionError}{\exception{ZeroDivisionError}}{}
1107\end{tableiii}
1108
1109\noindent
Fred Drakea8d73412000-08-11 20:39:29 +00001110Notes:
Fred Drake659ebfa2000-04-03 15:42:13 +00001111\begin{description}
1112\item[(1)]
Fred Draked04038d2000-06-29 20:15:14 +00001113 This is a base class for other standard exceptions.
Fred Drakea8d73412000-08-11 20:39:29 +00001114
1115\item[(2)]
1116 Only defined on Windows; protect code that uses this by testing that
1117 the preprocessor macro \code{MS_WINDOWS} is defined.
Fred Drake659ebfa2000-04-03 15:42:13 +00001118\end{description}
1119
1120
1121\section{Deprecation of String Exceptions}
1122
Fred Draked04038d2000-06-29 20:15:14 +00001123All exceptions built into Python or provided in the standard library
1124are derived from \exception{Exception}.
Fred Drake659ebfa2000-04-03 15:42:13 +00001125\withsubitem{(built-in exception)}{\ttindex{Exception}}
Fred Drake659ebfa2000-04-03 15:42:13 +00001126
Fred Draked04038d2000-06-29 20:15:14 +00001127String exceptions are still supported in the interpreter to allow
Fred Drake659ebfa2000-04-03 15:42:13 +00001128existing code to run unmodified, but this will also change in a future
1129release.
Guido van Rossumae110af1997-05-22 20:11:52 +00001130
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001131
Fred Drakeefd146c1999-02-15 15:30:45 +00001132\chapter{Utilities \label{utilities}}
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001133
Fred Drake88fdaa72001-07-20 20:56:11 +00001134The functions in this chapter perform various utility tasks, ranging
1135from helping C code be more portable across platforms, using Python
1136modules from C, and parsing function arguments and constructing Python
1137values from C values.
1138
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001139
Fred Drake377fb1e2001-07-14 03:01:48 +00001140\section{Operating System Utilities \label{os}}
Guido van Rossum42cefd01997-10-05 15:27:29 +00001141
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001142\begin{cfuncdesc}{int}{Py_FdIsInteractive}{FILE *fp, char *filename}
Fred Drakee058b4f1998-02-16 06:15:35 +00001143Return true (nonzero) if the standard I/O file \var{fp} with name
1144\var{filename} is deemed interactive. This is the case for files for
1145which \samp{isatty(fileno(\var{fp}))} is true. If the global flag
Fred Drakef8830d11998-04-23 14:06:01 +00001146\cdata{Py_InteractiveFlag} is true, this function also returns true if
Fred Drake5566c1c2001-01-19 22:48:33 +00001147the \var{filename} pointer is \NULL{} or if the name is equal to one of
Fred Drakea8455ab2000-06-16 19:58:42 +00001148the strings \code{'<stdin>'} or \code{'???'}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001149\end{cfuncdesc}
1150
1151\begin{cfuncdesc}{long}{PyOS_GetLastModificationTime}{char *filename}
Fred Drakee058b4f1998-02-16 06:15:35 +00001152Return the time of last modification of the file \var{filename}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001153The result is encoded in the same way as the timestamp returned by
Fred Drake659ebfa2000-04-03 15:42:13 +00001154the standard C library function \cfunction{time()}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001155\end{cfuncdesc}
1156
Fred Drakecabbc3b2000-06-28 15:53:13 +00001157\begin{cfuncdesc}{void}{PyOS_AfterFork}{}
1158Function to update some internal state after a process fork; this
1159should be called in the new process if the Python interpreter will
1160continue to be used. If a new executable is loaded into the new
1161process, this function does not need to be called.
1162\end{cfuncdesc}
1163
Fred Drake17e63432000-08-31 05:50:40 +00001164\begin{cfuncdesc}{int}{PyOS_CheckStack}{}
1165Return true when the interpreter runs out of stack space. This is a
1166reliable check, but is only available when \code{USE_STACKCHECK} is
1167defined (currently on Windows using the Microsoft Visual C++ compiler
1168and on the Macintosh). \code{USE_CHECKSTACK} will be defined
1169automatically; you should never change the definition in your own
1170code.
1171\end{cfuncdesc}
1172
Guido van Rossumc96ec6e2000-09-16 16:30:48 +00001173\begin{cfuncdesc}{PyOS_sighandler_t}{PyOS_getsig}{int i}
1174Return the current signal handler for signal \var{i}.
1175This is a thin wrapper around either \cfunction{sigaction} or
1176\cfunction{signal}. Do not call those functions directly!
1177\ctype{PyOS_sighandler_t} is a typedef alias for \ctype{void (*)(int)}.
1178\end{cfuncdesc}
1179
1180\begin{cfuncdesc}{PyOS_sighandler_t}{PyOS_setsig}{int i, PyOS_sighandler_t h}
1181Set the signal handler for signal \var{i} to be \var{h};
1182return the old signal handler.
1183This is a thin wrapper around either \cfunction{sigaction} or
1184\cfunction{signal}. Do not call those functions directly!
1185\ctype{PyOS_sighandler_t} is a typedef alias for \ctype{void (*)(int)}.
1186\end{cfuncdesc}
1187
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001188
Fred Drakeefd146c1999-02-15 15:30:45 +00001189\section{Process Control \label{processControl}}
Fred Drakee5bf8b21998-02-12 21:22:28 +00001190
1191\begin{cfuncdesc}{void}{Py_FatalError}{char *message}
1192Print a fatal error message and kill the process. No cleanup is
1193performed. This function should only be invoked when a condition is
1194detected that would make it dangerous to continue using the Python
1195interpreter; e.g., when the object administration appears to be
Fred Drake659ebfa2000-04-03 15:42:13 +00001196corrupted. On \UNIX{}, the standard C library function
1197\cfunction{abort()}\ttindex{abort()} is called which will attempt to
1198produce a \file{core} file.
Fred Drakee5bf8b21998-02-12 21:22:28 +00001199\end{cfuncdesc}
1200
1201\begin{cfuncdesc}{void}{Py_Exit}{int status}
Fred Drake659ebfa2000-04-03 15:42:13 +00001202Exit the current process. This calls
1203\cfunction{Py_Finalize()}\ttindex{Py_Finalize()} and
1204then calls the standard C library function
1205\code{exit(\var{status})}\ttindex{exit()}.
Fred Drakee5bf8b21998-02-12 21:22:28 +00001206\end{cfuncdesc}
1207
1208\begin{cfuncdesc}{int}{Py_AtExit}{void (*func) ()}
Fred Drake659ebfa2000-04-03 15:42:13 +00001209Register a cleanup function to be called by
1210\cfunction{Py_Finalize()}\ttindex{Py_Finalize()}.
Fred Drakee5bf8b21998-02-12 21:22:28 +00001211The cleanup function will be called with no arguments and should
Fred Drake659ebfa2000-04-03 15:42:13 +00001212return no value. At most 32 \index{cleanup functions}cleanup
1213functions can be registered.
Fred Drakee5bf8b21998-02-12 21:22:28 +00001214When the registration is successful, \cfunction{Py_AtExit()} returns
1215\code{0}; on failure, it returns \code{-1}. The cleanup function
1216registered last is called first. Each cleanup function will be called
1217at most once. Since Python's internal finallization will have
1218completed before the cleanup function, no Python APIs should be called
1219by \var{func}.
1220\end{cfuncdesc}
1221
1222
Fred Drakeefd146c1999-02-15 15:30:45 +00001223\section{Importing Modules \label{importing}}
Guido van Rossum42cefd01997-10-05 15:27:29 +00001224
Fred Drakec6fa34e1998-04-02 06:47:24 +00001225\begin{cfuncdesc}{PyObject*}{PyImport_ImportModule}{char *name}
Fred Drake659ebfa2000-04-03 15:42:13 +00001226This is a simplified interface to
1227\cfunction{PyImport_ImportModuleEx()} below, leaving the
1228\var{globals} and \var{locals} arguments set to \NULL{}. When the
Fred Drakebab29652001-07-10 16:10:08 +00001229\var{name} argument contains a dot (when it specifies a
Fred Drake659ebfa2000-04-03 15:42:13 +00001230submodule of a package), the \var{fromlist} argument is set to the
1231list \code{['*']} so that the return value is the named module rather
1232than the top-level package containing it as would otherwise be the
1233case. (Unfortunately, this has an additional side effect when
1234\var{name} in fact specifies a subpackage instead of a submodule: the
1235submodules specified in the package's \code{__all__} variable are
1236\index{package variable!\code{__all__}}
1237\withsubitem{(package variable)}{\ttindex{__all__}}loaded.) Return a
1238new reference to the imported module, or
1239\NULL{} with an exception set on failure (the module may still be
1240created in this case --- examine \code{sys.modules} to find out).
1241\withsubitem{(in module sys)}{\ttindex{modules}}
Guido van Rossum42cefd01997-10-05 15:27:29 +00001242\end{cfuncdesc}
1243
Fred Drakec6fa34e1998-04-02 06:47:24 +00001244\begin{cfuncdesc}{PyObject*}{PyImport_ImportModuleEx}{char *name, PyObject *globals, PyObject *locals, PyObject *fromlist}
Guido van Rossum42cefd01997-10-05 15:27:29 +00001245Import a module. This is best described by referring to the built-in
Fred Drake53fb7721998-02-16 06:23:20 +00001246Python function \function{__import__()}\bifuncindex{__import__}, as
1247the standard \function{__import__()} function calls this function
1248directly.
Guido van Rossum42cefd01997-10-05 15:27:29 +00001249
Guido van Rossum42cefd01997-10-05 15:27:29 +00001250The return value is a new reference to the imported module or
Guido van Rossum580aa8d1997-11-25 15:34:51 +00001251top-level package, or \NULL{} with an exception set on failure
Guido van Rossumc44d3d61997-10-06 05:10:47 +00001252(the module may still be created in this case). Like for
Fred Drakee058b4f1998-02-16 06:15:35 +00001253\function{__import__()}, the return value when a submodule of a
1254package was requested is normally the top-level package, unless a
1255non-empty \var{fromlist} was given.
Guido van Rossum42cefd01997-10-05 15:27:29 +00001256\end{cfuncdesc}
1257
Fred Drakec6fa34e1998-04-02 06:47:24 +00001258\begin{cfuncdesc}{PyObject*}{PyImport_Import}{PyObject *name}
Guido van Rossum42cefd01997-10-05 15:27:29 +00001259This is a higher-level interface that calls the current ``import hook
Fred Drakee058b4f1998-02-16 06:15:35 +00001260function''. It invokes the \function{__import__()} function from the
Guido van Rossum42cefd01997-10-05 15:27:29 +00001261\code{__builtins__} of the current globals. This means that the
1262import is done using whatever import hooks are installed in the
Fred Drake4de05a91998-02-16 14:25:26 +00001263current environment, e.g. by \module{rexec}\refstmodindex{rexec} or
1264\module{ihooks}\refstmodindex{ihooks}.
Guido van Rossum42cefd01997-10-05 15:27:29 +00001265\end{cfuncdesc}
1266
Fred Drakec6fa34e1998-04-02 06:47:24 +00001267\begin{cfuncdesc}{PyObject*}{PyImport_ReloadModule}{PyObject *m}
Guido van Rossum42cefd01997-10-05 15:27:29 +00001268Reload a module. This is best described by referring to the built-in
Fred Drake53fb7721998-02-16 06:23:20 +00001269Python function \function{reload()}\bifuncindex{reload}, as the standard
Fred Drakee058b4f1998-02-16 06:15:35 +00001270\function{reload()} function calls this function directly. Return a
1271new reference to the reloaded module, or \NULL{} with an exception set
1272on failure (the module still exists in this case).
Guido van Rossum42cefd01997-10-05 15:27:29 +00001273\end{cfuncdesc}
1274
Fred Drakec6fa34e1998-04-02 06:47:24 +00001275\begin{cfuncdesc}{PyObject*}{PyImport_AddModule}{char *name}
Guido van Rossum42cefd01997-10-05 15:27:29 +00001276Return the module object corresponding to a module name. The
1277\var{name} argument may be of the form \code{package.module}). First
1278check the modules dictionary if there's one there, and if not, create
Fred Drake659ebfa2000-04-03 15:42:13 +00001279a new one and insert in in the modules dictionary.
Guido van Rossuma096a2e1998-11-02 17:02:42 +00001280Warning: this function does not load or import the module; if the
1281module wasn't already loaded, you will get an empty module object.
1282Use \cfunction{PyImport_ImportModule()} or one of its variants to
1283import a module.
Fred Drake659ebfa2000-04-03 15:42:13 +00001284Return \NULL{} with an exception set on failure.
Guido van Rossum42cefd01997-10-05 15:27:29 +00001285\end{cfuncdesc}
1286
Fred Drakec6fa34e1998-04-02 06:47:24 +00001287\begin{cfuncdesc}{PyObject*}{PyImport_ExecCodeModule}{char *name, PyObject *co}
Guido van Rossum42cefd01997-10-05 15:27:29 +00001288Given a module name (possibly of the form \code{package.module}) and a
1289code object read from a Python bytecode file or obtained from the
Fred Drake53fb7721998-02-16 06:23:20 +00001290built-in function \function{compile()}\bifuncindex{compile}, load the
1291module. Return a new reference to the module object, or \NULL{} with
1292an exception set if an error occurred (the module may still be created
1293in this case). (This function would reload the module if it was
1294already imported.)
Guido van Rossum42cefd01997-10-05 15:27:29 +00001295\end{cfuncdesc}
1296
1297\begin{cfuncdesc}{long}{PyImport_GetMagicNumber}{}
Fred Drake659ebfa2000-04-03 15:42:13 +00001298Return the magic number for Python bytecode files (a.k.a.
1299\file{.pyc} and \file{.pyo} files). The magic number should be
1300present in the first four bytes of the bytecode file, in little-endian
1301byte order.
Guido van Rossum42cefd01997-10-05 15:27:29 +00001302\end{cfuncdesc}
1303
Fred Drakec6fa34e1998-04-02 06:47:24 +00001304\begin{cfuncdesc}{PyObject*}{PyImport_GetModuleDict}{}
Guido van Rossum42cefd01997-10-05 15:27:29 +00001305Return the dictionary used for the module administration
1306(a.k.a. \code{sys.modules}). Note that this is a per-interpreter
1307variable.
1308\end{cfuncdesc}
1309
1310\begin{cfuncdesc}{void}{_PyImport_Init}{}
1311Initialize the import mechanism. For internal use only.
1312\end{cfuncdesc}
1313
1314\begin{cfuncdesc}{void}{PyImport_Cleanup}{}
1315Empty the module table. For internal use only.
1316\end{cfuncdesc}
1317
1318\begin{cfuncdesc}{void}{_PyImport_Fini}{}
1319Finalize the import mechanism. For internal use only.
1320\end{cfuncdesc}
1321
Fred Drakec6fa34e1998-04-02 06:47:24 +00001322\begin{cfuncdesc}{PyObject*}{_PyImport_FindExtension}{char *, char *}
Guido van Rossum42cefd01997-10-05 15:27:29 +00001323For internal use only.
Guido van Rossum5b8a5231997-12-30 04:38:44 +00001324\end{cfuncdesc}
Guido van Rossum42cefd01997-10-05 15:27:29 +00001325
Fred Drakec6fa34e1998-04-02 06:47:24 +00001326\begin{cfuncdesc}{PyObject*}{_PyImport_FixupExtension}{char *, char *}
Guido van Rossum42cefd01997-10-05 15:27:29 +00001327For internal use only.
Guido van Rossum5b8a5231997-12-30 04:38:44 +00001328\end{cfuncdesc}
Guido van Rossum42cefd01997-10-05 15:27:29 +00001329
Fred Drake1d158692000-06-18 05:21:21 +00001330\begin{cfuncdesc}{int}{PyImport_ImportFrozenModule}{char *name}
1331Load a frozen module named \var{name}. Return \code{1} for success,
1332\code{0} if the module is not found, and \code{-1} with an exception
1333set if the initialization failed. To access the imported module on a
1334successful load, use \cfunction{PyImport_ImportModule()}.
Fred Drakee058b4f1998-02-16 06:15:35 +00001335(Note the misnomer --- this function would reload the module if it was
Guido van Rossum42cefd01997-10-05 15:27:29 +00001336already imported.)
1337\end{cfuncdesc}
1338
Fred Drake659ebfa2000-04-03 15:42:13 +00001339\begin{ctypedesc}[_frozen]{struct _frozen}
Guido van Rossum42cefd01997-10-05 15:27:29 +00001340This is the structure type definition for frozen module descriptors,
Fred Drakec6fa34e1998-04-02 06:47:24 +00001341as generated by the \program{freeze}\index{freeze utility} utility
1342(see \file{Tools/freeze/} in the Python source distribution). Its
Fred Drakee0d9a832000-09-01 05:30:00 +00001343definition, found in \file{Include/import.h}, is:
Fred Drakec6fa34e1998-04-02 06:47:24 +00001344
Guido van Rossum9faf4c51997-10-07 14:38:54 +00001345\begin{verbatim}
Guido van Rossum42cefd01997-10-05 15:27:29 +00001346struct _frozen {
Fred Drake36fbe761997-10-13 18:18:33 +00001347 char *name;
1348 unsigned char *code;
1349 int size;
Guido van Rossum42cefd01997-10-05 15:27:29 +00001350};
Guido van Rossum9faf4c51997-10-07 14:38:54 +00001351\end{verbatim}
Guido van Rossum42cefd01997-10-05 15:27:29 +00001352\end{ctypedesc}
1353
Fred Drakec6fa34e1998-04-02 06:47:24 +00001354\begin{cvardesc}{struct _frozen*}{PyImport_FrozenModules}
Fred Drakef8830d11998-04-23 14:06:01 +00001355This pointer is initialized to point to an array of \ctype{struct
Fred Drake659ebfa2000-04-03 15:42:13 +00001356_frozen} records, terminated by one whose members are all
1357\NULL{} or zero. When a frozen module is imported, it is searched in
1358this table. Third-party code could play tricks with this to provide a
Guido van Rossum42cefd01997-10-05 15:27:29 +00001359dynamically created collection of frozen modules.
1360\end{cvardesc}
1361
Fred Drakee0d9a832000-09-01 05:30:00 +00001362\begin{cfuncdesc}{int}{PyImport_AppendInittab}{char *name,
1363 void (*initfunc)(void)}
1364Add a single module to the existing table of built-in modules. This
1365is a convenience wrapper around \cfunction{PyImport_ExtendInittab()},
1366returning \code{-1} if the table could not be extended. The new
1367module can be imported by the name \var{name}, and uses the function
1368\var{initfunc} as the initialization function called on the first
1369attempted import. This should be called before
1370\cfunction{Py_Initialize()}.
1371\end{cfuncdesc}
1372
1373\begin{ctypedesc}[_inittab]{struct _inittab}
1374Structure describing a single entry in the list of built-in modules.
1375Each of these structures gives the name and initialization function
1376for a module built into the interpreter. Programs which embed Python
1377may use an array of these structures in conjunction with
1378\cfunction{PyImport_ExtendInittab()} to provide additional built-in
1379modules. The structure is defined in \file{Include/import.h} as:
1380
1381\begin{verbatim}
1382struct _inittab {
1383 char *name;
1384 void (*initfunc)(void);
1385};
1386\end{verbatim}
1387\end{ctypedesc}
1388
1389\begin{cfuncdesc}{int}{PyImport_ExtendInittab}{struct _inittab *newtab}
1390Add a collection of modules to the table of built-in modules. The
1391\var{newtab} array must end with a sentinel entry which contains
1392\NULL{} for the \member{name} field; failure to provide the sentinel
1393value can result in a memory fault. Returns \code{0} on success or
1394\code{-1} if insufficient memory could be allocated to extend the
1395internal table. In the event of failure, no modules are added to the
1396internal table. This should be called before
1397\cfunction{Py_Initialize()}.
1398\end{cfuncdesc}
1399
Guido van Rossum42cefd01997-10-05 15:27:29 +00001400
Fred Drake88fdaa72001-07-20 20:56:11 +00001401\section{Parsing arguements and building values
1402 \label{arg-parsing}}
1403
1404These functions are useful when creating your own extensions functions
1405and methods. Additional information and examples are available in
1406\citetitle[../ext/ext.html]{Extending and Embedding the Python
1407Interpreter}.
1408
1409\begin{cfuncdesc}{int}{PyArg_ParseTuple}{PyObject *args, char *format,
1410 \moreargs}
1411 Parse the parameters of a function that takes only positional
1412 parameters into local variables. Returns true on success; on
1413 failure, it returns false and raises the appropriate exception. See
1414 \citetitle[../ext/parseTuple.html]{Extending and Embedding the
1415 Python Interpreter} for more information.
1416\end{cfuncdesc}
1417
1418\begin{cfuncdesc}{int}{PyArg_ParseTupleAndKeywords}{PyObject *args,
1419 PyObject *kw, char *format, char *keywords[],
1420 \moreargs}
1421 Parse the parameters of a function that takes both positional and
1422 keyword parameters into local variables. Returns true on success;
1423 on failure, it returns false and raises the appropriate exception.
1424 See \citetitle[../ext/parseTupleAndKeywords.html]{Extending and
1425 Embedding the Python Interpreter} for more information.
1426\end{cfuncdesc}
1427
1428\begin{cfuncdesc}{int}{PyArg_Parse}{PyObject *args, char *format,
1429 \moreargs}
1430 Function used to deconstruct the argument lists of ``old-style''
1431 functions --- these are functions which use the
1432 \constant{METH_OLDARGS} parameter parsing method. This is not
1433 recommended for use in parameter parsing in new code, and most code
1434 in the standard interpreter has been modified to no longer use this
1435 for that purpose. It does remain a convenient way to decompose
1436 other tuples, however, and may continue to be used for that
1437 purpose.
1438\end{cfuncdesc}
1439
1440\begin{cfuncdesc}{PyObject*}{Py_BuildValue}{char *format,
1441 \moreargs}
1442 Create a new value based on a format string similar to those
1443 accepted by the \cfunction{PyArg_Parse*()} family of functions and a
1444 sequence of values. Returns the value or \NULL{} in the case of an
1445 error; an exception will be raised if \NULL{} is returned. For more
1446 information on the format string and additional parameters, see
1447 \citetitle[../ext/buildValue.html]{Extending and Embedding the
1448 Python Interpreter}.
1449\end{cfuncdesc}
1450
1451
1452
Fred Drakeefd146c1999-02-15 15:30:45 +00001453\chapter{Abstract Objects Layer \label{abstract}}
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001454
1455The functions in this chapter interact with Python objects regardless
1456of their type, or with wide classes of object types (e.g. all
1457numerical types, or all sequence types). When used on object types
Fred Drake659ebfa2000-04-03 15:42:13 +00001458for which they do not apply, they will raise a Python exception.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001459
Fred Drakeefd146c1999-02-15 15:30:45 +00001460\section{Object Protocol \label{object}}
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001461
1462\begin{cfuncdesc}{int}{PyObject_Print}{PyObject *o, FILE *fp, int flags}
Fred Drake659ebfa2000-04-03 15:42:13 +00001463Print an object \var{o}, on file \var{fp}. Returns \code{-1} on error.
1464The flags argument is used to enable certain printing options. The
1465only option currently supported is \constant{Py_PRINT_RAW}; if given,
1466the \function{str()} of the object is written instead of the
1467\function{repr()}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001468\end{cfuncdesc}
1469
1470\begin{cfuncdesc}{int}{PyObject_HasAttrString}{PyObject *o, char *attr_name}
Fred Drakee058b4f1998-02-16 06:15:35 +00001471Returns \code{1} if \var{o} has the attribute \var{attr_name}, and
1472\code{0} otherwise. This is equivalent to the Python expression
1473\samp{hasattr(\var{o}, \var{attr_name})}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001474This function always succeeds.
1475\end{cfuncdesc}
1476
Fred Drake659ebfa2000-04-03 15:42:13 +00001477\begin{cfuncdesc}{PyObject*}{PyObject_GetAttrString}{PyObject *o,
1478 char *attr_name}
Fred Drakee058b4f1998-02-16 06:15:35 +00001479Retrieve an attribute named \var{attr_name} from object \var{o}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001480Returns the attribute value on success, or \NULL{} on failure.
Fred Drakee058b4f1998-02-16 06:15:35 +00001481This is the equivalent of the Python expression
1482\samp{\var{o}.\var{attr_name}}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001483\end{cfuncdesc}
1484
1485
1486\begin{cfuncdesc}{int}{PyObject_HasAttr}{PyObject *o, PyObject *attr_name}
Fred Drakee058b4f1998-02-16 06:15:35 +00001487Returns \code{1} if \var{o} has the attribute \var{attr_name}, and
1488\code{0} otherwise. This is equivalent to the Python expression
1489\samp{hasattr(\var{o}, \var{attr_name})}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001490This function always succeeds.
1491\end{cfuncdesc}
1492
1493
Fred Drake659ebfa2000-04-03 15:42:13 +00001494\begin{cfuncdesc}{PyObject*}{PyObject_GetAttr}{PyObject *o,
1495 PyObject *attr_name}
Fred Drakee058b4f1998-02-16 06:15:35 +00001496Retrieve an attribute named \var{attr_name} from object \var{o}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001497Returns the attribute value on success, or \NULL{} on failure.
Fred Drakee058b4f1998-02-16 06:15:35 +00001498This is the equivalent of the Python expression
1499\samp{\var{o}.\var{attr_name}}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001500\end{cfuncdesc}
1501
1502
1503\begin{cfuncdesc}{int}{PyObject_SetAttrString}{PyObject *o, char *attr_name, PyObject *v}
Fred Drakee058b4f1998-02-16 06:15:35 +00001504Set the value of the attribute named \var{attr_name}, for object
1505\var{o}, to the value \var{v}. Returns \code{-1} on failure. This is
1506the equivalent of the Python statement \samp{\var{o}.\var{attr_name} =
1507\var{v}}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001508\end{cfuncdesc}
1509
1510
1511\begin{cfuncdesc}{int}{PyObject_SetAttr}{PyObject *o, PyObject *attr_name, PyObject *v}
Fred Drakee058b4f1998-02-16 06:15:35 +00001512Set the value of the attribute named \var{attr_name}, for
1513object \var{o},
1514to the value \var{v}. Returns \code{-1} on failure. This is
1515the equivalent of the Python statement \samp{\var{o}.\var{attr_name} =
1516\var{v}}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001517\end{cfuncdesc}
1518
1519
1520\begin{cfuncdesc}{int}{PyObject_DelAttrString}{PyObject *o, char *attr_name}
Fred Drakee058b4f1998-02-16 06:15:35 +00001521Delete attribute named \var{attr_name}, for object \var{o}. Returns
1522\code{-1} on failure. This is the equivalent of the Python
1523statement: \samp{del \var{o}.\var{attr_name}}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001524\end{cfuncdesc}
1525
1526
1527\begin{cfuncdesc}{int}{PyObject_DelAttr}{PyObject *o, PyObject *attr_name}
Fred Drakee058b4f1998-02-16 06:15:35 +00001528Delete attribute named \var{attr_name}, for object \var{o}. Returns
1529\code{-1} on failure. This is the equivalent of the Python
1530statement \samp{del \var{o}.\var{attr_name}}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001531\end{cfuncdesc}
1532
1533
1534\begin{cfuncdesc}{int}{PyObject_Cmp}{PyObject *o1, PyObject *o2, int *result}
Fred Drakee058b4f1998-02-16 06:15:35 +00001535Compare the values of \var{o1} and \var{o2} using a routine provided
1536by \var{o1}, if one exists, otherwise with a routine provided by
1537\var{o2}. The result of the comparison is returned in \var{result}.
1538Returns \code{-1} on failure. This is the equivalent of the Python
Fred Drake659ebfa2000-04-03 15:42:13 +00001539statement\bifuncindex{cmp} \samp{\var{result} = cmp(\var{o1}, \var{o2})}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001540\end{cfuncdesc}
1541
1542
1543\begin{cfuncdesc}{int}{PyObject_Compare}{PyObject *o1, PyObject *o2}
Fred Drakee058b4f1998-02-16 06:15:35 +00001544Compare the values of \var{o1} and \var{o2} using a routine provided
1545by \var{o1}, if one exists, otherwise with a routine provided by
1546\var{o2}. Returns the result of the comparison on success. On error,
1547the value returned is undefined; use \cfunction{PyErr_Occurred()} to
Fred Drake659ebfa2000-04-03 15:42:13 +00001548detect an error. This is equivalent to the Python
1549expression\bifuncindex{cmp} \samp{cmp(\var{o1}, \var{o2})}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001550\end{cfuncdesc}
1551
1552
1553\begin{cfuncdesc}{PyObject*}{PyObject_Repr}{PyObject *o}
Fred Drake659ebfa2000-04-03 15:42:13 +00001554Compute a string representation of object \var{o}. Returns the
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001555string representation on success, \NULL{} on failure. This is
Fred Drakee058b4f1998-02-16 06:15:35 +00001556the equivalent of the Python expression \samp{repr(\var{o})}.
1557Called by the \function{repr()}\bifuncindex{repr} built-in function
1558and by reverse quotes.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001559\end{cfuncdesc}
1560
1561
1562\begin{cfuncdesc}{PyObject*}{PyObject_Str}{PyObject *o}
Fred Drake659ebfa2000-04-03 15:42:13 +00001563Compute a string representation of object \var{o}. Returns the
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001564string representation on success, \NULL{} on failure. This is
Fred Drakee058b4f1998-02-16 06:15:35 +00001565the equivalent of the Python expression \samp{str(\var{o})}.
1566Called by the \function{str()}\bifuncindex{str} built-in function and
1567by the \keyword{print} statement.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001568\end{cfuncdesc}
1569
1570
Marc-André Lemburgad7c98e2001-01-17 17:09:53 +00001571\begin{cfuncdesc}{PyObject*}{PyObject_Unicode}{PyObject *o}
1572Compute a Unicode string representation of object \var{o}. Returns the
1573Unicode string representation on success, \NULL{} on failure. This is
1574the equivalent of the Python expression \samp{unistr(\var{o})}.
1575Called by the \function{unistr()}\bifuncindex{unistr} built-in function.
1576\end{cfuncdesc}
1577
Fred Drake58c8f9f2001-03-28 21:14:32 +00001578\begin{cfuncdesc}{int}{PyObject_IsInstance}{PyObject *inst, PyObject *cls}
1579Return \code{1} if \var{inst} is an instance of the class \var{cls} or
1580a subclass of \var{cls}. If \var{cls} is a type object rather than a
1581class object, \cfunction{PyObject_IsInstance()} returns \code{1} if
1582\var{inst} is of type \var{cls}. If \var{inst} is not a class
1583instance and \var{cls} is neither a type object or class object,
1584\var{inst} must have a \member{__class__} attribute --- the class
1585relationship of the value of that attribute with \var{cls} will be
1586used to determine the result of this function.
1587\versionadded{2.1}
1588\end{cfuncdesc}
1589
1590Subclass determination is done in a fairly straightforward way, but
1591includes a wrinkle that implementors of extensions to the class system
1592may want to be aware of. If \class{A} and \class{B} are class
1593objects, \class{B} is a subclass of \class{A} if it inherits from
1594\class{A} either directly or indirectly. If either is not a class
1595object, a more general mechanism is used to determine the class
1596relationship of the two objects. When testing if \var{B} is a
1597subclass of \var{A}, if \var{A} is \var{B},
1598\cfunction{PyObject_IsSubclass()} returns true. If \var{A} and
1599\var{B} are different objects, \var{B}'s \member{__bases__} attribute
1600is searched in a depth-first fashion for \var{A} --- the presence of
1601the \member{__bases__} attribute is considered sufficient for this
1602determination.
1603
1604\begin{cfuncdesc}{int}{PyObject_IsSubclass}{PyObject *derived,
1605 PyObject *cls}
1606Returns \code{1} if the class \var{derived} is identical to or derived
1607from the class \var{cls}, otherwise returns \code{0}. In case of an
1608error, returns \code{-1}. If either \var{derived} or \var{cls} is not
1609an actual class object, this function uses the generic algorithm
1610described above.
1611\versionadded{2.1}
1612\end{cfuncdesc}
1613
Marc-André Lemburgad7c98e2001-01-17 17:09:53 +00001614
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001615\begin{cfuncdesc}{int}{PyCallable_Check}{PyObject *o}
Fred Drake659ebfa2000-04-03 15:42:13 +00001616Determine if the object \var{o} is callable. Return \code{1} if the
Fred Drakee058b4f1998-02-16 06:15:35 +00001617object is callable and \code{0} otherwise.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001618This function always succeeds.
1619\end{cfuncdesc}
1620
1621
Fred Drake659ebfa2000-04-03 15:42:13 +00001622\begin{cfuncdesc}{PyObject*}{PyObject_CallObject}{PyObject *callable_object,
1623 PyObject *args}
Fred Drakee058b4f1998-02-16 06:15:35 +00001624Call a callable Python object \var{callable_object}, with
1625arguments given by the tuple \var{args}. If no arguments are
Fred Drake659ebfa2000-04-03 15:42:13 +00001626needed, then \var{args} may be \NULL{}. Returns the result of the
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001627call on success, or \NULL{} on failure. This is the equivalent
Fred Drake5566c1c2001-01-19 22:48:33 +00001628of the Python expression \samp{apply(\var{callable_object}, \var{args})}.
Fred Drake659ebfa2000-04-03 15:42:13 +00001629\bifuncindex{apply}
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001630\end{cfuncdesc}
1631
Fred Drake5566c1c2001-01-19 22:48:33 +00001632\begin{cfuncdesc}{PyObject*}{PyObject_CallFunction}{PyObject *callable_object,
1633 char *format, ...}
Fred Drakee058b4f1998-02-16 06:15:35 +00001634Call a callable Python object \var{callable_object}, with a
Fred Drake659ebfa2000-04-03 15:42:13 +00001635variable number of C arguments. The C arguments are described
Fred Drakee058b4f1998-02-16 06:15:35 +00001636using a \cfunction{Py_BuildValue()} style format string. The format may
1637be \NULL{}, indicating that no arguments are provided. Returns the
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001638result of the call on success, or \NULL{} on failure. This is
Fred Drake5566c1c2001-01-19 22:48:33 +00001639the equivalent of the Python expression \samp{apply(\var{callable_object},
Fred Drake659ebfa2000-04-03 15:42:13 +00001640\var{args})}.\bifuncindex{apply}
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001641\end{cfuncdesc}
1642
1643
Fred Drake5566c1c2001-01-19 22:48:33 +00001644\begin{cfuncdesc}{PyObject*}{PyObject_CallMethod}{PyObject *o,
1645 char *method, char *format, ...}
Fred Drakee058b4f1998-02-16 06:15:35 +00001646Call the method named \var{m} of object \var{o} with a variable number
Fred Drake659ebfa2000-04-03 15:42:13 +00001647of C arguments. The C arguments are described by a
Fred Drakee058b4f1998-02-16 06:15:35 +00001648\cfunction{Py_BuildValue()} format string. The format may be \NULL{},
1649indicating that no arguments are provided. Returns the result of the
1650call on success, or \NULL{} on failure. This is the equivalent of the
1651Python expression \samp{\var{o}.\var{method}(\var{args})}.
Fred Drake659ebfa2000-04-03 15:42:13 +00001652Note that special method names, such as \method{__add__()},
1653\method{__getitem__()}, and so on are not supported. The specific
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001654abstract-object routines for these must be used.
1655\end{cfuncdesc}
1656
1657
1658\begin{cfuncdesc}{int}{PyObject_Hash}{PyObject *o}
Fred Drakee058b4f1998-02-16 06:15:35 +00001659Compute and return the hash value of an object \var{o}. On
1660failure, return \code{-1}. This is the equivalent of the Python
Fred Drake659ebfa2000-04-03 15:42:13 +00001661expression \samp{hash(\var{o})}.\bifuncindex{hash}
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001662\end{cfuncdesc}
1663
1664
1665\begin{cfuncdesc}{int}{PyObject_IsTrue}{PyObject *o}
Fred Drakee058b4f1998-02-16 06:15:35 +00001666Returns \code{1} if the object \var{o} is considered to be true, and
1667\code{0} otherwise. This is equivalent to the Python expression
1668\samp{not not \var{o}}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001669This function always succeeds.
1670\end{cfuncdesc}
1671
1672
1673\begin{cfuncdesc}{PyObject*}{PyObject_Type}{PyObject *o}
1674On success, returns a type object corresponding to the object
Fred Drakee058b4f1998-02-16 06:15:35 +00001675type of object \var{o}. On failure, returns \NULL{}. This is
1676equivalent to the Python expression \samp{type(\var{o})}.
Fred Drake53fb7721998-02-16 06:23:20 +00001677\bifuncindex{type}
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001678\end{cfuncdesc}
1679
1680\begin{cfuncdesc}{int}{PyObject_Length}{PyObject *o}
Fred Drakee058b4f1998-02-16 06:15:35 +00001681Return the length of object \var{o}. If the object \var{o} provides
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001682both sequence and mapping protocols, the sequence length is
Fred Drake659ebfa2000-04-03 15:42:13 +00001683returned. On error, \code{-1} is returned. This is the equivalent
1684to the Python expression \samp{len(\var{o})}.\bifuncindex{len}
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001685\end{cfuncdesc}
1686
1687
1688\begin{cfuncdesc}{PyObject*}{PyObject_GetItem}{PyObject *o, PyObject *key}
Fred Drakee058b4f1998-02-16 06:15:35 +00001689Return element of \var{o} corresponding to the object \var{key} or
1690\NULL{} on failure. This is the equivalent of the Python expression
1691\samp{\var{o}[\var{key}]}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001692\end{cfuncdesc}
1693
1694
1695\begin{cfuncdesc}{int}{PyObject_SetItem}{PyObject *o, PyObject *key, PyObject *v}
Fred Drakee058b4f1998-02-16 06:15:35 +00001696Map the object \var{key} to the value \var{v}.
1697Returns \code{-1} on failure. This is the equivalent
1698of the Python statement \samp{\var{o}[\var{key}] = \var{v}}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001699\end{cfuncdesc}
1700
1701
Guido van Rossumd1dbf631999-01-22 20:10:49 +00001702\begin{cfuncdesc}{int}{PyObject_DelItem}{PyObject *o, PyObject *key}
Fred Drakee058b4f1998-02-16 06:15:35 +00001703Delete the mapping for \var{key} from \var{o}. Returns \code{-1} on
1704failure. This is the equivalent of the Python statement \samp{del
1705\var{o}[\var{key}]}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001706\end{cfuncdesc}
1707
Andrew M. Kuchling8c46b302000-07-13 23:58:16 +00001708\begin{cfuncdesc}{int}{PyObject_AsFileDescriptor}{PyObject *o}
1709Derives a file-descriptor from a Python object. If the object
1710is an integer or long integer, its value is returned. If not, the
1711object's \method{fileno()} method is called if it exists; the method
1712must return an integer or long integer, which is returned as the file
1713descriptor value. Returns \code{-1} on failure.
1714\end{cfuncdesc}
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001715
Fred Drakeefd146c1999-02-15 15:30:45 +00001716\section{Number Protocol \label{number}}
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001717
1718\begin{cfuncdesc}{int}{PyNumber_Check}{PyObject *o}
Fred Drakee058b4f1998-02-16 06:15:35 +00001719Returns \code{1} if the object \var{o} provides numeric protocols, and
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001720false otherwise.
1721This function always succeeds.
1722\end{cfuncdesc}
1723
1724
1725\begin{cfuncdesc}{PyObject*}{PyNumber_Add}{PyObject *o1, PyObject *o2}
Fred Drakee058b4f1998-02-16 06:15:35 +00001726Returns the result of adding \var{o1} and \var{o2}, or \NULL{} on
1727failure. This is the equivalent of the Python expression
1728\samp{\var{o1} + \var{o2}}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001729\end{cfuncdesc}
1730
1731
1732\begin{cfuncdesc}{PyObject*}{PyNumber_Subtract}{PyObject *o1, PyObject *o2}
Fred Drake659ebfa2000-04-03 15:42:13 +00001733Returns the result of subtracting \var{o2} from \var{o1}, or
1734\NULL{} on failure. This is the equivalent of the Python expression
Fred Drakee058b4f1998-02-16 06:15:35 +00001735\samp{\var{o1} - \var{o2}}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001736\end{cfuncdesc}
1737
1738
1739\begin{cfuncdesc}{PyObject*}{PyNumber_Multiply}{PyObject *o1, PyObject *o2}
Fred Drakee058b4f1998-02-16 06:15:35 +00001740Returns the result of multiplying \var{o1} and \var{o2}, or \NULL{} on
1741failure. This is the equivalent of the Python expression
1742\samp{\var{o1} * \var{o2}}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001743\end{cfuncdesc}
1744
1745
1746\begin{cfuncdesc}{PyObject*}{PyNumber_Divide}{PyObject *o1, PyObject *o2}
Fred Drakee058b4f1998-02-16 06:15:35 +00001747Returns the result of dividing \var{o1} by \var{o2}, or \NULL{} on
1748failure.
1749This is the equivalent of the Python expression \samp{\var{o1} /
1750\var{o2}}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001751\end{cfuncdesc}
1752
1753
1754\begin{cfuncdesc}{PyObject*}{PyNumber_Remainder}{PyObject *o1, PyObject *o2}
Fred Drakee058b4f1998-02-16 06:15:35 +00001755Returns the remainder of dividing \var{o1} by \var{o2}, or \NULL{} on
1756failure. This is the equivalent of the Python expression
Fred Drake659ebfa2000-04-03 15:42:13 +00001757\samp{\var{o1} \%\ \var{o2}}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001758\end{cfuncdesc}
1759
1760
1761\begin{cfuncdesc}{PyObject*}{PyNumber_Divmod}{PyObject *o1, PyObject *o2}
Fred Drake53fb7721998-02-16 06:23:20 +00001762See the built-in function \function{divmod()}\bifuncindex{divmod}.
1763Returns \NULL{} on failure. This is the equivalent of the Python
1764expression \samp{divmod(\var{o1}, \var{o2})}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001765\end{cfuncdesc}
1766
1767
1768\begin{cfuncdesc}{PyObject*}{PyNumber_Power}{PyObject *o1, PyObject *o2, PyObject *o3}
Fred Drake53fb7721998-02-16 06:23:20 +00001769See the built-in function \function{pow()}\bifuncindex{pow}. Returns
1770\NULL{} on failure. This is the equivalent of the Python expression
Fred Drakee058b4f1998-02-16 06:15:35 +00001771\samp{pow(\var{o1}, \var{o2}, \var{o3})}, where \var{o3} is optional.
Fred Drake659ebfa2000-04-03 15:42:13 +00001772If \var{o3} is to be ignored, pass \cdata{Py_None} in its place
1773(passing \NULL{} for \var{o3} would cause an illegal memory access).
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001774\end{cfuncdesc}
1775
1776
1777\begin{cfuncdesc}{PyObject*}{PyNumber_Negative}{PyObject *o}
Fred Drakee058b4f1998-02-16 06:15:35 +00001778Returns the negation of \var{o} on success, or \NULL{} on failure.
1779This is the equivalent of the Python expression \samp{-\var{o}}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001780\end{cfuncdesc}
1781
1782
1783\begin{cfuncdesc}{PyObject*}{PyNumber_Positive}{PyObject *o}
Fred Drakee058b4f1998-02-16 06:15:35 +00001784Returns \var{o} on success, or \NULL{} on failure.
1785This is the equivalent of the Python expression \samp{+\var{o}}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001786\end{cfuncdesc}
1787
1788
1789\begin{cfuncdesc}{PyObject*}{PyNumber_Absolute}{PyObject *o}
Fred Drakee058b4f1998-02-16 06:15:35 +00001790Returns the absolute value of \var{o}, or \NULL{} on failure. This is
1791the equivalent of the Python expression \samp{abs(\var{o})}.
Fred Drake659ebfa2000-04-03 15:42:13 +00001792\bifuncindex{abs}
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001793\end{cfuncdesc}
1794
1795
1796\begin{cfuncdesc}{PyObject*}{PyNumber_Invert}{PyObject *o}
Fred Drakee058b4f1998-02-16 06:15:35 +00001797Returns the bitwise negation of \var{o} on success, or \NULL{} on
1798failure. This is the equivalent of the Python expression
1799\samp{\~\var{o}}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001800\end{cfuncdesc}
1801
1802
1803\begin{cfuncdesc}{PyObject*}{PyNumber_Lshift}{PyObject *o1, PyObject *o2}
Fred Drakee058b4f1998-02-16 06:15:35 +00001804Returns the result of left shifting \var{o1} by \var{o2} on success,
1805or \NULL{} on failure. This is the equivalent of the Python
Fred Draked20d8b32001-04-13 14:52:39 +00001806expression \samp{\var{o1} <\code{<} \var{o2}}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001807\end{cfuncdesc}
1808
1809
1810\begin{cfuncdesc}{PyObject*}{PyNumber_Rshift}{PyObject *o1, PyObject *o2}
Fred Drakee058b4f1998-02-16 06:15:35 +00001811Returns the result of right shifting \var{o1} by \var{o2} on success,
1812or \NULL{} on failure. This is the equivalent of the Python
Fred Draked20d8b32001-04-13 14:52:39 +00001813expression \samp{\var{o1} >\code{>} \var{o2}}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001814\end{cfuncdesc}
1815
1816
1817\begin{cfuncdesc}{PyObject*}{PyNumber_And}{PyObject *o1, PyObject *o2}
Fred Drake7740a012000-09-12 20:27:05 +00001818Returns the ``bitwise and'' of \var{o2} and \var{o2} on success and
1819\NULL{} on failure. This is the equivalent of the Python expression
Fred Drake5566c1c2001-01-19 22:48:33 +00001820\samp{\var{o1} \&\ \var{o2}}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001821\end{cfuncdesc}
1822
1823
1824\begin{cfuncdesc}{PyObject*}{PyNumber_Xor}{PyObject *o1, PyObject *o2}
Fred Drake7740a012000-09-12 20:27:05 +00001825Returns the ``bitwise exclusive or'' of \var{o1} by \var{o2} on success,
Fred Drakee058b4f1998-02-16 06:15:35 +00001826or \NULL{} on failure. This is the equivalent of the Python
Fred Drake755c23d2001-07-14 03:05:53 +00001827expression \samp{\var{o1} \textasciicircum{} \var{o2}}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001828\end{cfuncdesc}
1829
1830\begin{cfuncdesc}{PyObject*}{PyNumber_Or}{PyObject *o1, PyObject *o2}
Fred Drake7740a012000-09-12 20:27:05 +00001831Returns the ``bitwise or'' of \var{o1} and \var{o2} on success, or
1832\NULL{} on failure. This is the equivalent of the Python expression
1833\samp{\var{o1} | \var{o2}}.
1834\end{cfuncdesc}
1835
1836
1837\begin{cfuncdesc}{PyObject*}{PyNumber_InPlaceAdd}{PyObject *o1, PyObject *o2}
1838Returns the result of adding \var{o1} and \var{o2}, or \NULL{} on failure.
1839The operation is done \emph{in-place} when \var{o1} supports it. This is the
1840equivalent of the Python expression \samp{\var{o1} += \var{o2}}.
1841\end{cfuncdesc}
1842
1843
1844\begin{cfuncdesc}{PyObject*}{PyNumber_InPlaceSubtract}{PyObject *o1, PyObject *o2}
1845Returns the result of subtracting \var{o2} from \var{o1}, or
1846\NULL{} on failure. The operation is done \emph{in-place} when \var{o1}
1847supports it. This is the equivalent of the Python expression \samp{\var{o1}
1848-= \var{o2}}.
1849\end{cfuncdesc}
1850
1851
1852\begin{cfuncdesc}{PyObject*}{PyNumber_InPlaceMultiply}{PyObject *o1, PyObject *o2}
1853Returns the result of multiplying \var{o1} and \var{o2}, or \NULL{} on
1854failure. The operation is done \emph{in-place} when \var{o1} supports it.
1855This is the equivalent of the Python expression \samp{\var{o1} *= \var{o2}}.
1856\end{cfuncdesc}
1857
1858
1859\begin{cfuncdesc}{PyObject*}{PyNumber_InPlaceDivide}{PyObject *o1, PyObject *o2}
1860Returns the result of dividing \var{o1} by \var{o2}, or \NULL{} on failure.
1861The operation is done \emph{in-place} when \var{o1} supports it. This is the
1862equivalent of the Python expression \samp{\var{o1} /= \var{o2}}.
1863\end{cfuncdesc}
1864
1865
1866\begin{cfuncdesc}{PyObject*}{PyNumber_InPlaceRemainder}{PyObject *o1, PyObject *o2}
1867Returns the remainder of dividing \var{o1} by \var{o2}, or \NULL{} on
1868failure. The operation is done \emph{in-place} when \var{o1} supports it.
1869This is the equivalent of the Python expression \samp{\var{o1} \%= \var{o2}}.
1870\end{cfuncdesc}
1871
1872
1873\begin{cfuncdesc}{PyObject*}{PyNumber_InPlacePower}{PyObject *o1, PyObject *o2, PyObject *o3}
1874See the built-in function \function{pow()}\bifuncindex{pow}. Returns
1875\NULL{} on failure. The operation is done \emph{in-place} when \var{o1}
1876supports it. This is the equivalent of the Python expression \samp{\var{o1}
1877**= \var{o2}} when o3 is \cdata{Py_None}, or an in-place variant of
Fred Drake5566c1c2001-01-19 22:48:33 +00001878\samp{pow(\var{o1}, \var{o2}, \var{o3})} otherwise. If \var{o3} is to be
Fred Drake7740a012000-09-12 20:27:05 +00001879ignored, pass \cdata{Py_None} in its place (passing \NULL{} for \var{o3}
1880would cause an illegal memory access).
1881\end{cfuncdesc}
1882
1883\begin{cfuncdesc}{PyObject*}{PyNumber_InPlaceLshift}{PyObject *o1, PyObject *o2}
1884Returns the result of left shifting \var{o1} by \var{o2} on success, or
1885\NULL{} on failure. The operation is done \emph{in-place} when \var{o1}
1886supports it. This is the equivalent of the Python expression \samp{\var{o1}
Fred Draked20d8b32001-04-13 14:52:39 +00001887<\code{<=} \var{o2}}.
Fred Drake7740a012000-09-12 20:27:05 +00001888\end{cfuncdesc}
1889
1890
1891\begin{cfuncdesc}{PyObject*}{PyNumber_InPlaceRshift}{PyObject *o1, PyObject *o2}
1892Returns the result of right shifting \var{o1} by \var{o2} on success, or
1893\NULL{} on failure. The operation is done \emph{in-place} when \var{o1}
1894supports it. This is the equivalent of the Python expression \samp{\var{o1}
Fred Draked20d8b32001-04-13 14:52:39 +00001895>\code{>=} \var{o2}}.
Fred Drake7740a012000-09-12 20:27:05 +00001896\end{cfuncdesc}
1897
1898
1899\begin{cfuncdesc}{PyObject*}{PyNumber_InPlaceAnd}{PyObject *o1, PyObject *o2}
Fred Drake5566c1c2001-01-19 22:48:33 +00001900Returns the ``bitwise and'' of \var{o1} and \var{o2} on success
1901and \NULL{} on failure. The operation is done \emph{in-place} when
1902\var{o1} supports it. This is the equivalent of the Python expression
1903\samp{\var{o1} \&= \var{o2}}.
Fred Drake7740a012000-09-12 20:27:05 +00001904\end{cfuncdesc}
1905
1906
1907\begin{cfuncdesc}{PyObject*}{PyNumber_InPlaceXor}{PyObject *o1, PyObject *o2}
1908Returns the ``bitwise exclusive or'' of \var{o1} by \var{o2} on success, or
1909\NULL{} on failure. The operation is done \emph{in-place} when \var{o1}
1910supports it. This is the equivalent of the Python expression \samp{\var{o1}
Fred Drake377fb1e2001-07-14 03:01:48 +00001911\textasciicircum= \var{o2}}.
Fred Drake7740a012000-09-12 20:27:05 +00001912\end{cfuncdesc}
1913
1914\begin{cfuncdesc}{PyObject*}{PyNumber_InPlaceOr}{PyObject *o1, PyObject *o2}
1915Returns the ``bitwise or'' of \var{o1} and \var{o2} on success, or \NULL{}
1916on failure. The operation is done \emph{in-place} when \var{o1} supports
1917it. This is the equivalent of the Python expression \samp{\var{o1} |=
1918\var{o2}}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001919\end{cfuncdesc}
1920
Fred Drakec0e6c5b2000-09-22 18:17:49 +00001921\begin{cfuncdesc}{int}{PyNumber_Coerce}{PyObject **p1, PyObject **p2}
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001922This function takes the addresses of two variables of type
Fred Drake659ebfa2000-04-03 15:42:13 +00001923\ctype{PyObject*}. If the objects pointed to by \code{*\var{p1}} and
1924\code{*\var{p2}} have the same type, increment their reference count
1925and return \code{0} (success). If the objects can be converted to a
1926common numeric type, replace \code{*p1} and \code{*p2} by their
1927converted value (with 'new' reference counts), and return \code{0}.
1928If no conversion is possible, or if some other error occurs, return
1929\code{-1} (failure) and don't increment the reference counts. The
1930call \code{PyNumber_Coerce(\&o1, \&o2)} is equivalent to the Python
1931statement \samp{\var{o1}, \var{o2} = coerce(\var{o1}, \var{o2})}.
1932\bifuncindex{coerce}
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001933\end{cfuncdesc}
1934
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001935\begin{cfuncdesc}{PyObject*}{PyNumber_Int}{PyObject *o}
Fred Drakee058b4f1998-02-16 06:15:35 +00001936Returns the \var{o} converted to an integer object on success, or
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001937\NULL{} on failure. This is the equivalent of the Python
Fred Drake659ebfa2000-04-03 15:42:13 +00001938expression \samp{int(\var{o})}.\bifuncindex{int}
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001939\end{cfuncdesc}
1940
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001941\begin{cfuncdesc}{PyObject*}{PyNumber_Long}{PyObject *o}
Fred Drakee058b4f1998-02-16 06:15:35 +00001942Returns the \var{o} converted to a long integer object on success,
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001943or \NULL{} on failure. This is the equivalent of the Python
Fred Drake659ebfa2000-04-03 15:42:13 +00001944expression \samp{long(\var{o})}.\bifuncindex{long}
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001945\end{cfuncdesc}
1946
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001947\begin{cfuncdesc}{PyObject*}{PyNumber_Float}{PyObject *o}
Fred Drake659ebfa2000-04-03 15:42:13 +00001948Returns the \var{o} converted to a float object on success, or
1949\NULL{} on failure. This is the equivalent of the Python expression
1950\samp{float(\var{o})}.\bifuncindex{float}
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001951\end{cfuncdesc}
1952
1953
Fred Drakeefd146c1999-02-15 15:30:45 +00001954\section{Sequence Protocol \label{sequence}}
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001955
1956\begin{cfuncdesc}{int}{PySequence_Check}{PyObject *o}
Fred Drake659ebfa2000-04-03 15:42:13 +00001957Return \code{1} if the object provides sequence protocol, and
1958\code{0} otherwise. This function always succeeds.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001959\end{cfuncdesc}
1960
Fred Drakec6a3cb42001-04-04 01:25:17 +00001961\begin{cfuncdesc}{int}{PySequence_Size}{PyObject *o}
Fred Drake659ebfa2000-04-03 15:42:13 +00001962Returns the number of objects in sequence \var{o} on success, and
1963\code{-1} on failure. For objects that do not provide sequence
1964protocol, this is equivalent to the Python expression
1965\samp{len(\var{o})}.\bifuncindex{len}
1966\end{cfuncdesc}
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001967
Fred Drakec6a3cb42001-04-04 01:25:17 +00001968\begin{cfuncdesc}{int}{PySequence_Length}{PyObject *o}
1969Alternate name for \cfunction{PySequence_Size()}.
1970\end{cfuncdesc}
1971
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001972\begin{cfuncdesc}{PyObject*}{PySequence_Concat}{PyObject *o1, PyObject *o2}
Fred Drakee058b4f1998-02-16 06:15:35 +00001973Return the concatenation of \var{o1} and \var{o2} on success, and \NULL{} on
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001974failure. This is the equivalent of the Python
Fred Drakee058b4f1998-02-16 06:15:35 +00001975expression \samp{\var{o1} + \var{o2}}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001976\end{cfuncdesc}
1977
1978
1979\begin{cfuncdesc}{PyObject*}{PySequence_Repeat}{PyObject *o, int count}
Fred Drake659ebfa2000-04-03 15:42:13 +00001980Return the result of repeating sequence object
1981\var{o} \var{count} times, or \NULL{} on failure. This is the
1982equivalent of the Python expression \samp{\var{o} * \var{count}}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001983\end{cfuncdesc}
1984
Fred Drake7740a012000-09-12 20:27:05 +00001985\begin{cfuncdesc}{PyObject*}{PySequence_InPlaceConcat}{PyObject *o1, PyObject *o2}
1986Return the concatenation of \var{o1} and \var{o2} on success, and \NULL{} on
1987failure. The operation is done \emph{in-place} when \var{o1} supports it.
1988This is the equivalent of the Python expression \samp{\var{o1} += \var{o2}}.
1989\end{cfuncdesc}
1990
1991
1992\begin{cfuncdesc}{PyObject*}{PySequence_InPlaceRepeat}{PyObject *o, int count}
1993Return the result of repeating sequence object \var{o} \var{count} times, or
1994\NULL{} on failure. The operation is done \emph{in-place} when \var{o}
1995supports it. This is the equivalent of the Python expression \samp{\var{o}
1996*= \var{count}}.
1997\end{cfuncdesc}
1998
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001999
2000\begin{cfuncdesc}{PyObject*}{PySequence_GetItem}{PyObject *o, int i}
Fred Drakee058b4f1998-02-16 06:15:35 +00002001Return the \var{i}th element of \var{o}, or \NULL{} on failure. This
2002is the equivalent of the Python expression \samp{\var{o}[\var{i}]}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00002003\end{cfuncdesc}
2004
2005
2006\begin{cfuncdesc}{PyObject*}{PySequence_GetSlice}{PyObject *o, int i1, int i2}
Fred Drakee058b4f1998-02-16 06:15:35 +00002007Return the slice of sequence object \var{o} between \var{i1} and
2008\var{i2}, or \NULL{} on failure. This is the equivalent of the Python
2009expression \samp{\var{o}[\var{i1}:\var{i2}]}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00002010\end{cfuncdesc}
2011
2012
2013\begin{cfuncdesc}{int}{PySequence_SetItem}{PyObject *o, int i, PyObject *v}
Fred Drakee058b4f1998-02-16 06:15:35 +00002014Assign object \var{v} to the \var{i}th element of \var{o}.
2015Returns \code{-1} on failure. This is the equivalent of the Python
2016statement \samp{\var{o}[\var{i}] = \var{v}}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00002017\end{cfuncdesc}
2018
2019\begin{cfuncdesc}{int}{PySequence_DelItem}{PyObject *o, int i}
Fred Drake5566c1c2001-01-19 22:48:33 +00002020Delete the \var{i}th element of object \var{o}. Returns
Fred Drakee058b4f1998-02-16 06:15:35 +00002021\code{-1} on failure. This is the equivalent of the Python
2022statement \samp{del \var{o}[\var{i}]}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00002023\end{cfuncdesc}
2024
Fred Drake659ebfa2000-04-03 15:42:13 +00002025\begin{cfuncdesc}{int}{PySequence_SetSlice}{PyObject *o, int i1,
2026 int i2, PyObject *v}
Fred Drakee058b4f1998-02-16 06:15:35 +00002027Assign the sequence object \var{v} to the slice in sequence
2028object \var{o} from \var{i1} to \var{i2}. This is the equivalent of
2029the Python statement \samp{\var{o}[\var{i1}:\var{i2}] = \var{v}}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00002030\end{cfuncdesc}
2031
2032\begin{cfuncdesc}{int}{PySequence_DelSlice}{PyObject *o, int i1, int i2}
Fred Drakee058b4f1998-02-16 06:15:35 +00002033Delete the slice in sequence object \var{o} from \var{i1} to \var{i2}.
2034Returns \code{-1} on failure. This is the equivalent of the Python
2035statement \samp{del \var{o}[\var{i1}:\var{i2}]}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00002036\end{cfuncdesc}
2037
2038\begin{cfuncdesc}{PyObject*}{PySequence_Tuple}{PyObject *o}
Fred Drakee058b4f1998-02-16 06:15:35 +00002039Returns the \var{o} as a tuple on success, and \NULL{} on failure.
Fred Drake659ebfa2000-04-03 15:42:13 +00002040This is equivalent to the Python expression \samp{tuple(\var{o})}.
2041\bifuncindex{tuple}
Guido van Rossum9231c8f1997-05-15 21:43:21 +00002042\end{cfuncdesc}
2043
2044\begin{cfuncdesc}{int}{PySequence_Count}{PyObject *o, PyObject *value}
Fred Drakee058b4f1998-02-16 06:15:35 +00002045Return the number of occurrences of \var{value} in \var{o}, that is,
2046return the number of keys for which \code{\var{o}[\var{key}] ==
2047\var{value}}. On failure, return \code{-1}. This is equivalent to
2048the Python expression \samp{\var{o}.count(\var{value})}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00002049\end{cfuncdesc}
2050
Fred Drake659ebfa2000-04-03 15:42:13 +00002051\begin{cfuncdesc}{int}{PySequence_Contains}{PyObject *o, PyObject *value}
Fred Drakee058b4f1998-02-16 06:15:35 +00002052Determine if \var{o} contains \var{value}. If an item in \var{o} is
2053equal to \var{value}, return \code{1}, otherwise return \code{0}. On
2054error, return \code{-1}. This is equivalent to the Python expression
2055\samp{\var{value} in \var{o}}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00002056\end{cfuncdesc}
2057
2058\begin{cfuncdesc}{int}{PySequence_Index}{PyObject *o, PyObject *value}
Fred Drakee058b4f1998-02-16 06:15:35 +00002059Return the first index \var{i} for which \code{\var{o}[\var{i}] ==
2060\var{value}}. On error, return \code{-1}. This is equivalent to
2061the Python expression \samp{\var{o}.index(\var{value})}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00002062\end{cfuncdesc}
2063
Fred Drakea8455ab2000-06-16 19:58:42 +00002064\begin{cfuncdesc}{PyObject*}{PySequence_List}{PyObject *o}
2065Return a list object with the same contents as the arbitrary sequence
2066\var{o}. The returned list is guaranteed to be new.
2067\end{cfuncdesc}
2068
2069\begin{cfuncdesc}{PyObject*}{PySequence_Tuple}{PyObject *o}
2070Return a tuple object with the same contents as the arbitrary sequence
2071\var{o}. If \var{o} is a tuple, a new reference will be returned,
2072otherwise a tuple will be constructed with the appropriate contents.
2073\end{cfuncdesc}
2074
Fred Drakef39ed671998-02-26 22:01:23 +00002075
Fred Drake81cccb72000-09-12 15:22:05 +00002076\begin{cfuncdesc}{PyObject*}{PySequence_Fast}{PyObject *o, const char *m}
2077Returns the sequence \var{o} as a tuple, unless it is already a
2078tuple or list, in which case \var{o} is returned. Use
2079\cfunction{PySequence_Fast_GET_ITEM()} to access the members of the
2080result. Returns \NULL{} on failure. If the object is not a sequence,
2081raises \exception{TypeError} with \var{m} as the message text.
2082\end{cfuncdesc}
2083
2084\begin{cfuncdesc}{PyObject*}{PySequence_Fast_GET_ITEM}{PyObject *o, int i}
2085Return the \var{i}th element of \var{o}, assuming that \var{o} was
2086returned by \cfunction{PySequence_Fast()}, and that \var{i} is within
2087bounds. The caller is expected to get the length of the sequence by
Fred Drake96a2a802001-05-29 18:51:41 +00002088calling \cfunction{PySequence_Size()} on \var{o}, since lists and tuples
Fred Drake81cccb72000-09-12 15:22:05 +00002089are guaranteed to always return their true length.
2090\end{cfuncdesc}
2091
2092
Fred Drakeefd146c1999-02-15 15:30:45 +00002093\section{Mapping Protocol \label{mapping}}
Guido van Rossum9231c8f1997-05-15 21:43:21 +00002094
2095\begin{cfuncdesc}{int}{PyMapping_Check}{PyObject *o}
Fred Drake659ebfa2000-04-03 15:42:13 +00002096Return \code{1} if the object provides mapping protocol, and
2097\code{0} otherwise. This function always succeeds.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00002098\end{cfuncdesc}
2099
2100
2101\begin{cfuncdesc}{int}{PyMapping_Length}{PyObject *o}
Fred Drake659ebfa2000-04-03 15:42:13 +00002102Returns the number of keys in object \var{o} on success, and
2103\code{-1} on failure. For objects that do not provide mapping
2104protocol, this is equivalent to the Python expression
2105\samp{len(\var{o})}.\bifuncindex{len}
Guido van Rossum9231c8f1997-05-15 21:43:21 +00002106\end{cfuncdesc}
2107
2108
2109\begin{cfuncdesc}{int}{PyMapping_DelItemString}{PyObject *o, char *key}
Fred Drakee058b4f1998-02-16 06:15:35 +00002110Remove the mapping for object \var{key} from the object \var{o}.
2111Return \code{-1} on failure. This is equivalent to
2112the Python statement \samp{del \var{o}[\var{key}]}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00002113\end{cfuncdesc}
2114
2115
2116\begin{cfuncdesc}{int}{PyMapping_DelItem}{PyObject *o, PyObject *key}
Fred Drakee058b4f1998-02-16 06:15:35 +00002117Remove the mapping for object \var{key} from the object \var{o}.
2118Return \code{-1} on failure. This is equivalent to
2119the Python statement \samp{del \var{o}[\var{key}]}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00002120\end{cfuncdesc}
2121
2122
2123\begin{cfuncdesc}{int}{PyMapping_HasKeyString}{PyObject *o, char *key}
Fred Drake659ebfa2000-04-03 15:42:13 +00002124On success, return \code{1} if the mapping object has the key
2125\var{key} and \code{0} otherwise. This is equivalent to the Python
2126expression \samp{\var{o}.has_key(\var{key})}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00002127This function always succeeds.
2128\end{cfuncdesc}
2129
2130
2131\begin{cfuncdesc}{int}{PyMapping_HasKey}{PyObject *o, PyObject *key}
Fred Drakee058b4f1998-02-16 06:15:35 +00002132Return \code{1} if the mapping object has the key \var{key} and
2133\code{0} otherwise. This is equivalent to the Python expression
2134\samp{\var{o}.has_key(\var{key})}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00002135This function always succeeds.
2136\end{cfuncdesc}
2137
2138
2139\begin{cfuncdesc}{PyObject*}{PyMapping_Keys}{PyObject *o}
Fred Drakee058b4f1998-02-16 06:15:35 +00002140On success, return a list of the keys in object \var{o}. On
Guido van Rossum9231c8f1997-05-15 21:43:21 +00002141failure, return \NULL{}. This is equivalent to the Python
Fred Drakee058b4f1998-02-16 06:15:35 +00002142expression \samp{\var{o}.keys()}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00002143\end{cfuncdesc}
2144
2145
2146\begin{cfuncdesc}{PyObject*}{PyMapping_Values}{PyObject *o}
Fred Drakee058b4f1998-02-16 06:15:35 +00002147On success, return a list of the values in object \var{o}. On
Guido van Rossum9231c8f1997-05-15 21:43:21 +00002148failure, return \NULL{}. This is equivalent to the Python
Fred Drakee058b4f1998-02-16 06:15:35 +00002149expression \samp{\var{o}.values()}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00002150\end{cfuncdesc}
2151
2152
2153\begin{cfuncdesc}{PyObject*}{PyMapping_Items}{PyObject *o}
Fred Drakee058b4f1998-02-16 06:15:35 +00002154On success, return a list of the items in object \var{o}, where
Guido van Rossum9231c8f1997-05-15 21:43:21 +00002155each item is a tuple containing a key-value pair. On
2156failure, return \NULL{}. This is equivalent to the Python
Fred Drakee058b4f1998-02-16 06:15:35 +00002157expression \samp{\var{o}.items()}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00002158\end{cfuncdesc}
2159
Guido van Rossum9231c8f1997-05-15 21:43:21 +00002160
2161\begin{cfuncdesc}{PyObject*}{PyMapping_GetItemString}{PyObject *o, char *key}
Fred Drakee058b4f1998-02-16 06:15:35 +00002162Return element of \var{o} corresponding to the object \var{key} or
2163\NULL{} on failure. This is the equivalent of the Python expression
2164\samp{\var{o}[\var{key}]}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00002165\end{cfuncdesc}
2166
Fred Drakedbcaeda2001-05-07 17:42:18 +00002167\begin{cfuncdesc}{int}{PyMapping_SetItemString}{PyObject *o, char *key,
2168 PyObject *v}
Fred Drakee058b4f1998-02-16 06:15:35 +00002169Map the object \var{key} to the value \var{v} in object \var{o}.
2170Returns \code{-1} on failure. This is the equivalent of the Python
2171statement \samp{\var{o}[\var{key}] = \var{v}}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00002172\end{cfuncdesc}
2173
2174
Fred Drakedbcaeda2001-05-07 17:42:18 +00002175\section{Iterator Protocol \label{iterator}}
2176
Fred Drakea8e08272001-05-07 17:47:07 +00002177\versionadded{2.2}
2178
Fred Drakedbcaeda2001-05-07 17:42:18 +00002179There are only a couple of functions specifically for working with
2180iterators.
2181
2182\begin{cfuncdesc}{int}{PyIter_Check}{PyObject *o}
2183 Return true if the object \var{o} supports the iterator protocol.
2184\end{cfuncdesc}
2185
2186\begin{cfuncdesc}{PyObject*}{PyIter_Next}{PyObject *o}
2187 Return the next value from the iteration \var{o}. If the object is
2188 an iterator, this retrieves the next value from the iteration, and
2189 returns \NULL{} with no exception set if there are no remaining
2190 items. If the object is not an iterator, \exception{TypeError} is
2191 raised, or if there is an error in retrieving the item, returns
2192 \NULL{} and passes along the exception.
2193\end{cfuncdesc}
2194
2195To write a loop which iterates over an iterator, the C code should
2196look something like this:
2197
2198\begin{verbatim}
2199PyObject *iterator = ...;
2200PyObject *item;
2201
2202while (item = PyIter_Next(iter)) {
2203 /* do something with item */
2204}
2205if (PyErr_Occurred()) {
2206 /* propogate error */
2207}
2208else {
2209 /* continue doing useful work */
2210}
2211\end{verbatim}
2212
2213
Fred Drakeefd146c1999-02-15 15:30:45 +00002214\chapter{Concrete Objects Layer \label{concrete}}
Guido van Rossum9231c8f1997-05-15 21:43:21 +00002215
2216The functions in this chapter are specific to certain Python object
2217types. Passing them an object of the wrong type is not a good idea;
2218if you receive an object from a Python program and you are not sure
2219that it has the right type, you must perform a type check first;
Fred Drake5566c1c2001-01-19 22:48:33 +00002220for example, to check that an object is a dictionary, use
Fred Drakee5bf8b21998-02-12 21:22:28 +00002221\cfunction{PyDict_Check()}. The chapter is structured like the
2222``family tree'' of Python object types.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00002223
Fred Drake89024422000-10-23 16:00:54 +00002224\strong{Warning:}
2225While the functions described in this chapter carefully check the type
2226of the objects which are passed in, many of them do not check for
2227\NULL{} being passed instead of a valid object. Allowing \NULL{} to
2228be passed in can cause memory access violations and immediate
2229termination of the interpreter.
2230
Guido van Rossum9231c8f1997-05-15 21:43:21 +00002231
Fred Drakeefd146c1999-02-15 15:30:45 +00002232\section{Fundamental Objects \label{fundamental}}
Guido van Rossum9231c8f1997-05-15 21:43:21 +00002233
Fred Drakee5bf8b21998-02-12 21:22:28 +00002234This section describes Python type objects and the singleton object
2235\code{None}.
2236
2237
Fred Drakeefd146c1999-02-15 15:30:45 +00002238\subsection{Type Objects \label{typeObjects}}
Fred Drakee5bf8b21998-02-12 21:22:28 +00002239
Fred Drake659ebfa2000-04-03 15:42:13 +00002240\obindex{type}
Fred Drakee5bf8b21998-02-12 21:22:28 +00002241\begin{ctypedesc}{PyTypeObject}
Fred Drake659ebfa2000-04-03 15:42:13 +00002242The C structure of the objects used to describe built-in types.
Fred Drakee5bf8b21998-02-12 21:22:28 +00002243\end{ctypedesc}
2244
Fred Drake659ebfa2000-04-03 15:42:13 +00002245\begin{cvardesc}{PyObject*}{PyType_Type}
Fred Drakeefd146c1999-02-15 15:30:45 +00002246This is the type object for type objects; it is the same object as
2247\code{types.TypeType} in the Python layer.
Fred Drake659ebfa2000-04-03 15:42:13 +00002248\withsubitem{(in module types)}{\ttindex{TypeType}}
Fred Drakee5bf8b21998-02-12 21:22:28 +00002249\end{cvardesc}
2250
Fred Drake659ebfa2000-04-03 15:42:13 +00002251\begin{cfuncdesc}{int}{PyType_Check}{PyObject *o}
2252Returns true is the object \var{o} is a type object.
2253\end{cfuncdesc}
2254
2255\begin{cfuncdesc}{int}{PyType_HasFeature}{PyObject *o, int feature}
2256Returns true if the type object \var{o} sets the feature
Fred Drakef0e08ef2001-02-03 01:11:26 +00002257\var{feature}. Type features are denoted by single bit flags.
Fred Drake659ebfa2000-04-03 15:42:13 +00002258\end{cfuncdesc}
2259
Fred Drakee5bf8b21998-02-12 21:22:28 +00002260
Fred Drakeefd146c1999-02-15 15:30:45 +00002261\subsection{The None Object \label{noneObject}}
Fred Drakee5bf8b21998-02-12 21:22:28 +00002262
Fred Drake659ebfa2000-04-03 15:42:13 +00002263\obindex{None@\texttt{None}}
2264Note that the \ctype{PyTypeObject} for \code{None} is not directly
2265exposed in the Python/C API. Since \code{None} is a singleton,
2266testing for object identity (using \samp{==} in C) is sufficient.
2267There is no \cfunction{PyNone_Check()} function for the same reason.
2268
2269\begin{cvardesc}{PyObject*}{Py_None}
Guido van Rossum44475131998-04-21 15:30:01 +00002270The Python \code{None} object, denoting lack of value. This object has
2271no methods.
Fred Drakee5bf8b21998-02-12 21:22:28 +00002272\end{cvardesc}
2273
2274
Fred Drakefa774872001-07-11 20:35:37 +00002275\section{Numeric Objects \label{numericObjects}}
2276
2277\obindex{numeric}
2278
2279
2280\subsection{Plain Integer Objects \label{intObjects}}
2281
2282\obindex{integer}
2283\begin{ctypedesc}{PyIntObject}
2284This subtype of \ctype{PyObject} represents a Python integer object.
2285\end{ctypedesc}
2286
2287\begin{cvardesc}{PyTypeObject}{PyInt_Type}
2288This instance of \ctype{PyTypeObject} represents the Python plain
2289integer type. This is the same object as \code{types.IntType}.
2290\withsubitem{(in modules types)}{\ttindex{IntType}}
2291\end{cvardesc}
2292
2293\begin{cfuncdesc}{int}{PyInt_Check}{PyObject* o}
2294Returns true if \var{o} is of type \cdata{PyInt_Type}.
2295\end{cfuncdesc}
2296
2297\begin{cfuncdesc}{PyObject*}{PyInt_FromLong}{long ival}
2298Creates a new integer object with a value of \var{ival}.
2299
2300The current implementation keeps an array of integer objects for all
2301integers between \code{-1} and \code{100}, when you create an int in
2302that range you actually just get back a reference to the existing
2303object. So it should be possible to change the value of \code{1}. I
2304suspect the behaviour of Python in this case is undefined. :-)
2305\end{cfuncdesc}
2306
2307\begin{cfuncdesc}{long}{PyInt_AsLong}{PyObject *io}
2308Will first attempt to cast the object to a \ctype{PyIntObject}, if
2309it is not already one, and then return its value.
2310\end{cfuncdesc}
2311
2312\begin{cfuncdesc}{long}{PyInt_AS_LONG}{PyObject *io}
2313Returns the value of the object \var{io}. No error checking is
2314performed.
2315\end{cfuncdesc}
2316
2317\begin{cfuncdesc}{long}{PyInt_GetMax}{}
2318Returns the system's idea of the largest integer it can handle
2319(\constant{LONG_MAX}\ttindex{LONG_MAX}, as defined in the system
2320header files).
2321\end{cfuncdesc}
2322
2323
2324\subsection{Long Integer Objects \label{longObjects}}
2325
2326\obindex{long integer}
2327\begin{ctypedesc}{PyLongObject}
2328This subtype of \ctype{PyObject} represents a Python long integer
2329object.
2330\end{ctypedesc}
2331
2332\begin{cvardesc}{PyTypeObject}{PyLong_Type}
2333This instance of \ctype{PyTypeObject} represents the Python long
2334integer type. This is the same object as \code{types.LongType}.
2335\withsubitem{(in modules types)}{\ttindex{LongType}}
2336\end{cvardesc}
2337
2338\begin{cfuncdesc}{int}{PyLong_Check}{PyObject *p}
2339Returns true if its argument is a \ctype{PyLongObject}.
2340\end{cfuncdesc}
2341
2342\begin{cfuncdesc}{PyObject*}{PyLong_FromLong}{long v}
2343Returns a new \ctype{PyLongObject} object from \var{v}, or \NULL{} on
2344failure.
2345\end{cfuncdesc}
2346
2347\begin{cfuncdesc}{PyObject*}{PyLong_FromUnsignedLong}{unsigned long v}
2348Returns a new \ctype{PyLongObject} object from a C \ctype{unsigned
2349long}, or \NULL{} on failure.
2350\end{cfuncdesc}
2351
2352\begin{cfuncdesc}{PyObject*}{PyLong_FromDouble}{double v}
2353Returns a new \ctype{PyLongObject} object from the integer part of
2354\var{v}, or \NULL{} on failure.
2355\end{cfuncdesc}
2356
2357\begin{cfuncdesc}{long}{PyLong_AsLong}{PyObject *pylong}
2358Returns a C \ctype{long} representation of the contents of
2359\var{pylong}. If \var{pylong} is greater than
2360\constant{LONG_MAX}\ttindex{LONG_MAX}, an \exception{OverflowError} is
2361raised.\withsubitem{(built-in exception)}{\ttindex{OverflowError}}
2362\end{cfuncdesc}
2363
2364\begin{cfuncdesc}{unsigned long}{PyLong_AsUnsignedLong}{PyObject *pylong}
2365Returns a C \ctype{unsigned long} representation of the contents of
2366\var{pylong}. If \var{pylong} is greater than
2367\constant{ULONG_MAX}\ttindex{ULONG_MAX}, an \exception{OverflowError}
2368is raised.\withsubitem{(built-in exception)}{\ttindex{OverflowError}}
2369\end{cfuncdesc}
2370
2371\begin{cfuncdesc}{double}{PyLong_AsDouble}{PyObject *pylong}
2372Returns a C \ctype{double} representation of the contents of \var{pylong}.
2373\end{cfuncdesc}
2374
2375\begin{cfuncdesc}{PyObject*}{PyLong_FromString}{char *str, char **pend,
2376 int base}
2377Return a new \ctype{PyLongObject} based on the string value in
2378\var{str}, which is interpreted according to the radix in \var{base}.
2379If \var{pend} is non-\NULL, \code{*\var{pend}} will point to the first
2380character in \var{str} which follows the representation of the
2381number. If \var{base} is \code{0}, the radix will be determined base
2382on the leading characters of \var{str}: if \var{str} starts with
2383\code{'0x'} or \code{'0X'}, radix 16 will be used; if \var{str} starts
2384with \code{'0'}, radix 8 will be used; otherwise radix 10 will be
2385used. If \var{base} is not \code{0}, it must be between \code{2} and
2386\code{36}, inclusive. Leading spaces are ignored. If there are no
2387digits, \exception{ValueError} will be raised.
2388\end{cfuncdesc}
2389
2390
2391\subsection{Floating Point Objects \label{floatObjects}}
2392
2393\obindex{floating point}
2394\begin{ctypedesc}{PyFloatObject}
2395This subtype of \ctype{PyObject} represents a Python floating point
2396object.
2397\end{ctypedesc}
2398
2399\begin{cvardesc}{PyTypeObject}{PyFloat_Type}
2400This instance of \ctype{PyTypeObject} represents the Python floating
2401point type. This is the same object as \code{types.FloatType}.
2402\withsubitem{(in modules types)}{\ttindex{FloatType}}
2403\end{cvardesc}
2404
2405\begin{cfuncdesc}{int}{PyFloat_Check}{PyObject *p}
2406Returns true if its argument is a \ctype{PyFloatObject}.
2407\end{cfuncdesc}
2408
2409\begin{cfuncdesc}{PyObject*}{PyFloat_FromDouble}{double v}
2410Creates a \ctype{PyFloatObject} object from \var{v}, or \NULL{} on
2411failure.
2412\end{cfuncdesc}
2413
2414\begin{cfuncdesc}{double}{PyFloat_AsDouble}{PyObject *pyfloat}
2415Returns a C \ctype{double} representation of the contents of \var{pyfloat}.
2416\end{cfuncdesc}
2417
2418\begin{cfuncdesc}{double}{PyFloat_AS_DOUBLE}{PyObject *pyfloat}
2419Returns a C \ctype{double} representation of the contents of
2420\var{pyfloat}, but without error checking.
2421\end{cfuncdesc}
2422
2423
2424\subsection{Complex Number Objects \label{complexObjects}}
2425
2426\obindex{complex number}
2427Python's complex number objects are implemented as two distinct types
2428when viewed from the C API: one is the Python object exposed to
2429Python programs, and the other is a C structure which represents the
2430actual complex number value. The API provides functions for working
2431with both.
2432
2433\subsubsection{Complex Numbers as C Structures}
2434
2435Note that the functions which accept these structures as parameters
2436and return them as results do so \emph{by value} rather than
2437dereferencing them through pointers. This is consistent throughout
2438the API.
2439
2440\begin{ctypedesc}{Py_complex}
2441The C structure which corresponds to the value portion of a Python
2442complex number object. Most of the functions for dealing with complex
2443number objects use structures of this type as input or output values,
2444as appropriate. It is defined as:
2445
2446\begin{verbatim}
2447typedef struct {
2448 double real;
2449 double imag;
2450} Py_complex;
2451\end{verbatim}
2452\end{ctypedesc}
2453
2454\begin{cfuncdesc}{Py_complex}{_Py_c_sum}{Py_complex left, Py_complex right}
2455Return the sum of two complex numbers, using the C
2456\ctype{Py_complex} representation.
2457\end{cfuncdesc}
2458
2459\begin{cfuncdesc}{Py_complex}{_Py_c_diff}{Py_complex left, Py_complex right}
2460Return the difference between two complex numbers, using the C
2461\ctype{Py_complex} representation.
2462\end{cfuncdesc}
2463
2464\begin{cfuncdesc}{Py_complex}{_Py_c_neg}{Py_complex complex}
2465Return the negation of the complex number \var{complex}, using the C
2466\ctype{Py_complex} representation.
2467\end{cfuncdesc}
2468
2469\begin{cfuncdesc}{Py_complex}{_Py_c_prod}{Py_complex left, Py_complex right}
2470Return the product of two complex numbers, using the C
2471\ctype{Py_complex} representation.
2472\end{cfuncdesc}
2473
2474\begin{cfuncdesc}{Py_complex}{_Py_c_quot}{Py_complex dividend,
2475 Py_complex divisor}
2476Return the quotient of two complex numbers, using the C
2477\ctype{Py_complex} representation.
2478\end{cfuncdesc}
2479
2480\begin{cfuncdesc}{Py_complex}{_Py_c_pow}{Py_complex num, Py_complex exp}
2481Return the exponentiation of \var{num} by \var{exp}, using the C
2482\ctype{Py_complex} representation.
2483\end{cfuncdesc}
2484
2485
2486\subsubsection{Complex Numbers as Python Objects}
2487
2488\begin{ctypedesc}{PyComplexObject}
2489This subtype of \ctype{PyObject} represents a Python complex number object.
2490\end{ctypedesc}
2491
2492\begin{cvardesc}{PyTypeObject}{PyComplex_Type}
2493This instance of \ctype{PyTypeObject} represents the Python complex
2494number type.
2495\end{cvardesc}
2496
2497\begin{cfuncdesc}{int}{PyComplex_Check}{PyObject *p}
2498Returns true if its argument is a \ctype{PyComplexObject}.
2499\end{cfuncdesc}
2500
2501\begin{cfuncdesc}{PyObject*}{PyComplex_FromCComplex}{Py_complex v}
2502Create a new Python complex number object from a C
2503\ctype{Py_complex} value.
2504\end{cfuncdesc}
2505
2506\begin{cfuncdesc}{PyObject*}{PyComplex_FromDoubles}{double real, double imag}
2507Returns a new \ctype{PyComplexObject} object from \var{real} and \var{imag}.
2508\end{cfuncdesc}
2509
2510\begin{cfuncdesc}{double}{PyComplex_RealAsDouble}{PyObject *op}
2511Returns the real part of \var{op} as a C \ctype{double}.
2512\end{cfuncdesc}
2513
2514\begin{cfuncdesc}{double}{PyComplex_ImagAsDouble}{PyObject *op}
2515Returns the imaginary part of \var{op} as a C \ctype{double}.
2516\end{cfuncdesc}
2517
2518\begin{cfuncdesc}{Py_complex}{PyComplex_AsCComplex}{PyObject *op}
2519Returns the \ctype{Py_complex} value of the complex number \var{op}.
2520\end{cfuncdesc}
2521
2522
2523
Fred Drakeefd146c1999-02-15 15:30:45 +00002524\section{Sequence Objects \label{sequenceObjects}}
Fred Drakee5bf8b21998-02-12 21:22:28 +00002525
Fred Drake659ebfa2000-04-03 15:42:13 +00002526\obindex{sequence}
Fred Drakee5bf8b21998-02-12 21:22:28 +00002527Generic operations on sequence objects were discussed in the previous
2528chapter; this section deals with the specific kinds of sequence
2529objects that are intrinsic to the Python language.
2530
2531
Fred Drakeefd146c1999-02-15 15:30:45 +00002532\subsection{String Objects \label{stringObjects}}
Fred Drakee5bf8b21998-02-12 21:22:28 +00002533
Fred Drake89024422000-10-23 16:00:54 +00002534These functions raise \exception{TypeError} when expecting a string
2535parameter and are called with a non-string parameter.
2536
Fred Drake659ebfa2000-04-03 15:42:13 +00002537\obindex{string}
Fred Drakee5bf8b21998-02-12 21:22:28 +00002538\begin{ctypedesc}{PyStringObject}
Fred Drakef8830d11998-04-23 14:06:01 +00002539This subtype of \ctype{PyObject} represents a Python string object.
Fred Drakee5bf8b21998-02-12 21:22:28 +00002540\end{ctypedesc}
2541
2542\begin{cvardesc}{PyTypeObject}{PyString_Type}
Fred Drake659ebfa2000-04-03 15:42:13 +00002543This instance of \ctype{PyTypeObject} represents the Python string
2544type; it is the same object as \code{types.TypeType} in the Python
2545layer.\withsubitem{(in module types)}{\ttindex{StringType}}.
Fred Drakee5bf8b21998-02-12 21:22:28 +00002546\end{cvardesc}
2547
2548\begin{cfuncdesc}{int}{PyString_Check}{PyObject *o}
Guido van Rossum3c4378b1998-04-14 20:21:10 +00002549Returns true if the object \var{o} is a string object.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00002550\end{cfuncdesc}
2551
Fred Drakec6fa34e1998-04-02 06:47:24 +00002552\begin{cfuncdesc}{PyObject*}{PyString_FromString}{const char *v}
Guido van Rossum3c4378b1998-04-14 20:21:10 +00002553Returns a new string object with the value \var{v} on success, and
2554\NULL{} on failure.
Fred Drakec6fa34e1998-04-02 06:47:24 +00002555\end{cfuncdesc}
2556
Fred Drake659ebfa2000-04-03 15:42:13 +00002557\begin{cfuncdesc}{PyObject*}{PyString_FromStringAndSize}{const char *v,
2558 int len}
2559Returns a new string object with the value \var{v} and length
2560\var{len} on success, and \NULL{} on failure. If \var{v} is \NULL{},
2561the contents of the string are uninitialized.
2562\end{cfuncdesc}
2563
Fred Drakec6fa34e1998-04-02 06:47:24 +00002564\begin{cfuncdesc}{int}{PyString_Size}{PyObject *string}
Guido van Rossum3c4378b1998-04-14 20:21:10 +00002565Returns the length of the string in string object \var{string}.
Fred Drakec6fa34e1998-04-02 06:47:24 +00002566\end{cfuncdesc}
2567
Fred Drake659ebfa2000-04-03 15:42:13 +00002568\begin{cfuncdesc}{int}{PyString_GET_SIZE}{PyObject *string}
Fred Drake5d644212000-10-07 12:31:50 +00002569Macro form of \cfunction{PyString_Size()} but without error
Fred Drake659ebfa2000-04-03 15:42:13 +00002570checking.
2571\end{cfuncdesc}
2572
Fred Drakec6fa34e1998-04-02 06:47:24 +00002573\begin{cfuncdesc}{char*}{PyString_AsString}{PyObject *string}
Fred Drake659ebfa2000-04-03 15:42:13 +00002574Returns a null-terminated representation of the contents of
2575\var{string}. The pointer refers to the internal buffer of
Fred Drake89024422000-10-23 16:00:54 +00002576\var{string}, not a copy. The data must not be modified in any way,
2577unless the string was just created using
2578\code{PyString_FromStringAndSize(NULL, \var{size})}.
2579It must not be deallocated.
Fred Drake659ebfa2000-04-03 15:42:13 +00002580\end{cfuncdesc}
2581
2582\begin{cfuncdesc}{char*}{PyString_AS_STRING}{PyObject *string}
2583Macro form of \cfunction{PyString_AsString()} but without error
2584checking.
Fred Drakec6fa34e1998-04-02 06:47:24 +00002585\end{cfuncdesc}
2586
Marc-André Lemburgd1ba4432000-09-19 21:04:18 +00002587\begin{cfuncdesc}{int}{PyString_AsStringAndSize}{PyObject *obj,
2588 char **buffer,
2589 int *length}
2590Returns a null-terminated representation of the contents of the object
2591\var{obj} through the output variables \var{buffer} and \var{length}.
2592
2593The function accepts both string and Unicode objects as input. For
2594Unicode objects it returns the default encoded version of the object.
2595If \var{length} is set to \NULL{}, the resulting buffer may not contain
2596null characters; if it does, the function returns -1 and a
2597TypeError is raised.
2598
2599The buffer refers to an internal string buffer of \var{obj}, not a
Fred Drake89024422000-10-23 16:00:54 +00002600copy. The data must not be modified in any way, unless the string was
2601just created using \code{PyString_FromStringAndSize(NULL,
2602\var{size})}. It must not be deallocated.
Marc-André Lemburgd1ba4432000-09-19 21:04:18 +00002603\end{cfuncdesc}
2604
Fred Drakec6fa34e1998-04-02 06:47:24 +00002605\begin{cfuncdesc}{void}{PyString_Concat}{PyObject **string,
2606 PyObject *newpart}
Fred Drake66b989c1999-02-15 20:15:39 +00002607Creates a new string object in \var{*string} containing the
Fred Drakeddc6c272000-03-31 18:22:38 +00002608contents of \var{newpart} appended to \var{string}; the caller will
2609own the new reference. The reference to the old value of \var{string}
2610will be stolen. If the new string
Fred Drake66b989c1999-02-15 20:15:39 +00002611cannot be created, the old reference to \var{string} will still be
2612discarded and the value of \var{*string} will be set to
2613\NULL{}; the appropriate exception will be set.
Fred Drakec6fa34e1998-04-02 06:47:24 +00002614\end{cfuncdesc}
2615
2616\begin{cfuncdesc}{void}{PyString_ConcatAndDel}{PyObject **string,
2617 PyObject *newpart}
Guido van Rossum3c4378b1998-04-14 20:21:10 +00002618Creates a new string object in \var{*string} containing the contents
Guido van Rossum44475131998-04-21 15:30:01 +00002619of \var{newpart} appended to \var{string}. This version decrements
2620the reference count of \var{newpart}.
Fred Drakec6fa34e1998-04-02 06:47:24 +00002621\end{cfuncdesc}
2622
2623\begin{cfuncdesc}{int}{_PyString_Resize}{PyObject **string, int newsize}
Guido van Rossum44475131998-04-21 15:30:01 +00002624A way to resize a string object even though it is ``immutable''.
2625Only use this to build up a brand new string object; don't use this if
2626the string may already be known in other parts of the code.
Fred Drakec6fa34e1998-04-02 06:47:24 +00002627\end{cfuncdesc}
2628
2629\begin{cfuncdesc}{PyObject*}{PyString_Format}{PyObject *format,
2630 PyObject *args}
Guido van Rossum44475131998-04-21 15:30:01 +00002631Returns a new string object from \var{format} and \var{args}. Analogous
Fred Drake659ebfa2000-04-03 15:42:13 +00002632to \code{\var{format} \%\ \var{args}}. The \var{args} argument must be
Guido van Rossum44475131998-04-21 15:30:01 +00002633a tuple.
Fred Drakec6fa34e1998-04-02 06:47:24 +00002634\end{cfuncdesc}
2635
2636\begin{cfuncdesc}{void}{PyString_InternInPlace}{PyObject **string}
Guido van Rossum44475131998-04-21 15:30:01 +00002637Intern the argument \var{*string} in place. The argument must be the
2638address of a pointer variable pointing to a Python string object.
2639If there is an existing interned string that is the same as
2640\var{*string}, it sets \var{*string} to it (decrementing the reference
2641count of the old string object and incrementing the reference count of
2642the interned string object), otherwise it leaves \var{*string} alone
2643and interns it (incrementing its reference count). (Clarification:
2644even though there is a lot of talk about reference counts, think of
Fred Drakef8830d11998-04-23 14:06:01 +00002645this function as reference-count-neutral; you own the object after
2646the call if and only if you owned it before the call.)
Fred Drakec6fa34e1998-04-02 06:47:24 +00002647\end{cfuncdesc}
2648
2649\begin{cfuncdesc}{PyObject*}{PyString_InternFromString}{const char *v}
Fred Drakef8830d11998-04-23 14:06:01 +00002650A combination of \cfunction{PyString_FromString()} and
2651\cfunction{PyString_InternInPlace()}, returning either a new string object
Guido van Rossum44475131998-04-21 15:30:01 +00002652that has been interned, or a new (``owned'') reference to an earlier
2653interned string object with the same value.
Fred Drakec6fa34e1998-04-02 06:47:24 +00002654\end{cfuncdesc}
2655
Marc-André Lemburg5a20b212000-07-07 15:47:06 +00002656\begin{cfuncdesc}{PyObject*}{PyString_Decode}{const char *s,
2657 int size,
2658 const char *encoding,
2659 const char *errors}
Marc-André Lemburg2d920412001-05-15 12:00:02 +00002660Creates an object by decoding \var{size} bytes of the encoded
2661buffer \var{s} using the codec registered
2662for \var{encoding}. \var{encoding} and \var{errors} have the same meaning
Marc-André Lemburg5a20b212000-07-07 15:47:06 +00002663as the parameters of the same name in the unicode() builtin
2664function. The codec to be used is looked up using the Python codec
2665registry. Returns \NULL{} in case an exception was raised by the
2666codec.
2667\end{cfuncdesc}
2668
Marc-André Lemburg2d920412001-05-15 12:00:02 +00002669\begin{cfuncdesc}{PyObject*}{PyString_AsDecodedObject}{PyObject *str,
2670 const char *encoding,
2671 const char *errors}
2672Decodes a string object by passing it to the codec registered
2673for \var{encoding} and returns the result as Python
2674object. \var{encoding} and \var{errors} have the same meaning as the
2675parameters of the same name in the string .encode() method. The codec
2676to be used is looked up using the Python codec registry. Returns
2677\NULL{} in case an exception was raised by the codec.
2678\end{cfuncdesc}
2679
2680\begin{cfuncdesc}{PyObject*}{PyString_Encode}{const char *s,
Marc-André Lemburg5a20b212000-07-07 15:47:06 +00002681 int size,
2682 const char *encoding,
2683 const char *errors}
Marc-André Lemburg2d920412001-05-15 12:00:02 +00002684Encodes the \ctype{char} buffer of the given size by passing it to
2685the codec registered for \var{encoding} and returns a Python object.
2686\var{encoding} and \var{errors} have the same
Marc-André Lemburg5a20b212000-07-07 15:47:06 +00002687meaning as the parameters of the same name in the string .encode()
2688method. The codec to be used is looked up using the Python codec
2689registry. Returns \NULL{} in case an exception was raised by the
2690codec.
2691\end{cfuncdesc}
2692
Marc-André Lemburg2d920412001-05-15 12:00:02 +00002693\begin{cfuncdesc}{PyObject*}{PyString_AsEncodedObject}{PyObject *str,
Marc-André Lemburg5a20b212000-07-07 15:47:06 +00002694 const char *encoding,
2695 const char *errors}
Marc-André Lemburg2d920412001-05-15 12:00:02 +00002696Encodes a string object using the codec registered
2697for \var{encoding} and returns the result as Python
Marc-André Lemburg5a20b212000-07-07 15:47:06 +00002698object. \var{encoding} and \var{errors} have the same meaning as the
2699parameters of the same name in the string .encode() method. The codec
2700to be used is looked up using the Python codec registry. Returns
2701\NULL{} in case an exception was raised by the codec.
2702\end{cfuncdesc}
2703
Fred Drakee5bf8b21998-02-12 21:22:28 +00002704
Fred Drakea4cd2612000-04-06 14:10:29 +00002705\subsection{Unicode Objects \label{unicodeObjects}}
2706\sectionauthor{Marc-Andre Lemburg}{mal@lemburg.com}
2707
2708%--- Unicode Type -------------------------------------------------------
2709
2710These are the basic Unicode object types used for the Unicode
2711implementation in Python:
2712
2713\begin{ctypedesc}{Py_UNICODE}
2714This type represents a 16-bit unsigned storage type which is used by
2715Python internally as basis for holding Unicode ordinals. On platforms
2716where \ctype{wchar_t} is available and also has 16-bits,
2717\ctype{Py_UNICODE} is a typedef alias for \ctype{wchar_t} to enhance
2718native platform compatibility. On all other platforms,
2719\ctype{Py_UNICODE} is a typedef alias for \ctype{unsigned short}.
2720\end{ctypedesc}
2721
2722\begin{ctypedesc}{PyUnicodeObject}
2723This subtype of \ctype{PyObject} represents a Python Unicode object.
2724\end{ctypedesc}
2725
2726\begin{cvardesc}{PyTypeObject}{PyUnicode_Type}
2727This instance of \ctype{PyTypeObject} represents the Python Unicode type.
2728\end{cvardesc}
2729
2730%--- These are really C macros... is there a macrodesc TeX macro ?
2731
2732The following APIs are really C macros and can be used to do fast
2733checks and to access internal read-only data of Unicode objects:
2734
2735\begin{cfuncdesc}{int}{PyUnicode_Check}{PyObject *o}
2736Returns true if the object \var{o} is a Unicode object.
2737\end{cfuncdesc}
2738
2739\begin{cfuncdesc}{int}{PyUnicode_GET_SIZE}{PyObject *o}
2740Returns the size of the object. o has to be a
2741PyUnicodeObject (not checked).
2742\end{cfuncdesc}
2743
2744\begin{cfuncdesc}{int}{PyUnicode_GET_DATA_SIZE}{PyObject *o}
2745Returns the size of the object's internal buffer in bytes. o has to be
2746a PyUnicodeObject (not checked).
2747\end{cfuncdesc}
2748
Fred Drake992fe5a2000-06-16 21:04:15 +00002749\begin{cfuncdesc}{Py_UNICODE*}{PyUnicode_AS_UNICODE}{PyObject *o}
Fred Drakea4cd2612000-04-06 14:10:29 +00002750Returns a pointer to the internal Py_UNICODE buffer of the object. o
2751has to be a PyUnicodeObject (not checked).
2752\end{cfuncdesc}
2753
Fred Drake992fe5a2000-06-16 21:04:15 +00002754\begin{cfuncdesc}{const char*}{PyUnicode_AS_DATA}{PyObject *o}
Fred Drakea4cd2612000-04-06 14:10:29 +00002755Returns a (const char *) pointer to the internal buffer of the object.
2756o has to be a PyUnicodeObject (not checked).
2757\end{cfuncdesc}
2758
2759% --- Unicode character properties ---------------------------------------
2760
2761Unicode provides many different character properties. The most often
2762needed ones are available through these macros which are mapped to C
2763functions depending on the Python configuration.
2764
2765\begin{cfuncdesc}{int}{Py_UNICODE_ISSPACE}{Py_UNICODE ch}
2766Returns 1/0 depending on whether \var{ch} is a whitespace character.
2767\end{cfuncdesc}
2768
2769\begin{cfuncdesc}{int}{Py_UNICODE_ISLOWER}{Py_UNICODE ch}
2770Returns 1/0 depending on whether \var{ch} is a lowercase character.
2771\end{cfuncdesc}
2772
2773\begin{cfuncdesc}{int}{Py_UNICODE_ISUPPER}{Py_UNICODE ch}
Fred Drakeae96aab2000-07-03 13:38:10 +00002774Returns 1/0 depending on whether \var{ch} is an uppercase character.
Fred Drakea4cd2612000-04-06 14:10:29 +00002775\end{cfuncdesc}
2776
2777\begin{cfuncdesc}{int}{Py_UNICODE_ISTITLE}{Py_UNICODE ch}
2778Returns 1/0 depending on whether \var{ch} is a titlecase character.
2779\end{cfuncdesc}
2780
2781\begin{cfuncdesc}{int}{Py_UNICODE_ISLINEBREAK}{Py_UNICODE ch}
2782Returns 1/0 depending on whether \var{ch} is a linebreak character.
2783\end{cfuncdesc}
2784
2785\begin{cfuncdesc}{int}{Py_UNICODE_ISDECIMAL}{Py_UNICODE ch}
2786Returns 1/0 depending on whether \var{ch} is a decimal character.
2787\end{cfuncdesc}
2788
2789\begin{cfuncdesc}{int}{Py_UNICODE_ISDIGIT}{Py_UNICODE ch}
2790Returns 1/0 depending on whether \var{ch} is a digit character.
2791\end{cfuncdesc}
2792
2793\begin{cfuncdesc}{int}{Py_UNICODE_ISNUMERIC}{Py_UNICODE ch}
2794Returns 1/0 depending on whether \var{ch} is a numeric character.
2795\end{cfuncdesc}
2796
Fred Drakeae96aab2000-07-03 13:38:10 +00002797\begin{cfuncdesc}{int}{Py_UNICODE_ISALPHA}{Py_UNICODE ch}
2798Returns 1/0 depending on whether \var{ch} is an alphabetic character.
2799\end{cfuncdesc}
2800
2801\begin{cfuncdesc}{int}{Py_UNICODE_ISALNUM}{Py_UNICODE ch}
2802Returns 1/0 depending on whether \var{ch} is an alphanumeric character.
2803\end{cfuncdesc}
2804
Fred Drakea4cd2612000-04-06 14:10:29 +00002805These APIs can be used for fast direct character conversions:
2806
2807\begin{cfuncdesc}{Py_UNICODE}{Py_UNICODE_TOLOWER}{Py_UNICODE ch}
2808Returns the character \var{ch} converted to lower case.
2809\end{cfuncdesc}
2810
2811\begin{cfuncdesc}{Py_UNICODE}{Py_UNICODE_TOUPPER}{Py_UNICODE ch}
2812Returns the character \var{ch} converted to upper case.
2813\end{cfuncdesc}
2814
2815\begin{cfuncdesc}{Py_UNICODE}{Py_UNICODE_TOTITLE}{Py_UNICODE ch}
2816Returns the character \var{ch} converted to title case.
2817\end{cfuncdesc}
2818
2819\begin{cfuncdesc}{int}{Py_UNICODE_TODECIMAL}{Py_UNICODE ch}
2820Returns the character \var{ch} converted to a decimal positive integer.
2821Returns -1 in case this is not possible. Does not raise exceptions.
2822\end{cfuncdesc}
2823
2824\begin{cfuncdesc}{int}{Py_UNICODE_TODIGIT}{Py_UNICODE ch}
2825Returns the character \var{ch} converted to a single digit integer.
2826Returns -1 in case this is not possible. Does not raise exceptions.
2827\end{cfuncdesc}
2828
2829\begin{cfuncdesc}{double}{Py_UNICODE_TONUMERIC}{Py_UNICODE ch}
2830Returns the character \var{ch} converted to a (positive) double.
2831Returns -1.0 in case this is not possible. Does not raise exceptions.
2832\end{cfuncdesc}
2833
2834% --- Plain Py_UNICODE ---------------------------------------------------
2835
2836To create Unicode objects and access their basic sequence properties,
2837use these APIs:
2838
2839\begin{cfuncdesc}{PyObject*}{PyUnicode_FromUnicode}{const Py_UNICODE *u,
2840 int size}
2841
2842Create a Unicode Object from the Py_UNICODE buffer \var{u} of the
2843given size. \var{u} may be \NULL{} which causes the contents to be
2844undefined. It is the user's responsibility to fill in the needed data.
Marc-André Lemburg8155e0e2001-04-23 14:44:21 +00002845The buffer is copied into the new object. If the buffer is not \NULL{},
2846the return value might be a shared object. Therefore, modification of
2847the resulting Unicode Object is only allowed when \var{u} is \NULL{}.
Fred Drakea4cd2612000-04-06 14:10:29 +00002848\end{cfuncdesc}
2849
Fred Drake1d158692000-06-18 05:21:21 +00002850\begin{cfuncdesc}{Py_UNICODE*}{PyUnicode_AsUnicode}{PyObject *unicode}
Fred Drakea4cd2612000-04-06 14:10:29 +00002851Return a read-only pointer to the Unicode object's internal
2852\ctype{Py_UNICODE} buffer.
2853\end{cfuncdesc}
2854
2855\begin{cfuncdesc}{int}{PyUnicode_GetSize}{PyObject *unicode}
2856Return the length of the Unicode object.
2857\end{cfuncdesc}
2858
Marc-André Lemburg5a20b212000-07-07 15:47:06 +00002859\begin{cfuncdesc}{PyObject*}{PyUnicode_FromEncodedObject}{PyObject *obj,
2860 const char *encoding,
2861 const char *errors}
Fred Drakea4cd2612000-04-06 14:10:29 +00002862
Marc-André Lemburg5a20b212000-07-07 15:47:06 +00002863Coerce an encoded object obj to an Unicode object and return a
2864reference with incremented refcount.
Fred Drakea4cd2612000-04-06 14:10:29 +00002865
2866Coercion is done in the following way:
2867\begin{enumerate}
2868\item Unicode objects are passed back as-is with incremented
Marc-André Lemburg5a20b212000-07-07 15:47:06 +00002869 refcount. Note: these cannot be decoded; passing a non-NULL
2870 value for encoding will result in a TypeError.
Fred Drakea4cd2612000-04-06 14:10:29 +00002871
2872\item String and other char buffer compatible objects are decoded
Marc-André Lemburg5a20b212000-07-07 15:47:06 +00002873 according to the given encoding and using the error handling
2874 defined by errors. Both can be NULL to have the interface use
2875 the default values (see the next section for details).
Fred Drakea4cd2612000-04-06 14:10:29 +00002876
Marc-André Lemburg5a20b212000-07-07 15:47:06 +00002877\item All other objects cause an exception.
Fred Drakea4cd2612000-04-06 14:10:29 +00002878\end{enumerate}
2879The API returns NULL in case of an error. The caller is responsible
2880for decref'ing the returned objects.
2881\end{cfuncdesc}
2882
Marc-André Lemburg5a20b212000-07-07 15:47:06 +00002883\begin{cfuncdesc}{PyObject*}{PyUnicode_FromObject}{PyObject *obj}
2884
2885Shortcut for PyUnicode_FromEncodedObject(obj, NULL, ``strict'')
2886which is used throughout the interpreter whenever coercion to
2887Unicode is needed.
2888\end{cfuncdesc}
2889
Fred Drakea4cd2612000-04-06 14:10:29 +00002890% --- wchar_t support for platforms which support it ---------------------
2891
2892If the platform supports \ctype{wchar_t} and provides a header file
2893wchar.h, Python can interface directly to this type using the
2894following functions. Support is optimized if Python's own
2895\ctype{Py_UNICODE} type is identical to the system's \ctype{wchar_t}.
2896
2897\begin{cfuncdesc}{PyObject*}{PyUnicode_FromWideChar}{const wchar_t *w,
2898 int size}
2899Create a Unicode Object from the \ctype{whcar_t} buffer \var{w} of the
2900given size. Returns \NULL{} on failure.
2901\end{cfuncdesc}
2902
2903\begin{cfuncdesc}{int}{PyUnicode_AsWideChar}{PyUnicodeObject *unicode,
2904 wchar_t *w,
2905 int size}
Fred Drakea4cd2612000-04-06 14:10:29 +00002906Copies the Unicode Object contents into the \ctype{whcar_t} buffer
2907\var{w}. At most \var{size} \ctype{whcar_t} characters are copied.
2908Returns the number of \ctype{whcar_t} characters copied or -1 in case
2909of an error.
2910\end{cfuncdesc}
2911
2912
2913\subsubsection{Builtin Codecs \label{builtinCodecs}}
2914
2915Python provides a set of builtin codecs which are written in C
2916for speed. All of these codecs are directly usable via the
2917following functions.
2918
2919Many of the following APIs take two arguments encoding and
2920errors. These parameters encoding and errors have the same semantics
2921as the ones of the builtin unicode() Unicode object constructor.
2922
2923Setting encoding to NULL causes the default encoding to be used which
2924is UTF-8.
2925
2926Error handling is set by errors which may also be set to NULL meaning
2927to use the default handling defined for the codec. Default error
2928handling for all builtin codecs is ``strict'' (ValueErrors are raised).
2929
2930The codecs all use a similar interface. Only deviation from the
2931following generic ones are documented for simplicity.
2932
2933% --- Generic Codecs -----------------------------------------------------
2934
2935These are the generic codec APIs:
2936
2937\begin{cfuncdesc}{PyObject*}{PyUnicode_Decode}{const char *s,
2938 int size,
2939 const char *encoding,
2940 const char *errors}
Fred Drakea4cd2612000-04-06 14:10:29 +00002941Create a Unicode object by decoding \var{size} bytes of the encoded
2942string \var{s}. \var{encoding} and \var{errors} have the same meaning
2943as the parameters of the same name in the unicode() builtin
2944function. The codec to be used is looked up using the Python codec
2945registry. Returns \NULL{} in case an exception was raised by the
2946codec.
2947\end{cfuncdesc}
2948
2949\begin{cfuncdesc}{PyObject*}{PyUnicode_Encode}{const Py_UNICODE *s,
2950 int size,
2951 const char *encoding,
2952 const char *errors}
Fred Drakea4cd2612000-04-06 14:10:29 +00002953Encodes the \ctype{Py_UNICODE} buffer of the given size and returns a
2954Python string object. \var{encoding} and \var{errors} have the same
2955meaning as the parameters of the same name in the Unicode .encode()
2956method. The codec to be used is looked up using the Python codec
2957registry. Returns \NULL{} in case an exception was raised by the
2958codec.
2959\end{cfuncdesc}
2960
2961\begin{cfuncdesc}{PyObject*}{PyUnicode_AsEncodedString}{PyObject *unicode,
2962 const char *encoding,
2963 const char *errors}
Fred Drakea4cd2612000-04-06 14:10:29 +00002964Encodes a Unicode object and returns the result as Python string
2965object. \var{encoding} and \var{errors} have the same meaning as the
2966parameters of the same name in the Unicode .encode() method. The codec
2967to be used is looked up using the Python codec registry. Returns
2968\NULL{} in case an exception was raised by the codec.
2969\end{cfuncdesc}
2970
2971% --- UTF-8 Codecs -------------------------------------------------------
2972
2973These are the UTF-8 codec APIs:
2974
2975\begin{cfuncdesc}{PyObject*}{PyUnicode_DecodeUTF8}{const char *s,
2976 int size,
2977 const char *errors}
Fred Drakea4cd2612000-04-06 14:10:29 +00002978Creates a Unicode object by decoding \var{size} bytes of the UTF-8
2979encoded string \var{s}. Returns \NULL{} in case an exception was
2980raised by the codec.
2981\end{cfuncdesc}
2982
2983\begin{cfuncdesc}{PyObject*}{PyUnicode_EncodeUTF8}{const Py_UNICODE *s,
2984 int size,
2985 const char *errors}
Fred Drakea4cd2612000-04-06 14:10:29 +00002986Encodes the \ctype{Py_UNICODE} buffer of the given size using UTF-8
2987and returns a Python string object. Returns \NULL{} in case an
2988exception was raised by the codec.
2989\end{cfuncdesc}
2990
2991\begin{cfuncdesc}{PyObject*}{PyUnicode_AsUTF8String}{PyObject *unicode}
Fred Drakea4cd2612000-04-06 14:10:29 +00002992Encodes a Unicode objects using UTF-8 and returns the result as Python
2993string object. Error handling is ``strict''. Returns
2994\NULL{} in case an exception was raised by the codec.
2995\end{cfuncdesc}
2996
2997% --- UTF-16 Codecs ------------------------------------------------------ */
2998
2999These are the UTF-16 codec APIs:
3000
3001\begin{cfuncdesc}{PyObject*}{PyUnicode_DecodeUTF16}{const char *s,
3002 int size,
3003 const char *errors,
3004 int *byteorder}
Fred Drakea4cd2612000-04-06 14:10:29 +00003005Decodes \var{length} bytes from a UTF-16 encoded buffer string and
3006returns the corresponding Unicode object.
3007
3008\var{errors} (if non-NULL) defines the error handling. It defaults
3009to ``strict''.
3010
3011If \var{byteorder} is non-\NULL{}, the decoder starts decoding using
3012the given byte order:
3013
3014\begin{verbatim}
3015 *byteorder == -1: little endian
3016 *byteorder == 0: native order
3017 *byteorder == 1: big endian
3018\end{verbatim}
3019
3020and then switches according to all byte order marks (BOM) it finds in
3021the input data. BOM marks are not copied into the resulting Unicode
3022string. After completion, \var{*byteorder} is set to the current byte
3023order at the end of input data.
3024
3025If \var{byteorder} is \NULL{}, the codec starts in native order mode.
3026
3027Returns \NULL{} in case an exception was raised by the codec.
3028\end{cfuncdesc}
3029
3030\begin{cfuncdesc}{PyObject*}{PyUnicode_EncodeUTF16}{const Py_UNICODE *s,
3031 int size,
3032 const char *errors,
3033 int byteorder}
Fred Drakea4cd2612000-04-06 14:10:29 +00003034Returns a Python string object holding the UTF-16 encoded value of the
3035Unicode data in \var{s}.
3036
Fred Drakea8455ab2000-06-16 19:58:42 +00003037If \var{byteorder} is not \code{0}, output is written according to the
Fred Drakea4cd2612000-04-06 14:10:29 +00003038following byte order:
3039
3040\begin{verbatim}
3041 byteorder == -1: little endian
3042 byteorder == 0: native byte order (writes a BOM mark)
3043 byteorder == 1: big endian
3044\end{verbatim}
3045
Fred Drakea8455ab2000-06-16 19:58:42 +00003046If byteorder is \code{0}, the output string will always start with the
Fred Drakea4cd2612000-04-06 14:10:29 +00003047Unicode BOM mark (U+FEFF). In the other two modes, no BOM mark is
3048prepended.
3049
3050Note that \ctype{Py_UNICODE} data is being interpreted as UTF-16
3051reduced to UCS-2. This trick makes it possible to add full UTF-16
3052capabilities at a later point without comprimising the APIs.
3053
3054Returns \NULL{} in case an exception was raised by the codec.
3055\end{cfuncdesc}
3056
3057\begin{cfuncdesc}{PyObject*}{PyUnicode_AsUTF16String}{PyObject *unicode}
Fred Drakea4cd2612000-04-06 14:10:29 +00003058Returns a Python string using the UTF-16 encoding in native byte
3059order. The string always starts with a BOM mark. Error handling is
3060``strict''. Returns \NULL{} in case an exception was raised by the
3061codec.
3062\end{cfuncdesc}
3063
3064% --- Unicode-Escape Codecs ----------------------------------------------
3065
3066These are the ``Unicode Esacpe'' codec APIs:
3067
3068\begin{cfuncdesc}{PyObject*}{PyUnicode_DecodeUnicodeEscape}{const char *s,
3069 int size,
3070 const char *errors}
Fred Drakea4cd2612000-04-06 14:10:29 +00003071Creates a Unicode object by decoding \var{size} bytes of the Unicode-Esacpe
3072encoded string \var{s}. Returns \NULL{} in case an exception was
3073raised by the codec.
3074\end{cfuncdesc}
3075
3076\begin{cfuncdesc}{PyObject*}{PyUnicode_EncodeUnicodeEscape}{const Py_UNICODE *s,
3077 int size,
3078 const char *errors}
Fred Drakea4cd2612000-04-06 14:10:29 +00003079Encodes the \ctype{Py_UNICODE} buffer of the given size using Unicode-Escape
3080and returns a Python string object. Returns \NULL{} in case an
3081exception was raised by the codec.
3082\end{cfuncdesc}
3083
3084\begin{cfuncdesc}{PyObject*}{PyUnicode_AsUnicodeEscapeString}{PyObject *unicode}
Fred Drakea4cd2612000-04-06 14:10:29 +00003085Encodes a Unicode objects using Unicode-Escape and returns the result
3086as Python string object. Error handling is ``strict''. Returns
3087\NULL{} in case an exception was raised by the codec.
3088\end{cfuncdesc}
3089
3090% --- Raw-Unicode-Escape Codecs ------------------------------------------
3091
3092These are the ``Raw Unicode Esacpe'' codec APIs:
3093
3094\begin{cfuncdesc}{PyObject*}{PyUnicode_DecodeRawUnicodeEscape}{const char *s,
3095 int size,
3096 const char *errors}
Fred Drakea4cd2612000-04-06 14:10:29 +00003097Creates a Unicode object by decoding \var{size} bytes of the Raw-Unicode-Esacpe
3098encoded string \var{s}. Returns \NULL{} in case an exception was
3099raised by the codec.
3100\end{cfuncdesc}
3101
3102\begin{cfuncdesc}{PyObject*}{PyUnicode_EncodeRawUnicodeEscape}{const Py_UNICODE *s,
3103 int size,
3104 const char *errors}
Fred Drakea4cd2612000-04-06 14:10:29 +00003105Encodes the \ctype{Py_UNICODE} buffer of the given size using Raw-Unicode-Escape
3106and returns a Python string object. Returns \NULL{} in case an
3107exception was raised by the codec.
3108\end{cfuncdesc}
3109
3110\begin{cfuncdesc}{PyObject*}{PyUnicode_AsRawUnicodeEscapeString}{PyObject *unicode}
Fred Drakea4cd2612000-04-06 14:10:29 +00003111Encodes a Unicode objects using Raw-Unicode-Escape and returns the result
3112as Python string object. Error handling is ``strict''. Returns
3113\NULL{} in case an exception was raised by the codec.
3114\end{cfuncdesc}
3115
3116% --- Latin-1 Codecs -----------------------------------------------------
3117
3118These are the Latin-1 codec APIs:
3119
3120Latin-1 corresponds to the first 256 Unicode ordinals and only these
3121are accepted by the codecs during encoding.
3122
3123\begin{cfuncdesc}{PyObject*}{PyUnicode_DecodeLatin1}{const char *s,
Fred Drake1d158692000-06-18 05:21:21 +00003124 int size,
3125 const char *errors}
Fred Drakea4cd2612000-04-06 14:10:29 +00003126Creates a Unicode object by decoding \var{size} bytes of the Latin-1
3127encoded string \var{s}. Returns \NULL{} in case an exception was
3128raised by the codec.
3129\end{cfuncdesc}
3130
3131\begin{cfuncdesc}{PyObject*}{PyUnicode_EncodeLatin1}{const Py_UNICODE *s,
Fred Drake1d158692000-06-18 05:21:21 +00003132 int size,
3133 const char *errors}
Fred Drakea4cd2612000-04-06 14:10:29 +00003134Encodes the \ctype{Py_UNICODE} buffer of the given size using Latin-1
3135and returns a Python string object. Returns \NULL{} in case an
3136exception was raised by the codec.
3137\end{cfuncdesc}
3138
3139\begin{cfuncdesc}{PyObject*}{PyUnicode_AsLatin1String}{PyObject *unicode}
Fred Drakea4cd2612000-04-06 14:10:29 +00003140Encodes a Unicode objects using Latin-1 and returns the result as
3141Python string object. Error handling is ``strict''. Returns
3142\NULL{} in case an exception was raised by the codec.
3143\end{cfuncdesc}
3144
3145% --- ASCII Codecs -------------------------------------------------------
3146
Fred Drake1d158692000-06-18 05:21:21 +00003147These are the \ASCII{} codec APIs. Only 7-bit \ASCII{} data is
3148accepted. All other codes generate errors.
Fred Drakea4cd2612000-04-06 14:10:29 +00003149
3150\begin{cfuncdesc}{PyObject*}{PyUnicode_DecodeASCII}{const char *s,
Fred Drake1d158692000-06-18 05:21:21 +00003151 int size,
3152 const char *errors}
3153Creates a Unicode object by decoding \var{size} bytes of the
3154\ASCII{} encoded string \var{s}. Returns \NULL{} in case an exception
3155was raised by the codec.
Fred Drakea4cd2612000-04-06 14:10:29 +00003156\end{cfuncdesc}
3157
3158\begin{cfuncdesc}{PyObject*}{PyUnicode_EncodeASCII}{const Py_UNICODE *s,
Fred Drake1d158692000-06-18 05:21:21 +00003159 int size,
3160 const char *errors}
3161Encodes the \ctype{Py_UNICODE} buffer of the given size using
3162\ASCII{} and returns a Python string object. Returns \NULL{} in case
3163an exception was raised by the codec.
Fred Drakea4cd2612000-04-06 14:10:29 +00003164\end{cfuncdesc}
3165
3166\begin{cfuncdesc}{PyObject*}{PyUnicode_AsASCIIString}{PyObject *unicode}
Fred Drake1d158692000-06-18 05:21:21 +00003167Encodes a Unicode objects using \ASCII{} and returns the result as Python
Fred Drakea4cd2612000-04-06 14:10:29 +00003168string object. Error handling is ``strict''. Returns
3169\NULL{} in case an exception was raised by the codec.
3170\end{cfuncdesc}
3171
3172% --- Character Map Codecs -----------------------------------------------
3173
3174These are the mapping codec APIs:
3175
3176This codec is special in that it can be used to implement many
3177different codecs (and this is in fact what was done to obtain most of
3178the standard codecs included in the \module{encodings} package). The
3179codec uses mapping to encode and decode characters.
3180
3181Decoding mappings must map single string characters to single Unicode
3182characters, integers (which are then interpreted as Unicode ordinals)
3183or None (meaning "undefined mapping" and causing an error).
3184
3185Encoding mappings must map single Unicode characters to single string
3186characters, integers (which are then interpreted as Latin-1 ordinals)
3187or None (meaning "undefined mapping" and causing an error).
3188
3189The mapping objects provided must only support the __getitem__ mapping
3190interface.
3191
3192If a character lookup fails with a LookupError, the character is
3193copied as-is meaning that its ordinal value will be interpreted as
3194Unicode or Latin-1 ordinal resp. Because of this, mappings only need
3195to contain those mappings which map characters to different code
3196points.
3197
3198\begin{cfuncdesc}{PyObject*}{PyUnicode_DecodeCharmap}{const char *s,
3199 int size,
3200 PyObject *mapping,
3201 const char *errors}
Fred Drakea4cd2612000-04-06 14:10:29 +00003202Creates a Unicode object by decoding \var{size} bytes of the encoded
3203string \var{s} using the given \var{mapping} object. Returns \NULL{}
3204in case an exception was raised by the codec.
3205\end{cfuncdesc}
3206
3207\begin{cfuncdesc}{PyObject*}{PyUnicode_EncodeCharmap}{const Py_UNICODE *s,
3208 int size,
3209 PyObject *mapping,
3210 const char *errors}
Fred Drakea4cd2612000-04-06 14:10:29 +00003211Encodes the \ctype{Py_UNICODE} buffer of the given size using the
3212given \var{mapping} object and returns a Python string object.
3213Returns \NULL{} in case an exception was raised by the codec.
3214\end{cfuncdesc}
3215
3216\begin{cfuncdesc}{PyObject*}{PyUnicode_AsCharmapString}{PyObject *unicode,
3217 PyObject *mapping}
Fred Drakea4cd2612000-04-06 14:10:29 +00003218Encodes a Unicode objects using the given \var{mapping} object and
3219returns the result as Python string object. Error handling is
3220``strict''. Returns \NULL{} in case an exception was raised by the
3221codec.
3222\end{cfuncdesc}
3223
3224The following codec API is special in that maps Unicode to Unicode.
3225
3226\begin{cfuncdesc}{PyObject*}{PyUnicode_TranslateCharmap}{const Py_UNICODE *s,
3227 int size,
3228 PyObject *table,
3229 const char *errors}
Fred Drakea4cd2612000-04-06 14:10:29 +00003230Translates a \ctype{Py_UNICODE} buffer of the given length by applying
3231a character mapping \var{table} to it and returns the resulting
Fred Drake1d158692000-06-18 05:21:21 +00003232Unicode object. Returns \NULL{} when an exception was raised by the
3233codec.
Fred Drakea4cd2612000-04-06 14:10:29 +00003234
3235The \var{mapping} table must map Unicode ordinal integers to Unicode
3236ordinal integers or None (causing deletion of the character).
3237
3238Mapping tables must only provide the __getitem__ interface,
3239e.g. dictionaries or sequences. Unmapped character ordinals (ones
3240which cause a LookupError) are left untouched and are copied as-is.
Fred Drakea4cd2612000-04-06 14:10:29 +00003241\end{cfuncdesc}
3242
3243% --- MBCS codecs for Windows --------------------------------------------
3244
Fred Drake1d158692000-06-18 05:21:21 +00003245These are the MBCS codec APIs. They are currently only available on
Fred Drakea4cd2612000-04-06 14:10:29 +00003246Windows and use the Win32 MBCS converters to implement the
Fred Drake1d158692000-06-18 05:21:21 +00003247conversions. Note that MBCS (or DBCS) is a class of encodings, not
3248just one. The target encoding is defined by the user settings on the
3249machine running the codec.
Fred Drakea4cd2612000-04-06 14:10:29 +00003250
3251\begin{cfuncdesc}{PyObject*}{PyUnicode_DecodeMBCS}{const char *s,
3252 int size,
3253 const char *errors}
Fred Drakea4cd2612000-04-06 14:10:29 +00003254Creates a Unicode object by decoding \var{size} bytes of the MBCS
Fred Drake1d158692000-06-18 05:21:21 +00003255encoded string \var{s}. Returns \NULL{} in case an exception was
Fred Drakea4cd2612000-04-06 14:10:29 +00003256raised by the codec.
3257\end{cfuncdesc}
3258
3259\begin{cfuncdesc}{PyObject*}{PyUnicode_EncodeMBCS}{const Py_UNICODE *s,
3260 int size,
3261 const char *errors}
Fred Drakea4cd2612000-04-06 14:10:29 +00003262Encodes the \ctype{Py_UNICODE} buffer of the given size using MBCS
3263and returns a Python string object. Returns \NULL{} in case an
3264exception was raised by the codec.
3265\end{cfuncdesc}
3266
3267\begin{cfuncdesc}{PyObject*}{PyUnicode_AsMBCSString}{PyObject *unicode}
Fred Drakea4cd2612000-04-06 14:10:29 +00003268Encodes a Unicode objects using MBCS and returns the result as Python
Fred Drake1d158692000-06-18 05:21:21 +00003269string object. Error handling is ``strict''. Returns \NULL{} in case
3270an exception was raised by the codec.
Fred Drakea4cd2612000-04-06 14:10:29 +00003271\end{cfuncdesc}
3272
3273% --- Methods & Slots ----------------------------------------------------
3274
3275\subsubsection{Methods and Slot Functions \label{unicodeMethodsAndSlots}}
3276
3277The following APIs are capable of handling Unicode objects and strings
3278on input (we refer to them as strings in the descriptions) and return
3279Unicode objects or integers as apporpriate.
3280
3281They all return \NULL{} or -1 in case an exception occurrs.
3282
3283\begin{cfuncdesc}{PyObject*}{PyUnicode_Concat}{PyObject *left,
3284 PyObject *right}
Fred Drakea4cd2612000-04-06 14:10:29 +00003285Concat two strings giving a new Unicode string.
3286\end{cfuncdesc}
3287
3288\begin{cfuncdesc}{PyObject*}{PyUnicode_Split}{PyObject *s,
3289 PyObject *sep,
3290 int maxsplit}
Fred Drakea4cd2612000-04-06 14:10:29 +00003291Split a string giving a list of Unicode strings.
3292
3293If sep is NULL, splitting will be done at all whitespace
3294substrings. Otherwise, splits occur at the given separator.
3295
3296At most maxsplit splits will be done. If negative, no limit is set.
3297
3298Separators are not included in the resulting list.
3299\end{cfuncdesc}
3300
3301\begin{cfuncdesc}{PyObject*}{PyUnicode_Splitlines}{PyObject *s,
3302 int maxsplit}
Fred Drake1d158692000-06-18 05:21:21 +00003303Split a Unicode string at line breaks, returning a list of Unicode
3304strings. CRLF is considered to be one line break. The Line break
3305characters are not included in the resulting strings.
Fred Drakea4cd2612000-04-06 14:10:29 +00003306\end{cfuncdesc}
3307
3308\begin{cfuncdesc}{PyObject*}{PyUnicode_Translate}{PyObject *str,
3309 PyObject *table,
3310 const char *errors}
Fred Drakea4cd2612000-04-06 14:10:29 +00003311Translate a string by applying a character mapping table to it and
3312return the resulting Unicode object.
3313
3314The mapping table must map Unicode ordinal integers to Unicode ordinal
3315integers or None (causing deletion of the character).
3316
3317Mapping tables must only provide the __getitem__ interface,
3318e.g. dictionaries or sequences. Unmapped character ordinals (ones
3319which cause a LookupError) are left untouched and are copied as-is.
3320
3321\var{errors} has the usual meaning for codecs. It may be \NULL{}
3322which indicates to use the default error handling.
Fred Drakea4cd2612000-04-06 14:10:29 +00003323\end{cfuncdesc}
3324
3325\begin{cfuncdesc}{PyObject*}{PyUnicode_Join}{PyObject *separator,
3326 PyObject *seq}
Fred Drakea4cd2612000-04-06 14:10:29 +00003327Join a sequence of strings using the given separator and return
3328the resulting Unicode string.
3329\end{cfuncdesc}
3330
3331\begin{cfuncdesc}{PyObject*}{PyUnicode_Tailmatch}{PyObject *str,
3332 PyObject *substr,
3333 int start,
3334 int end,
3335 int direction}
Fred Drakea4cd2612000-04-06 14:10:29 +00003336Return 1 if \var{substr} matches \var{str}[\var{start}:\var{end}] at
3337the given tail end (\var{direction} == -1 means to do a prefix match,
3338\var{direction} == 1 a suffix match), 0 otherwise.
3339\end{cfuncdesc}
3340
3341\begin{cfuncdesc}{PyObject*}{PyUnicode_Find}{PyObject *str,
3342 PyObject *substr,
3343 int start,
3344 int end,
3345 int direction}
Fred Drakea4cd2612000-04-06 14:10:29 +00003346Return the first position of \var{substr} in
3347\var{str}[\var{start}:\var{end}] using the given \var{direction}
3348(\var{direction} == 1 means to do a forward search,
3349\var{direction} == -1 a backward search), 0 otherwise.
3350\end{cfuncdesc}
3351
3352\begin{cfuncdesc}{PyObject*}{PyUnicode_Count}{PyObject *str,
3353 PyObject *substr,
3354 int start,
3355 int end}
Fred Drakea4cd2612000-04-06 14:10:29 +00003356Count the number of occurrences of \var{substr} in
3357\var{str}[\var{start}:\var{end}]
3358\end{cfuncdesc}
3359
3360\begin{cfuncdesc}{PyObject*}{PyUnicode_Replace}{PyObject *str,
3361 PyObject *substr,
3362 PyObject *replstr,
3363 int maxcount}
Fred Drakea4cd2612000-04-06 14:10:29 +00003364Replace at most \var{maxcount} occurrences of \var{substr} in
3365\var{str} with \var{replstr} and return the resulting Unicode object.
3366\var{maxcount} == -1 means: replace all occurrences.
3367\end{cfuncdesc}
3368
Fred Drake1d158692000-06-18 05:21:21 +00003369\begin{cfuncdesc}{int}{PyUnicode_Compare}{PyObject *left, PyObject *right}
Fred Drakea4cd2612000-04-06 14:10:29 +00003370Compare two strings and return -1, 0, 1 for less than, equal,
3371greater than resp.
3372\end{cfuncdesc}
3373
3374\begin{cfuncdesc}{PyObject*}{PyUnicode_Format}{PyObject *format,
3375 PyObject *args}
Fred Drake1d158692000-06-18 05:21:21 +00003376Returns a new string object from \var{format} and \var{args}; this is
3377analogous to \code{\var{format} \%\ \var{args}}. The
3378\var{args} argument must be a tuple.
Fred Drakea4cd2612000-04-06 14:10:29 +00003379\end{cfuncdesc}
3380
3381\begin{cfuncdesc}{int}{PyUnicode_Contains}{PyObject *container,
3382 PyObject *element}
Fred Drakea4cd2612000-04-06 14:10:29 +00003383Checks whether \var{element} is contained in \var{container} and
Fred Drake1d158692000-06-18 05:21:21 +00003384returns true or false accordingly.
Fred Drakea4cd2612000-04-06 14:10:29 +00003385
Fred Drake1d158692000-06-18 05:21:21 +00003386\var{element} has to coerce to a one element Unicode string. \code{-1} is
Fred Drakea4cd2612000-04-06 14:10:29 +00003387returned in case of an error.
3388\end{cfuncdesc}
3389
3390
Fred Drake58c5a2a1999-08-04 13:13:24 +00003391\subsection{Buffer Objects \label{bufferObjects}}
Fred Drake659ebfa2000-04-03 15:42:13 +00003392\sectionauthor{Greg Stein}{gstein@lyra.org}
Fred Drake58c5a2a1999-08-04 13:13:24 +00003393
Fred Drake659ebfa2000-04-03 15:42:13 +00003394\obindex{buffer}
3395Python objects implemented in C can export a group of functions called
3396the ``buffer\index{buffer interface} interface.'' These functions can
3397be used by an object to expose its data in a raw, byte-oriented
3398format. Clients of the object can use the buffer interface to access
3399the object data directly, without needing to copy it first.
3400
3401Two examples of objects that support
3402the buffer interface are strings and arrays. The string object exposes
3403the character contents in the buffer interface's byte-oriented
3404form. An array can also expose its contents, but it should be noted
3405that array elements may be multi-byte values.
3406
3407An example user of the buffer interface is the file object's
3408\method{write()} method. Any object that can export a series of bytes
3409through the buffer interface can be written to a file. There are a
Fred Drake88fdaa72001-07-20 20:56:11 +00003410number of format codes to \cfunction{PyArg_ParseTuple()} that operate
Fred Drake659ebfa2000-04-03 15:42:13 +00003411against an object's buffer interface, returning data from the target
3412object.
3413
3414More information on the buffer interface is provided in the section
3415``Buffer Object Structures'' (section \ref{buffer-structs}), under
3416the description for \ctype{PyBufferProcs}\ttindex{PyBufferProcs}.
3417
3418A ``buffer object'' is defined in the \file{bufferobject.h} header
3419(included by \file{Python.h}). These objects look very similar to
3420string objects at the Python programming level: they support slicing,
3421indexing, concatenation, and some other standard string
3422operations. However, their data can come from one of two sources: from
3423a block of memory, or from another object which exports the buffer
3424interface.
3425
3426Buffer objects are useful as a way to expose the data from another
3427object's buffer interface to the Python programmer. They can also be
3428used as a zero-copy slicing mechanism. Using their ability to
3429reference a block of memory, it is possible to expose any data to the
3430Python programmer quite easily. The memory could be a large, constant
3431array in a C extension, it could be a raw block of memory for
3432manipulation before passing to an operating system library, or it
3433could be used to pass around structured data in its native, in-memory
3434format.
3435
3436\begin{ctypedesc}{PyBufferObject}
3437This subtype of \ctype{PyObject} represents a buffer object.
3438\end{ctypedesc}
Fred Drake58c5a2a1999-08-04 13:13:24 +00003439
3440\begin{cvardesc}{PyTypeObject}{PyBuffer_Type}
3441The instance of \ctype{PyTypeObject} which represents the Python
Fred Drake659ebfa2000-04-03 15:42:13 +00003442buffer type; it is the same object as \code{types.BufferType} in the
3443Python layer.\withsubitem{(in module types)}{\ttindex{BufferType}}.
Fred Drake58c5a2a1999-08-04 13:13:24 +00003444\end{cvardesc}
3445
3446\begin{cvardesc}{int}{Py_END_OF_BUFFER}
Fred Drake659ebfa2000-04-03 15:42:13 +00003447This constant may be passed as the \var{size} parameter to
3448\cfunction{PyBuffer_FromObject()} or
3449\cfunction{PyBuffer_FromReadWriteObject()}. It indicates that the new
3450\ctype{PyBufferObject} should refer to \var{base} object from the
3451specified \var{offset} to the end of its exported buffer. Using this
3452enables the caller to avoid querying the \var{base} object for its
3453length.
Fred Drake58c5a2a1999-08-04 13:13:24 +00003454\end{cvardesc}
3455
3456\begin{cfuncdesc}{int}{PyBuffer_Check}{PyObject *p}
3457Return true if the argument has type \cdata{PyBuffer_Type}.
3458\end{cfuncdesc}
3459
3460\begin{cfuncdesc}{PyObject*}{PyBuffer_FromObject}{PyObject *base,
3461 int offset, int size}
Fred Drake659ebfa2000-04-03 15:42:13 +00003462Return a new read-only buffer object. This raises
3463\exception{TypeError} if \var{base} doesn't support the read-only
3464buffer protocol or doesn't provide exactly one buffer segment, or it
3465raises \exception{ValueError} if \var{offset} is less than zero. The
3466buffer will hold a reference to the \var{base} object, and the
3467buffer's contents will refer to the \var{base} object's buffer
3468interface, starting as position \var{offset} and extending for
3469\var{size} bytes. If \var{size} is \constant{Py_END_OF_BUFFER}, then
3470the new buffer's contents extend to the length of the
3471\var{base} object's exported buffer data.
Fred Drake58c5a2a1999-08-04 13:13:24 +00003472\end{cfuncdesc}
3473
3474\begin{cfuncdesc}{PyObject*}{PyBuffer_FromReadWriteObject}{PyObject *base,
3475 int offset,
3476 int size}
3477Return a new writable buffer object. Parameters and exceptions are
3478similar to those for \cfunction{PyBuffer_FromObject()}.
Fred Drake659ebfa2000-04-03 15:42:13 +00003479If the \var{base} object does not export the writeable buffer
3480protocol, then \exception{TypeError} is raised.
Fred Drake58c5a2a1999-08-04 13:13:24 +00003481\end{cfuncdesc}
3482
3483\begin{cfuncdesc}{PyObject*}{PyBuffer_FromMemory}{void *ptr, int size}
Fred Drake659ebfa2000-04-03 15:42:13 +00003484Return a new read-only buffer object that reads from a specified
3485location in memory, with a specified size.
Fred Drake58c5a2a1999-08-04 13:13:24 +00003486The caller is responsible for ensuring that the memory buffer, passed
3487in as \var{ptr}, is not deallocated while the returned buffer object
3488exists. Raises \exception{ValueError} if \var{size} is less than
Fred Drake659ebfa2000-04-03 15:42:13 +00003489zero. Note that \constant{Py_END_OF_BUFFER} may \emph{not} be passed
3490for the \var{size} parameter; \exception{ValueError} will be raised in
3491that case.
Fred Drake58c5a2a1999-08-04 13:13:24 +00003492\end{cfuncdesc}
3493
3494\begin{cfuncdesc}{PyObject*}{PyBuffer_FromReadWriteMemory}{void *ptr, int size}
Fred Drake659ebfa2000-04-03 15:42:13 +00003495Similar to \cfunction{PyBuffer_FromMemory()}, but the returned buffer
3496is writable.
Fred Drake58c5a2a1999-08-04 13:13:24 +00003497\end{cfuncdesc}
3498
3499\begin{cfuncdesc}{PyObject*}{PyBuffer_New}{int size}
3500Returns a new writable buffer object that maintains its own memory
Fred Drake659ebfa2000-04-03 15:42:13 +00003501buffer of \var{size} bytes. \exception{ValueError} is returned if
3502\var{size} is not zero or positive.
Fred Drake58c5a2a1999-08-04 13:13:24 +00003503\end{cfuncdesc}
3504
Guido van Rossum44475131998-04-21 15:30:01 +00003505
Fred Drakeefd146c1999-02-15 15:30:45 +00003506\subsection{Tuple Objects \label{tupleObjects}}
Fred Drakee5bf8b21998-02-12 21:22:28 +00003507
Fred Drake659ebfa2000-04-03 15:42:13 +00003508\obindex{tuple}
Fred Drakee5bf8b21998-02-12 21:22:28 +00003509\begin{ctypedesc}{PyTupleObject}
Fred Drakef8830d11998-04-23 14:06:01 +00003510This subtype of \ctype{PyObject} represents a Python tuple object.
Fred Drakee5bf8b21998-02-12 21:22:28 +00003511\end{ctypedesc}
3512
3513\begin{cvardesc}{PyTypeObject}{PyTuple_Type}
Fred Drake659ebfa2000-04-03 15:42:13 +00003514This instance of \ctype{PyTypeObject} represents the Python tuple
3515type; it is the same object as \code{types.TupleType} in the Python
3516layer.\withsubitem{(in module types)}{\ttindex{TupleType}}.
Fred Drakee5bf8b21998-02-12 21:22:28 +00003517\end{cvardesc}
3518
3519\begin{cfuncdesc}{int}{PyTuple_Check}{PyObject *p}
3520Return true if the argument is a tuple object.
3521\end{cfuncdesc}
3522
Fred Drake659ebfa2000-04-03 15:42:13 +00003523\begin{cfuncdesc}{PyObject*}{PyTuple_New}{int len}
3524Return a new tuple object of size \var{len}, or \NULL{} on failure.
Fred Drakee5bf8b21998-02-12 21:22:28 +00003525\end{cfuncdesc}
3526
Fred Drakea05460c2001-02-12 17:38:18 +00003527\begin{cfuncdesc}{int}{PyTuple_Size}{PyObject *p}
Fred Drakee058b4f1998-02-16 06:15:35 +00003528Takes a pointer to a tuple object, and returns the size
Fred Drakee5bf8b21998-02-12 21:22:28 +00003529of that tuple.
3530\end{cfuncdesc}
3531
Fred Drakea05460c2001-02-12 17:38:18 +00003532\begin{cfuncdesc}{PyObject*}{PyTuple_GetItem}{PyObject *p, int pos}
Fred Drakee058b4f1998-02-16 06:15:35 +00003533Returns the object at position \var{pos} in the tuple pointed
3534to by \var{p}. If \var{pos} is out of bounds, returns \NULL{} and
Fred Drake659ebfa2000-04-03 15:42:13 +00003535sets an \exception{IndexError} exception.
Fred Drakee5bf8b21998-02-12 21:22:28 +00003536\end{cfuncdesc}
3537
Fred Drakea05460c2001-02-12 17:38:18 +00003538\begin{cfuncdesc}{PyObject*}{PyTuple_GET_ITEM}{PyObject *p, int pos}
Fred Drakefac312f2001-05-29 15:13:00 +00003539Like \cfunction{PyTuple_GetItem()}, but does no checking of its
3540arguments.
Fred Drakee5bf8b21998-02-12 21:22:28 +00003541\end{cfuncdesc}
3542
Fred Drakea05460c2001-02-12 17:38:18 +00003543\begin{cfuncdesc}{PyObject*}{PyTuple_GetSlice}{PyObject *p,
3544 int low, int high}
Fred Drakee058b4f1998-02-16 06:15:35 +00003545Takes a slice of the tuple pointed to by \var{p} from
3546\var{low} to \var{high} and returns it as a new tuple.
Fred Drakee5bf8b21998-02-12 21:22:28 +00003547\end{cfuncdesc}
3548
Fred Drake659ebfa2000-04-03 15:42:13 +00003549\begin{cfuncdesc}{int}{PyTuple_SetItem}{PyObject *p,
3550 int pos, PyObject *o}
Fred Drakee058b4f1998-02-16 06:15:35 +00003551Inserts a reference to object \var{o} at position \var{pos} of
3552the tuple pointed to by \var{p}. It returns \code{0} on success.
Fred Drake659ebfa2000-04-03 15:42:13 +00003553\strong{Note:} This function ``steals'' a reference to \var{o}.
Fred Drakee5bf8b21998-02-12 21:22:28 +00003554\end{cfuncdesc}
3555
Fred Drake659ebfa2000-04-03 15:42:13 +00003556\begin{cfuncdesc}{void}{PyTuple_SET_ITEM}{PyObject *p,
3557 int pos, PyObject *o}
Fred Drakefac312f2001-05-29 15:13:00 +00003558Like \cfunction{PyTuple_SetItem()}, but does no error checking, and
Fred Drakee5bf8b21998-02-12 21:22:28 +00003559should \emph{only} be used to fill in brand new tuples.
Fred Drake659ebfa2000-04-03 15:42:13 +00003560\strong{Note:} This function ``steals'' a reference to \var{o}.
Fred Drakee5bf8b21998-02-12 21:22:28 +00003561\end{cfuncdesc}
3562
Fred Drakefac312f2001-05-29 15:13:00 +00003563\begin{cfuncdesc}{int}{_PyTuple_Resize}{PyObject **p, int newsize}
Fred Drake659ebfa2000-04-03 15:42:13 +00003564Can be used to resize a tuple. \var{newsize} will be the new length
3565of the tuple. Because tuples are \emph{supposed} to be immutable,
3566this should only be used if there is only one reference to the object.
3567Do \emph{not} use this if the tuple may already be known to some other
Fred Drakefac312f2001-05-29 15:13:00 +00003568part of the code. The tuple will always grow or shrink at the end.
3569Think of this as destroying the old tuple and creating a new one, only
3570more efficiently. Returns \code{0} on success. Client code should
3571never assume that the resulting value of \code{*\var{p}} will be the
3572same as before calling this function. If the object referenced by
3573\code{*\var{p}} is replaced, the original \code{*\var{p}} is
3574destroyed. On failure, returns \code{-1} and sets \code{*\var{p}} to
3575\NULL, and raises \exception{MemoryError} or \exception{SystemError}.
3576\versionchanged[Removed unused third parameter, \var{last_is_sticky}]{2.2}
Fred Drakee5bf8b21998-02-12 21:22:28 +00003577\end{cfuncdesc}
3578
3579
Fred Drakeefd146c1999-02-15 15:30:45 +00003580\subsection{List Objects \label{listObjects}}
Fred Drakee5bf8b21998-02-12 21:22:28 +00003581
Fred Drake659ebfa2000-04-03 15:42:13 +00003582\obindex{list}
Fred Drakee5bf8b21998-02-12 21:22:28 +00003583\begin{ctypedesc}{PyListObject}
Fred Drakef8830d11998-04-23 14:06:01 +00003584This subtype of \ctype{PyObject} represents a Python list object.
Fred Drakee5bf8b21998-02-12 21:22:28 +00003585\end{ctypedesc}
3586
3587\begin{cvardesc}{PyTypeObject}{PyList_Type}
Fred Drake659ebfa2000-04-03 15:42:13 +00003588This instance of \ctype{PyTypeObject} represents the Python list
3589type. This is the same object as \code{types.ListType}.
3590\withsubitem{(in module types)}{\ttindex{ListType}}
Fred Drakee5bf8b21998-02-12 21:22:28 +00003591\end{cvardesc}
3592
3593\begin{cfuncdesc}{int}{PyList_Check}{PyObject *p}
Fred Drakef8830d11998-04-23 14:06:01 +00003594Returns true if its argument is a \ctype{PyListObject}.
Fred Drakee5bf8b21998-02-12 21:22:28 +00003595\end{cfuncdesc}
3596
Fred Drake659ebfa2000-04-03 15:42:13 +00003597\begin{cfuncdesc}{PyObject*}{PyList_New}{int len}
3598Returns a new list of length \var{len} on success, or \NULL{} on
Guido van Rossum3c4378b1998-04-14 20:21:10 +00003599failure.
Fred Drakee5bf8b21998-02-12 21:22:28 +00003600\end{cfuncdesc}
3601
Fred Drakec6fa34e1998-04-02 06:47:24 +00003602\begin{cfuncdesc}{int}{PyList_Size}{PyObject *list}
Fred Drake659ebfa2000-04-03 15:42:13 +00003603Returns the length of the list object in \var{list}; this is
3604equivalent to \samp{len(\var{list})} on a list object.
3605\bifuncindex{len}
3606\end{cfuncdesc}
3607
3608\begin{cfuncdesc}{int}{PyList_GET_SIZE}{PyObject *list}
Fred Drake5d644212000-10-07 12:31:50 +00003609Macro form of \cfunction{PyList_Size()} without error checking.
Fred Drakee5bf8b21998-02-12 21:22:28 +00003610\end{cfuncdesc}
3611
Fred Drakec6fa34e1998-04-02 06:47:24 +00003612\begin{cfuncdesc}{PyObject*}{PyList_GetItem}{PyObject *list, int index}
Guido van Rossum44475131998-04-21 15:30:01 +00003613Returns the object at position \var{pos} in the list pointed
3614to by \var{p}. If \var{pos} is out of bounds, returns \NULL{} and
Fred Drake659ebfa2000-04-03 15:42:13 +00003615sets an \exception{IndexError} exception.
3616\end{cfuncdesc}
3617
3618\begin{cfuncdesc}{PyObject*}{PyList_GET_ITEM}{PyObject *list, int i}
3619Macro form of \cfunction{PyList_GetItem()} without error checking.
Fred Drakee5bf8b21998-02-12 21:22:28 +00003620\end{cfuncdesc}
3621
Fred Drakec6fa34e1998-04-02 06:47:24 +00003622\begin{cfuncdesc}{int}{PyList_SetItem}{PyObject *list, int index,
3623 PyObject *item}
Guido van Rossum3c4378b1998-04-14 20:21:10 +00003624Sets the item at index \var{index} in list to \var{item}.
Fred Drakebab29652001-07-10 16:10:08 +00003625Returns \code{0} on success or \code{-1} on failure.
Fred Drake00d0cb62001-06-03 03:12:57 +00003626\strong{Note:} This function ``steals'' a reference to \var{item} and
3627discards a reference to an item already in the list at the affected
3628position.
Fred Drake659ebfa2000-04-03 15:42:13 +00003629\end{cfuncdesc}
3630
Fred Drakebab29652001-07-10 16:10:08 +00003631\begin{cfuncdesc}{void}{PyList_SET_ITEM}{PyObject *list, int i,
Fred Drake659ebfa2000-04-03 15:42:13 +00003632 PyObject *o}
3633Macro form of \cfunction{PyList_SetItem()} without error checking.
Fred Drake00d0cb62001-06-03 03:12:57 +00003634\strong{Note:} This function ``steals'' a reference to \var{item},
3635and, unlike \cfunction{PyList_SetItem()}, does \emph{not} discard a
Fred Drakebab29652001-07-10 16:10:08 +00003636reference to any item that it being replaced; any reference in
3637\var{list} at position \var{i} will be leaked. This is normally only
3638used to fill in new lists where there is no previous content.
Fred Drakee5bf8b21998-02-12 21:22:28 +00003639\end{cfuncdesc}
3640
Fred Drakec6fa34e1998-04-02 06:47:24 +00003641\begin{cfuncdesc}{int}{PyList_Insert}{PyObject *list, int index,
Guido van Rossum44475131998-04-21 15:30:01 +00003642 PyObject *item}
3643Inserts the item \var{item} into list \var{list} in front of index
Fred Drake659ebfa2000-04-03 15:42:13 +00003644\var{index}. Returns \code{0} if successful; returns \code{-1} and
3645raises an exception if unsuccessful. Analogous to
3646\code{\var{list}.insert(\var{index}, \var{item})}.
Fred Drakee5bf8b21998-02-12 21:22:28 +00003647\end{cfuncdesc}
3648
Fred Drakec6fa34e1998-04-02 06:47:24 +00003649\begin{cfuncdesc}{int}{PyList_Append}{PyObject *list, PyObject *item}
Guido van Rossum44475131998-04-21 15:30:01 +00003650Appends the object \var{item} at the end of list \var{list}. Returns
Fred Drake659ebfa2000-04-03 15:42:13 +00003651\code{0} if successful; returns \code{-1} and sets an exception if
3652unsuccessful. Analogous to \code{\var{list}.append(\var{item})}.
Fred Drakee5bf8b21998-02-12 21:22:28 +00003653\end{cfuncdesc}
3654
Fred Drakec6fa34e1998-04-02 06:47:24 +00003655\begin{cfuncdesc}{PyObject*}{PyList_GetSlice}{PyObject *list,
3656 int low, int high}
Guido van Rossum3c4378b1998-04-14 20:21:10 +00003657Returns a list of the objects in \var{list} containing the objects
Guido van Rossum44475131998-04-21 15:30:01 +00003658\emph{between} \var{low} and \var{high}. Returns NULL and sets an
3659exception if unsuccessful.
Fred Drake659ebfa2000-04-03 15:42:13 +00003660Analogous to \code{\var{list}[\var{low}:\var{high}]}.
Fred Drakee5bf8b21998-02-12 21:22:28 +00003661\end{cfuncdesc}
3662
Fred Drakec6fa34e1998-04-02 06:47:24 +00003663\begin{cfuncdesc}{int}{PyList_SetSlice}{PyObject *list,
3664 int low, int high,
3665 PyObject *itemlist}
Fred Drake659ebfa2000-04-03 15:42:13 +00003666Sets the slice of \var{list} between \var{low} and \var{high} to the
3667contents of \var{itemlist}. Analogous to
3668\code{\var{list}[\var{low}:\var{high}] = \var{itemlist}}. Returns
3669\code{0} on success, \code{-1} on failure.
Fred Drakee5bf8b21998-02-12 21:22:28 +00003670\end{cfuncdesc}
3671
Fred Drakec6fa34e1998-04-02 06:47:24 +00003672\begin{cfuncdesc}{int}{PyList_Sort}{PyObject *list}
Fred Drake659ebfa2000-04-03 15:42:13 +00003673Sorts the items of \var{list} in place. Returns \code{0} on success,
3674\code{-1} on failure. This is equivalent to
3675\samp{\var{list}.sort()}.
Fred Drakee5bf8b21998-02-12 21:22:28 +00003676\end{cfuncdesc}
3677
Fred Drakec6fa34e1998-04-02 06:47:24 +00003678\begin{cfuncdesc}{int}{PyList_Reverse}{PyObject *list}
Fred Drake659ebfa2000-04-03 15:42:13 +00003679Reverses the items of \var{list} in place. Returns \code{0} on
3680success, \code{-1} on failure. This is the equivalent of
3681\samp{\var{list}.reverse()}.
Fred Drakee5bf8b21998-02-12 21:22:28 +00003682\end{cfuncdesc}
3683
Fred Drakec6fa34e1998-04-02 06:47:24 +00003684\begin{cfuncdesc}{PyObject*}{PyList_AsTuple}{PyObject *list}
Fred Drake659ebfa2000-04-03 15:42:13 +00003685Returns a new tuple object containing the contents of \var{list};
3686equivalent to \samp{tuple(\var{list})}.\bifuncindex{tuple}
Fred Drakee5bf8b21998-02-12 21:22:28 +00003687\end{cfuncdesc}
3688
3689
Fred Drakeefd146c1999-02-15 15:30:45 +00003690\section{Mapping Objects \label{mapObjects}}
Fred Drakee5bf8b21998-02-12 21:22:28 +00003691
Fred Drake659ebfa2000-04-03 15:42:13 +00003692\obindex{mapping}
3693
3694
Fred Drakeefd146c1999-02-15 15:30:45 +00003695\subsection{Dictionary Objects \label{dictObjects}}
Fred Drakee5bf8b21998-02-12 21:22:28 +00003696
Fred Drake659ebfa2000-04-03 15:42:13 +00003697\obindex{dictionary}
Fred Drakee5bf8b21998-02-12 21:22:28 +00003698\begin{ctypedesc}{PyDictObject}
Fred Drakef8830d11998-04-23 14:06:01 +00003699This subtype of \ctype{PyObject} represents a Python dictionary object.
Fred Drakee5bf8b21998-02-12 21:22:28 +00003700\end{ctypedesc}
3701
3702\begin{cvardesc}{PyTypeObject}{PyDict_Type}
Fred Drake659ebfa2000-04-03 15:42:13 +00003703This instance of \ctype{PyTypeObject} represents the Python dictionary
3704type. This is exposed to Python programs as \code{types.DictType} and
3705\code{types.DictionaryType}.
3706\withsubitem{(in module types)}{\ttindex{DictType}\ttindex{DictionaryType}}
Fred Drakee5bf8b21998-02-12 21:22:28 +00003707\end{cvardesc}
3708
3709\begin{cfuncdesc}{int}{PyDict_Check}{PyObject *p}
Fred Drakef8830d11998-04-23 14:06:01 +00003710Returns true if its argument is a \ctype{PyDictObject}.
Fred Drakee5bf8b21998-02-12 21:22:28 +00003711\end{cfuncdesc}
3712
Fred Drakec6fa34e1998-04-02 06:47:24 +00003713\begin{cfuncdesc}{PyObject*}{PyDict_New}{}
Fred Drake659ebfa2000-04-03 15:42:13 +00003714Returns a new empty dictionary, or \NULL{} on failure.
3715\end{cfuncdesc}
3716
3717\begin{cfuncdesc}{void}{PyDict_Clear}{PyObject *p}
3718Empties an existing dictionary of all key-value pairs.
Fred Drakee5bf8b21998-02-12 21:22:28 +00003719\end{cfuncdesc}
3720
Jeremy Hyltona12c7a72000-03-30 22:27:31 +00003721\begin{cfuncdesc}{PyObject*}{PyDict_Copy}{PyObject *p}
Fred Drake659ebfa2000-04-03 15:42:13 +00003722Returns a new dictionary that contains the same key-value pairs as p.
3723Empties an existing dictionary of all key-value pairs.
Jeremy Hyltona12c7a72000-03-30 22:27:31 +00003724\end{cfuncdesc}
3725
Fred Drake659ebfa2000-04-03 15:42:13 +00003726\begin{cfuncdesc}{int}{PyDict_SetItem}{PyObject *p, PyObject *key,
3727 PyObject *val}
Fred Drakebab29652001-07-10 16:10:08 +00003728Inserts \var{value} into the dictionary \var{p} with a key of \var{key}.
Fred Drake659ebfa2000-04-03 15:42:13 +00003729\var{key} must be hashable; if it isn't, \exception{TypeError} will be
3730raised.
Fred Drakebab29652001-07-10 16:10:08 +00003731Returns \code{0} on success or \code{-1} on failure.
Fred Drakee5bf8b21998-02-12 21:22:28 +00003732\end{cfuncdesc}
3733
Fred Drake83e01bf2001-03-16 15:41:29 +00003734\begin{cfuncdesc}{int}{PyDict_SetItemString}{PyObject *p,
Fred Drakee5bf8b21998-02-12 21:22:28 +00003735 char *key,
3736 PyObject *val}
Fred Drakebab29652001-07-10 16:10:08 +00003737Inserts \var{value} into the dictionary \var{p} using \var{key}
Fred Drake1d158692000-06-18 05:21:21 +00003738as a key. \var{key} should be a \ctype{char*}. The key object is
Fred Drakee058b4f1998-02-16 06:15:35 +00003739created using \code{PyString_FromString(\var{key})}.
Fred Drakebab29652001-07-10 16:10:08 +00003740Returns \code{0} on success or \code{-1} on failure.
Fred Drake659ebfa2000-04-03 15:42:13 +00003741\ttindex{PyString_FromString()}
Fred Drakee5bf8b21998-02-12 21:22:28 +00003742\end{cfuncdesc}
3743
Fred Drake659ebfa2000-04-03 15:42:13 +00003744\begin{cfuncdesc}{int}{PyDict_DelItem}{PyObject *p, PyObject *key}
Fred Drakee058b4f1998-02-16 06:15:35 +00003745Removes the entry in dictionary \var{p} with key \var{key}.
Fred Drake659ebfa2000-04-03 15:42:13 +00003746\var{key} must be hashable; if it isn't, \exception{TypeError} is
3747raised.
Fred Drakee5bf8b21998-02-12 21:22:28 +00003748\end{cfuncdesc}
3749
Fred Drake659ebfa2000-04-03 15:42:13 +00003750\begin{cfuncdesc}{int}{PyDict_DelItemString}{PyObject *p, char *key}
Fred Drakee058b4f1998-02-16 06:15:35 +00003751Removes the entry in dictionary \var{p} which has a key
Fred Drake659ebfa2000-04-03 15:42:13 +00003752specified by the string \var{key}.
Fred Drakebab29652001-07-10 16:10:08 +00003753Returns \code{0} on success or \code{-1} on failure.
Fred Drakee5bf8b21998-02-12 21:22:28 +00003754\end{cfuncdesc}
3755
Fred Drake659ebfa2000-04-03 15:42:13 +00003756\begin{cfuncdesc}{PyObject*}{PyDict_GetItem}{PyObject *p, PyObject *key}
Fred Drakee058b4f1998-02-16 06:15:35 +00003757Returns the object from dictionary \var{p} which has a key
Guido van Rossum44475131998-04-21 15:30:01 +00003758\var{key}. Returns \NULL{} if the key \var{key} is not present, but
Fred Drake659ebfa2000-04-03 15:42:13 +00003759\emph{without} setting an exception.
Fred Drakee5bf8b21998-02-12 21:22:28 +00003760\end{cfuncdesc}
3761
Fred Drake659ebfa2000-04-03 15:42:13 +00003762\begin{cfuncdesc}{PyObject*}{PyDict_GetItemString}{PyObject *p, char *key}
Fred Drakef8830d11998-04-23 14:06:01 +00003763This is the same as \cfunction{PyDict_GetItem()}, but \var{key} is
Fred Drake659ebfa2000-04-03 15:42:13 +00003764specified as a \ctype{char*}, rather than a \ctype{PyObject*}.
Fred Drakee5bf8b21998-02-12 21:22:28 +00003765\end{cfuncdesc}
3766
Fred Drake659ebfa2000-04-03 15:42:13 +00003767\begin{cfuncdesc}{PyObject*}{PyDict_Items}{PyObject *p}
Fred Drakef8830d11998-04-23 14:06:01 +00003768Returns a \ctype{PyListObject} containing all the items
Guido van Rossum44475131998-04-21 15:30:01 +00003769from the dictionary, as in the dictinoary method \method{items()} (see
Fred Drakebe486461999-11-09 17:03:03 +00003770the \citetitle[../lib/lib.html]{Python Library Reference}).
Fred Drakee5bf8b21998-02-12 21:22:28 +00003771\end{cfuncdesc}
3772
Fred Drake659ebfa2000-04-03 15:42:13 +00003773\begin{cfuncdesc}{PyObject*}{PyDict_Keys}{PyObject *p}
Fred Drakef8830d11998-04-23 14:06:01 +00003774Returns a \ctype{PyListObject} containing all the keys
Guido van Rossum44475131998-04-21 15:30:01 +00003775from the dictionary, as in the dictionary method \method{keys()} (see the
Fred Drakebe486461999-11-09 17:03:03 +00003776\citetitle[../lib/lib.html]{Python Library Reference}).
Fred Drakee5bf8b21998-02-12 21:22:28 +00003777\end{cfuncdesc}
3778
Fred Drake659ebfa2000-04-03 15:42:13 +00003779\begin{cfuncdesc}{PyObject*}{PyDict_Values}{PyObject *p}
Fred Drakef8830d11998-04-23 14:06:01 +00003780Returns a \ctype{PyListObject} containing all the values
Guido van Rossum44475131998-04-21 15:30:01 +00003781from the dictionary \var{p}, as in the dictionary method
Fred Drakebe486461999-11-09 17:03:03 +00003782\method{values()} (see the \citetitle[../lib/lib.html]{Python Library
3783Reference}).
Fred Drakee5bf8b21998-02-12 21:22:28 +00003784\end{cfuncdesc}
3785
Fred Drake659ebfa2000-04-03 15:42:13 +00003786\begin{cfuncdesc}{int}{PyDict_Size}{PyObject *p}
3787Returns the number of items in the dictionary. This is equivalent to
3788\samp{len(\var{p})} on a dictionary.\bifuncindex{len}
Fred Drakee5bf8b21998-02-12 21:22:28 +00003789\end{cfuncdesc}
3790
Fred Drake83e01bf2001-03-16 15:41:29 +00003791\begin{cfuncdesc}{int}{PyDict_Next}{PyObject *p, int *ppos,
Fred Drake7d45d342000-08-11 17:07:32 +00003792 PyObject **pkey, PyObject **pvalue}
Fred Drake83e01bf2001-03-16 15:41:29 +00003793Iterate over all key-value pairs in the dictionary \var{p}. The
3794\ctype{int} referred to by \var{ppos} must be initialized to \code{0}
3795prior to the first call to this function to start the iteration; the
3796function returns true for each pair in the dictionary, and false once
3797all pairs have been reported. The parameters \var{pkey} and
3798\var{pvalue} should either point to \ctype{PyObject*} variables that
3799will be filled in with each key and value, respectively, or may be
Fred Drake8d00a0f2001-04-13 17:55:02 +00003800\NULL.
3801
Fred Drake83e01bf2001-03-16 15:41:29 +00003802For example:
Fred Drakee5bf8b21998-02-12 21:22:28 +00003803
Fred Drake83e01bf2001-03-16 15:41:29 +00003804\begin{verbatim}
3805PyObject *key, *value;
3806int pos = 0;
3807
3808while (PyDict_Next(self->dict, &pos, &key, &value)) {
3809 /* do something interesting with the values... */
3810 ...
3811}
3812\end{verbatim}
Fred Drake8d00a0f2001-04-13 17:55:02 +00003813
3814The dictionary \var{p} should not be mutated during iteration. It is
3815safe (since Python 2.1) to modify the values of the keys as you
3816iterate over the dictionary, for example:
3817
3818\begin{verbatim}
3819PyObject *key, *value;
3820int pos = 0;
3821
3822while (PyDict_Next(self->dict, &pos, &key, &value)) {
3823 int i = PyInt_AS_LONG(value) + 1;
3824 PyObject *o = PyInt_FromLong(i);
3825 if (o == NULL)
3826 return -1;
3827 if (PyDict_SetItem(self->dict, key, o) < 0) {
3828 Py_DECREF(o);
3829 return -1;
3830 }
3831 Py_DECREF(o);
3832}
3833\end{verbatim}
Fred Drakee5bf8b21998-02-12 21:22:28 +00003834\end{cfuncdesc}
3835
3836
Fred Drakeefd146c1999-02-15 15:30:45 +00003837\section{Other Objects \label{otherObjects}}
Fred Drakee5bf8b21998-02-12 21:22:28 +00003838
Fred Drakeefd146c1999-02-15 15:30:45 +00003839\subsection{File Objects \label{fileObjects}}
Fred Drakee5bf8b21998-02-12 21:22:28 +00003840
Fred Drake659ebfa2000-04-03 15:42:13 +00003841\obindex{file}
3842Python's built-in file objects are implemented entirely on the
3843\ctype{FILE*} support from the C standard library. This is an
3844implementation detail and may change in future releases of Python.
3845
Fred Drakee5bf8b21998-02-12 21:22:28 +00003846\begin{ctypedesc}{PyFileObject}
Fred Drakef8830d11998-04-23 14:06:01 +00003847This subtype of \ctype{PyObject} represents a Python file object.
Fred Drakee5bf8b21998-02-12 21:22:28 +00003848\end{ctypedesc}
3849
3850\begin{cvardesc}{PyTypeObject}{PyFile_Type}
Fred Drake659ebfa2000-04-03 15:42:13 +00003851This instance of \ctype{PyTypeObject} represents the Python file
3852type. This is exposed to Python programs as \code{types.FileType}.
3853\withsubitem{(in module types)}{\ttindex{FileType}}
Fred Drakee5bf8b21998-02-12 21:22:28 +00003854\end{cvardesc}
3855
3856\begin{cfuncdesc}{int}{PyFile_Check}{PyObject *p}
Fred Drakef8830d11998-04-23 14:06:01 +00003857Returns true if its argument is a \ctype{PyFileObject}.
Fred Drakee5bf8b21998-02-12 21:22:28 +00003858\end{cfuncdesc}
3859
Fred Drake659ebfa2000-04-03 15:42:13 +00003860\begin{cfuncdesc}{PyObject*}{PyFile_FromString}{char *filename, char *mode}
3861On success, returns a new file object that is opened on the
3862file given by \var{filename}, with a file mode given by \var{mode},
3863where \var{mode} has the same semantics as the standard C routine
3864\cfunction{fopen()}\ttindex{fopen()}. On failure, returns \NULL.
Fred Drakee5bf8b21998-02-12 21:22:28 +00003865\end{cfuncdesc}
3866
Fred Drakec6fa34e1998-04-02 06:47:24 +00003867\begin{cfuncdesc}{PyObject*}{PyFile_FromFile}{FILE *fp,
Fred Drake659ebfa2000-04-03 15:42:13 +00003868 char *name, char *mode,
3869 int (*close)(FILE*)}
3870Creates a new \ctype{PyFileObject} from the already-open standard C
3871file pointer, \var{fp}. The function \var{close} will be called when
3872the file should be closed. Returns \NULL{} on failure.
Fred Drakee5bf8b21998-02-12 21:22:28 +00003873\end{cfuncdesc}
3874
Fred Drake659ebfa2000-04-03 15:42:13 +00003875\begin{cfuncdesc}{FILE*}{PyFile_AsFile}{PyFileObject *p}
3876Returns the file object associated with \var{p} as a \ctype{FILE*}.
Fred Drakee5bf8b21998-02-12 21:22:28 +00003877\end{cfuncdesc}
3878
Fred Drakec6fa34e1998-04-02 06:47:24 +00003879\begin{cfuncdesc}{PyObject*}{PyFile_GetLine}{PyObject *p, int n}
Fred Drake659ebfa2000-04-03 15:42:13 +00003880Equivalent to \code{\var{p}.readline(\optional{\var{n}})}, this
3881function reads one line from the object \var{p}. \var{p} may be a
3882file object or any object with a \method{readline()} method. If
3883\var{n} is \code{0}, exactly one line is read, regardless of the
3884length of the line. If \var{n} is greater than \code{0}, no more than
3885\var{n} bytes will be read from the file; a partial line can be
3886returned. In both cases, an empty string is returned if the end of
3887the file is reached immediately. If \var{n} is less than \code{0},
3888however, one line is read regardless of length, but
3889\exception{EOFError} is raised if the end of the file is reached
3890immediately.
3891\withsubitem{(built-in exception)}{\ttindex{EOFError}}
Fred Drakee5bf8b21998-02-12 21:22:28 +00003892\end{cfuncdesc}
3893
Fred Drakec6fa34e1998-04-02 06:47:24 +00003894\begin{cfuncdesc}{PyObject*}{PyFile_Name}{PyObject *p}
Fred Drake659ebfa2000-04-03 15:42:13 +00003895Returns the name of the file specified by \var{p} as a string object.
Fred Drakee5bf8b21998-02-12 21:22:28 +00003896\end{cfuncdesc}
3897
3898\begin{cfuncdesc}{void}{PyFile_SetBufSize}{PyFileObject *p, int n}
Fred Drake659ebfa2000-04-03 15:42:13 +00003899Available on systems with \cfunction{setvbuf()}\ttindex{setvbuf()}
3900only. This should only be called immediately after file object
3901creation.
Fred Drakee5bf8b21998-02-12 21:22:28 +00003902\end{cfuncdesc}
3903
Fred Drake659ebfa2000-04-03 15:42:13 +00003904\begin{cfuncdesc}{int}{PyFile_SoftSpace}{PyObject *p, int newflag}
3905This function exists for internal use by the interpreter.
3906Sets the \member{softspace} attribute of \var{p} to \var{newflag} and
3907\withsubitem{(file attribute)}{\ttindex{softspace}}returns the
3908previous value. \var{p} does not have to be a file object
3909for this function to work properly; any object is supported (thought
3910its only interesting if the \member{softspace} attribute can be set).
3911This function clears any errors, and will return \code{0} as the
3912previous value if the attribute either does not exist or if there were
3913errors in retrieving it. There is no way to detect errors from this
3914function, but doing so should not be needed.
Fred Drakee5bf8b21998-02-12 21:22:28 +00003915\end{cfuncdesc}
3916
Fred Drakec6fa34e1998-04-02 06:47:24 +00003917\begin{cfuncdesc}{int}{PyFile_WriteObject}{PyObject *obj, PyFileObject *p,
3918 int flags}
Fred Drake659ebfa2000-04-03 15:42:13 +00003919Writes object \var{obj} to file object \var{p}. The only supported
3920flag for \var{flags} is \constant{Py_PRINT_RAW}\ttindex{Py_PRINT_RAW};
3921if given, the \function{str()} of the object is written instead of the
3922\function{repr()}. Returns \code{0} on success or \code{-1} on
3923failure; the appropriate exception will be set.
Fred Drakee5bf8b21998-02-12 21:22:28 +00003924\end{cfuncdesc}
3925
Fred Drakec6fa34e1998-04-02 06:47:24 +00003926\begin{cfuncdesc}{int}{PyFile_WriteString}{char *s, PyFileObject *p,
3927 int flags}
Fred Drake659ebfa2000-04-03 15:42:13 +00003928Writes string \var{s} to file object \var{p}. Returns \code{0} on
3929success or \code{-1} on failure; the appropriate exception will be
3930set.
Fred Drakee5bf8b21998-02-12 21:22:28 +00003931\end{cfuncdesc}
3932
3933
Fred Drake5838d0f2001-01-28 06:39:35 +00003934\subsection{Instance Objects \label{instanceObjects}}
3935
3936\obindex{instance}
3937There are very few functions specific to instance objects.
3938
3939\begin{cvardesc}{PyTypeObject}{PyInstance_Type}
3940 Type object for class instances.
3941\end{cvardesc}
3942
3943\begin{cfuncdesc}{int}{PyInstance_Check}{PyObject *obj}
3944 Returns true if \var{obj} is an instance.
3945\end{cfuncdesc}
3946
3947\begin{cfuncdesc}{PyObject*}{PyInstance_New}{PyObject *class,
3948 PyObject *arg,
3949 PyObject *kw}
3950 Create a new instance of a specific class. The parameters \var{arg}
3951 and \var{kw} are used as the positional and keyword parameters to
3952 the object's constructor.
3953\end{cfuncdesc}
3954
3955\begin{cfuncdesc}{PyObject*}{PyInstance_NewRaw}{PyObject *class,
3956 PyObject *dict}
3957 Create a new instance of a specific class without calling it's
3958 constructor. \var{class} is the class of new object. The
3959 \var{dict} parameter will be used as the object's \member{__dict__};
3960 if \NULL, a new dictionary will be created for the instance.
3961\end{cfuncdesc}
3962
3963
Fred Drakeefd146c1999-02-15 15:30:45 +00003964\subsection{Module Objects \label{moduleObjects}}
3965
3966\obindex{module}
3967There are only a few functions special to module objects.
3968
Fred Drake659ebfa2000-04-03 15:42:13 +00003969\begin{cvardesc}{PyTypeObject}{PyModule_Type}
3970This instance of \ctype{PyTypeObject} represents the Python module
3971type. This is exposed to Python programs as \code{types.ModuleType}.
3972\withsubitem{(in module types)}{\ttindex{ModuleType}}
3973\end{cvardesc}
3974
3975\begin{cfuncdesc}{int}{PyModule_Check}{PyObject *p}
3976Returns true if its argument is a module object.
Fred Drakeefd146c1999-02-15 15:30:45 +00003977\end{cfuncdesc}
3978
Fred Drake659ebfa2000-04-03 15:42:13 +00003979\begin{cfuncdesc}{PyObject*}{PyModule_New}{char *name}
3980Return a new module object with the \member{__name__} attribute set to
3981\var{name}. Only the module's \member{__doc__} and
3982\member{__name__} attributes are filled in; the caller is responsible
3983for providing a \member{__file__} attribute.
3984\withsubitem{(module attribute)}{
3985 \ttindex{__name__}\ttindex{__doc__}\ttindex{__file__}}
3986\end{cfuncdesc}
3987
3988\begin{cfuncdesc}{PyObject*}{PyModule_GetDict}{PyObject *module}
Fred Drakeefd146c1999-02-15 15:30:45 +00003989Return the dictionary object that implements \var{module}'s namespace;
3990this object is the same as the \member{__dict__} attribute of the
3991module object. This function never fails.
Fred Drake659ebfa2000-04-03 15:42:13 +00003992\withsubitem{(module attribute)}{\ttindex{__dict__}}
Fred Drakeefd146c1999-02-15 15:30:45 +00003993\end{cfuncdesc}
3994
Fred Drake659ebfa2000-04-03 15:42:13 +00003995\begin{cfuncdesc}{char*}{PyModule_GetName}{PyObject *module}
Fred Drakeefd146c1999-02-15 15:30:45 +00003996Return \var{module}'s \member{__name__} value. If the module does not
Fred Drake659ebfa2000-04-03 15:42:13 +00003997provide one, or if it is not a string, \exception{SystemError} is
3998raised and \NULL{} is returned.
3999\withsubitem{(module attribute)}{\ttindex{__name__}}
4000\withsubitem{(built-in exception)}{\ttindex{SystemError}}
Fred Drakeefd146c1999-02-15 15:30:45 +00004001\end{cfuncdesc}
4002
Fred Drake659ebfa2000-04-03 15:42:13 +00004003\begin{cfuncdesc}{char*}{PyModule_GetFilename}{PyObject *module}
Fred Drakeefd146c1999-02-15 15:30:45 +00004004Return the name of the file from which \var{module} was loaded using
4005\var{module}'s \member{__file__} attribute. If this is not defined,
Fred Drake659ebfa2000-04-03 15:42:13 +00004006or if it is not a string, raise \exception{SystemError} and return
4007\NULL.
4008\withsubitem{(module attribute)}{\ttindex{__file__}}
4009\withsubitem{(built-in exception)}{\ttindex{SystemError}}
Fred Drakeefd146c1999-02-15 15:30:45 +00004010\end{cfuncdesc}
4011
Fred Drake891150b2000-09-23 03:25:42 +00004012\begin{cfuncdesc}{int}{PyModule_AddObject}{PyObject *module,
4013 char *name, PyObject *value}
4014Add an object to \var{module} as \var{name}. This is a convenience
4015function which can be used from the module's initialization function.
4016This steals a reference to \var{value}. Returns \code{-1} on error,
4017\code{0} on success.
4018\versionadded{2.0}
4019\end{cfuncdesc}
4020
4021\begin{cfuncdesc}{int}{PyModule_AddIntConstant}{PyObject *module,
4022 char *name, int value}
4023Add an integer constant to \var{module} as \var{name}. This convenience
4024function can be used from the module's initialization function.
4025Returns \code{-1} on error, \code{0} on success.
4026\versionadded{2.0}
4027\end{cfuncdesc}
4028
4029\begin{cfuncdesc}{int}{PyModule_AddStringConstant}{PyObject *module,
4030 char *name, char *value}
4031Add a string constant to \var{module} as \var{name}. This convenience
4032function can be used from the module's initialization function. The
4033string \var{value} must be null-terminated. Returns \code{-1} on
4034error, \code{0} on success.
4035\versionadded{2.0}
4036\end{cfuncdesc}
4037
Fred Drakeefd146c1999-02-15 15:30:45 +00004038
4039\subsection{CObjects \label{cObjects}}
Fred Drakee5bf8b21998-02-12 21:22:28 +00004040
Fred Drake659ebfa2000-04-03 15:42:13 +00004041\obindex{CObject}
4042Refer to \emph{Extending and Embedding the Python Interpreter},
4043section 1.12 (``Providing a C API for an Extension Module''), for more
4044information on using these objects.
4045
4046
Guido van Rossum44475131998-04-21 15:30:01 +00004047\begin{ctypedesc}{PyCObject}
Fred Drakef8830d11998-04-23 14:06:01 +00004048This subtype of \ctype{PyObject} represents an opaque value, useful for
Fred Drake659ebfa2000-04-03 15:42:13 +00004049C extension modules who need to pass an opaque value (as a
4050\ctype{void*} pointer) through Python code to other C code. It is
Guido van Rossum44475131998-04-21 15:30:01 +00004051often used to make a C function pointer defined in one module
4052available to other modules, so the regular import mechanism can be
4053used to access C APIs defined in dynamically loaded modules.
4054\end{ctypedesc}
Fred Drakee5bf8b21998-02-12 21:22:28 +00004055
Fred Drake659ebfa2000-04-03 15:42:13 +00004056\begin{cfuncdesc}{int}{PyCObject_Check}{PyObject *p}
4057Returns true if its argument is a \ctype{PyCObject}.
4058\end{cfuncdesc}
4059
4060\begin{cfuncdesc}{PyObject*}{PyCObject_FromVoidPtr}{void* cobj,
Marc-André Lemburga544ea22001-01-17 18:04:31 +00004061 void (*destr)(void *)}
Fred Drake1d158692000-06-18 05:21:21 +00004062Creates a \ctype{PyCObject} from the \code{void *}\var{cobj}. The
Fred Drakedab44681999-05-13 18:41:14 +00004063\var{destr} function will be called when the object is reclaimed, unless
4064it is \NULL.
Guido van Rossum44475131998-04-21 15:30:01 +00004065\end{cfuncdesc}
4066
Fred Drake659ebfa2000-04-03 15:42:13 +00004067\begin{cfuncdesc}{PyObject*}{PyCObject_FromVoidPtrAndDesc}{void* cobj,
Marc-André Lemburga544ea22001-01-17 18:04:31 +00004068 void* desc, void (*destr)(void *, void *) }
Fred Drakef8830d11998-04-23 14:06:01 +00004069Creates a \ctype{PyCObject} from the \ctype{void *}\var{cobj}. The
4070\var{destr} function will be called when the object is reclaimed. The
4071\var{desc} argument can be used to pass extra callback data for the
4072destructor function.
Guido van Rossum44475131998-04-21 15:30:01 +00004073\end{cfuncdesc}
4074
Fred Drake659ebfa2000-04-03 15:42:13 +00004075\begin{cfuncdesc}{void*}{PyCObject_AsVoidPtr}{PyObject* self}
4076Returns the object \ctype{void *} that the
4077\ctype{PyCObject} \var{self} was created with.
Guido van Rossum44475131998-04-21 15:30:01 +00004078\end{cfuncdesc}
4079
Fred Drake659ebfa2000-04-03 15:42:13 +00004080\begin{cfuncdesc}{void*}{PyCObject_GetDesc}{PyObject* self}
4081Returns the description \ctype{void *} that the
4082\ctype{PyCObject} \var{self} was created with.
Guido van Rossum44475131998-04-21 15:30:01 +00004083\end{cfuncdesc}
Fred Drakee5bf8b21998-02-12 21:22:28 +00004084
Fred Drake659ebfa2000-04-03 15:42:13 +00004085
Fred Drakeefd146c1999-02-15 15:30:45 +00004086\chapter{Initialization, Finalization, and Threads
4087 \label{initialization}}
Guido van Rossum4a944d71997-08-14 20:35:38 +00004088
Guido van Rossum4a944d71997-08-14 20:35:38 +00004089\begin{cfuncdesc}{void}{Py_Initialize}{}
4090Initialize the Python interpreter. In an application embedding
4091Python, this should be called before using any other Python/C API
Fred Drake659ebfa2000-04-03 15:42:13 +00004092functions; with the exception of
4093\cfunction{Py_SetProgramName()}\ttindex{Py_SetProgramName()},
4094\cfunction{PyEval_InitThreads()}\ttindex{PyEval_InitThreads()},
4095\cfunction{PyEval_ReleaseLock()}\ttindex{PyEval_ReleaseLock()},
4096and \cfunction{PyEval_AcquireLock()}\ttindex{PyEval_AcquireLock()}.
4097This initializes the table of loaded modules (\code{sys.modules}), and
4098\withsubitem{(in module sys)}{\ttindex{modules}\ttindex{path}}creates the
4099fundamental modules \module{__builtin__}\refbimodindex{__builtin__},
Fred Drake4de05a91998-02-16 14:25:26 +00004100\module{__main__}\refbimodindex{__main__} and
4101\module{sys}\refbimodindex{sys}. It also initializes the module
Fred Drake659ebfa2000-04-03 15:42:13 +00004102search\indexiii{module}{search}{path} path (\code{sys.path}).
4103It does not set \code{sys.argv}; use
4104\cfunction{PySys_SetArgv()}\ttindex{PySys_SetArgv()} for that. This
4105is a no-op when called for a second time (without calling
4106\cfunction{Py_Finalize()}\ttindex{Py_Finalize()} first). There is no
4107return value; it is a fatal error if the initialization fails.
Guido van Rossum42cefd01997-10-05 15:27:29 +00004108\end{cfuncdesc}
4109
4110\begin{cfuncdesc}{int}{Py_IsInitialized}{}
Guido van Rossum42cefd01997-10-05 15:27:29 +00004111Return true (nonzero) when the Python interpreter has been
Fred Drakee058b4f1998-02-16 06:15:35 +00004112initialized, false (zero) if not. After \cfunction{Py_Finalize()} is
4113called, this returns false until \cfunction{Py_Initialize()} is called
Guido van Rossum42cefd01997-10-05 15:27:29 +00004114again.
Guido van Rossum4a944d71997-08-14 20:35:38 +00004115\end{cfuncdesc}
4116
4117\begin{cfuncdesc}{void}{Py_Finalize}{}
Fred Drakee058b4f1998-02-16 06:15:35 +00004118Undo all initializations made by \cfunction{Py_Initialize()} and
4119subsequent use of Python/C API functions, and destroy all
4120sub-interpreters (see \cfunction{Py_NewInterpreter()} below) that were
4121created and not yet destroyed since the last call to
4122\cfunction{Py_Initialize()}. Ideally, this frees all memory allocated
4123by the Python interpreter. This is a no-op when called for a second
4124time (without calling \cfunction{Py_Initialize()} again first). There
4125is no return value; errors during finalization are ignored.
Guido van Rossum4a944d71997-08-14 20:35:38 +00004126
4127This function is provided for a number of reasons. An embedding
4128application might want to restart Python without having to restart the
4129application itself. An application that has loaded the Python
4130interpreter from a dynamically loadable library (or DLL) might want to
4131free all memory allocated by Python before unloading the DLL. During a
4132hunt for memory leaks in an application a developer might want to free
4133all memory allocated by Python before exiting from the application.
4134
Fred Drakee058b4f1998-02-16 06:15:35 +00004135\strong{Bugs and caveats:} The destruction of modules and objects in
Guido van Rossum4a944d71997-08-14 20:35:38 +00004136modules is done in random order; this may cause destructors
Fred Drakee058b4f1998-02-16 06:15:35 +00004137(\method{__del__()} methods) to fail when they depend on other objects
Guido van Rossum4a944d71997-08-14 20:35:38 +00004138(even functions) or modules. Dynamically loaded extension modules
4139loaded by Python are not unloaded. Small amounts of memory allocated
4140by the Python interpreter may not be freed (if you find a leak, please
4141report it). Memory tied up in circular references between objects is
4142not freed. Some memory allocated by extension modules may not be
4143freed. Some extension may not work properly if their initialization
4144routine is called more than once; this can happen if an applcation
Fred Drakee058b4f1998-02-16 06:15:35 +00004145calls \cfunction{Py_Initialize()} and \cfunction{Py_Finalize()} more
4146than once.
Guido van Rossum4a944d71997-08-14 20:35:38 +00004147\end{cfuncdesc}
4148
Fred Drakec6fa34e1998-04-02 06:47:24 +00004149\begin{cfuncdesc}{PyThreadState*}{Py_NewInterpreter}{}
Fred Drake4de05a91998-02-16 14:25:26 +00004150Create a new sub-interpreter. This is an (almost) totally separate
4151environment for the execution of Python code. In particular, the new
4152interpreter has separate, independent versions of all imported
4153modules, including the fundamental modules
4154\module{__builtin__}\refbimodindex{__builtin__},
4155\module{__main__}\refbimodindex{__main__} and
4156\module{sys}\refbimodindex{sys}. The table of loaded modules
4157(\code{sys.modules}) and the module search path (\code{sys.path}) are
4158also separate. The new environment has no \code{sys.argv} variable.
4159It has new standard I/O stream file objects \code{sys.stdin},
4160\code{sys.stdout} and \code{sys.stderr} (however these refer to the
Fred Drake659ebfa2000-04-03 15:42:13 +00004161same underlying \ctype{FILE} structures in the C library).
4162\withsubitem{(in module sys)}{
4163 \ttindex{stdout}\ttindex{stderr}\ttindex{stdin}}
Guido van Rossum4a944d71997-08-14 20:35:38 +00004164
4165The return value points to the first thread state created in the new
4166sub-interpreter. This thread state is made the current thread state.
4167Note that no actual thread is created; see the discussion of thread
4168states below. If creation of the new interpreter is unsuccessful,
Guido van Rossum580aa8d1997-11-25 15:34:51 +00004169\NULL{} is returned; no exception is set since the exception state
Guido van Rossum4a944d71997-08-14 20:35:38 +00004170is stored in the current thread state and there may not be a current
4171thread state. (Like all other Python/C API functions, the global
4172interpreter lock must be held before calling this function and is
4173still held when it returns; however, unlike most other Python/C API
4174functions, there needn't be a current thread state on entry.)
4175
4176Extension modules are shared between (sub-)interpreters as follows:
4177the first time a particular extension is imported, it is initialized
4178normally, and a (shallow) copy of its module's dictionary is
4179squirreled away. When the same extension is imported by another
4180(sub-)interpreter, a new module is initialized and filled with the
Fred Drakee058b4f1998-02-16 06:15:35 +00004181contents of this copy; the extension's \code{init} function is not
4182called. Note that this is different from what happens when an
4183extension is imported after the interpreter has been completely
Fred Drake659ebfa2000-04-03 15:42:13 +00004184re-initialized by calling
4185\cfunction{Py_Finalize()}\ttindex{Py_Finalize()} and
4186\cfunction{Py_Initialize()}\ttindex{Py_Initialize()}; in that case,
4187the extension's \code{init\var{module}} function \emph{is} called
4188again.
Guido van Rossum4a944d71997-08-14 20:35:38 +00004189
Fred Drakee058b4f1998-02-16 06:15:35 +00004190\strong{Bugs and caveats:} Because sub-interpreters (and the main
Guido van Rossum4a944d71997-08-14 20:35:38 +00004191interpreter) are part of the same process, the insulation between them
Fred Drakee058b4f1998-02-16 06:15:35 +00004192isn't perfect --- for example, using low-level file operations like
Fred Drake659ebfa2000-04-03 15:42:13 +00004193\withsubitem{(in module os)}{\ttindex{close()}}
Fred Drakef8830d11998-04-23 14:06:01 +00004194\function{os.close()} they can (accidentally or maliciously) affect each
Guido van Rossum4a944d71997-08-14 20:35:38 +00004195other's open files. Because of the way extensions are shared between
4196(sub-)interpreters, some extensions may not work properly; this is
4197especially likely when the extension makes use of (static) global
4198variables, or when the extension manipulates its module's dictionary
4199after its initialization. It is possible to insert objects created in
4200one sub-interpreter into a namespace of another sub-interpreter; this
4201should be done with great care to avoid sharing user-defined
4202functions, methods, instances or classes between sub-interpreters,
4203since import operations executed by such objects may affect the
4204wrong (sub-)interpreter's dictionary of loaded modules. (XXX This is
4205a hard-to-fix bug that will be addressed in a future release.)
4206\end{cfuncdesc}
4207
4208\begin{cfuncdesc}{void}{Py_EndInterpreter}{PyThreadState *tstate}
4209Destroy the (sub-)interpreter represented by the given thread state.
4210The given thread state must be the current thread state. See the
4211discussion of thread states below. When the call returns, the current
Guido van Rossum580aa8d1997-11-25 15:34:51 +00004212thread state is \NULL{}. All thread states associated with this
Guido van Rossum4a944d71997-08-14 20:35:38 +00004213interpreted are destroyed. (The global interpreter lock must be held
4214before calling this function and is still held when it returns.)
Fred Drake659ebfa2000-04-03 15:42:13 +00004215\cfunction{Py_Finalize()}\ttindex{Py_Finalize()} will destroy all
4216sub-interpreters that haven't been explicitly destroyed at that point.
Guido van Rossum4a944d71997-08-14 20:35:38 +00004217\end{cfuncdesc}
4218
4219\begin{cfuncdesc}{void}{Py_SetProgramName}{char *name}
Fred Drake659ebfa2000-04-03 15:42:13 +00004220This function should be called before
4221\cfunction{Py_Initialize()}\ttindex{Py_Initialize()} is called
Guido van Rossum4a944d71997-08-14 20:35:38 +00004222for the first time, if it is called at all. It tells the interpreter
Fred Drake659ebfa2000-04-03 15:42:13 +00004223the value of the \code{argv[0]} argument to the
4224\cfunction{main()}\ttindex{main()} function of the program. This is
4225used by \cfunction{Py_GetPath()}\ttindex{Py_GetPath()} and some other
Guido van Rossum4a944d71997-08-14 20:35:38 +00004226functions below to find the Python run-time libraries relative to the
Fred Drakea8455ab2000-06-16 19:58:42 +00004227interpreter executable. The default value is \code{'python'}. The
Guido van Rossum4a944d71997-08-14 20:35:38 +00004228argument should point to a zero-terminated character string in static
4229storage whose contents will not change for the duration of the
4230program's execution. No code in the Python interpreter will change
4231the contents of this storage.
4232\end{cfuncdesc}
4233
Fred Drakec6fa34e1998-04-02 06:47:24 +00004234\begin{cfuncdesc}{char*}{Py_GetProgramName}{}
Fred Drake659ebfa2000-04-03 15:42:13 +00004235Return the program name set with
4236\cfunction{Py_SetProgramName()}\ttindex{Py_SetProgramName()}, or the
Guido van Rossum4a944d71997-08-14 20:35:38 +00004237default. The returned string points into static storage; the caller
4238should not modify its value.
4239\end{cfuncdesc}
4240
Fred Drakec6fa34e1998-04-02 06:47:24 +00004241\begin{cfuncdesc}{char*}{Py_GetPrefix}{}
Fred Drakee058b4f1998-02-16 06:15:35 +00004242Return the \emph{prefix} for installed platform-independent files. This
Guido van Rossum4a944d71997-08-14 20:35:38 +00004243is derived through a number of complicated rules from the program name
Fred Drakee058b4f1998-02-16 06:15:35 +00004244set with \cfunction{Py_SetProgramName()} and some environment variables;
Fred Drakea8455ab2000-06-16 19:58:42 +00004245for example, if the program name is \code{'/usr/local/bin/python'},
4246the prefix is \code{'/usr/local'}. The returned string points into
Guido van Rossum4a944d71997-08-14 20:35:38 +00004247static storage; the caller should not modify its value. This
Fred Drakec94d9341998-04-12 02:39:13 +00004248corresponds to the \makevar{prefix} variable in the top-level
Fred Drakea8455ab2000-06-16 19:58:42 +00004249\file{Makefile} and the \longprogramopt{prefix} argument to the
Fred Drakee058b4f1998-02-16 06:15:35 +00004250\program{configure} script at build time. The value is available to
Fred Drakeb0a78731998-01-13 18:51:10 +00004251Python code as \code{sys.prefix}. It is only useful on \UNIX{}. See
Guido van Rossum4a944d71997-08-14 20:35:38 +00004252also the next function.
4253\end{cfuncdesc}
4254
Fred Drakec6fa34e1998-04-02 06:47:24 +00004255\begin{cfuncdesc}{char*}{Py_GetExecPrefix}{}
Fred Drakee058b4f1998-02-16 06:15:35 +00004256Return the \emph{exec-prefix} for installed platform-\emph{de}pendent
Guido van Rossum4a944d71997-08-14 20:35:38 +00004257files. This is derived through a number of complicated rules from the
Fred Drakee058b4f1998-02-16 06:15:35 +00004258program name set with \cfunction{Py_SetProgramName()} and some environment
Guido van Rossum4a944d71997-08-14 20:35:38 +00004259variables; for example, if the program name is
Fred Drakea8455ab2000-06-16 19:58:42 +00004260\code{'/usr/local/bin/python'}, the exec-prefix is
4261\code{'/usr/local'}. The returned string points into static storage;
Guido van Rossum4a944d71997-08-14 20:35:38 +00004262the caller should not modify its value. This corresponds to the
Fred Drakec94d9341998-04-12 02:39:13 +00004263\makevar{exec_prefix} variable in the top-level \file{Makefile} and the
Fred Drakea8455ab2000-06-16 19:58:42 +00004264\longprogramopt{exec-prefix} argument to the
Fred Drake310ee611999-11-09 17:31:42 +00004265\program{configure} script at build time. The value is available to
4266Python code as \code{sys.exec_prefix}. It is only useful on \UNIX{}.
Guido van Rossum4a944d71997-08-14 20:35:38 +00004267
4268Background: The exec-prefix differs from the prefix when platform
4269dependent files (such as executables and shared libraries) are
4270installed in a different directory tree. In a typical installation,
4271platform dependent files may be installed in the
Fred Drakea8455ab2000-06-16 19:58:42 +00004272\file{/usr/local/plat} subtree while platform independent may be
4273installed in \file{/usr/local}.
Guido van Rossum4a944d71997-08-14 20:35:38 +00004274
4275Generally speaking, a platform is a combination of hardware and
4276software families, e.g. Sparc machines running the Solaris 2.x
4277operating system are considered the same platform, but Intel machines
4278running Solaris 2.x are another platform, and Intel machines running
4279Linux are yet another platform. Different major revisions of the same
Fred Drakeb0a78731998-01-13 18:51:10 +00004280operating system generally also form different platforms. Non-\UNIX{}
Guido van Rossum4a944d71997-08-14 20:35:38 +00004281operating systems are a different story; the installation strategies
4282on those systems are so different that the prefix and exec-prefix are
4283meaningless, and set to the empty string. Note that compiled Python
4284bytecode files are platform independent (but not independent from the
4285Python version by which they were compiled!).
4286
Fred Drakee058b4f1998-02-16 06:15:35 +00004287System administrators will know how to configure the \program{mount} or
Fred Drakea8455ab2000-06-16 19:58:42 +00004288\program{automount} programs to share \file{/usr/local} between platforms
4289while having \file{/usr/local/plat} be a different filesystem for each
Guido van Rossum4a944d71997-08-14 20:35:38 +00004290platform.
4291\end{cfuncdesc}
4292
Fred Drakec6fa34e1998-04-02 06:47:24 +00004293\begin{cfuncdesc}{char*}{Py_GetProgramFullPath}{}
Guido van Rossum4a944d71997-08-14 20:35:38 +00004294Return the full program name of the Python executable; this is
4295computed as a side-effect of deriving the default module search path
Fred Drake659ebfa2000-04-03 15:42:13 +00004296from the program name (set by
4297\cfunction{Py_SetProgramName()}\ttindex{Py_SetProgramName()} above).
4298The returned string points into static storage; the caller should not
Guido van Rossum4a944d71997-08-14 20:35:38 +00004299modify its value. The value is available to Python code as
Guido van Rossum42cefd01997-10-05 15:27:29 +00004300\code{sys.executable}.
Fred Drake659ebfa2000-04-03 15:42:13 +00004301\withsubitem{(in module sys)}{\ttindex{executable}}
Guido van Rossum4a944d71997-08-14 20:35:38 +00004302\end{cfuncdesc}
4303
Fred Drakec6fa34e1998-04-02 06:47:24 +00004304\begin{cfuncdesc}{char*}{Py_GetPath}{}
Fred Drake4de05a91998-02-16 14:25:26 +00004305\indexiii{module}{search}{path}
Guido van Rossum4a944d71997-08-14 20:35:38 +00004306Return the default module search path; this is computed from the
Fred Drakee058b4f1998-02-16 06:15:35 +00004307program name (set by \cfunction{Py_SetProgramName()} above) and some
Guido van Rossum4a944d71997-08-14 20:35:38 +00004308environment variables. The returned string consists of a series of
4309directory names separated by a platform dependent delimiter character.
Fred Drakef8830d11998-04-23 14:06:01 +00004310The delimiter character is \character{:} on \UNIX{}, \character{;} on
Fred Drake659ebfa2000-04-03 15:42:13 +00004311DOS/Windows, and \character{\e n} (the \ASCII{} newline character) on
Fred Drakee5bc4971998-02-12 23:36:49 +00004312Macintosh. The returned string points into static storage; the caller
Guido van Rossum4a944d71997-08-14 20:35:38 +00004313should not modify its value. The value is available to Python code
Fred Drake659ebfa2000-04-03 15:42:13 +00004314as the list \code{sys.path}\withsubitem{(in module sys)}{\ttindex{path}},
4315which may be modified to change the future search path for loaded
4316modules.
Guido van Rossum4a944d71997-08-14 20:35:38 +00004317
4318% XXX should give the exact rules
4319\end{cfuncdesc}
4320
Fred Drakec6fa34e1998-04-02 06:47:24 +00004321\begin{cfuncdesc}{const char*}{Py_GetVersion}{}
Guido van Rossum4a944d71997-08-14 20:35:38 +00004322Return the version of this Python interpreter. This is a string that
4323looks something like
4324
Guido van Rossum09270b51997-08-15 18:57:32 +00004325\begin{verbatim}
Fred Drakee058b4f1998-02-16 06:15:35 +00004326"1.5 (#67, Dec 31 1997, 22:34:28) [GCC 2.7.2.2]"
Guido van Rossum09270b51997-08-15 18:57:32 +00004327\end{verbatim}
Guido van Rossum4a944d71997-08-14 20:35:38 +00004328
4329The first word (up to the first space character) is the current Python
4330version; the first three characters are the major and minor version
4331separated by a period. The returned string points into static storage;
4332the caller should not modify its value. The value is available to
4333Python code as the list \code{sys.version}.
Fred Drake659ebfa2000-04-03 15:42:13 +00004334\withsubitem{(in module sys)}{\ttindex{version}}
Guido van Rossum4a944d71997-08-14 20:35:38 +00004335\end{cfuncdesc}
4336
Fred Drakec6fa34e1998-04-02 06:47:24 +00004337\begin{cfuncdesc}{const char*}{Py_GetPlatform}{}
Fred Drakeb0a78731998-01-13 18:51:10 +00004338Return the platform identifier for the current platform. On \UNIX{},
Guido van Rossum4a944d71997-08-14 20:35:38 +00004339this is formed from the ``official'' name of the operating system,
4340converted to lower case, followed by the major revision number; e.g.,
4341for Solaris 2.x, which is also known as SunOS 5.x, the value is
Fred Drakea8455ab2000-06-16 19:58:42 +00004342\code{'sunos5'}. On Macintosh, it is \code{'mac'}. On Windows, it
4343is \code{'win'}. The returned string points into static storage;
Guido van Rossum4a944d71997-08-14 20:35:38 +00004344the caller should not modify its value. The value is available to
4345Python code as \code{sys.platform}.
Fred Drake659ebfa2000-04-03 15:42:13 +00004346\withsubitem{(in module sys)}{\ttindex{platform}}
Guido van Rossum4a944d71997-08-14 20:35:38 +00004347\end{cfuncdesc}
4348
Fred Drakec6fa34e1998-04-02 06:47:24 +00004349\begin{cfuncdesc}{const char*}{Py_GetCopyright}{}
Guido van Rossum4a944d71997-08-14 20:35:38 +00004350Return the official copyright string for the current Python version,
4351for example
4352
Fred Drakea8455ab2000-06-16 19:58:42 +00004353\code{'Copyright 1991-1995 Stichting Mathematisch Centrum, Amsterdam'}
Guido van Rossum4a944d71997-08-14 20:35:38 +00004354
4355The returned string points into static storage; the caller should not
4356modify its value. The value is available to Python code as the list
4357\code{sys.copyright}.
Fred Drake659ebfa2000-04-03 15:42:13 +00004358\withsubitem{(in module sys)}{\ttindex{copyright}}
Guido van Rossum4a944d71997-08-14 20:35:38 +00004359\end{cfuncdesc}
4360
Fred Drakec6fa34e1998-04-02 06:47:24 +00004361\begin{cfuncdesc}{const char*}{Py_GetCompiler}{}
Guido van Rossum4a944d71997-08-14 20:35:38 +00004362Return an indication of the compiler used to build the current Python
Fred Drakee058b4f1998-02-16 06:15:35 +00004363version, in square brackets, for example:
Guido van Rossum4a944d71997-08-14 20:35:38 +00004364
Fred Drakee058b4f1998-02-16 06:15:35 +00004365\begin{verbatim}
4366"[GCC 2.7.2.2]"
4367\end{verbatim}
Guido van Rossum4a944d71997-08-14 20:35:38 +00004368
4369The returned string points into static storage; the caller should not
4370modify its value. The value is available to Python code as part of
4371the variable \code{sys.version}.
Fred Drake659ebfa2000-04-03 15:42:13 +00004372\withsubitem{(in module sys)}{\ttindex{version}}
Guido van Rossum4a944d71997-08-14 20:35:38 +00004373\end{cfuncdesc}
4374
Fred Drakec6fa34e1998-04-02 06:47:24 +00004375\begin{cfuncdesc}{const char*}{Py_GetBuildInfo}{}
Guido van Rossum4a944d71997-08-14 20:35:38 +00004376Return information about the sequence number and build date and time
4377of the current Python interpreter instance, for example
4378
Guido van Rossum09270b51997-08-15 18:57:32 +00004379\begin{verbatim}
4380"#67, Aug 1 1997, 22:34:28"
4381\end{verbatim}
Guido van Rossum4a944d71997-08-14 20:35:38 +00004382
4383The returned string points into static storage; the caller should not
4384modify its value. The value is available to Python code as part of
4385the variable \code{sys.version}.
Fred Drake659ebfa2000-04-03 15:42:13 +00004386\withsubitem{(in module sys)}{\ttindex{version}}
Guido van Rossum4a944d71997-08-14 20:35:38 +00004387\end{cfuncdesc}
4388
4389\begin{cfuncdesc}{int}{PySys_SetArgv}{int argc, char **argv}
Fred Drake659ebfa2000-04-03 15:42:13 +00004390Set \code{sys.argv} based on \var{argc} and \var{argv}. These
4391parameters are similar to those passed to the program's
4392\cfunction{main()}\ttindex{main()} function with the difference that
4393the first entry should refer to the script file to be executed rather
4394than the executable hosting the Python interpreter. If there isn't a
4395script that will be run, the first entry in \var{argv} can be an empty
4396string. If this function fails to initialize \code{sys.argv}, a fatal
4397condition is signalled using
4398\cfunction{Py_FatalError()}\ttindex{Py_FatalError()}.
4399\withsubitem{(in module sys)}{\ttindex{argv}}
4400% XXX impl. doesn't seem consistent in allowing 0/NULL for the params;
4401% check w/ Guido.
Guido van Rossum4a944d71997-08-14 20:35:38 +00004402\end{cfuncdesc}
4403
4404% XXX Other PySys thingies (doesn't really belong in this chapter)
4405
Fred Drakeefd146c1999-02-15 15:30:45 +00004406\section{Thread State and the Global Interpreter Lock
4407 \label{threads}}
Guido van Rossum4a944d71997-08-14 20:35:38 +00004408
Fred Drake659ebfa2000-04-03 15:42:13 +00004409\index{global interpreter lock}
4410\index{interpreter lock}
4411\index{lock, interpreter}
4412
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004413The Python interpreter is not fully thread safe. In order to support
4414multi-threaded Python programs, there's a global lock that must be
4415held by the current thread before it can safely access Python objects.
4416Without the lock, even the simplest operations could cause problems in
Fred Drake7baf3d41998-02-20 00:45:52 +00004417a multi-threaded program: for example, when two threads simultaneously
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004418increment the reference count of the same object, the reference count
4419could end up being incremented only once instead of twice.
4420
4421Therefore, the rule exists that only the thread that has acquired the
4422global interpreter lock may operate on Python objects or call Python/C
4423API functions. In order to support multi-threaded Python programs,
Fred Drake659ebfa2000-04-03 15:42:13 +00004424the interpreter regularly releases and reacquires the lock --- by
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004425default, every ten bytecode instructions (this can be changed with
Fred Drake659ebfa2000-04-03 15:42:13 +00004426\withsubitem{(in module sys)}{\ttindex{setcheckinterval()}}
Fred Drakee058b4f1998-02-16 06:15:35 +00004427\function{sys.setcheckinterval()}). The lock is also released and
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004428reacquired around potentially blocking I/O operations like reading or
4429writing a file, so that other threads can run while the thread that
4430requests the I/O is waiting for the I/O operation to complete.
4431
4432The Python interpreter needs to keep some bookkeeping information
Fred Drakee058b4f1998-02-16 06:15:35 +00004433separate per thread --- for this it uses a data structure called
Fred Drake659ebfa2000-04-03 15:42:13 +00004434\ctype{PyThreadState}\ttindex{PyThreadState}. This is new in Python
44351.5; in earlier versions, such state was stored in global variables,
4436and switching threads could cause problems. In particular, exception
4437handling is now thread safe, when the application uses
4438\withsubitem{(in module sys)}{\ttindex{exc_info()}}
4439\function{sys.exc_info()} to access the exception last raised in the
4440current thread.
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004441
4442There's one global variable left, however: the pointer to the current
Fred Drake659ebfa2000-04-03 15:42:13 +00004443\ctype{PyThreadState}\ttindex{PyThreadState} structure. While most
4444thread packages have a way to store ``per-thread global data,''
4445Python's internal platform independent thread abstraction doesn't
4446support this yet. Therefore, the current thread state must be
4447manipulated explicitly.
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004448
4449This is easy enough in most cases. Most code manipulating the global
4450interpreter lock has the following simple structure:
4451
Guido van Rossum9faf4c51997-10-07 14:38:54 +00004452\begin{verbatim}
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004453Save the thread state in a local variable.
4454Release the interpreter lock.
4455...Do some blocking I/O operation...
4456Reacquire the interpreter lock.
4457Restore the thread state from the local variable.
Guido van Rossum9faf4c51997-10-07 14:38:54 +00004458\end{verbatim}
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004459
4460This is so common that a pair of macros exists to simplify it:
4461
Guido van Rossum9faf4c51997-10-07 14:38:54 +00004462\begin{verbatim}
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004463Py_BEGIN_ALLOW_THREADS
4464...Do some blocking I/O operation...
4465Py_END_ALLOW_THREADS
Guido van Rossum9faf4c51997-10-07 14:38:54 +00004466\end{verbatim}
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004467
Fred Drake659ebfa2000-04-03 15:42:13 +00004468The \code{Py_BEGIN_ALLOW_THREADS}\ttindex{Py_BEGIN_ALLOW_THREADS} macro
4469opens a new block and declares a hidden local variable; the
4470\code{Py_END_ALLOW_THREADS}\ttindex{Py_END_ALLOW_THREADS} macro closes
Fred Drakee058b4f1998-02-16 06:15:35 +00004471the block. Another advantage of using these two macros is that when
4472Python is compiled without thread support, they are defined empty,
4473thus saving the thread state and lock manipulations.
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004474
4475When thread support is enabled, the block above expands to the
4476following code:
4477
Guido van Rossum9faf4c51997-10-07 14:38:54 +00004478\begin{verbatim}
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004479 PyThreadState *_save;
Fred Drake659ebfa2000-04-03 15:42:13 +00004480
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004481 _save = PyEval_SaveThread();
4482 ...Do some blocking I/O operation...
4483 PyEval_RestoreThread(_save);
Guido van Rossum9faf4c51997-10-07 14:38:54 +00004484\end{verbatim}
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004485
4486Using even lower level primitives, we can get roughly the same effect
4487as follows:
4488
Guido van Rossum9faf4c51997-10-07 14:38:54 +00004489\begin{verbatim}
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004490 PyThreadState *_save;
Fred Drake659ebfa2000-04-03 15:42:13 +00004491
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004492 _save = PyThreadState_Swap(NULL);
4493 PyEval_ReleaseLock();
4494 ...Do some blocking I/O operation...
4495 PyEval_AcquireLock();
4496 PyThreadState_Swap(_save);
Guido van Rossum9faf4c51997-10-07 14:38:54 +00004497\end{verbatim}
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004498
4499There are some subtle differences; in particular,
Fred Drake659ebfa2000-04-03 15:42:13 +00004500\cfunction{PyEval_RestoreThread()}\ttindex{PyEval_RestoreThread()} saves
4501and restores the value of the global variable
4502\cdata{errno}\ttindex{errno}, since the lock manipulation does not
Fred Drakef8830d11998-04-23 14:06:01 +00004503guarantee that \cdata{errno} is left alone. Also, when thread support
Fred Drake659ebfa2000-04-03 15:42:13 +00004504is disabled,
4505\cfunction{PyEval_SaveThread()}\ttindex{PyEval_SaveThread()} and
Fred Drakee058b4f1998-02-16 06:15:35 +00004506\cfunction{PyEval_RestoreThread()} don't manipulate the lock; in this
Fred Drake659ebfa2000-04-03 15:42:13 +00004507case, \cfunction{PyEval_ReleaseLock()}\ttindex{PyEval_ReleaseLock()} and
4508\cfunction{PyEval_AcquireLock()}\ttindex{PyEval_AcquireLock()} are not
4509available. This is done so that dynamically loaded extensions
4510compiled with thread support enabled can be loaded by an interpreter
4511that was compiled with disabled thread support.
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004512
4513The global interpreter lock is used to protect the pointer to the
4514current thread state. When releasing the lock and saving the thread
4515state, the current thread state pointer must be retrieved before the
4516lock is released (since another thread could immediately acquire the
4517lock and store its own thread state in the global variable).
Fred Drakeffe58ca2000-09-29 17:31:54 +00004518Conversely, when acquiring the lock and restoring the thread state,
4519the lock must be acquired before storing the thread state pointer.
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004520
4521Why am I going on with so much detail about this? Because when
Fred Drake659ebfa2000-04-03 15:42:13 +00004522threads are created from C, they don't have the global interpreter
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004523lock, nor is there a thread state data structure for them. Such
4524threads must bootstrap themselves into existence, by first creating a
4525thread state data structure, then acquiring the lock, and finally
4526storing their thread state pointer, before they can start using the
4527Python/C API. When they are done, they should reset the thread state
4528pointer, release the lock, and finally free their thread state data
4529structure.
4530
4531When creating a thread data structure, you need to provide an
4532interpreter state data structure. The interpreter state data
4533structure hold global data that is shared by all threads in an
4534interpreter, for example the module administration
4535(\code{sys.modules}). Depending on your needs, you can either create
4536a new interpreter state data structure, or share the interpreter state
4537data structure used by the Python main thread (to access the latter,
Fred Drakef8830d11998-04-23 14:06:01 +00004538you must obtain the thread state and access its \member{interp} member;
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004539this must be done by a thread that is created by Python or by the main
4540thread after Python is initialized).
4541
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004542
4543\begin{ctypedesc}{PyInterpreterState}
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004544This data structure represents the state shared by a number of
4545cooperating threads. Threads belonging to the same interpreter
4546share their module administration and a few other internal items.
4547There are no public members in this structure.
4548
4549Threads belonging to different interpreters initially share nothing,
4550except process state like available memory, open file descriptors and
4551such. The global interpreter lock is also shared by all threads,
4552regardless of to which interpreter they belong.
4553\end{ctypedesc}
4554
4555\begin{ctypedesc}{PyThreadState}
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004556This data structure represents the state of a single thread. The only
Fred Drakef8830d11998-04-23 14:06:01 +00004557public data member is \ctype{PyInterpreterState *}\member{interp},
4558which points to this thread's interpreter state.
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004559\end{ctypedesc}
4560
4561\begin{cfuncdesc}{void}{PyEval_InitThreads}{}
4562Initialize and acquire the global interpreter lock. It should be
4563called in the main thread before creating a second thread or engaging
Fred Drakee058b4f1998-02-16 06:15:35 +00004564in any other thread operations such as
Fred Drake659ebfa2000-04-03 15:42:13 +00004565\cfunction{PyEval_ReleaseLock()}\ttindex{PyEval_ReleaseLock()} or
4566\code{PyEval_ReleaseThread(\var{tstate})}\ttindex{PyEval_ReleaseThread()}.
4567It is not needed before calling
4568\cfunction{PyEval_SaveThread()}\ttindex{PyEval_SaveThread()} or
4569\cfunction{PyEval_RestoreThread()}\ttindex{PyEval_RestoreThread()}.
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004570
4571This is a no-op when called for a second time. It is safe to call
Fred Drake659ebfa2000-04-03 15:42:13 +00004572this function before calling
4573\cfunction{Py_Initialize()}\ttindex{Py_Initialize()}.
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004574
4575When only the main thread exists, no lock operations are needed. This
4576is a common situation (most Python programs do not use threads), and
4577the lock operations slow the interpreter down a bit. Therefore, the
4578lock is not created initially. This situation is equivalent to having
4579acquired the lock: when there is only a single thread, all object
4580accesses are safe. Therefore, when this function initializes the
Fred Drake4de05a91998-02-16 14:25:26 +00004581lock, it also acquires it. Before the Python
4582\module{thread}\refbimodindex{thread} module creates a new thread,
4583knowing that either it has the lock or the lock hasn't been created
4584yet, it calls \cfunction{PyEval_InitThreads()}. When this call
4585returns, it is guaranteed that the lock has been created and that it
4586has acquired it.
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004587
4588It is \strong{not} safe to call this function when it is unknown which
4589thread (if any) currently has the global interpreter lock.
4590
4591This function is not available when thread support is disabled at
4592compile time.
4593\end{cfuncdesc}
4594
Guido van Rossum4a944d71997-08-14 20:35:38 +00004595\begin{cfuncdesc}{void}{PyEval_AcquireLock}{}
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004596Acquire the global interpreter lock. The lock must have been created
4597earlier. If this thread already has the lock, a deadlock ensues.
4598This function is not available when thread support is disabled at
4599compile time.
Guido van Rossum4a944d71997-08-14 20:35:38 +00004600\end{cfuncdesc}
4601
4602\begin{cfuncdesc}{void}{PyEval_ReleaseLock}{}
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004603Release the global interpreter lock. The lock must have been created
4604earlier. This function is not available when thread support is
Fred Drakee058b4f1998-02-16 06:15:35 +00004605disabled at compile time.
Guido van Rossum4a944d71997-08-14 20:35:38 +00004606\end{cfuncdesc}
4607
4608\begin{cfuncdesc}{void}{PyEval_AcquireThread}{PyThreadState *tstate}
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004609Acquire the global interpreter lock and then set the current thread
Guido van Rossum580aa8d1997-11-25 15:34:51 +00004610state to \var{tstate}, which should not be \NULL{}. The lock must
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004611have been created earlier. If this thread already has the lock,
4612deadlock ensues. This function is not available when thread support
Fred Drakee058b4f1998-02-16 06:15:35 +00004613is disabled at compile time.
Guido van Rossum4a944d71997-08-14 20:35:38 +00004614\end{cfuncdesc}
4615
4616\begin{cfuncdesc}{void}{PyEval_ReleaseThread}{PyThreadState *tstate}
Guido van Rossum580aa8d1997-11-25 15:34:51 +00004617Reset the current thread state to \NULL{} and release the global
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004618interpreter lock. The lock must have been created earlier and must be
4619held by the current thread. The \var{tstate} argument, which must not
Guido van Rossum580aa8d1997-11-25 15:34:51 +00004620be \NULL{}, is only used to check that it represents the current
Fred Drakee058b4f1998-02-16 06:15:35 +00004621thread state --- if it isn't, a fatal error is reported. This
4622function is not available when thread support is disabled at compile
4623time.
Guido van Rossum4a944d71997-08-14 20:35:38 +00004624\end{cfuncdesc}
4625
Fred Drakec6fa34e1998-04-02 06:47:24 +00004626\begin{cfuncdesc}{PyThreadState*}{PyEval_SaveThread}{}
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004627Release the interpreter lock (if it has been created and thread
Guido van Rossum580aa8d1997-11-25 15:34:51 +00004628support is enabled) and reset the thread state to \NULL{},
4629returning the previous thread state (which is not \NULL{}). If
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004630the lock has been created, the current thread must have acquired it.
4631(This function is available even when thread support is disabled at
4632compile time.)
Guido van Rossum4a944d71997-08-14 20:35:38 +00004633\end{cfuncdesc}
4634
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004635\begin{cfuncdesc}{void}{PyEval_RestoreThread}{PyThreadState *tstate}
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004636Acquire the interpreter lock (if it has been created and thread
4637support is enabled) and set the thread state to \var{tstate}, which
Guido van Rossum580aa8d1997-11-25 15:34:51 +00004638must not be \NULL{}. If the lock has been created, the current
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004639thread must not have acquired it, otherwise deadlock ensues. (This
4640function is available even when thread support is disabled at compile
4641time.)
Guido van Rossum4a944d71997-08-14 20:35:38 +00004642\end{cfuncdesc}
4643
Fred Drake659ebfa2000-04-03 15:42:13 +00004644The following macros are normally used without a trailing semicolon;
4645look for example usage in the Python source distribution.
4646
4647\begin{csimplemacrodesc}{Py_BEGIN_ALLOW_THREADS}
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004648This macro expands to
Fred Drakee058b4f1998-02-16 06:15:35 +00004649\samp{\{ PyThreadState *_save; _save = PyEval_SaveThread();}.
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004650Note that it contains an opening brace; it must be matched with a
4651following \code{Py_END_ALLOW_THREADS} macro. See above for further
4652discussion of this macro. It is a no-op when thread support is
4653disabled at compile time.
Fred Drake659ebfa2000-04-03 15:42:13 +00004654\end{csimplemacrodesc}
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004655
Fred Drake659ebfa2000-04-03 15:42:13 +00004656\begin{csimplemacrodesc}{Py_END_ALLOW_THREADS}
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004657This macro expands to
Fred Drakee058b4f1998-02-16 06:15:35 +00004658\samp{PyEval_RestoreThread(_save); \}}.
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004659Note that it contains a closing brace; it must be matched with an
4660earlier \code{Py_BEGIN_ALLOW_THREADS} macro. See above for further
4661discussion of this macro. It is a no-op when thread support is
4662disabled at compile time.
Fred Drake659ebfa2000-04-03 15:42:13 +00004663\end{csimplemacrodesc}
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004664
Thomas Wouterse30ac572001-07-09 14:35:01 +00004665\begin{csimplemacrodesc}{Py_BLOCK_THREADS}
Fred Drakebab29652001-07-10 16:10:08 +00004666This macro expands to \samp{PyEval_RestoreThread(_save);}: it
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004667is equivalent to \code{Py_END_ALLOW_THREADS} without the closing
4668brace. It is a no-op when thread support is disabled at compile
4669time.
Fred Drake659ebfa2000-04-03 15:42:13 +00004670\end{csimplemacrodesc}
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004671
Thomas Wouterse30ac572001-07-09 14:35:01 +00004672\begin{csimplemacrodesc}{Py_UNBLOCK_THREADS}
Fred Drakebab29652001-07-10 16:10:08 +00004673This macro expands to \samp{_save = PyEval_SaveThread();}: it is
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004674equivalent to \code{Py_BEGIN_ALLOW_THREADS} without the opening brace
4675and variable declaration. It is a no-op when thread support is
4676disabled at compile time.
Fred Drake659ebfa2000-04-03 15:42:13 +00004677\end{csimplemacrodesc}
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004678
4679All of the following functions are only available when thread support
4680is enabled at compile time, and must be called only when the
Fred Drake9d20ac31998-02-16 15:27:08 +00004681interpreter lock has been created.
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004682
Fred Drakec6fa34e1998-04-02 06:47:24 +00004683\begin{cfuncdesc}{PyInterpreterState*}{PyInterpreterState_New}{}
Guido van Rossumed9dcc11998-08-07 18:28:03 +00004684Create a new interpreter state object. The interpreter lock need not
4685be held, but may be held if it is necessary to serialize calls to this
4686function.
Guido van Rossum4a944d71997-08-14 20:35:38 +00004687\end{cfuncdesc}
4688
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004689\begin{cfuncdesc}{void}{PyInterpreterState_Clear}{PyInterpreterState *interp}
4690Reset all information in an interpreter state object. The interpreter
4691lock must be held.
Guido van Rossum4a944d71997-08-14 20:35:38 +00004692\end{cfuncdesc}
4693
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004694\begin{cfuncdesc}{void}{PyInterpreterState_Delete}{PyInterpreterState *interp}
4695Destroy an interpreter state object. The interpreter lock need not be
4696held. The interpreter state must have been reset with a previous
Fred Drakee058b4f1998-02-16 06:15:35 +00004697call to \cfunction{PyInterpreterState_Clear()}.
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004698\end{cfuncdesc}
4699
Fred Drakec6fa34e1998-04-02 06:47:24 +00004700\begin{cfuncdesc}{PyThreadState*}{PyThreadState_New}{PyInterpreterState *interp}
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004701Create a new thread state object belonging to the given interpreter
Guido van Rossumed9dcc11998-08-07 18:28:03 +00004702object. The interpreter lock need not be held, but may be held if it
4703is necessary to serialize calls to this function.
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004704\end{cfuncdesc}
4705
4706\begin{cfuncdesc}{void}{PyThreadState_Clear}{PyThreadState *tstate}
4707Reset all information in a thread state object. The interpreter lock
4708must be held.
4709\end{cfuncdesc}
4710
4711\begin{cfuncdesc}{void}{PyThreadState_Delete}{PyThreadState *tstate}
4712Destroy a thread state object. The interpreter lock need not be
4713held. The thread state must have been reset with a previous
Fred Drakee058b4f1998-02-16 06:15:35 +00004714call to \cfunction{PyThreadState_Clear()}.
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004715\end{cfuncdesc}
4716
Fred Drakec6fa34e1998-04-02 06:47:24 +00004717\begin{cfuncdesc}{PyThreadState*}{PyThreadState_Get}{}
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004718Return the current thread state. The interpreter lock must be held.
Guido van Rossum580aa8d1997-11-25 15:34:51 +00004719When the current thread state is \NULL{}, this issues a fatal
Guido van Rossum5b8a5231997-12-30 04:38:44 +00004720error (so that the caller needn't check for \NULL{}).
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004721\end{cfuncdesc}
4722
Fred Drakec6fa34e1998-04-02 06:47:24 +00004723\begin{cfuncdesc}{PyThreadState*}{PyThreadState_Swap}{PyThreadState *tstate}
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004724Swap the current thread state with the thread state given by the
Guido van Rossum580aa8d1997-11-25 15:34:51 +00004725argument \var{tstate}, which may be \NULL{}. The interpreter lock
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004726must be held.
4727\end{cfuncdesc}
4728
Fred Drake24e62192001-05-21 15:56:55 +00004729\begin{cfuncdesc}{PyObject*}{PyThreadState_GetDict}{}
4730Return a dictionary in which extensions can store thread-specific
4731state information. Each extension should use a unique key to use to
4732store state in the dictionary. If this function returns \NULL, an
4733exception has been raised and the caller should allow it to
4734propogate.
4735\end{cfuncdesc}
4736
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004737
Fred Drake68db7302001-07-17 19:48:30 +00004738\section{Profiling and Tracing \label{profiling}}
4739
4740\sectionauthor{Fred L. Drake, Jr.}{fdrake@acm.org}
4741
4742The Python interpreter provides some low-level support for attaching
4743profiling and execution tracing facilities. These are used for
4744profiling, debugging, and coverage analysis tools.
4745
4746Starting with Python 2.2, the implementation of this facility was
4747substantially revised, and an interface from C was added. This C
4748interface allows the profiling or tracing code to avoid the overhead
4749of calling through Python-level callable objects, making a direct C
4750function call instead. The essential attributes of the facility have
4751not changed; the interface allows trace functions to be installed
4752per-thread, and the basic events reported to the trace function are
4753the same as had been reported to the Python-level trace functions in
4754previous versions.
4755
4756\begin{ctypedesc}[Py_tracefunc]{int (*Py_tracefunc)(PyObject *obj,
4757 PyFrameObject *frame, int what,
4758 PyObject *arg)}
4759 The type of the trace function registered using
4760 \cfunction{PyEval_SetProfile()} and \cfunction{PyEval_SetTrace()}.
4761 The first parameter is the object passed to the registration
4762 function,
4763\end{ctypedesc}
4764
4765\begin{cvardesc}{int}{PyTrace_CALL}
4766 The value of the \var{what} parameter to a \ctype{Py_tracefunc}
4767 function when a new function or method call is being reported.
4768\end{cvardesc}
4769
4770\begin{cvardesc}{int}{PyTrace_EXCEPT}
4771\end{cvardesc}
4772
4773\begin{cvardesc}{int}{PyTrace_LINE}
4774 The value passed as the \var{what} parameter to a trace function
4775 (but not a profiling function) when a line-number event is being
4776 reported.
4777\end{cvardesc}
4778
4779\begin{cvardesc}{int}{PyTrace_RETURN}
4780 The value for the \var{what} parameter to \ctype{Py_tracefunc}
4781 functions when a call is returning without propogating an exception.
4782\end{cvardesc}
4783
4784\begin{cfuncdesc}{void}{PyEval_SetProfile}{Py_tracefunc func, PyObject *obj}
4785\end{cfuncdesc}
4786
4787\begin{cfuncdesc}{void}{PyEval_SetTrace}{Py_tracefunc func, PyObject *obj}
4788\end{cfuncdesc}
4789
4790
Fred Drake659ebfa2000-04-03 15:42:13 +00004791\chapter{Memory Management \label{memory}}
4792\sectionauthor{Vladimir Marangozov}{Vladimir.Marangozov@inrialpes.fr}
4793
4794
4795\section{Overview \label{memoryOverview}}
4796
4797Memory management in Python involves a private heap containing all
4798Python objects and data structures. The management of this private
4799heap is ensured internally by the \emph{Python memory manager}. The
4800Python memory manager has different components which deal with various
4801dynamic storage management aspects, like sharing, segmentation,
4802preallocation or caching.
4803
4804At the lowest level, a raw memory allocator ensures that there is
4805enough room in the private heap for storing all Python-related data
4806by interacting with the memory manager of the operating system. On top
4807of the raw memory allocator, several object-specific allocators
4808operate on the same heap and implement distinct memory management
4809policies adapted to the peculiarities of every object type. For
4810example, integer objects are managed differently within the heap than
4811strings, tuples or dictionaries because integers imply different
4812storage requirements and speed/space tradeoffs. The Python memory
4813manager thus delegates some of the work to the object-specific
4814allocators, but ensures that the latter operate within the bounds of
4815the private heap.
4816
4817It is important to understand that the management of the Python heap
4818is performed by the interpreter itself and that the user has no
4819control on it, even if she regularly manipulates object pointers to
4820memory blocks inside that heap. The allocation of heap space for
4821Python objects and other internal buffers is performed on demand by
4822the Python memory manager through the Python/C API functions listed in
4823this document.
4824
4825To avoid memory corruption, extension writers should never try to
4826operate on Python objects with the functions exported by the C
4827library: \cfunction{malloc()}\ttindex{malloc()},
4828\cfunction{calloc()}\ttindex{calloc()},
4829\cfunction{realloc()}\ttindex{realloc()} and
4830\cfunction{free()}\ttindex{free()}. This will result in
4831mixed calls between the C allocator and the Python memory manager
4832with fatal consequences, because they implement different algorithms
4833and operate on different heaps. However, one may safely allocate and
4834release memory blocks with the C library allocator for individual
4835purposes, as shown in the following example:
4836
4837\begin{verbatim}
4838 PyObject *res;
4839 char *buf = (char *) malloc(BUFSIZ); /* for I/O */
4840
4841 if (buf == NULL)
4842 return PyErr_NoMemory();
4843 ...Do some I/O operation involving buf...
4844 res = PyString_FromString(buf);
4845 free(buf); /* malloc'ed */
4846 return res;
4847\end{verbatim}
4848
4849In this example, the memory request for the I/O buffer is handled by
4850the C library allocator. The Python memory manager is involved only
4851in the allocation of the string object returned as a result.
4852
4853In most situations, however, it is recommended to allocate memory from
4854the Python heap specifically because the latter is under control of
4855the Python memory manager. For example, this is required when the
4856interpreter is extended with new object types written in C. Another
4857reason for using the Python heap is the desire to \emph{inform} the
4858Python memory manager about the memory needs of the extension module.
4859Even when the requested memory is used exclusively for internal,
4860highly-specific purposes, delegating all memory requests to the Python
4861memory manager causes the interpreter to have a more accurate image of
4862its memory footprint as a whole. Consequently, under certain
4863circumstances, the Python memory manager may or may not trigger
4864appropriate actions, like garbage collection, memory compaction or
4865other preventive procedures. Note that by using the C library
4866allocator as shown in the previous example, the allocated memory for
4867the I/O buffer escapes completely the Python memory manager.
4868
4869
4870\section{Memory Interface \label{memoryInterface}}
4871
4872The following function sets, modeled after the ANSI C standard, are
4873available for allocating and releasing memory from the Python heap:
4874
4875
Fred Drake7d45d342000-08-11 17:07:32 +00004876\begin{cfuncdesc}{void*}{PyMem_Malloc}{size_t n}
4877Allocates \var{n} bytes and returns a pointer of type \ctype{void*} to
Fred Drakebab29652001-07-10 16:10:08 +00004878the allocated memory, or \NULL{} if the request fails. Requesting zero
Fred Drake659ebfa2000-04-03 15:42:13 +00004879bytes returns a non-\NULL{} pointer.
Fred Drakebab29652001-07-10 16:10:08 +00004880The memory will not have been initialized in any way.
Fred Drake659ebfa2000-04-03 15:42:13 +00004881\end{cfuncdesc}
4882
Fred Drake7d45d342000-08-11 17:07:32 +00004883\begin{cfuncdesc}{void*}{PyMem_Realloc}{void *p, size_t n}
Fred Drake659ebfa2000-04-03 15:42:13 +00004884Resizes the memory block pointed to by \var{p} to \var{n} bytes. The
4885contents will be unchanged to the minimum of the old and the new
4886sizes. If \var{p} is \NULL{}, the call is equivalent to
Fred Drakebab29652001-07-10 16:10:08 +00004887\cfunction{PyMem_Malloc(\var{n})}; if \var{n} is equal to zero, the
4888memory block is resized but is not freed, and the returned pointer is
4889non-\NULL{}. Unless \var{p} is \NULL{}, it must have been returned by
4890a previous call to \cfunction{PyMem_Malloc()} or
4891\cfunction{PyMem_Realloc()}.
Fred Drake659ebfa2000-04-03 15:42:13 +00004892\end{cfuncdesc}
4893
Fred Drake7d45d342000-08-11 17:07:32 +00004894\begin{cfuncdesc}{void}{PyMem_Free}{void *p}
Fred Drake659ebfa2000-04-03 15:42:13 +00004895Frees the memory block pointed to by \var{p}, which must have been
4896returned by a previous call to \cfunction{PyMem_Malloc()} or
4897\cfunction{PyMem_Realloc()}. Otherwise, or if
4898\cfunction{PyMem_Free(p)} has been called before, undefined behaviour
4899occurs. If \var{p} is \NULL{}, no operation is performed.
4900\end{cfuncdesc}
4901
Fred Drake659ebfa2000-04-03 15:42:13 +00004902The following type-oriented macros are provided for convenience. Note
4903that \var{TYPE} refers to any C type.
4904
Fred Drakef913e542000-09-12 20:17:17 +00004905\begin{cfuncdesc}{\var{TYPE}*}{PyMem_New}{TYPE, size_t n}
Fred Drake659ebfa2000-04-03 15:42:13 +00004906Same as \cfunction{PyMem_Malloc()}, but allocates \code{(\var{n} *
4907sizeof(\var{TYPE}))} bytes of memory. Returns a pointer cast to
4908\ctype{\var{TYPE}*}.
Fred Drakebab29652001-07-10 16:10:08 +00004909The memory will not have been initialized in any way.
Fred Drake659ebfa2000-04-03 15:42:13 +00004910\end{cfuncdesc}
4911
Fred Drakef913e542000-09-12 20:17:17 +00004912\begin{cfuncdesc}{\var{TYPE}*}{PyMem_Resize}{void *p, TYPE, size_t n}
Fred Drake659ebfa2000-04-03 15:42:13 +00004913Same as \cfunction{PyMem_Realloc()}, but the memory block is resized
4914to \code{(\var{n} * sizeof(\var{TYPE}))} bytes. Returns a pointer
4915cast to \ctype{\var{TYPE}*}.
4916\end{cfuncdesc}
4917
Fred Drakef913e542000-09-12 20:17:17 +00004918\begin{cfuncdesc}{void}{PyMem_Del}{void *p}
Fred Drake659ebfa2000-04-03 15:42:13 +00004919Same as \cfunction{PyMem_Free()}.
4920\end{cfuncdesc}
4921
Fred Drakef913e542000-09-12 20:17:17 +00004922In addition, the following macro sets are provided for calling the
4923Python memory allocator directly, without involving the C API functions
4924listed above. However, note that their use does not preserve binary
4925compatibility accross Python versions and is therefore deprecated in
4926extension modules.
4927
4928\cfunction{PyMem_MALLOC()}, \cfunction{PyMem_REALLOC()}, \cfunction{PyMem_FREE()}.
4929
4930\cfunction{PyMem_NEW()}, \cfunction{PyMem_RESIZE()}, \cfunction{PyMem_DEL()}.
4931
Fred Drake659ebfa2000-04-03 15:42:13 +00004932
4933\section{Examples \label{memoryExamples}}
4934
4935Here is the example from section \ref{memoryOverview}, rewritten so
4936that the I/O buffer is allocated from the Python heap by using the
4937first function set:
4938
4939\begin{verbatim}
4940 PyObject *res;
4941 char *buf = (char *) PyMem_Malloc(BUFSIZ); /* for I/O */
4942
4943 if (buf == NULL)
4944 return PyErr_NoMemory();
4945 /* ...Do some I/O operation involving buf... */
4946 res = PyString_FromString(buf);
4947 PyMem_Free(buf); /* allocated with PyMem_Malloc */
4948 return res;
4949\end{verbatim}
4950
Fred Drakef913e542000-09-12 20:17:17 +00004951The same code using the type-oriented function set:
Fred Drake659ebfa2000-04-03 15:42:13 +00004952
4953\begin{verbatim}
4954 PyObject *res;
Fred Drakef913e542000-09-12 20:17:17 +00004955 char *buf = PyMem_New(char, BUFSIZ); /* for I/O */
Fred Drake659ebfa2000-04-03 15:42:13 +00004956
4957 if (buf == NULL)
4958 return PyErr_NoMemory();
4959 /* ...Do some I/O operation involving buf... */
4960 res = PyString_FromString(buf);
Fred Drakef913e542000-09-12 20:17:17 +00004961 PyMem_Del(buf); /* allocated with PyMem_New */
Fred Drake659ebfa2000-04-03 15:42:13 +00004962 return res;
4963\end{verbatim}
4964
Fred Drakef913e542000-09-12 20:17:17 +00004965Note that in the two examples above, the buffer is always
4966manipulated via functions belonging to the same set. Indeed, it
Fred Drake659ebfa2000-04-03 15:42:13 +00004967is required to use the same memory API family for a given
4968memory block, so that the risk of mixing different allocators is
4969reduced to a minimum. The following code sequence contains two errors,
4970one of which is labeled as \emph{fatal} because it mixes two different
4971allocators operating on different heaps.
4972
4973\begin{verbatim}
Fred Drakef913e542000-09-12 20:17:17 +00004974char *buf1 = PyMem_New(char, BUFSIZ);
Fred Drake659ebfa2000-04-03 15:42:13 +00004975char *buf2 = (char *) malloc(BUFSIZ);
4976char *buf3 = (char *) PyMem_Malloc(BUFSIZ);
4977...
Fred Drakef913e542000-09-12 20:17:17 +00004978PyMem_Del(buf3); /* Wrong -- should be PyMem_Free() */
Fred Drake659ebfa2000-04-03 15:42:13 +00004979free(buf2); /* Right -- allocated via malloc() */
Fred Drakef913e542000-09-12 20:17:17 +00004980free(buf1); /* Fatal -- should be PyMem_Del() */
Fred Drake659ebfa2000-04-03 15:42:13 +00004981\end{verbatim}
4982
4983In addition to the functions aimed at handling raw memory blocks from
4984the Python heap, objects in Python are allocated and released with
Fred Drakef913e542000-09-12 20:17:17 +00004985\cfunction{PyObject_New()}, \cfunction{PyObject_NewVar()} and
4986\cfunction{PyObject_Del()}, or with their corresponding macros
4987\cfunction{PyObject_NEW()}, \cfunction{PyObject_NEW_VAR()} and
Fred Drakee06f0f92000-06-30 15:52:39 +00004988\cfunction{PyObject_DEL()}.
Fred Drake659ebfa2000-04-03 15:42:13 +00004989
Fred Drakee06f0f92000-06-30 15:52:39 +00004990These will be explained in the next chapter on defining and
4991implementing new object types in C.
Fred Drake659ebfa2000-04-03 15:42:13 +00004992
4993
Fred Drakeefd146c1999-02-15 15:30:45 +00004994\chapter{Defining New Object Types \label{newTypes}}
Guido van Rossum4a944d71997-08-14 20:35:38 +00004995
Fred Drake88fdaa72001-07-20 20:56:11 +00004996
4997\section{Allocating Objects on the Heap
4998 \label{allocating-objects}}
4999
Fred Drakec6fa34e1998-04-02 06:47:24 +00005000\begin{cfuncdesc}{PyObject*}{_PyObject_New}{PyTypeObject *type}
Fred Drakee058b4f1998-02-16 06:15:35 +00005001\end{cfuncdesc}
5002
Fred Drakef913e542000-09-12 20:17:17 +00005003\begin{cfuncdesc}{PyVarObject*}{_PyObject_NewVar}{PyTypeObject *type, int size}
Fred Drakee058b4f1998-02-16 06:15:35 +00005004\end{cfuncdesc}
5005
Fred Drakef913e542000-09-12 20:17:17 +00005006\begin{cfuncdesc}{void}{_PyObject_Del}{PyObject *op}
Fred Drakee058b4f1998-02-16 06:15:35 +00005007\end{cfuncdesc}
5008
Fred Drakef913e542000-09-12 20:17:17 +00005009\begin{cfuncdesc}{PyObject*}{PyObject_Init}{PyObject *op,
Fred Drakebab29652001-07-10 16:10:08 +00005010 PyTypeObject *type}
5011 Initialize a newly-allocated object \var{op} with its type and
5012 initial reference. Returns the initialized object. If \var{type}
5013 indicates that the object participates in the cyclic garbage
5014 detector, it it added to the detector's set of observed objects.
5015 Other fields of the object are not affected.
Fred Drakef913e542000-09-12 20:17:17 +00005016\end{cfuncdesc}
5017
5018\begin{cfuncdesc}{PyVarObject*}{PyObject_InitVar}{PyVarObject *op,
Fred Drakebab29652001-07-10 16:10:08 +00005019 PyTypeObject *type, int size}
5020 This does everything \cfunction{PyObject_Init()} does, and also
5021 initializes the length information for a variable-size object.
Fred Drakef913e542000-09-12 20:17:17 +00005022\end{cfuncdesc}
5023
5024\begin{cfuncdesc}{\var{TYPE}*}{PyObject_New}{TYPE, PyTypeObject *type}
Fred Drakebab29652001-07-10 16:10:08 +00005025 Allocate a new Python object using the C structure type \var{TYPE}
5026 and the Python type object \var{type}. Fields not defined by the
5027 Python object header are not initialized; the object's reference
5028 count will be one. The size of the memory
5029 allocation is determined from the \member{tp_basicsize} field of the
5030 type object.
Fred Drakef913e542000-09-12 20:17:17 +00005031\end{cfuncdesc}
5032
5033\begin{cfuncdesc}{\var{TYPE}*}{PyObject_NewVar}{TYPE, PyTypeObject *type,
5034 int size}
Fred Drakebab29652001-07-10 16:10:08 +00005035 Allocate a new Python object using the C structure type \var{TYPE}
5036 and the Python type object \var{type}. Fields not defined by the
5037 Python object header are not initialized. The allocated memory
5038 allows for the \var{TYPE} structure plus \var{size} fields of the
5039 size given by the \member{tp_itemsize} field of \var{type}. This is
5040 useful for implementing objects like tuples, which are able to
5041 determine their size at construction time. Embedding the array of
5042 fields into the same allocation decreases the number of allocations,
5043 improving the memory management efficiency.
Fred Drakef913e542000-09-12 20:17:17 +00005044\end{cfuncdesc}
5045
5046\begin{cfuncdesc}{void}{PyObject_Del}{PyObject *op}
Fred Drakebab29652001-07-10 16:10:08 +00005047 Releases memory allocated to an object using
5048 \cfunction{PyObject_New()} or \cfunction{PyObject_NewVar()}. This
5049 is normally called from the \member{tp_dealloc} handler specified in
5050 the object's type. The fields of the object should not be accessed
5051 after this call as the memory is no longer a valid Python object.
Fred Drakef913e542000-09-12 20:17:17 +00005052\end{cfuncdesc}
5053
5054\begin{cfuncdesc}{\var{TYPE}*}{PyObject_NEW}{TYPE, PyTypeObject *type}
Fred Drakebab29652001-07-10 16:10:08 +00005055 Macro version of \cfunction{PyObject_New()}, to gain performance at
5056 the expense of safety. This does not check \var{type} for a \NULL{}
5057 value.
Fred Drakef913e542000-09-12 20:17:17 +00005058\end{cfuncdesc}
5059
5060\begin{cfuncdesc}{\var{TYPE}*}{PyObject_NEW_VAR}{TYPE, PyTypeObject *type,
5061 int size}
Fred Drakebab29652001-07-10 16:10:08 +00005062 Macro version of \cfunction{PyObject_NewVar()}, to gain performance
5063 at the expense of safety. This does not check \var{type} for a
5064 \NULL{} value.
Fred Drakef913e542000-09-12 20:17:17 +00005065\end{cfuncdesc}
5066
5067\begin{cfuncdesc}{void}{PyObject_DEL}{PyObject *op}
Fred Drakebab29652001-07-10 16:10:08 +00005068 Macro version of \cfunction{PyObject_Del()}.
Fred Drakee058b4f1998-02-16 06:15:35 +00005069\end{cfuncdesc}
5070
Fred Drakeee814bf2000-11-28 22:34:32 +00005071\begin{cfuncdesc}{PyObject*}{Py_InitModule}{char *name,
5072 PyMethodDef *methods}
5073 Create a new module object based on a name and table of functions,
5074 returning the new module object.
5075\end{cfuncdesc}
5076
5077\begin{cfuncdesc}{PyObject*}{Py_InitModule3}{char *name,
5078 PyMethodDef *methods,
5079 char *doc}
5080 Create a new module object based on a name and table of functions,
5081 returning the new module object. If \var{doc} is non-\NULL, it will
5082 be used to define the docstring for the module.
5083\end{cfuncdesc}
5084
5085\begin{cfuncdesc}{PyObject*}{Py_InitModule4}{char *name,
5086 PyMethodDef *methods,
5087 char *doc, PyObject *self,
5088 int apiver}
5089 Create a new module object based on a name and table of functions,
5090 returning the new module object. If \var{doc} is non-\NULL, it will
5091 be used to define the docstring for the module. If \var{self} is
5092 non-\NULL, it will passed to the functions of the module as their
5093 (otherwise \NULL) first parameter. (This was added as an
5094 experimental feature, and there are no known uses in the current
5095 version of Python.) For \var{apiver}, the only value which should
5096 be passed is defined by the constant \constant{PYTHON_API_VERSION}.
5097
5098 \strong{Note:} Most uses of this function should probably be using
5099 the \cfunction{Py_InitModule3()} instead; only use this if you are
5100 sure you need it.
5101\end{cfuncdesc}
Guido van Rossum3c4378b1998-04-14 20:21:10 +00005102
Fred Drake659ebfa2000-04-03 15:42:13 +00005103DL_IMPORT
5104
Fred Drakebab29652001-07-10 16:10:08 +00005105\begin{cvardesc}{PyObject}{_Py_NoneStruct}
5106 Object which is visible in Python as \code{None}. This should only
5107 be accessed using the \code{Py_None} macro, which evaluates to a
5108 pointer to this object.
5109\end{cvardesc}
Fred Drake659ebfa2000-04-03 15:42:13 +00005110
5111
5112\section{Common Object Structures \label{common-structs}}
5113
Guido van Rossumae110af1997-05-22 20:11:52 +00005114PyObject, PyVarObject
5115
5116PyObject_HEAD, PyObject_HEAD_INIT, PyObject_VAR_HEAD
5117
5118Typedefs:
5119unaryfunc, binaryfunc, ternaryfunc, inquiry, coercion, intargfunc,
5120intintargfunc, intobjargproc, intintobjargproc, objobjargproc,
Guido van Rossumae110af1997-05-22 20:11:52 +00005121destructor, printfunc, getattrfunc, getattrofunc, setattrfunc,
5122setattrofunc, cmpfunc, reprfunc, hashfunc
5123
Fred Drakea8455ab2000-06-16 19:58:42 +00005124\begin{ctypedesc}{PyCFunction}
5125Type of the functions used to implement most Python callables in C.
5126\end{ctypedesc}
5127
5128\begin{ctypedesc}{PyMethodDef}
5129Structure used to describe a method of an extension type. This
5130structure has four fields:
5131
5132\begin{tableiii}{l|l|l}{member}{Field}{C Type}{Meaning}
5133 \lineiii{ml_name}{char *}{name of the method}
5134 \lineiii{ml_meth}{PyCFunction}{pointer to the C implementation}
5135 \lineiii{ml_flags}{int}{flag bits indicating how the call should be
5136 constructed}
5137 \lineiii{ml_doc}{char *}{points to the contents of the docstring}
5138\end{tableiii}
5139\end{ctypedesc}
5140
5141\begin{cfuncdesc}{PyObject*}{Py_FindMethod}{PyMethodDef[] table,
5142 PyObject *ob, char *name}
5143Return a bound method object for an extension type implemented in C.
5144This function also handles the special attribute \member{__methods__},
5145returning a list of all the method names defined in \var{table}.
5146\end{cfuncdesc}
5147
Fred Drake659ebfa2000-04-03 15:42:13 +00005148
5149\section{Mapping Object Structures \label{mapping-structs}}
5150
5151\begin{ctypedesc}{PyMappingMethods}
5152Structure used to hold pointers to the functions used to implement the
5153mapping protocol for an extension type.
5154\end{ctypedesc}
5155
5156
5157\section{Number Object Structures \label{number-structs}}
5158
5159\begin{ctypedesc}{PyNumberMethods}
5160Structure used to hold pointers to the functions an extension type
5161uses to implement the number protocol.
5162\end{ctypedesc}
5163
5164
5165\section{Sequence Object Structures \label{sequence-structs}}
5166
5167\begin{ctypedesc}{PySequenceMethods}
5168Structure used to hold pointers to the functions which an object uses
5169to implement the sequence protocol.
5170\end{ctypedesc}
5171
5172
5173\section{Buffer Object Structures \label{buffer-structs}}
5174\sectionauthor{Greg J. Stein}{greg@lyra.org}
5175
5176The buffer interface exports a model where an object can expose its
5177internal data as a set of chunks of data, where each chunk is
5178specified as a pointer/length pair. These chunks are called
5179\dfn{segments} and are presumed to be non-contiguous in memory.
5180
5181If an object does not export the buffer interface, then its
5182\member{tp_as_buffer} member in the \ctype{PyTypeObject} structure
5183should be \NULL{}. Otherwise, the \member{tp_as_buffer} will point to
5184a \ctype{PyBufferProcs} structure.
5185
5186\strong{Note:} It is very important that your
Fred Drakec392b572001-03-21 22:15:01 +00005187\ctype{PyTypeObject} structure uses \constant{Py_TPFLAGS_DEFAULT} for
5188the value of the \member{tp_flags} member rather than \code{0}. This
Fred Drake659ebfa2000-04-03 15:42:13 +00005189tells the Python runtime that your \ctype{PyBufferProcs} structure
5190contains the \member{bf_getcharbuffer} slot. Older versions of Python
5191did not have this member, so a new Python interpreter using an old
5192extension needs to be able to test for its presence before using it.
5193
5194\begin{ctypedesc}{PyBufferProcs}
5195Structure used to hold the function pointers which define an
5196implementation of the buffer protocol.
5197
5198The first slot is \member{bf_getreadbuffer}, of type
5199\ctype{getreadbufferproc}. If this slot is \NULL{}, then the object
5200does not support reading from the internal data. This is
5201non-sensical, so implementors should fill this in, but callers should
5202test that the slot contains a non-\NULL{} value.
5203
5204The next slot is \member{bf_getwritebuffer} having type
5205\ctype{getwritebufferproc}. This slot may be \NULL{} if the object
5206does not allow writing into its returned buffers.
5207
5208The third slot is \member{bf_getsegcount}, with type
5209\ctype{getsegcountproc}. This slot must not be \NULL{} and is used to
5210inform the caller how many segments the object contains. Simple
5211objects such as \ctype{PyString_Type} and
5212\ctype{PyBuffer_Type} objects contain a single segment.
5213
5214The last slot is \member{bf_getcharbuffer}, of type
5215\ctype{getcharbufferproc}. This slot will only be present if the
Fred Drakec392b572001-03-21 22:15:01 +00005216\constant{Py_TPFLAGS_HAVE_GETCHARBUFFER} flag is present in the
Fred Drake659ebfa2000-04-03 15:42:13 +00005217\member{tp_flags} field of the object's \ctype{PyTypeObject}. Before using
5218this slot, the caller should test whether it is present by using the
5219\cfunction{PyType_HasFeature()}\ttindex{PyType_HasFeature()} function.
5220If present, it may be \NULL, indicating that the object's contents
5221cannot be used as \emph{8-bit characters}.
5222The slot function may also raise an error if the object's contents
5223cannot be interpreted as 8-bit characters. For example, if the object
5224is an array which is configured to hold floating point values, an
5225exception may be raised if a caller attempts to use
5226\member{bf_getcharbuffer} to fetch a sequence of 8-bit characters.
5227This notion of exporting the internal buffers as ``text'' is used to
5228distinguish between objects that are binary in nature, and those which
5229have character-based content.
5230
5231\strong{Note:} The current policy seems to state that these characters
5232may be multi-byte characters. This implies that a buffer size of
5233\var{N} does not mean there are \var{N} characters present.
5234\end{ctypedesc}
5235
5236\begin{datadesc}{Py_TPFLAGS_HAVE_GETCHARBUFFER}
5237Flag bit set in the type structure to indicate that the
5238\member{bf_getcharbuffer} slot is known. This being set does not
5239indicate that the object supports the buffer interface or that the
5240\member{bf_getcharbuffer} slot is non-\NULL.
5241\end{datadesc}
5242
5243\begin{ctypedesc}[getreadbufferproc]{int (*getreadbufferproc)
5244 (PyObject *self, int segment, void **ptrptr)}
5245Return a pointer to a readable segment of the buffer. This function
5246is allowed to raise an exception, in which case it must return
5247\code{-1}. The \var{segment} which is passed must be zero or
5248positive, and strictly less than the number of segments returned by
Greg Stein4d4d0032001-04-07 16:14:49 +00005249the \member{bf_getsegcount} slot function. On success, it returns the
5250length of the buffer memory, and sets \code{*\var{ptrptr}} to a
5251pointer to that memory.
Fred Drake659ebfa2000-04-03 15:42:13 +00005252\end{ctypedesc}
5253
5254\begin{ctypedesc}[getwritebufferproc]{int (*getwritebufferproc)
5255 (PyObject *self, int segment, void **ptrptr)}
Greg Stein4d4d0032001-04-07 16:14:49 +00005256Return a pointer to a writable memory buffer in \code{*\var{ptrptr}},
5257and the length of that segment as the function return value.
5258The memory buffer must correspond to buffer segment \var{segment}.
Fred Drake58c5a2a1999-08-04 13:13:24 +00005259Must return \code{-1} and set an exception on error.
5260\exception{TypeError} should be raised if the object only supports
5261read-only buffers, and \exception{SystemError} should be raised when
5262\var{segment} specifies a segment that doesn't exist.
5263% Why doesn't it raise ValueError for this one?
Fred Drake659ebfa2000-04-03 15:42:13 +00005264% GJS: because you shouldn't be calling it with an invalid
5265% segment. That indicates a blatant programming error in the C
5266% code.
Fred Drake58c5a2a1999-08-04 13:13:24 +00005267\end{ctypedesc}
5268
Fred Drake659ebfa2000-04-03 15:42:13 +00005269\begin{ctypedesc}[getsegcountproc]{int (*getsegcountproc)
5270 (PyObject *self, int *lenp)}
5271Return the number of memory segments which comprise the buffer. If
5272\var{lenp} is not \NULL, the implementation must report the sum of the
5273sizes (in bytes) of all segments in \code{*\var{lenp}}.
5274The function cannot fail.
5275\end{ctypedesc}
Guido van Rossumae110af1997-05-22 20:11:52 +00005276
Fred Drake659ebfa2000-04-03 15:42:13 +00005277\begin{ctypedesc}[getcharbufferproc]{int (*getcharbufferproc)
5278 (PyObject *self, int segment, const char **ptrptr)}
5279\end{ctypedesc}
Guido van Rossumae110af1997-05-22 20:11:52 +00005280
Guido van Rossumae110af1997-05-22 20:11:52 +00005281
Fred Drakec392b572001-03-21 22:15:01 +00005282\section{Supporting Cyclic Garbarge Collection
5283 \label{supporting-cycle-detection}}
5284
5285Python's support for detecting and collecting garbage which involves
5286circular references requires support from object types which are
5287``containers'' for other objects which may also be containers. Types
5288which do not store references to other objects, or which only store
5289references to atomic types (such as numbers or strings), do not need
5290to provide any explicit support for garbage collection.
5291
5292To create a container type, the \member{tp_flags} field of the type
5293object must include the \constant{Py_TPFLAGS_GC} and provide an
Fred Drakee28d8ae2001-03-22 16:30:17 +00005294implementation of the \member{tp_traverse} handler. The computed
5295value of the \member{tp_basicsize} field must include
5296\constant{PyGC_HEAD_SIZE} as well. If instances of the type are
5297mutable, a \member{tp_clear} implementation must also be provided.
Fred Drakec392b572001-03-21 22:15:01 +00005298
5299\begin{datadesc}{Py_TPFLAGS_GC}
5300 Objects with a type with this flag set must conform with the rules
5301 documented here. For convenience these objects will be referred to
5302 as container objects.
5303\end{datadesc}
5304
5305\begin{datadesc}{PyGC_HEAD_SIZE}
5306 Extra memory needed for the garbage collector. Container objects
5307 must include this in the calculation of their tp_basicsize. If the
5308 collector is disabled at compile time then this is \code{0}.
5309\end{datadesc}
5310
Fred Drakee28d8ae2001-03-22 16:30:17 +00005311Constructors for container types must conform to two rules:
5312
5313\begin{enumerate}
5314\item The memory for the object must be allocated using
5315 \cfunction{PyObject_New()} or \cfunction{PyObject_VarNew()}.
5316
5317\item Once all the fields which may contain references to other
5318 containers are initialized, it must call
5319 \cfunction{PyObject_GC_Init()}.
5320\end{enumerate}
5321
Fred Drakec392b572001-03-21 22:15:01 +00005322\begin{cfuncdesc}{void}{PyObject_GC_Init}{PyObject *op}
5323 Adds the object \var{op} to the set of container objects tracked by
5324 the collector. The collector can run at unexpected times so objects
5325 must be valid while being tracked. This should be called once all
5326 the fields followed by the \member{tp_traverse} handler become valid,
5327 usually near the end of the constructor.
5328\end{cfuncdesc}
5329
Fred Drakee28d8ae2001-03-22 16:30:17 +00005330Similarly, the deallocator for the object must conform to a similar
5331pair of rules:
5332
5333\begin{enumerate}
5334\item Before fields which refer to other containers are invalidated,
5335 \cfunction{PyObject_GC_Fini()} must be called.
5336
5337\item The object's memory must be deallocated using
5338 \cfunction{PyObject_Del()}.
5339\end{enumerate}
5340
Fred Drakec392b572001-03-21 22:15:01 +00005341\begin{cfuncdesc}{void}{PyObject_GC_Fini}{PyObject *op}
5342 Remove the object \var{op} from the set of container objects tracked
5343 by the collector. Note that \cfunction{PyObject_GC_Init()} can be
5344 called again on this object to add it back to the set of tracked
5345 objects. The deallocator (\member{tp_dealloc} handler) should call
5346 this for the object before any of the fields used by the
5347 \member{tp_traverse} handler become invalid.
Fred Drake8f6df462001-03-23 17:42:09 +00005348
5349 \strong{Note:} Any container which may be referenced from another
5350 object reachable by the collector must itself be tracked by the
5351 collector, so it is generally not safe to call this function
5352 anywhere but in the object's deallocator.
Fred Drakec392b572001-03-21 22:15:01 +00005353\end{cfuncdesc}
5354
5355The \member{tp_traverse} handler accepts a function parameter of this
5356type:
5357
5358\begin{ctypedesc}[visitproc]{int (*visitproc)(PyObject *object, void *arg)}
5359 Type of the visitor function passed to the \member{tp_traverse}
5360 handler. The function should be called with an object to traverse
5361 as \var{object} and the third parameter to the \member{tp_traverse}
5362 handler as \var{arg}.
5363\end{ctypedesc}
5364
5365The \member{tp_traverse} handler must have the following type:
5366
5367\begin{ctypedesc}[traverseproc]{int (*traverseproc)(PyObject *self,
5368 visitproc visit, void *arg)}
5369 Traversal function for a container object. Implementations must
5370 call the \var{visit} function for each object directly contained by
5371 \var{self}, with the parameters to \var{visit} being the contained
5372 object and the \var{arg} value passed to the handler. If
5373 \var{visit} returns a non-zero value then an error has occurred and
5374 that value should be returned immediately.
5375\end{ctypedesc}
5376
5377The \member{tp_clear} handler must be of the \ctype{inquiry} type, or
5378\NULL{} if the object is immutable.
5379
5380\begin{ctypedesc}[inquiry]{int (*inquiry)(PyObject *self)}
5381 Drop references that may have created reference cycles. Immutable
5382 objects do not have to define this method since they can never
5383 directly create reference cycles. Note that the object must still
Fred Drakebab29652001-07-10 16:10:08 +00005384 be valid after calling this method (don't just call
Fred Drakec392b572001-03-21 22:15:01 +00005385 \cfunction{Py_DECREF()} on a reference). The collector will call
5386 this method if it detects that this object is involved in a
5387 reference cycle.
5388\end{ctypedesc}
5389
5390
Fred Drakee28d8ae2001-03-22 16:30:17 +00005391\subsection{Example Cycle Collector Support
5392 \label{example-cycle-support}}
5393
5394This example shows only enough of the implementation of an extension
5395type to show how the garbage collector support needs to be added. It
5396shows the definition of the object structure, the
5397\member{tp_traverse}, \member{tp_clear} and \member{tp_dealloc}
5398implementations, the type structure, and a constructor --- the module
5399initialization needed to export the constructor to Python is not shown
5400as there are no special considerations there for the collector. To
5401make this interesting, assume that the module exposes ways for the
5402\member{container} field of the object to be modified. Note that
5403since no checks are made on the type of the object used to initialize
5404\member{container}, we have to assume that it may be a container.
5405
5406\begin{verbatim}
5407#include "Python.h"
5408
5409typedef struct {
5410 PyObject_HEAD
5411 PyObject *container;
5412} MyObject;
5413
5414static int
5415my_traverse(MyObject *self, visitproc visit, void *arg)
5416{
5417 if (self->container != NULL)
5418 return visit(self->container, arg);
5419 else
5420 return 0;
5421}
5422
5423static int
5424my_clear(MyObject *self)
5425{
5426 Py_XDECREF(self->container);
5427 self->container = NULL;
5428
5429 return 0;
5430}
5431
5432static void
5433my_dealloc(MyObject *self)
5434{
5435 PyObject_GC_Fini((PyObject *) self);
5436 Py_XDECREF(self->container);
5437 PyObject_Del(self);
5438}
5439\end{verbatim}
5440
5441\begin{verbatim}
5442statichere PyTypeObject
5443MyObject_Type = {
5444 PyObject_HEAD_INIT(NULL)
5445 0,
5446 "MyObject",
5447 sizeof(MyObject) + PyGC_HEAD_SIZE,
5448 0,
5449 (destructor)my_dealloc, /* tp_dealloc */
5450 0, /* tp_print */
5451 0, /* tp_getattr */
5452 0, /* tp_setattr */
5453 0, /* tp_compare */
5454 0, /* tp_repr */
5455 0, /* tp_as_number */
5456 0, /* tp_as_sequence */
5457 0, /* tp_as_mapping */
5458 0, /* tp_hash */
5459 0, /* tp_call */
5460 0, /* tp_str */
5461 0, /* tp_getattro */
5462 0, /* tp_setattro */
5463 0, /* tp_as_buffer */
5464 Py_TPFLAGS_DEFAULT | Py_TPFLAGS_GC,
5465 0, /* tp_doc */
5466 (traverseproc)my_traverse, /* tp_traverse */
5467 (inquiry)my_clear, /* tp_clear */
5468 0, /* tp_richcompare */
5469 0, /* tp_weaklistoffset */
5470};
5471
5472/* This constructor should be made accessible from Python. */
5473static PyObject *
5474new_object(PyObject *unused, PyObject *args)
5475{
5476 PyObject *container = NULL;
5477 MyObject *result = NULL;
5478
5479 if (PyArg_ParseTuple(args, "|O:new_object", &container)) {
5480 result = PyObject_New(MyObject, &MyObject_Type);
5481 if (result != NULL) {
5482 result->container = container;
5483 PyObject_GC_Init();
5484 }
5485 }
5486 return (PyObject *) result;
5487}
5488\end{verbatim}
5489
5490
Fred Drake659ebfa2000-04-03 15:42:13 +00005491% \chapter{Debugging \label{debugging}}
5492%
5493% XXX Explain Py_DEBUG, Py_TRACE_REFS, Py_REF_DEBUG.
Guido van Rossum5b8a5231997-12-30 04:38:44 +00005494
5495
Fred Drakeed773ef2000-09-21 21:35:22 +00005496\appendix
5497\chapter{Reporting Bugs}
5498\input{reportingbugs}
5499
Fred Drake490d34d2001-06-20 21:39:12 +00005500\chapter{History and License}
5501\input{license}
5502
Marc-André Lemburga544ea22001-01-17 18:04:31 +00005503\input{api.ind} % Index -- must be last
Guido van Rossum9231c8f1997-05-15 21:43:21 +00005504
5505\end{document}