blob: 2620ad3d117dc192c78b1be8ee8abdacba500dc2 [file] [log] [blame]
Fred Drake6659c301998-03-03 22:02:19 +00001\documentclass{manual}
Guido van Rossum9231c8f1997-05-15 21:43:21 +00002
Guido van Rossum9faf4c51997-10-07 14:38:54 +00003\title{Python/C API Reference Manual}
Guido van Rossum9231c8f1997-05-15 21:43:21 +00004
5\input{boilerplate}
6
Marc-André Lemburga544ea22001-01-17 18:04:31 +00007\makeindex % tell \index to actually write the .idx file
Guido van Rossum9231c8f1997-05-15 21:43:21 +00008
9
10\begin{document}
11
Guido van Rossum9231c8f1997-05-15 21:43:21 +000012\maketitle
13
Fred Drake9f86b661998-07-28 21:55:19 +000014\ifhtml
15\chapter*{Front Matter\label{front}}
16\fi
17
Guido van Rossum9231c8f1997-05-15 21:43:21 +000018\input{copyright}
19
20\begin{abstract}
21
22\noindent
Fred Drake659ebfa2000-04-03 15:42:13 +000023This manual documents the API used by C and \Cpp{} programmers who
Fred Drakee058b4f1998-02-16 06:15:35 +000024want to write extension modules or embed Python. It is a companion to
Fred Drakebe486461999-11-09 17:03:03 +000025\citetitle[../ext/ext.html]{Extending and Embedding the Python
26Interpreter}, which describes the general principles of extension
27writing but does not document the API functions in detail.
Guido van Rossum9231c8f1997-05-15 21:43:21 +000028
Guido van Rossum5b8a5231997-12-30 04:38:44 +000029\strong{Warning:} The current version of this document is incomplete.
30I hope that it is nevertheless useful. I will continue to work on it,
31and release new versions from time to time, independent from Python
32source code releases.
33
Guido van Rossum9231c8f1997-05-15 21:43:21 +000034\end{abstract}
35
Fred Drake4d4f9e71998-01-13 22:25:02 +000036\tableofcontents
Guido van Rossum9231c8f1997-05-15 21:43:21 +000037
Guido van Rossum5060b3b1997-08-17 18:02:23 +000038% XXX Consider moving all this back to ext.tex and giving api.tex
39% XXX a *really* short intro only.
Guido van Rossum9231c8f1997-05-15 21:43:21 +000040
Fred Drakeefd146c1999-02-15 15:30:45 +000041\chapter{Introduction \label{intro}}
Guido van Rossum9231c8f1997-05-15 21:43:21 +000042
Fred Drake659ebfa2000-04-03 15:42:13 +000043The Application Programmer's Interface to Python gives C and
44\Cpp{} programmers access to the Python interpreter at a variety of
45levels. The API is equally usable from \Cpp{}, but for brevity it is
46generally referred to as the Python/C API. There are two
47fundamentally different reasons for using the Python/C API. The first
48reason is to write \emph{extension modules} for specific purposes;
49these are C modules that extend the Python interpreter. This is
50probably the most common use. The second reason is to use Python as a
51component in a larger application; this technique is generally
52referred to as \dfn{embedding} Python in an application.
Guido van Rossum59a61351997-08-14 20:34:33 +000053
Guido van Rossum4a944d71997-08-14 20:35:38 +000054Writing an extension module is a relatively well-understood process,
55where a ``cookbook'' approach works well. There are several tools
56that automate the process to some extent. While people have embedded
57Python in other applications since its early existence, the process of
Fred Drakefc43d002001-05-21 15:03:35 +000058embedding Python is less straightforward than writing an extension.
Guido van Rossum59a61351997-08-14 20:34:33 +000059
Guido van Rossum4a944d71997-08-14 20:35:38 +000060Many API functions are useful independent of whether you're embedding
61or extending Python; moreover, most applications that embed Python
62will need to provide a custom extension as well, so it's probably a
63good idea to become familiar with writing an extension before
Guido van Rossum59a61351997-08-14 20:34:33 +000064attempting to embed Python in a real application.
65
Fred Drakeefd146c1999-02-15 15:30:45 +000066
67\section{Include Files \label{includes}}
Guido van Rossum580aa8d1997-11-25 15:34:51 +000068
69All function, type and macro definitions needed to use the Python/C
70API are included in your code by the following line:
71
Fred Drakee058b4f1998-02-16 06:15:35 +000072\begin{verbatim}
73#include "Python.h"
74\end{verbatim}
Guido van Rossum580aa8d1997-11-25 15:34:51 +000075
Fred Drakee058b4f1998-02-16 06:15:35 +000076This implies inclusion of the following standard headers:
Fred Drake0b71cea2000-09-26 05:51:50 +000077\code{<stdio.h>}, \code{<string.h>}, \code{<errno.h>},
78\code{<limits.h>}, and \code{<stdlib.h>} (if available).
Guido van Rossum580aa8d1997-11-25 15:34:51 +000079
80All user visible names defined by Python.h (except those defined by
Fred Drakee058b4f1998-02-16 06:15:35 +000081the included standard headers) have one of the prefixes \samp{Py} or
Fred Drake659ebfa2000-04-03 15:42:13 +000082\samp{_Py}. Names beginning with \samp{_Py} are for internal use by
83the Python implementation and should not be used by extension writers.
84Structure member names do not have a reserved prefix.
Guido van Rossum580aa8d1997-11-25 15:34:51 +000085
Fred Drakee058b4f1998-02-16 06:15:35 +000086\strong{Important:} user code should never define names that begin
87with \samp{Py} or \samp{_Py}. This confuses the reader, and
88jeopardizes the portability of the user code to future Python
89versions, which may define additional names beginning with one of
90these prefixes.
Guido van Rossum580aa8d1997-11-25 15:34:51 +000091
Fred Drake659ebfa2000-04-03 15:42:13 +000092The header files are typically installed with Python. On \UNIX, these
93are located in the directories
94\file{\envvar{prefix}/include/python\var{version}/} and
95\file{\envvar{exec_prefix}/include/python\var{version}/}, where
96\envvar{prefix} and \envvar{exec_prefix} are defined by the
97corresponding parameters to Python's \program{configure} script and
98\var{version} is \code{sys.version[:3]}. On Windows, the headers are
99installed in \file{\envvar{prefix}/include}, where \envvar{prefix} is
100the installation directory specified to the installer.
101
102To include the headers, place both directories (if different) on your
103compiler's search path for includes. Do \emph{not} place the parent
104directories on the search path and then use
Fred Draked5d04352000-09-14 20:24:17 +0000105\samp{\#include <python\shortversion/Python.h>}; this will break on
Fred Drake659ebfa2000-04-03 15:42:13 +0000106multi-platform builds since the platform independent headers under
107\envvar{prefix} include the platform specific headers from
108\envvar{exec_prefix}.
109
Fred Drakebab29652001-07-10 16:10:08 +0000110\Cpp{} users should note that though the API is defined entirely using
111C, the header files do properly declare the entry points to be
112\code{extern "C"}, so there is no need to do anything special to use
113the API from \Cpp.
114
Fred Drakeefd146c1999-02-15 15:30:45 +0000115
116\section{Objects, Types and Reference Counts \label{objects}}
Guido van Rossum59a61351997-08-14 20:34:33 +0000117
Guido van Rossum580aa8d1997-11-25 15:34:51 +0000118Most Python/C API functions have one or more arguments as well as a
Fred Drake659ebfa2000-04-03 15:42:13 +0000119return value of type \ctype{PyObject*}. This type is a pointer
Fred Drakee058b4f1998-02-16 06:15:35 +0000120to an opaque data type representing an arbitrary Python
Guido van Rossum580aa8d1997-11-25 15:34:51 +0000121object. Since all Python object types are treated the same way by the
122Python language in most situations (e.g., assignments, scope rules,
123and argument passing), it is only fitting that they should be
Fred Drake659ebfa2000-04-03 15:42:13 +0000124represented by a single C type. Almost all Python objects live on the
125heap: you never declare an automatic or static variable of type
126\ctype{PyObject}, only pointer variables of type \ctype{PyObject*} can
127be declared. The sole exception are the type objects\obindex{type};
128since these must never be deallocated, they are typically static
129\ctype{PyTypeObject} objects.
Guido van Rossum59a61351997-08-14 20:34:33 +0000130
Fred Drakee058b4f1998-02-16 06:15:35 +0000131All Python objects (even Python integers) have a \dfn{type} and a
132\dfn{reference count}. An object's type determines what kind of object
Guido van Rossum4a944d71997-08-14 20:35:38 +0000133it is (e.g., an integer, a list, or a user-defined function; there are
Fred Drakebe486461999-11-09 17:03:03 +0000134many more as explained in the \citetitle[../ref/ref.html]{Python
135Reference Manual}). For each of the well-known types there is a macro
136to check whether an object is of that type; for instance,
Fred Drake659ebfa2000-04-03 15:42:13 +0000137\samp{PyList_Check(\var{a})} is true if (and only if) the object
138pointed to by \var{a} is a Python list.
Guido van Rossum59a61351997-08-14 20:34:33 +0000139
Fred Drakeefd146c1999-02-15 15:30:45 +0000140
141\subsection{Reference Counts \label{refcounts}}
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000142
Guido van Rossum580aa8d1997-11-25 15:34:51 +0000143The reference count is important because today's computers have a
Fred Drake003d8da1998-04-13 00:53:42 +0000144finite (and often severely limited) memory size; it counts how many
Guido van Rossum4a944d71997-08-14 20:35:38 +0000145different places there are that have a reference to an object. Such a
Fred Drake659ebfa2000-04-03 15:42:13 +0000146place could be another object, or a global (or static) C variable, or
147a local variable in some C function. When an object's reference count
Guido van Rossum4a944d71997-08-14 20:35:38 +0000148becomes zero, the object is deallocated. If it contains references to
149other objects, their reference count is decremented. Those other
150objects may be deallocated in turn, if this decrement makes their
151reference count become zero, and so on. (There's an obvious problem
152with objects that reference each other here; for now, the solution is
Fred Drake659ebfa2000-04-03 15:42:13 +0000153``don't do that.'')
Guido van Rossum59a61351997-08-14 20:34:33 +0000154
Guido van Rossum4a944d71997-08-14 20:35:38 +0000155Reference counts are always manipulated explicitly. The normal way is
Fred Drake659ebfa2000-04-03 15:42:13 +0000156to use the macro \cfunction{Py_INCREF()}\ttindex{Py_INCREF()} to
157increment an object's reference count by one, and
158\cfunction{Py_DECREF()}\ttindex{Py_DECREF()} to decrement it by
159one. The \cfunction{Py_DECREF()} macro is considerably more complex
160than the incref one, since it must check whether the reference count
161becomes zero and then cause the object's deallocator to be called.
162The deallocator is a function pointer contained in the object's type
163structure. The type-specific deallocator takes care of decrementing
164the reference counts for other objects contained in the object if this
165is a compound object type, such as a list, as well as performing any
166additional finalization that's needed. There's no chance that the
167reference count can overflow; at least as many bits are used to hold
168the reference count as there are distinct memory locations in virtual
169memory (assuming \code{sizeof(long) >= sizeof(char*)}). Thus, the
170reference count increment is a simple operation.
Guido van Rossum59a61351997-08-14 20:34:33 +0000171
Guido van Rossum4a944d71997-08-14 20:35:38 +0000172It is not necessary to increment an object's reference count for every
173local variable that contains a pointer to an object. In theory, the
Fred Drakee058b4f1998-02-16 06:15:35 +0000174object's reference count goes up by one when the variable is made to
Guido van Rossum4a944d71997-08-14 20:35:38 +0000175point to it and it goes down by one when the variable goes out of
176scope. However, these two cancel each other out, so at the end the
177reference count hasn't changed. The only real reason to use the
178reference count is to prevent the object from being deallocated as
179long as our variable is pointing to it. If we know that there is at
180least one other reference to the object that lives at least as long as
181our variable, there is no need to increment the reference count
182temporarily. An important situation where this arises is in objects
Fred Drake659ebfa2000-04-03 15:42:13 +0000183that are passed as arguments to C functions in an extension module
Guido van Rossum4a944d71997-08-14 20:35:38 +0000184that are called from Python; the call mechanism guarantees to hold a
Guido van Rossum59a61351997-08-14 20:34:33 +0000185reference to every argument for the duration of the call.
186
Fred Drakee058b4f1998-02-16 06:15:35 +0000187However, a common pitfall is to extract an object from a list and
188hold on to it for a while without incrementing its reference count.
189Some other operation might conceivably remove the object from the
190list, decrementing its reference count and possible deallocating it.
191The real danger is that innocent-looking operations may invoke
192arbitrary Python code which could do this; there is a code path which
193allows control to flow back to the user from a \cfunction{Py_DECREF()},
194so almost any operation is potentially dangerous.
Guido van Rossum59a61351997-08-14 20:34:33 +0000195
Guido van Rossum4a944d71997-08-14 20:35:38 +0000196A safe approach is to always use the generic operations (functions
Fred Drake659ebfa2000-04-03 15:42:13 +0000197whose name begins with \samp{PyObject_}, \samp{PyNumber_},
198\samp{PySequence_} or \samp{PyMapping_}). These operations always
199increment the reference count of the object they return. This leaves
200the caller with the responsibility to call
201\cfunction{Py_DECREF()} when they are done with the result; this soon
202becomes second nature.
Guido van Rossum59a61351997-08-14 20:34:33 +0000203
Fred Drakeefd146c1999-02-15 15:30:45 +0000204
205\subsubsection{Reference Count Details \label{refcountDetails}}
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000206
207The reference count behavior of functions in the Python/C API is best
Fred Drake659ebfa2000-04-03 15:42:13 +0000208explained in terms of \emph{ownership of references}. Note that we
Guido van Rossum580aa8d1997-11-25 15:34:51 +0000209talk of owning references, never of owning objects; objects are always
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000210shared! When a function owns a reference, it has to dispose of it
Fred Drakee058b4f1998-02-16 06:15:35 +0000211properly --- either by passing ownership on (usually to its caller) or
212by calling \cfunction{Py_DECREF()} or \cfunction{Py_XDECREF()}. When
213a function passes ownership of a reference on to its caller, the
214caller is said to receive a \emph{new} reference. When no ownership
215is transferred, the caller is said to \emph{borrow} the reference.
216Nothing needs to be done for a borrowed reference.
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000217
Fred Drakea8455ab2000-06-16 19:58:42 +0000218Conversely, when a calling function passes it a reference to an
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000219object, there are two possibilities: the function \emph{steals} a
220reference to the object, or it does not. Few functions steal
Fred Drakee058b4f1998-02-16 06:15:35 +0000221references; the two notable exceptions are
Fred Drake659ebfa2000-04-03 15:42:13 +0000222\cfunction{PyList_SetItem()}\ttindex{PyList_SetItem()} and
223\cfunction{PyTuple_SetItem()}\ttindex{PyTuple_SetItem()}, which
Fred Drakee058b4f1998-02-16 06:15:35 +0000224steal a reference to the item (but not to the tuple or list into which
Fred Drake003d8da1998-04-13 00:53:42 +0000225the item is put!). These functions were designed to steal a reference
Fred Drakee058b4f1998-02-16 06:15:35 +0000226because of a common idiom for populating a tuple or list with newly
227created objects; for example, the code to create the tuple \code{(1,
2282, "three")} could look like this (forgetting about error handling for
Fred Drake659ebfa2000-04-03 15:42:13 +0000229the moment; a better way to code this is shown below):
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000230
231\begin{verbatim}
232PyObject *t;
Fred Drakec6fa34e1998-04-02 06:47:24 +0000233
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000234t = PyTuple_New(3);
235PyTuple_SetItem(t, 0, PyInt_FromLong(1L));
236PyTuple_SetItem(t, 1, PyInt_FromLong(2L));
237PyTuple_SetItem(t, 2, PyString_FromString("three"));
238\end{verbatim}
239
Fred Drakee058b4f1998-02-16 06:15:35 +0000240Incidentally, \cfunction{PyTuple_SetItem()} is the \emph{only} way to
241set tuple items; \cfunction{PySequence_SetItem()} and
242\cfunction{PyObject_SetItem()} refuse to do this since tuples are an
243immutable data type. You should only use
244\cfunction{PyTuple_SetItem()} for tuples that you are creating
Guido van Rossum5b8a5231997-12-30 04:38:44 +0000245yourself.
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000246
247Equivalent code for populating a list can be written using
Fred Drakee058b4f1998-02-16 06:15:35 +0000248\cfunction{PyList_New()} and \cfunction{PyList_SetItem()}. Such code
249can also use \cfunction{PySequence_SetItem()}; this illustrates the
250difference between the two (the extra \cfunction{Py_DECREF()} calls):
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000251
252\begin{verbatim}
253PyObject *l, *x;
Fred Drakec6fa34e1998-04-02 06:47:24 +0000254
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000255l = PyList_New(3);
256x = PyInt_FromLong(1L);
Guido van Rossum5b8a5231997-12-30 04:38:44 +0000257PySequence_SetItem(l, 0, x); Py_DECREF(x);
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000258x = PyInt_FromLong(2L);
Guido van Rossum5b8a5231997-12-30 04:38:44 +0000259PySequence_SetItem(l, 1, x); Py_DECREF(x);
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000260x = PyString_FromString("three");
Guido van Rossum5b8a5231997-12-30 04:38:44 +0000261PySequence_SetItem(l, 2, x); Py_DECREF(x);
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000262\end{verbatim}
263
Guido van Rossum580aa8d1997-11-25 15:34:51 +0000264You might find it strange that the ``recommended'' approach takes more
265code. However, in practice, you will rarely use these ways of
266creating and populating a tuple or list. There's a generic function,
Fred Drakee058b4f1998-02-16 06:15:35 +0000267\cfunction{Py_BuildValue()}, that can create most common objects from
Fred Drake659ebfa2000-04-03 15:42:13 +0000268C values, directed by a \dfn{format string}. For example, the
Fred Drakee058b4f1998-02-16 06:15:35 +0000269above two blocks of code could be replaced by the following (which
270also takes care of the error checking):
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000271
272\begin{verbatim}
273PyObject *t, *l;
Fred Drakec6fa34e1998-04-02 06:47:24 +0000274
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000275t = Py_BuildValue("(iis)", 1, 2, "three");
276l = Py_BuildValue("[iis]", 1, 2, "three");
277\end{verbatim}
278
Fred Drakee058b4f1998-02-16 06:15:35 +0000279It is much more common to use \cfunction{PyObject_SetItem()} and
280friends with items whose references you are only borrowing, like
281arguments that were passed in to the function you are writing. In
282that case, their behaviour regarding reference counts is much saner,
283since you don't have to increment a reference count so you can give a
284reference away (``have it be stolen''). For example, this function
285sets all items of a list (actually, any mutable sequence) to a given
286item:
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000287
288\begin{verbatim}
289int set_all(PyObject *target, PyObject *item)
290{
291 int i, n;
Fred Drakec6fa34e1998-04-02 06:47:24 +0000292
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000293 n = PyObject_Length(target);
294 if (n < 0)
295 return -1;
296 for (i = 0; i < n; i++) {
297 if (PyObject_SetItem(target, i, item) < 0)
298 return -1;
299 }
300 return 0;
301}
302\end{verbatim}
Fred Drake659ebfa2000-04-03 15:42:13 +0000303\ttindex{set_all()}
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000304
305The situation is slightly different for function return values.
306While passing a reference to most functions does not change your
307ownership responsibilities for that reference, many functions that
308return a referece to an object give you ownership of the reference.
309The reason is simple: in many cases, the returned object is created
310on the fly, and the reference you get is the only reference to the
Fred Drakee058b4f1998-02-16 06:15:35 +0000311object. Therefore, the generic functions that return object
312references, like \cfunction{PyObject_GetItem()} and
Fred Drakebab29652001-07-10 16:10:08 +0000313\cfunction{PySequence_GetItem()}, always return a new reference (the
314caller becomes the owner of the reference).
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000315
316It is important to realize that whether you own a reference returned
Fred Drakee058b4f1998-02-16 06:15:35 +0000317by a function depends on which function you call only --- \emph{the
Fred Drakebab29652001-07-10 16:10:08 +0000318plumage} (the type of the type of the object passed as an
Fred Drakee058b4f1998-02-16 06:15:35 +0000319argument to the function) \emph{doesn't enter into it!} Thus, if you
320extract an item from a list using \cfunction{PyList_GetItem()}, you
321don't own the reference --- but if you obtain the same item from the
322same list using \cfunction{PySequence_GetItem()} (which happens to
323take exactly the same arguments), you do own a reference to the
324returned object.
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000325
Fred Drakee058b4f1998-02-16 06:15:35 +0000326Here is an example of how you could write a function that computes the
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000327sum of the items in a list of integers; once using
Fred Drake659ebfa2000-04-03 15:42:13 +0000328\cfunction{PyList_GetItem()}\ttindex{PyList_GetItem()}, and once using
329\cfunction{PySequence_GetItem()}\ttindex{PySequence_GetItem()}.
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000330
331\begin{verbatim}
332long sum_list(PyObject *list)
333{
334 int i, n;
335 long total = 0;
336 PyObject *item;
Fred Drakec6fa34e1998-04-02 06:47:24 +0000337
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000338 n = PyList_Size(list);
339 if (n < 0)
340 return -1; /* Not a list */
341 for (i = 0; i < n; i++) {
342 item = PyList_GetItem(list, i); /* Can't fail */
343 if (!PyInt_Check(item)) continue; /* Skip non-integers */
344 total += PyInt_AsLong(item);
345 }
346 return total;
347}
348\end{verbatim}
Fred Drake659ebfa2000-04-03 15:42:13 +0000349\ttindex{sum_list()}
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000350
351\begin{verbatim}
352long sum_sequence(PyObject *sequence)
353{
354 int i, n;
355 long total = 0;
356 PyObject *item;
Fred Drake659ebfa2000-04-03 15:42:13 +0000357 n = PySequence_Length(sequence);
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000358 if (n < 0)
359 return -1; /* Has no length */
360 for (i = 0; i < n; i++) {
Fred Drake659ebfa2000-04-03 15:42:13 +0000361 item = PySequence_GetItem(sequence, i);
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000362 if (item == NULL)
363 return -1; /* Not a sequence, or other failure */
364 if (PyInt_Check(item))
365 total += PyInt_AsLong(item);
Guido van Rossum5b8a5231997-12-30 04:38:44 +0000366 Py_DECREF(item); /* Discard reference ownership */
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000367 }
368 return total;
369}
370\end{verbatim}
Fred Drake659ebfa2000-04-03 15:42:13 +0000371\ttindex{sum_sequence()}
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000372
Fred Drakeefd146c1999-02-15 15:30:45 +0000373
374\subsection{Types \label{types}}
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000375
376There are few other data types that play a significant role in
Fred Drake659ebfa2000-04-03 15:42:13 +0000377the Python/C API; most are simple C types such as \ctype{int},
378\ctype{long}, \ctype{double} and \ctype{char*}. A few structure types
Guido van Rossum4a944d71997-08-14 20:35:38 +0000379are used to describe static tables used to list the functions exported
Fred Drake659ebfa2000-04-03 15:42:13 +0000380by a module or the data attributes of a new object type, and another
381is used to describe the value of a complex number. These will
Guido van Rossum59a61351997-08-14 20:34:33 +0000382be discussed together with the functions that use them.
383
Fred Drakeefd146c1999-02-15 15:30:45 +0000384
385\section{Exceptions \label{exceptions}}
Guido van Rossum59a61351997-08-14 20:34:33 +0000386
Guido van Rossum4a944d71997-08-14 20:35:38 +0000387The Python programmer only needs to deal with exceptions if specific
388error handling is required; unhandled exceptions are automatically
Fred Drake659ebfa2000-04-03 15:42:13 +0000389propagated to the caller, then to the caller's caller, and so on, until
Guido van Rossum4a944d71997-08-14 20:35:38 +0000390they reach the top-level interpreter, where they are reported to the
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000391user accompanied by a stack traceback.
Guido van Rossum59a61351997-08-14 20:34:33 +0000392
Fred Drake659ebfa2000-04-03 15:42:13 +0000393For C programmers, however, error checking always has to be explicit.
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000394All functions in the Python/C API can raise exceptions, unless an
395explicit claim is made otherwise in a function's documentation. In
396general, when a function encounters an error, it sets an exception,
397discards any object references that it owns, and returns an
Fred Drakee058b4f1998-02-16 06:15:35 +0000398error indicator --- usually \NULL{} or \code{-1}. A few functions
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000399return a Boolean true/false result, with false indicating an error.
400Very few functions return no explicit error indicator or have an
401ambiguous return value, and require explicit testing for errors with
Fred Drake659ebfa2000-04-03 15:42:13 +0000402\cfunction{PyErr_Occurred()}\ttindex{PyErr_Occurred()}.
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000403
404Exception state is maintained in per-thread storage (this is
405equivalent to using global storage in an unthreaded application). A
Fred Drakec6fa34e1998-04-02 06:47:24 +0000406thread can be in one of two states: an exception has occurred, or not.
Fred Drakee058b4f1998-02-16 06:15:35 +0000407The function \cfunction{PyErr_Occurred()} can be used to check for
408this: it returns a borrowed reference to the exception type object
409when an exception has occurred, and \NULL{} otherwise. There are a
410number of functions to set the exception state:
Fred Drake659ebfa2000-04-03 15:42:13 +0000411\cfunction{PyErr_SetString()}\ttindex{PyErr_SetString()} is the most
412common (though not the most general) function to set the exception
413state, and \cfunction{PyErr_Clear()}\ttindex{PyErr_Clear()} clears the
414exception state.
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000415
416The full exception state consists of three objects (all of which can
Fred Drakee058b4f1998-02-16 06:15:35 +0000417be \NULL{}): the exception type, the corresponding exception
Fred Drake659ebfa2000-04-03 15:42:13 +0000418value, and the traceback. These have the same meanings as the Python
419\withsubitem{(in module sys)}{
420 \ttindex{exc_type}\ttindex{exc_value}\ttindex{exc_traceback}}
421objects \code{sys.exc_type}, \code{sys.exc_value}, and
422\code{sys.exc_traceback}; however, they are not the same: the Python
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000423objects represent the last exception being handled by a Python
Fred Drake659ebfa2000-04-03 15:42:13 +0000424\keyword{try} \ldots\ \keyword{except} statement, while the C level
Fred Drakee058b4f1998-02-16 06:15:35 +0000425exception state only exists while an exception is being passed on
Fred Drake659ebfa2000-04-03 15:42:13 +0000426between C functions until it reaches the Python bytecode interpreter's
427main loop, which takes care of transferring it to \code{sys.exc_type}
428and friends.
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000429
Fred Drakec6fa34e1998-04-02 06:47:24 +0000430Note that starting with Python 1.5, the preferred, thread-safe way to
Fred Drake659ebfa2000-04-03 15:42:13 +0000431access the exception state from Python code is to call the function
432\withsubitem{(in module sys)}{\ttindex{exc_info()}}
Fred Drakee058b4f1998-02-16 06:15:35 +0000433\function{sys.exc_info()}, which returns the per-thread exception state
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000434for Python code. Also, the semantics of both ways to access the
435exception state have changed so that a function which catches an
436exception will save and restore its thread's exception state so as to
437preserve the exception state of its caller. This prevents common bugs
438in exception handling code caused by an innocent-looking function
439overwriting the exception being handled; it also reduces the often
440unwanted lifetime extension for objects that are referenced by the
Fred Drakec6fa34e1998-04-02 06:47:24 +0000441stack frames in the traceback.
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000442
443As a general principle, a function that calls another function to
444perform some task should check whether the called function raised an
445exception, and if so, pass the exception state on to its caller. It
Fred Drake659ebfa2000-04-03 15:42:13 +0000446should discard any object references that it owns, and return an
Fred Drakee058b4f1998-02-16 06:15:35 +0000447error indicator, but it should \emph{not} set another exception ---
448that would overwrite the exception that was just raised, and lose
449important information about the exact cause of the error.
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000450
Fred Drake659ebfa2000-04-03 15:42:13 +0000451A simple example of detecting exceptions and passing them on is shown
452in the \cfunction{sum_sequence()}\ttindex{sum_sequence()} example
453above. It so happens that that example doesn't need to clean up any
454owned references when it detects an error. The following example
455function shows some error cleanup. First, to remind you why you like
456Python, we show the equivalent Python code:
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000457
458\begin{verbatim}
Guido van Rossum580aa8d1997-11-25 15:34:51 +0000459def incr_item(dict, key):
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000460 try:
Guido van Rossum580aa8d1997-11-25 15:34:51 +0000461 item = dict[key]
462 except KeyError:
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000463 item = 0
Fred Drake6b3f3f22000-11-29 15:48:22 +0000464 dict[key] = item + 1
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000465\end{verbatim}
Fred Drake659ebfa2000-04-03 15:42:13 +0000466\ttindex{incr_item()}
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000467
Fred Drake659ebfa2000-04-03 15:42:13 +0000468Here is the corresponding C code, in all its glory:
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000469
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000470\begin{verbatim}
Guido van Rossum580aa8d1997-11-25 15:34:51 +0000471int incr_item(PyObject *dict, PyObject *key)
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000472{
473 /* Objects all initialized to NULL for Py_XDECREF */
474 PyObject *item = NULL, *const_one = NULL, *incremented_item = NULL;
Guido van Rossum5b8a5231997-12-30 04:38:44 +0000475 int rv = -1; /* Return value initialized to -1 (failure) */
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000476
Guido van Rossum580aa8d1997-11-25 15:34:51 +0000477 item = PyObject_GetItem(dict, key);
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000478 if (item == NULL) {
Fred Drakec6fa34e1998-04-02 06:47:24 +0000479 /* Handle KeyError only: */
Fred Drake6b3f3f22000-11-29 15:48:22 +0000480 if (!PyErr_ExceptionMatches(PyExc_KeyError))
481 goto error;
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000482
483 /* Clear the error and use zero: */
484 PyErr_Clear();
Guido van Rossum580aa8d1997-11-25 15:34:51 +0000485 item = PyInt_FromLong(0L);
Fred Drake6b3f3f22000-11-29 15:48:22 +0000486 if (item == NULL)
487 goto error;
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000488 }
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000489 const_one = PyInt_FromLong(1L);
Fred Drake6b3f3f22000-11-29 15:48:22 +0000490 if (const_one == NULL)
491 goto error;
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000492
493 incremented_item = PyNumber_Add(item, const_one);
Fred Drake6b3f3f22000-11-29 15:48:22 +0000494 if (incremented_item == NULL)
495 goto error;
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000496
Fred Drake6b3f3f22000-11-29 15:48:22 +0000497 if (PyObject_SetItem(dict, key, incremented_item) < 0)
498 goto error;
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000499 rv = 0; /* Success */
500 /* Continue with cleanup code */
501
502 error:
503 /* Cleanup code, shared by success and failure path */
504
505 /* Use Py_XDECREF() to ignore NULL references */
506 Py_XDECREF(item);
507 Py_XDECREF(const_one);
508 Py_XDECREF(incremented_item);
509
510 return rv; /* -1 for error, 0 for success */
511}
512\end{verbatim}
Fred Drake659ebfa2000-04-03 15:42:13 +0000513\ttindex{incr_item()}
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000514
Fred Drakef8830d11998-04-23 14:06:01 +0000515This example represents an endorsed use of the \keyword{goto} statement
Fred Drake659ebfa2000-04-03 15:42:13 +0000516in C! It illustrates the use of
517\cfunction{PyErr_ExceptionMatches()}\ttindex{PyErr_ExceptionMatches()} and
518\cfunction{PyErr_Clear()}\ttindex{PyErr_Clear()} to
519handle specific exceptions, and the use of
520\cfunction{Py_XDECREF()}\ttindex{Py_XDECREF()} to
521dispose of owned references that may be \NULL{} (note the
522\character{X} in the name; \cfunction{Py_DECREF()} would crash when
523confronted with a \NULL{} reference). It is important that the
524variables used to hold owned references are initialized to \NULL{} for
525this to work; likewise, the proposed return value is initialized to
526\code{-1} (failure) and only set to success after the final call made
527is successful.
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000528
Guido van Rossum59a61351997-08-14 20:34:33 +0000529
Fred Drakeefd146c1999-02-15 15:30:45 +0000530\section{Embedding Python \label{embedding}}
Guido van Rossum59a61351997-08-14 20:34:33 +0000531
Guido van Rossum580aa8d1997-11-25 15:34:51 +0000532The one important task that only embedders (as opposed to extension
533writers) of the Python interpreter have to worry about is the
534initialization, and possibly the finalization, of the Python
535interpreter. Most functionality of the interpreter can only be used
536after the interpreter has been initialized.
Guido van Rossum59a61351997-08-14 20:34:33 +0000537
Fred Drake659ebfa2000-04-03 15:42:13 +0000538The basic initialization function is
539\cfunction{Py_Initialize()}\ttindex{Py_Initialize()}.
Fred Drakee058b4f1998-02-16 06:15:35 +0000540This initializes the table of loaded modules, and creates the
Fred Drake4de05a91998-02-16 14:25:26 +0000541fundamental modules \module{__builtin__}\refbimodindex{__builtin__},
542\module{__main__}\refbimodindex{__main__} and
543\module{sys}\refbimodindex{sys}. It also initializes the module
Fred Drakec6fa34e1998-04-02 06:47:24 +0000544search path (\code{sys.path}).%
545\indexiii{module}{search}{path}
Fred Drake659ebfa2000-04-03 15:42:13 +0000546\withsubitem{(in module sys)}{\ttindex{path}}
Guido van Rossum59a61351997-08-14 20:34:33 +0000547
Fred Drakee058b4f1998-02-16 06:15:35 +0000548\cfunction{Py_Initialize()} does not set the ``script argument list''
Guido van Rossum4a944d71997-08-14 20:35:38 +0000549(\code{sys.argv}). If this variable is needed by Python code that
550will be executed later, it must be set explicitly with a call to
Fred Drake659ebfa2000-04-03 15:42:13 +0000551\code{PySys_SetArgv(\var{argc},
552\var{argv})}\ttindex{PySys_SetArgv()} subsequent to the call to
553\cfunction{Py_Initialize()}.
Guido van Rossum59a61351997-08-14 20:34:33 +0000554
Fred Drakeb0a78731998-01-13 18:51:10 +0000555On most systems (in particular, on \UNIX{} and Windows, although the
Fred Drake659ebfa2000-04-03 15:42:13 +0000556details are slightly different),
557\cfunction{Py_Initialize()} calculates the module search path based
558upon its best guess for the location of the standard Python
559interpreter executable, assuming that the Python library is found in a
560fixed location relative to the Python interpreter executable. In
561particular, it looks for a directory named
Fred Draked5d04352000-09-14 20:24:17 +0000562\file{lib/python\shortversion} relative to the parent directory where
563the executable named \file{python} is found on the shell command
564search path (the environment variable \envvar{PATH}).
Guido van Rossum42cefd01997-10-05 15:27:29 +0000565
566For instance, if the Python executable is found in
Fred Drakee058b4f1998-02-16 06:15:35 +0000567\file{/usr/local/bin/python}, it will assume that the libraries are in
Fred Draked5d04352000-09-14 20:24:17 +0000568\file{/usr/local/lib/python\shortversion}. (In fact, this particular path
Fred Drakee058b4f1998-02-16 06:15:35 +0000569is also the ``fallback'' location, used when no executable file named
Fred Drakec6fa34e1998-04-02 06:47:24 +0000570\file{python} is found along \envvar{PATH}.) The user can override
571this behavior by setting the environment variable \envvar{PYTHONHOME},
Guido van Rossum580aa8d1997-11-25 15:34:51 +0000572or insert additional directories in front of the standard path by
Fred Drakec6fa34e1998-04-02 06:47:24 +0000573setting \envvar{PYTHONPATH}.
Guido van Rossum59a61351997-08-14 20:34:33 +0000574
Guido van Rossum4a944d71997-08-14 20:35:38 +0000575The embedding application can steer the search by calling
Fred Drake659ebfa2000-04-03 15:42:13 +0000576\code{Py_SetProgramName(\var{file})}\ttindex{Py_SetProgramName()} \emph{before} calling
Fred Drakec6fa34e1998-04-02 06:47:24 +0000577\cfunction{Py_Initialize()}. Note that \envvar{PYTHONHOME} still
578overrides this and \envvar{PYTHONPATH} is still inserted in front of
Fred Drakee058b4f1998-02-16 06:15:35 +0000579the standard path. An application that requires total control has to
Fred Drake659ebfa2000-04-03 15:42:13 +0000580provide its own implementation of
581\cfunction{Py_GetPath()}\ttindex{Py_GetPath()},
582\cfunction{Py_GetPrefix()}\ttindex{Py_GetPrefix()},
583\cfunction{Py_GetExecPrefix()}\ttindex{Py_GetExecPrefix()}, and
584\cfunction{Py_GetProgramFullPath()}\ttindex{Py_GetProgramFullPath()} (all
585defined in \file{Modules/getpath.c}).
Guido van Rossum59a61351997-08-14 20:34:33 +0000586
Guido van Rossum4a944d71997-08-14 20:35:38 +0000587Sometimes, it is desirable to ``uninitialize'' Python. For instance,
588the application may want to start over (make another call to
Fred Drakee058b4f1998-02-16 06:15:35 +0000589\cfunction{Py_Initialize()}) or the application is simply done with its
Guido van Rossum580aa8d1997-11-25 15:34:51 +0000590use of Python and wants to free all memory allocated by Python. This
Fred Drakee058b4f1998-02-16 06:15:35 +0000591can be accomplished by calling \cfunction{Py_Finalize()}. The function
Fred Drake659ebfa2000-04-03 15:42:13 +0000592\cfunction{Py_IsInitialized()}\ttindex{Py_IsInitialized()} returns
593true if Python is currently in the initialized state. More
594information about these functions is given in a later chapter.
Guido van Rossum59a61351997-08-14 20:34:33 +0000595
Guido van Rossum4a944d71997-08-14 20:35:38 +0000596
Fred Drakeefd146c1999-02-15 15:30:45 +0000597\chapter{The Very High Level Layer \label{veryhigh}}
Guido van Rossum4a944d71997-08-14 20:35:38 +0000598
Fred Drakee5bf8b21998-02-12 21:22:28 +0000599The functions in this chapter will let you execute Python source code
600given in a file or a buffer, but they will not let you interact in a
601more detailed way with the interpreter.
Guido van Rossum4a944d71997-08-14 20:35:38 +0000602
Fred Drake659ebfa2000-04-03 15:42:13 +0000603Several of these functions accept a start symbol from the grammar as a
604parameter. The available start symbols are \constant{Py_eval_input},
605\constant{Py_file_input}, and \constant{Py_single_input}. These are
606described following the functions which accept them as parameters.
607
Fred Drake510d08b2000-08-14 02:50:21 +0000608Note also that several of these functions take \ctype{FILE*}
609parameters. On particular issue which needs to be handled carefully
610is that the \ctype{FILE} structure for different C libraries can be
611different and incompatible. Under Windows (at least), it is possible
612for dynamically linked extensions to actually use different libraries,
613so care should be taken that \ctype{FILE*} parameters are only passed
614to these functions if it is certain that they were created by the same
615library that the Python runtime is using.
616
Fred Drake24e62192001-05-21 15:56:55 +0000617\begin{cfuncdesc}{int}{Py_Main}{int argc, char **argv}
618 The main program for the standard interpreter. This is made
619 available for programs which embed Python. The \var{argc} and
620 \var{argv} parameters should be prepared exactly as those which are
621 passed to a C program's \cfunction{main()} function. It is
622 important to note that the argument list may be modified (but the
623 contents of the strings pointed to by the argument list are not).
624 The return value will be the integer passed to the
625 \function{sys.exit()} function, \code{1} if the interpreter exits
626 due to an exception, or \code{2} if the parameter list does not
627 represent a valid Python command line.
628\end{cfuncdesc}
629
Fred Drakec6fa34e1998-04-02 06:47:24 +0000630\begin{cfuncdesc}{int}{PyRun_AnyFile}{FILE *fp, char *filename}
Fred Drake0041a941999-04-29 04:20:46 +0000631 If \var{fp} refers to a file associated with an interactive device
632 (console or terminal input or \UNIX{} pseudo-terminal), return the
633 value of \cfunction{PyRun_InteractiveLoop()}, otherwise return the
634 result of \cfunction{PyRun_SimpleFile()}. If \var{filename} is
Fred Drakea8d73412000-08-11 20:39:29 +0000635 \NULL{}, this function uses \code{"???"} as the filename.
Guido van Rossum9231c8f1997-05-15 21:43:21 +0000636\end{cfuncdesc}
637
Fred Drakec6fa34e1998-04-02 06:47:24 +0000638\begin{cfuncdesc}{int}{PyRun_SimpleString}{char *command}
Fred Drake0041a941999-04-29 04:20:46 +0000639 Executes the Python source code from \var{command} in the
640 \module{__main__} module. If \module{__main__} does not already
641 exist, it is created. Returns \code{0} on success or \code{-1} if
642 an exception was raised. If there was an error, there is no way to
643 get the exception information.
Guido van Rossum9231c8f1997-05-15 21:43:21 +0000644\end{cfuncdesc}
645
Fred Drakec6fa34e1998-04-02 06:47:24 +0000646\begin{cfuncdesc}{int}{PyRun_SimpleFile}{FILE *fp, char *filename}
Fred Drake0041a941999-04-29 04:20:46 +0000647 Similar to \cfunction{PyRun_SimpleString()}, but the Python source
648 code is read from \var{fp} instead of an in-memory string.
649 \var{filename} should be the name of the file.
Fred Drakee5bf8b21998-02-12 21:22:28 +0000650\end{cfuncdesc}
651
Fred Drakec6fa34e1998-04-02 06:47:24 +0000652\begin{cfuncdesc}{int}{PyRun_InteractiveOne}{FILE *fp, char *filename}
Fred Drakea8d73412000-08-11 20:39:29 +0000653 Read and execute a single statement from a file associated with an
654 interactive device. If \var{filename} is \NULL, \code{"???"} is
655 used instead. The user will be prompted using \code{sys.ps1} and
656 \code{sys.ps2}. Returns \code{0} when the input was executed
657 successfully, \code{-1} if there was an exception, or an error code
658 from the \file{errcode.h} include file distributed as part of Python
659 in case of a parse error. (Note that \file{errcode.h} is not
660 included by \file{Python.h}, so must be included specifically if
661 needed.)
Fred Drakee5bf8b21998-02-12 21:22:28 +0000662\end{cfuncdesc}
663
Fred Drakec6fa34e1998-04-02 06:47:24 +0000664\begin{cfuncdesc}{int}{PyRun_InteractiveLoop}{FILE *fp, char *filename}
Fred Drakea8d73412000-08-11 20:39:29 +0000665 Read and execute statements from a file associated with an
666 interactive device until \EOF{} is reached. If \var{filename} is
667 \NULL, \code{"???"} is used instead. The user will be prompted
668 using \code{sys.ps1} and \code{sys.ps2}. Returns \code{0} at \EOF.
Fred Drakee5bf8b21998-02-12 21:22:28 +0000669\end{cfuncdesc}
670
Fred Drakec6fa34e1998-04-02 06:47:24 +0000671\begin{cfuncdesc}{struct _node*}{PyParser_SimpleParseString}{char *str,
672 int start}
Fred Drake0041a941999-04-29 04:20:46 +0000673 Parse Python source code from \var{str} using the start token
674 \var{start}. The result can be used to create a code object which
675 can be evaluated efficiently. This is useful if a code fragment
676 must be evaluated many times.
Fred Drakee5bf8b21998-02-12 21:22:28 +0000677\end{cfuncdesc}
678
Fred Drakec6fa34e1998-04-02 06:47:24 +0000679\begin{cfuncdesc}{struct _node*}{PyParser_SimpleParseFile}{FILE *fp,
680 char *filename, int start}
Fred Drake0041a941999-04-29 04:20:46 +0000681 Similar to \cfunction{PyParser_SimpleParseString()}, but the Python
682 source code is read from \var{fp} instead of an in-memory string.
683 \var{filename} should be the name of the file.
Fred Drakee5bf8b21998-02-12 21:22:28 +0000684\end{cfuncdesc}
685
Fred Drakec6fa34e1998-04-02 06:47:24 +0000686\begin{cfuncdesc}{PyObject*}{PyRun_String}{char *str, int start,
687 PyObject *globals,
688 PyObject *locals}
Fred Drake0041a941999-04-29 04:20:46 +0000689 Execute Python source code from \var{str} in the context specified
690 by the dictionaries \var{globals} and \var{locals}. The parameter
691 \var{start} specifies the start token that should be used to parse
692 the source code.
693
694 Returns the result of executing the code as a Python object, or
695 \NULL{} if an exception was raised.
Fred Drakee5bf8b21998-02-12 21:22:28 +0000696\end{cfuncdesc}
697
Fred Drakec6fa34e1998-04-02 06:47:24 +0000698\begin{cfuncdesc}{PyObject*}{PyRun_File}{FILE *fp, char *filename,
699 int start, PyObject *globals,
700 PyObject *locals}
Fred Drake0041a941999-04-29 04:20:46 +0000701 Similar to \cfunction{PyRun_String()}, but the Python source code is
Fred Drake659ebfa2000-04-03 15:42:13 +0000702 read from \var{fp} instead of an in-memory string.
703 \var{filename} should be the name of the file.
Fred Drakee5bf8b21998-02-12 21:22:28 +0000704\end{cfuncdesc}
705
Fred Drakec6fa34e1998-04-02 06:47:24 +0000706\begin{cfuncdesc}{PyObject*}{Py_CompileString}{char *str, char *filename,
707 int start}
Fred Drake0041a941999-04-29 04:20:46 +0000708 Parse and compile the Python source code in \var{str}, returning the
709 resulting code object. The start token is given by \var{start};
Fred Drakec924b8d1999-08-23 18:57:25 +0000710 this can be used to constrain the code which can be compiled and should
711 be \constant{Py_eval_input}, \constant{Py_file_input}, or
712 \constant{Py_single_input}. The filename specified by
713 \var{filename} is used to construct the code object and may appear
714 in tracebacks or \exception{SyntaxError} exception messages. This
715 returns \NULL{} if the code cannot be parsed or compiled.
Guido van Rossum9231c8f1997-05-15 21:43:21 +0000716\end{cfuncdesc}
717
Fred Drakec924b8d1999-08-23 18:57:25 +0000718\begin{cvardesc}{int}{Py_eval_input}
719 The start symbol from the Python grammar for isolated expressions;
Fred Drake659ebfa2000-04-03 15:42:13 +0000720 for use with \cfunction{Py_CompileString()}\ttindex{Py_CompileString()}.
Fred Drakec924b8d1999-08-23 18:57:25 +0000721\end{cvardesc}
722
723\begin{cvardesc}{int}{Py_file_input}
724 The start symbol from the Python grammar for sequences of statements
725 as read from a file or other source; for use with
Fred Drake659ebfa2000-04-03 15:42:13 +0000726 \cfunction{Py_CompileString()}\ttindex{Py_CompileString()}. This is
727 the symbol to use when compiling arbitrarily long Python source code.
Fred Drakec924b8d1999-08-23 18:57:25 +0000728\end{cvardesc}
729
730\begin{cvardesc}{int}{Py_single_input}
731 The start symbol from the Python grammar for a single statement; for
Fred Drake659ebfa2000-04-03 15:42:13 +0000732 use with \cfunction{Py_CompileString()}\ttindex{Py_CompileString()}.
733 This is the symbol used for the interactive interpreter loop.
Fred Drakec924b8d1999-08-23 18:57:25 +0000734\end{cvardesc}
735
Guido van Rossum9231c8f1997-05-15 21:43:21 +0000736
Fred Drakeefd146c1999-02-15 15:30:45 +0000737\chapter{Reference Counting \label{countingRefs}}
Guido van Rossum9231c8f1997-05-15 21:43:21 +0000738
Guido van Rossum580aa8d1997-11-25 15:34:51 +0000739The macros in this section are used for managing reference counts
Guido van Rossum9231c8f1997-05-15 21:43:21 +0000740of Python objects.
741
742\begin{cfuncdesc}{void}{Py_INCREF}{PyObject *o}
Fred Drakec6fa34e1998-04-02 06:47:24 +0000743Increment the reference count for object \var{o}. The object must
Guido van Rossum9231c8f1997-05-15 21:43:21 +0000744not be \NULL{}; if you aren't sure that it isn't \NULL{}, use
Fred Drakee058b4f1998-02-16 06:15:35 +0000745\cfunction{Py_XINCREF()}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +0000746\end{cfuncdesc}
747
748\begin{cfuncdesc}{void}{Py_XINCREF}{PyObject *o}
Fred Drakee058b4f1998-02-16 06:15:35 +0000749Increment the reference count for object \var{o}. The object may be
Guido van Rossum580aa8d1997-11-25 15:34:51 +0000750\NULL{}, in which case the macro has no effect.
Guido van Rossum9231c8f1997-05-15 21:43:21 +0000751\end{cfuncdesc}
752
753\begin{cfuncdesc}{void}{Py_DECREF}{PyObject *o}
Fred Drakee058b4f1998-02-16 06:15:35 +0000754Decrement the reference count for object \var{o}. The object must
Guido van Rossum9231c8f1997-05-15 21:43:21 +0000755not be \NULL{}; if you aren't sure that it isn't \NULL{}, use
Fred Drakee058b4f1998-02-16 06:15:35 +0000756\cfunction{Py_XDECREF()}. If the reference count reaches zero, the
757object's type's deallocation function (which must not be \NULL{}) is
758invoked.
Guido van Rossum9231c8f1997-05-15 21:43:21 +0000759
760\strong{Warning:} The deallocation function can cause arbitrary Python
Fred Drake659ebfa2000-04-03 15:42:13 +0000761code to be invoked (e.g. when a class instance with a
762\method{__del__()} method is deallocated). While exceptions in such
763code are not propagated, the executed code has free access to all
764Python global variables. This means that any object that is reachable
765from a global variable should be in a consistent state before
766\cfunction{Py_DECREF()} is invoked. For example, code to delete an
767object from a list should copy a reference to the deleted object in a
768temporary variable, update the list data structure, and then call
769\cfunction{Py_DECREF()} for the temporary variable.
Guido van Rossum9231c8f1997-05-15 21:43:21 +0000770\end{cfuncdesc}
771
772\begin{cfuncdesc}{void}{Py_XDECREF}{PyObject *o}
Fred Drakee058b4f1998-02-16 06:15:35 +0000773Decrement the reference count for object \var{o}. The object may be
774\NULL{}, in which case the macro has no effect; otherwise the effect
775is the same as for \cfunction{Py_DECREF()}, and the same warning
Guido van Rossum9231c8f1997-05-15 21:43:21 +0000776applies.
777\end{cfuncdesc}
778
Fred Drake659ebfa2000-04-03 15:42:13 +0000779The following functions or macros are only for use within the
780interpreter core: \cfunction{_Py_Dealloc()},
781\cfunction{_Py_ForgetReference()}, \cfunction{_Py_NewReference()}, as
782well as the global variable \cdata{_Py_RefTotal}.
Guido van Rossum580aa8d1997-11-25 15:34:51 +0000783
Guido van Rossum9231c8f1997-05-15 21:43:21 +0000784
Fred Drakeefd146c1999-02-15 15:30:45 +0000785\chapter{Exception Handling \label{exceptionHandling}}
Guido van Rossum9231c8f1997-05-15 21:43:21 +0000786
Fred Drake659ebfa2000-04-03 15:42:13 +0000787The functions described in this chapter will let you handle and raise Python
Guido van Rossumae110af1997-05-22 20:11:52 +0000788exceptions. It is important to understand some of the basics of
Fred Drake659ebfa2000-04-03 15:42:13 +0000789Python exception handling. It works somewhat like the
790\UNIX{} \cdata{errno} variable: there is a global indicator (per
791thread) of the last error that occurred. Most functions don't clear
792this on success, but will set it to indicate the cause of the error on
793failure. Most functions also return an error indicator, usually
794\NULL{} if they are supposed to return a pointer, or \code{-1} if they
795return an integer (exception: the \cfunction{PyArg_Parse*()} functions
796return \code{1} for success and \code{0} for failure). When a
797function must fail because some function it called failed, it
798generally doesn't set the error indicator; the function it called
799already set it.
Guido van Rossumae110af1997-05-22 20:11:52 +0000800
801The error indicator consists of three Python objects corresponding to
Fred Drake659ebfa2000-04-03 15:42:13 +0000802\withsubitem{(in module sys)}{
803 \ttindex{exc_type}\ttindex{exc_value}\ttindex{exc_traceback}}
Guido van Rossumae110af1997-05-22 20:11:52 +0000804the Python variables \code{sys.exc_type}, \code{sys.exc_value} and
805\code{sys.exc_traceback}. API functions exist to interact with the
806error indicator in various ways. There is a separate error indicator
807for each thread.
808
809% XXX Order of these should be more thoughtful.
810% Either alphabetical or some kind of structure.
Guido van Rossum9231c8f1997-05-15 21:43:21 +0000811
812\begin{cfuncdesc}{void}{PyErr_Print}{}
Guido van Rossumae110af1997-05-22 20:11:52 +0000813Print a standard traceback to \code{sys.stderr} and clear the error
814indicator. Call this function only when the error indicator is set.
815(Otherwise it will cause a fatal error!)
Guido van Rossum9231c8f1997-05-15 21:43:21 +0000816\end{cfuncdesc}
817
Fred Drakec6fa34e1998-04-02 06:47:24 +0000818\begin{cfuncdesc}{PyObject*}{PyErr_Occurred}{}
Guido van Rossumae110af1997-05-22 20:11:52 +0000819Test whether the error indicator is set. If set, return the exception
Fred Drakee058b4f1998-02-16 06:15:35 +0000820\emph{type} (the first argument to the last call to one of the
Fred Drakef8830d11998-04-23 14:06:01 +0000821\cfunction{PyErr_Set*()} functions or to \cfunction{PyErr_Restore()}). If
Fred Drakee058b4f1998-02-16 06:15:35 +0000822not set, return \NULL{}. You do not own a reference to the return
823value, so you do not need to \cfunction{Py_DECREF()} it.
Fred Drake659ebfa2000-04-03 15:42:13 +0000824\strong{Note:} Do not compare the return value to a specific
Fred Drakee058b4f1998-02-16 06:15:35 +0000825exception; use \cfunction{PyErr_ExceptionMatches()} instead, shown
Fred Drake659ebfa2000-04-03 15:42:13 +0000826below. (The comparison could easily fail since the exception may be
827an instance instead of a class, in the case of a class exception, or
828it may the a subclass of the expected exception.)
Guido van Rossum42cefd01997-10-05 15:27:29 +0000829\end{cfuncdesc}
830
831\begin{cfuncdesc}{int}{PyErr_ExceptionMatches}{PyObject *exc}
Guido van Rossum42cefd01997-10-05 15:27:29 +0000832Equivalent to
Fred Drakee058b4f1998-02-16 06:15:35 +0000833\samp{PyErr_GivenExceptionMatches(PyErr_Occurred(), \var{exc})}.
Fred Drake659ebfa2000-04-03 15:42:13 +0000834This should only be called when an exception is actually set; a memory
835access violation will occur if no exception has been raised.
Guido van Rossum42cefd01997-10-05 15:27:29 +0000836\end{cfuncdesc}
837
838\begin{cfuncdesc}{int}{PyErr_GivenExceptionMatches}{PyObject *given, PyObject *exc}
Guido van Rossum42cefd01997-10-05 15:27:29 +0000839Return true if the \var{given} exception matches the exception in
840\var{exc}. If \var{exc} is a class object, this also returns true
Fred Drake659ebfa2000-04-03 15:42:13 +0000841when \var{given} is an instance of a subclass. If \var{exc} is a tuple, all
Guido van Rossum42cefd01997-10-05 15:27:29 +0000842exceptions in the tuple (and recursively in subtuples) are searched
Fred Drake659ebfa2000-04-03 15:42:13 +0000843for a match. If \var{given} is \NULL, a memory access violation will
844occur.
Guido van Rossum42cefd01997-10-05 15:27:29 +0000845\end{cfuncdesc}
846
847\begin{cfuncdesc}{void}{PyErr_NormalizeException}{PyObject**exc, PyObject**val, PyObject**tb}
Guido van Rossum42cefd01997-10-05 15:27:29 +0000848Under certain circumstances, the values returned by
Fred Drakee058b4f1998-02-16 06:15:35 +0000849\cfunction{PyErr_Fetch()} below can be ``unnormalized'', meaning that
850\code{*\var{exc}} is a class object but \code{*\var{val}} is not an
851instance of the same class. This function can be used to instantiate
852the class in that case. If the values are already normalized, nothing
Fred Drake659ebfa2000-04-03 15:42:13 +0000853happens. The delayed normalization is implemented to improve
854performance.
Guido van Rossumae110af1997-05-22 20:11:52 +0000855\end{cfuncdesc}
856
857\begin{cfuncdesc}{void}{PyErr_Clear}{}
858Clear the error indicator. If the error indicator is not set, there
859is no effect.
860\end{cfuncdesc}
861
Fred Drake659ebfa2000-04-03 15:42:13 +0000862\begin{cfuncdesc}{void}{PyErr_Fetch}{PyObject **ptype, PyObject **pvalue,
863 PyObject **ptraceback}
Guido van Rossumae110af1997-05-22 20:11:52 +0000864Retrieve the error indicator into three variables whose addresses are
865passed. If the error indicator is not set, set all three variables to
866\NULL{}. If it is set, it will be cleared and you own a reference to
Fred Drake659ebfa2000-04-03 15:42:13 +0000867each object retrieved. The value and traceback object may be
868\NULL{} even when the type object is not. \strong{Note:} This
869function is normally only used by code that needs to handle exceptions
870or by code that needs to save and restore the error indicator
871temporarily.
Guido van Rossumae110af1997-05-22 20:11:52 +0000872\end{cfuncdesc}
873
Fred Drake17e63432000-08-31 05:50:40 +0000874\begin{cfuncdesc}{void}{PyErr_Restore}{PyObject *type, PyObject *value,
875 PyObject *traceback}
Guido van Rossumae110af1997-05-22 20:11:52 +0000876Set the error indicator from the three objects. If the error
877indicator is already set, it is cleared first. If the objects are
878\NULL{}, the error indicator is cleared. Do not pass a \NULL{} type
879and non-\NULL{} value or traceback. The exception type should be a
880string or class; if it is a class, the value should be an instance of
881that class. Do not pass an invalid exception type or value.
882(Violating these rules will cause subtle problems later.) This call
Fred Drakebab29652001-07-10 16:10:08 +0000883takes away a reference to each object: you must own a reference
Guido van Rossumae110af1997-05-22 20:11:52 +0000884to each object before the call and after the call you no longer own
885these references. (If you don't understand this, don't use this
Fred Drake659ebfa2000-04-03 15:42:13 +0000886function. I warned you.) \strong{Note:} This function is normally
Guido van Rossumae110af1997-05-22 20:11:52 +0000887only used by code that needs to save and restore the error indicator
888temporarily.
889\end{cfuncdesc}
890
891\begin{cfuncdesc}{void}{PyErr_SetString}{PyObject *type, char *message}
892This is the most common way to set the error indicator. The first
893argument specifies the exception type; it is normally one of the
Fred Drakef8830d11998-04-23 14:06:01 +0000894standard exceptions, e.g. \cdata{PyExc_RuntimeError}. You need not
Guido van Rossumae110af1997-05-22 20:11:52 +0000895increment its reference count. The second argument is an error
896message; it is converted to a string object.
897\end{cfuncdesc}
898
899\begin{cfuncdesc}{void}{PyErr_SetObject}{PyObject *type, PyObject *value}
Fred Drakee058b4f1998-02-16 06:15:35 +0000900This function is similar to \cfunction{PyErr_SetString()} but lets you
Guido van Rossumae110af1997-05-22 20:11:52 +0000901specify an arbitrary Python object for the ``value'' of the exception.
902You need not increment its reference count.
903\end{cfuncdesc}
904
Fred Drake73577702000-04-10 18:50:14 +0000905\begin{cfuncdesc}{PyObject*}{PyErr_Format}{PyObject *exception,
Moshe Zadka57a59322000-09-01 09:47:20 +0000906 const char *format, \moreargs}
Fred Drake89fb0352000-10-14 05:49:30 +0000907This function sets the error indicator. \var{exception} should be a
908Python exception (string or class, not an instance).
Fred Drake5566c1c2001-01-19 22:48:33 +0000909\var{format} should be a string, containing format codes, similar to
Moshe Zadka57a59322000-09-01 09:47:20 +0000910\cfunction{printf}. The \code{width.precision} before a format code
911is parsed, but the width part is ignored.
912
913\begin{tableii}{c|l}{character}{Character}{Meaning}
914 \lineii{c}{Character, as an \ctype{int} parameter}
915 \lineii{d}{Number in decimal, as an \ctype{int} parameter}
916 \lineii{x}{Number in hexadecimal, as an \ctype{int} parameter}
917 \lineii{x}{A string, as a \ctype{char *} parameter}
918\end{tableii}
919
920An unrecognized format character causes all the rest of
921the format string to be copied as-is to the result string,
922and any extra arguments discarded.
923
924A new reference is returned, which is owned by the caller.
Jeremy Hylton98605b52000-04-10 18:40:57 +0000925\end{cfuncdesc}
926
Guido van Rossumae110af1997-05-22 20:11:52 +0000927\begin{cfuncdesc}{void}{PyErr_SetNone}{PyObject *type}
Fred Drakee058b4f1998-02-16 06:15:35 +0000928This is a shorthand for \samp{PyErr_SetObject(\var{type}, Py_None)}.
Guido van Rossumae110af1997-05-22 20:11:52 +0000929\end{cfuncdesc}
930
931\begin{cfuncdesc}{int}{PyErr_BadArgument}{}
Fred Drakee058b4f1998-02-16 06:15:35 +0000932This is a shorthand for \samp{PyErr_SetString(PyExc_TypeError,
Guido van Rossumae110af1997-05-22 20:11:52 +0000933\var{message})}, where \var{message} indicates that a built-in operation
934was invoked with an illegal argument. It is mostly for internal use.
935\end{cfuncdesc}
936
Fred Drakec6fa34e1998-04-02 06:47:24 +0000937\begin{cfuncdesc}{PyObject*}{PyErr_NoMemory}{}
Fred Drakee058b4f1998-02-16 06:15:35 +0000938This is a shorthand for \samp{PyErr_SetNone(PyExc_MemoryError)}; it
Guido van Rossumae110af1997-05-22 20:11:52 +0000939returns \NULL{} so an object allocation function can write
Fred Drakee058b4f1998-02-16 06:15:35 +0000940\samp{return PyErr_NoMemory();} when it runs out of memory.
Guido van Rossumae110af1997-05-22 20:11:52 +0000941\end{cfuncdesc}
942
Fred Drakec6fa34e1998-04-02 06:47:24 +0000943\begin{cfuncdesc}{PyObject*}{PyErr_SetFromErrno}{PyObject *type}
Fred Drake659ebfa2000-04-03 15:42:13 +0000944This is a convenience function to raise an exception when a C library
945function has returned an error and set the C variable \cdata{errno}.
Guido van Rossumae110af1997-05-22 20:11:52 +0000946It constructs a tuple object whose first item is the integer
Fred Drakef8830d11998-04-23 14:06:01 +0000947\cdata{errno} value and whose second item is the corresponding error
Fred Drake659ebfa2000-04-03 15:42:13 +0000948message (gotten from \cfunction{strerror()}\ttindex{strerror()}), and
949then calls
Fred Drakee058b4f1998-02-16 06:15:35 +0000950\samp{PyErr_SetObject(\var{type}, \var{object})}. On \UNIX{}, when
Fred Drakef8830d11998-04-23 14:06:01 +0000951the \cdata{errno} value is \constant{EINTR}, indicating an interrupted
Fred Drakee058b4f1998-02-16 06:15:35 +0000952system call, this calls \cfunction{PyErr_CheckSignals()}, and if that set
Guido van Rossumae110af1997-05-22 20:11:52 +0000953the error indicator, leaves it set to that. The function always
954returns \NULL{}, so a wrapper function around a system call can write
Fred Drakee058b4f1998-02-16 06:15:35 +0000955\samp{return PyErr_SetFromErrno();} when the system call returns an
956error.
Guido van Rossumae110af1997-05-22 20:11:52 +0000957\end{cfuncdesc}
958
Fred Drake490d34d2001-06-20 21:39:12 +0000959\begin{cfuncdesc}{PyObject*}{PyErr_SetFromErrnoWithFilename}{PyObject *type,
960 char *filename}
961Similar to \cfunction{PyErr_SetFromErrno()}, with the additional
962behavior that if \var{filename} is not \NULL, it is passed to the
963constructor of \var{type} as a third parameter. In the case of
964exceptions such as \exception{IOError} and \exception{OSError}, this
965is used to define the \member{filename} attribute of the exception
966instance.
967\end{cfuncdesc}
968
Guido van Rossumae110af1997-05-22 20:11:52 +0000969\begin{cfuncdesc}{void}{PyErr_BadInternalCall}{}
Fred Drakee058b4f1998-02-16 06:15:35 +0000970This is a shorthand for \samp{PyErr_SetString(PyExc_TypeError,
Guido van Rossumae110af1997-05-22 20:11:52 +0000971\var{message})}, where \var{message} indicates that an internal
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000972operation (e.g. a Python/C API function) was invoked with an illegal
Guido van Rossumae110af1997-05-22 20:11:52 +0000973argument. It is mostly for internal use.
974\end{cfuncdesc}
975
Guido van Rossum3dbb4062000-12-19 03:53:01 +0000976\begin{cfuncdesc}{int}{PyErr_Warn}{PyObject *category, char *message}
977Issue a warning message. The \var{category} argument is a warning
Fred Drake5566c1c2001-01-19 22:48:33 +0000978category (see below) or \NULL; the \var{message} argument is a message
Guido van Rossum3dbb4062000-12-19 03:53:01 +0000979string.
980
981This function normally prints a warning message to \var{sys.stderr};
982however, it is also possible that the user has specified that warnings
983are to be turned into errors, and in that case this will raise an
984exception. It is also possible that the function raises an exception
985because of a problem with the warning machinery (the implementation
986imports the \module{warnings} module to do the heavy lifting). The
987return value is \code{0} if no exception is raised, or \code{-1} if
988an exception is raised. (It is not possible to determine whether a
989warning message is actually printed, nor what the reason is for the
990exception; this is intentional.) If an exception is raised, the
Fred Drake5566c1c2001-01-19 22:48:33 +0000991caller should do its normal exception handling
992(e.g. \cfunction{Py_DECREF()} owned references and return an error
993value).
Guido van Rossum3dbb4062000-12-19 03:53:01 +0000994
995Warning categories must be subclasses of \cdata{Warning}; the default
996warning category is \cdata{RuntimeWarning}. The standard Python
997warning categories are available as global variables whose names are
998\samp{PyExc_} followed by the Python exception name. These have the
999type \ctype{PyObject*}; they are all class objects. Their names are
1000\cdata{PyExc_Warning}, \cdata{PyExc_UserWarning},
1001\cdata{PyExc_DeprecationWarning}, \cdata{PyExc_SyntaxWarning}, and
1002\cdata{PyExc_RuntimeWarning}. \cdata{PyExc_Warning} is a subclass of
1003\cdata{PyExc_Exception}; the other warning categories are subclasses
1004of \cdata{PyExc_Warning}.
1005
1006For information about warning control, see the documentation for the
Fred Drake316ef7c2001-01-04 05:56:34 +00001007\module{warnings} module and the \programopt{-W} option in the command
1008line documentation. There is no C API for warning control.
Guido van Rossum3dbb4062000-12-19 03:53:01 +00001009\end{cfuncdesc}
1010
Guido van Rossum1874c8f2001-02-28 23:46:44 +00001011\begin{cfuncdesc}{int}{PyErr_WarnExplicit}{PyObject *category, char *message,
1012char *filename, int lineno, char *module, PyObject *registry}
1013Issue a warning message with explicit control over all warning
1014attributes. This is a straightforward wrapper around the Python
1015function \function{warnings.warn_explicit()}, see there for more
1016information. The \var{module} and \var{registry} arguments may be
1017set to \code{NULL} to get the default effect described there.
1018\end{cfuncdesc}
1019
Guido van Rossumae110af1997-05-22 20:11:52 +00001020\begin{cfuncdesc}{int}{PyErr_CheckSignals}{}
1021This function interacts with Python's signal handling. It checks
1022whether a signal has been sent to the processes and if so, invokes the
Fred Drake4de05a91998-02-16 14:25:26 +00001023corresponding signal handler. If the
1024\module{signal}\refbimodindex{signal} module is supported, this can
1025invoke a signal handler written in Python. In all cases, the default
Fred Drake659ebfa2000-04-03 15:42:13 +00001026effect for \constant{SIGINT}\ttindex{SIGINT} is to raise the
1027\withsubitem{(built-in exception)}{\ttindex{KeyboardInterrupt}}
1028\exception{KeyboardInterrupt} exception. If an exception is raised the
Fred Drakee058b4f1998-02-16 06:15:35 +00001029error indicator is set and the function returns \code{1}; otherwise
1030the function returns \code{0}. The error indicator may or may not be
1031cleared if it was previously set.
Guido van Rossumae110af1997-05-22 20:11:52 +00001032\end{cfuncdesc}
1033
1034\begin{cfuncdesc}{void}{PyErr_SetInterrupt}{}
Fred Drake659ebfa2000-04-03 15:42:13 +00001035This function is obsolete. It simulates the effect of a
1036\constant{SIGINT}\ttindex{SIGINT} signal arriving --- the next time
Fred Drakee058b4f1998-02-16 06:15:35 +00001037\cfunction{PyErr_CheckSignals()} is called,
Fred Drake659ebfa2000-04-03 15:42:13 +00001038\withsubitem{(built-in exception)}{\ttindex{KeyboardInterrupt}}
1039\exception{KeyboardInterrupt} will be raised.
1040It may be called without holding the interpreter lock.
Guido van Rossumae110af1997-05-22 20:11:52 +00001041\end{cfuncdesc}
1042
Fred Drakec6fa34e1998-04-02 06:47:24 +00001043\begin{cfuncdesc}{PyObject*}{PyErr_NewException}{char *name,
1044 PyObject *base,
1045 PyObject *dict}
Guido van Rossum42cefd01997-10-05 15:27:29 +00001046This utility function creates and returns a new exception object. The
Fred Drake659ebfa2000-04-03 15:42:13 +00001047\var{name} argument must be the name of the new exception, a C string
1048of the form \code{module.class}. The \var{base} and
Fred Draked04038d2000-06-29 20:15:14 +00001049\var{dict} arguments are normally \NULL{}. This creates a
Fred Drake659ebfa2000-04-03 15:42:13 +00001050class object derived from the root for all exceptions, the built-in
1051name \exception{Exception} (accessible in C as
Fred Draked04038d2000-06-29 20:15:14 +00001052\cdata{PyExc_Exception}). The \member{__module__} attribute of the
1053new class is set to the first part (up to the last dot) of the
1054\var{name} argument, and the class name is set to the last part (after
1055the last dot). The \var{base} argument can be used to specify an
1056alternate base class. The \var{dict} argument can be used to specify
1057a dictionary of class variables and methods.
Guido van Rossum42cefd01997-10-05 15:27:29 +00001058\end{cfuncdesc}
1059
Jeremy Hyltonb709df32000-09-01 02:47:25 +00001060\begin{cfuncdesc}{void}{PyErr_WriteUnraisable}{PyObject *obj}
1061This utility function prints a warning message to \var{sys.stderr}
1062when an exception has been set but it is impossible for the
1063interpreter to actually raise the exception. It is used, for example,
1064when an exception occurs in an \member{__del__} method.
1065
1066The function is called with a single argument \var{obj} that
1067identifies where the context in which the unraisable exception
1068occurred. The repr of \var{obj} will be printed in the warning
1069message.
1070\end{cfuncdesc}
Guido van Rossum42cefd01997-10-05 15:27:29 +00001071
Fred Drakeefd146c1999-02-15 15:30:45 +00001072\section{Standard Exceptions \label{standardExceptions}}
Guido van Rossumae110af1997-05-22 20:11:52 +00001073
1074All standard Python exceptions are available as global variables whose
Fred Drake659ebfa2000-04-03 15:42:13 +00001075names are \samp{PyExc_} followed by the Python exception name. These
1076have the type \ctype{PyObject*}; they are all class objects. For
1077completeness, here are all the variables:
1078
1079\begin{tableiii}{l|l|c}{cdata}{C Name}{Python Name}{Notes}
1080 \lineiii{PyExc_Exception}{\exception{Exception}}{(1)}
1081 \lineiii{PyExc_StandardError}{\exception{StandardError}}{(1)}
1082 \lineiii{PyExc_ArithmeticError}{\exception{ArithmeticError}}{(1)}
1083 \lineiii{PyExc_LookupError}{\exception{LookupError}}{(1)}
1084 \lineiii{PyExc_AssertionError}{\exception{AssertionError}}{}
1085 \lineiii{PyExc_AttributeError}{\exception{AttributeError}}{}
1086 \lineiii{PyExc_EOFError}{\exception{EOFError}}{}
1087 \lineiii{PyExc_EnvironmentError}{\exception{EnvironmentError}}{(1)}
1088 \lineiii{PyExc_FloatingPointError}{\exception{FloatingPointError}}{}
1089 \lineiii{PyExc_IOError}{\exception{IOError}}{}
1090 \lineiii{PyExc_ImportError}{\exception{ImportError}}{}
1091 \lineiii{PyExc_IndexError}{\exception{IndexError}}{}
1092 \lineiii{PyExc_KeyError}{\exception{KeyError}}{}
1093 \lineiii{PyExc_KeyboardInterrupt}{\exception{KeyboardInterrupt}}{}
1094 \lineiii{PyExc_MemoryError}{\exception{MemoryError}}{}
1095 \lineiii{PyExc_NameError}{\exception{NameError}}{}
1096 \lineiii{PyExc_NotImplementedError}{\exception{NotImplementedError}}{}
1097 \lineiii{PyExc_OSError}{\exception{OSError}}{}
1098 \lineiii{PyExc_OverflowError}{\exception{OverflowError}}{}
1099 \lineiii{PyExc_RuntimeError}{\exception{RuntimeError}}{}
1100 \lineiii{PyExc_SyntaxError}{\exception{SyntaxError}}{}
1101 \lineiii{PyExc_SystemError}{\exception{SystemError}}{}
1102 \lineiii{PyExc_SystemExit}{\exception{SystemExit}}{}
1103 \lineiii{PyExc_TypeError}{\exception{TypeError}}{}
1104 \lineiii{PyExc_ValueError}{\exception{ValueError}}{}
Fred Drakea8d73412000-08-11 20:39:29 +00001105 \lineiii{PyExc_WindowsError}{\exception{WindowsError}}{(2)}
Fred Drake659ebfa2000-04-03 15:42:13 +00001106 \lineiii{PyExc_ZeroDivisionError}{\exception{ZeroDivisionError}}{}
1107\end{tableiii}
1108
1109\noindent
Fred Drakea8d73412000-08-11 20:39:29 +00001110Notes:
Fred Drake659ebfa2000-04-03 15:42:13 +00001111\begin{description}
1112\item[(1)]
Fred Draked04038d2000-06-29 20:15:14 +00001113 This is a base class for other standard exceptions.
Fred Drakea8d73412000-08-11 20:39:29 +00001114
1115\item[(2)]
1116 Only defined on Windows; protect code that uses this by testing that
1117 the preprocessor macro \code{MS_WINDOWS} is defined.
Fred Drake659ebfa2000-04-03 15:42:13 +00001118\end{description}
1119
1120
1121\section{Deprecation of String Exceptions}
1122
Fred Draked04038d2000-06-29 20:15:14 +00001123All exceptions built into Python or provided in the standard library
1124are derived from \exception{Exception}.
Fred Drake659ebfa2000-04-03 15:42:13 +00001125\withsubitem{(built-in exception)}{\ttindex{Exception}}
Fred Drake659ebfa2000-04-03 15:42:13 +00001126
Fred Draked04038d2000-06-29 20:15:14 +00001127String exceptions are still supported in the interpreter to allow
Fred Drake659ebfa2000-04-03 15:42:13 +00001128existing code to run unmodified, but this will also change in a future
1129release.
Guido van Rossumae110af1997-05-22 20:11:52 +00001130
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001131
Fred Drakeefd146c1999-02-15 15:30:45 +00001132\chapter{Utilities \label{utilities}}
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001133
Fred Drake88fdaa72001-07-20 20:56:11 +00001134The functions in this chapter perform various utility tasks, ranging
1135from helping C code be more portable across platforms, using Python
1136modules from C, and parsing function arguments and constructing Python
1137values from C values.
1138
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001139
Fred Drake377fb1e2001-07-14 03:01:48 +00001140\section{Operating System Utilities \label{os}}
Guido van Rossum42cefd01997-10-05 15:27:29 +00001141
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001142\begin{cfuncdesc}{int}{Py_FdIsInteractive}{FILE *fp, char *filename}
Fred Drakee058b4f1998-02-16 06:15:35 +00001143Return true (nonzero) if the standard I/O file \var{fp} with name
1144\var{filename} is deemed interactive. This is the case for files for
1145which \samp{isatty(fileno(\var{fp}))} is true. If the global flag
Fred Drakef8830d11998-04-23 14:06:01 +00001146\cdata{Py_InteractiveFlag} is true, this function also returns true if
Fred Drake5566c1c2001-01-19 22:48:33 +00001147the \var{filename} pointer is \NULL{} or if the name is equal to one of
Fred Drakea8455ab2000-06-16 19:58:42 +00001148the strings \code{'<stdin>'} or \code{'???'}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001149\end{cfuncdesc}
1150
1151\begin{cfuncdesc}{long}{PyOS_GetLastModificationTime}{char *filename}
Fred Drakee058b4f1998-02-16 06:15:35 +00001152Return the time of last modification of the file \var{filename}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001153The result is encoded in the same way as the timestamp returned by
Fred Drake659ebfa2000-04-03 15:42:13 +00001154the standard C library function \cfunction{time()}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001155\end{cfuncdesc}
1156
Fred Drakecabbc3b2000-06-28 15:53:13 +00001157\begin{cfuncdesc}{void}{PyOS_AfterFork}{}
1158Function to update some internal state after a process fork; this
1159should be called in the new process if the Python interpreter will
1160continue to be used. If a new executable is loaded into the new
1161process, this function does not need to be called.
1162\end{cfuncdesc}
1163
Fred Drake17e63432000-08-31 05:50:40 +00001164\begin{cfuncdesc}{int}{PyOS_CheckStack}{}
1165Return true when the interpreter runs out of stack space. This is a
1166reliable check, but is only available when \code{USE_STACKCHECK} is
1167defined (currently on Windows using the Microsoft Visual C++ compiler
1168and on the Macintosh). \code{USE_CHECKSTACK} will be defined
1169automatically; you should never change the definition in your own
1170code.
1171\end{cfuncdesc}
1172
Guido van Rossumc96ec6e2000-09-16 16:30:48 +00001173\begin{cfuncdesc}{PyOS_sighandler_t}{PyOS_getsig}{int i}
1174Return the current signal handler for signal \var{i}.
1175This is a thin wrapper around either \cfunction{sigaction} or
1176\cfunction{signal}. Do not call those functions directly!
1177\ctype{PyOS_sighandler_t} is a typedef alias for \ctype{void (*)(int)}.
1178\end{cfuncdesc}
1179
1180\begin{cfuncdesc}{PyOS_sighandler_t}{PyOS_setsig}{int i, PyOS_sighandler_t h}
1181Set the signal handler for signal \var{i} to be \var{h};
1182return the old signal handler.
1183This is a thin wrapper around either \cfunction{sigaction} or
1184\cfunction{signal}. Do not call those functions directly!
1185\ctype{PyOS_sighandler_t} is a typedef alias for \ctype{void (*)(int)}.
1186\end{cfuncdesc}
1187
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001188
Fred Drakeefd146c1999-02-15 15:30:45 +00001189\section{Process Control \label{processControl}}
Fred Drakee5bf8b21998-02-12 21:22:28 +00001190
1191\begin{cfuncdesc}{void}{Py_FatalError}{char *message}
1192Print a fatal error message and kill the process. No cleanup is
1193performed. This function should only be invoked when a condition is
1194detected that would make it dangerous to continue using the Python
1195interpreter; e.g., when the object administration appears to be
Fred Drake659ebfa2000-04-03 15:42:13 +00001196corrupted. On \UNIX{}, the standard C library function
1197\cfunction{abort()}\ttindex{abort()} is called which will attempt to
1198produce a \file{core} file.
Fred Drakee5bf8b21998-02-12 21:22:28 +00001199\end{cfuncdesc}
1200
1201\begin{cfuncdesc}{void}{Py_Exit}{int status}
Fred Drake659ebfa2000-04-03 15:42:13 +00001202Exit the current process. This calls
1203\cfunction{Py_Finalize()}\ttindex{Py_Finalize()} and
1204then calls the standard C library function
1205\code{exit(\var{status})}\ttindex{exit()}.
Fred Drakee5bf8b21998-02-12 21:22:28 +00001206\end{cfuncdesc}
1207
1208\begin{cfuncdesc}{int}{Py_AtExit}{void (*func) ()}
Fred Drake659ebfa2000-04-03 15:42:13 +00001209Register a cleanup function to be called by
1210\cfunction{Py_Finalize()}\ttindex{Py_Finalize()}.
Fred Drakee5bf8b21998-02-12 21:22:28 +00001211The cleanup function will be called with no arguments and should
Fred Drake659ebfa2000-04-03 15:42:13 +00001212return no value. At most 32 \index{cleanup functions}cleanup
1213functions can be registered.
Fred Drakee5bf8b21998-02-12 21:22:28 +00001214When the registration is successful, \cfunction{Py_AtExit()} returns
1215\code{0}; on failure, it returns \code{-1}. The cleanup function
1216registered last is called first. Each cleanup function will be called
1217at most once. Since Python's internal finallization will have
1218completed before the cleanup function, no Python APIs should be called
1219by \var{func}.
1220\end{cfuncdesc}
1221
1222
Fred Drakeefd146c1999-02-15 15:30:45 +00001223\section{Importing Modules \label{importing}}
Guido van Rossum42cefd01997-10-05 15:27:29 +00001224
Fred Drakec6fa34e1998-04-02 06:47:24 +00001225\begin{cfuncdesc}{PyObject*}{PyImport_ImportModule}{char *name}
Fred Drake659ebfa2000-04-03 15:42:13 +00001226This is a simplified interface to
1227\cfunction{PyImport_ImportModuleEx()} below, leaving the
1228\var{globals} and \var{locals} arguments set to \NULL{}. When the
Fred Drakebab29652001-07-10 16:10:08 +00001229\var{name} argument contains a dot (when it specifies a
Fred Drake659ebfa2000-04-03 15:42:13 +00001230submodule of a package), the \var{fromlist} argument is set to the
1231list \code{['*']} so that the return value is the named module rather
1232than the top-level package containing it as would otherwise be the
1233case. (Unfortunately, this has an additional side effect when
1234\var{name} in fact specifies a subpackage instead of a submodule: the
1235submodules specified in the package's \code{__all__} variable are
1236\index{package variable!\code{__all__}}
1237\withsubitem{(package variable)}{\ttindex{__all__}}loaded.) Return a
1238new reference to the imported module, or
1239\NULL{} with an exception set on failure (the module may still be
1240created in this case --- examine \code{sys.modules} to find out).
1241\withsubitem{(in module sys)}{\ttindex{modules}}
Guido van Rossum42cefd01997-10-05 15:27:29 +00001242\end{cfuncdesc}
1243
Fred Drakec6fa34e1998-04-02 06:47:24 +00001244\begin{cfuncdesc}{PyObject*}{PyImport_ImportModuleEx}{char *name, PyObject *globals, PyObject *locals, PyObject *fromlist}
Guido van Rossum42cefd01997-10-05 15:27:29 +00001245Import a module. This is best described by referring to the built-in
Fred Drake53fb7721998-02-16 06:23:20 +00001246Python function \function{__import__()}\bifuncindex{__import__}, as
1247the standard \function{__import__()} function calls this function
1248directly.
Guido van Rossum42cefd01997-10-05 15:27:29 +00001249
Guido van Rossum42cefd01997-10-05 15:27:29 +00001250The return value is a new reference to the imported module or
Guido van Rossum580aa8d1997-11-25 15:34:51 +00001251top-level package, or \NULL{} with an exception set on failure
Guido van Rossumc44d3d61997-10-06 05:10:47 +00001252(the module may still be created in this case). Like for
Fred Drakee058b4f1998-02-16 06:15:35 +00001253\function{__import__()}, the return value when a submodule of a
1254package was requested is normally the top-level package, unless a
1255non-empty \var{fromlist} was given.
Guido van Rossum42cefd01997-10-05 15:27:29 +00001256\end{cfuncdesc}
1257
Fred Drakec6fa34e1998-04-02 06:47:24 +00001258\begin{cfuncdesc}{PyObject*}{PyImport_Import}{PyObject *name}
Guido van Rossum42cefd01997-10-05 15:27:29 +00001259This is a higher-level interface that calls the current ``import hook
Fred Drakee058b4f1998-02-16 06:15:35 +00001260function''. It invokes the \function{__import__()} function from the
Guido van Rossum42cefd01997-10-05 15:27:29 +00001261\code{__builtins__} of the current globals. This means that the
1262import is done using whatever import hooks are installed in the
Fred Drake4de05a91998-02-16 14:25:26 +00001263current environment, e.g. by \module{rexec}\refstmodindex{rexec} or
1264\module{ihooks}\refstmodindex{ihooks}.
Guido van Rossum42cefd01997-10-05 15:27:29 +00001265\end{cfuncdesc}
1266
Fred Drakec6fa34e1998-04-02 06:47:24 +00001267\begin{cfuncdesc}{PyObject*}{PyImport_ReloadModule}{PyObject *m}
Guido van Rossum42cefd01997-10-05 15:27:29 +00001268Reload a module. This is best described by referring to the built-in
Fred Drake53fb7721998-02-16 06:23:20 +00001269Python function \function{reload()}\bifuncindex{reload}, as the standard
Fred Drakee058b4f1998-02-16 06:15:35 +00001270\function{reload()} function calls this function directly. Return a
1271new reference to the reloaded module, or \NULL{} with an exception set
1272on failure (the module still exists in this case).
Guido van Rossum42cefd01997-10-05 15:27:29 +00001273\end{cfuncdesc}
1274
Fred Drakec6fa34e1998-04-02 06:47:24 +00001275\begin{cfuncdesc}{PyObject*}{PyImport_AddModule}{char *name}
Guido van Rossum42cefd01997-10-05 15:27:29 +00001276Return the module object corresponding to a module name. The
1277\var{name} argument may be of the form \code{package.module}). First
1278check the modules dictionary if there's one there, and if not, create
Fred Drake659ebfa2000-04-03 15:42:13 +00001279a new one and insert in in the modules dictionary.
Guido van Rossuma096a2e1998-11-02 17:02:42 +00001280Warning: this function does not load or import the module; if the
1281module wasn't already loaded, you will get an empty module object.
1282Use \cfunction{PyImport_ImportModule()} or one of its variants to
1283import a module.
Fred Drake659ebfa2000-04-03 15:42:13 +00001284Return \NULL{} with an exception set on failure.
Guido van Rossum42cefd01997-10-05 15:27:29 +00001285\end{cfuncdesc}
1286
Fred Drakec6fa34e1998-04-02 06:47:24 +00001287\begin{cfuncdesc}{PyObject*}{PyImport_ExecCodeModule}{char *name, PyObject *co}
Guido van Rossum42cefd01997-10-05 15:27:29 +00001288Given a module name (possibly of the form \code{package.module}) and a
1289code object read from a Python bytecode file or obtained from the
Fred Drake53fb7721998-02-16 06:23:20 +00001290built-in function \function{compile()}\bifuncindex{compile}, load the
1291module. Return a new reference to the module object, or \NULL{} with
1292an exception set if an error occurred (the module may still be created
1293in this case). (This function would reload the module if it was
1294already imported.)
Guido van Rossum42cefd01997-10-05 15:27:29 +00001295\end{cfuncdesc}
1296
1297\begin{cfuncdesc}{long}{PyImport_GetMagicNumber}{}
Fred Drake659ebfa2000-04-03 15:42:13 +00001298Return the magic number for Python bytecode files (a.k.a.
1299\file{.pyc} and \file{.pyo} files). The magic number should be
1300present in the first four bytes of the bytecode file, in little-endian
1301byte order.
Guido van Rossum42cefd01997-10-05 15:27:29 +00001302\end{cfuncdesc}
1303
Fred Drakec6fa34e1998-04-02 06:47:24 +00001304\begin{cfuncdesc}{PyObject*}{PyImport_GetModuleDict}{}
Guido van Rossum42cefd01997-10-05 15:27:29 +00001305Return the dictionary used for the module administration
1306(a.k.a. \code{sys.modules}). Note that this is a per-interpreter
1307variable.
1308\end{cfuncdesc}
1309
1310\begin{cfuncdesc}{void}{_PyImport_Init}{}
1311Initialize the import mechanism. For internal use only.
1312\end{cfuncdesc}
1313
1314\begin{cfuncdesc}{void}{PyImport_Cleanup}{}
1315Empty the module table. For internal use only.
1316\end{cfuncdesc}
1317
1318\begin{cfuncdesc}{void}{_PyImport_Fini}{}
1319Finalize the import mechanism. For internal use only.
1320\end{cfuncdesc}
1321
Fred Drakec6fa34e1998-04-02 06:47:24 +00001322\begin{cfuncdesc}{PyObject*}{_PyImport_FindExtension}{char *, char *}
Guido van Rossum42cefd01997-10-05 15:27:29 +00001323For internal use only.
Guido van Rossum5b8a5231997-12-30 04:38:44 +00001324\end{cfuncdesc}
Guido van Rossum42cefd01997-10-05 15:27:29 +00001325
Fred Drakec6fa34e1998-04-02 06:47:24 +00001326\begin{cfuncdesc}{PyObject*}{_PyImport_FixupExtension}{char *, char *}
Guido van Rossum42cefd01997-10-05 15:27:29 +00001327For internal use only.
Guido van Rossum5b8a5231997-12-30 04:38:44 +00001328\end{cfuncdesc}
Guido van Rossum42cefd01997-10-05 15:27:29 +00001329
Fred Drake1d158692000-06-18 05:21:21 +00001330\begin{cfuncdesc}{int}{PyImport_ImportFrozenModule}{char *name}
1331Load a frozen module named \var{name}. Return \code{1} for success,
1332\code{0} if the module is not found, and \code{-1} with an exception
1333set if the initialization failed. To access the imported module on a
1334successful load, use \cfunction{PyImport_ImportModule()}.
Fred Drakee058b4f1998-02-16 06:15:35 +00001335(Note the misnomer --- this function would reload the module if it was
Guido van Rossum42cefd01997-10-05 15:27:29 +00001336already imported.)
1337\end{cfuncdesc}
1338
Fred Drake659ebfa2000-04-03 15:42:13 +00001339\begin{ctypedesc}[_frozen]{struct _frozen}
Guido van Rossum42cefd01997-10-05 15:27:29 +00001340This is the structure type definition for frozen module descriptors,
Fred Drakec6fa34e1998-04-02 06:47:24 +00001341as generated by the \program{freeze}\index{freeze utility} utility
1342(see \file{Tools/freeze/} in the Python source distribution). Its
Fred Drakee0d9a832000-09-01 05:30:00 +00001343definition, found in \file{Include/import.h}, is:
Fred Drakec6fa34e1998-04-02 06:47:24 +00001344
Guido van Rossum9faf4c51997-10-07 14:38:54 +00001345\begin{verbatim}
Guido van Rossum42cefd01997-10-05 15:27:29 +00001346struct _frozen {
Fred Drake36fbe761997-10-13 18:18:33 +00001347 char *name;
1348 unsigned char *code;
1349 int size;
Guido van Rossum42cefd01997-10-05 15:27:29 +00001350};
Guido van Rossum9faf4c51997-10-07 14:38:54 +00001351\end{verbatim}
Guido van Rossum42cefd01997-10-05 15:27:29 +00001352\end{ctypedesc}
1353
Fred Drakec6fa34e1998-04-02 06:47:24 +00001354\begin{cvardesc}{struct _frozen*}{PyImport_FrozenModules}
Fred Drakef8830d11998-04-23 14:06:01 +00001355This pointer is initialized to point to an array of \ctype{struct
Fred Drake659ebfa2000-04-03 15:42:13 +00001356_frozen} records, terminated by one whose members are all
1357\NULL{} or zero. When a frozen module is imported, it is searched in
1358this table. Third-party code could play tricks with this to provide a
Guido van Rossum42cefd01997-10-05 15:27:29 +00001359dynamically created collection of frozen modules.
1360\end{cvardesc}
1361
Fred Drakee0d9a832000-09-01 05:30:00 +00001362\begin{cfuncdesc}{int}{PyImport_AppendInittab}{char *name,
1363 void (*initfunc)(void)}
1364Add a single module to the existing table of built-in modules. This
1365is a convenience wrapper around \cfunction{PyImport_ExtendInittab()},
1366returning \code{-1} if the table could not be extended. The new
1367module can be imported by the name \var{name}, and uses the function
1368\var{initfunc} as the initialization function called on the first
1369attempted import. This should be called before
1370\cfunction{Py_Initialize()}.
1371\end{cfuncdesc}
1372
1373\begin{ctypedesc}[_inittab]{struct _inittab}
1374Structure describing a single entry in the list of built-in modules.
1375Each of these structures gives the name and initialization function
1376for a module built into the interpreter. Programs which embed Python
1377may use an array of these structures in conjunction with
1378\cfunction{PyImport_ExtendInittab()} to provide additional built-in
1379modules. The structure is defined in \file{Include/import.h} as:
1380
1381\begin{verbatim}
1382struct _inittab {
1383 char *name;
1384 void (*initfunc)(void);
1385};
1386\end{verbatim}
1387\end{ctypedesc}
1388
1389\begin{cfuncdesc}{int}{PyImport_ExtendInittab}{struct _inittab *newtab}
1390Add a collection of modules to the table of built-in modules. The
1391\var{newtab} array must end with a sentinel entry which contains
1392\NULL{} for the \member{name} field; failure to provide the sentinel
1393value can result in a memory fault. Returns \code{0} on success or
1394\code{-1} if insufficient memory could be allocated to extend the
1395internal table. In the event of failure, no modules are added to the
1396internal table. This should be called before
1397\cfunction{Py_Initialize()}.
1398\end{cfuncdesc}
1399
Guido van Rossum42cefd01997-10-05 15:27:29 +00001400
Fred Drake88fdaa72001-07-20 20:56:11 +00001401\section{Parsing arguements and building values
1402 \label{arg-parsing}}
1403
1404These functions are useful when creating your own extensions functions
1405and methods. Additional information and examples are available in
1406\citetitle[../ext/ext.html]{Extending and Embedding the Python
1407Interpreter}.
1408
1409\begin{cfuncdesc}{int}{PyArg_ParseTuple}{PyObject *args, char *format,
1410 \moreargs}
1411 Parse the parameters of a function that takes only positional
1412 parameters into local variables. Returns true on success; on
1413 failure, it returns false and raises the appropriate exception. See
1414 \citetitle[../ext/parseTuple.html]{Extending and Embedding the
1415 Python Interpreter} for more information.
1416\end{cfuncdesc}
1417
1418\begin{cfuncdesc}{int}{PyArg_ParseTupleAndKeywords}{PyObject *args,
1419 PyObject *kw, char *format, char *keywords[],
1420 \moreargs}
1421 Parse the parameters of a function that takes both positional and
1422 keyword parameters into local variables. Returns true on success;
1423 on failure, it returns false and raises the appropriate exception.
1424 See \citetitle[../ext/parseTupleAndKeywords.html]{Extending and
1425 Embedding the Python Interpreter} for more information.
1426\end{cfuncdesc}
1427
1428\begin{cfuncdesc}{int}{PyArg_Parse}{PyObject *args, char *format,
1429 \moreargs}
1430 Function used to deconstruct the argument lists of ``old-style''
1431 functions --- these are functions which use the
1432 \constant{METH_OLDARGS} parameter parsing method. This is not
1433 recommended for use in parameter parsing in new code, and most code
1434 in the standard interpreter has been modified to no longer use this
1435 for that purpose. It does remain a convenient way to decompose
1436 other tuples, however, and may continue to be used for that
1437 purpose.
1438\end{cfuncdesc}
1439
1440\begin{cfuncdesc}{PyObject*}{Py_BuildValue}{char *format,
1441 \moreargs}
1442 Create a new value based on a format string similar to those
1443 accepted by the \cfunction{PyArg_Parse*()} family of functions and a
1444 sequence of values. Returns the value or \NULL{} in the case of an
1445 error; an exception will be raised if \NULL{} is returned. For more
1446 information on the format string and additional parameters, see
1447 \citetitle[../ext/buildValue.html]{Extending and Embedding the
1448 Python Interpreter}.
1449\end{cfuncdesc}
1450
1451
1452
Fred Drakeefd146c1999-02-15 15:30:45 +00001453\chapter{Abstract Objects Layer \label{abstract}}
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001454
1455The functions in this chapter interact with Python objects regardless
1456of their type, or with wide classes of object types (e.g. all
1457numerical types, or all sequence types). When used on object types
Fred Drake659ebfa2000-04-03 15:42:13 +00001458for which they do not apply, they will raise a Python exception.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001459
Fred Drakeefd146c1999-02-15 15:30:45 +00001460\section{Object Protocol \label{object}}
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001461
1462\begin{cfuncdesc}{int}{PyObject_Print}{PyObject *o, FILE *fp, int flags}
Fred Drake659ebfa2000-04-03 15:42:13 +00001463Print an object \var{o}, on file \var{fp}. Returns \code{-1} on error.
1464The flags argument is used to enable certain printing options. The
1465only option currently supported is \constant{Py_PRINT_RAW}; if given,
1466the \function{str()} of the object is written instead of the
1467\function{repr()}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001468\end{cfuncdesc}
1469
1470\begin{cfuncdesc}{int}{PyObject_HasAttrString}{PyObject *o, char *attr_name}
Fred Drakee058b4f1998-02-16 06:15:35 +00001471Returns \code{1} if \var{o} has the attribute \var{attr_name}, and
1472\code{0} otherwise. This is equivalent to the Python expression
1473\samp{hasattr(\var{o}, \var{attr_name})}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001474This function always succeeds.
1475\end{cfuncdesc}
1476
Fred Drake659ebfa2000-04-03 15:42:13 +00001477\begin{cfuncdesc}{PyObject*}{PyObject_GetAttrString}{PyObject *o,
1478 char *attr_name}
Fred Drakee058b4f1998-02-16 06:15:35 +00001479Retrieve an attribute named \var{attr_name} from object \var{o}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001480Returns the attribute value on success, or \NULL{} on failure.
Fred Drakee058b4f1998-02-16 06:15:35 +00001481This is the equivalent of the Python expression
1482\samp{\var{o}.\var{attr_name}}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001483\end{cfuncdesc}
1484
1485
1486\begin{cfuncdesc}{int}{PyObject_HasAttr}{PyObject *o, PyObject *attr_name}
Fred Drakee058b4f1998-02-16 06:15:35 +00001487Returns \code{1} if \var{o} has the attribute \var{attr_name}, and
1488\code{0} otherwise. This is equivalent to the Python expression
1489\samp{hasattr(\var{o}, \var{attr_name})}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001490This function always succeeds.
1491\end{cfuncdesc}
1492
1493
Fred Drake659ebfa2000-04-03 15:42:13 +00001494\begin{cfuncdesc}{PyObject*}{PyObject_GetAttr}{PyObject *o,
1495 PyObject *attr_name}
Fred Drakee058b4f1998-02-16 06:15:35 +00001496Retrieve an attribute named \var{attr_name} from object \var{o}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001497Returns the attribute value on success, or \NULL{} on failure.
Fred Drakee058b4f1998-02-16 06:15:35 +00001498This is the equivalent of the Python expression
1499\samp{\var{o}.\var{attr_name}}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001500\end{cfuncdesc}
1501
1502
1503\begin{cfuncdesc}{int}{PyObject_SetAttrString}{PyObject *o, char *attr_name, PyObject *v}
Fred Drakee058b4f1998-02-16 06:15:35 +00001504Set the value of the attribute named \var{attr_name}, for object
1505\var{o}, to the value \var{v}. Returns \code{-1} on failure. This is
1506the equivalent of the Python statement \samp{\var{o}.\var{attr_name} =
1507\var{v}}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001508\end{cfuncdesc}
1509
1510
1511\begin{cfuncdesc}{int}{PyObject_SetAttr}{PyObject *o, PyObject *attr_name, PyObject *v}
Fred Drakee058b4f1998-02-16 06:15:35 +00001512Set the value of the attribute named \var{attr_name}, for
1513object \var{o},
1514to the value \var{v}. Returns \code{-1} on failure. This is
1515the equivalent of the Python statement \samp{\var{o}.\var{attr_name} =
1516\var{v}}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001517\end{cfuncdesc}
1518
1519
1520\begin{cfuncdesc}{int}{PyObject_DelAttrString}{PyObject *o, char *attr_name}
Fred Drakee058b4f1998-02-16 06:15:35 +00001521Delete attribute named \var{attr_name}, for object \var{o}. Returns
1522\code{-1} on failure. This is the equivalent of the Python
1523statement: \samp{del \var{o}.\var{attr_name}}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001524\end{cfuncdesc}
1525
1526
1527\begin{cfuncdesc}{int}{PyObject_DelAttr}{PyObject *o, PyObject *attr_name}
Fred Drakee058b4f1998-02-16 06:15:35 +00001528Delete attribute named \var{attr_name}, for object \var{o}. Returns
1529\code{-1} on failure. This is the equivalent of the Python
1530statement \samp{del \var{o}.\var{attr_name}}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001531\end{cfuncdesc}
1532
1533
1534\begin{cfuncdesc}{int}{PyObject_Cmp}{PyObject *o1, PyObject *o2, int *result}
Fred Drakee058b4f1998-02-16 06:15:35 +00001535Compare the values of \var{o1} and \var{o2} using a routine provided
1536by \var{o1}, if one exists, otherwise with a routine provided by
1537\var{o2}. The result of the comparison is returned in \var{result}.
1538Returns \code{-1} on failure. This is the equivalent of the Python
Fred Drake659ebfa2000-04-03 15:42:13 +00001539statement\bifuncindex{cmp} \samp{\var{result} = cmp(\var{o1}, \var{o2})}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001540\end{cfuncdesc}
1541
1542
1543\begin{cfuncdesc}{int}{PyObject_Compare}{PyObject *o1, PyObject *o2}
Fred Drakee058b4f1998-02-16 06:15:35 +00001544Compare the values of \var{o1} and \var{o2} using a routine provided
1545by \var{o1}, if one exists, otherwise with a routine provided by
1546\var{o2}. Returns the result of the comparison on success. On error,
1547the value returned is undefined; use \cfunction{PyErr_Occurred()} to
Fred Drake659ebfa2000-04-03 15:42:13 +00001548detect an error. This is equivalent to the Python
1549expression\bifuncindex{cmp} \samp{cmp(\var{o1}, \var{o2})}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001550\end{cfuncdesc}
1551
1552
1553\begin{cfuncdesc}{PyObject*}{PyObject_Repr}{PyObject *o}
Fred Drake659ebfa2000-04-03 15:42:13 +00001554Compute a string representation of object \var{o}. Returns the
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001555string representation on success, \NULL{} on failure. This is
Fred Drakee058b4f1998-02-16 06:15:35 +00001556the equivalent of the Python expression \samp{repr(\var{o})}.
1557Called by the \function{repr()}\bifuncindex{repr} built-in function
1558and by reverse quotes.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001559\end{cfuncdesc}
1560
1561
1562\begin{cfuncdesc}{PyObject*}{PyObject_Str}{PyObject *o}
Fred Drake659ebfa2000-04-03 15:42:13 +00001563Compute a string representation of object \var{o}. Returns the
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001564string representation on success, \NULL{} on failure. This is
Fred Drakee058b4f1998-02-16 06:15:35 +00001565the equivalent of the Python expression \samp{str(\var{o})}.
1566Called by the \function{str()}\bifuncindex{str} built-in function and
1567by the \keyword{print} statement.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001568\end{cfuncdesc}
1569
1570
Marc-André Lemburgad7c98e2001-01-17 17:09:53 +00001571\begin{cfuncdesc}{PyObject*}{PyObject_Unicode}{PyObject *o}
1572Compute a Unicode string representation of object \var{o}. Returns the
1573Unicode string representation on success, \NULL{} on failure. This is
1574the equivalent of the Python expression \samp{unistr(\var{o})}.
1575Called by the \function{unistr()}\bifuncindex{unistr} built-in function.
1576\end{cfuncdesc}
1577
Fred Drake58c8f9f2001-03-28 21:14:32 +00001578\begin{cfuncdesc}{int}{PyObject_IsInstance}{PyObject *inst, PyObject *cls}
1579Return \code{1} if \var{inst} is an instance of the class \var{cls} or
1580a subclass of \var{cls}. If \var{cls} is a type object rather than a
1581class object, \cfunction{PyObject_IsInstance()} returns \code{1} if
1582\var{inst} is of type \var{cls}. If \var{inst} is not a class
1583instance and \var{cls} is neither a type object or class object,
1584\var{inst} must have a \member{__class__} attribute --- the class
1585relationship of the value of that attribute with \var{cls} will be
1586used to determine the result of this function.
1587\versionadded{2.1}
1588\end{cfuncdesc}
1589
1590Subclass determination is done in a fairly straightforward way, but
1591includes a wrinkle that implementors of extensions to the class system
1592may want to be aware of. If \class{A} and \class{B} are class
1593objects, \class{B} is a subclass of \class{A} if it inherits from
1594\class{A} either directly or indirectly. If either is not a class
1595object, a more general mechanism is used to determine the class
1596relationship of the two objects. When testing if \var{B} is a
1597subclass of \var{A}, if \var{A} is \var{B},
1598\cfunction{PyObject_IsSubclass()} returns true. If \var{A} and
1599\var{B} are different objects, \var{B}'s \member{__bases__} attribute
1600is searched in a depth-first fashion for \var{A} --- the presence of
1601the \member{__bases__} attribute is considered sufficient for this
1602determination.
1603
1604\begin{cfuncdesc}{int}{PyObject_IsSubclass}{PyObject *derived,
1605 PyObject *cls}
1606Returns \code{1} if the class \var{derived} is identical to or derived
1607from the class \var{cls}, otherwise returns \code{0}. In case of an
1608error, returns \code{-1}. If either \var{derived} or \var{cls} is not
1609an actual class object, this function uses the generic algorithm
1610described above.
1611\versionadded{2.1}
1612\end{cfuncdesc}
1613
Marc-André Lemburgad7c98e2001-01-17 17:09:53 +00001614
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001615\begin{cfuncdesc}{int}{PyCallable_Check}{PyObject *o}
Fred Drake659ebfa2000-04-03 15:42:13 +00001616Determine if the object \var{o} is callable. Return \code{1} if the
Fred Drakee058b4f1998-02-16 06:15:35 +00001617object is callable and \code{0} otherwise.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001618This function always succeeds.
1619\end{cfuncdesc}
1620
1621
Fred Drake659ebfa2000-04-03 15:42:13 +00001622\begin{cfuncdesc}{PyObject*}{PyObject_CallObject}{PyObject *callable_object,
1623 PyObject *args}
Fred Drakee058b4f1998-02-16 06:15:35 +00001624Call a callable Python object \var{callable_object}, with
1625arguments given by the tuple \var{args}. If no arguments are
Fred Drake659ebfa2000-04-03 15:42:13 +00001626needed, then \var{args} may be \NULL{}. Returns the result of the
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001627call on success, or \NULL{} on failure. This is the equivalent
Fred Drakef90490e2001-08-02 18:00:28 +00001628of the Python expression \samp{apply(\var{callable_object},
1629\var{args})} or \samp{\var{callable_object}(*\var{args})}.
Fred Drake659ebfa2000-04-03 15:42:13 +00001630\bifuncindex{apply}
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001631\end{cfuncdesc}
1632
Fred Drake5566c1c2001-01-19 22:48:33 +00001633\begin{cfuncdesc}{PyObject*}{PyObject_CallFunction}{PyObject *callable_object,
1634 char *format, ...}
Fred Drakee058b4f1998-02-16 06:15:35 +00001635Call a callable Python object \var{callable_object}, with a
Fred Drake659ebfa2000-04-03 15:42:13 +00001636variable number of C arguments. The C arguments are described
Fred Drakee058b4f1998-02-16 06:15:35 +00001637using a \cfunction{Py_BuildValue()} style format string. The format may
1638be \NULL{}, indicating that no arguments are provided. Returns the
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001639result of the call on success, or \NULL{} on failure. This is
Fred Drakef90490e2001-08-02 18:00:28 +00001640the equivalent of the Python expression
1641\samp{apply(\var{callable_object}\var{args})} or
1642\samp{\var{callable_object}(*\var{args})}.
1643\bifuncindex{apply}
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001644\end{cfuncdesc}
1645
1646
Fred Drake5566c1c2001-01-19 22:48:33 +00001647\begin{cfuncdesc}{PyObject*}{PyObject_CallMethod}{PyObject *o,
1648 char *method, char *format, ...}
Fred Drakee058b4f1998-02-16 06:15:35 +00001649Call the method named \var{m} of object \var{o} with a variable number
Fred Drake659ebfa2000-04-03 15:42:13 +00001650of C arguments. The C arguments are described by a
Fred Drakee058b4f1998-02-16 06:15:35 +00001651\cfunction{Py_BuildValue()} format string. The format may be \NULL{},
1652indicating that no arguments are provided. Returns the result of the
1653call on success, or \NULL{} on failure. This is the equivalent of the
1654Python expression \samp{\var{o}.\var{method}(\var{args})}.
Fred Drake659ebfa2000-04-03 15:42:13 +00001655Note that special method names, such as \method{__add__()},
1656\method{__getitem__()}, and so on are not supported. The specific
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001657abstract-object routines for these must be used.
1658\end{cfuncdesc}
1659
1660
1661\begin{cfuncdesc}{int}{PyObject_Hash}{PyObject *o}
Fred Drakee058b4f1998-02-16 06:15:35 +00001662Compute and return the hash value of an object \var{o}. On
1663failure, return \code{-1}. This is the equivalent of the Python
Fred Drake659ebfa2000-04-03 15:42:13 +00001664expression \samp{hash(\var{o})}.\bifuncindex{hash}
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001665\end{cfuncdesc}
1666
1667
1668\begin{cfuncdesc}{int}{PyObject_IsTrue}{PyObject *o}
Fred Drakee058b4f1998-02-16 06:15:35 +00001669Returns \code{1} if the object \var{o} is considered to be true, and
1670\code{0} otherwise. This is equivalent to the Python expression
1671\samp{not not \var{o}}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001672This function always succeeds.
1673\end{cfuncdesc}
1674
1675
1676\begin{cfuncdesc}{PyObject*}{PyObject_Type}{PyObject *o}
1677On success, returns a type object corresponding to the object
Fred Drakee058b4f1998-02-16 06:15:35 +00001678type of object \var{o}. On failure, returns \NULL{}. This is
1679equivalent to the Python expression \samp{type(\var{o})}.
Fred Drake53fb7721998-02-16 06:23:20 +00001680\bifuncindex{type}
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001681\end{cfuncdesc}
1682
1683\begin{cfuncdesc}{int}{PyObject_Length}{PyObject *o}
Fred Drakee058b4f1998-02-16 06:15:35 +00001684Return the length of object \var{o}. If the object \var{o} provides
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001685both sequence and mapping protocols, the sequence length is
Fred Drake659ebfa2000-04-03 15:42:13 +00001686returned. On error, \code{-1} is returned. This is the equivalent
1687to the Python expression \samp{len(\var{o})}.\bifuncindex{len}
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001688\end{cfuncdesc}
1689
1690
1691\begin{cfuncdesc}{PyObject*}{PyObject_GetItem}{PyObject *o, PyObject *key}
Fred Drakee058b4f1998-02-16 06:15:35 +00001692Return element of \var{o} corresponding to the object \var{key} or
1693\NULL{} on failure. This is the equivalent of the Python expression
1694\samp{\var{o}[\var{key}]}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001695\end{cfuncdesc}
1696
1697
1698\begin{cfuncdesc}{int}{PyObject_SetItem}{PyObject *o, PyObject *key, PyObject *v}
Fred Drakee058b4f1998-02-16 06:15:35 +00001699Map the object \var{key} to the value \var{v}.
1700Returns \code{-1} on failure. This is the equivalent
1701of the Python statement \samp{\var{o}[\var{key}] = \var{v}}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001702\end{cfuncdesc}
1703
1704
Guido van Rossumd1dbf631999-01-22 20:10:49 +00001705\begin{cfuncdesc}{int}{PyObject_DelItem}{PyObject *o, PyObject *key}
Fred Drakee058b4f1998-02-16 06:15:35 +00001706Delete the mapping for \var{key} from \var{o}. Returns \code{-1} on
1707failure. This is the equivalent of the Python statement \samp{del
1708\var{o}[\var{key}]}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001709\end{cfuncdesc}
1710
Andrew M. Kuchling8c46b302000-07-13 23:58:16 +00001711\begin{cfuncdesc}{int}{PyObject_AsFileDescriptor}{PyObject *o}
1712Derives a file-descriptor from a Python object. If the object
1713is an integer or long integer, its value is returned. If not, the
1714object's \method{fileno()} method is called if it exists; the method
1715must return an integer or long integer, which is returned as the file
1716descriptor value. Returns \code{-1} on failure.
1717\end{cfuncdesc}
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001718
Fred Drakeefd146c1999-02-15 15:30:45 +00001719\section{Number Protocol \label{number}}
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001720
1721\begin{cfuncdesc}{int}{PyNumber_Check}{PyObject *o}
Fred Drakee058b4f1998-02-16 06:15:35 +00001722Returns \code{1} if the object \var{o} provides numeric protocols, and
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001723false otherwise.
1724This function always succeeds.
1725\end{cfuncdesc}
1726
1727
1728\begin{cfuncdesc}{PyObject*}{PyNumber_Add}{PyObject *o1, PyObject *o2}
Fred Drakee058b4f1998-02-16 06:15:35 +00001729Returns the result of adding \var{o1} and \var{o2}, or \NULL{} on
1730failure. This is the equivalent of the Python expression
1731\samp{\var{o1} + \var{o2}}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001732\end{cfuncdesc}
1733
1734
1735\begin{cfuncdesc}{PyObject*}{PyNumber_Subtract}{PyObject *o1, PyObject *o2}
Fred Drake659ebfa2000-04-03 15:42:13 +00001736Returns the result of subtracting \var{o2} from \var{o1}, or
1737\NULL{} on failure. This is the equivalent of the Python expression
Fred Drakee058b4f1998-02-16 06:15:35 +00001738\samp{\var{o1} - \var{o2}}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001739\end{cfuncdesc}
1740
1741
1742\begin{cfuncdesc}{PyObject*}{PyNumber_Multiply}{PyObject *o1, PyObject *o2}
Fred Drakee058b4f1998-02-16 06:15:35 +00001743Returns the result of multiplying \var{o1} and \var{o2}, or \NULL{} on
1744failure. This is the equivalent of the Python expression
1745\samp{\var{o1} * \var{o2}}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001746\end{cfuncdesc}
1747
1748
1749\begin{cfuncdesc}{PyObject*}{PyNumber_Divide}{PyObject *o1, PyObject *o2}
Fred Drakee058b4f1998-02-16 06:15:35 +00001750Returns the result of dividing \var{o1} by \var{o2}, or \NULL{} on
1751failure.
1752This is the equivalent of the Python expression \samp{\var{o1} /
1753\var{o2}}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001754\end{cfuncdesc}
1755
1756
1757\begin{cfuncdesc}{PyObject*}{PyNumber_Remainder}{PyObject *o1, PyObject *o2}
Fred Drakee058b4f1998-02-16 06:15:35 +00001758Returns the remainder of dividing \var{o1} by \var{o2}, or \NULL{} on
1759failure. This is the equivalent of the Python expression
Fred Drake659ebfa2000-04-03 15:42:13 +00001760\samp{\var{o1} \%\ \var{o2}}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001761\end{cfuncdesc}
1762
1763
1764\begin{cfuncdesc}{PyObject*}{PyNumber_Divmod}{PyObject *o1, PyObject *o2}
Fred Drake53fb7721998-02-16 06:23:20 +00001765See the built-in function \function{divmod()}\bifuncindex{divmod}.
1766Returns \NULL{} on failure. This is the equivalent of the Python
1767expression \samp{divmod(\var{o1}, \var{o2})}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001768\end{cfuncdesc}
1769
1770
1771\begin{cfuncdesc}{PyObject*}{PyNumber_Power}{PyObject *o1, PyObject *o2, PyObject *o3}
Fred Drake53fb7721998-02-16 06:23:20 +00001772See the built-in function \function{pow()}\bifuncindex{pow}. Returns
1773\NULL{} on failure. This is the equivalent of the Python expression
Fred Drakee058b4f1998-02-16 06:15:35 +00001774\samp{pow(\var{o1}, \var{o2}, \var{o3})}, where \var{o3} is optional.
Fred Drake659ebfa2000-04-03 15:42:13 +00001775If \var{o3} is to be ignored, pass \cdata{Py_None} in its place
1776(passing \NULL{} for \var{o3} would cause an illegal memory access).
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001777\end{cfuncdesc}
1778
1779
1780\begin{cfuncdesc}{PyObject*}{PyNumber_Negative}{PyObject *o}
Fred Drakee058b4f1998-02-16 06:15:35 +00001781Returns the negation of \var{o} on success, or \NULL{} on failure.
1782This is the equivalent of the Python expression \samp{-\var{o}}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001783\end{cfuncdesc}
1784
1785
1786\begin{cfuncdesc}{PyObject*}{PyNumber_Positive}{PyObject *o}
Fred Drakee058b4f1998-02-16 06:15:35 +00001787Returns \var{o} on success, or \NULL{} on failure.
1788This is the equivalent of the Python expression \samp{+\var{o}}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001789\end{cfuncdesc}
1790
1791
1792\begin{cfuncdesc}{PyObject*}{PyNumber_Absolute}{PyObject *o}
Fred Drakee058b4f1998-02-16 06:15:35 +00001793Returns the absolute value of \var{o}, or \NULL{} on failure. This is
1794the equivalent of the Python expression \samp{abs(\var{o})}.
Fred Drake659ebfa2000-04-03 15:42:13 +00001795\bifuncindex{abs}
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001796\end{cfuncdesc}
1797
1798
1799\begin{cfuncdesc}{PyObject*}{PyNumber_Invert}{PyObject *o}
Fred Drakee058b4f1998-02-16 06:15:35 +00001800Returns the bitwise negation of \var{o} on success, or \NULL{} on
1801failure. This is the equivalent of the Python expression
1802\samp{\~\var{o}}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001803\end{cfuncdesc}
1804
1805
1806\begin{cfuncdesc}{PyObject*}{PyNumber_Lshift}{PyObject *o1, PyObject *o2}
Fred Drakee058b4f1998-02-16 06:15:35 +00001807Returns the result of left shifting \var{o1} by \var{o2} on success,
1808or \NULL{} on failure. This is the equivalent of the Python
Fred Draked20d8b32001-04-13 14:52:39 +00001809expression \samp{\var{o1} <\code{<} \var{o2}}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001810\end{cfuncdesc}
1811
1812
1813\begin{cfuncdesc}{PyObject*}{PyNumber_Rshift}{PyObject *o1, PyObject *o2}
Fred Drakee058b4f1998-02-16 06:15:35 +00001814Returns the result of right shifting \var{o1} by \var{o2} on success,
1815or \NULL{} on failure. This is the equivalent of the Python
Fred Draked20d8b32001-04-13 14:52:39 +00001816expression \samp{\var{o1} >\code{>} \var{o2}}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001817\end{cfuncdesc}
1818
1819
1820\begin{cfuncdesc}{PyObject*}{PyNumber_And}{PyObject *o1, PyObject *o2}
Fred Drake7740a012000-09-12 20:27:05 +00001821Returns the ``bitwise and'' of \var{o2} and \var{o2} on success and
1822\NULL{} on failure. This is the equivalent of the Python expression
Fred Drake5566c1c2001-01-19 22:48:33 +00001823\samp{\var{o1} \&\ \var{o2}}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001824\end{cfuncdesc}
1825
1826
1827\begin{cfuncdesc}{PyObject*}{PyNumber_Xor}{PyObject *o1, PyObject *o2}
Fred Drake7740a012000-09-12 20:27:05 +00001828Returns the ``bitwise exclusive or'' of \var{o1} by \var{o2} on success,
Fred Drakee058b4f1998-02-16 06:15:35 +00001829or \NULL{} on failure. This is the equivalent of the Python
Fred Drake755c23d2001-07-14 03:05:53 +00001830expression \samp{\var{o1} \textasciicircum{} \var{o2}}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001831\end{cfuncdesc}
1832
1833\begin{cfuncdesc}{PyObject*}{PyNumber_Or}{PyObject *o1, PyObject *o2}
Fred Drake7740a012000-09-12 20:27:05 +00001834Returns the ``bitwise or'' of \var{o1} and \var{o2} on success, or
1835\NULL{} on failure. This is the equivalent of the Python expression
1836\samp{\var{o1} | \var{o2}}.
1837\end{cfuncdesc}
1838
1839
1840\begin{cfuncdesc}{PyObject*}{PyNumber_InPlaceAdd}{PyObject *o1, PyObject *o2}
1841Returns the result of adding \var{o1} and \var{o2}, or \NULL{} on failure.
1842The operation is done \emph{in-place} when \var{o1} supports it. This is the
1843equivalent of the Python expression \samp{\var{o1} += \var{o2}}.
1844\end{cfuncdesc}
1845
1846
1847\begin{cfuncdesc}{PyObject*}{PyNumber_InPlaceSubtract}{PyObject *o1, PyObject *o2}
1848Returns the result of subtracting \var{o2} from \var{o1}, or
1849\NULL{} on failure. The operation is done \emph{in-place} when \var{o1}
1850supports it. This is the equivalent of the Python expression \samp{\var{o1}
1851-= \var{o2}}.
1852\end{cfuncdesc}
1853
1854
1855\begin{cfuncdesc}{PyObject*}{PyNumber_InPlaceMultiply}{PyObject *o1, PyObject *o2}
1856Returns the result of multiplying \var{o1} and \var{o2}, or \NULL{} on
1857failure. The operation is done \emph{in-place} when \var{o1} supports it.
1858This is the equivalent of the Python expression \samp{\var{o1} *= \var{o2}}.
1859\end{cfuncdesc}
1860
1861
1862\begin{cfuncdesc}{PyObject*}{PyNumber_InPlaceDivide}{PyObject *o1, PyObject *o2}
1863Returns the result of dividing \var{o1} by \var{o2}, or \NULL{} on failure.
1864The operation is done \emph{in-place} when \var{o1} supports it. This is the
1865equivalent of the Python expression \samp{\var{o1} /= \var{o2}}.
1866\end{cfuncdesc}
1867
1868
1869\begin{cfuncdesc}{PyObject*}{PyNumber_InPlaceRemainder}{PyObject *o1, PyObject *o2}
1870Returns the remainder of dividing \var{o1} by \var{o2}, or \NULL{} on
1871failure. The operation is done \emph{in-place} when \var{o1} supports it.
1872This is the equivalent of the Python expression \samp{\var{o1} \%= \var{o2}}.
1873\end{cfuncdesc}
1874
1875
1876\begin{cfuncdesc}{PyObject*}{PyNumber_InPlacePower}{PyObject *o1, PyObject *o2, PyObject *o3}
1877See the built-in function \function{pow()}\bifuncindex{pow}. Returns
1878\NULL{} on failure. The operation is done \emph{in-place} when \var{o1}
1879supports it. This is the equivalent of the Python expression \samp{\var{o1}
1880**= \var{o2}} when o3 is \cdata{Py_None}, or an in-place variant of
Fred Drake5566c1c2001-01-19 22:48:33 +00001881\samp{pow(\var{o1}, \var{o2}, \var{o3})} otherwise. If \var{o3} is to be
Fred Drake7740a012000-09-12 20:27:05 +00001882ignored, pass \cdata{Py_None} in its place (passing \NULL{} for \var{o3}
1883would cause an illegal memory access).
1884\end{cfuncdesc}
1885
1886\begin{cfuncdesc}{PyObject*}{PyNumber_InPlaceLshift}{PyObject *o1, PyObject *o2}
1887Returns the result of left shifting \var{o1} by \var{o2} on success, or
1888\NULL{} on failure. The operation is done \emph{in-place} when \var{o1}
1889supports it. This is the equivalent of the Python expression \samp{\var{o1}
Fred Draked20d8b32001-04-13 14:52:39 +00001890<\code{<=} \var{o2}}.
Fred Drake7740a012000-09-12 20:27:05 +00001891\end{cfuncdesc}
1892
1893
1894\begin{cfuncdesc}{PyObject*}{PyNumber_InPlaceRshift}{PyObject *o1, PyObject *o2}
1895Returns the result of right shifting \var{o1} by \var{o2} on success, or
1896\NULL{} on failure. The operation is done \emph{in-place} when \var{o1}
1897supports it. This is the equivalent of the Python expression \samp{\var{o1}
Fred Draked20d8b32001-04-13 14:52:39 +00001898>\code{>=} \var{o2}}.
Fred Drake7740a012000-09-12 20:27:05 +00001899\end{cfuncdesc}
1900
1901
1902\begin{cfuncdesc}{PyObject*}{PyNumber_InPlaceAnd}{PyObject *o1, PyObject *o2}
Fred Drake5566c1c2001-01-19 22:48:33 +00001903Returns the ``bitwise and'' of \var{o1} and \var{o2} on success
1904and \NULL{} on failure. The operation is done \emph{in-place} when
1905\var{o1} supports it. This is the equivalent of the Python expression
1906\samp{\var{o1} \&= \var{o2}}.
Fred Drake7740a012000-09-12 20:27:05 +00001907\end{cfuncdesc}
1908
1909
1910\begin{cfuncdesc}{PyObject*}{PyNumber_InPlaceXor}{PyObject *o1, PyObject *o2}
1911Returns the ``bitwise exclusive or'' of \var{o1} by \var{o2} on success, or
1912\NULL{} on failure. The operation is done \emph{in-place} when \var{o1}
1913supports it. This is the equivalent of the Python expression \samp{\var{o1}
Fred Drake377fb1e2001-07-14 03:01:48 +00001914\textasciicircum= \var{o2}}.
Fred Drake7740a012000-09-12 20:27:05 +00001915\end{cfuncdesc}
1916
1917\begin{cfuncdesc}{PyObject*}{PyNumber_InPlaceOr}{PyObject *o1, PyObject *o2}
1918Returns the ``bitwise or'' of \var{o1} and \var{o2} on success, or \NULL{}
1919on failure. The operation is done \emph{in-place} when \var{o1} supports
1920it. This is the equivalent of the Python expression \samp{\var{o1} |=
1921\var{o2}}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001922\end{cfuncdesc}
1923
Fred Drakec0e6c5b2000-09-22 18:17:49 +00001924\begin{cfuncdesc}{int}{PyNumber_Coerce}{PyObject **p1, PyObject **p2}
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001925This function takes the addresses of two variables of type
Fred Drake659ebfa2000-04-03 15:42:13 +00001926\ctype{PyObject*}. If the objects pointed to by \code{*\var{p1}} and
1927\code{*\var{p2}} have the same type, increment their reference count
1928and return \code{0} (success). If the objects can be converted to a
1929common numeric type, replace \code{*p1} and \code{*p2} by their
1930converted value (with 'new' reference counts), and return \code{0}.
1931If no conversion is possible, or if some other error occurs, return
1932\code{-1} (failure) and don't increment the reference counts. The
1933call \code{PyNumber_Coerce(\&o1, \&o2)} is equivalent to the Python
1934statement \samp{\var{o1}, \var{o2} = coerce(\var{o1}, \var{o2})}.
1935\bifuncindex{coerce}
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001936\end{cfuncdesc}
1937
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001938\begin{cfuncdesc}{PyObject*}{PyNumber_Int}{PyObject *o}
Fred Drakee058b4f1998-02-16 06:15:35 +00001939Returns the \var{o} converted to an integer object on success, or
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001940\NULL{} on failure. This is the equivalent of the Python
Fred Drake659ebfa2000-04-03 15:42:13 +00001941expression \samp{int(\var{o})}.\bifuncindex{int}
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001942\end{cfuncdesc}
1943
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001944\begin{cfuncdesc}{PyObject*}{PyNumber_Long}{PyObject *o}
Fred Drakee058b4f1998-02-16 06:15:35 +00001945Returns the \var{o} converted to a long integer object on success,
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001946or \NULL{} on failure. This is the equivalent of the Python
Fred Drake659ebfa2000-04-03 15:42:13 +00001947expression \samp{long(\var{o})}.\bifuncindex{long}
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001948\end{cfuncdesc}
1949
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001950\begin{cfuncdesc}{PyObject*}{PyNumber_Float}{PyObject *o}
Fred Drake659ebfa2000-04-03 15:42:13 +00001951Returns the \var{o} converted to a float object on success, or
1952\NULL{} on failure. This is the equivalent of the Python expression
1953\samp{float(\var{o})}.\bifuncindex{float}
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001954\end{cfuncdesc}
1955
1956
Fred Drakeefd146c1999-02-15 15:30:45 +00001957\section{Sequence Protocol \label{sequence}}
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001958
1959\begin{cfuncdesc}{int}{PySequence_Check}{PyObject *o}
Fred Drake659ebfa2000-04-03 15:42:13 +00001960Return \code{1} if the object provides sequence protocol, and
1961\code{0} otherwise. This function always succeeds.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001962\end{cfuncdesc}
1963
Fred Drakec6a3cb42001-04-04 01:25:17 +00001964\begin{cfuncdesc}{int}{PySequence_Size}{PyObject *o}
Fred Drake659ebfa2000-04-03 15:42:13 +00001965Returns the number of objects in sequence \var{o} on success, and
1966\code{-1} on failure. For objects that do not provide sequence
1967protocol, this is equivalent to the Python expression
1968\samp{len(\var{o})}.\bifuncindex{len}
1969\end{cfuncdesc}
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001970
Fred Drakec6a3cb42001-04-04 01:25:17 +00001971\begin{cfuncdesc}{int}{PySequence_Length}{PyObject *o}
1972Alternate name for \cfunction{PySequence_Size()}.
1973\end{cfuncdesc}
1974
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001975\begin{cfuncdesc}{PyObject*}{PySequence_Concat}{PyObject *o1, PyObject *o2}
Fred Drakee058b4f1998-02-16 06:15:35 +00001976Return the concatenation of \var{o1} and \var{o2} on success, and \NULL{} on
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001977failure. This is the equivalent of the Python
Fred Drakee058b4f1998-02-16 06:15:35 +00001978expression \samp{\var{o1} + \var{o2}}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001979\end{cfuncdesc}
1980
1981
1982\begin{cfuncdesc}{PyObject*}{PySequence_Repeat}{PyObject *o, int count}
Fred Drake659ebfa2000-04-03 15:42:13 +00001983Return the result of repeating sequence object
1984\var{o} \var{count} times, or \NULL{} on failure. This is the
1985equivalent of the Python expression \samp{\var{o} * \var{count}}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001986\end{cfuncdesc}
1987
Fred Drake7740a012000-09-12 20:27:05 +00001988\begin{cfuncdesc}{PyObject*}{PySequence_InPlaceConcat}{PyObject *o1, PyObject *o2}
1989Return the concatenation of \var{o1} and \var{o2} on success, and \NULL{} on
1990failure. The operation is done \emph{in-place} when \var{o1} supports it.
1991This is the equivalent of the Python expression \samp{\var{o1} += \var{o2}}.
1992\end{cfuncdesc}
1993
1994
1995\begin{cfuncdesc}{PyObject*}{PySequence_InPlaceRepeat}{PyObject *o, int count}
1996Return the result of repeating sequence object \var{o} \var{count} times, or
1997\NULL{} on failure. The operation is done \emph{in-place} when \var{o}
1998supports it. This is the equivalent of the Python expression \samp{\var{o}
1999*= \var{count}}.
2000\end{cfuncdesc}
2001
Guido van Rossum9231c8f1997-05-15 21:43:21 +00002002
2003\begin{cfuncdesc}{PyObject*}{PySequence_GetItem}{PyObject *o, int i}
Fred Drakee058b4f1998-02-16 06:15:35 +00002004Return the \var{i}th element of \var{o}, or \NULL{} on failure. This
2005is the equivalent of the Python expression \samp{\var{o}[\var{i}]}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00002006\end{cfuncdesc}
2007
2008
2009\begin{cfuncdesc}{PyObject*}{PySequence_GetSlice}{PyObject *o, int i1, int i2}
Fred Drakee058b4f1998-02-16 06:15:35 +00002010Return the slice of sequence object \var{o} between \var{i1} and
2011\var{i2}, or \NULL{} on failure. This is the equivalent of the Python
2012expression \samp{\var{o}[\var{i1}:\var{i2}]}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00002013\end{cfuncdesc}
2014
2015
2016\begin{cfuncdesc}{int}{PySequence_SetItem}{PyObject *o, int i, PyObject *v}
Fred Drakee058b4f1998-02-16 06:15:35 +00002017Assign object \var{v} to the \var{i}th element of \var{o}.
2018Returns \code{-1} on failure. This is the equivalent of the Python
2019statement \samp{\var{o}[\var{i}] = \var{v}}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00002020\end{cfuncdesc}
2021
2022\begin{cfuncdesc}{int}{PySequence_DelItem}{PyObject *o, int i}
Fred Drake5566c1c2001-01-19 22:48:33 +00002023Delete the \var{i}th element of object \var{o}. Returns
Fred Drakee058b4f1998-02-16 06:15:35 +00002024\code{-1} on failure. This is the equivalent of the Python
2025statement \samp{del \var{o}[\var{i}]}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00002026\end{cfuncdesc}
2027
Fred Drake659ebfa2000-04-03 15:42:13 +00002028\begin{cfuncdesc}{int}{PySequence_SetSlice}{PyObject *o, int i1,
2029 int i2, PyObject *v}
Fred Drakee058b4f1998-02-16 06:15:35 +00002030Assign the sequence object \var{v} to the slice in sequence
2031object \var{o} from \var{i1} to \var{i2}. This is the equivalent of
2032the Python statement \samp{\var{o}[\var{i1}:\var{i2}] = \var{v}}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00002033\end{cfuncdesc}
2034
2035\begin{cfuncdesc}{int}{PySequence_DelSlice}{PyObject *o, int i1, int i2}
Fred Drakee058b4f1998-02-16 06:15:35 +00002036Delete the slice in sequence object \var{o} from \var{i1} to \var{i2}.
2037Returns \code{-1} on failure. This is the equivalent of the Python
2038statement \samp{del \var{o}[\var{i1}:\var{i2}]}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00002039\end{cfuncdesc}
2040
2041\begin{cfuncdesc}{PyObject*}{PySequence_Tuple}{PyObject *o}
Fred Drakee058b4f1998-02-16 06:15:35 +00002042Returns the \var{o} as a tuple on success, and \NULL{} on failure.
Fred Drake659ebfa2000-04-03 15:42:13 +00002043This is equivalent to the Python expression \samp{tuple(\var{o})}.
2044\bifuncindex{tuple}
Guido van Rossum9231c8f1997-05-15 21:43:21 +00002045\end{cfuncdesc}
2046
2047\begin{cfuncdesc}{int}{PySequence_Count}{PyObject *o, PyObject *value}
Fred Drakee058b4f1998-02-16 06:15:35 +00002048Return the number of occurrences of \var{value} in \var{o}, that is,
2049return the number of keys for which \code{\var{o}[\var{key}] ==
2050\var{value}}. On failure, return \code{-1}. This is equivalent to
2051the Python expression \samp{\var{o}.count(\var{value})}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00002052\end{cfuncdesc}
2053
Fred Drake659ebfa2000-04-03 15:42:13 +00002054\begin{cfuncdesc}{int}{PySequence_Contains}{PyObject *o, PyObject *value}
Fred Drakee058b4f1998-02-16 06:15:35 +00002055Determine if \var{o} contains \var{value}. If an item in \var{o} is
2056equal to \var{value}, return \code{1}, otherwise return \code{0}. On
2057error, return \code{-1}. This is equivalent to the Python expression
2058\samp{\var{value} in \var{o}}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00002059\end{cfuncdesc}
2060
2061\begin{cfuncdesc}{int}{PySequence_Index}{PyObject *o, PyObject *value}
Fred Drakee058b4f1998-02-16 06:15:35 +00002062Return the first index \var{i} for which \code{\var{o}[\var{i}] ==
2063\var{value}}. On error, return \code{-1}. This is equivalent to
2064the Python expression \samp{\var{o}.index(\var{value})}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00002065\end{cfuncdesc}
2066
Fred Drakea8455ab2000-06-16 19:58:42 +00002067\begin{cfuncdesc}{PyObject*}{PySequence_List}{PyObject *o}
2068Return a list object with the same contents as the arbitrary sequence
2069\var{o}. The returned list is guaranteed to be new.
2070\end{cfuncdesc}
2071
2072\begin{cfuncdesc}{PyObject*}{PySequence_Tuple}{PyObject *o}
2073Return a tuple object with the same contents as the arbitrary sequence
2074\var{o}. If \var{o} is a tuple, a new reference will be returned,
2075otherwise a tuple will be constructed with the appropriate contents.
2076\end{cfuncdesc}
2077
Fred Drakef39ed671998-02-26 22:01:23 +00002078
Fred Drake81cccb72000-09-12 15:22:05 +00002079\begin{cfuncdesc}{PyObject*}{PySequence_Fast}{PyObject *o, const char *m}
2080Returns the sequence \var{o} as a tuple, unless it is already a
2081tuple or list, in which case \var{o} is returned. Use
2082\cfunction{PySequence_Fast_GET_ITEM()} to access the members of the
2083result. Returns \NULL{} on failure. If the object is not a sequence,
2084raises \exception{TypeError} with \var{m} as the message text.
2085\end{cfuncdesc}
2086
2087\begin{cfuncdesc}{PyObject*}{PySequence_Fast_GET_ITEM}{PyObject *o, int i}
2088Return the \var{i}th element of \var{o}, assuming that \var{o} was
2089returned by \cfunction{PySequence_Fast()}, and that \var{i} is within
2090bounds. The caller is expected to get the length of the sequence by
Fred Drake96a2a802001-05-29 18:51:41 +00002091calling \cfunction{PySequence_Size()} on \var{o}, since lists and tuples
Fred Drake81cccb72000-09-12 15:22:05 +00002092are guaranteed to always return their true length.
2093\end{cfuncdesc}
2094
2095
Fred Drakeefd146c1999-02-15 15:30:45 +00002096\section{Mapping Protocol \label{mapping}}
Guido van Rossum9231c8f1997-05-15 21:43:21 +00002097
2098\begin{cfuncdesc}{int}{PyMapping_Check}{PyObject *o}
Fred Drake659ebfa2000-04-03 15:42:13 +00002099Return \code{1} if the object provides mapping protocol, and
2100\code{0} otherwise. This function always succeeds.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00002101\end{cfuncdesc}
2102
2103
2104\begin{cfuncdesc}{int}{PyMapping_Length}{PyObject *o}
Fred Drake659ebfa2000-04-03 15:42:13 +00002105Returns the number of keys in object \var{o} on success, and
2106\code{-1} on failure. For objects that do not provide mapping
2107protocol, this is equivalent to the Python expression
2108\samp{len(\var{o})}.\bifuncindex{len}
Guido van Rossum9231c8f1997-05-15 21:43:21 +00002109\end{cfuncdesc}
2110
2111
2112\begin{cfuncdesc}{int}{PyMapping_DelItemString}{PyObject *o, char *key}
Fred Drakee058b4f1998-02-16 06:15:35 +00002113Remove the mapping for object \var{key} from the object \var{o}.
2114Return \code{-1} on failure. This is equivalent to
2115the Python statement \samp{del \var{o}[\var{key}]}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00002116\end{cfuncdesc}
2117
2118
2119\begin{cfuncdesc}{int}{PyMapping_DelItem}{PyObject *o, PyObject *key}
Fred Drakee058b4f1998-02-16 06:15:35 +00002120Remove the mapping for object \var{key} from the object \var{o}.
2121Return \code{-1} on failure. This is equivalent to
2122the Python statement \samp{del \var{o}[\var{key}]}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00002123\end{cfuncdesc}
2124
2125
2126\begin{cfuncdesc}{int}{PyMapping_HasKeyString}{PyObject *o, char *key}
Fred Drake659ebfa2000-04-03 15:42:13 +00002127On success, return \code{1} if the mapping object has the key
2128\var{key} and \code{0} otherwise. This is equivalent to the Python
2129expression \samp{\var{o}.has_key(\var{key})}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00002130This function always succeeds.
2131\end{cfuncdesc}
2132
2133
2134\begin{cfuncdesc}{int}{PyMapping_HasKey}{PyObject *o, PyObject *key}
Fred Drakee058b4f1998-02-16 06:15:35 +00002135Return \code{1} if the mapping object has the key \var{key} and
2136\code{0} otherwise. This is equivalent to the Python expression
2137\samp{\var{o}.has_key(\var{key})}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00002138This function always succeeds.
2139\end{cfuncdesc}
2140
2141
2142\begin{cfuncdesc}{PyObject*}{PyMapping_Keys}{PyObject *o}
Fred Drakee058b4f1998-02-16 06:15:35 +00002143On success, return a list of the keys in object \var{o}. On
Guido van Rossum9231c8f1997-05-15 21:43:21 +00002144failure, return \NULL{}. This is equivalent to the Python
Fred Drakee058b4f1998-02-16 06:15:35 +00002145expression \samp{\var{o}.keys()}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00002146\end{cfuncdesc}
2147
2148
2149\begin{cfuncdesc}{PyObject*}{PyMapping_Values}{PyObject *o}
Fred Drakee058b4f1998-02-16 06:15:35 +00002150On success, return a list of the values in object \var{o}. On
Guido van Rossum9231c8f1997-05-15 21:43:21 +00002151failure, return \NULL{}. This is equivalent to the Python
Fred Drakee058b4f1998-02-16 06:15:35 +00002152expression \samp{\var{o}.values()}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00002153\end{cfuncdesc}
2154
2155
2156\begin{cfuncdesc}{PyObject*}{PyMapping_Items}{PyObject *o}
Fred Drakee058b4f1998-02-16 06:15:35 +00002157On success, return a list of the items in object \var{o}, where
Guido van Rossum9231c8f1997-05-15 21:43:21 +00002158each item is a tuple containing a key-value pair. On
2159failure, return \NULL{}. This is equivalent to the Python
Fred Drakee058b4f1998-02-16 06:15:35 +00002160expression \samp{\var{o}.items()}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00002161\end{cfuncdesc}
2162
Guido van Rossum9231c8f1997-05-15 21:43:21 +00002163
2164\begin{cfuncdesc}{PyObject*}{PyMapping_GetItemString}{PyObject *o, char *key}
Fred Drakee058b4f1998-02-16 06:15:35 +00002165Return element of \var{o} corresponding to the object \var{key} or
2166\NULL{} on failure. This is the equivalent of the Python expression
2167\samp{\var{o}[\var{key}]}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00002168\end{cfuncdesc}
2169
Fred Drakedbcaeda2001-05-07 17:42:18 +00002170\begin{cfuncdesc}{int}{PyMapping_SetItemString}{PyObject *o, char *key,
2171 PyObject *v}
Fred Drakee058b4f1998-02-16 06:15:35 +00002172Map the object \var{key} to the value \var{v} in object \var{o}.
2173Returns \code{-1} on failure. This is the equivalent of the Python
2174statement \samp{\var{o}[\var{key}] = \var{v}}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00002175\end{cfuncdesc}
2176
2177
Fred Drakedbcaeda2001-05-07 17:42:18 +00002178\section{Iterator Protocol \label{iterator}}
2179
Fred Drakea8e08272001-05-07 17:47:07 +00002180\versionadded{2.2}
2181
Fred Drakedbcaeda2001-05-07 17:42:18 +00002182There are only a couple of functions specifically for working with
2183iterators.
2184
2185\begin{cfuncdesc}{int}{PyIter_Check}{PyObject *o}
2186 Return true if the object \var{o} supports the iterator protocol.
2187\end{cfuncdesc}
2188
2189\begin{cfuncdesc}{PyObject*}{PyIter_Next}{PyObject *o}
2190 Return the next value from the iteration \var{o}. If the object is
2191 an iterator, this retrieves the next value from the iteration, and
2192 returns \NULL{} with no exception set if there are no remaining
2193 items. If the object is not an iterator, \exception{TypeError} is
2194 raised, or if there is an error in retrieving the item, returns
2195 \NULL{} and passes along the exception.
2196\end{cfuncdesc}
2197
2198To write a loop which iterates over an iterator, the C code should
2199look something like this:
2200
2201\begin{verbatim}
2202PyObject *iterator = ...;
2203PyObject *item;
2204
2205while (item = PyIter_Next(iter)) {
2206 /* do something with item */
2207}
2208if (PyErr_Occurred()) {
2209 /* propogate error */
2210}
2211else {
2212 /* continue doing useful work */
2213}
2214\end{verbatim}
2215
2216
Fred Drakeefd146c1999-02-15 15:30:45 +00002217\chapter{Concrete Objects Layer \label{concrete}}
Guido van Rossum9231c8f1997-05-15 21:43:21 +00002218
2219The functions in this chapter are specific to certain Python object
2220types. Passing them an object of the wrong type is not a good idea;
2221if you receive an object from a Python program and you are not sure
2222that it has the right type, you must perform a type check first;
Fred Drake5566c1c2001-01-19 22:48:33 +00002223for example, to check that an object is a dictionary, use
Fred Drakee5bf8b21998-02-12 21:22:28 +00002224\cfunction{PyDict_Check()}. The chapter is structured like the
2225``family tree'' of Python object types.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00002226
Fred Drake89024422000-10-23 16:00:54 +00002227\strong{Warning:}
2228While the functions described in this chapter carefully check the type
2229of the objects which are passed in, many of them do not check for
2230\NULL{} being passed instead of a valid object. Allowing \NULL{} to
2231be passed in can cause memory access violations and immediate
2232termination of the interpreter.
2233
Guido van Rossum9231c8f1997-05-15 21:43:21 +00002234
Fred Drakeefd146c1999-02-15 15:30:45 +00002235\section{Fundamental Objects \label{fundamental}}
Guido van Rossum9231c8f1997-05-15 21:43:21 +00002236
Fred Drakee5bf8b21998-02-12 21:22:28 +00002237This section describes Python type objects and the singleton object
2238\code{None}.
2239
2240
Fred Drakeefd146c1999-02-15 15:30:45 +00002241\subsection{Type Objects \label{typeObjects}}
Fred Drakee5bf8b21998-02-12 21:22:28 +00002242
Fred Drake659ebfa2000-04-03 15:42:13 +00002243\obindex{type}
Fred Drakee5bf8b21998-02-12 21:22:28 +00002244\begin{ctypedesc}{PyTypeObject}
Fred Drake659ebfa2000-04-03 15:42:13 +00002245The C structure of the objects used to describe built-in types.
Fred Drakee5bf8b21998-02-12 21:22:28 +00002246\end{ctypedesc}
2247
Fred Drake659ebfa2000-04-03 15:42:13 +00002248\begin{cvardesc}{PyObject*}{PyType_Type}
Fred Drakeefd146c1999-02-15 15:30:45 +00002249This is the type object for type objects; it is the same object as
2250\code{types.TypeType} in the Python layer.
Fred Drake659ebfa2000-04-03 15:42:13 +00002251\withsubitem{(in module types)}{\ttindex{TypeType}}
Fred Drakee5bf8b21998-02-12 21:22:28 +00002252\end{cvardesc}
2253
Fred Drake659ebfa2000-04-03 15:42:13 +00002254\begin{cfuncdesc}{int}{PyType_Check}{PyObject *o}
2255Returns true is the object \var{o} is a type object.
2256\end{cfuncdesc}
2257
2258\begin{cfuncdesc}{int}{PyType_HasFeature}{PyObject *o, int feature}
2259Returns true if the type object \var{o} sets the feature
Fred Drakef0e08ef2001-02-03 01:11:26 +00002260\var{feature}. Type features are denoted by single bit flags.
Fred Drake659ebfa2000-04-03 15:42:13 +00002261\end{cfuncdesc}
2262
Fred Drakee5bf8b21998-02-12 21:22:28 +00002263
Fred Drakeefd146c1999-02-15 15:30:45 +00002264\subsection{The None Object \label{noneObject}}
Fred Drakee5bf8b21998-02-12 21:22:28 +00002265
Fred Drake659ebfa2000-04-03 15:42:13 +00002266\obindex{None@\texttt{None}}
2267Note that the \ctype{PyTypeObject} for \code{None} is not directly
2268exposed in the Python/C API. Since \code{None} is a singleton,
2269testing for object identity (using \samp{==} in C) is sufficient.
2270There is no \cfunction{PyNone_Check()} function for the same reason.
2271
2272\begin{cvardesc}{PyObject*}{Py_None}
Guido van Rossum44475131998-04-21 15:30:01 +00002273The Python \code{None} object, denoting lack of value. This object has
2274no methods.
Fred Drakee5bf8b21998-02-12 21:22:28 +00002275\end{cvardesc}
2276
2277
Fred Drakefa774872001-07-11 20:35:37 +00002278\section{Numeric Objects \label{numericObjects}}
2279
2280\obindex{numeric}
2281
2282
2283\subsection{Plain Integer Objects \label{intObjects}}
2284
2285\obindex{integer}
2286\begin{ctypedesc}{PyIntObject}
2287This subtype of \ctype{PyObject} represents a Python integer object.
2288\end{ctypedesc}
2289
2290\begin{cvardesc}{PyTypeObject}{PyInt_Type}
2291This instance of \ctype{PyTypeObject} represents the Python plain
2292integer type. This is the same object as \code{types.IntType}.
2293\withsubitem{(in modules types)}{\ttindex{IntType}}
2294\end{cvardesc}
2295
2296\begin{cfuncdesc}{int}{PyInt_Check}{PyObject* o}
2297Returns true if \var{o} is of type \cdata{PyInt_Type}.
2298\end{cfuncdesc}
2299
2300\begin{cfuncdesc}{PyObject*}{PyInt_FromLong}{long ival}
2301Creates a new integer object with a value of \var{ival}.
2302
2303The current implementation keeps an array of integer objects for all
2304integers between \code{-1} and \code{100}, when you create an int in
2305that range you actually just get back a reference to the existing
2306object. So it should be possible to change the value of \code{1}. I
2307suspect the behaviour of Python in this case is undefined. :-)
2308\end{cfuncdesc}
2309
2310\begin{cfuncdesc}{long}{PyInt_AsLong}{PyObject *io}
2311Will first attempt to cast the object to a \ctype{PyIntObject}, if
2312it is not already one, and then return its value.
2313\end{cfuncdesc}
2314
2315\begin{cfuncdesc}{long}{PyInt_AS_LONG}{PyObject *io}
2316Returns the value of the object \var{io}. No error checking is
2317performed.
2318\end{cfuncdesc}
2319
2320\begin{cfuncdesc}{long}{PyInt_GetMax}{}
2321Returns the system's idea of the largest integer it can handle
2322(\constant{LONG_MAX}\ttindex{LONG_MAX}, as defined in the system
2323header files).
2324\end{cfuncdesc}
2325
2326
2327\subsection{Long Integer Objects \label{longObjects}}
2328
2329\obindex{long integer}
2330\begin{ctypedesc}{PyLongObject}
2331This subtype of \ctype{PyObject} represents a Python long integer
2332object.
2333\end{ctypedesc}
2334
2335\begin{cvardesc}{PyTypeObject}{PyLong_Type}
2336This instance of \ctype{PyTypeObject} represents the Python long
2337integer type. This is the same object as \code{types.LongType}.
2338\withsubitem{(in modules types)}{\ttindex{LongType}}
2339\end{cvardesc}
2340
2341\begin{cfuncdesc}{int}{PyLong_Check}{PyObject *p}
2342Returns true if its argument is a \ctype{PyLongObject}.
2343\end{cfuncdesc}
2344
2345\begin{cfuncdesc}{PyObject*}{PyLong_FromLong}{long v}
2346Returns a new \ctype{PyLongObject} object from \var{v}, or \NULL{} on
2347failure.
2348\end{cfuncdesc}
2349
2350\begin{cfuncdesc}{PyObject*}{PyLong_FromUnsignedLong}{unsigned long v}
2351Returns a new \ctype{PyLongObject} object from a C \ctype{unsigned
2352long}, or \NULL{} on failure.
2353\end{cfuncdesc}
2354
2355\begin{cfuncdesc}{PyObject*}{PyLong_FromDouble}{double v}
2356Returns a new \ctype{PyLongObject} object from the integer part of
2357\var{v}, or \NULL{} on failure.
2358\end{cfuncdesc}
2359
2360\begin{cfuncdesc}{long}{PyLong_AsLong}{PyObject *pylong}
2361Returns a C \ctype{long} representation of the contents of
2362\var{pylong}. If \var{pylong} is greater than
2363\constant{LONG_MAX}\ttindex{LONG_MAX}, an \exception{OverflowError} is
2364raised.\withsubitem{(built-in exception)}{\ttindex{OverflowError}}
2365\end{cfuncdesc}
2366
2367\begin{cfuncdesc}{unsigned long}{PyLong_AsUnsignedLong}{PyObject *pylong}
2368Returns a C \ctype{unsigned long} representation of the contents of
2369\var{pylong}. If \var{pylong} is greater than
2370\constant{ULONG_MAX}\ttindex{ULONG_MAX}, an \exception{OverflowError}
2371is raised.\withsubitem{(built-in exception)}{\ttindex{OverflowError}}
2372\end{cfuncdesc}
2373
2374\begin{cfuncdesc}{double}{PyLong_AsDouble}{PyObject *pylong}
2375Returns a C \ctype{double} representation of the contents of \var{pylong}.
2376\end{cfuncdesc}
2377
2378\begin{cfuncdesc}{PyObject*}{PyLong_FromString}{char *str, char **pend,
2379 int base}
2380Return a new \ctype{PyLongObject} based on the string value in
2381\var{str}, which is interpreted according to the radix in \var{base}.
2382If \var{pend} is non-\NULL, \code{*\var{pend}} will point to the first
2383character in \var{str} which follows the representation of the
2384number. If \var{base} is \code{0}, the radix will be determined base
2385on the leading characters of \var{str}: if \var{str} starts with
2386\code{'0x'} or \code{'0X'}, radix 16 will be used; if \var{str} starts
2387with \code{'0'}, radix 8 will be used; otherwise radix 10 will be
2388used. If \var{base} is not \code{0}, it must be between \code{2} and
2389\code{36}, inclusive. Leading spaces are ignored. If there are no
2390digits, \exception{ValueError} will be raised.
2391\end{cfuncdesc}
2392
2393
2394\subsection{Floating Point Objects \label{floatObjects}}
2395
2396\obindex{floating point}
2397\begin{ctypedesc}{PyFloatObject}
2398This subtype of \ctype{PyObject} represents a Python floating point
2399object.
2400\end{ctypedesc}
2401
2402\begin{cvardesc}{PyTypeObject}{PyFloat_Type}
2403This instance of \ctype{PyTypeObject} represents the Python floating
2404point type. This is the same object as \code{types.FloatType}.
2405\withsubitem{(in modules types)}{\ttindex{FloatType}}
2406\end{cvardesc}
2407
2408\begin{cfuncdesc}{int}{PyFloat_Check}{PyObject *p}
2409Returns true if its argument is a \ctype{PyFloatObject}.
2410\end{cfuncdesc}
2411
2412\begin{cfuncdesc}{PyObject*}{PyFloat_FromDouble}{double v}
2413Creates a \ctype{PyFloatObject} object from \var{v}, or \NULL{} on
2414failure.
2415\end{cfuncdesc}
2416
2417\begin{cfuncdesc}{double}{PyFloat_AsDouble}{PyObject *pyfloat}
2418Returns a C \ctype{double} representation of the contents of \var{pyfloat}.
2419\end{cfuncdesc}
2420
2421\begin{cfuncdesc}{double}{PyFloat_AS_DOUBLE}{PyObject *pyfloat}
2422Returns a C \ctype{double} representation of the contents of
2423\var{pyfloat}, but without error checking.
2424\end{cfuncdesc}
2425
2426
2427\subsection{Complex Number Objects \label{complexObjects}}
2428
2429\obindex{complex number}
2430Python's complex number objects are implemented as two distinct types
2431when viewed from the C API: one is the Python object exposed to
2432Python programs, and the other is a C structure which represents the
2433actual complex number value. The API provides functions for working
2434with both.
2435
2436\subsubsection{Complex Numbers as C Structures}
2437
2438Note that the functions which accept these structures as parameters
2439and return them as results do so \emph{by value} rather than
2440dereferencing them through pointers. This is consistent throughout
2441the API.
2442
2443\begin{ctypedesc}{Py_complex}
2444The C structure which corresponds to the value portion of a Python
2445complex number object. Most of the functions for dealing with complex
2446number objects use structures of this type as input or output values,
2447as appropriate. It is defined as:
2448
2449\begin{verbatim}
2450typedef struct {
2451 double real;
2452 double imag;
2453} Py_complex;
2454\end{verbatim}
2455\end{ctypedesc}
2456
2457\begin{cfuncdesc}{Py_complex}{_Py_c_sum}{Py_complex left, Py_complex right}
2458Return the sum of two complex numbers, using the C
2459\ctype{Py_complex} representation.
2460\end{cfuncdesc}
2461
2462\begin{cfuncdesc}{Py_complex}{_Py_c_diff}{Py_complex left, Py_complex right}
2463Return the difference between two complex numbers, using the C
2464\ctype{Py_complex} representation.
2465\end{cfuncdesc}
2466
2467\begin{cfuncdesc}{Py_complex}{_Py_c_neg}{Py_complex complex}
2468Return the negation of the complex number \var{complex}, using the C
2469\ctype{Py_complex} representation.
2470\end{cfuncdesc}
2471
2472\begin{cfuncdesc}{Py_complex}{_Py_c_prod}{Py_complex left, Py_complex right}
2473Return the product of two complex numbers, using the C
2474\ctype{Py_complex} representation.
2475\end{cfuncdesc}
2476
2477\begin{cfuncdesc}{Py_complex}{_Py_c_quot}{Py_complex dividend,
2478 Py_complex divisor}
2479Return the quotient of two complex numbers, using the C
2480\ctype{Py_complex} representation.
2481\end{cfuncdesc}
2482
2483\begin{cfuncdesc}{Py_complex}{_Py_c_pow}{Py_complex num, Py_complex exp}
2484Return the exponentiation of \var{num} by \var{exp}, using the C
2485\ctype{Py_complex} representation.
2486\end{cfuncdesc}
2487
2488
2489\subsubsection{Complex Numbers as Python Objects}
2490
2491\begin{ctypedesc}{PyComplexObject}
2492This subtype of \ctype{PyObject} represents a Python complex number object.
2493\end{ctypedesc}
2494
2495\begin{cvardesc}{PyTypeObject}{PyComplex_Type}
2496This instance of \ctype{PyTypeObject} represents the Python complex
2497number type.
2498\end{cvardesc}
2499
2500\begin{cfuncdesc}{int}{PyComplex_Check}{PyObject *p}
2501Returns true if its argument is a \ctype{PyComplexObject}.
2502\end{cfuncdesc}
2503
2504\begin{cfuncdesc}{PyObject*}{PyComplex_FromCComplex}{Py_complex v}
2505Create a new Python complex number object from a C
2506\ctype{Py_complex} value.
2507\end{cfuncdesc}
2508
2509\begin{cfuncdesc}{PyObject*}{PyComplex_FromDoubles}{double real, double imag}
2510Returns a new \ctype{PyComplexObject} object from \var{real} and \var{imag}.
2511\end{cfuncdesc}
2512
2513\begin{cfuncdesc}{double}{PyComplex_RealAsDouble}{PyObject *op}
2514Returns the real part of \var{op} as a C \ctype{double}.
2515\end{cfuncdesc}
2516
2517\begin{cfuncdesc}{double}{PyComplex_ImagAsDouble}{PyObject *op}
2518Returns the imaginary part of \var{op} as a C \ctype{double}.
2519\end{cfuncdesc}
2520
2521\begin{cfuncdesc}{Py_complex}{PyComplex_AsCComplex}{PyObject *op}
2522Returns the \ctype{Py_complex} value of the complex number \var{op}.
2523\end{cfuncdesc}
2524
2525
2526
Fred Drakeefd146c1999-02-15 15:30:45 +00002527\section{Sequence Objects \label{sequenceObjects}}
Fred Drakee5bf8b21998-02-12 21:22:28 +00002528
Fred Drake659ebfa2000-04-03 15:42:13 +00002529\obindex{sequence}
Fred Drakee5bf8b21998-02-12 21:22:28 +00002530Generic operations on sequence objects were discussed in the previous
2531chapter; this section deals with the specific kinds of sequence
2532objects that are intrinsic to the Python language.
2533
2534
Fred Drakeefd146c1999-02-15 15:30:45 +00002535\subsection{String Objects \label{stringObjects}}
Fred Drakee5bf8b21998-02-12 21:22:28 +00002536
Fred Drake89024422000-10-23 16:00:54 +00002537These functions raise \exception{TypeError} when expecting a string
2538parameter and are called with a non-string parameter.
2539
Fred Drake659ebfa2000-04-03 15:42:13 +00002540\obindex{string}
Fred Drakee5bf8b21998-02-12 21:22:28 +00002541\begin{ctypedesc}{PyStringObject}
Fred Drakef8830d11998-04-23 14:06:01 +00002542This subtype of \ctype{PyObject} represents a Python string object.
Fred Drakee5bf8b21998-02-12 21:22:28 +00002543\end{ctypedesc}
2544
2545\begin{cvardesc}{PyTypeObject}{PyString_Type}
Fred Drake659ebfa2000-04-03 15:42:13 +00002546This instance of \ctype{PyTypeObject} represents the Python string
2547type; it is the same object as \code{types.TypeType} in the Python
2548layer.\withsubitem{(in module types)}{\ttindex{StringType}}.
Fred Drakee5bf8b21998-02-12 21:22:28 +00002549\end{cvardesc}
2550
2551\begin{cfuncdesc}{int}{PyString_Check}{PyObject *o}
Guido van Rossum3c4378b1998-04-14 20:21:10 +00002552Returns true if the object \var{o} is a string object.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00002553\end{cfuncdesc}
2554
Fred Drakec6fa34e1998-04-02 06:47:24 +00002555\begin{cfuncdesc}{PyObject*}{PyString_FromString}{const char *v}
Guido van Rossum3c4378b1998-04-14 20:21:10 +00002556Returns a new string object with the value \var{v} on success, and
2557\NULL{} on failure.
Fred Drakec6fa34e1998-04-02 06:47:24 +00002558\end{cfuncdesc}
2559
Fred Drake659ebfa2000-04-03 15:42:13 +00002560\begin{cfuncdesc}{PyObject*}{PyString_FromStringAndSize}{const char *v,
2561 int len}
2562Returns a new string object with the value \var{v} and length
2563\var{len} on success, and \NULL{} on failure. If \var{v} is \NULL{},
2564the contents of the string are uninitialized.
2565\end{cfuncdesc}
2566
Fred Drakec6fa34e1998-04-02 06:47:24 +00002567\begin{cfuncdesc}{int}{PyString_Size}{PyObject *string}
Guido van Rossum3c4378b1998-04-14 20:21:10 +00002568Returns the length of the string in string object \var{string}.
Fred Drakec6fa34e1998-04-02 06:47:24 +00002569\end{cfuncdesc}
2570
Fred Drake659ebfa2000-04-03 15:42:13 +00002571\begin{cfuncdesc}{int}{PyString_GET_SIZE}{PyObject *string}
Fred Drake5d644212000-10-07 12:31:50 +00002572Macro form of \cfunction{PyString_Size()} but without error
Fred Drake659ebfa2000-04-03 15:42:13 +00002573checking.
2574\end{cfuncdesc}
2575
Fred Drakec6fa34e1998-04-02 06:47:24 +00002576\begin{cfuncdesc}{char*}{PyString_AsString}{PyObject *string}
Fred Drake659ebfa2000-04-03 15:42:13 +00002577Returns a null-terminated representation of the contents of
2578\var{string}. The pointer refers to the internal buffer of
Fred Drake89024422000-10-23 16:00:54 +00002579\var{string}, not a copy. The data must not be modified in any way,
2580unless the string was just created using
2581\code{PyString_FromStringAndSize(NULL, \var{size})}.
2582It must not be deallocated.
Fred Drake659ebfa2000-04-03 15:42:13 +00002583\end{cfuncdesc}
2584
2585\begin{cfuncdesc}{char*}{PyString_AS_STRING}{PyObject *string}
2586Macro form of \cfunction{PyString_AsString()} but without error
2587checking.
Fred Drakec6fa34e1998-04-02 06:47:24 +00002588\end{cfuncdesc}
2589
Marc-André Lemburgd1ba4432000-09-19 21:04:18 +00002590\begin{cfuncdesc}{int}{PyString_AsStringAndSize}{PyObject *obj,
2591 char **buffer,
2592 int *length}
2593Returns a null-terminated representation of the contents of the object
2594\var{obj} through the output variables \var{buffer} and \var{length}.
2595
2596The function accepts both string and Unicode objects as input. For
2597Unicode objects it returns the default encoded version of the object.
2598If \var{length} is set to \NULL{}, the resulting buffer may not contain
2599null characters; if it does, the function returns -1 and a
2600TypeError is raised.
2601
2602The buffer refers to an internal string buffer of \var{obj}, not a
Fred Drake89024422000-10-23 16:00:54 +00002603copy. The data must not be modified in any way, unless the string was
2604just created using \code{PyString_FromStringAndSize(NULL,
2605\var{size})}. It must not be deallocated.
Marc-André Lemburgd1ba4432000-09-19 21:04:18 +00002606\end{cfuncdesc}
2607
Fred Drakec6fa34e1998-04-02 06:47:24 +00002608\begin{cfuncdesc}{void}{PyString_Concat}{PyObject **string,
2609 PyObject *newpart}
Fred Drake66b989c1999-02-15 20:15:39 +00002610Creates a new string object in \var{*string} containing the
Fred Drakeddc6c272000-03-31 18:22:38 +00002611contents of \var{newpart} appended to \var{string}; the caller will
2612own the new reference. The reference to the old value of \var{string}
2613will be stolen. If the new string
Fred Drake66b989c1999-02-15 20:15:39 +00002614cannot be created, the old reference to \var{string} will still be
2615discarded and the value of \var{*string} will be set to
2616\NULL{}; the appropriate exception will be set.
Fred Drakec6fa34e1998-04-02 06:47:24 +00002617\end{cfuncdesc}
2618
2619\begin{cfuncdesc}{void}{PyString_ConcatAndDel}{PyObject **string,
2620 PyObject *newpart}
Guido van Rossum3c4378b1998-04-14 20:21:10 +00002621Creates a new string object in \var{*string} containing the contents
Guido van Rossum44475131998-04-21 15:30:01 +00002622of \var{newpart} appended to \var{string}. This version decrements
2623the reference count of \var{newpart}.
Fred Drakec6fa34e1998-04-02 06:47:24 +00002624\end{cfuncdesc}
2625
2626\begin{cfuncdesc}{int}{_PyString_Resize}{PyObject **string, int newsize}
Guido van Rossum44475131998-04-21 15:30:01 +00002627A way to resize a string object even though it is ``immutable''.
2628Only use this to build up a brand new string object; don't use this if
2629the string may already be known in other parts of the code.
Fred Drakec6fa34e1998-04-02 06:47:24 +00002630\end{cfuncdesc}
2631
2632\begin{cfuncdesc}{PyObject*}{PyString_Format}{PyObject *format,
2633 PyObject *args}
Guido van Rossum44475131998-04-21 15:30:01 +00002634Returns a new string object from \var{format} and \var{args}. Analogous
Fred Drake659ebfa2000-04-03 15:42:13 +00002635to \code{\var{format} \%\ \var{args}}. The \var{args} argument must be
Guido van Rossum44475131998-04-21 15:30:01 +00002636a tuple.
Fred Drakec6fa34e1998-04-02 06:47:24 +00002637\end{cfuncdesc}
2638
2639\begin{cfuncdesc}{void}{PyString_InternInPlace}{PyObject **string}
Guido van Rossum44475131998-04-21 15:30:01 +00002640Intern the argument \var{*string} in place. The argument must be the
2641address of a pointer variable pointing to a Python string object.
2642If there is an existing interned string that is the same as
2643\var{*string}, it sets \var{*string} to it (decrementing the reference
2644count of the old string object and incrementing the reference count of
2645the interned string object), otherwise it leaves \var{*string} alone
2646and interns it (incrementing its reference count). (Clarification:
2647even though there is a lot of talk about reference counts, think of
Fred Drakef8830d11998-04-23 14:06:01 +00002648this function as reference-count-neutral; you own the object after
2649the call if and only if you owned it before the call.)
Fred Drakec6fa34e1998-04-02 06:47:24 +00002650\end{cfuncdesc}
2651
2652\begin{cfuncdesc}{PyObject*}{PyString_InternFromString}{const char *v}
Fred Drakef8830d11998-04-23 14:06:01 +00002653A combination of \cfunction{PyString_FromString()} and
2654\cfunction{PyString_InternInPlace()}, returning either a new string object
Guido van Rossum44475131998-04-21 15:30:01 +00002655that has been interned, or a new (``owned'') reference to an earlier
2656interned string object with the same value.
Fred Drakec6fa34e1998-04-02 06:47:24 +00002657\end{cfuncdesc}
2658
Marc-André Lemburg5a20b212000-07-07 15:47:06 +00002659\begin{cfuncdesc}{PyObject*}{PyString_Decode}{const char *s,
2660 int size,
2661 const char *encoding,
2662 const char *errors}
Marc-André Lemburg2d920412001-05-15 12:00:02 +00002663Creates an object by decoding \var{size} bytes of the encoded
2664buffer \var{s} using the codec registered
2665for \var{encoding}. \var{encoding} and \var{errors} have the same meaning
Marc-André Lemburg5a20b212000-07-07 15:47:06 +00002666as the parameters of the same name in the unicode() builtin
2667function. The codec to be used is looked up using the Python codec
2668registry. Returns \NULL{} in case an exception was raised by the
2669codec.
2670\end{cfuncdesc}
2671
Marc-André Lemburg2d920412001-05-15 12:00:02 +00002672\begin{cfuncdesc}{PyObject*}{PyString_AsDecodedObject}{PyObject *str,
2673 const char *encoding,
2674 const char *errors}
2675Decodes a string object by passing it to the codec registered
2676for \var{encoding} and returns the result as Python
2677object. \var{encoding} and \var{errors} have the same meaning as the
2678parameters of the same name in the string .encode() method. The codec
2679to be used is looked up using the Python codec registry. Returns
2680\NULL{} in case an exception was raised by the codec.
2681\end{cfuncdesc}
2682
2683\begin{cfuncdesc}{PyObject*}{PyString_Encode}{const char *s,
Marc-André Lemburg5a20b212000-07-07 15:47:06 +00002684 int size,
2685 const char *encoding,
2686 const char *errors}
Marc-André Lemburg2d920412001-05-15 12:00:02 +00002687Encodes the \ctype{char} buffer of the given size by passing it to
2688the codec registered for \var{encoding} and returns a Python object.
2689\var{encoding} and \var{errors} have the same
Marc-André Lemburg5a20b212000-07-07 15:47:06 +00002690meaning as the parameters of the same name in the string .encode()
2691method. The codec to be used is looked up using the Python codec
2692registry. Returns \NULL{} in case an exception was raised by the
2693codec.
2694\end{cfuncdesc}
2695
Marc-André Lemburg2d920412001-05-15 12:00:02 +00002696\begin{cfuncdesc}{PyObject*}{PyString_AsEncodedObject}{PyObject *str,
Marc-André Lemburg5a20b212000-07-07 15:47:06 +00002697 const char *encoding,
2698 const char *errors}
Marc-André Lemburg2d920412001-05-15 12:00:02 +00002699Encodes a string object using the codec registered
2700for \var{encoding} and returns the result as Python
Marc-André Lemburg5a20b212000-07-07 15:47:06 +00002701object. \var{encoding} and \var{errors} have the same meaning as the
2702parameters of the same name in the string .encode() method. The codec
2703to be used is looked up using the Python codec registry. Returns
2704\NULL{} in case an exception was raised by the codec.
2705\end{cfuncdesc}
2706
Fred Drakee5bf8b21998-02-12 21:22:28 +00002707
Fred Drakea4cd2612000-04-06 14:10:29 +00002708\subsection{Unicode Objects \label{unicodeObjects}}
2709\sectionauthor{Marc-Andre Lemburg}{mal@lemburg.com}
2710
2711%--- Unicode Type -------------------------------------------------------
2712
2713These are the basic Unicode object types used for the Unicode
2714implementation in Python:
2715
2716\begin{ctypedesc}{Py_UNICODE}
2717This type represents a 16-bit unsigned storage type which is used by
2718Python internally as basis for holding Unicode ordinals. On platforms
2719where \ctype{wchar_t} is available and also has 16-bits,
2720\ctype{Py_UNICODE} is a typedef alias for \ctype{wchar_t} to enhance
2721native platform compatibility. On all other platforms,
2722\ctype{Py_UNICODE} is a typedef alias for \ctype{unsigned short}.
2723\end{ctypedesc}
2724
2725\begin{ctypedesc}{PyUnicodeObject}
2726This subtype of \ctype{PyObject} represents a Python Unicode object.
2727\end{ctypedesc}
2728
2729\begin{cvardesc}{PyTypeObject}{PyUnicode_Type}
2730This instance of \ctype{PyTypeObject} represents the Python Unicode type.
2731\end{cvardesc}
2732
2733%--- These are really C macros... is there a macrodesc TeX macro ?
2734
2735The following APIs are really C macros and can be used to do fast
2736checks and to access internal read-only data of Unicode objects:
2737
2738\begin{cfuncdesc}{int}{PyUnicode_Check}{PyObject *o}
2739Returns true if the object \var{o} is a Unicode object.
2740\end{cfuncdesc}
2741
2742\begin{cfuncdesc}{int}{PyUnicode_GET_SIZE}{PyObject *o}
2743Returns the size of the object. o has to be a
2744PyUnicodeObject (not checked).
2745\end{cfuncdesc}
2746
2747\begin{cfuncdesc}{int}{PyUnicode_GET_DATA_SIZE}{PyObject *o}
2748Returns the size of the object's internal buffer in bytes. o has to be
2749a PyUnicodeObject (not checked).
2750\end{cfuncdesc}
2751
Fred Drake992fe5a2000-06-16 21:04:15 +00002752\begin{cfuncdesc}{Py_UNICODE*}{PyUnicode_AS_UNICODE}{PyObject *o}
Fred Drakea4cd2612000-04-06 14:10:29 +00002753Returns a pointer to the internal Py_UNICODE buffer of the object. o
2754has to be a PyUnicodeObject (not checked).
2755\end{cfuncdesc}
2756
Fred Drake992fe5a2000-06-16 21:04:15 +00002757\begin{cfuncdesc}{const char*}{PyUnicode_AS_DATA}{PyObject *o}
Fred Drakea4cd2612000-04-06 14:10:29 +00002758Returns a (const char *) pointer to the internal buffer of the object.
2759o has to be a PyUnicodeObject (not checked).
2760\end{cfuncdesc}
2761
2762% --- Unicode character properties ---------------------------------------
2763
2764Unicode provides many different character properties. The most often
2765needed ones are available through these macros which are mapped to C
2766functions depending on the Python configuration.
2767
2768\begin{cfuncdesc}{int}{Py_UNICODE_ISSPACE}{Py_UNICODE ch}
2769Returns 1/0 depending on whether \var{ch} is a whitespace character.
2770\end{cfuncdesc}
2771
2772\begin{cfuncdesc}{int}{Py_UNICODE_ISLOWER}{Py_UNICODE ch}
2773Returns 1/0 depending on whether \var{ch} is a lowercase character.
2774\end{cfuncdesc}
2775
2776\begin{cfuncdesc}{int}{Py_UNICODE_ISUPPER}{Py_UNICODE ch}
Fred Drakeae96aab2000-07-03 13:38:10 +00002777Returns 1/0 depending on whether \var{ch} is an uppercase character.
Fred Drakea4cd2612000-04-06 14:10:29 +00002778\end{cfuncdesc}
2779
2780\begin{cfuncdesc}{int}{Py_UNICODE_ISTITLE}{Py_UNICODE ch}
2781Returns 1/0 depending on whether \var{ch} is a titlecase character.
2782\end{cfuncdesc}
2783
2784\begin{cfuncdesc}{int}{Py_UNICODE_ISLINEBREAK}{Py_UNICODE ch}
2785Returns 1/0 depending on whether \var{ch} is a linebreak character.
2786\end{cfuncdesc}
2787
2788\begin{cfuncdesc}{int}{Py_UNICODE_ISDECIMAL}{Py_UNICODE ch}
2789Returns 1/0 depending on whether \var{ch} is a decimal character.
2790\end{cfuncdesc}
2791
2792\begin{cfuncdesc}{int}{Py_UNICODE_ISDIGIT}{Py_UNICODE ch}
2793Returns 1/0 depending on whether \var{ch} is a digit character.
2794\end{cfuncdesc}
2795
2796\begin{cfuncdesc}{int}{Py_UNICODE_ISNUMERIC}{Py_UNICODE ch}
2797Returns 1/0 depending on whether \var{ch} is a numeric character.
2798\end{cfuncdesc}
2799
Fred Drakeae96aab2000-07-03 13:38:10 +00002800\begin{cfuncdesc}{int}{Py_UNICODE_ISALPHA}{Py_UNICODE ch}
2801Returns 1/0 depending on whether \var{ch} is an alphabetic character.
2802\end{cfuncdesc}
2803
2804\begin{cfuncdesc}{int}{Py_UNICODE_ISALNUM}{Py_UNICODE ch}
2805Returns 1/0 depending on whether \var{ch} is an alphanumeric character.
2806\end{cfuncdesc}
2807
Fred Drakea4cd2612000-04-06 14:10:29 +00002808These APIs can be used for fast direct character conversions:
2809
2810\begin{cfuncdesc}{Py_UNICODE}{Py_UNICODE_TOLOWER}{Py_UNICODE ch}
2811Returns the character \var{ch} converted to lower case.
2812\end{cfuncdesc}
2813
2814\begin{cfuncdesc}{Py_UNICODE}{Py_UNICODE_TOUPPER}{Py_UNICODE ch}
2815Returns the character \var{ch} converted to upper case.
2816\end{cfuncdesc}
2817
2818\begin{cfuncdesc}{Py_UNICODE}{Py_UNICODE_TOTITLE}{Py_UNICODE ch}
2819Returns the character \var{ch} converted to title case.
2820\end{cfuncdesc}
2821
2822\begin{cfuncdesc}{int}{Py_UNICODE_TODECIMAL}{Py_UNICODE ch}
2823Returns the character \var{ch} converted to a decimal positive integer.
2824Returns -1 in case this is not possible. Does not raise exceptions.
2825\end{cfuncdesc}
2826
2827\begin{cfuncdesc}{int}{Py_UNICODE_TODIGIT}{Py_UNICODE ch}
2828Returns the character \var{ch} converted to a single digit integer.
2829Returns -1 in case this is not possible. Does not raise exceptions.
2830\end{cfuncdesc}
2831
2832\begin{cfuncdesc}{double}{Py_UNICODE_TONUMERIC}{Py_UNICODE ch}
2833Returns the character \var{ch} converted to a (positive) double.
2834Returns -1.0 in case this is not possible. Does not raise exceptions.
2835\end{cfuncdesc}
2836
2837% --- Plain Py_UNICODE ---------------------------------------------------
2838
2839To create Unicode objects and access their basic sequence properties,
2840use these APIs:
2841
2842\begin{cfuncdesc}{PyObject*}{PyUnicode_FromUnicode}{const Py_UNICODE *u,
2843 int size}
2844
2845Create a Unicode Object from the Py_UNICODE buffer \var{u} of the
2846given size. \var{u} may be \NULL{} which causes the contents to be
2847undefined. It is the user's responsibility to fill in the needed data.
Marc-André Lemburg8155e0e2001-04-23 14:44:21 +00002848The buffer is copied into the new object. If the buffer is not \NULL{},
2849the return value might be a shared object. Therefore, modification of
2850the resulting Unicode Object is only allowed when \var{u} is \NULL{}.
Fred Drakea4cd2612000-04-06 14:10:29 +00002851\end{cfuncdesc}
2852
Fred Drake1d158692000-06-18 05:21:21 +00002853\begin{cfuncdesc}{Py_UNICODE*}{PyUnicode_AsUnicode}{PyObject *unicode}
Fred Drakea4cd2612000-04-06 14:10:29 +00002854Return a read-only pointer to the Unicode object's internal
2855\ctype{Py_UNICODE} buffer.
2856\end{cfuncdesc}
2857
2858\begin{cfuncdesc}{int}{PyUnicode_GetSize}{PyObject *unicode}
2859Return the length of the Unicode object.
2860\end{cfuncdesc}
2861
Marc-André Lemburg5a20b212000-07-07 15:47:06 +00002862\begin{cfuncdesc}{PyObject*}{PyUnicode_FromEncodedObject}{PyObject *obj,
2863 const char *encoding,
2864 const char *errors}
Fred Drakea4cd2612000-04-06 14:10:29 +00002865
Marc-André Lemburg5a20b212000-07-07 15:47:06 +00002866Coerce an encoded object obj to an Unicode object and return a
2867reference with incremented refcount.
Fred Drakea4cd2612000-04-06 14:10:29 +00002868
2869Coercion is done in the following way:
2870\begin{enumerate}
2871\item Unicode objects are passed back as-is with incremented
Marc-André Lemburg5a20b212000-07-07 15:47:06 +00002872 refcount. Note: these cannot be decoded; passing a non-NULL
2873 value for encoding will result in a TypeError.
Fred Drakea4cd2612000-04-06 14:10:29 +00002874
2875\item String and other char buffer compatible objects are decoded
Marc-André Lemburg5a20b212000-07-07 15:47:06 +00002876 according to the given encoding and using the error handling
2877 defined by errors. Both can be NULL to have the interface use
2878 the default values (see the next section for details).
Fred Drakea4cd2612000-04-06 14:10:29 +00002879
Marc-André Lemburg5a20b212000-07-07 15:47:06 +00002880\item All other objects cause an exception.
Fred Drakea4cd2612000-04-06 14:10:29 +00002881\end{enumerate}
2882The API returns NULL in case of an error. The caller is responsible
2883for decref'ing the returned objects.
2884\end{cfuncdesc}
2885
Marc-André Lemburg5a20b212000-07-07 15:47:06 +00002886\begin{cfuncdesc}{PyObject*}{PyUnicode_FromObject}{PyObject *obj}
2887
2888Shortcut for PyUnicode_FromEncodedObject(obj, NULL, ``strict'')
2889which is used throughout the interpreter whenever coercion to
2890Unicode is needed.
2891\end{cfuncdesc}
2892
Fred Drakea4cd2612000-04-06 14:10:29 +00002893% --- wchar_t support for platforms which support it ---------------------
2894
2895If the platform supports \ctype{wchar_t} and provides a header file
2896wchar.h, Python can interface directly to this type using the
2897following functions. Support is optimized if Python's own
2898\ctype{Py_UNICODE} type is identical to the system's \ctype{wchar_t}.
2899
2900\begin{cfuncdesc}{PyObject*}{PyUnicode_FromWideChar}{const wchar_t *w,
2901 int size}
2902Create a Unicode Object from the \ctype{whcar_t} buffer \var{w} of the
2903given size. Returns \NULL{} on failure.
2904\end{cfuncdesc}
2905
2906\begin{cfuncdesc}{int}{PyUnicode_AsWideChar}{PyUnicodeObject *unicode,
2907 wchar_t *w,
2908 int size}
Fred Drakea4cd2612000-04-06 14:10:29 +00002909Copies the Unicode Object contents into the \ctype{whcar_t} buffer
2910\var{w}. At most \var{size} \ctype{whcar_t} characters are copied.
2911Returns the number of \ctype{whcar_t} characters copied or -1 in case
2912of an error.
2913\end{cfuncdesc}
2914
2915
2916\subsubsection{Builtin Codecs \label{builtinCodecs}}
2917
2918Python provides a set of builtin codecs which are written in C
2919for speed. All of these codecs are directly usable via the
2920following functions.
2921
2922Many of the following APIs take two arguments encoding and
2923errors. These parameters encoding and errors have the same semantics
2924as the ones of the builtin unicode() Unicode object constructor.
2925
2926Setting encoding to NULL causes the default encoding to be used which
2927is UTF-8.
2928
2929Error handling is set by errors which may also be set to NULL meaning
2930to use the default handling defined for the codec. Default error
2931handling for all builtin codecs is ``strict'' (ValueErrors are raised).
2932
2933The codecs all use a similar interface. Only deviation from the
2934following generic ones are documented for simplicity.
2935
2936% --- Generic Codecs -----------------------------------------------------
2937
2938These are the generic codec APIs:
2939
2940\begin{cfuncdesc}{PyObject*}{PyUnicode_Decode}{const char *s,
2941 int size,
2942 const char *encoding,
2943 const char *errors}
Fred Drakea4cd2612000-04-06 14:10:29 +00002944Create a Unicode object by decoding \var{size} bytes of the encoded
2945string \var{s}. \var{encoding} and \var{errors} have the same meaning
2946as the parameters of the same name in the unicode() builtin
2947function. The codec to be used is looked up using the Python codec
2948registry. Returns \NULL{} in case an exception was raised by the
2949codec.
2950\end{cfuncdesc}
2951
2952\begin{cfuncdesc}{PyObject*}{PyUnicode_Encode}{const Py_UNICODE *s,
2953 int size,
2954 const char *encoding,
2955 const char *errors}
Fred Drakea4cd2612000-04-06 14:10:29 +00002956Encodes the \ctype{Py_UNICODE} buffer of the given size and returns a
2957Python string object. \var{encoding} and \var{errors} have the same
2958meaning as the parameters of the same name in the Unicode .encode()
2959method. The codec to be used is looked up using the Python codec
2960registry. Returns \NULL{} in case an exception was raised by the
2961codec.
2962\end{cfuncdesc}
2963
2964\begin{cfuncdesc}{PyObject*}{PyUnicode_AsEncodedString}{PyObject *unicode,
2965 const char *encoding,
2966 const char *errors}
Fred Drakea4cd2612000-04-06 14:10:29 +00002967Encodes a Unicode object and returns the result as Python string
2968object. \var{encoding} and \var{errors} have the same meaning as the
2969parameters of the same name in the Unicode .encode() method. The codec
2970to be used is looked up using the Python codec registry. Returns
2971\NULL{} in case an exception was raised by the codec.
2972\end{cfuncdesc}
2973
2974% --- UTF-8 Codecs -------------------------------------------------------
2975
2976These are the UTF-8 codec APIs:
2977
2978\begin{cfuncdesc}{PyObject*}{PyUnicode_DecodeUTF8}{const char *s,
2979 int size,
2980 const char *errors}
Fred Drakea4cd2612000-04-06 14:10:29 +00002981Creates a Unicode object by decoding \var{size} bytes of the UTF-8
2982encoded string \var{s}. Returns \NULL{} in case an exception was
2983raised by the codec.
2984\end{cfuncdesc}
2985
2986\begin{cfuncdesc}{PyObject*}{PyUnicode_EncodeUTF8}{const Py_UNICODE *s,
2987 int size,
2988 const char *errors}
Fred Drakea4cd2612000-04-06 14:10:29 +00002989Encodes the \ctype{Py_UNICODE} buffer of the given size using UTF-8
2990and returns a Python string object. Returns \NULL{} in case an
2991exception was raised by the codec.
2992\end{cfuncdesc}
2993
2994\begin{cfuncdesc}{PyObject*}{PyUnicode_AsUTF8String}{PyObject *unicode}
Fred Drakea4cd2612000-04-06 14:10:29 +00002995Encodes a Unicode objects using UTF-8 and returns the result as Python
2996string object. Error handling is ``strict''. Returns
2997\NULL{} in case an exception was raised by the codec.
2998\end{cfuncdesc}
2999
3000% --- UTF-16 Codecs ------------------------------------------------------ */
3001
3002These are the UTF-16 codec APIs:
3003
3004\begin{cfuncdesc}{PyObject*}{PyUnicode_DecodeUTF16}{const char *s,
3005 int size,
3006 const char *errors,
3007 int *byteorder}
Fred Drakea4cd2612000-04-06 14:10:29 +00003008Decodes \var{length} bytes from a UTF-16 encoded buffer string and
3009returns the corresponding Unicode object.
3010
3011\var{errors} (if non-NULL) defines the error handling. It defaults
3012to ``strict''.
3013
3014If \var{byteorder} is non-\NULL{}, the decoder starts decoding using
3015the given byte order:
3016
3017\begin{verbatim}
3018 *byteorder == -1: little endian
3019 *byteorder == 0: native order
3020 *byteorder == 1: big endian
3021\end{verbatim}
3022
3023and then switches according to all byte order marks (BOM) it finds in
3024the input data. BOM marks are not copied into the resulting Unicode
3025string. After completion, \var{*byteorder} is set to the current byte
3026order at the end of input data.
3027
3028If \var{byteorder} is \NULL{}, the codec starts in native order mode.
3029
3030Returns \NULL{} in case an exception was raised by the codec.
3031\end{cfuncdesc}
3032
3033\begin{cfuncdesc}{PyObject*}{PyUnicode_EncodeUTF16}{const Py_UNICODE *s,
3034 int size,
3035 const char *errors,
3036 int byteorder}
Fred Drakea4cd2612000-04-06 14:10:29 +00003037Returns a Python string object holding the UTF-16 encoded value of the
3038Unicode data in \var{s}.
3039
Fred Drakea8455ab2000-06-16 19:58:42 +00003040If \var{byteorder} is not \code{0}, output is written according to the
Fred Drakea4cd2612000-04-06 14:10:29 +00003041following byte order:
3042
3043\begin{verbatim}
3044 byteorder == -1: little endian
3045 byteorder == 0: native byte order (writes a BOM mark)
3046 byteorder == 1: big endian
3047\end{verbatim}
3048
Fred Drakea8455ab2000-06-16 19:58:42 +00003049If byteorder is \code{0}, the output string will always start with the
Fred Drakea4cd2612000-04-06 14:10:29 +00003050Unicode BOM mark (U+FEFF). In the other two modes, no BOM mark is
3051prepended.
3052
3053Note that \ctype{Py_UNICODE} data is being interpreted as UTF-16
3054reduced to UCS-2. This trick makes it possible to add full UTF-16
3055capabilities at a later point without comprimising the APIs.
3056
3057Returns \NULL{} in case an exception was raised by the codec.
3058\end{cfuncdesc}
3059
3060\begin{cfuncdesc}{PyObject*}{PyUnicode_AsUTF16String}{PyObject *unicode}
Fred Drakea4cd2612000-04-06 14:10:29 +00003061Returns a Python string using the UTF-16 encoding in native byte
3062order. The string always starts with a BOM mark. Error handling is
3063``strict''. Returns \NULL{} in case an exception was raised by the
3064codec.
3065\end{cfuncdesc}
3066
3067% --- Unicode-Escape Codecs ----------------------------------------------
3068
3069These are the ``Unicode Esacpe'' codec APIs:
3070
3071\begin{cfuncdesc}{PyObject*}{PyUnicode_DecodeUnicodeEscape}{const char *s,
3072 int size,
3073 const char *errors}
Fred Drakea4cd2612000-04-06 14:10:29 +00003074Creates a Unicode object by decoding \var{size} bytes of the Unicode-Esacpe
3075encoded string \var{s}. Returns \NULL{} in case an exception was
3076raised by the codec.
3077\end{cfuncdesc}
3078
3079\begin{cfuncdesc}{PyObject*}{PyUnicode_EncodeUnicodeEscape}{const Py_UNICODE *s,
3080 int size,
3081 const char *errors}
Fred Drakea4cd2612000-04-06 14:10:29 +00003082Encodes the \ctype{Py_UNICODE} buffer of the given size using Unicode-Escape
3083and returns a Python string object. Returns \NULL{} in case an
3084exception was raised by the codec.
3085\end{cfuncdesc}
3086
3087\begin{cfuncdesc}{PyObject*}{PyUnicode_AsUnicodeEscapeString}{PyObject *unicode}
Fred Drakea4cd2612000-04-06 14:10:29 +00003088Encodes a Unicode objects using Unicode-Escape and returns the result
3089as Python string object. Error handling is ``strict''. Returns
3090\NULL{} in case an exception was raised by the codec.
3091\end{cfuncdesc}
3092
3093% --- Raw-Unicode-Escape Codecs ------------------------------------------
3094
3095These are the ``Raw Unicode Esacpe'' codec APIs:
3096
3097\begin{cfuncdesc}{PyObject*}{PyUnicode_DecodeRawUnicodeEscape}{const char *s,
3098 int size,
3099 const char *errors}
Fred Drakea4cd2612000-04-06 14:10:29 +00003100Creates a Unicode object by decoding \var{size} bytes of the Raw-Unicode-Esacpe
3101encoded string \var{s}. Returns \NULL{} in case an exception was
3102raised by the codec.
3103\end{cfuncdesc}
3104
3105\begin{cfuncdesc}{PyObject*}{PyUnicode_EncodeRawUnicodeEscape}{const Py_UNICODE *s,
3106 int size,
3107 const char *errors}
Fred Drakea4cd2612000-04-06 14:10:29 +00003108Encodes the \ctype{Py_UNICODE} buffer of the given size using Raw-Unicode-Escape
3109and returns a Python string object. Returns \NULL{} in case an
3110exception was raised by the codec.
3111\end{cfuncdesc}
3112
3113\begin{cfuncdesc}{PyObject*}{PyUnicode_AsRawUnicodeEscapeString}{PyObject *unicode}
Fred Drakea4cd2612000-04-06 14:10:29 +00003114Encodes a Unicode objects using Raw-Unicode-Escape and returns the result
3115as Python string object. Error handling is ``strict''. Returns
3116\NULL{} in case an exception was raised by the codec.
3117\end{cfuncdesc}
3118
3119% --- Latin-1 Codecs -----------------------------------------------------
3120
3121These are the Latin-1 codec APIs:
3122
3123Latin-1 corresponds to the first 256 Unicode ordinals and only these
3124are accepted by the codecs during encoding.
3125
3126\begin{cfuncdesc}{PyObject*}{PyUnicode_DecodeLatin1}{const char *s,
Fred Drake1d158692000-06-18 05:21:21 +00003127 int size,
3128 const char *errors}
Fred Drakea4cd2612000-04-06 14:10:29 +00003129Creates a Unicode object by decoding \var{size} bytes of the Latin-1
3130encoded string \var{s}. Returns \NULL{} in case an exception was
3131raised by the codec.
3132\end{cfuncdesc}
3133
3134\begin{cfuncdesc}{PyObject*}{PyUnicode_EncodeLatin1}{const Py_UNICODE *s,
Fred Drake1d158692000-06-18 05:21:21 +00003135 int size,
3136 const char *errors}
Fred Drakea4cd2612000-04-06 14:10:29 +00003137Encodes the \ctype{Py_UNICODE} buffer of the given size using Latin-1
3138and returns a Python string object. Returns \NULL{} in case an
3139exception was raised by the codec.
3140\end{cfuncdesc}
3141
3142\begin{cfuncdesc}{PyObject*}{PyUnicode_AsLatin1String}{PyObject *unicode}
Fred Drakea4cd2612000-04-06 14:10:29 +00003143Encodes a Unicode objects using Latin-1 and returns the result as
3144Python string object. Error handling is ``strict''. Returns
3145\NULL{} in case an exception was raised by the codec.
3146\end{cfuncdesc}
3147
3148% --- ASCII Codecs -------------------------------------------------------
3149
Fred Drake1d158692000-06-18 05:21:21 +00003150These are the \ASCII{} codec APIs. Only 7-bit \ASCII{} data is
3151accepted. All other codes generate errors.
Fred Drakea4cd2612000-04-06 14:10:29 +00003152
3153\begin{cfuncdesc}{PyObject*}{PyUnicode_DecodeASCII}{const char *s,
Fred Drake1d158692000-06-18 05:21:21 +00003154 int size,
3155 const char *errors}
3156Creates a Unicode object by decoding \var{size} bytes of the
3157\ASCII{} encoded string \var{s}. Returns \NULL{} in case an exception
3158was raised by the codec.
Fred Drakea4cd2612000-04-06 14:10:29 +00003159\end{cfuncdesc}
3160
3161\begin{cfuncdesc}{PyObject*}{PyUnicode_EncodeASCII}{const Py_UNICODE *s,
Fred Drake1d158692000-06-18 05:21:21 +00003162 int size,
3163 const char *errors}
3164Encodes the \ctype{Py_UNICODE} buffer of the given size using
3165\ASCII{} and returns a Python string object. Returns \NULL{} in case
3166an exception was raised by the codec.
Fred Drakea4cd2612000-04-06 14:10:29 +00003167\end{cfuncdesc}
3168
3169\begin{cfuncdesc}{PyObject*}{PyUnicode_AsASCIIString}{PyObject *unicode}
Fred Drake1d158692000-06-18 05:21:21 +00003170Encodes a Unicode objects using \ASCII{} and returns the result as Python
Fred Drakea4cd2612000-04-06 14:10:29 +00003171string object. Error handling is ``strict''. Returns
3172\NULL{} in case an exception was raised by the codec.
3173\end{cfuncdesc}
3174
3175% --- Character Map Codecs -----------------------------------------------
3176
3177These are the mapping codec APIs:
3178
3179This codec is special in that it can be used to implement many
3180different codecs (and this is in fact what was done to obtain most of
3181the standard codecs included in the \module{encodings} package). The
3182codec uses mapping to encode and decode characters.
3183
3184Decoding mappings must map single string characters to single Unicode
3185characters, integers (which are then interpreted as Unicode ordinals)
3186or None (meaning "undefined mapping" and causing an error).
3187
3188Encoding mappings must map single Unicode characters to single string
3189characters, integers (which are then interpreted as Latin-1 ordinals)
3190or None (meaning "undefined mapping" and causing an error).
3191
3192The mapping objects provided must only support the __getitem__ mapping
3193interface.
3194
3195If a character lookup fails with a LookupError, the character is
3196copied as-is meaning that its ordinal value will be interpreted as
3197Unicode or Latin-1 ordinal resp. Because of this, mappings only need
3198to contain those mappings which map characters to different code
3199points.
3200
3201\begin{cfuncdesc}{PyObject*}{PyUnicode_DecodeCharmap}{const char *s,
3202 int size,
3203 PyObject *mapping,
3204 const char *errors}
Fred Drakea4cd2612000-04-06 14:10:29 +00003205Creates a Unicode object by decoding \var{size} bytes of the encoded
3206string \var{s} using the given \var{mapping} object. Returns \NULL{}
3207in case an exception was raised by the codec.
3208\end{cfuncdesc}
3209
3210\begin{cfuncdesc}{PyObject*}{PyUnicode_EncodeCharmap}{const Py_UNICODE *s,
3211 int size,
3212 PyObject *mapping,
3213 const char *errors}
Fred Drakea4cd2612000-04-06 14:10:29 +00003214Encodes the \ctype{Py_UNICODE} buffer of the given size using the
3215given \var{mapping} object and returns a Python string object.
3216Returns \NULL{} in case an exception was raised by the codec.
3217\end{cfuncdesc}
3218
3219\begin{cfuncdesc}{PyObject*}{PyUnicode_AsCharmapString}{PyObject *unicode,
3220 PyObject *mapping}
Fred Drakea4cd2612000-04-06 14:10:29 +00003221Encodes a Unicode objects using the given \var{mapping} object and
3222returns the result as Python string object. Error handling is
3223``strict''. Returns \NULL{} in case an exception was raised by the
3224codec.
3225\end{cfuncdesc}
3226
3227The following codec API is special in that maps Unicode to Unicode.
3228
3229\begin{cfuncdesc}{PyObject*}{PyUnicode_TranslateCharmap}{const Py_UNICODE *s,
3230 int size,
3231 PyObject *table,
3232 const char *errors}
Fred Drakea4cd2612000-04-06 14:10:29 +00003233Translates a \ctype{Py_UNICODE} buffer of the given length by applying
3234a character mapping \var{table} to it and returns the resulting
Fred Drake1d158692000-06-18 05:21:21 +00003235Unicode object. Returns \NULL{} when an exception was raised by the
3236codec.
Fred Drakea4cd2612000-04-06 14:10:29 +00003237
3238The \var{mapping} table must map Unicode ordinal integers to Unicode
3239ordinal integers or None (causing deletion of the character).
3240
3241Mapping tables must only provide the __getitem__ interface,
3242e.g. dictionaries or sequences. Unmapped character ordinals (ones
3243which cause a LookupError) are left untouched and are copied as-is.
Fred Drakea4cd2612000-04-06 14:10:29 +00003244\end{cfuncdesc}
3245
3246% --- MBCS codecs for Windows --------------------------------------------
3247
Fred Drake1d158692000-06-18 05:21:21 +00003248These are the MBCS codec APIs. They are currently only available on
Fred Drakea4cd2612000-04-06 14:10:29 +00003249Windows and use the Win32 MBCS converters to implement the
Fred Drake1d158692000-06-18 05:21:21 +00003250conversions. Note that MBCS (or DBCS) is a class of encodings, not
3251just one. The target encoding is defined by the user settings on the
3252machine running the codec.
Fred Drakea4cd2612000-04-06 14:10:29 +00003253
3254\begin{cfuncdesc}{PyObject*}{PyUnicode_DecodeMBCS}{const char *s,
3255 int size,
3256 const char *errors}
Fred Drakea4cd2612000-04-06 14:10:29 +00003257Creates a Unicode object by decoding \var{size} bytes of the MBCS
Fred Drake1d158692000-06-18 05:21:21 +00003258encoded string \var{s}. Returns \NULL{} in case an exception was
Fred Drakea4cd2612000-04-06 14:10:29 +00003259raised by the codec.
3260\end{cfuncdesc}
3261
3262\begin{cfuncdesc}{PyObject*}{PyUnicode_EncodeMBCS}{const Py_UNICODE *s,
3263 int size,
3264 const char *errors}
Fred Drakea4cd2612000-04-06 14:10:29 +00003265Encodes the \ctype{Py_UNICODE} buffer of the given size using MBCS
3266and returns a Python string object. Returns \NULL{} in case an
3267exception was raised by the codec.
3268\end{cfuncdesc}
3269
3270\begin{cfuncdesc}{PyObject*}{PyUnicode_AsMBCSString}{PyObject *unicode}
Fred Drakea4cd2612000-04-06 14:10:29 +00003271Encodes a Unicode objects using MBCS and returns the result as Python
Fred Drake1d158692000-06-18 05:21:21 +00003272string object. Error handling is ``strict''. Returns \NULL{} in case
3273an exception was raised by the codec.
Fred Drakea4cd2612000-04-06 14:10:29 +00003274\end{cfuncdesc}
3275
3276% --- Methods & Slots ----------------------------------------------------
3277
3278\subsubsection{Methods and Slot Functions \label{unicodeMethodsAndSlots}}
3279
3280The following APIs are capable of handling Unicode objects and strings
3281on input (we refer to them as strings in the descriptions) and return
3282Unicode objects or integers as apporpriate.
3283
3284They all return \NULL{} or -1 in case an exception occurrs.
3285
3286\begin{cfuncdesc}{PyObject*}{PyUnicode_Concat}{PyObject *left,
3287 PyObject *right}
Fred Drakea4cd2612000-04-06 14:10:29 +00003288Concat two strings giving a new Unicode string.
3289\end{cfuncdesc}
3290
3291\begin{cfuncdesc}{PyObject*}{PyUnicode_Split}{PyObject *s,
3292 PyObject *sep,
3293 int maxsplit}
Fred Drakea4cd2612000-04-06 14:10:29 +00003294Split a string giving a list of Unicode strings.
3295
3296If sep is NULL, splitting will be done at all whitespace
3297substrings. Otherwise, splits occur at the given separator.
3298
3299At most maxsplit splits will be done. If negative, no limit is set.
3300
3301Separators are not included in the resulting list.
3302\end{cfuncdesc}
3303
3304\begin{cfuncdesc}{PyObject*}{PyUnicode_Splitlines}{PyObject *s,
3305 int maxsplit}
Fred Drake1d158692000-06-18 05:21:21 +00003306Split a Unicode string at line breaks, returning a list of Unicode
3307strings. CRLF is considered to be one line break. The Line break
3308characters are not included in the resulting strings.
Fred Drakea4cd2612000-04-06 14:10:29 +00003309\end{cfuncdesc}
3310
3311\begin{cfuncdesc}{PyObject*}{PyUnicode_Translate}{PyObject *str,
3312 PyObject *table,
3313 const char *errors}
Fred Drakea4cd2612000-04-06 14:10:29 +00003314Translate a string by applying a character mapping table to it and
3315return the resulting Unicode object.
3316
3317The mapping table must map Unicode ordinal integers to Unicode ordinal
3318integers or None (causing deletion of the character).
3319
3320Mapping tables must only provide the __getitem__ interface,
3321e.g. dictionaries or sequences. Unmapped character ordinals (ones
3322which cause a LookupError) are left untouched and are copied as-is.
3323
3324\var{errors} has the usual meaning for codecs. It may be \NULL{}
3325which indicates to use the default error handling.
Fred Drakea4cd2612000-04-06 14:10:29 +00003326\end{cfuncdesc}
3327
3328\begin{cfuncdesc}{PyObject*}{PyUnicode_Join}{PyObject *separator,
3329 PyObject *seq}
Fred Drakea4cd2612000-04-06 14:10:29 +00003330Join a sequence of strings using the given separator and return
3331the resulting Unicode string.
3332\end{cfuncdesc}
3333
3334\begin{cfuncdesc}{PyObject*}{PyUnicode_Tailmatch}{PyObject *str,
3335 PyObject *substr,
3336 int start,
3337 int end,
3338 int direction}
Fred Drakea4cd2612000-04-06 14:10:29 +00003339Return 1 if \var{substr} matches \var{str}[\var{start}:\var{end}] at
3340the given tail end (\var{direction} == -1 means to do a prefix match,
3341\var{direction} == 1 a suffix match), 0 otherwise.
3342\end{cfuncdesc}
3343
3344\begin{cfuncdesc}{PyObject*}{PyUnicode_Find}{PyObject *str,
3345 PyObject *substr,
3346 int start,
3347 int end,
3348 int direction}
Fred Drakea4cd2612000-04-06 14:10:29 +00003349Return the first position of \var{substr} in
3350\var{str}[\var{start}:\var{end}] using the given \var{direction}
3351(\var{direction} == 1 means to do a forward search,
3352\var{direction} == -1 a backward search), 0 otherwise.
3353\end{cfuncdesc}
3354
3355\begin{cfuncdesc}{PyObject*}{PyUnicode_Count}{PyObject *str,
3356 PyObject *substr,
3357 int start,
3358 int end}
Fred Drakea4cd2612000-04-06 14:10:29 +00003359Count the number of occurrences of \var{substr} in
3360\var{str}[\var{start}:\var{end}]
3361\end{cfuncdesc}
3362
3363\begin{cfuncdesc}{PyObject*}{PyUnicode_Replace}{PyObject *str,
3364 PyObject *substr,
3365 PyObject *replstr,
3366 int maxcount}
Fred Drakea4cd2612000-04-06 14:10:29 +00003367Replace at most \var{maxcount} occurrences of \var{substr} in
3368\var{str} with \var{replstr} and return the resulting Unicode object.
3369\var{maxcount} == -1 means: replace all occurrences.
3370\end{cfuncdesc}
3371
Fred Drake1d158692000-06-18 05:21:21 +00003372\begin{cfuncdesc}{int}{PyUnicode_Compare}{PyObject *left, PyObject *right}
Fred Drakea4cd2612000-04-06 14:10:29 +00003373Compare two strings and return -1, 0, 1 for less than, equal,
3374greater than resp.
3375\end{cfuncdesc}
3376
3377\begin{cfuncdesc}{PyObject*}{PyUnicode_Format}{PyObject *format,
3378 PyObject *args}
Fred Drake1d158692000-06-18 05:21:21 +00003379Returns a new string object from \var{format} and \var{args}; this is
3380analogous to \code{\var{format} \%\ \var{args}}. The
3381\var{args} argument must be a tuple.
Fred Drakea4cd2612000-04-06 14:10:29 +00003382\end{cfuncdesc}
3383
3384\begin{cfuncdesc}{int}{PyUnicode_Contains}{PyObject *container,
3385 PyObject *element}
Fred Drakea4cd2612000-04-06 14:10:29 +00003386Checks whether \var{element} is contained in \var{container} and
Fred Drake1d158692000-06-18 05:21:21 +00003387returns true or false accordingly.
Fred Drakea4cd2612000-04-06 14:10:29 +00003388
Fred Drake1d158692000-06-18 05:21:21 +00003389\var{element} has to coerce to a one element Unicode string. \code{-1} is
Fred Drakea4cd2612000-04-06 14:10:29 +00003390returned in case of an error.
3391\end{cfuncdesc}
3392
3393
Fred Drake58c5a2a1999-08-04 13:13:24 +00003394\subsection{Buffer Objects \label{bufferObjects}}
Fred Drake659ebfa2000-04-03 15:42:13 +00003395\sectionauthor{Greg Stein}{gstein@lyra.org}
Fred Drake58c5a2a1999-08-04 13:13:24 +00003396
Fred Drake659ebfa2000-04-03 15:42:13 +00003397\obindex{buffer}
3398Python objects implemented in C can export a group of functions called
3399the ``buffer\index{buffer interface} interface.'' These functions can
3400be used by an object to expose its data in a raw, byte-oriented
3401format. Clients of the object can use the buffer interface to access
3402the object data directly, without needing to copy it first.
3403
3404Two examples of objects that support
3405the buffer interface are strings and arrays. The string object exposes
3406the character contents in the buffer interface's byte-oriented
3407form. An array can also expose its contents, but it should be noted
3408that array elements may be multi-byte values.
3409
3410An example user of the buffer interface is the file object's
3411\method{write()} method. Any object that can export a series of bytes
3412through the buffer interface can be written to a file. There are a
Fred Drake88fdaa72001-07-20 20:56:11 +00003413number of format codes to \cfunction{PyArg_ParseTuple()} that operate
Fred Drake659ebfa2000-04-03 15:42:13 +00003414against an object's buffer interface, returning data from the target
3415object.
3416
3417More information on the buffer interface is provided in the section
3418``Buffer Object Structures'' (section \ref{buffer-structs}), under
3419the description for \ctype{PyBufferProcs}\ttindex{PyBufferProcs}.
3420
3421A ``buffer object'' is defined in the \file{bufferobject.h} header
3422(included by \file{Python.h}). These objects look very similar to
3423string objects at the Python programming level: they support slicing,
3424indexing, concatenation, and some other standard string
3425operations. However, their data can come from one of two sources: from
3426a block of memory, or from another object which exports the buffer
3427interface.
3428
3429Buffer objects are useful as a way to expose the data from another
3430object's buffer interface to the Python programmer. They can also be
3431used as a zero-copy slicing mechanism. Using their ability to
3432reference a block of memory, it is possible to expose any data to the
3433Python programmer quite easily. The memory could be a large, constant
3434array in a C extension, it could be a raw block of memory for
3435manipulation before passing to an operating system library, or it
3436could be used to pass around structured data in its native, in-memory
3437format.
3438
3439\begin{ctypedesc}{PyBufferObject}
3440This subtype of \ctype{PyObject} represents a buffer object.
3441\end{ctypedesc}
Fred Drake58c5a2a1999-08-04 13:13:24 +00003442
3443\begin{cvardesc}{PyTypeObject}{PyBuffer_Type}
3444The instance of \ctype{PyTypeObject} which represents the Python
Fred Drake659ebfa2000-04-03 15:42:13 +00003445buffer type; it is the same object as \code{types.BufferType} in the
3446Python layer.\withsubitem{(in module types)}{\ttindex{BufferType}}.
Fred Drake58c5a2a1999-08-04 13:13:24 +00003447\end{cvardesc}
3448
3449\begin{cvardesc}{int}{Py_END_OF_BUFFER}
Fred Drake659ebfa2000-04-03 15:42:13 +00003450This constant may be passed as the \var{size} parameter to
3451\cfunction{PyBuffer_FromObject()} or
3452\cfunction{PyBuffer_FromReadWriteObject()}. It indicates that the new
3453\ctype{PyBufferObject} should refer to \var{base} object from the
3454specified \var{offset} to the end of its exported buffer. Using this
3455enables the caller to avoid querying the \var{base} object for its
3456length.
Fred Drake58c5a2a1999-08-04 13:13:24 +00003457\end{cvardesc}
3458
3459\begin{cfuncdesc}{int}{PyBuffer_Check}{PyObject *p}
3460Return true if the argument has type \cdata{PyBuffer_Type}.
3461\end{cfuncdesc}
3462
3463\begin{cfuncdesc}{PyObject*}{PyBuffer_FromObject}{PyObject *base,
3464 int offset, int size}
Fred Drake659ebfa2000-04-03 15:42:13 +00003465Return a new read-only buffer object. This raises
3466\exception{TypeError} if \var{base} doesn't support the read-only
3467buffer protocol or doesn't provide exactly one buffer segment, or it
3468raises \exception{ValueError} if \var{offset} is less than zero. The
3469buffer will hold a reference to the \var{base} object, and the
3470buffer's contents will refer to the \var{base} object's buffer
3471interface, starting as position \var{offset} and extending for
3472\var{size} bytes. If \var{size} is \constant{Py_END_OF_BUFFER}, then
3473the new buffer's contents extend to the length of the
3474\var{base} object's exported buffer data.
Fred Drake58c5a2a1999-08-04 13:13:24 +00003475\end{cfuncdesc}
3476
3477\begin{cfuncdesc}{PyObject*}{PyBuffer_FromReadWriteObject}{PyObject *base,
3478 int offset,
3479 int size}
3480Return a new writable buffer object. Parameters and exceptions are
3481similar to those for \cfunction{PyBuffer_FromObject()}.
Fred Drake659ebfa2000-04-03 15:42:13 +00003482If the \var{base} object does not export the writeable buffer
3483protocol, then \exception{TypeError} is raised.
Fred Drake58c5a2a1999-08-04 13:13:24 +00003484\end{cfuncdesc}
3485
3486\begin{cfuncdesc}{PyObject*}{PyBuffer_FromMemory}{void *ptr, int size}
Fred Drake659ebfa2000-04-03 15:42:13 +00003487Return a new read-only buffer object that reads from a specified
3488location in memory, with a specified size.
Fred Drake58c5a2a1999-08-04 13:13:24 +00003489The caller is responsible for ensuring that the memory buffer, passed
3490in as \var{ptr}, is not deallocated while the returned buffer object
3491exists. Raises \exception{ValueError} if \var{size} is less than
Fred Drake659ebfa2000-04-03 15:42:13 +00003492zero. Note that \constant{Py_END_OF_BUFFER} may \emph{not} be passed
3493for the \var{size} parameter; \exception{ValueError} will be raised in
3494that case.
Fred Drake58c5a2a1999-08-04 13:13:24 +00003495\end{cfuncdesc}
3496
3497\begin{cfuncdesc}{PyObject*}{PyBuffer_FromReadWriteMemory}{void *ptr, int size}
Fred Drake659ebfa2000-04-03 15:42:13 +00003498Similar to \cfunction{PyBuffer_FromMemory()}, but the returned buffer
3499is writable.
Fred Drake58c5a2a1999-08-04 13:13:24 +00003500\end{cfuncdesc}
3501
3502\begin{cfuncdesc}{PyObject*}{PyBuffer_New}{int size}
3503Returns a new writable buffer object that maintains its own memory
Fred Drake659ebfa2000-04-03 15:42:13 +00003504buffer of \var{size} bytes. \exception{ValueError} is returned if
3505\var{size} is not zero or positive.
Fred Drake58c5a2a1999-08-04 13:13:24 +00003506\end{cfuncdesc}
3507
Guido van Rossum44475131998-04-21 15:30:01 +00003508
Fred Drakeefd146c1999-02-15 15:30:45 +00003509\subsection{Tuple Objects \label{tupleObjects}}
Fred Drakee5bf8b21998-02-12 21:22:28 +00003510
Fred Drake659ebfa2000-04-03 15:42:13 +00003511\obindex{tuple}
Fred Drakee5bf8b21998-02-12 21:22:28 +00003512\begin{ctypedesc}{PyTupleObject}
Fred Drakef8830d11998-04-23 14:06:01 +00003513This subtype of \ctype{PyObject} represents a Python tuple object.
Fred Drakee5bf8b21998-02-12 21:22:28 +00003514\end{ctypedesc}
3515
3516\begin{cvardesc}{PyTypeObject}{PyTuple_Type}
Fred Drake659ebfa2000-04-03 15:42:13 +00003517This instance of \ctype{PyTypeObject} represents the Python tuple
3518type; it is the same object as \code{types.TupleType} in the Python
3519layer.\withsubitem{(in module types)}{\ttindex{TupleType}}.
Fred Drakee5bf8b21998-02-12 21:22:28 +00003520\end{cvardesc}
3521
3522\begin{cfuncdesc}{int}{PyTuple_Check}{PyObject *p}
3523Return true if the argument is a tuple object.
3524\end{cfuncdesc}
3525
Fred Drake659ebfa2000-04-03 15:42:13 +00003526\begin{cfuncdesc}{PyObject*}{PyTuple_New}{int len}
3527Return a new tuple object of size \var{len}, or \NULL{} on failure.
Fred Drakee5bf8b21998-02-12 21:22:28 +00003528\end{cfuncdesc}
3529
Fred Drakea05460c2001-02-12 17:38:18 +00003530\begin{cfuncdesc}{int}{PyTuple_Size}{PyObject *p}
Fred Drakee058b4f1998-02-16 06:15:35 +00003531Takes a pointer to a tuple object, and returns the size
Fred Drakee5bf8b21998-02-12 21:22:28 +00003532of that tuple.
3533\end{cfuncdesc}
3534
Fred Drakea05460c2001-02-12 17:38:18 +00003535\begin{cfuncdesc}{PyObject*}{PyTuple_GetItem}{PyObject *p, int pos}
Fred Drakee058b4f1998-02-16 06:15:35 +00003536Returns the object at position \var{pos} in the tuple pointed
3537to by \var{p}. If \var{pos} is out of bounds, returns \NULL{} and
Fred Drake659ebfa2000-04-03 15:42:13 +00003538sets an \exception{IndexError} exception.
Fred Drakee5bf8b21998-02-12 21:22:28 +00003539\end{cfuncdesc}
3540
Fred Drakea05460c2001-02-12 17:38:18 +00003541\begin{cfuncdesc}{PyObject*}{PyTuple_GET_ITEM}{PyObject *p, int pos}
Fred Drakefac312f2001-05-29 15:13:00 +00003542Like \cfunction{PyTuple_GetItem()}, but does no checking of its
3543arguments.
Fred Drakee5bf8b21998-02-12 21:22:28 +00003544\end{cfuncdesc}
3545
Fred Drakea05460c2001-02-12 17:38:18 +00003546\begin{cfuncdesc}{PyObject*}{PyTuple_GetSlice}{PyObject *p,
3547 int low, int high}
Fred Drakee058b4f1998-02-16 06:15:35 +00003548Takes a slice of the tuple pointed to by \var{p} from
3549\var{low} to \var{high} and returns it as a new tuple.
Fred Drakee5bf8b21998-02-12 21:22:28 +00003550\end{cfuncdesc}
3551
Fred Drake659ebfa2000-04-03 15:42:13 +00003552\begin{cfuncdesc}{int}{PyTuple_SetItem}{PyObject *p,
3553 int pos, PyObject *o}
Fred Drakee058b4f1998-02-16 06:15:35 +00003554Inserts a reference to object \var{o} at position \var{pos} of
3555the tuple pointed to by \var{p}. It returns \code{0} on success.
Fred Drake659ebfa2000-04-03 15:42:13 +00003556\strong{Note:} This function ``steals'' a reference to \var{o}.
Fred Drakee5bf8b21998-02-12 21:22:28 +00003557\end{cfuncdesc}
3558
Fred Drake659ebfa2000-04-03 15:42:13 +00003559\begin{cfuncdesc}{void}{PyTuple_SET_ITEM}{PyObject *p,
3560 int pos, PyObject *o}
Fred Drakefac312f2001-05-29 15:13:00 +00003561Like \cfunction{PyTuple_SetItem()}, but does no error checking, and
Fred Drakee5bf8b21998-02-12 21:22:28 +00003562should \emph{only} be used to fill in brand new tuples.
Fred Drake659ebfa2000-04-03 15:42:13 +00003563\strong{Note:} This function ``steals'' a reference to \var{o}.
Fred Drakee5bf8b21998-02-12 21:22:28 +00003564\end{cfuncdesc}
3565
Fred Drakefac312f2001-05-29 15:13:00 +00003566\begin{cfuncdesc}{int}{_PyTuple_Resize}{PyObject **p, int newsize}
Fred Drake659ebfa2000-04-03 15:42:13 +00003567Can be used to resize a tuple. \var{newsize} will be the new length
3568of the tuple. Because tuples are \emph{supposed} to be immutable,
3569this should only be used if there is only one reference to the object.
3570Do \emph{not} use this if the tuple may already be known to some other
Fred Drakefac312f2001-05-29 15:13:00 +00003571part of the code. The tuple will always grow or shrink at the end.
3572Think of this as destroying the old tuple and creating a new one, only
3573more efficiently. Returns \code{0} on success. Client code should
3574never assume that the resulting value of \code{*\var{p}} will be the
3575same as before calling this function. If the object referenced by
3576\code{*\var{p}} is replaced, the original \code{*\var{p}} is
3577destroyed. On failure, returns \code{-1} and sets \code{*\var{p}} to
3578\NULL, and raises \exception{MemoryError} or \exception{SystemError}.
3579\versionchanged[Removed unused third parameter, \var{last_is_sticky}]{2.2}
Fred Drakee5bf8b21998-02-12 21:22:28 +00003580\end{cfuncdesc}
3581
3582
Fred Drakeefd146c1999-02-15 15:30:45 +00003583\subsection{List Objects \label{listObjects}}
Fred Drakee5bf8b21998-02-12 21:22:28 +00003584
Fred Drake659ebfa2000-04-03 15:42:13 +00003585\obindex{list}
Fred Drakee5bf8b21998-02-12 21:22:28 +00003586\begin{ctypedesc}{PyListObject}
Fred Drakef8830d11998-04-23 14:06:01 +00003587This subtype of \ctype{PyObject} represents a Python list object.
Fred Drakee5bf8b21998-02-12 21:22:28 +00003588\end{ctypedesc}
3589
3590\begin{cvardesc}{PyTypeObject}{PyList_Type}
Fred Drake659ebfa2000-04-03 15:42:13 +00003591This instance of \ctype{PyTypeObject} represents the Python list
3592type. This is the same object as \code{types.ListType}.
3593\withsubitem{(in module types)}{\ttindex{ListType}}
Fred Drakee5bf8b21998-02-12 21:22:28 +00003594\end{cvardesc}
3595
3596\begin{cfuncdesc}{int}{PyList_Check}{PyObject *p}
Fred Drakef8830d11998-04-23 14:06:01 +00003597Returns true if its argument is a \ctype{PyListObject}.
Fred Drakee5bf8b21998-02-12 21:22:28 +00003598\end{cfuncdesc}
3599
Fred Drake659ebfa2000-04-03 15:42:13 +00003600\begin{cfuncdesc}{PyObject*}{PyList_New}{int len}
3601Returns a new list of length \var{len} on success, or \NULL{} on
Guido van Rossum3c4378b1998-04-14 20:21:10 +00003602failure.
Fred Drakee5bf8b21998-02-12 21:22:28 +00003603\end{cfuncdesc}
3604
Fred Drakec6fa34e1998-04-02 06:47:24 +00003605\begin{cfuncdesc}{int}{PyList_Size}{PyObject *list}
Fred Drake659ebfa2000-04-03 15:42:13 +00003606Returns the length of the list object in \var{list}; this is
3607equivalent to \samp{len(\var{list})} on a list object.
3608\bifuncindex{len}
3609\end{cfuncdesc}
3610
3611\begin{cfuncdesc}{int}{PyList_GET_SIZE}{PyObject *list}
Fred Drake5d644212000-10-07 12:31:50 +00003612Macro form of \cfunction{PyList_Size()} without error checking.
Fred Drakee5bf8b21998-02-12 21:22:28 +00003613\end{cfuncdesc}
3614
Fred Drakec6fa34e1998-04-02 06:47:24 +00003615\begin{cfuncdesc}{PyObject*}{PyList_GetItem}{PyObject *list, int index}
Guido van Rossum44475131998-04-21 15:30:01 +00003616Returns the object at position \var{pos} in the list pointed
3617to by \var{p}. If \var{pos} is out of bounds, returns \NULL{} and
Fred Drake659ebfa2000-04-03 15:42:13 +00003618sets an \exception{IndexError} exception.
3619\end{cfuncdesc}
3620
3621\begin{cfuncdesc}{PyObject*}{PyList_GET_ITEM}{PyObject *list, int i}
3622Macro form of \cfunction{PyList_GetItem()} without error checking.
Fred Drakee5bf8b21998-02-12 21:22:28 +00003623\end{cfuncdesc}
3624
Fred Drakec6fa34e1998-04-02 06:47:24 +00003625\begin{cfuncdesc}{int}{PyList_SetItem}{PyObject *list, int index,
3626 PyObject *item}
Guido van Rossum3c4378b1998-04-14 20:21:10 +00003627Sets the item at index \var{index} in list to \var{item}.
Fred Drakebab29652001-07-10 16:10:08 +00003628Returns \code{0} on success or \code{-1} on failure.
Fred Drake00d0cb62001-06-03 03:12:57 +00003629\strong{Note:} This function ``steals'' a reference to \var{item} and
3630discards a reference to an item already in the list at the affected
3631position.
Fred Drake659ebfa2000-04-03 15:42:13 +00003632\end{cfuncdesc}
3633
Fred Drakebab29652001-07-10 16:10:08 +00003634\begin{cfuncdesc}{void}{PyList_SET_ITEM}{PyObject *list, int i,
Fred Drake659ebfa2000-04-03 15:42:13 +00003635 PyObject *o}
3636Macro form of \cfunction{PyList_SetItem()} without error checking.
Fred Drake00d0cb62001-06-03 03:12:57 +00003637\strong{Note:} This function ``steals'' a reference to \var{item},
3638and, unlike \cfunction{PyList_SetItem()}, does \emph{not} discard a
Fred Drakebab29652001-07-10 16:10:08 +00003639reference to any item that it being replaced; any reference in
3640\var{list} at position \var{i} will be leaked. This is normally only
3641used to fill in new lists where there is no previous content.
Fred Drakee5bf8b21998-02-12 21:22:28 +00003642\end{cfuncdesc}
3643
Fred Drakec6fa34e1998-04-02 06:47:24 +00003644\begin{cfuncdesc}{int}{PyList_Insert}{PyObject *list, int index,
Guido van Rossum44475131998-04-21 15:30:01 +00003645 PyObject *item}
3646Inserts the item \var{item} into list \var{list} in front of index
Fred Drake659ebfa2000-04-03 15:42:13 +00003647\var{index}. Returns \code{0} if successful; returns \code{-1} and
3648raises an exception if unsuccessful. Analogous to
3649\code{\var{list}.insert(\var{index}, \var{item})}.
Fred Drakee5bf8b21998-02-12 21:22:28 +00003650\end{cfuncdesc}
3651
Fred Drakec6fa34e1998-04-02 06:47:24 +00003652\begin{cfuncdesc}{int}{PyList_Append}{PyObject *list, PyObject *item}
Guido van Rossum44475131998-04-21 15:30:01 +00003653Appends the object \var{item} at the end of list \var{list}. Returns
Fred Drake659ebfa2000-04-03 15:42:13 +00003654\code{0} if successful; returns \code{-1} and sets an exception if
3655unsuccessful. Analogous to \code{\var{list}.append(\var{item})}.
Fred Drakee5bf8b21998-02-12 21:22:28 +00003656\end{cfuncdesc}
3657
Fred Drakec6fa34e1998-04-02 06:47:24 +00003658\begin{cfuncdesc}{PyObject*}{PyList_GetSlice}{PyObject *list,
3659 int low, int high}
Guido van Rossum3c4378b1998-04-14 20:21:10 +00003660Returns a list of the objects in \var{list} containing the objects
Guido van Rossum44475131998-04-21 15:30:01 +00003661\emph{between} \var{low} and \var{high}. Returns NULL and sets an
3662exception if unsuccessful.
Fred Drake659ebfa2000-04-03 15:42:13 +00003663Analogous to \code{\var{list}[\var{low}:\var{high}]}.
Fred Drakee5bf8b21998-02-12 21:22:28 +00003664\end{cfuncdesc}
3665
Fred Drakec6fa34e1998-04-02 06:47:24 +00003666\begin{cfuncdesc}{int}{PyList_SetSlice}{PyObject *list,
3667 int low, int high,
3668 PyObject *itemlist}
Fred Drake659ebfa2000-04-03 15:42:13 +00003669Sets the slice of \var{list} between \var{low} and \var{high} to the
3670contents of \var{itemlist}. Analogous to
3671\code{\var{list}[\var{low}:\var{high}] = \var{itemlist}}. Returns
3672\code{0} on success, \code{-1} on failure.
Fred Drakee5bf8b21998-02-12 21:22:28 +00003673\end{cfuncdesc}
3674
Fred Drakec6fa34e1998-04-02 06:47:24 +00003675\begin{cfuncdesc}{int}{PyList_Sort}{PyObject *list}
Fred Drake659ebfa2000-04-03 15:42:13 +00003676Sorts the items of \var{list} in place. Returns \code{0} on success,
3677\code{-1} on failure. This is equivalent to
3678\samp{\var{list}.sort()}.
Fred Drakee5bf8b21998-02-12 21:22:28 +00003679\end{cfuncdesc}
3680
Fred Drakec6fa34e1998-04-02 06:47:24 +00003681\begin{cfuncdesc}{int}{PyList_Reverse}{PyObject *list}
Fred Drake659ebfa2000-04-03 15:42:13 +00003682Reverses the items of \var{list} in place. Returns \code{0} on
3683success, \code{-1} on failure. This is the equivalent of
3684\samp{\var{list}.reverse()}.
Fred Drakee5bf8b21998-02-12 21:22:28 +00003685\end{cfuncdesc}
3686
Fred Drakec6fa34e1998-04-02 06:47:24 +00003687\begin{cfuncdesc}{PyObject*}{PyList_AsTuple}{PyObject *list}
Fred Drake659ebfa2000-04-03 15:42:13 +00003688Returns a new tuple object containing the contents of \var{list};
3689equivalent to \samp{tuple(\var{list})}.\bifuncindex{tuple}
Fred Drakee5bf8b21998-02-12 21:22:28 +00003690\end{cfuncdesc}
3691
3692
Fred Drakeefd146c1999-02-15 15:30:45 +00003693\section{Mapping Objects \label{mapObjects}}
Fred Drakee5bf8b21998-02-12 21:22:28 +00003694
Fred Drake659ebfa2000-04-03 15:42:13 +00003695\obindex{mapping}
3696
3697
Fred Drakeefd146c1999-02-15 15:30:45 +00003698\subsection{Dictionary Objects \label{dictObjects}}
Fred Drakee5bf8b21998-02-12 21:22:28 +00003699
Fred Drake659ebfa2000-04-03 15:42:13 +00003700\obindex{dictionary}
Fred Drakee5bf8b21998-02-12 21:22:28 +00003701\begin{ctypedesc}{PyDictObject}
Fred Drakef8830d11998-04-23 14:06:01 +00003702This subtype of \ctype{PyObject} represents a Python dictionary object.
Fred Drakee5bf8b21998-02-12 21:22:28 +00003703\end{ctypedesc}
3704
3705\begin{cvardesc}{PyTypeObject}{PyDict_Type}
Fred Drake659ebfa2000-04-03 15:42:13 +00003706This instance of \ctype{PyTypeObject} represents the Python dictionary
3707type. This is exposed to Python programs as \code{types.DictType} and
3708\code{types.DictionaryType}.
3709\withsubitem{(in module types)}{\ttindex{DictType}\ttindex{DictionaryType}}
Fred Drakee5bf8b21998-02-12 21:22:28 +00003710\end{cvardesc}
3711
3712\begin{cfuncdesc}{int}{PyDict_Check}{PyObject *p}
Fred Drakef8830d11998-04-23 14:06:01 +00003713Returns true if its argument is a \ctype{PyDictObject}.
Fred Drakee5bf8b21998-02-12 21:22:28 +00003714\end{cfuncdesc}
3715
Fred Drakec6fa34e1998-04-02 06:47:24 +00003716\begin{cfuncdesc}{PyObject*}{PyDict_New}{}
Fred Drake659ebfa2000-04-03 15:42:13 +00003717Returns a new empty dictionary, or \NULL{} on failure.
3718\end{cfuncdesc}
3719
3720\begin{cfuncdesc}{void}{PyDict_Clear}{PyObject *p}
3721Empties an existing dictionary of all key-value pairs.
Fred Drakee5bf8b21998-02-12 21:22:28 +00003722\end{cfuncdesc}
3723
Jeremy Hyltona12c7a72000-03-30 22:27:31 +00003724\begin{cfuncdesc}{PyObject*}{PyDict_Copy}{PyObject *p}
Fred Drake659ebfa2000-04-03 15:42:13 +00003725Returns a new dictionary that contains the same key-value pairs as p.
3726Empties an existing dictionary of all key-value pairs.
Jeremy Hyltona12c7a72000-03-30 22:27:31 +00003727\end{cfuncdesc}
3728
Fred Drake659ebfa2000-04-03 15:42:13 +00003729\begin{cfuncdesc}{int}{PyDict_SetItem}{PyObject *p, PyObject *key,
3730 PyObject *val}
Fred Drakebab29652001-07-10 16:10:08 +00003731Inserts \var{value} into the dictionary \var{p} with a key of \var{key}.
Fred Drake659ebfa2000-04-03 15:42:13 +00003732\var{key} must be hashable; if it isn't, \exception{TypeError} will be
3733raised.
Fred Drakebab29652001-07-10 16:10:08 +00003734Returns \code{0} on success or \code{-1} on failure.
Fred Drakee5bf8b21998-02-12 21:22:28 +00003735\end{cfuncdesc}
3736
Fred Drake83e01bf2001-03-16 15:41:29 +00003737\begin{cfuncdesc}{int}{PyDict_SetItemString}{PyObject *p,
Fred Drakee5bf8b21998-02-12 21:22:28 +00003738 char *key,
3739 PyObject *val}
Fred Drakebab29652001-07-10 16:10:08 +00003740Inserts \var{value} into the dictionary \var{p} using \var{key}
Fred Drake1d158692000-06-18 05:21:21 +00003741as a key. \var{key} should be a \ctype{char*}. The key object is
Fred Drakee058b4f1998-02-16 06:15:35 +00003742created using \code{PyString_FromString(\var{key})}.
Fred Drakebab29652001-07-10 16:10:08 +00003743Returns \code{0} on success or \code{-1} on failure.
Fred Drake659ebfa2000-04-03 15:42:13 +00003744\ttindex{PyString_FromString()}
Fred Drakee5bf8b21998-02-12 21:22:28 +00003745\end{cfuncdesc}
3746
Fred Drake659ebfa2000-04-03 15:42:13 +00003747\begin{cfuncdesc}{int}{PyDict_DelItem}{PyObject *p, PyObject *key}
Fred Drakee058b4f1998-02-16 06:15:35 +00003748Removes the entry in dictionary \var{p} with key \var{key}.
Fred Drake659ebfa2000-04-03 15:42:13 +00003749\var{key} must be hashable; if it isn't, \exception{TypeError} is
3750raised.
Fred Drakee5bf8b21998-02-12 21:22:28 +00003751\end{cfuncdesc}
3752
Fred Drake659ebfa2000-04-03 15:42:13 +00003753\begin{cfuncdesc}{int}{PyDict_DelItemString}{PyObject *p, char *key}
Fred Drakee058b4f1998-02-16 06:15:35 +00003754Removes the entry in dictionary \var{p} which has a key
Fred Drake659ebfa2000-04-03 15:42:13 +00003755specified by the string \var{key}.
Fred Drakebab29652001-07-10 16:10:08 +00003756Returns \code{0} on success or \code{-1} on failure.
Fred Drakee5bf8b21998-02-12 21:22:28 +00003757\end{cfuncdesc}
3758
Fred Drake659ebfa2000-04-03 15:42:13 +00003759\begin{cfuncdesc}{PyObject*}{PyDict_GetItem}{PyObject *p, PyObject *key}
Fred Drakee058b4f1998-02-16 06:15:35 +00003760Returns the object from dictionary \var{p} which has a key
Guido van Rossum44475131998-04-21 15:30:01 +00003761\var{key}. Returns \NULL{} if the key \var{key} is not present, but
Fred Drake659ebfa2000-04-03 15:42:13 +00003762\emph{without} setting an exception.
Fred Drakee5bf8b21998-02-12 21:22:28 +00003763\end{cfuncdesc}
3764
Fred Drake659ebfa2000-04-03 15:42:13 +00003765\begin{cfuncdesc}{PyObject*}{PyDict_GetItemString}{PyObject *p, char *key}
Fred Drakef8830d11998-04-23 14:06:01 +00003766This is the same as \cfunction{PyDict_GetItem()}, but \var{key} is
Fred Drake659ebfa2000-04-03 15:42:13 +00003767specified as a \ctype{char*}, rather than a \ctype{PyObject*}.
Fred Drakee5bf8b21998-02-12 21:22:28 +00003768\end{cfuncdesc}
3769
Fred Drake659ebfa2000-04-03 15:42:13 +00003770\begin{cfuncdesc}{PyObject*}{PyDict_Items}{PyObject *p}
Fred Drakef8830d11998-04-23 14:06:01 +00003771Returns a \ctype{PyListObject} containing all the items
Guido van Rossum44475131998-04-21 15:30:01 +00003772from the dictionary, as in the dictinoary method \method{items()} (see
Fred Drakebe486461999-11-09 17:03:03 +00003773the \citetitle[../lib/lib.html]{Python Library Reference}).
Fred Drakee5bf8b21998-02-12 21:22:28 +00003774\end{cfuncdesc}
3775
Fred Drake659ebfa2000-04-03 15:42:13 +00003776\begin{cfuncdesc}{PyObject*}{PyDict_Keys}{PyObject *p}
Fred Drakef8830d11998-04-23 14:06:01 +00003777Returns a \ctype{PyListObject} containing all the keys
Guido van Rossum44475131998-04-21 15:30:01 +00003778from the dictionary, as in the dictionary method \method{keys()} (see the
Fred Drakebe486461999-11-09 17:03:03 +00003779\citetitle[../lib/lib.html]{Python Library Reference}).
Fred Drakee5bf8b21998-02-12 21:22:28 +00003780\end{cfuncdesc}
3781
Fred Drake659ebfa2000-04-03 15:42:13 +00003782\begin{cfuncdesc}{PyObject*}{PyDict_Values}{PyObject *p}
Fred Drakef8830d11998-04-23 14:06:01 +00003783Returns a \ctype{PyListObject} containing all the values
Guido van Rossum44475131998-04-21 15:30:01 +00003784from the dictionary \var{p}, as in the dictionary method
Fred Drakebe486461999-11-09 17:03:03 +00003785\method{values()} (see the \citetitle[../lib/lib.html]{Python Library
3786Reference}).
Fred Drakee5bf8b21998-02-12 21:22:28 +00003787\end{cfuncdesc}
3788
Fred Drake659ebfa2000-04-03 15:42:13 +00003789\begin{cfuncdesc}{int}{PyDict_Size}{PyObject *p}
3790Returns the number of items in the dictionary. This is equivalent to
3791\samp{len(\var{p})} on a dictionary.\bifuncindex{len}
Fred Drakee5bf8b21998-02-12 21:22:28 +00003792\end{cfuncdesc}
3793
Fred Drake83e01bf2001-03-16 15:41:29 +00003794\begin{cfuncdesc}{int}{PyDict_Next}{PyObject *p, int *ppos,
Fred Drake7d45d342000-08-11 17:07:32 +00003795 PyObject **pkey, PyObject **pvalue}
Fred Drake83e01bf2001-03-16 15:41:29 +00003796Iterate over all key-value pairs in the dictionary \var{p}. The
3797\ctype{int} referred to by \var{ppos} must be initialized to \code{0}
3798prior to the first call to this function to start the iteration; the
3799function returns true for each pair in the dictionary, and false once
3800all pairs have been reported. The parameters \var{pkey} and
3801\var{pvalue} should either point to \ctype{PyObject*} variables that
3802will be filled in with each key and value, respectively, or may be
Fred Drake8d00a0f2001-04-13 17:55:02 +00003803\NULL.
3804
Fred Drake83e01bf2001-03-16 15:41:29 +00003805For example:
Fred Drakee5bf8b21998-02-12 21:22:28 +00003806
Fred Drake83e01bf2001-03-16 15:41:29 +00003807\begin{verbatim}
3808PyObject *key, *value;
3809int pos = 0;
3810
3811while (PyDict_Next(self->dict, &pos, &key, &value)) {
3812 /* do something interesting with the values... */
3813 ...
3814}
3815\end{verbatim}
Fred Drake8d00a0f2001-04-13 17:55:02 +00003816
3817The dictionary \var{p} should not be mutated during iteration. It is
3818safe (since Python 2.1) to modify the values of the keys as you
3819iterate over the dictionary, for example:
3820
3821\begin{verbatim}
3822PyObject *key, *value;
3823int pos = 0;
3824
3825while (PyDict_Next(self->dict, &pos, &key, &value)) {
3826 int i = PyInt_AS_LONG(value) + 1;
3827 PyObject *o = PyInt_FromLong(i);
3828 if (o == NULL)
3829 return -1;
3830 if (PyDict_SetItem(self->dict, key, o) < 0) {
3831 Py_DECREF(o);
3832 return -1;
3833 }
3834 Py_DECREF(o);
3835}
3836\end{verbatim}
Fred Drakee5bf8b21998-02-12 21:22:28 +00003837\end{cfuncdesc}
3838
3839
Fred Drakeefd146c1999-02-15 15:30:45 +00003840\section{Other Objects \label{otherObjects}}
Fred Drakee5bf8b21998-02-12 21:22:28 +00003841
Fred Drakeefd146c1999-02-15 15:30:45 +00003842\subsection{File Objects \label{fileObjects}}
Fred Drakee5bf8b21998-02-12 21:22:28 +00003843
Fred Drake659ebfa2000-04-03 15:42:13 +00003844\obindex{file}
3845Python's built-in file objects are implemented entirely on the
3846\ctype{FILE*} support from the C standard library. This is an
3847implementation detail and may change in future releases of Python.
3848
Fred Drakee5bf8b21998-02-12 21:22:28 +00003849\begin{ctypedesc}{PyFileObject}
Fred Drakef8830d11998-04-23 14:06:01 +00003850This subtype of \ctype{PyObject} represents a Python file object.
Fred Drakee5bf8b21998-02-12 21:22:28 +00003851\end{ctypedesc}
3852
3853\begin{cvardesc}{PyTypeObject}{PyFile_Type}
Fred Drake659ebfa2000-04-03 15:42:13 +00003854This instance of \ctype{PyTypeObject} represents the Python file
3855type. This is exposed to Python programs as \code{types.FileType}.
3856\withsubitem{(in module types)}{\ttindex{FileType}}
Fred Drakee5bf8b21998-02-12 21:22:28 +00003857\end{cvardesc}
3858
3859\begin{cfuncdesc}{int}{PyFile_Check}{PyObject *p}
Fred Drakef8830d11998-04-23 14:06:01 +00003860Returns true if its argument is a \ctype{PyFileObject}.
Fred Drakee5bf8b21998-02-12 21:22:28 +00003861\end{cfuncdesc}
3862
Fred Drake659ebfa2000-04-03 15:42:13 +00003863\begin{cfuncdesc}{PyObject*}{PyFile_FromString}{char *filename, char *mode}
3864On success, returns a new file object that is opened on the
3865file given by \var{filename}, with a file mode given by \var{mode},
3866where \var{mode} has the same semantics as the standard C routine
3867\cfunction{fopen()}\ttindex{fopen()}. On failure, returns \NULL.
Fred Drakee5bf8b21998-02-12 21:22:28 +00003868\end{cfuncdesc}
3869
Fred Drakec6fa34e1998-04-02 06:47:24 +00003870\begin{cfuncdesc}{PyObject*}{PyFile_FromFile}{FILE *fp,
Fred Drake659ebfa2000-04-03 15:42:13 +00003871 char *name, char *mode,
3872 int (*close)(FILE*)}
3873Creates a new \ctype{PyFileObject} from the already-open standard C
3874file pointer, \var{fp}. The function \var{close} will be called when
3875the file should be closed. Returns \NULL{} on failure.
Fred Drakee5bf8b21998-02-12 21:22:28 +00003876\end{cfuncdesc}
3877
Fred Drake659ebfa2000-04-03 15:42:13 +00003878\begin{cfuncdesc}{FILE*}{PyFile_AsFile}{PyFileObject *p}
3879Returns the file object associated with \var{p} as a \ctype{FILE*}.
Fred Drakee5bf8b21998-02-12 21:22:28 +00003880\end{cfuncdesc}
3881
Fred Drakec6fa34e1998-04-02 06:47:24 +00003882\begin{cfuncdesc}{PyObject*}{PyFile_GetLine}{PyObject *p, int n}
Fred Drake659ebfa2000-04-03 15:42:13 +00003883Equivalent to \code{\var{p}.readline(\optional{\var{n}})}, this
3884function reads one line from the object \var{p}. \var{p} may be a
3885file object or any object with a \method{readline()} method. If
3886\var{n} is \code{0}, exactly one line is read, regardless of the
3887length of the line. If \var{n} is greater than \code{0}, no more than
3888\var{n} bytes will be read from the file; a partial line can be
3889returned. In both cases, an empty string is returned if the end of
3890the file is reached immediately. If \var{n} is less than \code{0},
3891however, one line is read regardless of length, but
3892\exception{EOFError} is raised if the end of the file is reached
3893immediately.
3894\withsubitem{(built-in exception)}{\ttindex{EOFError}}
Fred Drakee5bf8b21998-02-12 21:22:28 +00003895\end{cfuncdesc}
3896
Fred Drakec6fa34e1998-04-02 06:47:24 +00003897\begin{cfuncdesc}{PyObject*}{PyFile_Name}{PyObject *p}
Fred Drake659ebfa2000-04-03 15:42:13 +00003898Returns the name of the file specified by \var{p} as a string object.
Fred Drakee5bf8b21998-02-12 21:22:28 +00003899\end{cfuncdesc}
3900
3901\begin{cfuncdesc}{void}{PyFile_SetBufSize}{PyFileObject *p, int n}
Fred Drake659ebfa2000-04-03 15:42:13 +00003902Available on systems with \cfunction{setvbuf()}\ttindex{setvbuf()}
3903only. This should only be called immediately after file object
3904creation.
Fred Drakee5bf8b21998-02-12 21:22:28 +00003905\end{cfuncdesc}
3906
Fred Drake659ebfa2000-04-03 15:42:13 +00003907\begin{cfuncdesc}{int}{PyFile_SoftSpace}{PyObject *p, int newflag}
3908This function exists for internal use by the interpreter.
3909Sets the \member{softspace} attribute of \var{p} to \var{newflag} and
3910\withsubitem{(file attribute)}{\ttindex{softspace}}returns the
3911previous value. \var{p} does not have to be a file object
3912for this function to work properly; any object is supported (thought
3913its only interesting if the \member{softspace} attribute can be set).
3914This function clears any errors, and will return \code{0} as the
3915previous value if the attribute either does not exist or if there were
3916errors in retrieving it. There is no way to detect errors from this
3917function, but doing so should not be needed.
Fred Drakee5bf8b21998-02-12 21:22:28 +00003918\end{cfuncdesc}
3919
Fred Drakec6fa34e1998-04-02 06:47:24 +00003920\begin{cfuncdesc}{int}{PyFile_WriteObject}{PyObject *obj, PyFileObject *p,
3921 int flags}
Fred Drake659ebfa2000-04-03 15:42:13 +00003922Writes object \var{obj} to file object \var{p}. The only supported
3923flag for \var{flags} is \constant{Py_PRINT_RAW}\ttindex{Py_PRINT_RAW};
3924if given, the \function{str()} of the object is written instead of the
3925\function{repr()}. Returns \code{0} on success or \code{-1} on
3926failure; the appropriate exception will be set.
Fred Drakee5bf8b21998-02-12 21:22:28 +00003927\end{cfuncdesc}
3928
Fred Drakec6fa34e1998-04-02 06:47:24 +00003929\begin{cfuncdesc}{int}{PyFile_WriteString}{char *s, PyFileObject *p,
3930 int flags}
Fred Drake659ebfa2000-04-03 15:42:13 +00003931Writes string \var{s} to file object \var{p}. Returns \code{0} on
3932success or \code{-1} on failure; the appropriate exception will be
3933set.
Fred Drakee5bf8b21998-02-12 21:22:28 +00003934\end{cfuncdesc}
3935
3936
Fred Drake5838d0f2001-01-28 06:39:35 +00003937\subsection{Instance Objects \label{instanceObjects}}
3938
3939\obindex{instance}
3940There are very few functions specific to instance objects.
3941
3942\begin{cvardesc}{PyTypeObject}{PyInstance_Type}
3943 Type object for class instances.
3944\end{cvardesc}
3945
3946\begin{cfuncdesc}{int}{PyInstance_Check}{PyObject *obj}
3947 Returns true if \var{obj} is an instance.
3948\end{cfuncdesc}
3949
3950\begin{cfuncdesc}{PyObject*}{PyInstance_New}{PyObject *class,
3951 PyObject *arg,
3952 PyObject *kw}
3953 Create a new instance of a specific class. The parameters \var{arg}
3954 and \var{kw} are used as the positional and keyword parameters to
3955 the object's constructor.
3956\end{cfuncdesc}
3957
3958\begin{cfuncdesc}{PyObject*}{PyInstance_NewRaw}{PyObject *class,
3959 PyObject *dict}
3960 Create a new instance of a specific class without calling it's
3961 constructor. \var{class} is the class of new object. The
3962 \var{dict} parameter will be used as the object's \member{__dict__};
3963 if \NULL, a new dictionary will be created for the instance.
3964\end{cfuncdesc}
3965
3966
Fred Drakeefd146c1999-02-15 15:30:45 +00003967\subsection{Module Objects \label{moduleObjects}}
3968
3969\obindex{module}
3970There are only a few functions special to module objects.
3971
Fred Drake659ebfa2000-04-03 15:42:13 +00003972\begin{cvardesc}{PyTypeObject}{PyModule_Type}
3973This instance of \ctype{PyTypeObject} represents the Python module
3974type. This is exposed to Python programs as \code{types.ModuleType}.
3975\withsubitem{(in module types)}{\ttindex{ModuleType}}
3976\end{cvardesc}
3977
3978\begin{cfuncdesc}{int}{PyModule_Check}{PyObject *p}
3979Returns true if its argument is a module object.
Fred Drakeefd146c1999-02-15 15:30:45 +00003980\end{cfuncdesc}
3981
Fred Drake659ebfa2000-04-03 15:42:13 +00003982\begin{cfuncdesc}{PyObject*}{PyModule_New}{char *name}
3983Return a new module object with the \member{__name__} attribute set to
3984\var{name}. Only the module's \member{__doc__} and
3985\member{__name__} attributes are filled in; the caller is responsible
3986for providing a \member{__file__} attribute.
3987\withsubitem{(module attribute)}{
3988 \ttindex{__name__}\ttindex{__doc__}\ttindex{__file__}}
3989\end{cfuncdesc}
3990
3991\begin{cfuncdesc}{PyObject*}{PyModule_GetDict}{PyObject *module}
Fred Drakeefd146c1999-02-15 15:30:45 +00003992Return the dictionary object that implements \var{module}'s namespace;
3993this object is the same as the \member{__dict__} attribute of the
3994module object. This function never fails.
Fred Drake659ebfa2000-04-03 15:42:13 +00003995\withsubitem{(module attribute)}{\ttindex{__dict__}}
Fred Drakeefd146c1999-02-15 15:30:45 +00003996\end{cfuncdesc}
3997
Fred Drake659ebfa2000-04-03 15:42:13 +00003998\begin{cfuncdesc}{char*}{PyModule_GetName}{PyObject *module}
Fred Drakeefd146c1999-02-15 15:30:45 +00003999Return \var{module}'s \member{__name__} value. If the module does not
Fred Drake659ebfa2000-04-03 15:42:13 +00004000provide one, or if it is not a string, \exception{SystemError} is
4001raised and \NULL{} is returned.
4002\withsubitem{(module attribute)}{\ttindex{__name__}}
4003\withsubitem{(built-in exception)}{\ttindex{SystemError}}
Fred Drakeefd146c1999-02-15 15:30:45 +00004004\end{cfuncdesc}
4005
Fred Drake659ebfa2000-04-03 15:42:13 +00004006\begin{cfuncdesc}{char*}{PyModule_GetFilename}{PyObject *module}
Fred Drakeefd146c1999-02-15 15:30:45 +00004007Return the name of the file from which \var{module} was loaded using
4008\var{module}'s \member{__file__} attribute. If this is not defined,
Fred Drake659ebfa2000-04-03 15:42:13 +00004009or if it is not a string, raise \exception{SystemError} and return
4010\NULL.
4011\withsubitem{(module attribute)}{\ttindex{__file__}}
4012\withsubitem{(built-in exception)}{\ttindex{SystemError}}
Fred Drakeefd146c1999-02-15 15:30:45 +00004013\end{cfuncdesc}
4014
Fred Drake891150b2000-09-23 03:25:42 +00004015\begin{cfuncdesc}{int}{PyModule_AddObject}{PyObject *module,
4016 char *name, PyObject *value}
4017Add an object to \var{module} as \var{name}. This is a convenience
4018function which can be used from the module's initialization function.
4019This steals a reference to \var{value}. Returns \code{-1} on error,
4020\code{0} on success.
4021\versionadded{2.0}
4022\end{cfuncdesc}
4023
4024\begin{cfuncdesc}{int}{PyModule_AddIntConstant}{PyObject *module,
4025 char *name, int value}
4026Add an integer constant to \var{module} as \var{name}. This convenience
4027function can be used from the module's initialization function.
4028Returns \code{-1} on error, \code{0} on success.
4029\versionadded{2.0}
4030\end{cfuncdesc}
4031
4032\begin{cfuncdesc}{int}{PyModule_AddStringConstant}{PyObject *module,
4033 char *name, char *value}
4034Add a string constant to \var{module} as \var{name}. This convenience
4035function can be used from the module's initialization function. The
4036string \var{value} must be null-terminated. Returns \code{-1} on
4037error, \code{0} on success.
4038\versionadded{2.0}
4039\end{cfuncdesc}
4040
Fred Drakeefd146c1999-02-15 15:30:45 +00004041
4042\subsection{CObjects \label{cObjects}}
Fred Drakee5bf8b21998-02-12 21:22:28 +00004043
Fred Drake659ebfa2000-04-03 15:42:13 +00004044\obindex{CObject}
4045Refer to \emph{Extending and Embedding the Python Interpreter},
4046section 1.12 (``Providing a C API for an Extension Module''), for more
4047information on using these objects.
4048
4049
Guido van Rossum44475131998-04-21 15:30:01 +00004050\begin{ctypedesc}{PyCObject}
Fred Drakef8830d11998-04-23 14:06:01 +00004051This subtype of \ctype{PyObject} represents an opaque value, useful for
Fred Drake659ebfa2000-04-03 15:42:13 +00004052C extension modules who need to pass an opaque value (as a
4053\ctype{void*} pointer) through Python code to other C code. It is
Guido van Rossum44475131998-04-21 15:30:01 +00004054often used to make a C function pointer defined in one module
4055available to other modules, so the regular import mechanism can be
4056used to access C APIs defined in dynamically loaded modules.
4057\end{ctypedesc}
Fred Drakee5bf8b21998-02-12 21:22:28 +00004058
Fred Drake659ebfa2000-04-03 15:42:13 +00004059\begin{cfuncdesc}{int}{PyCObject_Check}{PyObject *p}
4060Returns true if its argument is a \ctype{PyCObject}.
4061\end{cfuncdesc}
4062
4063\begin{cfuncdesc}{PyObject*}{PyCObject_FromVoidPtr}{void* cobj,
Marc-André Lemburga544ea22001-01-17 18:04:31 +00004064 void (*destr)(void *)}
Fred Drake1d158692000-06-18 05:21:21 +00004065Creates a \ctype{PyCObject} from the \code{void *}\var{cobj}. The
Fred Drakedab44681999-05-13 18:41:14 +00004066\var{destr} function will be called when the object is reclaimed, unless
4067it is \NULL.
Guido van Rossum44475131998-04-21 15:30:01 +00004068\end{cfuncdesc}
4069
Fred Drake659ebfa2000-04-03 15:42:13 +00004070\begin{cfuncdesc}{PyObject*}{PyCObject_FromVoidPtrAndDesc}{void* cobj,
Marc-André Lemburga544ea22001-01-17 18:04:31 +00004071 void* desc, void (*destr)(void *, void *) }
Fred Drakef8830d11998-04-23 14:06:01 +00004072Creates a \ctype{PyCObject} from the \ctype{void *}\var{cobj}. The
4073\var{destr} function will be called when the object is reclaimed. The
4074\var{desc} argument can be used to pass extra callback data for the
4075destructor function.
Guido van Rossum44475131998-04-21 15:30:01 +00004076\end{cfuncdesc}
4077
Fred Drake659ebfa2000-04-03 15:42:13 +00004078\begin{cfuncdesc}{void*}{PyCObject_AsVoidPtr}{PyObject* self}
4079Returns the object \ctype{void *} that the
4080\ctype{PyCObject} \var{self} was created with.
Guido van Rossum44475131998-04-21 15:30:01 +00004081\end{cfuncdesc}
4082
Fred Drake659ebfa2000-04-03 15:42:13 +00004083\begin{cfuncdesc}{void*}{PyCObject_GetDesc}{PyObject* self}
4084Returns the description \ctype{void *} that the
4085\ctype{PyCObject} \var{self} was created with.
Guido van Rossum44475131998-04-21 15:30:01 +00004086\end{cfuncdesc}
Fred Drakee5bf8b21998-02-12 21:22:28 +00004087
Fred Drake659ebfa2000-04-03 15:42:13 +00004088
Fred Drakeefd146c1999-02-15 15:30:45 +00004089\chapter{Initialization, Finalization, and Threads
4090 \label{initialization}}
Guido van Rossum4a944d71997-08-14 20:35:38 +00004091
Guido van Rossum4a944d71997-08-14 20:35:38 +00004092\begin{cfuncdesc}{void}{Py_Initialize}{}
4093Initialize the Python interpreter. In an application embedding
4094Python, this should be called before using any other Python/C API
Fred Drake659ebfa2000-04-03 15:42:13 +00004095functions; with the exception of
4096\cfunction{Py_SetProgramName()}\ttindex{Py_SetProgramName()},
4097\cfunction{PyEval_InitThreads()}\ttindex{PyEval_InitThreads()},
4098\cfunction{PyEval_ReleaseLock()}\ttindex{PyEval_ReleaseLock()},
4099and \cfunction{PyEval_AcquireLock()}\ttindex{PyEval_AcquireLock()}.
4100This initializes the table of loaded modules (\code{sys.modules}), and
4101\withsubitem{(in module sys)}{\ttindex{modules}\ttindex{path}}creates the
4102fundamental modules \module{__builtin__}\refbimodindex{__builtin__},
Fred Drake4de05a91998-02-16 14:25:26 +00004103\module{__main__}\refbimodindex{__main__} and
4104\module{sys}\refbimodindex{sys}. It also initializes the module
Fred Drake659ebfa2000-04-03 15:42:13 +00004105search\indexiii{module}{search}{path} path (\code{sys.path}).
4106It does not set \code{sys.argv}; use
4107\cfunction{PySys_SetArgv()}\ttindex{PySys_SetArgv()} for that. This
4108is a no-op when called for a second time (without calling
4109\cfunction{Py_Finalize()}\ttindex{Py_Finalize()} first). There is no
4110return value; it is a fatal error if the initialization fails.
Guido van Rossum42cefd01997-10-05 15:27:29 +00004111\end{cfuncdesc}
4112
4113\begin{cfuncdesc}{int}{Py_IsInitialized}{}
Guido van Rossum42cefd01997-10-05 15:27:29 +00004114Return true (nonzero) when the Python interpreter has been
Fred Drakee058b4f1998-02-16 06:15:35 +00004115initialized, false (zero) if not. After \cfunction{Py_Finalize()} is
4116called, this returns false until \cfunction{Py_Initialize()} is called
Guido van Rossum42cefd01997-10-05 15:27:29 +00004117again.
Guido van Rossum4a944d71997-08-14 20:35:38 +00004118\end{cfuncdesc}
4119
4120\begin{cfuncdesc}{void}{Py_Finalize}{}
Fred Drakee058b4f1998-02-16 06:15:35 +00004121Undo all initializations made by \cfunction{Py_Initialize()} and
4122subsequent use of Python/C API functions, and destroy all
4123sub-interpreters (see \cfunction{Py_NewInterpreter()} below) that were
4124created and not yet destroyed since the last call to
4125\cfunction{Py_Initialize()}. Ideally, this frees all memory allocated
4126by the Python interpreter. This is a no-op when called for a second
4127time (without calling \cfunction{Py_Initialize()} again first). There
4128is no return value; errors during finalization are ignored.
Guido van Rossum4a944d71997-08-14 20:35:38 +00004129
4130This function is provided for a number of reasons. An embedding
4131application might want to restart Python without having to restart the
4132application itself. An application that has loaded the Python
4133interpreter from a dynamically loadable library (or DLL) might want to
4134free all memory allocated by Python before unloading the DLL. During a
4135hunt for memory leaks in an application a developer might want to free
4136all memory allocated by Python before exiting from the application.
4137
Fred Drakee058b4f1998-02-16 06:15:35 +00004138\strong{Bugs and caveats:} The destruction of modules and objects in
Guido van Rossum4a944d71997-08-14 20:35:38 +00004139modules is done in random order; this may cause destructors
Fred Drakee058b4f1998-02-16 06:15:35 +00004140(\method{__del__()} methods) to fail when they depend on other objects
Guido van Rossum4a944d71997-08-14 20:35:38 +00004141(even functions) or modules. Dynamically loaded extension modules
4142loaded by Python are not unloaded. Small amounts of memory allocated
4143by the Python interpreter may not be freed (if you find a leak, please
4144report it). Memory tied up in circular references between objects is
4145not freed. Some memory allocated by extension modules may not be
4146freed. Some extension may not work properly if their initialization
4147routine is called more than once; this can happen if an applcation
Fred Drakee058b4f1998-02-16 06:15:35 +00004148calls \cfunction{Py_Initialize()} and \cfunction{Py_Finalize()} more
4149than once.
Guido van Rossum4a944d71997-08-14 20:35:38 +00004150\end{cfuncdesc}
4151
Fred Drakec6fa34e1998-04-02 06:47:24 +00004152\begin{cfuncdesc}{PyThreadState*}{Py_NewInterpreter}{}
Fred Drake4de05a91998-02-16 14:25:26 +00004153Create a new sub-interpreter. This is an (almost) totally separate
4154environment for the execution of Python code. In particular, the new
4155interpreter has separate, independent versions of all imported
4156modules, including the fundamental modules
4157\module{__builtin__}\refbimodindex{__builtin__},
4158\module{__main__}\refbimodindex{__main__} and
4159\module{sys}\refbimodindex{sys}. The table of loaded modules
4160(\code{sys.modules}) and the module search path (\code{sys.path}) are
4161also separate. The new environment has no \code{sys.argv} variable.
4162It has new standard I/O stream file objects \code{sys.stdin},
4163\code{sys.stdout} and \code{sys.stderr} (however these refer to the
Fred Drake659ebfa2000-04-03 15:42:13 +00004164same underlying \ctype{FILE} structures in the C library).
4165\withsubitem{(in module sys)}{
4166 \ttindex{stdout}\ttindex{stderr}\ttindex{stdin}}
Guido van Rossum4a944d71997-08-14 20:35:38 +00004167
4168The return value points to the first thread state created in the new
4169sub-interpreter. This thread state is made the current thread state.
4170Note that no actual thread is created; see the discussion of thread
4171states below. If creation of the new interpreter is unsuccessful,
Guido van Rossum580aa8d1997-11-25 15:34:51 +00004172\NULL{} is returned; no exception is set since the exception state
Guido van Rossum4a944d71997-08-14 20:35:38 +00004173is stored in the current thread state and there may not be a current
4174thread state. (Like all other Python/C API functions, the global
4175interpreter lock must be held before calling this function and is
4176still held when it returns; however, unlike most other Python/C API
4177functions, there needn't be a current thread state on entry.)
4178
4179Extension modules are shared between (sub-)interpreters as follows:
4180the first time a particular extension is imported, it is initialized
4181normally, and a (shallow) copy of its module's dictionary is
4182squirreled away. When the same extension is imported by another
4183(sub-)interpreter, a new module is initialized and filled with the
Fred Drakee058b4f1998-02-16 06:15:35 +00004184contents of this copy; the extension's \code{init} function is not
4185called. Note that this is different from what happens when an
4186extension is imported after the interpreter has been completely
Fred Drake659ebfa2000-04-03 15:42:13 +00004187re-initialized by calling
4188\cfunction{Py_Finalize()}\ttindex{Py_Finalize()} and
4189\cfunction{Py_Initialize()}\ttindex{Py_Initialize()}; in that case,
4190the extension's \code{init\var{module}} function \emph{is} called
4191again.
Guido van Rossum4a944d71997-08-14 20:35:38 +00004192
Fred Drakee058b4f1998-02-16 06:15:35 +00004193\strong{Bugs and caveats:} Because sub-interpreters (and the main
Guido van Rossum4a944d71997-08-14 20:35:38 +00004194interpreter) are part of the same process, the insulation between them
Fred Drakee058b4f1998-02-16 06:15:35 +00004195isn't perfect --- for example, using low-level file operations like
Fred Drake659ebfa2000-04-03 15:42:13 +00004196\withsubitem{(in module os)}{\ttindex{close()}}
Fred Drakef8830d11998-04-23 14:06:01 +00004197\function{os.close()} they can (accidentally or maliciously) affect each
Guido van Rossum4a944d71997-08-14 20:35:38 +00004198other's open files. Because of the way extensions are shared between
4199(sub-)interpreters, some extensions may not work properly; this is
4200especially likely when the extension makes use of (static) global
4201variables, or when the extension manipulates its module's dictionary
4202after its initialization. It is possible to insert objects created in
4203one sub-interpreter into a namespace of another sub-interpreter; this
4204should be done with great care to avoid sharing user-defined
4205functions, methods, instances or classes between sub-interpreters,
4206since import operations executed by such objects may affect the
4207wrong (sub-)interpreter's dictionary of loaded modules. (XXX This is
4208a hard-to-fix bug that will be addressed in a future release.)
4209\end{cfuncdesc}
4210
4211\begin{cfuncdesc}{void}{Py_EndInterpreter}{PyThreadState *tstate}
4212Destroy the (sub-)interpreter represented by the given thread state.
4213The given thread state must be the current thread state. See the
4214discussion of thread states below. When the call returns, the current
Guido van Rossum580aa8d1997-11-25 15:34:51 +00004215thread state is \NULL{}. All thread states associated with this
Guido van Rossum4a944d71997-08-14 20:35:38 +00004216interpreted are destroyed. (The global interpreter lock must be held
4217before calling this function and is still held when it returns.)
Fred Drake659ebfa2000-04-03 15:42:13 +00004218\cfunction{Py_Finalize()}\ttindex{Py_Finalize()} will destroy all
4219sub-interpreters that haven't been explicitly destroyed at that point.
Guido van Rossum4a944d71997-08-14 20:35:38 +00004220\end{cfuncdesc}
4221
4222\begin{cfuncdesc}{void}{Py_SetProgramName}{char *name}
Fred Drake659ebfa2000-04-03 15:42:13 +00004223This function should be called before
4224\cfunction{Py_Initialize()}\ttindex{Py_Initialize()} is called
Guido van Rossum4a944d71997-08-14 20:35:38 +00004225for the first time, if it is called at all. It tells the interpreter
Fred Drake659ebfa2000-04-03 15:42:13 +00004226the value of the \code{argv[0]} argument to the
4227\cfunction{main()}\ttindex{main()} function of the program. This is
4228used by \cfunction{Py_GetPath()}\ttindex{Py_GetPath()} and some other
Guido van Rossum4a944d71997-08-14 20:35:38 +00004229functions below to find the Python run-time libraries relative to the
Fred Drakea8455ab2000-06-16 19:58:42 +00004230interpreter executable. The default value is \code{'python'}. The
Guido van Rossum4a944d71997-08-14 20:35:38 +00004231argument should point to a zero-terminated character string in static
4232storage whose contents will not change for the duration of the
4233program's execution. No code in the Python interpreter will change
4234the contents of this storage.
4235\end{cfuncdesc}
4236
Fred Drakec6fa34e1998-04-02 06:47:24 +00004237\begin{cfuncdesc}{char*}{Py_GetProgramName}{}
Fred Drake659ebfa2000-04-03 15:42:13 +00004238Return the program name set with
4239\cfunction{Py_SetProgramName()}\ttindex{Py_SetProgramName()}, or the
Guido van Rossum4a944d71997-08-14 20:35:38 +00004240default. The returned string points into static storage; the caller
4241should not modify its value.
4242\end{cfuncdesc}
4243
Fred Drakec6fa34e1998-04-02 06:47:24 +00004244\begin{cfuncdesc}{char*}{Py_GetPrefix}{}
Fred Drakee058b4f1998-02-16 06:15:35 +00004245Return the \emph{prefix} for installed platform-independent files. This
Guido van Rossum4a944d71997-08-14 20:35:38 +00004246is derived through a number of complicated rules from the program name
Fred Drakee058b4f1998-02-16 06:15:35 +00004247set with \cfunction{Py_SetProgramName()} and some environment variables;
Fred Drakea8455ab2000-06-16 19:58:42 +00004248for example, if the program name is \code{'/usr/local/bin/python'},
4249the prefix is \code{'/usr/local'}. The returned string points into
Guido van Rossum4a944d71997-08-14 20:35:38 +00004250static storage; the caller should not modify its value. This
Fred Drakec94d9341998-04-12 02:39:13 +00004251corresponds to the \makevar{prefix} variable in the top-level
Fred Drakea8455ab2000-06-16 19:58:42 +00004252\file{Makefile} and the \longprogramopt{prefix} argument to the
Fred Drakee058b4f1998-02-16 06:15:35 +00004253\program{configure} script at build time. The value is available to
Fred Drakeb0a78731998-01-13 18:51:10 +00004254Python code as \code{sys.prefix}. It is only useful on \UNIX{}. See
Guido van Rossum4a944d71997-08-14 20:35:38 +00004255also the next function.
4256\end{cfuncdesc}
4257
Fred Drakec6fa34e1998-04-02 06:47:24 +00004258\begin{cfuncdesc}{char*}{Py_GetExecPrefix}{}
Fred Drakee058b4f1998-02-16 06:15:35 +00004259Return the \emph{exec-prefix} for installed platform-\emph{de}pendent
Guido van Rossum4a944d71997-08-14 20:35:38 +00004260files. This is derived through a number of complicated rules from the
Fred Drakee058b4f1998-02-16 06:15:35 +00004261program name set with \cfunction{Py_SetProgramName()} and some environment
Guido van Rossum4a944d71997-08-14 20:35:38 +00004262variables; for example, if the program name is
Fred Drakea8455ab2000-06-16 19:58:42 +00004263\code{'/usr/local/bin/python'}, the exec-prefix is
4264\code{'/usr/local'}. The returned string points into static storage;
Guido van Rossum4a944d71997-08-14 20:35:38 +00004265the caller should not modify its value. This corresponds to the
Fred Drakec94d9341998-04-12 02:39:13 +00004266\makevar{exec_prefix} variable in the top-level \file{Makefile} and the
Fred Drakea8455ab2000-06-16 19:58:42 +00004267\longprogramopt{exec-prefix} argument to the
Fred Drake310ee611999-11-09 17:31:42 +00004268\program{configure} script at build time. The value is available to
4269Python code as \code{sys.exec_prefix}. It is only useful on \UNIX{}.
Guido van Rossum4a944d71997-08-14 20:35:38 +00004270
4271Background: The exec-prefix differs from the prefix when platform
4272dependent files (such as executables and shared libraries) are
4273installed in a different directory tree. In a typical installation,
4274platform dependent files may be installed in the
Fred Drakea8455ab2000-06-16 19:58:42 +00004275\file{/usr/local/plat} subtree while platform independent may be
4276installed in \file{/usr/local}.
Guido van Rossum4a944d71997-08-14 20:35:38 +00004277
4278Generally speaking, a platform is a combination of hardware and
4279software families, e.g. Sparc machines running the Solaris 2.x
4280operating system are considered the same platform, but Intel machines
4281running Solaris 2.x are another platform, and Intel machines running
4282Linux are yet another platform. Different major revisions of the same
Fred Drakeb0a78731998-01-13 18:51:10 +00004283operating system generally also form different platforms. Non-\UNIX{}
Guido van Rossum4a944d71997-08-14 20:35:38 +00004284operating systems are a different story; the installation strategies
4285on those systems are so different that the prefix and exec-prefix are
4286meaningless, and set to the empty string. Note that compiled Python
4287bytecode files are platform independent (but not independent from the
4288Python version by which they were compiled!).
4289
Fred Drakee058b4f1998-02-16 06:15:35 +00004290System administrators will know how to configure the \program{mount} or
Fred Drakea8455ab2000-06-16 19:58:42 +00004291\program{automount} programs to share \file{/usr/local} between platforms
4292while having \file{/usr/local/plat} be a different filesystem for each
Guido van Rossum4a944d71997-08-14 20:35:38 +00004293platform.
4294\end{cfuncdesc}
4295
Fred Drakec6fa34e1998-04-02 06:47:24 +00004296\begin{cfuncdesc}{char*}{Py_GetProgramFullPath}{}
Guido van Rossum4a944d71997-08-14 20:35:38 +00004297Return the full program name of the Python executable; this is
4298computed as a side-effect of deriving the default module search path
Fred Drake659ebfa2000-04-03 15:42:13 +00004299from the program name (set by
4300\cfunction{Py_SetProgramName()}\ttindex{Py_SetProgramName()} above).
4301The returned string points into static storage; the caller should not
Guido van Rossum4a944d71997-08-14 20:35:38 +00004302modify its value. The value is available to Python code as
Guido van Rossum42cefd01997-10-05 15:27:29 +00004303\code{sys.executable}.
Fred Drake659ebfa2000-04-03 15:42:13 +00004304\withsubitem{(in module sys)}{\ttindex{executable}}
Guido van Rossum4a944d71997-08-14 20:35:38 +00004305\end{cfuncdesc}
4306
Fred Drakec6fa34e1998-04-02 06:47:24 +00004307\begin{cfuncdesc}{char*}{Py_GetPath}{}
Fred Drake4de05a91998-02-16 14:25:26 +00004308\indexiii{module}{search}{path}
Guido van Rossum4a944d71997-08-14 20:35:38 +00004309Return the default module search path; this is computed from the
Fred Drakee058b4f1998-02-16 06:15:35 +00004310program name (set by \cfunction{Py_SetProgramName()} above) and some
Guido van Rossum4a944d71997-08-14 20:35:38 +00004311environment variables. The returned string consists of a series of
4312directory names separated by a platform dependent delimiter character.
Fred Drakef8830d11998-04-23 14:06:01 +00004313The delimiter character is \character{:} on \UNIX{}, \character{;} on
Fred Drake659ebfa2000-04-03 15:42:13 +00004314DOS/Windows, and \character{\e n} (the \ASCII{} newline character) on
Fred Drakee5bc4971998-02-12 23:36:49 +00004315Macintosh. The returned string points into static storage; the caller
Guido van Rossum4a944d71997-08-14 20:35:38 +00004316should not modify its value. The value is available to Python code
Fred Drake659ebfa2000-04-03 15:42:13 +00004317as the list \code{sys.path}\withsubitem{(in module sys)}{\ttindex{path}},
4318which may be modified to change the future search path for loaded
4319modules.
Guido van Rossum4a944d71997-08-14 20:35:38 +00004320
4321% XXX should give the exact rules
4322\end{cfuncdesc}
4323
Fred Drakec6fa34e1998-04-02 06:47:24 +00004324\begin{cfuncdesc}{const char*}{Py_GetVersion}{}
Guido van Rossum4a944d71997-08-14 20:35:38 +00004325Return the version of this Python interpreter. This is a string that
4326looks something like
4327
Guido van Rossum09270b51997-08-15 18:57:32 +00004328\begin{verbatim}
Fred Drakee058b4f1998-02-16 06:15:35 +00004329"1.5 (#67, Dec 31 1997, 22:34:28) [GCC 2.7.2.2]"
Guido van Rossum09270b51997-08-15 18:57:32 +00004330\end{verbatim}
Guido van Rossum4a944d71997-08-14 20:35:38 +00004331
4332The first word (up to the first space character) is the current Python
4333version; the first three characters are the major and minor version
4334separated by a period. The returned string points into static storage;
4335the caller should not modify its value. The value is available to
4336Python code as the list \code{sys.version}.
Fred Drake659ebfa2000-04-03 15:42:13 +00004337\withsubitem{(in module sys)}{\ttindex{version}}
Guido van Rossum4a944d71997-08-14 20:35:38 +00004338\end{cfuncdesc}
4339
Fred Drakec6fa34e1998-04-02 06:47:24 +00004340\begin{cfuncdesc}{const char*}{Py_GetPlatform}{}
Fred Drakeb0a78731998-01-13 18:51:10 +00004341Return the platform identifier for the current platform. On \UNIX{},
Guido van Rossum4a944d71997-08-14 20:35:38 +00004342this is formed from the ``official'' name of the operating system,
4343converted to lower case, followed by the major revision number; e.g.,
4344for Solaris 2.x, which is also known as SunOS 5.x, the value is
Fred Drakea8455ab2000-06-16 19:58:42 +00004345\code{'sunos5'}. On Macintosh, it is \code{'mac'}. On Windows, it
4346is \code{'win'}. The returned string points into static storage;
Guido van Rossum4a944d71997-08-14 20:35:38 +00004347the caller should not modify its value. The value is available to
4348Python code as \code{sys.platform}.
Fred Drake659ebfa2000-04-03 15:42:13 +00004349\withsubitem{(in module sys)}{\ttindex{platform}}
Guido van Rossum4a944d71997-08-14 20:35:38 +00004350\end{cfuncdesc}
4351
Fred Drakec6fa34e1998-04-02 06:47:24 +00004352\begin{cfuncdesc}{const char*}{Py_GetCopyright}{}
Guido van Rossum4a944d71997-08-14 20:35:38 +00004353Return the official copyright string for the current Python version,
4354for example
4355
Fred Drakea8455ab2000-06-16 19:58:42 +00004356\code{'Copyright 1991-1995 Stichting Mathematisch Centrum, Amsterdam'}
Guido van Rossum4a944d71997-08-14 20:35:38 +00004357
4358The returned string points into static storage; the caller should not
4359modify its value. The value is available to Python code as the list
4360\code{sys.copyright}.
Fred Drake659ebfa2000-04-03 15:42:13 +00004361\withsubitem{(in module sys)}{\ttindex{copyright}}
Guido van Rossum4a944d71997-08-14 20:35:38 +00004362\end{cfuncdesc}
4363
Fred Drakec6fa34e1998-04-02 06:47:24 +00004364\begin{cfuncdesc}{const char*}{Py_GetCompiler}{}
Guido van Rossum4a944d71997-08-14 20:35:38 +00004365Return an indication of the compiler used to build the current Python
Fred Drakee058b4f1998-02-16 06:15:35 +00004366version, in square brackets, for example:
Guido van Rossum4a944d71997-08-14 20:35:38 +00004367
Fred Drakee058b4f1998-02-16 06:15:35 +00004368\begin{verbatim}
4369"[GCC 2.7.2.2]"
4370\end{verbatim}
Guido van Rossum4a944d71997-08-14 20:35:38 +00004371
4372The returned string points into static storage; the caller should not
4373modify its value. The value is available to Python code as part of
4374the variable \code{sys.version}.
Fred Drake659ebfa2000-04-03 15:42:13 +00004375\withsubitem{(in module sys)}{\ttindex{version}}
Guido van Rossum4a944d71997-08-14 20:35:38 +00004376\end{cfuncdesc}
4377
Fred Drakec6fa34e1998-04-02 06:47:24 +00004378\begin{cfuncdesc}{const char*}{Py_GetBuildInfo}{}
Guido van Rossum4a944d71997-08-14 20:35:38 +00004379Return information about the sequence number and build date and time
4380of the current Python interpreter instance, for example
4381
Guido van Rossum09270b51997-08-15 18:57:32 +00004382\begin{verbatim}
4383"#67, Aug 1 1997, 22:34:28"
4384\end{verbatim}
Guido van Rossum4a944d71997-08-14 20:35:38 +00004385
4386The returned string points into static storage; the caller should not
4387modify its value. The value is available to Python code as part of
4388the variable \code{sys.version}.
Fred Drake659ebfa2000-04-03 15:42:13 +00004389\withsubitem{(in module sys)}{\ttindex{version}}
Guido van Rossum4a944d71997-08-14 20:35:38 +00004390\end{cfuncdesc}
4391
4392\begin{cfuncdesc}{int}{PySys_SetArgv}{int argc, char **argv}
Fred Drake659ebfa2000-04-03 15:42:13 +00004393Set \code{sys.argv} based on \var{argc} and \var{argv}. These
4394parameters are similar to those passed to the program's
4395\cfunction{main()}\ttindex{main()} function with the difference that
4396the first entry should refer to the script file to be executed rather
4397than the executable hosting the Python interpreter. If there isn't a
4398script that will be run, the first entry in \var{argv} can be an empty
4399string. If this function fails to initialize \code{sys.argv}, a fatal
4400condition is signalled using
4401\cfunction{Py_FatalError()}\ttindex{Py_FatalError()}.
4402\withsubitem{(in module sys)}{\ttindex{argv}}
4403% XXX impl. doesn't seem consistent in allowing 0/NULL for the params;
4404% check w/ Guido.
Guido van Rossum4a944d71997-08-14 20:35:38 +00004405\end{cfuncdesc}
4406
4407% XXX Other PySys thingies (doesn't really belong in this chapter)
4408
Fred Drakeefd146c1999-02-15 15:30:45 +00004409\section{Thread State and the Global Interpreter Lock
4410 \label{threads}}
Guido van Rossum4a944d71997-08-14 20:35:38 +00004411
Fred Drake659ebfa2000-04-03 15:42:13 +00004412\index{global interpreter lock}
4413\index{interpreter lock}
4414\index{lock, interpreter}
4415
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004416The Python interpreter is not fully thread safe. In order to support
4417multi-threaded Python programs, there's a global lock that must be
4418held by the current thread before it can safely access Python objects.
4419Without the lock, even the simplest operations could cause problems in
Fred Drake7baf3d41998-02-20 00:45:52 +00004420a multi-threaded program: for example, when two threads simultaneously
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004421increment the reference count of the same object, the reference count
4422could end up being incremented only once instead of twice.
4423
4424Therefore, the rule exists that only the thread that has acquired the
4425global interpreter lock may operate on Python objects or call Python/C
4426API functions. In order to support multi-threaded Python programs,
Fred Drake659ebfa2000-04-03 15:42:13 +00004427the interpreter regularly releases and reacquires the lock --- by
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004428default, every ten bytecode instructions (this can be changed with
Fred Drake659ebfa2000-04-03 15:42:13 +00004429\withsubitem{(in module sys)}{\ttindex{setcheckinterval()}}
Fred Drakee058b4f1998-02-16 06:15:35 +00004430\function{sys.setcheckinterval()}). The lock is also released and
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004431reacquired around potentially blocking I/O operations like reading or
4432writing a file, so that other threads can run while the thread that
4433requests the I/O is waiting for the I/O operation to complete.
4434
4435The Python interpreter needs to keep some bookkeeping information
Fred Drakee058b4f1998-02-16 06:15:35 +00004436separate per thread --- for this it uses a data structure called
Fred Drake659ebfa2000-04-03 15:42:13 +00004437\ctype{PyThreadState}\ttindex{PyThreadState}. This is new in Python
44381.5; in earlier versions, such state was stored in global variables,
4439and switching threads could cause problems. In particular, exception
4440handling is now thread safe, when the application uses
4441\withsubitem{(in module sys)}{\ttindex{exc_info()}}
4442\function{sys.exc_info()} to access the exception last raised in the
4443current thread.
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004444
4445There's one global variable left, however: the pointer to the current
Fred Drake659ebfa2000-04-03 15:42:13 +00004446\ctype{PyThreadState}\ttindex{PyThreadState} structure. While most
4447thread packages have a way to store ``per-thread global data,''
4448Python's internal platform independent thread abstraction doesn't
4449support this yet. Therefore, the current thread state must be
4450manipulated explicitly.
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004451
4452This is easy enough in most cases. Most code manipulating the global
4453interpreter lock has the following simple structure:
4454
Guido van Rossum9faf4c51997-10-07 14:38:54 +00004455\begin{verbatim}
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004456Save the thread state in a local variable.
4457Release the interpreter lock.
4458...Do some blocking I/O operation...
4459Reacquire the interpreter lock.
4460Restore the thread state from the local variable.
Guido van Rossum9faf4c51997-10-07 14:38:54 +00004461\end{verbatim}
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004462
4463This is so common that a pair of macros exists to simplify it:
4464
Guido van Rossum9faf4c51997-10-07 14:38:54 +00004465\begin{verbatim}
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004466Py_BEGIN_ALLOW_THREADS
4467...Do some blocking I/O operation...
4468Py_END_ALLOW_THREADS
Guido van Rossum9faf4c51997-10-07 14:38:54 +00004469\end{verbatim}
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004470
Fred Drake659ebfa2000-04-03 15:42:13 +00004471The \code{Py_BEGIN_ALLOW_THREADS}\ttindex{Py_BEGIN_ALLOW_THREADS} macro
4472opens a new block and declares a hidden local variable; the
4473\code{Py_END_ALLOW_THREADS}\ttindex{Py_END_ALLOW_THREADS} macro closes
Fred Drakee058b4f1998-02-16 06:15:35 +00004474the block. Another advantage of using these two macros is that when
4475Python is compiled without thread support, they are defined empty,
4476thus saving the thread state and lock manipulations.
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004477
4478When thread support is enabled, the block above expands to the
4479following code:
4480
Guido van Rossum9faf4c51997-10-07 14:38:54 +00004481\begin{verbatim}
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004482 PyThreadState *_save;
Fred Drake659ebfa2000-04-03 15:42:13 +00004483
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004484 _save = PyEval_SaveThread();
4485 ...Do some blocking I/O operation...
4486 PyEval_RestoreThread(_save);
Guido van Rossum9faf4c51997-10-07 14:38:54 +00004487\end{verbatim}
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004488
4489Using even lower level primitives, we can get roughly the same effect
4490as follows:
4491
Guido van Rossum9faf4c51997-10-07 14:38:54 +00004492\begin{verbatim}
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004493 PyThreadState *_save;
Fred Drake659ebfa2000-04-03 15:42:13 +00004494
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004495 _save = PyThreadState_Swap(NULL);
4496 PyEval_ReleaseLock();
4497 ...Do some blocking I/O operation...
4498 PyEval_AcquireLock();
4499 PyThreadState_Swap(_save);
Guido van Rossum9faf4c51997-10-07 14:38:54 +00004500\end{verbatim}
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004501
4502There are some subtle differences; in particular,
Fred Drake659ebfa2000-04-03 15:42:13 +00004503\cfunction{PyEval_RestoreThread()}\ttindex{PyEval_RestoreThread()} saves
4504and restores the value of the global variable
4505\cdata{errno}\ttindex{errno}, since the lock manipulation does not
Fred Drakef8830d11998-04-23 14:06:01 +00004506guarantee that \cdata{errno} is left alone. Also, when thread support
Fred Drake659ebfa2000-04-03 15:42:13 +00004507is disabled,
4508\cfunction{PyEval_SaveThread()}\ttindex{PyEval_SaveThread()} and
Fred Drakee058b4f1998-02-16 06:15:35 +00004509\cfunction{PyEval_RestoreThread()} don't manipulate the lock; in this
Fred Drake659ebfa2000-04-03 15:42:13 +00004510case, \cfunction{PyEval_ReleaseLock()}\ttindex{PyEval_ReleaseLock()} and
4511\cfunction{PyEval_AcquireLock()}\ttindex{PyEval_AcquireLock()} are not
4512available. This is done so that dynamically loaded extensions
4513compiled with thread support enabled can be loaded by an interpreter
4514that was compiled with disabled thread support.
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004515
4516The global interpreter lock is used to protect the pointer to the
4517current thread state. When releasing the lock and saving the thread
4518state, the current thread state pointer must be retrieved before the
4519lock is released (since another thread could immediately acquire the
4520lock and store its own thread state in the global variable).
Fred Drakeffe58ca2000-09-29 17:31:54 +00004521Conversely, when acquiring the lock and restoring the thread state,
4522the lock must be acquired before storing the thread state pointer.
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004523
4524Why am I going on with so much detail about this? Because when
Fred Drake659ebfa2000-04-03 15:42:13 +00004525threads are created from C, they don't have the global interpreter
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004526lock, nor is there a thread state data structure for them. Such
4527threads must bootstrap themselves into existence, by first creating a
4528thread state data structure, then acquiring the lock, and finally
4529storing their thread state pointer, before they can start using the
4530Python/C API. When they are done, they should reset the thread state
4531pointer, release the lock, and finally free their thread state data
4532structure.
4533
4534When creating a thread data structure, you need to provide an
4535interpreter state data structure. The interpreter state data
4536structure hold global data that is shared by all threads in an
4537interpreter, for example the module administration
4538(\code{sys.modules}). Depending on your needs, you can either create
4539a new interpreter state data structure, or share the interpreter state
4540data structure used by the Python main thread (to access the latter,
Fred Drakef8830d11998-04-23 14:06:01 +00004541you must obtain the thread state and access its \member{interp} member;
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004542this must be done by a thread that is created by Python or by the main
4543thread after Python is initialized).
4544
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004545
4546\begin{ctypedesc}{PyInterpreterState}
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004547This data structure represents the state shared by a number of
4548cooperating threads. Threads belonging to the same interpreter
4549share their module administration and a few other internal items.
4550There are no public members in this structure.
4551
4552Threads belonging to different interpreters initially share nothing,
4553except process state like available memory, open file descriptors and
4554such. The global interpreter lock is also shared by all threads,
4555regardless of to which interpreter they belong.
4556\end{ctypedesc}
4557
4558\begin{ctypedesc}{PyThreadState}
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004559This data structure represents the state of a single thread. The only
Fred Drakef8830d11998-04-23 14:06:01 +00004560public data member is \ctype{PyInterpreterState *}\member{interp},
4561which points to this thread's interpreter state.
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004562\end{ctypedesc}
4563
4564\begin{cfuncdesc}{void}{PyEval_InitThreads}{}
4565Initialize and acquire the global interpreter lock. It should be
4566called in the main thread before creating a second thread or engaging
Fred Drakee058b4f1998-02-16 06:15:35 +00004567in any other thread operations such as
Fred Drake659ebfa2000-04-03 15:42:13 +00004568\cfunction{PyEval_ReleaseLock()}\ttindex{PyEval_ReleaseLock()} or
4569\code{PyEval_ReleaseThread(\var{tstate})}\ttindex{PyEval_ReleaseThread()}.
4570It is not needed before calling
4571\cfunction{PyEval_SaveThread()}\ttindex{PyEval_SaveThread()} or
4572\cfunction{PyEval_RestoreThread()}\ttindex{PyEval_RestoreThread()}.
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004573
4574This is a no-op when called for a second time. It is safe to call
Fred Drake659ebfa2000-04-03 15:42:13 +00004575this function before calling
4576\cfunction{Py_Initialize()}\ttindex{Py_Initialize()}.
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004577
4578When only the main thread exists, no lock operations are needed. This
4579is a common situation (most Python programs do not use threads), and
4580the lock operations slow the interpreter down a bit. Therefore, the
4581lock is not created initially. This situation is equivalent to having
4582acquired the lock: when there is only a single thread, all object
4583accesses are safe. Therefore, when this function initializes the
Fred Drake4de05a91998-02-16 14:25:26 +00004584lock, it also acquires it. Before the Python
4585\module{thread}\refbimodindex{thread} module creates a new thread,
4586knowing that either it has the lock or the lock hasn't been created
4587yet, it calls \cfunction{PyEval_InitThreads()}. When this call
4588returns, it is guaranteed that the lock has been created and that it
4589has acquired it.
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004590
4591It is \strong{not} safe to call this function when it is unknown which
4592thread (if any) currently has the global interpreter lock.
4593
4594This function is not available when thread support is disabled at
4595compile time.
4596\end{cfuncdesc}
4597
Guido van Rossum4a944d71997-08-14 20:35:38 +00004598\begin{cfuncdesc}{void}{PyEval_AcquireLock}{}
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004599Acquire the global interpreter lock. The lock must have been created
4600earlier. If this thread already has the lock, a deadlock ensues.
4601This function is not available when thread support is disabled at
4602compile time.
Guido van Rossum4a944d71997-08-14 20:35:38 +00004603\end{cfuncdesc}
4604
4605\begin{cfuncdesc}{void}{PyEval_ReleaseLock}{}
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004606Release the global interpreter lock. The lock must have been created
4607earlier. This function is not available when thread support is
Fred Drakee058b4f1998-02-16 06:15:35 +00004608disabled at compile time.
Guido van Rossum4a944d71997-08-14 20:35:38 +00004609\end{cfuncdesc}
4610
4611\begin{cfuncdesc}{void}{PyEval_AcquireThread}{PyThreadState *tstate}
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004612Acquire the global interpreter lock and then set the current thread
Guido van Rossum580aa8d1997-11-25 15:34:51 +00004613state to \var{tstate}, which should not be \NULL{}. The lock must
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004614have been created earlier. If this thread already has the lock,
4615deadlock ensues. This function is not available when thread support
Fred Drakee058b4f1998-02-16 06:15:35 +00004616is disabled at compile time.
Guido van Rossum4a944d71997-08-14 20:35:38 +00004617\end{cfuncdesc}
4618
4619\begin{cfuncdesc}{void}{PyEval_ReleaseThread}{PyThreadState *tstate}
Guido van Rossum580aa8d1997-11-25 15:34:51 +00004620Reset the current thread state to \NULL{} and release the global
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004621interpreter lock. The lock must have been created earlier and must be
4622held by the current thread. The \var{tstate} argument, which must not
Guido van Rossum580aa8d1997-11-25 15:34:51 +00004623be \NULL{}, is only used to check that it represents the current
Fred Drakee058b4f1998-02-16 06:15:35 +00004624thread state --- if it isn't, a fatal error is reported. This
4625function is not available when thread support is disabled at compile
4626time.
Guido van Rossum4a944d71997-08-14 20:35:38 +00004627\end{cfuncdesc}
4628
Fred Drakec6fa34e1998-04-02 06:47:24 +00004629\begin{cfuncdesc}{PyThreadState*}{PyEval_SaveThread}{}
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004630Release the interpreter lock (if it has been created and thread
Guido van Rossum580aa8d1997-11-25 15:34:51 +00004631support is enabled) and reset the thread state to \NULL{},
4632returning the previous thread state (which is not \NULL{}). If
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004633the lock has been created, the current thread must have acquired it.
4634(This function is available even when thread support is disabled at
4635compile time.)
Guido van Rossum4a944d71997-08-14 20:35:38 +00004636\end{cfuncdesc}
4637
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004638\begin{cfuncdesc}{void}{PyEval_RestoreThread}{PyThreadState *tstate}
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004639Acquire the interpreter lock (if it has been created and thread
4640support is enabled) and set the thread state to \var{tstate}, which
Guido van Rossum580aa8d1997-11-25 15:34:51 +00004641must not be \NULL{}. If the lock has been created, the current
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004642thread must not have acquired it, otherwise deadlock ensues. (This
4643function is available even when thread support is disabled at compile
4644time.)
Guido van Rossum4a944d71997-08-14 20:35:38 +00004645\end{cfuncdesc}
4646
Fred Drake659ebfa2000-04-03 15:42:13 +00004647The following macros are normally used without a trailing semicolon;
4648look for example usage in the Python source distribution.
4649
4650\begin{csimplemacrodesc}{Py_BEGIN_ALLOW_THREADS}
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004651This macro expands to
Fred Drakee058b4f1998-02-16 06:15:35 +00004652\samp{\{ PyThreadState *_save; _save = PyEval_SaveThread();}.
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004653Note that it contains an opening brace; it must be matched with a
4654following \code{Py_END_ALLOW_THREADS} macro. See above for further
4655discussion of this macro. It is a no-op when thread support is
4656disabled at compile time.
Fred Drake659ebfa2000-04-03 15:42:13 +00004657\end{csimplemacrodesc}
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004658
Fred Drake659ebfa2000-04-03 15:42:13 +00004659\begin{csimplemacrodesc}{Py_END_ALLOW_THREADS}
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004660This macro expands to
Fred Drakee058b4f1998-02-16 06:15:35 +00004661\samp{PyEval_RestoreThread(_save); \}}.
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004662Note that it contains a closing brace; it must be matched with an
4663earlier \code{Py_BEGIN_ALLOW_THREADS} macro. See above for further
4664discussion of this macro. It is a no-op when thread support is
4665disabled at compile time.
Fred Drake659ebfa2000-04-03 15:42:13 +00004666\end{csimplemacrodesc}
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004667
Thomas Wouterse30ac572001-07-09 14:35:01 +00004668\begin{csimplemacrodesc}{Py_BLOCK_THREADS}
Fred Drakebab29652001-07-10 16:10:08 +00004669This macro expands to \samp{PyEval_RestoreThread(_save);}: it
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004670is equivalent to \code{Py_END_ALLOW_THREADS} without the closing
4671brace. It is a no-op when thread support is disabled at compile
4672time.
Fred Drake659ebfa2000-04-03 15:42:13 +00004673\end{csimplemacrodesc}
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004674
Thomas Wouterse30ac572001-07-09 14:35:01 +00004675\begin{csimplemacrodesc}{Py_UNBLOCK_THREADS}
Fred Drakebab29652001-07-10 16:10:08 +00004676This macro expands to \samp{_save = PyEval_SaveThread();}: it is
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004677equivalent to \code{Py_BEGIN_ALLOW_THREADS} without the opening brace
4678and variable declaration. It is a no-op when thread support is
4679disabled at compile time.
Fred Drake659ebfa2000-04-03 15:42:13 +00004680\end{csimplemacrodesc}
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004681
4682All of the following functions are only available when thread support
4683is enabled at compile time, and must be called only when the
Fred Drake9d20ac31998-02-16 15:27:08 +00004684interpreter lock has been created.
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004685
Fred Drakec6fa34e1998-04-02 06:47:24 +00004686\begin{cfuncdesc}{PyInterpreterState*}{PyInterpreterState_New}{}
Guido van Rossumed9dcc11998-08-07 18:28:03 +00004687Create a new interpreter state object. The interpreter lock need not
4688be held, but may be held if it is necessary to serialize calls to this
4689function.
Guido van Rossum4a944d71997-08-14 20:35:38 +00004690\end{cfuncdesc}
4691
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004692\begin{cfuncdesc}{void}{PyInterpreterState_Clear}{PyInterpreterState *interp}
4693Reset all information in an interpreter state object. The interpreter
4694lock must be held.
Guido van Rossum4a944d71997-08-14 20:35:38 +00004695\end{cfuncdesc}
4696
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004697\begin{cfuncdesc}{void}{PyInterpreterState_Delete}{PyInterpreterState *interp}
4698Destroy an interpreter state object. The interpreter lock need not be
4699held. The interpreter state must have been reset with a previous
Fred Drakee058b4f1998-02-16 06:15:35 +00004700call to \cfunction{PyInterpreterState_Clear()}.
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004701\end{cfuncdesc}
4702
Fred Drakec6fa34e1998-04-02 06:47:24 +00004703\begin{cfuncdesc}{PyThreadState*}{PyThreadState_New}{PyInterpreterState *interp}
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004704Create a new thread state object belonging to the given interpreter
Guido van Rossumed9dcc11998-08-07 18:28:03 +00004705object. The interpreter lock need not be held, but may be held if it
4706is necessary to serialize calls to this function.
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004707\end{cfuncdesc}
4708
4709\begin{cfuncdesc}{void}{PyThreadState_Clear}{PyThreadState *tstate}
4710Reset all information in a thread state object. The interpreter lock
4711must be held.
4712\end{cfuncdesc}
4713
4714\begin{cfuncdesc}{void}{PyThreadState_Delete}{PyThreadState *tstate}
4715Destroy a thread state object. The interpreter lock need not be
4716held. The thread state must have been reset with a previous
Fred Drakee058b4f1998-02-16 06:15:35 +00004717call to \cfunction{PyThreadState_Clear()}.
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004718\end{cfuncdesc}
4719
Fred Drakec6fa34e1998-04-02 06:47:24 +00004720\begin{cfuncdesc}{PyThreadState*}{PyThreadState_Get}{}
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004721Return the current thread state. The interpreter lock must be held.
Guido van Rossum580aa8d1997-11-25 15:34:51 +00004722When the current thread state is \NULL{}, this issues a fatal
Guido van Rossum5b8a5231997-12-30 04:38:44 +00004723error (so that the caller needn't check for \NULL{}).
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004724\end{cfuncdesc}
4725
Fred Drakec6fa34e1998-04-02 06:47:24 +00004726\begin{cfuncdesc}{PyThreadState*}{PyThreadState_Swap}{PyThreadState *tstate}
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004727Swap the current thread state with the thread state given by the
Guido van Rossum580aa8d1997-11-25 15:34:51 +00004728argument \var{tstate}, which may be \NULL{}. The interpreter lock
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004729must be held.
4730\end{cfuncdesc}
4731
Fred Drake24e62192001-05-21 15:56:55 +00004732\begin{cfuncdesc}{PyObject*}{PyThreadState_GetDict}{}
4733Return a dictionary in which extensions can store thread-specific
4734state information. Each extension should use a unique key to use to
4735store state in the dictionary. If this function returns \NULL, an
4736exception has been raised and the caller should allow it to
4737propogate.
4738\end{cfuncdesc}
4739
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004740
Fred Drake68db7302001-07-17 19:48:30 +00004741\section{Profiling and Tracing \label{profiling}}
4742
4743\sectionauthor{Fred L. Drake, Jr.}{fdrake@acm.org}
4744
4745The Python interpreter provides some low-level support for attaching
4746profiling and execution tracing facilities. These are used for
4747profiling, debugging, and coverage analysis tools.
4748
4749Starting with Python 2.2, the implementation of this facility was
4750substantially revised, and an interface from C was added. This C
4751interface allows the profiling or tracing code to avoid the overhead
4752of calling through Python-level callable objects, making a direct C
4753function call instead. The essential attributes of the facility have
4754not changed; the interface allows trace functions to be installed
4755per-thread, and the basic events reported to the trace function are
4756the same as had been reported to the Python-level trace functions in
4757previous versions.
4758
4759\begin{ctypedesc}[Py_tracefunc]{int (*Py_tracefunc)(PyObject *obj,
4760 PyFrameObject *frame, int what,
4761 PyObject *arg)}
4762 The type of the trace function registered using
4763 \cfunction{PyEval_SetProfile()} and \cfunction{PyEval_SetTrace()}.
4764 The first parameter is the object passed to the registration
4765 function,
4766\end{ctypedesc}
4767
4768\begin{cvardesc}{int}{PyTrace_CALL}
4769 The value of the \var{what} parameter to a \ctype{Py_tracefunc}
4770 function when a new function or method call is being reported.
4771\end{cvardesc}
4772
4773\begin{cvardesc}{int}{PyTrace_EXCEPT}
4774\end{cvardesc}
4775
4776\begin{cvardesc}{int}{PyTrace_LINE}
4777 The value passed as the \var{what} parameter to a trace function
4778 (but not a profiling function) when a line-number event is being
4779 reported.
4780\end{cvardesc}
4781
4782\begin{cvardesc}{int}{PyTrace_RETURN}
4783 The value for the \var{what} parameter to \ctype{Py_tracefunc}
4784 functions when a call is returning without propogating an exception.
4785\end{cvardesc}
4786
4787\begin{cfuncdesc}{void}{PyEval_SetProfile}{Py_tracefunc func, PyObject *obj}
Fred Drakef90490e2001-08-02 18:00:28 +00004788 Set the profiler function to \var{func}. The \var{obj} parameter is
4789 passed to the function as its first parameter, and may be any Python
4790 object, or \NULL. If the profile function needs to maintain state,
4791 using a different value for \var{obj} for each thread provides a
4792 convenient and thread-safe place to store it. The profile function
4793 is called for all monitored events except the line-number events.
Fred Drake68db7302001-07-17 19:48:30 +00004794\end{cfuncdesc}
4795
4796\begin{cfuncdesc}{void}{PyEval_SetTrace}{Py_tracefunc func, PyObject *obj}
Fred Drakef90490e2001-08-02 18:00:28 +00004797 Set the the tracing function to \var{func}. This is similar to
4798 \cfunction{PyEval_SetProfile()}, except the tracing function does
4799 receive line-number events.
Fred Drake68db7302001-07-17 19:48:30 +00004800\end{cfuncdesc}
4801
4802
Fred Drake659ebfa2000-04-03 15:42:13 +00004803\chapter{Memory Management \label{memory}}
4804\sectionauthor{Vladimir Marangozov}{Vladimir.Marangozov@inrialpes.fr}
4805
4806
4807\section{Overview \label{memoryOverview}}
4808
4809Memory management in Python involves a private heap containing all
4810Python objects and data structures. The management of this private
4811heap is ensured internally by the \emph{Python memory manager}. The
4812Python memory manager has different components which deal with various
4813dynamic storage management aspects, like sharing, segmentation,
4814preallocation or caching.
4815
4816At the lowest level, a raw memory allocator ensures that there is
4817enough room in the private heap for storing all Python-related data
4818by interacting with the memory manager of the operating system. On top
4819of the raw memory allocator, several object-specific allocators
4820operate on the same heap and implement distinct memory management
4821policies adapted to the peculiarities of every object type. For
4822example, integer objects are managed differently within the heap than
4823strings, tuples or dictionaries because integers imply different
4824storage requirements and speed/space tradeoffs. The Python memory
4825manager thus delegates some of the work to the object-specific
4826allocators, but ensures that the latter operate within the bounds of
4827the private heap.
4828
4829It is important to understand that the management of the Python heap
4830is performed by the interpreter itself and that the user has no
4831control on it, even if she regularly manipulates object pointers to
4832memory blocks inside that heap. The allocation of heap space for
4833Python objects and other internal buffers is performed on demand by
4834the Python memory manager through the Python/C API functions listed in
4835this document.
4836
4837To avoid memory corruption, extension writers should never try to
4838operate on Python objects with the functions exported by the C
4839library: \cfunction{malloc()}\ttindex{malloc()},
4840\cfunction{calloc()}\ttindex{calloc()},
4841\cfunction{realloc()}\ttindex{realloc()} and
4842\cfunction{free()}\ttindex{free()}. This will result in
4843mixed calls between the C allocator and the Python memory manager
4844with fatal consequences, because they implement different algorithms
4845and operate on different heaps. However, one may safely allocate and
4846release memory blocks with the C library allocator for individual
4847purposes, as shown in the following example:
4848
4849\begin{verbatim}
4850 PyObject *res;
4851 char *buf = (char *) malloc(BUFSIZ); /* for I/O */
4852
4853 if (buf == NULL)
4854 return PyErr_NoMemory();
4855 ...Do some I/O operation involving buf...
4856 res = PyString_FromString(buf);
4857 free(buf); /* malloc'ed */
4858 return res;
4859\end{verbatim}
4860
4861In this example, the memory request for the I/O buffer is handled by
4862the C library allocator. The Python memory manager is involved only
4863in the allocation of the string object returned as a result.
4864
4865In most situations, however, it is recommended to allocate memory from
4866the Python heap specifically because the latter is under control of
4867the Python memory manager. For example, this is required when the
4868interpreter is extended with new object types written in C. Another
4869reason for using the Python heap is the desire to \emph{inform} the
4870Python memory manager about the memory needs of the extension module.
4871Even when the requested memory is used exclusively for internal,
4872highly-specific purposes, delegating all memory requests to the Python
4873memory manager causes the interpreter to have a more accurate image of
4874its memory footprint as a whole. Consequently, under certain
4875circumstances, the Python memory manager may or may not trigger
4876appropriate actions, like garbage collection, memory compaction or
4877other preventive procedures. Note that by using the C library
4878allocator as shown in the previous example, the allocated memory for
4879the I/O buffer escapes completely the Python memory manager.
4880
4881
4882\section{Memory Interface \label{memoryInterface}}
4883
4884The following function sets, modeled after the ANSI C standard, are
4885available for allocating and releasing memory from the Python heap:
4886
4887
Fred Drake7d45d342000-08-11 17:07:32 +00004888\begin{cfuncdesc}{void*}{PyMem_Malloc}{size_t n}
4889Allocates \var{n} bytes and returns a pointer of type \ctype{void*} to
Fred Drakebab29652001-07-10 16:10:08 +00004890the allocated memory, or \NULL{} if the request fails. Requesting zero
Fred Drake659ebfa2000-04-03 15:42:13 +00004891bytes returns a non-\NULL{} pointer.
Fred Drakebab29652001-07-10 16:10:08 +00004892The memory will not have been initialized in any way.
Fred Drake659ebfa2000-04-03 15:42:13 +00004893\end{cfuncdesc}
4894
Fred Drake7d45d342000-08-11 17:07:32 +00004895\begin{cfuncdesc}{void*}{PyMem_Realloc}{void *p, size_t n}
Fred Drake659ebfa2000-04-03 15:42:13 +00004896Resizes the memory block pointed to by \var{p} to \var{n} bytes. The
4897contents will be unchanged to the minimum of the old and the new
4898sizes. If \var{p} is \NULL{}, the call is equivalent to
Fred Drakebab29652001-07-10 16:10:08 +00004899\cfunction{PyMem_Malloc(\var{n})}; if \var{n} is equal to zero, the
4900memory block is resized but is not freed, and the returned pointer is
4901non-\NULL{}. Unless \var{p} is \NULL{}, it must have been returned by
4902a previous call to \cfunction{PyMem_Malloc()} or
4903\cfunction{PyMem_Realloc()}.
Fred Drake659ebfa2000-04-03 15:42:13 +00004904\end{cfuncdesc}
4905
Fred Drake7d45d342000-08-11 17:07:32 +00004906\begin{cfuncdesc}{void}{PyMem_Free}{void *p}
Fred Drake659ebfa2000-04-03 15:42:13 +00004907Frees the memory block pointed to by \var{p}, which must have been
4908returned by a previous call to \cfunction{PyMem_Malloc()} or
4909\cfunction{PyMem_Realloc()}. Otherwise, or if
4910\cfunction{PyMem_Free(p)} has been called before, undefined behaviour
4911occurs. If \var{p} is \NULL{}, no operation is performed.
4912\end{cfuncdesc}
4913
Fred Drake659ebfa2000-04-03 15:42:13 +00004914The following type-oriented macros are provided for convenience. Note
4915that \var{TYPE} refers to any C type.
4916
Fred Drakef913e542000-09-12 20:17:17 +00004917\begin{cfuncdesc}{\var{TYPE}*}{PyMem_New}{TYPE, size_t n}
Fred Drake659ebfa2000-04-03 15:42:13 +00004918Same as \cfunction{PyMem_Malloc()}, but allocates \code{(\var{n} *
4919sizeof(\var{TYPE}))} bytes of memory. Returns a pointer cast to
4920\ctype{\var{TYPE}*}.
Fred Drakebab29652001-07-10 16:10:08 +00004921The memory will not have been initialized in any way.
Fred Drake659ebfa2000-04-03 15:42:13 +00004922\end{cfuncdesc}
4923
Fred Drakef913e542000-09-12 20:17:17 +00004924\begin{cfuncdesc}{\var{TYPE}*}{PyMem_Resize}{void *p, TYPE, size_t n}
Fred Drake659ebfa2000-04-03 15:42:13 +00004925Same as \cfunction{PyMem_Realloc()}, but the memory block is resized
4926to \code{(\var{n} * sizeof(\var{TYPE}))} bytes. Returns a pointer
4927cast to \ctype{\var{TYPE}*}.
4928\end{cfuncdesc}
4929
Fred Drakef913e542000-09-12 20:17:17 +00004930\begin{cfuncdesc}{void}{PyMem_Del}{void *p}
Fred Drake659ebfa2000-04-03 15:42:13 +00004931Same as \cfunction{PyMem_Free()}.
4932\end{cfuncdesc}
4933
Fred Drakef913e542000-09-12 20:17:17 +00004934In addition, the following macro sets are provided for calling the
4935Python memory allocator directly, without involving the C API functions
4936listed above. However, note that their use does not preserve binary
4937compatibility accross Python versions and is therefore deprecated in
4938extension modules.
4939
4940\cfunction{PyMem_MALLOC()}, \cfunction{PyMem_REALLOC()}, \cfunction{PyMem_FREE()}.
4941
4942\cfunction{PyMem_NEW()}, \cfunction{PyMem_RESIZE()}, \cfunction{PyMem_DEL()}.
4943
Fred Drake659ebfa2000-04-03 15:42:13 +00004944
4945\section{Examples \label{memoryExamples}}
4946
4947Here is the example from section \ref{memoryOverview}, rewritten so
4948that the I/O buffer is allocated from the Python heap by using the
4949first function set:
4950
4951\begin{verbatim}
4952 PyObject *res;
4953 char *buf = (char *) PyMem_Malloc(BUFSIZ); /* for I/O */
4954
4955 if (buf == NULL)
4956 return PyErr_NoMemory();
4957 /* ...Do some I/O operation involving buf... */
4958 res = PyString_FromString(buf);
4959 PyMem_Free(buf); /* allocated with PyMem_Malloc */
4960 return res;
4961\end{verbatim}
4962
Fred Drakef913e542000-09-12 20:17:17 +00004963The same code using the type-oriented function set:
Fred Drake659ebfa2000-04-03 15:42:13 +00004964
4965\begin{verbatim}
4966 PyObject *res;
Fred Drakef913e542000-09-12 20:17:17 +00004967 char *buf = PyMem_New(char, BUFSIZ); /* for I/O */
Fred Drake659ebfa2000-04-03 15:42:13 +00004968
4969 if (buf == NULL)
4970 return PyErr_NoMemory();
4971 /* ...Do some I/O operation involving buf... */
4972 res = PyString_FromString(buf);
Fred Drakef913e542000-09-12 20:17:17 +00004973 PyMem_Del(buf); /* allocated with PyMem_New */
Fred Drake659ebfa2000-04-03 15:42:13 +00004974 return res;
4975\end{verbatim}
4976
Fred Drakef913e542000-09-12 20:17:17 +00004977Note that in the two examples above, the buffer is always
4978manipulated via functions belonging to the same set. Indeed, it
Fred Drake659ebfa2000-04-03 15:42:13 +00004979is required to use the same memory API family for a given
4980memory block, so that the risk of mixing different allocators is
4981reduced to a minimum. The following code sequence contains two errors,
4982one of which is labeled as \emph{fatal} because it mixes two different
4983allocators operating on different heaps.
4984
4985\begin{verbatim}
Fred Drakef913e542000-09-12 20:17:17 +00004986char *buf1 = PyMem_New(char, BUFSIZ);
Fred Drake659ebfa2000-04-03 15:42:13 +00004987char *buf2 = (char *) malloc(BUFSIZ);
4988char *buf3 = (char *) PyMem_Malloc(BUFSIZ);
4989...
Fred Drakef913e542000-09-12 20:17:17 +00004990PyMem_Del(buf3); /* Wrong -- should be PyMem_Free() */
Fred Drake659ebfa2000-04-03 15:42:13 +00004991free(buf2); /* Right -- allocated via malloc() */
Fred Drakef913e542000-09-12 20:17:17 +00004992free(buf1); /* Fatal -- should be PyMem_Del() */
Fred Drake659ebfa2000-04-03 15:42:13 +00004993\end{verbatim}
4994
4995In addition to the functions aimed at handling raw memory blocks from
4996the Python heap, objects in Python are allocated and released with
Fred Drakef913e542000-09-12 20:17:17 +00004997\cfunction{PyObject_New()}, \cfunction{PyObject_NewVar()} and
4998\cfunction{PyObject_Del()}, or with their corresponding macros
4999\cfunction{PyObject_NEW()}, \cfunction{PyObject_NEW_VAR()} and
Fred Drakee06f0f92000-06-30 15:52:39 +00005000\cfunction{PyObject_DEL()}.
Fred Drake659ebfa2000-04-03 15:42:13 +00005001
Fred Drakee06f0f92000-06-30 15:52:39 +00005002These will be explained in the next chapter on defining and
5003implementing new object types in C.
Fred Drake659ebfa2000-04-03 15:42:13 +00005004
5005
Fred Drakeefd146c1999-02-15 15:30:45 +00005006\chapter{Defining New Object Types \label{newTypes}}
Guido van Rossum4a944d71997-08-14 20:35:38 +00005007
Fred Drake88fdaa72001-07-20 20:56:11 +00005008
5009\section{Allocating Objects on the Heap
5010 \label{allocating-objects}}
5011
Fred Drakec6fa34e1998-04-02 06:47:24 +00005012\begin{cfuncdesc}{PyObject*}{_PyObject_New}{PyTypeObject *type}
Fred Drakee058b4f1998-02-16 06:15:35 +00005013\end{cfuncdesc}
5014
Fred Drakef913e542000-09-12 20:17:17 +00005015\begin{cfuncdesc}{PyVarObject*}{_PyObject_NewVar}{PyTypeObject *type, int size}
Fred Drakee058b4f1998-02-16 06:15:35 +00005016\end{cfuncdesc}
5017
Fred Drakef913e542000-09-12 20:17:17 +00005018\begin{cfuncdesc}{void}{_PyObject_Del}{PyObject *op}
Fred Drakee058b4f1998-02-16 06:15:35 +00005019\end{cfuncdesc}
5020
Fred Drakef913e542000-09-12 20:17:17 +00005021\begin{cfuncdesc}{PyObject*}{PyObject_Init}{PyObject *op,
Fred Drakebab29652001-07-10 16:10:08 +00005022 PyTypeObject *type}
5023 Initialize a newly-allocated object \var{op} with its type and
5024 initial reference. Returns the initialized object. If \var{type}
5025 indicates that the object participates in the cyclic garbage
5026 detector, it it added to the detector's set of observed objects.
5027 Other fields of the object are not affected.
Fred Drakef913e542000-09-12 20:17:17 +00005028\end{cfuncdesc}
5029
5030\begin{cfuncdesc}{PyVarObject*}{PyObject_InitVar}{PyVarObject *op,
Fred Drakebab29652001-07-10 16:10:08 +00005031 PyTypeObject *type, int size}
5032 This does everything \cfunction{PyObject_Init()} does, and also
5033 initializes the length information for a variable-size object.
Fred Drakef913e542000-09-12 20:17:17 +00005034\end{cfuncdesc}
5035
5036\begin{cfuncdesc}{\var{TYPE}*}{PyObject_New}{TYPE, PyTypeObject *type}
Fred Drakebab29652001-07-10 16:10:08 +00005037 Allocate a new Python object using the C structure type \var{TYPE}
5038 and the Python type object \var{type}. Fields not defined by the
5039 Python object header are not initialized; the object's reference
5040 count will be one. The size of the memory
5041 allocation is determined from the \member{tp_basicsize} field of the
5042 type object.
Fred Drakef913e542000-09-12 20:17:17 +00005043\end{cfuncdesc}
5044
5045\begin{cfuncdesc}{\var{TYPE}*}{PyObject_NewVar}{TYPE, PyTypeObject *type,
5046 int size}
Fred Drakebab29652001-07-10 16:10:08 +00005047 Allocate a new Python object using the C structure type \var{TYPE}
5048 and the Python type object \var{type}. Fields not defined by the
5049 Python object header are not initialized. The allocated memory
5050 allows for the \var{TYPE} structure plus \var{size} fields of the
5051 size given by the \member{tp_itemsize} field of \var{type}. This is
5052 useful for implementing objects like tuples, which are able to
5053 determine their size at construction time. Embedding the array of
5054 fields into the same allocation decreases the number of allocations,
5055 improving the memory management efficiency.
Fred Drakef913e542000-09-12 20:17:17 +00005056\end{cfuncdesc}
5057
5058\begin{cfuncdesc}{void}{PyObject_Del}{PyObject *op}
Fred Drakebab29652001-07-10 16:10:08 +00005059 Releases memory allocated to an object using
5060 \cfunction{PyObject_New()} or \cfunction{PyObject_NewVar()}. This
5061 is normally called from the \member{tp_dealloc} handler specified in
5062 the object's type. The fields of the object should not be accessed
5063 after this call as the memory is no longer a valid Python object.
Fred Drakef913e542000-09-12 20:17:17 +00005064\end{cfuncdesc}
5065
5066\begin{cfuncdesc}{\var{TYPE}*}{PyObject_NEW}{TYPE, PyTypeObject *type}
Fred Drakebab29652001-07-10 16:10:08 +00005067 Macro version of \cfunction{PyObject_New()}, to gain performance at
5068 the expense of safety. This does not check \var{type} for a \NULL{}
5069 value.
Fred Drakef913e542000-09-12 20:17:17 +00005070\end{cfuncdesc}
5071
5072\begin{cfuncdesc}{\var{TYPE}*}{PyObject_NEW_VAR}{TYPE, PyTypeObject *type,
5073 int size}
Fred Drakebab29652001-07-10 16:10:08 +00005074 Macro version of \cfunction{PyObject_NewVar()}, to gain performance
5075 at the expense of safety. This does not check \var{type} for a
5076 \NULL{} value.
Fred Drakef913e542000-09-12 20:17:17 +00005077\end{cfuncdesc}
5078
5079\begin{cfuncdesc}{void}{PyObject_DEL}{PyObject *op}
Fred Drakebab29652001-07-10 16:10:08 +00005080 Macro version of \cfunction{PyObject_Del()}.
Fred Drakee058b4f1998-02-16 06:15:35 +00005081\end{cfuncdesc}
5082
Fred Drakeee814bf2000-11-28 22:34:32 +00005083\begin{cfuncdesc}{PyObject*}{Py_InitModule}{char *name,
5084 PyMethodDef *methods}
5085 Create a new module object based on a name and table of functions,
5086 returning the new module object.
5087\end{cfuncdesc}
5088
5089\begin{cfuncdesc}{PyObject*}{Py_InitModule3}{char *name,
5090 PyMethodDef *methods,
5091 char *doc}
5092 Create a new module object based on a name and table of functions,
5093 returning the new module object. If \var{doc} is non-\NULL, it will
5094 be used to define the docstring for the module.
5095\end{cfuncdesc}
5096
5097\begin{cfuncdesc}{PyObject*}{Py_InitModule4}{char *name,
5098 PyMethodDef *methods,
5099 char *doc, PyObject *self,
5100 int apiver}
5101 Create a new module object based on a name and table of functions,
5102 returning the new module object. If \var{doc} is non-\NULL, it will
5103 be used to define the docstring for the module. If \var{self} is
5104 non-\NULL, it will passed to the functions of the module as their
5105 (otherwise \NULL) first parameter. (This was added as an
5106 experimental feature, and there are no known uses in the current
5107 version of Python.) For \var{apiver}, the only value which should
5108 be passed is defined by the constant \constant{PYTHON_API_VERSION}.
5109
5110 \strong{Note:} Most uses of this function should probably be using
5111 the \cfunction{Py_InitModule3()} instead; only use this if you are
5112 sure you need it.
5113\end{cfuncdesc}
Guido van Rossum3c4378b1998-04-14 20:21:10 +00005114
Fred Drake659ebfa2000-04-03 15:42:13 +00005115DL_IMPORT
5116
Fred Drakebab29652001-07-10 16:10:08 +00005117\begin{cvardesc}{PyObject}{_Py_NoneStruct}
5118 Object which is visible in Python as \code{None}. This should only
5119 be accessed using the \code{Py_None} macro, which evaluates to a
5120 pointer to this object.
5121\end{cvardesc}
Fred Drake659ebfa2000-04-03 15:42:13 +00005122
5123
5124\section{Common Object Structures \label{common-structs}}
5125
Guido van Rossumae110af1997-05-22 20:11:52 +00005126PyObject, PyVarObject
5127
5128PyObject_HEAD, PyObject_HEAD_INIT, PyObject_VAR_HEAD
5129
5130Typedefs:
5131unaryfunc, binaryfunc, ternaryfunc, inquiry, coercion, intargfunc,
5132intintargfunc, intobjargproc, intintobjargproc, objobjargproc,
Guido van Rossumae110af1997-05-22 20:11:52 +00005133destructor, printfunc, getattrfunc, getattrofunc, setattrfunc,
5134setattrofunc, cmpfunc, reprfunc, hashfunc
5135
Fred Drakea8455ab2000-06-16 19:58:42 +00005136\begin{ctypedesc}{PyCFunction}
5137Type of the functions used to implement most Python callables in C.
5138\end{ctypedesc}
5139
5140\begin{ctypedesc}{PyMethodDef}
5141Structure used to describe a method of an extension type. This
5142structure has four fields:
5143
5144\begin{tableiii}{l|l|l}{member}{Field}{C Type}{Meaning}
5145 \lineiii{ml_name}{char *}{name of the method}
5146 \lineiii{ml_meth}{PyCFunction}{pointer to the C implementation}
5147 \lineiii{ml_flags}{int}{flag bits indicating how the call should be
5148 constructed}
5149 \lineiii{ml_doc}{char *}{points to the contents of the docstring}
5150\end{tableiii}
5151\end{ctypedesc}
5152
5153\begin{cfuncdesc}{PyObject*}{Py_FindMethod}{PyMethodDef[] table,
5154 PyObject *ob, char *name}
5155Return a bound method object for an extension type implemented in C.
5156This function also handles the special attribute \member{__methods__},
5157returning a list of all the method names defined in \var{table}.
5158\end{cfuncdesc}
5159
Fred Drake659ebfa2000-04-03 15:42:13 +00005160
5161\section{Mapping Object Structures \label{mapping-structs}}
5162
5163\begin{ctypedesc}{PyMappingMethods}
5164Structure used to hold pointers to the functions used to implement the
5165mapping protocol for an extension type.
5166\end{ctypedesc}
5167
5168
5169\section{Number Object Structures \label{number-structs}}
5170
5171\begin{ctypedesc}{PyNumberMethods}
5172Structure used to hold pointers to the functions an extension type
5173uses to implement the number protocol.
5174\end{ctypedesc}
5175
5176
5177\section{Sequence Object Structures \label{sequence-structs}}
5178
5179\begin{ctypedesc}{PySequenceMethods}
5180Structure used to hold pointers to the functions which an object uses
5181to implement the sequence protocol.
5182\end{ctypedesc}
5183
5184
5185\section{Buffer Object Structures \label{buffer-structs}}
5186\sectionauthor{Greg J. Stein}{greg@lyra.org}
5187
5188The buffer interface exports a model where an object can expose its
5189internal data as a set of chunks of data, where each chunk is
5190specified as a pointer/length pair. These chunks are called
5191\dfn{segments} and are presumed to be non-contiguous in memory.
5192
5193If an object does not export the buffer interface, then its
5194\member{tp_as_buffer} member in the \ctype{PyTypeObject} structure
5195should be \NULL{}. Otherwise, the \member{tp_as_buffer} will point to
5196a \ctype{PyBufferProcs} structure.
5197
5198\strong{Note:} It is very important that your
Fred Drakec392b572001-03-21 22:15:01 +00005199\ctype{PyTypeObject} structure uses \constant{Py_TPFLAGS_DEFAULT} for
5200the value of the \member{tp_flags} member rather than \code{0}. This
Fred Drake659ebfa2000-04-03 15:42:13 +00005201tells the Python runtime that your \ctype{PyBufferProcs} structure
5202contains the \member{bf_getcharbuffer} slot. Older versions of Python
5203did not have this member, so a new Python interpreter using an old
5204extension needs to be able to test for its presence before using it.
5205
5206\begin{ctypedesc}{PyBufferProcs}
5207Structure used to hold the function pointers which define an
5208implementation of the buffer protocol.
5209
5210The first slot is \member{bf_getreadbuffer}, of type
5211\ctype{getreadbufferproc}. If this slot is \NULL{}, then the object
5212does not support reading from the internal data. This is
5213non-sensical, so implementors should fill this in, but callers should
5214test that the slot contains a non-\NULL{} value.
5215
5216The next slot is \member{bf_getwritebuffer} having type
5217\ctype{getwritebufferproc}. This slot may be \NULL{} if the object
5218does not allow writing into its returned buffers.
5219
5220The third slot is \member{bf_getsegcount}, with type
5221\ctype{getsegcountproc}. This slot must not be \NULL{} and is used to
5222inform the caller how many segments the object contains. Simple
5223objects such as \ctype{PyString_Type} and
5224\ctype{PyBuffer_Type} objects contain a single segment.
5225
5226The last slot is \member{bf_getcharbuffer}, of type
5227\ctype{getcharbufferproc}. This slot will only be present if the
Fred Drakec392b572001-03-21 22:15:01 +00005228\constant{Py_TPFLAGS_HAVE_GETCHARBUFFER} flag is present in the
Fred Drake659ebfa2000-04-03 15:42:13 +00005229\member{tp_flags} field of the object's \ctype{PyTypeObject}. Before using
5230this slot, the caller should test whether it is present by using the
5231\cfunction{PyType_HasFeature()}\ttindex{PyType_HasFeature()} function.
5232If present, it may be \NULL, indicating that the object's contents
5233cannot be used as \emph{8-bit characters}.
5234The slot function may also raise an error if the object's contents
5235cannot be interpreted as 8-bit characters. For example, if the object
5236is an array which is configured to hold floating point values, an
5237exception may be raised if a caller attempts to use
5238\member{bf_getcharbuffer} to fetch a sequence of 8-bit characters.
5239This notion of exporting the internal buffers as ``text'' is used to
5240distinguish between objects that are binary in nature, and those which
5241have character-based content.
5242
5243\strong{Note:} The current policy seems to state that these characters
5244may be multi-byte characters. This implies that a buffer size of
5245\var{N} does not mean there are \var{N} characters present.
5246\end{ctypedesc}
5247
5248\begin{datadesc}{Py_TPFLAGS_HAVE_GETCHARBUFFER}
5249Flag bit set in the type structure to indicate that the
5250\member{bf_getcharbuffer} slot is known. This being set does not
5251indicate that the object supports the buffer interface or that the
5252\member{bf_getcharbuffer} slot is non-\NULL.
5253\end{datadesc}
5254
5255\begin{ctypedesc}[getreadbufferproc]{int (*getreadbufferproc)
5256 (PyObject *self, int segment, void **ptrptr)}
5257Return a pointer to a readable segment of the buffer. This function
5258is allowed to raise an exception, in which case it must return
5259\code{-1}. The \var{segment} which is passed must be zero or
5260positive, and strictly less than the number of segments returned by
Greg Stein4d4d0032001-04-07 16:14:49 +00005261the \member{bf_getsegcount} slot function. On success, it returns the
5262length of the buffer memory, and sets \code{*\var{ptrptr}} to a
5263pointer to that memory.
Fred Drake659ebfa2000-04-03 15:42:13 +00005264\end{ctypedesc}
5265
5266\begin{ctypedesc}[getwritebufferproc]{int (*getwritebufferproc)
5267 (PyObject *self, int segment, void **ptrptr)}
Greg Stein4d4d0032001-04-07 16:14:49 +00005268Return a pointer to a writable memory buffer in \code{*\var{ptrptr}},
5269and the length of that segment as the function return value.
5270The memory buffer must correspond to buffer segment \var{segment}.
Fred Drake58c5a2a1999-08-04 13:13:24 +00005271Must return \code{-1} and set an exception on error.
5272\exception{TypeError} should be raised if the object only supports
5273read-only buffers, and \exception{SystemError} should be raised when
5274\var{segment} specifies a segment that doesn't exist.
5275% Why doesn't it raise ValueError for this one?
Fred Drake659ebfa2000-04-03 15:42:13 +00005276% GJS: because you shouldn't be calling it with an invalid
5277% segment. That indicates a blatant programming error in the C
5278% code.
Fred Drake58c5a2a1999-08-04 13:13:24 +00005279\end{ctypedesc}
5280
Fred Drake659ebfa2000-04-03 15:42:13 +00005281\begin{ctypedesc}[getsegcountproc]{int (*getsegcountproc)
5282 (PyObject *self, int *lenp)}
5283Return the number of memory segments which comprise the buffer. If
5284\var{lenp} is not \NULL, the implementation must report the sum of the
5285sizes (in bytes) of all segments in \code{*\var{lenp}}.
5286The function cannot fail.
5287\end{ctypedesc}
Guido van Rossumae110af1997-05-22 20:11:52 +00005288
Fred Drake659ebfa2000-04-03 15:42:13 +00005289\begin{ctypedesc}[getcharbufferproc]{int (*getcharbufferproc)
5290 (PyObject *self, int segment, const char **ptrptr)}
5291\end{ctypedesc}
Guido van Rossumae110af1997-05-22 20:11:52 +00005292
Guido van Rossumae110af1997-05-22 20:11:52 +00005293
Fred Drakef90490e2001-08-02 18:00:28 +00005294\section{Supporting the Iterator Protocol
5295 \label{supporting-iteration}}
5296
5297
Fred Drakec392b572001-03-21 22:15:01 +00005298\section{Supporting Cyclic Garbarge Collection
5299 \label{supporting-cycle-detection}}
5300
5301Python's support for detecting and collecting garbage which involves
5302circular references requires support from object types which are
5303``containers'' for other objects which may also be containers. Types
5304which do not store references to other objects, or which only store
5305references to atomic types (such as numbers or strings), do not need
5306to provide any explicit support for garbage collection.
5307
5308To create a container type, the \member{tp_flags} field of the type
5309object must include the \constant{Py_TPFLAGS_GC} and provide an
Fred Drakee28d8ae2001-03-22 16:30:17 +00005310implementation of the \member{tp_traverse} handler. The computed
5311value of the \member{tp_basicsize} field must include
5312\constant{PyGC_HEAD_SIZE} as well. If instances of the type are
5313mutable, a \member{tp_clear} implementation must also be provided.
Fred Drakec392b572001-03-21 22:15:01 +00005314
5315\begin{datadesc}{Py_TPFLAGS_GC}
5316 Objects with a type with this flag set must conform with the rules
5317 documented here. For convenience these objects will be referred to
5318 as container objects.
5319\end{datadesc}
5320
5321\begin{datadesc}{PyGC_HEAD_SIZE}
5322 Extra memory needed for the garbage collector. Container objects
5323 must include this in the calculation of their tp_basicsize. If the
5324 collector is disabled at compile time then this is \code{0}.
5325\end{datadesc}
5326
Fred Drakee28d8ae2001-03-22 16:30:17 +00005327Constructors for container types must conform to two rules:
5328
5329\begin{enumerate}
5330\item The memory for the object must be allocated using
5331 \cfunction{PyObject_New()} or \cfunction{PyObject_VarNew()}.
5332
5333\item Once all the fields which may contain references to other
5334 containers are initialized, it must call
5335 \cfunction{PyObject_GC_Init()}.
5336\end{enumerate}
5337
Fred Drakec392b572001-03-21 22:15:01 +00005338\begin{cfuncdesc}{void}{PyObject_GC_Init}{PyObject *op}
5339 Adds the object \var{op} to the set of container objects tracked by
5340 the collector. The collector can run at unexpected times so objects
5341 must be valid while being tracked. This should be called once all
5342 the fields followed by the \member{tp_traverse} handler become valid,
5343 usually near the end of the constructor.
5344\end{cfuncdesc}
5345
Fred Drakee28d8ae2001-03-22 16:30:17 +00005346Similarly, the deallocator for the object must conform to a similar
5347pair of rules:
5348
5349\begin{enumerate}
5350\item Before fields which refer to other containers are invalidated,
5351 \cfunction{PyObject_GC_Fini()} must be called.
5352
5353\item The object's memory must be deallocated using
5354 \cfunction{PyObject_Del()}.
5355\end{enumerate}
5356
Fred Drakec392b572001-03-21 22:15:01 +00005357\begin{cfuncdesc}{void}{PyObject_GC_Fini}{PyObject *op}
5358 Remove the object \var{op} from the set of container objects tracked
5359 by the collector. Note that \cfunction{PyObject_GC_Init()} can be
5360 called again on this object to add it back to the set of tracked
5361 objects. The deallocator (\member{tp_dealloc} handler) should call
5362 this for the object before any of the fields used by the
5363 \member{tp_traverse} handler become invalid.
Fred Drake8f6df462001-03-23 17:42:09 +00005364
5365 \strong{Note:} Any container which may be referenced from another
5366 object reachable by the collector must itself be tracked by the
5367 collector, so it is generally not safe to call this function
5368 anywhere but in the object's deallocator.
Fred Drakec392b572001-03-21 22:15:01 +00005369\end{cfuncdesc}
5370
5371The \member{tp_traverse} handler accepts a function parameter of this
5372type:
5373
5374\begin{ctypedesc}[visitproc]{int (*visitproc)(PyObject *object, void *arg)}
5375 Type of the visitor function passed to the \member{tp_traverse}
5376 handler. The function should be called with an object to traverse
5377 as \var{object} and the third parameter to the \member{tp_traverse}
5378 handler as \var{arg}.
5379\end{ctypedesc}
5380
5381The \member{tp_traverse} handler must have the following type:
5382
5383\begin{ctypedesc}[traverseproc]{int (*traverseproc)(PyObject *self,
5384 visitproc visit, void *arg)}
5385 Traversal function for a container object. Implementations must
5386 call the \var{visit} function for each object directly contained by
5387 \var{self}, with the parameters to \var{visit} being the contained
5388 object and the \var{arg} value passed to the handler. If
5389 \var{visit} returns a non-zero value then an error has occurred and
5390 that value should be returned immediately.
5391\end{ctypedesc}
5392
5393The \member{tp_clear} handler must be of the \ctype{inquiry} type, or
5394\NULL{} if the object is immutable.
5395
5396\begin{ctypedesc}[inquiry]{int (*inquiry)(PyObject *self)}
5397 Drop references that may have created reference cycles. Immutable
5398 objects do not have to define this method since they can never
5399 directly create reference cycles. Note that the object must still
Fred Drakebab29652001-07-10 16:10:08 +00005400 be valid after calling this method (don't just call
Fred Drakec392b572001-03-21 22:15:01 +00005401 \cfunction{Py_DECREF()} on a reference). The collector will call
5402 this method if it detects that this object is involved in a
5403 reference cycle.
5404\end{ctypedesc}
5405
5406
Fred Drakee28d8ae2001-03-22 16:30:17 +00005407\subsection{Example Cycle Collector Support
5408 \label{example-cycle-support}}
5409
5410This example shows only enough of the implementation of an extension
5411type to show how the garbage collector support needs to be added. It
5412shows the definition of the object structure, the
5413\member{tp_traverse}, \member{tp_clear} and \member{tp_dealloc}
5414implementations, the type structure, and a constructor --- the module
5415initialization needed to export the constructor to Python is not shown
5416as there are no special considerations there for the collector. To
5417make this interesting, assume that the module exposes ways for the
5418\member{container} field of the object to be modified. Note that
5419since no checks are made on the type of the object used to initialize
5420\member{container}, we have to assume that it may be a container.
5421
5422\begin{verbatim}
5423#include "Python.h"
5424
5425typedef struct {
5426 PyObject_HEAD
5427 PyObject *container;
5428} MyObject;
5429
5430static int
5431my_traverse(MyObject *self, visitproc visit, void *arg)
5432{
5433 if (self->container != NULL)
5434 return visit(self->container, arg);
5435 else
5436 return 0;
5437}
5438
5439static int
5440my_clear(MyObject *self)
5441{
5442 Py_XDECREF(self->container);
5443 self->container = NULL;
5444
5445 return 0;
5446}
5447
5448static void
5449my_dealloc(MyObject *self)
5450{
5451 PyObject_GC_Fini((PyObject *) self);
5452 Py_XDECREF(self->container);
5453 PyObject_Del(self);
5454}
5455\end{verbatim}
5456
5457\begin{verbatim}
5458statichere PyTypeObject
5459MyObject_Type = {
5460 PyObject_HEAD_INIT(NULL)
5461 0,
5462 "MyObject",
5463 sizeof(MyObject) + PyGC_HEAD_SIZE,
5464 0,
5465 (destructor)my_dealloc, /* tp_dealloc */
5466 0, /* tp_print */
5467 0, /* tp_getattr */
5468 0, /* tp_setattr */
5469 0, /* tp_compare */
5470 0, /* tp_repr */
5471 0, /* tp_as_number */
5472 0, /* tp_as_sequence */
5473 0, /* tp_as_mapping */
5474 0, /* tp_hash */
5475 0, /* tp_call */
5476 0, /* tp_str */
5477 0, /* tp_getattro */
5478 0, /* tp_setattro */
5479 0, /* tp_as_buffer */
5480 Py_TPFLAGS_DEFAULT | Py_TPFLAGS_GC,
5481 0, /* tp_doc */
5482 (traverseproc)my_traverse, /* tp_traverse */
5483 (inquiry)my_clear, /* tp_clear */
5484 0, /* tp_richcompare */
5485 0, /* tp_weaklistoffset */
5486};
5487
5488/* This constructor should be made accessible from Python. */
5489static PyObject *
5490new_object(PyObject *unused, PyObject *args)
5491{
5492 PyObject *container = NULL;
5493 MyObject *result = NULL;
5494
5495 if (PyArg_ParseTuple(args, "|O:new_object", &container)) {
5496 result = PyObject_New(MyObject, &MyObject_Type);
5497 if (result != NULL) {
5498 result->container = container;
5499 PyObject_GC_Init();
5500 }
5501 }
5502 return (PyObject *) result;
5503}
5504\end{verbatim}
5505
5506
Fred Drake659ebfa2000-04-03 15:42:13 +00005507% \chapter{Debugging \label{debugging}}
5508%
5509% XXX Explain Py_DEBUG, Py_TRACE_REFS, Py_REF_DEBUG.
Guido van Rossum5b8a5231997-12-30 04:38:44 +00005510
5511
Fred Drakeed773ef2000-09-21 21:35:22 +00005512\appendix
5513\chapter{Reporting Bugs}
5514\input{reportingbugs}
5515
Fred Drake490d34d2001-06-20 21:39:12 +00005516\chapter{History and License}
5517\input{license}
5518
Marc-André Lemburga544ea22001-01-17 18:04:31 +00005519\input{api.ind} % Index -- must be last
Guido van Rossum9231c8f1997-05-15 21:43:21 +00005520
5521\end{document}