blob: 836d677e4d96ecaa7044b5d1bc7ddf4210f800f3 [file] [log] [blame]
Fred Drake6659c301998-03-03 22:02:19 +00001\documentclass{manual}
Guido van Rossum9231c8f1997-05-15 21:43:21 +00002
Guido van Rossum9faf4c51997-10-07 14:38:54 +00003\title{Python/C API Reference Manual}
Guido van Rossum9231c8f1997-05-15 21:43:21 +00004
5\input{boilerplate}
6
7\makeindex % tell \index to actually write the .idx file
8
9
10\begin{document}
11
Guido van Rossum9231c8f1997-05-15 21:43:21 +000012\maketitle
13
Fred Drake9f86b661998-07-28 21:55:19 +000014\ifhtml
15\chapter*{Front Matter\label{front}}
16\fi
17
Guido van Rossum9231c8f1997-05-15 21:43:21 +000018\input{copyright}
19
20\begin{abstract}
21
22\noindent
Fred Drake659ebfa2000-04-03 15:42:13 +000023This manual documents the API used by C and \Cpp{} programmers who
Fred Drakee058b4f1998-02-16 06:15:35 +000024want to write extension modules or embed Python. It is a companion to
Fred Drakebe486461999-11-09 17:03:03 +000025\citetitle[../ext/ext.html]{Extending and Embedding the Python
26Interpreter}, which describes the general principles of extension
27writing but does not document the API functions in detail.
Guido van Rossum9231c8f1997-05-15 21:43:21 +000028
Guido van Rossum5b8a5231997-12-30 04:38:44 +000029\strong{Warning:} The current version of this document is incomplete.
30I hope that it is nevertheless useful. I will continue to work on it,
31and release new versions from time to time, independent from Python
32source code releases.
33
Guido van Rossum9231c8f1997-05-15 21:43:21 +000034\end{abstract}
35
Fred Drake4d4f9e71998-01-13 22:25:02 +000036\tableofcontents
Guido van Rossum9231c8f1997-05-15 21:43:21 +000037
Guido van Rossum5060b3b1997-08-17 18:02:23 +000038% XXX Consider moving all this back to ext.tex and giving api.tex
39% XXX a *really* short intro only.
Guido van Rossum9231c8f1997-05-15 21:43:21 +000040
Fred Drakeefd146c1999-02-15 15:30:45 +000041\chapter{Introduction \label{intro}}
Guido van Rossum9231c8f1997-05-15 21:43:21 +000042
Fred Drake659ebfa2000-04-03 15:42:13 +000043The Application Programmer's Interface to Python gives C and
44\Cpp{} programmers access to the Python interpreter at a variety of
45levels. The API is equally usable from \Cpp{}, but for brevity it is
46generally referred to as the Python/C API. There are two
47fundamentally different reasons for using the Python/C API. The first
48reason is to write \emph{extension modules} for specific purposes;
49these are C modules that extend the Python interpreter. This is
50probably the most common use. The second reason is to use Python as a
51component in a larger application; this technique is generally
52referred to as \dfn{embedding} Python in an application.
Guido van Rossum59a61351997-08-14 20:34:33 +000053
Guido van Rossum4a944d71997-08-14 20:35:38 +000054Writing an extension module is a relatively well-understood process,
55where a ``cookbook'' approach works well. There are several tools
56that automate the process to some extent. While people have embedded
57Python in other applications since its early existence, the process of
58embedding Python is less straightforward that writing an extension.
Guido van Rossum59a61351997-08-14 20:34:33 +000059
Guido van Rossum4a944d71997-08-14 20:35:38 +000060Many API functions are useful independent of whether you're embedding
61or extending Python; moreover, most applications that embed Python
62will need to provide a custom extension as well, so it's probably a
63good idea to become familiar with writing an extension before
Guido van Rossum59a61351997-08-14 20:34:33 +000064attempting to embed Python in a real application.
65
Fred Drakeefd146c1999-02-15 15:30:45 +000066
67\section{Include Files \label{includes}}
Guido van Rossum580aa8d1997-11-25 15:34:51 +000068
69All function, type and macro definitions needed to use the Python/C
70API are included in your code by the following line:
71
Fred Drakee058b4f1998-02-16 06:15:35 +000072\begin{verbatim}
73#include "Python.h"
74\end{verbatim}
Guido van Rossum580aa8d1997-11-25 15:34:51 +000075
Fred Drakee058b4f1998-02-16 06:15:35 +000076This implies inclusion of the following standard headers:
77\code{<stdio.h>}, \code{<string.h>}, \code{<errno.h>}, and
78\code{<stdlib.h>} (if available).
Guido van Rossum580aa8d1997-11-25 15:34:51 +000079
80All user visible names defined by Python.h (except those defined by
Fred Drakee058b4f1998-02-16 06:15:35 +000081the included standard headers) have one of the prefixes \samp{Py} or
Fred Drake659ebfa2000-04-03 15:42:13 +000082\samp{_Py}. Names beginning with \samp{_Py} are for internal use by
83the Python implementation and should not be used by extension writers.
84Structure member names do not have a reserved prefix.
Guido van Rossum580aa8d1997-11-25 15:34:51 +000085
Fred Drakee058b4f1998-02-16 06:15:35 +000086\strong{Important:} user code should never define names that begin
87with \samp{Py} or \samp{_Py}. This confuses the reader, and
88jeopardizes the portability of the user code to future Python
89versions, which may define additional names beginning with one of
90these prefixes.
Guido van Rossum580aa8d1997-11-25 15:34:51 +000091
Fred Drake659ebfa2000-04-03 15:42:13 +000092The header files are typically installed with Python. On \UNIX, these
93are located in the directories
94\file{\envvar{prefix}/include/python\var{version}/} and
95\file{\envvar{exec_prefix}/include/python\var{version}/}, where
96\envvar{prefix} and \envvar{exec_prefix} are defined by the
97corresponding parameters to Python's \program{configure} script and
98\var{version} is \code{sys.version[:3]}. On Windows, the headers are
99installed in \file{\envvar{prefix}/include}, where \envvar{prefix} is
100the installation directory specified to the installer.
101
102To include the headers, place both directories (if different) on your
103compiler's search path for includes. Do \emph{not} place the parent
104directories on the search path and then use
105\samp{\#include <python1.5/Python.h>}; this will break on
106multi-platform builds since the platform independent headers under
107\envvar{prefix} include the platform specific headers from
108\envvar{exec_prefix}.
109
Fred Drakeefd146c1999-02-15 15:30:45 +0000110
111\section{Objects, Types and Reference Counts \label{objects}}
Guido van Rossum59a61351997-08-14 20:34:33 +0000112
Guido van Rossum580aa8d1997-11-25 15:34:51 +0000113Most Python/C API functions have one or more arguments as well as a
Fred Drake659ebfa2000-04-03 15:42:13 +0000114return value of type \ctype{PyObject*}. This type is a pointer
Fred Drakee058b4f1998-02-16 06:15:35 +0000115to an opaque data type representing an arbitrary Python
Guido van Rossum580aa8d1997-11-25 15:34:51 +0000116object. Since all Python object types are treated the same way by the
117Python language in most situations (e.g., assignments, scope rules,
118and argument passing), it is only fitting that they should be
Fred Drake659ebfa2000-04-03 15:42:13 +0000119represented by a single C type. Almost all Python objects live on the
120heap: you never declare an automatic or static variable of type
121\ctype{PyObject}, only pointer variables of type \ctype{PyObject*} can
122be declared. The sole exception are the type objects\obindex{type};
123since these must never be deallocated, they are typically static
124\ctype{PyTypeObject} objects.
Guido van Rossum59a61351997-08-14 20:34:33 +0000125
Fred Drakee058b4f1998-02-16 06:15:35 +0000126All Python objects (even Python integers) have a \dfn{type} and a
127\dfn{reference count}. An object's type determines what kind of object
Guido van Rossum4a944d71997-08-14 20:35:38 +0000128it is (e.g., an integer, a list, or a user-defined function; there are
Fred Drakebe486461999-11-09 17:03:03 +0000129many more as explained in the \citetitle[../ref/ref.html]{Python
130Reference Manual}). For each of the well-known types there is a macro
131to check whether an object is of that type; for instance,
Fred Drake659ebfa2000-04-03 15:42:13 +0000132\samp{PyList_Check(\var{a})} is true if (and only if) the object
133pointed to by \var{a} is a Python list.
Guido van Rossum59a61351997-08-14 20:34:33 +0000134
Fred Drakeefd146c1999-02-15 15:30:45 +0000135
136\subsection{Reference Counts \label{refcounts}}
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000137
Guido van Rossum580aa8d1997-11-25 15:34:51 +0000138The reference count is important because today's computers have a
Fred Drake003d8da1998-04-13 00:53:42 +0000139finite (and often severely limited) memory size; it counts how many
Guido van Rossum4a944d71997-08-14 20:35:38 +0000140different places there are that have a reference to an object. Such a
Fred Drake659ebfa2000-04-03 15:42:13 +0000141place could be another object, or a global (or static) C variable, or
142a local variable in some C function. When an object's reference count
Guido van Rossum4a944d71997-08-14 20:35:38 +0000143becomes zero, the object is deallocated. If it contains references to
144other objects, their reference count is decremented. Those other
145objects may be deallocated in turn, if this decrement makes their
146reference count become zero, and so on. (There's an obvious problem
147with objects that reference each other here; for now, the solution is
Fred Drake659ebfa2000-04-03 15:42:13 +0000148``don't do that.'')
Guido van Rossum59a61351997-08-14 20:34:33 +0000149
Guido van Rossum4a944d71997-08-14 20:35:38 +0000150Reference counts are always manipulated explicitly. The normal way is
Fred Drake659ebfa2000-04-03 15:42:13 +0000151to use the macro \cfunction{Py_INCREF()}\ttindex{Py_INCREF()} to
152increment an object's reference count by one, and
153\cfunction{Py_DECREF()}\ttindex{Py_DECREF()} to decrement it by
154one. The \cfunction{Py_DECREF()} macro is considerably more complex
155than the incref one, since it must check whether the reference count
156becomes zero and then cause the object's deallocator to be called.
157The deallocator is a function pointer contained in the object's type
158structure. The type-specific deallocator takes care of decrementing
159the reference counts for other objects contained in the object if this
160is a compound object type, such as a list, as well as performing any
161additional finalization that's needed. There's no chance that the
162reference count can overflow; at least as many bits are used to hold
163the reference count as there are distinct memory locations in virtual
164memory (assuming \code{sizeof(long) >= sizeof(char*)}). Thus, the
165reference count increment is a simple operation.
Guido van Rossum59a61351997-08-14 20:34:33 +0000166
Guido van Rossum4a944d71997-08-14 20:35:38 +0000167It is not necessary to increment an object's reference count for every
168local variable that contains a pointer to an object. In theory, the
Fred Drakee058b4f1998-02-16 06:15:35 +0000169object's reference count goes up by one when the variable is made to
Guido van Rossum4a944d71997-08-14 20:35:38 +0000170point to it and it goes down by one when the variable goes out of
171scope. However, these two cancel each other out, so at the end the
172reference count hasn't changed. The only real reason to use the
173reference count is to prevent the object from being deallocated as
174long as our variable is pointing to it. If we know that there is at
175least one other reference to the object that lives at least as long as
176our variable, there is no need to increment the reference count
177temporarily. An important situation where this arises is in objects
Fred Drake659ebfa2000-04-03 15:42:13 +0000178that are passed as arguments to C functions in an extension module
Guido van Rossum4a944d71997-08-14 20:35:38 +0000179that are called from Python; the call mechanism guarantees to hold a
Guido van Rossum59a61351997-08-14 20:34:33 +0000180reference to every argument for the duration of the call.
181
Fred Drakee058b4f1998-02-16 06:15:35 +0000182However, a common pitfall is to extract an object from a list and
183hold on to it for a while without incrementing its reference count.
184Some other operation might conceivably remove the object from the
185list, decrementing its reference count and possible deallocating it.
186The real danger is that innocent-looking operations may invoke
187arbitrary Python code which could do this; there is a code path which
188allows control to flow back to the user from a \cfunction{Py_DECREF()},
189so almost any operation is potentially dangerous.
Guido van Rossum59a61351997-08-14 20:34:33 +0000190
Guido van Rossum4a944d71997-08-14 20:35:38 +0000191A safe approach is to always use the generic operations (functions
Fred Drake659ebfa2000-04-03 15:42:13 +0000192whose name begins with \samp{PyObject_}, \samp{PyNumber_},
193\samp{PySequence_} or \samp{PyMapping_}). These operations always
194increment the reference count of the object they return. This leaves
195the caller with the responsibility to call
196\cfunction{Py_DECREF()} when they are done with the result; this soon
197becomes second nature.
Guido van Rossum59a61351997-08-14 20:34:33 +0000198
Fred Drakeefd146c1999-02-15 15:30:45 +0000199
200\subsubsection{Reference Count Details \label{refcountDetails}}
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000201
202The reference count behavior of functions in the Python/C API is best
Fred Drake659ebfa2000-04-03 15:42:13 +0000203explained in terms of \emph{ownership of references}. Note that we
Guido van Rossum580aa8d1997-11-25 15:34:51 +0000204talk of owning references, never of owning objects; objects are always
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000205shared! When a function owns a reference, it has to dispose of it
Fred Drakee058b4f1998-02-16 06:15:35 +0000206properly --- either by passing ownership on (usually to its caller) or
207by calling \cfunction{Py_DECREF()} or \cfunction{Py_XDECREF()}. When
208a function passes ownership of a reference on to its caller, the
209caller is said to receive a \emph{new} reference. When no ownership
210is transferred, the caller is said to \emph{borrow} the reference.
211Nothing needs to be done for a borrowed reference.
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000212
Fred Drakea8455ab2000-06-16 19:58:42 +0000213Conversely, when a calling function passes it a reference to an
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000214object, there are two possibilities: the function \emph{steals} a
215reference to the object, or it does not. Few functions steal
Fred Drakee058b4f1998-02-16 06:15:35 +0000216references; the two notable exceptions are
Fred Drake659ebfa2000-04-03 15:42:13 +0000217\cfunction{PyList_SetItem()}\ttindex{PyList_SetItem()} and
218\cfunction{PyTuple_SetItem()}\ttindex{PyTuple_SetItem()}, which
Fred Drakee058b4f1998-02-16 06:15:35 +0000219steal a reference to the item (but not to the tuple or list into which
Fred Drake003d8da1998-04-13 00:53:42 +0000220the item is put!). These functions were designed to steal a reference
Fred Drakee058b4f1998-02-16 06:15:35 +0000221because of a common idiom for populating a tuple or list with newly
222created objects; for example, the code to create the tuple \code{(1,
2232, "three")} could look like this (forgetting about error handling for
Fred Drake659ebfa2000-04-03 15:42:13 +0000224the moment; a better way to code this is shown below):
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000225
226\begin{verbatim}
227PyObject *t;
Fred Drakec6fa34e1998-04-02 06:47:24 +0000228
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000229t = PyTuple_New(3);
230PyTuple_SetItem(t, 0, PyInt_FromLong(1L));
231PyTuple_SetItem(t, 1, PyInt_FromLong(2L));
232PyTuple_SetItem(t, 2, PyString_FromString("three"));
233\end{verbatim}
234
Fred Drakee058b4f1998-02-16 06:15:35 +0000235Incidentally, \cfunction{PyTuple_SetItem()} is the \emph{only} way to
236set tuple items; \cfunction{PySequence_SetItem()} and
237\cfunction{PyObject_SetItem()} refuse to do this since tuples are an
238immutable data type. You should only use
239\cfunction{PyTuple_SetItem()} for tuples that you are creating
Guido van Rossum5b8a5231997-12-30 04:38:44 +0000240yourself.
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000241
242Equivalent code for populating a list can be written using
Fred Drakee058b4f1998-02-16 06:15:35 +0000243\cfunction{PyList_New()} and \cfunction{PyList_SetItem()}. Such code
244can also use \cfunction{PySequence_SetItem()}; this illustrates the
245difference between the two (the extra \cfunction{Py_DECREF()} calls):
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000246
247\begin{verbatim}
248PyObject *l, *x;
Fred Drakec6fa34e1998-04-02 06:47:24 +0000249
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000250l = PyList_New(3);
251x = PyInt_FromLong(1L);
Guido van Rossum5b8a5231997-12-30 04:38:44 +0000252PySequence_SetItem(l, 0, x); Py_DECREF(x);
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000253x = PyInt_FromLong(2L);
Guido van Rossum5b8a5231997-12-30 04:38:44 +0000254PySequence_SetItem(l, 1, x); Py_DECREF(x);
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000255x = PyString_FromString("three");
Guido van Rossum5b8a5231997-12-30 04:38:44 +0000256PySequence_SetItem(l, 2, x); Py_DECREF(x);
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000257\end{verbatim}
258
Guido van Rossum580aa8d1997-11-25 15:34:51 +0000259You might find it strange that the ``recommended'' approach takes more
260code. However, in practice, you will rarely use these ways of
261creating and populating a tuple or list. There's a generic function,
Fred Drakee058b4f1998-02-16 06:15:35 +0000262\cfunction{Py_BuildValue()}, that can create most common objects from
Fred Drake659ebfa2000-04-03 15:42:13 +0000263C values, directed by a \dfn{format string}. For example, the
Fred Drakee058b4f1998-02-16 06:15:35 +0000264above two blocks of code could be replaced by the following (which
265also takes care of the error checking):
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000266
267\begin{verbatim}
268PyObject *t, *l;
Fred Drakec6fa34e1998-04-02 06:47:24 +0000269
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000270t = Py_BuildValue("(iis)", 1, 2, "three");
271l = Py_BuildValue("[iis]", 1, 2, "three");
272\end{verbatim}
273
Fred Drakee058b4f1998-02-16 06:15:35 +0000274It is much more common to use \cfunction{PyObject_SetItem()} and
275friends with items whose references you are only borrowing, like
276arguments that were passed in to the function you are writing. In
277that case, their behaviour regarding reference counts is much saner,
278since you don't have to increment a reference count so you can give a
279reference away (``have it be stolen''). For example, this function
280sets all items of a list (actually, any mutable sequence) to a given
281item:
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000282
283\begin{verbatim}
284int set_all(PyObject *target, PyObject *item)
285{
286 int i, n;
Fred Drakec6fa34e1998-04-02 06:47:24 +0000287
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000288 n = PyObject_Length(target);
289 if (n < 0)
290 return -1;
291 for (i = 0; i < n; i++) {
292 if (PyObject_SetItem(target, i, item) < 0)
293 return -1;
294 }
295 return 0;
296}
297\end{verbatim}
Fred Drake659ebfa2000-04-03 15:42:13 +0000298\ttindex{set_all()}
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000299
300The situation is slightly different for function return values.
301While passing a reference to most functions does not change your
302ownership responsibilities for that reference, many functions that
303return a referece to an object give you ownership of the reference.
304The reason is simple: in many cases, the returned object is created
305on the fly, and the reference you get is the only reference to the
Fred Drakee058b4f1998-02-16 06:15:35 +0000306object. Therefore, the generic functions that return object
307references, like \cfunction{PyObject_GetItem()} and
308\cfunction{PySequence_GetItem()}, always return a new reference (i.e.,
309the caller becomes the owner of the reference).
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000310
311It is important to realize that whether you own a reference returned
Fred Drakee058b4f1998-02-16 06:15:35 +0000312by a function depends on which function you call only --- \emph{the
313plumage} (i.e., the type of the type of the object passed as an
314argument to the function) \emph{doesn't enter into it!} Thus, if you
315extract an item from a list using \cfunction{PyList_GetItem()}, you
316don't own the reference --- but if you obtain the same item from the
317same list using \cfunction{PySequence_GetItem()} (which happens to
318take exactly the same arguments), you do own a reference to the
319returned object.
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000320
Fred Drakee058b4f1998-02-16 06:15:35 +0000321Here is an example of how you could write a function that computes the
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000322sum of the items in a list of integers; once using
Fred Drake659ebfa2000-04-03 15:42:13 +0000323\cfunction{PyList_GetItem()}\ttindex{PyList_GetItem()}, and once using
324\cfunction{PySequence_GetItem()}\ttindex{PySequence_GetItem()}.
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000325
326\begin{verbatim}
327long sum_list(PyObject *list)
328{
329 int i, n;
330 long total = 0;
331 PyObject *item;
Fred Drakec6fa34e1998-04-02 06:47:24 +0000332
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000333 n = PyList_Size(list);
334 if (n < 0)
335 return -1; /* Not a list */
336 for (i = 0; i < n; i++) {
337 item = PyList_GetItem(list, i); /* Can't fail */
338 if (!PyInt_Check(item)) continue; /* Skip non-integers */
339 total += PyInt_AsLong(item);
340 }
341 return total;
342}
343\end{verbatim}
Fred Drake659ebfa2000-04-03 15:42:13 +0000344\ttindex{sum_list()}
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000345
346\begin{verbatim}
347long sum_sequence(PyObject *sequence)
348{
349 int i, n;
350 long total = 0;
351 PyObject *item;
Fred Drake659ebfa2000-04-03 15:42:13 +0000352 n = PySequence_Length(sequence);
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000353 if (n < 0)
354 return -1; /* Has no length */
355 for (i = 0; i < n; i++) {
Fred Drake659ebfa2000-04-03 15:42:13 +0000356 item = PySequence_GetItem(sequence, i);
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000357 if (item == NULL)
358 return -1; /* Not a sequence, or other failure */
359 if (PyInt_Check(item))
360 total += PyInt_AsLong(item);
Guido van Rossum5b8a5231997-12-30 04:38:44 +0000361 Py_DECREF(item); /* Discard reference ownership */
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000362 }
363 return total;
364}
365\end{verbatim}
Fred Drake659ebfa2000-04-03 15:42:13 +0000366\ttindex{sum_sequence()}
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000367
Fred Drakeefd146c1999-02-15 15:30:45 +0000368
369\subsection{Types \label{types}}
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000370
371There are few other data types that play a significant role in
Fred Drake659ebfa2000-04-03 15:42:13 +0000372the Python/C API; most are simple C types such as \ctype{int},
373\ctype{long}, \ctype{double} and \ctype{char*}. A few structure types
Guido van Rossum4a944d71997-08-14 20:35:38 +0000374are used to describe static tables used to list the functions exported
Fred Drake659ebfa2000-04-03 15:42:13 +0000375by a module or the data attributes of a new object type, and another
376is used to describe the value of a complex number. These will
Guido van Rossum59a61351997-08-14 20:34:33 +0000377be discussed together with the functions that use them.
378
Fred Drakeefd146c1999-02-15 15:30:45 +0000379
380\section{Exceptions \label{exceptions}}
Guido van Rossum59a61351997-08-14 20:34:33 +0000381
Guido van Rossum4a944d71997-08-14 20:35:38 +0000382The Python programmer only needs to deal with exceptions if specific
383error handling is required; unhandled exceptions are automatically
Fred Drake659ebfa2000-04-03 15:42:13 +0000384propagated to the caller, then to the caller's caller, and so on, until
Guido van Rossum4a944d71997-08-14 20:35:38 +0000385they reach the top-level interpreter, where they are reported to the
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000386user accompanied by a stack traceback.
Guido van Rossum59a61351997-08-14 20:34:33 +0000387
Fred Drake659ebfa2000-04-03 15:42:13 +0000388For C programmers, however, error checking always has to be explicit.
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000389All functions in the Python/C API can raise exceptions, unless an
390explicit claim is made otherwise in a function's documentation. In
391general, when a function encounters an error, it sets an exception,
392discards any object references that it owns, and returns an
Fred Drakee058b4f1998-02-16 06:15:35 +0000393error indicator --- usually \NULL{} or \code{-1}. A few functions
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000394return a Boolean true/false result, with false indicating an error.
395Very few functions return no explicit error indicator or have an
396ambiguous return value, and require explicit testing for errors with
Fred Drake659ebfa2000-04-03 15:42:13 +0000397\cfunction{PyErr_Occurred()}\ttindex{PyErr_Occurred()}.
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000398
399Exception state is maintained in per-thread storage (this is
400equivalent to using global storage in an unthreaded application). A
Fred Drakec6fa34e1998-04-02 06:47:24 +0000401thread can be in one of two states: an exception has occurred, or not.
Fred Drakee058b4f1998-02-16 06:15:35 +0000402The function \cfunction{PyErr_Occurred()} can be used to check for
403this: it returns a borrowed reference to the exception type object
404when an exception has occurred, and \NULL{} otherwise. There are a
405number of functions to set the exception state:
Fred Drake659ebfa2000-04-03 15:42:13 +0000406\cfunction{PyErr_SetString()}\ttindex{PyErr_SetString()} is the most
407common (though not the most general) function to set the exception
408state, and \cfunction{PyErr_Clear()}\ttindex{PyErr_Clear()} clears the
409exception state.
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000410
411The full exception state consists of three objects (all of which can
Fred Drakee058b4f1998-02-16 06:15:35 +0000412be \NULL{}): the exception type, the corresponding exception
Fred Drake659ebfa2000-04-03 15:42:13 +0000413value, and the traceback. These have the same meanings as the Python
414\withsubitem{(in module sys)}{
415 \ttindex{exc_type}\ttindex{exc_value}\ttindex{exc_traceback}}
416objects \code{sys.exc_type}, \code{sys.exc_value}, and
417\code{sys.exc_traceback}; however, they are not the same: the Python
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000418objects represent the last exception being handled by a Python
Fred Drake659ebfa2000-04-03 15:42:13 +0000419\keyword{try} \ldots\ \keyword{except} statement, while the C level
Fred Drakee058b4f1998-02-16 06:15:35 +0000420exception state only exists while an exception is being passed on
Fred Drake659ebfa2000-04-03 15:42:13 +0000421between C functions until it reaches the Python bytecode interpreter's
422main loop, which takes care of transferring it to \code{sys.exc_type}
423and friends.
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000424
Fred Drakec6fa34e1998-04-02 06:47:24 +0000425Note that starting with Python 1.5, the preferred, thread-safe way to
Fred Drake659ebfa2000-04-03 15:42:13 +0000426access the exception state from Python code is to call the function
427\withsubitem{(in module sys)}{\ttindex{exc_info()}}
Fred Drakee058b4f1998-02-16 06:15:35 +0000428\function{sys.exc_info()}, which returns the per-thread exception state
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000429for Python code. Also, the semantics of both ways to access the
430exception state have changed so that a function which catches an
431exception will save and restore its thread's exception state so as to
432preserve the exception state of its caller. This prevents common bugs
433in exception handling code caused by an innocent-looking function
434overwriting the exception being handled; it also reduces the often
435unwanted lifetime extension for objects that are referenced by the
Fred Drakec6fa34e1998-04-02 06:47:24 +0000436stack frames in the traceback.
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000437
438As a general principle, a function that calls another function to
439perform some task should check whether the called function raised an
440exception, and if so, pass the exception state on to its caller. It
Fred Drake659ebfa2000-04-03 15:42:13 +0000441should discard any object references that it owns, and return an
Fred Drakee058b4f1998-02-16 06:15:35 +0000442error indicator, but it should \emph{not} set another exception ---
443that would overwrite the exception that was just raised, and lose
444important information about the exact cause of the error.
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000445
Fred Drake659ebfa2000-04-03 15:42:13 +0000446A simple example of detecting exceptions and passing them on is shown
447in the \cfunction{sum_sequence()}\ttindex{sum_sequence()} example
448above. It so happens that that example doesn't need to clean up any
449owned references when it detects an error. The following example
450function shows some error cleanup. First, to remind you why you like
451Python, we show the equivalent Python code:
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000452
453\begin{verbatim}
Guido van Rossum580aa8d1997-11-25 15:34:51 +0000454def incr_item(dict, key):
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000455 try:
Guido van Rossum580aa8d1997-11-25 15:34:51 +0000456 item = dict[key]
457 except KeyError:
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000458 item = 0
Guido van Rossum580aa8d1997-11-25 15:34:51 +0000459 return item + 1
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000460\end{verbatim}
Fred Drake659ebfa2000-04-03 15:42:13 +0000461\ttindex{incr_item()}
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000462
Fred Drake659ebfa2000-04-03 15:42:13 +0000463Here is the corresponding C code, in all its glory:
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000464
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000465\begin{verbatim}
Guido van Rossum580aa8d1997-11-25 15:34:51 +0000466int incr_item(PyObject *dict, PyObject *key)
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000467{
468 /* Objects all initialized to NULL for Py_XDECREF */
469 PyObject *item = NULL, *const_one = NULL, *incremented_item = NULL;
Guido van Rossum5b8a5231997-12-30 04:38:44 +0000470 int rv = -1; /* Return value initialized to -1 (failure) */
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000471
Guido van Rossum580aa8d1997-11-25 15:34:51 +0000472 item = PyObject_GetItem(dict, key);
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000473 if (item == NULL) {
Fred Drakec6fa34e1998-04-02 06:47:24 +0000474 /* Handle KeyError only: */
475 if (!PyErr_ExceptionMatches(PyExc_KeyError)) goto error;
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000476
477 /* Clear the error and use zero: */
478 PyErr_Clear();
Guido van Rossum580aa8d1997-11-25 15:34:51 +0000479 item = PyInt_FromLong(0L);
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000480 if (item == NULL) goto error;
481 }
482
483 const_one = PyInt_FromLong(1L);
484 if (const_one == NULL) goto error;
485
486 incremented_item = PyNumber_Add(item, const_one);
487 if (incremented_item == NULL) goto error;
488
Guido van Rossum580aa8d1997-11-25 15:34:51 +0000489 if (PyObject_SetItem(dict, key, incremented_item) < 0) goto error;
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000490 rv = 0; /* Success */
491 /* Continue with cleanup code */
492
493 error:
494 /* Cleanup code, shared by success and failure path */
495
496 /* Use Py_XDECREF() to ignore NULL references */
497 Py_XDECREF(item);
498 Py_XDECREF(const_one);
499 Py_XDECREF(incremented_item);
500
501 return rv; /* -1 for error, 0 for success */
502}
503\end{verbatim}
Fred Drake659ebfa2000-04-03 15:42:13 +0000504\ttindex{incr_item()}
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000505
Fred Drakef8830d11998-04-23 14:06:01 +0000506This example represents an endorsed use of the \keyword{goto} statement
Fred Drake659ebfa2000-04-03 15:42:13 +0000507in C! It illustrates the use of
508\cfunction{PyErr_ExceptionMatches()}\ttindex{PyErr_ExceptionMatches()} and
509\cfunction{PyErr_Clear()}\ttindex{PyErr_Clear()} to
510handle specific exceptions, and the use of
511\cfunction{Py_XDECREF()}\ttindex{Py_XDECREF()} to
512dispose of owned references that may be \NULL{} (note the
513\character{X} in the name; \cfunction{Py_DECREF()} would crash when
514confronted with a \NULL{} reference). It is important that the
515variables used to hold owned references are initialized to \NULL{} for
516this to work; likewise, the proposed return value is initialized to
517\code{-1} (failure) and only set to success after the final call made
518is successful.
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000519
Guido van Rossum59a61351997-08-14 20:34:33 +0000520
Fred Drakeefd146c1999-02-15 15:30:45 +0000521\section{Embedding Python \label{embedding}}
Guido van Rossum59a61351997-08-14 20:34:33 +0000522
Guido van Rossum580aa8d1997-11-25 15:34:51 +0000523The one important task that only embedders (as opposed to extension
524writers) of the Python interpreter have to worry about is the
525initialization, and possibly the finalization, of the Python
526interpreter. Most functionality of the interpreter can only be used
527after the interpreter has been initialized.
Guido van Rossum59a61351997-08-14 20:34:33 +0000528
Fred Drake659ebfa2000-04-03 15:42:13 +0000529The basic initialization function is
530\cfunction{Py_Initialize()}\ttindex{Py_Initialize()}.
Fred Drakee058b4f1998-02-16 06:15:35 +0000531This initializes the table of loaded modules, and creates the
Fred Drake4de05a91998-02-16 14:25:26 +0000532fundamental modules \module{__builtin__}\refbimodindex{__builtin__},
533\module{__main__}\refbimodindex{__main__} and
534\module{sys}\refbimodindex{sys}. It also initializes the module
Fred Drakec6fa34e1998-04-02 06:47:24 +0000535search path (\code{sys.path}).%
536\indexiii{module}{search}{path}
Fred Drake659ebfa2000-04-03 15:42:13 +0000537\withsubitem{(in module sys)}{\ttindex{path}}
Guido van Rossum59a61351997-08-14 20:34:33 +0000538
Fred Drakee058b4f1998-02-16 06:15:35 +0000539\cfunction{Py_Initialize()} does not set the ``script argument list''
Guido van Rossum4a944d71997-08-14 20:35:38 +0000540(\code{sys.argv}). If this variable is needed by Python code that
541will be executed later, it must be set explicitly with a call to
Fred Drake659ebfa2000-04-03 15:42:13 +0000542\code{PySys_SetArgv(\var{argc},
543\var{argv})}\ttindex{PySys_SetArgv()} subsequent to the call to
544\cfunction{Py_Initialize()}.
Guido van Rossum59a61351997-08-14 20:34:33 +0000545
Fred Drakeb0a78731998-01-13 18:51:10 +0000546On most systems (in particular, on \UNIX{} and Windows, although the
Fred Drake659ebfa2000-04-03 15:42:13 +0000547details are slightly different),
548\cfunction{Py_Initialize()} calculates the module search path based
549upon its best guess for the location of the standard Python
550interpreter executable, assuming that the Python library is found in a
551fixed location relative to the Python interpreter executable. In
552particular, it looks for a directory named
Fred Drake2de75ec1998-04-09 14:12:11 +0000553\file{lib/python1.5} (replacing \file{1.5} with the current
Guido van Rossum42cefd01997-10-05 15:27:29 +0000554interpreter version) relative to the parent directory where the
Fred Drakee058b4f1998-02-16 06:15:35 +0000555executable named \file{python} is found on the shell command search
Fred Drakec6fa34e1998-04-02 06:47:24 +0000556path (the environment variable \envvar{PATH}).
Guido van Rossum42cefd01997-10-05 15:27:29 +0000557
558For instance, if the Python executable is found in
Fred Drakee058b4f1998-02-16 06:15:35 +0000559\file{/usr/local/bin/python}, it will assume that the libraries are in
Fred Drake2de75ec1998-04-09 14:12:11 +0000560\file{/usr/local/lib/python1.5}. (In fact, this particular path
Fred Drakee058b4f1998-02-16 06:15:35 +0000561is also the ``fallback'' location, used when no executable file named
Fred Drakec6fa34e1998-04-02 06:47:24 +0000562\file{python} is found along \envvar{PATH}.) The user can override
563this behavior by setting the environment variable \envvar{PYTHONHOME},
Guido van Rossum580aa8d1997-11-25 15:34:51 +0000564or insert additional directories in front of the standard path by
Fred Drakec6fa34e1998-04-02 06:47:24 +0000565setting \envvar{PYTHONPATH}.
Guido van Rossum59a61351997-08-14 20:34:33 +0000566
Guido van Rossum4a944d71997-08-14 20:35:38 +0000567The embedding application can steer the search by calling
Fred Drake659ebfa2000-04-03 15:42:13 +0000568\code{Py_SetProgramName(\var{file})}\ttindex{Py_SetProgramName()} \emph{before} calling
Fred Drakec6fa34e1998-04-02 06:47:24 +0000569\cfunction{Py_Initialize()}. Note that \envvar{PYTHONHOME} still
570overrides this and \envvar{PYTHONPATH} is still inserted in front of
Fred Drakee058b4f1998-02-16 06:15:35 +0000571the standard path. An application that requires total control has to
Fred Drake659ebfa2000-04-03 15:42:13 +0000572provide its own implementation of
573\cfunction{Py_GetPath()}\ttindex{Py_GetPath()},
574\cfunction{Py_GetPrefix()}\ttindex{Py_GetPrefix()},
575\cfunction{Py_GetExecPrefix()}\ttindex{Py_GetExecPrefix()}, and
576\cfunction{Py_GetProgramFullPath()}\ttindex{Py_GetProgramFullPath()} (all
577defined in \file{Modules/getpath.c}).
Guido van Rossum59a61351997-08-14 20:34:33 +0000578
Guido van Rossum4a944d71997-08-14 20:35:38 +0000579Sometimes, it is desirable to ``uninitialize'' Python. For instance,
580the application may want to start over (make another call to
Fred Drakee058b4f1998-02-16 06:15:35 +0000581\cfunction{Py_Initialize()}) or the application is simply done with its
Guido van Rossum580aa8d1997-11-25 15:34:51 +0000582use of Python and wants to free all memory allocated by Python. This
Fred Drakee058b4f1998-02-16 06:15:35 +0000583can be accomplished by calling \cfunction{Py_Finalize()}. The function
Fred Drake659ebfa2000-04-03 15:42:13 +0000584\cfunction{Py_IsInitialized()}\ttindex{Py_IsInitialized()} returns
585true if Python is currently in the initialized state. More
586information about these functions is given in a later chapter.
Guido van Rossum59a61351997-08-14 20:34:33 +0000587
Guido van Rossum4a944d71997-08-14 20:35:38 +0000588
Fred Drakeefd146c1999-02-15 15:30:45 +0000589\chapter{The Very High Level Layer \label{veryhigh}}
Guido van Rossum4a944d71997-08-14 20:35:38 +0000590
Fred Drakee5bf8b21998-02-12 21:22:28 +0000591The functions in this chapter will let you execute Python source code
592given in a file or a buffer, but they will not let you interact in a
593more detailed way with the interpreter.
Guido van Rossum4a944d71997-08-14 20:35:38 +0000594
Fred Drake659ebfa2000-04-03 15:42:13 +0000595Several of these functions accept a start symbol from the grammar as a
596parameter. The available start symbols are \constant{Py_eval_input},
597\constant{Py_file_input}, and \constant{Py_single_input}. These are
598described following the functions which accept them as parameters.
599
Fred Drake510d08b2000-08-14 02:50:21 +0000600Note also that several of these functions take \ctype{FILE*}
601parameters. On particular issue which needs to be handled carefully
602is that the \ctype{FILE} structure for different C libraries can be
603different and incompatible. Under Windows (at least), it is possible
604for dynamically linked extensions to actually use different libraries,
605so care should be taken that \ctype{FILE*} parameters are only passed
606to these functions if it is certain that they were created by the same
607library that the Python runtime is using.
608
Fred Drakec6fa34e1998-04-02 06:47:24 +0000609\begin{cfuncdesc}{int}{PyRun_AnyFile}{FILE *fp, char *filename}
Fred Drake0041a941999-04-29 04:20:46 +0000610 If \var{fp} refers to a file associated with an interactive device
611 (console or terminal input or \UNIX{} pseudo-terminal), return the
612 value of \cfunction{PyRun_InteractiveLoop()}, otherwise return the
613 result of \cfunction{PyRun_SimpleFile()}. If \var{filename} is
Fred Drakea8d73412000-08-11 20:39:29 +0000614 \NULL{}, this function uses \code{"???"} as the filename.
Guido van Rossum9231c8f1997-05-15 21:43:21 +0000615\end{cfuncdesc}
616
Fred Drakec6fa34e1998-04-02 06:47:24 +0000617\begin{cfuncdesc}{int}{PyRun_SimpleString}{char *command}
Fred Drake0041a941999-04-29 04:20:46 +0000618 Executes the Python source code from \var{command} in the
619 \module{__main__} module. If \module{__main__} does not already
620 exist, it is created. Returns \code{0} on success or \code{-1} if
621 an exception was raised. If there was an error, there is no way to
622 get the exception information.
Guido van Rossum9231c8f1997-05-15 21:43:21 +0000623\end{cfuncdesc}
624
Fred Drakec6fa34e1998-04-02 06:47:24 +0000625\begin{cfuncdesc}{int}{PyRun_SimpleFile}{FILE *fp, char *filename}
Fred Drake0041a941999-04-29 04:20:46 +0000626 Similar to \cfunction{PyRun_SimpleString()}, but the Python source
627 code is read from \var{fp} instead of an in-memory string.
628 \var{filename} should be the name of the file.
Fred Drakee5bf8b21998-02-12 21:22:28 +0000629\end{cfuncdesc}
630
Fred Drakec6fa34e1998-04-02 06:47:24 +0000631\begin{cfuncdesc}{int}{PyRun_InteractiveOne}{FILE *fp, char *filename}
Fred Drakea8d73412000-08-11 20:39:29 +0000632 Read and execute a single statement from a file associated with an
633 interactive device. If \var{filename} is \NULL, \code{"???"} is
634 used instead. The user will be prompted using \code{sys.ps1} and
635 \code{sys.ps2}. Returns \code{0} when the input was executed
636 successfully, \code{-1} if there was an exception, or an error code
637 from the \file{errcode.h} include file distributed as part of Python
638 in case of a parse error. (Note that \file{errcode.h} is not
639 included by \file{Python.h}, so must be included specifically if
640 needed.)
Fred Drakee5bf8b21998-02-12 21:22:28 +0000641\end{cfuncdesc}
642
Fred Drakec6fa34e1998-04-02 06:47:24 +0000643\begin{cfuncdesc}{int}{PyRun_InteractiveLoop}{FILE *fp, char *filename}
Fred Drakea8d73412000-08-11 20:39:29 +0000644 Read and execute statements from a file associated with an
645 interactive device until \EOF{} is reached. If \var{filename} is
646 \NULL, \code{"???"} is used instead. The user will be prompted
647 using \code{sys.ps1} and \code{sys.ps2}. Returns \code{0} at \EOF.
Fred Drakee5bf8b21998-02-12 21:22:28 +0000648\end{cfuncdesc}
649
Fred Drakec6fa34e1998-04-02 06:47:24 +0000650\begin{cfuncdesc}{struct _node*}{PyParser_SimpleParseString}{char *str,
651 int start}
Fred Drake0041a941999-04-29 04:20:46 +0000652 Parse Python source code from \var{str} using the start token
653 \var{start}. The result can be used to create a code object which
654 can be evaluated efficiently. This is useful if a code fragment
655 must be evaluated many times.
Fred Drakee5bf8b21998-02-12 21:22:28 +0000656\end{cfuncdesc}
657
Fred Drakec6fa34e1998-04-02 06:47:24 +0000658\begin{cfuncdesc}{struct _node*}{PyParser_SimpleParseFile}{FILE *fp,
659 char *filename, int start}
Fred Drake0041a941999-04-29 04:20:46 +0000660 Similar to \cfunction{PyParser_SimpleParseString()}, but the Python
661 source code is read from \var{fp} instead of an in-memory string.
662 \var{filename} should be the name of the file.
Fred Drakee5bf8b21998-02-12 21:22:28 +0000663\end{cfuncdesc}
664
Fred Drakec6fa34e1998-04-02 06:47:24 +0000665\begin{cfuncdesc}{PyObject*}{PyRun_String}{char *str, int start,
666 PyObject *globals,
667 PyObject *locals}
Fred Drake0041a941999-04-29 04:20:46 +0000668 Execute Python source code from \var{str} in the context specified
669 by the dictionaries \var{globals} and \var{locals}. The parameter
670 \var{start} specifies the start token that should be used to parse
671 the source code.
672
673 Returns the result of executing the code as a Python object, or
674 \NULL{} if an exception was raised.
Fred Drakee5bf8b21998-02-12 21:22:28 +0000675\end{cfuncdesc}
676
Fred Drakec6fa34e1998-04-02 06:47:24 +0000677\begin{cfuncdesc}{PyObject*}{PyRun_File}{FILE *fp, char *filename,
678 int start, PyObject *globals,
679 PyObject *locals}
Fred Drake0041a941999-04-29 04:20:46 +0000680 Similar to \cfunction{PyRun_String()}, but the Python source code is
Fred Drake659ebfa2000-04-03 15:42:13 +0000681 read from \var{fp} instead of an in-memory string.
682 \var{filename} should be the name of the file.
Fred Drakee5bf8b21998-02-12 21:22:28 +0000683\end{cfuncdesc}
684
Fred Drakec6fa34e1998-04-02 06:47:24 +0000685\begin{cfuncdesc}{PyObject*}{Py_CompileString}{char *str, char *filename,
686 int start}
Fred Drake0041a941999-04-29 04:20:46 +0000687 Parse and compile the Python source code in \var{str}, returning the
688 resulting code object. The start token is given by \var{start};
Fred Drakec924b8d1999-08-23 18:57:25 +0000689 this can be used to constrain the code which can be compiled and should
690 be \constant{Py_eval_input}, \constant{Py_file_input}, or
691 \constant{Py_single_input}. The filename specified by
692 \var{filename} is used to construct the code object and may appear
693 in tracebacks or \exception{SyntaxError} exception messages. This
694 returns \NULL{} if the code cannot be parsed or compiled.
Guido van Rossum9231c8f1997-05-15 21:43:21 +0000695\end{cfuncdesc}
696
Fred Drakec924b8d1999-08-23 18:57:25 +0000697\begin{cvardesc}{int}{Py_eval_input}
698 The start symbol from the Python grammar for isolated expressions;
Fred Drake659ebfa2000-04-03 15:42:13 +0000699 for use with \cfunction{Py_CompileString()}\ttindex{Py_CompileString()}.
Fred Drakec924b8d1999-08-23 18:57:25 +0000700\end{cvardesc}
701
702\begin{cvardesc}{int}{Py_file_input}
703 The start symbol from the Python grammar for sequences of statements
704 as read from a file or other source; for use with
Fred Drake659ebfa2000-04-03 15:42:13 +0000705 \cfunction{Py_CompileString()}\ttindex{Py_CompileString()}. This is
706 the symbol to use when compiling arbitrarily long Python source code.
Fred Drakec924b8d1999-08-23 18:57:25 +0000707\end{cvardesc}
708
709\begin{cvardesc}{int}{Py_single_input}
710 The start symbol from the Python grammar for a single statement; for
Fred Drake659ebfa2000-04-03 15:42:13 +0000711 use with \cfunction{Py_CompileString()}\ttindex{Py_CompileString()}.
712 This is the symbol used for the interactive interpreter loop.
Fred Drakec924b8d1999-08-23 18:57:25 +0000713\end{cvardesc}
714
Guido van Rossum9231c8f1997-05-15 21:43:21 +0000715
Fred Drakeefd146c1999-02-15 15:30:45 +0000716\chapter{Reference Counting \label{countingRefs}}
Guido van Rossum9231c8f1997-05-15 21:43:21 +0000717
Guido van Rossum580aa8d1997-11-25 15:34:51 +0000718The macros in this section are used for managing reference counts
Guido van Rossum9231c8f1997-05-15 21:43:21 +0000719of Python objects.
720
721\begin{cfuncdesc}{void}{Py_INCREF}{PyObject *o}
Fred Drakec6fa34e1998-04-02 06:47:24 +0000722Increment the reference count for object \var{o}. The object must
Guido van Rossum9231c8f1997-05-15 21:43:21 +0000723not be \NULL{}; if you aren't sure that it isn't \NULL{}, use
Fred Drakee058b4f1998-02-16 06:15:35 +0000724\cfunction{Py_XINCREF()}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +0000725\end{cfuncdesc}
726
727\begin{cfuncdesc}{void}{Py_XINCREF}{PyObject *o}
Fred Drakee058b4f1998-02-16 06:15:35 +0000728Increment the reference count for object \var{o}. The object may be
Guido van Rossum580aa8d1997-11-25 15:34:51 +0000729\NULL{}, in which case the macro has no effect.
Guido van Rossum9231c8f1997-05-15 21:43:21 +0000730\end{cfuncdesc}
731
732\begin{cfuncdesc}{void}{Py_DECREF}{PyObject *o}
Fred Drakee058b4f1998-02-16 06:15:35 +0000733Decrement the reference count for object \var{o}. The object must
Guido van Rossum9231c8f1997-05-15 21:43:21 +0000734not be \NULL{}; if you aren't sure that it isn't \NULL{}, use
Fred Drakee058b4f1998-02-16 06:15:35 +0000735\cfunction{Py_XDECREF()}. If the reference count reaches zero, the
736object's type's deallocation function (which must not be \NULL{}) is
737invoked.
Guido van Rossum9231c8f1997-05-15 21:43:21 +0000738
739\strong{Warning:} The deallocation function can cause arbitrary Python
Fred Drake659ebfa2000-04-03 15:42:13 +0000740code to be invoked (e.g. when a class instance with a
741\method{__del__()} method is deallocated). While exceptions in such
742code are not propagated, the executed code has free access to all
743Python global variables. This means that any object that is reachable
744from a global variable should be in a consistent state before
745\cfunction{Py_DECREF()} is invoked. For example, code to delete an
746object from a list should copy a reference to the deleted object in a
747temporary variable, update the list data structure, and then call
748\cfunction{Py_DECREF()} for the temporary variable.
Guido van Rossum9231c8f1997-05-15 21:43:21 +0000749\end{cfuncdesc}
750
751\begin{cfuncdesc}{void}{Py_XDECREF}{PyObject *o}
Fred Drakee058b4f1998-02-16 06:15:35 +0000752Decrement the reference count for object \var{o}. The object may be
753\NULL{}, in which case the macro has no effect; otherwise the effect
754is the same as for \cfunction{Py_DECREF()}, and the same warning
Guido van Rossum9231c8f1997-05-15 21:43:21 +0000755applies.
756\end{cfuncdesc}
757
Fred Drake659ebfa2000-04-03 15:42:13 +0000758The following functions or macros are only for use within the
759interpreter core: \cfunction{_Py_Dealloc()},
760\cfunction{_Py_ForgetReference()}, \cfunction{_Py_NewReference()}, as
761well as the global variable \cdata{_Py_RefTotal}.
Guido van Rossum580aa8d1997-11-25 15:34:51 +0000762
Guido van Rossum9231c8f1997-05-15 21:43:21 +0000763
Fred Drakeefd146c1999-02-15 15:30:45 +0000764\chapter{Exception Handling \label{exceptionHandling}}
Guido van Rossum9231c8f1997-05-15 21:43:21 +0000765
Fred Drake659ebfa2000-04-03 15:42:13 +0000766The functions described in this chapter will let you handle and raise Python
Guido van Rossumae110af1997-05-22 20:11:52 +0000767exceptions. It is important to understand some of the basics of
Fred Drake659ebfa2000-04-03 15:42:13 +0000768Python exception handling. It works somewhat like the
769\UNIX{} \cdata{errno} variable: there is a global indicator (per
770thread) of the last error that occurred. Most functions don't clear
771this on success, but will set it to indicate the cause of the error on
772failure. Most functions also return an error indicator, usually
773\NULL{} if they are supposed to return a pointer, or \code{-1} if they
774return an integer (exception: the \cfunction{PyArg_Parse*()} functions
775return \code{1} for success and \code{0} for failure). When a
776function must fail because some function it called failed, it
777generally doesn't set the error indicator; the function it called
778already set it.
Guido van Rossumae110af1997-05-22 20:11:52 +0000779
780The error indicator consists of three Python objects corresponding to
Fred Drake659ebfa2000-04-03 15:42:13 +0000781\withsubitem{(in module sys)}{
782 \ttindex{exc_type}\ttindex{exc_value}\ttindex{exc_traceback}}
Guido van Rossumae110af1997-05-22 20:11:52 +0000783the Python variables \code{sys.exc_type}, \code{sys.exc_value} and
784\code{sys.exc_traceback}. API functions exist to interact with the
785error indicator in various ways. There is a separate error indicator
786for each thread.
787
788% XXX Order of these should be more thoughtful.
789% Either alphabetical or some kind of structure.
Guido van Rossum9231c8f1997-05-15 21:43:21 +0000790
791\begin{cfuncdesc}{void}{PyErr_Print}{}
Guido van Rossumae110af1997-05-22 20:11:52 +0000792Print a standard traceback to \code{sys.stderr} and clear the error
793indicator. Call this function only when the error indicator is set.
794(Otherwise it will cause a fatal error!)
Guido van Rossum9231c8f1997-05-15 21:43:21 +0000795\end{cfuncdesc}
796
Fred Drakec6fa34e1998-04-02 06:47:24 +0000797\begin{cfuncdesc}{PyObject*}{PyErr_Occurred}{}
Guido van Rossumae110af1997-05-22 20:11:52 +0000798Test whether the error indicator is set. If set, return the exception
Fred Drakee058b4f1998-02-16 06:15:35 +0000799\emph{type} (the first argument to the last call to one of the
Fred Drakef8830d11998-04-23 14:06:01 +0000800\cfunction{PyErr_Set*()} functions or to \cfunction{PyErr_Restore()}). If
Fred Drakee058b4f1998-02-16 06:15:35 +0000801not set, return \NULL{}. You do not own a reference to the return
802value, so you do not need to \cfunction{Py_DECREF()} it.
Fred Drake659ebfa2000-04-03 15:42:13 +0000803\strong{Note:} Do not compare the return value to a specific
Fred Drakee058b4f1998-02-16 06:15:35 +0000804exception; use \cfunction{PyErr_ExceptionMatches()} instead, shown
Fred Drake659ebfa2000-04-03 15:42:13 +0000805below. (The comparison could easily fail since the exception may be
806an instance instead of a class, in the case of a class exception, or
807it may the a subclass of the expected exception.)
Guido van Rossum42cefd01997-10-05 15:27:29 +0000808\end{cfuncdesc}
809
810\begin{cfuncdesc}{int}{PyErr_ExceptionMatches}{PyObject *exc}
Guido van Rossum42cefd01997-10-05 15:27:29 +0000811Equivalent to
Fred Drakee058b4f1998-02-16 06:15:35 +0000812\samp{PyErr_GivenExceptionMatches(PyErr_Occurred(), \var{exc})}.
Fred Drake659ebfa2000-04-03 15:42:13 +0000813This should only be called when an exception is actually set; a memory
814access violation will occur if no exception has been raised.
Guido van Rossum42cefd01997-10-05 15:27:29 +0000815\end{cfuncdesc}
816
817\begin{cfuncdesc}{int}{PyErr_GivenExceptionMatches}{PyObject *given, PyObject *exc}
Guido van Rossum42cefd01997-10-05 15:27:29 +0000818Return true if the \var{given} exception matches the exception in
819\var{exc}. If \var{exc} is a class object, this also returns true
Fred Drake659ebfa2000-04-03 15:42:13 +0000820when \var{given} is an instance of a subclass. If \var{exc} is a tuple, all
Guido van Rossum42cefd01997-10-05 15:27:29 +0000821exceptions in the tuple (and recursively in subtuples) are searched
Fred Drake659ebfa2000-04-03 15:42:13 +0000822for a match. If \var{given} is \NULL, a memory access violation will
823occur.
Guido van Rossum42cefd01997-10-05 15:27:29 +0000824\end{cfuncdesc}
825
826\begin{cfuncdesc}{void}{PyErr_NormalizeException}{PyObject**exc, PyObject**val, PyObject**tb}
Guido van Rossum42cefd01997-10-05 15:27:29 +0000827Under certain circumstances, the values returned by
Fred Drakee058b4f1998-02-16 06:15:35 +0000828\cfunction{PyErr_Fetch()} below can be ``unnormalized'', meaning that
829\code{*\var{exc}} is a class object but \code{*\var{val}} is not an
830instance of the same class. This function can be used to instantiate
831the class in that case. If the values are already normalized, nothing
Fred Drake659ebfa2000-04-03 15:42:13 +0000832happens. The delayed normalization is implemented to improve
833performance.
Guido van Rossumae110af1997-05-22 20:11:52 +0000834\end{cfuncdesc}
835
836\begin{cfuncdesc}{void}{PyErr_Clear}{}
837Clear the error indicator. If the error indicator is not set, there
838is no effect.
839\end{cfuncdesc}
840
Fred Drake659ebfa2000-04-03 15:42:13 +0000841\begin{cfuncdesc}{void}{PyErr_Fetch}{PyObject **ptype, PyObject **pvalue,
842 PyObject **ptraceback}
Guido van Rossumae110af1997-05-22 20:11:52 +0000843Retrieve the error indicator into three variables whose addresses are
844passed. If the error indicator is not set, set all three variables to
845\NULL{}. If it is set, it will be cleared and you own a reference to
Fred Drake659ebfa2000-04-03 15:42:13 +0000846each object retrieved. The value and traceback object may be
847\NULL{} even when the type object is not. \strong{Note:} This
848function is normally only used by code that needs to handle exceptions
849or by code that needs to save and restore the error indicator
850temporarily.
Guido van Rossumae110af1997-05-22 20:11:52 +0000851\end{cfuncdesc}
852
Fred Drake17e63432000-08-31 05:50:40 +0000853\begin{cfuncdesc}{void}{PyErr_Restore}{PyObject *type, PyObject *value,
854 PyObject *traceback}
Guido van Rossumae110af1997-05-22 20:11:52 +0000855Set the error indicator from the three objects. If the error
856indicator is already set, it is cleared first. If the objects are
857\NULL{}, the error indicator is cleared. Do not pass a \NULL{} type
858and non-\NULL{} value or traceback. The exception type should be a
859string or class; if it is a class, the value should be an instance of
860that class. Do not pass an invalid exception type or value.
861(Violating these rules will cause subtle problems later.) This call
Fred Drake17e63432000-08-31 05:50:40 +0000862takes away a reference to each object, i.e.\ you must own a reference
Guido van Rossumae110af1997-05-22 20:11:52 +0000863to each object before the call and after the call you no longer own
864these references. (If you don't understand this, don't use this
Fred Drake659ebfa2000-04-03 15:42:13 +0000865function. I warned you.) \strong{Note:} This function is normally
Guido van Rossumae110af1997-05-22 20:11:52 +0000866only used by code that needs to save and restore the error indicator
867temporarily.
868\end{cfuncdesc}
869
870\begin{cfuncdesc}{void}{PyErr_SetString}{PyObject *type, char *message}
871This is the most common way to set the error indicator. The first
872argument specifies the exception type; it is normally one of the
Fred Drakef8830d11998-04-23 14:06:01 +0000873standard exceptions, e.g. \cdata{PyExc_RuntimeError}. You need not
Guido van Rossumae110af1997-05-22 20:11:52 +0000874increment its reference count. The second argument is an error
875message; it is converted to a string object.
876\end{cfuncdesc}
877
878\begin{cfuncdesc}{void}{PyErr_SetObject}{PyObject *type, PyObject *value}
Fred Drakee058b4f1998-02-16 06:15:35 +0000879This function is similar to \cfunction{PyErr_SetString()} but lets you
Guido van Rossumae110af1997-05-22 20:11:52 +0000880specify an arbitrary Python object for the ``value'' of the exception.
881You need not increment its reference count.
882\end{cfuncdesc}
883
Fred Drake73577702000-04-10 18:50:14 +0000884\begin{cfuncdesc}{PyObject*}{PyErr_Format}{PyObject *exception,
Moshe Zadka57a59322000-09-01 09:47:20 +0000885 const char *format, \moreargs}
886This function sets the error indicator.
887\var{exception} should be a Python object.
888\var{fmt} should be a string, containing format codes, similar to
889\cfunction{printf}. The \code{width.precision} before a format code
890is parsed, but the width part is ignored.
891
892\begin{tableii}{c|l}{character}{Character}{Meaning}
893 \lineii{c}{Character, as an \ctype{int} parameter}
894 \lineii{d}{Number in decimal, as an \ctype{int} parameter}
895 \lineii{x}{Number in hexadecimal, as an \ctype{int} parameter}
896 \lineii{x}{A string, as a \ctype{char *} parameter}
897\end{tableii}
898
899An unrecognized format character causes all the rest of
900the format string to be copied as-is to the result string,
901and any extra arguments discarded.
902
903A new reference is returned, which is owned by the caller.
Jeremy Hylton98605b52000-04-10 18:40:57 +0000904\end{cfuncdesc}
905
Guido van Rossumae110af1997-05-22 20:11:52 +0000906\begin{cfuncdesc}{void}{PyErr_SetNone}{PyObject *type}
Fred Drakee058b4f1998-02-16 06:15:35 +0000907This is a shorthand for \samp{PyErr_SetObject(\var{type}, Py_None)}.
Guido van Rossumae110af1997-05-22 20:11:52 +0000908\end{cfuncdesc}
909
910\begin{cfuncdesc}{int}{PyErr_BadArgument}{}
Fred Drakee058b4f1998-02-16 06:15:35 +0000911This is a shorthand for \samp{PyErr_SetString(PyExc_TypeError,
Guido van Rossumae110af1997-05-22 20:11:52 +0000912\var{message})}, where \var{message} indicates that a built-in operation
913was invoked with an illegal argument. It is mostly for internal use.
914\end{cfuncdesc}
915
Fred Drakec6fa34e1998-04-02 06:47:24 +0000916\begin{cfuncdesc}{PyObject*}{PyErr_NoMemory}{}
Fred Drakee058b4f1998-02-16 06:15:35 +0000917This is a shorthand for \samp{PyErr_SetNone(PyExc_MemoryError)}; it
Guido van Rossumae110af1997-05-22 20:11:52 +0000918returns \NULL{} so an object allocation function can write
Fred Drakee058b4f1998-02-16 06:15:35 +0000919\samp{return PyErr_NoMemory();} when it runs out of memory.
Guido van Rossumae110af1997-05-22 20:11:52 +0000920\end{cfuncdesc}
921
Fred Drakec6fa34e1998-04-02 06:47:24 +0000922\begin{cfuncdesc}{PyObject*}{PyErr_SetFromErrno}{PyObject *type}
Fred Drake659ebfa2000-04-03 15:42:13 +0000923This is a convenience function to raise an exception when a C library
924function has returned an error and set the C variable \cdata{errno}.
Guido van Rossumae110af1997-05-22 20:11:52 +0000925It constructs a tuple object whose first item is the integer
Fred Drakef8830d11998-04-23 14:06:01 +0000926\cdata{errno} value and whose second item is the corresponding error
Fred Drake659ebfa2000-04-03 15:42:13 +0000927message (gotten from \cfunction{strerror()}\ttindex{strerror()}), and
928then calls
Fred Drakee058b4f1998-02-16 06:15:35 +0000929\samp{PyErr_SetObject(\var{type}, \var{object})}. On \UNIX{}, when
Fred Drakef8830d11998-04-23 14:06:01 +0000930the \cdata{errno} value is \constant{EINTR}, indicating an interrupted
Fred Drakee058b4f1998-02-16 06:15:35 +0000931system call, this calls \cfunction{PyErr_CheckSignals()}, and if that set
Guido van Rossumae110af1997-05-22 20:11:52 +0000932the error indicator, leaves it set to that. The function always
933returns \NULL{}, so a wrapper function around a system call can write
Fred Drakee058b4f1998-02-16 06:15:35 +0000934\samp{return PyErr_SetFromErrno();} when the system call returns an
935error.
Guido van Rossumae110af1997-05-22 20:11:52 +0000936\end{cfuncdesc}
937
938\begin{cfuncdesc}{void}{PyErr_BadInternalCall}{}
Fred Drakee058b4f1998-02-16 06:15:35 +0000939This is a shorthand for \samp{PyErr_SetString(PyExc_TypeError,
Guido van Rossumae110af1997-05-22 20:11:52 +0000940\var{message})}, where \var{message} indicates that an internal
Guido van Rossum5060b3b1997-08-17 18:02:23 +0000941operation (e.g. a Python/C API function) was invoked with an illegal
Guido van Rossumae110af1997-05-22 20:11:52 +0000942argument. It is mostly for internal use.
943\end{cfuncdesc}
944
945\begin{cfuncdesc}{int}{PyErr_CheckSignals}{}
946This function interacts with Python's signal handling. It checks
947whether a signal has been sent to the processes and if so, invokes the
Fred Drake4de05a91998-02-16 14:25:26 +0000948corresponding signal handler. If the
949\module{signal}\refbimodindex{signal} module is supported, this can
950invoke a signal handler written in Python. In all cases, the default
Fred Drake659ebfa2000-04-03 15:42:13 +0000951effect for \constant{SIGINT}\ttindex{SIGINT} is to raise the
952\withsubitem{(built-in exception)}{\ttindex{KeyboardInterrupt}}
953\exception{KeyboardInterrupt} exception. If an exception is raised the
Fred Drakee058b4f1998-02-16 06:15:35 +0000954error indicator is set and the function returns \code{1}; otherwise
955the function returns \code{0}. The error indicator may or may not be
956cleared if it was previously set.
Guido van Rossumae110af1997-05-22 20:11:52 +0000957\end{cfuncdesc}
958
959\begin{cfuncdesc}{void}{PyErr_SetInterrupt}{}
Fred Drake659ebfa2000-04-03 15:42:13 +0000960This function is obsolete. It simulates the effect of a
961\constant{SIGINT}\ttindex{SIGINT} signal arriving --- the next time
Fred Drakee058b4f1998-02-16 06:15:35 +0000962\cfunction{PyErr_CheckSignals()} is called,
Fred Drake659ebfa2000-04-03 15:42:13 +0000963\withsubitem{(built-in exception)}{\ttindex{KeyboardInterrupt}}
964\exception{KeyboardInterrupt} will be raised.
965It may be called without holding the interpreter lock.
Guido van Rossumae110af1997-05-22 20:11:52 +0000966\end{cfuncdesc}
967
Fred Drakec6fa34e1998-04-02 06:47:24 +0000968\begin{cfuncdesc}{PyObject*}{PyErr_NewException}{char *name,
969 PyObject *base,
970 PyObject *dict}
Guido van Rossum42cefd01997-10-05 15:27:29 +0000971This utility function creates and returns a new exception object. The
Fred Drake659ebfa2000-04-03 15:42:13 +0000972\var{name} argument must be the name of the new exception, a C string
973of the form \code{module.class}. The \var{base} and
Fred Draked04038d2000-06-29 20:15:14 +0000974\var{dict} arguments are normally \NULL{}. This creates a
Fred Drake659ebfa2000-04-03 15:42:13 +0000975class object derived from the root for all exceptions, the built-in
976name \exception{Exception} (accessible in C as
Fred Draked04038d2000-06-29 20:15:14 +0000977\cdata{PyExc_Exception}). The \member{__module__} attribute of the
978new class is set to the first part (up to the last dot) of the
979\var{name} argument, and the class name is set to the last part (after
980the last dot). The \var{base} argument can be used to specify an
981alternate base class. The \var{dict} argument can be used to specify
982a dictionary of class variables and methods.
Guido van Rossum42cefd01997-10-05 15:27:29 +0000983\end{cfuncdesc}
984
Jeremy Hyltonb709df32000-09-01 02:47:25 +0000985\begin{cfuncdesc}{void}{PyErr_WriteUnraisable}{PyObject *obj}
986This utility function prints a warning message to \var{sys.stderr}
987when an exception has been set but it is impossible for the
988interpreter to actually raise the exception. It is used, for example,
989when an exception occurs in an \member{__del__} method.
990
991The function is called with a single argument \var{obj} that
992identifies where the context in which the unraisable exception
993occurred. The repr of \var{obj} will be printed in the warning
994message.
995\end{cfuncdesc}
Guido van Rossum42cefd01997-10-05 15:27:29 +0000996
Fred Drakeefd146c1999-02-15 15:30:45 +0000997\section{Standard Exceptions \label{standardExceptions}}
Guido van Rossumae110af1997-05-22 20:11:52 +0000998
999All standard Python exceptions are available as global variables whose
Fred Drake659ebfa2000-04-03 15:42:13 +00001000names are \samp{PyExc_} followed by the Python exception name. These
1001have the type \ctype{PyObject*}; they are all class objects. For
1002completeness, here are all the variables:
1003
1004\begin{tableiii}{l|l|c}{cdata}{C Name}{Python Name}{Notes}
1005 \lineiii{PyExc_Exception}{\exception{Exception}}{(1)}
1006 \lineiii{PyExc_StandardError}{\exception{StandardError}}{(1)}
1007 \lineiii{PyExc_ArithmeticError}{\exception{ArithmeticError}}{(1)}
1008 \lineiii{PyExc_LookupError}{\exception{LookupError}}{(1)}
1009 \lineiii{PyExc_AssertionError}{\exception{AssertionError}}{}
1010 \lineiii{PyExc_AttributeError}{\exception{AttributeError}}{}
1011 \lineiii{PyExc_EOFError}{\exception{EOFError}}{}
1012 \lineiii{PyExc_EnvironmentError}{\exception{EnvironmentError}}{(1)}
1013 \lineiii{PyExc_FloatingPointError}{\exception{FloatingPointError}}{}
1014 \lineiii{PyExc_IOError}{\exception{IOError}}{}
1015 \lineiii{PyExc_ImportError}{\exception{ImportError}}{}
1016 \lineiii{PyExc_IndexError}{\exception{IndexError}}{}
1017 \lineiii{PyExc_KeyError}{\exception{KeyError}}{}
1018 \lineiii{PyExc_KeyboardInterrupt}{\exception{KeyboardInterrupt}}{}
1019 \lineiii{PyExc_MemoryError}{\exception{MemoryError}}{}
1020 \lineiii{PyExc_NameError}{\exception{NameError}}{}
1021 \lineiii{PyExc_NotImplementedError}{\exception{NotImplementedError}}{}
1022 \lineiii{PyExc_OSError}{\exception{OSError}}{}
1023 \lineiii{PyExc_OverflowError}{\exception{OverflowError}}{}
1024 \lineiii{PyExc_RuntimeError}{\exception{RuntimeError}}{}
1025 \lineiii{PyExc_SyntaxError}{\exception{SyntaxError}}{}
1026 \lineiii{PyExc_SystemError}{\exception{SystemError}}{}
1027 \lineiii{PyExc_SystemExit}{\exception{SystemExit}}{}
1028 \lineiii{PyExc_TypeError}{\exception{TypeError}}{}
1029 \lineiii{PyExc_ValueError}{\exception{ValueError}}{}
Fred Drakea8d73412000-08-11 20:39:29 +00001030 \lineiii{PyExc_WindowsError}{\exception{WindowsError}}{(2)}
Fred Drake659ebfa2000-04-03 15:42:13 +00001031 \lineiii{PyExc_ZeroDivisionError}{\exception{ZeroDivisionError}}{}
1032\end{tableiii}
1033
1034\noindent
Fred Drakea8d73412000-08-11 20:39:29 +00001035Notes:
Fred Drake659ebfa2000-04-03 15:42:13 +00001036\begin{description}
1037\item[(1)]
Fred Draked04038d2000-06-29 20:15:14 +00001038 This is a base class for other standard exceptions.
Fred Drakea8d73412000-08-11 20:39:29 +00001039
1040\item[(2)]
1041 Only defined on Windows; protect code that uses this by testing that
1042 the preprocessor macro \code{MS_WINDOWS} is defined.
Fred Drake659ebfa2000-04-03 15:42:13 +00001043\end{description}
1044
1045
1046\section{Deprecation of String Exceptions}
1047
Fred Draked04038d2000-06-29 20:15:14 +00001048All exceptions built into Python or provided in the standard library
1049are derived from \exception{Exception}.
Fred Drake659ebfa2000-04-03 15:42:13 +00001050\withsubitem{(built-in exception)}{\ttindex{Exception}}
Fred Drake659ebfa2000-04-03 15:42:13 +00001051
Fred Draked04038d2000-06-29 20:15:14 +00001052String exceptions are still supported in the interpreter to allow
Fred Drake659ebfa2000-04-03 15:42:13 +00001053existing code to run unmodified, but this will also change in a future
1054release.
Guido van Rossumae110af1997-05-22 20:11:52 +00001055
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001056
Fred Drakeefd146c1999-02-15 15:30:45 +00001057\chapter{Utilities \label{utilities}}
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001058
1059The functions in this chapter perform various utility tasks, such as
Fred Drake659ebfa2000-04-03 15:42:13 +00001060parsing function arguments and constructing Python values from C
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001061values.
1062
Fred Drakeefd146c1999-02-15 15:30:45 +00001063\section{OS Utilities \label{os}}
Guido van Rossum42cefd01997-10-05 15:27:29 +00001064
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001065\begin{cfuncdesc}{int}{Py_FdIsInteractive}{FILE *fp, char *filename}
Fred Drakee058b4f1998-02-16 06:15:35 +00001066Return true (nonzero) if the standard I/O file \var{fp} with name
1067\var{filename} is deemed interactive. This is the case for files for
1068which \samp{isatty(fileno(\var{fp}))} is true. If the global flag
Fred Drakef8830d11998-04-23 14:06:01 +00001069\cdata{Py_InteractiveFlag} is true, this function also returns true if
Fred Drakee058b4f1998-02-16 06:15:35 +00001070the \var{name} pointer is \NULL{} or if the name is equal to one of
Fred Drakea8455ab2000-06-16 19:58:42 +00001071the strings \code{'<stdin>'} or \code{'???'}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001072\end{cfuncdesc}
1073
1074\begin{cfuncdesc}{long}{PyOS_GetLastModificationTime}{char *filename}
Fred Drakee058b4f1998-02-16 06:15:35 +00001075Return the time of last modification of the file \var{filename}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001076The result is encoded in the same way as the timestamp returned by
Fred Drake659ebfa2000-04-03 15:42:13 +00001077the standard C library function \cfunction{time()}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001078\end{cfuncdesc}
1079
Fred Drakecabbc3b2000-06-28 15:53:13 +00001080\begin{cfuncdesc}{void}{PyOS_AfterFork}{}
1081Function to update some internal state after a process fork; this
1082should be called in the new process if the Python interpreter will
1083continue to be used. If a new executable is loaded into the new
1084process, this function does not need to be called.
1085\end{cfuncdesc}
1086
Fred Drake17e63432000-08-31 05:50:40 +00001087\begin{cfuncdesc}{int}{PyOS_CheckStack}{}
1088Return true when the interpreter runs out of stack space. This is a
1089reliable check, but is only available when \code{USE_STACKCHECK} is
1090defined (currently on Windows using the Microsoft Visual C++ compiler
1091and on the Macintosh). \code{USE_CHECKSTACK} will be defined
1092automatically; you should never change the definition in your own
1093code.
1094\end{cfuncdesc}
1095
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001096
Fred Drakeefd146c1999-02-15 15:30:45 +00001097\section{Process Control \label{processControl}}
Fred Drakee5bf8b21998-02-12 21:22:28 +00001098
1099\begin{cfuncdesc}{void}{Py_FatalError}{char *message}
1100Print a fatal error message and kill the process. No cleanup is
1101performed. This function should only be invoked when a condition is
1102detected that would make it dangerous to continue using the Python
1103interpreter; e.g., when the object administration appears to be
Fred Drake659ebfa2000-04-03 15:42:13 +00001104corrupted. On \UNIX{}, the standard C library function
1105\cfunction{abort()}\ttindex{abort()} is called which will attempt to
1106produce a \file{core} file.
Fred Drakee5bf8b21998-02-12 21:22:28 +00001107\end{cfuncdesc}
1108
1109\begin{cfuncdesc}{void}{Py_Exit}{int status}
Fred Drake659ebfa2000-04-03 15:42:13 +00001110Exit the current process. This calls
1111\cfunction{Py_Finalize()}\ttindex{Py_Finalize()} and
1112then calls the standard C library function
1113\code{exit(\var{status})}\ttindex{exit()}.
Fred Drakee5bf8b21998-02-12 21:22:28 +00001114\end{cfuncdesc}
1115
1116\begin{cfuncdesc}{int}{Py_AtExit}{void (*func) ()}
Fred Drake659ebfa2000-04-03 15:42:13 +00001117Register a cleanup function to be called by
1118\cfunction{Py_Finalize()}\ttindex{Py_Finalize()}.
Fred Drakee5bf8b21998-02-12 21:22:28 +00001119The cleanup function will be called with no arguments and should
Fred Drake659ebfa2000-04-03 15:42:13 +00001120return no value. At most 32 \index{cleanup functions}cleanup
1121functions can be registered.
Fred Drakee5bf8b21998-02-12 21:22:28 +00001122When the registration is successful, \cfunction{Py_AtExit()} returns
1123\code{0}; on failure, it returns \code{-1}. The cleanup function
1124registered last is called first. Each cleanup function will be called
1125at most once. Since Python's internal finallization will have
1126completed before the cleanup function, no Python APIs should be called
1127by \var{func}.
1128\end{cfuncdesc}
1129
1130
Fred Drakeefd146c1999-02-15 15:30:45 +00001131\section{Importing Modules \label{importing}}
Guido van Rossum42cefd01997-10-05 15:27:29 +00001132
Fred Drakec6fa34e1998-04-02 06:47:24 +00001133\begin{cfuncdesc}{PyObject*}{PyImport_ImportModule}{char *name}
Fred Drake659ebfa2000-04-03 15:42:13 +00001134This is a simplified interface to
1135\cfunction{PyImport_ImportModuleEx()} below, leaving the
1136\var{globals} and \var{locals} arguments set to \NULL{}. When the
1137\var{name} argument contains a dot (i.e., when it specifies a
1138submodule of a package), the \var{fromlist} argument is set to the
1139list \code{['*']} so that the return value is the named module rather
1140than the top-level package containing it as would otherwise be the
1141case. (Unfortunately, this has an additional side effect when
1142\var{name} in fact specifies a subpackage instead of a submodule: the
1143submodules specified in the package's \code{__all__} variable are
1144\index{package variable!\code{__all__}}
1145\withsubitem{(package variable)}{\ttindex{__all__}}loaded.) Return a
1146new reference to the imported module, or
1147\NULL{} with an exception set on failure (the module may still be
1148created in this case --- examine \code{sys.modules} to find out).
1149\withsubitem{(in module sys)}{\ttindex{modules}}
Guido van Rossum42cefd01997-10-05 15:27:29 +00001150\end{cfuncdesc}
1151
Fred Drakec6fa34e1998-04-02 06:47:24 +00001152\begin{cfuncdesc}{PyObject*}{PyImport_ImportModuleEx}{char *name, PyObject *globals, PyObject *locals, PyObject *fromlist}
Guido van Rossum42cefd01997-10-05 15:27:29 +00001153Import a module. This is best described by referring to the built-in
Fred Drake53fb7721998-02-16 06:23:20 +00001154Python function \function{__import__()}\bifuncindex{__import__}, as
1155the standard \function{__import__()} function calls this function
1156directly.
Guido van Rossum42cefd01997-10-05 15:27:29 +00001157
Guido van Rossum42cefd01997-10-05 15:27:29 +00001158The return value is a new reference to the imported module or
Guido van Rossum580aa8d1997-11-25 15:34:51 +00001159top-level package, or \NULL{} with an exception set on failure
Guido van Rossumc44d3d61997-10-06 05:10:47 +00001160(the module may still be created in this case). Like for
Fred Drakee058b4f1998-02-16 06:15:35 +00001161\function{__import__()}, the return value when a submodule of a
1162package was requested is normally the top-level package, unless a
1163non-empty \var{fromlist} was given.
Guido van Rossum42cefd01997-10-05 15:27:29 +00001164\end{cfuncdesc}
1165
Fred Drakec6fa34e1998-04-02 06:47:24 +00001166\begin{cfuncdesc}{PyObject*}{PyImport_Import}{PyObject *name}
Guido van Rossum42cefd01997-10-05 15:27:29 +00001167This is a higher-level interface that calls the current ``import hook
Fred Drakee058b4f1998-02-16 06:15:35 +00001168function''. It invokes the \function{__import__()} function from the
Guido van Rossum42cefd01997-10-05 15:27:29 +00001169\code{__builtins__} of the current globals. This means that the
1170import is done using whatever import hooks are installed in the
Fred Drake4de05a91998-02-16 14:25:26 +00001171current environment, e.g. by \module{rexec}\refstmodindex{rexec} or
1172\module{ihooks}\refstmodindex{ihooks}.
Guido van Rossum42cefd01997-10-05 15:27:29 +00001173\end{cfuncdesc}
1174
Fred Drakec6fa34e1998-04-02 06:47:24 +00001175\begin{cfuncdesc}{PyObject*}{PyImport_ReloadModule}{PyObject *m}
Guido van Rossum42cefd01997-10-05 15:27:29 +00001176Reload a module. This is best described by referring to the built-in
Fred Drake53fb7721998-02-16 06:23:20 +00001177Python function \function{reload()}\bifuncindex{reload}, as the standard
Fred Drakee058b4f1998-02-16 06:15:35 +00001178\function{reload()} function calls this function directly. Return a
1179new reference to the reloaded module, or \NULL{} with an exception set
1180on failure (the module still exists in this case).
Guido van Rossum42cefd01997-10-05 15:27:29 +00001181\end{cfuncdesc}
1182
Fred Drakec6fa34e1998-04-02 06:47:24 +00001183\begin{cfuncdesc}{PyObject*}{PyImport_AddModule}{char *name}
Guido van Rossum42cefd01997-10-05 15:27:29 +00001184Return the module object corresponding to a module name. The
1185\var{name} argument may be of the form \code{package.module}). First
1186check the modules dictionary if there's one there, and if not, create
Fred Drake659ebfa2000-04-03 15:42:13 +00001187a new one and insert in in the modules dictionary.
Guido van Rossuma096a2e1998-11-02 17:02:42 +00001188Warning: this function does not load or import the module; if the
1189module wasn't already loaded, you will get an empty module object.
1190Use \cfunction{PyImport_ImportModule()} or one of its variants to
1191import a module.
Fred Drake659ebfa2000-04-03 15:42:13 +00001192Return \NULL{} with an exception set on failure.
Guido van Rossum42cefd01997-10-05 15:27:29 +00001193\end{cfuncdesc}
1194
Fred Drakec6fa34e1998-04-02 06:47:24 +00001195\begin{cfuncdesc}{PyObject*}{PyImport_ExecCodeModule}{char *name, PyObject *co}
Guido van Rossum42cefd01997-10-05 15:27:29 +00001196Given a module name (possibly of the form \code{package.module}) and a
1197code object read from a Python bytecode file or obtained from the
Fred Drake53fb7721998-02-16 06:23:20 +00001198built-in function \function{compile()}\bifuncindex{compile}, load the
1199module. Return a new reference to the module object, or \NULL{} with
1200an exception set if an error occurred (the module may still be created
1201in this case). (This function would reload the module if it was
1202already imported.)
Guido van Rossum42cefd01997-10-05 15:27:29 +00001203\end{cfuncdesc}
1204
1205\begin{cfuncdesc}{long}{PyImport_GetMagicNumber}{}
Fred Drake659ebfa2000-04-03 15:42:13 +00001206Return the magic number for Python bytecode files (a.k.a.
1207\file{.pyc} and \file{.pyo} files). The magic number should be
1208present in the first four bytes of the bytecode file, in little-endian
1209byte order.
Guido van Rossum42cefd01997-10-05 15:27:29 +00001210\end{cfuncdesc}
1211
Fred Drakec6fa34e1998-04-02 06:47:24 +00001212\begin{cfuncdesc}{PyObject*}{PyImport_GetModuleDict}{}
Guido van Rossum42cefd01997-10-05 15:27:29 +00001213Return the dictionary used for the module administration
1214(a.k.a. \code{sys.modules}). Note that this is a per-interpreter
1215variable.
1216\end{cfuncdesc}
1217
1218\begin{cfuncdesc}{void}{_PyImport_Init}{}
1219Initialize the import mechanism. For internal use only.
1220\end{cfuncdesc}
1221
1222\begin{cfuncdesc}{void}{PyImport_Cleanup}{}
1223Empty the module table. For internal use only.
1224\end{cfuncdesc}
1225
1226\begin{cfuncdesc}{void}{_PyImport_Fini}{}
1227Finalize the import mechanism. For internal use only.
1228\end{cfuncdesc}
1229
Fred Drakec6fa34e1998-04-02 06:47:24 +00001230\begin{cfuncdesc}{PyObject*}{_PyImport_FindExtension}{char *, char *}
Guido van Rossum42cefd01997-10-05 15:27:29 +00001231For internal use only.
Guido van Rossum5b8a5231997-12-30 04:38:44 +00001232\end{cfuncdesc}
Guido van Rossum42cefd01997-10-05 15:27:29 +00001233
Fred Drakec6fa34e1998-04-02 06:47:24 +00001234\begin{cfuncdesc}{PyObject*}{_PyImport_FixupExtension}{char *, char *}
Guido van Rossum42cefd01997-10-05 15:27:29 +00001235For internal use only.
Guido van Rossum5b8a5231997-12-30 04:38:44 +00001236\end{cfuncdesc}
Guido van Rossum42cefd01997-10-05 15:27:29 +00001237
Fred Drake1d158692000-06-18 05:21:21 +00001238\begin{cfuncdesc}{int}{PyImport_ImportFrozenModule}{char *name}
1239Load a frozen module named \var{name}. Return \code{1} for success,
1240\code{0} if the module is not found, and \code{-1} with an exception
1241set if the initialization failed. To access the imported module on a
1242successful load, use \cfunction{PyImport_ImportModule()}.
Fred Drakee058b4f1998-02-16 06:15:35 +00001243(Note the misnomer --- this function would reload the module if it was
Guido van Rossum42cefd01997-10-05 15:27:29 +00001244already imported.)
1245\end{cfuncdesc}
1246
Fred Drake659ebfa2000-04-03 15:42:13 +00001247\begin{ctypedesc}[_frozen]{struct _frozen}
Guido van Rossum42cefd01997-10-05 15:27:29 +00001248This is the structure type definition for frozen module descriptors,
Fred Drakec6fa34e1998-04-02 06:47:24 +00001249as generated by the \program{freeze}\index{freeze utility} utility
1250(see \file{Tools/freeze/} in the Python source distribution). Its
Fred Drakee0d9a832000-09-01 05:30:00 +00001251definition, found in \file{Include/import.h}, is:
Fred Drakec6fa34e1998-04-02 06:47:24 +00001252
Guido van Rossum9faf4c51997-10-07 14:38:54 +00001253\begin{verbatim}
Guido van Rossum42cefd01997-10-05 15:27:29 +00001254struct _frozen {
Fred Drake36fbe761997-10-13 18:18:33 +00001255 char *name;
1256 unsigned char *code;
1257 int size;
Guido van Rossum42cefd01997-10-05 15:27:29 +00001258};
Guido van Rossum9faf4c51997-10-07 14:38:54 +00001259\end{verbatim}
Guido van Rossum42cefd01997-10-05 15:27:29 +00001260\end{ctypedesc}
1261
Fred Drakec6fa34e1998-04-02 06:47:24 +00001262\begin{cvardesc}{struct _frozen*}{PyImport_FrozenModules}
Fred Drakef8830d11998-04-23 14:06:01 +00001263This pointer is initialized to point to an array of \ctype{struct
Fred Drake659ebfa2000-04-03 15:42:13 +00001264_frozen} records, terminated by one whose members are all
1265\NULL{} or zero. When a frozen module is imported, it is searched in
1266this table. Third-party code could play tricks with this to provide a
Guido van Rossum42cefd01997-10-05 15:27:29 +00001267dynamically created collection of frozen modules.
1268\end{cvardesc}
1269
Fred Drakee0d9a832000-09-01 05:30:00 +00001270\begin{cfuncdesc}{int}{PyImport_AppendInittab}{char *name,
1271 void (*initfunc)(void)}
1272Add a single module to the existing table of built-in modules. This
1273is a convenience wrapper around \cfunction{PyImport_ExtendInittab()},
1274returning \code{-1} if the table could not be extended. The new
1275module can be imported by the name \var{name}, and uses the function
1276\var{initfunc} as the initialization function called on the first
1277attempted import. This should be called before
1278\cfunction{Py_Initialize()}.
1279\end{cfuncdesc}
1280
1281\begin{ctypedesc}[_inittab]{struct _inittab}
1282Structure describing a single entry in the list of built-in modules.
1283Each of these structures gives the name and initialization function
1284for a module built into the interpreter. Programs which embed Python
1285may use an array of these structures in conjunction with
1286\cfunction{PyImport_ExtendInittab()} to provide additional built-in
1287modules. The structure is defined in \file{Include/import.h} as:
1288
1289\begin{verbatim}
1290struct _inittab {
1291 char *name;
1292 void (*initfunc)(void);
1293};
1294\end{verbatim}
1295\end{ctypedesc}
1296
1297\begin{cfuncdesc}{int}{PyImport_ExtendInittab}{struct _inittab *newtab}
1298Add a collection of modules to the table of built-in modules. The
1299\var{newtab} array must end with a sentinel entry which contains
1300\NULL{} for the \member{name} field; failure to provide the sentinel
1301value can result in a memory fault. Returns \code{0} on success or
1302\code{-1} if insufficient memory could be allocated to extend the
1303internal table. In the event of failure, no modules are added to the
1304internal table. This should be called before
1305\cfunction{Py_Initialize()}.
1306\end{cfuncdesc}
1307
Guido van Rossum42cefd01997-10-05 15:27:29 +00001308
Fred Drakeefd146c1999-02-15 15:30:45 +00001309\chapter{Abstract Objects Layer \label{abstract}}
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001310
1311The functions in this chapter interact with Python objects regardless
1312of their type, or with wide classes of object types (e.g. all
1313numerical types, or all sequence types). When used on object types
Fred Drake659ebfa2000-04-03 15:42:13 +00001314for which they do not apply, they will raise a Python exception.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001315
Fred Drakeefd146c1999-02-15 15:30:45 +00001316\section{Object Protocol \label{object}}
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001317
1318\begin{cfuncdesc}{int}{PyObject_Print}{PyObject *o, FILE *fp, int flags}
Fred Drake659ebfa2000-04-03 15:42:13 +00001319Print an object \var{o}, on file \var{fp}. Returns \code{-1} on error.
1320The flags argument is used to enable certain printing options. The
1321only option currently supported is \constant{Py_PRINT_RAW}; if given,
1322the \function{str()} of the object is written instead of the
1323\function{repr()}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001324\end{cfuncdesc}
1325
1326\begin{cfuncdesc}{int}{PyObject_HasAttrString}{PyObject *o, char *attr_name}
Fred Drakee058b4f1998-02-16 06:15:35 +00001327Returns \code{1} if \var{o} has the attribute \var{attr_name}, and
1328\code{0} otherwise. This is equivalent to the Python expression
1329\samp{hasattr(\var{o}, \var{attr_name})}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001330This function always succeeds.
1331\end{cfuncdesc}
1332
Fred Drake659ebfa2000-04-03 15:42:13 +00001333\begin{cfuncdesc}{PyObject*}{PyObject_GetAttrString}{PyObject *o,
1334 char *attr_name}
Fred Drakee058b4f1998-02-16 06:15:35 +00001335Retrieve an attribute named \var{attr_name} from object \var{o}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001336Returns the attribute value on success, or \NULL{} on failure.
Fred Drakee058b4f1998-02-16 06:15:35 +00001337This is the equivalent of the Python expression
1338\samp{\var{o}.\var{attr_name}}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001339\end{cfuncdesc}
1340
1341
1342\begin{cfuncdesc}{int}{PyObject_HasAttr}{PyObject *o, PyObject *attr_name}
Fred Drakee058b4f1998-02-16 06:15:35 +00001343Returns \code{1} if \var{o} has the attribute \var{attr_name}, and
1344\code{0} otherwise. This is equivalent to the Python expression
1345\samp{hasattr(\var{o}, \var{attr_name})}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001346This function always succeeds.
1347\end{cfuncdesc}
1348
1349
Fred Drake659ebfa2000-04-03 15:42:13 +00001350\begin{cfuncdesc}{PyObject*}{PyObject_GetAttr}{PyObject *o,
1351 PyObject *attr_name}
Fred Drakee058b4f1998-02-16 06:15:35 +00001352Retrieve an attribute named \var{attr_name} from object \var{o}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001353Returns the attribute value on success, or \NULL{} on failure.
Fred Drakee058b4f1998-02-16 06:15:35 +00001354This is the equivalent of the Python expression
1355\samp{\var{o}.\var{attr_name}}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001356\end{cfuncdesc}
1357
1358
1359\begin{cfuncdesc}{int}{PyObject_SetAttrString}{PyObject *o, char *attr_name, PyObject *v}
Fred Drakee058b4f1998-02-16 06:15:35 +00001360Set the value of the attribute named \var{attr_name}, for object
1361\var{o}, to the value \var{v}. Returns \code{-1} on failure. This is
1362the equivalent of the Python statement \samp{\var{o}.\var{attr_name} =
1363\var{v}}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001364\end{cfuncdesc}
1365
1366
1367\begin{cfuncdesc}{int}{PyObject_SetAttr}{PyObject *o, PyObject *attr_name, PyObject *v}
Fred Drakee058b4f1998-02-16 06:15:35 +00001368Set the value of the attribute named \var{attr_name}, for
1369object \var{o},
1370to the value \var{v}. Returns \code{-1} on failure. This is
1371the equivalent of the Python statement \samp{\var{o}.\var{attr_name} =
1372\var{v}}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001373\end{cfuncdesc}
1374
1375
1376\begin{cfuncdesc}{int}{PyObject_DelAttrString}{PyObject *o, char *attr_name}
Fred Drakee058b4f1998-02-16 06:15:35 +00001377Delete attribute named \var{attr_name}, for object \var{o}. Returns
1378\code{-1} on failure. This is the equivalent of the Python
1379statement: \samp{del \var{o}.\var{attr_name}}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001380\end{cfuncdesc}
1381
1382
1383\begin{cfuncdesc}{int}{PyObject_DelAttr}{PyObject *o, PyObject *attr_name}
Fred Drakee058b4f1998-02-16 06:15:35 +00001384Delete attribute named \var{attr_name}, for object \var{o}. Returns
1385\code{-1} on failure. This is the equivalent of the Python
1386statement \samp{del \var{o}.\var{attr_name}}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001387\end{cfuncdesc}
1388
1389
1390\begin{cfuncdesc}{int}{PyObject_Cmp}{PyObject *o1, PyObject *o2, int *result}
Fred Drakee058b4f1998-02-16 06:15:35 +00001391Compare the values of \var{o1} and \var{o2} using a routine provided
1392by \var{o1}, if one exists, otherwise with a routine provided by
1393\var{o2}. The result of the comparison is returned in \var{result}.
1394Returns \code{-1} on failure. This is the equivalent of the Python
Fred Drake659ebfa2000-04-03 15:42:13 +00001395statement\bifuncindex{cmp} \samp{\var{result} = cmp(\var{o1}, \var{o2})}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001396\end{cfuncdesc}
1397
1398
1399\begin{cfuncdesc}{int}{PyObject_Compare}{PyObject *o1, PyObject *o2}
Fred Drakee058b4f1998-02-16 06:15:35 +00001400Compare the values of \var{o1} and \var{o2} using a routine provided
1401by \var{o1}, if one exists, otherwise with a routine provided by
1402\var{o2}. Returns the result of the comparison on success. On error,
1403the value returned is undefined; use \cfunction{PyErr_Occurred()} to
Fred Drake659ebfa2000-04-03 15:42:13 +00001404detect an error. This is equivalent to the Python
1405expression\bifuncindex{cmp} \samp{cmp(\var{o1}, \var{o2})}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001406\end{cfuncdesc}
1407
1408
1409\begin{cfuncdesc}{PyObject*}{PyObject_Repr}{PyObject *o}
Fred Drake659ebfa2000-04-03 15:42:13 +00001410Compute a string representation of object \var{o}. Returns the
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001411string representation on success, \NULL{} on failure. This is
Fred Drakee058b4f1998-02-16 06:15:35 +00001412the equivalent of the Python expression \samp{repr(\var{o})}.
1413Called by the \function{repr()}\bifuncindex{repr} built-in function
1414and by reverse quotes.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001415\end{cfuncdesc}
1416
1417
1418\begin{cfuncdesc}{PyObject*}{PyObject_Str}{PyObject *o}
Fred Drake659ebfa2000-04-03 15:42:13 +00001419Compute a string representation of object \var{o}. Returns the
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001420string representation on success, \NULL{} on failure. This is
Fred Drakee058b4f1998-02-16 06:15:35 +00001421the equivalent of the Python expression \samp{str(\var{o})}.
1422Called by the \function{str()}\bifuncindex{str} built-in function and
1423by the \keyword{print} statement.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001424\end{cfuncdesc}
1425
1426
1427\begin{cfuncdesc}{int}{PyCallable_Check}{PyObject *o}
Fred Drake659ebfa2000-04-03 15:42:13 +00001428Determine if the object \var{o} is callable. Return \code{1} if the
Fred Drakee058b4f1998-02-16 06:15:35 +00001429object is callable and \code{0} otherwise.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001430This function always succeeds.
1431\end{cfuncdesc}
1432
1433
Fred Drake659ebfa2000-04-03 15:42:13 +00001434\begin{cfuncdesc}{PyObject*}{PyObject_CallObject}{PyObject *callable_object,
1435 PyObject *args}
Fred Drakee058b4f1998-02-16 06:15:35 +00001436Call a callable Python object \var{callable_object}, with
1437arguments given by the tuple \var{args}. If no arguments are
Fred Drake659ebfa2000-04-03 15:42:13 +00001438needed, then \var{args} may be \NULL{}. Returns the result of the
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001439call on success, or \NULL{} on failure. This is the equivalent
Fred Drakee058b4f1998-02-16 06:15:35 +00001440of the Python expression \samp{apply(\var{o}, \var{args})}.
Fred Drake659ebfa2000-04-03 15:42:13 +00001441\bifuncindex{apply}
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001442\end{cfuncdesc}
1443
1444\begin{cfuncdesc}{PyObject*}{PyObject_CallFunction}{PyObject *callable_object, char *format, ...}
Fred Drakee058b4f1998-02-16 06:15:35 +00001445Call a callable Python object \var{callable_object}, with a
Fred Drake659ebfa2000-04-03 15:42:13 +00001446variable number of C arguments. The C arguments are described
Fred Drakee058b4f1998-02-16 06:15:35 +00001447using a \cfunction{Py_BuildValue()} style format string. The format may
1448be \NULL{}, indicating that no arguments are provided. Returns the
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001449result of the call on success, or \NULL{} on failure. This is
Fred Drakee058b4f1998-02-16 06:15:35 +00001450the equivalent of the Python expression \samp{apply(\var{o},
Fred Drake659ebfa2000-04-03 15:42:13 +00001451\var{args})}.\bifuncindex{apply}
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001452\end{cfuncdesc}
1453
1454
1455\begin{cfuncdesc}{PyObject*}{PyObject_CallMethod}{PyObject *o, char *m, char *format, ...}
Fred Drakee058b4f1998-02-16 06:15:35 +00001456Call the method named \var{m} of object \var{o} with a variable number
Fred Drake659ebfa2000-04-03 15:42:13 +00001457of C arguments. The C arguments are described by a
Fred Drakee058b4f1998-02-16 06:15:35 +00001458\cfunction{Py_BuildValue()} format string. The format may be \NULL{},
1459indicating that no arguments are provided. Returns the result of the
1460call on success, or \NULL{} on failure. This is the equivalent of the
1461Python expression \samp{\var{o}.\var{method}(\var{args})}.
Fred Drake659ebfa2000-04-03 15:42:13 +00001462Note that special method names, such as \method{__add__()},
1463\method{__getitem__()}, and so on are not supported. The specific
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001464abstract-object routines for these must be used.
1465\end{cfuncdesc}
1466
1467
1468\begin{cfuncdesc}{int}{PyObject_Hash}{PyObject *o}
Fred Drakee058b4f1998-02-16 06:15:35 +00001469Compute and return the hash value of an object \var{o}. On
1470failure, return \code{-1}. This is the equivalent of the Python
Fred Drake659ebfa2000-04-03 15:42:13 +00001471expression \samp{hash(\var{o})}.\bifuncindex{hash}
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001472\end{cfuncdesc}
1473
1474
1475\begin{cfuncdesc}{int}{PyObject_IsTrue}{PyObject *o}
Fred Drakee058b4f1998-02-16 06:15:35 +00001476Returns \code{1} if the object \var{o} is considered to be true, and
1477\code{0} otherwise. This is equivalent to the Python expression
1478\samp{not not \var{o}}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001479This function always succeeds.
1480\end{cfuncdesc}
1481
1482
1483\begin{cfuncdesc}{PyObject*}{PyObject_Type}{PyObject *o}
1484On success, returns a type object corresponding to the object
Fred Drakee058b4f1998-02-16 06:15:35 +00001485type of object \var{o}. On failure, returns \NULL{}. This is
1486equivalent to the Python expression \samp{type(\var{o})}.
Fred Drake53fb7721998-02-16 06:23:20 +00001487\bifuncindex{type}
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001488\end{cfuncdesc}
1489
1490\begin{cfuncdesc}{int}{PyObject_Length}{PyObject *o}
Fred Drakee058b4f1998-02-16 06:15:35 +00001491Return the length of object \var{o}. If the object \var{o} provides
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001492both sequence and mapping protocols, the sequence length is
Fred Drake659ebfa2000-04-03 15:42:13 +00001493returned. On error, \code{-1} is returned. This is the equivalent
1494to the Python expression \samp{len(\var{o})}.\bifuncindex{len}
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001495\end{cfuncdesc}
1496
1497
1498\begin{cfuncdesc}{PyObject*}{PyObject_GetItem}{PyObject *o, PyObject *key}
Fred Drakee058b4f1998-02-16 06:15:35 +00001499Return element of \var{o} corresponding to the object \var{key} or
1500\NULL{} on failure. This is the equivalent of the Python expression
1501\samp{\var{o}[\var{key}]}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001502\end{cfuncdesc}
1503
1504
1505\begin{cfuncdesc}{int}{PyObject_SetItem}{PyObject *o, PyObject *key, PyObject *v}
Fred Drakee058b4f1998-02-16 06:15:35 +00001506Map the object \var{key} to the value \var{v}.
1507Returns \code{-1} on failure. This is the equivalent
1508of the Python statement \samp{\var{o}[\var{key}] = \var{v}}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001509\end{cfuncdesc}
1510
1511
Guido van Rossumd1dbf631999-01-22 20:10:49 +00001512\begin{cfuncdesc}{int}{PyObject_DelItem}{PyObject *o, PyObject *key}
Fred Drakee058b4f1998-02-16 06:15:35 +00001513Delete the mapping for \var{key} from \var{o}. Returns \code{-1} on
1514failure. This is the equivalent of the Python statement \samp{del
1515\var{o}[\var{key}]}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001516\end{cfuncdesc}
1517
Andrew M. Kuchling8c46b302000-07-13 23:58:16 +00001518\begin{cfuncdesc}{int}{PyObject_AsFileDescriptor}{PyObject *o}
1519Derives a file-descriptor from a Python object. If the object
1520is an integer or long integer, its value is returned. If not, the
1521object's \method{fileno()} method is called if it exists; the method
1522must return an integer or long integer, which is returned as the file
1523descriptor value. Returns \code{-1} on failure.
1524\end{cfuncdesc}
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001525
Fred Drakeefd146c1999-02-15 15:30:45 +00001526\section{Number Protocol \label{number}}
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001527
1528\begin{cfuncdesc}{int}{PyNumber_Check}{PyObject *o}
Fred Drakee058b4f1998-02-16 06:15:35 +00001529Returns \code{1} if the object \var{o} provides numeric protocols, and
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001530false otherwise.
1531This function always succeeds.
1532\end{cfuncdesc}
1533
1534
1535\begin{cfuncdesc}{PyObject*}{PyNumber_Add}{PyObject *o1, PyObject *o2}
Fred Drakee058b4f1998-02-16 06:15:35 +00001536Returns the result of adding \var{o1} and \var{o2}, or \NULL{} on
1537failure. This is the equivalent of the Python expression
1538\samp{\var{o1} + \var{o2}}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001539\end{cfuncdesc}
1540
1541
1542\begin{cfuncdesc}{PyObject*}{PyNumber_Subtract}{PyObject *o1, PyObject *o2}
Fred Drake659ebfa2000-04-03 15:42:13 +00001543Returns the result of subtracting \var{o2} from \var{o1}, or
1544\NULL{} on failure. This is the equivalent of the Python expression
Fred Drakee058b4f1998-02-16 06:15:35 +00001545\samp{\var{o1} - \var{o2}}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001546\end{cfuncdesc}
1547
1548
1549\begin{cfuncdesc}{PyObject*}{PyNumber_Multiply}{PyObject *o1, PyObject *o2}
Fred Drakee058b4f1998-02-16 06:15:35 +00001550Returns the result of multiplying \var{o1} and \var{o2}, or \NULL{} on
1551failure. This is the equivalent of the Python expression
1552\samp{\var{o1} * \var{o2}}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001553\end{cfuncdesc}
1554
1555
1556\begin{cfuncdesc}{PyObject*}{PyNumber_Divide}{PyObject *o1, PyObject *o2}
Fred Drakee058b4f1998-02-16 06:15:35 +00001557Returns the result of dividing \var{o1} by \var{o2}, or \NULL{} on
1558failure.
1559This is the equivalent of the Python expression \samp{\var{o1} /
1560\var{o2}}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001561\end{cfuncdesc}
1562
1563
1564\begin{cfuncdesc}{PyObject*}{PyNumber_Remainder}{PyObject *o1, PyObject *o2}
Fred Drakee058b4f1998-02-16 06:15:35 +00001565Returns the remainder of dividing \var{o1} by \var{o2}, or \NULL{} on
1566failure. This is the equivalent of the Python expression
Fred Drake659ebfa2000-04-03 15:42:13 +00001567\samp{\var{o1} \%\ \var{o2}}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001568\end{cfuncdesc}
1569
1570
1571\begin{cfuncdesc}{PyObject*}{PyNumber_Divmod}{PyObject *o1, PyObject *o2}
Fred Drake53fb7721998-02-16 06:23:20 +00001572See the built-in function \function{divmod()}\bifuncindex{divmod}.
1573Returns \NULL{} on failure. This is the equivalent of the Python
1574expression \samp{divmod(\var{o1}, \var{o2})}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001575\end{cfuncdesc}
1576
1577
1578\begin{cfuncdesc}{PyObject*}{PyNumber_Power}{PyObject *o1, PyObject *o2, PyObject *o3}
Fred Drake53fb7721998-02-16 06:23:20 +00001579See the built-in function \function{pow()}\bifuncindex{pow}. Returns
1580\NULL{} on failure. This is the equivalent of the Python expression
Fred Drakee058b4f1998-02-16 06:15:35 +00001581\samp{pow(\var{o1}, \var{o2}, \var{o3})}, where \var{o3} is optional.
Fred Drake659ebfa2000-04-03 15:42:13 +00001582If \var{o3} is to be ignored, pass \cdata{Py_None} in its place
1583(passing \NULL{} for \var{o3} would cause an illegal memory access).
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001584\end{cfuncdesc}
1585
1586
1587\begin{cfuncdesc}{PyObject*}{PyNumber_Negative}{PyObject *o}
Fred Drakee058b4f1998-02-16 06:15:35 +00001588Returns the negation of \var{o} on success, or \NULL{} on failure.
1589This is the equivalent of the Python expression \samp{-\var{o}}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001590\end{cfuncdesc}
1591
1592
1593\begin{cfuncdesc}{PyObject*}{PyNumber_Positive}{PyObject *o}
Fred Drakee058b4f1998-02-16 06:15:35 +00001594Returns \var{o} on success, or \NULL{} on failure.
1595This is the equivalent of the Python expression \samp{+\var{o}}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001596\end{cfuncdesc}
1597
1598
1599\begin{cfuncdesc}{PyObject*}{PyNumber_Absolute}{PyObject *o}
Fred Drakee058b4f1998-02-16 06:15:35 +00001600Returns the absolute value of \var{o}, or \NULL{} on failure. This is
1601the equivalent of the Python expression \samp{abs(\var{o})}.
Fred Drake659ebfa2000-04-03 15:42:13 +00001602\bifuncindex{abs}
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001603\end{cfuncdesc}
1604
1605
1606\begin{cfuncdesc}{PyObject*}{PyNumber_Invert}{PyObject *o}
Fred Drakee058b4f1998-02-16 06:15:35 +00001607Returns the bitwise negation of \var{o} on success, or \NULL{} on
1608failure. This is the equivalent of the Python expression
1609\samp{\~\var{o}}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001610\end{cfuncdesc}
1611
1612
1613\begin{cfuncdesc}{PyObject*}{PyNumber_Lshift}{PyObject *o1, PyObject *o2}
Fred Drakee058b4f1998-02-16 06:15:35 +00001614Returns the result of left shifting \var{o1} by \var{o2} on success,
1615or \NULL{} on failure. This is the equivalent of the Python
1616expression \samp{\var{o1} << \var{o2}}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001617\end{cfuncdesc}
1618
1619
1620\begin{cfuncdesc}{PyObject*}{PyNumber_Rshift}{PyObject *o1, PyObject *o2}
Fred Drakee058b4f1998-02-16 06:15:35 +00001621Returns the result of right shifting \var{o1} by \var{o2} on success,
1622or \NULL{} on failure. This is the equivalent of the Python
1623expression \samp{\var{o1} >> \var{o2}}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001624\end{cfuncdesc}
1625
1626
1627\begin{cfuncdesc}{PyObject*}{PyNumber_And}{PyObject *o1, PyObject *o2}
Fred Drakee058b4f1998-02-16 06:15:35 +00001628Returns the result of ``anding'' \var{o2} and \var{o2} on success and
1629\NULL{} on failure. This is the equivalent of the Python
1630expression \samp{\var{o1} and \var{o2}}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001631\end{cfuncdesc}
1632
1633
1634\begin{cfuncdesc}{PyObject*}{PyNumber_Xor}{PyObject *o1, PyObject *o2}
Fred Drakee058b4f1998-02-16 06:15:35 +00001635Returns the bitwise exclusive or of \var{o1} by \var{o2} on success,
1636or \NULL{} on failure. This is the equivalent of the Python
1637expression \samp{\var{o1} \^{ }\var{o2}}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001638\end{cfuncdesc}
1639
1640\begin{cfuncdesc}{PyObject*}{PyNumber_Or}{PyObject *o1, PyObject *o2}
Fred Drakee058b4f1998-02-16 06:15:35 +00001641Returns the result of \var{o1} and \var{o2} on success, or \NULL{} on
1642failure. This is the equivalent of the Python expression
1643\samp{\var{o1} or \var{o2}}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001644\end{cfuncdesc}
1645
Fred Drakee058b4f1998-02-16 06:15:35 +00001646\begin{cfuncdesc}{PyObject*}{PyNumber_Coerce}{PyObject **p1, PyObject **p2}
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001647This function takes the addresses of two variables of type
Fred Drake659ebfa2000-04-03 15:42:13 +00001648\ctype{PyObject*}. If the objects pointed to by \code{*\var{p1}} and
1649\code{*\var{p2}} have the same type, increment their reference count
1650and return \code{0} (success). If the objects can be converted to a
1651common numeric type, replace \code{*p1} and \code{*p2} by their
1652converted value (with 'new' reference counts), and return \code{0}.
1653If no conversion is possible, or if some other error occurs, return
1654\code{-1} (failure) and don't increment the reference counts. The
1655call \code{PyNumber_Coerce(\&o1, \&o2)} is equivalent to the Python
1656statement \samp{\var{o1}, \var{o2} = coerce(\var{o1}, \var{o2})}.
1657\bifuncindex{coerce}
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001658\end{cfuncdesc}
1659
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001660\begin{cfuncdesc}{PyObject*}{PyNumber_Int}{PyObject *o}
Fred Drakee058b4f1998-02-16 06:15:35 +00001661Returns the \var{o} converted to an integer object on success, or
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001662\NULL{} on failure. This is the equivalent of the Python
Fred Drake659ebfa2000-04-03 15:42:13 +00001663expression \samp{int(\var{o})}.\bifuncindex{int}
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001664\end{cfuncdesc}
1665
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001666\begin{cfuncdesc}{PyObject*}{PyNumber_Long}{PyObject *o}
Fred Drakee058b4f1998-02-16 06:15:35 +00001667Returns the \var{o} converted to a long integer object on success,
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001668or \NULL{} on failure. This is the equivalent of the Python
Fred Drake659ebfa2000-04-03 15:42:13 +00001669expression \samp{long(\var{o})}.\bifuncindex{long}
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001670\end{cfuncdesc}
1671
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001672\begin{cfuncdesc}{PyObject*}{PyNumber_Float}{PyObject *o}
Fred Drake659ebfa2000-04-03 15:42:13 +00001673Returns the \var{o} converted to a float object on success, or
1674\NULL{} on failure. This is the equivalent of the Python expression
1675\samp{float(\var{o})}.\bifuncindex{float}
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001676\end{cfuncdesc}
1677
1678
Fred Drakeefd146c1999-02-15 15:30:45 +00001679\section{Sequence Protocol \label{sequence}}
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001680
1681\begin{cfuncdesc}{int}{PySequence_Check}{PyObject *o}
Fred Drake659ebfa2000-04-03 15:42:13 +00001682Return \code{1} if the object provides sequence protocol, and
1683\code{0} otherwise. This function always succeeds.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001684\end{cfuncdesc}
1685
Fred Drake659ebfa2000-04-03 15:42:13 +00001686\begin{cfuncdesc}{int}{PySequence_Length}{PyObject *o}
1687Returns the number of objects in sequence \var{o} on success, and
1688\code{-1} on failure. For objects that do not provide sequence
1689protocol, this is equivalent to the Python expression
1690\samp{len(\var{o})}.\bifuncindex{len}
1691\end{cfuncdesc}
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001692
1693\begin{cfuncdesc}{PyObject*}{PySequence_Concat}{PyObject *o1, PyObject *o2}
Fred Drakee058b4f1998-02-16 06:15:35 +00001694Return the concatenation of \var{o1} and \var{o2} on success, and \NULL{} on
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001695failure. This is the equivalent of the Python
Fred Drakee058b4f1998-02-16 06:15:35 +00001696expression \samp{\var{o1} + \var{o2}}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001697\end{cfuncdesc}
1698
1699
1700\begin{cfuncdesc}{PyObject*}{PySequence_Repeat}{PyObject *o, int count}
Fred Drake659ebfa2000-04-03 15:42:13 +00001701Return the result of repeating sequence object
1702\var{o} \var{count} times, or \NULL{} on failure. This is the
1703equivalent of the Python expression \samp{\var{o} * \var{count}}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001704\end{cfuncdesc}
1705
1706
1707\begin{cfuncdesc}{PyObject*}{PySequence_GetItem}{PyObject *o, int i}
Fred Drakee058b4f1998-02-16 06:15:35 +00001708Return the \var{i}th element of \var{o}, or \NULL{} on failure. This
1709is the equivalent of the Python expression \samp{\var{o}[\var{i}]}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001710\end{cfuncdesc}
1711
1712
1713\begin{cfuncdesc}{PyObject*}{PySequence_GetSlice}{PyObject *o, int i1, int i2}
Fred Drakee058b4f1998-02-16 06:15:35 +00001714Return the slice of sequence object \var{o} between \var{i1} and
1715\var{i2}, or \NULL{} on failure. This is the equivalent of the Python
1716expression \samp{\var{o}[\var{i1}:\var{i2}]}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001717\end{cfuncdesc}
1718
1719
1720\begin{cfuncdesc}{int}{PySequence_SetItem}{PyObject *o, int i, PyObject *v}
Fred Drakee058b4f1998-02-16 06:15:35 +00001721Assign object \var{v} to the \var{i}th element of \var{o}.
1722Returns \code{-1} on failure. This is the equivalent of the Python
1723statement \samp{\var{o}[\var{i}] = \var{v}}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001724\end{cfuncdesc}
1725
1726\begin{cfuncdesc}{int}{PySequence_DelItem}{PyObject *o, int i}
Fred Drakee058b4f1998-02-16 06:15:35 +00001727Delete the \var{i}th element of object \var{v}. Returns
1728\code{-1} on failure. This is the equivalent of the Python
1729statement \samp{del \var{o}[\var{i}]}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001730\end{cfuncdesc}
1731
Fred Drake659ebfa2000-04-03 15:42:13 +00001732\begin{cfuncdesc}{int}{PySequence_SetSlice}{PyObject *o, int i1,
1733 int i2, PyObject *v}
Fred Drakee058b4f1998-02-16 06:15:35 +00001734Assign the sequence object \var{v} to the slice in sequence
1735object \var{o} from \var{i1} to \var{i2}. This is the equivalent of
1736the Python statement \samp{\var{o}[\var{i1}:\var{i2}] = \var{v}}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001737\end{cfuncdesc}
1738
1739\begin{cfuncdesc}{int}{PySequence_DelSlice}{PyObject *o, int i1, int i2}
Fred Drakee058b4f1998-02-16 06:15:35 +00001740Delete the slice in sequence object \var{o} from \var{i1} to \var{i2}.
1741Returns \code{-1} on failure. This is the equivalent of the Python
1742statement \samp{del \var{o}[\var{i1}:\var{i2}]}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001743\end{cfuncdesc}
1744
1745\begin{cfuncdesc}{PyObject*}{PySequence_Tuple}{PyObject *o}
Fred Drakee058b4f1998-02-16 06:15:35 +00001746Returns the \var{o} as a tuple on success, and \NULL{} on failure.
Fred Drake659ebfa2000-04-03 15:42:13 +00001747This is equivalent to the Python expression \samp{tuple(\var{o})}.
1748\bifuncindex{tuple}
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001749\end{cfuncdesc}
1750
1751\begin{cfuncdesc}{int}{PySequence_Count}{PyObject *o, PyObject *value}
Fred Drakee058b4f1998-02-16 06:15:35 +00001752Return the number of occurrences of \var{value} in \var{o}, that is,
1753return the number of keys for which \code{\var{o}[\var{key}] ==
1754\var{value}}. On failure, return \code{-1}. This is equivalent to
1755the Python expression \samp{\var{o}.count(\var{value})}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001756\end{cfuncdesc}
1757
Fred Drake659ebfa2000-04-03 15:42:13 +00001758\begin{cfuncdesc}{int}{PySequence_Contains}{PyObject *o, PyObject *value}
Fred Drakee058b4f1998-02-16 06:15:35 +00001759Determine if \var{o} contains \var{value}. If an item in \var{o} is
1760equal to \var{value}, return \code{1}, otherwise return \code{0}. On
1761error, return \code{-1}. This is equivalent to the Python expression
1762\samp{\var{value} in \var{o}}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001763\end{cfuncdesc}
1764
1765\begin{cfuncdesc}{int}{PySequence_Index}{PyObject *o, PyObject *value}
Fred Drakee058b4f1998-02-16 06:15:35 +00001766Return the first index \var{i} for which \code{\var{o}[\var{i}] ==
1767\var{value}}. On error, return \code{-1}. This is equivalent to
1768the Python expression \samp{\var{o}.index(\var{value})}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001769\end{cfuncdesc}
1770
Fred Drakea8455ab2000-06-16 19:58:42 +00001771\begin{cfuncdesc}{PyObject*}{PySequence_List}{PyObject *o}
1772Return a list object with the same contents as the arbitrary sequence
1773\var{o}. The returned list is guaranteed to be new.
1774\end{cfuncdesc}
1775
1776\begin{cfuncdesc}{PyObject*}{PySequence_Tuple}{PyObject *o}
1777Return a tuple object with the same contents as the arbitrary sequence
1778\var{o}. If \var{o} is a tuple, a new reference will be returned,
1779otherwise a tuple will be constructed with the appropriate contents.
1780\end{cfuncdesc}
1781
Fred Drakef39ed671998-02-26 22:01:23 +00001782
Fred Drakeefd146c1999-02-15 15:30:45 +00001783\section{Mapping Protocol \label{mapping}}
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001784
1785\begin{cfuncdesc}{int}{PyMapping_Check}{PyObject *o}
Fred Drake659ebfa2000-04-03 15:42:13 +00001786Return \code{1} if the object provides mapping protocol, and
1787\code{0} otherwise. This function always succeeds.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001788\end{cfuncdesc}
1789
1790
1791\begin{cfuncdesc}{int}{PyMapping_Length}{PyObject *o}
Fred Drake659ebfa2000-04-03 15:42:13 +00001792Returns the number of keys in object \var{o} on success, and
1793\code{-1} on failure. For objects that do not provide mapping
1794protocol, this is equivalent to the Python expression
1795\samp{len(\var{o})}.\bifuncindex{len}
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001796\end{cfuncdesc}
1797
1798
1799\begin{cfuncdesc}{int}{PyMapping_DelItemString}{PyObject *o, char *key}
Fred Drakee058b4f1998-02-16 06:15:35 +00001800Remove the mapping for object \var{key} from the object \var{o}.
1801Return \code{-1} on failure. This is equivalent to
1802the Python statement \samp{del \var{o}[\var{key}]}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001803\end{cfuncdesc}
1804
1805
1806\begin{cfuncdesc}{int}{PyMapping_DelItem}{PyObject *o, PyObject *key}
Fred Drakee058b4f1998-02-16 06:15:35 +00001807Remove the mapping for object \var{key} from the object \var{o}.
1808Return \code{-1} on failure. This is equivalent to
1809the Python statement \samp{del \var{o}[\var{key}]}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001810\end{cfuncdesc}
1811
1812
1813\begin{cfuncdesc}{int}{PyMapping_HasKeyString}{PyObject *o, char *key}
Fred Drake659ebfa2000-04-03 15:42:13 +00001814On success, return \code{1} if the mapping object has the key
1815\var{key} and \code{0} otherwise. This is equivalent to the Python
1816expression \samp{\var{o}.has_key(\var{key})}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001817This function always succeeds.
1818\end{cfuncdesc}
1819
1820
1821\begin{cfuncdesc}{int}{PyMapping_HasKey}{PyObject *o, PyObject *key}
Fred Drakee058b4f1998-02-16 06:15:35 +00001822Return \code{1} if the mapping object has the key \var{key} and
1823\code{0} otherwise. This is equivalent to the Python expression
1824\samp{\var{o}.has_key(\var{key})}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001825This function always succeeds.
1826\end{cfuncdesc}
1827
1828
1829\begin{cfuncdesc}{PyObject*}{PyMapping_Keys}{PyObject *o}
Fred Drakee058b4f1998-02-16 06:15:35 +00001830On success, return a list of the keys in object \var{o}. On
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001831failure, return \NULL{}. This is equivalent to the Python
Fred Drakee058b4f1998-02-16 06:15:35 +00001832expression \samp{\var{o}.keys()}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001833\end{cfuncdesc}
1834
1835
1836\begin{cfuncdesc}{PyObject*}{PyMapping_Values}{PyObject *o}
Fred Drakee058b4f1998-02-16 06:15:35 +00001837On success, return a list of the values in object \var{o}. On
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001838failure, return \NULL{}. This is equivalent to the Python
Fred Drakee058b4f1998-02-16 06:15:35 +00001839expression \samp{\var{o}.values()}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001840\end{cfuncdesc}
1841
1842
1843\begin{cfuncdesc}{PyObject*}{PyMapping_Items}{PyObject *o}
Fred Drakee058b4f1998-02-16 06:15:35 +00001844On success, return a list of the items in object \var{o}, where
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001845each item is a tuple containing a key-value pair. On
1846failure, return \NULL{}. This is equivalent to the Python
Fred Drakee058b4f1998-02-16 06:15:35 +00001847expression \samp{\var{o}.items()}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001848\end{cfuncdesc}
1849
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001850
1851\begin{cfuncdesc}{PyObject*}{PyMapping_GetItemString}{PyObject *o, char *key}
Fred Drakee058b4f1998-02-16 06:15:35 +00001852Return element of \var{o} corresponding to the object \var{key} or
1853\NULL{} on failure. This is the equivalent of the Python expression
1854\samp{\var{o}[\var{key}]}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001855\end{cfuncdesc}
1856
Guido van Rossum0a0f11b1998-10-16 17:43:53 +00001857\begin{cfuncdesc}{int}{PyMapping_SetItemString}{PyObject *o, char *key, PyObject *v}
Fred Drakee058b4f1998-02-16 06:15:35 +00001858Map the object \var{key} to the value \var{v} in object \var{o}.
1859Returns \code{-1} on failure. This is the equivalent of the Python
1860statement \samp{\var{o}[\var{key}] = \var{v}}.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001861\end{cfuncdesc}
1862
1863
Fred Drakeefd146c1999-02-15 15:30:45 +00001864\chapter{Concrete Objects Layer \label{concrete}}
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001865
1866The functions in this chapter are specific to certain Python object
1867types. Passing them an object of the wrong type is not a good idea;
1868if you receive an object from a Python program and you are not sure
1869that it has the right type, you must perform a type check first;
Fred Drake659ebfa2000-04-03 15:42:13 +00001870for example. to check that an object is a dictionary, use
Fred Drakee5bf8b21998-02-12 21:22:28 +00001871\cfunction{PyDict_Check()}. The chapter is structured like the
1872``family tree'' of Python object types.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001873
1874
Fred Drakeefd146c1999-02-15 15:30:45 +00001875\section{Fundamental Objects \label{fundamental}}
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001876
Fred Drakee5bf8b21998-02-12 21:22:28 +00001877This section describes Python type objects and the singleton object
1878\code{None}.
1879
1880
Fred Drakeefd146c1999-02-15 15:30:45 +00001881\subsection{Type Objects \label{typeObjects}}
Fred Drakee5bf8b21998-02-12 21:22:28 +00001882
Fred Drake659ebfa2000-04-03 15:42:13 +00001883\obindex{type}
Fred Drakee5bf8b21998-02-12 21:22:28 +00001884\begin{ctypedesc}{PyTypeObject}
Fred Drake659ebfa2000-04-03 15:42:13 +00001885The C structure of the objects used to describe built-in types.
Fred Drakee5bf8b21998-02-12 21:22:28 +00001886\end{ctypedesc}
1887
Fred Drake659ebfa2000-04-03 15:42:13 +00001888\begin{cvardesc}{PyObject*}{PyType_Type}
Fred Drakeefd146c1999-02-15 15:30:45 +00001889This is the type object for type objects; it is the same object as
1890\code{types.TypeType} in the Python layer.
Fred Drake659ebfa2000-04-03 15:42:13 +00001891\withsubitem{(in module types)}{\ttindex{TypeType}}
Fred Drakee5bf8b21998-02-12 21:22:28 +00001892\end{cvardesc}
1893
Fred Drake659ebfa2000-04-03 15:42:13 +00001894\begin{cfuncdesc}{int}{PyType_Check}{PyObject *o}
1895Returns true is the object \var{o} is a type object.
1896\end{cfuncdesc}
1897
1898\begin{cfuncdesc}{int}{PyType_HasFeature}{PyObject *o, int feature}
1899Returns true if the type object \var{o} sets the feature
1900\var{feature}. Type features are denoted by single bit flags. The
1901only defined feature flag is \constant{Py_TPFLAGS_HAVE_GETCHARBUFFER},
1902described in section \ref{buffer-structs}.
1903\end{cfuncdesc}
1904
Fred Drakee5bf8b21998-02-12 21:22:28 +00001905
Fred Drakeefd146c1999-02-15 15:30:45 +00001906\subsection{The None Object \label{noneObject}}
Fred Drakee5bf8b21998-02-12 21:22:28 +00001907
Fred Drake659ebfa2000-04-03 15:42:13 +00001908\obindex{None@\texttt{None}}
1909Note that the \ctype{PyTypeObject} for \code{None} is not directly
1910exposed in the Python/C API. Since \code{None} is a singleton,
1911testing for object identity (using \samp{==} in C) is sufficient.
1912There is no \cfunction{PyNone_Check()} function for the same reason.
1913
1914\begin{cvardesc}{PyObject*}{Py_None}
Guido van Rossum44475131998-04-21 15:30:01 +00001915The Python \code{None} object, denoting lack of value. This object has
1916no methods.
Fred Drakee5bf8b21998-02-12 21:22:28 +00001917\end{cvardesc}
1918
1919
Fred Drakeefd146c1999-02-15 15:30:45 +00001920\section{Sequence Objects \label{sequenceObjects}}
Fred Drakee5bf8b21998-02-12 21:22:28 +00001921
Fred Drake659ebfa2000-04-03 15:42:13 +00001922\obindex{sequence}
Fred Drakee5bf8b21998-02-12 21:22:28 +00001923Generic operations on sequence objects were discussed in the previous
1924chapter; this section deals with the specific kinds of sequence
1925objects that are intrinsic to the Python language.
1926
1927
Fred Drakeefd146c1999-02-15 15:30:45 +00001928\subsection{String Objects \label{stringObjects}}
Fred Drakee5bf8b21998-02-12 21:22:28 +00001929
Fred Drake659ebfa2000-04-03 15:42:13 +00001930\obindex{string}
Fred Drakee5bf8b21998-02-12 21:22:28 +00001931\begin{ctypedesc}{PyStringObject}
Fred Drakef8830d11998-04-23 14:06:01 +00001932This subtype of \ctype{PyObject} represents a Python string object.
Fred Drakee5bf8b21998-02-12 21:22:28 +00001933\end{ctypedesc}
1934
1935\begin{cvardesc}{PyTypeObject}{PyString_Type}
Fred Drake659ebfa2000-04-03 15:42:13 +00001936This instance of \ctype{PyTypeObject} represents the Python string
1937type; it is the same object as \code{types.TypeType} in the Python
1938layer.\withsubitem{(in module types)}{\ttindex{StringType}}.
Fred Drakee5bf8b21998-02-12 21:22:28 +00001939\end{cvardesc}
1940
1941\begin{cfuncdesc}{int}{PyString_Check}{PyObject *o}
Guido van Rossum3c4378b1998-04-14 20:21:10 +00001942Returns true if the object \var{o} is a string object.
Guido van Rossum9231c8f1997-05-15 21:43:21 +00001943\end{cfuncdesc}
1944
Fred Drakec6fa34e1998-04-02 06:47:24 +00001945\begin{cfuncdesc}{PyObject*}{PyString_FromString}{const char *v}
Guido van Rossum3c4378b1998-04-14 20:21:10 +00001946Returns a new string object with the value \var{v} on success, and
1947\NULL{} on failure.
Fred Drakec6fa34e1998-04-02 06:47:24 +00001948\end{cfuncdesc}
1949
Fred Drake659ebfa2000-04-03 15:42:13 +00001950\begin{cfuncdesc}{PyObject*}{PyString_FromStringAndSize}{const char *v,
1951 int len}
1952Returns a new string object with the value \var{v} and length
1953\var{len} on success, and \NULL{} on failure. If \var{v} is \NULL{},
1954the contents of the string are uninitialized.
1955\end{cfuncdesc}
1956
Fred Drakec6fa34e1998-04-02 06:47:24 +00001957\begin{cfuncdesc}{int}{PyString_Size}{PyObject *string}
Guido van Rossum3c4378b1998-04-14 20:21:10 +00001958Returns the length of the string in string object \var{string}.
Fred Drakec6fa34e1998-04-02 06:47:24 +00001959\end{cfuncdesc}
1960
Fred Drake659ebfa2000-04-03 15:42:13 +00001961\begin{cfuncdesc}{int}{PyString_GET_SIZE}{PyObject *string}
1962Macro form of \cfunction{PyString_GetSize()} but without error
1963checking.
1964\end{cfuncdesc}
1965
Fred Drakec6fa34e1998-04-02 06:47:24 +00001966\begin{cfuncdesc}{char*}{PyString_AsString}{PyObject *string}
Fred Drake659ebfa2000-04-03 15:42:13 +00001967Returns a null-terminated representation of the contents of
1968\var{string}. The pointer refers to the internal buffer of
1969\var{string}, not a copy. The data must not be modified in any way.
1970It must not be de-allocated.
1971\end{cfuncdesc}
1972
1973\begin{cfuncdesc}{char*}{PyString_AS_STRING}{PyObject *string}
1974Macro form of \cfunction{PyString_AsString()} but without error
1975checking.
Fred Drakec6fa34e1998-04-02 06:47:24 +00001976\end{cfuncdesc}
1977
1978\begin{cfuncdesc}{void}{PyString_Concat}{PyObject **string,
1979 PyObject *newpart}
Fred Drake66b989c1999-02-15 20:15:39 +00001980Creates a new string object in \var{*string} containing the
Fred Drakeddc6c272000-03-31 18:22:38 +00001981contents of \var{newpart} appended to \var{string}; the caller will
1982own the new reference. The reference to the old value of \var{string}
1983will be stolen. If the new string
Fred Drake66b989c1999-02-15 20:15:39 +00001984cannot be created, the old reference to \var{string} will still be
1985discarded and the value of \var{*string} will be set to
1986\NULL{}; the appropriate exception will be set.
Fred Drakec6fa34e1998-04-02 06:47:24 +00001987\end{cfuncdesc}
1988
1989\begin{cfuncdesc}{void}{PyString_ConcatAndDel}{PyObject **string,
1990 PyObject *newpart}
Guido van Rossum3c4378b1998-04-14 20:21:10 +00001991Creates a new string object in \var{*string} containing the contents
Guido van Rossum44475131998-04-21 15:30:01 +00001992of \var{newpart} appended to \var{string}. This version decrements
1993the reference count of \var{newpart}.
Fred Drakec6fa34e1998-04-02 06:47:24 +00001994\end{cfuncdesc}
1995
1996\begin{cfuncdesc}{int}{_PyString_Resize}{PyObject **string, int newsize}
Guido van Rossum44475131998-04-21 15:30:01 +00001997A way to resize a string object even though it is ``immutable''.
1998Only use this to build up a brand new string object; don't use this if
1999the string may already be known in other parts of the code.
Fred Drakec6fa34e1998-04-02 06:47:24 +00002000\end{cfuncdesc}
2001
2002\begin{cfuncdesc}{PyObject*}{PyString_Format}{PyObject *format,
2003 PyObject *args}
Guido van Rossum44475131998-04-21 15:30:01 +00002004Returns a new string object from \var{format} and \var{args}. Analogous
Fred Drake659ebfa2000-04-03 15:42:13 +00002005to \code{\var{format} \%\ \var{args}}. The \var{args} argument must be
Guido van Rossum44475131998-04-21 15:30:01 +00002006a tuple.
Fred Drakec6fa34e1998-04-02 06:47:24 +00002007\end{cfuncdesc}
2008
2009\begin{cfuncdesc}{void}{PyString_InternInPlace}{PyObject **string}
Guido van Rossum44475131998-04-21 15:30:01 +00002010Intern the argument \var{*string} in place. The argument must be the
2011address of a pointer variable pointing to a Python string object.
2012If there is an existing interned string that is the same as
2013\var{*string}, it sets \var{*string} to it (decrementing the reference
2014count of the old string object and incrementing the reference count of
2015the interned string object), otherwise it leaves \var{*string} alone
2016and interns it (incrementing its reference count). (Clarification:
2017even though there is a lot of talk about reference counts, think of
Fred Drakef8830d11998-04-23 14:06:01 +00002018this function as reference-count-neutral; you own the object after
2019the call if and only if you owned it before the call.)
Fred Drakec6fa34e1998-04-02 06:47:24 +00002020\end{cfuncdesc}
2021
2022\begin{cfuncdesc}{PyObject*}{PyString_InternFromString}{const char *v}
Fred Drakef8830d11998-04-23 14:06:01 +00002023A combination of \cfunction{PyString_FromString()} and
2024\cfunction{PyString_InternInPlace()}, returning either a new string object
Guido van Rossum44475131998-04-21 15:30:01 +00002025that has been interned, or a new (``owned'') reference to an earlier
2026interned string object with the same value.
Fred Drakec6fa34e1998-04-02 06:47:24 +00002027\end{cfuncdesc}
2028
Marc-André Lemburg5a20b212000-07-07 15:47:06 +00002029\begin{cfuncdesc}{PyObject*}{PyString_Decode}{const char *s,
2030 int size,
2031 const char *encoding,
2032 const char *errors}
2033Create a string object by decoding \var{size} bytes of the encoded
2034buffer \var{s}. \var{encoding} and \var{errors} have the same meaning
2035as the parameters of the same name in the unicode() builtin
2036function. The codec to be used is looked up using the Python codec
2037registry. Returns \NULL{} in case an exception was raised by the
2038codec.
2039\end{cfuncdesc}
2040
2041\begin{cfuncdesc}{PyObject*}{PyString_Encode}{const Py_UNICODE *s,
2042 int size,
2043 const char *encoding,
2044 const char *errors}
2045Encodes the \ctype{Py_UNICODE} buffer of the given size and returns a
2046Python string object. \var{encoding} and \var{errors} have the same
2047meaning as the parameters of the same name in the string .encode()
2048method. The codec to be used is looked up using the Python codec
2049registry. Returns \NULL{} in case an exception was raised by the
2050codec.
2051\end{cfuncdesc}
2052
2053\begin{cfuncdesc}{PyObject*}{PyString_AsEncodedString}{PyObject *unicode,
2054 const char *encoding,
2055 const char *errors}
2056Encodes a string object and returns the result as Python string
2057object. \var{encoding} and \var{errors} have the same meaning as the
2058parameters of the same name in the string .encode() method. The codec
2059to be used is looked up using the Python codec registry. Returns
2060\NULL{} in case an exception was raised by the codec.
2061\end{cfuncdesc}
2062
Fred Drakee5bf8b21998-02-12 21:22:28 +00002063
Fred Drakea4cd2612000-04-06 14:10:29 +00002064\subsection{Unicode Objects \label{unicodeObjects}}
2065\sectionauthor{Marc-Andre Lemburg}{mal@lemburg.com}
2066
2067%--- Unicode Type -------------------------------------------------------
2068
2069These are the basic Unicode object types used for the Unicode
2070implementation in Python:
2071
2072\begin{ctypedesc}{Py_UNICODE}
2073This type represents a 16-bit unsigned storage type which is used by
2074Python internally as basis for holding Unicode ordinals. On platforms
2075where \ctype{wchar_t} is available and also has 16-bits,
2076\ctype{Py_UNICODE} is a typedef alias for \ctype{wchar_t} to enhance
2077native platform compatibility. On all other platforms,
2078\ctype{Py_UNICODE} is a typedef alias for \ctype{unsigned short}.
2079\end{ctypedesc}
2080
2081\begin{ctypedesc}{PyUnicodeObject}
2082This subtype of \ctype{PyObject} represents a Python Unicode object.
2083\end{ctypedesc}
2084
2085\begin{cvardesc}{PyTypeObject}{PyUnicode_Type}
2086This instance of \ctype{PyTypeObject} represents the Python Unicode type.
2087\end{cvardesc}
2088
2089%--- These are really C macros... is there a macrodesc TeX macro ?
2090
2091The following APIs are really C macros and can be used to do fast
2092checks and to access internal read-only data of Unicode objects:
2093
2094\begin{cfuncdesc}{int}{PyUnicode_Check}{PyObject *o}
2095Returns true if the object \var{o} is a Unicode object.
2096\end{cfuncdesc}
2097
2098\begin{cfuncdesc}{int}{PyUnicode_GET_SIZE}{PyObject *o}
2099Returns the size of the object. o has to be a
2100PyUnicodeObject (not checked).
2101\end{cfuncdesc}
2102
2103\begin{cfuncdesc}{int}{PyUnicode_GET_DATA_SIZE}{PyObject *o}
2104Returns the size of the object's internal buffer in bytes. o has to be
2105a PyUnicodeObject (not checked).
2106\end{cfuncdesc}
2107
Fred Drake992fe5a2000-06-16 21:04:15 +00002108\begin{cfuncdesc}{Py_UNICODE*}{PyUnicode_AS_UNICODE}{PyObject *o}
Fred Drakea4cd2612000-04-06 14:10:29 +00002109Returns a pointer to the internal Py_UNICODE buffer of the object. o
2110has to be a PyUnicodeObject (not checked).
2111\end{cfuncdesc}
2112
Fred Drake992fe5a2000-06-16 21:04:15 +00002113\begin{cfuncdesc}{const char*}{PyUnicode_AS_DATA}{PyObject *o}
Fred Drakea4cd2612000-04-06 14:10:29 +00002114Returns a (const char *) pointer to the internal buffer of the object.
2115o has to be a PyUnicodeObject (not checked).
2116\end{cfuncdesc}
2117
2118% --- Unicode character properties ---------------------------------------
2119
2120Unicode provides many different character properties. The most often
2121needed ones are available through these macros which are mapped to C
2122functions depending on the Python configuration.
2123
2124\begin{cfuncdesc}{int}{Py_UNICODE_ISSPACE}{Py_UNICODE ch}
2125Returns 1/0 depending on whether \var{ch} is a whitespace character.
2126\end{cfuncdesc}
2127
2128\begin{cfuncdesc}{int}{Py_UNICODE_ISLOWER}{Py_UNICODE ch}
2129Returns 1/0 depending on whether \var{ch} is a lowercase character.
2130\end{cfuncdesc}
2131
2132\begin{cfuncdesc}{int}{Py_UNICODE_ISUPPER}{Py_UNICODE ch}
Fred Drakeae96aab2000-07-03 13:38:10 +00002133Returns 1/0 depending on whether \var{ch} is an uppercase character.
Fred Drakea4cd2612000-04-06 14:10:29 +00002134\end{cfuncdesc}
2135
2136\begin{cfuncdesc}{int}{Py_UNICODE_ISTITLE}{Py_UNICODE ch}
2137Returns 1/0 depending on whether \var{ch} is a titlecase character.
2138\end{cfuncdesc}
2139
2140\begin{cfuncdesc}{int}{Py_UNICODE_ISLINEBREAK}{Py_UNICODE ch}
2141Returns 1/0 depending on whether \var{ch} is a linebreak character.
2142\end{cfuncdesc}
2143
2144\begin{cfuncdesc}{int}{Py_UNICODE_ISDECIMAL}{Py_UNICODE ch}
2145Returns 1/0 depending on whether \var{ch} is a decimal character.
2146\end{cfuncdesc}
2147
2148\begin{cfuncdesc}{int}{Py_UNICODE_ISDIGIT}{Py_UNICODE ch}
2149Returns 1/0 depending on whether \var{ch} is a digit character.
2150\end{cfuncdesc}
2151
2152\begin{cfuncdesc}{int}{Py_UNICODE_ISNUMERIC}{Py_UNICODE ch}
2153Returns 1/0 depending on whether \var{ch} is a numeric character.
2154\end{cfuncdesc}
2155
Fred Drakeae96aab2000-07-03 13:38:10 +00002156\begin{cfuncdesc}{int}{Py_UNICODE_ISALPHA}{Py_UNICODE ch}
2157Returns 1/0 depending on whether \var{ch} is an alphabetic character.
2158\end{cfuncdesc}
2159
2160\begin{cfuncdesc}{int}{Py_UNICODE_ISALNUM}{Py_UNICODE ch}
2161Returns 1/0 depending on whether \var{ch} is an alphanumeric character.
2162\end{cfuncdesc}
2163
Fred Drakea4cd2612000-04-06 14:10:29 +00002164These APIs can be used for fast direct character conversions:
2165
2166\begin{cfuncdesc}{Py_UNICODE}{Py_UNICODE_TOLOWER}{Py_UNICODE ch}
2167Returns the character \var{ch} converted to lower case.
2168\end{cfuncdesc}
2169
2170\begin{cfuncdesc}{Py_UNICODE}{Py_UNICODE_TOUPPER}{Py_UNICODE ch}
2171Returns the character \var{ch} converted to upper case.
2172\end{cfuncdesc}
2173
2174\begin{cfuncdesc}{Py_UNICODE}{Py_UNICODE_TOTITLE}{Py_UNICODE ch}
2175Returns the character \var{ch} converted to title case.
2176\end{cfuncdesc}
2177
2178\begin{cfuncdesc}{int}{Py_UNICODE_TODECIMAL}{Py_UNICODE ch}
2179Returns the character \var{ch} converted to a decimal positive integer.
2180Returns -1 in case this is not possible. Does not raise exceptions.
2181\end{cfuncdesc}
2182
2183\begin{cfuncdesc}{int}{Py_UNICODE_TODIGIT}{Py_UNICODE ch}
2184Returns the character \var{ch} converted to a single digit integer.
2185Returns -1 in case this is not possible. Does not raise exceptions.
2186\end{cfuncdesc}
2187
2188\begin{cfuncdesc}{double}{Py_UNICODE_TONUMERIC}{Py_UNICODE ch}
2189Returns the character \var{ch} converted to a (positive) double.
2190Returns -1.0 in case this is not possible. Does not raise exceptions.
2191\end{cfuncdesc}
2192
2193% --- Plain Py_UNICODE ---------------------------------------------------
2194
2195To create Unicode objects and access their basic sequence properties,
2196use these APIs:
2197
2198\begin{cfuncdesc}{PyObject*}{PyUnicode_FromUnicode}{const Py_UNICODE *u,
2199 int size}
2200
2201Create a Unicode Object from the Py_UNICODE buffer \var{u} of the
2202given size. \var{u} may be \NULL{} which causes the contents to be
2203undefined. It is the user's responsibility to fill in the needed data.
2204The buffer is copied into the new object.
2205\end{cfuncdesc}
2206
Fred Drake1d158692000-06-18 05:21:21 +00002207\begin{cfuncdesc}{Py_UNICODE*}{PyUnicode_AsUnicode}{PyObject *unicode}
Fred Drakea4cd2612000-04-06 14:10:29 +00002208Return a read-only pointer to the Unicode object's internal
2209\ctype{Py_UNICODE} buffer.
2210\end{cfuncdesc}
2211
2212\begin{cfuncdesc}{int}{PyUnicode_GetSize}{PyObject *unicode}
2213Return the length of the Unicode object.
2214\end{cfuncdesc}
2215
Marc-André Lemburg5a20b212000-07-07 15:47:06 +00002216\begin{cfuncdesc}{PyObject*}{PyUnicode_FromEncodedObject}{PyObject *obj,
2217 const char *encoding,
2218 const char *errors}
Fred Drakea4cd2612000-04-06 14:10:29 +00002219
Marc-André Lemburg5a20b212000-07-07 15:47:06 +00002220Coerce an encoded object obj to an Unicode object and return a
2221reference with incremented refcount.
Fred Drakea4cd2612000-04-06 14:10:29 +00002222
2223Coercion is done in the following way:
2224\begin{enumerate}
2225\item Unicode objects are passed back as-is with incremented
Marc-André Lemburg5a20b212000-07-07 15:47:06 +00002226 refcount. Note: these cannot be decoded; passing a non-NULL
2227 value for encoding will result in a TypeError.
Fred Drakea4cd2612000-04-06 14:10:29 +00002228
2229\item String and other char buffer compatible objects are decoded
Marc-André Lemburg5a20b212000-07-07 15:47:06 +00002230 according to the given encoding and using the error handling
2231 defined by errors. Both can be NULL to have the interface use
2232 the default values (see the next section for details).
Fred Drakea4cd2612000-04-06 14:10:29 +00002233
Marc-André Lemburg5a20b212000-07-07 15:47:06 +00002234\item All other objects cause an exception.
Fred Drakea4cd2612000-04-06 14:10:29 +00002235\end{enumerate}
2236The API returns NULL in case of an error. The caller is responsible
2237for decref'ing the returned objects.
2238\end{cfuncdesc}
2239
Marc-André Lemburg5a20b212000-07-07 15:47:06 +00002240\begin{cfuncdesc}{PyObject*}{PyUnicode_FromObject}{PyObject *obj}
2241
2242Shortcut for PyUnicode_FromEncodedObject(obj, NULL, ``strict'')
2243which is used throughout the interpreter whenever coercion to
2244Unicode is needed.
2245\end{cfuncdesc}
2246
Fred Drakea4cd2612000-04-06 14:10:29 +00002247% --- wchar_t support for platforms which support it ---------------------
2248
2249If the platform supports \ctype{wchar_t} and provides a header file
2250wchar.h, Python can interface directly to this type using the
2251following functions. Support is optimized if Python's own
2252\ctype{Py_UNICODE} type is identical to the system's \ctype{wchar_t}.
2253
2254\begin{cfuncdesc}{PyObject*}{PyUnicode_FromWideChar}{const wchar_t *w,
2255 int size}
2256Create a Unicode Object from the \ctype{whcar_t} buffer \var{w} of the
2257given size. Returns \NULL{} on failure.
2258\end{cfuncdesc}
2259
2260\begin{cfuncdesc}{int}{PyUnicode_AsWideChar}{PyUnicodeObject *unicode,
2261 wchar_t *w,
2262 int size}
Fred Drakea4cd2612000-04-06 14:10:29 +00002263Copies the Unicode Object contents into the \ctype{whcar_t} buffer
2264\var{w}. At most \var{size} \ctype{whcar_t} characters are copied.
2265Returns the number of \ctype{whcar_t} characters copied or -1 in case
2266of an error.
2267\end{cfuncdesc}
2268
2269
2270\subsubsection{Builtin Codecs \label{builtinCodecs}}
2271
2272Python provides a set of builtin codecs which are written in C
2273for speed. All of these codecs are directly usable via the
2274following functions.
2275
2276Many of the following APIs take two arguments encoding and
2277errors. These parameters encoding and errors have the same semantics
2278as the ones of the builtin unicode() Unicode object constructor.
2279
2280Setting encoding to NULL causes the default encoding to be used which
2281is UTF-8.
2282
2283Error handling is set by errors which may also be set to NULL meaning
2284to use the default handling defined for the codec. Default error
2285handling for all builtin codecs is ``strict'' (ValueErrors are raised).
2286
2287The codecs all use a similar interface. Only deviation from the
2288following generic ones are documented for simplicity.
2289
2290% --- Generic Codecs -----------------------------------------------------
2291
2292These are the generic codec APIs:
2293
2294\begin{cfuncdesc}{PyObject*}{PyUnicode_Decode}{const char *s,
2295 int size,
2296 const char *encoding,
2297 const char *errors}
Fred Drakea4cd2612000-04-06 14:10:29 +00002298Create a Unicode object by decoding \var{size} bytes of the encoded
2299string \var{s}. \var{encoding} and \var{errors} have the same meaning
2300as the parameters of the same name in the unicode() builtin
2301function. The codec to be used is looked up using the Python codec
2302registry. Returns \NULL{} in case an exception was raised by the
2303codec.
2304\end{cfuncdesc}
2305
2306\begin{cfuncdesc}{PyObject*}{PyUnicode_Encode}{const Py_UNICODE *s,
2307 int size,
2308 const char *encoding,
2309 const char *errors}
Fred Drakea4cd2612000-04-06 14:10:29 +00002310Encodes the \ctype{Py_UNICODE} buffer of the given size and returns a
2311Python string object. \var{encoding} and \var{errors} have the same
2312meaning as the parameters of the same name in the Unicode .encode()
2313method. The codec to be used is looked up using the Python codec
2314registry. Returns \NULL{} in case an exception was raised by the
2315codec.
2316\end{cfuncdesc}
2317
2318\begin{cfuncdesc}{PyObject*}{PyUnicode_AsEncodedString}{PyObject *unicode,
2319 const char *encoding,
2320 const char *errors}
Fred Drakea4cd2612000-04-06 14:10:29 +00002321Encodes a Unicode object and returns the result as Python string
2322object. \var{encoding} and \var{errors} have the same meaning as the
2323parameters of the same name in the Unicode .encode() method. The codec
2324to be used is looked up using the Python codec registry. Returns
2325\NULL{} in case an exception was raised by the codec.
2326\end{cfuncdesc}
2327
2328% --- UTF-8 Codecs -------------------------------------------------------
2329
2330These are the UTF-8 codec APIs:
2331
2332\begin{cfuncdesc}{PyObject*}{PyUnicode_DecodeUTF8}{const char *s,
2333 int size,
2334 const char *errors}
Fred Drakea4cd2612000-04-06 14:10:29 +00002335Creates a Unicode object by decoding \var{size} bytes of the UTF-8
2336encoded string \var{s}. Returns \NULL{} in case an exception was
2337raised by the codec.
2338\end{cfuncdesc}
2339
2340\begin{cfuncdesc}{PyObject*}{PyUnicode_EncodeUTF8}{const Py_UNICODE *s,
2341 int size,
2342 const char *errors}
Fred Drakea4cd2612000-04-06 14:10:29 +00002343Encodes the \ctype{Py_UNICODE} buffer of the given size using UTF-8
2344and returns a Python string object. Returns \NULL{} in case an
2345exception was raised by the codec.
2346\end{cfuncdesc}
2347
2348\begin{cfuncdesc}{PyObject*}{PyUnicode_AsUTF8String}{PyObject *unicode}
Fred Drakea4cd2612000-04-06 14:10:29 +00002349Encodes a Unicode objects using UTF-8 and returns the result as Python
2350string object. Error handling is ``strict''. Returns
2351\NULL{} in case an exception was raised by the codec.
2352\end{cfuncdesc}
2353
2354% --- UTF-16 Codecs ------------------------------------------------------ */
2355
2356These are the UTF-16 codec APIs:
2357
2358\begin{cfuncdesc}{PyObject*}{PyUnicode_DecodeUTF16}{const char *s,
2359 int size,
2360 const char *errors,
2361 int *byteorder}
Fred Drakea4cd2612000-04-06 14:10:29 +00002362Decodes \var{length} bytes from a UTF-16 encoded buffer string and
2363returns the corresponding Unicode object.
2364
2365\var{errors} (if non-NULL) defines the error handling. It defaults
2366to ``strict''.
2367
2368If \var{byteorder} is non-\NULL{}, the decoder starts decoding using
2369the given byte order:
2370
2371\begin{verbatim}
2372 *byteorder == -1: little endian
2373 *byteorder == 0: native order
2374 *byteorder == 1: big endian
2375\end{verbatim}
2376
2377and then switches according to all byte order marks (BOM) it finds in
2378the input data. BOM marks are not copied into the resulting Unicode
2379string. After completion, \var{*byteorder} is set to the current byte
2380order at the end of input data.
2381
2382If \var{byteorder} is \NULL{}, the codec starts in native order mode.
2383
2384Returns \NULL{} in case an exception was raised by the codec.
2385\end{cfuncdesc}
2386
2387\begin{cfuncdesc}{PyObject*}{PyUnicode_EncodeUTF16}{const Py_UNICODE *s,
2388 int size,
2389 const char *errors,
2390 int byteorder}
Fred Drakea4cd2612000-04-06 14:10:29 +00002391Returns a Python string object holding the UTF-16 encoded value of the
2392Unicode data in \var{s}.
2393
Fred Drakea8455ab2000-06-16 19:58:42 +00002394If \var{byteorder} is not \code{0}, output is written according to the
Fred Drakea4cd2612000-04-06 14:10:29 +00002395following byte order:
2396
2397\begin{verbatim}
2398 byteorder == -1: little endian
2399 byteorder == 0: native byte order (writes a BOM mark)
2400 byteorder == 1: big endian
2401\end{verbatim}
2402
Fred Drakea8455ab2000-06-16 19:58:42 +00002403If byteorder is \code{0}, the output string will always start with the
Fred Drakea4cd2612000-04-06 14:10:29 +00002404Unicode BOM mark (U+FEFF). In the other two modes, no BOM mark is
2405prepended.
2406
2407Note that \ctype{Py_UNICODE} data is being interpreted as UTF-16
2408reduced to UCS-2. This trick makes it possible to add full UTF-16
2409capabilities at a later point without comprimising the APIs.
2410
2411Returns \NULL{} in case an exception was raised by the codec.
2412\end{cfuncdesc}
2413
2414\begin{cfuncdesc}{PyObject*}{PyUnicode_AsUTF16String}{PyObject *unicode}
Fred Drakea4cd2612000-04-06 14:10:29 +00002415Returns a Python string using the UTF-16 encoding in native byte
2416order. The string always starts with a BOM mark. Error handling is
2417``strict''. Returns \NULL{} in case an exception was raised by the
2418codec.
2419\end{cfuncdesc}
2420
2421% --- Unicode-Escape Codecs ----------------------------------------------
2422
2423These are the ``Unicode Esacpe'' codec APIs:
2424
2425\begin{cfuncdesc}{PyObject*}{PyUnicode_DecodeUnicodeEscape}{const char *s,
2426 int size,
2427 const char *errors}
Fred Drakea4cd2612000-04-06 14:10:29 +00002428Creates a Unicode object by decoding \var{size} bytes of the Unicode-Esacpe
2429encoded string \var{s}. Returns \NULL{} in case an exception was
2430raised by the codec.
2431\end{cfuncdesc}
2432
2433\begin{cfuncdesc}{PyObject*}{PyUnicode_EncodeUnicodeEscape}{const Py_UNICODE *s,
2434 int size,
2435 const char *errors}
Fred Drakea4cd2612000-04-06 14:10:29 +00002436Encodes the \ctype{Py_UNICODE} buffer of the given size using Unicode-Escape
2437and returns a Python string object. Returns \NULL{} in case an
2438exception was raised by the codec.
2439\end{cfuncdesc}
2440
2441\begin{cfuncdesc}{PyObject*}{PyUnicode_AsUnicodeEscapeString}{PyObject *unicode}
Fred Drakea4cd2612000-04-06 14:10:29 +00002442Encodes a Unicode objects using Unicode-Escape and returns the result
2443as Python string object. Error handling is ``strict''. Returns
2444\NULL{} in case an exception was raised by the codec.
2445\end{cfuncdesc}
2446
2447% --- Raw-Unicode-Escape Codecs ------------------------------------------
2448
2449These are the ``Raw Unicode Esacpe'' codec APIs:
2450
2451\begin{cfuncdesc}{PyObject*}{PyUnicode_DecodeRawUnicodeEscape}{const char *s,
2452 int size,
2453 const char *errors}
Fred Drakea4cd2612000-04-06 14:10:29 +00002454Creates a Unicode object by decoding \var{size} bytes of the Raw-Unicode-Esacpe
2455encoded string \var{s}. Returns \NULL{} in case an exception was
2456raised by the codec.
2457\end{cfuncdesc}
2458
2459\begin{cfuncdesc}{PyObject*}{PyUnicode_EncodeRawUnicodeEscape}{const Py_UNICODE *s,
2460 int size,
2461 const char *errors}
Fred Drakea4cd2612000-04-06 14:10:29 +00002462Encodes the \ctype{Py_UNICODE} buffer of the given size using Raw-Unicode-Escape
2463and returns a Python string object. Returns \NULL{} in case an
2464exception was raised by the codec.
2465\end{cfuncdesc}
2466
2467\begin{cfuncdesc}{PyObject*}{PyUnicode_AsRawUnicodeEscapeString}{PyObject *unicode}
Fred Drakea4cd2612000-04-06 14:10:29 +00002468Encodes a Unicode objects using Raw-Unicode-Escape and returns the result
2469as Python string object. Error handling is ``strict''. Returns
2470\NULL{} in case an exception was raised by the codec.
2471\end{cfuncdesc}
2472
2473% --- Latin-1 Codecs -----------------------------------------------------
2474
2475These are the Latin-1 codec APIs:
2476
2477Latin-1 corresponds to the first 256 Unicode ordinals and only these
2478are accepted by the codecs during encoding.
2479
2480\begin{cfuncdesc}{PyObject*}{PyUnicode_DecodeLatin1}{const char *s,
Fred Drake1d158692000-06-18 05:21:21 +00002481 int size,
2482 const char *errors}
Fred Drakea4cd2612000-04-06 14:10:29 +00002483Creates a Unicode object by decoding \var{size} bytes of the Latin-1
2484encoded string \var{s}. Returns \NULL{} in case an exception was
2485raised by the codec.
2486\end{cfuncdesc}
2487
2488\begin{cfuncdesc}{PyObject*}{PyUnicode_EncodeLatin1}{const Py_UNICODE *s,
Fred Drake1d158692000-06-18 05:21:21 +00002489 int size,
2490 const char *errors}
Fred Drakea4cd2612000-04-06 14:10:29 +00002491Encodes the \ctype{Py_UNICODE} buffer of the given size using Latin-1
2492and returns a Python string object. Returns \NULL{} in case an
2493exception was raised by the codec.
2494\end{cfuncdesc}
2495
2496\begin{cfuncdesc}{PyObject*}{PyUnicode_AsLatin1String}{PyObject *unicode}
Fred Drakea4cd2612000-04-06 14:10:29 +00002497Encodes a Unicode objects using Latin-1 and returns the result as
2498Python string object. Error handling is ``strict''. Returns
2499\NULL{} in case an exception was raised by the codec.
2500\end{cfuncdesc}
2501
2502% --- ASCII Codecs -------------------------------------------------------
2503
Fred Drake1d158692000-06-18 05:21:21 +00002504These are the \ASCII{} codec APIs. Only 7-bit \ASCII{} data is
2505accepted. All other codes generate errors.
Fred Drakea4cd2612000-04-06 14:10:29 +00002506
2507\begin{cfuncdesc}{PyObject*}{PyUnicode_DecodeASCII}{const char *s,
Fred Drake1d158692000-06-18 05:21:21 +00002508 int size,
2509 const char *errors}
2510Creates a Unicode object by decoding \var{size} bytes of the
2511\ASCII{} encoded string \var{s}. Returns \NULL{} in case an exception
2512was raised by the codec.
Fred Drakea4cd2612000-04-06 14:10:29 +00002513\end{cfuncdesc}
2514
2515\begin{cfuncdesc}{PyObject*}{PyUnicode_EncodeASCII}{const Py_UNICODE *s,
Fred Drake1d158692000-06-18 05:21:21 +00002516 int size,
2517 const char *errors}
2518Encodes the \ctype{Py_UNICODE} buffer of the given size using
2519\ASCII{} and returns a Python string object. Returns \NULL{} in case
2520an exception was raised by the codec.
Fred Drakea4cd2612000-04-06 14:10:29 +00002521\end{cfuncdesc}
2522
2523\begin{cfuncdesc}{PyObject*}{PyUnicode_AsASCIIString}{PyObject *unicode}
Fred Drake1d158692000-06-18 05:21:21 +00002524Encodes a Unicode objects using \ASCII{} and returns the result as Python
Fred Drakea4cd2612000-04-06 14:10:29 +00002525string object. Error handling is ``strict''. Returns
2526\NULL{} in case an exception was raised by the codec.
2527\end{cfuncdesc}
2528
2529% --- Character Map Codecs -----------------------------------------------
2530
2531These are the mapping codec APIs:
2532
2533This codec is special in that it can be used to implement many
2534different codecs (and this is in fact what was done to obtain most of
2535the standard codecs included in the \module{encodings} package). The
2536codec uses mapping to encode and decode characters.
2537
2538Decoding mappings must map single string characters to single Unicode
2539characters, integers (which are then interpreted as Unicode ordinals)
2540or None (meaning "undefined mapping" and causing an error).
2541
2542Encoding mappings must map single Unicode characters to single string
2543characters, integers (which are then interpreted as Latin-1 ordinals)
2544or None (meaning "undefined mapping" and causing an error).
2545
2546The mapping objects provided must only support the __getitem__ mapping
2547interface.
2548
2549If a character lookup fails with a LookupError, the character is
2550copied as-is meaning that its ordinal value will be interpreted as
2551Unicode or Latin-1 ordinal resp. Because of this, mappings only need
2552to contain those mappings which map characters to different code
2553points.
2554
2555\begin{cfuncdesc}{PyObject*}{PyUnicode_DecodeCharmap}{const char *s,
2556 int size,
2557 PyObject *mapping,
2558 const char *errors}
Fred Drakea4cd2612000-04-06 14:10:29 +00002559Creates a Unicode object by decoding \var{size} bytes of the encoded
2560string \var{s} using the given \var{mapping} object. Returns \NULL{}
2561in case an exception was raised by the codec.
2562\end{cfuncdesc}
2563
2564\begin{cfuncdesc}{PyObject*}{PyUnicode_EncodeCharmap}{const Py_UNICODE *s,
2565 int size,
2566 PyObject *mapping,
2567 const char *errors}
Fred Drakea4cd2612000-04-06 14:10:29 +00002568Encodes the \ctype{Py_UNICODE} buffer of the given size using the
2569given \var{mapping} object and returns a Python string object.
2570Returns \NULL{} in case an exception was raised by the codec.
2571\end{cfuncdesc}
2572
2573\begin{cfuncdesc}{PyObject*}{PyUnicode_AsCharmapString}{PyObject *unicode,
2574 PyObject *mapping}
Fred Drakea4cd2612000-04-06 14:10:29 +00002575Encodes a Unicode objects using the given \var{mapping} object and
2576returns the result as Python string object. Error handling is
2577``strict''. Returns \NULL{} in case an exception was raised by the
2578codec.
2579\end{cfuncdesc}
2580
2581The following codec API is special in that maps Unicode to Unicode.
2582
2583\begin{cfuncdesc}{PyObject*}{PyUnicode_TranslateCharmap}{const Py_UNICODE *s,
2584 int size,
2585 PyObject *table,
2586 const char *errors}
Fred Drakea4cd2612000-04-06 14:10:29 +00002587Translates a \ctype{Py_UNICODE} buffer of the given length by applying
2588a character mapping \var{table} to it and returns the resulting
Fred Drake1d158692000-06-18 05:21:21 +00002589Unicode object. Returns \NULL{} when an exception was raised by the
2590codec.
Fred Drakea4cd2612000-04-06 14:10:29 +00002591
2592The \var{mapping} table must map Unicode ordinal integers to Unicode
2593ordinal integers or None (causing deletion of the character).
2594
2595Mapping tables must only provide the __getitem__ interface,
2596e.g. dictionaries or sequences. Unmapped character ordinals (ones
2597which cause a LookupError) are left untouched and are copied as-is.
Fred Drakea4cd2612000-04-06 14:10:29 +00002598\end{cfuncdesc}
2599
2600% --- MBCS codecs for Windows --------------------------------------------
2601
Fred Drake1d158692000-06-18 05:21:21 +00002602These are the MBCS codec APIs. They are currently only available on
Fred Drakea4cd2612000-04-06 14:10:29 +00002603Windows and use the Win32 MBCS converters to implement the
Fred Drake1d158692000-06-18 05:21:21 +00002604conversions. Note that MBCS (or DBCS) is a class of encodings, not
2605just one. The target encoding is defined by the user settings on the
2606machine running the codec.
Fred Drakea4cd2612000-04-06 14:10:29 +00002607
2608\begin{cfuncdesc}{PyObject*}{PyUnicode_DecodeMBCS}{const char *s,
2609 int size,
2610 const char *errors}
Fred Drakea4cd2612000-04-06 14:10:29 +00002611Creates a Unicode object by decoding \var{size} bytes of the MBCS
Fred Drake1d158692000-06-18 05:21:21 +00002612encoded string \var{s}. Returns \NULL{} in case an exception was
Fred Drakea4cd2612000-04-06 14:10:29 +00002613raised by the codec.
2614\end{cfuncdesc}
2615
2616\begin{cfuncdesc}{PyObject*}{PyUnicode_EncodeMBCS}{const Py_UNICODE *s,
2617 int size,
2618 const char *errors}
Fred Drakea4cd2612000-04-06 14:10:29 +00002619Encodes the \ctype{Py_UNICODE} buffer of the given size using MBCS
2620and returns a Python string object. Returns \NULL{} in case an
2621exception was raised by the codec.
2622\end{cfuncdesc}
2623
2624\begin{cfuncdesc}{PyObject*}{PyUnicode_AsMBCSString}{PyObject *unicode}
Fred Drakea4cd2612000-04-06 14:10:29 +00002625Encodes a Unicode objects using MBCS and returns the result as Python
Fred Drake1d158692000-06-18 05:21:21 +00002626string object. Error handling is ``strict''. Returns \NULL{} in case
2627an exception was raised by the codec.
Fred Drakea4cd2612000-04-06 14:10:29 +00002628\end{cfuncdesc}
2629
2630% --- Methods & Slots ----------------------------------------------------
2631
2632\subsubsection{Methods and Slot Functions \label{unicodeMethodsAndSlots}}
2633
2634The following APIs are capable of handling Unicode objects and strings
2635on input (we refer to them as strings in the descriptions) and return
2636Unicode objects or integers as apporpriate.
2637
2638They all return \NULL{} or -1 in case an exception occurrs.
2639
2640\begin{cfuncdesc}{PyObject*}{PyUnicode_Concat}{PyObject *left,
2641 PyObject *right}
Fred Drakea4cd2612000-04-06 14:10:29 +00002642Concat two strings giving a new Unicode string.
2643\end{cfuncdesc}
2644
2645\begin{cfuncdesc}{PyObject*}{PyUnicode_Split}{PyObject *s,
2646 PyObject *sep,
2647 int maxsplit}
Fred Drakea4cd2612000-04-06 14:10:29 +00002648Split a string giving a list of Unicode strings.
2649
2650If sep is NULL, splitting will be done at all whitespace
2651substrings. Otherwise, splits occur at the given separator.
2652
2653At most maxsplit splits will be done. If negative, no limit is set.
2654
2655Separators are not included in the resulting list.
2656\end{cfuncdesc}
2657
2658\begin{cfuncdesc}{PyObject*}{PyUnicode_Splitlines}{PyObject *s,
2659 int maxsplit}
Fred Drake1d158692000-06-18 05:21:21 +00002660Split a Unicode string at line breaks, returning a list of Unicode
2661strings. CRLF is considered to be one line break. The Line break
2662characters are not included in the resulting strings.
Fred Drakea4cd2612000-04-06 14:10:29 +00002663\end{cfuncdesc}
2664
2665\begin{cfuncdesc}{PyObject*}{PyUnicode_Translate}{PyObject *str,
2666 PyObject *table,
2667 const char *errors}
Fred Drakea4cd2612000-04-06 14:10:29 +00002668Translate a string by applying a character mapping table to it and
2669return the resulting Unicode object.
2670
2671The mapping table must map Unicode ordinal integers to Unicode ordinal
2672integers or None (causing deletion of the character).
2673
2674Mapping tables must only provide the __getitem__ interface,
2675e.g. dictionaries or sequences. Unmapped character ordinals (ones
2676which cause a LookupError) are left untouched and are copied as-is.
2677
2678\var{errors} has the usual meaning for codecs. It may be \NULL{}
2679which indicates to use the default error handling.
Fred Drakea4cd2612000-04-06 14:10:29 +00002680\end{cfuncdesc}
2681
2682\begin{cfuncdesc}{PyObject*}{PyUnicode_Join}{PyObject *separator,
2683 PyObject *seq}
Fred Drakea4cd2612000-04-06 14:10:29 +00002684Join a sequence of strings using the given separator and return
2685the resulting Unicode string.
2686\end{cfuncdesc}
2687
2688\begin{cfuncdesc}{PyObject*}{PyUnicode_Tailmatch}{PyObject *str,
2689 PyObject *substr,
2690 int start,
2691 int end,
2692 int direction}
Fred Drakea4cd2612000-04-06 14:10:29 +00002693Return 1 if \var{substr} matches \var{str}[\var{start}:\var{end}] at
2694the given tail end (\var{direction} == -1 means to do a prefix match,
2695\var{direction} == 1 a suffix match), 0 otherwise.
2696\end{cfuncdesc}
2697
2698\begin{cfuncdesc}{PyObject*}{PyUnicode_Find}{PyObject *str,
2699 PyObject *substr,
2700 int start,
2701 int end,
2702 int direction}
Fred Drakea4cd2612000-04-06 14:10:29 +00002703Return the first position of \var{substr} in
2704\var{str}[\var{start}:\var{end}] using the given \var{direction}
2705(\var{direction} == 1 means to do a forward search,
2706\var{direction} == -1 a backward search), 0 otherwise.
2707\end{cfuncdesc}
2708
2709\begin{cfuncdesc}{PyObject*}{PyUnicode_Count}{PyObject *str,
2710 PyObject *substr,
2711 int start,
2712 int end}
Fred Drakea4cd2612000-04-06 14:10:29 +00002713Count the number of occurrences of \var{substr} in
2714\var{str}[\var{start}:\var{end}]
2715\end{cfuncdesc}
2716
2717\begin{cfuncdesc}{PyObject*}{PyUnicode_Replace}{PyObject *str,
2718 PyObject *substr,
2719 PyObject *replstr,
2720 int maxcount}
Fred Drakea4cd2612000-04-06 14:10:29 +00002721Replace at most \var{maxcount} occurrences of \var{substr} in
2722\var{str} with \var{replstr} and return the resulting Unicode object.
2723\var{maxcount} == -1 means: replace all occurrences.
2724\end{cfuncdesc}
2725
Fred Drake1d158692000-06-18 05:21:21 +00002726\begin{cfuncdesc}{int}{PyUnicode_Compare}{PyObject *left, PyObject *right}
Fred Drakea4cd2612000-04-06 14:10:29 +00002727Compare two strings and return -1, 0, 1 for less than, equal,
2728greater than resp.
2729\end{cfuncdesc}
2730
2731\begin{cfuncdesc}{PyObject*}{PyUnicode_Format}{PyObject *format,
2732 PyObject *args}
Fred Drake1d158692000-06-18 05:21:21 +00002733Returns a new string object from \var{format} and \var{args}; this is
2734analogous to \code{\var{format} \%\ \var{args}}. The
2735\var{args} argument must be a tuple.
Fred Drakea4cd2612000-04-06 14:10:29 +00002736\end{cfuncdesc}
2737
2738\begin{cfuncdesc}{int}{PyUnicode_Contains}{PyObject *container,
2739 PyObject *element}
Fred Drakea4cd2612000-04-06 14:10:29 +00002740Checks whether \var{element} is contained in \var{container} and
Fred Drake1d158692000-06-18 05:21:21 +00002741returns true or false accordingly.
Fred Drakea4cd2612000-04-06 14:10:29 +00002742
Fred Drake1d158692000-06-18 05:21:21 +00002743\var{element} has to coerce to a one element Unicode string. \code{-1} is
Fred Drakea4cd2612000-04-06 14:10:29 +00002744returned in case of an error.
2745\end{cfuncdesc}
2746
2747
Fred Drake58c5a2a1999-08-04 13:13:24 +00002748\subsection{Buffer Objects \label{bufferObjects}}
Fred Drake659ebfa2000-04-03 15:42:13 +00002749\sectionauthor{Greg Stein}{gstein@lyra.org}
Fred Drake58c5a2a1999-08-04 13:13:24 +00002750
Fred Drake659ebfa2000-04-03 15:42:13 +00002751\obindex{buffer}
2752Python objects implemented in C can export a group of functions called
2753the ``buffer\index{buffer interface} interface.'' These functions can
2754be used by an object to expose its data in a raw, byte-oriented
2755format. Clients of the object can use the buffer interface to access
2756the object data directly, without needing to copy it first.
2757
2758Two examples of objects that support
2759the buffer interface are strings and arrays. The string object exposes
2760the character contents in the buffer interface's byte-oriented
2761form. An array can also expose its contents, but it should be noted
2762that array elements may be multi-byte values.
2763
2764An example user of the buffer interface is the file object's
2765\method{write()} method. Any object that can export a series of bytes
2766through the buffer interface can be written to a file. There are a
2767number of format codes to \cfunction{PyArgs_ParseTuple()} that operate
2768against an object's buffer interface, returning data from the target
2769object.
2770
2771More information on the buffer interface is provided in the section
2772``Buffer Object Structures'' (section \ref{buffer-structs}), under
2773the description for \ctype{PyBufferProcs}\ttindex{PyBufferProcs}.
2774
2775A ``buffer object'' is defined in the \file{bufferobject.h} header
2776(included by \file{Python.h}). These objects look very similar to
2777string objects at the Python programming level: they support slicing,
2778indexing, concatenation, and some other standard string
2779operations. However, their data can come from one of two sources: from
2780a block of memory, or from another object which exports the buffer
2781interface.
2782
2783Buffer objects are useful as a way to expose the data from another
2784object's buffer interface to the Python programmer. They can also be
2785used as a zero-copy slicing mechanism. Using their ability to
2786reference a block of memory, it is possible to expose any data to the
2787Python programmer quite easily. The memory could be a large, constant
2788array in a C extension, it could be a raw block of memory for
2789manipulation before passing to an operating system library, or it
2790could be used to pass around structured data in its native, in-memory
2791format.
2792
2793\begin{ctypedesc}{PyBufferObject}
2794This subtype of \ctype{PyObject} represents a buffer object.
2795\end{ctypedesc}
Fred Drake58c5a2a1999-08-04 13:13:24 +00002796
2797\begin{cvardesc}{PyTypeObject}{PyBuffer_Type}
2798The instance of \ctype{PyTypeObject} which represents the Python
Fred Drake659ebfa2000-04-03 15:42:13 +00002799buffer type; it is the same object as \code{types.BufferType} in the
2800Python layer.\withsubitem{(in module types)}{\ttindex{BufferType}}.
Fred Drake58c5a2a1999-08-04 13:13:24 +00002801\end{cvardesc}
2802
2803\begin{cvardesc}{int}{Py_END_OF_BUFFER}
Fred Drake659ebfa2000-04-03 15:42:13 +00002804This constant may be passed as the \var{size} parameter to
2805\cfunction{PyBuffer_FromObject()} or
2806\cfunction{PyBuffer_FromReadWriteObject()}. It indicates that the new
2807\ctype{PyBufferObject} should refer to \var{base} object from the
2808specified \var{offset} to the end of its exported buffer. Using this
2809enables the caller to avoid querying the \var{base} object for its
2810length.
Fred Drake58c5a2a1999-08-04 13:13:24 +00002811\end{cvardesc}
2812
2813\begin{cfuncdesc}{int}{PyBuffer_Check}{PyObject *p}
2814Return true if the argument has type \cdata{PyBuffer_Type}.
2815\end{cfuncdesc}
2816
2817\begin{cfuncdesc}{PyObject*}{PyBuffer_FromObject}{PyObject *base,
2818 int offset, int size}
Fred Drake659ebfa2000-04-03 15:42:13 +00002819Return a new read-only buffer object. This raises
2820\exception{TypeError} if \var{base} doesn't support the read-only
2821buffer protocol or doesn't provide exactly one buffer segment, or it
2822raises \exception{ValueError} if \var{offset} is less than zero. The
2823buffer will hold a reference to the \var{base} object, and the
2824buffer's contents will refer to the \var{base} object's buffer
2825interface, starting as position \var{offset} and extending for
2826\var{size} bytes. If \var{size} is \constant{Py_END_OF_BUFFER}, then
2827the new buffer's contents extend to the length of the
2828\var{base} object's exported buffer data.
Fred Drake58c5a2a1999-08-04 13:13:24 +00002829\end{cfuncdesc}
2830
2831\begin{cfuncdesc}{PyObject*}{PyBuffer_FromReadWriteObject}{PyObject *base,
2832 int offset,
2833 int size}
2834Return a new writable buffer object. Parameters and exceptions are
2835similar to those for \cfunction{PyBuffer_FromObject()}.
Fred Drake659ebfa2000-04-03 15:42:13 +00002836If the \var{base} object does not export the writeable buffer
2837protocol, then \exception{TypeError} is raised.
Fred Drake58c5a2a1999-08-04 13:13:24 +00002838\end{cfuncdesc}
2839
2840\begin{cfuncdesc}{PyObject*}{PyBuffer_FromMemory}{void *ptr, int size}
Fred Drake659ebfa2000-04-03 15:42:13 +00002841Return a new read-only buffer object that reads from a specified
2842location in memory, with a specified size.
Fred Drake58c5a2a1999-08-04 13:13:24 +00002843The caller is responsible for ensuring that the memory buffer, passed
2844in as \var{ptr}, is not deallocated while the returned buffer object
2845exists. Raises \exception{ValueError} if \var{size} is less than
Fred Drake659ebfa2000-04-03 15:42:13 +00002846zero. Note that \constant{Py_END_OF_BUFFER} may \emph{not} be passed
2847for the \var{size} parameter; \exception{ValueError} will be raised in
2848that case.
Fred Drake58c5a2a1999-08-04 13:13:24 +00002849\end{cfuncdesc}
2850
2851\begin{cfuncdesc}{PyObject*}{PyBuffer_FromReadWriteMemory}{void *ptr, int size}
Fred Drake659ebfa2000-04-03 15:42:13 +00002852Similar to \cfunction{PyBuffer_FromMemory()}, but the returned buffer
2853is writable.
Fred Drake58c5a2a1999-08-04 13:13:24 +00002854\end{cfuncdesc}
2855
2856\begin{cfuncdesc}{PyObject*}{PyBuffer_New}{int size}
2857Returns a new writable buffer object that maintains its own memory
Fred Drake659ebfa2000-04-03 15:42:13 +00002858buffer of \var{size} bytes. \exception{ValueError} is returned if
2859\var{size} is not zero or positive.
Fred Drake58c5a2a1999-08-04 13:13:24 +00002860\end{cfuncdesc}
2861
Guido van Rossum44475131998-04-21 15:30:01 +00002862
Fred Drakeefd146c1999-02-15 15:30:45 +00002863\subsection{Tuple Objects \label{tupleObjects}}
Fred Drakee5bf8b21998-02-12 21:22:28 +00002864
Fred Drake659ebfa2000-04-03 15:42:13 +00002865\obindex{tuple}
Fred Drakee5bf8b21998-02-12 21:22:28 +00002866\begin{ctypedesc}{PyTupleObject}
Fred Drakef8830d11998-04-23 14:06:01 +00002867This subtype of \ctype{PyObject} represents a Python tuple object.
Fred Drakee5bf8b21998-02-12 21:22:28 +00002868\end{ctypedesc}
2869
2870\begin{cvardesc}{PyTypeObject}{PyTuple_Type}
Fred Drake659ebfa2000-04-03 15:42:13 +00002871This instance of \ctype{PyTypeObject} represents the Python tuple
2872type; it is the same object as \code{types.TupleType} in the Python
2873layer.\withsubitem{(in module types)}{\ttindex{TupleType}}.
Fred Drakee5bf8b21998-02-12 21:22:28 +00002874\end{cvardesc}
2875
2876\begin{cfuncdesc}{int}{PyTuple_Check}{PyObject *p}
2877Return true if the argument is a tuple object.
2878\end{cfuncdesc}
2879
Fred Drake659ebfa2000-04-03 15:42:13 +00002880\begin{cfuncdesc}{PyObject*}{PyTuple_New}{int len}
2881Return a new tuple object of size \var{len}, or \NULL{} on failure.
Fred Drakee5bf8b21998-02-12 21:22:28 +00002882\end{cfuncdesc}
2883
2884\begin{cfuncdesc}{int}{PyTuple_Size}{PyTupleObject *p}
Fred Drakee058b4f1998-02-16 06:15:35 +00002885Takes a pointer to a tuple object, and returns the size
Fred Drakee5bf8b21998-02-12 21:22:28 +00002886of that tuple.
2887\end{cfuncdesc}
2888
Fred Drakec6fa34e1998-04-02 06:47:24 +00002889\begin{cfuncdesc}{PyObject*}{PyTuple_GetItem}{PyTupleObject *p, int pos}
Fred Drakee058b4f1998-02-16 06:15:35 +00002890Returns the object at position \var{pos} in the tuple pointed
2891to by \var{p}. If \var{pos} is out of bounds, returns \NULL{} and
Fred Drake659ebfa2000-04-03 15:42:13 +00002892sets an \exception{IndexError} exception.
Fred Drakee5bf8b21998-02-12 21:22:28 +00002893\end{cfuncdesc}
2894
Fred Drakec6fa34e1998-04-02 06:47:24 +00002895\begin{cfuncdesc}{PyObject*}{PyTuple_GET_ITEM}{PyTupleObject *p, int pos}
Fred Drakee058b4f1998-02-16 06:15:35 +00002896Does the same, but does no checking of its arguments.
Fred Drakee5bf8b21998-02-12 21:22:28 +00002897\end{cfuncdesc}
2898
Fred Drakec6fa34e1998-04-02 06:47:24 +00002899\begin{cfuncdesc}{PyObject*}{PyTuple_GetSlice}{PyTupleObject *p,
Fred Drakee5bf8b21998-02-12 21:22:28 +00002900 int low,
2901 int high}
Fred Drakee058b4f1998-02-16 06:15:35 +00002902Takes a slice of the tuple pointed to by \var{p} from
2903\var{low} to \var{high} and returns it as a new tuple.
Fred Drakee5bf8b21998-02-12 21:22:28 +00002904\end{cfuncdesc}
2905
Fred Drake659ebfa2000-04-03 15:42:13 +00002906\begin{cfuncdesc}{int}{PyTuple_SetItem}{PyObject *p,
2907 int pos, PyObject *o}
Fred Drakee058b4f1998-02-16 06:15:35 +00002908Inserts a reference to object \var{o} at position \var{pos} of
2909the tuple pointed to by \var{p}. It returns \code{0} on success.
Fred Drake659ebfa2000-04-03 15:42:13 +00002910\strong{Note:} This function ``steals'' a reference to \var{o}.
Fred Drakee5bf8b21998-02-12 21:22:28 +00002911\end{cfuncdesc}
2912
Fred Drake659ebfa2000-04-03 15:42:13 +00002913\begin{cfuncdesc}{void}{PyTuple_SET_ITEM}{PyObject *p,
2914 int pos, PyObject *o}
Fred Drakee058b4f1998-02-16 06:15:35 +00002915Does the same, but does no error checking, and
Fred Drakee5bf8b21998-02-12 21:22:28 +00002916should \emph{only} be used to fill in brand new tuples.
Fred Drake659ebfa2000-04-03 15:42:13 +00002917\strong{Note:} This function ``steals'' a reference to \var{o}.
Fred Drakee5bf8b21998-02-12 21:22:28 +00002918\end{cfuncdesc}
2919
Fred Drakec6fa34e1998-04-02 06:47:24 +00002920\begin{cfuncdesc}{int}{_PyTuple_Resize}{PyTupleObject *p,
Fred Drake659ebfa2000-04-03 15:42:13 +00002921 int newsize, int last_is_sticky}
2922Can be used to resize a tuple. \var{newsize} will be the new length
2923of the tuple. Because tuples are \emph{supposed} to be immutable,
2924this should only be used if there is only one reference to the object.
2925Do \emph{not} use this if the tuple may already be known to some other
2926part of the code. \var{last_is_sticky} is a flag --- if true, the
2927tuple will grow or shrink at the front, otherwise it will grow or
2928shrink at the end. Think of this as destroying the old tuple and
2929creating a new one, only more efficiently. Returns \code{0} on
2930success and \code{-1} on failure (in which case a
2931\exception{MemoryError} or \exception{SystemError} will be raised).
Fred Drakee5bf8b21998-02-12 21:22:28 +00002932\end{cfuncdesc}
2933
2934
Fred Drakeefd146c1999-02-15 15:30:45 +00002935\subsection{List Objects \label{listObjects}}
Fred Drakee5bf8b21998-02-12 21:22:28 +00002936
Fred Drake659ebfa2000-04-03 15:42:13 +00002937\obindex{list}
Fred Drakee5bf8b21998-02-12 21:22:28 +00002938\begin{ctypedesc}{PyListObject}
Fred Drakef8830d11998-04-23 14:06:01 +00002939This subtype of \ctype{PyObject} represents a Python list object.
Fred Drakee5bf8b21998-02-12 21:22:28 +00002940\end{ctypedesc}
2941
2942\begin{cvardesc}{PyTypeObject}{PyList_Type}
Fred Drake659ebfa2000-04-03 15:42:13 +00002943This instance of \ctype{PyTypeObject} represents the Python list
2944type. This is the same object as \code{types.ListType}.
2945\withsubitem{(in module types)}{\ttindex{ListType}}
Fred Drakee5bf8b21998-02-12 21:22:28 +00002946\end{cvardesc}
2947
2948\begin{cfuncdesc}{int}{PyList_Check}{PyObject *p}
Fred Drakef8830d11998-04-23 14:06:01 +00002949Returns true if its argument is a \ctype{PyListObject}.
Fred Drakee5bf8b21998-02-12 21:22:28 +00002950\end{cfuncdesc}
2951
Fred Drake659ebfa2000-04-03 15:42:13 +00002952\begin{cfuncdesc}{PyObject*}{PyList_New}{int len}
2953Returns a new list of length \var{len} on success, or \NULL{} on
Guido van Rossum3c4378b1998-04-14 20:21:10 +00002954failure.
Fred Drakee5bf8b21998-02-12 21:22:28 +00002955\end{cfuncdesc}
2956
Fred Drakec6fa34e1998-04-02 06:47:24 +00002957\begin{cfuncdesc}{int}{PyList_Size}{PyObject *list}
Fred Drake659ebfa2000-04-03 15:42:13 +00002958Returns the length of the list object in \var{list}; this is
2959equivalent to \samp{len(\var{list})} on a list object.
2960\bifuncindex{len}
2961\end{cfuncdesc}
2962
2963\begin{cfuncdesc}{int}{PyList_GET_SIZE}{PyObject *list}
2964Macro form of \cfunction{PyList_GetSize()} without error checking.
Fred Drakee5bf8b21998-02-12 21:22:28 +00002965\end{cfuncdesc}
2966
Fred Drakec6fa34e1998-04-02 06:47:24 +00002967\begin{cfuncdesc}{PyObject*}{PyList_GetItem}{PyObject *list, int index}
Guido van Rossum44475131998-04-21 15:30:01 +00002968Returns the object at position \var{pos} in the list pointed
2969to by \var{p}. If \var{pos} is out of bounds, returns \NULL{} and
Fred Drake659ebfa2000-04-03 15:42:13 +00002970sets an \exception{IndexError} exception.
2971\end{cfuncdesc}
2972
2973\begin{cfuncdesc}{PyObject*}{PyList_GET_ITEM}{PyObject *list, int i}
2974Macro form of \cfunction{PyList_GetItem()} without error checking.
Fred Drakee5bf8b21998-02-12 21:22:28 +00002975\end{cfuncdesc}
2976
Fred Drakec6fa34e1998-04-02 06:47:24 +00002977\begin{cfuncdesc}{int}{PyList_SetItem}{PyObject *list, int index,
2978 PyObject *item}
Guido van Rossum3c4378b1998-04-14 20:21:10 +00002979Sets the item at index \var{index} in list to \var{item}.
Fred Drake659ebfa2000-04-03 15:42:13 +00002980\strong{Note:} This function ``steals'' a reference to \var{item}.
2981\end{cfuncdesc}
2982
2983\begin{cfuncdesc}{PyObject*}{PyList_SET_ITEM}{PyObject *list, int i,
2984 PyObject *o}
2985Macro form of \cfunction{PyList_SetItem()} without error checking.
2986\strong{Note:} This function ``steals'' a reference to \var{item}.
Fred Drakee5bf8b21998-02-12 21:22:28 +00002987\end{cfuncdesc}
2988
Fred Drakec6fa34e1998-04-02 06:47:24 +00002989\begin{cfuncdesc}{int}{PyList_Insert}{PyObject *list, int index,
Guido van Rossum44475131998-04-21 15:30:01 +00002990 PyObject *item}
2991Inserts the item \var{item} into list \var{list} in front of index
Fred Drake659ebfa2000-04-03 15:42:13 +00002992\var{index}. Returns \code{0} if successful; returns \code{-1} and
2993raises an exception if unsuccessful. Analogous to
2994\code{\var{list}.insert(\var{index}, \var{item})}.
Fred Drakee5bf8b21998-02-12 21:22:28 +00002995\end{cfuncdesc}
2996
Fred Drakec6fa34e1998-04-02 06:47:24 +00002997\begin{cfuncdesc}{int}{PyList_Append}{PyObject *list, PyObject *item}
Guido van Rossum44475131998-04-21 15:30:01 +00002998Appends the object \var{item} at the end of list \var{list}. Returns
Fred Drake659ebfa2000-04-03 15:42:13 +00002999\code{0} if successful; returns \code{-1} and sets an exception if
3000unsuccessful. Analogous to \code{\var{list}.append(\var{item})}.
Fred Drakee5bf8b21998-02-12 21:22:28 +00003001\end{cfuncdesc}
3002
Fred Drakec6fa34e1998-04-02 06:47:24 +00003003\begin{cfuncdesc}{PyObject*}{PyList_GetSlice}{PyObject *list,
3004 int low, int high}
Guido van Rossum3c4378b1998-04-14 20:21:10 +00003005Returns a list of the objects in \var{list} containing the objects
Guido van Rossum44475131998-04-21 15:30:01 +00003006\emph{between} \var{low} and \var{high}. Returns NULL and sets an
3007exception if unsuccessful.
Fred Drake659ebfa2000-04-03 15:42:13 +00003008Analogous to \code{\var{list}[\var{low}:\var{high}]}.
Fred Drakee5bf8b21998-02-12 21:22:28 +00003009\end{cfuncdesc}
3010
Fred Drakec6fa34e1998-04-02 06:47:24 +00003011\begin{cfuncdesc}{int}{PyList_SetSlice}{PyObject *list,
3012 int low, int high,
3013 PyObject *itemlist}
Fred Drake659ebfa2000-04-03 15:42:13 +00003014Sets the slice of \var{list} between \var{low} and \var{high} to the
3015contents of \var{itemlist}. Analogous to
3016\code{\var{list}[\var{low}:\var{high}] = \var{itemlist}}. Returns
3017\code{0} on success, \code{-1} on failure.
Fred Drakee5bf8b21998-02-12 21:22:28 +00003018\end{cfuncdesc}
3019
Fred Drakec6fa34e1998-04-02 06:47:24 +00003020\begin{cfuncdesc}{int}{PyList_Sort}{PyObject *list}
Fred Drake659ebfa2000-04-03 15:42:13 +00003021Sorts the items of \var{list} in place. Returns \code{0} on success,
3022\code{-1} on failure. This is equivalent to
3023\samp{\var{list}.sort()}.
Fred Drakee5bf8b21998-02-12 21:22:28 +00003024\end{cfuncdesc}
3025
Fred Drakec6fa34e1998-04-02 06:47:24 +00003026\begin{cfuncdesc}{int}{PyList_Reverse}{PyObject *list}
Fred Drake659ebfa2000-04-03 15:42:13 +00003027Reverses the items of \var{list} in place. Returns \code{0} on
3028success, \code{-1} on failure. This is the equivalent of
3029\samp{\var{list}.reverse()}.
Fred Drakee5bf8b21998-02-12 21:22:28 +00003030\end{cfuncdesc}
3031
Fred Drakec6fa34e1998-04-02 06:47:24 +00003032\begin{cfuncdesc}{PyObject*}{PyList_AsTuple}{PyObject *list}
Fred Drake659ebfa2000-04-03 15:42:13 +00003033Returns a new tuple object containing the contents of \var{list};
3034equivalent to \samp{tuple(\var{list})}.\bifuncindex{tuple}
Fred Drakee5bf8b21998-02-12 21:22:28 +00003035\end{cfuncdesc}
3036
3037
Fred Drakeefd146c1999-02-15 15:30:45 +00003038\section{Mapping Objects \label{mapObjects}}
Fred Drakee5bf8b21998-02-12 21:22:28 +00003039
Fred Drake659ebfa2000-04-03 15:42:13 +00003040\obindex{mapping}
3041
3042
Fred Drakeefd146c1999-02-15 15:30:45 +00003043\subsection{Dictionary Objects \label{dictObjects}}
Fred Drakee5bf8b21998-02-12 21:22:28 +00003044
Fred Drake659ebfa2000-04-03 15:42:13 +00003045\obindex{dictionary}
Fred Drakee5bf8b21998-02-12 21:22:28 +00003046\begin{ctypedesc}{PyDictObject}
Fred Drakef8830d11998-04-23 14:06:01 +00003047This subtype of \ctype{PyObject} represents a Python dictionary object.
Fred Drakee5bf8b21998-02-12 21:22:28 +00003048\end{ctypedesc}
3049
3050\begin{cvardesc}{PyTypeObject}{PyDict_Type}
Fred Drake659ebfa2000-04-03 15:42:13 +00003051This instance of \ctype{PyTypeObject} represents the Python dictionary
3052type. This is exposed to Python programs as \code{types.DictType} and
3053\code{types.DictionaryType}.
3054\withsubitem{(in module types)}{\ttindex{DictType}\ttindex{DictionaryType}}
Fred Drakee5bf8b21998-02-12 21:22:28 +00003055\end{cvardesc}
3056
3057\begin{cfuncdesc}{int}{PyDict_Check}{PyObject *p}
Fred Drakef8830d11998-04-23 14:06:01 +00003058Returns true if its argument is a \ctype{PyDictObject}.
Fred Drakee5bf8b21998-02-12 21:22:28 +00003059\end{cfuncdesc}
3060
Fred Drakec6fa34e1998-04-02 06:47:24 +00003061\begin{cfuncdesc}{PyObject*}{PyDict_New}{}
Fred Drake659ebfa2000-04-03 15:42:13 +00003062Returns a new empty dictionary, or \NULL{} on failure.
3063\end{cfuncdesc}
3064
3065\begin{cfuncdesc}{void}{PyDict_Clear}{PyObject *p}
3066Empties an existing dictionary of all key-value pairs.
Fred Drakee5bf8b21998-02-12 21:22:28 +00003067\end{cfuncdesc}
3068
Jeremy Hyltona12c7a72000-03-30 22:27:31 +00003069\begin{cfuncdesc}{PyObject*}{PyDict_Copy}{PyObject *p}
Fred Drake659ebfa2000-04-03 15:42:13 +00003070Returns a new dictionary that contains the same key-value pairs as p.
3071Empties an existing dictionary of all key-value pairs.
Jeremy Hyltona12c7a72000-03-30 22:27:31 +00003072\end{cfuncdesc}
3073
Fred Drake659ebfa2000-04-03 15:42:13 +00003074\begin{cfuncdesc}{int}{PyDict_SetItem}{PyObject *p, PyObject *key,
3075 PyObject *val}
3076Inserts \var{value} into the dictionary with a key of \var{key}.
3077\var{key} must be hashable; if it isn't, \exception{TypeError} will be
3078raised.
Fred Drakee5bf8b21998-02-12 21:22:28 +00003079\end{cfuncdesc}
3080
3081\begin{cfuncdesc}{int}{PyDict_SetItemString}{PyDictObject *p,
3082 char *key,
3083 PyObject *val}
Fred Drakee058b4f1998-02-16 06:15:35 +00003084Inserts \var{value} into the dictionary using \var{key}
Fred Drake1d158692000-06-18 05:21:21 +00003085as a key. \var{key} should be a \ctype{char*}. The key object is
Fred Drakee058b4f1998-02-16 06:15:35 +00003086created using \code{PyString_FromString(\var{key})}.
Fred Drake659ebfa2000-04-03 15:42:13 +00003087\ttindex{PyString_FromString()}
Fred Drakee5bf8b21998-02-12 21:22:28 +00003088\end{cfuncdesc}
3089
Fred Drake659ebfa2000-04-03 15:42:13 +00003090\begin{cfuncdesc}{int}{PyDict_DelItem}{PyObject *p, PyObject *key}
Fred Drakee058b4f1998-02-16 06:15:35 +00003091Removes the entry in dictionary \var{p} with key \var{key}.
Fred Drake659ebfa2000-04-03 15:42:13 +00003092\var{key} must be hashable; if it isn't, \exception{TypeError} is
3093raised.
Fred Drakee5bf8b21998-02-12 21:22:28 +00003094\end{cfuncdesc}
3095
Fred Drake659ebfa2000-04-03 15:42:13 +00003096\begin{cfuncdesc}{int}{PyDict_DelItemString}{PyObject *p, char *key}
Fred Drakee058b4f1998-02-16 06:15:35 +00003097Removes the entry in dictionary \var{p} which has a key
Fred Drake659ebfa2000-04-03 15:42:13 +00003098specified by the string \var{key}.
Fred Drakee5bf8b21998-02-12 21:22:28 +00003099\end{cfuncdesc}
3100
Fred Drake659ebfa2000-04-03 15:42:13 +00003101\begin{cfuncdesc}{PyObject*}{PyDict_GetItem}{PyObject *p, PyObject *key}
Fred Drakee058b4f1998-02-16 06:15:35 +00003102Returns the object from dictionary \var{p} which has a key
Guido van Rossum44475131998-04-21 15:30:01 +00003103\var{key}. Returns \NULL{} if the key \var{key} is not present, but
Fred Drake659ebfa2000-04-03 15:42:13 +00003104\emph{without} setting an exception.
Fred Drakee5bf8b21998-02-12 21:22:28 +00003105\end{cfuncdesc}
3106
Fred Drake659ebfa2000-04-03 15:42:13 +00003107\begin{cfuncdesc}{PyObject*}{PyDict_GetItemString}{PyObject *p, char *key}
Fred Drakef8830d11998-04-23 14:06:01 +00003108This is the same as \cfunction{PyDict_GetItem()}, but \var{key} is
Fred Drake659ebfa2000-04-03 15:42:13 +00003109specified as a \ctype{char*}, rather than a \ctype{PyObject*}.
Fred Drakee5bf8b21998-02-12 21:22:28 +00003110\end{cfuncdesc}
3111
Fred Drake659ebfa2000-04-03 15:42:13 +00003112\begin{cfuncdesc}{PyObject*}{PyDict_Items}{PyObject *p}
Fred Drakef8830d11998-04-23 14:06:01 +00003113Returns a \ctype{PyListObject} containing all the items
Guido van Rossum44475131998-04-21 15:30:01 +00003114from the dictionary, as in the dictinoary method \method{items()} (see
Fred Drakebe486461999-11-09 17:03:03 +00003115the \citetitle[../lib/lib.html]{Python Library Reference}).
Fred Drakee5bf8b21998-02-12 21:22:28 +00003116\end{cfuncdesc}
3117
Fred Drake659ebfa2000-04-03 15:42:13 +00003118\begin{cfuncdesc}{PyObject*}{PyDict_Keys}{PyObject *p}
Fred Drakef8830d11998-04-23 14:06:01 +00003119Returns a \ctype{PyListObject} containing all the keys
Guido van Rossum44475131998-04-21 15:30:01 +00003120from the dictionary, as in the dictionary method \method{keys()} (see the
Fred Drakebe486461999-11-09 17:03:03 +00003121\citetitle[../lib/lib.html]{Python Library Reference}).
Fred Drakee5bf8b21998-02-12 21:22:28 +00003122\end{cfuncdesc}
3123
Fred Drake659ebfa2000-04-03 15:42:13 +00003124\begin{cfuncdesc}{PyObject*}{PyDict_Values}{PyObject *p}
Fred Drakef8830d11998-04-23 14:06:01 +00003125Returns a \ctype{PyListObject} containing all the values
Guido van Rossum44475131998-04-21 15:30:01 +00003126from the dictionary \var{p}, as in the dictionary method
Fred Drakebe486461999-11-09 17:03:03 +00003127\method{values()} (see the \citetitle[../lib/lib.html]{Python Library
3128Reference}).
Fred Drakee5bf8b21998-02-12 21:22:28 +00003129\end{cfuncdesc}
3130
Fred Drake659ebfa2000-04-03 15:42:13 +00003131\begin{cfuncdesc}{int}{PyDict_Size}{PyObject *p}
3132Returns the number of items in the dictionary. This is equivalent to
3133\samp{len(\var{p})} on a dictionary.\bifuncindex{len}
Fred Drakee5bf8b21998-02-12 21:22:28 +00003134\end{cfuncdesc}
3135
Fred Drake7d45d342000-08-11 17:07:32 +00003136\begin{cfuncdesc}{int}{PyDict_Next}{PyDictObject *p, int *ppos,
3137 PyObject **pkey, PyObject **pvalue}
Fred Drakee5bf8b21998-02-12 21:22:28 +00003138
3139\end{cfuncdesc}
3140
3141
Fred Drakeefd146c1999-02-15 15:30:45 +00003142\section{Numeric Objects \label{numericObjects}}
Fred Drakee5bf8b21998-02-12 21:22:28 +00003143
Fred Drake659ebfa2000-04-03 15:42:13 +00003144\obindex{numeric}
3145
3146
Fred Drakeefd146c1999-02-15 15:30:45 +00003147\subsection{Plain Integer Objects \label{intObjects}}
Fred Drakee5bf8b21998-02-12 21:22:28 +00003148
Fred Drake659ebfa2000-04-03 15:42:13 +00003149\obindex{integer}
Fred Drakee5bf8b21998-02-12 21:22:28 +00003150\begin{ctypedesc}{PyIntObject}
Fred Drakef8830d11998-04-23 14:06:01 +00003151This subtype of \ctype{PyObject} represents a Python integer object.
Fred Drakee5bf8b21998-02-12 21:22:28 +00003152\end{ctypedesc}
3153
3154\begin{cvardesc}{PyTypeObject}{PyInt_Type}
Fred Drakef8830d11998-04-23 14:06:01 +00003155This instance of \ctype{PyTypeObject} represents the Python plain
Fred Drake659ebfa2000-04-03 15:42:13 +00003156integer type. This is the same object as \code{types.IntType}.
3157\withsubitem{(in modules types)}{\ttindex{IntType}}
Fred Drakee5bf8b21998-02-12 21:22:28 +00003158\end{cvardesc}
3159
Fred Drake659ebfa2000-04-03 15:42:13 +00003160\begin{cfuncdesc}{int}{PyInt_Check}{PyObject* o}
3161Returns true if \var{o} is of type \cdata{PyInt_Type}.
Fred Drakee5bf8b21998-02-12 21:22:28 +00003162\end{cfuncdesc}
3163
Fred Drakec6fa34e1998-04-02 06:47:24 +00003164\begin{cfuncdesc}{PyObject*}{PyInt_FromLong}{long ival}
Fred Drakee058b4f1998-02-16 06:15:35 +00003165Creates a new integer object with a value of \var{ival}.
Fred Drakee5bf8b21998-02-12 21:22:28 +00003166
3167The current implementation keeps an array of integer objects for all
Fred Drakee058b4f1998-02-16 06:15:35 +00003168integers between \code{-1} and \code{100}, when you create an int in
3169that range you actually just get back a reference to the existing
3170object. So it should be possible to change the value of \code{1}. I
Fred Drake7e9d3141998-04-03 05:02:28 +00003171suspect the behaviour of Python in this case is undefined. :-)
Fred Drakee5bf8b21998-02-12 21:22:28 +00003172\end{cfuncdesc}
3173
Fred Drakee5bf8b21998-02-12 21:22:28 +00003174\begin{cfuncdesc}{long}{PyInt_AsLong}{PyObject *io}
Fred Drakef8830d11998-04-23 14:06:01 +00003175Will first attempt to cast the object to a \ctype{PyIntObject}, if
Fred Drakee058b4f1998-02-16 06:15:35 +00003176it is not already one, and then return its value.
Fred Drakee5bf8b21998-02-12 21:22:28 +00003177\end{cfuncdesc}
3178
Fred Drake659ebfa2000-04-03 15:42:13 +00003179\begin{cfuncdesc}{long}{PyInt_AS_LONG}{PyObject *io}
3180Returns the value of the object \var{io}. No error checking is
3181performed.
3182\end{cfuncdesc}
3183
Fred Drakee5bf8b21998-02-12 21:22:28 +00003184\begin{cfuncdesc}{long}{PyInt_GetMax}{}
Fred Drake659ebfa2000-04-03 15:42:13 +00003185Returns the system's idea of the largest integer it can handle
3186(\constant{LONG_MAX}\ttindex{LONG_MAX}, as defined in the system
3187header files).
Fred Drakee5bf8b21998-02-12 21:22:28 +00003188\end{cfuncdesc}
3189
3190
Fred Drakeefd146c1999-02-15 15:30:45 +00003191\subsection{Long Integer Objects \label{longObjects}}
Fred Drakee5bf8b21998-02-12 21:22:28 +00003192
Fred Drake659ebfa2000-04-03 15:42:13 +00003193\obindex{long integer}
Fred Drakee5bf8b21998-02-12 21:22:28 +00003194\begin{ctypedesc}{PyLongObject}
Fred Drakef8830d11998-04-23 14:06:01 +00003195This subtype of \ctype{PyObject} represents a Python long integer
Fred Drakee058b4f1998-02-16 06:15:35 +00003196object.
Fred Drakee5bf8b21998-02-12 21:22:28 +00003197\end{ctypedesc}
3198
3199\begin{cvardesc}{PyTypeObject}{PyLong_Type}
Fred Drakef8830d11998-04-23 14:06:01 +00003200This instance of \ctype{PyTypeObject} represents the Python long
Fred Drake659ebfa2000-04-03 15:42:13 +00003201integer type. This is the same object as \code{types.LongType}.
3202\withsubitem{(in modules types)}{\ttindex{LongType}}
Fred Drakee5bf8b21998-02-12 21:22:28 +00003203\end{cvardesc}
3204
3205\begin{cfuncdesc}{int}{PyLong_Check}{PyObject *p}
Fred Drakef8830d11998-04-23 14:06:01 +00003206Returns true if its argument is a \ctype{PyLongObject}.
Fred Drakee5bf8b21998-02-12 21:22:28 +00003207\end{cfuncdesc}
3208
Fred Drakec6fa34e1998-04-02 06:47:24 +00003209\begin{cfuncdesc}{PyObject*}{PyLong_FromLong}{long v}
Fred Drake659ebfa2000-04-03 15:42:13 +00003210Returns a new \ctype{PyLongObject} object from \var{v}, or \NULL{} on
3211failure.
Fred Drakee5bf8b21998-02-12 21:22:28 +00003212\end{cfuncdesc}
3213
Fred Drakec6fa34e1998-04-02 06:47:24 +00003214\begin{cfuncdesc}{PyObject*}{PyLong_FromUnsignedLong}{unsigned long v}
Fred Drake659ebfa2000-04-03 15:42:13 +00003215Returns a new \ctype{PyLongObject} object from a C \ctype{unsigned
3216long}, or \NULL{} on failure.
Fred Drakee5bf8b21998-02-12 21:22:28 +00003217\end{cfuncdesc}
3218
Fred Drakec6fa34e1998-04-02 06:47:24 +00003219\begin{cfuncdesc}{PyObject*}{PyLong_FromDouble}{double v}
Fred Drake659ebfa2000-04-03 15:42:13 +00003220Returns a new \ctype{PyLongObject} object from the integer part of
3221\var{v}, or \NULL{} on failure.
Fred Drakee5bf8b21998-02-12 21:22:28 +00003222\end{cfuncdesc}
3223
Fred Drakec6fa34e1998-04-02 06:47:24 +00003224\begin{cfuncdesc}{long}{PyLong_AsLong}{PyObject *pylong}
Fred Drake659ebfa2000-04-03 15:42:13 +00003225Returns a C \ctype{long} representation of the contents of
3226\var{pylong}. If \var{pylong} is greater than
3227\constant{LONG_MAX}\ttindex{LONG_MAX}, an \exception{OverflowError} is
3228raised.\withsubitem{(built-in exception)}{OverflowError}
Fred Drakee5bf8b21998-02-12 21:22:28 +00003229\end{cfuncdesc}
3230
Fred Drakec6fa34e1998-04-02 06:47:24 +00003231\begin{cfuncdesc}{unsigned long}{PyLong_AsUnsignedLong}{PyObject *pylong}
Fred Drake659ebfa2000-04-03 15:42:13 +00003232Returns a C \ctype{unsigned long} representation of the contents of
3233\var{pylong}. If \var{pylong} is greater than
3234\constant{ULONG_MAX}\ttindex{ULONG_MAX}, an \exception{OverflowError}
3235is raised.\withsubitem{(built-in exception)}{OverflowError}
Fred Drakee5bf8b21998-02-12 21:22:28 +00003236\end{cfuncdesc}
3237
Fred Drakec6fa34e1998-04-02 06:47:24 +00003238\begin{cfuncdesc}{double}{PyLong_AsDouble}{PyObject *pylong}
Fred Drake659ebfa2000-04-03 15:42:13 +00003239Returns a C \ctype{double} representation of the contents of \var{pylong}.
Fred Drakee5bf8b21998-02-12 21:22:28 +00003240\end{cfuncdesc}
3241
Fred Drakec6fa34e1998-04-02 06:47:24 +00003242\begin{cfuncdesc}{PyObject*}{PyLong_FromString}{char *str, char **pend,
3243 int base}
Fred Drake659ebfa2000-04-03 15:42:13 +00003244Return a new \ctype{PyLongObject} based on the string value in
3245\var{str}, which is interpreted according to the radix in \var{base}.
3246If \var{pend} is non-\NULL, \code{*\var{pend}} will point to the first
3247character in \var{str} which follows the representation of the
3248number. If \var{base} is \code{0}, the radix will be determined base
3249on the leading characters of \var{str}: if \var{str} starts with
3250\code{'0x'} or \code{'0X'}, radix 16 will be used; if \var{str} starts
3251with \code{'0'}, radix 8 will be used; otherwise radix 10 will be
3252used. If \var{base} is not \code{0}, it must be between \code{2} and
3253\code{36}, inclusive. Leading spaces are ignored. If there are no
3254digits, \exception{ValueError} will be raised.
Fred Drakee5bf8b21998-02-12 21:22:28 +00003255\end{cfuncdesc}
3256
3257
Fred Drakeefd146c1999-02-15 15:30:45 +00003258\subsection{Floating Point Objects \label{floatObjects}}
Fred Drakee5bf8b21998-02-12 21:22:28 +00003259
Fred Drake659ebfa2000-04-03 15:42:13 +00003260\obindex{floating point}
Fred Drakee5bf8b21998-02-12 21:22:28 +00003261\begin{ctypedesc}{PyFloatObject}
Fred Drakef8830d11998-04-23 14:06:01 +00003262This subtype of \ctype{PyObject} represents a Python floating point
Fred Drakee058b4f1998-02-16 06:15:35 +00003263object.
Fred Drakee5bf8b21998-02-12 21:22:28 +00003264\end{ctypedesc}
3265
3266\begin{cvardesc}{PyTypeObject}{PyFloat_Type}
Fred Drakef8830d11998-04-23 14:06:01 +00003267This instance of \ctype{PyTypeObject} represents the Python floating
Fred Drake659ebfa2000-04-03 15:42:13 +00003268point type. This is the same object as \code{types.FloatType}.
3269\withsubitem{(in modules types)}{\ttindex{FloatType}}
Fred Drakee5bf8b21998-02-12 21:22:28 +00003270\end{cvardesc}
3271
3272\begin{cfuncdesc}{int}{PyFloat_Check}{PyObject *p}
Fred Drakef8830d11998-04-23 14:06:01 +00003273Returns true if its argument is a \ctype{PyFloatObject}.
Fred Drakee5bf8b21998-02-12 21:22:28 +00003274\end{cfuncdesc}
3275
Fred Drakec6fa34e1998-04-02 06:47:24 +00003276\begin{cfuncdesc}{PyObject*}{PyFloat_FromDouble}{double v}
Fred Drake659ebfa2000-04-03 15:42:13 +00003277Creates a \ctype{PyFloatObject} object from \var{v}, or \NULL{} on
3278failure.
Fred Drakee5bf8b21998-02-12 21:22:28 +00003279\end{cfuncdesc}
3280
Fred Drakec6fa34e1998-04-02 06:47:24 +00003281\begin{cfuncdesc}{double}{PyFloat_AsDouble}{PyObject *pyfloat}
Fred Drake659ebfa2000-04-03 15:42:13 +00003282Returns a C \ctype{double} representation of the contents of \var{pyfloat}.
Fred Drakee5bf8b21998-02-12 21:22:28 +00003283\end{cfuncdesc}
3284
Fred Drakec6fa34e1998-04-02 06:47:24 +00003285\begin{cfuncdesc}{double}{PyFloat_AS_DOUBLE}{PyObject *pyfloat}
Fred Drake659ebfa2000-04-03 15:42:13 +00003286Returns a C \ctype{double} representation of the contents of
Fred Drakef8830d11998-04-23 14:06:01 +00003287\var{pyfloat}, but without error checking.
Fred Drakee5bf8b21998-02-12 21:22:28 +00003288\end{cfuncdesc}
3289
3290
Fred Drakeefd146c1999-02-15 15:30:45 +00003291\subsection{Complex Number Objects \label{complexObjects}}
Fred Drakee5bf8b21998-02-12 21:22:28 +00003292
Fred Drake659ebfa2000-04-03 15:42:13 +00003293\obindex{complex number}
3294Python's complex number objects are implemented as two distinct types
3295when viewed from the C API: one is the Python object exposed to
3296Python programs, and the other is a C structure which represents the
3297actual complex number value. The API provides functions for working
3298with both.
3299
3300\subsubsection{Complex Numbers as C Structures}
3301
3302Note that the functions which accept these structures as parameters
3303and return them as results do so \emph{by value} rather than
3304dereferencing them through pointers. This is consistent throughout
3305the API.
3306
Fred Drakee5bf8b21998-02-12 21:22:28 +00003307\begin{ctypedesc}{Py_complex}
Fred Drake659ebfa2000-04-03 15:42:13 +00003308The C structure which corresponds to the value portion of a Python
Fred Drake4de05a91998-02-16 14:25:26 +00003309complex number object. Most of the functions for dealing with complex
3310number objects use structures of this type as input or output values,
3311as appropriate. It is defined as:
3312
Fred Drakee058b4f1998-02-16 06:15:35 +00003313\begin{verbatim}
Fred Drakee5bf8b21998-02-12 21:22:28 +00003314typedef struct {
3315 double real;
3316 double imag;
Fred Drake4de05a91998-02-16 14:25:26 +00003317} Py_complex;
Fred Drakee058b4f1998-02-16 06:15:35 +00003318\end{verbatim}
Fred Drakee5bf8b21998-02-12 21:22:28 +00003319\end{ctypedesc}
3320
Fred Drake659ebfa2000-04-03 15:42:13 +00003321\begin{cfuncdesc}{Py_complex}{_Py_c_sum}{Py_complex left, Py_complex right}
3322Return the sum of two complex numbers, using the C
3323\ctype{Py_complex} representation.
3324\end{cfuncdesc}
3325
3326\begin{cfuncdesc}{Py_complex}{_Py_c_diff}{Py_complex left, Py_complex right}
3327Return the difference between two complex numbers, using the C
3328\ctype{Py_complex} representation.
3329\end{cfuncdesc}
3330
3331\begin{cfuncdesc}{Py_complex}{_Py_c_neg}{Py_complex complex}
3332Return the negation of the complex number \var{complex}, using the C
3333\ctype{Py_complex} representation.
3334\end{cfuncdesc}
3335
3336\begin{cfuncdesc}{Py_complex}{_Py_c_prod}{Py_complex left, Py_complex right}
3337Return the product of two complex numbers, using the C
3338\ctype{Py_complex} representation.
3339\end{cfuncdesc}
3340
3341\begin{cfuncdesc}{Py_complex}{_Py_c_quot}{Py_complex dividend,
3342 Py_complex divisor}
3343Return the quotient of two complex numbers, using the C
3344\ctype{Py_complex} representation.
3345\end{cfuncdesc}
3346
3347\begin{cfuncdesc}{Py_complex}{_Py_c_pow}{Py_complex num, Py_complex exp}
3348Return the exponentiation of \var{num} by \var{exp}, using the C
3349\ctype{Py_complex} representation.
3350\end{cfuncdesc}
3351
3352
3353\subsubsection{Complex Numbers as Python Objects}
3354
Fred Drakee5bf8b21998-02-12 21:22:28 +00003355\begin{ctypedesc}{PyComplexObject}
Fred Drakef8830d11998-04-23 14:06:01 +00003356This subtype of \ctype{PyObject} represents a Python complex number object.
Fred Drakee5bf8b21998-02-12 21:22:28 +00003357\end{ctypedesc}
3358
3359\begin{cvardesc}{PyTypeObject}{PyComplex_Type}
Fred Drakef8830d11998-04-23 14:06:01 +00003360This instance of \ctype{PyTypeObject} represents the Python complex
Fred Drakee5bf8b21998-02-12 21:22:28 +00003361number type.
3362\end{cvardesc}
3363
3364\begin{cfuncdesc}{int}{PyComplex_Check}{PyObject *p}
Fred Drakef8830d11998-04-23 14:06:01 +00003365Returns true if its argument is a \ctype{PyComplexObject}.
Fred Drakee5bf8b21998-02-12 21:22:28 +00003366\end{cfuncdesc}
3367
Fred Drakec6fa34e1998-04-02 06:47:24 +00003368\begin{cfuncdesc}{PyObject*}{PyComplex_FromCComplex}{Py_complex v}
Fred Drake659ebfa2000-04-03 15:42:13 +00003369Create a new Python complex number object from a C
3370\ctype{Py_complex} value.
Fred Drakee5bf8b21998-02-12 21:22:28 +00003371\end{cfuncdesc}
3372
Fred Drakec6fa34e1998-04-02 06:47:24 +00003373\begin{cfuncdesc}{PyObject*}{PyComplex_FromDoubles}{double real, double imag}
Fred Drakef8830d11998-04-23 14:06:01 +00003374Returns a new \ctype{PyComplexObject} object from \var{real} and \var{imag}.
Fred Drakee5bf8b21998-02-12 21:22:28 +00003375\end{cfuncdesc}
3376
3377\begin{cfuncdesc}{double}{PyComplex_RealAsDouble}{PyObject *op}
Fred Drake659ebfa2000-04-03 15:42:13 +00003378Returns the real part of \var{op} as a C \ctype{double}.
Fred Drakee5bf8b21998-02-12 21:22:28 +00003379\end{cfuncdesc}
3380
3381\begin{cfuncdesc}{double}{PyComplex_ImagAsDouble}{PyObject *op}
Fred Drake659ebfa2000-04-03 15:42:13 +00003382Returns the imaginary part of \var{op} as a C \ctype{double}.
Fred Drakee5bf8b21998-02-12 21:22:28 +00003383\end{cfuncdesc}
3384
3385\begin{cfuncdesc}{Py_complex}{PyComplex_AsCComplex}{PyObject *op}
Fred Drake659ebfa2000-04-03 15:42:13 +00003386Returns the \ctype{Py_complex} value of the complex number \var{op}.
Fred Drakee5bf8b21998-02-12 21:22:28 +00003387\end{cfuncdesc}
3388
3389
3390
Fred Drakeefd146c1999-02-15 15:30:45 +00003391\section{Other Objects \label{otherObjects}}
Fred Drakee5bf8b21998-02-12 21:22:28 +00003392
Fred Drakeefd146c1999-02-15 15:30:45 +00003393\subsection{File Objects \label{fileObjects}}
Fred Drakee5bf8b21998-02-12 21:22:28 +00003394
Fred Drake659ebfa2000-04-03 15:42:13 +00003395\obindex{file}
3396Python's built-in file objects are implemented entirely on the
3397\ctype{FILE*} support from the C standard library. This is an
3398implementation detail and may change in future releases of Python.
3399
Fred Drakee5bf8b21998-02-12 21:22:28 +00003400\begin{ctypedesc}{PyFileObject}
Fred Drakef8830d11998-04-23 14:06:01 +00003401This subtype of \ctype{PyObject} represents a Python file object.
Fred Drakee5bf8b21998-02-12 21:22:28 +00003402\end{ctypedesc}
3403
3404\begin{cvardesc}{PyTypeObject}{PyFile_Type}
Fred Drake659ebfa2000-04-03 15:42:13 +00003405This instance of \ctype{PyTypeObject} represents the Python file
3406type. This is exposed to Python programs as \code{types.FileType}.
3407\withsubitem{(in module types)}{\ttindex{FileType}}
Fred Drakee5bf8b21998-02-12 21:22:28 +00003408\end{cvardesc}
3409
3410\begin{cfuncdesc}{int}{PyFile_Check}{PyObject *p}
Fred Drakef8830d11998-04-23 14:06:01 +00003411Returns true if its argument is a \ctype{PyFileObject}.
Fred Drakee5bf8b21998-02-12 21:22:28 +00003412\end{cfuncdesc}
3413
Fred Drake659ebfa2000-04-03 15:42:13 +00003414\begin{cfuncdesc}{PyObject*}{PyFile_FromString}{char *filename, char *mode}
3415On success, returns a new file object that is opened on the
3416file given by \var{filename}, with a file mode given by \var{mode},
3417where \var{mode} has the same semantics as the standard C routine
3418\cfunction{fopen()}\ttindex{fopen()}. On failure, returns \NULL.
Fred Drakee5bf8b21998-02-12 21:22:28 +00003419\end{cfuncdesc}
3420
Fred Drakec6fa34e1998-04-02 06:47:24 +00003421\begin{cfuncdesc}{PyObject*}{PyFile_FromFile}{FILE *fp,
Fred Drake659ebfa2000-04-03 15:42:13 +00003422 char *name, char *mode,
3423 int (*close)(FILE*)}
3424Creates a new \ctype{PyFileObject} from the already-open standard C
3425file pointer, \var{fp}. The function \var{close} will be called when
3426the file should be closed. Returns \NULL{} on failure.
Fred Drakee5bf8b21998-02-12 21:22:28 +00003427\end{cfuncdesc}
3428
Fred Drake659ebfa2000-04-03 15:42:13 +00003429\begin{cfuncdesc}{FILE*}{PyFile_AsFile}{PyFileObject *p}
3430Returns the file object associated with \var{p} as a \ctype{FILE*}.
Fred Drakee5bf8b21998-02-12 21:22:28 +00003431\end{cfuncdesc}
3432
Fred Drakec6fa34e1998-04-02 06:47:24 +00003433\begin{cfuncdesc}{PyObject*}{PyFile_GetLine}{PyObject *p, int n}
Fred Drake659ebfa2000-04-03 15:42:13 +00003434Equivalent to \code{\var{p}.readline(\optional{\var{n}})}, this
3435function reads one line from the object \var{p}. \var{p} may be a
3436file object or any object with a \method{readline()} method. If
3437\var{n} is \code{0}, exactly one line is read, regardless of the
3438length of the line. If \var{n} is greater than \code{0}, no more than
3439\var{n} bytes will be read from the file; a partial line can be
3440returned. In both cases, an empty string is returned if the end of
3441the file is reached immediately. If \var{n} is less than \code{0},
3442however, one line is read regardless of length, but
3443\exception{EOFError} is raised if the end of the file is reached
3444immediately.
3445\withsubitem{(built-in exception)}{\ttindex{EOFError}}
Fred Drakee5bf8b21998-02-12 21:22:28 +00003446\end{cfuncdesc}
3447
Fred Drakec6fa34e1998-04-02 06:47:24 +00003448\begin{cfuncdesc}{PyObject*}{PyFile_Name}{PyObject *p}
Fred Drake659ebfa2000-04-03 15:42:13 +00003449Returns the name of the file specified by \var{p} as a string object.
Fred Drakee5bf8b21998-02-12 21:22:28 +00003450\end{cfuncdesc}
3451
3452\begin{cfuncdesc}{void}{PyFile_SetBufSize}{PyFileObject *p, int n}
Fred Drake659ebfa2000-04-03 15:42:13 +00003453Available on systems with \cfunction{setvbuf()}\ttindex{setvbuf()}
3454only. This should only be called immediately after file object
3455creation.
Fred Drakee5bf8b21998-02-12 21:22:28 +00003456\end{cfuncdesc}
3457
Fred Drake659ebfa2000-04-03 15:42:13 +00003458\begin{cfuncdesc}{int}{PyFile_SoftSpace}{PyObject *p, int newflag}
3459This function exists for internal use by the interpreter.
3460Sets the \member{softspace} attribute of \var{p} to \var{newflag} and
3461\withsubitem{(file attribute)}{\ttindex{softspace}}returns the
3462previous value. \var{p} does not have to be a file object
3463for this function to work properly; any object is supported (thought
3464its only interesting if the \member{softspace} attribute can be set).
3465This function clears any errors, and will return \code{0} as the
3466previous value if the attribute either does not exist or if there were
3467errors in retrieving it. There is no way to detect errors from this
3468function, but doing so should not be needed.
Fred Drakee5bf8b21998-02-12 21:22:28 +00003469\end{cfuncdesc}
3470
Fred Drakec6fa34e1998-04-02 06:47:24 +00003471\begin{cfuncdesc}{int}{PyFile_WriteObject}{PyObject *obj, PyFileObject *p,
3472 int flags}
Fred Drake659ebfa2000-04-03 15:42:13 +00003473Writes object \var{obj} to file object \var{p}. The only supported
3474flag for \var{flags} is \constant{Py_PRINT_RAW}\ttindex{Py_PRINT_RAW};
3475if given, the \function{str()} of the object is written instead of the
3476\function{repr()}. Returns \code{0} on success or \code{-1} on
3477failure; the appropriate exception will be set.
Fred Drakee5bf8b21998-02-12 21:22:28 +00003478\end{cfuncdesc}
3479
Fred Drakec6fa34e1998-04-02 06:47:24 +00003480\begin{cfuncdesc}{int}{PyFile_WriteString}{char *s, PyFileObject *p,
3481 int flags}
Fred Drake659ebfa2000-04-03 15:42:13 +00003482Writes string \var{s} to file object \var{p}. Returns \code{0} on
3483success or \code{-1} on failure; the appropriate exception will be
3484set.
Fred Drakee5bf8b21998-02-12 21:22:28 +00003485\end{cfuncdesc}
3486
3487
Fred Drakeefd146c1999-02-15 15:30:45 +00003488\subsection{Module Objects \label{moduleObjects}}
3489
3490\obindex{module}
3491There are only a few functions special to module objects.
3492
Fred Drake659ebfa2000-04-03 15:42:13 +00003493\begin{cvardesc}{PyTypeObject}{PyModule_Type}
3494This instance of \ctype{PyTypeObject} represents the Python module
3495type. This is exposed to Python programs as \code{types.ModuleType}.
3496\withsubitem{(in module types)}{\ttindex{ModuleType}}
3497\end{cvardesc}
3498
3499\begin{cfuncdesc}{int}{PyModule_Check}{PyObject *p}
3500Returns true if its argument is a module object.
Fred Drakeefd146c1999-02-15 15:30:45 +00003501\end{cfuncdesc}
3502
Fred Drake659ebfa2000-04-03 15:42:13 +00003503\begin{cfuncdesc}{PyObject*}{PyModule_New}{char *name}
3504Return a new module object with the \member{__name__} attribute set to
3505\var{name}. Only the module's \member{__doc__} and
3506\member{__name__} attributes are filled in; the caller is responsible
3507for providing a \member{__file__} attribute.
3508\withsubitem{(module attribute)}{
3509 \ttindex{__name__}\ttindex{__doc__}\ttindex{__file__}}
3510\end{cfuncdesc}
3511
3512\begin{cfuncdesc}{PyObject*}{PyModule_GetDict}{PyObject *module}
Fred Drakeefd146c1999-02-15 15:30:45 +00003513Return the dictionary object that implements \var{module}'s namespace;
3514this object is the same as the \member{__dict__} attribute of the
3515module object. This function never fails.
Fred Drake659ebfa2000-04-03 15:42:13 +00003516\withsubitem{(module attribute)}{\ttindex{__dict__}}
Fred Drakeefd146c1999-02-15 15:30:45 +00003517\end{cfuncdesc}
3518
Fred Drake659ebfa2000-04-03 15:42:13 +00003519\begin{cfuncdesc}{char*}{PyModule_GetName}{PyObject *module}
Fred Drakeefd146c1999-02-15 15:30:45 +00003520Return \var{module}'s \member{__name__} value. If the module does not
Fred Drake659ebfa2000-04-03 15:42:13 +00003521provide one, or if it is not a string, \exception{SystemError} is
3522raised and \NULL{} is returned.
3523\withsubitem{(module attribute)}{\ttindex{__name__}}
3524\withsubitem{(built-in exception)}{\ttindex{SystemError}}
Fred Drakeefd146c1999-02-15 15:30:45 +00003525\end{cfuncdesc}
3526
Fred Drake659ebfa2000-04-03 15:42:13 +00003527\begin{cfuncdesc}{char*}{PyModule_GetFilename}{PyObject *module}
Fred Drakeefd146c1999-02-15 15:30:45 +00003528Return the name of the file from which \var{module} was loaded using
3529\var{module}'s \member{__file__} attribute. If this is not defined,
Fred Drake659ebfa2000-04-03 15:42:13 +00003530or if it is not a string, raise \exception{SystemError} and return
3531\NULL.
3532\withsubitem{(module attribute)}{\ttindex{__file__}}
3533\withsubitem{(built-in exception)}{\ttindex{SystemError}}
Fred Drakeefd146c1999-02-15 15:30:45 +00003534\end{cfuncdesc}
3535
3536
3537\subsection{CObjects \label{cObjects}}
Fred Drakee5bf8b21998-02-12 21:22:28 +00003538
Fred Drake659ebfa2000-04-03 15:42:13 +00003539\obindex{CObject}
3540Refer to \emph{Extending and Embedding the Python Interpreter},
3541section 1.12 (``Providing a C API for an Extension Module''), for more
3542information on using these objects.
3543
3544
Guido van Rossum44475131998-04-21 15:30:01 +00003545\begin{ctypedesc}{PyCObject}
Fred Drakef8830d11998-04-23 14:06:01 +00003546This subtype of \ctype{PyObject} represents an opaque value, useful for
Fred Drake659ebfa2000-04-03 15:42:13 +00003547C extension modules who need to pass an opaque value (as a
3548\ctype{void*} pointer) through Python code to other C code. It is
Guido van Rossum44475131998-04-21 15:30:01 +00003549often used to make a C function pointer defined in one module
3550available to other modules, so the regular import mechanism can be
3551used to access C APIs defined in dynamically loaded modules.
3552\end{ctypedesc}
Fred Drakee5bf8b21998-02-12 21:22:28 +00003553
Fred Drake659ebfa2000-04-03 15:42:13 +00003554\begin{cfuncdesc}{int}{PyCObject_Check}{PyObject *p}
3555Returns true if its argument is a \ctype{PyCObject}.
3556\end{cfuncdesc}
3557
3558\begin{cfuncdesc}{PyObject*}{PyCObject_FromVoidPtr}{void* cobj,
Guido van Rossum44475131998-04-21 15:30:01 +00003559 void (*destr)(void *)}
Fred Drake1d158692000-06-18 05:21:21 +00003560Creates a \ctype{PyCObject} from the \code{void *}\var{cobj}. The
Fred Drakedab44681999-05-13 18:41:14 +00003561\var{destr} function will be called when the object is reclaimed, unless
3562it is \NULL.
Guido van Rossum44475131998-04-21 15:30:01 +00003563\end{cfuncdesc}
3564
Fred Drake659ebfa2000-04-03 15:42:13 +00003565\begin{cfuncdesc}{PyObject*}{PyCObject_FromVoidPtrAndDesc}{void* cobj,
Guido van Rossum44475131998-04-21 15:30:01 +00003566 void* desc, void (*destr)(void *, void *) }
Fred Drakef8830d11998-04-23 14:06:01 +00003567Creates a \ctype{PyCObject} from the \ctype{void *}\var{cobj}. The
3568\var{destr} function will be called when the object is reclaimed. The
3569\var{desc} argument can be used to pass extra callback data for the
3570destructor function.
Guido van Rossum44475131998-04-21 15:30:01 +00003571\end{cfuncdesc}
3572
Fred Drake659ebfa2000-04-03 15:42:13 +00003573\begin{cfuncdesc}{void*}{PyCObject_AsVoidPtr}{PyObject* self}
3574Returns the object \ctype{void *} that the
3575\ctype{PyCObject} \var{self} was created with.
Guido van Rossum44475131998-04-21 15:30:01 +00003576\end{cfuncdesc}
3577
Fred Drake659ebfa2000-04-03 15:42:13 +00003578\begin{cfuncdesc}{void*}{PyCObject_GetDesc}{PyObject* self}
3579Returns the description \ctype{void *} that the
3580\ctype{PyCObject} \var{self} was created with.
Guido van Rossum44475131998-04-21 15:30:01 +00003581\end{cfuncdesc}
Fred Drakee5bf8b21998-02-12 21:22:28 +00003582
Fred Drake659ebfa2000-04-03 15:42:13 +00003583
Fred Drakeefd146c1999-02-15 15:30:45 +00003584\chapter{Initialization, Finalization, and Threads
3585 \label{initialization}}
Guido van Rossum4a944d71997-08-14 20:35:38 +00003586
Guido van Rossum4a944d71997-08-14 20:35:38 +00003587\begin{cfuncdesc}{void}{Py_Initialize}{}
3588Initialize the Python interpreter. In an application embedding
3589Python, this should be called before using any other Python/C API
Fred Drake659ebfa2000-04-03 15:42:13 +00003590functions; with the exception of
3591\cfunction{Py_SetProgramName()}\ttindex{Py_SetProgramName()},
3592\cfunction{PyEval_InitThreads()}\ttindex{PyEval_InitThreads()},
3593\cfunction{PyEval_ReleaseLock()}\ttindex{PyEval_ReleaseLock()},
3594and \cfunction{PyEval_AcquireLock()}\ttindex{PyEval_AcquireLock()}.
3595This initializes the table of loaded modules (\code{sys.modules}), and
3596\withsubitem{(in module sys)}{\ttindex{modules}\ttindex{path}}creates the
3597fundamental modules \module{__builtin__}\refbimodindex{__builtin__},
Fred Drake4de05a91998-02-16 14:25:26 +00003598\module{__main__}\refbimodindex{__main__} and
3599\module{sys}\refbimodindex{sys}. It also initializes the module
Fred Drake659ebfa2000-04-03 15:42:13 +00003600search\indexiii{module}{search}{path} path (\code{sys.path}).
3601It does not set \code{sys.argv}; use
3602\cfunction{PySys_SetArgv()}\ttindex{PySys_SetArgv()} for that. This
3603is a no-op when called for a second time (without calling
3604\cfunction{Py_Finalize()}\ttindex{Py_Finalize()} first). There is no
3605return value; it is a fatal error if the initialization fails.
Guido van Rossum42cefd01997-10-05 15:27:29 +00003606\end{cfuncdesc}
3607
3608\begin{cfuncdesc}{int}{Py_IsInitialized}{}
Guido van Rossum42cefd01997-10-05 15:27:29 +00003609Return true (nonzero) when the Python interpreter has been
Fred Drakee058b4f1998-02-16 06:15:35 +00003610initialized, false (zero) if not. After \cfunction{Py_Finalize()} is
3611called, this returns false until \cfunction{Py_Initialize()} is called
Guido van Rossum42cefd01997-10-05 15:27:29 +00003612again.
Guido van Rossum4a944d71997-08-14 20:35:38 +00003613\end{cfuncdesc}
3614
3615\begin{cfuncdesc}{void}{Py_Finalize}{}
Fred Drakee058b4f1998-02-16 06:15:35 +00003616Undo all initializations made by \cfunction{Py_Initialize()} and
3617subsequent use of Python/C API functions, and destroy all
3618sub-interpreters (see \cfunction{Py_NewInterpreter()} below) that were
3619created and not yet destroyed since the last call to
3620\cfunction{Py_Initialize()}. Ideally, this frees all memory allocated
3621by the Python interpreter. This is a no-op when called for a second
3622time (without calling \cfunction{Py_Initialize()} again first). There
3623is no return value; errors during finalization are ignored.
Guido van Rossum4a944d71997-08-14 20:35:38 +00003624
3625This function is provided for a number of reasons. An embedding
3626application might want to restart Python without having to restart the
3627application itself. An application that has loaded the Python
3628interpreter from a dynamically loadable library (or DLL) might want to
3629free all memory allocated by Python before unloading the DLL. During a
3630hunt for memory leaks in an application a developer might want to free
3631all memory allocated by Python before exiting from the application.
3632
Fred Drakee058b4f1998-02-16 06:15:35 +00003633\strong{Bugs and caveats:} The destruction of modules and objects in
Guido van Rossum4a944d71997-08-14 20:35:38 +00003634modules is done in random order; this may cause destructors
Fred Drakee058b4f1998-02-16 06:15:35 +00003635(\method{__del__()} methods) to fail when they depend on other objects
Guido van Rossum4a944d71997-08-14 20:35:38 +00003636(even functions) or modules. Dynamically loaded extension modules
3637loaded by Python are not unloaded. Small amounts of memory allocated
3638by the Python interpreter may not be freed (if you find a leak, please
3639report it). Memory tied up in circular references between objects is
3640not freed. Some memory allocated by extension modules may not be
3641freed. Some extension may not work properly if their initialization
3642routine is called more than once; this can happen if an applcation
Fred Drakee058b4f1998-02-16 06:15:35 +00003643calls \cfunction{Py_Initialize()} and \cfunction{Py_Finalize()} more
3644than once.
Guido van Rossum4a944d71997-08-14 20:35:38 +00003645\end{cfuncdesc}
3646
Fred Drakec6fa34e1998-04-02 06:47:24 +00003647\begin{cfuncdesc}{PyThreadState*}{Py_NewInterpreter}{}
Fred Drake4de05a91998-02-16 14:25:26 +00003648Create a new sub-interpreter. This is an (almost) totally separate
3649environment for the execution of Python code. In particular, the new
3650interpreter has separate, independent versions of all imported
3651modules, including the fundamental modules
3652\module{__builtin__}\refbimodindex{__builtin__},
3653\module{__main__}\refbimodindex{__main__} and
3654\module{sys}\refbimodindex{sys}. The table of loaded modules
3655(\code{sys.modules}) and the module search path (\code{sys.path}) are
3656also separate. The new environment has no \code{sys.argv} variable.
3657It has new standard I/O stream file objects \code{sys.stdin},
3658\code{sys.stdout} and \code{sys.stderr} (however these refer to the
Fred Drake659ebfa2000-04-03 15:42:13 +00003659same underlying \ctype{FILE} structures in the C library).
3660\withsubitem{(in module sys)}{
3661 \ttindex{stdout}\ttindex{stderr}\ttindex{stdin}}
Guido van Rossum4a944d71997-08-14 20:35:38 +00003662
3663The return value points to the first thread state created in the new
3664sub-interpreter. This thread state is made the current thread state.
3665Note that no actual thread is created; see the discussion of thread
3666states below. If creation of the new interpreter is unsuccessful,
Guido van Rossum580aa8d1997-11-25 15:34:51 +00003667\NULL{} is returned; no exception is set since the exception state
Guido van Rossum4a944d71997-08-14 20:35:38 +00003668is stored in the current thread state and there may not be a current
3669thread state. (Like all other Python/C API functions, the global
3670interpreter lock must be held before calling this function and is
3671still held when it returns; however, unlike most other Python/C API
3672functions, there needn't be a current thread state on entry.)
3673
3674Extension modules are shared between (sub-)interpreters as follows:
3675the first time a particular extension is imported, it is initialized
3676normally, and a (shallow) copy of its module's dictionary is
3677squirreled away. When the same extension is imported by another
3678(sub-)interpreter, a new module is initialized and filled with the
Fred Drakee058b4f1998-02-16 06:15:35 +00003679contents of this copy; the extension's \code{init} function is not
3680called. Note that this is different from what happens when an
3681extension is imported after the interpreter has been completely
Fred Drake659ebfa2000-04-03 15:42:13 +00003682re-initialized by calling
3683\cfunction{Py_Finalize()}\ttindex{Py_Finalize()} and
3684\cfunction{Py_Initialize()}\ttindex{Py_Initialize()}; in that case,
3685the extension's \code{init\var{module}} function \emph{is} called
3686again.
Guido van Rossum4a944d71997-08-14 20:35:38 +00003687
Fred Drakee058b4f1998-02-16 06:15:35 +00003688\strong{Bugs and caveats:} Because sub-interpreters (and the main
Guido van Rossum4a944d71997-08-14 20:35:38 +00003689interpreter) are part of the same process, the insulation between them
Fred Drakee058b4f1998-02-16 06:15:35 +00003690isn't perfect --- for example, using low-level file operations like
Fred Drake659ebfa2000-04-03 15:42:13 +00003691\withsubitem{(in module os)}{\ttindex{close()}}
Fred Drakef8830d11998-04-23 14:06:01 +00003692\function{os.close()} they can (accidentally or maliciously) affect each
Guido van Rossum4a944d71997-08-14 20:35:38 +00003693other's open files. Because of the way extensions are shared between
3694(sub-)interpreters, some extensions may not work properly; this is
3695especially likely when the extension makes use of (static) global
3696variables, or when the extension manipulates its module's dictionary
3697after its initialization. It is possible to insert objects created in
3698one sub-interpreter into a namespace of another sub-interpreter; this
3699should be done with great care to avoid sharing user-defined
3700functions, methods, instances or classes between sub-interpreters,
3701since import operations executed by such objects may affect the
3702wrong (sub-)interpreter's dictionary of loaded modules. (XXX This is
3703a hard-to-fix bug that will be addressed in a future release.)
3704\end{cfuncdesc}
3705
3706\begin{cfuncdesc}{void}{Py_EndInterpreter}{PyThreadState *tstate}
3707Destroy the (sub-)interpreter represented by the given thread state.
3708The given thread state must be the current thread state. See the
3709discussion of thread states below. When the call returns, the current
Guido van Rossum580aa8d1997-11-25 15:34:51 +00003710thread state is \NULL{}. All thread states associated with this
Guido van Rossum4a944d71997-08-14 20:35:38 +00003711interpreted are destroyed. (The global interpreter lock must be held
3712before calling this function and is still held when it returns.)
Fred Drake659ebfa2000-04-03 15:42:13 +00003713\cfunction{Py_Finalize()}\ttindex{Py_Finalize()} will destroy all
3714sub-interpreters that haven't been explicitly destroyed at that point.
Guido van Rossum4a944d71997-08-14 20:35:38 +00003715\end{cfuncdesc}
3716
3717\begin{cfuncdesc}{void}{Py_SetProgramName}{char *name}
Fred Drake659ebfa2000-04-03 15:42:13 +00003718This function should be called before
3719\cfunction{Py_Initialize()}\ttindex{Py_Initialize()} is called
Guido van Rossum4a944d71997-08-14 20:35:38 +00003720for the first time, if it is called at all. It tells the interpreter
Fred Drake659ebfa2000-04-03 15:42:13 +00003721the value of the \code{argv[0]} argument to the
3722\cfunction{main()}\ttindex{main()} function of the program. This is
3723used by \cfunction{Py_GetPath()}\ttindex{Py_GetPath()} and some other
Guido van Rossum4a944d71997-08-14 20:35:38 +00003724functions below to find the Python run-time libraries relative to the
Fred Drakea8455ab2000-06-16 19:58:42 +00003725interpreter executable. The default value is \code{'python'}. The
Guido van Rossum4a944d71997-08-14 20:35:38 +00003726argument should point to a zero-terminated character string in static
3727storage whose contents will not change for the duration of the
3728program's execution. No code in the Python interpreter will change
3729the contents of this storage.
3730\end{cfuncdesc}
3731
Fred Drakec6fa34e1998-04-02 06:47:24 +00003732\begin{cfuncdesc}{char*}{Py_GetProgramName}{}
Fred Drake659ebfa2000-04-03 15:42:13 +00003733Return the program name set with
3734\cfunction{Py_SetProgramName()}\ttindex{Py_SetProgramName()}, or the
Guido van Rossum4a944d71997-08-14 20:35:38 +00003735default. The returned string points into static storage; the caller
3736should not modify its value.
3737\end{cfuncdesc}
3738
Fred Drakec6fa34e1998-04-02 06:47:24 +00003739\begin{cfuncdesc}{char*}{Py_GetPrefix}{}
Fred Drakee058b4f1998-02-16 06:15:35 +00003740Return the \emph{prefix} for installed platform-independent files. This
Guido van Rossum4a944d71997-08-14 20:35:38 +00003741is derived through a number of complicated rules from the program name
Fred Drakee058b4f1998-02-16 06:15:35 +00003742set with \cfunction{Py_SetProgramName()} and some environment variables;
Fred Drakea8455ab2000-06-16 19:58:42 +00003743for example, if the program name is \code{'/usr/local/bin/python'},
3744the prefix is \code{'/usr/local'}. The returned string points into
Guido van Rossum4a944d71997-08-14 20:35:38 +00003745static storage; the caller should not modify its value. This
Fred Drakec94d9341998-04-12 02:39:13 +00003746corresponds to the \makevar{prefix} variable in the top-level
Fred Drakea8455ab2000-06-16 19:58:42 +00003747\file{Makefile} and the \longprogramopt{prefix} argument to the
Fred Drakee058b4f1998-02-16 06:15:35 +00003748\program{configure} script at build time. The value is available to
Fred Drakeb0a78731998-01-13 18:51:10 +00003749Python code as \code{sys.prefix}. It is only useful on \UNIX{}. See
Guido van Rossum4a944d71997-08-14 20:35:38 +00003750also the next function.
3751\end{cfuncdesc}
3752
Fred Drakec6fa34e1998-04-02 06:47:24 +00003753\begin{cfuncdesc}{char*}{Py_GetExecPrefix}{}
Fred Drakee058b4f1998-02-16 06:15:35 +00003754Return the \emph{exec-prefix} for installed platform-\emph{de}pendent
Guido van Rossum4a944d71997-08-14 20:35:38 +00003755files. This is derived through a number of complicated rules from the
Fred Drakee058b4f1998-02-16 06:15:35 +00003756program name set with \cfunction{Py_SetProgramName()} and some environment
Guido van Rossum4a944d71997-08-14 20:35:38 +00003757variables; for example, if the program name is
Fred Drakea8455ab2000-06-16 19:58:42 +00003758\code{'/usr/local/bin/python'}, the exec-prefix is
3759\code{'/usr/local'}. The returned string points into static storage;
Guido van Rossum4a944d71997-08-14 20:35:38 +00003760the caller should not modify its value. This corresponds to the
Fred Drakec94d9341998-04-12 02:39:13 +00003761\makevar{exec_prefix} variable in the top-level \file{Makefile} and the
Fred Drakea8455ab2000-06-16 19:58:42 +00003762\longprogramopt{exec-prefix} argument to the
Fred Drake310ee611999-11-09 17:31:42 +00003763\program{configure} script at build time. The value is available to
3764Python code as \code{sys.exec_prefix}. It is only useful on \UNIX{}.
Guido van Rossum4a944d71997-08-14 20:35:38 +00003765
3766Background: The exec-prefix differs from the prefix when platform
3767dependent files (such as executables and shared libraries) are
3768installed in a different directory tree. In a typical installation,
3769platform dependent files may be installed in the
Fred Drakea8455ab2000-06-16 19:58:42 +00003770\file{/usr/local/plat} subtree while platform independent may be
3771installed in \file{/usr/local}.
Guido van Rossum4a944d71997-08-14 20:35:38 +00003772
3773Generally speaking, a platform is a combination of hardware and
3774software families, e.g. Sparc machines running the Solaris 2.x
3775operating system are considered the same platform, but Intel machines
3776running Solaris 2.x are another platform, and Intel machines running
3777Linux are yet another platform. Different major revisions of the same
Fred Drakeb0a78731998-01-13 18:51:10 +00003778operating system generally also form different platforms. Non-\UNIX{}
Guido van Rossum4a944d71997-08-14 20:35:38 +00003779operating systems are a different story; the installation strategies
3780on those systems are so different that the prefix and exec-prefix are
3781meaningless, and set to the empty string. Note that compiled Python
3782bytecode files are platform independent (but not independent from the
3783Python version by which they were compiled!).
3784
Fred Drakee058b4f1998-02-16 06:15:35 +00003785System administrators will know how to configure the \program{mount} or
Fred Drakea8455ab2000-06-16 19:58:42 +00003786\program{automount} programs to share \file{/usr/local} between platforms
3787while having \file{/usr/local/plat} be a different filesystem for each
Guido van Rossum4a944d71997-08-14 20:35:38 +00003788platform.
3789\end{cfuncdesc}
3790
Fred Drakec6fa34e1998-04-02 06:47:24 +00003791\begin{cfuncdesc}{char*}{Py_GetProgramFullPath}{}
Guido van Rossum4a944d71997-08-14 20:35:38 +00003792Return the full program name of the Python executable; this is
3793computed as a side-effect of deriving the default module search path
Fred Drake659ebfa2000-04-03 15:42:13 +00003794from the program name (set by
3795\cfunction{Py_SetProgramName()}\ttindex{Py_SetProgramName()} above).
3796The returned string points into static storage; the caller should not
Guido van Rossum4a944d71997-08-14 20:35:38 +00003797modify its value. The value is available to Python code as
Guido van Rossum42cefd01997-10-05 15:27:29 +00003798\code{sys.executable}.
Fred Drake659ebfa2000-04-03 15:42:13 +00003799\withsubitem{(in module sys)}{\ttindex{executable}}
Guido van Rossum4a944d71997-08-14 20:35:38 +00003800\end{cfuncdesc}
3801
Fred Drakec6fa34e1998-04-02 06:47:24 +00003802\begin{cfuncdesc}{char*}{Py_GetPath}{}
Fred Drake4de05a91998-02-16 14:25:26 +00003803\indexiii{module}{search}{path}
Guido van Rossum4a944d71997-08-14 20:35:38 +00003804Return the default module search path; this is computed from the
Fred Drakee058b4f1998-02-16 06:15:35 +00003805program name (set by \cfunction{Py_SetProgramName()} above) and some
Guido van Rossum4a944d71997-08-14 20:35:38 +00003806environment variables. The returned string consists of a series of
3807directory names separated by a platform dependent delimiter character.
Fred Drakef8830d11998-04-23 14:06:01 +00003808The delimiter character is \character{:} on \UNIX{}, \character{;} on
Fred Drake659ebfa2000-04-03 15:42:13 +00003809DOS/Windows, and \character{\e n} (the \ASCII{} newline character) on
Fred Drakee5bc4971998-02-12 23:36:49 +00003810Macintosh. The returned string points into static storage; the caller
Guido van Rossum4a944d71997-08-14 20:35:38 +00003811should not modify its value. The value is available to Python code
Fred Drake659ebfa2000-04-03 15:42:13 +00003812as the list \code{sys.path}\withsubitem{(in module sys)}{\ttindex{path}},
3813which may be modified to change the future search path for loaded
3814modules.
Guido van Rossum4a944d71997-08-14 20:35:38 +00003815
3816% XXX should give the exact rules
3817\end{cfuncdesc}
3818
Fred Drakec6fa34e1998-04-02 06:47:24 +00003819\begin{cfuncdesc}{const char*}{Py_GetVersion}{}
Guido van Rossum4a944d71997-08-14 20:35:38 +00003820Return the version of this Python interpreter. This is a string that
3821looks something like
3822
Guido van Rossum09270b51997-08-15 18:57:32 +00003823\begin{verbatim}
Fred Drakee058b4f1998-02-16 06:15:35 +00003824"1.5 (#67, Dec 31 1997, 22:34:28) [GCC 2.7.2.2]"
Guido van Rossum09270b51997-08-15 18:57:32 +00003825\end{verbatim}
Guido van Rossum4a944d71997-08-14 20:35:38 +00003826
3827The first word (up to the first space character) is the current Python
3828version; the first three characters are the major and minor version
3829separated by a period. The returned string points into static storage;
3830the caller should not modify its value. The value is available to
3831Python code as the list \code{sys.version}.
Fred Drake659ebfa2000-04-03 15:42:13 +00003832\withsubitem{(in module sys)}{\ttindex{version}}
Guido van Rossum4a944d71997-08-14 20:35:38 +00003833\end{cfuncdesc}
3834
Fred Drakec6fa34e1998-04-02 06:47:24 +00003835\begin{cfuncdesc}{const char*}{Py_GetPlatform}{}
Fred Drakeb0a78731998-01-13 18:51:10 +00003836Return the platform identifier for the current platform. On \UNIX{},
Guido van Rossum4a944d71997-08-14 20:35:38 +00003837this is formed from the ``official'' name of the operating system,
3838converted to lower case, followed by the major revision number; e.g.,
3839for Solaris 2.x, which is also known as SunOS 5.x, the value is
Fred Drakea8455ab2000-06-16 19:58:42 +00003840\code{'sunos5'}. On Macintosh, it is \code{'mac'}. On Windows, it
3841is \code{'win'}. The returned string points into static storage;
Guido van Rossum4a944d71997-08-14 20:35:38 +00003842the caller should not modify its value. The value is available to
3843Python code as \code{sys.platform}.
Fred Drake659ebfa2000-04-03 15:42:13 +00003844\withsubitem{(in module sys)}{\ttindex{platform}}
Guido van Rossum4a944d71997-08-14 20:35:38 +00003845\end{cfuncdesc}
3846
Fred Drakec6fa34e1998-04-02 06:47:24 +00003847\begin{cfuncdesc}{const char*}{Py_GetCopyright}{}
Guido van Rossum4a944d71997-08-14 20:35:38 +00003848Return the official copyright string for the current Python version,
3849for example
3850
Fred Drakea8455ab2000-06-16 19:58:42 +00003851\code{'Copyright 1991-1995 Stichting Mathematisch Centrum, Amsterdam'}
Guido van Rossum4a944d71997-08-14 20:35:38 +00003852
3853The returned string points into static storage; the caller should not
3854modify its value. The value is available to Python code as the list
3855\code{sys.copyright}.
Fred Drake659ebfa2000-04-03 15:42:13 +00003856\withsubitem{(in module sys)}{\ttindex{copyright}}
Guido van Rossum4a944d71997-08-14 20:35:38 +00003857\end{cfuncdesc}
3858
Fred Drakec6fa34e1998-04-02 06:47:24 +00003859\begin{cfuncdesc}{const char*}{Py_GetCompiler}{}
Guido van Rossum4a944d71997-08-14 20:35:38 +00003860Return an indication of the compiler used to build the current Python
Fred Drakee058b4f1998-02-16 06:15:35 +00003861version, in square brackets, for example:
Guido van Rossum4a944d71997-08-14 20:35:38 +00003862
Fred Drakee058b4f1998-02-16 06:15:35 +00003863\begin{verbatim}
3864"[GCC 2.7.2.2]"
3865\end{verbatim}
Guido van Rossum4a944d71997-08-14 20:35:38 +00003866
3867The returned string points into static storage; the caller should not
3868modify its value. The value is available to Python code as part of
3869the variable \code{sys.version}.
Fred Drake659ebfa2000-04-03 15:42:13 +00003870\withsubitem{(in module sys)}{\ttindex{version}}
Guido van Rossum4a944d71997-08-14 20:35:38 +00003871\end{cfuncdesc}
3872
Fred Drakec6fa34e1998-04-02 06:47:24 +00003873\begin{cfuncdesc}{const char*}{Py_GetBuildInfo}{}
Guido van Rossum4a944d71997-08-14 20:35:38 +00003874Return information about the sequence number and build date and time
3875of the current Python interpreter instance, for example
3876
Guido van Rossum09270b51997-08-15 18:57:32 +00003877\begin{verbatim}
3878"#67, Aug 1 1997, 22:34:28"
3879\end{verbatim}
Guido van Rossum4a944d71997-08-14 20:35:38 +00003880
3881The returned string points into static storage; the caller should not
3882modify its value. The value is available to Python code as part of
3883the variable \code{sys.version}.
Fred Drake659ebfa2000-04-03 15:42:13 +00003884\withsubitem{(in module sys)}{\ttindex{version}}
Guido van Rossum4a944d71997-08-14 20:35:38 +00003885\end{cfuncdesc}
3886
3887\begin{cfuncdesc}{int}{PySys_SetArgv}{int argc, char **argv}
Fred Drake659ebfa2000-04-03 15:42:13 +00003888Set \code{sys.argv} based on \var{argc} and \var{argv}. These
3889parameters are similar to those passed to the program's
3890\cfunction{main()}\ttindex{main()} function with the difference that
3891the first entry should refer to the script file to be executed rather
3892than the executable hosting the Python interpreter. If there isn't a
3893script that will be run, the first entry in \var{argv} can be an empty
3894string. If this function fails to initialize \code{sys.argv}, a fatal
3895condition is signalled using
3896\cfunction{Py_FatalError()}\ttindex{Py_FatalError()}.
3897\withsubitem{(in module sys)}{\ttindex{argv}}
3898% XXX impl. doesn't seem consistent in allowing 0/NULL for the params;
3899% check w/ Guido.
Guido van Rossum4a944d71997-08-14 20:35:38 +00003900\end{cfuncdesc}
3901
3902% XXX Other PySys thingies (doesn't really belong in this chapter)
3903
Fred Drakeefd146c1999-02-15 15:30:45 +00003904\section{Thread State and the Global Interpreter Lock
3905 \label{threads}}
Guido van Rossum4a944d71997-08-14 20:35:38 +00003906
Fred Drake659ebfa2000-04-03 15:42:13 +00003907\index{global interpreter lock}
3908\index{interpreter lock}
3909\index{lock, interpreter}
3910
Guido van Rossumc44d3d61997-10-06 05:10:47 +00003911The Python interpreter is not fully thread safe. In order to support
3912multi-threaded Python programs, there's a global lock that must be
3913held by the current thread before it can safely access Python objects.
3914Without the lock, even the simplest operations could cause problems in
Fred Drake7baf3d41998-02-20 00:45:52 +00003915a multi-threaded program: for example, when two threads simultaneously
Guido van Rossumc44d3d61997-10-06 05:10:47 +00003916increment the reference count of the same object, the reference count
3917could end up being incremented only once instead of twice.
3918
3919Therefore, the rule exists that only the thread that has acquired the
3920global interpreter lock may operate on Python objects or call Python/C
3921API functions. In order to support multi-threaded Python programs,
Fred Drake659ebfa2000-04-03 15:42:13 +00003922the interpreter regularly releases and reacquires the lock --- by
Guido van Rossumc44d3d61997-10-06 05:10:47 +00003923default, every ten bytecode instructions (this can be changed with
Fred Drake659ebfa2000-04-03 15:42:13 +00003924\withsubitem{(in module sys)}{\ttindex{setcheckinterval()}}
Fred Drakee058b4f1998-02-16 06:15:35 +00003925\function{sys.setcheckinterval()}). The lock is also released and
Guido van Rossumc44d3d61997-10-06 05:10:47 +00003926reacquired around potentially blocking I/O operations like reading or
3927writing a file, so that other threads can run while the thread that
3928requests the I/O is waiting for the I/O operation to complete.
3929
3930The Python interpreter needs to keep some bookkeeping information
Fred Drakee058b4f1998-02-16 06:15:35 +00003931separate per thread --- for this it uses a data structure called
Fred Drake659ebfa2000-04-03 15:42:13 +00003932\ctype{PyThreadState}\ttindex{PyThreadState}. This is new in Python
39331.5; in earlier versions, such state was stored in global variables,
3934and switching threads could cause problems. In particular, exception
3935handling is now thread safe, when the application uses
3936\withsubitem{(in module sys)}{\ttindex{exc_info()}}
3937\function{sys.exc_info()} to access the exception last raised in the
3938current thread.
Guido van Rossumc44d3d61997-10-06 05:10:47 +00003939
3940There's one global variable left, however: the pointer to the current
Fred Drake659ebfa2000-04-03 15:42:13 +00003941\ctype{PyThreadState}\ttindex{PyThreadState} structure. While most
3942thread packages have a way to store ``per-thread global data,''
3943Python's internal platform independent thread abstraction doesn't
3944support this yet. Therefore, the current thread state must be
3945manipulated explicitly.
Guido van Rossumc44d3d61997-10-06 05:10:47 +00003946
3947This is easy enough in most cases. Most code manipulating the global
3948interpreter lock has the following simple structure:
3949
Guido van Rossum9faf4c51997-10-07 14:38:54 +00003950\begin{verbatim}
Guido van Rossumc44d3d61997-10-06 05:10:47 +00003951Save the thread state in a local variable.
3952Release the interpreter lock.
3953...Do some blocking I/O operation...
3954Reacquire the interpreter lock.
3955Restore the thread state from the local variable.
Guido van Rossum9faf4c51997-10-07 14:38:54 +00003956\end{verbatim}
Guido van Rossumc44d3d61997-10-06 05:10:47 +00003957
3958This is so common that a pair of macros exists to simplify it:
3959
Guido van Rossum9faf4c51997-10-07 14:38:54 +00003960\begin{verbatim}
Guido van Rossumc44d3d61997-10-06 05:10:47 +00003961Py_BEGIN_ALLOW_THREADS
3962...Do some blocking I/O operation...
3963Py_END_ALLOW_THREADS
Guido van Rossum9faf4c51997-10-07 14:38:54 +00003964\end{verbatim}
Guido van Rossumc44d3d61997-10-06 05:10:47 +00003965
Fred Drake659ebfa2000-04-03 15:42:13 +00003966The \code{Py_BEGIN_ALLOW_THREADS}\ttindex{Py_BEGIN_ALLOW_THREADS} macro
3967opens a new block and declares a hidden local variable; the
3968\code{Py_END_ALLOW_THREADS}\ttindex{Py_END_ALLOW_THREADS} macro closes
Fred Drakee058b4f1998-02-16 06:15:35 +00003969the block. Another advantage of using these two macros is that when
3970Python is compiled without thread support, they are defined empty,
3971thus saving the thread state and lock manipulations.
Guido van Rossumc44d3d61997-10-06 05:10:47 +00003972
3973When thread support is enabled, the block above expands to the
3974following code:
3975
Guido van Rossum9faf4c51997-10-07 14:38:54 +00003976\begin{verbatim}
Guido van Rossumc44d3d61997-10-06 05:10:47 +00003977 PyThreadState *_save;
Fred Drake659ebfa2000-04-03 15:42:13 +00003978
Guido van Rossumc44d3d61997-10-06 05:10:47 +00003979 _save = PyEval_SaveThread();
3980 ...Do some blocking I/O operation...
3981 PyEval_RestoreThread(_save);
Guido van Rossum9faf4c51997-10-07 14:38:54 +00003982\end{verbatim}
Guido van Rossumc44d3d61997-10-06 05:10:47 +00003983
3984Using even lower level primitives, we can get roughly the same effect
3985as follows:
3986
Guido van Rossum9faf4c51997-10-07 14:38:54 +00003987\begin{verbatim}
Guido van Rossumc44d3d61997-10-06 05:10:47 +00003988 PyThreadState *_save;
Fred Drake659ebfa2000-04-03 15:42:13 +00003989
Guido van Rossumc44d3d61997-10-06 05:10:47 +00003990 _save = PyThreadState_Swap(NULL);
3991 PyEval_ReleaseLock();
3992 ...Do some blocking I/O operation...
3993 PyEval_AcquireLock();
3994 PyThreadState_Swap(_save);
Guido van Rossum9faf4c51997-10-07 14:38:54 +00003995\end{verbatim}
Guido van Rossumc44d3d61997-10-06 05:10:47 +00003996
3997There are some subtle differences; in particular,
Fred Drake659ebfa2000-04-03 15:42:13 +00003998\cfunction{PyEval_RestoreThread()}\ttindex{PyEval_RestoreThread()} saves
3999and restores the value of the global variable
4000\cdata{errno}\ttindex{errno}, since the lock manipulation does not
Fred Drakef8830d11998-04-23 14:06:01 +00004001guarantee that \cdata{errno} is left alone. Also, when thread support
Fred Drake659ebfa2000-04-03 15:42:13 +00004002is disabled,
4003\cfunction{PyEval_SaveThread()}\ttindex{PyEval_SaveThread()} and
Fred Drakee058b4f1998-02-16 06:15:35 +00004004\cfunction{PyEval_RestoreThread()} don't manipulate the lock; in this
Fred Drake659ebfa2000-04-03 15:42:13 +00004005case, \cfunction{PyEval_ReleaseLock()}\ttindex{PyEval_ReleaseLock()} and
4006\cfunction{PyEval_AcquireLock()}\ttindex{PyEval_AcquireLock()} are not
4007available. This is done so that dynamically loaded extensions
4008compiled with thread support enabled can be loaded by an interpreter
4009that was compiled with disabled thread support.
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004010
4011The global interpreter lock is used to protect the pointer to the
4012current thread state. When releasing the lock and saving the thread
4013state, the current thread state pointer must be retrieved before the
4014lock is released (since another thread could immediately acquire the
4015lock and store its own thread state in the global variable).
4016Reversely, when acquiring the lock and restoring the thread state, the
4017lock must be acquired before storing the thread state pointer.
4018
4019Why am I going on with so much detail about this? Because when
Fred Drake659ebfa2000-04-03 15:42:13 +00004020threads are created from C, they don't have the global interpreter
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004021lock, nor is there a thread state data structure for them. Such
4022threads must bootstrap themselves into existence, by first creating a
4023thread state data structure, then acquiring the lock, and finally
4024storing their thread state pointer, before they can start using the
4025Python/C API. When they are done, they should reset the thread state
4026pointer, release the lock, and finally free their thread state data
4027structure.
4028
4029When creating a thread data structure, you need to provide an
4030interpreter state data structure. The interpreter state data
4031structure hold global data that is shared by all threads in an
4032interpreter, for example the module administration
4033(\code{sys.modules}). Depending on your needs, you can either create
4034a new interpreter state data structure, or share the interpreter state
4035data structure used by the Python main thread (to access the latter,
Fred Drakef8830d11998-04-23 14:06:01 +00004036you must obtain the thread state and access its \member{interp} member;
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004037this must be done by a thread that is created by Python or by the main
4038thread after Python is initialized).
4039
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004040
4041\begin{ctypedesc}{PyInterpreterState}
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004042This data structure represents the state shared by a number of
4043cooperating threads. Threads belonging to the same interpreter
4044share their module administration and a few other internal items.
4045There are no public members in this structure.
4046
4047Threads belonging to different interpreters initially share nothing,
4048except process state like available memory, open file descriptors and
4049such. The global interpreter lock is also shared by all threads,
4050regardless of to which interpreter they belong.
4051\end{ctypedesc}
4052
4053\begin{ctypedesc}{PyThreadState}
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004054This data structure represents the state of a single thread. The only
Fred Drakef8830d11998-04-23 14:06:01 +00004055public data member is \ctype{PyInterpreterState *}\member{interp},
4056which points to this thread's interpreter state.
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004057\end{ctypedesc}
4058
4059\begin{cfuncdesc}{void}{PyEval_InitThreads}{}
4060Initialize and acquire the global interpreter lock. It should be
4061called in the main thread before creating a second thread or engaging
Fred Drakee058b4f1998-02-16 06:15:35 +00004062in any other thread operations such as
Fred Drake659ebfa2000-04-03 15:42:13 +00004063\cfunction{PyEval_ReleaseLock()}\ttindex{PyEval_ReleaseLock()} or
4064\code{PyEval_ReleaseThread(\var{tstate})}\ttindex{PyEval_ReleaseThread()}.
4065It is not needed before calling
4066\cfunction{PyEval_SaveThread()}\ttindex{PyEval_SaveThread()} or
4067\cfunction{PyEval_RestoreThread()}\ttindex{PyEval_RestoreThread()}.
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004068
4069This is a no-op when called for a second time. It is safe to call
Fred Drake659ebfa2000-04-03 15:42:13 +00004070this function before calling
4071\cfunction{Py_Initialize()}\ttindex{Py_Initialize()}.
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004072
4073When only the main thread exists, no lock operations are needed. This
4074is a common situation (most Python programs do not use threads), and
4075the lock operations slow the interpreter down a bit. Therefore, the
4076lock is not created initially. This situation is equivalent to having
4077acquired the lock: when there is only a single thread, all object
4078accesses are safe. Therefore, when this function initializes the
Fred Drake4de05a91998-02-16 14:25:26 +00004079lock, it also acquires it. Before the Python
4080\module{thread}\refbimodindex{thread} module creates a new thread,
4081knowing that either it has the lock or the lock hasn't been created
4082yet, it calls \cfunction{PyEval_InitThreads()}. When this call
4083returns, it is guaranteed that the lock has been created and that it
4084has acquired it.
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004085
4086It is \strong{not} safe to call this function when it is unknown which
4087thread (if any) currently has the global interpreter lock.
4088
4089This function is not available when thread support is disabled at
4090compile time.
4091\end{cfuncdesc}
4092
Guido van Rossum4a944d71997-08-14 20:35:38 +00004093\begin{cfuncdesc}{void}{PyEval_AcquireLock}{}
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004094Acquire the global interpreter lock. The lock must have been created
4095earlier. If this thread already has the lock, a deadlock ensues.
4096This function is not available when thread support is disabled at
4097compile time.
Guido van Rossum4a944d71997-08-14 20:35:38 +00004098\end{cfuncdesc}
4099
4100\begin{cfuncdesc}{void}{PyEval_ReleaseLock}{}
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004101Release the global interpreter lock. The lock must have been created
4102earlier. This function is not available when thread support is
Fred Drakee058b4f1998-02-16 06:15:35 +00004103disabled at compile time.
Guido van Rossum4a944d71997-08-14 20:35:38 +00004104\end{cfuncdesc}
4105
4106\begin{cfuncdesc}{void}{PyEval_AcquireThread}{PyThreadState *tstate}
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004107Acquire the global interpreter lock and then set the current thread
Guido van Rossum580aa8d1997-11-25 15:34:51 +00004108state to \var{tstate}, which should not be \NULL{}. The lock must
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004109have been created earlier. If this thread already has the lock,
4110deadlock ensues. This function is not available when thread support
Fred Drakee058b4f1998-02-16 06:15:35 +00004111is disabled at compile time.
Guido van Rossum4a944d71997-08-14 20:35:38 +00004112\end{cfuncdesc}
4113
4114\begin{cfuncdesc}{void}{PyEval_ReleaseThread}{PyThreadState *tstate}
Guido van Rossum580aa8d1997-11-25 15:34:51 +00004115Reset the current thread state to \NULL{} and release the global
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004116interpreter lock. The lock must have been created earlier and must be
4117held by the current thread. The \var{tstate} argument, which must not
Guido van Rossum580aa8d1997-11-25 15:34:51 +00004118be \NULL{}, is only used to check that it represents the current
Fred Drakee058b4f1998-02-16 06:15:35 +00004119thread state --- if it isn't, a fatal error is reported. This
4120function is not available when thread support is disabled at compile
4121time.
Guido van Rossum4a944d71997-08-14 20:35:38 +00004122\end{cfuncdesc}
4123
Fred Drakec6fa34e1998-04-02 06:47:24 +00004124\begin{cfuncdesc}{PyThreadState*}{PyEval_SaveThread}{}
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004125Release the interpreter lock (if it has been created and thread
Guido van Rossum580aa8d1997-11-25 15:34:51 +00004126support is enabled) and reset the thread state to \NULL{},
4127returning the previous thread state (which is not \NULL{}). If
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004128the lock has been created, the current thread must have acquired it.
4129(This function is available even when thread support is disabled at
4130compile time.)
Guido van Rossum4a944d71997-08-14 20:35:38 +00004131\end{cfuncdesc}
4132
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004133\begin{cfuncdesc}{void}{PyEval_RestoreThread}{PyThreadState *tstate}
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004134Acquire the interpreter lock (if it has been created and thread
4135support is enabled) and set the thread state to \var{tstate}, which
Guido van Rossum580aa8d1997-11-25 15:34:51 +00004136must not be \NULL{}. If the lock has been created, the current
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004137thread must not have acquired it, otherwise deadlock ensues. (This
4138function is available even when thread support is disabled at compile
4139time.)
Guido van Rossum4a944d71997-08-14 20:35:38 +00004140\end{cfuncdesc}
4141
Fred Drake659ebfa2000-04-03 15:42:13 +00004142The following macros are normally used without a trailing semicolon;
4143look for example usage in the Python source distribution.
4144
4145\begin{csimplemacrodesc}{Py_BEGIN_ALLOW_THREADS}
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004146This macro expands to
Fred Drakee058b4f1998-02-16 06:15:35 +00004147\samp{\{ PyThreadState *_save; _save = PyEval_SaveThread();}.
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004148Note that it contains an opening brace; it must be matched with a
4149following \code{Py_END_ALLOW_THREADS} macro. See above for further
4150discussion of this macro. It is a no-op when thread support is
4151disabled at compile time.
Fred Drake659ebfa2000-04-03 15:42:13 +00004152\end{csimplemacrodesc}
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004153
Fred Drake659ebfa2000-04-03 15:42:13 +00004154\begin{csimplemacrodesc}{Py_END_ALLOW_THREADS}
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004155This macro expands to
Fred Drakee058b4f1998-02-16 06:15:35 +00004156\samp{PyEval_RestoreThread(_save); \}}.
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004157Note that it contains a closing brace; it must be matched with an
4158earlier \code{Py_BEGIN_ALLOW_THREADS} macro. See above for further
4159discussion of this macro. It is a no-op when thread support is
4160disabled at compile time.
Fred Drake659ebfa2000-04-03 15:42:13 +00004161\end{csimplemacrodesc}
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004162
Fred Drake659ebfa2000-04-03 15:42:13 +00004163\begin{csimplemacrodesc}{Py_BEGIN_BLOCK_THREADS}
Fred Drakee058b4f1998-02-16 06:15:35 +00004164This macro expands to \samp{PyEval_RestoreThread(_save);} i.e. it
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004165is equivalent to \code{Py_END_ALLOW_THREADS} without the closing
4166brace. It is a no-op when thread support is disabled at compile
4167time.
Fred Drake659ebfa2000-04-03 15:42:13 +00004168\end{csimplemacrodesc}
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004169
Fred Drake659ebfa2000-04-03 15:42:13 +00004170\begin{csimplemacrodesc}{Py_BEGIN_UNBLOCK_THREADS}
Fred Drakee058b4f1998-02-16 06:15:35 +00004171This macro expands to \samp{_save = PyEval_SaveThread();} i.e. it is
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004172equivalent to \code{Py_BEGIN_ALLOW_THREADS} without the opening brace
4173and variable declaration. It is a no-op when thread support is
4174disabled at compile time.
Fred Drake659ebfa2000-04-03 15:42:13 +00004175\end{csimplemacrodesc}
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004176
4177All of the following functions are only available when thread support
4178is enabled at compile time, and must be called only when the
Fred Drake9d20ac31998-02-16 15:27:08 +00004179interpreter lock has been created.
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004180
Fred Drakec6fa34e1998-04-02 06:47:24 +00004181\begin{cfuncdesc}{PyInterpreterState*}{PyInterpreterState_New}{}
Guido van Rossumed9dcc11998-08-07 18:28:03 +00004182Create a new interpreter state object. The interpreter lock need not
4183be held, but may be held if it is necessary to serialize calls to this
4184function.
Guido van Rossum4a944d71997-08-14 20:35:38 +00004185\end{cfuncdesc}
4186
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004187\begin{cfuncdesc}{void}{PyInterpreterState_Clear}{PyInterpreterState *interp}
4188Reset all information in an interpreter state object. The interpreter
4189lock must be held.
Guido van Rossum4a944d71997-08-14 20:35:38 +00004190\end{cfuncdesc}
4191
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004192\begin{cfuncdesc}{void}{PyInterpreterState_Delete}{PyInterpreterState *interp}
4193Destroy an interpreter state object. The interpreter lock need not be
4194held. The interpreter state must have been reset with a previous
Fred Drakee058b4f1998-02-16 06:15:35 +00004195call to \cfunction{PyInterpreterState_Clear()}.
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004196\end{cfuncdesc}
4197
Fred Drakec6fa34e1998-04-02 06:47:24 +00004198\begin{cfuncdesc}{PyThreadState*}{PyThreadState_New}{PyInterpreterState *interp}
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004199Create a new thread state object belonging to the given interpreter
Guido van Rossumed9dcc11998-08-07 18:28:03 +00004200object. The interpreter lock need not be held, but may be held if it
4201is necessary to serialize calls to this function.
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004202\end{cfuncdesc}
4203
4204\begin{cfuncdesc}{void}{PyThreadState_Clear}{PyThreadState *tstate}
4205Reset all information in a thread state object. The interpreter lock
4206must be held.
4207\end{cfuncdesc}
4208
4209\begin{cfuncdesc}{void}{PyThreadState_Delete}{PyThreadState *tstate}
4210Destroy a thread state object. The interpreter lock need not be
4211held. The thread state must have been reset with a previous
Fred Drakee058b4f1998-02-16 06:15:35 +00004212call to \cfunction{PyThreadState_Clear()}.
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004213\end{cfuncdesc}
4214
Fred Drakec6fa34e1998-04-02 06:47:24 +00004215\begin{cfuncdesc}{PyThreadState*}{PyThreadState_Get}{}
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004216Return the current thread state. The interpreter lock must be held.
Guido van Rossum580aa8d1997-11-25 15:34:51 +00004217When the current thread state is \NULL{}, this issues a fatal
Guido van Rossum5b8a5231997-12-30 04:38:44 +00004218error (so that the caller needn't check for \NULL{}).
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004219\end{cfuncdesc}
4220
Fred Drakec6fa34e1998-04-02 06:47:24 +00004221\begin{cfuncdesc}{PyThreadState*}{PyThreadState_Swap}{PyThreadState *tstate}
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004222Swap the current thread state with the thread state given by the
Guido van Rossum580aa8d1997-11-25 15:34:51 +00004223argument \var{tstate}, which may be \NULL{}. The interpreter lock
Guido van Rossumc44d3d61997-10-06 05:10:47 +00004224must be held.
4225\end{cfuncdesc}
4226
4227
Fred Drake659ebfa2000-04-03 15:42:13 +00004228\chapter{Memory Management \label{memory}}
4229\sectionauthor{Vladimir Marangozov}{Vladimir.Marangozov@inrialpes.fr}
4230
4231
4232\section{Overview \label{memoryOverview}}
4233
4234Memory management in Python involves a private heap containing all
4235Python objects and data structures. The management of this private
4236heap is ensured internally by the \emph{Python memory manager}. The
4237Python memory manager has different components which deal with various
4238dynamic storage management aspects, like sharing, segmentation,
4239preallocation or caching.
4240
4241At the lowest level, a raw memory allocator ensures that there is
4242enough room in the private heap for storing all Python-related data
4243by interacting with the memory manager of the operating system. On top
4244of the raw memory allocator, several object-specific allocators
4245operate on the same heap and implement distinct memory management
4246policies adapted to the peculiarities of every object type. For
4247example, integer objects are managed differently within the heap than
4248strings, tuples or dictionaries because integers imply different
4249storage requirements and speed/space tradeoffs. The Python memory
4250manager thus delegates some of the work to the object-specific
4251allocators, but ensures that the latter operate within the bounds of
4252the private heap.
4253
4254It is important to understand that the management of the Python heap
4255is performed by the interpreter itself and that the user has no
4256control on it, even if she regularly manipulates object pointers to
4257memory blocks inside that heap. The allocation of heap space for
4258Python objects and other internal buffers is performed on demand by
4259the Python memory manager through the Python/C API functions listed in
4260this document.
4261
4262To avoid memory corruption, extension writers should never try to
4263operate on Python objects with the functions exported by the C
4264library: \cfunction{malloc()}\ttindex{malloc()},
4265\cfunction{calloc()}\ttindex{calloc()},
4266\cfunction{realloc()}\ttindex{realloc()} and
4267\cfunction{free()}\ttindex{free()}. This will result in
4268mixed calls between the C allocator and the Python memory manager
4269with fatal consequences, because they implement different algorithms
4270and operate on different heaps. However, one may safely allocate and
4271release memory blocks with the C library allocator for individual
4272purposes, as shown in the following example:
4273
4274\begin{verbatim}
4275 PyObject *res;
4276 char *buf = (char *) malloc(BUFSIZ); /* for I/O */
4277
4278 if (buf == NULL)
4279 return PyErr_NoMemory();
4280 ...Do some I/O operation involving buf...
4281 res = PyString_FromString(buf);
4282 free(buf); /* malloc'ed */
4283 return res;
4284\end{verbatim}
4285
4286In this example, the memory request for the I/O buffer is handled by
4287the C library allocator. The Python memory manager is involved only
4288in the allocation of the string object returned as a result.
4289
4290In most situations, however, it is recommended to allocate memory from
4291the Python heap specifically because the latter is under control of
4292the Python memory manager. For example, this is required when the
4293interpreter is extended with new object types written in C. Another
4294reason for using the Python heap is the desire to \emph{inform} the
4295Python memory manager about the memory needs of the extension module.
4296Even when the requested memory is used exclusively for internal,
4297highly-specific purposes, delegating all memory requests to the Python
4298memory manager causes the interpreter to have a more accurate image of
4299its memory footprint as a whole. Consequently, under certain
4300circumstances, the Python memory manager may or may not trigger
4301appropriate actions, like garbage collection, memory compaction or
4302other preventive procedures. Note that by using the C library
4303allocator as shown in the previous example, the allocated memory for
4304the I/O buffer escapes completely the Python memory manager.
4305
4306
4307\section{Memory Interface \label{memoryInterface}}
4308
4309The following function sets, modeled after the ANSI C standard, are
4310available for allocating and releasing memory from the Python heap:
4311
4312
Fred Drake7d45d342000-08-11 17:07:32 +00004313\begin{cfuncdesc}{void*}{PyMem_Malloc}{size_t n}
4314Allocates \var{n} bytes and returns a pointer of type \ctype{void*} to
Fred Drake659ebfa2000-04-03 15:42:13 +00004315the allocated memory, or \NULL{} if the request fails. Requesting zero
4316bytes returns a non-\NULL{} pointer.
4317\end{cfuncdesc}
4318
Fred Drake7d45d342000-08-11 17:07:32 +00004319\begin{cfuncdesc}{void*}{PyMem_Realloc}{void *p, size_t n}
Fred Drake659ebfa2000-04-03 15:42:13 +00004320Resizes the memory block pointed to by \var{p} to \var{n} bytes. The
4321contents will be unchanged to the minimum of the old and the new
4322sizes. If \var{p} is \NULL{}, the call is equivalent to
4323\cfunction{PyMem_Malloc(\var{n})}; if \var{n} is equal to zero, the memory block
4324is resized but is not freed, and the returned pointer is non-\NULL{}.
4325Unless \var{p} is \NULL{}, it must have been returned by a previous
4326call to \cfunction{PyMem_Malloc()} or \cfunction{PyMem_Realloc()}.
4327\end{cfuncdesc}
4328
Fred Drake7d45d342000-08-11 17:07:32 +00004329\begin{cfuncdesc}{void}{PyMem_Free}{void *p}
Fred Drake659ebfa2000-04-03 15:42:13 +00004330Frees the memory block pointed to by \var{p}, which must have been
4331returned by a previous call to \cfunction{PyMem_Malloc()} or
4332\cfunction{PyMem_Realloc()}. Otherwise, or if
4333\cfunction{PyMem_Free(p)} has been called before, undefined behaviour
4334occurs. If \var{p} is \NULL{}, no operation is performed.
4335\end{cfuncdesc}
4336
Fred Drake7d45d342000-08-11 17:07:32 +00004337\begin{cfuncdesc}{void*}{Py_Malloc}{size_t n}
Fred Drake659ebfa2000-04-03 15:42:13 +00004338Same as \cfunction{PyMem_Malloc()}, but calls
4339\cfunction{PyErr_NoMemory()} on failure.
4340\end{cfuncdesc}
4341
Fred Drake7d45d342000-08-11 17:07:32 +00004342\begin{cfuncdesc}{void*}{Py_Realloc}{void *p, size_t n}
Fred Drake659ebfa2000-04-03 15:42:13 +00004343Same as \cfunction{PyMem_Realloc()}, but calls
4344\cfunction{PyErr_NoMemory()} on failure.
4345\end{cfuncdesc}
4346
Fred Drake7d45d342000-08-11 17:07:32 +00004347\begin{cfuncdesc}{void}{Py_Free}{void *p}
Fred Drake659ebfa2000-04-03 15:42:13 +00004348Same as \cfunction{PyMem_Free()}.
4349\end{cfuncdesc}
4350
4351The following type-oriented macros are provided for convenience. Note
4352that \var{TYPE} refers to any C type.
4353
4354\begin{cfuncdesc}{\var{TYPE}*}{PyMem_NEW}{TYPE, size_t n}
4355Same as \cfunction{PyMem_Malloc()}, but allocates \code{(\var{n} *
4356sizeof(\var{TYPE}))} bytes of memory. Returns a pointer cast to
4357\ctype{\var{TYPE}*}.
4358\end{cfuncdesc}
4359
Fred Drake7d45d342000-08-11 17:07:32 +00004360\begin{cfuncdesc}{\var{TYPE}*}{PyMem_RESIZE}{void *p, TYPE, size_t n}
Fred Drake659ebfa2000-04-03 15:42:13 +00004361Same as \cfunction{PyMem_Realloc()}, but the memory block is resized
4362to \code{(\var{n} * sizeof(\var{TYPE}))} bytes. Returns a pointer
4363cast to \ctype{\var{TYPE}*}.
4364\end{cfuncdesc}
4365
Fred Drake7d45d342000-08-11 17:07:32 +00004366\begin{cfuncdesc}{void}{PyMem_DEL}{void *p}
Fred Drake659ebfa2000-04-03 15:42:13 +00004367Same as \cfunction{PyMem_Free()}.
4368\end{cfuncdesc}
4369
4370
4371\section{Examples \label{memoryExamples}}
4372
4373Here is the example from section \ref{memoryOverview}, rewritten so
4374that the I/O buffer is allocated from the Python heap by using the
4375first function set:
4376
4377\begin{verbatim}
4378 PyObject *res;
4379 char *buf = (char *) PyMem_Malloc(BUFSIZ); /* for I/O */
4380
4381 if (buf == NULL)
4382 return PyErr_NoMemory();
4383 /* ...Do some I/O operation involving buf... */
4384 res = PyString_FromString(buf);
4385 PyMem_Free(buf); /* allocated with PyMem_Malloc */
4386 return res;
4387\end{verbatim}
4388
4389With the second function set, the need to call
4390\cfunction{PyErr_NoMemory()} is obviated:
4391
4392\begin{verbatim}
4393 PyObject *res;
4394 char *buf = (char *) Py_Malloc(BUFSIZ); /* for I/O */
4395
4396 if (buf == NULL)
4397 return NULL;
4398 /* ...Do some I/O operation involving buf... */
4399 res = PyString_FromString(buf);
4400 Py_Free(buf); /* allocated with Py_Malloc */
4401 return res;
4402\end{verbatim}
4403
4404The same code using the macro set:
4405
4406\begin{verbatim}
4407 PyObject *res;
4408 char *buf = PyMem_NEW(char, BUFSIZ); /* for I/O */
4409
4410 if (buf == NULL)
4411 return PyErr_NoMemory();
4412 /* ...Do some I/O operation involving buf... */
4413 res = PyString_FromString(buf);
4414 PyMem_DEL(buf); /* allocated with PyMem_NEW */
4415 return res;
4416\end{verbatim}
4417
4418Note that in the three examples above, the buffer is always
4419manipulated via functions/macros belonging to the same set. Indeed, it
4420is required to use the same memory API family for a given
4421memory block, so that the risk of mixing different allocators is
4422reduced to a minimum. The following code sequence contains two errors,
4423one of which is labeled as \emph{fatal} because it mixes two different
4424allocators operating on different heaps.
4425
4426\begin{verbatim}
4427char *buf1 = PyMem_NEW(char, BUFSIZ);
4428char *buf2 = (char *) malloc(BUFSIZ);
4429char *buf3 = (char *) PyMem_Malloc(BUFSIZ);
4430...
4431PyMem_DEL(buf3); /* Wrong -- should be PyMem_Free() */
4432free(buf2); /* Right -- allocated via malloc() */
4433free(buf1); /* Fatal -- should be PyMem_DEL() */
4434\end{verbatim}
4435
4436In addition to the functions aimed at handling raw memory blocks from
4437the Python heap, objects in Python are allocated and released with
4438\cfunction{_PyObject_New()}\ttindex{_PyObject_New()} and
4439\cfunction{_PyObject_NewVar()}\ttindex{_PyObject_NewVar()}, or with
4440their corresponding macros
4441\cfunction{PyObject_NEW()}\ttindex{PyObject_NEW()} and
4442\cfunction{PyObject_NEW_VAR()}\ttindex{PyObject_NEW_VAR()}.
4443
Fred Drakee06f0f92000-06-30 15:52:39 +00004444\cfunction{_PyObject_New()}, \cfunction{_PyObject_NewVar()},
4445\cfunction{_PyObject_Del()}, or with their corresponding macros
4446\cfunction{PyObject_NEW()}, \cfunction{PyObject_NEW_VAR()},
4447\cfunction{PyObject_DEL()}.
Fred Drake659ebfa2000-04-03 15:42:13 +00004448
Fred Drakee06f0f92000-06-30 15:52:39 +00004449These will be explained in the next chapter on defining and
4450implementing new object types in C.
Fred Drake659ebfa2000-04-03 15:42:13 +00004451
4452
Fred Drakeefd146c1999-02-15 15:30:45 +00004453\chapter{Defining New Object Types \label{newTypes}}
Guido van Rossum4a944d71997-08-14 20:35:38 +00004454
Fred Drakec6fa34e1998-04-02 06:47:24 +00004455\begin{cfuncdesc}{PyObject*}{_PyObject_New}{PyTypeObject *type}
Fred Drakee058b4f1998-02-16 06:15:35 +00004456\end{cfuncdesc}
4457
Fred Drakec6fa34e1998-04-02 06:47:24 +00004458\begin{cfuncdesc}{PyObject*}{_PyObject_NewVar}{PyTypeObject *type, int size}
Fred Drakee058b4f1998-02-16 06:15:35 +00004459\end{cfuncdesc}
4460
Fred Drake659ebfa2000-04-03 15:42:13 +00004461\begin{cfuncdesc}{\var{TYPE}}{_PyObject_NEW}{TYPE, PyTypeObject *type}
Fred Drakee058b4f1998-02-16 06:15:35 +00004462\end{cfuncdesc}
4463
Fred Drake659ebfa2000-04-03 15:42:13 +00004464\begin{cfuncdesc}{\var{TYPE}}{_PyObject_NEW_VAR}{TYPE, PyTypeObject *type,
4465 int size}
Fred Drakee058b4f1998-02-16 06:15:35 +00004466\end{cfuncdesc}
4467
Guido van Rossum3c4378b1998-04-14 20:21:10 +00004468Py_InitModule (!!!)
4469
4470PyArg_ParseTupleAndKeywords, PyArg_ParseTuple, PyArg_Parse
4471
4472Py_BuildValue
Guido van Rossumae110af1997-05-22 20:11:52 +00004473
Fred Drake659ebfa2000-04-03 15:42:13 +00004474DL_IMPORT
4475
4476Py*_Check
4477
4478_Py_NoneStruct
4479
4480
4481\section{Common Object Structures \label{common-structs}}
4482
Guido van Rossumae110af1997-05-22 20:11:52 +00004483PyObject, PyVarObject
4484
4485PyObject_HEAD, PyObject_HEAD_INIT, PyObject_VAR_HEAD
4486
4487Typedefs:
4488unaryfunc, binaryfunc, ternaryfunc, inquiry, coercion, intargfunc,
4489intintargfunc, intobjargproc, intintobjargproc, objobjargproc,
Guido van Rossumae110af1997-05-22 20:11:52 +00004490destructor, printfunc, getattrfunc, getattrofunc, setattrfunc,
4491setattrofunc, cmpfunc, reprfunc, hashfunc
4492
Fred Drakea8455ab2000-06-16 19:58:42 +00004493\begin{ctypedesc}{PyCFunction}
4494Type of the functions used to implement most Python callables in C.
4495\end{ctypedesc}
4496
4497\begin{ctypedesc}{PyMethodDef}
4498Structure used to describe a method of an extension type. This
4499structure has four fields:
4500
4501\begin{tableiii}{l|l|l}{member}{Field}{C Type}{Meaning}
4502 \lineiii{ml_name}{char *}{name of the method}
4503 \lineiii{ml_meth}{PyCFunction}{pointer to the C implementation}
4504 \lineiii{ml_flags}{int}{flag bits indicating how the call should be
4505 constructed}
4506 \lineiii{ml_doc}{char *}{points to the contents of the docstring}
4507\end{tableiii}
4508\end{ctypedesc}
4509
4510\begin{cfuncdesc}{PyObject*}{Py_FindMethod}{PyMethodDef[] table,
4511 PyObject *ob, char *name}
4512Return a bound method object for an extension type implemented in C.
4513This function also handles the special attribute \member{__methods__},
4514returning a list of all the method names defined in \var{table}.
4515\end{cfuncdesc}
4516
Fred Drake659ebfa2000-04-03 15:42:13 +00004517
4518\section{Mapping Object Structures \label{mapping-structs}}
4519
4520\begin{ctypedesc}{PyMappingMethods}
4521Structure used to hold pointers to the functions used to implement the
4522mapping protocol for an extension type.
4523\end{ctypedesc}
4524
4525
4526\section{Number Object Structures \label{number-structs}}
4527
4528\begin{ctypedesc}{PyNumberMethods}
4529Structure used to hold pointers to the functions an extension type
4530uses to implement the number protocol.
4531\end{ctypedesc}
4532
4533
4534\section{Sequence Object Structures \label{sequence-structs}}
4535
4536\begin{ctypedesc}{PySequenceMethods}
4537Structure used to hold pointers to the functions which an object uses
4538to implement the sequence protocol.
4539\end{ctypedesc}
4540
4541
4542\section{Buffer Object Structures \label{buffer-structs}}
4543\sectionauthor{Greg J. Stein}{greg@lyra.org}
4544
4545The buffer interface exports a model where an object can expose its
4546internal data as a set of chunks of data, where each chunk is
4547specified as a pointer/length pair. These chunks are called
4548\dfn{segments} and are presumed to be non-contiguous in memory.
4549
4550If an object does not export the buffer interface, then its
4551\member{tp_as_buffer} member in the \ctype{PyTypeObject} structure
4552should be \NULL{}. Otherwise, the \member{tp_as_buffer} will point to
4553a \ctype{PyBufferProcs} structure.
4554
4555\strong{Note:} It is very important that your
4556\ctype{PyTypeObject} structure uses \code{Py_TPFLAGS_DEFAULT} for the
4557value of the \member{tp_flags} member rather than \code{0}. This
4558tells the Python runtime that your \ctype{PyBufferProcs} structure
4559contains the \member{bf_getcharbuffer} slot. Older versions of Python
4560did not have this member, so a new Python interpreter using an old
4561extension needs to be able to test for its presence before using it.
4562
4563\begin{ctypedesc}{PyBufferProcs}
4564Structure used to hold the function pointers which define an
4565implementation of the buffer protocol.
4566
4567The first slot is \member{bf_getreadbuffer}, of type
4568\ctype{getreadbufferproc}. If this slot is \NULL{}, then the object
4569does not support reading from the internal data. This is
4570non-sensical, so implementors should fill this in, but callers should
4571test that the slot contains a non-\NULL{} value.
4572
4573The next slot is \member{bf_getwritebuffer} having type
4574\ctype{getwritebufferproc}. This slot may be \NULL{} if the object
4575does not allow writing into its returned buffers.
4576
4577The third slot is \member{bf_getsegcount}, with type
4578\ctype{getsegcountproc}. This slot must not be \NULL{} and is used to
4579inform the caller how many segments the object contains. Simple
4580objects such as \ctype{PyString_Type} and
4581\ctype{PyBuffer_Type} objects contain a single segment.
4582
4583The last slot is \member{bf_getcharbuffer}, of type
4584\ctype{getcharbufferproc}. This slot will only be present if the
4585\code{Py_TPFLAGS_HAVE_GETCHARBUFFER} flag is present in the
4586\member{tp_flags} field of the object's \ctype{PyTypeObject}. Before using
4587this slot, the caller should test whether it is present by using the
4588\cfunction{PyType_HasFeature()}\ttindex{PyType_HasFeature()} function.
4589If present, it may be \NULL, indicating that the object's contents
4590cannot be used as \emph{8-bit characters}.
4591The slot function may also raise an error if the object's contents
4592cannot be interpreted as 8-bit characters. For example, if the object
4593is an array which is configured to hold floating point values, an
4594exception may be raised if a caller attempts to use
4595\member{bf_getcharbuffer} to fetch a sequence of 8-bit characters.
4596This notion of exporting the internal buffers as ``text'' is used to
4597distinguish between objects that are binary in nature, and those which
4598have character-based content.
4599
4600\strong{Note:} The current policy seems to state that these characters
4601may be multi-byte characters. This implies that a buffer size of
4602\var{N} does not mean there are \var{N} characters present.
4603\end{ctypedesc}
4604
4605\begin{datadesc}{Py_TPFLAGS_HAVE_GETCHARBUFFER}
4606Flag bit set in the type structure to indicate that the
4607\member{bf_getcharbuffer} slot is known. This being set does not
4608indicate that the object supports the buffer interface or that the
4609\member{bf_getcharbuffer} slot is non-\NULL.
4610\end{datadesc}
4611
4612\begin{ctypedesc}[getreadbufferproc]{int (*getreadbufferproc)
4613 (PyObject *self, int segment, void **ptrptr)}
4614Return a pointer to a readable segment of the buffer. This function
4615is allowed to raise an exception, in which case it must return
4616\code{-1}. The \var{segment} which is passed must be zero or
4617positive, and strictly less than the number of segments returned by
4618the \member{bf_getsegcount} slot function. On success, returns
4619\code{0} and sets \code{*\var{ptrptr}} to a pointer to the buffer
4620memory.
4621\end{ctypedesc}
4622
4623\begin{ctypedesc}[getwritebufferproc]{int (*getwritebufferproc)
4624 (PyObject *self, int segment, void **ptrptr)}
Fred Drake58c5a2a1999-08-04 13:13:24 +00004625Return a pointer to a writable memory buffer in \code{*\var{ptrptr}};
4626the memory buffer must correspond to buffer segment \var{segment}.
4627Must return \code{-1} and set an exception on error.
4628\exception{TypeError} should be raised if the object only supports
4629read-only buffers, and \exception{SystemError} should be raised when
4630\var{segment} specifies a segment that doesn't exist.
4631% Why doesn't it raise ValueError for this one?
Fred Drake659ebfa2000-04-03 15:42:13 +00004632% GJS: because you shouldn't be calling it with an invalid
4633% segment. That indicates a blatant programming error in the C
4634% code.
Fred Drake58c5a2a1999-08-04 13:13:24 +00004635\end{ctypedesc}
4636
Fred Drake659ebfa2000-04-03 15:42:13 +00004637\begin{ctypedesc}[getsegcountproc]{int (*getsegcountproc)
4638 (PyObject *self, int *lenp)}
4639Return the number of memory segments which comprise the buffer. If
4640\var{lenp} is not \NULL, the implementation must report the sum of the
4641sizes (in bytes) of all segments in \code{*\var{lenp}}.
4642The function cannot fail.
4643\end{ctypedesc}
Guido van Rossumae110af1997-05-22 20:11:52 +00004644
Fred Drake659ebfa2000-04-03 15:42:13 +00004645\begin{ctypedesc}[getcharbufferproc]{int (*getcharbufferproc)
4646 (PyObject *self, int segment, const char **ptrptr)}
4647\end{ctypedesc}
Guido van Rossumae110af1997-05-22 20:11:52 +00004648
Guido van Rossumae110af1997-05-22 20:11:52 +00004649
Fred Drake659ebfa2000-04-03 15:42:13 +00004650% \chapter{Debugging \label{debugging}}
4651%
4652% XXX Explain Py_DEBUG, Py_TRACE_REFS, Py_REF_DEBUG.
Guido van Rossum5b8a5231997-12-30 04:38:44 +00004653
4654
Fred Drakef3aa0e01998-03-17 06:23:13 +00004655\input{api.ind} % Index -- must be last
Guido van Rossum9231c8f1997-05-15 21:43:21 +00004656
4657\end{document}