blob: 36c8c3871a4d5cf66a7b4f9fb79772ff1bbd52fb [file] [log] [blame]
Fred Drake61c77281998-07-28 19:34:22 +00001\chapter{Data model\label{datamodel}}
Fred Drakef6669171998-05-06 19:52:49 +00002
Fred Drake2829f1c2001-06-23 05:27:20 +00003
Fred Drake61c77281998-07-28 19:34:22 +00004\section{Objects, values and types\label{objects}}
Fred Drakef6669171998-05-06 19:52:49 +00005
6\dfn{Objects} are Python's abstraction for data. All data in a Python
7program is represented by objects or by relations between objects.
8(In a sense, and in conformance to Von Neumann's model of a
Guido van Rossum83b2f8a1998-07-23 17:12:46 +00009``stored program computer,'' code is also represented by objects.)
Fred Drakef6669171998-05-06 19:52:49 +000010\index{object}
11\index{data}
12
13Every object has an identity, a type and a value. An object's
14\emph{identity} never changes once it has been created; you may think
Fred Drake293dd4b2002-06-04 16:25:57 +000015of it as the object's address in memory. The `\keyword{is}' operator
Fred Drake82385871998-10-01 20:40:43 +000016compares the identity of two objects; the
17\function{id()}\bifuncindex{id} function returns an integer
18representing its identity (currently implemented as its address).
Guido van Rossum83b2f8a1998-07-23 17:12:46 +000019An object's \dfn{type} is
Raymond Hettingere701dcb2003-01-19 13:08:18 +000020also unchangeable.\footnote{Since Python 2.2, a gradual merging of
21types and classes has been started that makes this and a few other
22assertions made in this manual not 100\% accurate and complete:
23for example, it \emph{is} now possible in some cases to change an
24object's type, under certain controlled conditions. Until this manual
25undergoes extensive revision, it must now be taken as authoritative
26only regarding ``classic classes'', that are still the default, for
27compatibility purposes, in Python 2.2 and 2.3.}
28An object's type determines the operations that the object
Guido van Rossum83b2f8a1998-07-23 17:12:46 +000029supports (e.g., ``does it have a length?'') and also defines the
Fred Drake82385871998-10-01 20:40:43 +000030possible values for objects of that type. The
31\function{type()}\bifuncindex{type} function returns an object's type
32(which is an object itself). The \emph{value} of some
Fred Drakef6669171998-05-06 19:52:49 +000033objects can change. Objects whose value can change are said to be
34\emph{mutable}; objects whose value is unchangeable once they are
Guido van Rossum83b2f8a1998-07-23 17:12:46 +000035created are called \emph{immutable}.
Guido van Rossum264bd591999-02-23 16:40:55 +000036(The value of an immutable container object that contains a reference
37to a mutable object can change when the latter's value is changed;
38however the container is still considered immutable, because the
39collection of objects it contains cannot be changed. So, immutability
40is not strictly the same as having an unchangeable value, it is more
41subtle.)
Guido van Rossum83b2f8a1998-07-23 17:12:46 +000042An object's mutability is determined by its type; for instance,
43numbers, strings and tuples are immutable, while dictionaries and
44lists are mutable.
Fred Drakef6669171998-05-06 19:52:49 +000045\index{identity of an object}
46\index{value of an object}
47\index{type of an object}
48\index{mutable object}
49\index{immutable object}
50
51Objects are never explicitly destroyed; however, when they become
52unreachable they may be garbage-collected. An implementation is
Barry Warsaw92a6ed91998-08-07 16:33:51 +000053allowed to postpone garbage collection or omit it altogether --- it is
54a matter of implementation quality how garbage collection is
Fred Drakef6669171998-05-06 19:52:49 +000055implemented, as long as no objects are collected that are still
56reachable. (Implementation note: the current implementation uses a
Fred Drakec8e82812001-01-22 17:46:18 +000057reference-counting scheme with (optional) delayed detection of
Raymond Hettingere701dcb2003-01-19 13:08:18 +000058cyclically linked garbage, which collects most objects as soon as they
Fred Drakec8e82812001-01-22 17:46:18 +000059become unreachable, but is not guaranteed to collect garbage
60containing circular references. See the
61\citetitle[../lib/module-gc.html]{Python Library Reference} for
62information on controlling the collection of cyclic garbage.)
Fred Drakef6669171998-05-06 19:52:49 +000063\index{garbage collection}
64\index{reference counting}
65\index{unreachable object}
66
67Note that the use of the implementation's tracing or debugging
68facilities may keep objects alive that would normally be collectable.
Guido van Rossum83b2f8a1998-07-23 17:12:46 +000069Also note that catching an exception with a
Fred Drake4856d011999-01-12 04:15:20 +000070`\keyword{try}...\keyword{except}' statement may keep objects alive.
Fred Drakef6669171998-05-06 19:52:49 +000071
72Some objects contain references to ``external'' resources such as open
73files or windows. It is understood that these resources are freed
74when the object is garbage-collected, but since garbage collection is
75not guaranteed to happen, such objects also provide an explicit way to
76release the external resource, usually a \method{close()} method.
Guido van Rossum83b2f8a1998-07-23 17:12:46 +000077Programs are strongly recommended to explicitly close such
Fred Drake4856d011999-01-12 04:15:20 +000078objects. The `\keyword{try}...\keyword{finally}' statement provides
79a convenient way to do this.
Fred Drakef6669171998-05-06 19:52:49 +000080
81Some objects contain references to other objects; these are called
82\emph{containers}. Examples of containers are tuples, lists and
83dictionaries. The references are part of a container's value. In
84most cases, when we talk about the value of a container, we imply the
85values, not the identities of the contained objects; however, when we
Guido van Rossum83b2f8a1998-07-23 17:12:46 +000086talk about the mutability of a container, only the identities of
87the immediately contained objects are implied. So, if an immutable
88container (like a tuple)
89contains a reference to a mutable object, its value changes
90if that mutable object is changed.
Fred Drakef6669171998-05-06 19:52:49 +000091\index{container}
92
Guido van Rossum83b2f8a1998-07-23 17:12:46 +000093Types affect almost all aspects of object behavior. Even the importance
Fred Drakef6669171998-05-06 19:52:49 +000094of object identity is affected in some sense: for immutable types,
95operations that compute new values may actually return a reference to
96any existing object with the same type and value, while for mutable
Guido van Rossum83b2f8a1998-07-23 17:12:46 +000097objects this is not allowed. E.g., after
Fred Drake82385871998-10-01 20:40:43 +000098\samp{a = 1; b = 1},
Fred Drakef6669171998-05-06 19:52:49 +000099\code{a} and \code{b} may or may not refer to the same object with the
Guido van Rossum83b2f8a1998-07-23 17:12:46 +0000100value one, depending on the implementation, but after
Fred Drake82385871998-10-01 20:40:43 +0000101\samp{c = []; d = []}, \code{c} and \code{d}
Fred Drakef6669171998-05-06 19:52:49 +0000102are guaranteed to refer to two different, unique, newly created empty
103lists.
Fred Drake82385871998-10-01 20:40:43 +0000104(Note that \samp{c = d = []} assigns the same object to both
Guido van Rossum83b2f8a1998-07-23 17:12:46 +0000105\code{c} and \code{d}.)
Fred Drakef6669171998-05-06 19:52:49 +0000106
Fred Drake2829f1c2001-06-23 05:27:20 +0000107
Fred Drake61c77281998-07-28 19:34:22 +0000108\section{The standard type hierarchy\label{types}}
Fred Drakef6669171998-05-06 19:52:49 +0000109
110Below is a list of the types that are built into Python. Extension
Raymond Hettingere701dcb2003-01-19 13:08:18 +0000111modules (written in C, Java, or other languages, depending on
112the implementation) can define additional types. Future versions of
Guido van Rossum83b2f8a1998-07-23 17:12:46 +0000113Python may add types to the type hierarchy (e.g., rational
Fred Drakef6669171998-05-06 19:52:49 +0000114numbers, efficiently stored arrays of integers, etc.).
115\index{type}
116\indexii{data}{type}
117\indexii{type}{hierarchy}
118\indexii{extension}{module}
119\indexii{C}{language}
120
121Some of the type descriptions below contain a paragraph listing
Guido van Rossum83b2f8a1998-07-23 17:12:46 +0000122`special attributes.' These are attributes that provide access to the
Fred Drakef6669171998-05-06 19:52:49 +0000123implementation and are not intended for general use. Their definition
Fred Drake35705512001-12-03 17:32:27 +0000124may change in the future.
Fred Drakef6669171998-05-06 19:52:49 +0000125\index{attribute}
126\indexii{special}{attribute}
127\indexiii{generic}{special}{attribute}
Fred Drakef6669171998-05-06 19:52:49 +0000128
129\begin{description}
130
131\item[None]
132This type has a single value. There is a single object with this value.
133This object is accessed through the built-in name \code{None}.
Guido van Rossum83b2f8a1998-07-23 17:12:46 +0000134It is used to signify the absence of a value in many situations, e.g.,
135it is returned from functions that don't explicitly return anything.
136Its truth value is false.
Fred Drakef6669171998-05-06 19:52:49 +0000137\ttindex{None}
Fred Drake78eebfd1998-11-25 19:09:24 +0000138\obindex{None@{\texttt{None}}}
Fred Drakef6669171998-05-06 19:52:49 +0000139
Neil Schemenauer48c2eb92001-01-04 01:25:50 +0000140\item[NotImplemented]
141This type has a single value. There is a single object with this value.
142This object is accessed through the built-in name \code{NotImplemented}.
Guido van Rossumab782dd2001-01-18 15:17:06 +0000143Numeric methods and rich comparison methods may return this value if
144they do not implement the operation for the operands provided. (The
145interpreter will then try the reflected operation, or some other
146fallback, depending on the operator.) Its truth value is true.
Neil Schemenauer48c2eb92001-01-04 01:25:50 +0000147\ttindex{NotImplemented}
148\obindex{NotImplemented@{\texttt{NotImplemented}}}
149
Guido van Rossum83b2f8a1998-07-23 17:12:46 +0000150\item[Ellipsis]
151This type has a single value. There is a single object with this value.
152This object is accessed through the built-in name \code{Ellipsis}.
Fred Drake82385871998-10-01 20:40:43 +0000153It is used to indicate the presence of the \samp{...} syntax in a
Guido van Rossum83b2f8a1998-07-23 17:12:46 +0000154slice. Its truth value is true.
Fred Drakec0a02c02002-04-16 02:03:05 +0000155\obindex{Ellipsis}
Guido van Rossum83b2f8a1998-07-23 17:12:46 +0000156
Fred Drakef6669171998-05-06 19:52:49 +0000157\item[Numbers]
158These are created by numeric literals and returned as results by
159arithmetic operators and arithmetic built-in functions. Numeric
160objects are immutable; once created their value never changes. Python
161numbers are of course strongly related to mathematical numbers, but
162subject to the limitations of numerical representation in computers.
Fred Drakef6669171998-05-06 19:52:49 +0000163\obindex{numeric}
164
Fred Drakeb3384d32001-05-14 16:04:22 +0000165Python distinguishes between integers, floating point numbers, and
166complex numbers:
Fred Drakef6669171998-05-06 19:52:49 +0000167
168\begin{description}
169\item[Integers]
170These represent elements from the mathematical set of whole numbers.
171\obindex{integer}
172
Guido van Rossum77f6a652002-04-03 22:41:51 +0000173There are three types of integers:
Fred Drakef6669171998-05-06 19:52:49 +0000174
175\begin{description}
176
177\item[Plain integers]
178These represent numbers in the range -2147483648 through 2147483647.
179(The range may be larger on machines with a larger natural word
180size, but not smaller.)
Fred Drakee15956b2000-04-03 04:51:13 +0000181When the result of an operation would fall outside this range, the
Raymond Hettingere701dcb2003-01-19 13:08:18 +0000182result is normally returned as a long integer (in some cases, the
183exception \exception{OverflowError} is raised instead).
Fred Drakef6669171998-05-06 19:52:49 +0000184For the purpose of shift and mask operations, integers are assumed to
185have a binary, 2's complement notation using 32 or more bits, and
186hiding no bits from the user (i.e., all 4294967296 different bit
187patterns correspond to different values).
188\obindex{plain integer}
189\withsubitem{(built-in exception)}{\ttindex{OverflowError}}
190
191\item[Long integers]
192These represent numbers in an unlimited range, subject to available
193(virtual) memory only. For the purpose of shift and mask operations,
194a binary representation is assumed, and negative numbers are
195represented in a variant of 2's complement which gives the illusion of
196an infinite string of sign bits extending to the left.
197\obindex{long integer}
198
Guido van Rossum77f6a652002-04-03 22:41:51 +0000199\item[Booleans]
200These represent the truth values False and True. The two objects
201representing the values False and True are the only Boolean objects.
202The Boolean type is a subtype of plain integers, and Boolean values
203behave like the values 0 and 1, respectively, in almost all contexts,
204the exception being that when converted to a string, the strings
205\code{"False"} or \code{"True"} are returned, respectively.
206\obindex{Boolean}
207\ttindex{False}
208\ttindex{True}
209
Fred Drakef6669171998-05-06 19:52:49 +0000210\end{description} % Integers
211
212The rules for integer representation are intended to give the most
213meaningful interpretation of shift and mask operations involving
214negative integers and the least surprises when switching between the
Raymond Hettingere701dcb2003-01-19 13:08:18 +0000215plain and long integer domains. Any operation except left shift,
Fred Drakef6669171998-05-06 19:52:49 +0000216if it yields a result in the plain integer domain without causing
Raymond Hettingere701dcb2003-01-19 13:08:18 +0000217overflow, will yield the same result in the long integer domain or
Fred Drakef6669171998-05-06 19:52:49 +0000218when using mixed operands.
219\indexii{integer}{representation}
220
221\item[Floating point numbers]
222These represent machine-level double precision floating point numbers.
Raymond Hettingere701dcb2003-01-19 13:08:18 +0000223You are at the mercy of the underlying machine architecture (and
224C or Java implementation) for the accepted range and handling of overflow.
Guido van Rossum83b2f8a1998-07-23 17:12:46 +0000225Python does not support single-precision floating point numbers; the
Fred Drake6e5e1d92001-07-14 02:12:27 +0000226savings in processor and memory usage that are usually the reason for using
Guido van Rossum83b2f8a1998-07-23 17:12:46 +0000227these is dwarfed by the overhead of using objects in Python, so there
228is no reason to complicate the language with two kinds of floating
229point numbers.
Fred Drakef6669171998-05-06 19:52:49 +0000230\obindex{floating point}
231\indexii{floating point}{number}
232\indexii{C}{language}
Raymond Hettingere701dcb2003-01-19 13:08:18 +0000233\indexii{Java}{language}
Fred Drakef6669171998-05-06 19:52:49 +0000234
Guido van Rossum83b2f8a1998-07-23 17:12:46 +0000235\item[Complex numbers]
236These represent complex numbers as a pair of machine-level double
237precision floating point numbers. The same caveats apply as for
Raymond Hettingere701dcb2003-01-19 13:08:18 +0000238floating point numbers. The real and imaginary parts of a complex
239number \code{z} can be retrieved through the read-only attributes
240\code{z.real} and \code{z.imag}.
Guido van Rossum83b2f8a1998-07-23 17:12:46 +0000241\obindex{complex}
242\indexii{complex}{number}
243
Fred Drakef6669171998-05-06 19:52:49 +0000244\end{description} % Numbers
245
Guido van Rossum77f6a652002-04-03 22:41:51 +0000246
Fred Drakef6669171998-05-06 19:52:49 +0000247\item[Sequences]
Fred Drake230d17d2001-02-22 21:28:04 +0000248These represent finite ordered sets indexed by non-negative numbers.
Fred Drakef6669171998-05-06 19:52:49 +0000249The built-in function \function{len()}\bifuncindex{len} returns the
Guido van Rossum83b2f8a1998-07-23 17:12:46 +0000250number of items of a sequence.
Thomas Woutersf9b526d2000-07-16 19:05:38 +0000251When the length of a sequence is \var{n}, the
Guido van Rossum83b2f8a1998-07-23 17:12:46 +0000252index set contains the numbers 0, 1, \ldots, \var{n}-1. Item
Fred Drakef6669171998-05-06 19:52:49 +0000253\var{i} of sequence \var{a} is selected by \code{\var{a}[\var{i}]}.
Fred Drakee15956b2000-04-03 04:51:13 +0000254\obindex{sequence}
Fred Drakef6669171998-05-06 19:52:49 +0000255\index{index operation}
256\index{item selection}
257\index{subscription}
258
259Sequences also support slicing: \code{\var{a}[\var{i}:\var{j}]}
Guido van Rossum83b2f8a1998-07-23 17:12:46 +0000260selects all items with index \var{k} such that \var{i} \code{<=}
Fred Drakef6669171998-05-06 19:52:49 +0000261\var{k} \code{<} \var{j}. When used as an expression, a slice is a
Guido van Rossum83b2f8a1998-07-23 17:12:46 +0000262sequence of the same type. This implies that the index set is
263renumbered so that it starts at 0.
Fred Drakef6669171998-05-06 19:52:49 +0000264\index{slicing}
265
Michael W. Hudson5efaf7e2002-06-11 10:55:12 +0000266Some sequences also support ``extended slicing'' with a third ``step''
267parameter: \code{\var{a}[\var{i}:\var{j}:\var{k}]} selects all items
268of \var{a} with index \var{x} where \code{\var{x} = \var{i} +
269\var{n}*\var{k}}, \var{n} \code{>=} \code{0} and \var{i} \code{<=}
270\var{x} \code{<} \var{j}.
271\index{extended slicing}
272
Fred Drakef6669171998-05-06 19:52:49 +0000273Sequences are distinguished according to their mutability:
274
275\begin{description}
Fred Drake4856d011999-01-12 04:15:20 +0000276
Fred Drakef6669171998-05-06 19:52:49 +0000277\item[Immutable sequences]
278An object of an immutable sequence type cannot change once it is
279created. (If the object contains references to other objects,
Guido van Rossum83b2f8a1998-07-23 17:12:46 +0000280these other objects may be mutable and may be changed; however,
Fred Drakef6669171998-05-06 19:52:49 +0000281the collection of objects directly referenced by an immutable object
282cannot change.)
283\obindex{immutable sequence}
284\obindex{immutable}
285
286The following types are immutable sequences:
287
288\begin{description}
289
290\item[Strings]
Guido van Rossum83b2f8a1998-07-23 17:12:46 +0000291The items of a string are characters. There is no separate
292character type; a character is represented by a string of one item.
Fred Drakef6669171998-05-06 19:52:49 +0000293Characters represent (at least) 8-bit bytes. The built-in
294functions \function{chr()}\bifuncindex{chr} and
295\function{ord()}\bifuncindex{ord} convert between characters and
296nonnegative integers representing the byte values. Bytes with the
Guido van Rossum83b2f8a1998-07-23 17:12:46 +0000297values 0-127 usually represent the corresponding \ASCII{} values, but
298the interpretation of values is up to the program. The string
299data type is also used to represent arrays of bytes, e.g., to hold data
Fred Drakef6669171998-05-06 19:52:49 +0000300read from a file.
301\obindex{string}
302\index{character}
303\index{byte}
Fred Drakec37b65e2001-11-28 07:26:15 +0000304\index{ASCII@\ASCII}
Fred Drakef6669171998-05-06 19:52:49 +0000305
Fred Drakec37b65e2001-11-28 07:26:15 +0000306(On systems whose native character set is not \ASCII, strings may use
Fred Drakef6669171998-05-06 19:52:49 +0000307EBCDIC in their internal representation, provided the functions
308\function{chr()} and \function{ord()} implement a mapping between \ASCII{} and
309EBCDIC, and string comparison preserves the \ASCII{} order.
310Or perhaps someone can propose a better rule?)
Fred Drakec37b65e2001-11-28 07:26:15 +0000311\index{ASCII@\ASCII}
Fred Drakef6669171998-05-06 19:52:49 +0000312\index{EBCDIC}
313\index{character set}
314\indexii{string}{comparison}
315\bifuncindex{chr}
316\bifuncindex{ord}
317
Fred Drakef0aff8e2000-04-06 13:57:21 +0000318\item[Unicode]
Fred Drake5ec22f22002-09-24 21:09:13 +0000319The items of a Unicode object are Unicode code units. A Unicode code
320unit is represented by a Unicode object of one item and can hold
321either a 16-bit or 32-bit value representing a Unicode ordinal (the
322maximum value for the ordinal is given in \code{sys.maxunicode}, and
323depends on how Python is configured at compile time). Surrogate pairs
324may be present in the Unicode object, and will be reported as two
325separate items. The built-in functions
Fred Drakef0aff8e2000-04-06 13:57:21 +0000326\function{unichr()}\bifuncindex{unichr} and
Fred Drake5ec22f22002-09-24 21:09:13 +0000327\function{ord()}\bifuncindex{ord} convert between code units and
Fred Drakef0aff8e2000-04-06 13:57:21 +0000328nonnegative integers representing the Unicode ordinals as defined in
329the Unicode Standard 3.0. Conversion from and to other encodings are
330possible through the Unicode method \method{encode} and the built-in
Fred Drake5ec22f22002-09-24 21:09:13 +0000331function \function{unicode()}.\bifuncindex{unicode}
Fred Drakef0aff8e2000-04-06 13:57:21 +0000332\obindex{unicode}
333\index{character}
334\index{integer}
Fred Drake8b3ce9e2000-04-06 14:00:14 +0000335\index{Unicode}
Fred Drakef0aff8e2000-04-06 13:57:21 +0000336
Fred Drakef6669171998-05-06 19:52:49 +0000337\item[Tuples]
Guido van Rossum83b2f8a1998-07-23 17:12:46 +0000338The items of a tuple are arbitrary Python objects.
339Tuples of two or more items are formed by comma-separated lists
340of expressions. A tuple of one item (a `singleton') can be formed
Fred Drakef6669171998-05-06 19:52:49 +0000341by affixing a comma to an expression (an expression by itself does
342not create a tuple, since parentheses must be usable for grouping of
Guido van Rossum83b2f8a1998-07-23 17:12:46 +0000343expressions). An empty tuple can be formed by an empty pair of
Fred Drakef6669171998-05-06 19:52:49 +0000344parentheses.
345\obindex{tuple}
346\indexii{singleton}{tuple}
347\indexii{empty}{tuple}
348
349\end{description} % Immutable sequences
350
351\item[Mutable sequences]
352Mutable sequences can be changed after they are created. The
353subscription and slicing notations can be used as the target of
354assignment and \keyword{del} (delete) statements.
Thomas Woutersf9b526d2000-07-16 19:05:38 +0000355\obindex{mutable sequence}
Fred Drakef6669171998-05-06 19:52:49 +0000356\obindex{mutable}
357\indexii{assignment}{statement}
358\index{delete}
359\stindex{del}
360\index{subscription}
361\index{slicing}
362
Raymond Hettingere701dcb2003-01-19 13:08:18 +0000363There is currently a single intrinsic mutable sequence type:
Fred Drakef6669171998-05-06 19:52:49 +0000364
365\begin{description}
366
367\item[Lists]
Guido van Rossum83b2f8a1998-07-23 17:12:46 +0000368The items of a list are arbitrary Python objects. Lists are formed
Fred Drakef6669171998-05-06 19:52:49 +0000369by placing a comma-separated list of expressions in square brackets.
370(Note that there are no special cases needed to form lists of length 0
371or 1.)
372\obindex{list}
373
374\end{description} % Mutable sequences
375
Guido van Rossum83b2f8a1998-07-23 17:12:46 +0000376The extension module \module{array}\refstmodindex{array} provides an
377additional example of a mutable sequence type.
378
379
Fred Drakef6669171998-05-06 19:52:49 +0000380\end{description} % Sequences
381
Guido van Rossum83b2f8a1998-07-23 17:12:46 +0000382\item[Mappings]
Fred Drakef6669171998-05-06 19:52:49 +0000383These represent finite sets of objects indexed by arbitrary index sets.
Guido van Rossum83b2f8a1998-07-23 17:12:46 +0000384The subscript notation \code{a[k]} selects the item indexed
Fred Drakef6669171998-05-06 19:52:49 +0000385by \code{k} from the mapping \code{a}; this can be used in
386expressions and as the target of assignments or \keyword{del} statements.
Guido van Rossum83b2f8a1998-07-23 17:12:46 +0000387The built-in function \function{len()} returns the number of items
Fred Drakef6669171998-05-06 19:52:49 +0000388in a mapping.
389\bifuncindex{len}
390\index{subscription}
391\obindex{mapping}
392
Guido van Rossum83b2f8a1998-07-23 17:12:46 +0000393There is currently a single intrinsic mapping type:
Fred Drakef6669171998-05-06 19:52:49 +0000394
395\begin{description}
396
397\item[Dictionaries]
Fred Drake8cdee961999-02-23 18:50:38 +0000398These\obindex{dictionary} represent finite sets of objects indexed by
399nearly arbitrary values. The only types of values not acceptable as
400keys are values containing lists or dictionaries or other mutable
401types that are compared by value rather than by object identity, the
402reason being that the efficient implementation of dictionaries
403requires a key's hash value to remain constant.
Fred Drakef6669171998-05-06 19:52:49 +0000404Numeric types used for keys obey the normal rules for numeric
Guido van Rossum83b2f8a1998-07-23 17:12:46 +0000405comparison: if two numbers compare equal (e.g., \code{1} and
Fred Drakef6669171998-05-06 19:52:49 +0000406\code{1.0}) then they can be used interchangeably to index the same
407dictionary entry.
408
Raymond Hettingere701dcb2003-01-19 13:08:18 +0000409Dictionaries are mutable; they can be created by the
Fred Drake62364ff2003-03-20 18:17:16 +0000410\code{\{...\}} notation (see section~\ref{dict}, ``Dictionary
Fred Drake8cdee961999-02-23 18:50:38 +0000411Displays'').
Fred Drakef6669171998-05-06 19:52:49 +0000412
Guido van Rossum83b2f8a1998-07-23 17:12:46 +0000413The extension modules \module{dbm}\refstmodindex{dbm},
414\module{gdbm}\refstmodindex{gdbm}, \module{bsddb}\refstmodindex{bsddb}
415provide additional examples of mapping types.
416
Fred Drakef6669171998-05-06 19:52:49 +0000417\end{description} % Mapping types
418
419\item[Callable types]
Fred Drake8cdee961999-02-23 18:50:38 +0000420These\obindex{callable} are the types to which the function call
Fred Drake62364ff2003-03-20 18:17:16 +0000421operation (see section~\ref{calls}, ``Calls'') can be applied:
Fred Drakef6669171998-05-06 19:52:49 +0000422\indexii{function}{call}
423\index{invocation}
424\indexii{function}{argument}
Fred Drakef6669171998-05-06 19:52:49 +0000425
426\begin{description}
427
428\item[User-defined functions]
429A user-defined function object is created by a function definition
Fred Drake62364ff2003-03-20 18:17:16 +0000430(see section~\ref{function}, ``Function definitions''). It should be
Guido van Rossum83b2f8a1998-07-23 17:12:46 +0000431called with an argument
Fred Drakef6669171998-05-06 19:52:49 +0000432list containing the same number of items as the function's formal
433parameter list.
434\indexii{user-defined}{function}
435\obindex{function}
436\obindex{user-defined function}
437
Guido van Rossum264bd591999-02-23 16:40:55 +0000438Special attributes: \member{func_doc} or \member{__doc__} is the
Jeremy Hyltonf9b0cc72003-01-31 18:52:45 +0000439function's documentation string, or \code{None} if unavailable;
Fred Drake82385871998-10-01 20:40:43 +0000440\member{func_name} or \member{__name__} is the function's name;
Jeremy Hyltonf9b0cc72003-01-31 18:52:45 +0000441\member{__module__} is the name of the module the function was defined
442in, or \code{None} if unavailable;
Fred Drake82385871998-10-01 20:40:43 +0000443\member{func_defaults} is a tuple containing default argument values for
Guido van Rossum83b2f8a1998-07-23 17:12:46 +0000444those arguments that have defaults, or \code{None} if no arguments
Fred Drake82385871998-10-01 20:40:43 +0000445have a default value; \member{func_code} is the code object representing
446the compiled function body; \member{func_globals} is (a reference to)
Guido van Rossum83b2f8a1998-07-23 17:12:46 +0000447the dictionary that holds the function's global variables --- it
Guido van Rossumdfb658c1998-07-23 17:54:36 +0000448defines the global namespace of the module in which the function was
Barry Warsaw7a5e80e2001-02-27 03:36:30 +0000449defined; \member{func_dict} or \member{__dict__} contains the
Jeremy Hyltonaa90adc2001-03-23 17:23:50 +0000450namespace supporting arbitrary function attributes;
451\member{func_closure} is \code{None} or a tuple of cells that contain
Jeremy Hylton26c49b62002-04-01 17:58:39 +0000452bindings for the function's free variables.
Barry Warsaw7a5e80e2001-02-27 03:36:30 +0000453
Jeremy Hylton26c49b62002-04-01 17:58:39 +0000454Of these, \member{func_code}, \member{func_defaults},
Barry Warsaw7a5e80e2001-02-27 03:36:30 +0000455\member{func_doc}/\member{__doc__}, and
456\member{func_dict}/\member{__dict__} may be writable; the
Jeremy Hyltonaa90adc2001-03-23 17:23:50 +0000457others can never be changed. Additional information about a
458function's definition can be retrieved from its code object; see the
459description of internal types below.
460
Fred Drake4856d011999-01-12 04:15:20 +0000461\withsubitem{(function attribute)}{
462 \ttindex{func_doc}
463 \ttindex{__doc__}
464 \ttindex{__name__}
Jeremy Hyltonf9b0cc72003-01-31 18:52:45 +0000465 \ttindex{__module__}
Barry Warsaw7a5e80e2001-02-27 03:36:30 +0000466 \ttindex{__dict__}
Fred Drake4856d011999-01-12 04:15:20 +0000467 \ttindex{func_defaults}
Jeremy Hylton26c49b62002-04-01 17:58:39 +0000468 \ttindex{func_closure}
Fred Drake4856d011999-01-12 04:15:20 +0000469 \ttindex{func_code}
Barry Warsaw7a5e80e2001-02-27 03:36:30 +0000470 \ttindex{func_globals}
471 \ttindex{func_dict}}
Guido van Rossumdfb658c1998-07-23 17:54:36 +0000472\indexii{global}{namespace}
Fred Drakef6669171998-05-06 19:52:49 +0000473
474\item[User-defined methods]
Guido van Rossum83b2f8a1998-07-23 17:12:46 +0000475A user-defined method object combines a class, a class instance (or
Fred Drake8dd6ffd2001-08-02 21:34:53 +0000476\code{None}) and any callable object (normally a user-defined
477function).
Fred Drakef6669171998-05-06 19:52:49 +0000478\obindex{method}
479\obindex{user-defined method}
480\indexii{user-defined}{method}
Fred Drakef6669171998-05-06 19:52:49 +0000481
482Special read-only attributes: \member{im_self} is the class instance
Guido van Rossum83b2f8a1998-07-23 17:12:46 +0000483object, \member{im_func} is the function object;
Raymond Hettinger03ec6d52003-06-25 18:29:36 +0000484\member{im_class} is the class of \member{im_self} for bound methods
485or the class that asked for the method for unbound methods;
Fred Drake82385871998-10-01 20:40:43 +0000486\member{__doc__} is the method's documentation (same as
487\code{im_func.__doc__}); \member{__name__} is the method name (same as
Jeremy Hyltonf9b0cc72003-01-31 18:52:45 +0000488\code{im_func.__name__}); \member{__module__} is the name of the
489module the method was defined in, or \code{None} if unavailable.
Fred Drakef9d58032001-12-07 23:13:53 +0000490\versionchanged[\member{im_self} used to refer to the class that
491 defined the method]{2.2}
Fred Drake4856d011999-01-12 04:15:20 +0000492\withsubitem{(method attribute)}{
Jeremy Hyltonf9b0cc72003-01-31 18:52:45 +0000493 \ttindex{__doc__}
494 \ttindex{__name__}
495 \ttindex{__module__}
Fred Drake4856d011999-01-12 04:15:20 +0000496 \ttindex{im_func}
Fred Drake1e42d8a1998-11-25 17:58:50 +0000497 \ttindex{im_self}}
Guido van Rossum83b2f8a1998-07-23 17:12:46 +0000498
Barry Warsaw7a5e80e2001-02-27 03:36:30 +0000499Methods also support accessing (but not setting) the arbitrary
500function attributes on the underlying function object.
501
Fred Drakeea690c42003-07-17 05:26:53 +0000502User-defined method objects may be created when getting an attribute
503of a class (perhaps via an instance of that class), if that attribute
504is a user-defined function object, an unbound user-defined method object,
505or a class method object.
506When the attribute is a user-defined method object, a new
507method object is only created if the class from which it is being
508retrieved is the same as, or a derived class of, the class stored
509in the original method object; otherwise, the original method object
510is used as it is.
511
512When a user-defined method object is created by retrieving
513a user-defined function object from a class, its \member{im_self}
514attribute is \code{None} and the method object is said to be unbound.
515When one is created by retrieving a user-defined function object
516from a class via one of its instances, its \member{im_self} attribute
517is the instance, and the method object is said to be bound.
518In either case, the new method's \member{im_class} attribute
519is the class from which the retrieval takes place, and
520its \member{im_func} attribute is the original function object.
521\withsubitem{(method attribute)}{
522 \ttindex{im_class}\ttindex{im_func}\ttindex{im_self}}
523
524When a user-defined method object is created by retrieving another
525method object from a class or instance, the behaviour is the same
526as for a function object, except that the \member{im_func} attribute
527of the new instance is not the original method object but its
528\member{im_func} attribute.
529\withsubitem{(method attribute)}{
530 \ttindex{im_func}}
531
532When a user-defined method object is created by retrieving a
533class method object from a class or instance, its \member{im_self}
534attribute is the class itself (the same as the \member{im_class}
535attribute), and its \member{im_func} attribute is the function
536object underlying the class method.
Fred Drake4856d011999-01-12 04:15:20 +0000537\withsubitem{(method attribute)}{
Fred Drake35c09f22000-06-28 20:15:47 +0000538 \ttindex{im_class}\ttindex{im_func}\ttindex{im_self}}
Guido van Rossum83b2f8a1998-07-23 17:12:46 +0000539
540When an unbound user-defined method object is called, the underlying
Fred Drake82385871998-10-01 20:40:43 +0000541function (\member{im_func}) is called, with the restriction that the
Guido van Rossum83b2f8a1998-07-23 17:12:46 +0000542first argument must be an instance of the proper class
Fred Drake82385871998-10-01 20:40:43 +0000543(\member{im_class}) or of a derived class thereof.
Guido van Rossum83b2f8a1998-07-23 17:12:46 +0000544
545When a bound user-defined method object is called, the underlying
Fred Drake82385871998-10-01 20:40:43 +0000546function (\member{im_func}) is called, inserting the class instance
547(\member{im_self}) in front of the argument list. For instance, when
548\class{C} is a class which contains a definition for a function
549\method{f()}, and \code{x} is an instance of \class{C}, calling
Guido van Rossum83b2f8a1998-07-23 17:12:46 +0000550\code{x.f(1)} is equivalent to calling \code{C.f(x, 1)}.
551
Fred Drakeea690c42003-07-17 05:26:53 +0000552When a user-defined method object is derived from a class method object,
553the ``class instance'' stored in \member{im_self} will actually be the
554class itself, so that calling either \code{x.f(1)} or \code{C.f(1)} is
555equivalent to calling \code{f(C,1)} where \code{f} is the underlying
556function.
557
Guido van Rossum83b2f8a1998-07-23 17:12:46 +0000558Note that the transformation from function object to (unbound or
559bound) method object happens each time the attribute is retrieved from
560the class or instance. In some cases, a fruitful optimization is to
561assign the attribute to a local variable and call that local variable.
562Also notice that this transformation only happens for user-defined
563functions; other callable objects (and all non-callable objects) are
Fred Drake35c09f22000-06-28 20:15:47 +0000564retrieved without transformation. It is also important to note that
565user-defined functions which are attributes of a class instance are
566not converted to bound methods; this \emph{only} happens when the
567function is an attribute of the class.
Guido van Rossum83b2f8a1998-07-23 17:12:46 +0000568
Fred Drakee31e9ce2001-12-11 21:10:08 +0000569\item[Generator functions\index{generator!function}\index{generator!iterator}]
570A function or method which uses the \keyword{yield} statement (see
571section~\ref{yield}, ``The \keyword{yield} statement'') is called a
572\dfn{generator function}. Such a function, when called, always
573returns an iterator object which can be used to execute the body of
574the function: calling the iterator's \method{next()} method will
575cause the function to execute until it provides a value using the
576\keyword{yield} statement. When the function executes a
577\keyword{return} statement or falls off the end, a
578\exception{StopIteration} exception is raised and the iterator will
579have reached the end of the set of values to be returned.
580
Fred Drakef6669171998-05-06 19:52:49 +0000581\item[Built-in functions]
Guido van Rossum83b2f8a1998-07-23 17:12:46 +0000582A built-in function object is a wrapper around a \C{} function. Examples
583of built-in functions are \function{len()} and \function{math.sin()}
584(\module{math} is a standard built-in module).
585The number and type of the arguments are
Fred Drakef6669171998-05-06 19:52:49 +0000586determined by the C function.
Fred Drake82385871998-10-01 20:40:43 +0000587Special read-only attributes: \member{__doc__} is the function's
588documentation string, or \code{None} if unavailable; \member{__name__}
589is the function's name; \member{__self__} is set to \code{None} (but see
Jeremy Hyltonf9b0cc72003-01-31 18:52:45 +0000590the next item); \member{__module__} is the name of the module the
591function was defined in or \code{None} if unavailable.
Fred Drakef6669171998-05-06 19:52:49 +0000592\obindex{built-in function}
593\obindex{function}
594\indexii{C}{language}
595
596\item[Built-in methods]
597This is really a different disguise of a built-in function, this time
Raymond Hettingere701dcb2003-01-19 13:08:18 +0000598containing an object passed to the C function as an implicit extra
Guido van Rossum83b2f8a1998-07-23 17:12:46 +0000599argument. An example of a built-in method is
Raymond Hettingere701dcb2003-01-19 13:08:18 +0000600\code{\var{alist}.append()}, assuming
601\var{alist} is a list object.
Fred Drake82385871998-10-01 20:40:43 +0000602In this case, the special read-only attribute \member{__self__} is set
Fred Drakee31e9ce2001-12-11 21:10:08 +0000603to the object denoted by \var{list}.
Fred Drakef6669171998-05-06 19:52:49 +0000604\obindex{built-in method}
605\obindex{method}
606\indexii{built-in}{method}
607
Fred Drakee37b4ed2003-07-15 20:45:16 +0000608\item[Class Types]
609Class types, or ``new-style classes,'' are callable. These objects
610normally act as factories for new instances of themselves, but
611variations are possible for class types that override
612\method{__new__()}. The arguments of the call are passed to
613\method{__new__()} and, in the typical case, to \method{__init__()} to
614initialize the new instance.
615
616\item[Classic Classes]
Guido van Rossum83b2f8a1998-07-23 17:12:46 +0000617Class objects are described below. When a class object is called,
618a new class instance (also described below) is created and
Fred Drakef6669171998-05-06 19:52:49 +0000619returned. This implies a call to the class's \method{__init__()} method
620if it has one. Any arguments are passed on to the \method{__init__()}
Guido van Rossum83b2f8a1998-07-23 17:12:46 +0000621method. If there is no \method{__init__()} method, the class must be called
Fred Drakef6669171998-05-06 19:52:49 +0000622without arguments.
Fred Drake1e42d8a1998-11-25 17:58:50 +0000623\withsubitem{(object method)}{\ttindex{__init__()}}
Fred Drakef6669171998-05-06 19:52:49 +0000624\obindex{class}
625\obindex{class instance}
626\obindex{instance}
627\indexii{class object}{call}
628
Guido van Rossum83b2f8a1998-07-23 17:12:46 +0000629\item[Class instances]
630Class instances are described below. Class instances are callable
Fred Drake82385871998-10-01 20:40:43 +0000631only when the class has a \method{__call__()} method; \code{x(arguments)}
Guido van Rossum83b2f8a1998-07-23 17:12:46 +0000632is a shorthand for \code{x.__call__(arguments)}.
633
Fred Drakef6669171998-05-06 19:52:49 +0000634\end{description}
635
636\item[Modules]
Fred Drake62364ff2003-03-20 18:17:16 +0000637Modules are imported by the \keyword{import} statement (see
Fred Draked51ce7d2003-07-15 22:03:00 +0000638section~\ref{import}, ``The \keyword{import} statement'').%
639\stindex{import}\obindex{module}
Guido van Rossumdfb658c1998-07-23 17:54:36 +0000640A module object has a namespace implemented by a dictionary object
Guido van Rossum83b2f8a1998-07-23 17:12:46 +0000641(this is the dictionary referenced by the func_globals attribute of
642functions defined in the module). Attribute references are translated
643to lookups in this dictionary, e.g., \code{m.x} is equivalent to
644\code{m.__dict__["x"]}.
645A module object does not contain the code object used to
Fred Drakef6669171998-05-06 19:52:49 +0000646initialize the module (since it isn't needed once the initialization
647is done).
Fred Drakef6669171998-05-06 19:52:49 +0000648
Guido van Rossumdfb658c1998-07-23 17:54:36 +0000649Attribute assignment updates the module's namespace dictionary,
Fred Drake82385871998-10-01 20:40:43 +0000650e.g., \samp{m.x = 1} is equivalent to \samp{m.__dict__["x"] = 1}.
Fred Drakef6669171998-05-06 19:52:49 +0000651
Guido van Rossumdfb658c1998-07-23 17:54:36 +0000652Special read-only attribute: \member{__dict__} is the module's
653namespace as a dictionary object.
Fred Drake1e42d8a1998-11-25 17:58:50 +0000654\withsubitem{(module attribute)}{\ttindex{__dict__}}
Guido van Rossum83b2f8a1998-07-23 17:12:46 +0000655
656Predefined (writable) attributes: \member{__name__}
657is the module's name; \member{__doc__} is the
658module's documentation string, or
Fred Drake82385871998-10-01 20:40:43 +0000659\code{None} if unavailable; \member{__file__} is the pathname of the
Guido van Rossum83b2f8a1998-07-23 17:12:46 +0000660file from which the module was loaded, if it was loaded from a file.
Fred Drake82385871998-10-01 20:40:43 +0000661The \member{__file__} attribute is not present for C{} modules that are
Guido van Rossum83b2f8a1998-07-23 17:12:46 +0000662statically linked into the interpreter; for extension modules loaded
663dynamically from a shared library, it is the pathname of the shared
664library file.
Fred Drake4856d011999-01-12 04:15:20 +0000665\withsubitem{(module attribute)}{
666 \ttindex{__name__}
667 \ttindex{__doc__}
Fred Drake1e42d8a1998-11-25 17:58:50 +0000668 \ttindex{__file__}}
Guido van Rossumdfb658c1998-07-23 17:54:36 +0000669\indexii{module}{namespace}
Fred Drakef6669171998-05-06 19:52:49 +0000670
671\item[Classes]
Fred Drake62364ff2003-03-20 18:17:16 +0000672Class objects are created by class definitions (see
673section~\ref{class}, ``Class definitions'').
Guido van Rossum83b2f8a1998-07-23 17:12:46 +0000674A class has a namespace implemented by a dictionary object.
675Class attribute references are translated to
676lookups in this dictionary,
Fred Drake82385871998-10-01 20:40:43 +0000677e.g., \samp{C.x} is translated to \samp{C.__dict__["x"]}.
Guido van Rossum83b2f8a1998-07-23 17:12:46 +0000678When the attribute name is not found
Fred Drakef6669171998-05-06 19:52:49 +0000679there, the attribute search continues in the base classes. The search
Guido van Rossum83b2f8a1998-07-23 17:12:46 +0000680is depth-first, left-to-right in the order of occurrence in the
Fred Drakef6669171998-05-06 19:52:49 +0000681base class list.
Fred Drakeea690c42003-07-17 05:26:53 +0000682
683When a class attribute reference (for class \class{C}, say)
684would yield a user-defined function object or
685an unbound user-defined method object whose associated class is either
686\class{C} or one of its base classes, it is transformed into an unbound
687user-defined method object whose \member{im_class} attribute is~\class{C}.
688When it would yield a class method object, it is transformed into
689a bound user-defined method object whose \member{im_class} and
690\member{im_self} attributes are both~\class{C}. When it would yield
691a static method object, it is transformed into the object wrapped
692by the static method object. See section~\ref{descriptors} for another
693way in which attributes retrieved from a class may differ from those
694actually contained in its \member{__dict__}.
Fred Drakef6669171998-05-06 19:52:49 +0000695\obindex{class}
696\obindex{class instance}
697\obindex{instance}
698\indexii{class object}{call}
699\index{container}
700\obindex{dictionary}
701\indexii{class}{attribute}
702
703Class attribute assignments update the class's dictionary, never the
704dictionary of a base class.
705\indexiii{class}{attribute}{assignment}
706
Guido van Rossum83b2f8a1998-07-23 17:12:46 +0000707A class object can be called (see above) to yield a class instance (see
708below).
Fred Drakef6669171998-05-06 19:52:49 +0000709\indexii{class object}{call}
710
Guido van Rossum83b2f8a1998-07-23 17:12:46 +0000711Special attributes: \member{__name__} is the class name;
712\member{__module__} is the module name in which the class was defined;
Guido van Rossumdfb658c1998-07-23 17:54:36 +0000713\member{__dict__} is the dictionary containing the class's namespace;
Guido van Rossum83b2f8a1998-07-23 17:12:46 +0000714\member{__bases__} is a tuple (possibly empty or a singleton)
715containing the base classes, in the order of their occurrence in the
Fred Drake82385871998-10-01 20:40:43 +0000716base class list; \member{__doc__} is the class's documentation string,
Guido van Rossum83b2f8a1998-07-23 17:12:46 +0000717or None if undefined.
Fred Drake4856d011999-01-12 04:15:20 +0000718\withsubitem{(class attribute)}{
719 \ttindex{__name__}
720 \ttindex{__module__}
721 \ttindex{__dict__}
722 \ttindex{__bases__}
Fred Drake1e42d8a1998-11-25 17:58:50 +0000723 \ttindex{__doc__}}
Fred Drakef6669171998-05-06 19:52:49 +0000724
725\item[Class instances]
Guido van Rossum83b2f8a1998-07-23 17:12:46 +0000726A class instance is created by calling a class object (see above).
727A class instance has a namespace implemented as a dictionary which
728is the first place in which
Fred Drakef6669171998-05-06 19:52:49 +0000729attribute references are searched. When an attribute is not found
Guido van Rossum83b2f8a1998-07-23 17:12:46 +0000730there, and the instance's class has an attribute by that name,
731the search continues with the class attributes. If a class attribute
Fred Drakeea690c42003-07-17 05:26:53 +0000732is found that is a user-defined function object or an unbound
733user-defined method object whose associated class is the class
734(call it~\class{C}) of the instance for which the attribute reference
735was initiated or one of its bases,
736it is transformed into a bound user-defined method object whose
737\member{im_class} attribute is~\class{C} whose \member{im_self} attribute
738is the instance. Static method and class method objects are also
739transformed, as if they had been retrieved from class~\class{C};
740see above under ``Classes''. See section~\ref{descriptors} for
741another way in which attributes of a class retrieved via its
742instances may differ from the objects actually stored in the
743class's \member{__dict__}.
Guido van Rossum83b2f8a1998-07-23 17:12:46 +0000744If no class attribute is found, and the object's class has a
Fred Drake82385871998-10-01 20:40:43 +0000745\method{__getattr__()} method, that is called to satisfy the lookup.
Fred Drakef6669171998-05-06 19:52:49 +0000746\obindex{class instance}
747\obindex{instance}
748\indexii{class}{instance}
749\indexii{class instance}{attribute}
750
Guido van Rossum83b2f8a1998-07-23 17:12:46 +0000751Attribute assignments and deletions update the instance's dictionary,
Fred Drake82385871998-10-01 20:40:43 +0000752never a class's dictionary. If the class has a \method{__setattr__()} or
753\method{__delattr__()} method, this is called instead of updating the
Guido van Rossum83b2f8a1998-07-23 17:12:46 +0000754instance dictionary directly.
Fred Drakef6669171998-05-06 19:52:49 +0000755\indexiii{class instance}{attribute}{assignment}
756
757Class instances can pretend to be numbers, sequences, or mappings if
Guido van Rossum83b2f8a1998-07-23 17:12:46 +0000758they have methods with certain special names. See
Fred Drake62364ff2003-03-20 18:17:16 +0000759section~\ref{specialnames}, ``Special method names.''
Fred Drakee15956b2000-04-03 04:51:13 +0000760\obindex{numeric}
Fred Drakef6669171998-05-06 19:52:49 +0000761\obindex{sequence}
762\obindex{mapping}
763
Guido van Rossum83b2f8a1998-07-23 17:12:46 +0000764Special attributes: \member{__dict__} is the attribute
765dictionary; \member{__class__} is the instance's class.
Fred Drake4856d011999-01-12 04:15:20 +0000766\withsubitem{(instance attribute)}{
767 \ttindex{__dict__}
Fred Drake1e42d8a1998-11-25 17:58:50 +0000768 \ttindex{__class__}}
Fred Drakef6669171998-05-06 19:52:49 +0000769
770\item[Files]
Fred Drakee15eb351999-11-10 16:13:25 +0000771A file\obindex{file} object represents an open file. File objects are
772created by the \function{open()}\bifuncindex{open} built-in function,
773and also by
774\withsubitem{(in module os)}{\ttindex{popen()}}\function{os.popen()},
775\function{os.fdopen()}, and the
776\method{makefile()}\withsubitem{(socket method)}{\ttindex{makefile()}}
777method of socket objects (and perhaps by other functions or methods
778provided by extension modules). The objects
779\ttindex{sys.stdin}\code{sys.stdin},
780\ttindex{sys.stdout}\code{sys.stdout} and
781\ttindex{sys.stderr}\code{sys.stderr} are initialized to file objects
782corresponding to the interpreter's standard\index{stdio} input, output
783and error streams. See the \citetitle[../lib/lib.html]{Python Library
784Reference} for complete documentation of file objects.
Fred Drake4856d011999-01-12 04:15:20 +0000785\withsubitem{(in module sys)}{
786 \ttindex{stdin}
787 \ttindex{stdout}
Fred Drake1e42d8a1998-11-25 17:58:50 +0000788 \ttindex{stderr}}
Fred Drakee15eb351999-11-10 16:13:25 +0000789
Fred Drakef6669171998-05-06 19:52:49 +0000790
791\item[Internal types]
792A few types used internally by the interpreter are exposed to the user.
Guido van Rossum83b2f8a1998-07-23 17:12:46 +0000793Their definitions may change with future versions of the interpreter,
Fred Drakef6669171998-05-06 19:52:49 +0000794but they are mentioned here for completeness.
795\index{internal type}
796\index{types, internal}
797
798\begin{description}
799
800\item[Code objects]
Guido van Rossum83b2f8a1998-07-23 17:12:46 +0000801Code objects represent \emph{byte-compiled} executable Python code, or
802\emph{bytecode}.
Fred Drakef6669171998-05-06 19:52:49 +0000803The difference between a code
804object and a function object is that the function object contains an
Guido van Rossum83b2f8a1998-07-23 17:12:46 +0000805explicit reference to the function's globals (the module in which it
806was defined), while a code object contains no context;
807also the default argument values are stored in the function object,
808not in the code object (because they represent values calculated at
809run-time). Unlike function objects, code objects are immutable and
810contain no references (directly or indirectly) to mutable objects.
811\index{bytecode}
Fred Drakef6669171998-05-06 19:52:49 +0000812\obindex{code}
813
Fred Drake1e42d8a1998-11-25 17:58:50 +0000814Special read-only attributes: \member{co_name} gives the function
815name; \member{co_argcount} is the number of positional arguments
816(including arguments with default values); \member{co_nlocals} is the
817number of local variables used by the function (including arguments);
818\member{co_varnames} is a tuple containing the names of the local
Jeremy Hyltonaa90adc2001-03-23 17:23:50 +0000819variables (starting with the argument names); \member{co_cellvars} is
820a tuple containing the names of local variables that are referenced by
821nested functions; \member{co_freevars} is a tuple containing the names
Jeremy Hylton8392f362002-04-01 18:53:36 +0000822of free variables; \member{co_code} is a string representing the
823sequence of bytecode instructions;
Fred Drake1e42d8a1998-11-25 17:58:50 +0000824\member{co_consts} is a tuple containing the literals used by the
825bytecode; \member{co_names} is a tuple containing the names used by
826the bytecode; \member{co_filename} is the filename from which the code
827was compiled; \member{co_firstlineno} is the first line number of the
828function; \member{co_lnotab} is a string encoding the mapping from
Thomas Woutersf9b526d2000-07-16 19:05:38 +0000829byte code offsets to line numbers (for details see the source code of
Fred Drake1e42d8a1998-11-25 17:58:50 +0000830the interpreter); \member{co_stacksize} is the required stack size
831(including local variables); \member{co_flags} is an integer encoding
832a number of flags for the interpreter.
Jeremy Hyltonaa90adc2001-03-23 17:23:50 +0000833
Fred Drake4856d011999-01-12 04:15:20 +0000834\withsubitem{(code object attribute)}{
835 \ttindex{co_argcount}
836 \ttindex{co_code}
837 \ttindex{co_consts}
838 \ttindex{co_filename}
839 \ttindex{co_firstlineno}
840 \ttindex{co_flags}
841 \ttindex{co_lnotab}
842 \ttindex{co_name}
843 \ttindex{co_names}
844 \ttindex{co_nlocals}
845 \ttindex{co_stacksize}
Jeremy Hyltonaa90adc2001-03-23 17:23:50 +0000846 \ttindex{co_varnames}
847 \ttindex{co_cellvars}
848 \ttindex{co_freevars}}
Guido van Rossum83b2f8a1998-07-23 17:12:46 +0000849
Fred Drakee15956b2000-04-03 04:51:13 +0000850The following flag bits are defined for \member{co_flags}: bit
851\code{0x04} is set if the function uses the \samp{*arguments} syntax
852to accept an arbitrary number of positional arguments; bit
853\code{0x08} is set if the function uses the \samp{**keywords} syntax
Jeremy Hylton8392f362002-04-01 18:53:36 +0000854to accept arbitrary keyword arguments; bit \code{0x20} is set if the
Brett Cannon9e6fedd2003-06-15 22:57:44 +0000855function is a generator.
856\obindex{generator}
Jeremy Hylton8392f362002-04-01 18:53:36 +0000857
858Future feature declarations (\samp{from __future__ import division})
859also use bits in \member{co_flags} to indicate whether a code object
860was compiled with a particular feature enabled: bit \code{0x2000} is
861set if the function was compiled with future division enabled; bits
862\code{0x10} and \code{0x1000} were used in earlier versions of Python.
863
864Other bits in \member{co_flags} are reserved for internal use.
865
866If\index{documentation string} a code object represents a function,
867the first item in
Jeremy Hyltonaa90adc2001-03-23 17:23:50 +0000868\member{co_consts} is the documentation string of the function, or
869\code{None} if undefined.
Fred Drakef6669171998-05-06 19:52:49 +0000870
871\item[Frame objects]
872Frame objects represent execution frames. They may occur in traceback
873objects (see below).
874\obindex{frame}
875
876Special read-only attributes: \member{f_back} is to the previous
877stack frame (towards the caller), or \code{None} if this is the bottom
878stack frame; \member{f_code} is the code object being executed in this
Guido van Rossum83b2f8a1998-07-23 17:12:46 +0000879frame; \member{f_locals} is the dictionary used to look up local
880variables; \member{f_globals} is used for global variables;
Fred Drake82385871998-10-01 20:40:43 +0000881\member{f_builtins} is used for built-in (intrinsic) names;
882\member{f_restricted} is a flag indicating whether the function is
Michael W. Hudsoncfd38842002-12-17 16:15:34 +0000883executing in restricted execution mode; \member{f_lasti} gives the
Guido van Rossum83b2f8a1998-07-23 17:12:46 +0000884precise instruction (this is an index into the bytecode string of
Fred Drakef6669171998-05-06 19:52:49 +0000885the code object).
Fred Drake4856d011999-01-12 04:15:20 +0000886\withsubitem{(frame attribute)}{
887 \ttindex{f_back}
888 \ttindex{f_code}
889 \ttindex{f_globals}
890 \ttindex{f_locals}
Fred Drake4856d011999-01-12 04:15:20 +0000891 \ttindex{f_lasti}
892 \ttindex{f_builtins}
Fred Drake1e42d8a1998-11-25 17:58:50 +0000893 \ttindex{f_restricted}}
Guido van Rossum83b2f8a1998-07-23 17:12:46 +0000894
Fred Drake82385871998-10-01 20:40:43 +0000895Special writable attributes: \member{f_trace}, if not \code{None}, is a
Guido van Rossum83b2f8a1998-07-23 17:12:46 +0000896function called at the start of each source code line (this is used by
Fred Drake82385871998-10-01 20:40:43 +0000897the debugger); \member{f_exc_type}, \member{f_exc_value},
898\member{f_exc_traceback} represent the most recent exception caught in
Michael W. Hudsoncfd38842002-12-17 16:15:34 +0000899this frame; \member{f_lineno} is the current line number of the frame
900--- writing to this from within a trace function jumps to the given line
901(only for the bottom-most frame). A debugger can implement a Jump
902command (aka Set Next Statement) by writing to f_lineno.
Fred Drake4856d011999-01-12 04:15:20 +0000903\withsubitem{(frame attribute)}{
904 \ttindex{f_trace}
905 \ttindex{f_exc_type}
906 \ttindex{f_exc_value}
Michael W. Hudsoncfd38842002-12-17 16:15:34 +0000907 \ttindex{f_exc_traceback}
908 \ttindex{f_lineno}}
Fred Drakef6669171998-05-06 19:52:49 +0000909
910\item[Traceback objects] \label{traceback}
911Traceback objects represent a stack trace of an exception. A
912traceback object is created when an exception occurs. When the search
913for an exception handler unwinds the execution stack, at each unwound
914level a traceback object is inserted in front of the current
Guido van Rossum83b2f8a1998-07-23 17:12:46 +0000915traceback. When an exception handler is entered, the stack trace is
916made available to the program.
Fred Drake62364ff2003-03-20 18:17:16 +0000917(See section~\ref{try}, ``The \code{try} statement.'')
Guido van Rossum83b2f8a1998-07-23 17:12:46 +0000918It is accessible as \code{sys.exc_traceback}, and also as the third
919item of the tuple returned by \code{sys.exc_info()}. The latter is
920the preferred interface, since it works correctly when the program is
921using multiple threads.
922When the program contains no suitable handler, the stack trace is written
Fred Drakef6669171998-05-06 19:52:49 +0000923(nicely formatted) to the standard error stream; if the interpreter is
924interactive, it is also made available to the user as
925\code{sys.last_traceback}.
926\obindex{traceback}
927\indexii{stack}{trace}
928\indexii{exception}{handler}
929\indexii{execution}{stack}
Fred Drake4856d011999-01-12 04:15:20 +0000930\withsubitem{(in module sys)}{
931 \ttindex{exc_info}
932 \ttindex{exc_traceback}
Fred Drake1e42d8a1998-11-25 17:58:50 +0000933 \ttindex{last_traceback}}
Guido van Rossum83b2f8a1998-07-23 17:12:46 +0000934\ttindex{sys.exc_info}
Fred Drakef6669171998-05-06 19:52:49 +0000935\ttindex{sys.exc_traceback}
936\ttindex{sys.last_traceback}
937
938Special read-only attributes: \member{tb_next} is the next level in the
939stack trace (towards the frame where the exception occurred), or
940\code{None} if there is no next level; \member{tb_frame} points to the
941execution frame of the current level; \member{tb_lineno} gives the line
942number where the exception occurred; \member{tb_lasti} indicates the
943precise instruction. The line number and last instruction in the
944traceback may differ from the line number of its frame object if the
945exception occurred in a \keyword{try} statement with no matching
946except clause or with a finally clause.
Fred Drake4856d011999-01-12 04:15:20 +0000947\withsubitem{(traceback attribute)}{
948 \ttindex{tb_next}
949 \ttindex{tb_frame}
950 \ttindex{tb_lineno}
Fred Drake1e42d8a1998-11-25 17:58:50 +0000951 \ttindex{tb_lasti}}
Fred Drakef6669171998-05-06 19:52:49 +0000952\stindex{try}
953
Guido van Rossum83b2f8a1998-07-23 17:12:46 +0000954\item[Slice objects]
955Slice objects are used to represent slices when \emph{extended slice
956syntax} is used. This is a slice using two colons, or multiple slices
957or ellipses separated by commas, e.g., \code{a[i:j:step]}, \code{a[i:j,
Raymond Hettinger03ec6d52003-06-25 18:29:36 +0000958k:l]}, or \code{a[..., i:j]}. They are also created by the built-in
Fred Drake1e42d8a1998-11-25 17:58:50 +0000959\function{slice()}\bifuncindex{slice} function.
Guido van Rossum83b2f8a1998-07-23 17:12:46 +0000960
Thomas Woutersf9b526d2000-07-16 19:05:38 +0000961Special read-only attributes: \member{start} is the lower bound;
962\member{stop} is the upper bound; \member{step} is the step value; each is
Guido van Rossum83b2f8a1998-07-23 17:12:46 +0000963\code{None} if omitted. These attributes can have any type.
Fred Drake4856d011999-01-12 04:15:20 +0000964\withsubitem{(slice object attribute)}{
965 \ttindex{start}
966 \ttindex{stop}
Fred Drake1e42d8a1998-11-25 17:58:50 +0000967 \ttindex{step}}
Guido van Rossum83b2f8a1998-07-23 17:12:46 +0000968
Fred Drake5ec22f22002-09-24 21:09:13 +0000969Slice objects support one method:
970
971\begin{methoddesc}[slice]{indices}{self, length}
972This method takes a single integer argument \var{length} and computes
973information about the extended slice that the slice object would
974describe if applied to a sequence of \var{length} items. It returns a
975tuple of three integers; respectively these are the \var{start} and
976\var{stop} indices and the \var{step} or stride length of the slice.
977Missing or out-of-bounds indices are handled in a manner consistent
978with regular slices.
Michael W. Hudsonf0d777c2002-07-19 15:47:06 +0000979\versionadded{2.3}
Fred Drake5ec22f22002-09-24 21:09:13 +0000980\end{methoddesc}
Michael W. Hudsonf0d777c2002-07-19 15:47:06 +0000981
Fred Drakeea690c42003-07-17 05:26:53 +0000982\item[Static method objects]
983Static method objects provide a way of defeating the transformation
984of function objects to method objects described above. A static method
985object is a wrapper around any other object, usually a user-defined
986method object. When a static method object is retrieved from a class
987or a class instance, the object actually returned is the wrapped object,
988which is not subject to any further transformation. Static method
989objects are not themselves callable, although the objects they
990wrap usually are. Static method objects are created by the built-in
991\function{staticmethod()} constructor.
992
993\item[Class method objects]
994A class method object, like a static method object, is a wrapper
995around another object that alters the way in which that object
996is retrieved from classes and class instances. The behaviour of
997class method objects upon such retrieval is described above,
998under ``User-defined methods''. Class method objects are created
999by the built-in \function{classmethod()} constructor.
1000
Fred Drakef6669171998-05-06 19:52:49 +00001001\end{description} % Internal types
1002
1003\end{description} % Types
1004
1005
Fred Drake61c77281998-07-28 19:34:22 +00001006\section{Special method names\label{specialnames}}
Fred Drakef6669171998-05-06 19:52:49 +00001007
1008A class can implement certain operations that are invoked by special
Fred Draked82575d1998-08-28 20:03:12 +00001009syntax (such as arithmetic operations or subscripting and slicing) by
Fred Drake7af9f4d2003-05-12 13:50:11 +00001010defining methods with special names.\indexii{operator}{overloading}
1011This is Python's approach to \dfn{operator overloading}, allowing
1012classes to define their own behavior with respect to language
1013operators. For instance, if a class defines
Fred Draked82575d1998-08-28 20:03:12 +00001014a method named \method{__getitem__()}, and \code{x} is an instance of
1015this class, then \code{x[i]} is equivalent to
Raymond Hettinger94153092002-05-12 03:09:25 +00001016\code{x.__getitem__(i)}. Except where mentioned, attempts to execute
1017an operation raise an exception when no appropriate method is defined.
Fred Drake1e42d8a1998-11-25 17:58:50 +00001018\withsubitem{(mapping object method)}{\ttindex{__getitem__()}}
Fred Drakef6669171998-05-06 19:52:49 +00001019
Fred Drake0c475592000-12-07 04:49:34 +00001020When implementing a class that emulates any built-in type, it is
1021important that the emulation only be implemented to the degree that it
1022makes sense for the object being modelled. For example, some
1023sequences may work well with retrieval of individual elements, but
1024extracting a slice may not make sense. (One example of this is the
1025\class{NodeList} interface in the W3C's Document Object Model.)
1026
Fred Drakef6669171998-05-06 19:52:49 +00001027
Fred Drake61c77281998-07-28 19:34:22 +00001028\subsection{Basic customization\label{customization}}
Fred Drakef6669171998-05-06 19:52:49 +00001029
Fred Drake044bb4d2001-08-02 15:53:05 +00001030\begin{methoddesc}[object]{__init__}{self\optional{, \moreargs}}
1031Called\indexii{class}{constructor} when the instance is created. The
1032arguments are those passed to the class constructor expression. If a
Raymond Hettingere701dcb2003-01-19 13:08:18 +00001033base class has an \method{__init__()} method, the derived class's
1034\method{__init__()} method, if any, must explicitly call it to ensure proper
Fred Drake044bb4d2001-08-02 15:53:05 +00001035initialization of the base class part of the instance; for example:
1036\samp{BaseClass.__init__(\var{self}, [\var{args}...])}. As a special
1037contraint on constructors, no value may be returned; doing so will
1038cause a \exception{TypeError} to be raised at runtime.
Fred Drake1e42d8a1998-11-25 17:58:50 +00001039\end{methoddesc}
Fred Drakef6669171998-05-06 19:52:49 +00001040
1041
Fred Drake1e42d8a1998-11-25 17:58:50 +00001042\begin{methoddesc}[object]{__del__}{self}
Guido van Rossum7c0240f1998-07-24 15:36:43 +00001043Called when the instance is about to be destroyed. This is also
1044called a destructor\index{destructor}. If a base class
Raymond Hettingere701dcb2003-01-19 13:08:18 +00001045has a \method{__del__()} method, the derived class's \method{__del__()}
1046method, if any,
Fred Drakef6669171998-05-06 19:52:49 +00001047must explicitly call it to ensure proper deletion of the base class
Guido van Rossum83b2f8a1998-07-23 17:12:46 +00001048part of the instance. Note that it is possible (though not recommended!)
1049for the \method{__del__()}
Fred Drakef6669171998-05-06 19:52:49 +00001050method to postpone destruction of the instance by creating a new
1051reference to it. It may then be called at a later time when this new
1052reference is deleted. It is not guaranteed that
1053\method{__del__()} methods are called for objects that still exist when
1054the interpreter exits.
Fred Drakef6669171998-05-06 19:52:49 +00001055\stindex{del}
1056
Fred Drake591dd8f2001-12-14 22:52:41 +00001057\begin{notice}
1058\samp{del x} doesn't directly call
Guido van Rossum83b2f8a1998-07-23 17:12:46 +00001059\code{x.__del__()} --- the former decrements the reference count for
Raymond Hettingere701dcb2003-01-19 13:08:18 +00001060\code{x} by one, and the latter is only called when \code{x}'s reference
Guido van Rossum83b2f8a1998-07-23 17:12:46 +00001061count reaches zero. Some common situations that may prevent the
Raymond Hettingere701dcb2003-01-19 13:08:18 +00001062reference count of an object from going to zero include: circular
Guido van Rossum83b2f8a1998-07-23 17:12:46 +00001063references between objects (e.g., a doubly-linked list or a tree data
1064structure with parent and child pointers); a reference to the object
1065on the stack frame of a function that caught an exception (the
1066traceback stored in \code{sys.exc_traceback} keeps the stack frame
1067alive); or a reference to the object on the stack frame that raised an
1068unhandled exception in interactive mode (the traceback stored in
1069\code{sys.last_traceback} keeps the stack frame alive). The first
1070situation can only be remedied by explicitly breaking the cycles; the
Fred Drake591dd8f2001-12-14 22:52:41 +00001071latter two situations can be resolved by storing \code{None} in
1072\code{sys.exc_traceback} or \code{sys.last_traceback}. Circular
1073references which are garbage are detected when the option cycle
1074detector is enabled (it's on by default), but can only be cleaned up
1075if there are no Python-level \method{__del__()} methods involved.
1076Refer to the documentation for the \ulink{\module{gc}
1077module}{../lib/module-gc.html} for more information about how
1078\method{__del__()} methods are handled by the cycle detector,
1079particularly the description of the \code{garbage} value.
1080\end{notice}
Fred Drakef6669171998-05-06 19:52:49 +00001081
Fred Drake591dd8f2001-12-14 22:52:41 +00001082\begin{notice}[warning]
1083Due to the precarious circumstances under which
Fred Draked82575d1998-08-28 20:03:12 +00001084\method{__del__()} methods are invoked, exceptions that occur during their
Guido van Rossum83b2f8a1998-07-23 17:12:46 +00001085execution are ignored, and a warning is printed to \code{sys.stderr}
Fred Drake591dd8f2001-12-14 22:52:41 +00001086instead. Also, when \method{__del__()} is invoked in response to a module
Guido van Rossum83b2f8a1998-07-23 17:12:46 +00001087being deleted (e.g., when execution of the program is done), other
Fred Draked82575d1998-08-28 20:03:12 +00001088globals referenced by the \method{__del__()} method may already have been
1089deleted. For this reason, \method{__del__()} methods should do the
Raymond Hettingera0e4d6c2002-09-08 21:10:54 +00001090absolute minimum needed to maintain external invariants. Starting with
1091version 1.5, Python guarantees that globals whose name begins with a single
1092underscore are deleted from their module before other globals are deleted;
1093if no other references to such globals exist, this may help in assuring that
Guido van Rossum83b2f8a1998-07-23 17:12:46 +00001094imported modules are still available at the time when the
Fred Drake591dd8f2001-12-14 22:52:41 +00001095\method{__del__()} method is called.
1096\end{notice}
Fred Drake1e42d8a1998-11-25 17:58:50 +00001097\end{methoddesc}
Fred Drakef6669171998-05-06 19:52:49 +00001098
Fred Drake1e42d8a1998-11-25 17:58:50 +00001099\begin{methoddesc}[object]{__repr__}{self}
Fred Drake82385871998-10-01 20:40:43 +00001100Called by the \function{repr()}\bifuncindex{repr} built-in function
1101and by string conversions (reverse quotes) to compute the ``official''
Andrew M. Kuchling68abe832000-12-19 14:09:21 +00001102string representation of an object. If at all possible, this should
Guido van Rossum035f7e82000-12-19 04:18:13 +00001103look like a valid Python expression that could be used to recreate an
1104object with the same value (given an appropriate environment). If
1105this is not possible, a string of the form \samp{<\var{...some useful
1106description...}>} should be returned. The return value must be a
1107string object.
Raymond Hettingere701dcb2003-01-19 13:08:18 +00001108If a class defines \method{__repr__()} but not \method{__str__()},
1109then \method{__repr__()} is also used when an ``informal'' string
1110representation of instances of that class is required.
Guido van Rossum035f7e82000-12-19 04:18:13 +00001111
1112This is typically used for debugging, so it is important that the
1113representation is information-rich and unambiguous.
Fred Drakef6669171998-05-06 19:52:49 +00001114\indexii{string}{conversion}
1115\indexii{reverse}{quotes}
1116\indexii{backward}{quotes}
1117\index{back-quotes}
Fred Drake1e42d8a1998-11-25 17:58:50 +00001118\end{methoddesc}
Fred Drakef6669171998-05-06 19:52:49 +00001119
Fred Drake1e42d8a1998-11-25 17:58:50 +00001120\begin{methoddesc}[object]{__str__}{self}
Fred Draked82575d1998-08-28 20:03:12 +00001121Called by the \function{str()}\bifuncindex{str} built-in function and
1122by the \keyword{print}\stindex{print} statement to compute the
Fred Drake82385871998-10-01 20:40:43 +00001123``informal'' string representation of an object. This differs from
1124\method{__repr__()} in that it does not have to be a valid Python
1125expression: a more convenient or concise representation may be used
Guido van Rossum035f7e82000-12-19 04:18:13 +00001126instead. The return value must be a string object.
Fred Drake1e42d8a1998-11-25 17:58:50 +00001127\end{methoddesc}
Fred Drakef6669171998-05-06 19:52:49 +00001128
Guido van Rossumab782dd2001-01-18 15:17:06 +00001129\begin{methoddesc}[object]{__lt__}{self, other}
1130\methodline[object]{__le__}{self, other}
1131\methodline[object]{__eq__}{self, other}
1132\methodline[object]{__ne__}{self, other}
1133\methodline[object]{__gt__}{self, other}
1134\methodline[object]{__ge__}{self, other}
1135\versionadded{2.1}
1136These are the so-called ``rich comparison'' methods, and are called
1137for comparison operators in preference to \method{__cmp__()} below.
1138The correspondence between operator symbols and method names is as
1139follows:
1140\code{\var{x}<\var{y}} calls \code{\var{x}.__lt__(\var{y})},
1141\code{\var{x}<=\var{y}} calls \code{\var{x}.__le__(\var{y})},
1142\code{\var{x}==\var{y}} calls \code{\var{x}.__eq__(\var{y})},
1143\code{\var{x}!=\var{y}} and \code{\var{x}<>\var{y}} call
1144\code{\var{x}.__ne__(\var{y})},
1145\code{\var{x}>\var{y}} calls \code{\var{x}.__gt__(\var{y})}, and
1146\code{\var{x}>=\var{y}} calls \code{\var{x}.__ge__(\var{y})}.
1147These methods can return any value, but if the comparison operator is
1148used in a Boolean context, the return value should be interpretable as
1149a Boolean value, else a \exception{TypeError} will be raised.
Raymond Hettingere701dcb2003-01-19 13:08:18 +00001150By convention, \code{False} is used for false and \code{True} for true.
Guido van Rossumab782dd2001-01-18 15:17:06 +00001151
Raymond Hettinger4d6e8fe2003-07-16 19:40:23 +00001152There are no implied relationships among the comparison operators.
Raymond Hettinger943277e2003-07-17 14:47:12 +00001153The truth of \code{\var{x}==\var{y}} does not imply that \code{\var{x}!=\var{y}}
Raymond Hettinger4d6e8fe2003-07-16 19:40:23 +00001154is false. Accordingly, when defining \method{__eq__}, one should also
1155define \method{__ne__} so that the operators will behave as expected.
1156
Guido van Rossumab782dd2001-01-18 15:17:06 +00001157There are no reflected (swapped-argument) versions of these methods
1158(to be used when the left argument does not support the operation but
1159the right argument does); rather, \method{__lt__()} and
1160\method{__gt__()} are each other's reflection, \method{__le__()} and
1161\method{__ge__()} are each other's reflection, and \method{__eq__()}
1162and \method{__ne__()} are their own reflection.
1163
1164Arguments to rich comparison methods are never coerced. A rich
1165comparison method may return \code{NotImplemented} if it does not
1166implement the operation for a given pair of arguments.
1167\end{methoddesc}
1168
Fred Drake1e42d8a1998-11-25 17:58:50 +00001169\begin{methoddesc}[object]{__cmp__}{self, other}
Guido van Rossumab782dd2001-01-18 15:17:06 +00001170Called by comparison operations if rich comparison (see above) is not
Fred Drake597bc1d2001-05-29 16:02:35 +00001171defined. Should return a negative integer if \code{self < other},
1172zero if \code{self == other}, a positive integer if \code{self >
1173other}. If no \method{__cmp__()}, \method{__eq__()} or
1174\method{__ne__()} operation is defined, class instances are compared
1175by object identity (``address''). See also the description of
1176\method{__hash__()} for some important notes on creating objects which
1177support custom comparison operations and are usable as dictionary
1178keys.
Guido van Rossum83b2f8a1998-07-23 17:12:46 +00001179(Note: the restriction that exceptions are not propagated by
Raymond Hettingere701dcb2003-01-19 13:08:18 +00001180\method{__cmp__()} has been removed since Python 1.5.)
Fred Drakef6669171998-05-06 19:52:49 +00001181\bifuncindex{cmp}
1182\index{comparisons}
Fred Drake1e42d8a1998-11-25 17:58:50 +00001183\end{methoddesc}
Fred Drakef6669171998-05-06 19:52:49 +00001184
Fred Drakee57a1142000-06-15 20:07:25 +00001185\begin{methoddesc}[object]{__rcmp__}{self, other}
Fred Drake445f8322001-01-04 15:11:48 +00001186 \versionchanged[No longer supported]{2.1}
Fred Drakee57a1142000-06-15 20:07:25 +00001187\end{methoddesc}
1188
Fred Drake1e42d8a1998-11-25 17:58:50 +00001189\begin{methoddesc}[object]{__hash__}{self}
Fred Draked82575d1998-08-28 20:03:12 +00001190Called for the key object for dictionary\obindex{dictionary}
1191operations, and by the built-in function
Fred Drakef6669171998-05-06 19:52:49 +00001192\function{hash()}\bifuncindex{hash}. Should return a 32-bit integer
1193usable as a hash value
1194for dictionary operations. The only required property is that objects
1195which compare equal have the same hash value; it is advised to somehow
Guido van Rossum83b2f8a1998-07-23 17:12:46 +00001196mix together (e.g., using exclusive or) the hash values for the
Fred Drakef6669171998-05-06 19:52:49 +00001197components of the object that also play a part in comparison of
1198objects. If a class does not define a \method{__cmp__()} method it should
1199not define a \method{__hash__()} operation either; if it defines
Fred Drake597bc1d2001-05-29 16:02:35 +00001200\method{__cmp__()} or \method{__eq__()} but not \method{__hash__()},
1201its instances will not be usable as dictionary keys. If a class
1202defines mutable objects and implements a \method{__cmp__()} or
1203\method{__eq__()} method, it should not implement \method{__hash__()},
1204since the dictionary implementation requires that a key's hash value
1205is immutable (if the object's hash value changes, it will be in the
1206wrong hash bucket).
Fred Drake1e42d8a1998-11-25 17:58:50 +00001207\withsubitem{(object method)}{\ttindex{__cmp__()}}
1208\end{methoddesc}
Fred Drakef6669171998-05-06 19:52:49 +00001209
Fred Drake1e42d8a1998-11-25 17:58:50 +00001210\begin{methoddesc}[object]{__nonzero__}{self}
Guido van Rossum77f6a652002-04-03 22:41:51 +00001211Called to implement truth value testing, and the built-in operation
1212\code{bool()}; should return \code{False} or \code{True}, or their
1213integer equivalents \code{0} or \code{1}.
1214When this method is not defined, \method{__len__()} is
Fred Draked82575d1998-08-28 20:03:12 +00001215called, if it is defined (see below). If a class defines neither
1216\method{__len__()} nor \method{__nonzero__()}, all its instances are
1217considered true.
Fred Drake1e42d8a1998-11-25 17:58:50 +00001218\withsubitem{(mapping object method)}{\ttindex{__len__()}}
1219\end{methoddesc}
Fred Drakef6669171998-05-06 19:52:49 +00001220
Martin v. Löwis2a519f82002-04-11 12:39:35 +00001221\begin{methoddesc}[object]{__unicode__}{self}
1222Called to implement \function{unicode()}\bifuncindex{unicode} builtin;
1223should return a Unicode object. When this method is not defined, string
1224conversion is attempted, and the result of string conversion is converted
1225to Unicode using the system default encoding.
1226\end{methoddesc}
1227
Fred Drakef6669171998-05-06 19:52:49 +00001228
Fred Drake61c77281998-07-28 19:34:22 +00001229\subsection{Customizing attribute access\label{attribute-access}}
Fred Drakef6669171998-05-06 19:52:49 +00001230
Guido van Rossum83b2f8a1998-07-23 17:12:46 +00001231The following methods can be defined to customize the meaning of
1232attribute access (use of, assignment to, or deletion of \code{x.name})
1233for class instances.
Fred Drakef6669171998-05-06 19:52:49 +00001234
Fred Drake1e42d8a1998-11-25 17:58:50 +00001235\begin{methoddesc}[object]{__getattr__}{self, name}
Fred Drakef6669171998-05-06 19:52:49 +00001236Called when an attribute lookup has not found the attribute in the
1237usual places (i.e. it is not an instance attribute nor is it found in
1238the class tree for \code{self}). \code{name} is the attribute name.
Guido van Rossum83b2f8a1998-07-23 17:12:46 +00001239This method should return the (computed) attribute value or raise an
Fred Draked82575d1998-08-28 20:03:12 +00001240\exception{AttributeError} exception.
Fred Drakef6669171998-05-06 19:52:49 +00001241
1242Note that if the attribute is found through the normal mechanism,
Fred Draked82575d1998-08-28 20:03:12 +00001243\method{__getattr__()} is not called. (This is an intentional
1244asymmetry between \method{__getattr__()} and \method{__setattr__()}.)
Fred Drakef6669171998-05-06 19:52:49 +00001245This is done both for efficiency reasons and because otherwise
Fred Draked82575d1998-08-28 20:03:12 +00001246\method{__setattr__()} would have no way to access other attributes of
Guido van Rossumd41eea02003-02-28 14:11:45 +00001247the instance. Note that at least for instance variables, you can fake
1248total control by not inserting any values in the instance attribute
1249dictionary (but instead inserting them in another object). See the
1250\method{__getattribute__()} method below for a way to actually get
1251total control in new-style classes.
Fred Drake1e42d8a1998-11-25 17:58:50 +00001252\withsubitem{(object method)}{\ttindex{__setattr__()}}
1253\end{methoddesc}
Fred Drakef6669171998-05-06 19:52:49 +00001254
Fred Drake1e42d8a1998-11-25 17:58:50 +00001255\begin{methoddesc}[object]{__setattr__}{self, name, value}
Fred Drakef6669171998-05-06 19:52:49 +00001256Called when an attribute assignment is attempted. This is called
Fred Draked82575d1998-08-28 20:03:12 +00001257instead of the normal mechanism (i.e.\ store the value in the instance
1258dictionary). \var{name} is the attribute name, \var{value} is the
Fred Drakef6669171998-05-06 19:52:49 +00001259value to be assigned to it.
Fred Drakef6669171998-05-06 19:52:49 +00001260
Fred Draked82575d1998-08-28 20:03:12 +00001261If \method{__setattr__()} wants to assign to an instance attribute, it
1262should not simply execute \samp{self.\var{name} = value} --- this
1263would cause a recursive call to itself. Instead, it should insert the
1264value in the dictionary of instance attributes, e.g.,
Guido van Rossumd41eea02003-02-28 14:11:45 +00001265\samp{self.__dict__[\var{name}] = value}. For new-style classes,
1266rather than accessing the instance dictionary, it should call the base
1267class method with the same name, for example,
1268\samp{object.__setattr__(self, name, value)}.
Fred Drake1e42d8a1998-11-25 17:58:50 +00001269\withsubitem{(instance attribute)}{\ttindex{__dict__}}
1270\end{methoddesc}
Fred Drakef6669171998-05-06 19:52:49 +00001271
Fred Drake1e42d8a1998-11-25 17:58:50 +00001272\begin{methoddesc}[object]{__delattr__}{self, name}
Fred Draked82575d1998-08-28 20:03:12 +00001273Like \method{__setattr__()} but for attribute deletion instead of
Fred Drake1e42d8a1998-11-25 17:58:50 +00001274assignment. This should only be implemented if \samp{del
1275obj.\var{name}} is meaningful for the object.
1276\end{methoddesc}
Fred Drakef6669171998-05-06 19:52:49 +00001277
Michael W. Hudson2ab1d082003-03-05 14:20:58 +00001278\subsubsection{More attribute access for new-style classes \label{new-style-attribute-access}}
Guido van Rossumd41eea02003-02-28 14:11:45 +00001279
1280The following methods only apply to new-style classes.
1281
1282\begin{methoddesc}[object]{__getattribute__}{self, name}
1283Called unconditionally to implement attribute accesses for instances
1284of the class. If the class also defines \method{__getattr__}, it will
1285never be called (unless called explicitly).
1286This method should return the (computed) attribute
1287value or raise an \exception{AttributeError} exception.
1288In order to avoid infinite recursion in this method, its
1289implementation should always call the base class method with the same
Fred Drake62364ff2003-03-20 18:17:16 +00001290name to access any attributes it needs, for example,
Guido van Rossumd41eea02003-02-28 14:11:45 +00001291\samp{object.__getattribute__(self, name)}.
1292\end{methoddesc}
1293
Michael W. Hudson2ab1d082003-03-05 14:20:58 +00001294\subsubsection{Implementing Descriptors \label{descriptors}}
Guido van Rossumd41eea02003-02-28 14:11:45 +00001295
1296The following methods only apply when an instance of the class
Fred Drake62364ff2003-03-20 18:17:16 +00001297containing the method (a so-called \emph{descriptor} class) appears in
Guido van Rossumd41eea02003-02-28 14:11:45 +00001298the class dictionary of another new-style class, known as the
1299\emph{owner} class. In the examples below, ``the attribute'' refers to
Fred Drake62364ff2003-03-20 18:17:16 +00001300the attribute whose name is the key of the property in the owner
Guido van Rossumd41eea02003-02-28 14:11:45 +00001301class' \code{__dict__}.
1302
1303\begin{methoddesc}[object]{__get__}{self, instance, owner}
Fred Drake62364ff2003-03-20 18:17:16 +00001304Called to get the attribute of the owner class (class attribute access)
Guido van Rossumd41eea02003-02-28 14:11:45 +00001305or of an instance of that class (instance attribute acces).
1306\var{owner} is always the owner class, while \var{instance} is the
1307instance that the attribute was accessed through, or \code{None} when
1308the attribute is accessed through the \var{owner}. This method should
1309return the (computed) attribute value or raise an
1310\exception{AttributeError} exception.
1311\end{methoddesc}
1312
1313\begin{methoddesc}[object]{__set__}{self, instance, value}
Michael W. Hudson2ab1d082003-03-05 14:20:58 +00001314Called to set the attribute on an instance \var{instance} of the owner
Guido van Rossumd41eea02003-02-28 14:11:45 +00001315class to a new value, \var{value}.
1316\end{methoddesc}
1317
1318\begin{methoddesc}[object]{__delete__}{self, instance}
Michael W. Hudson2ab1d082003-03-05 14:20:58 +00001319Called to delete the attribute on an instance \var{instance} of the
1320owner class.
Guido van Rossumd41eea02003-02-28 14:11:45 +00001321\end{methoddesc}
1322
Fred Drakef6669171998-05-06 19:52:49 +00001323
Fred Drake4db36612003-06-26 03:11:20 +00001324\subsubsection{Invoking Descriptors \label{descriptor-invocation}}
Raymond Hettinger03ec6d52003-06-25 18:29:36 +00001325
1326In general, a descriptor is an object attribute with ``binding behavior'',
1327one whose attribute access has been overridden by methods in the descriptor
Raymond Hettingerdf9eff02003-06-27 06:57:56 +00001328protocol: \method{__get__()}, \method{__set__()}, and \method{__delete__()}.
Raymond Hettinger03ec6d52003-06-25 18:29:36 +00001329If any of those methods are defined for an object, it is said to be a
1330descriptor.
1331
1332The default behavior for attribute access is to get, set, or delete the
1333attribute from an object's dictionary. For instance, \code{a.x} has a
1334lookup chain starting with \code{a.__dict__['x']}, then
1335\code{type(a).__dict__['x']}, and continuing
1336through the base classes of \code{type(a)} excluding metaclasses.
1337
1338However, if the looked-up value is an object defining one of the descriptor
1339methods, then Python may override the default behavior and invoke the
1340descriptor method instead. Where this occurs in the precedence chain depends
1341on which descriptor methods were defined and how they were called. Note that
1342descriptors are only invoked for new style objects or classes
Raymond Hettingerdf9eff02003-06-27 06:57:56 +00001343(ones that subclass \class{object()} or \class{type()}).
Raymond Hettinger03ec6d52003-06-25 18:29:36 +00001344
1345The starting point for descriptor invocation is a binding, \code{a.x}.
1346How the arguments are assembled depends on \code{a}:
1347
1348\begin{itemize}
1349
1350 \item[Direct Call] The simplest and least common call is when user code
1351 directly invokes a descriptor method: \code{x.__get__(a)}.
1352
1353 \item[Instance Binding] If binding to a new-style object instance,
1354 \code{a.x} is transformed into the call:
1355 \code{type(a).__dict__['x'].__get__(a, type(a))}.
1356
1357 \item[Class Binding] If binding to a new-style class, \code{A.x}
1358 is transformed into the call: \code{A.__dict__['x'].__get__(None, A)}.
1359
1360 \item[Super Binding] If \code{a} is an instance of \class{super},
1361 then the binding \code{super(B, obj).m()} searches
1362 \code{obj.__class__.__mro__} for the base class \code{A} immediately
1363 preceding \code{B} and then invokes the descriptor with the call:
1364 \code{A.__dict__['m'].__get__(obj, A)}.
1365
1366\end{itemize}
1367
1368For instance bindings, the precedence of descriptor invocation depends
1369on the which descriptor methods are defined. Data descriptors define
Raymond Hettingerdf9eff02003-06-27 06:57:56 +00001370both \method{__get__()} and \method{__set__()}. Non-data descriptors have
1371just the \method{__get__()} method. Data descriptors always override
Raymond Hettinger03ec6d52003-06-25 18:29:36 +00001372a redefinition in an instance dictionary. In contrast, non-data
1373descriptors can be overridden by instances.
1374
Raymond Hettingerdf9eff02003-06-27 06:57:56 +00001375Python methods (including \function{staticmethod()} and \function{classmethod()})
Raymond Hettinger03ec6d52003-06-25 18:29:36 +00001376are implemented as non-data descriptors. Accordingly, instances can
1377redefine and override methods. This allows individual instances to acquire
1378behaviors that differ from other instances of the same class.
1379
Raymond Hettingerdf9eff02003-06-27 06:57:56 +00001380The \function{property()} function is implemented as a data descriptor.
1381Accordingly, instances cannot override the behavior of a property.
1382
1383
1384\subsubsection{__slots__\label{slots}}
1385
1386By default, instances of both old and new-style classes have a dictionary
1387for attribute storage. This wastes space for objects having very few instance
1388variables. The space consumption can become acute when creating large numbers
1389of instances.
1390
1391The default can be overridden by defining \var{__slots__} in a new-style class
1392definition. The \var{__slots__} declaration takes a sequence of instance
1393variables and reserves just enough space in each instance to hold a value
1394for each variable. Space is saved because \var{__dict__} is not created for
1395each instance.
1396
1397\begin{datadesc}{__slots__}
1398This class variable can be assigned a string, iterable, or sequence of strings
Raymond Hettinger5918f8d2003-06-29 04:53:23 +00001399with variable names used by instances. If defined in a new-style class,
1400\var{__slots__} reserves space for the declared variables
Raymond Hettingerdf9eff02003-06-27 06:57:56 +00001401and prevents the automatic creation of \var{__dict__} and \var{__weakref__}
1402for each instance.
1403\versionadded{2.2}
1404\end{datadesc}
1405
1406\noindent
Raymond Hettinger5918f8d2003-06-29 04:53:23 +00001407Notes on using \var{__slots__}
Raymond Hettingerdf9eff02003-06-27 06:57:56 +00001408
1409\begin{itemize}
1410
1411\item Without a \var{__dict__} variable, instances cannot be assigned new
1412variables not listed in the \var{__slots__} definition. Attempts to assign
1413to an unlisted variable name raises \exception{AttributeError}. If dynamic
1414assignment of new variables is desired, then add \code{'__dict__'} to the
1415sequence of strings in the \var{__slots__} declaration.
1416\versionchanged[Previously, adding \code{'__dict__'} to the \var{__slots__}
1417declaration would not enable the assignment of new attributes not
1418specifically listed in the sequence of instance variable names]{2.3}
1419
1420\item Without a \var{__weakref__} variable for each instance, classes
1421defining \var{__slots__} do not support weak references to its instances.
1422If weak reference support is needed, then add \code{'__weakref__'} to the
1423sequence of strings in the \var{__slots__} declaration.
Raymond Hettinger5918f8d2003-06-29 04:53:23 +00001424\versionchanged[Previously, adding \code{'__weakref__'} to the \var{__slots__}
Raymond Hettingerdf9eff02003-06-27 06:57:56 +00001425declaration would not enable support for weak references]{2.3}
1426
1427\item \var{__slots__} are implemented at the class level by creating
1428descriptors (\ref{descriptors}) for each variable name. As a result,
1429class attributes cannot be used to set default values for instance
1430variables defined by \var{__slots__}; otherwise, the class attribute would
1431overwrite the descriptor assignment.
1432
1433\item If a class defines a slot also defined in a base class, the instance
1434variable defined by the base class slot is inaccessible (except by retrieving
1435its descriptor directly from the base class). This renders the meaning of the
1436program undefined. In the future, a check may be added to prevent this.
1437
1438\item The action of a \var{__slots__} declaration is limited to the class
1439where it is defined. As a result, subclasses will have a \var{__dict__}
1440unless they also define \var{__slots__}.
1441
1442\item \var{__slots__} do not work for classes derived from ``variable-length''
1443built-in types such as \class{long}, \class{str} and \class{tuple}.
1444
Raymond Hettinger5918f8d2003-06-29 04:53:23 +00001445\item Any non-string iterable may be assigned to \var{__slots__}.
Raymond Hettingerdf9eff02003-06-27 06:57:56 +00001446Mappings may also be used; however, in the future, special meaning may
1447be assigned to the values corresponding to each key.
1448
1449\end{itemize}
1450
1451
1452\subsection{Customizing class creation\label{metaclasses}}
1453
1454By default, new-style classes are constructed using \function{type()}.
1455A class definition is read into a separate namespace and the value
1456of class name is bound to the result of \code{type(name, bases, dict)}.
1457
1458When the class definition is read, if \var{__metaclass__} is defined
1459then the callable assigned to it will be called instead of \function{type()}.
1460The allows classes or functions to be written which monitor or alter the class
1461creation process:
1462
1463\begin{itemize}
1464\item Modifying the class dictionary prior to the class being created.
1465\item Returning an instance of another class -- essentially performing
1466the role of a factory function.
1467\end{itemize}
1468
1469\begin{datadesc}{__metaclass__}
1470This variable can be any callable accepting arguments for \code{name},
1471\code{bases}, and \code{dict}. Upon class creation, the callable is
1472used instead of the built-in \function{type()}.
1473\versionadded{2.2}
1474\end{datadesc}
1475
1476The appropriate metaclass is determined by the following precedence rules:
1477
1478\begin{itemize}
1479
1480\item If \code{dict['__metaclass__']} exists, it is used.
1481
1482\item Otherwise, if there is at least one base class, its metaclass is used
1483(this looks for a \var{__class__} attribute first and if not found, uses its
1484type).
1485
1486\item Otherwise, if a global variable named __metaclass__ exists, it is used.
1487
1488\item Otherwise, the old-style, classic metaclass (types.ClassType) is used.
1489
1490\end{itemize}
1491
1492The potential uses for metaclasses are boundless. Some ideas that have
1493been explored including logging, interface checking, automatic delegation,
1494automatic property creation, proxies, frameworks, and automatic resource
1495locking/synchronization.
Raymond Hettinger03ec6d52003-06-25 18:29:36 +00001496
1497
Fred Drake61c77281998-07-28 19:34:22 +00001498\subsection{Emulating callable objects\label{callable-types}}
Guido van Rossum83b2f8a1998-07-23 17:12:46 +00001499
Fred Drake1e42d8a1998-11-25 17:58:50 +00001500\begin{methoddesc}[object]{__call__}{self\optional{, args...}}
Guido van Rossum83b2f8a1998-07-23 17:12:46 +00001501Called when the instance is ``called'' as a function; if this method
Fred Draked82575d1998-08-28 20:03:12 +00001502is defined, \code{\var{x}(arg1, arg2, ...)} is a shorthand for
1503\code{\var{x}.__call__(arg1, arg2, ...)}.
Guido van Rossum83b2f8a1998-07-23 17:12:46 +00001504\indexii{call}{instance}
Fred Drake1e42d8a1998-11-25 17:58:50 +00001505\end{methoddesc}
Guido van Rossum83b2f8a1998-07-23 17:12:46 +00001506
1507
Fred Drake73921b02001-10-01 16:32:13 +00001508\subsection{Emulating container types\label{sequence-types}}
Guido van Rossum83b2f8a1998-07-23 17:12:46 +00001509
Fred Drake73921b02001-10-01 16:32:13 +00001510The following methods can be defined to implement container
1511objects. Containers usually are sequences (such as lists or tuples)
1512or mappings (like dictionaries), but can represent other containers as
1513well. The first set of methods is used either to emulate a
Guido van Rossum83b2f8a1998-07-23 17:12:46 +00001514sequence or to emulate a mapping; the difference is that for a
1515sequence, the allowable keys should be the integers \var{k} for which
1516\code{0 <= \var{k} < \var{N}} where \var{N} is the length of the
Thomas Wouters1d75a792000-08-17 22:37:32 +00001517sequence, or slice objects, which define a range of items. (For backwards
1518compatibility, the method \method{__getslice__()} (see below) can also be
1519defined to handle simple, but not extended slices.) It is also recommended
Fred Drakea0073822000-08-18 02:42:14 +00001520that mappings provide the methods \method{keys()}, \method{values()},
Thomas Wouters1d75a792000-08-17 22:37:32 +00001521\method{items()}, \method{has_key()}, \method{get()}, \method{clear()},
Raymond Hettingere701dcb2003-01-19 13:08:18 +00001522\method{setdefault()}, \method{iterkeys()}, \method{itervalues()},
Raymond Hettingerf4ca5a22003-01-19 14:57:12 +00001523\method{iteritems()}, \method{pop()}, \method{popitem()},
Thomas Wouters1d75a792000-08-17 22:37:32 +00001524\method{copy()}, and \method{update()} behaving similar to those for
Raymond Hettingere701dcb2003-01-19 13:08:18 +00001525Python's standard dictionary objects. The \module{UserDict} module
1526provides a \class{DictMixin} class to help create those methods
1527from a base set of \method{__getitem__()}, \method{__setitem__()},
1528\method{__delitem__()}, and \method{keys()}.
1529Mutable sequences should provide
Guido van Rossum83b2f8a1998-07-23 17:12:46 +00001530methods \method{append()}, \method{count()}, \method{index()},
Raymond Hettingere701dcb2003-01-19 13:08:18 +00001531\method{extend()},
Guido van Rossum83b2f8a1998-07-23 17:12:46 +00001532\method{insert()}, \method{pop()}, \method{remove()}, \method{reverse()}
1533and \method{sort()}, like Python standard list objects. Finally,
1534sequence types should implement addition (meaning concatenation) and
1535multiplication (meaning repetition) by defining the methods
Thomas Wouters12bba852000-08-24 20:06:04 +00001536\method{__add__()}, \method{__radd__()}, \method{__iadd__()},
1537\method{__mul__()}, \method{__rmul__()} and \method{__imul__()} described
1538below; they should not define \method{__coerce__()} or other numerical
Guido van Rossum0dbb4fb2001-04-20 16:50:40 +00001539operators. It is recommended that both mappings and sequences
Fred Drake18d8d5a2001-09-18 17:58:20 +00001540implement the \method{__contains__()} method to allow efficient use of
1541the \code{in} operator; for mappings, \code{in} should be equivalent
1542of \method{has_key()}; for sequences, it should search through the
Raymond Hettingere701dcb2003-01-19 13:08:18 +00001543values. It is further recommended that both mappings and sequences
1544implement the \method{__iter__()} method to allow efficient iteration
1545through the container; for mappings, \method{__iter__()} should be
1546the same as \method{iterkeys()}; for sequences, it should iterate
1547through the values.
Fred Drake4856d011999-01-12 04:15:20 +00001548\withsubitem{(mapping object method)}{
1549 \ttindex{keys()}
1550 \ttindex{values()}
1551 \ttindex{items()}
Raymond Hettingere701dcb2003-01-19 13:08:18 +00001552 \ttindex{iterkeys()}
1553 \ttindex{itervalues()}
1554 \ttindex{iteritems()}
Fred Drake4856d011999-01-12 04:15:20 +00001555 \ttindex{has_key()}
1556 \ttindex{get()}
Raymond Hettingere701dcb2003-01-19 13:08:18 +00001557 \ttindex{setdefault()}
1558 \ttindex{pop()}
1559 \ttindex{popitem()}
Fred Drake4856d011999-01-12 04:15:20 +00001560 \ttindex{clear()}
1561 \ttindex{copy()}
Guido van Rossum0dbb4fb2001-04-20 16:50:40 +00001562 \ttindex{update()}
1563 \ttindex{__contains__()}}
Fred Drake4856d011999-01-12 04:15:20 +00001564\withsubitem{(sequence object method)}{
1565 \ttindex{append()}
1566 \ttindex{count()}
Raymond Hettingere701dcb2003-01-19 13:08:18 +00001567 \ttindex{extend()}
Fred Drake4856d011999-01-12 04:15:20 +00001568 \ttindex{index()}
1569 \ttindex{insert()}
1570 \ttindex{pop()}
1571 \ttindex{remove()}
1572 \ttindex{reverse()}
1573 \ttindex{sort()}
1574 \ttindex{__add__()}
1575 \ttindex{__radd__()}
Thomas Wouters12bba852000-08-24 20:06:04 +00001576 \ttindex{__iadd__()}
Fred Drake4856d011999-01-12 04:15:20 +00001577 \ttindex{__mul__()}
Thomas Wouters12bba852000-08-24 20:06:04 +00001578 \ttindex{__rmul__()}
Guido van Rossum0dbb4fb2001-04-20 16:50:40 +00001579 \ttindex{__imul__()}
Raymond Hettingere701dcb2003-01-19 13:08:18 +00001580 \ttindex{__contains__()}
1581 \ttindex{__iter__()}}
Fred Drakeae3e5741999-01-28 23:21:49 +00001582\withsubitem{(numeric object method)}{\ttindex{__coerce__()}}
Fred Drakef6669171998-05-06 19:52:49 +00001583
Fred Drake73921b02001-10-01 16:32:13 +00001584\begin{methoddesc}[container object]{__len__}{self}
Fred Draked82575d1998-08-28 20:03:12 +00001585Called to implement the built-in function
1586\function{len()}\bifuncindex{len}. Should return the length of the
1587object, an integer \code{>=} 0. Also, an object that doesn't define a
1588\method{__nonzero__()} method and whose \method{__len__()} method
1589returns zero is considered to be false in a Boolean context.
Fred Drake1e42d8a1998-11-25 17:58:50 +00001590\withsubitem{(object method)}{\ttindex{__nonzero__()}}
1591\end{methoddesc}
Fred Drakef6669171998-05-06 19:52:49 +00001592
Fred Drake73921b02001-10-01 16:32:13 +00001593\begin{methoddesc}[container object]{__getitem__}{self, key}
Fred Draked82575d1998-08-28 20:03:12 +00001594Called to implement evaluation of \code{\var{self}[\var{key}]}.
Fred Drake31575ce2000-09-21 05:28:26 +00001595For sequence types, the accepted keys should be integers and slice
1596objects.\obindex{slice} Note that
1597the special interpretation of negative indexes (if the class wishes to
Fred Drakef6669171998-05-06 19:52:49 +00001598emulate a sequence type) is up to the \method{__getitem__()} method.
Fred Drake91826ed2000-07-13 04:57:58 +00001599If \var{key} is of an inappropriate type, \exception{TypeError} may be
1600raised; if of a value outside the set of indexes for the sequence
1601(after any special interpretation of negative values),
1602\exception{IndexError} should be raised.
Fred Drake0aa811c2001-10-20 04:24:09 +00001603\note{\keyword{for} loops expect that an
Fred Drake91826ed2000-07-13 04:57:58 +00001604\exception{IndexError} will be raised for illegal indexes to allow
Fred Drake0aa811c2001-10-20 04:24:09 +00001605proper detection of the end of the sequence.}
Fred Drake1e42d8a1998-11-25 17:58:50 +00001606\end{methoddesc}
Fred Drakef6669171998-05-06 19:52:49 +00001607
Fred Drake73921b02001-10-01 16:32:13 +00001608\begin{methoddesc}[container object]{__setitem__}{self, key, value}
Fred Draked82575d1998-08-28 20:03:12 +00001609Called to implement assignment to \code{\var{self}[\var{key}]}. Same
Fred Drake1e42d8a1998-11-25 17:58:50 +00001610note as for \method{__getitem__()}. This should only be implemented
1611for mappings if the objects support changes to the values for keys, or
1612if new keys can be added, or for sequences if elements can be
Fred Drake91826ed2000-07-13 04:57:58 +00001613replaced. The same exceptions should be raised for improper
1614\var{key} values as for the \method{__getitem__()} method.
Fred Drake1e42d8a1998-11-25 17:58:50 +00001615\end{methoddesc}
Fred Drakef6669171998-05-06 19:52:49 +00001616
Fred Drake73921b02001-10-01 16:32:13 +00001617\begin{methoddesc}[container object]{__delitem__}{self, key}
Fred Draked82575d1998-08-28 20:03:12 +00001618Called to implement deletion of \code{\var{self}[\var{key}]}. Same
Fred Drake1e42d8a1998-11-25 17:58:50 +00001619note as for \method{__getitem__()}. This should only be implemented
1620for mappings if the objects support removal of keys, or for sequences
Fred Drake91826ed2000-07-13 04:57:58 +00001621if elements can be removed from the sequence. The same exceptions
1622should be raised for improper \var{key} values as for the
1623\method{__getitem__()} method.
Fred Drake1e42d8a1998-11-25 17:58:50 +00001624\end{methoddesc}
Fred Drakef6669171998-05-06 19:52:49 +00001625
Fred Drake73921b02001-10-01 16:32:13 +00001626\begin{methoddesc}[container object]{__iter__}{self}
1627This method is called when an iterator is required for a container.
1628This method should return a new iterator object that can iterate over
1629all the objects in the container. For mappings, it should iterate
1630over the keys of the container, and should also be made available as
1631the method \method{iterkeys()}.
1632
1633Iterator objects also need to implement this method; they are required
1634to return themselves. For more information on iterator objects, see
1635``\ulink{Iterator Types}{../lib/typeiter.html}'' in the
1636\citetitle[../lib/lib.html]{Python Library Reference}.
1637\end{methoddesc}
1638
1639The membership test operators (\keyword{in} and \keyword{not in}) are
1640normally implemented as an iteration through a sequence. However,
1641container objects can supply the following special method with a more
1642efficient implementation, which also does not require the object be a
1643sequence.
1644
1645\begin{methoddesc}[container object]{__contains__}{self, item}
1646Called to implement membership test operators. Should return true if
1647\var{item} is in \var{self}, false otherwise. For mapping objects,
1648this should consider the keys of the mapping rather than the values or
1649the key-item pairs.
1650\end{methoddesc}
1651
Fred Drakef6669171998-05-06 19:52:49 +00001652
Fred Drake3041b071998-10-21 00:25:32 +00001653\subsection{Additional methods for emulation of sequence types
Fred Drake61c77281998-07-28 19:34:22 +00001654 \label{sequence-methods}}
Guido van Rossum83b2f8a1998-07-23 17:12:46 +00001655
Raymond Hettingere701dcb2003-01-19 13:08:18 +00001656The following optional methods can be defined to further emulate sequence
1657objects. Immutable sequences methods should at most only define
1658\method{__getslice__()}; mutable sequences might define all three
Guido van Rossum83b2f8a1998-07-23 17:12:46 +00001659three methods.
Fred Drakef6669171998-05-06 19:52:49 +00001660
Fred Drake1e42d8a1998-11-25 17:58:50 +00001661\begin{methoddesc}[sequence object]{__getslice__}{self, i, j}
Fred Drakea0073822000-08-18 02:42:14 +00001662\deprecated{2.0}{Support slice objects as parameters to the
1663\method{__getitem__()} method.}
Fred Draked82575d1998-08-28 20:03:12 +00001664Called to implement evaluation of \code{\var{self}[\var{i}:\var{j}]}.
1665The returned object should be of the same type as \var{self}. Note
1666that missing \var{i} or \var{j} in the slice expression are replaced
Fred Drakee15956b2000-04-03 04:51:13 +00001667by zero or \code{sys.maxint}, respectively. If negative indexes are
1668used in the slice, the length of the sequence is added to that index.
1669If the instance does not implement the \method{__len__()} method, an
1670\exception{AttributeError} is raised.
1671No guarantee is made that indexes adjusted this way are not still
1672negative. Indexes which are greater than the length of the sequence
1673are not modified.
Fred Drakea0073822000-08-18 02:42:14 +00001674If no \method{__getslice__()} is found, a slice
Thomas Wouters1d75a792000-08-17 22:37:32 +00001675object is created instead, and passed to \method{__getitem__()} instead.
Fred Drake1e42d8a1998-11-25 17:58:50 +00001676\end{methoddesc}
Fred Drakef6669171998-05-06 19:52:49 +00001677
Fred Drake1e42d8a1998-11-25 17:58:50 +00001678\begin{methoddesc}[sequence object]{__setslice__}{self, i, j, sequence}
Fred Draked82575d1998-08-28 20:03:12 +00001679Called to implement assignment to \code{\var{self}[\var{i}:\var{j}]}.
1680Same notes for \var{i} and \var{j} as for \method{__getslice__()}.
Thomas Wouters1d75a792000-08-17 22:37:32 +00001681
Raymond Hettingere701dcb2003-01-19 13:08:18 +00001682This method is deprecated. If no \method{__setslice__()} is found,
1683or for extended slicing of the form
1684\code{\var{self}[\var{i}:\var{j}:\var{k}]}, a
1685slice object is created, and passed to \method{__setitem__()},
1686instead of \method{__setslice__()} being called.
Fred Drake1e42d8a1998-11-25 17:58:50 +00001687\end{methoddesc}
Fred Drakef6669171998-05-06 19:52:49 +00001688
Fred Drake1e42d8a1998-11-25 17:58:50 +00001689\begin{methoddesc}[sequence object]{__delslice__}{self, i, j}
Fred Draked82575d1998-08-28 20:03:12 +00001690Called to implement deletion of \code{\var{self}[\var{i}:\var{j}]}.
1691Same notes for \var{i} and \var{j} as for \method{__getslice__()}.
Raymond Hettingere701dcb2003-01-19 13:08:18 +00001692This method is deprecated. If no \method{__delslice__()} is found,
1693or for extended slicing of the form
1694\code{\var{self}[\var{i}:\var{j}:\var{k}]}, a
1695slice object is created, and passed to \method{__delitem__()},
1696instead of \method{__delslice__()} being called.
Fred Drake1e42d8a1998-11-25 17:58:50 +00001697\end{methoddesc}
Fred Drakef6669171998-05-06 19:52:49 +00001698
Fred Drakefb8ffe62001-04-13 15:54:41 +00001699Notice that these methods are only invoked when a single slice with a
1700single colon is used, and the slice method is available. For slice
1701operations involving extended slice notation, or in absence of the
1702slice methods, \method{__getitem__()}, \method{__setitem__()} or
1703\method{__delitem__()} is called with a slice object as argument.
Fred Drakef6669171998-05-06 19:52:49 +00001704
Fred Drakef89259782000-09-21 22:27:16 +00001705The following example demonstrate how to make your program or module
1706compatible with earlier versions of Python (assuming that methods
1707\method{__getitem__()}, \method{__setitem__()} and \method{__delitem__()}
1708support slice objects as arguments):
1709
1710\begin{verbatim}
1711class MyClass:
1712 ...
1713 def __getitem__(self, index):
1714 ...
1715 def __setitem__(self, index, value):
1716 ...
1717 def __delitem__(self, index):
1718 ...
1719
1720 if sys.version_info < (2, 0):
1721 # They won't be defined if version is at least 2.0 final
1722
1723 def __getslice__(self, i, j):
1724 return self[max(0, i):max(0, j):]
1725 def __setslice__(self, i, j, seq):
1726 self[max(0, i):max(0, j):] = seq
1727 def __delslice__(self, i, j):
1728 del self[max(0, i):max(0, j):]
1729 ...
1730\end{verbatim}
1731
Raymond Hettingere701dcb2003-01-19 13:08:18 +00001732Note the calls to \function{max()}; these are necessary because of
1733the handling of negative indices before the
Fred Drakef89259782000-09-21 22:27:16 +00001734\method{__*slice__()} methods are called. When negative indexes are
1735used, the \method{__*item__()} methods receive them as provided, but
1736the \method{__*slice__()} methods get a ``cooked'' form of the index
1737values. For each negative index value, the length of the sequence is
1738added to the index before calling the method (which may still result
1739in a negative index); this is the customary handling of negative
1740indexes by the built-in sequence types, and the \method{__*item__()}
1741methods are expected to do this as well. However, since they should
1742already be doing that, negative indexes cannot be passed in; they must
1743be be constrained to the bounds of the sequence before being passed to
1744the \method{__*item__()} methods.
1745Calling \code{max(0, i)} conveniently returns the proper value.
1746
Fred Drake15988fd1999-02-12 18:14:57 +00001747
Fred Drake61c77281998-07-28 19:34:22 +00001748\subsection{Emulating numeric types\label{numeric-types}}
Guido van Rossum83b2f8a1998-07-23 17:12:46 +00001749
1750The following methods can be defined to emulate numeric objects.
1751Methods corresponding to operations that are not supported by the
1752particular kind of number implemented (e.g., bitwise operations for
1753non-integral numbers) should be left undefined.
Fred Drakef6669171998-05-06 19:52:49 +00001754
Fred Drakeb8943701999-05-10 13:43:22 +00001755\begin{methoddesc}[numeric object]{__add__}{self, other}
1756\methodline[numeric object]{__sub__}{self, other}
1757\methodline[numeric object]{__mul__}{self, other}
Fred Drake3e2aca42001-08-14 20:28:08 +00001758\methodline[numeric object]{__floordiv__}{self, other}
Fred Drakeb8943701999-05-10 13:43:22 +00001759\methodline[numeric object]{__mod__}{self, other}
1760\methodline[numeric object]{__divmod__}{self, other}
1761\methodline[numeric object]{__pow__}{self, other\optional{, modulo}}
1762\methodline[numeric object]{__lshift__}{self, other}
1763\methodline[numeric object]{__rshift__}{self, other}
1764\methodline[numeric object]{__and__}{self, other}
1765\methodline[numeric object]{__xor__}{self, other}
1766\methodline[numeric object]{__or__}{self, other}
Fred Drake3e2aca42001-08-14 20:28:08 +00001767These methods are
Guido van Rossum83b2f8a1998-07-23 17:12:46 +00001768called to implement the binary arithmetic operations (\code{+},
Fred Drake3e2aca42001-08-14 20:28:08 +00001769\code{-}, \code{*}, \code{//}, \code{\%},
Fred Draked82575d1998-08-28 20:03:12 +00001770\function{divmod()}\bifuncindex{divmod},
Fred Drakefb8ffe62001-04-13 15:54:41 +00001771\function{pow()}\bifuncindex{pow}, \code{**}, \code{<}\code{<},
1772\code{>}\code{>}, \code{\&}, \code{\^}, \code{|}). For instance, to
1773evaluate the expression \var{x}\code{+}\var{y}, where \var{x} is an
1774instance of a class that has an \method{__add__()} method,
Fred Drake3e2aca42001-08-14 20:28:08 +00001775\code{\var{x}.__add__(\var{y})} is called. The \method{__divmod__()}
1776method should be the equivalent to using \method{__floordiv__()} and
1777\method{__mod__()}; it should not be related to \method{__truediv__()}
1778(described below). Note that
Fred Draked82575d1998-08-28 20:03:12 +00001779\method{__pow__()} should be defined to accept an optional third
1780argument if the ternary version of the built-in
1781\function{pow()}\bifuncindex{pow} function is to be supported.
Fred Drake1e42d8a1998-11-25 17:58:50 +00001782\end{methoddesc}
Guido van Rossum83b2f8a1998-07-23 17:12:46 +00001783
Fred Drake3e2aca42001-08-14 20:28:08 +00001784\begin{methoddesc}[numeric object]{__div__}{self, other}
1785\methodline[numeric object]{__truediv__}{self, other}
1786The division operator (\code{/}) is implemented by these methods. The
1787\method{__truediv__()} method is used when \code{__future__.division}
1788is in effect, otherwise \method{__div__()} is used. If only one of
1789these two methods is defined, the object will not support division in
1790the alternate context; \exception{TypeError} will be raised instead.
1791\end{methoddesc}
1792
Fred Drakeb8943701999-05-10 13:43:22 +00001793\begin{methoddesc}[numeric object]{__radd__}{self, other}
1794\methodline[numeric object]{__rsub__}{self, other}
1795\methodline[numeric object]{__rmul__}{self, other}
1796\methodline[numeric object]{__rdiv__}{self, other}
Raymond Hettinger10cbe8d2002-06-20 06:12:37 +00001797\methodline[numeric object]{__rtruediv__}{self, other}
1798\methodline[numeric object]{__rfloordiv__}{self, other}
Fred Drakeb8943701999-05-10 13:43:22 +00001799\methodline[numeric object]{__rmod__}{self, other}
1800\methodline[numeric object]{__rdivmod__}{self, other}
1801\methodline[numeric object]{__rpow__}{self, other}
1802\methodline[numeric object]{__rlshift__}{self, other}
1803\methodline[numeric object]{__rrshift__}{self, other}
1804\methodline[numeric object]{__rand__}{self, other}
1805\methodline[numeric object]{__rxor__}{self, other}
1806\methodline[numeric object]{__ror__}{self, other}
Fred Drake3e2aca42001-08-14 20:28:08 +00001807These methods are
Guido van Rossum83b2f8a1998-07-23 17:12:46 +00001808called to implement the binary arithmetic operations (\code{+},
Fred Draked82575d1998-08-28 20:03:12 +00001809\code{-}, \code{*}, \code{/}, \code{\%},
1810\function{divmod()}\bifuncindex{divmod},
Fred Drakefb8ffe62001-04-13 15:54:41 +00001811\function{pow()}\bifuncindex{pow}, \code{**}, \code{<}\code{<},
1812\code{>}\code{>}, \code{\&}, \code{\^}, \code{|}) with reflected
1813(swapped) operands. These functions are only called if the left
1814operand does not support the corresponding operation. For instance,
1815to evaluate the expression \var{x}\code{-}\var{y}, where \var{y} is an
1816instance of a class that has an \method{__rsub__()} method,
1817\code{\var{y}.__rsub__(\var{x})} is called. Note that ternary
1818\function{pow()}\bifuncindex{pow} will not try calling
1819\method{__rpow__()} (the coercion rules would become too
Guido van Rossum83b2f8a1998-07-23 17:12:46 +00001820complicated).
Fred Drake1e42d8a1998-11-25 17:58:50 +00001821\end{methoddesc}
Fred Drakef6669171998-05-06 19:52:49 +00001822
Thomas Woutersdc90cc22000-12-11 23:11:51 +00001823\begin{methoddesc}[numeric object]{__iadd__}{self, other}
1824\methodline[numeric object]{__isub__}{self, other}
1825\methodline[numeric object]{__imul__}{self, other}
1826\methodline[numeric object]{__idiv__}{self, other}
Raymond Hettinger10cbe8d2002-06-20 06:12:37 +00001827\methodline[numeric object]{__itruediv__}{self, other}
1828\methodline[numeric object]{__ifloordiv__}{self, other}
1829\methodline[numeric object]{__imod__}{self, other}
Thomas Woutersdc90cc22000-12-11 23:11:51 +00001830\methodline[numeric object]{__ipow__}{self, other\optional{, modulo}}
1831\methodline[numeric object]{__ilshift__}{self, other}
1832\methodline[numeric object]{__irshift__}{self, other}
1833\methodline[numeric object]{__iand__}{self, other}
1834\methodline[numeric object]{__ixor__}{self, other}
1835\methodline[numeric object]{__ior__}{self, other}
Fred Drakefb8ffe62001-04-13 15:54:41 +00001836These methods are called to implement the augmented arithmetic
1837operations (\code{+=}, \code{-=}, \code{*=}, \code{/=}, \code{\%=},
1838\code{**=}, \code{<}\code{<=}, \code{>}\code{>=}, \code{\&=},
Fred Drakea3788642003-07-23 15:18:03 +00001839\code{\textasciicircum=}, \code{|=}). These methods should attempt to do the
Fred Drakefb8ffe62001-04-13 15:54:41 +00001840operation in-place (modifying \var{self}) and return the result (which
1841could be, but does not have to be, \var{self}). If a specific method
1842is not defined, the augmented operation falls back to the normal
1843methods. For instance, to evaluate the expression
1844\var{x}\code{+=}\var{y}, where \var{x} is an instance of a class that
1845has an \method{__iadd__()} method, \code{\var{x}.__iadd__(\var{y})} is
1846called. If \var{x} is an instance of a class that does not define a
1847\method{__iadd()} method, \code{\var{x}.__add__(\var{y})} and
1848\code{\var{y}.__radd__(\var{x})} are considered, as with the
1849evaluation of \var{x}\code{+}\var{y}.
Thomas Woutersdc90cc22000-12-11 23:11:51 +00001850\end{methoddesc}
1851
Fred Drakeb8943701999-05-10 13:43:22 +00001852\begin{methoddesc}[numeric object]{__neg__}{self}
1853\methodline[numeric object]{__pos__}{self}
1854\methodline[numeric object]{__abs__}{self}
1855\methodline[numeric object]{__invert__}{self}
Fred Drakefb8ffe62001-04-13 15:54:41 +00001856Called to implement the unary arithmetic operations (\code{-},
1857\code{+}, \function{abs()}\bifuncindex{abs} and \code{\~{}}).
Fred Drake1e42d8a1998-11-25 17:58:50 +00001858\end{methoddesc}
Fred Drakef6669171998-05-06 19:52:49 +00001859
Fred Drakeb8943701999-05-10 13:43:22 +00001860\begin{methoddesc}[numeric object]{__complex__}{self}
1861\methodline[numeric object]{__int__}{self}
1862\methodline[numeric object]{__long__}{self}
1863\methodline[numeric object]{__float__}{self}
Fred Draked82575d1998-08-28 20:03:12 +00001864Called to implement the built-in functions
Fred Drake15988fd1999-02-12 18:14:57 +00001865\function{complex()}\bifuncindex{complex},
1866\function{int()}\bifuncindex{int}, \function{long()}\bifuncindex{long},
Fred Draked82575d1998-08-28 20:03:12 +00001867and \function{float()}\bifuncindex{float}. Should return a value of
1868the appropriate type.
Fred Drake1e42d8a1998-11-25 17:58:50 +00001869\end{methoddesc}
Fred Drakef6669171998-05-06 19:52:49 +00001870
Fred Drakeb8943701999-05-10 13:43:22 +00001871\begin{methoddesc}[numeric object]{__oct__}{self}
1872\methodline[numeric object]{__hex__}{self}
Fred Draked82575d1998-08-28 20:03:12 +00001873Called to implement the built-in functions
1874\function{oct()}\bifuncindex{oct} and
1875\function{hex()}\bifuncindex{hex}. Should return a string value.
Fred Drake1e42d8a1998-11-25 17:58:50 +00001876\end{methoddesc}
Fred Drakef6669171998-05-06 19:52:49 +00001877
Fred Drakeb8943701999-05-10 13:43:22 +00001878\begin{methoddesc}[numeric object]{__coerce__}{self, other}
Guido van Rossum83b2f8a1998-07-23 17:12:46 +00001879Called to implement ``mixed-mode'' numeric arithmetic. Should either
Fred Draked82575d1998-08-28 20:03:12 +00001880return a 2-tuple containing \var{self} and \var{other} converted to
Fred Drakeb8943701999-05-10 13:43:22 +00001881a common numeric type, or \code{None} if conversion is impossible. When
Guido van Rossum83b2f8a1998-07-23 17:12:46 +00001882the common type would be the type of \code{other}, it is sufficient to
1883return \code{None}, since the interpreter will also ask the other
1884object to attempt a coercion (but sometimes, if the implementation of
1885the other type cannot be changed, it is useful to do the conversion to
Guido van Rossum92cf95f2002-06-03 19:06:41 +00001886the other type here). A return value of \code{NotImplemented} is
1887equivalent to returning \code{None}.
Fred Drake1e42d8a1998-11-25 17:58:50 +00001888\end{methoddesc}
Guido van Rossum83b2f8a1998-07-23 17:12:46 +00001889
Guido van Rossum92cf95f2002-06-03 19:06:41 +00001890\subsection{Coercion rules\label{coercion-rules}}
1891
1892This section used to document the rules for coercion. As the language
1893has evolved, the coercion rules have become hard to document
1894precisely; documenting what one version of one particular
1895implementation does is undesirable. Instead, here are some informal
1896guidelines regarding coercion. In Python 3.0, coercion will not be
1897supported.
Guido van Rossum83b2f8a1998-07-23 17:12:46 +00001898
1899\begin{itemize}
1900
Guido van Rossum92cf95f2002-06-03 19:06:41 +00001901\item
Guido van Rossum83b2f8a1998-07-23 17:12:46 +00001902
Guido van Rossum92cf95f2002-06-03 19:06:41 +00001903If the left operand of a \% operator is a string or Unicode object, no
1904coercion takes place and the string formatting operation is invoked
1905instead.
Guido van Rossum83b2f8a1998-07-23 17:12:46 +00001906
Guido van Rossum92cf95f2002-06-03 19:06:41 +00001907\item
Guido van Rossum83b2f8a1998-07-23 17:12:46 +00001908
Guido van Rossum92cf95f2002-06-03 19:06:41 +00001909It is no longer recommended to define a coercion operation.
1910Mixed-mode operations on types that don't define coercion pass the
1911original arguments to the operation.
Guido van Rossum83b2f8a1998-07-23 17:12:46 +00001912
Guido van Rossum92cf95f2002-06-03 19:06:41 +00001913\item
Guido van Rossum83b2f8a1998-07-23 17:12:46 +00001914
Fred Drake293dd4b2002-06-04 16:25:57 +00001915New-style classes (those derived from \class{object}) never invoke the
1916\method{__coerce__()} method in response to a binary operator; the only
1917time \method{__coerce__()} is invoked is when the built-in function
1918\function{coerce()} is called.
Guido van Rossum83b2f8a1998-07-23 17:12:46 +00001919
Guido van Rossum92cf95f2002-06-03 19:06:41 +00001920\item
Guido van Rossum83b2f8a1998-07-23 17:12:46 +00001921
Guido van Rossum92cf95f2002-06-03 19:06:41 +00001922For most intents and purposes, an operator that returns
1923\code{NotImplemented} is treated the same as one that is not
1924implemented at all.
Guido van Rossum83b2f8a1998-07-23 17:12:46 +00001925
Guido van Rossum92cf95f2002-06-03 19:06:41 +00001926\item
Guido van Rossum83b2f8a1998-07-23 17:12:46 +00001927
Guido van Rossum92cf95f2002-06-03 19:06:41 +00001928Below, \method{__op__()} and \method{__rop__()} are used to signify
1929the generic method names corresponding to an operator;
1930\method{__iop__} is used for the corresponding in-place operator. For
1931example, for the operator `\code{+}', \method{__add__()} and
1932\method{__radd__()} are used for the left and right variant of the
1933binary operator, and \method{__iadd__} for the in-place variant.
Guido van Rossum83b2f8a1998-07-23 17:12:46 +00001934
Guido van Rossum92cf95f2002-06-03 19:06:41 +00001935\item
Guido van Rossum83b2f8a1998-07-23 17:12:46 +00001936
Guido van Rossum92cf95f2002-06-03 19:06:41 +00001937For objects \var{x} and \var{y}, first \code{\var{x}.__op__(\var{y})}
1938is tried. If this is not implemented or returns \code{NotImplemented},
1939\code{\var{y}.__rop__(\var{x})} is tried. If this is also not
Fred Drake293dd4b2002-06-04 16:25:57 +00001940implemented or returns \code{NotImplemented}, a \exception{TypeError}
Guido van Rossum92cf95f2002-06-03 19:06:41 +00001941exception is raised. But see the following exception:
Guido van Rossum83b2f8a1998-07-23 17:12:46 +00001942
Guido van Rossum92cf95f2002-06-03 19:06:41 +00001943\item
Guido van Rossum83b2f8a1998-07-23 17:12:46 +00001944
Guido van Rossum92cf95f2002-06-03 19:06:41 +00001945Exception to the previous item: if the left operand is an instance of
1946a built-in type or a new-style class, and the right operand is an
1947instance of a proper subclass of that type or class, the right
Fred Drake293dd4b2002-06-04 16:25:57 +00001948operand's \method{__rop__()} method is tried \emph{before} the left
1949operand's \method{__op__()} method. This is done so that a subclass can
Guido van Rossum92cf95f2002-06-03 19:06:41 +00001950completely override binary operators. Otherwise, the left operand's
1951__op__ method would always accept the right operand: when an instance
1952of a given class is expected, an instance of a subclass of that class
1953is always acceptable.
Guido van Rossum83b2f8a1998-07-23 17:12:46 +00001954
Guido van Rossum92cf95f2002-06-03 19:06:41 +00001955\item
Guido van Rossum83b2f8a1998-07-23 17:12:46 +00001956
Guido van Rossum92cf95f2002-06-03 19:06:41 +00001957When either operand type defines a coercion, this coercion is called
Fred Drake293dd4b2002-06-04 16:25:57 +00001958before that type's \method{__op__()} or \method{__rop__()} method is
1959called, but no sooner. If the coercion returns an object of a
1960different type for the operand whose coercion is invoked, part of the
1961process is redone using the new object.
Guido van Rossum83b2f8a1998-07-23 17:12:46 +00001962
Guido van Rossum92cf95f2002-06-03 19:06:41 +00001963\item
Guido van Rossum83b2f8a1998-07-23 17:12:46 +00001964
Guido van Rossum92cf95f2002-06-03 19:06:41 +00001965When an in-place operator (like `\code{+=}') is used, if the left
Fred Drake293dd4b2002-06-04 16:25:57 +00001966operand implements \method{__iop__()}, it is invoked without any
1967coercion. When the operation falls back to \method{__op__()} and/or
1968\method{__rop__()}, the normal coercion rules apply.
Guido van Rossum83b2f8a1998-07-23 17:12:46 +00001969
Guido van Rossum92cf95f2002-06-03 19:06:41 +00001970\item
1971
1972In \var{x}\code{+}\var{y}, if \var{x} is a sequence that implements
1973sequence concatenation, sequence concatenation is invoked.
1974
1975\item
1976
1977In \var{x}\code{*}\var{y}, if one operator is a sequence that
1978implements sequence repetition, and the other is an integer
Fred Drake293dd4b2002-06-04 16:25:57 +00001979(\class{int} or \class{long}), sequence repetition is invoked.
Guido van Rossum92cf95f2002-06-03 19:06:41 +00001980
1981\item
1982
Fred Drake293dd4b2002-06-04 16:25:57 +00001983Rich comparisons (implemented by methods \method{__eq__()} and so on)
Guido van Rossum92cf95f2002-06-03 19:06:41 +00001984never use coercion. Three-way comparison (implemented by
Fred Drake293dd4b2002-06-04 16:25:57 +00001985\method{__cmp__()}) does use coercion under the same conditions as
Guido van Rossum92cf95f2002-06-03 19:06:41 +00001986other binary operations use it.
1987
1988\item
1989
Fred Drake293dd4b2002-06-04 16:25:57 +00001990In the current implementation, the built-in numeric types \class{int},
1991\class{long} and \class{float} do not use coercion; the type
1992\class{complex} however does use it. The difference can become
Guido van Rossum92cf95f2002-06-03 19:06:41 +00001993apparent when subclassing these types. Over time, the type
Fred Drake293dd4b2002-06-04 16:25:57 +00001994\class{complex} may be fixed to avoid coercion. All these types
1995implement a \method{__coerce__()} method, for use by the built-in
1996\function{coerce()} function.
Guido van Rossum83b2f8a1998-07-23 17:12:46 +00001997
1998\end{itemize}