blob: 46388b2faaebe3b674dd0bfa1cc97c62b2db4ab6 [file] [log] [blame]
Fred Drake61c77281998-07-28 19:34:22 +00001\chapter{Data model\label{datamodel}}
Fred Drakef6669171998-05-06 19:52:49 +00002
Fred Drake2829f1c2001-06-23 05:27:20 +00003
Fred Drake61c77281998-07-28 19:34:22 +00004\section{Objects, values and types\label{objects}}
Fred Drakef6669171998-05-06 19:52:49 +00005
6\dfn{Objects} are Python's abstraction for data. All data in a Python
7program is represented by objects or by relations between objects.
8(In a sense, and in conformance to Von Neumann's model of a
Guido van Rossum83b2f8a1998-07-23 17:12:46 +00009``stored program computer,'' code is also represented by objects.)
Fred Drakef6669171998-05-06 19:52:49 +000010\index{object}
11\index{data}
12
13Every object has an identity, a type and a value. An object's
14\emph{identity} never changes once it has been created; you may think
Fred Drake293dd4b2002-06-04 16:25:57 +000015of it as the object's address in memory. The `\keyword{is}' operator
Fred Drake82385871998-10-01 20:40:43 +000016compares the identity of two objects; the
17\function{id()}\bifuncindex{id} function returns an integer
18representing its identity (currently implemented as its address).
Guido van Rossum83b2f8a1998-07-23 17:12:46 +000019An object's \dfn{type} is
Raymond Hettingere701dcb2003-01-19 13:08:18 +000020also unchangeable.\footnote{Since Python 2.2, a gradual merging of
21types and classes has been started that makes this and a few other
22assertions made in this manual not 100\% accurate and complete:
23for example, it \emph{is} now possible in some cases to change an
24object's type, under certain controlled conditions. Until this manual
25undergoes extensive revision, it must now be taken as authoritative
26only regarding ``classic classes'', that are still the default, for
27compatibility purposes, in Python 2.2 and 2.3.}
28An object's type determines the operations that the object
Guido van Rossum83b2f8a1998-07-23 17:12:46 +000029supports (e.g., ``does it have a length?'') and also defines the
Fred Drake82385871998-10-01 20:40:43 +000030possible values for objects of that type. The
31\function{type()}\bifuncindex{type} function returns an object's type
32(which is an object itself). The \emph{value} of some
Fred Drakef6669171998-05-06 19:52:49 +000033objects can change. Objects whose value can change are said to be
34\emph{mutable}; objects whose value is unchangeable once they are
Guido van Rossum83b2f8a1998-07-23 17:12:46 +000035created are called \emph{immutable}.
Guido van Rossum264bd591999-02-23 16:40:55 +000036(The value of an immutable container object that contains a reference
37to a mutable object can change when the latter's value is changed;
38however the container is still considered immutable, because the
39collection of objects it contains cannot be changed. So, immutability
40is not strictly the same as having an unchangeable value, it is more
41subtle.)
Guido van Rossum83b2f8a1998-07-23 17:12:46 +000042An object's mutability is determined by its type; for instance,
43numbers, strings and tuples are immutable, while dictionaries and
44lists are mutable.
Fred Drakef6669171998-05-06 19:52:49 +000045\index{identity of an object}
46\index{value of an object}
47\index{type of an object}
48\index{mutable object}
49\index{immutable object}
50
51Objects are never explicitly destroyed; however, when they become
52unreachable they may be garbage-collected. An implementation is
Barry Warsaw92a6ed91998-08-07 16:33:51 +000053allowed to postpone garbage collection or omit it altogether --- it is
54a matter of implementation quality how garbage collection is
Fred Drakef6669171998-05-06 19:52:49 +000055implemented, as long as no objects are collected that are still
56reachable. (Implementation note: the current implementation uses a
Fred Drakec8e82812001-01-22 17:46:18 +000057reference-counting scheme with (optional) delayed detection of
Raymond Hettingere701dcb2003-01-19 13:08:18 +000058cyclically linked garbage, which collects most objects as soon as they
Fred Drakec8e82812001-01-22 17:46:18 +000059become unreachable, but is not guaranteed to collect garbage
60containing circular references. See the
61\citetitle[../lib/module-gc.html]{Python Library Reference} for
62information on controlling the collection of cyclic garbage.)
Fred Drakef6669171998-05-06 19:52:49 +000063\index{garbage collection}
64\index{reference counting}
65\index{unreachable object}
66
67Note that the use of the implementation's tracing or debugging
68facilities may keep objects alive that would normally be collectable.
Guido van Rossum83b2f8a1998-07-23 17:12:46 +000069Also note that catching an exception with a
Fred Drake4856d011999-01-12 04:15:20 +000070`\keyword{try}...\keyword{except}' statement may keep objects alive.
Fred Drakef6669171998-05-06 19:52:49 +000071
72Some objects contain references to ``external'' resources such as open
73files or windows. It is understood that these resources are freed
74when the object is garbage-collected, but since garbage collection is
75not guaranteed to happen, such objects also provide an explicit way to
76release the external resource, usually a \method{close()} method.
Guido van Rossum83b2f8a1998-07-23 17:12:46 +000077Programs are strongly recommended to explicitly close such
Fred Drake4856d011999-01-12 04:15:20 +000078objects. The `\keyword{try}...\keyword{finally}' statement provides
79a convenient way to do this.
Fred Drakef6669171998-05-06 19:52:49 +000080
81Some objects contain references to other objects; these are called
82\emph{containers}. Examples of containers are tuples, lists and
83dictionaries. The references are part of a container's value. In
84most cases, when we talk about the value of a container, we imply the
85values, not the identities of the contained objects; however, when we
Guido van Rossum83b2f8a1998-07-23 17:12:46 +000086talk about the mutability of a container, only the identities of
87the immediately contained objects are implied. So, if an immutable
88container (like a tuple)
89contains a reference to a mutable object, its value changes
90if that mutable object is changed.
Fred Drakef6669171998-05-06 19:52:49 +000091\index{container}
92
Guido van Rossum83b2f8a1998-07-23 17:12:46 +000093Types affect almost all aspects of object behavior. Even the importance
Fred Drakef6669171998-05-06 19:52:49 +000094of object identity is affected in some sense: for immutable types,
95operations that compute new values may actually return a reference to
96any existing object with the same type and value, while for mutable
Guido van Rossum83b2f8a1998-07-23 17:12:46 +000097objects this is not allowed. E.g., after
Fred Drake82385871998-10-01 20:40:43 +000098\samp{a = 1; b = 1},
Fred Drakef6669171998-05-06 19:52:49 +000099\code{a} and \code{b} may or may not refer to the same object with the
Guido van Rossum83b2f8a1998-07-23 17:12:46 +0000100value one, depending on the implementation, but after
Fred Drake82385871998-10-01 20:40:43 +0000101\samp{c = []; d = []}, \code{c} and \code{d}
Fred Drakef6669171998-05-06 19:52:49 +0000102are guaranteed to refer to two different, unique, newly created empty
103lists.
Fred Drake82385871998-10-01 20:40:43 +0000104(Note that \samp{c = d = []} assigns the same object to both
Guido van Rossum83b2f8a1998-07-23 17:12:46 +0000105\code{c} and \code{d}.)
Fred Drakef6669171998-05-06 19:52:49 +0000106
Fred Drake2829f1c2001-06-23 05:27:20 +0000107
Fred Drake61c77281998-07-28 19:34:22 +0000108\section{The standard type hierarchy\label{types}}
Fred Drakef6669171998-05-06 19:52:49 +0000109
110Below is a list of the types that are built into Python. Extension
Raymond Hettingere701dcb2003-01-19 13:08:18 +0000111modules (written in C, Java, or other languages, depending on
112the implementation) can define additional types. Future versions of
Guido van Rossum83b2f8a1998-07-23 17:12:46 +0000113Python may add types to the type hierarchy (e.g., rational
Fred Drakef6669171998-05-06 19:52:49 +0000114numbers, efficiently stored arrays of integers, etc.).
115\index{type}
116\indexii{data}{type}
117\indexii{type}{hierarchy}
118\indexii{extension}{module}
119\indexii{C}{language}
120
121Some of the type descriptions below contain a paragraph listing
Guido van Rossum83b2f8a1998-07-23 17:12:46 +0000122`special attributes.' These are attributes that provide access to the
Fred Drakef6669171998-05-06 19:52:49 +0000123implementation and are not intended for general use. Their definition
Fred Drake35705512001-12-03 17:32:27 +0000124may change in the future.
Fred Drakef6669171998-05-06 19:52:49 +0000125\index{attribute}
126\indexii{special}{attribute}
127\indexiii{generic}{special}{attribute}
Fred Drakef6669171998-05-06 19:52:49 +0000128
129\begin{description}
130
131\item[None]
132This type has a single value. There is a single object with this value.
133This object is accessed through the built-in name \code{None}.
Guido van Rossum83b2f8a1998-07-23 17:12:46 +0000134It is used to signify the absence of a value in many situations, e.g.,
135it is returned from functions that don't explicitly return anything.
136Its truth value is false.
Fred Drake7a700b82004-01-01 05:43:53 +0000137\obindex{None}
Fred Drakef6669171998-05-06 19:52:49 +0000138
Neil Schemenauer48c2eb92001-01-04 01:25:50 +0000139\item[NotImplemented]
140This type has a single value. There is a single object with this value.
141This object is accessed through the built-in name \code{NotImplemented}.
Guido van Rossumab782dd2001-01-18 15:17:06 +0000142Numeric methods and rich comparison methods may return this value if
143they do not implement the operation for the operands provided. (The
144interpreter will then try the reflected operation, or some other
145fallback, depending on the operator.) Its truth value is true.
Fred Drake7a700b82004-01-01 05:43:53 +0000146\obindex{NotImplemented}
Neil Schemenauer48c2eb92001-01-04 01:25:50 +0000147
Guido van Rossum83b2f8a1998-07-23 17:12:46 +0000148\item[Ellipsis]
149This type has a single value. There is a single object with this value.
150This object is accessed through the built-in name \code{Ellipsis}.
Fred Drake82385871998-10-01 20:40:43 +0000151It is used to indicate the presence of the \samp{...} syntax in a
Guido van Rossum83b2f8a1998-07-23 17:12:46 +0000152slice. Its truth value is true.
Fred Drakec0a02c02002-04-16 02:03:05 +0000153\obindex{Ellipsis}
Guido van Rossum83b2f8a1998-07-23 17:12:46 +0000154
Fred Drakef6669171998-05-06 19:52:49 +0000155\item[Numbers]
156These are created by numeric literals and returned as results by
157arithmetic operators and arithmetic built-in functions. Numeric
158objects are immutable; once created their value never changes. Python
159numbers are of course strongly related to mathematical numbers, but
160subject to the limitations of numerical representation in computers.
Fred Drakef6669171998-05-06 19:52:49 +0000161\obindex{numeric}
162
Fred Drakeb3384d32001-05-14 16:04:22 +0000163Python distinguishes between integers, floating point numbers, and
164complex numbers:
Fred Drakef6669171998-05-06 19:52:49 +0000165
166\begin{description}
167\item[Integers]
168These represent elements from the mathematical set of whole numbers.
169\obindex{integer}
170
Guido van Rossum77f6a652002-04-03 22:41:51 +0000171There are three types of integers:
Fred Drakef6669171998-05-06 19:52:49 +0000172
173\begin{description}
174
175\item[Plain integers]
176These represent numbers in the range -2147483648 through 2147483647.
177(The range may be larger on machines with a larger natural word
178size, but not smaller.)
Fred Drakee15956b2000-04-03 04:51:13 +0000179When the result of an operation would fall outside this range, the
Raymond Hettingere701dcb2003-01-19 13:08:18 +0000180result is normally returned as a long integer (in some cases, the
181exception \exception{OverflowError} is raised instead).
Fred Drakef6669171998-05-06 19:52:49 +0000182For the purpose of shift and mask operations, integers are assumed to
183have a binary, 2's complement notation using 32 or more bits, and
184hiding no bits from the user (i.e., all 4294967296 different bit
185patterns correspond to different values).
186\obindex{plain integer}
187\withsubitem{(built-in exception)}{\ttindex{OverflowError}}
188
189\item[Long integers]
190These represent numbers in an unlimited range, subject to available
191(virtual) memory only. For the purpose of shift and mask operations,
192a binary representation is assumed, and negative numbers are
193represented in a variant of 2's complement which gives the illusion of
194an infinite string of sign bits extending to the left.
195\obindex{long integer}
196
Guido van Rossum77f6a652002-04-03 22:41:51 +0000197\item[Booleans]
198These represent the truth values False and True. The two objects
199representing the values False and True are the only Boolean objects.
200The Boolean type is a subtype of plain integers, and Boolean values
201behave like the values 0 and 1, respectively, in almost all contexts,
202the exception being that when converted to a string, the strings
203\code{"False"} or \code{"True"} are returned, respectively.
204\obindex{Boolean}
205\ttindex{False}
206\ttindex{True}
207
Fred Drakef6669171998-05-06 19:52:49 +0000208\end{description} % Integers
209
210The rules for integer representation are intended to give the most
211meaningful interpretation of shift and mask operations involving
212negative integers and the least surprises when switching between the
Raymond Hettingere701dcb2003-01-19 13:08:18 +0000213plain and long integer domains. Any operation except left shift,
Fred Drakef6669171998-05-06 19:52:49 +0000214if it yields a result in the plain integer domain without causing
Raymond Hettingere701dcb2003-01-19 13:08:18 +0000215overflow, will yield the same result in the long integer domain or
Fred Drakef6669171998-05-06 19:52:49 +0000216when using mixed operands.
217\indexii{integer}{representation}
218
219\item[Floating point numbers]
220These represent machine-level double precision floating point numbers.
Raymond Hettingere701dcb2003-01-19 13:08:18 +0000221You are at the mercy of the underlying machine architecture (and
222C or Java implementation) for the accepted range and handling of overflow.
Guido van Rossum83b2f8a1998-07-23 17:12:46 +0000223Python does not support single-precision floating point numbers; the
Fred Drake6e5e1d92001-07-14 02:12:27 +0000224savings in processor and memory usage that are usually the reason for using
Guido van Rossum83b2f8a1998-07-23 17:12:46 +0000225these is dwarfed by the overhead of using objects in Python, so there
226is no reason to complicate the language with two kinds of floating
227point numbers.
Fred Drakef6669171998-05-06 19:52:49 +0000228\obindex{floating point}
229\indexii{floating point}{number}
230\indexii{C}{language}
Raymond Hettingere701dcb2003-01-19 13:08:18 +0000231\indexii{Java}{language}
Fred Drakef6669171998-05-06 19:52:49 +0000232
Guido van Rossum83b2f8a1998-07-23 17:12:46 +0000233\item[Complex numbers]
234These represent complex numbers as a pair of machine-level double
235precision floating point numbers. The same caveats apply as for
Raymond Hettingere701dcb2003-01-19 13:08:18 +0000236floating point numbers. The real and imaginary parts of a complex
237number \code{z} can be retrieved through the read-only attributes
238\code{z.real} and \code{z.imag}.
Guido van Rossum83b2f8a1998-07-23 17:12:46 +0000239\obindex{complex}
240\indexii{complex}{number}
241
Fred Drakef6669171998-05-06 19:52:49 +0000242\end{description} % Numbers
243
Guido van Rossum77f6a652002-04-03 22:41:51 +0000244
Fred Drakef6669171998-05-06 19:52:49 +0000245\item[Sequences]
Fred Drake230d17d2001-02-22 21:28:04 +0000246These represent finite ordered sets indexed by non-negative numbers.
Fred Drakef6669171998-05-06 19:52:49 +0000247The built-in function \function{len()}\bifuncindex{len} returns the
Guido van Rossum83b2f8a1998-07-23 17:12:46 +0000248number of items of a sequence.
Thomas Woutersf9b526d2000-07-16 19:05:38 +0000249When the length of a sequence is \var{n}, the
Guido van Rossum83b2f8a1998-07-23 17:12:46 +0000250index set contains the numbers 0, 1, \ldots, \var{n}-1. Item
Fred Drakef6669171998-05-06 19:52:49 +0000251\var{i} of sequence \var{a} is selected by \code{\var{a}[\var{i}]}.
Fred Drakee15956b2000-04-03 04:51:13 +0000252\obindex{sequence}
Fred Drakef6669171998-05-06 19:52:49 +0000253\index{index operation}
254\index{item selection}
255\index{subscription}
256
257Sequences also support slicing: \code{\var{a}[\var{i}:\var{j}]}
Guido van Rossum83b2f8a1998-07-23 17:12:46 +0000258selects all items with index \var{k} such that \var{i} \code{<=}
Fred Drakef6669171998-05-06 19:52:49 +0000259\var{k} \code{<} \var{j}. When used as an expression, a slice is a
Guido van Rossum83b2f8a1998-07-23 17:12:46 +0000260sequence of the same type. This implies that the index set is
261renumbered so that it starts at 0.
Fred Drakef6669171998-05-06 19:52:49 +0000262\index{slicing}
263
Michael W. Hudson5efaf7e2002-06-11 10:55:12 +0000264Some sequences also support ``extended slicing'' with a third ``step''
265parameter: \code{\var{a}[\var{i}:\var{j}:\var{k}]} selects all items
266of \var{a} with index \var{x} where \code{\var{x} = \var{i} +
267\var{n}*\var{k}}, \var{n} \code{>=} \code{0} and \var{i} \code{<=}
268\var{x} \code{<} \var{j}.
269\index{extended slicing}
270
Fred Drakef6669171998-05-06 19:52:49 +0000271Sequences are distinguished according to their mutability:
272
273\begin{description}
Fred Drake4856d011999-01-12 04:15:20 +0000274
Fred Drakef6669171998-05-06 19:52:49 +0000275\item[Immutable sequences]
276An object of an immutable sequence type cannot change once it is
277created. (If the object contains references to other objects,
Guido van Rossum83b2f8a1998-07-23 17:12:46 +0000278these other objects may be mutable and may be changed; however,
Fred Drakef6669171998-05-06 19:52:49 +0000279the collection of objects directly referenced by an immutable object
280cannot change.)
281\obindex{immutable sequence}
282\obindex{immutable}
283
284The following types are immutable sequences:
285
286\begin{description}
287
288\item[Strings]
Guido van Rossum83b2f8a1998-07-23 17:12:46 +0000289The items of a string are characters. There is no separate
290character type; a character is represented by a string of one item.
Fred Drakef6669171998-05-06 19:52:49 +0000291Characters represent (at least) 8-bit bytes. The built-in
292functions \function{chr()}\bifuncindex{chr} and
293\function{ord()}\bifuncindex{ord} convert between characters and
294nonnegative integers representing the byte values. Bytes with the
Guido van Rossum83b2f8a1998-07-23 17:12:46 +0000295values 0-127 usually represent the corresponding \ASCII{} values, but
296the interpretation of values is up to the program. The string
297data type is also used to represent arrays of bytes, e.g., to hold data
Fred Drakef6669171998-05-06 19:52:49 +0000298read from a file.
299\obindex{string}
300\index{character}
301\index{byte}
Fred Drakec37b65e2001-11-28 07:26:15 +0000302\index{ASCII@\ASCII}
Fred Drakef6669171998-05-06 19:52:49 +0000303
Fred Drakec37b65e2001-11-28 07:26:15 +0000304(On systems whose native character set is not \ASCII, strings may use
Fred Drakef6669171998-05-06 19:52:49 +0000305EBCDIC in their internal representation, provided the functions
306\function{chr()} and \function{ord()} implement a mapping between \ASCII{} and
307EBCDIC, and string comparison preserves the \ASCII{} order.
308Or perhaps someone can propose a better rule?)
Fred Drakec37b65e2001-11-28 07:26:15 +0000309\index{ASCII@\ASCII}
Fred Drakef6669171998-05-06 19:52:49 +0000310\index{EBCDIC}
311\index{character set}
312\indexii{string}{comparison}
313\bifuncindex{chr}
314\bifuncindex{ord}
315
Fred Drakef0aff8e2000-04-06 13:57:21 +0000316\item[Unicode]
Fred Drake5ec22f22002-09-24 21:09:13 +0000317The items of a Unicode object are Unicode code units. A Unicode code
318unit is represented by a Unicode object of one item and can hold
319either a 16-bit or 32-bit value representing a Unicode ordinal (the
320maximum value for the ordinal is given in \code{sys.maxunicode}, and
321depends on how Python is configured at compile time). Surrogate pairs
322may be present in the Unicode object, and will be reported as two
323separate items. The built-in functions
Fred Drakef0aff8e2000-04-06 13:57:21 +0000324\function{unichr()}\bifuncindex{unichr} and
Fred Drake5ec22f22002-09-24 21:09:13 +0000325\function{ord()}\bifuncindex{ord} convert between code units and
Fred Drakef0aff8e2000-04-06 13:57:21 +0000326nonnegative integers representing the Unicode ordinals as defined in
327the Unicode Standard 3.0. Conversion from and to other encodings are
328possible through the Unicode method \method{encode} and the built-in
Fred Drake5ec22f22002-09-24 21:09:13 +0000329function \function{unicode()}.\bifuncindex{unicode}
Fred Drakef0aff8e2000-04-06 13:57:21 +0000330\obindex{unicode}
331\index{character}
332\index{integer}
Fred Drake8b3ce9e2000-04-06 14:00:14 +0000333\index{Unicode}
Fred Drakef0aff8e2000-04-06 13:57:21 +0000334
Fred Drakef6669171998-05-06 19:52:49 +0000335\item[Tuples]
Guido van Rossum83b2f8a1998-07-23 17:12:46 +0000336The items of a tuple are arbitrary Python objects.
337Tuples of two or more items are formed by comma-separated lists
338of expressions. A tuple of one item (a `singleton') can be formed
Fred Drakef6669171998-05-06 19:52:49 +0000339by affixing a comma to an expression (an expression by itself does
340not create a tuple, since parentheses must be usable for grouping of
Guido van Rossum83b2f8a1998-07-23 17:12:46 +0000341expressions). An empty tuple can be formed by an empty pair of
Fred Drakef6669171998-05-06 19:52:49 +0000342parentheses.
343\obindex{tuple}
344\indexii{singleton}{tuple}
345\indexii{empty}{tuple}
346
347\end{description} % Immutable sequences
348
349\item[Mutable sequences]
350Mutable sequences can be changed after they are created. The
351subscription and slicing notations can be used as the target of
352assignment and \keyword{del} (delete) statements.
Thomas Woutersf9b526d2000-07-16 19:05:38 +0000353\obindex{mutable sequence}
Fred Drakef6669171998-05-06 19:52:49 +0000354\obindex{mutable}
355\indexii{assignment}{statement}
356\index{delete}
357\stindex{del}
358\index{subscription}
359\index{slicing}
360
Raymond Hettingere701dcb2003-01-19 13:08:18 +0000361There is currently a single intrinsic mutable sequence type:
Fred Drakef6669171998-05-06 19:52:49 +0000362
363\begin{description}
364
365\item[Lists]
Guido van Rossum83b2f8a1998-07-23 17:12:46 +0000366The items of a list are arbitrary Python objects. Lists are formed
Fred Drakef6669171998-05-06 19:52:49 +0000367by placing a comma-separated list of expressions in square brackets.
368(Note that there are no special cases needed to form lists of length 0
369or 1.)
370\obindex{list}
371
372\end{description} % Mutable sequences
373
Guido van Rossum83b2f8a1998-07-23 17:12:46 +0000374The extension module \module{array}\refstmodindex{array} provides an
375additional example of a mutable sequence type.
376
377
Fred Drakef6669171998-05-06 19:52:49 +0000378\end{description} % Sequences
379
Guido van Rossum83b2f8a1998-07-23 17:12:46 +0000380\item[Mappings]
Fred Drakef6669171998-05-06 19:52:49 +0000381These represent finite sets of objects indexed by arbitrary index sets.
Guido van Rossum83b2f8a1998-07-23 17:12:46 +0000382The subscript notation \code{a[k]} selects the item indexed
Fred Drakef6669171998-05-06 19:52:49 +0000383by \code{k} from the mapping \code{a}; this can be used in
384expressions and as the target of assignments or \keyword{del} statements.
Guido van Rossum83b2f8a1998-07-23 17:12:46 +0000385The built-in function \function{len()} returns the number of items
Fred Drakef6669171998-05-06 19:52:49 +0000386in a mapping.
387\bifuncindex{len}
388\index{subscription}
389\obindex{mapping}
390
Guido van Rossum83b2f8a1998-07-23 17:12:46 +0000391There is currently a single intrinsic mapping type:
Fred Drakef6669171998-05-06 19:52:49 +0000392
393\begin{description}
394
395\item[Dictionaries]
Fred Drake8cdee961999-02-23 18:50:38 +0000396These\obindex{dictionary} represent finite sets of objects indexed by
397nearly arbitrary values. The only types of values not acceptable as
398keys are values containing lists or dictionaries or other mutable
399types that are compared by value rather than by object identity, the
400reason being that the efficient implementation of dictionaries
401requires a key's hash value to remain constant.
Fred Drakef6669171998-05-06 19:52:49 +0000402Numeric types used for keys obey the normal rules for numeric
Guido van Rossum83b2f8a1998-07-23 17:12:46 +0000403comparison: if two numbers compare equal (e.g., \code{1} and
Fred Drakef6669171998-05-06 19:52:49 +0000404\code{1.0}) then they can be used interchangeably to index the same
405dictionary entry.
406
Raymond Hettingere701dcb2003-01-19 13:08:18 +0000407Dictionaries are mutable; they can be created by the
Fred Drake62364ff2003-03-20 18:17:16 +0000408\code{\{...\}} notation (see section~\ref{dict}, ``Dictionary
Fred Drake8cdee961999-02-23 18:50:38 +0000409Displays'').
Fred Drakef6669171998-05-06 19:52:49 +0000410
Guido van Rossum83b2f8a1998-07-23 17:12:46 +0000411The extension modules \module{dbm}\refstmodindex{dbm},
412\module{gdbm}\refstmodindex{gdbm}, \module{bsddb}\refstmodindex{bsddb}
413provide additional examples of mapping types.
414
Fred Drakef6669171998-05-06 19:52:49 +0000415\end{description} % Mapping types
416
417\item[Callable types]
Fred Drake8cdee961999-02-23 18:50:38 +0000418These\obindex{callable} are the types to which the function call
Fred Drake62364ff2003-03-20 18:17:16 +0000419operation (see section~\ref{calls}, ``Calls'') can be applied:
Fred Drakef6669171998-05-06 19:52:49 +0000420\indexii{function}{call}
421\index{invocation}
422\indexii{function}{argument}
Fred Drakef6669171998-05-06 19:52:49 +0000423
424\begin{description}
425
426\item[User-defined functions]
427A user-defined function object is created by a function definition
Fred Drake62364ff2003-03-20 18:17:16 +0000428(see section~\ref{function}, ``Function definitions''). It should be
Guido van Rossum83b2f8a1998-07-23 17:12:46 +0000429called with an argument
Fred Drakef6669171998-05-06 19:52:49 +0000430list containing the same number of items as the function's formal
431parameter list.
432\indexii{user-defined}{function}
433\obindex{function}
434\obindex{user-defined function}
435
Michael W. Hudson5e897952004-08-12 18:12:44 +0000436Special attributes:
Barry Warsaw7a5e80e2001-02-27 03:36:30 +0000437
Michael W. Hudson5e897952004-08-12 18:12:44 +0000438\begin{tableiii}{lll}{member}{Attribute}{Meaning}{}
439 \lineiii{func_doc}{The function's documentation string, or
440 \code{None} if unavailable}{Writable}
441
442 \lineiii{__doc__}{Another way of spelling
443 \member{func_doc}}{Writable}
444
445 \lineiii{func_name}{The function's name}{Writable}
446
447 \lineiii{__name__}{Another way of spelling
448 \member{func_name}}{Writable}
449
450 \lineiii{__module__}{The name of the module the function was defined
451 in, or \code{None} if unavailable.}{Writable}
452
Raymond Hettingerf21569e2005-04-26 05:18:53 +0000453 \lineiii{func_defaults}{A tuple containing default argument values
Michael W. Hudson5e897952004-08-12 18:12:44 +0000454 for those arguments that have defaults, or \code{None} if no
455 arguments have a default value}{Writable}
456
457 \lineiii{func_code}{The code object representing the compiled
458 function body.}{Writable}
459
460 \lineiii{func_globals}{A reference to the dictionary that holds the
461 function's global variables --- the global namespace of the module
462 in which the function was defined.}{Read-only}
463
464 \lineiii{func_dict}{The namespace supporting arbitrary function
465 attributes.}{Writable}
466
467 \lineiii{func_closure}{\code{None} or a tuple of cells that contain
468 bindings for the function's free variables.}{Read-only}
469\end{tableiii}
470
471Most of the attributes labelled ``Writable'' check the type of the
472assigned value.
473
474\versionchanged[\code{func_name} is now writable]{2.4}
475
476Function objects also support getting and setting arbitrary
477attributes, which can be used, for example, to attach metadata to
478functions. Regular attribute dot-notation is used to get and set such
479attributes. \emph{Note that the current implementation only supports
480function attributes on user-defined functions. Function attributes on
481built-in functions may be supported in the future.}
482
483Additional information about a function's definition can be retrieved
484from its code object; see the description of internal types below.
Jeremy Hyltonaa90adc2001-03-23 17:23:50 +0000485
Fred Drake4856d011999-01-12 04:15:20 +0000486\withsubitem{(function attribute)}{
487 \ttindex{func_doc}
488 \ttindex{__doc__}
489 \ttindex{__name__}
Jeremy Hyltonf9b0cc72003-01-31 18:52:45 +0000490 \ttindex{__module__}
Barry Warsaw7a5e80e2001-02-27 03:36:30 +0000491 \ttindex{__dict__}
Fred Drake4856d011999-01-12 04:15:20 +0000492 \ttindex{func_defaults}
Jeremy Hylton26c49b62002-04-01 17:58:39 +0000493 \ttindex{func_closure}
Fred Drake4856d011999-01-12 04:15:20 +0000494 \ttindex{func_code}
Barry Warsaw7a5e80e2001-02-27 03:36:30 +0000495 \ttindex{func_globals}
496 \ttindex{func_dict}}
Guido van Rossumdfb658c1998-07-23 17:54:36 +0000497\indexii{global}{namespace}
Fred Drakef6669171998-05-06 19:52:49 +0000498
499\item[User-defined methods]
Guido van Rossum83b2f8a1998-07-23 17:12:46 +0000500A user-defined method object combines a class, a class instance (or
Fred Drake8dd6ffd2001-08-02 21:34:53 +0000501\code{None}) and any callable object (normally a user-defined
502function).
Fred Drakef6669171998-05-06 19:52:49 +0000503\obindex{method}
504\obindex{user-defined method}
505\indexii{user-defined}{method}
Fred Drakef6669171998-05-06 19:52:49 +0000506
507Special read-only attributes: \member{im_self} is the class instance
Guido van Rossum83b2f8a1998-07-23 17:12:46 +0000508object, \member{im_func} is the function object;
Raymond Hettinger03ec6d52003-06-25 18:29:36 +0000509\member{im_class} is the class of \member{im_self} for bound methods
510or the class that asked for the method for unbound methods;
Fred Drake82385871998-10-01 20:40:43 +0000511\member{__doc__} is the method's documentation (same as
512\code{im_func.__doc__}); \member{__name__} is the method name (same as
Jeremy Hyltonf9b0cc72003-01-31 18:52:45 +0000513\code{im_func.__name__}); \member{__module__} is the name of the
514module the method was defined in, or \code{None} if unavailable.
Fred Drakef9d58032001-12-07 23:13:53 +0000515\versionchanged[\member{im_self} used to refer to the class that
516 defined the method]{2.2}
Fred Drake4856d011999-01-12 04:15:20 +0000517\withsubitem{(method attribute)}{
Jeremy Hyltonf9b0cc72003-01-31 18:52:45 +0000518 \ttindex{__doc__}
519 \ttindex{__name__}
520 \ttindex{__module__}
Fred Drake4856d011999-01-12 04:15:20 +0000521 \ttindex{im_func}
Fred Drake1e42d8a1998-11-25 17:58:50 +0000522 \ttindex{im_self}}
Guido van Rossum83b2f8a1998-07-23 17:12:46 +0000523
Barry Warsaw7a5e80e2001-02-27 03:36:30 +0000524Methods also support accessing (but not setting) the arbitrary
525function attributes on the underlying function object.
526
Fred Drakeea690c42003-07-17 05:26:53 +0000527User-defined method objects may be created when getting an attribute
528of a class (perhaps via an instance of that class), if that attribute
529is a user-defined function object, an unbound user-defined method object,
530or a class method object.
531When the attribute is a user-defined method object, a new
532method object is only created if the class from which it is being
533retrieved is the same as, or a derived class of, the class stored
534in the original method object; otherwise, the original method object
535is used as it is.
536
537When a user-defined method object is created by retrieving
538a user-defined function object from a class, its \member{im_self}
539attribute is \code{None} and the method object is said to be unbound.
540When one is created by retrieving a user-defined function object
541from a class via one of its instances, its \member{im_self} attribute
542is the instance, and the method object is said to be bound.
543In either case, the new method's \member{im_class} attribute
544is the class from which the retrieval takes place, and
545its \member{im_func} attribute is the original function object.
546\withsubitem{(method attribute)}{
547 \ttindex{im_class}\ttindex{im_func}\ttindex{im_self}}
548
549When a user-defined method object is created by retrieving another
550method object from a class or instance, the behaviour is the same
551as for a function object, except that the \member{im_func} attribute
552of the new instance is not the original method object but its
553\member{im_func} attribute.
554\withsubitem{(method attribute)}{
555 \ttindex{im_func}}
556
557When a user-defined method object is created by retrieving a
558class method object from a class or instance, its \member{im_self}
559attribute is the class itself (the same as the \member{im_class}
560attribute), and its \member{im_func} attribute is the function
561object underlying the class method.
Fred Drake4856d011999-01-12 04:15:20 +0000562\withsubitem{(method attribute)}{
Fred Drake35c09f22000-06-28 20:15:47 +0000563 \ttindex{im_class}\ttindex{im_func}\ttindex{im_self}}
Guido van Rossum83b2f8a1998-07-23 17:12:46 +0000564
565When an unbound user-defined method object is called, the underlying
Fred Drake82385871998-10-01 20:40:43 +0000566function (\member{im_func}) is called, with the restriction that the
Guido van Rossum83b2f8a1998-07-23 17:12:46 +0000567first argument must be an instance of the proper class
Fred Drake82385871998-10-01 20:40:43 +0000568(\member{im_class}) or of a derived class thereof.
Guido van Rossum83b2f8a1998-07-23 17:12:46 +0000569
570When a bound user-defined method object is called, the underlying
Fred Drake82385871998-10-01 20:40:43 +0000571function (\member{im_func}) is called, inserting the class instance
572(\member{im_self}) in front of the argument list. For instance, when
573\class{C} is a class which contains a definition for a function
574\method{f()}, and \code{x} is an instance of \class{C}, calling
Guido van Rossum83b2f8a1998-07-23 17:12:46 +0000575\code{x.f(1)} is equivalent to calling \code{C.f(x, 1)}.
576
Fred Drakeea690c42003-07-17 05:26:53 +0000577When a user-defined method object is derived from a class method object,
578the ``class instance'' stored in \member{im_self} will actually be the
579class itself, so that calling either \code{x.f(1)} or \code{C.f(1)} is
580equivalent to calling \code{f(C,1)} where \code{f} is the underlying
581function.
582
Guido van Rossum83b2f8a1998-07-23 17:12:46 +0000583Note that the transformation from function object to (unbound or
584bound) method object happens each time the attribute is retrieved from
585the class or instance. In some cases, a fruitful optimization is to
586assign the attribute to a local variable and call that local variable.
587Also notice that this transformation only happens for user-defined
588functions; other callable objects (and all non-callable objects) are
Fred Drake35c09f22000-06-28 20:15:47 +0000589retrieved without transformation. It is also important to note that
590user-defined functions which are attributes of a class instance are
591not converted to bound methods; this \emph{only} happens when the
592function is an attribute of the class.
Guido van Rossum83b2f8a1998-07-23 17:12:46 +0000593
Fred Drakee31e9ce2001-12-11 21:10:08 +0000594\item[Generator functions\index{generator!function}\index{generator!iterator}]
595A function or method which uses the \keyword{yield} statement (see
596section~\ref{yield}, ``The \keyword{yield} statement'') is called a
597\dfn{generator function}. Such a function, when called, always
598returns an iterator object which can be used to execute the body of
599the function: calling the iterator's \method{next()} method will
600cause the function to execute until it provides a value using the
601\keyword{yield} statement. When the function executes a
602\keyword{return} statement or falls off the end, a
603\exception{StopIteration} exception is raised and the iterator will
604have reached the end of the set of values to be returned.
605
Fred Drakef6669171998-05-06 19:52:49 +0000606\item[Built-in functions]
Georg Brandl1c330eb2005-07-02 10:27:31 +0000607A built-in function object is a wrapper around a C function. Examples
Guido van Rossum83b2f8a1998-07-23 17:12:46 +0000608of built-in functions are \function{len()} and \function{math.sin()}
609(\module{math} is a standard built-in module).
610The number and type of the arguments are
Fred Drakef6669171998-05-06 19:52:49 +0000611determined by the C function.
Fred Drake82385871998-10-01 20:40:43 +0000612Special read-only attributes: \member{__doc__} is the function's
613documentation string, or \code{None} if unavailable; \member{__name__}
614is the function's name; \member{__self__} is set to \code{None} (but see
Jeremy Hyltonf9b0cc72003-01-31 18:52:45 +0000615the next item); \member{__module__} is the name of the module the
616function was defined in or \code{None} if unavailable.
Fred Drakef6669171998-05-06 19:52:49 +0000617\obindex{built-in function}
618\obindex{function}
619\indexii{C}{language}
620
621\item[Built-in methods]
622This is really a different disguise of a built-in function, this time
Raymond Hettingere701dcb2003-01-19 13:08:18 +0000623containing an object passed to the C function as an implicit extra
Guido van Rossum83b2f8a1998-07-23 17:12:46 +0000624argument. An example of a built-in method is
Raymond Hettingere701dcb2003-01-19 13:08:18 +0000625\code{\var{alist}.append()}, assuming
626\var{alist} is a list object.
Fred Drake82385871998-10-01 20:40:43 +0000627In this case, the special read-only attribute \member{__self__} is set
Fred Drakee31e9ce2001-12-11 21:10:08 +0000628to the object denoted by \var{list}.
Fred Drakef6669171998-05-06 19:52:49 +0000629\obindex{built-in method}
630\obindex{method}
631\indexii{built-in}{method}
632
Fred Drakee37b4ed2003-07-15 20:45:16 +0000633\item[Class Types]
634Class types, or ``new-style classes,'' are callable. These objects
635normally act as factories for new instances of themselves, but
636variations are possible for class types that override
637\method{__new__()}. The arguments of the call are passed to
638\method{__new__()} and, in the typical case, to \method{__init__()} to
639initialize the new instance.
640
641\item[Classic Classes]
Guido van Rossum83b2f8a1998-07-23 17:12:46 +0000642Class objects are described below. When a class object is called,
643a new class instance (also described below) is created and
Fred Drakef6669171998-05-06 19:52:49 +0000644returned. This implies a call to the class's \method{__init__()} method
645if it has one. Any arguments are passed on to the \method{__init__()}
Guido van Rossum83b2f8a1998-07-23 17:12:46 +0000646method. If there is no \method{__init__()} method, the class must be called
Fred Drakef6669171998-05-06 19:52:49 +0000647without arguments.
Fred Drake1e42d8a1998-11-25 17:58:50 +0000648\withsubitem{(object method)}{\ttindex{__init__()}}
Fred Drakef6669171998-05-06 19:52:49 +0000649\obindex{class}
650\obindex{class instance}
651\obindex{instance}
652\indexii{class object}{call}
653
Guido van Rossum83b2f8a1998-07-23 17:12:46 +0000654\item[Class instances]
655Class instances are described below. Class instances are callable
Fred Drake82385871998-10-01 20:40:43 +0000656only when the class has a \method{__call__()} method; \code{x(arguments)}
Guido van Rossum83b2f8a1998-07-23 17:12:46 +0000657is a shorthand for \code{x.__call__(arguments)}.
658
Fred Drakef6669171998-05-06 19:52:49 +0000659\end{description}
660
661\item[Modules]
Fred Drake62364ff2003-03-20 18:17:16 +0000662Modules are imported by the \keyword{import} statement (see
Fred Draked51ce7d2003-07-15 22:03:00 +0000663section~\ref{import}, ``The \keyword{import} statement'').%
664\stindex{import}\obindex{module}
Guido van Rossumdfb658c1998-07-23 17:54:36 +0000665A module object has a namespace implemented by a dictionary object
Guido van Rossum83b2f8a1998-07-23 17:12:46 +0000666(this is the dictionary referenced by the func_globals attribute of
667functions defined in the module). Attribute references are translated
668to lookups in this dictionary, e.g., \code{m.x} is equivalent to
669\code{m.__dict__["x"]}.
670A module object does not contain the code object used to
Fred Drakef6669171998-05-06 19:52:49 +0000671initialize the module (since it isn't needed once the initialization
672is done).
Fred Drakef6669171998-05-06 19:52:49 +0000673
Guido van Rossumdfb658c1998-07-23 17:54:36 +0000674Attribute assignment updates the module's namespace dictionary,
Fred Drake82385871998-10-01 20:40:43 +0000675e.g., \samp{m.x = 1} is equivalent to \samp{m.__dict__["x"] = 1}.
Fred Drakef6669171998-05-06 19:52:49 +0000676
Guido van Rossumdfb658c1998-07-23 17:54:36 +0000677Special read-only attribute: \member{__dict__} is the module's
678namespace as a dictionary object.
Fred Drake1e42d8a1998-11-25 17:58:50 +0000679\withsubitem{(module attribute)}{\ttindex{__dict__}}
Guido van Rossum83b2f8a1998-07-23 17:12:46 +0000680
681Predefined (writable) attributes: \member{__name__}
682is the module's name; \member{__doc__} is the
683module's documentation string, or
Fred Drake82385871998-10-01 20:40:43 +0000684\code{None} if unavailable; \member{__file__} is the pathname of the
Guido van Rossum83b2f8a1998-07-23 17:12:46 +0000685file from which the module was loaded, if it was loaded from a file.
Fred Drake82385871998-10-01 20:40:43 +0000686The \member{__file__} attribute is not present for C{} modules that are
Guido van Rossum83b2f8a1998-07-23 17:12:46 +0000687statically linked into the interpreter; for extension modules loaded
688dynamically from a shared library, it is the pathname of the shared
689library file.
Fred Drake4856d011999-01-12 04:15:20 +0000690\withsubitem{(module attribute)}{
691 \ttindex{__name__}
692 \ttindex{__doc__}
Fred Drake1e42d8a1998-11-25 17:58:50 +0000693 \ttindex{__file__}}
Guido van Rossumdfb658c1998-07-23 17:54:36 +0000694\indexii{module}{namespace}
Fred Drakef6669171998-05-06 19:52:49 +0000695
696\item[Classes]
Fred Drake62364ff2003-03-20 18:17:16 +0000697Class objects are created by class definitions (see
698section~\ref{class}, ``Class definitions'').
Guido van Rossum83b2f8a1998-07-23 17:12:46 +0000699A class has a namespace implemented by a dictionary object.
700Class attribute references are translated to
701lookups in this dictionary,
Fred Drake82385871998-10-01 20:40:43 +0000702e.g., \samp{C.x} is translated to \samp{C.__dict__["x"]}.
Guido van Rossum83b2f8a1998-07-23 17:12:46 +0000703When the attribute name is not found
Fred Drakef6669171998-05-06 19:52:49 +0000704there, the attribute search continues in the base classes. The search
Guido van Rossum83b2f8a1998-07-23 17:12:46 +0000705is depth-first, left-to-right in the order of occurrence in the
Fred Drakef6669171998-05-06 19:52:49 +0000706base class list.
Fred Drakeea690c42003-07-17 05:26:53 +0000707
708When a class attribute reference (for class \class{C}, say)
709would yield a user-defined function object or
710an unbound user-defined method object whose associated class is either
711\class{C} or one of its base classes, it is transformed into an unbound
712user-defined method object whose \member{im_class} attribute is~\class{C}.
713When it would yield a class method object, it is transformed into
714a bound user-defined method object whose \member{im_class} and
715\member{im_self} attributes are both~\class{C}. When it would yield
716a static method object, it is transformed into the object wrapped
717by the static method object. See section~\ref{descriptors} for another
718way in which attributes retrieved from a class may differ from those
719actually contained in its \member{__dict__}.
Fred Drakef6669171998-05-06 19:52:49 +0000720\obindex{class}
721\obindex{class instance}
722\obindex{instance}
723\indexii{class object}{call}
724\index{container}
725\obindex{dictionary}
726\indexii{class}{attribute}
727
728Class attribute assignments update the class's dictionary, never the
729dictionary of a base class.
730\indexiii{class}{attribute}{assignment}
731
Guido van Rossum83b2f8a1998-07-23 17:12:46 +0000732A class object can be called (see above) to yield a class instance (see
733below).
Fred Drakef6669171998-05-06 19:52:49 +0000734\indexii{class object}{call}
735
Guido van Rossum83b2f8a1998-07-23 17:12:46 +0000736Special attributes: \member{__name__} is the class name;
737\member{__module__} is the module name in which the class was defined;
Guido van Rossumdfb658c1998-07-23 17:54:36 +0000738\member{__dict__} is the dictionary containing the class's namespace;
Guido van Rossum83b2f8a1998-07-23 17:12:46 +0000739\member{__bases__} is a tuple (possibly empty or a singleton)
740containing the base classes, in the order of their occurrence in the
Fred Drake82385871998-10-01 20:40:43 +0000741base class list; \member{__doc__} is the class's documentation string,
Guido van Rossum83b2f8a1998-07-23 17:12:46 +0000742or None if undefined.
Fred Drake4856d011999-01-12 04:15:20 +0000743\withsubitem{(class attribute)}{
744 \ttindex{__name__}
745 \ttindex{__module__}
746 \ttindex{__dict__}
747 \ttindex{__bases__}
Fred Drake1e42d8a1998-11-25 17:58:50 +0000748 \ttindex{__doc__}}
Fred Drakef6669171998-05-06 19:52:49 +0000749
750\item[Class instances]
Guido van Rossum83b2f8a1998-07-23 17:12:46 +0000751A class instance is created by calling a class object (see above).
752A class instance has a namespace implemented as a dictionary which
753is the first place in which
Fred Drakef6669171998-05-06 19:52:49 +0000754attribute references are searched. When an attribute is not found
Guido van Rossum83b2f8a1998-07-23 17:12:46 +0000755there, and the instance's class has an attribute by that name,
756the search continues with the class attributes. If a class attribute
Fred Drakeea690c42003-07-17 05:26:53 +0000757is found that is a user-defined function object or an unbound
758user-defined method object whose associated class is the class
759(call it~\class{C}) of the instance for which the attribute reference
760was initiated or one of its bases,
761it is transformed into a bound user-defined method object whose
762\member{im_class} attribute is~\class{C} whose \member{im_self} attribute
763is the instance. Static method and class method objects are also
764transformed, as if they had been retrieved from class~\class{C};
765see above under ``Classes''. See section~\ref{descriptors} for
766another way in which attributes of a class retrieved via its
767instances may differ from the objects actually stored in the
768class's \member{__dict__}.
Guido van Rossum83b2f8a1998-07-23 17:12:46 +0000769If no class attribute is found, and the object's class has a
Fred Drake82385871998-10-01 20:40:43 +0000770\method{__getattr__()} method, that is called to satisfy the lookup.
Fred Drakef6669171998-05-06 19:52:49 +0000771\obindex{class instance}
772\obindex{instance}
773\indexii{class}{instance}
774\indexii{class instance}{attribute}
775
Guido van Rossum83b2f8a1998-07-23 17:12:46 +0000776Attribute assignments and deletions update the instance's dictionary,
Fred Drake82385871998-10-01 20:40:43 +0000777never a class's dictionary. If the class has a \method{__setattr__()} or
778\method{__delattr__()} method, this is called instead of updating the
Guido van Rossum83b2f8a1998-07-23 17:12:46 +0000779instance dictionary directly.
Fred Drakef6669171998-05-06 19:52:49 +0000780\indexiii{class instance}{attribute}{assignment}
781
782Class instances can pretend to be numbers, sequences, or mappings if
Guido van Rossum83b2f8a1998-07-23 17:12:46 +0000783they have methods with certain special names. See
Fred Drake62364ff2003-03-20 18:17:16 +0000784section~\ref{specialnames}, ``Special method names.''
Fred Drakee15956b2000-04-03 04:51:13 +0000785\obindex{numeric}
Fred Drakef6669171998-05-06 19:52:49 +0000786\obindex{sequence}
787\obindex{mapping}
788
Guido van Rossum83b2f8a1998-07-23 17:12:46 +0000789Special attributes: \member{__dict__} is the attribute
790dictionary; \member{__class__} is the instance's class.
Fred Drake4856d011999-01-12 04:15:20 +0000791\withsubitem{(instance attribute)}{
792 \ttindex{__dict__}
Fred Drake1e42d8a1998-11-25 17:58:50 +0000793 \ttindex{__class__}}
Fred Drakef6669171998-05-06 19:52:49 +0000794
795\item[Files]
Fred Drakee15eb351999-11-10 16:13:25 +0000796A file\obindex{file} object represents an open file. File objects are
797created by the \function{open()}\bifuncindex{open} built-in function,
798and also by
799\withsubitem{(in module os)}{\ttindex{popen()}}\function{os.popen()},
800\function{os.fdopen()}, and the
801\method{makefile()}\withsubitem{(socket method)}{\ttindex{makefile()}}
802method of socket objects (and perhaps by other functions or methods
803provided by extension modules). The objects
804\ttindex{sys.stdin}\code{sys.stdin},
805\ttindex{sys.stdout}\code{sys.stdout} and
806\ttindex{sys.stderr}\code{sys.stderr} are initialized to file objects
807corresponding to the interpreter's standard\index{stdio} input, output
808and error streams. See the \citetitle[../lib/lib.html]{Python Library
809Reference} for complete documentation of file objects.
Fred Drake4856d011999-01-12 04:15:20 +0000810\withsubitem{(in module sys)}{
811 \ttindex{stdin}
812 \ttindex{stdout}
Fred Drake1e42d8a1998-11-25 17:58:50 +0000813 \ttindex{stderr}}
Fred Drakee15eb351999-11-10 16:13:25 +0000814
Fred Drakef6669171998-05-06 19:52:49 +0000815
816\item[Internal types]
817A few types used internally by the interpreter are exposed to the user.
Guido van Rossum83b2f8a1998-07-23 17:12:46 +0000818Their definitions may change with future versions of the interpreter,
Fred Drakef6669171998-05-06 19:52:49 +0000819but they are mentioned here for completeness.
820\index{internal type}
821\index{types, internal}
822
823\begin{description}
824
825\item[Code objects]
Guido van Rossum83b2f8a1998-07-23 17:12:46 +0000826Code objects represent \emph{byte-compiled} executable Python code, or
827\emph{bytecode}.
Fred Drakef6669171998-05-06 19:52:49 +0000828The difference between a code
829object and a function object is that the function object contains an
Guido van Rossum83b2f8a1998-07-23 17:12:46 +0000830explicit reference to the function's globals (the module in which it
831was defined), while a code object contains no context;
832also the default argument values are stored in the function object,
833not in the code object (because they represent values calculated at
834run-time). Unlike function objects, code objects are immutable and
835contain no references (directly or indirectly) to mutable objects.
836\index{bytecode}
Fred Drakef6669171998-05-06 19:52:49 +0000837\obindex{code}
838
Fred Drake1e42d8a1998-11-25 17:58:50 +0000839Special read-only attributes: \member{co_name} gives the function
840name; \member{co_argcount} is the number of positional arguments
841(including arguments with default values); \member{co_nlocals} is the
842number of local variables used by the function (including arguments);
843\member{co_varnames} is a tuple containing the names of the local
Jeremy Hyltonaa90adc2001-03-23 17:23:50 +0000844variables (starting with the argument names); \member{co_cellvars} is
845a tuple containing the names of local variables that are referenced by
846nested functions; \member{co_freevars} is a tuple containing the names
Jeremy Hylton8392f362002-04-01 18:53:36 +0000847of free variables; \member{co_code} is a string representing the
848sequence of bytecode instructions;
Fred Drake1e42d8a1998-11-25 17:58:50 +0000849\member{co_consts} is a tuple containing the literals used by the
850bytecode; \member{co_names} is a tuple containing the names used by
851the bytecode; \member{co_filename} is the filename from which the code
852was compiled; \member{co_firstlineno} is the first line number of the
853function; \member{co_lnotab} is a string encoding the mapping from
Thomas Woutersf9b526d2000-07-16 19:05:38 +0000854byte code offsets to line numbers (for details see the source code of
Fred Drake1e42d8a1998-11-25 17:58:50 +0000855the interpreter); \member{co_stacksize} is the required stack size
856(including local variables); \member{co_flags} is an integer encoding
857a number of flags for the interpreter.
Jeremy Hyltonaa90adc2001-03-23 17:23:50 +0000858
Fred Drake4856d011999-01-12 04:15:20 +0000859\withsubitem{(code object attribute)}{
860 \ttindex{co_argcount}
861 \ttindex{co_code}
862 \ttindex{co_consts}
863 \ttindex{co_filename}
864 \ttindex{co_firstlineno}
865 \ttindex{co_flags}
866 \ttindex{co_lnotab}
867 \ttindex{co_name}
868 \ttindex{co_names}
869 \ttindex{co_nlocals}
870 \ttindex{co_stacksize}
Jeremy Hyltonaa90adc2001-03-23 17:23:50 +0000871 \ttindex{co_varnames}
872 \ttindex{co_cellvars}
873 \ttindex{co_freevars}}
Guido van Rossum83b2f8a1998-07-23 17:12:46 +0000874
Fred Drakee15956b2000-04-03 04:51:13 +0000875The following flag bits are defined for \member{co_flags}: bit
876\code{0x04} is set if the function uses the \samp{*arguments} syntax
877to accept an arbitrary number of positional arguments; bit
878\code{0x08} is set if the function uses the \samp{**keywords} syntax
Jeremy Hylton8392f362002-04-01 18:53:36 +0000879to accept arbitrary keyword arguments; bit \code{0x20} is set if the
Brett Cannon9e6fedd2003-06-15 22:57:44 +0000880function is a generator.
881\obindex{generator}
Jeremy Hylton8392f362002-04-01 18:53:36 +0000882
883Future feature declarations (\samp{from __future__ import division})
884also use bits in \member{co_flags} to indicate whether a code object
885was compiled with a particular feature enabled: bit \code{0x2000} is
886set if the function was compiled with future division enabled; bits
887\code{0x10} and \code{0x1000} were used in earlier versions of Python.
888
889Other bits in \member{co_flags} are reserved for internal use.
890
891If\index{documentation string} a code object represents a function,
892the first item in
Jeremy Hyltonaa90adc2001-03-23 17:23:50 +0000893\member{co_consts} is the documentation string of the function, or
894\code{None} if undefined.
Fred Drakef6669171998-05-06 19:52:49 +0000895
896\item[Frame objects]
897Frame objects represent execution frames. They may occur in traceback
898objects (see below).
899\obindex{frame}
900
901Special read-only attributes: \member{f_back} is to the previous
902stack frame (towards the caller), or \code{None} if this is the bottom
903stack frame; \member{f_code} is the code object being executed in this
Guido van Rossum83b2f8a1998-07-23 17:12:46 +0000904frame; \member{f_locals} is the dictionary used to look up local
905variables; \member{f_globals} is used for global variables;
Fred Drake82385871998-10-01 20:40:43 +0000906\member{f_builtins} is used for built-in (intrinsic) names;
907\member{f_restricted} is a flag indicating whether the function is
Michael W. Hudsoncfd38842002-12-17 16:15:34 +0000908executing in restricted execution mode; \member{f_lasti} gives the
Guido van Rossum83b2f8a1998-07-23 17:12:46 +0000909precise instruction (this is an index into the bytecode string of
Fred Drakef6669171998-05-06 19:52:49 +0000910the code object).
Fred Drake4856d011999-01-12 04:15:20 +0000911\withsubitem{(frame attribute)}{
912 \ttindex{f_back}
913 \ttindex{f_code}
914 \ttindex{f_globals}
915 \ttindex{f_locals}
Fred Drake4856d011999-01-12 04:15:20 +0000916 \ttindex{f_lasti}
917 \ttindex{f_builtins}
Fred Drake1e42d8a1998-11-25 17:58:50 +0000918 \ttindex{f_restricted}}
Guido van Rossum83b2f8a1998-07-23 17:12:46 +0000919
Georg Brandl1c330eb2005-07-02 10:27:31 +0000920Special writable attributes: \member{f_trace}, if not \code{None}, is
921a function called at the start of each source code line (this is used
922by the debugger); \member{f_exc_type}, \member{f_exc_value},
923\member{f_exc_traceback} represent the last exception raised in the
924parent frame provided another exception was ever raised in the current
925frame (in all other cases they are None); \member{f_lineno} is the
926current line number of the frame --- writing to this from within a
927trace function jumps to the given line (only for the bottom-most
928frame). A debugger can implement a Jump command (aka Set Next
929Statement) by writing to f_lineno.
Fred Drake4856d011999-01-12 04:15:20 +0000930\withsubitem{(frame attribute)}{
931 \ttindex{f_trace}
932 \ttindex{f_exc_type}
933 \ttindex{f_exc_value}
Michael W. Hudsoncfd38842002-12-17 16:15:34 +0000934 \ttindex{f_exc_traceback}
935 \ttindex{f_lineno}}
Fred Drakef6669171998-05-06 19:52:49 +0000936
937\item[Traceback objects] \label{traceback}
938Traceback objects represent a stack trace of an exception. A
939traceback object is created when an exception occurs. When the search
940for an exception handler unwinds the execution stack, at each unwound
941level a traceback object is inserted in front of the current
Guido van Rossum83b2f8a1998-07-23 17:12:46 +0000942traceback. When an exception handler is entered, the stack trace is
943made available to the program.
Fred Drake62364ff2003-03-20 18:17:16 +0000944(See section~\ref{try}, ``The \code{try} statement.'')
Guido van Rossum83b2f8a1998-07-23 17:12:46 +0000945It is accessible as \code{sys.exc_traceback}, and also as the third
946item of the tuple returned by \code{sys.exc_info()}. The latter is
947the preferred interface, since it works correctly when the program is
948using multiple threads.
949When the program contains no suitable handler, the stack trace is written
Fred Drakef6669171998-05-06 19:52:49 +0000950(nicely formatted) to the standard error stream; if the interpreter is
951interactive, it is also made available to the user as
952\code{sys.last_traceback}.
953\obindex{traceback}
954\indexii{stack}{trace}
955\indexii{exception}{handler}
956\indexii{execution}{stack}
Fred Drake4856d011999-01-12 04:15:20 +0000957\withsubitem{(in module sys)}{
958 \ttindex{exc_info}
959 \ttindex{exc_traceback}
Fred Drake1e42d8a1998-11-25 17:58:50 +0000960 \ttindex{last_traceback}}
Guido van Rossum83b2f8a1998-07-23 17:12:46 +0000961\ttindex{sys.exc_info}
Fred Drakef6669171998-05-06 19:52:49 +0000962\ttindex{sys.exc_traceback}
963\ttindex{sys.last_traceback}
964
965Special read-only attributes: \member{tb_next} is the next level in the
966stack trace (towards the frame where the exception occurred), or
967\code{None} if there is no next level; \member{tb_frame} points to the
968execution frame of the current level; \member{tb_lineno} gives the line
969number where the exception occurred; \member{tb_lasti} indicates the
970precise instruction. The line number and last instruction in the
971traceback may differ from the line number of its frame object if the
972exception occurred in a \keyword{try} statement with no matching
973except clause or with a finally clause.
Fred Drake4856d011999-01-12 04:15:20 +0000974\withsubitem{(traceback attribute)}{
975 \ttindex{tb_next}
976 \ttindex{tb_frame}
977 \ttindex{tb_lineno}
Fred Drake1e42d8a1998-11-25 17:58:50 +0000978 \ttindex{tb_lasti}}
Fred Drakef6669171998-05-06 19:52:49 +0000979\stindex{try}
980
Guido van Rossum83b2f8a1998-07-23 17:12:46 +0000981\item[Slice objects]
982Slice objects are used to represent slices when \emph{extended slice
983syntax} is used. This is a slice using two colons, or multiple slices
984or ellipses separated by commas, e.g., \code{a[i:j:step]}, \code{a[i:j,
Raymond Hettinger03ec6d52003-06-25 18:29:36 +0000985k:l]}, or \code{a[..., i:j]}. They are also created by the built-in
Fred Drake1e42d8a1998-11-25 17:58:50 +0000986\function{slice()}\bifuncindex{slice} function.
Guido van Rossum83b2f8a1998-07-23 17:12:46 +0000987
Thomas Woutersf9b526d2000-07-16 19:05:38 +0000988Special read-only attributes: \member{start} is the lower bound;
989\member{stop} is the upper bound; \member{step} is the step value; each is
Guido van Rossum83b2f8a1998-07-23 17:12:46 +0000990\code{None} if omitted. These attributes can have any type.
Fred Drake4856d011999-01-12 04:15:20 +0000991\withsubitem{(slice object attribute)}{
992 \ttindex{start}
993 \ttindex{stop}
Fred Drake1e42d8a1998-11-25 17:58:50 +0000994 \ttindex{step}}
Guido van Rossum83b2f8a1998-07-23 17:12:46 +0000995
Fred Drake5ec22f22002-09-24 21:09:13 +0000996Slice objects support one method:
997
998\begin{methoddesc}[slice]{indices}{self, length}
999This method takes a single integer argument \var{length} and computes
1000information about the extended slice that the slice object would
1001describe if applied to a sequence of \var{length} items. It returns a
1002tuple of three integers; respectively these are the \var{start} and
1003\var{stop} indices and the \var{step} or stride length of the slice.
1004Missing or out-of-bounds indices are handled in a manner consistent
1005with regular slices.
Michael W. Hudsonf0d777c2002-07-19 15:47:06 +00001006\versionadded{2.3}
Fred Drake5ec22f22002-09-24 21:09:13 +00001007\end{methoddesc}
Michael W. Hudsonf0d777c2002-07-19 15:47:06 +00001008
Fred Drakeea690c42003-07-17 05:26:53 +00001009\item[Static method objects]
1010Static method objects provide a way of defeating the transformation
1011of function objects to method objects described above. A static method
1012object is a wrapper around any other object, usually a user-defined
1013method object. When a static method object is retrieved from a class
1014or a class instance, the object actually returned is the wrapped object,
1015which is not subject to any further transformation. Static method
1016objects are not themselves callable, although the objects they
1017wrap usually are. Static method objects are created by the built-in
1018\function{staticmethod()} constructor.
1019
1020\item[Class method objects]
1021A class method object, like a static method object, is a wrapper
1022around another object that alters the way in which that object
1023is retrieved from classes and class instances. The behaviour of
1024class method objects upon such retrieval is described above,
1025under ``User-defined methods''. Class method objects are created
1026by the built-in \function{classmethod()} constructor.
1027
Fred Drakef6669171998-05-06 19:52:49 +00001028\end{description} % Internal types
1029
1030\end{description} % Types
1031
1032
Fred Drake61c77281998-07-28 19:34:22 +00001033\section{Special method names\label{specialnames}}
Fred Drakef6669171998-05-06 19:52:49 +00001034
1035A class can implement certain operations that are invoked by special
Fred Draked82575d1998-08-28 20:03:12 +00001036syntax (such as arithmetic operations or subscripting and slicing) by
Fred Drake7af9f4d2003-05-12 13:50:11 +00001037defining methods with special names.\indexii{operator}{overloading}
1038This is Python's approach to \dfn{operator overloading}, allowing
1039classes to define their own behavior with respect to language
1040operators. For instance, if a class defines
Fred Draked82575d1998-08-28 20:03:12 +00001041a method named \method{__getitem__()}, and \code{x} is an instance of
1042this class, then \code{x[i]} is equivalent to
Raymond Hettinger94153092002-05-12 03:09:25 +00001043\code{x.__getitem__(i)}. Except where mentioned, attempts to execute
1044an operation raise an exception when no appropriate method is defined.
Fred Drake1e42d8a1998-11-25 17:58:50 +00001045\withsubitem{(mapping object method)}{\ttindex{__getitem__()}}
Fred Drakef6669171998-05-06 19:52:49 +00001046
Fred Drake0c475592000-12-07 04:49:34 +00001047When implementing a class that emulates any built-in type, it is
1048important that the emulation only be implemented to the degree that it
1049makes sense for the object being modelled. For example, some
1050sequences may work well with retrieval of individual elements, but
1051extracting a slice may not make sense. (One example of this is the
1052\class{NodeList} interface in the W3C's Document Object Model.)
1053
Fred Drakef6669171998-05-06 19:52:49 +00001054
Fred Drake61c77281998-07-28 19:34:22 +00001055\subsection{Basic customization\label{customization}}
Fred Drakef6669171998-05-06 19:52:49 +00001056
Greg Wardff564d32005-03-08 01:10:20 +00001057\begin{methoddesc}[object]{__new__}{cls\optional{, \moreargs}}
1058Called to create a new instance of class \var{cls}. \method{__new__()}
Georg Brandl15ad9352005-08-26 12:56:22 +00001059is a static method (special-cased so you need not declare it as such)
Greg Wardff564d32005-03-08 01:10:20 +00001060that takes the class of which an instance was requested as its first
1061argument. The remaining arguments are those passed to the object
1062constructor expression (the call to the class). The return value of
1063\method{__new__()} should be the new object instance (usually an
1064instance of \var{cls}).
1065
1066Typical implementations create a new instance of the class by invoking
1067the superclass's \method{__new__()} method using
1068\samp{super(\var{currentclass}, \var{cls}).__new__(\var{cls}[, ...])}
1069with appropriate arguments and then modifying the newly-created instance
1070as necessary before returning it.
1071
1072If \method{__new__()} returns an instance of \var{cls}, then the new
1073instance's \method{__init__()} method will be invoked like
1074\samp{__init__(\var{self}[, ...])}, where \var{self} is the new instance
1075and the remaining arguments are the same as were passed to
1076\method{__new__()}.
1077
1078If \method{__new__()} does not return an instance of \var{cls}, then the
1079new instance's \method{__init__()} method will not be invoked.
1080
1081\method{__new__()} is intended mainly to allow subclasses of
1082immutable types (like int, str, or tuple) to customize instance
1083creation.
1084\end{methoddesc}
1085
Fred Drake044bb4d2001-08-02 15:53:05 +00001086\begin{methoddesc}[object]{__init__}{self\optional{, \moreargs}}
1087Called\indexii{class}{constructor} when the instance is created. The
1088arguments are those passed to the class constructor expression. If a
Raymond Hettingere701dcb2003-01-19 13:08:18 +00001089base class has an \method{__init__()} method, the derived class's
1090\method{__init__()} method, if any, must explicitly call it to ensure proper
Fred Drake044bb4d2001-08-02 15:53:05 +00001091initialization of the base class part of the instance; for example:
1092\samp{BaseClass.__init__(\var{self}, [\var{args}...])}. As a special
Martin v. Löwis95cf84a2003-10-19 07:32:24 +00001093constraint on constructors, no value may be returned; doing so will
Fred Drake044bb4d2001-08-02 15:53:05 +00001094cause a \exception{TypeError} to be raised at runtime.
Fred Drake1e42d8a1998-11-25 17:58:50 +00001095\end{methoddesc}
Fred Drakef6669171998-05-06 19:52:49 +00001096
1097
Fred Drake1e42d8a1998-11-25 17:58:50 +00001098\begin{methoddesc}[object]{__del__}{self}
Guido van Rossum7c0240f1998-07-24 15:36:43 +00001099Called when the instance is about to be destroyed. This is also
1100called a destructor\index{destructor}. If a base class
Raymond Hettingere701dcb2003-01-19 13:08:18 +00001101has a \method{__del__()} method, the derived class's \method{__del__()}
1102method, if any,
Fred Drakef6669171998-05-06 19:52:49 +00001103must explicitly call it to ensure proper deletion of the base class
Guido van Rossum83b2f8a1998-07-23 17:12:46 +00001104part of the instance. Note that it is possible (though not recommended!)
1105for the \method{__del__()}
Fred Drakef6669171998-05-06 19:52:49 +00001106method to postpone destruction of the instance by creating a new
1107reference to it. It may then be called at a later time when this new
1108reference is deleted. It is not guaranteed that
1109\method{__del__()} methods are called for objects that still exist when
1110the interpreter exits.
Fred Drakef6669171998-05-06 19:52:49 +00001111\stindex{del}
1112
Fred Drake591dd8f2001-12-14 22:52:41 +00001113\begin{notice}
1114\samp{del x} doesn't directly call
Guido van Rossum83b2f8a1998-07-23 17:12:46 +00001115\code{x.__del__()} --- the former decrements the reference count for
Raymond Hettingere701dcb2003-01-19 13:08:18 +00001116\code{x} by one, and the latter is only called when \code{x}'s reference
Guido van Rossum83b2f8a1998-07-23 17:12:46 +00001117count reaches zero. Some common situations that may prevent the
Raymond Hettingere701dcb2003-01-19 13:08:18 +00001118reference count of an object from going to zero include: circular
Guido van Rossum83b2f8a1998-07-23 17:12:46 +00001119references between objects (e.g., a doubly-linked list or a tree data
1120structure with parent and child pointers); a reference to the object
1121on the stack frame of a function that caught an exception (the
1122traceback stored in \code{sys.exc_traceback} keeps the stack frame
1123alive); or a reference to the object on the stack frame that raised an
1124unhandled exception in interactive mode (the traceback stored in
1125\code{sys.last_traceback} keeps the stack frame alive). The first
1126situation can only be remedied by explicitly breaking the cycles; the
Fred Drake591dd8f2001-12-14 22:52:41 +00001127latter two situations can be resolved by storing \code{None} in
1128\code{sys.exc_traceback} or \code{sys.last_traceback}. Circular
1129references which are garbage are detected when the option cycle
1130detector is enabled (it's on by default), but can only be cleaned up
1131if there are no Python-level \method{__del__()} methods involved.
1132Refer to the documentation for the \ulink{\module{gc}
1133module}{../lib/module-gc.html} for more information about how
1134\method{__del__()} methods are handled by the cycle detector,
1135particularly the description of the \code{garbage} value.
1136\end{notice}
Fred Drakef6669171998-05-06 19:52:49 +00001137
Fred Drake591dd8f2001-12-14 22:52:41 +00001138\begin{notice}[warning]
1139Due to the precarious circumstances under which
Fred Draked82575d1998-08-28 20:03:12 +00001140\method{__del__()} methods are invoked, exceptions that occur during their
Guido van Rossum83b2f8a1998-07-23 17:12:46 +00001141execution are ignored, and a warning is printed to \code{sys.stderr}
Fred Drake591dd8f2001-12-14 22:52:41 +00001142instead. Also, when \method{__del__()} is invoked in response to a module
Guido van Rossum83b2f8a1998-07-23 17:12:46 +00001143being deleted (e.g., when execution of the program is done), other
Fred Draked82575d1998-08-28 20:03:12 +00001144globals referenced by the \method{__del__()} method may already have been
1145deleted. For this reason, \method{__del__()} methods should do the
Raymond Hettingera0e4d6c2002-09-08 21:10:54 +00001146absolute minimum needed to maintain external invariants. Starting with
1147version 1.5, Python guarantees that globals whose name begins with a single
1148underscore are deleted from their module before other globals are deleted;
1149if no other references to such globals exist, this may help in assuring that
Guido van Rossum83b2f8a1998-07-23 17:12:46 +00001150imported modules are still available at the time when the
Fred Drake591dd8f2001-12-14 22:52:41 +00001151\method{__del__()} method is called.
1152\end{notice}
Fred Drake1e42d8a1998-11-25 17:58:50 +00001153\end{methoddesc}
Fred Drakef6669171998-05-06 19:52:49 +00001154
Fred Drake1e42d8a1998-11-25 17:58:50 +00001155\begin{methoddesc}[object]{__repr__}{self}
Fred Drake82385871998-10-01 20:40:43 +00001156Called by the \function{repr()}\bifuncindex{repr} built-in function
1157and by string conversions (reverse quotes) to compute the ``official''
Andrew M. Kuchling68abe832000-12-19 14:09:21 +00001158string representation of an object. If at all possible, this should
Guido van Rossum035f7e82000-12-19 04:18:13 +00001159look like a valid Python expression that could be used to recreate an
1160object with the same value (given an appropriate environment). If
1161this is not possible, a string of the form \samp{<\var{...some useful
1162description...}>} should be returned. The return value must be a
1163string object.
Raymond Hettingere701dcb2003-01-19 13:08:18 +00001164If a class defines \method{__repr__()} but not \method{__str__()},
1165then \method{__repr__()} is also used when an ``informal'' string
1166representation of instances of that class is required.
Guido van Rossum035f7e82000-12-19 04:18:13 +00001167
1168This is typically used for debugging, so it is important that the
1169representation is information-rich and unambiguous.
Fred Drakef6669171998-05-06 19:52:49 +00001170\indexii{string}{conversion}
1171\indexii{reverse}{quotes}
1172\indexii{backward}{quotes}
1173\index{back-quotes}
Fred Drake1e42d8a1998-11-25 17:58:50 +00001174\end{methoddesc}
Fred Drakef6669171998-05-06 19:52:49 +00001175
Fred Drake1e42d8a1998-11-25 17:58:50 +00001176\begin{methoddesc}[object]{__str__}{self}
Fred Draked82575d1998-08-28 20:03:12 +00001177Called by the \function{str()}\bifuncindex{str} built-in function and
1178by the \keyword{print}\stindex{print} statement to compute the
Fred Drake82385871998-10-01 20:40:43 +00001179``informal'' string representation of an object. This differs from
1180\method{__repr__()} in that it does not have to be a valid Python
1181expression: a more convenient or concise representation may be used
Guido van Rossum035f7e82000-12-19 04:18:13 +00001182instead. The return value must be a string object.
Fred Drake1e42d8a1998-11-25 17:58:50 +00001183\end{methoddesc}
Fred Drakef6669171998-05-06 19:52:49 +00001184
Guido van Rossumab782dd2001-01-18 15:17:06 +00001185\begin{methoddesc}[object]{__lt__}{self, other}
1186\methodline[object]{__le__}{self, other}
1187\methodline[object]{__eq__}{self, other}
1188\methodline[object]{__ne__}{self, other}
1189\methodline[object]{__gt__}{self, other}
1190\methodline[object]{__ge__}{self, other}
1191\versionadded{2.1}
1192These are the so-called ``rich comparison'' methods, and are called
1193for comparison operators in preference to \method{__cmp__()} below.
1194The correspondence between operator symbols and method names is as
1195follows:
1196\code{\var{x}<\var{y}} calls \code{\var{x}.__lt__(\var{y})},
1197\code{\var{x}<=\var{y}} calls \code{\var{x}.__le__(\var{y})},
1198\code{\var{x}==\var{y}} calls \code{\var{x}.__eq__(\var{y})},
1199\code{\var{x}!=\var{y}} and \code{\var{x}<>\var{y}} call
1200\code{\var{x}.__ne__(\var{y})},
1201\code{\var{x}>\var{y}} calls \code{\var{x}.__gt__(\var{y})}, and
1202\code{\var{x}>=\var{y}} calls \code{\var{x}.__ge__(\var{y})}.
1203These methods can return any value, but if the comparison operator is
1204used in a Boolean context, the return value should be interpretable as
1205a Boolean value, else a \exception{TypeError} will be raised.
Raymond Hettingere701dcb2003-01-19 13:08:18 +00001206By convention, \code{False} is used for false and \code{True} for true.
Guido van Rossumab782dd2001-01-18 15:17:06 +00001207
Raymond Hettinger4d6e8fe2003-07-16 19:40:23 +00001208There are no implied relationships among the comparison operators.
Raymond Hettinger943277e2003-07-17 14:47:12 +00001209The truth of \code{\var{x}==\var{y}} does not imply that \code{\var{x}!=\var{y}}
Raymond Hettinger4d6e8fe2003-07-16 19:40:23 +00001210is false. Accordingly, when defining \method{__eq__}, one should also
1211define \method{__ne__} so that the operators will behave as expected.
1212
Guido van Rossumab782dd2001-01-18 15:17:06 +00001213There are no reflected (swapped-argument) versions of these methods
1214(to be used when the left argument does not support the operation but
1215the right argument does); rather, \method{__lt__()} and
1216\method{__gt__()} are each other's reflection, \method{__le__()} and
1217\method{__ge__()} are each other's reflection, and \method{__eq__()}
1218and \method{__ne__()} are their own reflection.
1219
1220Arguments to rich comparison methods are never coerced. A rich
1221comparison method may return \code{NotImplemented} if it does not
1222implement the operation for a given pair of arguments.
1223\end{methoddesc}
1224
Fred Drake1e42d8a1998-11-25 17:58:50 +00001225\begin{methoddesc}[object]{__cmp__}{self, other}
Guido van Rossumab782dd2001-01-18 15:17:06 +00001226Called by comparison operations if rich comparison (see above) is not
Fred Drake597bc1d2001-05-29 16:02:35 +00001227defined. Should return a negative integer if \code{self < other},
1228zero if \code{self == other}, a positive integer if \code{self >
1229other}. If no \method{__cmp__()}, \method{__eq__()} or
1230\method{__ne__()} operation is defined, class instances are compared
1231by object identity (``address''). See also the description of
1232\method{__hash__()} for some important notes on creating objects which
1233support custom comparison operations and are usable as dictionary
1234keys.
Guido van Rossum83b2f8a1998-07-23 17:12:46 +00001235(Note: the restriction that exceptions are not propagated by
Raymond Hettingere701dcb2003-01-19 13:08:18 +00001236\method{__cmp__()} has been removed since Python 1.5.)
Fred Drakef6669171998-05-06 19:52:49 +00001237\bifuncindex{cmp}
1238\index{comparisons}
Fred Drake1e42d8a1998-11-25 17:58:50 +00001239\end{methoddesc}
Fred Drakef6669171998-05-06 19:52:49 +00001240
Fred Drakee57a1142000-06-15 20:07:25 +00001241\begin{methoddesc}[object]{__rcmp__}{self, other}
Fred Drake445f8322001-01-04 15:11:48 +00001242 \versionchanged[No longer supported]{2.1}
Fred Drakee57a1142000-06-15 20:07:25 +00001243\end{methoddesc}
1244
Fred Drake1e42d8a1998-11-25 17:58:50 +00001245\begin{methoddesc}[object]{__hash__}{self}
Brett Cannona031a082004-06-29 04:14:02 +00001246Called for the key object for dictionary \obindex{dictionary}
Fred Draked82575d1998-08-28 20:03:12 +00001247operations, and by the built-in function
Fred Drakef6669171998-05-06 19:52:49 +00001248\function{hash()}\bifuncindex{hash}. Should return a 32-bit integer
1249usable as a hash value
1250for dictionary operations. The only required property is that objects
1251which compare equal have the same hash value; it is advised to somehow
Guido van Rossum83b2f8a1998-07-23 17:12:46 +00001252mix together (e.g., using exclusive or) the hash values for the
Fred Drakef6669171998-05-06 19:52:49 +00001253components of the object that also play a part in comparison of
1254objects. If a class does not define a \method{__cmp__()} method it should
1255not define a \method{__hash__()} operation either; if it defines
Fred Drake597bc1d2001-05-29 16:02:35 +00001256\method{__cmp__()} or \method{__eq__()} but not \method{__hash__()},
1257its instances will not be usable as dictionary keys. If a class
1258defines mutable objects and implements a \method{__cmp__()} or
1259\method{__eq__()} method, it should not implement \method{__hash__()},
1260since the dictionary implementation requires that a key's hash value
1261is immutable (if the object's hash value changes, it will be in the
1262wrong hash bucket).
Fred Drake1e42d8a1998-11-25 17:58:50 +00001263\withsubitem{(object method)}{\ttindex{__cmp__()}}
1264\end{methoddesc}
Fred Drakef6669171998-05-06 19:52:49 +00001265
Fred Drake1e42d8a1998-11-25 17:58:50 +00001266\begin{methoddesc}[object]{__nonzero__}{self}
Guido van Rossum77f6a652002-04-03 22:41:51 +00001267Called to implement truth value testing, and the built-in operation
1268\code{bool()}; should return \code{False} or \code{True}, or their
1269integer equivalents \code{0} or \code{1}.
1270When this method is not defined, \method{__len__()} is
Fred Draked82575d1998-08-28 20:03:12 +00001271called, if it is defined (see below). If a class defines neither
1272\method{__len__()} nor \method{__nonzero__()}, all its instances are
1273considered true.
Fred Drake1e42d8a1998-11-25 17:58:50 +00001274\withsubitem{(mapping object method)}{\ttindex{__len__()}}
1275\end{methoddesc}
Fred Drakef6669171998-05-06 19:52:49 +00001276
Martin v. Löwis2a519f82002-04-11 12:39:35 +00001277\begin{methoddesc}[object]{__unicode__}{self}
1278Called to implement \function{unicode()}\bifuncindex{unicode} builtin;
1279should return a Unicode object. When this method is not defined, string
1280conversion is attempted, and the result of string conversion is converted
1281to Unicode using the system default encoding.
1282\end{methoddesc}
1283
Fred Drakef6669171998-05-06 19:52:49 +00001284
Fred Drake61c77281998-07-28 19:34:22 +00001285\subsection{Customizing attribute access\label{attribute-access}}
Fred Drakef6669171998-05-06 19:52:49 +00001286
Guido van Rossum83b2f8a1998-07-23 17:12:46 +00001287The following methods can be defined to customize the meaning of
1288attribute access (use of, assignment to, or deletion of \code{x.name})
1289for class instances.
Fred Drakef6669171998-05-06 19:52:49 +00001290
Fred Drake1e42d8a1998-11-25 17:58:50 +00001291\begin{methoddesc}[object]{__getattr__}{self, name}
Fred Drakef6669171998-05-06 19:52:49 +00001292Called when an attribute lookup has not found the attribute in the
1293usual places (i.e. it is not an instance attribute nor is it found in
1294the class tree for \code{self}). \code{name} is the attribute name.
Guido van Rossum83b2f8a1998-07-23 17:12:46 +00001295This method should return the (computed) attribute value or raise an
Fred Draked82575d1998-08-28 20:03:12 +00001296\exception{AttributeError} exception.
Fred Drakef6669171998-05-06 19:52:49 +00001297
1298Note that if the attribute is found through the normal mechanism,
Fred Draked82575d1998-08-28 20:03:12 +00001299\method{__getattr__()} is not called. (This is an intentional
1300asymmetry between \method{__getattr__()} and \method{__setattr__()}.)
Fred Drakef6669171998-05-06 19:52:49 +00001301This is done both for efficiency reasons and because otherwise
Fred Draked82575d1998-08-28 20:03:12 +00001302\method{__setattr__()} would have no way to access other attributes of
Guido van Rossumd41eea02003-02-28 14:11:45 +00001303the instance. Note that at least for instance variables, you can fake
1304total control by not inserting any values in the instance attribute
1305dictionary (but instead inserting them in another object). See the
1306\method{__getattribute__()} method below for a way to actually get
1307total control in new-style classes.
Fred Drake1e42d8a1998-11-25 17:58:50 +00001308\withsubitem{(object method)}{\ttindex{__setattr__()}}
1309\end{methoddesc}
Fred Drakef6669171998-05-06 19:52:49 +00001310
Fred Drake1e42d8a1998-11-25 17:58:50 +00001311\begin{methoddesc}[object]{__setattr__}{self, name, value}
Fred Drakef6669171998-05-06 19:52:49 +00001312Called when an attribute assignment is attempted. This is called
Fred Draked82575d1998-08-28 20:03:12 +00001313instead of the normal mechanism (i.e.\ store the value in the instance
1314dictionary). \var{name} is the attribute name, \var{value} is the
Fred Drakef6669171998-05-06 19:52:49 +00001315value to be assigned to it.
Fred Drakef6669171998-05-06 19:52:49 +00001316
Fred Draked82575d1998-08-28 20:03:12 +00001317If \method{__setattr__()} wants to assign to an instance attribute, it
1318should not simply execute \samp{self.\var{name} = value} --- this
1319would cause a recursive call to itself. Instead, it should insert the
1320value in the dictionary of instance attributes, e.g.,
Guido van Rossumd41eea02003-02-28 14:11:45 +00001321\samp{self.__dict__[\var{name}] = value}. For new-style classes,
1322rather than accessing the instance dictionary, it should call the base
1323class method with the same name, for example,
1324\samp{object.__setattr__(self, name, value)}.
Fred Drake1e42d8a1998-11-25 17:58:50 +00001325\withsubitem{(instance attribute)}{\ttindex{__dict__}}
1326\end{methoddesc}
Fred Drakef6669171998-05-06 19:52:49 +00001327
Fred Drake1e42d8a1998-11-25 17:58:50 +00001328\begin{methoddesc}[object]{__delattr__}{self, name}
Fred Draked82575d1998-08-28 20:03:12 +00001329Like \method{__setattr__()} but for attribute deletion instead of
Fred Drake1e42d8a1998-11-25 17:58:50 +00001330assignment. This should only be implemented if \samp{del
1331obj.\var{name}} is meaningful for the object.
1332\end{methoddesc}
Fred Drakef6669171998-05-06 19:52:49 +00001333
Michael W. Hudson2ab1d082003-03-05 14:20:58 +00001334\subsubsection{More attribute access for new-style classes \label{new-style-attribute-access}}
Guido van Rossumd41eea02003-02-28 14:11:45 +00001335
1336The following methods only apply to new-style classes.
1337
1338\begin{methoddesc}[object]{__getattribute__}{self, name}
1339Called unconditionally to implement attribute accesses for instances
Georg Brandl1c330eb2005-07-02 10:27:31 +00001340of the class. If the class also defines \method{__getattr__}, the latter
1341will not be called unless \method{__getattribute__()} either calls it
1342explicitly or raises an \exception{AttributeError}.
Guido van Rossumd41eea02003-02-28 14:11:45 +00001343This method should return the (computed) attribute
1344value or raise an \exception{AttributeError} exception.
1345In order to avoid infinite recursion in this method, its
1346implementation should always call the base class method with the same
Fred Drake62364ff2003-03-20 18:17:16 +00001347name to access any attributes it needs, for example,
Guido van Rossumd41eea02003-02-28 14:11:45 +00001348\samp{object.__getattribute__(self, name)}.
1349\end{methoddesc}
1350
Michael W. Hudson2ab1d082003-03-05 14:20:58 +00001351\subsubsection{Implementing Descriptors \label{descriptors}}
Guido van Rossumd41eea02003-02-28 14:11:45 +00001352
1353The following methods only apply when an instance of the class
Fred Drake62364ff2003-03-20 18:17:16 +00001354containing the method (a so-called \emph{descriptor} class) appears in
Guido van Rossumd41eea02003-02-28 14:11:45 +00001355the class dictionary of another new-style class, known as the
1356\emph{owner} class. In the examples below, ``the attribute'' refers to
Fred Drake62364ff2003-03-20 18:17:16 +00001357the attribute whose name is the key of the property in the owner
Fred Drake67a521e2004-05-06 12:44:29 +00001358class' \code{__dict__}. Descriptors can only be implemented as
1359new-style classes themselves.
Guido van Rossumd41eea02003-02-28 14:11:45 +00001360
1361\begin{methoddesc}[object]{__get__}{self, instance, owner}
Fred Drake62364ff2003-03-20 18:17:16 +00001362Called to get the attribute of the owner class (class attribute access)
Raymond Hettinger68804312005-01-01 00:28:46 +00001363or of an instance of that class (instance attribute access).
Guido van Rossumd41eea02003-02-28 14:11:45 +00001364\var{owner} is always the owner class, while \var{instance} is the
1365instance that the attribute was accessed through, or \code{None} when
1366the attribute is accessed through the \var{owner}. This method should
1367return the (computed) attribute value or raise an
1368\exception{AttributeError} exception.
1369\end{methoddesc}
1370
1371\begin{methoddesc}[object]{__set__}{self, instance, value}
Michael W. Hudson2ab1d082003-03-05 14:20:58 +00001372Called to set the attribute on an instance \var{instance} of the owner
Guido van Rossumd41eea02003-02-28 14:11:45 +00001373class to a new value, \var{value}.
1374\end{methoddesc}
1375
1376\begin{methoddesc}[object]{__delete__}{self, instance}
Michael W. Hudson2ab1d082003-03-05 14:20:58 +00001377Called to delete the attribute on an instance \var{instance} of the
1378owner class.
Guido van Rossumd41eea02003-02-28 14:11:45 +00001379\end{methoddesc}
1380
Fred Drakef6669171998-05-06 19:52:49 +00001381
Fred Drake4db36612003-06-26 03:11:20 +00001382\subsubsection{Invoking Descriptors \label{descriptor-invocation}}
Raymond Hettinger03ec6d52003-06-25 18:29:36 +00001383
1384In general, a descriptor is an object attribute with ``binding behavior'',
1385one whose attribute access has been overridden by methods in the descriptor
Raymond Hettingerdf9eff02003-06-27 06:57:56 +00001386protocol: \method{__get__()}, \method{__set__()}, and \method{__delete__()}.
Raymond Hettinger03ec6d52003-06-25 18:29:36 +00001387If any of those methods are defined for an object, it is said to be a
1388descriptor.
1389
1390The default behavior for attribute access is to get, set, or delete the
1391attribute from an object's dictionary. For instance, \code{a.x} has a
1392lookup chain starting with \code{a.__dict__['x']}, then
1393\code{type(a).__dict__['x']}, and continuing
1394through the base classes of \code{type(a)} excluding metaclasses.
1395
1396However, if the looked-up value is an object defining one of the descriptor
1397methods, then Python may override the default behavior and invoke the
1398descriptor method instead. Where this occurs in the precedence chain depends
1399on which descriptor methods were defined and how they were called. Note that
1400descriptors are only invoked for new style objects or classes
Raymond Hettingerdf9eff02003-06-27 06:57:56 +00001401(ones that subclass \class{object()} or \class{type()}).
Raymond Hettinger03ec6d52003-06-25 18:29:36 +00001402
1403The starting point for descriptor invocation is a binding, \code{a.x}.
1404How the arguments are assembled depends on \code{a}:
1405
1406\begin{itemize}
1407
1408 \item[Direct Call] The simplest and least common call is when user code
1409 directly invokes a descriptor method: \code{x.__get__(a)}.
1410
1411 \item[Instance Binding] If binding to a new-style object instance,
1412 \code{a.x} is transformed into the call:
1413 \code{type(a).__dict__['x'].__get__(a, type(a))}.
1414
1415 \item[Class Binding] If binding to a new-style class, \code{A.x}
1416 is transformed into the call: \code{A.__dict__['x'].__get__(None, A)}.
1417
1418 \item[Super Binding] If \code{a} is an instance of \class{super},
1419 then the binding \code{super(B, obj).m()} searches
1420 \code{obj.__class__.__mro__} for the base class \code{A} immediately
1421 preceding \code{B} and then invokes the descriptor with the call:
1422 \code{A.__dict__['m'].__get__(obj, A)}.
1423
1424\end{itemize}
1425
1426For instance bindings, the precedence of descriptor invocation depends
1427on the which descriptor methods are defined. Data descriptors define
Raymond Hettingerdf9eff02003-06-27 06:57:56 +00001428both \method{__get__()} and \method{__set__()}. Non-data descriptors have
1429just the \method{__get__()} method. Data descriptors always override
Raymond Hettinger03ec6d52003-06-25 18:29:36 +00001430a redefinition in an instance dictionary. In contrast, non-data
1431descriptors can be overridden by instances.
1432
Raymond Hettingerdf9eff02003-06-27 06:57:56 +00001433Python methods (including \function{staticmethod()} and \function{classmethod()})
Raymond Hettinger03ec6d52003-06-25 18:29:36 +00001434are implemented as non-data descriptors. Accordingly, instances can
1435redefine and override methods. This allows individual instances to acquire
1436behaviors that differ from other instances of the same class.
1437
Raymond Hettingerdf9eff02003-06-27 06:57:56 +00001438The \function{property()} function is implemented as a data descriptor.
1439Accordingly, instances cannot override the behavior of a property.
1440
1441
1442\subsubsection{__slots__\label{slots}}
1443
1444By default, instances of both old and new-style classes have a dictionary
1445for attribute storage. This wastes space for objects having very few instance
1446variables. The space consumption can become acute when creating large numbers
1447of instances.
1448
1449The default can be overridden by defining \var{__slots__} in a new-style class
1450definition. The \var{__slots__} declaration takes a sequence of instance
1451variables and reserves just enough space in each instance to hold a value
1452for each variable. Space is saved because \var{__dict__} is not created for
1453each instance.
1454
1455\begin{datadesc}{__slots__}
1456This class variable can be assigned a string, iterable, or sequence of strings
Raymond Hettinger5918f8d2003-06-29 04:53:23 +00001457with variable names used by instances. If defined in a new-style class,
1458\var{__slots__} reserves space for the declared variables
Raymond Hettingerdf9eff02003-06-27 06:57:56 +00001459and prevents the automatic creation of \var{__dict__} and \var{__weakref__}
1460for each instance.
1461\versionadded{2.2}
1462\end{datadesc}
1463
1464\noindent
Raymond Hettinger5918f8d2003-06-29 04:53:23 +00001465Notes on using \var{__slots__}
Raymond Hettingerdf9eff02003-06-27 06:57:56 +00001466
1467\begin{itemize}
1468
1469\item Without a \var{__dict__} variable, instances cannot be assigned new
1470variables not listed in the \var{__slots__} definition. Attempts to assign
1471to an unlisted variable name raises \exception{AttributeError}. If dynamic
1472assignment of new variables is desired, then add \code{'__dict__'} to the
1473sequence of strings in the \var{__slots__} declaration.
1474\versionchanged[Previously, adding \code{'__dict__'} to the \var{__slots__}
1475declaration would not enable the assignment of new attributes not
1476specifically listed in the sequence of instance variable names]{2.3}
1477
1478\item Without a \var{__weakref__} variable for each instance, classes
1479defining \var{__slots__} do not support weak references to its instances.
1480If weak reference support is needed, then add \code{'__weakref__'} to the
1481sequence of strings in the \var{__slots__} declaration.
Raymond Hettinger5918f8d2003-06-29 04:53:23 +00001482\versionchanged[Previously, adding \code{'__weakref__'} to the \var{__slots__}
Raymond Hettingerdf9eff02003-06-27 06:57:56 +00001483declaration would not enable support for weak references]{2.3}
1484
1485\item \var{__slots__} are implemented at the class level by creating
1486descriptors (\ref{descriptors}) for each variable name. As a result,
1487class attributes cannot be used to set default values for instance
1488variables defined by \var{__slots__}; otherwise, the class attribute would
1489overwrite the descriptor assignment.
1490
1491\item If a class defines a slot also defined in a base class, the instance
1492variable defined by the base class slot is inaccessible (except by retrieving
1493its descriptor directly from the base class). This renders the meaning of the
1494program undefined. In the future, a check may be added to prevent this.
1495
1496\item The action of a \var{__slots__} declaration is limited to the class
1497where it is defined. As a result, subclasses will have a \var{__dict__}
1498unless they also define \var{__slots__}.
1499
1500\item \var{__slots__} do not work for classes derived from ``variable-length''
1501built-in types such as \class{long}, \class{str} and \class{tuple}.
1502
Raymond Hettinger5918f8d2003-06-29 04:53:23 +00001503\item Any non-string iterable may be assigned to \var{__slots__}.
Raymond Hettingerdf9eff02003-06-27 06:57:56 +00001504Mappings may also be used; however, in the future, special meaning may
1505be assigned to the values corresponding to each key.
1506
1507\end{itemize}
1508
1509
1510\subsection{Customizing class creation\label{metaclasses}}
1511
1512By default, new-style classes are constructed using \function{type()}.
1513A class definition is read into a separate namespace and the value
1514of class name is bound to the result of \code{type(name, bases, dict)}.
1515
1516When the class definition is read, if \var{__metaclass__} is defined
1517then the callable assigned to it will be called instead of \function{type()}.
1518The allows classes or functions to be written which monitor or alter the class
1519creation process:
1520
1521\begin{itemize}
1522\item Modifying the class dictionary prior to the class being created.
1523\item Returning an instance of another class -- essentially performing
1524the role of a factory function.
1525\end{itemize}
1526
1527\begin{datadesc}{__metaclass__}
1528This variable can be any callable accepting arguments for \code{name},
1529\code{bases}, and \code{dict}. Upon class creation, the callable is
1530used instead of the built-in \function{type()}.
1531\versionadded{2.2}
1532\end{datadesc}
1533
1534The appropriate metaclass is determined by the following precedence rules:
1535
1536\begin{itemize}
1537
1538\item If \code{dict['__metaclass__']} exists, it is used.
1539
1540\item Otherwise, if there is at least one base class, its metaclass is used
1541(this looks for a \var{__class__} attribute first and if not found, uses its
1542type).
1543
1544\item Otherwise, if a global variable named __metaclass__ exists, it is used.
1545
1546\item Otherwise, the old-style, classic metaclass (types.ClassType) is used.
1547
1548\end{itemize}
1549
1550The potential uses for metaclasses are boundless. Some ideas that have
1551been explored including logging, interface checking, automatic delegation,
1552automatic property creation, proxies, frameworks, and automatic resource
1553locking/synchronization.
Raymond Hettinger03ec6d52003-06-25 18:29:36 +00001554
1555
Fred Drake61c77281998-07-28 19:34:22 +00001556\subsection{Emulating callable objects\label{callable-types}}
Guido van Rossum83b2f8a1998-07-23 17:12:46 +00001557
Fred Drake1e42d8a1998-11-25 17:58:50 +00001558\begin{methoddesc}[object]{__call__}{self\optional{, args...}}
Guido van Rossum83b2f8a1998-07-23 17:12:46 +00001559Called when the instance is ``called'' as a function; if this method
Fred Draked82575d1998-08-28 20:03:12 +00001560is defined, \code{\var{x}(arg1, arg2, ...)} is a shorthand for
1561\code{\var{x}.__call__(arg1, arg2, ...)}.
Guido van Rossum83b2f8a1998-07-23 17:12:46 +00001562\indexii{call}{instance}
Fred Drake1e42d8a1998-11-25 17:58:50 +00001563\end{methoddesc}
Guido van Rossum83b2f8a1998-07-23 17:12:46 +00001564
1565
Fred Drake73921b02001-10-01 16:32:13 +00001566\subsection{Emulating container types\label{sequence-types}}
Guido van Rossum83b2f8a1998-07-23 17:12:46 +00001567
Fred Drake73921b02001-10-01 16:32:13 +00001568The following methods can be defined to implement container
1569objects. Containers usually are sequences (such as lists or tuples)
1570or mappings (like dictionaries), but can represent other containers as
1571well. The first set of methods is used either to emulate a
Guido van Rossum83b2f8a1998-07-23 17:12:46 +00001572sequence or to emulate a mapping; the difference is that for a
1573sequence, the allowable keys should be the integers \var{k} for which
1574\code{0 <= \var{k} < \var{N}} where \var{N} is the length of the
Thomas Wouters1d75a792000-08-17 22:37:32 +00001575sequence, or slice objects, which define a range of items. (For backwards
1576compatibility, the method \method{__getslice__()} (see below) can also be
1577defined to handle simple, but not extended slices.) It is also recommended
Fred Drakea0073822000-08-18 02:42:14 +00001578that mappings provide the methods \method{keys()}, \method{values()},
Thomas Wouters1d75a792000-08-17 22:37:32 +00001579\method{items()}, \method{has_key()}, \method{get()}, \method{clear()},
Raymond Hettingere701dcb2003-01-19 13:08:18 +00001580\method{setdefault()}, \method{iterkeys()}, \method{itervalues()},
Raymond Hettingerf4ca5a22003-01-19 14:57:12 +00001581\method{iteritems()}, \method{pop()}, \method{popitem()},
Thomas Wouters1d75a792000-08-17 22:37:32 +00001582\method{copy()}, and \method{update()} behaving similar to those for
Raymond Hettingere701dcb2003-01-19 13:08:18 +00001583Python's standard dictionary objects. The \module{UserDict} module
1584provides a \class{DictMixin} class to help create those methods
1585from a base set of \method{__getitem__()}, \method{__setitem__()},
1586\method{__delitem__()}, and \method{keys()}.
1587Mutable sequences should provide
Guido van Rossum83b2f8a1998-07-23 17:12:46 +00001588methods \method{append()}, \method{count()}, \method{index()},
Raymond Hettingere701dcb2003-01-19 13:08:18 +00001589\method{extend()},
Guido van Rossum83b2f8a1998-07-23 17:12:46 +00001590\method{insert()}, \method{pop()}, \method{remove()}, \method{reverse()}
1591and \method{sort()}, like Python standard list objects. Finally,
1592sequence types should implement addition (meaning concatenation) and
1593multiplication (meaning repetition) by defining the methods
Thomas Wouters12bba852000-08-24 20:06:04 +00001594\method{__add__()}, \method{__radd__()}, \method{__iadd__()},
1595\method{__mul__()}, \method{__rmul__()} and \method{__imul__()} described
1596below; they should not define \method{__coerce__()} or other numerical
Guido van Rossum0dbb4fb2001-04-20 16:50:40 +00001597operators. It is recommended that both mappings and sequences
Fred Drake18d8d5a2001-09-18 17:58:20 +00001598implement the \method{__contains__()} method to allow efficient use of
1599the \code{in} operator; for mappings, \code{in} should be equivalent
1600of \method{has_key()}; for sequences, it should search through the
Raymond Hettingere701dcb2003-01-19 13:08:18 +00001601values. It is further recommended that both mappings and sequences
1602implement the \method{__iter__()} method to allow efficient iteration
1603through the container; for mappings, \method{__iter__()} should be
1604the same as \method{iterkeys()}; for sequences, it should iterate
1605through the values.
Fred Drake4856d011999-01-12 04:15:20 +00001606\withsubitem{(mapping object method)}{
1607 \ttindex{keys()}
1608 \ttindex{values()}
1609 \ttindex{items()}
Raymond Hettingere701dcb2003-01-19 13:08:18 +00001610 \ttindex{iterkeys()}
1611 \ttindex{itervalues()}
1612 \ttindex{iteritems()}
Fred Drake4856d011999-01-12 04:15:20 +00001613 \ttindex{has_key()}
1614 \ttindex{get()}
Raymond Hettingere701dcb2003-01-19 13:08:18 +00001615 \ttindex{setdefault()}
1616 \ttindex{pop()}
1617 \ttindex{popitem()}
Fred Drake4856d011999-01-12 04:15:20 +00001618 \ttindex{clear()}
1619 \ttindex{copy()}
Guido van Rossum0dbb4fb2001-04-20 16:50:40 +00001620 \ttindex{update()}
1621 \ttindex{__contains__()}}
Fred Drake4856d011999-01-12 04:15:20 +00001622\withsubitem{(sequence object method)}{
1623 \ttindex{append()}
1624 \ttindex{count()}
Raymond Hettingere701dcb2003-01-19 13:08:18 +00001625 \ttindex{extend()}
Fred Drake4856d011999-01-12 04:15:20 +00001626 \ttindex{index()}
1627 \ttindex{insert()}
1628 \ttindex{pop()}
1629 \ttindex{remove()}
1630 \ttindex{reverse()}
1631 \ttindex{sort()}
1632 \ttindex{__add__()}
1633 \ttindex{__radd__()}
Thomas Wouters12bba852000-08-24 20:06:04 +00001634 \ttindex{__iadd__()}
Fred Drake4856d011999-01-12 04:15:20 +00001635 \ttindex{__mul__()}
Thomas Wouters12bba852000-08-24 20:06:04 +00001636 \ttindex{__rmul__()}
Guido van Rossum0dbb4fb2001-04-20 16:50:40 +00001637 \ttindex{__imul__()}
Raymond Hettingere701dcb2003-01-19 13:08:18 +00001638 \ttindex{__contains__()}
1639 \ttindex{__iter__()}}
Fred Drakeae3e5741999-01-28 23:21:49 +00001640\withsubitem{(numeric object method)}{\ttindex{__coerce__()}}
Fred Drakef6669171998-05-06 19:52:49 +00001641
Fred Drake73921b02001-10-01 16:32:13 +00001642\begin{methoddesc}[container object]{__len__}{self}
Fred Draked82575d1998-08-28 20:03:12 +00001643Called to implement the built-in function
1644\function{len()}\bifuncindex{len}. Should return the length of the
1645object, an integer \code{>=} 0. Also, an object that doesn't define a
1646\method{__nonzero__()} method and whose \method{__len__()} method
1647returns zero is considered to be false in a Boolean context.
Fred Drake1e42d8a1998-11-25 17:58:50 +00001648\withsubitem{(object method)}{\ttindex{__nonzero__()}}
1649\end{methoddesc}
Fred Drakef6669171998-05-06 19:52:49 +00001650
Fred Drake73921b02001-10-01 16:32:13 +00001651\begin{methoddesc}[container object]{__getitem__}{self, key}
Fred Draked82575d1998-08-28 20:03:12 +00001652Called to implement evaluation of \code{\var{self}[\var{key}]}.
Fred Drake31575ce2000-09-21 05:28:26 +00001653For sequence types, the accepted keys should be integers and slice
1654objects.\obindex{slice} Note that
1655the special interpretation of negative indexes (if the class wishes to
Fred Drakef6669171998-05-06 19:52:49 +00001656emulate a sequence type) is up to the \method{__getitem__()} method.
Fred Drake91826ed2000-07-13 04:57:58 +00001657If \var{key} is of an inappropriate type, \exception{TypeError} may be
1658raised; if of a value outside the set of indexes for the sequence
1659(after any special interpretation of negative values),
1660\exception{IndexError} should be raised.
Raymond Hettingera30616a2005-08-21 11:26:14 +00001661For mapping types, if \var{key} is missing (not in the container),
1662\exception{KeyError} should be raised.
Fred Drake0aa811c2001-10-20 04:24:09 +00001663\note{\keyword{for} loops expect that an
Fred Drake91826ed2000-07-13 04:57:58 +00001664\exception{IndexError} will be raised for illegal indexes to allow
Fred Drake0aa811c2001-10-20 04:24:09 +00001665proper detection of the end of the sequence.}
Fred Drake1e42d8a1998-11-25 17:58:50 +00001666\end{methoddesc}
Fred Drakef6669171998-05-06 19:52:49 +00001667
Fred Drake73921b02001-10-01 16:32:13 +00001668\begin{methoddesc}[container object]{__setitem__}{self, key, value}
Fred Draked82575d1998-08-28 20:03:12 +00001669Called to implement assignment to \code{\var{self}[\var{key}]}. Same
Fred Drake1e42d8a1998-11-25 17:58:50 +00001670note as for \method{__getitem__()}. This should only be implemented
1671for mappings if the objects support changes to the values for keys, or
1672if new keys can be added, or for sequences if elements can be
Fred Drake91826ed2000-07-13 04:57:58 +00001673replaced. The same exceptions should be raised for improper
1674\var{key} values as for the \method{__getitem__()} method.
Fred Drake1e42d8a1998-11-25 17:58:50 +00001675\end{methoddesc}
Fred Drakef6669171998-05-06 19:52:49 +00001676
Fred Drake73921b02001-10-01 16:32:13 +00001677\begin{methoddesc}[container object]{__delitem__}{self, key}
Fred Draked82575d1998-08-28 20:03:12 +00001678Called to implement deletion of \code{\var{self}[\var{key}]}. Same
Fred Drake1e42d8a1998-11-25 17:58:50 +00001679note as for \method{__getitem__()}. This should only be implemented
1680for mappings if the objects support removal of keys, or for sequences
Fred Drake91826ed2000-07-13 04:57:58 +00001681if elements can be removed from the sequence. The same exceptions
1682should be raised for improper \var{key} values as for the
1683\method{__getitem__()} method.
Fred Drake1e42d8a1998-11-25 17:58:50 +00001684\end{methoddesc}
Fred Drakef6669171998-05-06 19:52:49 +00001685
Fred Drake73921b02001-10-01 16:32:13 +00001686\begin{methoddesc}[container object]{__iter__}{self}
1687This method is called when an iterator is required for a container.
1688This method should return a new iterator object that can iterate over
1689all the objects in the container. For mappings, it should iterate
1690over the keys of the container, and should also be made available as
1691the method \method{iterkeys()}.
1692
1693Iterator objects also need to implement this method; they are required
1694to return themselves. For more information on iterator objects, see
1695``\ulink{Iterator Types}{../lib/typeiter.html}'' in the
1696\citetitle[../lib/lib.html]{Python Library Reference}.
1697\end{methoddesc}
1698
1699The membership test operators (\keyword{in} and \keyword{not in}) are
1700normally implemented as an iteration through a sequence. However,
1701container objects can supply the following special method with a more
1702efficient implementation, which also does not require the object be a
1703sequence.
1704
1705\begin{methoddesc}[container object]{__contains__}{self, item}
1706Called to implement membership test operators. Should return true if
1707\var{item} is in \var{self}, false otherwise. For mapping objects,
1708this should consider the keys of the mapping rather than the values or
1709the key-item pairs.
1710\end{methoddesc}
1711
Fred Drakef6669171998-05-06 19:52:49 +00001712
Fred Drake3041b071998-10-21 00:25:32 +00001713\subsection{Additional methods for emulation of sequence types
Fred Drake61c77281998-07-28 19:34:22 +00001714 \label{sequence-methods}}
Guido van Rossum83b2f8a1998-07-23 17:12:46 +00001715
Raymond Hettingere701dcb2003-01-19 13:08:18 +00001716The following optional methods can be defined to further emulate sequence
1717objects. Immutable sequences methods should at most only define
1718\method{__getslice__()}; mutable sequences might define all three
Raymond Hettinger92016dc2003-09-22 15:27:11 +00001719methods.
Fred Drakef6669171998-05-06 19:52:49 +00001720
Fred Drake1e42d8a1998-11-25 17:58:50 +00001721\begin{methoddesc}[sequence object]{__getslice__}{self, i, j}
Fred Drakea0073822000-08-18 02:42:14 +00001722\deprecated{2.0}{Support slice objects as parameters to the
1723\method{__getitem__()} method.}
Fred Draked82575d1998-08-28 20:03:12 +00001724Called to implement evaluation of \code{\var{self}[\var{i}:\var{j}]}.
1725The returned object should be of the same type as \var{self}. Note
1726that missing \var{i} or \var{j} in the slice expression are replaced
Fred Drakee15956b2000-04-03 04:51:13 +00001727by zero or \code{sys.maxint}, respectively. If negative indexes are
1728used in the slice, the length of the sequence is added to that index.
1729If the instance does not implement the \method{__len__()} method, an
1730\exception{AttributeError} is raised.
1731No guarantee is made that indexes adjusted this way are not still
1732negative. Indexes which are greater than the length of the sequence
1733are not modified.
Fred Drakea0073822000-08-18 02:42:14 +00001734If no \method{__getslice__()} is found, a slice
Thomas Wouters1d75a792000-08-17 22:37:32 +00001735object is created instead, and passed to \method{__getitem__()} instead.
Fred Drake1e42d8a1998-11-25 17:58:50 +00001736\end{methoddesc}
Fred Drakef6669171998-05-06 19:52:49 +00001737
Fred Drake1e42d8a1998-11-25 17:58:50 +00001738\begin{methoddesc}[sequence object]{__setslice__}{self, i, j, sequence}
Fred Draked82575d1998-08-28 20:03:12 +00001739Called to implement assignment to \code{\var{self}[\var{i}:\var{j}]}.
1740Same notes for \var{i} and \var{j} as for \method{__getslice__()}.
Thomas Wouters1d75a792000-08-17 22:37:32 +00001741
Raymond Hettingere701dcb2003-01-19 13:08:18 +00001742This method is deprecated. If no \method{__setslice__()} is found,
1743or for extended slicing of the form
1744\code{\var{self}[\var{i}:\var{j}:\var{k}]}, a
1745slice object is created, and passed to \method{__setitem__()},
1746instead of \method{__setslice__()} being called.
Fred Drake1e42d8a1998-11-25 17:58:50 +00001747\end{methoddesc}
Fred Drakef6669171998-05-06 19:52:49 +00001748
Fred Drake1e42d8a1998-11-25 17:58:50 +00001749\begin{methoddesc}[sequence object]{__delslice__}{self, i, j}
Fred Draked82575d1998-08-28 20:03:12 +00001750Called to implement deletion of \code{\var{self}[\var{i}:\var{j}]}.
1751Same notes for \var{i} and \var{j} as for \method{__getslice__()}.
Raymond Hettingere701dcb2003-01-19 13:08:18 +00001752This method is deprecated. If no \method{__delslice__()} is found,
1753or for extended slicing of the form
1754\code{\var{self}[\var{i}:\var{j}:\var{k}]}, a
1755slice object is created, and passed to \method{__delitem__()},
1756instead of \method{__delslice__()} being called.
Fred Drake1e42d8a1998-11-25 17:58:50 +00001757\end{methoddesc}
Fred Drakef6669171998-05-06 19:52:49 +00001758
Fred Drakefb8ffe62001-04-13 15:54:41 +00001759Notice that these methods are only invoked when a single slice with a
1760single colon is used, and the slice method is available. For slice
1761operations involving extended slice notation, or in absence of the
1762slice methods, \method{__getitem__()}, \method{__setitem__()} or
1763\method{__delitem__()} is called with a slice object as argument.
Fred Drakef6669171998-05-06 19:52:49 +00001764
Fred Drakef89259782000-09-21 22:27:16 +00001765The following example demonstrate how to make your program or module
1766compatible with earlier versions of Python (assuming that methods
1767\method{__getitem__()}, \method{__setitem__()} and \method{__delitem__()}
1768support slice objects as arguments):
1769
1770\begin{verbatim}
1771class MyClass:
1772 ...
1773 def __getitem__(self, index):
1774 ...
1775 def __setitem__(self, index, value):
1776 ...
1777 def __delitem__(self, index):
1778 ...
1779
1780 if sys.version_info < (2, 0):
1781 # They won't be defined if version is at least 2.0 final
1782
1783 def __getslice__(self, i, j):
1784 return self[max(0, i):max(0, j):]
1785 def __setslice__(self, i, j, seq):
1786 self[max(0, i):max(0, j):] = seq
1787 def __delslice__(self, i, j):
1788 del self[max(0, i):max(0, j):]
1789 ...
1790\end{verbatim}
1791
Raymond Hettingere701dcb2003-01-19 13:08:18 +00001792Note the calls to \function{max()}; these are necessary because of
1793the handling of negative indices before the
Fred Drakef89259782000-09-21 22:27:16 +00001794\method{__*slice__()} methods are called. When negative indexes are
1795used, the \method{__*item__()} methods receive them as provided, but
1796the \method{__*slice__()} methods get a ``cooked'' form of the index
1797values. For each negative index value, the length of the sequence is
1798added to the index before calling the method (which may still result
1799in a negative index); this is the customary handling of negative
1800indexes by the built-in sequence types, and the \method{__*item__()}
1801methods are expected to do this as well. However, since they should
1802already be doing that, negative indexes cannot be passed in; they must
Raymond Hettingere41d4c82003-08-25 04:39:55 +00001803be constrained to the bounds of the sequence before being passed to
Fred Drakef89259782000-09-21 22:27:16 +00001804the \method{__*item__()} methods.
1805Calling \code{max(0, i)} conveniently returns the proper value.
1806
Fred Drake15988fd1999-02-12 18:14:57 +00001807
Fred Drake61c77281998-07-28 19:34:22 +00001808\subsection{Emulating numeric types\label{numeric-types}}
Guido van Rossum83b2f8a1998-07-23 17:12:46 +00001809
1810The following methods can be defined to emulate numeric objects.
1811Methods corresponding to operations that are not supported by the
1812particular kind of number implemented (e.g., bitwise operations for
1813non-integral numbers) should be left undefined.
Fred Drakef6669171998-05-06 19:52:49 +00001814
Fred Drakeb8943701999-05-10 13:43:22 +00001815\begin{methoddesc}[numeric object]{__add__}{self, other}
1816\methodline[numeric object]{__sub__}{self, other}
1817\methodline[numeric object]{__mul__}{self, other}
Fred Drake3e2aca42001-08-14 20:28:08 +00001818\methodline[numeric object]{__floordiv__}{self, other}
Fred Drakeb8943701999-05-10 13:43:22 +00001819\methodline[numeric object]{__mod__}{self, other}
1820\methodline[numeric object]{__divmod__}{self, other}
1821\methodline[numeric object]{__pow__}{self, other\optional{, modulo}}
1822\methodline[numeric object]{__lshift__}{self, other}
1823\methodline[numeric object]{__rshift__}{self, other}
1824\methodline[numeric object]{__and__}{self, other}
1825\methodline[numeric object]{__xor__}{self, other}
1826\methodline[numeric object]{__or__}{self, other}
Fred Drake3e2aca42001-08-14 20:28:08 +00001827These methods are
Guido van Rossum83b2f8a1998-07-23 17:12:46 +00001828called to implement the binary arithmetic operations (\code{+},
Fred Drake3e2aca42001-08-14 20:28:08 +00001829\code{-}, \code{*}, \code{//}, \code{\%},
Fred Draked82575d1998-08-28 20:03:12 +00001830\function{divmod()}\bifuncindex{divmod},
Fred Drakefb8ffe62001-04-13 15:54:41 +00001831\function{pow()}\bifuncindex{pow}, \code{**}, \code{<}\code{<},
1832\code{>}\code{>}, \code{\&}, \code{\^}, \code{|}). For instance, to
1833evaluate the expression \var{x}\code{+}\var{y}, where \var{x} is an
1834instance of a class that has an \method{__add__()} method,
Fred Drake3e2aca42001-08-14 20:28:08 +00001835\code{\var{x}.__add__(\var{y})} is called. The \method{__divmod__()}
1836method should be the equivalent to using \method{__floordiv__()} and
1837\method{__mod__()}; it should not be related to \method{__truediv__()}
1838(described below). Note that
Fred Draked82575d1998-08-28 20:03:12 +00001839\method{__pow__()} should be defined to accept an optional third
1840argument if the ternary version of the built-in
1841\function{pow()}\bifuncindex{pow} function is to be supported.
Fred Drake1e42d8a1998-11-25 17:58:50 +00001842\end{methoddesc}
Guido van Rossum83b2f8a1998-07-23 17:12:46 +00001843
Fred Drake3e2aca42001-08-14 20:28:08 +00001844\begin{methoddesc}[numeric object]{__div__}{self, other}
1845\methodline[numeric object]{__truediv__}{self, other}
1846The division operator (\code{/}) is implemented by these methods. The
1847\method{__truediv__()} method is used when \code{__future__.division}
1848is in effect, otherwise \method{__div__()} is used. If only one of
1849these two methods is defined, the object will not support division in
1850the alternate context; \exception{TypeError} will be raised instead.
1851\end{methoddesc}
1852
Fred Drakeb8943701999-05-10 13:43:22 +00001853\begin{methoddesc}[numeric object]{__radd__}{self, other}
1854\methodline[numeric object]{__rsub__}{self, other}
1855\methodline[numeric object]{__rmul__}{self, other}
1856\methodline[numeric object]{__rdiv__}{self, other}
Raymond Hettinger10cbe8d2002-06-20 06:12:37 +00001857\methodline[numeric object]{__rtruediv__}{self, other}
1858\methodline[numeric object]{__rfloordiv__}{self, other}
Fred Drakeb8943701999-05-10 13:43:22 +00001859\methodline[numeric object]{__rmod__}{self, other}
1860\methodline[numeric object]{__rdivmod__}{self, other}
1861\methodline[numeric object]{__rpow__}{self, other}
1862\methodline[numeric object]{__rlshift__}{self, other}
1863\methodline[numeric object]{__rrshift__}{self, other}
1864\methodline[numeric object]{__rand__}{self, other}
1865\methodline[numeric object]{__rxor__}{self, other}
1866\methodline[numeric object]{__ror__}{self, other}
Fred Drake3e2aca42001-08-14 20:28:08 +00001867These methods are
Guido van Rossum83b2f8a1998-07-23 17:12:46 +00001868called to implement the binary arithmetic operations (\code{+},
Fred Draked82575d1998-08-28 20:03:12 +00001869\code{-}, \code{*}, \code{/}, \code{\%},
1870\function{divmod()}\bifuncindex{divmod},
Fred Drakefb8ffe62001-04-13 15:54:41 +00001871\function{pow()}\bifuncindex{pow}, \code{**}, \code{<}\code{<},
1872\code{>}\code{>}, \code{\&}, \code{\^}, \code{|}) with reflected
1873(swapped) operands. These functions are only called if the left
1874operand does not support the corresponding operation. For instance,
1875to evaluate the expression \var{x}\code{-}\var{y}, where \var{y} is an
1876instance of a class that has an \method{__rsub__()} method,
1877\code{\var{y}.__rsub__(\var{x})} is called. Note that ternary
1878\function{pow()}\bifuncindex{pow} will not try calling
1879\method{__rpow__()} (the coercion rules would become too
Guido van Rossum83b2f8a1998-07-23 17:12:46 +00001880complicated).
Fred Drake1e42d8a1998-11-25 17:58:50 +00001881\end{methoddesc}
Fred Drakef6669171998-05-06 19:52:49 +00001882
Thomas Woutersdc90cc22000-12-11 23:11:51 +00001883\begin{methoddesc}[numeric object]{__iadd__}{self, other}
1884\methodline[numeric object]{__isub__}{self, other}
1885\methodline[numeric object]{__imul__}{self, other}
1886\methodline[numeric object]{__idiv__}{self, other}
Raymond Hettinger10cbe8d2002-06-20 06:12:37 +00001887\methodline[numeric object]{__itruediv__}{self, other}
1888\methodline[numeric object]{__ifloordiv__}{self, other}
1889\methodline[numeric object]{__imod__}{self, other}
Thomas Woutersdc90cc22000-12-11 23:11:51 +00001890\methodline[numeric object]{__ipow__}{self, other\optional{, modulo}}
1891\methodline[numeric object]{__ilshift__}{self, other}
1892\methodline[numeric object]{__irshift__}{self, other}
1893\methodline[numeric object]{__iand__}{self, other}
1894\methodline[numeric object]{__ixor__}{self, other}
1895\methodline[numeric object]{__ior__}{self, other}
Fred Drakefb8ffe62001-04-13 15:54:41 +00001896These methods are called to implement the augmented arithmetic
1897operations (\code{+=}, \code{-=}, \code{*=}, \code{/=}, \code{\%=},
1898\code{**=}, \code{<}\code{<=}, \code{>}\code{>=}, \code{\&=},
Fred Drakea3788642003-07-23 15:18:03 +00001899\code{\textasciicircum=}, \code{|=}). These methods should attempt to do the
Fred Drakefb8ffe62001-04-13 15:54:41 +00001900operation in-place (modifying \var{self}) and return the result (which
1901could be, but does not have to be, \var{self}). If a specific method
1902is not defined, the augmented operation falls back to the normal
1903methods. For instance, to evaluate the expression
1904\var{x}\code{+=}\var{y}, where \var{x} is an instance of a class that
1905has an \method{__iadd__()} method, \code{\var{x}.__iadd__(\var{y})} is
1906called. If \var{x} is an instance of a class that does not define a
1907\method{__iadd()} method, \code{\var{x}.__add__(\var{y})} and
1908\code{\var{y}.__radd__(\var{x})} are considered, as with the
1909evaluation of \var{x}\code{+}\var{y}.
Thomas Woutersdc90cc22000-12-11 23:11:51 +00001910\end{methoddesc}
1911
Fred Drakeb8943701999-05-10 13:43:22 +00001912\begin{methoddesc}[numeric object]{__neg__}{self}
1913\methodline[numeric object]{__pos__}{self}
1914\methodline[numeric object]{__abs__}{self}
1915\methodline[numeric object]{__invert__}{self}
Fred Drakefb8ffe62001-04-13 15:54:41 +00001916Called to implement the unary arithmetic operations (\code{-},
1917\code{+}, \function{abs()}\bifuncindex{abs} and \code{\~{}}).
Fred Drake1e42d8a1998-11-25 17:58:50 +00001918\end{methoddesc}
Fred Drakef6669171998-05-06 19:52:49 +00001919
Fred Drakeb8943701999-05-10 13:43:22 +00001920\begin{methoddesc}[numeric object]{__complex__}{self}
1921\methodline[numeric object]{__int__}{self}
1922\methodline[numeric object]{__long__}{self}
1923\methodline[numeric object]{__float__}{self}
Fred Draked82575d1998-08-28 20:03:12 +00001924Called to implement the built-in functions
Fred Drake15988fd1999-02-12 18:14:57 +00001925\function{complex()}\bifuncindex{complex},
1926\function{int()}\bifuncindex{int}, \function{long()}\bifuncindex{long},
Fred Draked82575d1998-08-28 20:03:12 +00001927and \function{float()}\bifuncindex{float}. Should return a value of
1928the appropriate type.
Fred Drake1e42d8a1998-11-25 17:58:50 +00001929\end{methoddesc}
Fred Drakef6669171998-05-06 19:52:49 +00001930
Fred Drakeb8943701999-05-10 13:43:22 +00001931\begin{methoddesc}[numeric object]{__oct__}{self}
1932\methodline[numeric object]{__hex__}{self}
Fred Draked82575d1998-08-28 20:03:12 +00001933Called to implement the built-in functions
1934\function{oct()}\bifuncindex{oct} and
1935\function{hex()}\bifuncindex{hex}. Should return a string value.
Fred Drake1e42d8a1998-11-25 17:58:50 +00001936\end{methoddesc}
Fred Drakef6669171998-05-06 19:52:49 +00001937
Fred Drakeb8943701999-05-10 13:43:22 +00001938\begin{methoddesc}[numeric object]{__coerce__}{self, other}
Guido van Rossum83b2f8a1998-07-23 17:12:46 +00001939Called to implement ``mixed-mode'' numeric arithmetic. Should either
Fred Draked82575d1998-08-28 20:03:12 +00001940return a 2-tuple containing \var{self} and \var{other} converted to
Fred Drakeb8943701999-05-10 13:43:22 +00001941a common numeric type, or \code{None} if conversion is impossible. When
Guido van Rossum83b2f8a1998-07-23 17:12:46 +00001942the common type would be the type of \code{other}, it is sufficient to
1943return \code{None}, since the interpreter will also ask the other
1944object to attempt a coercion (but sometimes, if the implementation of
1945the other type cannot be changed, it is useful to do the conversion to
Guido van Rossum92cf95f2002-06-03 19:06:41 +00001946the other type here). A return value of \code{NotImplemented} is
1947equivalent to returning \code{None}.
Fred Drake1e42d8a1998-11-25 17:58:50 +00001948\end{methoddesc}
Guido van Rossum83b2f8a1998-07-23 17:12:46 +00001949
Guido van Rossum92cf95f2002-06-03 19:06:41 +00001950\subsection{Coercion rules\label{coercion-rules}}
1951
1952This section used to document the rules for coercion. As the language
1953has evolved, the coercion rules have become hard to document
1954precisely; documenting what one version of one particular
1955implementation does is undesirable. Instead, here are some informal
1956guidelines regarding coercion. In Python 3.0, coercion will not be
1957supported.
Guido van Rossum83b2f8a1998-07-23 17:12:46 +00001958
1959\begin{itemize}
1960
Guido van Rossum92cf95f2002-06-03 19:06:41 +00001961\item
Guido van Rossum83b2f8a1998-07-23 17:12:46 +00001962
Guido van Rossum92cf95f2002-06-03 19:06:41 +00001963If the left operand of a \% operator is a string or Unicode object, no
1964coercion takes place and the string formatting operation is invoked
1965instead.
Guido van Rossum83b2f8a1998-07-23 17:12:46 +00001966
Guido van Rossum92cf95f2002-06-03 19:06:41 +00001967\item
Guido van Rossum83b2f8a1998-07-23 17:12:46 +00001968
Guido van Rossum92cf95f2002-06-03 19:06:41 +00001969It is no longer recommended to define a coercion operation.
1970Mixed-mode operations on types that don't define coercion pass the
1971original arguments to the operation.
Guido van Rossum83b2f8a1998-07-23 17:12:46 +00001972
Guido van Rossum92cf95f2002-06-03 19:06:41 +00001973\item
Guido van Rossum83b2f8a1998-07-23 17:12:46 +00001974
Fred Drake293dd4b2002-06-04 16:25:57 +00001975New-style classes (those derived from \class{object}) never invoke the
1976\method{__coerce__()} method in response to a binary operator; the only
1977time \method{__coerce__()} is invoked is when the built-in function
1978\function{coerce()} is called.
Guido van Rossum83b2f8a1998-07-23 17:12:46 +00001979
Guido van Rossum92cf95f2002-06-03 19:06:41 +00001980\item
Guido van Rossum83b2f8a1998-07-23 17:12:46 +00001981
Guido van Rossum92cf95f2002-06-03 19:06:41 +00001982For most intents and purposes, an operator that returns
1983\code{NotImplemented} is treated the same as one that is not
1984implemented at all.
Guido van Rossum83b2f8a1998-07-23 17:12:46 +00001985
Guido van Rossum92cf95f2002-06-03 19:06:41 +00001986\item
Guido van Rossum83b2f8a1998-07-23 17:12:46 +00001987
Guido van Rossum92cf95f2002-06-03 19:06:41 +00001988Below, \method{__op__()} and \method{__rop__()} are used to signify
1989the generic method names corresponding to an operator;
1990\method{__iop__} is used for the corresponding in-place operator. For
1991example, for the operator `\code{+}', \method{__add__()} and
1992\method{__radd__()} are used for the left and right variant of the
1993binary operator, and \method{__iadd__} for the in-place variant.
Guido van Rossum83b2f8a1998-07-23 17:12:46 +00001994
Guido van Rossum92cf95f2002-06-03 19:06:41 +00001995\item
Guido van Rossum83b2f8a1998-07-23 17:12:46 +00001996
Guido van Rossum92cf95f2002-06-03 19:06:41 +00001997For objects \var{x} and \var{y}, first \code{\var{x}.__op__(\var{y})}
1998is tried. If this is not implemented or returns \code{NotImplemented},
1999\code{\var{y}.__rop__(\var{x})} is tried. If this is also not
Fred Drake293dd4b2002-06-04 16:25:57 +00002000implemented or returns \code{NotImplemented}, a \exception{TypeError}
Guido van Rossum92cf95f2002-06-03 19:06:41 +00002001exception is raised. But see the following exception:
Guido van Rossum83b2f8a1998-07-23 17:12:46 +00002002
Guido van Rossum92cf95f2002-06-03 19:06:41 +00002003\item
Guido van Rossum83b2f8a1998-07-23 17:12:46 +00002004
Guido van Rossum92cf95f2002-06-03 19:06:41 +00002005Exception to the previous item: if the left operand is an instance of
2006a built-in type or a new-style class, and the right operand is an
2007instance of a proper subclass of that type or class, the right
Fred Drake293dd4b2002-06-04 16:25:57 +00002008operand's \method{__rop__()} method is tried \emph{before} the left
2009operand's \method{__op__()} method. This is done so that a subclass can
Guido van Rossum92cf95f2002-06-03 19:06:41 +00002010completely override binary operators. Otherwise, the left operand's
2011__op__ method would always accept the right operand: when an instance
2012of a given class is expected, an instance of a subclass of that class
2013is always acceptable.
Guido van Rossum83b2f8a1998-07-23 17:12:46 +00002014
Guido van Rossum92cf95f2002-06-03 19:06:41 +00002015\item
Guido van Rossum83b2f8a1998-07-23 17:12:46 +00002016
Guido van Rossum92cf95f2002-06-03 19:06:41 +00002017When either operand type defines a coercion, this coercion is called
Fred Drake293dd4b2002-06-04 16:25:57 +00002018before that type's \method{__op__()} or \method{__rop__()} method is
2019called, but no sooner. If the coercion returns an object of a
2020different type for the operand whose coercion is invoked, part of the
2021process is redone using the new object.
Guido van Rossum83b2f8a1998-07-23 17:12:46 +00002022
Guido van Rossum92cf95f2002-06-03 19:06:41 +00002023\item
Guido van Rossum83b2f8a1998-07-23 17:12:46 +00002024
Guido van Rossum92cf95f2002-06-03 19:06:41 +00002025When an in-place operator (like `\code{+=}') is used, if the left
Fred Drake293dd4b2002-06-04 16:25:57 +00002026operand implements \method{__iop__()}, it is invoked without any
2027coercion. When the operation falls back to \method{__op__()} and/or
2028\method{__rop__()}, the normal coercion rules apply.
Guido van Rossum83b2f8a1998-07-23 17:12:46 +00002029
Guido van Rossum92cf95f2002-06-03 19:06:41 +00002030\item
2031
2032In \var{x}\code{+}\var{y}, if \var{x} is a sequence that implements
2033sequence concatenation, sequence concatenation is invoked.
2034
2035\item
2036
2037In \var{x}\code{*}\var{y}, if one operator is a sequence that
2038implements sequence repetition, and the other is an integer
Fred Drake293dd4b2002-06-04 16:25:57 +00002039(\class{int} or \class{long}), sequence repetition is invoked.
Guido van Rossum92cf95f2002-06-03 19:06:41 +00002040
2041\item
2042
Fred Drake293dd4b2002-06-04 16:25:57 +00002043Rich comparisons (implemented by methods \method{__eq__()} and so on)
Guido van Rossum92cf95f2002-06-03 19:06:41 +00002044never use coercion. Three-way comparison (implemented by
Fred Drake293dd4b2002-06-04 16:25:57 +00002045\method{__cmp__()}) does use coercion under the same conditions as
Guido van Rossum92cf95f2002-06-03 19:06:41 +00002046other binary operations use it.
2047
2048\item
2049
Fred Drake293dd4b2002-06-04 16:25:57 +00002050In the current implementation, the built-in numeric types \class{int},
2051\class{long} and \class{float} do not use coercion; the type
2052\class{complex} however does use it. The difference can become
Guido van Rossum92cf95f2002-06-03 19:06:41 +00002053apparent when subclassing these types. Over time, the type
Fred Drake293dd4b2002-06-04 16:25:57 +00002054\class{complex} may be fixed to avoid coercion. All these types
2055implement a \method{__coerce__()} method, for use by the built-in
2056\function{coerce()} function.
Guido van Rossum83b2f8a1998-07-23 17:12:46 +00002057
2058\end{itemize}