Fred Drake | 61c7728 | 1998-07-28 19:34:22 +0000 | [diff] [blame] | 1 | \chapter{Data model\label{datamodel}} |
Fred Drake | f666917 | 1998-05-06 19:52:49 +0000 | [diff] [blame] | 2 | |
Fred Drake | 2829f1c | 2001-06-23 05:27:20 +0000 | [diff] [blame] | 3 | |
Fred Drake | 61c7728 | 1998-07-28 19:34:22 +0000 | [diff] [blame] | 4 | \section{Objects, values and types\label{objects}} |
Fred Drake | f666917 | 1998-05-06 19:52:49 +0000 | [diff] [blame] | 5 | |
| 6 | \dfn{Objects} are Python's abstraction for data. All data in a Python |
| 7 | program is represented by objects or by relations between objects. |
| 8 | (In a sense, and in conformance to Von Neumann's model of a |
Guido van Rossum | 83b2f8a | 1998-07-23 17:12:46 +0000 | [diff] [blame] | 9 | ``stored program computer,'' code is also represented by objects.) |
Fred Drake | f666917 | 1998-05-06 19:52:49 +0000 | [diff] [blame] | 10 | \index{object} |
| 11 | \index{data} |
| 12 | |
| 13 | Every object has an identity, a type and a value. An object's |
| 14 | \emph{identity} never changes once it has been created; you may think |
Fred Drake | 293dd4b | 2002-06-04 16:25:57 +0000 | [diff] [blame] | 15 | of it as the object's address in memory. The `\keyword{is}' operator |
Fred Drake | 8238587 | 1998-10-01 20:40:43 +0000 | [diff] [blame] | 16 | compares the identity of two objects; the |
| 17 | \function{id()}\bifuncindex{id} function returns an integer |
| 18 | representing its identity (currently implemented as its address). |
Guido van Rossum | 83b2f8a | 1998-07-23 17:12:46 +0000 | [diff] [blame] | 19 | An object's \dfn{type} is |
Raymond Hettinger | e701dcb | 2003-01-19 13:08:18 +0000 | [diff] [blame] | 20 | also unchangeable.\footnote{Since Python 2.2, a gradual merging of |
| 21 | types and classes has been started that makes this and a few other |
| 22 | assertions made in this manual not 100\% accurate and complete: |
| 23 | for example, it \emph{is} now possible in some cases to change an |
| 24 | object's type, under certain controlled conditions. Until this manual |
| 25 | undergoes extensive revision, it must now be taken as authoritative |
| 26 | only regarding ``classic classes'', that are still the default, for |
| 27 | compatibility purposes, in Python 2.2 and 2.3.} |
| 28 | An object's type determines the operations that the object |
Guido van Rossum | 83b2f8a | 1998-07-23 17:12:46 +0000 | [diff] [blame] | 29 | supports (e.g., ``does it have a length?'') and also defines the |
Fred Drake | 8238587 | 1998-10-01 20:40:43 +0000 | [diff] [blame] | 30 | possible values for objects of that type. The |
| 31 | \function{type()}\bifuncindex{type} function returns an object's type |
| 32 | (which is an object itself). The \emph{value} of some |
Fred Drake | f666917 | 1998-05-06 19:52:49 +0000 | [diff] [blame] | 33 | objects can change. Objects whose value can change are said to be |
| 34 | \emph{mutable}; objects whose value is unchangeable once they are |
Guido van Rossum | 83b2f8a | 1998-07-23 17:12:46 +0000 | [diff] [blame] | 35 | created are called \emph{immutable}. |
Guido van Rossum | 264bd59 | 1999-02-23 16:40:55 +0000 | [diff] [blame] | 36 | (The value of an immutable container object that contains a reference |
| 37 | to a mutable object can change when the latter's value is changed; |
| 38 | however the container is still considered immutable, because the |
| 39 | collection of objects it contains cannot be changed. So, immutability |
| 40 | is not strictly the same as having an unchangeable value, it is more |
| 41 | subtle.) |
Guido van Rossum | 83b2f8a | 1998-07-23 17:12:46 +0000 | [diff] [blame] | 42 | An object's mutability is determined by its type; for instance, |
| 43 | numbers, strings and tuples are immutable, while dictionaries and |
| 44 | lists are mutable. |
Fred Drake | f666917 | 1998-05-06 19:52:49 +0000 | [diff] [blame] | 45 | \index{identity of an object} |
| 46 | \index{value of an object} |
| 47 | \index{type of an object} |
| 48 | \index{mutable object} |
| 49 | \index{immutable object} |
| 50 | |
| 51 | Objects are never explicitly destroyed; however, when they become |
| 52 | unreachable they may be garbage-collected. An implementation is |
Barry Warsaw | 92a6ed9 | 1998-08-07 16:33:51 +0000 | [diff] [blame] | 53 | allowed to postpone garbage collection or omit it altogether --- it is |
| 54 | a matter of implementation quality how garbage collection is |
Fred Drake | f666917 | 1998-05-06 19:52:49 +0000 | [diff] [blame] | 55 | implemented, as long as no objects are collected that are still |
| 56 | reachable. (Implementation note: the current implementation uses a |
Fred Drake | c8e8281 | 2001-01-22 17:46:18 +0000 | [diff] [blame] | 57 | reference-counting scheme with (optional) delayed detection of |
Raymond Hettinger | e701dcb | 2003-01-19 13:08:18 +0000 | [diff] [blame] | 58 | cyclically linked garbage, which collects most objects as soon as they |
Fred Drake | c8e8281 | 2001-01-22 17:46:18 +0000 | [diff] [blame] | 59 | become unreachable, but is not guaranteed to collect garbage |
| 60 | containing circular references. See the |
| 61 | \citetitle[../lib/module-gc.html]{Python Library Reference} for |
| 62 | information on controlling the collection of cyclic garbage.) |
Fred Drake | f666917 | 1998-05-06 19:52:49 +0000 | [diff] [blame] | 63 | \index{garbage collection} |
| 64 | \index{reference counting} |
| 65 | \index{unreachable object} |
| 66 | |
| 67 | Note that the use of the implementation's tracing or debugging |
| 68 | facilities may keep objects alive that would normally be collectable. |
Guido van Rossum | 83b2f8a | 1998-07-23 17:12:46 +0000 | [diff] [blame] | 69 | Also note that catching an exception with a |
Fred Drake | 4856d01 | 1999-01-12 04:15:20 +0000 | [diff] [blame] | 70 | `\keyword{try}...\keyword{except}' statement may keep objects alive. |
Fred Drake | f666917 | 1998-05-06 19:52:49 +0000 | [diff] [blame] | 71 | |
| 72 | Some objects contain references to ``external'' resources such as open |
| 73 | files or windows. It is understood that these resources are freed |
| 74 | when the object is garbage-collected, but since garbage collection is |
| 75 | not guaranteed to happen, such objects also provide an explicit way to |
| 76 | release the external resource, usually a \method{close()} method. |
Guido van Rossum | 83b2f8a | 1998-07-23 17:12:46 +0000 | [diff] [blame] | 77 | Programs are strongly recommended to explicitly close such |
Fred Drake | 4856d01 | 1999-01-12 04:15:20 +0000 | [diff] [blame] | 78 | objects. The `\keyword{try}...\keyword{finally}' statement provides |
| 79 | a convenient way to do this. |
Fred Drake | f666917 | 1998-05-06 19:52:49 +0000 | [diff] [blame] | 80 | |
| 81 | Some objects contain references to other objects; these are called |
| 82 | \emph{containers}. Examples of containers are tuples, lists and |
| 83 | dictionaries. The references are part of a container's value. In |
| 84 | most cases, when we talk about the value of a container, we imply the |
| 85 | values, not the identities of the contained objects; however, when we |
Guido van Rossum | 83b2f8a | 1998-07-23 17:12:46 +0000 | [diff] [blame] | 86 | talk about the mutability of a container, only the identities of |
| 87 | the immediately contained objects are implied. So, if an immutable |
| 88 | container (like a tuple) |
| 89 | contains a reference to a mutable object, its value changes |
| 90 | if that mutable object is changed. |
Fred Drake | f666917 | 1998-05-06 19:52:49 +0000 | [diff] [blame] | 91 | \index{container} |
| 92 | |
Guido van Rossum | 83b2f8a | 1998-07-23 17:12:46 +0000 | [diff] [blame] | 93 | Types affect almost all aspects of object behavior. Even the importance |
Fred Drake | f666917 | 1998-05-06 19:52:49 +0000 | [diff] [blame] | 94 | of object identity is affected in some sense: for immutable types, |
| 95 | operations that compute new values may actually return a reference to |
| 96 | any existing object with the same type and value, while for mutable |
Guido van Rossum | 83b2f8a | 1998-07-23 17:12:46 +0000 | [diff] [blame] | 97 | objects this is not allowed. E.g., after |
Fred Drake | 8238587 | 1998-10-01 20:40:43 +0000 | [diff] [blame] | 98 | \samp{a = 1; b = 1}, |
Fred Drake | f666917 | 1998-05-06 19:52:49 +0000 | [diff] [blame] | 99 | \code{a} and \code{b} may or may not refer to the same object with the |
Guido van Rossum | 83b2f8a | 1998-07-23 17:12:46 +0000 | [diff] [blame] | 100 | value one, depending on the implementation, but after |
Fred Drake | 8238587 | 1998-10-01 20:40:43 +0000 | [diff] [blame] | 101 | \samp{c = []; d = []}, \code{c} and \code{d} |
Fred Drake | f666917 | 1998-05-06 19:52:49 +0000 | [diff] [blame] | 102 | are guaranteed to refer to two different, unique, newly created empty |
| 103 | lists. |
Fred Drake | 8238587 | 1998-10-01 20:40:43 +0000 | [diff] [blame] | 104 | (Note that \samp{c = d = []} assigns the same object to both |
Guido van Rossum | 83b2f8a | 1998-07-23 17:12:46 +0000 | [diff] [blame] | 105 | \code{c} and \code{d}.) |
Fred Drake | f666917 | 1998-05-06 19:52:49 +0000 | [diff] [blame] | 106 | |
Fred Drake | 2829f1c | 2001-06-23 05:27:20 +0000 | [diff] [blame] | 107 | |
Fred Drake | 61c7728 | 1998-07-28 19:34:22 +0000 | [diff] [blame] | 108 | \section{The standard type hierarchy\label{types}} |
Fred Drake | f666917 | 1998-05-06 19:52:49 +0000 | [diff] [blame] | 109 | |
| 110 | Below is a list of the types that are built into Python. Extension |
Raymond Hettinger | e701dcb | 2003-01-19 13:08:18 +0000 | [diff] [blame] | 111 | modules (written in C, Java, or other languages, depending on |
| 112 | the implementation) can define additional types. Future versions of |
Guido van Rossum | 83b2f8a | 1998-07-23 17:12:46 +0000 | [diff] [blame] | 113 | Python may add types to the type hierarchy (e.g., rational |
Fred Drake | f666917 | 1998-05-06 19:52:49 +0000 | [diff] [blame] | 114 | numbers, efficiently stored arrays of integers, etc.). |
| 115 | \index{type} |
| 116 | \indexii{data}{type} |
| 117 | \indexii{type}{hierarchy} |
| 118 | \indexii{extension}{module} |
| 119 | \indexii{C}{language} |
| 120 | |
| 121 | Some of the type descriptions below contain a paragraph listing |
Guido van Rossum | 83b2f8a | 1998-07-23 17:12:46 +0000 | [diff] [blame] | 122 | `special attributes.' These are attributes that provide access to the |
Fred Drake | f666917 | 1998-05-06 19:52:49 +0000 | [diff] [blame] | 123 | implementation and are not intended for general use. Their definition |
Fred Drake | 3570551 | 2001-12-03 17:32:27 +0000 | [diff] [blame] | 124 | may change in the future. |
Fred Drake | f666917 | 1998-05-06 19:52:49 +0000 | [diff] [blame] | 125 | \index{attribute} |
| 126 | \indexii{special}{attribute} |
| 127 | \indexiii{generic}{special}{attribute} |
Fred Drake | f666917 | 1998-05-06 19:52:49 +0000 | [diff] [blame] | 128 | |
| 129 | \begin{description} |
| 130 | |
| 131 | \item[None] |
| 132 | This type has a single value. There is a single object with this value. |
| 133 | This object is accessed through the built-in name \code{None}. |
Guido van Rossum | 83b2f8a | 1998-07-23 17:12:46 +0000 | [diff] [blame] | 134 | It is used to signify the absence of a value in many situations, e.g., |
| 135 | it is returned from functions that don't explicitly return anything. |
| 136 | Its truth value is false. |
Fred Drake | 7a700b8 | 2004-01-01 05:43:53 +0000 | [diff] [blame] | 137 | \obindex{None} |
Fred Drake | f666917 | 1998-05-06 19:52:49 +0000 | [diff] [blame] | 138 | |
Neil Schemenauer | 48c2eb9 | 2001-01-04 01:25:50 +0000 | [diff] [blame] | 139 | \item[NotImplemented] |
| 140 | This type has a single value. There is a single object with this value. |
| 141 | This object is accessed through the built-in name \code{NotImplemented}. |
Guido van Rossum | ab782dd | 2001-01-18 15:17:06 +0000 | [diff] [blame] | 142 | Numeric methods and rich comparison methods may return this value if |
| 143 | they do not implement the operation for the operands provided. (The |
| 144 | interpreter will then try the reflected operation, or some other |
| 145 | fallback, depending on the operator.) Its truth value is true. |
Fred Drake | 7a700b8 | 2004-01-01 05:43:53 +0000 | [diff] [blame] | 146 | \obindex{NotImplemented} |
Neil Schemenauer | 48c2eb9 | 2001-01-04 01:25:50 +0000 | [diff] [blame] | 147 | |
Guido van Rossum | 83b2f8a | 1998-07-23 17:12:46 +0000 | [diff] [blame] | 148 | \item[Ellipsis] |
| 149 | This type has a single value. There is a single object with this value. |
| 150 | This object is accessed through the built-in name \code{Ellipsis}. |
Fred Drake | 8238587 | 1998-10-01 20:40:43 +0000 | [diff] [blame] | 151 | It is used to indicate the presence of the \samp{...} syntax in a |
Guido van Rossum | 83b2f8a | 1998-07-23 17:12:46 +0000 | [diff] [blame] | 152 | slice. Its truth value is true. |
Fred Drake | c0a02c0 | 2002-04-16 02:03:05 +0000 | [diff] [blame] | 153 | \obindex{Ellipsis} |
Guido van Rossum | 83b2f8a | 1998-07-23 17:12:46 +0000 | [diff] [blame] | 154 | |
Fred Drake | f666917 | 1998-05-06 19:52:49 +0000 | [diff] [blame] | 155 | \item[Numbers] |
| 156 | These are created by numeric literals and returned as results by |
| 157 | arithmetic operators and arithmetic built-in functions. Numeric |
| 158 | objects are immutable; once created their value never changes. Python |
| 159 | numbers are of course strongly related to mathematical numbers, but |
| 160 | subject to the limitations of numerical representation in computers. |
Fred Drake | f666917 | 1998-05-06 19:52:49 +0000 | [diff] [blame] | 161 | \obindex{numeric} |
| 162 | |
Fred Drake | b3384d3 | 2001-05-14 16:04:22 +0000 | [diff] [blame] | 163 | Python distinguishes between integers, floating point numbers, and |
| 164 | complex numbers: |
Fred Drake | f666917 | 1998-05-06 19:52:49 +0000 | [diff] [blame] | 165 | |
| 166 | \begin{description} |
| 167 | \item[Integers] |
| 168 | These represent elements from the mathematical set of whole numbers. |
| 169 | \obindex{integer} |
| 170 | |
Guido van Rossum | 77f6a65 | 2002-04-03 22:41:51 +0000 | [diff] [blame] | 171 | There are three types of integers: |
Fred Drake | f666917 | 1998-05-06 19:52:49 +0000 | [diff] [blame] | 172 | |
| 173 | \begin{description} |
| 174 | |
| 175 | \item[Plain integers] |
| 176 | These represent numbers in the range -2147483648 through 2147483647. |
| 177 | (The range may be larger on machines with a larger natural word |
| 178 | size, but not smaller.) |
Fred Drake | e15956b | 2000-04-03 04:51:13 +0000 | [diff] [blame] | 179 | When the result of an operation would fall outside this range, the |
Raymond Hettinger | e701dcb | 2003-01-19 13:08:18 +0000 | [diff] [blame] | 180 | result is normally returned as a long integer (in some cases, the |
| 181 | exception \exception{OverflowError} is raised instead). |
Fred Drake | f666917 | 1998-05-06 19:52:49 +0000 | [diff] [blame] | 182 | For the purpose of shift and mask operations, integers are assumed to |
| 183 | have a binary, 2's complement notation using 32 or more bits, and |
| 184 | hiding no bits from the user (i.e., all 4294967296 different bit |
| 185 | patterns correspond to different values). |
| 186 | \obindex{plain integer} |
| 187 | \withsubitem{(built-in exception)}{\ttindex{OverflowError}} |
| 188 | |
| 189 | \item[Long integers] |
| 190 | These represent numbers in an unlimited range, subject to available |
| 191 | (virtual) memory only. For the purpose of shift and mask operations, |
| 192 | a binary representation is assumed, and negative numbers are |
| 193 | represented in a variant of 2's complement which gives the illusion of |
| 194 | an infinite string of sign bits extending to the left. |
| 195 | \obindex{long integer} |
| 196 | |
Guido van Rossum | 77f6a65 | 2002-04-03 22:41:51 +0000 | [diff] [blame] | 197 | \item[Booleans] |
| 198 | These represent the truth values False and True. The two objects |
| 199 | representing the values False and True are the only Boolean objects. |
| 200 | The Boolean type is a subtype of plain integers, and Boolean values |
| 201 | behave like the values 0 and 1, respectively, in almost all contexts, |
| 202 | the exception being that when converted to a string, the strings |
| 203 | \code{"False"} or \code{"True"} are returned, respectively. |
| 204 | \obindex{Boolean} |
| 205 | \ttindex{False} |
| 206 | \ttindex{True} |
| 207 | |
Fred Drake | f666917 | 1998-05-06 19:52:49 +0000 | [diff] [blame] | 208 | \end{description} % Integers |
| 209 | |
| 210 | The rules for integer representation are intended to give the most |
| 211 | meaningful interpretation of shift and mask operations involving |
| 212 | negative integers and the least surprises when switching between the |
Raymond Hettinger | e701dcb | 2003-01-19 13:08:18 +0000 | [diff] [blame] | 213 | plain and long integer domains. Any operation except left shift, |
Fred Drake | f666917 | 1998-05-06 19:52:49 +0000 | [diff] [blame] | 214 | if it yields a result in the plain integer domain without causing |
Raymond Hettinger | e701dcb | 2003-01-19 13:08:18 +0000 | [diff] [blame] | 215 | overflow, will yield the same result in the long integer domain or |
Fred Drake | f666917 | 1998-05-06 19:52:49 +0000 | [diff] [blame] | 216 | when using mixed operands. |
| 217 | \indexii{integer}{representation} |
| 218 | |
| 219 | \item[Floating point numbers] |
| 220 | These represent machine-level double precision floating point numbers. |
Raymond Hettinger | e701dcb | 2003-01-19 13:08:18 +0000 | [diff] [blame] | 221 | You are at the mercy of the underlying machine architecture (and |
| 222 | C or Java implementation) for the accepted range and handling of overflow. |
Guido van Rossum | 83b2f8a | 1998-07-23 17:12:46 +0000 | [diff] [blame] | 223 | Python does not support single-precision floating point numbers; the |
Fred Drake | 6e5e1d9 | 2001-07-14 02:12:27 +0000 | [diff] [blame] | 224 | savings in processor and memory usage that are usually the reason for using |
Guido van Rossum | 83b2f8a | 1998-07-23 17:12:46 +0000 | [diff] [blame] | 225 | these is dwarfed by the overhead of using objects in Python, so there |
| 226 | is no reason to complicate the language with two kinds of floating |
| 227 | point numbers. |
Fred Drake | f666917 | 1998-05-06 19:52:49 +0000 | [diff] [blame] | 228 | \obindex{floating point} |
| 229 | \indexii{floating point}{number} |
| 230 | \indexii{C}{language} |
Raymond Hettinger | e701dcb | 2003-01-19 13:08:18 +0000 | [diff] [blame] | 231 | \indexii{Java}{language} |
Fred Drake | f666917 | 1998-05-06 19:52:49 +0000 | [diff] [blame] | 232 | |
Guido van Rossum | 83b2f8a | 1998-07-23 17:12:46 +0000 | [diff] [blame] | 233 | \item[Complex numbers] |
| 234 | These represent complex numbers as a pair of machine-level double |
| 235 | precision floating point numbers. The same caveats apply as for |
Raymond Hettinger | e701dcb | 2003-01-19 13:08:18 +0000 | [diff] [blame] | 236 | floating point numbers. The real and imaginary parts of a complex |
| 237 | number \code{z} can be retrieved through the read-only attributes |
| 238 | \code{z.real} and \code{z.imag}. |
Guido van Rossum | 83b2f8a | 1998-07-23 17:12:46 +0000 | [diff] [blame] | 239 | \obindex{complex} |
| 240 | \indexii{complex}{number} |
| 241 | |
Fred Drake | f666917 | 1998-05-06 19:52:49 +0000 | [diff] [blame] | 242 | \end{description} % Numbers |
| 243 | |
Guido van Rossum | 77f6a65 | 2002-04-03 22:41:51 +0000 | [diff] [blame] | 244 | |
Fred Drake | f666917 | 1998-05-06 19:52:49 +0000 | [diff] [blame] | 245 | \item[Sequences] |
Fred Drake | 230d17d | 2001-02-22 21:28:04 +0000 | [diff] [blame] | 246 | These represent finite ordered sets indexed by non-negative numbers. |
Fred Drake | f666917 | 1998-05-06 19:52:49 +0000 | [diff] [blame] | 247 | The built-in function \function{len()}\bifuncindex{len} returns the |
Guido van Rossum | 83b2f8a | 1998-07-23 17:12:46 +0000 | [diff] [blame] | 248 | number of items of a sequence. |
Thomas Wouters | f9b526d | 2000-07-16 19:05:38 +0000 | [diff] [blame] | 249 | When the length of a sequence is \var{n}, the |
Guido van Rossum | 83b2f8a | 1998-07-23 17:12:46 +0000 | [diff] [blame] | 250 | index set contains the numbers 0, 1, \ldots, \var{n}-1. Item |
Fred Drake | f666917 | 1998-05-06 19:52:49 +0000 | [diff] [blame] | 251 | \var{i} of sequence \var{a} is selected by \code{\var{a}[\var{i}]}. |
Fred Drake | e15956b | 2000-04-03 04:51:13 +0000 | [diff] [blame] | 252 | \obindex{sequence} |
Fred Drake | f666917 | 1998-05-06 19:52:49 +0000 | [diff] [blame] | 253 | \index{index operation} |
| 254 | \index{item selection} |
| 255 | \index{subscription} |
| 256 | |
| 257 | Sequences also support slicing: \code{\var{a}[\var{i}:\var{j}]} |
Guido van Rossum | 83b2f8a | 1998-07-23 17:12:46 +0000 | [diff] [blame] | 258 | selects all items with index \var{k} such that \var{i} \code{<=} |
Fred Drake | f666917 | 1998-05-06 19:52:49 +0000 | [diff] [blame] | 259 | \var{k} \code{<} \var{j}. When used as an expression, a slice is a |
Guido van Rossum | 83b2f8a | 1998-07-23 17:12:46 +0000 | [diff] [blame] | 260 | sequence of the same type. This implies that the index set is |
| 261 | renumbered so that it starts at 0. |
Fred Drake | f666917 | 1998-05-06 19:52:49 +0000 | [diff] [blame] | 262 | \index{slicing} |
| 263 | |
Michael W. Hudson | 5efaf7e | 2002-06-11 10:55:12 +0000 | [diff] [blame] | 264 | Some sequences also support ``extended slicing'' with a third ``step'' |
| 265 | parameter: \code{\var{a}[\var{i}:\var{j}:\var{k}]} selects all items |
| 266 | of \var{a} with index \var{x} where \code{\var{x} = \var{i} + |
| 267 | \var{n}*\var{k}}, \var{n} \code{>=} \code{0} and \var{i} \code{<=} |
| 268 | \var{x} \code{<} \var{j}. |
| 269 | \index{extended slicing} |
| 270 | |
Fred Drake | f666917 | 1998-05-06 19:52:49 +0000 | [diff] [blame] | 271 | Sequences are distinguished according to their mutability: |
| 272 | |
| 273 | \begin{description} |
Fred Drake | 4856d01 | 1999-01-12 04:15:20 +0000 | [diff] [blame] | 274 | |
Fred Drake | f666917 | 1998-05-06 19:52:49 +0000 | [diff] [blame] | 275 | \item[Immutable sequences] |
| 276 | An object of an immutable sequence type cannot change once it is |
| 277 | created. (If the object contains references to other objects, |
Guido van Rossum | 83b2f8a | 1998-07-23 17:12:46 +0000 | [diff] [blame] | 278 | these other objects may be mutable and may be changed; however, |
Fred Drake | f666917 | 1998-05-06 19:52:49 +0000 | [diff] [blame] | 279 | the collection of objects directly referenced by an immutable object |
| 280 | cannot change.) |
| 281 | \obindex{immutable sequence} |
| 282 | \obindex{immutable} |
| 283 | |
| 284 | The following types are immutable sequences: |
| 285 | |
| 286 | \begin{description} |
| 287 | |
| 288 | \item[Strings] |
Guido van Rossum | 83b2f8a | 1998-07-23 17:12:46 +0000 | [diff] [blame] | 289 | The items of a string are characters. There is no separate |
| 290 | character type; a character is represented by a string of one item. |
Fred Drake | f666917 | 1998-05-06 19:52:49 +0000 | [diff] [blame] | 291 | Characters represent (at least) 8-bit bytes. The built-in |
| 292 | functions \function{chr()}\bifuncindex{chr} and |
| 293 | \function{ord()}\bifuncindex{ord} convert between characters and |
| 294 | nonnegative integers representing the byte values. Bytes with the |
Guido van Rossum | 83b2f8a | 1998-07-23 17:12:46 +0000 | [diff] [blame] | 295 | values 0-127 usually represent the corresponding \ASCII{} values, but |
| 296 | the interpretation of values is up to the program. The string |
| 297 | data type is also used to represent arrays of bytes, e.g., to hold data |
Fred Drake | f666917 | 1998-05-06 19:52:49 +0000 | [diff] [blame] | 298 | read from a file. |
| 299 | \obindex{string} |
| 300 | \index{character} |
| 301 | \index{byte} |
Fred Drake | c37b65e | 2001-11-28 07:26:15 +0000 | [diff] [blame] | 302 | \index{ASCII@\ASCII} |
Fred Drake | f666917 | 1998-05-06 19:52:49 +0000 | [diff] [blame] | 303 | |
Fred Drake | c37b65e | 2001-11-28 07:26:15 +0000 | [diff] [blame] | 304 | (On systems whose native character set is not \ASCII, strings may use |
Fred Drake | f666917 | 1998-05-06 19:52:49 +0000 | [diff] [blame] | 305 | EBCDIC in their internal representation, provided the functions |
| 306 | \function{chr()} and \function{ord()} implement a mapping between \ASCII{} and |
| 307 | EBCDIC, and string comparison preserves the \ASCII{} order. |
| 308 | Or perhaps someone can propose a better rule?) |
Fred Drake | c37b65e | 2001-11-28 07:26:15 +0000 | [diff] [blame] | 309 | \index{ASCII@\ASCII} |
Fred Drake | f666917 | 1998-05-06 19:52:49 +0000 | [diff] [blame] | 310 | \index{EBCDIC} |
| 311 | \index{character set} |
| 312 | \indexii{string}{comparison} |
| 313 | \bifuncindex{chr} |
| 314 | \bifuncindex{ord} |
| 315 | |
Fred Drake | f0aff8e | 2000-04-06 13:57:21 +0000 | [diff] [blame] | 316 | \item[Unicode] |
Fred Drake | 5ec22f2 | 2002-09-24 21:09:13 +0000 | [diff] [blame] | 317 | The items of a Unicode object are Unicode code units. A Unicode code |
| 318 | unit is represented by a Unicode object of one item and can hold |
| 319 | either a 16-bit or 32-bit value representing a Unicode ordinal (the |
| 320 | maximum value for the ordinal is given in \code{sys.maxunicode}, and |
| 321 | depends on how Python is configured at compile time). Surrogate pairs |
| 322 | may be present in the Unicode object, and will be reported as two |
| 323 | separate items. The built-in functions |
Fred Drake | f0aff8e | 2000-04-06 13:57:21 +0000 | [diff] [blame] | 324 | \function{unichr()}\bifuncindex{unichr} and |
Fred Drake | 5ec22f2 | 2002-09-24 21:09:13 +0000 | [diff] [blame] | 325 | \function{ord()}\bifuncindex{ord} convert between code units and |
Fred Drake | f0aff8e | 2000-04-06 13:57:21 +0000 | [diff] [blame] | 326 | nonnegative integers representing the Unicode ordinals as defined in |
| 327 | the Unicode Standard 3.0. Conversion from and to other encodings are |
| 328 | possible through the Unicode method \method{encode} and the built-in |
Fred Drake | 5ec22f2 | 2002-09-24 21:09:13 +0000 | [diff] [blame] | 329 | function \function{unicode()}.\bifuncindex{unicode} |
Fred Drake | f0aff8e | 2000-04-06 13:57:21 +0000 | [diff] [blame] | 330 | \obindex{unicode} |
| 331 | \index{character} |
| 332 | \index{integer} |
Fred Drake | 8b3ce9e | 2000-04-06 14:00:14 +0000 | [diff] [blame] | 333 | \index{Unicode} |
Fred Drake | f0aff8e | 2000-04-06 13:57:21 +0000 | [diff] [blame] | 334 | |
Fred Drake | f666917 | 1998-05-06 19:52:49 +0000 | [diff] [blame] | 335 | \item[Tuples] |
Guido van Rossum | 83b2f8a | 1998-07-23 17:12:46 +0000 | [diff] [blame] | 336 | The items of a tuple are arbitrary Python objects. |
| 337 | Tuples of two or more items are formed by comma-separated lists |
| 338 | of expressions. A tuple of one item (a `singleton') can be formed |
Fred Drake | f666917 | 1998-05-06 19:52:49 +0000 | [diff] [blame] | 339 | by affixing a comma to an expression (an expression by itself does |
| 340 | not create a tuple, since parentheses must be usable for grouping of |
Guido van Rossum | 83b2f8a | 1998-07-23 17:12:46 +0000 | [diff] [blame] | 341 | expressions). An empty tuple can be formed by an empty pair of |
Fred Drake | f666917 | 1998-05-06 19:52:49 +0000 | [diff] [blame] | 342 | parentheses. |
| 343 | \obindex{tuple} |
| 344 | \indexii{singleton}{tuple} |
| 345 | \indexii{empty}{tuple} |
| 346 | |
| 347 | \end{description} % Immutable sequences |
| 348 | |
| 349 | \item[Mutable sequences] |
| 350 | Mutable sequences can be changed after they are created. The |
| 351 | subscription and slicing notations can be used as the target of |
| 352 | assignment and \keyword{del} (delete) statements. |
Thomas Wouters | f9b526d | 2000-07-16 19:05:38 +0000 | [diff] [blame] | 353 | \obindex{mutable sequence} |
Fred Drake | f666917 | 1998-05-06 19:52:49 +0000 | [diff] [blame] | 354 | \obindex{mutable} |
| 355 | \indexii{assignment}{statement} |
| 356 | \index{delete} |
| 357 | \stindex{del} |
| 358 | \index{subscription} |
| 359 | \index{slicing} |
| 360 | |
Raymond Hettinger | e701dcb | 2003-01-19 13:08:18 +0000 | [diff] [blame] | 361 | There is currently a single intrinsic mutable sequence type: |
Fred Drake | f666917 | 1998-05-06 19:52:49 +0000 | [diff] [blame] | 362 | |
| 363 | \begin{description} |
| 364 | |
| 365 | \item[Lists] |
Guido van Rossum | 83b2f8a | 1998-07-23 17:12:46 +0000 | [diff] [blame] | 366 | The items of a list are arbitrary Python objects. Lists are formed |
Fred Drake | f666917 | 1998-05-06 19:52:49 +0000 | [diff] [blame] | 367 | by placing a comma-separated list of expressions in square brackets. |
| 368 | (Note that there are no special cases needed to form lists of length 0 |
| 369 | or 1.) |
| 370 | \obindex{list} |
| 371 | |
| 372 | \end{description} % Mutable sequences |
| 373 | |
Guido van Rossum | 83b2f8a | 1998-07-23 17:12:46 +0000 | [diff] [blame] | 374 | The extension module \module{array}\refstmodindex{array} provides an |
| 375 | additional example of a mutable sequence type. |
| 376 | |
| 377 | |
Fred Drake | f666917 | 1998-05-06 19:52:49 +0000 | [diff] [blame] | 378 | \end{description} % Sequences |
| 379 | |
Guido van Rossum | 83b2f8a | 1998-07-23 17:12:46 +0000 | [diff] [blame] | 380 | \item[Mappings] |
Fred Drake | f666917 | 1998-05-06 19:52:49 +0000 | [diff] [blame] | 381 | These represent finite sets of objects indexed by arbitrary index sets. |
Guido van Rossum | 83b2f8a | 1998-07-23 17:12:46 +0000 | [diff] [blame] | 382 | The subscript notation \code{a[k]} selects the item indexed |
Fred Drake | f666917 | 1998-05-06 19:52:49 +0000 | [diff] [blame] | 383 | by \code{k} from the mapping \code{a}; this can be used in |
| 384 | expressions and as the target of assignments or \keyword{del} statements. |
Guido van Rossum | 83b2f8a | 1998-07-23 17:12:46 +0000 | [diff] [blame] | 385 | The built-in function \function{len()} returns the number of items |
Fred Drake | f666917 | 1998-05-06 19:52:49 +0000 | [diff] [blame] | 386 | in a mapping. |
| 387 | \bifuncindex{len} |
| 388 | \index{subscription} |
| 389 | \obindex{mapping} |
| 390 | |
Guido van Rossum | 83b2f8a | 1998-07-23 17:12:46 +0000 | [diff] [blame] | 391 | There is currently a single intrinsic mapping type: |
Fred Drake | f666917 | 1998-05-06 19:52:49 +0000 | [diff] [blame] | 392 | |
| 393 | \begin{description} |
| 394 | |
| 395 | \item[Dictionaries] |
Fred Drake | 8cdee96 | 1999-02-23 18:50:38 +0000 | [diff] [blame] | 396 | These\obindex{dictionary} represent finite sets of objects indexed by |
| 397 | nearly arbitrary values. The only types of values not acceptable as |
| 398 | keys are values containing lists or dictionaries or other mutable |
| 399 | types that are compared by value rather than by object identity, the |
| 400 | reason being that the efficient implementation of dictionaries |
| 401 | requires a key's hash value to remain constant. |
Fred Drake | f666917 | 1998-05-06 19:52:49 +0000 | [diff] [blame] | 402 | Numeric types used for keys obey the normal rules for numeric |
Guido van Rossum | 83b2f8a | 1998-07-23 17:12:46 +0000 | [diff] [blame] | 403 | comparison: if two numbers compare equal (e.g., \code{1} and |
Fred Drake | f666917 | 1998-05-06 19:52:49 +0000 | [diff] [blame] | 404 | \code{1.0}) then they can be used interchangeably to index the same |
| 405 | dictionary entry. |
| 406 | |
Raymond Hettinger | e701dcb | 2003-01-19 13:08:18 +0000 | [diff] [blame] | 407 | Dictionaries are mutable; they can be created by the |
Fred Drake | 62364ff | 2003-03-20 18:17:16 +0000 | [diff] [blame] | 408 | \code{\{...\}} notation (see section~\ref{dict}, ``Dictionary |
Fred Drake | 8cdee96 | 1999-02-23 18:50:38 +0000 | [diff] [blame] | 409 | Displays''). |
Fred Drake | f666917 | 1998-05-06 19:52:49 +0000 | [diff] [blame] | 410 | |
Guido van Rossum | 83b2f8a | 1998-07-23 17:12:46 +0000 | [diff] [blame] | 411 | The extension modules \module{dbm}\refstmodindex{dbm}, |
| 412 | \module{gdbm}\refstmodindex{gdbm}, \module{bsddb}\refstmodindex{bsddb} |
| 413 | provide additional examples of mapping types. |
| 414 | |
Fred Drake | f666917 | 1998-05-06 19:52:49 +0000 | [diff] [blame] | 415 | \end{description} % Mapping types |
| 416 | |
| 417 | \item[Callable types] |
Fred Drake | 8cdee96 | 1999-02-23 18:50:38 +0000 | [diff] [blame] | 418 | These\obindex{callable} are the types to which the function call |
Fred Drake | 62364ff | 2003-03-20 18:17:16 +0000 | [diff] [blame] | 419 | operation (see section~\ref{calls}, ``Calls'') can be applied: |
Fred Drake | f666917 | 1998-05-06 19:52:49 +0000 | [diff] [blame] | 420 | \indexii{function}{call} |
| 421 | \index{invocation} |
| 422 | \indexii{function}{argument} |
Fred Drake | f666917 | 1998-05-06 19:52:49 +0000 | [diff] [blame] | 423 | |
| 424 | \begin{description} |
| 425 | |
| 426 | \item[User-defined functions] |
| 427 | A user-defined function object is created by a function definition |
Fred Drake | 62364ff | 2003-03-20 18:17:16 +0000 | [diff] [blame] | 428 | (see section~\ref{function}, ``Function definitions''). It should be |
Guido van Rossum | 83b2f8a | 1998-07-23 17:12:46 +0000 | [diff] [blame] | 429 | called with an argument |
Fred Drake | f666917 | 1998-05-06 19:52:49 +0000 | [diff] [blame] | 430 | list containing the same number of items as the function's formal |
| 431 | parameter list. |
| 432 | \indexii{user-defined}{function} |
| 433 | \obindex{function} |
| 434 | \obindex{user-defined function} |
| 435 | |
Guido van Rossum | 264bd59 | 1999-02-23 16:40:55 +0000 | [diff] [blame] | 436 | Special attributes: \member{func_doc} or \member{__doc__} is the |
Jeremy Hylton | f9b0cc7 | 2003-01-31 18:52:45 +0000 | [diff] [blame] | 437 | function's documentation string, or \code{None} if unavailable; |
Fred Drake | 8238587 | 1998-10-01 20:40:43 +0000 | [diff] [blame] | 438 | \member{func_name} or \member{__name__} is the function's name; |
Jeremy Hylton | f9b0cc7 | 2003-01-31 18:52:45 +0000 | [diff] [blame] | 439 | \member{__module__} is the name of the module the function was defined |
| 440 | in, or \code{None} if unavailable; |
Fred Drake | 8238587 | 1998-10-01 20:40:43 +0000 | [diff] [blame] | 441 | \member{func_defaults} is a tuple containing default argument values for |
Guido van Rossum | 83b2f8a | 1998-07-23 17:12:46 +0000 | [diff] [blame] | 442 | those arguments that have defaults, or \code{None} if no arguments |
Fred Drake | 8238587 | 1998-10-01 20:40:43 +0000 | [diff] [blame] | 443 | have a default value; \member{func_code} is the code object representing |
| 444 | the compiled function body; \member{func_globals} is (a reference to) |
Guido van Rossum | 83b2f8a | 1998-07-23 17:12:46 +0000 | [diff] [blame] | 445 | the dictionary that holds the function's global variables --- it |
Guido van Rossum | dfb658c | 1998-07-23 17:54:36 +0000 | [diff] [blame] | 446 | defines the global namespace of the module in which the function was |
Barry Warsaw | 7a5e80e | 2001-02-27 03:36:30 +0000 | [diff] [blame] | 447 | defined; \member{func_dict} or \member{__dict__} contains the |
Jeremy Hylton | aa90adc | 2001-03-23 17:23:50 +0000 | [diff] [blame] | 448 | namespace supporting arbitrary function attributes; |
| 449 | \member{func_closure} is \code{None} or a tuple of cells that contain |
Jeremy Hylton | 26c49b6 | 2002-04-01 17:58:39 +0000 | [diff] [blame] | 450 | bindings for the function's free variables. |
Barry Warsaw | 7a5e80e | 2001-02-27 03:36:30 +0000 | [diff] [blame] | 451 | |
Jeremy Hylton | 26c49b6 | 2002-04-01 17:58:39 +0000 | [diff] [blame] | 452 | Of these, \member{func_code}, \member{func_defaults}, |
Barry Warsaw | 7a5e80e | 2001-02-27 03:36:30 +0000 | [diff] [blame] | 453 | \member{func_doc}/\member{__doc__}, and |
| 454 | \member{func_dict}/\member{__dict__} may be writable; the |
Jeremy Hylton | aa90adc | 2001-03-23 17:23:50 +0000 | [diff] [blame] | 455 | others can never be changed. Additional information about a |
| 456 | function's definition can be retrieved from its code object; see the |
| 457 | description of internal types below. |
| 458 | |
Fred Drake | 4856d01 | 1999-01-12 04:15:20 +0000 | [diff] [blame] | 459 | \withsubitem{(function attribute)}{ |
| 460 | \ttindex{func_doc} |
| 461 | \ttindex{__doc__} |
| 462 | \ttindex{__name__} |
Jeremy Hylton | f9b0cc7 | 2003-01-31 18:52:45 +0000 | [diff] [blame] | 463 | \ttindex{__module__} |
Barry Warsaw | 7a5e80e | 2001-02-27 03:36:30 +0000 | [diff] [blame] | 464 | \ttindex{__dict__} |
Fred Drake | 4856d01 | 1999-01-12 04:15:20 +0000 | [diff] [blame] | 465 | \ttindex{func_defaults} |
Jeremy Hylton | 26c49b6 | 2002-04-01 17:58:39 +0000 | [diff] [blame] | 466 | \ttindex{func_closure} |
Fred Drake | 4856d01 | 1999-01-12 04:15:20 +0000 | [diff] [blame] | 467 | \ttindex{func_code} |
Barry Warsaw | 7a5e80e | 2001-02-27 03:36:30 +0000 | [diff] [blame] | 468 | \ttindex{func_globals} |
| 469 | \ttindex{func_dict}} |
Guido van Rossum | dfb658c | 1998-07-23 17:54:36 +0000 | [diff] [blame] | 470 | \indexii{global}{namespace} |
Fred Drake | f666917 | 1998-05-06 19:52:49 +0000 | [diff] [blame] | 471 | |
| 472 | \item[User-defined methods] |
Guido van Rossum | 83b2f8a | 1998-07-23 17:12:46 +0000 | [diff] [blame] | 473 | A user-defined method object combines a class, a class instance (or |
Fred Drake | 8dd6ffd | 2001-08-02 21:34:53 +0000 | [diff] [blame] | 474 | \code{None}) and any callable object (normally a user-defined |
| 475 | function). |
Fred Drake | f666917 | 1998-05-06 19:52:49 +0000 | [diff] [blame] | 476 | \obindex{method} |
| 477 | \obindex{user-defined method} |
| 478 | \indexii{user-defined}{method} |
Fred Drake | f666917 | 1998-05-06 19:52:49 +0000 | [diff] [blame] | 479 | |
| 480 | Special read-only attributes: \member{im_self} is the class instance |
Guido van Rossum | 83b2f8a | 1998-07-23 17:12:46 +0000 | [diff] [blame] | 481 | object, \member{im_func} is the function object; |
Raymond Hettinger | 03ec6d5 | 2003-06-25 18:29:36 +0000 | [diff] [blame] | 482 | \member{im_class} is the class of \member{im_self} for bound methods |
| 483 | or the class that asked for the method for unbound methods; |
Fred Drake | 8238587 | 1998-10-01 20:40:43 +0000 | [diff] [blame] | 484 | \member{__doc__} is the method's documentation (same as |
| 485 | \code{im_func.__doc__}); \member{__name__} is the method name (same as |
Jeremy Hylton | f9b0cc7 | 2003-01-31 18:52:45 +0000 | [diff] [blame] | 486 | \code{im_func.__name__}); \member{__module__} is the name of the |
| 487 | module the method was defined in, or \code{None} if unavailable. |
Fred Drake | f9d5803 | 2001-12-07 23:13:53 +0000 | [diff] [blame] | 488 | \versionchanged[\member{im_self} used to refer to the class that |
| 489 | defined the method]{2.2} |
Fred Drake | 4856d01 | 1999-01-12 04:15:20 +0000 | [diff] [blame] | 490 | \withsubitem{(method attribute)}{ |
Jeremy Hylton | f9b0cc7 | 2003-01-31 18:52:45 +0000 | [diff] [blame] | 491 | \ttindex{__doc__} |
| 492 | \ttindex{__name__} |
| 493 | \ttindex{__module__} |
Fred Drake | 4856d01 | 1999-01-12 04:15:20 +0000 | [diff] [blame] | 494 | \ttindex{im_func} |
Fred Drake | 1e42d8a | 1998-11-25 17:58:50 +0000 | [diff] [blame] | 495 | \ttindex{im_self}} |
Guido van Rossum | 83b2f8a | 1998-07-23 17:12:46 +0000 | [diff] [blame] | 496 | |
Barry Warsaw | 7a5e80e | 2001-02-27 03:36:30 +0000 | [diff] [blame] | 497 | Methods also support accessing (but not setting) the arbitrary |
| 498 | function attributes on the underlying function object. |
| 499 | |
Fred Drake | ea690c4 | 2003-07-17 05:26:53 +0000 | [diff] [blame] | 500 | User-defined method objects may be created when getting an attribute |
| 501 | of a class (perhaps via an instance of that class), if that attribute |
| 502 | is a user-defined function object, an unbound user-defined method object, |
| 503 | or a class method object. |
| 504 | When the attribute is a user-defined method object, a new |
| 505 | method object is only created if the class from which it is being |
| 506 | retrieved is the same as, or a derived class of, the class stored |
| 507 | in the original method object; otherwise, the original method object |
| 508 | is used as it is. |
| 509 | |
| 510 | When a user-defined method object is created by retrieving |
| 511 | a user-defined function object from a class, its \member{im_self} |
| 512 | attribute is \code{None} and the method object is said to be unbound. |
| 513 | When one is created by retrieving a user-defined function object |
| 514 | from a class via one of its instances, its \member{im_self} attribute |
| 515 | is the instance, and the method object is said to be bound. |
| 516 | In either case, the new method's \member{im_class} attribute |
| 517 | is the class from which the retrieval takes place, and |
| 518 | its \member{im_func} attribute is the original function object. |
| 519 | \withsubitem{(method attribute)}{ |
| 520 | \ttindex{im_class}\ttindex{im_func}\ttindex{im_self}} |
| 521 | |
| 522 | When a user-defined method object is created by retrieving another |
| 523 | method object from a class or instance, the behaviour is the same |
| 524 | as for a function object, except that the \member{im_func} attribute |
| 525 | of the new instance is not the original method object but its |
| 526 | \member{im_func} attribute. |
| 527 | \withsubitem{(method attribute)}{ |
| 528 | \ttindex{im_func}} |
| 529 | |
| 530 | When a user-defined method object is created by retrieving a |
| 531 | class method object from a class or instance, its \member{im_self} |
| 532 | attribute is the class itself (the same as the \member{im_class} |
| 533 | attribute), and its \member{im_func} attribute is the function |
| 534 | object underlying the class method. |
Fred Drake | 4856d01 | 1999-01-12 04:15:20 +0000 | [diff] [blame] | 535 | \withsubitem{(method attribute)}{ |
Fred Drake | 35c09f2 | 2000-06-28 20:15:47 +0000 | [diff] [blame] | 536 | \ttindex{im_class}\ttindex{im_func}\ttindex{im_self}} |
Guido van Rossum | 83b2f8a | 1998-07-23 17:12:46 +0000 | [diff] [blame] | 537 | |
| 538 | When an unbound user-defined method object is called, the underlying |
Fred Drake | 8238587 | 1998-10-01 20:40:43 +0000 | [diff] [blame] | 539 | function (\member{im_func}) is called, with the restriction that the |
Guido van Rossum | 83b2f8a | 1998-07-23 17:12:46 +0000 | [diff] [blame] | 540 | first argument must be an instance of the proper class |
Fred Drake | 8238587 | 1998-10-01 20:40:43 +0000 | [diff] [blame] | 541 | (\member{im_class}) or of a derived class thereof. |
Guido van Rossum | 83b2f8a | 1998-07-23 17:12:46 +0000 | [diff] [blame] | 542 | |
| 543 | When a bound user-defined method object is called, the underlying |
Fred Drake | 8238587 | 1998-10-01 20:40:43 +0000 | [diff] [blame] | 544 | function (\member{im_func}) is called, inserting the class instance |
| 545 | (\member{im_self}) in front of the argument list. For instance, when |
| 546 | \class{C} is a class which contains a definition for a function |
| 547 | \method{f()}, and \code{x} is an instance of \class{C}, calling |
Guido van Rossum | 83b2f8a | 1998-07-23 17:12:46 +0000 | [diff] [blame] | 548 | \code{x.f(1)} is equivalent to calling \code{C.f(x, 1)}. |
| 549 | |
Fred Drake | ea690c4 | 2003-07-17 05:26:53 +0000 | [diff] [blame] | 550 | When a user-defined method object is derived from a class method object, |
| 551 | the ``class instance'' stored in \member{im_self} will actually be the |
| 552 | class itself, so that calling either \code{x.f(1)} or \code{C.f(1)} is |
| 553 | equivalent to calling \code{f(C,1)} where \code{f} is the underlying |
| 554 | function. |
| 555 | |
Guido van Rossum | 83b2f8a | 1998-07-23 17:12:46 +0000 | [diff] [blame] | 556 | Note that the transformation from function object to (unbound or |
| 557 | bound) method object happens each time the attribute is retrieved from |
| 558 | the class or instance. In some cases, a fruitful optimization is to |
| 559 | assign the attribute to a local variable and call that local variable. |
| 560 | Also notice that this transformation only happens for user-defined |
| 561 | functions; other callable objects (and all non-callable objects) are |
Fred Drake | 35c09f2 | 2000-06-28 20:15:47 +0000 | [diff] [blame] | 562 | retrieved without transformation. It is also important to note that |
| 563 | user-defined functions which are attributes of a class instance are |
| 564 | not converted to bound methods; this \emph{only} happens when the |
| 565 | function is an attribute of the class. |
Guido van Rossum | 83b2f8a | 1998-07-23 17:12:46 +0000 | [diff] [blame] | 566 | |
Fred Drake | e31e9ce | 2001-12-11 21:10:08 +0000 | [diff] [blame] | 567 | \item[Generator functions\index{generator!function}\index{generator!iterator}] |
| 568 | A function or method which uses the \keyword{yield} statement (see |
| 569 | section~\ref{yield}, ``The \keyword{yield} statement'') is called a |
| 570 | \dfn{generator function}. Such a function, when called, always |
| 571 | returns an iterator object which can be used to execute the body of |
| 572 | the function: calling the iterator's \method{next()} method will |
| 573 | cause the function to execute until it provides a value using the |
| 574 | \keyword{yield} statement. When the function executes a |
| 575 | \keyword{return} statement or falls off the end, a |
| 576 | \exception{StopIteration} exception is raised and the iterator will |
| 577 | have reached the end of the set of values to be returned. |
| 578 | |
Fred Drake | f666917 | 1998-05-06 19:52:49 +0000 | [diff] [blame] | 579 | \item[Built-in functions] |
Guido van Rossum | 83b2f8a | 1998-07-23 17:12:46 +0000 | [diff] [blame] | 580 | A built-in function object is a wrapper around a \C{} function. Examples |
| 581 | of built-in functions are \function{len()} and \function{math.sin()} |
| 582 | (\module{math} is a standard built-in module). |
| 583 | The number and type of the arguments are |
Fred Drake | f666917 | 1998-05-06 19:52:49 +0000 | [diff] [blame] | 584 | determined by the C function. |
Fred Drake | 8238587 | 1998-10-01 20:40:43 +0000 | [diff] [blame] | 585 | Special read-only attributes: \member{__doc__} is the function's |
| 586 | documentation string, or \code{None} if unavailable; \member{__name__} |
| 587 | is the function's name; \member{__self__} is set to \code{None} (but see |
Jeremy Hylton | f9b0cc7 | 2003-01-31 18:52:45 +0000 | [diff] [blame] | 588 | the next item); \member{__module__} is the name of the module the |
| 589 | function was defined in or \code{None} if unavailable. |
Fred Drake | f666917 | 1998-05-06 19:52:49 +0000 | [diff] [blame] | 590 | \obindex{built-in function} |
| 591 | \obindex{function} |
| 592 | \indexii{C}{language} |
| 593 | |
| 594 | \item[Built-in methods] |
| 595 | This is really a different disguise of a built-in function, this time |
Raymond Hettinger | e701dcb | 2003-01-19 13:08:18 +0000 | [diff] [blame] | 596 | containing an object passed to the C function as an implicit extra |
Guido van Rossum | 83b2f8a | 1998-07-23 17:12:46 +0000 | [diff] [blame] | 597 | argument. An example of a built-in method is |
Raymond Hettinger | e701dcb | 2003-01-19 13:08:18 +0000 | [diff] [blame] | 598 | \code{\var{alist}.append()}, assuming |
| 599 | \var{alist} is a list object. |
Fred Drake | 8238587 | 1998-10-01 20:40:43 +0000 | [diff] [blame] | 600 | In this case, the special read-only attribute \member{__self__} is set |
Fred Drake | e31e9ce | 2001-12-11 21:10:08 +0000 | [diff] [blame] | 601 | to the object denoted by \var{list}. |
Fred Drake | f666917 | 1998-05-06 19:52:49 +0000 | [diff] [blame] | 602 | \obindex{built-in method} |
| 603 | \obindex{method} |
| 604 | \indexii{built-in}{method} |
| 605 | |
Fred Drake | e37b4ed | 2003-07-15 20:45:16 +0000 | [diff] [blame] | 606 | \item[Class Types] |
| 607 | Class types, or ``new-style classes,'' are callable. These objects |
| 608 | normally act as factories for new instances of themselves, but |
| 609 | variations are possible for class types that override |
| 610 | \method{__new__()}. The arguments of the call are passed to |
| 611 | \method{__new__()} and, in the typical case, to \method{__init__()} to |
| 612 | initialize the new instance. |
| 613 | |
| 614 | \item[Classic Classes] |
Guido van Rossum | 83b2f8a | 1998-07-23 17:12:46 +0000 | [diff] [blame] | 615 | Class objects are described below. When a class object is called, |
| 616 | a new class instance (also described below) is created and |
Fred Drake | f666917 | 1998-05-06 19:52:49 +0000 | [diff] [blame] | 617 | returned. This implies a call to the class's \method{__init__()} method |
| 618 | if it has one. Any arguments are passed on to the \method{__init__()} |
Guido van Rossum | 83b2f8a | 1998-07-23 17:12:46 +0000 | [diff] [blame] | 619 | method. If there is no \method{__init__()} method, the class must be called |
Fred Drake | f666917 | 1998-05-06 19:52:49 +0000 | [diff] [blame] | 620 | without arguments. |
Fred Drake | 1e42d8a | 1998-11-25 17:58:50 +0000 | [diff] [blame] | 621 | \withsubitem{(object method)}{\ttindex{__init__()}} |
Fred Drake | f666917 | 1998-05-06 19:52:49 +0000 | [diff] [blame] | 622 | \obindex{class} |
| 623 | \obindex{class instance} |
| 624 | \obindex{instance} |
| 625 | \indexii{class object}{call} |
| 626 | |
Guido van Rossum | 83b2f8a | 1998-07-23 17:12:46 +0000 | [diff] [blame] | 627 | \item[Class instances] |
| 628 | Class instances are described below. Class instances are callable |
Fred Drake | 8238587 | 1998-10-01 20:40:43 +0000 | [diff] [blame] | 629 | only when the class has a \method{__call__()} method; \code{x(arguments)} |
Guido van Rossum | 83b2f8a | 1998-07-23 17:12:46 +0000 | [diff] [blame] | 630 | is a shorthand for \code{x.__call__(arguments)}. |
| 631 | |
Fred Drake | f666917 | 1998-05-06 19:52:49 +0000 | [diff] [blame] | 632 | \end{description} |
| 633 | |
| 634 | \item[Modules] |
Fred Drake | 62364ff | 2003-03-20 18:17:16 +0000 | [diff] [blame] | 635 | Modules are imported by the \keyword{import} statement (see |
Fred Drake | d51ce7d | 2003-07-15 22:03:00 +0000 | [diff] [blame] | 636 | section~\ref{import}, ``The \keyword{import} statement'').% |
| 637 | \stindex{import}\obindex{module} |
Guido van Rossum | dfb658c | 1998-07-23 17:54:36 +0000 | [diff] [blame] | 638 | A module object has a namespace implemented by a dictionary object |
Guido van Rossum | 83b2f8a | 1998-07-23 17:12:46 +0000 | [diff] [blame] | 639 | (this is the dictionary referenced by the func_globals attribute of |
| 640 | functions defined in the module). Attribute references are translated |
| 641 | to lookups in this dictionary, e.g., \code{m.x} is equivalent to |
| 642 | \code{m.__dict__["x"]}. |
| 643 | A module object does not contain the code object used to |
Fred Drake | f666917 | 1998-05-06 19:52:49 +0000 | [diff] [blame] | 644 | initialize the module (since it isn't needed once the initialization |
| 645 | is done). |
Fred Drake | f666917 | 1998-05-06 19:52:49 +0000 | [diff] [blame] | 646 | |
Guido van Rossum | dfb658c | 1998-07-23 17:54:36 +0000 | [diff] [blame] | 647 | Attribute assignment updates the module's namespace dictionary, |
Fred Drake | 8238587 | 1998-10-01 20:40:43 +0000 | [diff] [blame] | 648 | e.g., \samp{m.x = 1} is equivalent to \samp{m.__dict__["x"] = 1}. |
Fred Drake | f666917 | 1998-05-06 19:52:49 +0000 | [diff] [blame] | 649 | |
Guido van Rossum | dfb658c | 1998-07-23 17:54:36 +0000 | [diff] [blame] | 650 | Special read-only attribute: \member{__dict__} is the module's |
| 651 | namespace as a dictionary object. |
Fred Drake | 1e42d8a | 1998-11-25 17:58:50 +0000 | [diff] [blame] | 652 | \withsubitem{(module attribute)}{\ttindex{__dict__}} |
Guido van Rossum | 83b2f8a | 1998-07-23 17:12:46 +0000 | [diff] [blame] | 653 | |
| 654 | Predefined (writable) attributes: \member{__name__} |
| 655 | is the module's name; \member{__doc__} is the |
| 656 | module's documentation string, or |
Fred Drake | 8238587 | 1998-10-01 20:40:43 +0000 | [diff] [blame] | 657 | \code{None} if unavailable; \member{__file__} is the pathname of the |
Guido van Rossum | 83b2f8a | 1998-07-23 17:12:46 +0000 | [diff] [blame] | 658 | file from which the module was loaded, if it was loaded from a file. |
Fred Drake | 8238587 | 1998-10-01 20:40:43 +0000 | [diff] [blame] | 659 | The \member{__file__} attribute is not present for C{} modules that are |
Guido van Rossum | 83b2f8a | 1998-07-23 17:12:46 +0000 | [diff] [blame] | 660 | statically linked into the interpreter; for extension modules loaded |
| 661 | dynamically from a shared library, it is the pathname of the shared |
| 662 | library file. |
Fred Drake | 4856d01 | 1999-01-12 04:15:20 +0000 | [diff] [blame] | 663 | \withsubitem{(module attribute)}{ |
| 664 | \ttindex{__name__} |
| 665 | \ttindex{__doc__} |
Fred Drake | 1e42d8a | 1998-11-25 17:58:50 +0000 | [diff] [blame] | 666 | \ttindex{__file__}} |
Guido van Rossum | dfb658c | 1998-07-23 17:54:36 +0000 | [diff] [blame] | 667 | \indexii{module}{namespace} |
Fred Drake | f666917 | 1998-05-06 19:52:49 +0000 | [diff] [blame] | 668 | |
| 669 | \item[Classes] |
Fred Drake | 62364ff | 2003-03-20 18:17:16 +0000 | [diff] [blame] | 670 | Class objects are created by class definitions (see |
| 671 | section~\ref{class}, ``Class definitions''). |
Guido van Rossum | 83b2f8a | 1998-07-23 17:12:46 +0000 | [diff] [blame] | 672 | A class has a namespace implemented by a dictionary object. |
| 673 | Class attribute references are translated to |
| 674 | lookups in this dictionary, |
Fred Drake | 8238587 | 1998-10-01 20:40:43 +0000 | [diff] [blame] | 675 | e.g., \samp{C.x} is translated to \samp{C.__dict__["x"]}. |
Guido van Rossum | 83b2f8a | 1998-07-23 17:12:46 +0000 | [diff] [blame] | 676 | When the attribute name is not found |
Fred Drake | f666917 | 1998-05-06 19:52:49 +0000 | [diff] [blame] | 677 | there, the attribute search continues in the base classes. The search |
Guido van Rossum | 83b2f8a | 1998-07-23 17:12:46 +0000 | [diff] [blame] | 678 | is depth-first, left-to-right in the order of occurrence in the |
Fred Drake | f666917 | 1998-05-06 19:52:49 +0000 | [diff] [blame] | 679 | base class list. |
Fred Drake | ea690c4 | 2003-07-17 05:26:53 +0000 | [diff] [blame] | 680 | |
| 681 | When a class attribute reference (for class \class{C}, say) |
| 682 | would yield a user-defined function object or |
| 683 | an unbound user-defined method object whose associated class is either |
| 684 | \class{C} or one of its base classes, it is transformed into an unbound |
| 685 | user-defined method object whose \member{im_class} attribute is~\class{C}. |
| 686 | When it would yield a class method object, it is transformed into |
| 687 | a bound user-defined method object whose \member{im_class} and |
| 688 | \member{im_self} attributes are both~\class{C}. When it would yield |
| 689 | a static method object, it is transformed into the object wrapped |
| 690 | by the static method object. See section~\ref{descriptors} for another |
| 691 | way in which attributes retrieved from a class may differ from those |
| 692 | actually contained in its \member{__dict__}. |
Fred Drake | f666917 | 1998-05-06 19:52:49 +0000 | [diff] [blame] | 693 | \obindex{class} |
| 694 | \obindex{class instance} |
| 695 | \obindex{instance} |
| 696 | \indexii{class object}{call} |
| 697 | \index{container} |
| 698 | \obindex{dictionary} |
| 699 | \indexii{class}{attribute} |
| 700 | |
| 701 | Class attribute assignments update the class's dictionary, never the |
| 702 | dictionary of a base class. |
| 703 | \indexiii{class}{attribute}{assignment} |
| 704 | |
Guido van Rossum | 83b2f8a | 1998-07-23 17:12:46 +0000 | [diff] [blame] | 705 | A class object can be called (see above) to yield a class instance (see |
| 706 | below). |
Fred Drake | f666917 | 1998-05-06 19:52:49 +0000 | [diff] [blame] | 707 | \indexii{class object}{call} |
| 708 | |
Guido van Rossum | 83b2f8a | 1998-07-23 17:12:46 +0000 | [diff] [blame] | 709 | Special attributes: \member{__name__} is the class name; |
| 710 | \member{__module__} is the module name in which the class was defined; |
Guido van Rossum | dfb658c | 1998-07-23 17:54:36 +0000 | [diff] [blame] | 711 | \member{__dict__} is the dictionary containing the class's namespace; |
Guido van Rossum | 83b2f8a | 1998-07-23 17:12:46 +0000 | [diff] [blame] | 712 | \member{__bases__} is a tuple (possibly empty or a singleton) |
| 713 | containing the base classes, in the order of their occurrence in the |
Fred Drake | 8238587 | 1998-10-01 20:40:43 +0000 | [diff] [blame] | 714 | base class list; \member{__doc__} is the class's documentation string, |
Guido van Rossum | 83b2f8a | 1998-07-23 17:12:46 +0000 | [diff] [blame] | 715 | or None if undefined. |
Fred Drake | 4856d01 | 1999-01-12 04:15:20 +0000 | [diff] [blame] | 716 | \withsubitem{(class attribute)}{ |
| 717 | \ttindex{__name__} |
| 718 | \ttindex{__module__} |
| 719 | \ttindex{__dict__} |
| 720 | \ttindex{__bases__} |
Fred Drake | 1e42d8a | 1998-11-25 17:58:50 +0000 | [diff] [blame] | 721 | \ttindex{__doc__}} |
Fred Drake | f666917 | 1998-05-06 19:52:49 +0000 | [diff] [blame] | 722 | |
| 723 | \item[Class instances] |
Guido van Rossum | 83b2f8a | 1998-07-23 17:12:46 +0000 | [diff] [blame] | 724 | A class instance is created by calling a class object (see above). |
| 725 | A class instance has a namespace implemented as a dictionary which |
| 726 | is the first place in which |
Fred Drake | f666917 | 1998-05-06 19:52:49 +0000 | [diff] [blame] | 727 | attribute references are searched. When an attribute is not found |
Guido van Rossum | 83b2f8a | 1998-07-23 17:12:46 +0000 | [diff] [blame] | 728 | there, and the instance's class has an attribute by that name, |
| 729 | the search continues with the class attributes. If a class attribute |
Fred Drake | ea690c4 | 2003-07-17 05:26:53 +0000 | [diff] [blame] | 730 | is found that is a user-defined function object or an unbound |
| 731 | user-defined method object whose associated class is the class |
| 732 | (call it~\class{C}) of the instance for which the attribute reference |
| 733 | was initiated or one of its bases, |
| 734 | it is transformed into a bound user-defined method object whose |
| 735 | \member{im_class} attribute is~\class{C} whose \member{im_self} attribute |
| 736 | is the instance. Static method and class method objects are also |
| 737 | transformed, as if they had been retrieved from class~\class{C}; |
| 738 | see above under ``Classes''. See section~\ref{descriptors} for |
| 739 | another way in which attributes of a class retrieved via its |
| 740 | instances may differ from the objects actually stored in the |
| 741 | class's \member{__dict__}. |
Guido van Rossum | 83b2f8a | 1998-07-23 17:12:46 +0000 | [diff] [blame] | 742 | If no class attribute is found, and the object's class has a |
Fred Drake | 8238587 | 1998-10-01 20:40:43 +0000 | [diff] [blame] | 743 | \method{__getattr__()} method, that is called to satisfy the lookup. |
Fred Drake | f666917 | 1998-05-06 19:52:49 +0000 | [diff] [blame] | 744 | \obindex{class instance} |
| 745 | \obindex{instance} |
| 746 | \indexii{class}{instance} |
| 747 | \indexii{class instance}{attribute} |
| 748 | |
Guido van Rossum | 83b2f8a | 1998-07-23 17:12:46 +0000 | [diff] [blame] | 749 | Attribute assignments and deletions update the instance's dictionary, |
Fred Drake | 8238587 | 1998-10-01 20:40:43 +0000 | [diff] [blame] | 750 | never a class's dictionary. If the class has a \method{__setattr__()} or |
| 751 | \method{__delattr__()} method, this is called instead of updating the |
Guido van Rossum | 83b2f8a | 1998-07-23 17:12:46 +0000 | [diff] [blame] | 752 | instance dictionary directly. |
Fred Drake | f666917 | 1998-05-06 19:52:49 +0000 | [diff] [blame] | 753 | \indexiii{class instance}{attribute}{assignment} |
| 754 | |
| 755 | Class instances can pretend to be numbers, sequences, or mappings if |
Guido van Rossum | 83b2f8a | 1998-07-23 17:12:46 +0000 | [diff] [blame] | 756 | they have methods with certain special names. See |
Fred Drake | 62364ff | 2003-03-20 18:17:16 +0000 | [diff] [blame] | 757 | section~\ref{specialnames}, ``Special method names.'' |
Fred Drake | e15956b | 2000-04-03 04:51:13 +0000 | [diff] [blame] | 758 | \obindex{numeric} |
Fred Drake | f666917 | 1998-05-06 19:52:49 +0000 | [diff] [blame] | 759 | \obindex{sequence} |
| 760 | \obindex{mapping} |
| 761 | |
Guido van Rossum | 83b2f8a | 1998-07-23 17:12:46 +0000 | [diff] [blame] | 762 | Special attributes: \member{__dict__} is the attribute |
| 763 | dictionary; \member{__class__} is the instance's class. |
Fred Drake | 4856d01 | 1999-01-12 04:15:20 +0000 | [diff] [blame] | 764 | \withsubitem{(instance attribute)}{ |
| 765 | \ttindex{__dict__} |
Fred Drake | 1e42d8a | 1998-11-25 17:58:50 +0000 | [diff] [blame] | 766 | \ttindex{__class__}} |
Fred Drake | f666917 | 1998-05-06 19:52:49 +0000 | [diff] [blame] | 767 | |
| 768 | \item[Files] |
Fred Drake | e15eb35 | 1999-11-10 16:13:25 +0000 | [diff] [blame] | 769 | A file\obindex{file} object represents an open file. File objects are |
| 770 | created by the \function{open()}\bifuncindex{open} built-in function, |
| 771 | and also by |
| 772 | \withsubitem{(in module os)}{\ttindex{popen()}}\function{os.popen()}, |
| 773 | \function{os.fdopen()}, and the |
| 774 | \method{makefile()}\withsubitem{(socket method)}{\ttindex{makefile()}} |
| 775 | method of socket objects (and perhaps by other functions or methods |
| 776 | provided by extension modules). The objects |
| 777 | \ttindex{sys.stdin}\code{sys.stdin}, |
| 778 | \ttindex{sys.stdout}\code{sys.stdout} and |
| 779 | \ttindex{sys.stderr}\code{sys.stderr} are initialized to file objects |
| 780 | corresponding to the interpreter's standard\index{stdio} input, output |
| 781 | and error streams. See the \citetitle[../lib/lib.html]{Python Library |
| 782 | Reference} for complete documentation of file objects. |
Fred Drake | 4856d01 | 1999-01-12 04:15:20 +0000 | [diff] [blame] | 783 | \withsubitem{(in module sys)}{ |
| 784 | \ttindex{stdin} |
| 785 | \ttindex{stdout} |
Fred Drake | 1e42d8a | 1998-11-25 17:58:50 +0000 | [diff] [blame] | 786 | \ttindex{stderr}} |
Fred Drake | e15eb35 | 1999-11-10 16:13:25 +0000 | [diff] [blame] | 787 | |
Fred Drake | f666917 | 1998-05-06 19:52:49 +0000 | [diff] [blame] | 788 | |
| 789 | \item[Internal types] |
| 790 | A few types used internally by the interpreter are exposed to the user. |
Guido van Rossum | 83b2f8a | 1998-07-23 17:12:46 +0000 | [diff] [blame] | 791 | Their definitions may change with future versions of the interpreter, |
Fred Drake | f666917 | 1998-05-06 19:52:49 +0000 | [diff] [blame] | 792 | but they are mentioned here for completeness. |
| 793 | \index{internal type} |
| 794 | \index{types, internal} |
| 795 | |
| 796 | \begin{description} |
| 797 | |
| 798 | \item[Code objects] |
Guido van Rossum | 83b2f8a | 1998-07-23 17:12:46 +0000 | [diff] [blame] | 799 | Code objects represent \emph{byte-compiled} executable Python code, or |
| 800 | \emph{bytecode}. |
Fred Drake | f666917 | 1998-05-06 19:52:49 +0000 | [diff] [blame] | 801 | The difference between a code |
| 802 | object and a function object is that the function object contains an |
Guido van Rossum | 83b2f8a | 1998-07-23 17:12:46 +0000 | [diff] [blame] | 803 | explicit reference to the function's globals (the module in which it |
| 804 | was defined), while a code object contains no context; |
| 805 | also the default argument values are stored in the function object, |
| 806 | not in the code object (because they represent values calculated at |
| 807 | run-time). Unlike function objects, code objects are immutable and |
| 808 | contain no references (directly or indirectly) to mutable objects. |
| 809 | \index{bytecode} |
Fred Drake | f666917 | 1998-05-06 19:52:49 +0000 | [diff] [blame] | 810 | \obindex{code} |
| 811 | |
Fred Drake | 1e42d8a | 1998-11-25 17:58:50 +0000 | [diff] [blame] | 812 | Special read-only attributes: \member{co_name} gives the function |
| 813 | name; \member{co_argcount} is the number of positional arguments |
| 814 | (including arguments with default values); \member{co_nlocals} is the |
| 815 | number of local variables used by the function (including arguments); |
| 816 | \member{co_varnames} is a tuple containing the names of the local |
Jeremy Hylton | aa90adc | 2001-03-23 17:23:50 +0000 | [diff] [blame] | 817 | variables (starting with the argument names); \member{co_cellvars} is |
| 818 | a tuple containing the names of local variables that are referenced by |
| 819 | nested functions; \member{co_freevars} is a tuple containing the names |
Jeremy Hylton | 8392f36 | 2002-04-01 18:53:36 +0000 | [diff] [blame] | 820 | of free variables; \member{co_code} is a string representing the |
| 821 | sequence of bytecode instructions; |
Fred Drake | 1e42d8a | 1998-11-25 17:58:50 +0000 | [diff] [blame] | 822 | \member{co_consts} is a tuple containing the literals used by the |
| 823 | bytecode; \member{co_names} is a tuple containing the names used by |
| 824 | the bytecode; \member{co_filename} is the filename from which the code |
| 825 | was compiled; \member{co_firstlineno} is the first line number of the |
| 826 | function; \member{co_lnotab} is a string encoding the mapping from |
Thomas Wouters | f9b526d | 2000-07-16 19:05:38 +0000 | [diff] [blame] | 827 | byte code offsets to line numbers (for details see the source code of |
Fred Drake | 1e42d8a | 1998-11-25 17:58:50 +0000 | [diff] [blame] | 828 | the interpreter); \member{co_stacksize} is the required stack size |
| 829 | (including local variables); \member{co_flags} is an integer encoding |
| 830 | a number of flags for the interpreter. |
Jeremy Hylton | aa90adc | 2001-03-23 17:23:50 +0000 | [diff] [blame] | 831 | |
Fred Drake | 4856d01 | 1999-01-12 04:15:20 +0000 | [diff] [blame] | 832 | \withsubitem{(code object attribute)}{ |
| 833 | \ttindex{co_argcount} |
| 834 | \ttindex{co_code} |
| 835 | \ttindex{co_consts} |
| 836 | \ttindex{co_filename} |
| 837 | \ttindex{co_firstlineno} |
| 838 | \ttindex{co_flags} |
| 839 | \ttindex{co_lnotab} |
| 840 | \ttindex{co_name} |
| 841 | \ttindex{co_names} |
| 842 | \ttindex{co_nlocals} |
| 843 | \ttindex{co_stacksize} |
Jeremy Hylton | aa90adc | 2001-03-23 17:23:50 +0000 | [diff] [blame] | 844 | \ttindex{co_varnames} |
| 845 | \ttindex{co_cellvars} |
| 846 | \ttindex{co_freevars}} |
Guido van Rossum | 83b2f8a | 1998-07-23 17:12:46 +0000 | [diff] [blame] | 847 | |
Fred Drake | e15956b | 2000-04-03 04:51:13 +0000 | [diff] [blame] | 848 | The following flag bits are defined for \member{co_flags}: bit |
| 849 | \code{0x04} is set if the function uses the \samp{*arguments} syntax |
| 850 | to accept an arbitrary number of positional arguments; bit |
| 851 | \code{0x08} is set if the function uses the \samp{**keywords} syntax |
Jeremy Hylton | 8392f36 | 2002-04-01 18:53:36 +0000 | [diff] [blame] | 852 | to accept arbitrary keyword arguments; bit \code{0x20} is set if the |
Brett Cannon | 9e6fedd | 2003-06-15 22:57:44 +0000 | [diff] [blame] | 853 | function is a generator. |
| 854 | \obindex{generator} |
Jeremy Hylton | 8392f36 | 2002-04-01 18:53:36 +0000 | [diff] [blame] | 855 | |
| 856 | Future feature declarations (\samp{from __future__ import division}) |
| 857 | also use bits in \member{co_flags} to indicate whether a code object |
| 858 | was compiled with a particular feature enabled: bit \code{0x2000} is |
| 859 | set if the function was compiled with future division enabled; bits |
| 860 | \code{0x10} and \code{0x1000} were used in earlier versions of Python. |
| 861 | |
| 862 | Other bits in \member{co_flags} are reserved for internal use. |
| 863 | |
| 864 | If\index{documentation string} a code object represents a function, |
| 865 | the first item in |
Jeremy Hylton | aa90adc | 2001-03-23 17:23:50 +0000 | [diff] [blame] | 866 | \member{co_consts} is the documentation string of the function, or |
| 867 | \code{None} if undefined. |
Fred Drake | f666917 | 1998-05-06 19:52:49 +0000 | [diff] [blame] | 868 | |
| 869 | \item[Frame objects] |
| 870 | Frame objects represent execution frames. They may occur in traceback |
| 871 | objects (see below). |
| 872 | \obindex{frame} |
| 873 | |
| 874 | Special read-only attributes: \member{f_back} is to the previous |
| 875 | stack frame (towards the caller), or \code{None} if this is the bottom |
| 876 | stack frame; \member{f_code} is the code object being executed in this |
Guido van Rossum | 83b2f8a | 1998-07-23 17:12:46 +0000 | [diff] [blame] | 877 | frame; \member{f_locals} is the dictionary used to look up local |
| 878 | variables; \member{f_globals} is used for global variables; |
Fred Drake | 8238587 | 1998-10-01 20:40:43 +0000 | [diff] [blame] | 879 | \member{f_builtins} is used for built-in (intrinsic) names; |
| 880 | \member{f_restricted} is a flag indicating whether the function is |
Michael W. Hudson | cfd3884 | 2002-12-17 16:15:34 +0000 | [diff] [blame] | 881 | executing in restricted execution mode; \member{f_lasti} gives the |
Guido van Rossum | 83b2f8a | 1998-07-23 17:12:46 +0000 | [diff] [blame] | 882 | precise instruction (this is an index into the bytecode string of |
Fred Drake | f666917 | 1998-05-06 19:52:49 +0000 | [diff] [blame] | 883 | the code object). |
Fred Drake | 4856d01 | 1999-01-12 04:15:20 +0000 | [diff] [blame] | 884 | \withsubitem{(frame attribute)}{ |
| 885 | \ttindex{f_back} |
| 886 | \ttindex{f_code} |
| 887 | \ttindex{f_globals} |
| 888 | \ttindex{f_locals} |
Fred Drake | 4856d01 | 1999-01-12 04:15:20 +0000 | [diff] [blame] | 889 | \ttindex{f_lasti} |
| 890 | \ttindex{f_builtins} |
Fred Drake | 1e42d8a | 1998-11-25 17:58:50 +0000 | [diff] [blame] | 891 | \ttindex{f_restricted}} |
Guido van Rossum | 83b2f8a | 1998-07-23 17:12:46 +0000 | [diff] [blame] | 892 | |
Fred Drake | 8238587 | 1998-10-01 20:40:43 +0000 | [diff] [blame] | 893 | Special writable attributes: \member{f_trace}, if not \code{None}, is a |
Guido van Rossum | 83b2f8a | 1998-07-23 17:12:46 +0000 | [diff] [blame] | 894 | function called at the start of each source code line (this is used by |
Fred Drake | 8238587 | 1998-10-01 20:40:43 +0000 | [diff] [blame] | 895 | the debugger); \member{f_exc_type}, \member{f_exc_value}, |
| 896 | \member{f_exc_traceback} represent the most recent exception caught in |
Michael W. Hudson | cfd3884 | 2002-12-17 16:15:34 +0000 | [diff] [blame] | 897 | this frame; \member{f_lineno} is the current line number of the frame |
| 898 | --- writing to this from within a trace function jumps to the given line |
| 899 | (only for the bottom-most frame). A debugger can implement a Jump |
| 900 | command (aka Set Next Statement) by writing to f_lineno. |
Fred Drake | 4856d01 | 1999-01-12 04:15:20 +0000 | [diff] [blame] | 901 | \withsubitem{(frame attribute)}{ |
| 902 | \ttindex{f_trace} |
| 903 | \ttindex{f_exc_type} |
| 904 | \ttindex{f_exc_value} |
Michael W. Hudson | cfd3884 | 2002-12-17 16:15:34 +0000 | [diff] [blame] | 905 | \ttindex{f_exc_traceback} |
| 906 | \ttindex{f_lineno}} |
Fred Drake | f666917 | 1998-05-06 19:52:49 +0000 | [diff] [blame] | 907 | |
| 908 | \item[Traceback objects] \label{traceback} |
| 909 | Traceback objects represent a stack trace of an exception. A |
| 910 | traceback object is created when an exception occurs. When the search |
| 911 | for an exception handler unwinds the execution stack, at each unwound |
| 912 | level a traceback object is inserted in front of the current |
Guido van Rossum | 83b2f8a | 1998-07-23 17:12:46 +0000 | [diff] [blame] | 913 | traceback. When an exception handler is entered, the stack trace is |
| 914 | made available to the program. |
Fred Drake | 62364ff | 2003-03-20 18:17:16 +0000 | [diff] [blame] | 915 | (See section~\ref{try}, ``The \code{try} statement.'') |
Guido van Rossum | 83b2f8a | 1998-07-23 17:12:46 +0000 | [diff] [blame] | 916 | It is accessible as \code{sys.exc_traceback}, and also as the third |
| 917 | item of the tuple returned by \code{sys.exc_info()}. The latter is |
| 918 | the preferred interface, since it works correctly when the program is |
| 919 | using multiple threads. |
| 920 | When the program contains no suitable handler, the stack trace is written |
Fred Drake | f666917 | 1998-05-06 19:52:49 +0000 | [diff] [blame] | 921 | (nicely formatted) to the standard error stream; if the interpreter is |
| 922 | interactive, it is also made available to the user as |
| 923 | \code{sys.last_traceback}. |
| 924 | \obindex{traceback} |
| 925 | \indexii{stack}{trace} |
| 926 | \indexii{exception}{handler} |
| 927 | \indexii{execution}{stack} |
Fred Drake | 4856d01 | 1999-01-12 04:15:20 +0000 | [diff] [blame] | 928 | \withsubitem{(in module sys)}{ |
| 929 | \ttindex{exc_info} |
| 930 | \ttindex{exc_traceback} |
Fred Drake | 1e42d8a | 1998-11-25 17:58:50 +0000 | [diff] [blame] | 931 | \ttindex{last_traceback}} |
Guido van Rossum | 83b2f8a | 1998-07-23 17:12:46 +0000 | [diff] [blame] | 932 | \ttindex{sys.exc_info} |
Fred Drake | f666917 | 1998-05-06 19:52:49 +0000 | [diff] [blame] | 933 | \ttindex{sys.exc_traceback} |
| 934 | \ttindex{sys.last_traceback} |
| 935 | |
| 936 | Special read-only attributes: \member{tb_next} is the next level in the |
| 937 | stack trace (towards the frame where the exception occurred), or |
| 938 | \code{None} if there is no next level; \member{tb_frame} points to the |
| 939 | execution frame of the current level; \member{tb_lineno} gives the line |
| 940 | number where the exception occurred; \member{tb_lasti} indicates the |
| 941 | precise instruction. The line number and last instruction in the |
| 942 | traceback may differ from the line number of its frame object if the |
| 943 | exception occurred in a \keyword{try} statement with no matching |
| 944 | except clause or with a finally clause. |
Fred Drake | 4856d01 | 1999-01-12 04:15:20 +0000 | [diff] [blame] | 945 | \withsubitem{(traceback attribute)}{ |
| 946 | \ttindex{tb_next} |
| 947 | \ttindex{tb_frame} |
| 948 | \ttindex{tb_lineno} |
Fred Drake | 1e42d8a | 1998-11-25 17:58:50 +0000 | [diff] [blame] | 949 | \ttindex{tb_lasti}} |
Fred Drake | f666917 | 1998-05-06 19:52:49 +0000 | [diff] [blame] | 950 | \stindex{try} |
| 951 | |
Guido van Rossum | 83b2f8a | 1998-07-23 17:12:46 +0000 | [diff] [blame] | 952 | \item[Slice objects] |
| 953 | Slice objects are used to represent slices when \emph{extended slice |
| 954 | syntax} is used. This is a slice using two colons, or multiple slices |
| 955 | or ellipses separated by commas, e.g., \code{a[i:j:step]}, \code{a[i:j, |
Raymond Hettinger | 03ec6d5 | 2003-06-25 18:29:36 +0000 | [diff] [blame] | 956 | k:l]}, or \code{a[..., i:j]}. They are also created by the built-in |
Fred Drake | 1e42d8a | 1998-11-25 17:58:50 +0000 | [diff] [blame] | 957 | \function{slice()}\bifuncindex{slice} function. |
Guido van Rossum | 83b2f8a | 1998-07-23 17:12:46 +0000 | [diff] [blame] | 958 | |
Thomas Wouters | f9b526d | 2000-07-16 19:05:38 +0000 | [diff] [blame] | 959 | Special read-only attributes: \member{start} is the lower bound; |
| 960 | \member{stop} is the upper bound; \member{step} is the step value; each is |
Guido van Rossum | 83b2f8a | 1998-07-23 17:12:46 +0000 | [diff] [blame] | 961 | \code{None} if omitted. These attributes can have any type. |
Fred Drake | 4856d01 | 1999-01-12 04:15:20 +0000 | [diff] [blame] | 962 | \withsubitem{(slice object attribute)}{ |
| 963 | \ttindex{start} |
| 964 | \ttindex{stop} |
Fred Drake | 1e42d8a | 1998-11-25 17:58:50 +0000 | [diff] [blame] | 965 | \ttindex{step}} |
Guido van Rossum | 83b2f8a | 1998-07-23 17:12:46 +0000 | [diff] [blame] | 966 | |
Fred Drake | 5ec22f2 | 2002-09-24 21:09:13 +0000 | [diff] [blame] | 967 | Slice objects support one method: |
| 968 | |
| 969 | \begin{methoddesc}[slice]{indices}{self, length} |
| 970 | This method takes a single integer argument \var{length} and computes |
| 971 | information about the extended slice that the slice object would |
| 972 | describe if applied to a sequence of \var{length} items. It returns a |
| 973 | tuple of three integers; respectively these are the \var{start} and |
| 974 | \var{stop} indices and the \var{step} or stride length of the slice. |
| 975 | Missing or out-of-bounds indices are handled in a manner consistent |
| 976 | with regular slices. |
Michael W. Hudson | f0d777c | 2002-07-19 15:47:06 +0000 | [diff] [blame] | 977 | \versionadded{2.3} |
Fred Drake | 5ec22f2 | 2002-09-24 21:09:13 +0000 | [diff] [blame] | 978 | \end{methoddesc} |
Michael W. Hudson | f0d777c | 2002-07-19 15:47:06 +0000 | [diff] [blame] | 979 | |
Fred Drake | ea690c4 | 2003-07-17 05:26:53 +0000 | [diff] [blame] | 980 | \item[Static method objects] |
| 981 | Static method objects provide a way of defeating the transformation |
| 982 | of function objects to method objects described above. A static method |
| 983 | object is a wrapper around any other object, usually a user-defined |
| 984 | method object. When a static method object is retrieved from a class |
| 985 | or a class instance, the object actually returned is the wrapped object, |
| 986 | which is not subject to any further transformation. Static method |
| 987 | objects are not themselves callable, although the objects they |
| 988 | wrap usually are. Static method objects are created by the built-in |
| 989 | \function{staticmethod()} constructor. |
| 990 | |
| 991 | \item[Class method objects] |
| 992 | A class method object, like a static method object, is a wrapper |
| 993 | around another object that alters the way in which that object |
| 994 | is retrieved from classes and class instances. The behaviour of |
| 995 | class method objects upon such retrieval is described above, |
| 996 | under ``User-defined methods''. Class method objects are created |
| 997 | by the built-in \function{classmethod()} constructor. |
| 998 | |
Fred Drake | f666917 | 1998-05-06 19:52:49 +0000 | [diff] [blame] | 999 | \end{description} % Internal types |
| 1000 | |
| 1001 | \end{description} % Types |
| 1002 | |
| 1003 | |
Fred Drake | 61c7728 | 1998-07-28 19:34:22 +0000 | [diff] [blame] | 1004 | \section{Special method names\label{specialnames}} |
Fred Drake | f666917 | 1998-05-06 19:52:49 +0000 | [diff] [blame] | 1005 | |
| 1006 | A class can implement certain operations that are invoked by special |
Fred Drake | d82575d | 1998-08-28 20:03:12 +0000 | [diff] [blame] | 1007 | syntax (such as arithmetic operations or subscripting and slicing) by |
Fred Drake | 7af9f4d | 2003-05-12 13:50:11 +0000 | [diff] [blame] | 1008 | defining methods with special names.\indexii{operator}{overloading} |
| 1009 | This is Python's approach to \dfn{operator overloading}, allowing |
| 1010 | classes to define their own behavior with respect to language |
| 1011 | operators. For instance, if a class defines |
Fred Drake | d82575d | 1998-08-28 20:03:12 +0000 | [diff] [blame] | 1012 | a method named \method{__getitem__()}, and \code{x} is an instance of |
| 1013 | this class, then \code{x[i]} is equivalent to |
Raymond Hettinger | 9415309 | 2002-05-12 03:09:25 +0000 | [diff] [blame] | 1014 | \code{x.__getitem__(i)}. Except where mentioned, attempts to execute |
| 1015 | an operation raise an exception when no appropriate method is defined. |
Fred Drake | 1e42d8a | 1998-11-25 17:58:50 +0000 | [diff] [blame] | 1016 | \withsubitem{(mapping object method)}{\ttindex{__getitem__()}} |
Fred Drake | f666917 | 1998-05-06 19:52:49 +0000 | [diff] [blame] | 1017 | |
Fred Drake | 0c47559 | 2000-12-07 04:49:34 +0000 | [diff] [blame] | 1018 | When implementing a class that emulates any built-in type, it is |
| 1019 | important that the emulation only be implemented to the degree that it |
| 1020 | makes sense for the object being modelled. For example, some |
| 1021 | sequences may work well with retrieval of individual elements, but |
| 1022 | extracting a slice may not make sense. (One example of this is the |
| 1023 | \class{NodeList} interface in the W3C's Document Object Model.) |
| 1024 | |
Fred Drake | f666917 | 1998-05-06 19:52:49 +0000 | [diff] [blame] | 1025 | |
Fred Drake | 61c7728 | 1998-07-28 19:34:22 +0000 | [diff] [blame] | 1026 | \subsection{Basic customization\label{customization}} |
Fred Drake | f666917 | 1998-05-06 19:52:49 +0000 | [diff] [blame] | 1027 | |
Fred Drake | 044bb4d | 2001-08-02 15:53:05 +0000 | [diff] [blame] | 1028 | \begin{methoddesc}[object]{__init__}{self\optional{, \moreargs}} |
| 1029 | Called\indexii{class}{constructor} when the instance is created. The |
| 1030 | arguments are those passed to the class constructor expression. If a |
Raymond Hettinger | e701dcb | 2003-01-19 13:08:18 +0000 | [diff] [blame] | 1031 | base class has an \method{__init__()} method, the derived class's |
| 1032 | \method{__init__()} method, if any, must explicitly call it to ensure proper |
Fred Drake | 044bb4d | 2001-08-02 15:53:05 +0000 | [diff] [blame] | 1033 | initialization of the base class part of the instance; for example: |
| 1034 | \samp{BaseClass.__init__(\var{self}, [\var{args}...])}. As a special |
Martin v. Löwis | 95cf84a | 2003-10-19 07:32:24 +0000 | [diff] [blame] | 1035 | constraint on constructors, no value may be returned; doing so will |
Fred Drake | 044bb4d | 2001-08-02 15:53:05 +0000 | [diff] [blame] | 1036 | cause a \exception{TypeError} to be raised at runtime. |
Fred Drake | 1e42d8a | 1998-11-25 17:58:50 +0000 | [diff] [blame] | 1037 | \end{methoddesc} |
Fred Drake | f666917 | 1998-05-06 19:52:49 +0000 | [diff] [blame] | 1038 | |
| 1039 | |
Fred Drake | 1e42d8a | 1998-11-25 17:58:50 +0000 | [diff] [blame] | 1040 | \begin{methoddesc}[object]{__del__}{self} |
Guido van Rossum | 7c0240f | 1998-07-24 15:36:43 +0000 | [diff] [blame] | 1041 | Called when the instance is about to be destroyed. This is also |
| 1042 | called a destructor\index{destructor}. If a base class |
Raymond Hettinger | e701dcb | 2003-01-19 13:08:18 +0000 | [diff] [blame] | 1043 | has a \method{__del__()} method, the derived class's \method{__del__()} |
| 1044 | method, if any, |
Fred Drake | f666917 | 1998-05-06 19:52:49 +0000 | [diff] [blame] | 1045 | must explicitly call it to ensure proper deletion of the base class |
Guido van Rossum | 83b2f8a | 1998-07-23 17:12:46 +0000 | [diff] [blame] | 1046 | part of the instance. Note that it is possible (though not recommended!) |
| 1047 | for the \method{__del__()} |
Fred Drake | f666917 | 1998-05-06 19:52:49 +0000 | [diff] [blame] | 1048 | method to postpone destruction of the instance by creating a new |
| 1049 | reference to it. It may then be called at a later time when this new |
| 1050 | reference is deleted. It is not guaranteed that |
| 1051 | \method{__del__()} methods are called for objects that still exist when |
| 1052 | the interpreter exits. |
Fred Drake | f666917 | 1998-05-06 19:52:49 +0000 | [diff] [blame] | 1053 | \stindex{del} |
| 1054 | |
Fred Drake | 591dd8f | 2001-12-14 22:52:41 +0000 | [diff] [blame] | 1055 | \begin{notice} |
| 1056 | \samp{del x} doesn't directly call |
Guido van Rossum | 83b2f8a | 1998-07-23 17:12:46 +0000 | [diff] [blame] | 1057 | \code{x.__del__()} --- the former decrements the reference count for |
Raymond Hettinger | e701dcb | 2003-01-19 13:08:18 +0000 | [diff] [blame] | 1058 | \code{x} by one, and the latter is only called when \code{x}'s reference |
Guido van Rossum | 83b2f8a | 1998-07-23 17:12:46 +0000 | [diff] [blame] | 1059 | count reaches zero. Some common situations that may prevent the |
Raymond Hettinger | e701dcb | 2003-01-19 13:08:18 +0000 | [diff] [blame] | 1060 | reference count of an object from going to zero include: circular |
Guido van Rossum | 83b2f8a | 1998-07-23 17:12:46 +0000 | [diff] [blame] | 1061 | references between objects (e.g., a doubly-linked list or a tree data |
| 1062 | structure with parent and child pointers); a reference to the object |
| 1063 | on the stack frame of a function that caught an exception (the |
| 1064 | traceback stored in \code{sys.exc_traceback} keeps the stack frame |
| 1065 | alive); or a reference to the object on the stack frame that raised an |
| 1066 | unhandled exception in interactive mode (the traceback stored in |
| 1067 | \code{sys.last_traceback} keeps the stack frame alive). The first |
| 1068 | situation can only be remedied by explicitly breaking the cycles; the |
Fred Drake | 591dd8f | 2001-12-14 22:52:41 +0000 | [diff] [blame] | 1069 | latter two situations can be resolved by storing \code{None} in |
| 1070 | \code{sys.exc_traceback} or \code{sys.last_traceback}. Circular |
| 1071 | references which are garbage are detected when the option cycle |
| 1072 | detector is enabled (it's on by default), but can only be cleaned up |
| 1073 | if there are no Python-level \method{__del__()} methods involved. |
| 1074 | Refer to the documentation for the \ulink{\module{gc} |
| 1075 | module}{../lib/module-gc.html} for more information about how |
| 1076 | \method{__del__()} methods are handled by the cycle detector, |
| 1077 | particularly the description of the \code{garbage} value. |
| 1078 | \end{notice} |
Fred Drake | f666917 | 1998-05-06 19:52:49 +0000 | [diff] [blame] | 1079 | |
Fred Drake | 591dd8f | 2001-12-14 22:52:41 +0000 | [diff] [blame] | 1080 | \begin{notice}[warning] |
| 1081 | Due to the precarious circumstances under which |
Fred Drake | d82575d | 1998-08-28 20:03:12 +0000 | [diff] [blame] | 1082 | \method{__del__()} methods are invoked, exceptions that occur during their |
Guido van Rossum | 83b2f8a | 1998-07-23 17:12:46 +0000 | [diff] [blame] | 1083 | execution are ignored, and a warning is printed to \code{sys.stderr} |
Fred Drake | 591dd8f | 2001-12-14 22:52:41 +0000 | [diff] [blame] | 1084 | instead. Also, when \method{__del__()} is invoked in response to a module |
Guido van Rossum | 83b2f8a | 1998-07-23 17:12:46 +0000 | [diff] [blame] | 1085 | being deleted (e.g., when execution of the program is done), other |
Fred Drake | d82575d | 1998-08-28 20:03:12 +0000 | [diff] [blame] | 1086 | globals referenced by the \method{__del__()} method may already have been |
| 1087 | deleted. For this reason, \method{__del__()} methods should do the |
Raymond Hettinger | a0e4d6c | 2002-09-08 21:10:54 +0000 | [diff] [blame] | 1088 | absolute minimum needed to maintain external invariants. Starting with |
| 1089 | version 1.5, Python guarantees that globals whose name begins with a single |
| 1090 | underscore are deleted from their module before other globals are deleted; |
| 1091 | if no other references to such globals exist, this may help in assuring that |
Guido van Rossum | 83b2f8a | 1998-07-23 17:12:46 +0000 | [diff] [blame] | 1092 | imported modules are still available at the time when the |
Fred Drake | 591dd8f | 2001-12-14 22:52:41 +0000 | [diff] [blame] | 1093 | \method{__del__()} method is called. |
| 1094 | \end{notice} |
Fred Drake | 1e42d8a | 1998-11-25 17:58:50 +0000 | [diff] [blame] | 1095 | \end{methoddesc} |
Fred Drake | f666917 | 1998-05-06 19:52:49 +0000 | [diff] [blame] | 1096 | |
Fred Drake | 1e42d8a | 1998-11-25 17:58:50 +0000 | [diff] [blame] | 1097 | \begin{methoddesc}[object]{__repr__}{self} |
Fred Drake | 8238587 | 1998-10-01 20:40:43 +0000 | [diff] [blame] | 1098 | Called by the \function{repr()}\bifuncindex{repr} built-in function |
| 1099 | and by string conversions (reverse quotes) to compute the ``official'' |
Andrew M. Kuchling | 68abe83 | 2000-12-19 14:09:21 +0000 | [diff] [blame] | 1100 | string representation of an object. If at all possible, this should |
Guido van Rossum | 035f7e8 | 2000-12-19 04:18:13 +0000 | [diff] [blame] | 1101 | look like a valid Python expression that could be used to recreate an |
| 1102 | object with the same value (given an appropriate environment). If |
| 1103 | this is not possible, a string of the form \samp{<\var{...some useful |
| 1104 | description...}>} should be returned. The return value must be a |
| 1105 | string object. |
Raymond Hettinger | e701dcb | 2003-01-19 13:08:18 +0000 | [diff] [blame] | 1106 | If a class defines \method{__repr__()} but not \method{__str__()}, |
| 1107 | then \method{__repr__()} is also used when an ``informal'' string |
| 1108 | representation of instances of that class is required. |
Guido van Rossum | 035f7e8 | 2000-12-19 04:18:13 +0000 | [diff] [blame] | 1109 | |
| 1110 | This is typically used for debugging, so it is important that the |
| 1111 | representation is information-rich and unambiguous. |
Fred Drake | f666917 | 1998-05-06 19:52:49 +0000 | [diff] [blame] | 1112 | \indexii{string}{conversion} |
| 1113 | \indexii{reverse}{quotes} |
| 1114 | \indexii{backward}{quotes} |
| 1115 | \index{back-quotes} |
Fred Drake | 1e42d8a | 1998-11-25 17:58:50 +0000 | [diff] [blame] | 1116 | \end{methoddesc} |
Fred Drake | f666917 | 1998-05-06 19:52:49 +0000 | [diff] [blame] | 1117 | |
Fred Drake | 1e42d8a | 1998-11-25 17:58:50 +0000 | [diff] [blame] | 1118 | \begin{methoddesc}[object]{__str__}{self} |
Fred Drake | d82575d | 1998-08-28 20:03:12 +0000 | [diff] [blame] | 1119 | Called by the \function{str()}\bifuncindex{str} built-in function and |
| 1120 | by the \keyword{print}\stindex{print} statement to compute the |
Fred Drake | 8238587 | 1998-10-01 20:40:43 +0000 | [diff] [blame] | 1121 | ``informal'' string representation of an object. This differs from |
| 1122 | \method{__repr__()} in that it does not have to be a valid Python |
| 1123 | expression: a more convenient or concise representation may be used |
Guido van Rossum | 035f7e8 | 2000-12-19 04:18:13 +0000 | [diff] [blame] | 1124 | instead. The return value must be a string object. |
Fred Drake | 1e42d8a | 1998-11-25 17:58:50 +0000 | [diff] [blame] | 1125 | \end{methoddesc} |
Fred Drake | f666917 | 1998-05-06 19:52:49 +0000 | [diff] [blame] | 1126 | |
Guido van Rossum | ab782dd | 2001-01-18 15:17:06 +0000 | [diff] [blame] | 1127 | \begin{methoddesc}[object]{__lt__}{self, other} |
| 1128 | \methodline[object]{__le__}{self, other} |
| 1129 | \methodline[object]{__eq__}{self, other} |
| 1130 | \methodline[object]{__ne__}{self, other} |
| 1131 | \methodline[object]{__gt__}{self, other} |
| 1132 | \methodline[object]{__ge__}{self, other} |
| 1133 | \versionadded{2.1} |
| 1134 | These are the so-called ``rich comparison'' methods, and are called |
| 1135 | for comparison operators in preference to \method{__cmp__()} below. |
| 1136 | The correspondence between operator symbols and method names is as |
| 1137 | follows: |
| 1138 | \code{\var{x}<\var{y}} calls \code{\var{x}.__lt__(\var{y})}, |
| 1139 | \code{\var{x}<=\var{y}} calls \code{\var{x}.__le__(\var{y})}, |
| 1140 | \code{\var{x}==\var{y}} calls \code{\var{x}.__eq__(\var{y})}, |
| 1141 | \code{\var{x}!=\var{y}} and \code{\var{x}<>\var{y}} call |
| 1142 | \code{\var{x}.__ne__(\var{y})}, |
| 1143 | \code{\var{x}>\var{y}} calls \code{\var{x}.__gt__(\var{y})}, and |
| 1144 | \code{\var{x}>=\var{y}} calls \code{\var{x}.__ge__(\var{y})}. |
| 1145 | These methods can return any value, but if the comparison operator is |
| 1146 | used in a Boolean context, the return value should be interpretable as |
| 1147 | a Boolean value, else a \exception{TypeError} will be raised. |
Raymond Hettinger | e701dcb | 2003-01-19 13:08:18 +0000 | [diff] [blame] | 1148 | By convention, \code{False} is used for false and \code{True} for true. |
Guido van Rossum | ab782dd | 2001-01-18 15:17:06 +0000 | [diff] [blame] | 1149 | |
Raymond Hettinger | 4d6e8fe | 2003-07-16 19:40:23 +0000 | [diff] [blame] | 1150 | There are no implied relationships among the comparison operators. |
Raymond Hettinger | 943277e | 2003-07-17 14:47:12 +0000 | [diff] [blame] | 1151 | The truth of \code{\var{x}==\var{y}} does not imply that \code{\var{x}!=\var{y}} |
Raymond Hettinger | 4d6e8fe | 2003-07-16 19:40:23 +0000 | [diff] [blame] | 1152 | is false. Accordingly, when defining \method{__eq__}, one should also |
| 1153 | define \method{__ne__} so that the operators will behave as expected. |
| 1154 | |
Guido van Rossum | ab782dd | 2001-01-18 15:17:06 +0000 | [diff] [blame] | 1155 | There are no reflected (swapped-argument) versions of these methods |
| 1156 | (to be used when the left argument does not support the operation but |
| 1157 | the right argument does); rather, \method{__lt__()} and |
| 1158 | \method{__gt__()} are each other's reflection, \method{__le__()} and |
| 1159 | \method{__ge__()} are each other's reflection, and \method{__eq__()} |
| 1160 | and \method{__ne__()} are their own reflection. |
| 1161 | |
| 1162 | Arguments to rich comparison methods are never coerced. A rich |
| 1163 | comparison method may return \code{NotImplemented} if it does not |
| 1164 | implement the operation for a given pair of arguments. |
| 1165 | \end{methoddesc} |
| 1166 | |
Fred Drake | 1e42d8a | 1998-11-25 17:58:50 +0000 | [diff] [blame] | 1167 | \begin{methoddesc}[object]{__cmp__}{self, other} |
Guido van Rossum | ab782dd | 2001-01-18 15:17:06 +0000 | [diff] [blame] | 1168 | Called by comparison operations if rich comparison (see above) is not |
Fred Drake | 597bc1d | 2001-05-29 16:02:35 +0000 | [diff] [blame] | 1169 | defined. Should return a negative integer if \code{self < other}, |
| 1170 | zero if \code{self == other}, a positive integer if \code{self > |
| 1171 | other}. If no \method{__cmp__()}, \method{__eq__()} or |
| 1172 | \method{__ne__()} operation is defined, class instances are compared |
| 1173 | by object identity (``address''). See also the description of |
| 1174 | \method{__hash__()} for some important notes on creating objects which |
| 1175 | support custom comparison operations and are usable as dictionary |
| 1176 | keys. |
Guido van Rossum | 83b2f8a | 1998-07-23 17:12:46 +0000 | [diff] [blame] | 1177 | (Note: the restriction that exceptions are not propagated by |
Raymond Hettinger | e701dcb | 2003-01-19 13:08:18 +0000 | [diff] [blame] | 1178 | \method{__cmp__()} has been removed since Python 1.5.) |
Fred Drake | f666917 | 1998-05-06 19:52:49 +0000 | [diff] [blame] | 1179 | \bifuncindex{cmp} |
| 1180 | \index{comparisons} |
Fred Drake | 1e42d8a | 1998-11-25 17:58:50 +0000 | [diff] [blame] | 1181 | \end{methoddesc} |
Fred Drake | f666917 | 1998-05-06 19:52:49 +0000 | [diff] [blame] | 1182 | |
Fred Drake | e57a114 | 2000-06-15 20:07:25 +0000 | [diff] [blame] | 1183 | \begin{methoddesc}[object]{__rcmp__}{self, other} |
Fred Drake | 445f832 | 2001-01-04 15:11:48 +0000 | [diff] [blame] | 1184 | \versionchanged[No longer supported]{2.1} |
Fred Drake | e57a114 | 2000-06-15 20:07:25 +0000 | [diff] [blame] | 1185 | \end{methoddesc} |
| 1186 | |
Fred Drake | 1e42d8a | 1998-11-25 17:58:50 +0000 | [diff] [blame] | 1187 | \begin{methoddesc}[object]{__hash__}{self} |
Fred Drake | d82575d | 1998-08-28 20:03:12 +0000 | [diff] [blame] | 1188 | Called for the key object for dictionary\obindex{dictionary} |
| 1189 | operations, and by the built-in function |
Fred Drake | f666917 | 1998-05-06 19:52:49 +0000 | [diff] [blame] | 1190 | \function{hash()}\bifuncindex{hash}. Should return a 32-bit integer |
| 1191 | usable as a hash value |
| 1192 | for dictionary operations. The only required property is that objects |
| 1193 | which compare equal have the same hash value; it is advised to somehow |
Guido van Rossum | 83b2f8a | 1998-07-23 17:12:46 +0000 | [diff] [blame] | 1194 | mix together (e.g., using exclusive or) the hash values for the |
Fred Drake | f666917 | 1998-05-06 19:52:49 +0000 | [diff] [blame] | 1195 | components of the object that also play a part in comparison of |
| 1196 | objects. If a class does not define a \method{__cmp__()} method it should |
| 1197 | not define a \method{__hash__()} operation either; if it defines |
Fred Drake | 597bc1d | 2001-05-29 16:02:35 +0000 | [diff] [blame] | 1198 | \method{__cmp__()} or \method{__eq__()} but not \method{__hash__()}, |
| 1199 | its instances will not be usable as dictionary keys. If a class |
| 1200 | defines mutable objects and implements a \method{__cmp__()} or |
| 1201 | \method{__eq__()} method, it should not implement \method{__hash__()}, |
| 1202 | since the dictionary implementation requires that a key's hash value |
| 1203 | is immutable (if the object's hash value changes, it will be in the |
| 1204 | wrong hash bucket). |
Fred Drake | 1e42d8a | 1998-11-25 17:58:50 +0000 | [diff] [blame] | 1205 | \withsubitem{(object method)}{\ttindex{__cmp__()}} |
| 1206 | \end{methoddesc} |
Fred Drake | f666917 | 1998-05-06 19:52:49 +0000 | [diff] [blame] | 1207 | |
Fred Drake | 1e42d8a | 1998-11-25 17:58:50 +0000 | [diff] [blame] | 1208 | \begin{methoddesc}[object]{__nonzero__}{self} |
Guido van Rossum | 77f6a65 | 2002-04-03 22:41:51 +0000 | [diff] [blame] | 1209 | Called to implement truth value testing, and the built-in operation |
| 1210 | \code{bool()}; should return \code{False} or \code{True}, or their |
| 1211 | integer equivalents \code{0} or \code{1}. |
| 1212 | When this method is not defined, \method{__len__()} is |
Fred Drake | d82575d | 1998-08-28 20:03:12 +0000 | [diff] [blame] | 1213 | called, if it is defined (see below). If a class defines neither |
| 1214 | \method{__len__()} nor \method{__nonzero__()}, all its instances are |
| 1215 | considered true. |
Fred Drake | 1e42d8a | 1998-11-25 17:58:50 +0000 | [diff] [blame] | 1216 | \withsubitem{(mapping object method)}{\ttindex{__len__()}} |
| 1217 | \end{methoddesc} |
Fred Drake | f666917 | 1998-05-06 19:52:49 +0000 | [diff] [blame] | 1218 | |
Martin v. Löwis | 2a519f8 | 2002-04-11 12:39:35 +0000 | [diff] [blame] | 1219 | \begin{methoddesc}[object]{__unicode__}{self} |
| 1220 | Called to implement \function{unicode()}\bifuncindex{unicode} builtin; |
| 1221 | should return a Unicode object. When this method is not defined, string |
| 1222 | conversion is attempted, and the result of string conversion is converted |
| 1223 | to Unicode using the system default encoding. |
| 1224 | \end{methoddesc} |
| 1225 | |
Fred Drake | f666917 | 1998-05-06 19:52:49 +0000 | [diff] [blame] | 1226 | |
Fred Drake | 61c7728 | 1998-07-28 19:34:22 +0000 | [diff] [blame] | 1227 | \subsection{Customizing attribute access\label{attribute-access}} |
Fred Drake | f666917 | 1998-05-06 19:52:49 +0000 | [diff] [blame] | 1228 | |
Guido van Rossum | 83b2f8a | 1998-07-23 17:12:46 +0000 | [diff] [blame] | 1229 | The following methods can be defined to customize the meaning of |
| 1230 | attribute access (use of, assignment to, or deletion of \code{x.name}) |
| 1231 | for class instances. |
Fred Drake | f666917 | 1998-05-06 19:52:49 +0000 | [diff] [blame] | 1232 | |
Fred Drake | 1e42d8a | 1998-11-25 17:58:50 +0000 | [diff] [blame] | 1233 | \begin{methoddesc}[object]{__getattr__}{self, name} |
Fred Drake | f666917 | 1998-05-06 19:52:49 +0000 | [diff] [blame] | 1234 | Called when an attribute lookup has not found the attribute in the |
| 1235 | usual places (i.e. it is not an instance attribute nor is it found in |
| 1236 | the class tree for \code{self}). \code{name} is the attribute name. |
Guido van Rossum | 83b2f8a | 1998-07-23 17:12:46 +0000 | [diff] [blame] | 1237 | This method should return the (computed) attribute value or raise an |
Fred Drake | d82575d | 1998-08-28 20:03:12 +0000 | [diff] [blame] | 1238 | \exception{AttributeError} exception. |
Fred Drake | f666917 | 1998-05-06 19:52:49 +0000 | [diff] [blame] | 1239 | |
| 1240 | Note that if the attribute is found through the normal mechanism, |
Fred Drake | d82575d | 1998-08-28 20:03:12 +0000 | [diff] [blame] | 1241 | \method{__getattr__()} is not called. (This is an intentional |
| 1242 | asymmetry between \method{__getattr__()} and \method{__setattr__()}.) |
Fred Drake | f666917 | 1998-05-06 19:52:49 +0000 | [diff] [blame] | 1243 | This is done both for efficiency reasons and because otherwise |
Fred Drake | d82575d | 1998-08-28 20:03:12 +0000 | [diff] [blame] | 1244 | \method{__setattr__()} would have no way to access other attributes of |
Guido van Rossum | d41eea0 | 2003-02-28 14:11:45 +0000 | [diff] [blame] | 1245 | the instance. Note that at least for instance variables, you can fake |
| 1246 | total control by not inserting any values in the instance attribute |
| 1247 | dictionary (but instead inserting them in another object). See the |
| 1248 | \method{__getattribute__()} method below for a way to actually get |
| 1249 | total control in new-style classes. |
Fred Drake | 1e42d8a | 1998-11-25 17:58:50 +0000 | [diff] [blame] | 1250 | \withsubitem{(object method)}{\ttindex{__setattr__()}} |
| 1251 | \end{methoddesc} |
Fred Drake | f666917 | 1998-05-06 19:52:49 +0000 | [diff] [blame] | 1252 | |
Fred Drake | 1e42d8a | 1998-11-25 17:58:50 +0000 | [diff] [blame] | 1253 | \begin{methoddesc}[object]{__setattr__}{self, name, value} |
Fred Drake | f666917 | 1998-05-06 19:52:49 +0000 | [diff] [blame] | 1254 | Called when an attribute assignment is attempted. This is called |
Fred Drake | d82575d | 1998-08-28 20:03:12 +0000 | [diff] [blame] | 1255 | instead of the normal mechanism (i.e.\ store the value in the instance |
| 1256 | dictionary). \var{name} is the attribute name, \var{value} is the |
Fred Drake | f666917 | 1998-05-06 19:52:49 +0000 | [diff] [blame] | 1257 | value to be assigned to it. |
Fred Drake | f666917 | 1998-05-06 19:52:49 +0000 | [diff] [blame] | 1258 | |
Fred Drake | d82575d | 1998-08-28 20:03:12 +0000 | [diff] [blame] | 1259 | If \method{__setattr__()} wants to assign to an instance attribute, it |
| 1260 | should not simply execute \samp{self.\var{name} = value} --- this |
| 1261 | would cause a recursive call to itself. Instead, it should insert the |
| 1262 | value in the dictionary of instance attributes, e.g., |
Guido van Rossum | d41eea0 | 2003-02-28 14:11:45 +0000 | [diff] [blame] | 1263 | \samp{self.__dict__[\var{name}] = value}. For new-style classes, |
| 1264 | rather than accessing the instance dictionary, it should call the base |
| 1265 | class method with the same name, for example, |
| 1266 | \samp{object.__setattr__(self, name, value)}. |
Fred Drake | 1e42d8a | 1998-11-25 17:58:50 +0000 | [diff] [blame] | 1267 | \withsubitem{(instance attribute)}{\ttindex{__dict__}} |
| 1268 | \end{methoddesc} |
Fred Drake | f666917 | 1998-05-06 19:52:49 +0000 | [diff] [blame] | 1269 | |
Fred Drake | 1e42d8a | 1998-11-25 17:58:50 +0000 | [diff] [blame] | 1270 | \begin{methoddesc}[object]{__delattr__}{self, name} |
Fred Drake | d82575d | 1998-08-28 20:03:12 +0000 | [diff] [blame] | 1271 | Like \method{__setattr__()} but for attribute deletion instead of |
Fred Drake | 1e42d8a | 1998-11-25 17:58:50 +0000 | [diff] [blame] | 1272 | assignment. This should only be implemented if \samp{del |
| 1273 | obj.\var{name}} is meaningful for the object. |
| 1274 | \end{methoddesc} |
Fred Drake | f666917 | 1998-05-06 19:52:49 +0000 | [diff] [blame] | 1275 | |
Michael W. Hudson | 2ab1d08 | 2003-03-05 14:20:58 +0000 | [diff] [blame] | 1276 | \subsubsection{More attribute access for new-style classes \label{new-style-attribute-access}} |
Guido van Rossum | d41eea0 | 2003-02-28 14:11:45 +0000 | [diff] [blame] | 1277 | |
| 1278 | The following methods only apply to new-style classes. |
| 1279 | |
| 1280 | \begin{methoddesc}[object]{__getattribute__}{self, name} |
| 1281 | Called unconditionally to implement attribute accesses for instances |
| 1282 | of the class. If the class also defines \method{__getattr__}, it will |
| 1283 | never be called (unless called explicitly). |
| 1284 | This method should return the (computed) attribute |
| 1285 | value or raise an \exception{AttributeError} exception. |
| 1286 | In order to avoid infinite recursion in this method, its |
| 1287 | implementation should always call the base class method with the same |
Fred Drake | 62364ff | 2003-03-20 18:17:16 +0000 | [diff] [blame] | 1288 | name to access any attributes it needs, for example, |
Guido van Rossum | d41eea0 | 2003-02-28 14:11:45 +0000 | [diff] [blame] | 1289 | \samp{object.__getattribute__(self, name)}. |
| 1290 | \end{methoddesc} |
| 1291 | |
Michael W. Hudson | 2ab1d08 | 2003-03-05 14:20:58 +0000 | [diff] [blame] | 1292 | \subsubsection{Implementing Descriptors \label{descriptors}} |
Guido van Rossum | d41eea0 | 2003-02-28 14:11:45 +0000 | [diff] [blame] | 1293 | |
| 1294 | The following methods only apply when an instance of the class |
Fred Drake | 62364ff | 2003-03-20 18:17:16 +0000 | [diff] [blame] | 1295 | containing the method (a so-called \emph{descriptor} class) appears in |
Guido van Rossum | d41eea0 | 2003-02-28 14:11:45 +0000 | [diff] [blame] | 1296 | the class dictionary of another new-style class, known as the |
| 1297 | \emph{owner} class. In the examples below, ``the attribute'' refers to |
Fred Drake | 62364ff | 2003-03-20 18:17:16 +0000 | [diff] [blame] | 1298 | the attribute whose name is the key of the property in the owner |
Fred Drake | 67a521e | 2004-05-06 12:44:29 +0000 | [diff] [blame^] | 1299 | class' \code{__dict__}. Descriptors can only be implemented as |
| 1300 | new-style classes themselves. |
Guido van Rossum | d41eea0 | 2003-02-28 14:11:45 +0000 | [diff] [blame] | 1301 | |
| 1302 | \begin{methoddesc}[object]{__get__}{self, instance, owner} |
Fred Drake | 62364ff | 2003-03-20 18:17:16 +0000 | [diff] [blame] | 1303 | Called to get the attribute of the owner class (class attribute access) |
Guido van Rossum | d41eea0 | 2003-02-28 14:11:45 +0000 | [diff] [blame] | 1304 | or of an instance of that class (instance attribute acces). |
| 1305 | \var{owner} is always the owner class, while \var{instance} is the |
| 1306 | instance that the attribute was accessed through, or \code{None} when |
| 1307 | the attribute is accessed through the \var{owner}. This method should |
| 1308 | return the (computed) attribute value or raise an |
| 1309 | \exception{AttributeError} exception. |
| 1310 | \end{methoddesc} |
| 1311 | |
| 1312 | \begin{methoddesc}[object]{__set__}{self, instance, value} |
Michael W. Hudson | 2ab1d08 | 2003-03-05 14:20:58 +0000 | [diff] [blame] | 1313 | Called to set the attribute on an instance \var{instance} of the owner |
Guido van Rossum | d41eea0 | 2003-02-28 14:11:45 +0000 | [diff] [blame] | 1314 | class to a new value, \var{value}. |
| 1315 | \end{methoddesc} |
| 1316 | |
| 1317 | \begin{methoddesc}[object]{__delete__}{self, instance} |
Michael W. Hudson | 2ab1d08 | 2003-03-05 14:20:58 +0000 | [diff] [blame] | 1318 | Called to delete the attribute on an instance \var{instance} of the |
| 1319 | owner class. |
Guido van Rossum | d41eea0 | 2003-02-28 14:11:45 +0000 | [diff] [blame] | 1320 | \end{methoddesc} |
| 1321 | |
Fred Drake | f666917 | 1998-05-06 19:52:49 +0000 | [diff] [blame] | 1322 | |
Fred Drake | 4db3661 | 2003-06-26 03:11:20 +0000 | [diff] [blame] | 1323 | \subsubsection{Invoking Descriptors \label{descriptor-invocation}} |
Raymond Hettinger | 03ec6d5 | 2003-06-25 18:29:36 +0000 | [diff] [blame] | 1324 | |
| 1325 | In general, a descriptor is an object attribute with ``binding behavior'', |
| 1326 | one whose attribute access has been overridden by methods in the descriptor |
Raymond Hettinger | df9eff0 | 2003-06-27 06:57:56 +0000 | [diff] [blame] | 1327 | protocol: \method{__get__()}, \method{__set__()}, and \method{__delete__()}. |
Raymond Hettinger | 03ec6d5 | 2003-06-25 18:29:36 +0000 | [diff] [blame] | 1328 | If any of those methods are defined for an object, it is said to be a |
| 1329 | descriptor. |
| 1330 | |
| 1331 | The default behavior for attribute access is to get, set, or delete the |
| 1332 | attribute from an object's dictionary. For instance, \code{a.x} has a |
| 1333 | lookup chain starting with \code{a.__dict__['x']}, then |
| 1334 | \code{type(a).__dict__['x']}, and continuing |
| 1335 | through the base classes of \code{type(a)} excluding metaclasses. |
| 1336 | |
| 1337 | However, if the looked-up value is an object defining one of the descriptor |
| 1338 | methods, then Python may override the default behavior and invoke the |
| 1339 | descriptor method instead. Where this occurs in the precedence chain depends |
| 1340 | on which descriptor methods were defined and how they were called. Note that |
| 1341 | descriptors are only invoked for new style objects or classes |
Raymond Hettinger | df9eff0 | 2003-06-27 06:57:56 +0000 | [diff] [blame] | 1342 | (ones that subclass \class{object()} or \class{type()}). |
Raymond Hettinger | 03ec6d5 | 2003-06-25 18:29:36 +0000 | [diff] [blame] | 1343 | |
| 1344 | The starting point for descriptor invocation is a binding, \code{a.x}. |
| 1345 | How the arguments are assembled depends on \code{a}: |
| 1346 | |
| 1347 | \begin{itemize} |
| 1348 | |
| 1349 | \item[Direct Call] The simplest and least common call is when user code |
| 1350 | directly invokes a descriptor method: \code{x.__get__(a)}. |
| 1351 | |
| 1352 | \item[Instance Binding] If binding to a new-style object instance, |
| 1353 | \code{a.x} is transformed into the call: |
| 1354 | \code{type(a).__dict__['x'].__get__(a, type(a))}. |
| 1355 | |
| 1356 | \item[Class Binding] If binding to a new-style class, \code{A.x} |
| 1357 | is transformed into the call: \code{A.__dict__['x'].__get__(None, A)}. |
| 1358 | |
| 1359 | \item[Super Binding] If \code{a} is an instance of \class{super}, |
| 1360 | then the binding \code{super(B, obj).m()} searches |
| 1361 | \code{obj.__class__.__mro__} for the base class \code{A} immediately |
| 1362 | preceding \code{B} and then invokes the descriptor with the call: |
| 1363 | \code{A.__dict__['m'].__get__(obj, A)}. |
| 1364 | |
| 1365 | \end{itemize} |
| 1366 | |
| 1367 | For instance bindings, the precedence of descriptor invocation depends |
| 1368 | on the which descriptor methods are defined. Data descriptors define |
Raymond Hettinger | df9eff0 | 2003-06-27 06:57:56 +0000 | [diff] [blame] | 1369 | both \method{__get__()} and \method{__set__()}. Non-data descriptors have |
| 1370 | just the \method{__get__()} method. Data descriptors always override |
Raymond Hettinger | 03ec6d5 | 2003-06-25 18:29:36 +0000 | [diff] [blame] | 1371 | a redefinition in an instance dictionary. In contrast, non-data |
| 1372 | descriptors can be overridden by instances. |
| 1373 | |
Raymond Hettinger | df9eff0 | 2003-06-27 06:57:56 +0000 | [diff] [blame] | 1374 | Python methods (including \function{staticmethod()} and \function{classmethod()}) |
Raymond Hettinger | 03ec6d5 | 2003-06-25 18:29:36 +0000 | [diff] [blame] | 1375 | are implemented as non-data descriptors. Accordingly, instances can |
| 1376 | redefine and override methods. This allows individual instances to acquire |
| 1377 | behaviors that differ from other instances of the same class. |
| 1378 | |
Raymond Hettinger | df9eff0 | 2003-06-27 06:57:56 +0000 | [diff] [blame] | 1379 | The \function{property()} function is implemented as a data descriptor. |
| 1380 | Accordingly, instances cannot override the behavior of a property. |
| 1381 | |
| 1382 | |
| 1383 | \subsubsection{__slots__\label{slots}} |
| 1384 | |
| 1385 | By default, instances of both old and new-style classes have a dictionary |
| 1386 | for attribute storage. This wastes space for objects having very few instance |
| 1387 | variables. The space consumption can become acute when creating large numbers |
| 1388 | of instances. |
| 1389 | |
| 1390 | The default can be overridden by defining \var{__slots__} in a new-style class |
| 1391 | definition. The \var{__slots__} declaration takes a sequence of instance |
| 1392 | variables and reserves just enough space in each instance to hold a value |
| 1393 | for each variable. Space is saved because \var{__dict__} is not created for |
| 1394 | each instance. |
| 1395 | |
| 1396 | \begin{datadesc}{__slots__} |
| 1397 | This class variable can be assigned a string, iterable, or sequence of strings |
Raymond Hettinger | 5918f8d | 2003-06-29 04:53:23 +0000 | [diff] [blame] | 1398 | with variable names used by instances. If defined in a new-style class, |
| 1399 | \var{__slots__} reserves space for the declared variables |
Raymond Hettinger | df9eff0 | 2003-06-27 06:57:56 +0000 | [diff] [blame] | 1400 | and prevents the automatic creation of \var{__dict__} and \var{__weakref__} |
| 1401 | for each instance. |
| 1402 | \versionadded{2.2} |
| 1403 | \end{datadesc} |
| 1404 | |
| 1405 | \noindent |
Raymond Hettinger | 5918f8d | 2003-06-29 04:53:23 +0000 | [diff] [blame] | 1406 | Notes on using \var{__slots__} |
Raymond Hettinger | df9eff0 | 2003-06-27 06:57:56 +0000 | [diff] [blame] | 1407 | |
| 1408 | \begin{itemize} |
| 1409 | |
| 1410 | \item Without a \var{__dict__} variable, instances cannot be assigned new |
| 1411 | variables not listed in the \var{__slots__} definition. Attempts to assign |
| 1412 | to an unlisted variable name raises \exception{AttributeError}. If dynamic |
| 1413 | assignment of new variables is desired, then add \code{'__dict__'} to the |
| 1414 | sequence of strings in the \var{__slots__} declaration. |
| 1415 | \versionchanged[Previously, adding \code{'__dict__'} to the \var{__slots__} |
| 1416 | declaration would not enable the assignment of new attributes not |
| 1417 | specifically listed in the sequence of instance variable names]{2.3} |
| 1418 | |
| 1419 | \item Without a \var{__weakref__} variable for each instance, classes |
| 1420 | defining \var{__slots__} do not support weak references to its instances. |
| 1421 | If weak reference support is needed, then add \code{'__weakref__'} to the |
| 1422 | sequence of strings in the \var{__slots__} declaration. |
Raymond Hettinger | 5918f8d | 2003-06-29 04:53:23 +0000 | [diff] [blame] | 1423 | \versionchanged[Previously, adding \code{'__weakref__'} to the \var{__slots__} |
Raymond Hettinger | df9eff0 | 2003-06-27 06:57:56 +0000 | [diff] [blame] | 1424 | declaration would not enable support for weak references]{2.3} |
| 1425 | |
| 1426 | \item \var{__slots__} are implemented at the class level by creating |
| 1427 | descriptors (\ref{descriptors}) for each variable name. As a result, |
| 1428 | class attributes cannot be used to set default values for instance |
| 1429 | variables defined by \var{__slots__}; otherwise, the class attribute would |
| 1430 | overwrite the descriptor assignment. |
| 1431 | |
| 1432 | \item If a class defines a slot also defined in a base class, the instance |
| 1433 | variable defined by the base class slot is inaccessible (except by retrieving |
| 1434 | its descriptor directly from the base class). This renders the meaning of the |
| 1435 | program undefined. In the future, a check may be added to prevent this. |
| 1436 | |
| 1437 | \item The action of a \var{__slots__} declaration is limited to the class |
| 1438 | where it is defined. As a result, subclasses will have a \var{__dict__} |
| 1439 | unless they also define \var{__slots__}. |
| 1440 | |
| 1441 | \item \var{__slots__} do not work for classes derived from ``variable-length'' |
| 1442 | built-in types such as \class{long}, \class{str} and \class{tuple}. |
| 1443 | |
Raymond Hettinger | 5918f8d | 2003-06-29 04:53:23 +0000 | [diff] [blame] | 1444 | \item Any non-string iterable may be assigned to \var{__slots__}. |
Raymond Hettinger | df9eff0 | 2003-06-27 06:57:56 +0000 | [diff] [blame] | 1445 | Mappings may also be used; however, in the future, special meaning may |
| 1446 | be assigned to the values corresponding to each key. |
| 1447 | |
| 1448 | \end{itemize} |
| 1449 | |
| 1450 | |
| 1451 | \subsection{Customizing class creation\label{metaclasses}} |
| 1452 | |
| 1453 | By default, new-style classes are constructed using \function{type()}. |
| 1454 | A class definition is read into a separate namespace and the value |
| 1455 | of class name is bound to the result of \code{type(name, bases, dict)}. |
| 1456 | |
| 1457 | When the class definition is read, if \var{__metaclass__} is defined |
| 1458 | then the callable assigned to it will be called instead of \function{type()}. |
| 1459 | The allows classes or functions to be written which monitor or alter the class |
| 1460 | creation process: |
| 1461 | |
| 1462 | \begin{itemize} |
| 1463 | \item Modifying the class dictionary prior to the class being created. |
| 1464 | \item Returning an instance of another class -- essentially performing |
| 1465 | the role of a factory function. |
| 1466 | \end{itemize} |
| 1467 | |
| 1468 | \begin{datadesc}{__metaclass__} |
| 1469 | This variable can be any callable accepting arguments for \code{name}, |
| 1470 | \code{bases}, and \code{dict}. Upon class creation, the callable is |
| 1471 | used instead of the built-in \function{type()}. |
| 1472 | \versionadded{2.2} |
| 1473 | \end{datadesc} |
| 1474 | |
| 1475 | The appropriate metaclass is determined by the following precedence rules: |
| 1476 | |
| 1477 | \begin{itemize} |
| 1478 | |
| 1479 | \item If \code{dict['__metaclass__']} exists, it is used. |
| 1480 | |
| 1481 | \item Otherwise, if there is at least one base class, its metaclass is used |
| 1482 | (this looks for a \var{__class__} attribute first and if not found, uses its |
| 1483 | type). |
| 1484 | |
| 1485 | \item Otherwise, if a global variable named __metaclass__ exists, it is used. |
| 1486 | |
| 1487 | \item Otherwise, the old-style, classic metaclass (types.ClassType) is used. |
| 1488 | |
| 1489 | \end{itemize} |
| 1490 | |
| 1491 | The potential uses for metaclasses are boundless. Some ideas that have |
| 1492 | been explored including logging, interface checking, automatic delegation, |
| 1493 | automatic property creation, proxies, frameworks, and automatic resource |
| 1494 | locking/synchronization. |
Raymond Hettinger | 03ec6d5 | 2003-06-25 18:29:36 +0000 | [diff] [blame] | 1495 | |
| 1496 | |
Fred Drake | 61c7728 | 1998-07-28 19:34:22 +0000 | [diff] [blame] | 1497 | \subsection{Emulating callable objects\label{callable-types}} |
Guido van Rossum | 83b2f8a | 1998-07-23 17:12:46 +0000 | [diff] [blame] | 1498 | |
Fred Drake | 1e42d8a | 1998-11-25 17:58:50 +0000 | [diff] [blame] | 1499 | \begin{methoddesc}[object]{__call__}{self\optional{, args...}} |
Guido van Rossum | 83b2f8a | 1998-07-23 17:12:46 +0000 | [diff] [blame] | 1500 | Called when the instance is ``called'' as a function; if this method |
Fred Drake | d82575d | 1998-08-28 20:03:12 +0000 | [diff] [blame] | 1501 | is defined, \code{\var{x}(arg1, arg2, ...)} is a shorthand for |
| 1502 | \code{\var{x}.__call__(arg1, arg2, ...)}. |
Guido van Rossum | 83b2f8a | 1998-07-23 17:12:46 +0000 | [diff] [blame] | 1503 | \indexii{call}{instance} |
Fred Drake | 1e42d8a | 1998-11-25 17:58:50 +0000 | [diff] [blame] | 1504 | \end{methoddesc} |
Guido van Rossum | 83b2f8a | 1998-07-23 17:12:46 +0000 | [diff] [blame] | 1505 | |
| 1506 | |
Fred Drake | 73921b0 | 2001-10-01 16:32:13 +0000 | [diff] [blame] | 1507 | \subsection{Emulating container types\label{sequence-types}} |
Guido van Rossum | 83b2f8a | 1998-07-23 17:12:46 +0000 | [diff] [blame] | 1508 | |
Fred Drake | 73921b0 | 2001-10-01 16:32:13 +0000 | [diff] [blame] | 1509 | The following methods can be defined to implement container |
| 1510 | objects. Containers usually are sequences (such as lists or tuples) |
| 1511 | or mappings (like dictionaries), but can represent other containers as |
| 1512 | well. The first set of methods is used either to emulate a |
Guido van Rossum | 83b2f8a | 1998-07-23 17:12:46 +0000 | [diff] [blame] | 1513 | sequence or to emulate a mapping; the difference is that for a |
| 1514 | sequence, the allowable keys should be the integers \var{k} for which |
| 1515 | \code{0 <= \var{k} < \var{N}} where \var{N} is the length of the |
Thomas Wouters | 1d75a79 | 2000-08-17 22:37:32 +0000 | [diff] [blame] | 1516 | sequence, or slice objects, which define a range of items. (For backwards |
| 1517 | compatibility, the method \method{__getslice__()} (see below) can also be |
| 1518 | defined to handle simple, but not extended slices.) It is also recommended |
Fred Drake | a007382 | 2000-08-18 02:42:14 +0000 | [diff] [blame] | 1519 | that mappings provide the methods \method{keys()}, \method{values()}, |
Thomas Wouters | 1d75a79 | 2000-08-17 22:37:32 +0000 | [diff] [blame] | 1520 | \method{items()}, \method{has_key()}, \method{get()}, \method{clear()}, |
Raymond Hettinger | e701dcb | 2003-01-19 13:08:18 +0000 | [diff] [blame] | 1521 | \method{setdefault()}, \method{iterkeys()}, \method{itervalues()}, |
Raymond Hettinger | f4ca5a2 | 2003-01-19 14:57:12 +0000 | [diff] [blame] | 1522 | \method{iteritems()}, \method{pop()}, \method{popitem()}, |
Thomas Wouters | 1d75a79 | 2000-08-17 22:37:32 +0000 | [diff] [blame] | 1523 | \method{copy()}, and \method{update()} behaving similar to those for |
Raymond Hettinger | e701dcb | 2003-01-19 13:08:18 +0000 | [diff] [blame] | 1524 | Python's standard dictionary objects. The \module{UserDict} module |
| 1525 | provides a \class{DictMixin} class to help create those methods |
| 1526 | from a base set of \method{__getitem__()}, \method{__setitem__()}, |
| 1527 | \method{__delitem__()}, and \method{keys()}. |
| 1528 | Mutable sequences should provide |
Guido van Rossum | 83b2f8a | 1998-07-23 17:12:46 +0000 | [diff] [blame] | 1529 | methods \method{append()}, \method{count()}, \method{index()}, |
Raymond Hettinger | e701dcb | 2003-01-19 13:08:18 +0000 | [diff] [blame] | 1530 | \method{extend()}, |
Guido van Rossum | 83b2f8a | 1998-07-23 17:12:46 +0000 | [diff] [blame] | 1531 | \method{insert()}, \method{pop()}, \method{remove()}, \method{reverse()} |
| 1532 | and \method{sort()}, like Python standard list objects. Finally, |
| 1533 | sequence types should implement addition (meaning concatenation) and |
| 1534 | multiplication (meaning repetition) by defining the methods |
Thomas Wouters | 12bba85 | 2000-08-24 20:06:04 +0000 | [diff] [blame] | 1535 | \method{__add__()}, \method{__radd__()}, \method{__iadd__()}, |
| 1536 | \method{__mul__()}, \method{__rmul__()} and \method{__imul__()} described |
| 1537 | below; they should not define \method{__coerce__()} or other numerical |
Guido van Rossum | 0dbb4fb | 2001-04-20 16:50:40 +0000 | [diff] [blame] | 1538 | operators. It is recommended that both mappings and sequences |
Fred Drake | 18d8d5a | 2001-09-18 17:58:20 +0000 | [diff] [blame] | 1539 | implement the \method{__contains__()} method to allow efficient use of |
| 1540 | the \code{in} operator; for mappings, \code{in} should be equivalent |
| 1541 | of \method{has_key()}; for sequences, it should search through the |
Raymond Hettinger | e701dcb | 2003-01-19 13:08:18 +0000 | [diff] [blame] | 1542 | values. It is further recommended that both mappings and sequences |
| 1543 | implement the \method{__iter__()} method to allow efficient iteration |
| 1544 | through the container; for mappings, \method{__iter__()} should be |
| 1545 | the same as \method{iterkeys()}; for sequences, it should iterate |
| 1546 | through the values. |
Fred Drake | 4856d01 | 1999-01-12 04:15:20 +0000 | [diff] [blame] | 1547 | \withsubitem{(mapping object method)}{ |
| 1548 | \ttindex{keys()} |
| 1549 | \ttindex{values()} |
| 1550 | \ttindex{items()} |
Raymond Hettinger | e701dcb | 2003-01-19 13:08:18 +0000 | [diff] [blame] | 1551 | \ttindex{iterkeys()} |
| 1552 | \ttindex{itervalues()} |
| 1553 | \ttindex{iteritems()} |
Fred Drake | 4856d01 | 1999-01-12 04:15:20 +0000 | [diff] [blame] | 1554 | \ttindex{has_key()} |
| 1555 | \ttindex{get()} |
Raymond Hettinger | e701dcb | 2003-01-19 13:08:18 +0000 | [diff] [blame] | 1556 | \ttindex{setdefault()} |
| 1557 | \ttindex{pop()} |
| 1558 | \ttindex{popitem()} |
Fred Drake | 4856d01 | 1999-01-12 04:15:20 +0000 | [diff] [blame] | 1559 | \ttindex{clear()} |
| 1560 | \ttindex{copy()} |
Guido van Rossum | 0dbb4fb | 2001-04-20 16:50:40 +0000 | [diff] [blame] | 1561 | \ttindex{update()} |
| 1562 | \ttindex{__contains__()}} |
Fred Drake | 4856d01 | 1999-01-12 04:15:20 +0000 | [diff] [blame] | 1563 | \withsubitem{(sequence object method)}{ |
| 1564 | \ttindex{append()} |
| 1565 | \ttindex{count()} |
Raymond Hettinger | e701dcb | 2003-01-19 13:08:18 +0000 | [diff] [blame] | 1566 | \ttindex{extend()} |
Fred Drake | 4856d01 | 1999-01-12 04:15:20 +0000 | [diff] [blame] | 1567 | \ttindex{index()} |
| 1568 | \ttindex{insert()} |
| 1569 | \ttindex{pop()} |
| 1570 | \ttindex{remove()} |
| 1571 | \ttindex{reverse()} |
| 1572 | \ttindex{sort()} |
| 1573 | \ttindex{__add__()} |
| 1574 | \ttindex{__radd__()} |
Thomas Wouters | 12bba85 | 2000-08-24 20:06:04 +0000 | [diff] [blame] | 1575 | \ttindex{__iadd__()} |
Fred Drake | 4856d01 | 1999-01-12 04:15:20 +0000 | [diff] [blame] | 1576 | \ttindex{__mul__()} |
Thomas Wouters | 12bba85 | 2000-08-24 20:06:04 +0000 | [diff] [blame] | 1577 | \ttindex{__rmul__()} |
Guido van Rossum | 0dbb4fb | 2001-04-20 16:50:40 +0000 | [diff] [blame] | 1578 | \ttindex{__imul__()} |
Raymond Hettinger | e701dcb | 2003-01-19 13:08:18 +0000 | [diff] [blame] | 1579 | \ttindex{__contains__()} |
| 1580 | \ttindex{__iter__()}} |
Fred Drake | ae3e574 | 1999-01-28 23:21:49 +0000 | [diff] [blame] | 1581 | \withsubitem{(numeric object method)}{\ttindex{__coerce__()}} |
Fred Drake | f666917 | 1998-05-06 19:52:49 +0000 | [diff] [blame] | 1582 | |
Fred Drake | 73921b0 | 2001-10-01 16:32:13 +0000 | [diff] [blame] | 1583 | \begin{methoddesc}[container object]{__len__}{self} |
Fred Drake | d82575d | 1998-08-28 20:03:12 +0000 | [diff] [blame] | 1584 | Called to implement the built-in function |
| 1585 | \function{len()}\bifuncindex{len}. Should return the length of the |
| 1586 | object, an integer \code{>=} 0. Also, an object that doesn't define a |
| 1587 | \method{__nonzero__()} method and whose \method{__len__()} method |
| 1588 | returns zero is considered to be false in a Boolean context. |
Fred Drake | 1e42d8a | 1998-11-25 17:58:50 +0000 | [diff] [blame] | 1589 | \withsubitem{(object method)}{\ttindex{__nonzero__()}} |
| 1590 | \end{methoddesc} |
Fred Drake | f666917 | 1998-05-06 19:52:49 +0000 | [diff] [blame] | 1591 | |
Fred Drake | 73921b0 | 2001-10-01 16:32:13 +0000 | [diff] [blame] | 1592 | \begin{methoddesc}[container object]{__getitem__}{self, key} |
Fred Drake | d82575d | 1998-08-28 20:03:12 +0000 | [diff] [blame] | 1593 | Called to implement evaluation of \code{\var{self}[\var{key}]}. |
Fred Drake | 31575ce | 2000-09-21 05:28:26 +0000 | [diff] [blame] | 1594 | For sequence types, the accepted keys should be integers and slice |
| 1595 | objects.\obindex{slice} Note that |
| 1596 | the special interpretation of negative indexes (if the class wishes to |
Fred Drake | f666917 | 1998-05-06 19:52:49 +0000 | [diff] [blame] | 1597 | emulate a sequence type) is up to the \method{__getitem__()} method. |
Fred Drake | 91826ed | 2000-07-13 04:57:58 +0000 | [diff] [blame] | 1598 | If \var{key} is of an inappropriate type, \exception{TypeError} may be |
| 1599 | raised; if of a value outside the set of indexes for the sequence |
| 1600 | (after any special interpretation of negative values), |
| 1601 | \exception{IndexError} should be raised. |
Fred Drake | 0aa811c | 2001-10-20 04:24:09 +0000 | [diff] [blame] | 1602 | \note{\keyword{for} loops expect that an |
Fred Drake | 91826ed | 2000-07-13 04:57:58 +0000 | [diff] [blame] | 1603 | \exception{IndexError} will be raised for illegal indexes to allow |
Fred Drake | 0aa811c | 2001-10-20 04:24:09 +0000 | [diff] [blame] | 1604 | proper detection of the end of the sequence.} |
Fred Drake | 1e42d8a | 1998-11-25 17:58:50 +0000 | [diff] [blame] | 1605 | \end{methoddesc} |
Fred Drake | f666917 | 1998-05-06 19:52:49 +0000 | [diff] [blame] | 1606 | |
Fred Drake | 73921b0 | 2001-10-01 16:32:13 +0000 | [diff] [blame] | 1607 | \begin{methoddesc}[container object]{__setitem__}{self, key, value} |
Fred Drake | d82575d | 1998-08-28 20:03:12 +0000 | [diff] [blame] | 1608 | Called to implement assignment to \code{\var{self}[\var{key}]}. Same |
Fred Drake | 1e42d8a | 1998-11-25 17:58:50 +0000 | [diff] [blame] | 1609 | note as for \method{__getitem__()}. This should only be implemented |
| 1610 | for mappings if the objects support changes to the values for keys, or |
| 1611 | if new keys can be added, or for sequences if elements can be |
Fred Drake | 91826ed | 2000-07-13 04:57:58 +0000 | [diff] [blame] | 1612 | replaced. The same exceptions should be raised for improper |
| 1613 | \var{key} values as for the \method{__getitem__()} method. |
Fred Drake | 1e42d8a | 1998-11-25 17:58:50 +0000 | [diff] [blame] | 1614 | \end{methoddesc} |
Fred Drake | f666917 | 1998-05-06 19:52:49 +0000 | [diff] [blame] | 1615 | |
Fred Drake | 73921b0 | 2001-10-01 16:32:13 +0000 | [diff] [blame] | 1616 | \begin{methoddesc}[container object]{__delitem__}{self, key} |
Fred Drake | d82575d | 1998-08-28 20:03:12 +0000 | [diff] [blame] | 1617 | Called to implement deletion of \code{\var{self}[\var{key}]}. Same |
Fred Drake | 1e42d8a | 1998-11-25 17:58:50 +0000 | [diff] [blame] | 1618 | note as for \method{__getitem__()}. This should only be implemented |
| 1619 | for mappings if the objects support removal of keys, or for sequences |
Fred Drake | 91826ed | 2000-07-13 04:57:58 +0000 | [diff] [blame] | 1620 | if elements can be removed from the sequence. The same exceptions |
| 1621 | should be raised for improper \var{key} values as for the |
| 1622 | \method{__getitem__()} method. |
Fred Drake | 1e42d8a | 1998-11-25 17:58:50 +0000 | [diff] [blame] | 1623 | \end{methoddesc} |
Fred Drake | f666917 | 1998-05-06 19:52:49 +0000 | [diff] [blame] | 1624 | |
Fred Drake | 73921b0 | 2001-10-01 16:32:13 +0000 | [diff] [blame] | 1625 | \begin{methoddesc}[container object]{__iter__}{self} |
| 1626 | This method is called when an iterator is required for a container. |
| 1627 | This method should return a new iterator object that can iterate over |
| 1628 | all the objects in the container. For mappings, it should iterate |
| 1629 | over the keys of the container, and should also be made available as |
| 1630 | the method \method{iterkeys()}. |
| 1631 | |
| 1632 | Iterator objects also need to implement this method; they are required |
| 1633 | to return themselves. For more information on iterator objects, see |
| 1634 | ``\ulink{Iterator Types}{../lib/typeiter.html}'' in the |
| 1635 | \citetitle[../lib/lib.html]{Python Library Reference}. |
| 1636 | \end{methoddesc} |
| 1637 | |
| 1638 | The membership test operators (\keyword{in} and \keyword{not in}) are |
| 1639 | normally implemented as an iteration through a sequence. However, |
| 1640 | container objects can supply the following special method with a more |
| 1641 | efficient implementation, which also does not require the object be a |
| 1642 | sequence. |
| 1643 | |
| 1644 | \begin{methoddesc}[container object]{__contains__}{self, item} |
| 1645 | Called to implement membership test operators. Should return true if |
| 1646 | \var{item} is in \var{self}, false otherwise. For mapping objects, |
| 1647 | this should consider the keys of the mapping rather than the values or |
| 1648 | the key-item pairs. |
| 1649 | \end{methoddesc} |
| 1650 | |
Fred Drake | f666917 | 1998-05-06 19:52:49 +0000 | [diff] [blame] | 1651 | |
Fred Drake | 3041b07 | 1998-10-21 00:25:32 +0000 | [diff] [blame] | 1652 | \subsection{Additional methods for emulation of sequence types |
Fred Drake | 61c7728 | 1998-07-28 19:34:22 +0000 | [diff] [blame] | 1653 | \label{sequence-methods}} |
Guido van Rossum | 83b2f8a | 1998-07-23 17:12:46 +0000 | [diff] [blame] | 1654 | |
Raymond Hettinger | e701dcb | 2003-01-19 13:08:18 +0000 | [diff] [blame] | 1655 | The following optional methods can be defined to further emulate sequence |
| 1656 | objects. Immutable sequences methods should at most only define |
| 1657 | \method{__getslice__()}; mutable sequences might define all three |
Raymond Hettinger | 92016dc | 2003-09-22 15:27:11 +0000 | [diff] [blame] | 1658 | methods. |
Fred Drake | f666917 | 1998-05-06 19:52:49 +0000 | [diff] [blame] | 1659 | |
Fred Drake | 1e42d8a | 1998-11-25 17:58:50 +0000 | [diff] [blame] | 1660 | \begin{methoddesc}[sequence object]{__getslice__}{self, i, j} |
Fred Drake | a007382 | 2000-08-18 02:42:14 +0000 | [diff] [blame] | 1661 | \deprecated{2.0}{Support slice objects as parameters to the |
| 1662 | \method{__getitem__()} method.} |
Fred Drake | d82575d | 1998-08-28 20:03:12 +0000 | [diff] [blame] | 1663 | Called to implement evaluation of \code{\var{self}[\var{i}:\var{j}]}. |
| 1664 | The returned object should be of the same type as \var{self}. Note |
| 1665 | that missing \var{i} or \var{j} in the slice expression are replaced |
Fred Drake | e15956b | 2000-04-03 04:51:13 +0000 | [diff] [blame] | 1666 | by zero or \code{sys.maxint}, respectively. If negative indexes are |
| 1667 | used in the slice, the length of the sequence is added to that index. |
| 1668 | If the instance does not implement the \method{__len__()} method, an |
| 1669 | \exception{AttributeError} is raised. |
| 1670 | No guarantee is made that indexes adjusted this way are not still |
| 1671 | negative. Indexes which are greater than the length of the sequence |
| 1672 | are not modified. |
Fred Drake | a007382 | 2000-08-18 02:42:14 +0000 | [diff] [blame] | 1673 | If no \method{__getslice__()} is found, a slice |
Thomas Wouters | 1d75a79 | 2000-08-17 22:37:32 +0000 | [diff] [blame] | 1674 | object is created instead, and passed to \method{__getitem__()} instead. |
Fred Drake | 1e42d8a | 1998-11-25 17:58:50 +0000 | [diff] [blame] | 1675 | \end{methoddesc} |
Fred Drake | f666917 | 1998-05-06 19:52:49 +0000 | [diff] [blame] | 1676 | |
Fred Drake | 1e42d8a | 1998-11-25 17:58:50 +0000 | [diff] [blame] | 1677 | \begin{methoddesc}[sequence object]{__setslice__}{self, i, j, sequence} |
Fred Drake | d82575d | 1998-08-28 20:03:12 +0000 | [diff] [blame] | 1678 | Called to implement assignment to \code{\var{self}[\var{i}:\var{j}]}. |
| 1679 | Same notes for \var{i} and \var{j} as for \method{__getslice__()}. |
Thomas Wouters | 1d75a79 | 2000-08-17 22:37:32 +0000 | [diff] [blame] | 1680 | |
Raymond Hettinger | e701dcb | 2003-01-19 13:08:18 +0000 | [diff] [blame] | 1681 | This method is deprecated. If no \method{__setslice__()} is found, |
| 1682 | or for extended slicing of the form |
| 1683 | \code{\var{self}[\var{i}:\var{j}:\var{k}]}, a |
| 1684 | slice object is created, and passed to \method{__setitem__()}, |
| 1685 | instead of \method{__setslice__()} being called. |
Fred Drake | 1e42d8a | 1998-11-25 17:58:50 +0000 | [diff] [blame] | 1686 | \end{methoddesc} |
Fred Drake | f666917 | 1998-05-06 19:52:49 +0000 | [diff] [blame] | 1687 | |
Fred Drake | 1e42d8a | 1998-11-25 17:58:50 +0000 | [diff] [blame] | 1688 | \begin{methoddesc}[sequence object]{__delslice__}{self, i, j} |
Fred Drake | d82575d | 1998-08-28 20:03:12 +0000 | [diff] [blame] | 1689 | Called to implement deletion of \code{\var{self}[\var{i}:\var{j}]}. |
| 1690 | Same notes for \var{i} and \var{j} as for \method{__getslice__()}. |
Raymond Hettinger | e701dcb | 2003-01-19 13:08:18 +0000 | [diff] [blame] | 1691 | This method is deprecated. If no \method{__delslice__()} is found, |
| 1692 | or for extended slicing of the form |
| 1693 | \code{\var{self}[\var{i}:\var{j}:\var{k}]}, a |
| 1694 | slice object is created, and passed to \method{__delitem__()}, |
| 1695 | instead of \method{__delslice__()} being called. |
Fred Drake | 1e42d8a | 1998-11-25 17:58:50 +0000 | [diff] [blame] | 1696 | \end{methoddesc} |
Fred Drake | f666917 | 1998-05-06 19:52:49 +0000 | [diff] [blame] | 1697 | |
Fred Drake | fb8ffe6 | 2001-04-13 15:54:41 +0000 | [diff] [blame] | 1698 | Notice that these methods are only invoked when a single slice with a |
| 1699 | single colon is used, and the slice method is available. For slice |
| 1700 | operations involving extended slice notation, or in absence of the |
| 1701 | slice methods, \method{__getitem__()}, \method{__setitem__()} or |
| 1702 | \method{__delitem__()} is called with a slice object as argument. |
Fred Drake | f666917 | 1998-05-06 19:52:49 +0000 | [diff] [blame] | 1703 | |
Fred Drake | f8925978 | 2000-09-21 22:27:16 +0000 | [diff] [blame] | 1704 | The following example demonstrate how to make your program or module |
| 1705 | compatible with earlier versions of Python (assuming that methods |
| 1706 | \method{__getitem__()}, \method{__setitem__()} and \method{__delitem__()} |
| 1707 | support slice objects as arguments): |
| 1708 | |
| 1709 | \begin{verbatim} |
| 1710 | class MyClass: |
| 1711 | ... |
| 1712 | def __getitem__(self, index): |
| 1713 | ... |
| 1714 | def __setitem__(self, index, value): |
| 1715 | ... |
| 1716 | def __delitem__(self, index): |
| 1717 | ... |
| 1718 | |
| 1719 | if sys.version_info < (2, 0): |
| 1720 | # They won't be defined if version is at least 2.0 final |
| 1721 | |
| 1722 | def __getslice__(self, i, j): |
| 1723 | return self[max(0, i):max(0, j):] |
| 1724 | def __setslice__(self, i, j, seq): |
| 1725 | self[max(0, i):max(0, j):] = seq |
| 1726 | def __delslice__(self, i, j): |
| 1727 | del self[max(0, i):max(0, j):] |
| 1728 | ... |
| 1729 | \end{verbatim} |
| 1730 | |
Raymond Hettinger | e701dcb | 2003-01-19 13:08:18 +0000 | [diff] [blame] | 1731 | Note the calls to \function{max()}; these are necessary because of |
| 1732 | the handling of negative indices before the |
Fred Drake | f8925978 | 2000-09-21 22:27:16 +0000 | [diff] [blame] | 1733 | \method{__*slice__()} methods are called. When negative indexes are |
| 1734 | used, the \method{__*item__()} methods receive them as provided, but |
| 1735 | the \method{__*slice__()} methods get a ``cooked'' form of the index |
| 1736 | values. For each negative index value, the length of the sequence is |
| 1737 | added to the index before calling the method (which may still result |
| 1738 | in a negative index); this is the customary handling of negative |
| 1739 | indexes by the built-in sequence types, and the \method{__*item__()} |
| 1740 | methods are expected to do this as well. However, since they should |
| 1741 | already be doing that, negative indexes cannot be passed in; they must |
Raymond Hettinger | e41d4c8 | 2003-08-25 04:39:55 +0000 | [diff] [blame] | 1742 | be constrained to the bounds of the sequence before being passed to |
Fred Drake | f8925978 | 2000-09-21 22:27:16 +0000 | [diff] [blame] | 1743 | the \method{__*item__()} methods. |
| 1744 | Calling \code{max(0, i)} conveniently returns the proper value. |
| 1745 | |
Fred Drake | 15988fd | 1999-02-12 18:14:57 +0000 | [diff] [blame] | 1746 | |
Fred Drake | 61c7728 | 1998-07-28 19:34:22 +0000 | [diff] [blame] | 1747 | \subsection{Emulating numeric types\label{numeric-types}} |
Guido van Rossum | 83b2f8a | 1998-07-23 17:12:46 +0000 | [diff] [blame] | 1748 | |
| 1749 | The following methods can be defined to emulate numeric objects. |
| 1750 | Methods corresponding to operations that are not supported by the |
| 1751 | particular kind of number implemented (e.g., bitwise operations for |
| 1752 | non-integral numbers) should be left undefined. |
Fred Drake | f666917 | 1998-05-06 19:52:49 +0000 | [diff] [blame] | 1753 | |
Fred Drake | b894370 | 1999-05-10 13:43:22 +0000 | [diff] [blame] | 1754 | \begin{methoddesc}[numeric object]{__add__}{self, other} |
| 1755 | \methodline[numeric object]{__sub__}{self, other} |
| 1756 | \methodline[numeric object]{__mul__}{self, other} |
Fred Drake | 3e2aca4 | 2001-08-14 20:28:08 +0000 | [diff] [blame] | 1757 | \methodline[numeric object]{__floordiv__}{self, other} |
Fred Drake | b894370 | 1999-05-10 13:43:22 +0000 | [diff] [blame] | 1758 | \methodline[numeric object]{__mod__}{self, other} |
| 1759 | \methodline[numeric object]{__divmod__}{self, other} |
| 1760 | \methodline[numeric object]{__pow__}{self, other\optional{, modulo}} |
| 1761 | \methodline[numeric object]{__lshift__}{self, other} |
| 1762 | \methodline[numeric object]{__rshift__}{self, other} |
| 1763 | \methodline[numeric object]{__and__}{self, other} |
| 1764 | \methodline[numeric object]{__xor__}{self, other} |
| 1765 | \methodline[numeric object]{__or__}{self, other} |
Fred Drake | 3e2aca4 | 2001-08-14 20:28:08 +0000 | [diff] [blame] | 1766 | These methods are |
Guido van Rossum | 83b2f8a | 1998-07-23 17:12:46 +0000 | [diff] [blame] | 1767 | called to implement the binary arithmetic operations (\code{+}, |
Fred Drake | 3e2aca4 | 2001-08-14 20:28:08 +0000 | [diff] [blame] | 1768 | \code{-}, \code{*}, \code{//}, \code{\%}, |
Fred Drake | d82575d | 1998-08-28 20:03:12 +0000 | [diff] [blame] | 1769 | \function{divmod()}\bifuncindex{divmod}, |
Fred Drake | fb8ffe6 | 2001-04-13 15:54:41 +0000 | [diff] [blame] | 1770 | \function{pow()}\bifuncindex{pow}, \code{**}, \code{<}\code{<}, |
| 1771 | \code{>}\code{>}, \code{\&}, \code{\^}, \code{|}). For instance, to |
| 1772 | evaluate the expression \var{x}\code{+}\var{y}, where \var{x} is an |
| 1773 | instance of a class that has an \method{__add__()} method, |
Fred Drake | 3e2aca4 | 2001-08-14 20:28:08 +0000 | [diff] [blame] | 1774 | \code{\var{x}.__add__(\var{y})} is called. The \method{__divmod__()} |
| 1775 | method should be the equivalent to using \method{__floordiv__()} and |
| 1776 | \method{__mod__()}; it should not be related to \method{__truediv__()} |
| 1777 | (described below). Note that |
Fred Drake | d82575d | 1998-08-28 20:03:12 +0000 | [diff] [blame] | 1778 | \method{__pow__()} should be defined to accept an optional third |
| 1779 | argument if the ternary version of the built-in |
| 1780 | \function{pow()}\bifuncindex{pow} function is to be supported. |
Fred Drake | 1e42d8a | 1998-11-25 17:58:50 +0000 | [diff] [blame] | 1781 | \end{methoddesc} |
Guido van Rossum | 83b2f8a | 1998-07-23 17:12:46 +0000 | [diff] [blame] | 1782 | |
Fred Drake | 3e2aca4 | 2001-08-14 20:28:08 +0000 | [diff] [blame] | 1783 | \begin{methoddesc}[numeric object]{__div__}{self, other} |
| 1784 | \methodline[numeric object]{__truediv__}{self, other} |
| 1785 | The division operator (\code{/}) is implemented by these methods. The |
| 1786 | \method{__truediv__()} method is used when \code{__future__.division} |
| 1787 | is in effect, otherwise \method{__div__()} is used. If only one of |
| 1788 | these two methods is defined, the object will not support division in |
| 1789 | the alternate context; \exception{TypeError} will be raised instead. |
| 1790 | \end{methoddesc} |
| 1791 | |
Fred Drake | b894370 | 1999-05-10 13:43:22 +0000 | [diff] [blame] | 1792 | \begin{methoddesc}[numeric object]{__radd__}{self, other} |
| 1793 | \methodline[numeric object]{__rsub__}{self, other} |
| 1794 | \methodline[numeric object]{__rmul__}{self, other} |
| 1795 | \methodline[numeric object]{__rdiv__}{self, other} |
Raymond Hettinger | 10cbe8d | 2002-06-20 06:12:37 +0000 | [diff] [blame] | 1796 | \methodline[numeric object]{__rtruediv__}{self, other} |
| 1797 | \methodline[numeric object]{__rfloordiv__}{self, other} |
Fred Drake | b894370 | 1999-05-10 13:43:22 +0000 | [diff] [blame] | 1798 | \methodline[numeric object]{__rmod__}{self, other} |
| 1799 | \methodline[numeric object]{__rdivmod__}{self, other} |
| 1800 | \methodline[numeric object]{__rpow__}{self, other} |
| 1801 | \methodline[numeric object]{__rlshift__}{self, other} |
| 1802 | \methodline[numeric object]{__rrshift__}{self, other} |
| 1803 | \methodline[numeric object]{__rand__}{self, other} |
| 1804 | \methodline[numeric object]{__rxor__}{self, other} |
| 1805 | \methodline[numeric object]{__ror__}{self, other} |
Fred Drake | 3e2aca4 | 2001-08-14 20:28:08 +0000 | [diff] [blame] | 1806 | These methods are |
Guido van Rossum | 83b2f8a | 1998-07-23 17:12:46 +0000 | [diff] [blame] | 1807 | called to implement the binary arithmetic operations (\code{+}, |
Fred Drake | d82575d | 1998-08-28 20:03:12 +0000 | [diff] [blame] | 1808 | \code{-}, \code{*}, \code{/}, \code{\%}, |
| 1809 | \function{divmod()}\bifuncindex{divmod}, |
Fred Drake | fb8ffe6 | 2001-04-13 15:54:41 +0000 | [diff] [blame] | 1810 | \function{pow()}\bifuncindex{pow}, \code{**}, \code{<}\code{<}, |
| 1811 | \code{>}\code{>}, \code{\&}, \code{\^}, \code{|}) with reflected |
| 1812 | (swapped) operands. These functions are only called if the left |
| 1813 | operand does not support the corresponding operation. For instance, |
| 1814 | to evaluate the expression \var{x}\code{-}\var{y}, where \var{y} is an |
| 1815 | instance of a class that has an \method{__rsub__()} method, |
| 1816 | \code{\var{y}.__rsub__(\var{x})} is called. Note that ternary |
| 1817 | \function{pow()}\bifuncindex{pow} will not try calling |
| 1818 | \method{__rpow__()} (the coercion rules would become too |
Guido van Rossum | 83b2f8a | 1998-07-23 17:12:46 +0000 | [diff] [blame] | 1819 | complicated). |
Fred Drake | 1e42d8a | 1998-11-25 17:58:50 +0000 | [diff] [blame] | 1820 | \end{methoddesc} |
Fred Drake | f666917 | 1998-05-06 19:52:49 +0000 | [diff] [blame] | 1821 | |
Thomas Wouters | dc90cc2 | 2000-12-11 23:11:51 +0000 | [diff] [blame] | 1822 | \begin{methoddesc}[numeric object]{__iadd__}{self, other} |
| 1823 | \methodline[numeric object]{__isub__}{self, other} |
| 1824 | \methodline[numeric object]{__imul__}{self, other} |
| 1825 | \methodline[numeric object]{__idiv__}{self, other} |
Raymond Hettinger | 10cbe8d | 2002-06-20 06:12:37 +0000 | [diff] [blame] | 1826 | \methodline[numeric object]{__itruediv__}{self, other} |
| 1827 | \methodline[numeric object]{__ifloordiv__}{self, other} |
| 1828 | \methodline[numeric object]{__imod__}{self, other} |
Thomas Wouters | dc90cc2 | 2000-12-11 23:11:51 +0000 | [diff] [blame] | 1829 | \methodline[numeric object]{__ipow__}{self, other\optional{, modulo}} |
| 1830 | \methodline[numeric object]{__ilshift__}{self, other} |
| 1831 | \methodline[numeric object]{__irshift__}{self, other} |
| 1832 | \methodline[numeric object]{__iand__}{self, other} |
| 1833 | \methodline[numeric object]{__ixor__}{self, other} |
| 1834 | \methodline[numeric object]{__ior__}{self, other} |
Fred Drake | fb8ffe6 | 2001-04-13 15:54:41 +0000 | [diff] [blame] | 1835 | These methods are called to implement the augmented arithmetic |
| 1836 | operations (\code{+=}, \code{-=}, \code{*=}, \code{/=}, \code{\%=}, |
| 1837 | \code{**=}, \code{<}\code{<=}, \code{>}\code{>=}, \code{\&=}, |
Fred Drake | a378864 | 2003-07-23 15:18:03 +0000 | [diff] [blame] | 1838 | \code{\textasciicircum=}, \code{|=}). These methods should attempt to do the |
Fred Drake | fb8ffe6 | 2001-04-13 15:54:41 +0000 | [diff] [blame] | 1839 | operation in-place (modifying \var{self}) and return the result (which |
| 1840 | could be, but does not have to be, \var{self}). If a specific method |
| 1841 | is not defined, the augmented operation falls back to the normal |
| 1842 | methods. For instance, to evaluate the expression |
| 1843 | \var{x}\code{+=}\var{y}, where \var{x} is an instance of a class that |
| 1844 | has an \method{__iadd__()} method, \code{\var{x}.__iadd__(\var{y})} is |
| 1845 | called. If \var{x} is an instance of a class that does not define a |
| 1846 | \method{__iadd()} method, \code{\var{x}.__add__(\var{y})} and |
| 1847 | \code{\var{y}.__radd__(\var{x})} are considered, as with the |
| 1848 | evaluation of \var{x}\code{+}\var{y}. |
Thomas Wouters | dc90cc2 | 2000-12-11 23:11:51 +0000 | [diff] [blame] | 1849 | \end{methoddesc} |
| 1850 | |
Fred Drake | b894370 | 1999-05-10 13:43:22 +0000 | [diff] [blame] | 1851 | \begin{methoddesc}[numeric object]{__neg__}{self} |
| 1852 | \methodline[numeric object]{__pos__}{self} |
| 1853 | \methodline[numeric object]{__abs__}{self} |
| 1854 | \methodline[numeric object]{__invert__}{self} |
Fred Drake | fb8ffe6 | 2001-04-13 15:54:41 +0000 | [diff] [blame] | 1855 | Called to implement the unary arithmetic operations (\code{-}, |
| 1856 | \code{+}, \function{abs()}\bifuncindex{abs} and \code{\~{}}). |
Fred Drake | 1e42d8a | 1998-11-25 17:58:50 +0000 | [diff] [blame] | 1857 | \end{methoddesc} |
Fred Drake | f666917 | 1998-05-06 19:52:49 +0000 | [diff] [blame] | 1858 | |
Fred Drake | b894370 | 1999-05-10 13:43:22 +0000 | [diff] [blame] | 1859 | \begin{methoddesc}[numeric object]{__complex__}{self} |
| 1860 | \methodline[numeric object]{__int__}{self} |
| 1861 | \methodline[numeric object]{__long__}{self} |
| 1862 | \methodline[numeric object]{__float__}{self} |
Fred Drake | d82575d | 1998-08-28 20:03:12 +0000 | [diff] [blame] | 1863 | Called to implement the built-in functions |
Fred Drake | 15988fd | 1999-02-12 18:14:57 +0000 | [diff] [blame] | 1864 | \function{complex()}\bifuncindex{complex}, |
| 1865 | \function{int()}\bifuncindex{int}, \function{long()}\bifuncindex{long}, |
Fred Drake | d82575d | 1998-08-28 20:03:12 +0000 | [diff] [blame] | 1866 | and \function{float()}\bifuncindex{float}. Should return a value of |
| 1867 | the appropriate type. |
Fred Drake | 1e42d8a | 1998-11-25 17:58:50 +0000 | [diff] [blame] | 1868 | \end{methoddesc} |
Fred Drake | f666917 | 1998-05-06 19:52:49 +0000 | [diff] [blame] | 1869 | |
Fred Drake | b894370 | 1999-05-10 13:43:22 +0000 | [diff] [blame] | 1870 | \begin{methoddesc}[numeric object]{__oct__}{self} |
| 1871 | \methodline[numeric object]{__hex__}{self} |
Fred Drake | d82575d | 1998-08-28 20:03:12 +0000 | [diff] [blame] | 1872 | Called to implement the built-in functions |
| 1873 | \function{oct()}\bifuncindex{oct} and |
| 1874 | \function{hex()}\bifuncindex{hex}. Should return a string value. |
Fred Drake | 1e42d8a | 1998-11-25 17:58:50 +0000 | [diff] [blame] | 1875 | \end{methoddesc} |
Fred Drake | f666917 | 1998-05-06 19:52:49 +0000 | [diff] [blame] | 1876 | |
Fred Drake | b894370 | 1999-05-10 13:43:22 +0000 | [diff] [blame] | 1877 | \begin{methoddesc}[numeric object]{__coerce__}{self, other} |
Guido van Rossum | 83b2f8a | 1998-07-23 17:12:46 +0000 | [diff] [blame] | 1878 | Called to implement ``mixed-mode'' numeric arithmetic. Should either |
Fred Drake | d82575d | 1998-08-28 20:03:12 +0000 | [diff] [blame] | 1879 | return a 2-tuple containing \var{self} and \var{other} converted to |
Fred Drake | b894370 | 1999-05-10 13:43:22 +0000 | [diff] [blame] | 1880 | a common numeric type, or \code{None} if conversion is impossible. When |
Guido van Rossum | 83b2f8a | 1998-07-23 17:12:46 +0000 | [diff] [blame] | 1881 | the common type would be the type of \code{other}, it is sufficient to |
| 1882 | return \code{None}, since the interpreter will also ask the other |
| 1883 | object to attempt a coercion (but sometimes, if the implementation of |
| 1884 | the other type cannot be changed, it is useful to do the conversion to |
Guido van Rossum | 92cf95f | 2002-06-03 19:06:41 +0000 | [diff] [blame] | 1885 | the other type here). A return value of \code{NotImplemented} is |
| 1886 | equivalent to returning \code{None}. |
Fred Drake | 1e42d8a | 1998-11-25 17:58:50 +0000 | [diff] [blame] | 1887 | \end{methoddesc} |
Guido van Rossum | 83b2f8a | 1998-07-23 17:12:46 +0000 | [diff] [blame] | 1888 | |
Guido van Rossum | 92cf95f | 2002-06-03 19:06:41 +0000 | [diff] [blame] | 1889 | \subsection{Coercion rules\label{coercion-rules}} |
| 1890 | |
| 1891 | This section used to document the rules for coercion. As the language |
| 1892 | has evolved, the coercion rules have become hard to document |
| 1893 | precisely; documenting what one version of one particular |
| 1894 | implementation does is undesirable. Instead, here are some informal |
| 1895 | guidelines regarding coercion. In Python 3.0, coercion will not be |
| 1896 | supported. |
Guido van Rossum | 83b2f8a | 1998-07-23 17:12:46 +0000 | [diff] [blame] | 1897 | |
| 1898 | \begin{itemize} |
| 1899 | |
Guido van Rossum | 92cf95f | 2002-06-03 19:06:41 +0000 | [diff] [blame] | 1900 | \item |
Guido van Rossum | 83b2f8a | 1998-07-23 17:12:46 +0000 | [diff] [blame] | 1901 | |
Guido van Rossum | 92cf95f | 2002-06-03 19:06:41 +0000 | [diff] [blame] | 1902 | If the left operand of a \% operator is a string or Unicode object, no |
| 1903 | coercion takes place and the string formatting operation is invoked |
| 1904 | instead. |
Guido van Rossum | 83b2f8a | 1998-07-23 17:12:46 +0000 | [diff] [blame] | 1905 | |
Guido van Rossum | 92cf95f | 2002-06-03 19:06:41 +0000 | [diff] [blame] | 1906 | \item |
Guido van Rossum | 83b2f8a | 1998-07-23 17:12:46 +0000 | [diff] [blame] | 1907 | |
Guido van Rossum | 92cf95f | 2002-06-03 19:06:41 +0000 | [diff] [blame] | 1908 | It is no longer recommended to define a coercion operation. |
| 1909 | Mixed-mode operations on types that don't define coercion pass the |
| 1910 | original arguments to the operation. |
Guido van Rossum | 83b2f8a | 1998-07-23 17:12:46 +0000 | [diff] [blame] | 1911 | |
Guido van Rossum | 92cf95f | 2002-06-03 19:06:41 +0000 | [diff] [blame] | 1912 | \item |
Guido van Rossum | 83b2f8a | 1998-07-23 17:12:46 +0000 | [diff] [blame] | 1913 | |
Fred Drake | 293dd4b | 2002-06-04 16:25:57 +0000 | [diff] [blame] | 1914 | New-style classes (those derived from \class{object}) never invoke the |
| 1915 | \method{__coerce__()} method in response to a binary operator; the only |
| 1916 | time \method{__coerce__()} is invoked is when the built-in function |
| 1917 | \function{coerce()} is called. |
Guido van Rossum | 83b2f8a | 1998-07-23 17:12:46 +0000 | [diff] [blame] | 1918 | |
Guido van Rossum | 92cf95f | 2002-06-03 19:06:41 +0000 | [diff] [blame] | 1919 | \item |
Guido van Rossum | 83b2f8a | 1998-07-23 17:12:46 +0000 | [diff] [blame] | 1920 | |
Guido van Rossum | 92cf95f | 2002-06-03 19:06:41 +0000 | [diff] [blame] | 1921 | For most intents and purposes, an operator that returns |
| 1922 | \code{NotImplemented} is treated the same as one that is not |
| 1923 | implemented at all. |
Guido van Rossum | 83b2f8a | 1998-07-23 17:12:46 +0000 | [diff] [blame] | 1924 | |
Guido van Rossum | 92cf95f | 2002-06-03 19:06:41 +0000 | [diff] [blame] | 1925 | \item |
Guido van Rossum | 83b2f8a | 1998-07-23 17:12:46 +0000 | [diff] [blame] | 1926 | |
Guido van Rossum | 92cf95f | 2002-06-03 19:06:41 +0000 | [diff] [blame] | 1927 | Below, \method{__op__()} and \method{__rop__()} are used to signify |
| 1928 | the generic method names corresponding to an operator; |
| 1929 | \method{__iop__} is used for the corresponding in-place operator. For |
| 1930 | example, for the operator `\code{+}', \method{__add__()} and |
| 1931 | \method{__radd__()} are used for the left and right variant of the |
| 1932 | binary operator, and \method{__iadd__} for the in-place variant. |
Guido van Rossum | 83b2f8a | 1998-07-23 17:12:46 +0000 | [diff] [blame] | 1933 | |
Guido van Rossum | 92cf95f | 2002-06-03 19:06:41 +0000 | [diff] [blame] | 1934 | \item |
Guido van Rossum | 83b2f8a | 1998-07-23 17:12:46 +0000 | [diff] [blame] | 1935 | |
Guido van Rossum | 92cf95f | 2002-06-03 19:06:41 +0000 | [diff] [blame] | 1936 | For objects \var{x} and \var{y}, first \code{\var{x}.__op__(\var{y})} |
| 1937 | is tried. If this is not implemented or returns \code{NotImplemented}, |
| 1938 | \code{\var{y}.__rop__(\var{x})} is tried. If this is also not |
Fred Drake | 293dd4b | 2002-06-04 16:25:57 +0000 | [diff] [blame] | 1939 | implemented or returns \code{NotImplemented}, a \exception{TypeError} |
Guido van Rossum | 92cf95f | 2002-06-03 19:06:41 +0000 | [diff] [blame] | 1940 | exception is raised. But see the following exception: |
Guido van Rossum | 83b2f8a | 1998-07-23 17:12:46 +0000 | [diff] [blame] | 1941 | |
Guido van Rossum | 92cf95f | 2002-06-03 19:06:41 +0000 | [diff] [blame] | 1942 | \item |
Guido van Rossum | 83b2f8a | 1998-07-23 17:12:46 +0000 | [diff] [blame] | 1943 | |
Guido van Rossum | 92cf95f | 2002-06-03 19:06:41 +0000 | [diff] [blame] | 1944 | Exception to the previous item: if the left operand is an instance of |
| 1945 | a built-in type or a new-style class, and the right operand is an |
| 1946 | instance of a proper subclass of that type or class, the right |
Fred Drake | 293dd4b | 2002-06-04 16:25:57 +0000 | [diff] [blame] | 1947 | operand's \method{__rop__()} method is tried \emph{before} the left |
| 1948 | operand's \method{__op__()} method. This is done so that a subclass can |
Guido van Rossum | 92cf95f | 2002-06-03 19:06:41 +0000 | [diff] [blame] | 1949 | completely override binary operators. Otherwise, the left operand's |
| 1950 | __op__ method would always accept the right operand: when an instance |
| 1951 | of a given class is expected, an instance of a subclass of that class |
| 1952 | is always acceptable. |
Guido van Rossum | 83b2f8a | 1998-07-23 17:12:46 +0000 | [diff] [blame] | 1953 | |
Guido van Rossum | 92cf95f | 2002-06-03 19:06:41 +0000 | [diff] [blame] | 1954 | \item |
Guido van Rossum | 83b2f8a | 1998-07-23 17:12:46 +0000 | [diff] [blame] | 1955 | |
Guido van Rossum | 92cf95f | 2002-06-03 19:06:41 +0000 | [diff] [blame] | 1956 | When either operand type defines a coercion, this coercion is called |
Fred Drake | 293dd4b | 2002-06-04 16:25:57 +0000 | [diff] [blame] | 1957 | before that type's \method{__op__()} or \method{__rop__()} method is |
| 1958 | called, but no sooner. If the coercion returns an object of a |
| 1959 | different type for the operand whose coercion is invoked, part of the |
| 1960 | process is redone using the new object. |
Guido van Rossum | 83b2f8a | 1998-07-23 17:12:46 +0000 | [diff] [blame] | 1961 | |
Guido van Rossum | 92cf95f | 2002-06-03 19:06:41 +0000 | [diff] [blame] | 1962 | \item |
Guido van Rossum | 83b2f8a | 1998-07-23 17:12:46 +0000 | [diff] [blame] | 1963 | |
Guido van Rossum | 92cf95f | 2002-06-03 19:06:41 +0000 | [diff] [blame] | 1964 | When an in-place operator (like `\code{+=}') is used, if the left |
Fred Drake | 293dd4b | 2002-06-04 16:25:57 +0000 | [diff] [blame] | 1965 | operand implements \method{__iop__()}, it is invoked without any |
| 1966 | coercion. When the operation falls back to \method{__op__()} and/or |
| 1967 | \method{__rop__()}, the normal coercion rules apply. |
Guido van Rossum | 83b2f8a | 1998-07-23 17:12:46 +0000 | [diff] [blame] | 1968 | |
Guido van Rossum | 92cf95f | 2002-06-03 19:06:41 +0000 | [diff] [blame] | 1969 | \item |
| 1970 | |
| 1971 | In \var{x}\code{+}\var{y}, if \var{x} is a sequence that implements |
| 1972 | sequence concatenation, sequence concatenation is invoked. |
| 1973 | |
| 1974 | \item |
| 1975 | |
| 1976 | In \var{x}\code{*}\var{y}, if one operator is a sequence that |
| 1977 | implements sequence repetition, and the other is an integer |
Fred Drake | 293dd4b | 2002-06-04 16:25:57 +0000 | [diff] [blame] | 1978 | (\class{int} or \class{long}), sequence repetition is invoked. |
Guido van Rossum | 92cf95f | 2002-06-03 19:06:41 +0000 | [diff] [blame] | 1979 | |
| 1980 | \item |
| 1981 | |
Fred Drake | 293dd4b | 2002-06-04 16:25:57 +0000 | [diff] [blame] | 1982 | Rich comparisons (implemented by methods \method{__eq__()} and so on) |
Guido van Rossum | 92cf95f | 2002-06-03 19:06:41 +0000 | [diff] [blame] | 1983 | never use coercion. Three-way comparison (implemented by |
Fred Drake | 293dd4b | 2002-06-04 16:25:57 +0000 | [diff] [blame] | 1984 | \method{__cmp__()}) does use coercion under the same conditions as |
Guido van Rossum | 92cf95f | 2002-06-03 19:06:41 +0000 | [diff] [blame] | 1985 | other binary operations use it. |
| 1986 | |
| 1987 | \item |
| 1988 | |
Fred Drake | 293dd4b | 2002-06-04 16:25:57 +0000 | [diff] [blame] | 1989 | In the current implementation, the built-in numeric types \class{int}, |
| 1990 | \class{long} and \class{float} do not use coercion; the type |
| 1991 | \class{complex} however does use it. The difference can become |
Guido van Rossum | 92cf95f | 2002-06-03 19:06:41 +0000 | [diff] [blame] | 1992 | apparent when subclassing these types. Over time, the type |
Fred Drake | 293dd4b | 2002-06-04 16:25:57 +0000 | [diff] [blame] | 1993 | \class{complex} may be fixed to avoid coercion. All these types |
| 1994 | implement a \method{__coerce__()} method, for use by the built-in |
| 1995 | \function{coerce()} function. |
Guido van Rossum | 83b2f8a | 1998-07-23 17:12:46 +0000 | [diff] [blame] | 1996 | |
| 1997 | \end{itemize} |