blob: 5a3fa71e301a131fd338eff73a50dee6409ee73d [file] [log] [blame]
Fred Drakeed0fa3d2003-07-30 19:14:09 +00001\documentclass{howto}
2\usepackage{distutils}
3% $Id$
4
Andrew M. Kuchlingc8f8a812004-07-04 01:26:42 +00005% Don't write extensive text for new sections; I'll do that.
6% Feel free to add commented-out reminders of things that need
7% to be covered. --amk
8
Andrew M. Kuchlingd0b6d9d2004-07-04 15:35:00 +00009% XXX pydoc can display links to module docs -- but when?
10%
11
Fred Drakeed0fa3d2003-07-30 19:14:09 +000012\title{What's New in Python 2.4}
Andrew M. Kuchlingba59be02004-08-06 18:55:48 +000013\release{0.3}
Fred Drakeed0fa3d2003-07-30 19:14:09 +000014\author{A.M.\ Kuchling}
Fred Drakeb914ef02004-01-02 06:57:50 +000015\authoraddress{
16 \strong{Python Software Foundation}\\
17 Email: \email{amk@amk.ca}
18}
Fred Drakeed0fa3d2003-07-30 19:14:09 +000019
20\begin{document}
21\maketitle
22\tableofcontents
23
Andrew M. Kuchling89ba1ff2004-07-14 21:56:19 +000024This article explains the new features in Python 2.4 alpha2, scheduled
25for release in late July 2004. The final version of Python 2.4 is
Andrew M. Kuchling3bf85f12004-07-05 01:37:07 +000026expected to be released around September 2004.
Fred Drakeed0fa3d2003-07-30 19:14:09 +000027
Andrew M. Kuchling3bf85f12004-07-05 01:37:07 +000028Python 2.4 is a medium-sized release. It doesn't introduce as many
Andrew M. Kuchling3b790912004-07-04 16:39:40 +000029changes as the radical Python 2.2, but introduces more features than
30the conservative 2.3 release did. The most significant new language
31feature (as of this writing) is the addition of generator expressions;
Andrew M. Kuchling3bf85f12004-07-05 01:37:07 +000032most other changes are to the standard library.
Fred Drakeed0fa3d2003-07-30 19:14:09 +000033
34This article doesn't attempt to provide a complete specification of
Andrew M. Kuchling3b790912004-07-04 16:39:40 +000035every single new feature, but instead provides a convenient overview.
36For full details, you should refer to the documentation for Python
372.4, such as the \citetitle[../lib/lib.html]{Python Library Reference}
38and the \citetitle[../ref/ref.html]{Python Reference Manual}. If you
39want to understand the complete implementation and design rationale,
40refer to the PEP for a particular new feature or to the module
41documentation.
Fred Drakeed0fa3d2003-07-30 19:14:09 +000042
Andrew M. Kuchling35f2b052003-12-18 13:28:13 +000043
Raymond Hettinger7e0282f2003-11-24 07:14:54 +000044%======================================================================
45\section{PEP 218: Built-In Set Objects}
46
Andrew M. Kuchling3bf85f12004-07-05 01:37:07 +000047Python 2.3 introduced the \module{sets} module. C implementations of
48set data types have now been added to the Python core as two new
49built-in types, \function{set(\var{iterable})} and
50\function{frozenset(\var{iterable})}. They provide high speed
51operations for membership testing, for eliminating duplicates from
52sequences, and for mathematical operations like unions, intersections,
53differences, and symmetric differences.
Raymond Hettinger7e0282f2003-11-24 07:14:54 +000054
55\begin{verbatim}
56>>> a = set('abracadabra') # form a set from a string
57>>> 'z' in a # fast membership testing
58False
59>>> a # unique letters in a
60set(['a', 'r', 'b', 'c', 'd'])
61>>> ''.join(a) # convert back into a string
62'arbcd'
Raymond Hettingerd4462302003-11-26 17:52:45 +000063
Raymond Hettinger7e0282f2003-11-24 07:14:54 +000064>>> b = set('alacazam') # form a second set
65>>> a - b # letters in a but not in b
66set(['r', 'd', 'b'])
67>>> a | b # letters in either a or b
68set(['a', 'c', 'r', 'd', 'b', 'm', 'z', 'l'])
69>>> a & b # letters in both a and b
70set(['a', 'c'])
71>>> a ^ b # letters in a or b but not both
72set(['r', 'd', 'b', 'm', 'z', 'l'])
Raymond Hettingerd4462302003-11-26 17:52:45 +000073
Raymond Hettinger7e0282f2003-11-24 07:14:54 +000074>>> a.add('z') # add a new element
75>>> a.update('wxy') # add multiple new elements
76>>> a
77set(['a', 'c', 'b', 'd', 'r', 'w', 'y', 'x', 'z'])
78>>> a.remove('x') # take one element out
79>>> a
80set(['a', 'c', 'b', 'd', 'r', 'w', 'y', 'z'])
81\end{verbatim}
82
Andrew M. Kuchling3bf85f12004-07-05 01:37:07 +000083The \function{frozenset} type is an immutable version of \function{set}.
Raymond Hettinger7e0282f2003-11-24 07:14:54 +000084Since it is immutable and hashable, it may be used as a dictionary key or
Andrew M. Kuchling3bf85f12004-07-05 01:37:07 +000085as a member of another set.
Raymond Hettinger7e0282f2003-11-24 07:14:54 +000086
Andrew M. Kuchling3bf85f12004-07-05 01:37:07 +000087The \module{sets} module remains in the standard library, and may be
88useful if you wish to subclass the \class{Set} or \class{ImmutableSet}
89classes. There are currently no plans to deprecate the module.
Andrew M. Kuchling35f2b052003-12-18 13:28:13 +000090
Raymond Hettinger7e0282f2003-11-24 07:14:54 +000091\begin{seealso}
92\seepep{218}{Adding a Built-In Set Object Type}{Originally proposed by
93Greg Wilson and ultimately implemented by Raymond Hettinger.}
94\end{seealso}
Fred Drakeed0fa3d2003-07-30 19:14:09 +000095
96%======================================================================
Andrew M. Kuchling35f2b052003-12-18 13:28:13 +000097\section{PEP 237: Unifying Long Integers and Integers}
98
Andrew M. Kuchling3bf85f12004-07-05 01:37:07 +000099The lengthy transition process for this PEP, begun in Python 2.2,
Andrew M. Kuchlingd4be86c2004-07-04 01:44:04 +0000100takes another step forward in Python 2.4. In 2.3, certain integer
101operations that would behave differently after int/long unification
102triggered \exception{FutureWarning} warnings and returned values
Andrew M. Kuchling3bf85f12004-07-05 01:37:07 +0000103limited to 32 or 64 bits (depending on your platform). In 2.4, these
104expressions no longer produce a warning and instead produce a
105different result that's usually a long integer.
Andrew M. Kuchlingd4be86c2004-07-04 01:44:04 +0000106
107The problematic expressions are primarily left shifts and lengthy
Raymond Hettingerca1a7752004-07-12 13:00:45 +0000108hexadecimal and octal constants. For example,
109\code{2 \textless{}\textless{} 32} results
Andrew M. Kuchling3bf85f12004-07-05 01:37:07 +0000110in a warning in 2.3, evaluating to 0 on 32-bit platforms. In Python
1112.4, this expression now returns the correct answer, 8589934592.
Andrew M. Kuchlingd4be86c2004-07-04 01:44:04 +0000112
113\begin{seealso}
114\seepep{237}{Unifying Long Integers and Integers}{Original PEP
Andrew M. Kuchling3bf85f12004-07-05 01:37:07 +0000115written by Moshe Zadka and GvR. The changes for 2.4 were implemented by
Andrew M. Kuchlingd4be86c2004-07-04 01:44:04 +0000116Kalle Svensson.}
117\end{seealso}
Andrew M. Kuchling35f2b052003-12-18 13:28:13 +0000118
119%======================================================================
Andrew M. Kuchlingc8f8a812004-07-04 01:26:42 +0000120\section{PEP 289: Generator Expressions}
Raymond Hettinger354433a2004-05-19 08:20:33 +0000121
Andrew M. Kuchling38dc2a62004-08-07 13:24:12 +0000122The iterator feature introduced in Python 2.2 and the
123\module{itertools} module make it easier to write programs that loop
124through large data sets without having the entire data set in memory
125at one time. List comprehensions don't fit into this picture very
126well because they produce a Python list object containing all of the
127items, unavoidably pulling them all into memory. When trying to write
128a functionally-styled program, it would be natural to write something
Andrew M. Kuchling3bf85f12004-07-05 01:37:07 +0000129like:
Raymond Hettinger354433a2004-05-19 08:20:33 +0000130
131\begin{verbatim}
Andrew M. Kuchlingc8f8a812004-07-04 01:26:42 +0000132links = [link for link in get_all_links() if not link.followed]
133for link in links:
134 ...
Raymond Hettinger354433a2004-05-19 08:20:33 +0000135\end{verbatim}
136
Andrew M. Kuchlingc8f8a812004-07-04 01:26:42 +0000137instead of
Raymond Hettinger354433a2004-05-19 08:20:33 +0000138
139\begin{verbatim}
Andrew M. Kuchlingc8f8a812004-07-04 01:26:42 +0000140for link in get_all_links():
141 if link.followed:
142 continue
143 ...
144\end{verbatim}
Raymond Hettinger354433a2004-05-19 08:20:33 +0000145
Andrew M. Kuchlingc8f8a812004-07-04 01:26:42 +0000146The first form is more concise and perhaps more readable, but if
147you're dealing with a large number of link objects the second form
Andrew M. Kuchling38dc2a62004-08-07 13:24:12 +0000148would have to be used to avoid having all link objects in memory at
149the same time.
Raymond Hettinger354433a2004-05-19 08:20:33 +0000150
Andrew M. Kuchlingc8f8a812004-07-04 01:26:42 +0000151Generator expressions work similarly to list comprehensions but don't
152materialize the entire list; instead they create a generator that will
153return elements one by one. The above example could be written as:
Raymond Hettinger354433a2004-05-19 08:20:33 +0000154
Andrew M. Kuchlingc8f8a812004-07-04 01:26:42 +0000155\begin{verbatim}
156links = (link for link in get_all_links() if not link.followed)
157for link in links:
158 ...
159\end{verbatim}
Raymond Hettinger170a6222004-05-19 19:45:19 +0000160
Andrew M. Kuchlingc8f8a812004-07-04 01:26:42 +0000161Generator expressions always have to be written inside parentheses, as
162in the above example. The parentheses signalling a function call also
163count, so if you want to create a iterator that will be immediately
164passed to a function you could write:
Raymond Hettinger170a6222004-05-19 19:45:19 +0000165
Andrew M. Kuchlingc8f8a812004-07-04 01:26:42 +0000166\begin{verbatim}
167print sum(obj.count for obj in list_all_objects())
168\end{verbatim}
Raymond Hettinger170a6222004-05-19 19:45:19 +0000169
Andrew M. Kuchling3bf85f12004-07-05 01:37:07 +0000170Generator expressions differ from list comprehensions in various small
171ways. Most notably, the loop variable (\var{obj} in the above
172example) is not accessible outside of the generator expression. List
173comprehensions leave the variable assigned to its last value; future
174versions of Python will change this, making list comprehensions match
175generator expressions in this respect.
Raymond Hettinger354433a2004-05-19 08:20:33 +0000176
177\begin{seealso}
178\seepep{289}{Generator Expressions}{Proposed by Raymond Hettinger and
179implemented by Jiwon Seo with early efforts steered by Hye-Shik Chang.}
180\end{seealso}
181
182%======================================================================
Andrew M. Kuchlingd91fcbe2004-08-02 12:44:28 +0000183\section{PEP 318: Decorators for Functions, Methods and Classes}
184
Andrew M. Kuchling77a602f2004-08-02 13:48:18 +0000185Python 2.2 extended Python's object model by adding static methods and
186class methods, but it didn't extend Python's syntax to provide any new
187way of defining static or class methods. Instead, you had to write a
188\keyword{def} statement in the usual way, and pass the resulting
189method to a \function{staticmethod()} or \function{classmethod()}
190function that would wrap up the function as a method of the new type.
191Your code would look like this:
192
193\begin{verbatim}
194class C:
195 def meth (cls):
196 ...
197
198 meth = classmethod(meth) # Rebind name to wrapped-up class method
199\end{verbatim}
200
201If the method was very long, it would be easy to miss or forget the
202\function{classmethod()} invocation after the function body.
203
204The intention was always to add some syntax to make such definitions
205more readable, but at the time of 2.2's release a good syntax was not
206obvious. Years later, when Python 2.4 is coming out, a good syntax
207\emph{still} isn't obvious but users are asking for easier access to
208the feature, so a new syntactic feature has been added.
209
210The feature is called ``function decorators''. The name comes from
211the idea that \function{classmethod}, \function{staticmethod}, and
212friends are storing additional information on a function object; they're
213\emph{decorating} functions with more details.
214
Fred Drake3f5c6542004-08-06 03:34:20 +0000215The notation borrows from Java and uses the \character{@} character as an
Andrew M. Kuchling77a602f2004-08-02 13:48:18 +0000216indicator. Using the new syntax, the example above would be written:
217
218\begin{verbatim}
219class C:
220
221 @classmethod
222 def meth (cls):
223 ...
224
225\end{verbatim}
226
227The \code{@classmethod} is shorthand for the
Fred Drake3f5c6542004-08-06 03:34:20 +0000228\code{meth=classmethod(meth)} assignment. More generally, if you have
Andrew M. Kuchling77a602f2004-08-02 13:48:18 +0000229the following:
230
231\begin{verbatim}
232@A @B @C
233def f ():
234 ...
235\end{verbatim}
236
237It's equivalent to:
238
239\begin{verbatim}
240def f(): ...
241f = C(B(A(f)))
242\end{verbatim}
243
244Decorators must come on the line before a function definition, and
245can't be on the same line, meaning that \code{@A def f(): ...} is
246illegal. You can only decorate function definitions, either at the
247module-level or inside a class; you can't decorate class definitions.
248
249A decorator is just a function that takes the function to be decorated
250as an argument and returns either the same function or some new
251callable thing. It's easy to write your own decorators. The
252following simple example just sets an attribute on the function
253object:
254
255\begin{verbatim}
256>>> def deco(func):
257... func.attr = 'decorated'
258... return func
259...
260>>> @deco
261... def f(): pass
262...
263>>> f
264<function f at 0x402ef0d4>
265>>> f.attr
266'decorated'
267>>>
268\end{verbatim}
269
270As a slightly more realistic example, the following decorator checks
271that the supplied argument is an integer:
272
273\begin{verbatim}
274def require_int (func):
275 def wrapper (arg):
276 assert isinstance(arg, int)
277 return func(arg)
278
279 return wrapper
280
281@require_int
282def p1 (arg):
283 print arg
284
285@require_int
286def p2(arg):
287 print arg*2
288\end{verbatim}
289
290An example in \pep{318} contains a fancier version of this idea that
291lets you specify the required type and check the returned type as
292well.
293
294Decorator functions can take arguments. If arguments are supplied,
295the decorator function is called with only those arguments and must
296return a new decorator function; this new function must take a single
297function and return a function, as previously described. In other
298words, \code{@A @B @C(args)} becomes:
299
300\begin{verbatim}
301def f(): ...
302_deco = C(args)
303f = _deco(B(A(f)))
304\end{verbatim}
305
306Getting this right can be slightly brain-bending, but it's not too
307difficult.
308
309The new syntax was provisionally added in 2.4alpha2, and is subject to
310change during the 2.4alpha release cycle depending on the Python
311community's reaction. Post-2.4 versions of Python will preserve
312compatibility with whatever syntax is used in 2.4final.
Andrew M. Kuchlingd91fcbe2004-08-02 12:44:28 +0000313
314\begin{seealso}
315\seepep{318}{Decorators for Functions, Methods and Classes}{Written
Andrew M. Kuchling77a602f2004-08-02 13:48:18 +0000316by Kevin D. Smith, Jim Jewett, and Skip Montanaro. Several people
317wrote patches implementing function decorators, but the one that was
Fred Drakee72bd4d2004-08-02 21:50:26 +0000318actually checked in was patch \#979728, written by Mark Russell.}
Andrew M. Kuchlingd91fcbe2004-08-02 12:44:28 +0000319\end{seealso}
320
321%======================================================================
Andrew M. Kuchling1a420252003-11-08 15:58:49 +0000322\section{PEP 322: Reverse Iteration}
Fred Drakeed0fa3d2003-07-30 19:14:09 +0000323
Fred Drake56fcc232004-05-06 02:55:35 +0000324A new built-in function, \function{reversed(\var{seq})}, takes a sequence
Andrew M. Kuchling3bf85f12004-07-05 01:37:07 +0000325and returns an iterator that loops over the elements of the sequence
Andrew M. Kuchling1a420252003-11-08 15:58:49 +0000326in reverse order.
327
328\begin{verbatim}
Raymond Hettingerbc3cba22003-11-12 16:39:30 +0000329>>> for i in reversed(xrange(1,4)):
Andrew M. Kuchling1a420252003-11-08 15:58:49 +0000330... print i
331...
3323
3332
3341
335\end{verbatim}
336
Andrew M. Kuchling3bf85f12004-07-05 01:37:07 +0000337Compared to extended slicing, such as \code{range(1,4)[::-1]},
338\function{reversed()} is easier to read, runs faster, and uses
339substantially less memory.
Raymond Hettingerbc3cba22003-11-12 16:39:30 +0000340
Andrew M. Kuchling1a420252003-11-08 15:58:49 +0000341Note that \function{reversed()} only accepts sequences, not arbitrary
Raymond Hettingerbc3cba22003-11-12 16:39:30 +0000342iterators. If you want to reverse an iterator, first convert it to
343a list with \function{list()}.
Andrew M. Kuchling1a420252003-11-08 15:58:49 +0000344
345\begin{verbatim}
Andrew M. Kuchling44a31e12004-01-01 18:33:34 +0000346>>> input= open('/etc/passwd', 'r')
347>>> for line in reversed(list(input)):
Andrew M. Kuchling1a420252003-11-08 15:58:49 +0000348... print line
349...
350root:*:0:0:System Administrator:/var/root:/bin/tcsh
351 ...
352\end{verbatim}
Fred Drakeed0fa3d2003-07-30 19:14:09 +0000353
Andrew M. Kuchlingf7a6b672003-11-08 16:05:37 +0000354\begin{seealso}
355\seepep{322}{Reverse Iteration}{Written and implemented by Raymond Hettinger.}
356
357\end{seealso}
358
Fred Drakeed0fa3d2003-07-30 19:14:09 +0000359
360%======================================================================
Raymond Hettinger0fff62f2004-07-01 11:52:15 +0000361\section{PEP 327: Decimal Data Type}
362
Andrew M. Kuchlingc8f8a812004-07-04 01:26:42 +0000363Python has always supported floating-point (FP) numbers as a data
364type, based on the underlying C \ctype{double} type. However, while
365most programming languages provide a floating-point type, most people
366(even programmers) are unaware that computing with floating-point
367numbers entails certain unavoidable inaccuracies. The new decimal
368type provides a way to avoid these inaccuracies.
Raymond Hettinger0fff62f2004-07-01 11:52:15 +0000369
Andrew M. Kuchlingc8f8a812004-07-04 01:26:42 +0000370\subsection{Why is Decimal needed?}
Raymond Hettinger0fff62f2004-07-01 11:52:15 +0000371
Andrew M. Kuchlingc8f8a812004-07-04 01:26:42 +0000372The limitations arise from the representation used for floating-point numbers.
373FP numbers are made up of three components:
374
375\begin{itemize}
376\item The sign, which is -1 or +1.
377\item The mantissa, which is a single-digit binary number
378followed by a fractional part. For example, \code{1.01} in base-2 notation
379is \code{1 + 0/2 + 1/4}, or 1.25 in decimal notation.
380\item The exponent, which tells where the decimal point is located in the number represented.
381\end{itemize}
382
383For example, the number 1.25 has sign +1, mantissa 1.01 (in binary),
384and exponent of 0 (the decimal point doesn't need to be shifted). The
385number 5 has the same sign and mantissa, but the exponent is 2
386because the mantissa is multiplied by 4 (2 to the power of the exponent 2).
387
388Modern systems usually provide floating-point support that conforms to
389a relevant standard called IEEE 754. C's \ctype{double} type is
390usually implemented as a 64-bit IEEE 754 number, which uses 52 bits of
391space for the mantissa. This means that numbers can only be specified
392to 52 bits of precision. If you're trying to represent numbers whose
393expansion repeats endlessly, the expansion is cut off after 52 bits.
394Unfortunately, most software needs to produce output in base 10, and
395base 10 often gives rise to such repeating decimals. For example, 1.1
396decimal is binary \code{1.0001100110011 ...}; .1 = 1/16 + 1/32 + 1/256
397plus an infinite number of additional terms. IEEE 754 has to chop off
398that infinitely repeated decimal after 52 digits, so the
399representation is slightly inaccurate.
400
401Sometimes you can see this inaccuracy when the number is printed:
Raymond Hettinger0fff62f2004-07-01 11:52:15 +0000402\begin{verbatim}
Andrew M. Kuchlingc8f8a812004-07-04 01:26:42 +0000403>>> 1.1
4041.1000000000000001
Raymond Hettinger0fff62f2004-07-01 11:52:15 +0000405\end{verbatim}
406
Andrew M. Kuchlingc8f8a812004-07-04 01:26:42 +0000407The inaccuracy isn't always visible when you print the number because
Andrew M. Kuchling0ad20f12004-07-21 13:00:06 +0000408the FP-to-decimal-string conversion is provided by the C library and
Andrew M. Kuchlingc8f8a812004-07-04 01:26:42 +0000409most C libraries try to produce sensible output, but the inaccuracy is
410still there and subsequent operations can magnify the error.
Raymond Hettinger0fff62f2004-07-01 11:52:15 +0000411
Andrew M. Kuchlingc8f8a812004-07-04 01:26:42 +0000412For many applications this doesn't matter. If I'm plotting points and
413displaying them on my monitor, the difference between 1.1 and
4141.1000000000000001 is too small to be visible. Reports often limit
415output to a certain number of decimal places, and if you round the
416number to two or three or even eight decimal places, the error is
417never apparent. However, for applications where it does matter,
418it's a lot of work to implement your own custom arithmetic routines.
419
420\subsection{The \class{Decimal} type}
421
422A new module, \module{decimal}, was added to Python's standard library.
423It contains two classes, \class{Decimal} and \class{Context}.
424\class{Decimal} instances represent numbers, and
425\class{Context} instances are used to wrap up various settings such as the precision and default rounding mode.
426
427\class{Decimal} instances, like regular Python integers and FP numbers, are immutable; once they've been created, you can't change the value it represents.
428\class{Decimal} instances can be created from integers or strings:
Raymond Hettinger0fff62f2004-07-01 11:52:15 +0000429
430\begin{verbatim}
Andrew M. Kuchlingc8f8a812004-07-04 01:26:42 +0000431>>> import decimal
432>>> decimal.Decimal(1972)
433Decimal("1972")
434>>> decimal.Decimal("1.1")
435Decimal("1.1")
Raymond Hettinger0fff62f2004-07-01 11:52:15 +0000436\end{verbatim}
437
Andrew M. Kuchlingc8f8a812004-07-04 01:26:42 +0000438You can also provide tuples containing the sign, mantissa represented
439as a tuple of decimal digits, and exponent:
Raymond Hettinger0fff62f2004-07-01 11:52:15 +0000440
441\begin{verbatim}
Andrew M. Kuchlingc8f8a812004-07-04 01:26:42 +0000442>>> decimal.Decimal((1, (1, 4, 7, 5), -2))
443Decimal("-14.75")
Raymond Hettinger0fff62f2004-07-01 11:52:15 +0000444\end{verbatim}
Andrew M. Kuchlingc8f8a812004-07-04 01:26:42 +0000445
446Cautionary note: the sign bit is a Boolean value, so 0 is positive and 1 is negative.
447
448Floating-point numbers posed a bit of a problem: should the FP number
449representing 1.1 turn into the decimal number for exactly 1.1, or for
4501.1 plus whatever inaccuracies are introduced? The decision was to
451leave such a conversion out of the API. Instead, you should convert
452the floating-point number into a string using the desired precision and
453pass the string to the \class{Decimal} constructor:
454
455\begin{verbatim}
456>>> f = 1.1
457>>> decimal.Decimal(str(f))
458Decimal("1.1")
Andrew M. Kuchling0ad20f12004-07-21 13:00:06 +0000459>>> decimal.Decimal('%.12f' % f)
460Decimal("1.100000000000")
Andrew M. Kuchlingc8f8a812004-07-04 01:26:42 +0000461\end{verbatim}
462
463Once you have \class{Decimal} instances, you can perform the usual
464mathematical operations on them. One limitation: exponentiation
465requires an integer exponent:
466
467\begin{verbatim}
468>>> a = decimal.Decimal('35.72')
469>>> b = decimal.Decimal('1.73')
470>>> a+b
471Decimal("37.45")
472>>> a-b
473Decimal("33.99")
474>>> a*b
475Decimal("61.7956")
476>>> a/b
Andrew M. Kuchling0ad20f12004-07-21 13:00:06 +0000477Decimal("20.64739884393063583815028902")
Andrew M. Kuchlingc8f8a812004-07-04 01:26:42 +0000478>>> a ** 2
479Decimal("1275.9184")
Andrew M. Kuchling0ad20f12004-07-21 13:00:06 +0000480>>> a**b
481Traceback (most recent call last):
482 ...
483decimal.InvalidOperation: x ** (non-integer)
Andrew M. Kuchlingc8f8a812004-07-04 01:26:42 +0000484\end{verbatim}
485
486You can combine \class{Decimal} instances with integers, but not with
487floating-point numbers:
488
489\begin{verbatim}
490>>> a + 4
491Decimal("39.72")
492>>> a + 4.5
493Traceback (most recent call last):
494 ...
495TypeError: You can interact Decimal only with int, long or Decimal data types.
496>>>
497\end{verbatim}
498
499\class{Decimal} numbers can be used with the \module{math} and
Andrew M. Kuchling0ad20f12004-07-21 13:00:06 +0000500\module{cmath} modules, but note that they'll be immediately converted to
501floating-point numbers before the operation is performed, resulting in
502a possible loss of precision and accuracy. You'll also get back a
503regular floating-point number and not a \class{Decimal}.
Andrew M. Kuchlingc8f8a812004-07-04 01:26:42 +0000504
505\begin{verbatim}
506>>> import math, cmath
507>>> d = decimal.Decimal('123456789012.345')
508>>> math.sqrt(d)
509351364.18288201344
510>>> cmath.sqrt(-d)
511351364.18288201344j
Andrew M. Kuchling0ad20f12004-07-21 13:00:06 +0000512\end{verbatim}
513
514Instances also have a \method{sqrt()} method that returns a
515\class{Decimal}, but if you need other things such as trigonometric
516functions you'll have to implement them.
517
518\begin{verbatim}
Andrew M. Kuchlingc8f8a812004-07-04 01:26:42 +0000519>>> d.sqrt()
Raymond Hettingerca1a7752004-07-12 13:00:45 +0000520Decimal("351364.1828820134592177245001")
Andrew M. Kuchlingc8f8a812004-07-04 01:26:42 +0000521\end{verbatim}
522
523
524\subsection{The \class{Context} type}
525
526Instances of the \class{Context} class encapsulate several settings for
527decimal operations:
528
529\begin{itemize}
530 \item \member{prec} is the precision, the number of decimal places.
531 \item \member{rounding} specifies the rounding mode. The \module{decimal}
532 module has constants for the various possibilities:
533 \constant{ROUND_DOWN}, \constant{ROUND_CEILING}, \constant{ROUND_HALF_EVEN}, and various others.
Andrew M. Kuchling0ad20f12004-07-21 13:00:06 +0000534 \item \member{traps} is a dictionary specifying what happens on
Andrew M. Kuchlingc8f8a812004-07-04 01:26:42 +0000535encountering certain error conditions: either an exception is raised or
536a value is returned. Some examples of error conditions are
537division by zero, loss of precision, and overflow.
538\end{itemize}
539
540There's a thread-local default context available by calling
541\function{getcontext()}; you can change the properties of this context
542to alter the default precision, rounding, or trap handling.
543
544\begin{verbatim}
545>>> decimal.getcontext().prec
54628
547>>> decimal.Decimal(1) / decimal.Decimal(7)
Raymond Hettingerca1a7752004-07-12 13:00:45 +0000548Decimal("0.1428571428571428571428571429")
Andrew M. Kuchlingc8f8a812004-07-04 01:26:42 +0000549>>> decimal.getcontext().prec = 9
550>>> decimal.Decimal(1) / decimal.Decimal(7)
Raymond Hettingerca1a7752004-07-12 13:00:45 +0000551Decimal("0.142857143")
Andrew M. Kuchlingc8f8a812004-07-04 01:26:42 +0000552\end{verbatim}
553
Andrew M. Kuchling0ad20f12004-07-21 13:00:06 +0000554The default action for error conditions is selectable; the module can
555either return a special value such as infinity or not-a-number, or
556exceptions can be raised:
Andrew M. Kuchlingc8f8a812004-07-04 01:26:42 +0000557
558\begin{verbatim}
559>>> decimal.Decimal(1) / decimal.Decimal(0)
Andrew M. Kuchlingc8f8a812004-07-04 01:26:42 +0000560Traceback (most recent call last):
561 ...
562decimal.DivisionByZero: x / 0
Andrew M. Kuchling0ad20f12004-07-21 13:00:06 +0000563>>> decimal.getcontext().traps[decimal.DivisionByZero] = False
564>>> decimal.Decimal(1) / decimal.Decimal(0)
565Decimal("Infinity")
Andrew M. Kuchlingc8f8a812004-07-04 01:26:42 +0000566>>>
567\end{verbatim}
568
569The \class{Context} instance also has various methods for formatting
570numbers such as \method{to_eng_string()} and \method{to_sci_string()}.
571
Andrew M. Kuchling0ad20f12004-07-21 13:00:06 +0000572For more information, see the documentation for the \module{decimal}
573module, which includes a quick-start tutorial and a reference.
574
Raymond Hettinger0fff62f2004-07-01 11:52:15 +0000575\begin{seealso}
576\seepep{327}{Decimal Data Type}{Written by Facundo Batista and implemented
Andrew M. Kuchlingc8f8a812004-07-04 01:26:42 +0000577 by Facundo Batista, Eric Price, Raymond Hettinger, Aahz, and Tim Peters.}
578
Raymond Hettingerca1a7752004-07-12 13:00:45 +0000579\seeurl{http://research.microsoft.com/\textasciitilde hollasch/cgindex/coding/ieeefloat.html}
Andrew M. Kuchlingc8f8a812004-07-04 01:26:42 +0000580{A more detailed overview of the IEEE-754 representation.}
581
582\seeurl{http://www.lahey.com/float.htm}
583{The article uses Fortran code to illustrate many of the problems
584that floating-point inaccuracy can cause.}
585
586\seeurl{http://www2.hursley.ibm.com/decimal/}
587{A description of a decimal-based representation. This representation
588is being proposed as a standard, and underlies the new Python decimal
589type. Much of this material was written by Mike Cowlishaw, designer of the
Raymond Hettingerca1a7752004-07-12 13:00:45 +0000590Rexx language.}
Andrew M. Kuchlingc8f8a812004-07-04 01:26:42 +0000591
Raymond Hettinger0fff62f2004-07-01 11:52:15 +0000592\end{seealso}
593
594
595%======================================================================
Andrew M. Kuchling65a33322004-07-21 12:41:38 +0000596\section{PEP 331: Locale-Independent Float/String Conversions}
597
598The \module{locale} modules lets Python software select various
599conversions and display conventions that are localized to a particular
600country or language. However, the module was careful to not change
601the numeric locale because various functions in Python's
602implementation required that the numeric locale remain set to the
603\code{'C'} locale. Often this was because the code was using the C library's
604\cfunction{atof()} function.
605
606Not setting the numeric locale caused trouble for extensions that used
607third-party C libraries, however, because they wouldn't have the
608correct locale set. The motivating example was GTK+, whose user
609interface widgets weren't displaying numbers in the current locale.
610
611The solution described in the PEP is to add three new functions to the
612Python API that perform ASCII-only conversions, ignoring the locale
613setting:
614
615\begin{itemize}
616 \item \cfunction{PyOS_ascii_strtod(\var{str}, \var{ptr})}
617and \cfunction{PyOS_ascii_atof(\var{str}, \var{ptr})}
618both convert a string to a C \ctype{double}.
619 \item \cfunction{PyOS_ascii_formatd(\var{buffer}, \var{buf_len}, \var{format}, \var{d})} converts a \ctype{double} to an ASCII string.
620\end{itemize}
621
622The code for these functions came from the GLib library
623(\url{http://developer.gnome.org/arch/gtk/glib.html}), whose
624developers kindly relicensed the relevant functions and donated them
625to the Python Software Foundation. The \module{locale} module
626can now change the numeric locale, letting extensions such as GTK+
627produce the correct results.
628
629\begin{seealso}
630\seepep{331}{Locale-Independent Float/String Conversions}{Written by Christian R. Reis, and implemented by Gustavo Carneiro.}
631\end{seealso}
632
633%======================================================================
Fred Drakeed0fa3d2003-07-30 19:14:09 +0000634\section{Other Language Changes}
635
636Here are all of the changes that Python 2.4 makes to the core Python
637language.
638
639\begin{itemize}
Raymond Hettingerd4462302003-11-26 17:52:45 +0000640
Raymond Hettinger31017ae2004-03-04 08:25:44 +0000641\item The \method{dict.update()} method now accepts the same
642argument forms as the \class{dict} constructor. This includes any
Andrew M. Kuchlingd0b6d9d2004-07-04 15:35:00 +0000643mapping, any iterable of key/value pairs, and keyword arguments.
Raymond Hettinger31017ae2004-03-04 08:25:44 +0000644
Andrew M. Kuchling3bf85f12004-07-05 01:37:07 +0000645\item The string methods \method{ljust()}, \method{rjust()}, and
Andrew M. Kuchling67087562003-11-26 18:03:48 +0000646\method{center()} now take an optional argument for specifying a
Raymond Hettingerd4462302003-11-26 17:52:45 +0000647fill character other than a space.
648
Andrew M. Kuchling35f2b052003-12-18 13:28:13 +0000649\item Strings also gained an \method{rsplit()} method that
Raymond Hettingered54d912003-12-31 01:59:18 +0000650works like the \method{split()} method but splits from the end of
Andrew M. Kuchling44a31e12004-01-01 18:33:34 +0000651the string.
Andrew M. Kuchling35f2b052003-12-18 13:28:13 +0000652
653\begin{verbatim}
Raymond Hettinger7a6d2972004-02-13 19:00:07 +0000654>>> 'www.python.org'.split('.', 1)
655['www', 'python.org']
656'www.python.org'.rsplit('.', 1)
657['www.python', 'org']
658\end{verbatim}
Raymond Hettinger97ef8de2004-01-05 00:29:57 +0000659
Andrew M. Kuchling2fb4d512003-10-21 12:31:16 +0000660\item The \method{sort()} method of lists gained three keyword
Andrew M. Kuchling3bf85f12004-07-05 01:37:07 +0000661arguments: \var{cmp}, \var{key}, and \var{reverse}. These arguments
Andrew M. Kuchling2fb4d512003-10-21 12:31:16 +0000662make some common usages of \method{sort()} simpler. All are optional.
663
664\var{cmp} is the same as the previous single argument to
665\method{sort()}; if provided, the value should be a comparison
666function that takes two arguments and returns -1, 0, or +1 depending
667on how the arguments compare.
668
669\var{key} should be a single-argument function that takes a list
670element and returns a comparison key for the element. The list is
Raymond Hettinger607c00f2003-11-12 16:27:50 +0000671then sorted using the comparison keys. The following example sorts a
672list case-insensitively:
Andrew M. Kuchling2fb4d512003-10-21 12:31:16 +0000673
674\begin{verbatim}
675>>> L = ['A', 'b', 'c', 'D']
676>>> L.sort() # Case-sensitive sort
677>>> L
678['A', 'D', 'b', 'c']
679>>> L.sort(key=lambda x: x.lower())
680>>> L
681['A', 'b', 'c', 'D']
682>>> L.sort(cmp=lambda x,y: cmp(x.lower(), y.lower()))
683>>> L
684['A', 'b', 'c', 'D']
685\end{verbatim}
686
687The last example, which uses the \var{cmp} parameter, is the old way
Raymond Hettingered54d912003-12-31 01:59:18 +0000688to perform a case-insensitive sort. It works but is slower than
Andrew M. Kuchling2fb4d512003-10-21 12:31:16 +0000689using a \var{key} parameter. Using \var{key} results in calling the
690\method{lower()} method once for each element in the list while using
Andrew M. Kuchling3bf85f12004-07-05 01:37:07 +0000691\var{cmp} will call it twice for each comparison.
Andrew M. Kuchling2fb4d512003-10-21 12:31:16 +0000692
Andrew M. Kuchling981a9182003-11-13 21:33:26 +0000693For simple key functions and comparison functions, it is often
694possible to avoid a \keyword{lambda} expression by using an unbound
Raymond Hettinger607c00f2003-11-12 16:27:50 +0000695method instead. For example, the above case-insensitive sort is best
696coded as:
697
698\begin{verbatim}
699>>> L.sort(key=str.lower)
700>>> L
701['A', 'b', 'c', 'D']
702\end{verbatim}
703
Andrew M. Kuchling3bf85f12004-07-05 01:37:07 +0000704The \var{reverse} parameter should have a Boolean value. If the value
705is \constant{True}, the list will be sorted into reverse order.
706Instead of \code{L.sort(lambda x,y: cmp(x.score, y.score)) ;
707L.reverse()}, you can now write: \code{L.sort(key = lambda x: x.score,
708reverse=True)}.
Fred Drakeed0fa3d2003-07-30 19:14:09 +0000709
Andrew M. Kuchling981a9182003-11-13 21:33:26 +0000710The results of sorting are now guaranteed to be stable. This means
711that two entries with equal keys will be returned in the same order as
712they were input. For example, you can sort a list of people by name,
713and then sort the list by age, resulting in a list sorted by age where
714people with the same age are in name-sorted order.
Raymond Hettinger607c00f2003-11-12 16:27:50 +0000715
Fred Drake56fcc232004-05-06 02:55:35 +0000716\item There is a new built-in function
717\function{sorted(\var{iterable})} that works like the in-place
Andrew M. Kuchling3bf85f12004-07-05 01:37:07 +0000718\method{list.sort()} method but can be used in
Fred Drake56fcc232004-05-06 02:55:35 +0000719expressions. The differences are:
Raymond Hettinger607c00f2003-11-12 16:27:50 +0000720 \begin{itemize}
Raymond Hettinger7d1dd042003-11-12 16:42:10 +0000721 \item the input may be any iterable;
722 \item a newly formed copy is sorted, leaving the original intact; and
Raymond Hettinger607c00f2003-11-12 16:27:50 +0000723 \item the expression returns the new sorted copy
724 \end{itemize}
Andrew M. Kuchling1a420252003-11-08 15:58:49 +0000725
726\begin{verbatim}
727>>> L = [9,7,8,3,2,4,1,6,5]
Raymond Hettinger64958a12003-12-17 20:43:33 +0000728>>> [10+i for i in sorted(L)] # usable in a list comprehension
Raymond Hettinger607c00f2003-11-12 16:27:50 +0000729[11, 12, 13, 14, 15, 16, 17, 18, 19]
Hye-Shik Chang2b052482004-07-17 13:53:48 +0000730>>> L # original is left unchanged
Andrew M. Kuchlinge3e1eca2004-07-26 18:52:48 +0000731[9,7,8,3,2,4,1,6,5]
732>>> sorted('Monty Python') # any iterable may be an input
733[' ', 'M', 'P', 'h', 'n', 'n', 'o', 'o', 't', 't', 'y', 'y']
Raymond Hettingerd4462302003-11-26 17:52:45 +0000734
735>>> # List the contents of a dict sorted by key values
Raymond Hettinger607c00f2003-11-12 16:27:50 +0000736>>> colormap = dict(red=1, blue=2, green=3, black=4, yellow=5)
Raymond Hettinger64958a12003-12-17 20:43:33 +0000737>>> for k, v in sorted(colormap.iteritems()):
Raymond Hettinger607c00f2003-11-12 16:27:50 +0000738... print k, v
739...
740black 4
741blue 2
742green 3
743red 1
744yellow 5
Andrew M. Kuchling1a420252003-11-08 15:58:49 +0000745\end{verbatim}
746
Andrew M. Kuchlingd0b6d9d2004-07-04 15:35:00 +0000747\item The \function{eval(\var{expr}, \var{globals}, \var{locals})}
Andrew M. Kuchling1455f792004-08-02 12:09:58 +0000748and \function{execfile(\var{filename}, \var{globals}, \var{locals})}
749functions and the \keyword{exec} statement now accept any mapping type
750for the \var{locals} argument. Previously this had to be a regular
751Python dictionary. (Contributed by Raymond Hettinger.)
Andrew M. Kuchlingd0b6d9d2004-07-04 15:35:00 +0000752
Raymond Hettinger607c00f2003-11-12 16:27:50 +0000753\item The \function{zip()} built-in function and \function{itertools.izip()}
Andrew M. Kuchling3bf85f12004-07-05 01:37:07 +0000754 now return an empty list if called with no arguments.
755 Previously they raised a \exception{TypeError}
756 exception. This makes them more
Raymond Hettinger607c00f2003-11-12 16:27:50 +0000757 suitable for use with variable length argument lists:
758
759\begin{verbatim}
760>>> def transpose(array):
761... return zip(*array)
762...
763>>> transpose([(1,2,3), (4,5,6)])
764[(1, 4), (2, 5), (3, 6)]
765>>> transpose([])
766[]
767\end{verbatim}
Andrew M. Kuchling6aedcfc2003-10-21 12:48:23 +0000768
Andrew M. Kuchlingd91fcbe2004-08-02 12:44:28 +0000769\item Encountering a failure while importing a module no longer leaves
770a partially-initialized module object in \code{sys.modules}. The
771incomplete module object left behind would fool further imports of the
772same module into succeeding, leading to confusing errors.
773
Andrew M. Kuchling65a33322004-07-21 12:41:38 +0000774\item \constant{None} is now a constant; code that binds a new value to
775the name \samp{None} is now a syntax error.
776
Fred Drakeed0fa3d2003-07-30 19:14:09 +0000777\end{itemize}
778
779
780%======================================================================
781\subsection{Optimizations}
782
783\begin{itemize}
784
Raymond Hettingerca1a7752004-07-12 13:00:45 +0000785\item The inner loops for list and tuple slicing
Andrew M. Kuchling65a33322004-07-21 12:41:38 +0000786 were optimized and now run about one-third faster. The inner loops
787 were also optimized for dictionaries, resulting in performance boosts for
788 \method{keys()}, \method{values()}, \method{items()},
789 \method{iterkeys()}, \method{itervalues()}, and \method{iteritems()}.
Raymond Hettingerb7d05db2004-03-08 07:25:05 +0000790
Andrew M. Kuchling3bf85f12004-07-05 01:37:07 +0000791\item The machinery for growing and shrinking lists was optimized for
792 speed and for space efficiency. Appending and popping from lists now
793 runs faster due to more efficient code paths and less frequent use of
794 the underlying system \cfunction{realloc()}. List comprehensions
795 also benefit. \method{list.extend()} was also optimized and no
796 longer converts its argument into a temporary list before extending
797 the base list.
Raymond Hettinger7a6d2972004-02-13 19:00:07 +0000798
Raymond Hettinger97ef8de2004-01-05 00:29:57 +0000799\item \function{list()}, \function{tuple()}, \function{map()},
800 \function{filter()}, and \function{zip()} now run several times
801 faster with non-sequence arguments that supply a \method{__len__()}
Andrew M. Kuchling3bf85f12004-07-05 01:37:07 +0000802 method.
Raymond Hettinger97ef8de2004-01-05 00:29:57 +0000803
Raymond Hettinger23a0f4e2004-01-05 08:15:20 +0000804\item The methods \method{list.__getitem__()},
Raymond Hettinger97ef8de2004-01-05 00:29:57 +0000805 \method{dict.__getitem__()}, and \method{dict.__contains__()} are
806 are now implemented as \class{method_descriptor} objects rather
807 than \class{wrapper_descriptor} objects. This form of optimized
808 access doubles their performance and makes them more suitable for
Raymond Hettinger23a0f4e2004-01-05 08:15:20 +0000809 use as arguments to functionals:
810 \samp{map(mydict.__getitem__, keylist)}.
Fred Drakeed0fa3d2003-07-30 19:14:09 +0000811
Fred Draked6d35d92004-06-03 13:31:22 +0000812\item Added a new opcode, \code{LIST_APPEND}, that simplifies
Raymond Hettingerdd80f762004-03-07 07:31:06 +0000813 the generated bytecode for list comprehensions and speeds them up
814 by about a third.
815
Andrew M. Kuchlingac642872004-08-07 13:13:31 +0000816\item String concatenations in statements of the form \code{s = s +
817"abc"} and \code{s += "abc"} are now performed more efficiently in
818certain circumstances. This optimization won't be present in other
819Python implementations such as Jython, so you shouldn't rely on it;
820using the \method{join()} method of strings is still recommended when
821you want to efficiently glue a large number of strings together.
822
Fred Drakeed0fa3d2003-07-30 19:14:09 +0000823\end{itemize}
824
825The net result of the 2.4 optimizations is that Python 2.4 runs the
826pystone benchmark around XX\% faster than Python 2.3 and YY\% faster
827than Python 2.2.
828
829
830%======================================================================
831\section{New, Improved, and Deprecated Modules}
832
833As usual, Python's standard library received a number of enhancements and
834bug fixes. Here's a partial list of the most notable changes, sorted
835alphabetically by module name. Consult the
836\file{Misc/NEWS} file in the source tree for a more
837complete list of changes, or look through the CVS logs for all the
838details.
839
840\begin{itemize}
841
Anthony Baxter5da4c832004-07-09 16:16:46 +0000842% XXX new email parser
843
Andrew M. Kuchlingd0b6d9d2004-07-04 15:35:00 +0000844\item The \module{asyncore} module's \function{loop()} now has a
845 \var{count} parameter that lets you perform a limited number
846 of passes through the polling loop. The default is still to loop
847 forever.
848
Andrew M. Kuchling69f31eb2003-08-13 23:11:04 +0000849\item The \module{curses} modules now supports the ncurses extension
Fred Draked6d35d92004-06-03 13:31:22 +0000850 \function{use_default_colors()}. On platforms where the terminal
851 supports transparency, this makes it possible to use a transparent
852 background. (Contributed by J\"org Lehmann.)
Andrew M. Kuchling6aedcfc2003-10-21 12:48:23 +0000853
Raymond Hettinger0c410272004-01-05 10:13:35 +0000854\item The \module{bisect} module now has an underlying C implementation
855 for improved performance.
856 (Contributed by Dmitry Vasiliev.)
857
Andrew M. Kuchling5303a962004-01-18 15:55:51 +0000858\item The CJKCodecs collections of East Asian codecs, maintained
859by Hye-Shik Chang, was integrated into 2.4.
860The new encodings are:
861
862\begin{itemize}
Andrew M. Kuchling671c5062004-07-28 15:29:39 +0000863 \item Chinese (PRC): gb2312, gbk, gb18030, big5hkscs, hz
Andrew M. Kuchling5303a962004-01-18 15:55:51 +0000864 \item Chinese (ROC): big5, cp950
Andrew M. Kuchling671c5062004-07-28 15:29:39 +0000865 \item Japanese: cp932, euc-jis-2004, euc-jp,
Andrew M. Kuchling5303a962004-01-18 15:55:51 +0000866euc-jisx0213, iso-2022-jp, iso-2022-jp-1, iso-2022-jp-2,
Andrew M. Kuchling671c5062004-07-28 15:29:39 +0000867 iso-2022-jp-3, iso-2022-jp-ext, iso-2022-jp-2004,
868 shift-jis, shift-jisx0213, shift-jis-2004
Andrew M. Kuchling5303a962004-01-18 15:55:51 +0000869 \item Korean: cp949, euc-kr, johab, iso-2022-kr
870\end{itemize}
871
Fred Drake9bae19e2004-08-07 14:28:37 +0000872\item Some other new encodings were added: ISO_8859-11, ISO_8859-16, PCTP-154,
Andrew M. Kuchlinge30c4d42004-08-07 13:58:02 +0000873and TIS-620.
874
Andrew M. Kuchlingfd0e4942004-02-09 13:23:34 +0000875\item There is a new \module{collections} module for
876 various specialized collection datatypes.
877 Currently it contains just one type, \class{deque},
878 a double-ended queue that supports efficiently adding and removing
879 elements from either end.
Raymond Hettinger756b3f32004-01-29 06:37:52 +0000880
881\begin{verbatim}
882>>> from collections import deque
883>>> d = deque('ghi') # make a new deque with three items
884>>> d.append('j') # add a new entry to the right side
885>>> d.appendleft('f') # add a new entry to the left side
886>>> d # show the representation of the deque
887deque(['f', 'g', 'h', 'i', 'j'])
888>>> d.pop() # return and remove the rightmost item
889'j'
890>>> d.popleft() # return and remove the leftmost item
891'f'
892>>> list(d) # list the contents of the deque
893['g', 'h', 'i']
894>>> 'h' in d # search the deque
895True
896\end{verbatim}
897
Andrew M. Kuchlingfd0e4942004-02-09 13:23:34 +0000898Several modules now take advantage of \class{collections.deque} for
Andrew M. Kuchling3bf85f12004-07-05 01:37:07 +0000899improved performance, such as the \module{Queue} and
900\module{threading} modules.
Andrew M. Kuchling5303a962004-01-18 15:55:51 +0000901
Fred Drake9f15b5c2004-05-18 04:30:00 +0000902\item The \module{ConfigParser} classes have been enhanced slightly.
903 The \method{read()} method now returns a list of the files that
904 were successfully parsed, and the \method{set()} method raises
905 \exception{TypeError} if passed a \var{value} argument that isn't a
906 string.
907
Raymond Hettinger607c00f2003-11-12 16:27:50 +0000908\item The \module{heapq} module has been converted to C. The resulting
Andrew M. Kuchlingfd0e4942004-02-09 13:23:34 +0000909 tenfold improvement in speed makes the module suitable for handling
Raymond Hettinger33ecffb2004-06-10 05:03:17 +0000910 high volumes of data. In addition, the module has two new functions
911 \function{nlargest()} and \function{nsmallest()} that use heaps to
Andrew M. Kuchlingd0b6d9d2004-07-04 15:35:00 +0000912 find the N largest or smallest values in a dataset without the
Raymond Hettinger33ecffb2004-06-10 05:03:17 +0000913 expense of a full sort.
Andrew M. Kuchling1a420252003-11-08 15:58:49 +0000914
Andrew M. Kuchlingce4bae62004-07-27 12:13:25 +0000915\item The \module{imaplib} module now supports IMAP's THREAD command
916(contributed by Yves Dionne) and new \method{deleteacl()} and
917\method{myrights()} methods (contributed by Arnaud Mazin).
Andrew M. Kuchlingdff9dbd2003-11-20 22:22:19 +0000918
Andrew M. Kuchlingad809552003-12-06 23:19:23 +0000919\item The \module{itertools} module gained a
Andrew M. Kuchling3bf85f12004-07-05 01:37:07 +0000920 \function{groupby(\var{iterable}\optional{, \var{func}})} function.
Andrew M. Kuchlingad809552003-12-06 23:19:23 +0000921 \var{iterable} returns a succession of elements, and the optional
922 \var{func} is a function that takes an element and returns a key
923 value; if omitted, the key is simply the element itself.
924 \function{groupby()} then groups the elements into subsequences
925 which have matching values of the key, and returns a series of 2-tuples
926 containing the key value and an iterator over the subsequence.
927
928Here's an example. The \var{key} function simply returns whether a
929number is even or odd, so the result of \function{groupby()} is to
930return consecutive runs of odd or even numbers.
931
932\begin{verbatim}
933>>> import itertools
934>>> L = [2,4,6, 7,8,9,11, 12, 14]
935>>> for key_val, it in itertools.groupby(L, lambda x: x % 2):
936... print key_val, list(it)
937...
9380 [2, 4, 6]
9391 [7]
9400 [8]
9411 [9, 11]
9420 [12, 14]
943>>>
944\end{verbatim}
945
Andrew M. Kuchling3bf85f12004-07-05 01:37:07 +0000946\function{groupby()} is typically used with sorted input. The logic
947for \function{groupby()} is similar to the \UNIX{} \code{uniq} filter
948which makes it handy for eliminating, counting, or identifying
949duplicate elements:
Raymond Hettingerfeb78c92003-12-12 13:13:47 +0000950
951\begin{verbatim}
952>>> word = 'abracadabra'
Raymond Hettingered54d912003-12-31 01:59:18 +0000953>>> letters = sorted(word) # Turn string into a sorted list of letters
Raymond Hettinger64958a12003-12-17 20:43:33 +0000954>>> letters
Andrew M. Kuchling4612bc52003-12-16 20:59:37 +0000955['a', 'a', 'a', 'a', 'a', 'b', 'b', 'c', 'd', 'r', 'r']
Andrew M. Kuchling3bf85f12004-07-05 01:37:07 +0000956>>> for k, g in itertools.groupby(letters):
957... print k, list(g)
958...
959a ['a', 'a', 'a', 'a', 'a']
960b ['b', 'b']
961c ['c']
962d ['d']
963r ['r', 'r']
964>>> # List unique letters
965>>> [k for k, g in groupby(letters)]
Raymond Hettingerfeb78c92003-12-12 13:13:47 +0000966['a', 'b', 'c', 'd', 'r']
Andrew M. Kuchling3bf85f12004-07-05 01:37:07 +0000967>>> # Count letter occurences
968>>> [(k, len(list(g))) for k, g in groupby(letters)]
Raymond Hettingerfeb78c92003-12-12 13:13:47 +0000969[('a', 5), ('b', 2), ('c', 1), ('d', 1), ('r', 2)]
Raymond Hettingerfeb78c92003-12-12 13:13:47 +0000970\end{verbatim}
971
Raymond Hettingered54d912003-12-31 01:59:18 +0000972\item \module{itertools} also gained a function named
973\function{tee(\var{iterator}, \var{N})} that returns \var{N} independent
974iterators that replicate \var{iterator}. If \var{N} is omitted, the
975default is 2.
Andrew M. Kuchling35f2b052003-12-18 13:28:13 +0000976
977\begin{verbatim}
978>>> L = [1,2,3]
979>>> i1, i2 = itertools.tee(L)
980>>> i1,i2
981(<itertools.tee object at 0x402c2080>, <itertools.tee object at 0x402c2090>)
Raymond Hettingered54d912003-12-31 01:59:18 +0000982>>> list(i1) # Run the first iterator to exhaustion
Andrew M. Kuchling35f2b052003-12-18 13:28:13 +0000983[1, 2, 3]
Raymond Hettingered54d912003-12-31 01:59:18 +0000984>>> list(i2) # Run the second iterator to exhaustion
Andrew M. Kuchling35f2b052003-12-18 13:28:13 +0000985[1, 2, 3]
986>\end{verbatim}
987
988Note that \function{tee()} has to keep copies of the values returned
Raymond Hettingered54d912003-12-31 01:59:18 +0000989by the iterator; in the worst case, it may need to keep all of them.
Andrew M. Kuchling44a31e12004-01-01 18:33:34 +0000990This should therefore be used carefully if the leading iterator
Raymond Hettingered54d912003-12-31 01:59:18 +0000991can run far ahead of the trailing iterator in a long stream of inputs.
Andrew M. Kuchling3bf85f12004-07-05 01:37:07 +0000992If the separation is large, then you might as well use
Raymond Hettingered54d912003-12-31 01:59:18 +0000993\function{list()} instead. When the iterators track closely with one
994another, \function{tee()} is ideal. Possible applications include
995bookmarking, windowing, or lookahead iterators.
Andrew M. Kuchling35f2b052003-12-18 13:28:13 +0000996
Andrew M. Kuchling5785a132004-07-26 19:28:46 +0000997\item A number of functions were added to the \module{locale}
998module, such as \function{bind_textdomain_codeset()} to specify a
999particular encoding, and a family of \function{l*gettext()} functions
1000that return messages in the chosen encoding.
1001(Contributed by Gustavo Niemeyer.)
1002
Andrew M. Kuchling23406892004-07-15 11:44:42 +00001003\item The \module{logging} package's \function{basicConfig} function
1004gained some keyword arguments to simplify log configuration. The
1005default behavior is to log messages to standard error, but
1006various keyword arguments can be specified to log to a particular file,
1007change the logging format, or set the logging level. For example:
Andrew M. Kuchlingbcefe692004-07-07 13:01:53 +00001008
1009\begin{verbatim}
1010import logging
1011logging.basicConfig(filename = '/var/log/application.log',
1012 level=0, # Log all messages, including debugging,
1013 format='%(levelname):%(process):%(thread):%(message)')
1014\end{verbatim}
1015
1016Another addition to \module{logging} is a
1017\class{TimedRotatingFileHandler} class which rotates its log files at
1018a timed interval. The module already had \class{RotatingFileHandler},
1019which rotated logs once the file exceeded a certain size. Both
1020classes derive from a new \class{BaseRotatingHandler} class that can
1021be used to implement other rotating handlers.
1022
Andrew M. Kuchling5785a132004-07-26 19:28:46 +00001023\item The \module{nntplib} module's \class{NNTP} class gained
1024\method{description()} and \method{descriptions()} methods to retrieve
1025newsgroup descriptions for a single group or for a range of groups.
1026(Contributed by J\"urgen A. Erhard.)
1027
Andrew M. Kuchling35f2b052003-12-18 13:28:13 +00001028\item The \module{operator} module gained two new functions,
1029\function{attrgetter(\var{attr})} and \function{itemgetter(\var{index})}.
1030Both functions return callables that take a single argument and return
Raymond Hettingered54d912003-12-31 01:59:18 +00001031the corresponding attribute or item; these callables make excellent
Andrew M. Kuchlingbcefe692004-07-07 13:01:53 +00001032data extractors when used with \function{map()} or
1033\function{sorted()}. For example:
Andrew M. Kuchling35f2b052003-12-18 13:28:13 +00001034
1035\begin{verbatim}
Raymond Hettingered54d912003-12-31 01:59:18 +00001036>>> L = [('c', 2), ('d', 1), ('a', 4), ('b', 3)]
Andrew M. Kuchling35f2b052003-12-18 13:28:13 +00001037>>> map(operator.itemgetter(0), L)
1038['c', 'd', 'a', 'b']
1039>>> map(operator.itemgetter(1), L)
Raymond Hettingered54d912003-12-31 01:59:18 +00001040[2, 1, 4, 3]
1041>>> sorted(L, key=operator.itemgetter(1)) # Sort list by second tuple item
1042[('d', 1), ('c', 2), ('b', 3), ('a', 4)]
Andrew M. Kuchling35f2b052003-12-18 13:28:13 +00001043\end{verbatim}
1044
Andrew M. Kuchlinge30c4d42004-08-07 13:58:02 +00001045\item The \module{optparse} module was updated. The module now passes
1046its messages through \function{gettext.gettext()}, making it possible
1047to internationalize Optik's help and error messages. Help messages
Fred Drake9bae19e2004-08-07 14:28:37 +00001048for options can now include the string \code{'\%default'}, which will
Andrew M. Kuchlinge30c4d42004-08-07 13:58:02 +00001049be replaced by the option's default value.
1050
Andrew M. Kuchlingd0b6d9d2004-07-04 15:35:00 +00001051\item A new \function{getsid()} function was added to the
1052\module{posix} module that underlies the \module{os} module.
1053(Contributed by J. Raynor.)
1054
1055\item The \module{poplib} module now supports POP over SSL.
1056
1057\item The \module{profile} module can now profile C extension functions.
1058% XXX more to say about this?
1059
Andrew M. Kuchling6aedcfc2003-10-21 12:48:23 +00001060\item The \module{random} module has a new method called \method{getrandbits(N)}
Raymond Hettinger607c00f2003-11-12 16:27:50 +00001061 which returns an N-bit long integer. This method supports the existing
1062 \method{randrange()} method, making it possible to efficiently generate
Andrew M. Kuchling44a31e12004-01-01 18:33:34 +00001063 arbitrarily large random numbers.
Andrew M. Kuchling6aedcfc2003-10-21 12:48:23 +00001064
1065\item The regular expression language accepted by the \module{re} module
1066 was extended with simple conditional expressions, written as
1067 \code{(?(\var{group})\var{A}|\var{B})}. \var{group} is either a
1068 numeric group ID or a group name defined with \code{(?P<group>...)}
1069 earlier in the expression. If the specified group matched, the
1070 regular expression pattern \var{A} will be tested against the string; if
1071 the group didn't match, the pattern \var{B} will be used instead.
Raymond Hettinger874ebd52004-05-31 03:15:02 +00001072
Andrew M. Kuchling7f203b82004-08-09 14:48:28 +00001073\item A new \function{socketpair()} function was added to the
1074 \module{socket} module, returning a pair of connected sockets.
1075 (Contributed by Dave Cole.)
1076
Anthony Baxter1869df12004-07-12 08:15:37 +00001077% XXX sre is now non-recursive.
1078
Andrew M. Kuchling00457172004-07-15 11:52:40 +00001079\item The \module{threading} module now has an elegantly simple way to support
1080thread-local data. The module contains a \class{local} class whose
1081attribute values are local to different threads.
1082
1083\begin{verbatim}
1084import threading
1085
1086data = threading.local()
1087data.number = 42
1088data.url = ('www.python.org', 80)
1089\end{verbatim}
1090
1091Other threads can assign and retrieve their own values for the
1092\member{number} and \member{url} attributes. You can subclass
1093\class{local} to initialize attributes or to add methods.
1094(Contributed by Jim Fulton.)
1095
Raymond Hettinger874ebd52004-05-31 03:15:02 +00001096\item The \module{weakref} module now supports a wider variety of objects
1097 including Python functions, class instances, sets, frozensets, deques,
1098 arrays, files, sockets, and regular expression pattern objects.
Andrew M. Kuchlingd0b6d9d2004-07-04 15:35:00 +00001099
1100\item The \module{xmlrpclib} module now supports a multi-call extension for
Andrew M. Kuchling00457172004-07-15 11:52:40 +00001101transmitting multiple XML-RPC calls in a single HTTP operation.
Andrew M. Kuchling69f31eb2003-08-13 23:11:04 +00001102
Fred Drakeed0fa3d2003-07-30 19:14:09 +00001103\end{itemize}
1104
1105
1106%======================================================================
Raymond Hettingerca1a7752004-07-12 13:00:45 +00001107% whole new modules get described in subsections here
Fred Drakeed0fa3d2003-07-30 19:14:09 +00001108
Martin v. Löwis2a6ba902004-05-31 18:22:40 +00001109\subsection{cookielib}
1110
1111The \module{cookielib} library supports client-side handling for HTTP
1112cookies, just as the \module{Cookie} provides server-side cookie
Andrew M. Kuchling71432f12004-07-05 01:40:07 +00001113support in CGI scripts. Cookies are stored in cookie jars; the library
Martin v. Löwis2a6ba902004-05-31 18:22:40 +00001114transparently stores cookies offered by the web server in the cookie
1115jar, and fetches the cookie from the jar when connecting to the
1116server. Similar to web browsers, policy objects control whether
1117cookies are accepted or not.
1118
1119In order to store cookies across sessions, two implementations of
1120cookie jars are provided: one that stores cookies in the Netscape
1121format, so applications can use the Mozilla or Lynx cookie jars, and
1122one that stores cookies in the same format as the Perl libwww libary.
1123
1124\module{urllib2} has been changed to interact with \module{cookielib}:
1125\class{HTTPCookieProcessor} manages a cookie jar that is used when
1126accessing URLs.
Fred Drakeed0fa3d2003-07-30 19:14:09 +00001127
1128% ======================================================================
1129\section{Build and C API Changes}
1130
1131Changes to Python's build process and to the C API include:
1132
1133\begin{itemize}
1134
Andrew M. Kuchling6aedcfc2003-10-21 12:48:23 +00001135 \item Three new convenience macros were added for common return
1136 values from extension functions: \csimplemacro{Py_RETURN_NONE},
1137 \csimplemacro{Py_RETURN_TRUE}, and \csimplemacro{Py_RETURN_FALSE}.
1138
Andrew M. Kuchling5785a132004-07-26 19:28:46 +00001139 \item Another new macro, \csimplemacro{Py_CLEAR(\var{obj})},
1140 decreases the reference count of \var{obj} and sets \var{obj} to the
1141 null pointer.
1142
Fred Drakece3caf22004-02-12 18:13:12 +00001143 \item A new function, \cfunction{PyTuple_Pack(\var{N}, \var{obj1},
1144 \var{obj2}, ..., \var{objN})}, constructs tuples from a variable
1145 length argument list of Python objects.
Fred Drakeed0fa3d2003-07-30 19:14:09 +00001146
Fred Drakece3caf22004-02-12 18:13:12 +00001147 \item A new function, \cfunction{PyDict_Contains(\var{d}, \var{k})},
1148 implements fast dictionary lookups without masking exceptions raised
1149 during the look-up process.
Raymond Hettingerd4462302003-11-26 17:52:45 +00001150
Andrew M. Kuchlinge30c4d42004-08-07 13:58:02 +00001151 \item A new function, \cfunction{PyArg_VaParseTupleAndKeywords()},
1152 is the same as \cfunction{PyArg_ParseTupleAndKeywords()} but takes a
1153 \ctype{va_list} instead of a number of arguments.
1154 (Contributed by Greg Chapman.)
1155
Fred Drakece3caf22004-02-12 18:13:12 +00001156 \item A new method flag, \constant{METH_COEXISTS}, allows a function
Andrew M. Kuchling71432f12004-07-05 01:40:07 +00001157 defined in slots to co-exist with a \ctype{PyCFunction} having the
1158 same name. This can halve the access time for a method such as
Andrew M. Kuchlingd0b6d9d2004-07-04 15:35:00 +00001159 \method{set.__contains__()}.
1160
1161 \item Python can now be built with additional profiling for the interpreter
Andrew M. Kuchling71432f12004-07-05 01:40:07 +00001162 itself. This is intended for people developing on the Python core.
Andrew M. Kuchlingd0b6d9d2004-07-04 15:35:00 +00001163 Providing \longprogramopt{--enable-profiling} to the
1164 \program{configure} script will let you profile the interpreter with
1165 \program{gprof}, and providing the \longprogramopt{--with-tsc} switch
Andrew M. Kuchling71432f12004-07-05 01:40:07 +00001166 enables profiling using the Pentium's Time-Stamp-Counter register.
Andrew M. Kuchlingd0b6d9d2004-07-04 15:35:00 +00001167
1168 \item The \ctype{tracebackobject} type has been renamed to \ctype{PyTracebackObject}.
Raymond Hettinger97ef8de2004-01-05 00:29:57 +00001169
Fred Drakeed0fa3d2003-07-30 19:14:09 +00001170\end{itemize}
1171
1172
1173%======================================================================
1174\subsection{Port-Specific Changes}
1175
Raymond Hettinger97ef8de2004-01-05 00:29:57 +00001176\begin{itemize}
1177
1178\item The Windows port now builds under MSVC++ 7.1 as well as version 6.
1179
1180\end{itemize}
Fred Drakeed0fa3d2003-07-30 19:14:09 +00001181
1182
1183%======================================================================
1184\section{Other Changes and Fixes \label{section-other}}
1185
1186As usual, there were a bunch of other improvements and bugfixes
1187scattered throughout the source tree. A search through the CVS change
1188logs finds there were XXX patches applied and YYY bugs fixed between
1189Python 2.3 and 2.4. Both figures are likely to be underestimates.
1190
1191Some of the more notable changes are:
1192
1193\begin{itemize}
1194
Raymond Hettinger97ef8de2004-01-05 00:29:57 +00001195\item The \module{timeit} module now automatically disables periodic
1196 garbarge collection during the timing loop. This change makes
1197 consecutive timings more comparable.
1198
1199\item The \module{base64} module now has more complete RFC 3548 support
1200 for Base64, Base32, and Base16 encoding and decoding, including
1201 optional case folding and optional alternative alphabets.
1202 (Contributed by Barry Warsaw.)
Fred Drakeed0fa3d2003-07-30 19:14:09 +00001203
1204\end{itemize}
1205
1206
1207%======================================================================
1208\section{Porting to Python 2.4}
1209
1210This section lists previously described changes that may require
1211changes to your code:
1212
1213\begin{itemize}
1214
Raymond Hettinger607c00f2003-11-12 16:27:50 +00001215\item The \function{zip()} built-in function and \function{itertools.izip()}
1216 now return an empty list instead of raising a \exception{TypeError}
1217 exception if called with no arguments.
Andrew M. Kuchling6aedcfc2003-10-21 12:48:23 +00001218
1219\item \function{dircache.listdir()} now passes exceptions to the caller
1220 instead of returning empty lists.
Fred Drakeed0fa3d2003-07-30 19:14:09 +00001221
Andrew M. Kuchling71432f12004-07-05 01:40:07 +00001222\item \function{LexicalHandler.startDTD()} used to receive the public and
1223 system IDs in the wrong order. This has been corrected; applications
Fred Drake56fcc232004-05-06 02:55:35 +00001224 relying on the wrong order need to be fixed.
Martin v. Löwis456ab1d2004-05-06 01:54:36 +00001225
Andrew M. Kuchling71432f12004-07-05 01:40:07 +00001226\item \function{fcntl.ioctl} now warns if the \var{mutate}
1227 argument is omitted and relevant.
Martin v. Löwis77ca6c42004-06-03 12:47:26 +00001228
Fred Drakeed0fa3d2003-07-30 19:14:09 +00001229\end{itemize}
1230
1231
1232%======================================================================
1233\section{Acknowledgements \label{acks}}
1234
1235The author would like to thank the following people for offering
1236suggestions, corrections and assistance with various drafts of this
Andrew M. Kuchling671c5062004-07-28 15:29:39 +00001237article: Hye-Shik Chang, Michael Dyck, Raymond Hettinger.
Fred Drakeed0fa3d2003-07-30 19:14:09 +00001238
1239\end{document}