blob: 76f0b26efa29cd0147a8a99e8658105f3fac314f [file] [log] [blame]
Georg Brandlc47408a2008-12-04 18:44:53 +00001****************************
Georg Brandl48310cd2009-01-03 21:18:54 +00002 What's New In Python 3.1
Georg Brandlc47408a2008-12-04 18:44:53 +00003****************************
4
5.. XXX Add trademark info for Apple, Microsoft.
6
Raymond Hettinger1f251a02009-04-04 10:47:35 +00007:Author: Raymond Hettinger
Benjamin Peterson34328292008-12-05 03:05:29 +00008:Release: |release|
9:Date: |today|
Georg Brandlc47408a2008-12-04 18:44:53 +000010
11.. $Id$
12 Rules for maintenance:
Georg Brandl48310cd2009-01-03 21:18:54 +000013
Georg Brandlc47408a2008-12-04 18:44:53 +000014 * Anyone can add text to this document. Do not spend very much time
15 on the wording of your changes, because your text will probably
16 get rewritten to some degree.
Georg Brandl48310cd2009-01-03 21:18:54 +000017
Georg Brandlc47408a2008-12-04 18:44:53 +000018 * The maintainer will go through Misc/NEWS periodically and add
19 changes; it's therefore more important to add your changes to
20 Misc/NEWS than to this file. (Note: I didn't get to this for 3.0.
21 GvR.)
Georg Brandl48310cd2009-01-03 21:18:54 +000022
Georg Brandlc47408a2008-12-04 18:44:53 +000023 * This is not a complete list of every single change; completeness
24 is the purpose of Misc/NEWS. Some changes I consider too small
25 or esoteric to include. If such a change is added to the text,
26 I'll just remove it. (This is another reason you shouldn't spend
27 too much time on writing your addition.)
Georg Brandl48310cd2009-01-03 21:18:54 +000028
Georg Brandlc47408a2008-12-04 18:44:53 +000029 * If you want to draw your new text to the attention of the
30 maintainer, add 'XXX' to the beginning of the paragraph or
31 section.
Georg Brandl48310cd2009-01-03 21:18:54 +000032
Georg Brandlc47408a2008-12-04 18:44:53 +000033 * It's OK to just add a fragmentary note about a change. For
34 example: "XXX Describe the transmogrify() function added to the
35 socket module." The maintainer will research the change and
36 write the necessary text.
Georg Brandl48310cd2009-01-03 21:18:54 +000037
Georg Brandlc47408a2008-12-04 18:44:53 +000038 * You can comment out your additions if you like, but it's not
39 necessary (especially when a final release is some months away).
Georg Brandl48310cd2009-01-03 21:18:54 +000040
Georg Brandlc47408a2008-12-04 18:44:53 +000041 * Credit the author of a patch or bugfix. Just the name is
42 sufficient; the e-mail address isn't necessary. (Due to time
43 constraints I haven't managed to do this for 3.0. GvR.)
Georg Brandl48310cd2009-01-03 21:18:54 +000044
Georg Brandlc47408a2008-12-04 18:44:53 +000045 * It's helpful to add the bug/patch number as a comment:
Georg Brandl48310cd2009-01-03 21:18:54 +000046
Georg Brandlc47408a2008-12-04 18:44:53 +000047 % Patch 12345
48 XXX Describe the transmogrify() function added to the socket
49 module.
50 (Contributed by P.Y. Developer.)
Georg Brandl48310cd2009-01-03 21:18:54 +000051
Georg Brandlc47408a2008-12-04 18:44:53 +000052 This saves the maintainer the effort of going through the SVN log
53 when researching a change. (Again, I didn't get to this for 3.0.
54 GvR.)
55
56This article explains the new features in Python 3.1, compared to 3.0.
57
58.. Compare with previous release in 2 - 3 sentences here.
59.. add hyperlink when the documentation becomes available online.
60
61.. ======================================================================
62.. Large, PEP-level features and changes should be described here.
63.. Should there be a new section here for 3k migration?
64.. Or perhaps a more general section describing module changes/deprecation?
65.. sets module deprecated
66.. ======================================================================
67
68
Raymond Hettinger1f251a02009-04-04 10:47:35 +000069PEP 372: Ordered Dictionaries
70=============================
71
72Regular Python dictionaries iterate over key/value pairs in arbitrary order.
73Over the years, a number of authors have written alternative implementations
74that remember the order that the keys were originally inserted. Based on
75the experiences from those implementations, the :mod:`collections` module
76now has an :class:`OrderedDict` class.
77
78The OrderedDict API is substantially the same as regular dictionaries
79but will iterate over keys and values in a guaranteed order depending on
80when a key was first inserted. If a new entry overwrites an existing entry,
81the original insertion position is left unchanged. Deleting an entry and
82reinserting it will move it to the end.
83
84The standard library now supports use of ordered dictionaries in several
85modules. The :mod:`ConfigParser` modules uses them by default. This lets
86configuration files be read, modified, and then written back in their original
87order. The :mod:`collections` module's :meth:`namedtuple._asdict` method now
88returns a dictionary with the values appearing in the same order as the
89underlying tuple.count The :mod:`json` module is being built-out with an
90*object_pairs_hook* to allow OrderedDicts to be built by the decoder.
Raymond Hettingerf41857e2009-04-04 11:59:00 +000091Support was also added for third-party tools like PyYAML.
Raymond Hettinger1f251a02009-04-04 10:47:35 +000092
93.. seealso::
94
95 :pep:`372` - Ordered Dictionaries
Raymond Hettingerf41857e2009-04-04 11:59:00 +000096 PEP written by Armin Ronacher and Raymond Hettinger; implemented by
Raymond Hettinger1f251a02009-04-04 10:47:35 +000097 Raymond Hettinger
98
99PEP 378: Format Specifier for Thousands Separator
100=================================================
101
102The builtin :func:`format` function and the :meth:`str.format` method use
103a mini-language that now includes a simple, non-locale aware way to format
104a number with a thousands separator. That provides a way to humanize a
105program's output, improving its professional appearance and readability::
106
107 >>> format(Decimal('1234567.89'), ',f')
108 '1,234,567.89'
109
110The currently supported types are :class:`int` and :class:`Decimal`.
111Support for :class:`float` is expected before the beta release.
112Discussions are underway about how to specify alternative separators
113like dots, spaces, apostrophes, or underscores.
114
115.. seealso::
116
117 :pep:`378` - Format Specifier for Thousands Separator
118 PEP written by Raymond Hettinger; implemented by Eric Smith and
119 Mark Dickinson.
120
121
Mark Dickinson54bc1ec2008-12-17 16:19:07 +0000122Other Language Changes
123======================
124
125Some smaller changes made to the core Python language are:
126
127* The :func:`int` type gained a ``bit_length`` method that returns the
128 number of bits necessary to represent its argument in binary::
129
130 >>> n = 37
131 >>> bin(37)
132 '0b100101'
133 >>> n.bit_length()
134 6
135 >>> n = 2**123-1
136 >>> n.bit_length()
137 123
138 >>> (n+1).bit_length()
139 124
140
141 (Contributed by Fredrik Johansson and Victor Stinner; :issue:`3439`.)
142
Mark Dickinsonbd792642009-03-18 20:06:12 +0000143* Integers are now stored internally either in base 2**15 or in base
144 2**30, the base being determined at build time. Previously, they
145 were always stored in base 2**15. Using base 2**30 gives
146 significant performance improvements on 64-bit machines, but
147 benchmark results on 32-bit machines have been mixed. Therefore,
148 the default is to use base 2**30 on 64-bit machines and base 2**15
149 on 32-bit machines; on Unix, there's a new configure option
150 --enable-big-digits that can be used to override this default.
151
152 Apart from the performance improvements this change should be
153 invisible to end users, with one exception: for testing and
154 debugging purposes there's a new structseq ``sys.int_info`` that
155 provides information about the internal format, giving the number of
156 bits per digit and the size in bytes of the C type used to store
157 each digit::
158
159 >>> import sys
160 >>> sys.int_info
161 sys.int_info(bits_per_digit=30, sizeof_digit=4)
162
Mark Dickinsonbd792642009-03-18 20:06:12 +0000163 (Contributed by Mark Dickinson; :issue:`4258`.)
164
Mark Dickinson54bc1ec2008-12-17 16:19:07 +0000165
Raymond Hettinger1f251a02009-04-04 10:47:35 +0000166* Added a :class:`collections.Counter` class to support convenient
167 counting of unique items in a sequence or iterable::
168
169 >>> Counter(['red', 'blue', 'red', 'green', 'blue', 'blue'])
170 Counter({'blue': 3, 'red': 2, 'green': 1})
171
172 (Contributed by Raymond Hettinger; :issue:`1696199`.)
173
174* The :class:`gzip.GzipFile` and :class:`bz2.BZ2File` classs now support
175 the context manager protocol.
176
177 (Contributed by Jacques Frechet; :issue:`4272`.)
178
179* The :mod:`Decimal` module now supports two new methods to create a
180 decimal object that from a binary :class:`float`. The conversion is
181 exact but can sometimes be surprising::
182
183 >>> Decimal.from_float(1.1)
184 Decimal('1.100000000000000088817841970012523233890533447265625')
185
186 The long decimal result shows the actual binary fraction being
187 stored for *1.1*. The fraction has many digits because *1.1* cannot
188 be exactly represented in binary.
189
190 (Contributed by Raymond Hettinger and Mark Dickinson.)
191
Raymond Hettingere7ec57d2009-04-04 11:08:48 +0000192* The fields in :func:`format` strings can now be automatically
193 numbered::
194
195 >>> 'Sir {} of {}'.format('Gallahad', 'Camelot')
196 'Sir Gallahad of Camelot'
197
198 Formerly, the string would have required numbered fields such as:
199 ``'Sir {0} of {1}'``.
200
201 (Contributed by Eric Smith; :issue:`5237`.)
202
203* The :mod:`itertools` module grew two new functions. The
204 :func:`itertools.combinations_with_replacement` function is one of
205 four for generating combinatorics including permutations and Cartesian
206 products. The :func:`itertools.compress` function mimics its namesake
207 from APL. Also, the existing :func:`itertools.count` function now has
208 an optional *step* argument and can accept any type of counting
209 sequence including :class:`fractions.Fraction` and
210 :class:`decimal.Decimal`.
211
212 (Contributed by Raymond Hettinger.)
213
Raymond Hettingerf41857e2009-04-04 11:59:00 +0000214* :class:`collections.deque` objects now have a read-only attribute
215 called *maxlen*.
216
217 (Contributed by Raymond Hettinger.)
218
219* :func:`collections.namedtuple` now supports a keyword argument
220 *rename* which lets invalid fieldnames be automatically converted to
221 positional names in the form _0, _1, etc. This is useful when
222 the field names are being created by an external source such as a
223 CSV header, SQL field list, or user input.
224
225 (Contributed by Raymond Hettinger; :issue:`1818`.)
226
227* :func:`round`(x, n) now returns an integer if *x* is an integer.
228 Previously it returned a float.
229
230 (Contributed by Mark Dickinson; :issue:`4707`.)
231
232* The :func:`re.sub`, :func:`re.subn` and :func:`re.split` functions now
233 accept a flags parameter.
234
235 (Contributed by Gregory Smith.)
236
237* The :mod:`runpy` module which supports the ``-m`` command line switch
238 now supports the execution of packages by looking for and executing
239 a ``__main__`` submodule when a package name is supplied.
240
241 (Contributed by Andi Vajda; :issue:`4195`.)
242
243* The :mod:`pdb` module can now access and display source code loaded via
244 :mod:`zipimport` (or any other conformant :pep:`302` loader).
245
246 (Contributed by Alexander Belopolsky; :issue:`4201`.)
247
248* :class:`functools.partial` objects can now be pickled.
249
250 (Suggested by Antoine Pitrou and Jesse Noller. Implemented by
251 Jack Diedrich; :issue:`5228`.)
252
253XXX Brett Cannon's importlib package
254
255XXX New unittest assert methods
Raymond Hettinger1f251a02009-04-04 10:47:35 +0000256
Georg Brandlc47408a2008-12-04 18:44:53 +0000257.. ======================================================================
Antoine Pitroub5564522009-03-28 19:45:26 +0000258
259
260Optimizations
261-------------
262
263Major performance enhancements have been added:
264
265* The new I/O library (as defined in :pep:`3116`) was mostly written in
266 Python and quickly proved to be a problematic bottleneck in Python 3.0.
267 In Python 3.1, the I/O library has been entirely rewritten in C and is
268 2 to 20 times faster depending on the task at hand. The pure Python
269 version is still available for experimentation purposes through
270 the ``_pyio`` module.
271
272 (Contributed by Amaury Forgeot d'Arc and Antoine Pitrou.)
273
274* A new configure flag, ``--with-computed-gotos``, enables a faster opcode
275 dispatch mechanism on compilers which support it. Speedups of up to 20%
276 have been observed, depending on the system and compiler.
277
278 (Contributed by Antoine Pitrou, :issue:`4753`.)
279
Raymond Hettingere7ec57d2009-04-04 11:08:48 +0000280* Add a heuristic so that tuples and dicts containing only untrackable objects
281 are not tracked by the garbage collector. This can reduce the size of
282 collections and therefore the garbage collection overhead on long-running
283 programs, depending on their particular use of datatypes.
284
285 (Contributed by Antoine Pitrou, :issue:`4688`.)
286
Raymond Hettingerf41857e2009-04-04 11:59:00 +0000287* Enabling a configure option named ``--with-computed-gotos``
288 on compilers that support it (notably: gcc, SunPro, icc), the bytecode
289 evaluation loop is compiled with a new dispatch mechanism which gives
290 speedups of up to 20%, depending on the system, on various benchmarks.
291
292 (Contributed by Antoine Pitrou and Jeffrey Yasskin along with a number
293 of other participants, :issue:`4753`).
294
295* The decoding of UTF-8, UTF-16 and LATIN-1 is now two to four times
296 faster.
297
298 (Contributed by Antoine Pitrou and Amaury Forgeot d'Arc, :issue:`4868`.)
299
Raymond Hettinger1f251a02009-04-04 10:47:35 +0000300XXX The JSON module is getting a C extension for speed.
Antoine Pitroub5564522009-03-28 19:45:26 +0000301
302.. ======================================================================