blob: ca123cddc246974e95957d838475e4bba92ba2ae [file] [log] [blame]
Giampaolo Rodolà3108f982011-02-24 20:59:48 +00001****************************
2 What's New In Python 3.3
3****************************
4
5:Author: Raymond Hettinger
6:Release: |release|
7:Date: |today|
8
Éric Araujob07b97f2011-10-05 01:03:34 +02009.. Rules for maintenance:
Giampaolo Rodolà3108f982011-02-24 20:59:48 +000010
11 * Anyone can add text to this document. Do not spend very much time
12 on the wording of your changes, because your text will probably
13 get rewritten to some degree.
14
15 * The maintainer will go through Misc/NEWS periodically and add
16 changes; it's therefore more important to add your changes to
17 Misc/NEWS than to this file.
18
19 * This is not a complete list of every single change; completeness
20 is the purpose of Misc/NEWS. Some changes I consider too small
21 or esoteric to include. If such a change is added to the text,
22 I'll just remove it. (This is another reason you shouldn't spend
23 too much time on writing your addition.)
24
25 * If you want to draw your new text to the attention of the
26 maintainer, add 'XXX' to the beginning of the paragraph or
27 section.
28
29 * It's OK to just add a fragmentary note about a change. For
30 example: "XXX Describe the transmogrify() function added to the
31 socket module." The maintainer will research the change and
32 write the necessary text.
33
34 * You can comment out your additions if you like, but it's not
35 necessary (especially when a final release is some months away).
36
37 * Credit the author of a patch or bugfix. Just the name is
38 sufficient; the e-mail address isn't necessary.
39
40 * It's helpful to add the bug/patch number as a comment:
41
Giampaolo Rodolà3108f982011-02-24 20:59:48 +000042 XXX Describe the transmogrify() function added to the socket
43 module.
Éric Araujob07b97f2011-10-05 01:03:34 +020044 (Contributed by P.Y. Developer in :issue:`12345`.)
Giampaolo Rodolà3108f982011-02-24 20:59:48 +000045
Éric Araujob07b97f2011-10-05 01:03:34 +020046 This saves the maintainer the effort of going through the Mercurial log
Giampaolo Rodolà3108f982011-02-24 20:59:48 +000047 when researching a change.
48
49This article explains the new features in Python 3.3, compared to 3.2.
50
51
Nick Coghlan98e20702012-03-06 21:50:13 +100052.. pep-3118-update:
53
Stefan Krah9a2d99e2012-02-25 12:24:21 +010054PEP 3118: New memoryview implementation and buffer protocol documentation
55=========================================================================
56
57:issue:`10181` - memoryview bug fixes and features.
58 Written by Stefan Krah.
59
60The new memoryview implementation comprehensively fixes all ownership and
61lifetime issues of dynamically allocated fields in the Py_buffer struct
62that led to multiple crash reports. Additionally, several functions that
63crashed or returned incorrect results for non-contiguous or multi-dimensional
64input have been fixed.
65
66The memoryview object now has a PEP-3118 compliant getbufferproc()
67that checks the consumer's request type. Many new features have been
68added, most of them work in full generality for non-contiguous arrays
69and arrays with suboffsets.
70
71The documentation has been updated, clearly spelling out responsibilities
72for both exporters and consumers. Buffer request flags are grouped into
73basic and compound flags. The memory layout of non-contiguous and
74multi-dimensional NumPy-style arrays is explained.
75
76Features
77--------
78
79* All native single character format specifiers in struct module syntax
80 (optionally prefixed with '@') are now supported.
81
82* With some restrictions, the cast() method allows changing of format and
83 shape of C-contiguous arrays.
84
85* Multi-dimensional list representations are supported for any array type.
86
87* Multi-dimensional comparisons are supported for any array type.
88
89* All array types are hashable if the exporting object is hashable
Nick Coghlan98e20702012-03-06 21:50:13 +100090 and the view is read-only. (Contributed by Antoine Pitrou in
91 :issue:`13411`)
92
Stefan Krah9a2d99e2012-02-25 12:24:21 +010093
94* Arbitrary slicing of any 1-D arrays type is supported. For example, it
95 is now possible to reverse a memoryview in O(1) by using a negative step.
96
97API changes
98-----------
99
100* The maximum number of dimensions is officially limited to 64.
101
102* The representation of empty shape, strides and suboffsets is now
103 an empty tuple instead of None.
104
105* Accessing a memoryview element with format 'B' (unsigned bytes)
106 now returns an integer (in accordance with the struct module syntax).
107 For returning a bytes object the view must be cast to 'c' first.
108
Stefan Krah54c32032012-02-29 17:47:21 +0100109* For further changes see `Build and C API Changes`_ and `Porting C code`_ .
Stefan Krah9a2d99e2012-02-25 12:24:21 +0100110
Antoine Pitrou037ffbf2011-10-24 00:25:41 +0200111.. _pep-393:
112
Ezio Melotti48a2f8f2011-09-29 00:18:19 +0300113PEP 393: Flexible String Representation
114=======================================
115
Antoine Pitroufd9b4162011-10-24 00:14:43 +0200116The Unicode string type is changed to support multiple internal
117representations, depending on the character with the largest Unicode ordinal
118(1, 2, or 4 bytes) in the represented string. This allows a space-efficient
119representation in common cases, but gives access to full UCS-4 on all
120systems. For compatibility with existing APIs, several representations may
121exist in parallel; over time, this compatibility should be phased out.
Ezio Melotti397546a2011-09-29 08:34:36 +0300122
Antoine Pitroufd9b4162011-10-24 00:14:43 +0200123On the Python side, there should be no downside to this change.
Ezio Melotti397546a2011-09-29 08:34:36 +0300124
Antoine Pitroufd9b4162011-10-24 00:14:43 +0200125On the C API side, PEP 393 is fully backward compatible. The legacy API
126should remain available at least five years. Applications using the legacy
127API will not fully benefit of the memory reduction, or - worse - may use
128a bit more memory, because Python may have to maintain two versions of each
129string (in the legacy format and in the new efficient storage).
130
Antoine Pitrou0599b5b2011-11-29 22:45:07 +0100131Functionality
132-------------
133
Antoine Pitroufd9b4162011-10-24 00:14:43 +0200134Changes introduced by :pep:`393` are the following:
Ezio Melotti48a2f8f2011-09-29 00:18:19 +0300135
Ezio Melotti397546a2011-09-29 08:34:36 +0300136* Python now always supports the full range of Unicode codepoints, including
137 non-BMP ones (i.e. from ``U+0000`` to ``U+10FFFF``). The distinction between
138 narrow and wide builds no longer exists and Python now behaves like a wide
Antoine Pitroufd9b4162011-10-24 00:14:43 +0200139 build, even under Windows.
Ezio Melotti397546a2011-09-29 08:34:36 +0300140
Antoine Pitroufd9b4162011-10-24 00:14:43 +0200141* With the death of narrow builds, the problems specific to narrow builds have
142 also been fixed, for example:
Ezio Melotti397546a2011-09-29 08:34:36 +0300143
144 * :func:`len` now always returns 1 for non-BMP characters,
145 so ``len('\U0010FFFF') == 1``;
146
147 * surrogate pairs are not recombined in string literals,
148 so ``'\uDBFF\uDFFF' != '\U0010FFFF'``;
149
Antoine Pitroufd9b4162011-10-24 00:14:43 +0200150 * indexing or slicing non-BMP characters returns the expected value,
Ezio Melotti397546a2011-09-29 08:34:36 +0300151 so ``'\U0010FFFF'[0]`` now returns ``'\U0010FFFF'`` and not ``'\uDBFF'``;
152
Antoine Pitroud136aec2011-11-17 01:48:06 +0100153 * all other functions in the standard library now correctly handle
Antoine Pitroufd9b4162011-10-24 00:14:43 +0200154 non-BMP codepoints.
Ezio Melotti397546a2011-09-29 08:34:36 +0300155
Ezio Melotti48a2f8f2011-09-29 00:18:19 +0300156* The value of :data:`sys.maxunicode` is now always ``1114111`` (``0x10FFFF``
157 in hexadecimal). The :c:func:`PyUnicode_GetMax` function still returns
158 either ``0xFFFF`` or ``0x10FFFF`` for backward compatibility, and it should
159 not be used with the new Unicode API (see :issue:`13054`).
160
Ezio Melotti397546a2011-09-29 08:34:36 +0300161* The :file:`./configure` flag ``--with-wide-unicode`` has been removed.
Victor Stinner7d637ab2011-09-29 02:56:16 +0200162
Antoine Pitrou0599b5b2011-11-29 22:45:07 +0100163Performance and resource usage
164------------------------------
165
166The storage of Unicode strings now depends on the highest codepoint in the string:
167
168* pure ASCII and Latin1 strings (``U+0000-U+00FF``) use 1 byte per codepoint;
169
170* BMP strings (``U+0000-U+FFFF``) use 2 bytes per codepoint;
171
172* non-BMP strings (``U+10000-U+10FFFF``) use 4 bytes per codepoint.
173
Martin v. Löwisde157cc2012-03-06 08:42:17 +0100174The net effect is that for most applications, memory usage of string
175storage should decrease significantly - especially compared to former
176wide unicode builds - as, in many cases, strings will be pure ASCII
177even in international contexts (because many strings store non-human
178language data, such as XML fragments, HTTP headers, JSON-encoded data,
179etc.). We also hope that it will, for the same reasons, increase CPU
180cache efficiency on non-trivial applications. The memory usage of
181Python 3.3 is two to three times smaller than Python 3.2, and a little
182bit better than Python 2.7, on a Django benchmark (see the PEP for
183details).
Antoine Pitrou0599b5b2011-11-29 22:45:07 +0100184
Éric Araujob07b97f2011-10-05 01:03:34 +0200185
Victor Stinnera1bf2982011-10-12 20:35:02 +0200186PEP 3151: Reworking the OS and IO exception hierarchy
187=====================================================
188
189:pep:`3151` - Reworking the OS and IO exception hierarchy
Antoine Pitrou01fd26c2011-10-24 00:07:02 +0200190 PEP written and implemented by Antoine Pitrou.
Victor Stinnera1bf2982011-10-12 20:35:02 +0200191
Antoine Pitrou01fd26c2011-10-24 00:07:02 +0200192The hierarchy of exceptions raised by operating system errors is now both
193simplified and finer-grained.
Victor Stinnera1bf2982011-10-12 20:35:02 +0200194
Antoine Pitrou01fd26c2011-10-24 00:07:02 +0200195You don't have to worry anymore about choosing the appropriate exception
196type between :exc:`OSError`, :exc:`IOError`, :exc:`EnvironmentError`,
197:exc:`WindowsError`, :exc:`mmap.error`, :exc:`socket.error` or
198:exc:`select.error`. All these exception types are now only one:
199:exc:`OSError`. The other names are kept as aliases for compatibility
200reasons.
Victor Stinnera1bf2982011-10-12 20:35:02 +0200201
Antoine Pitrou01fd26c2011-10-24 00:07:02 +0200202Also, it is now easier to catch a specific error condition. Instead of
203inspecting the ``errno`` attribute (or ``args[0]``) for a particular
204constant from the :mod:`errno` module, you can catch the adequate
205:exc:`OSError` subclass. The available subclasses are the following:
Victor Stinnera1bf2982011-10-12 20:35:02 +0200206
Antoine Pitrou01fd26c2011-10-24 00:07:02 +0200207* :exc:`BlockingIOError`
208* :exc:`ChildProcessError`
209* :exc:`ConnectionError`
210* :exc:`FileExistsError`
211* :exc:`FileNotFoundError`
212* :exc:`InterruptedError`
213* :exc:`IsADirectoryError`
214* :exc:`NotADirectoryError`
215* :exc:`PermissionError`
216* :exc:`ProcessLookupError`
217* :exc:`TimeoutError`
Victor Stinnera1bf2982011-10-12 20:35:02 +0200218
Antoine Pitrou01fd26c2011-10-24 00:07:02 +0200219And the :exc:`ConnectionError` itself has finer-grained subclasses:
Victor Stinnera1bf2982011-10-12 20:35:02 +0200220
Antoine Pitrou01fd26c2011-10-24 00:07:02 +0200221* :exc:`BrokenPipeError`
222* :exc:`ConnectionAbortedError`
223* :exc:`ConnectionRefusedError`
224* :exc:`ConnectionResetError`
Victor Stinnera1bf2982011-10-12 20:35:02 +0200225
226Thanks to the new exceptions, common usages of the :mod:`errno` can now be
Antoine Pitrou01fd26c2011-10-24 00:07:02 +0200227avoided. For example, the following code written for Python 3.2::
Victor Stinnera1bf2982011-10-12 20:35:02 +0200228
229 from errno import ENOENT, EACCES, EPERM
230
231 try:
232 with open("document.txt") as f:
233 content = f.read()
234 except IOError as err:
235 if err.errno == ENOENT:
236 print("document.txt file is missing")
237 elif err.errno in (EACCES, EPERM):
238 print("You are not allowed to read document.txt")
239 else:
240 raise
241
Antoine Pitrou01fd26c2011-10-24 00:07:02 +0200242can now be written without the :mod:`errno` import and without manual
243inspection of exception attributes::
Victor Stinnera1bf2982011-10-12 20:35:02 +0200244
245 try:
246 with open("document.txt") as f:
247 content = f.read()
248 except FileNotFoundError:
249 print("document.txt file is missing")
250 except PermissionError:
251 print("You are not allowed to read document.txt")
252
253
Nick Coghlan1f7ce622012-01-13 21:43:40 +1000254PEP 380: Syntax for Delegating to a Subgenerator
255================================================
256
Nick Coghlanab7bf212012-02-26 17:49:52 +1000257:pep:`380` - Syntax for Delegating to a Subgenerator
258 PEP written by Greg Ewing.
259
Nick Coghlan1f7ce622012-01-13 21:43:40 +1000260PEP 380 adds the ``yield from`` expression, allowing a generator to delegate
261part of its operations to another generator. This allows a section of code
262containing 'yield' to be factored out and placed in another generator.
263Additionally, the subgenerator is allowed to return with a value, and the
264value is made available to the delegating generator.
Nick Coghlanb9b281b2012-03-06 22:31:12 +1000265
Nick Coghlan1f7ce622012-01-13 21:43:40 +1000266While designed primarily for use in delegating to a subgenerator, the ``yield
267from`` expression actually allows delegation to arbitrary subiterators.
268
Nick Coghlanb9b281b2012-03-06 22:31:12 +1000269For simple iterators, ``yield from iterable`` is essentially just a shortened
270form of ``for item in iterable: yield item``::
271
272 >>> def g(x):
273 ... yield from range(x, 0, -1)
274 ... yield from range(x)
275 ...
276 >>> list(g(5))
277 [5, 4, 3, 2, 1, 0, 1, 2, 3, 4]
278
279However, unlike an ordinary loop, ``yield from`` allows subgenerators to
280receive sent and thrown values directly from the calling scope, and
281return a final value to the outer generator::
282
283 >>> def accumulate(start=0):
284 ... tally = start
285 ... while 1:
286 ... next = yield
287 ... if next is None:
288 ... return tally
289 ... tally += next
290 ...
291 >>> def gather_tallies(tallies, start=0):
292 ... while 1:
293 ... tally = yield from accumulate()
294 ... tallies.append(tally)
295 ...
296 >>> tallies = []
297 >>> acc = gather_tallies(tallies)
298 >>> next(acc) # Ensure the accumulator is ready to accept values
299 >>> for i in range(10):
300 ... acc.send(i)
301 ...
302 >>> acc.send(None) # Finish the first tally
303 >>> for i in range(5):
304 ... acc.send(i)
305 ...
306 >>> acc.send(None) # Finish the second tally
307 >>> tallies
308 [45, 10]
309
310The main principle driving this change is to allow even generators that are
311designed to be used with the ``send`` and ``throw`` methods to be split into
312multiple subgenerators as easily as a single large function can be split into
313multiple subfunctions.
314
Nick Coghlan1f7ce622012-01-13 21:43:40 +1000315(Implementation by Greg Ewing, integrated into 3.3 by Renaud Blanch, Ryan
316Kelly and Nick Coghlan, documentation by Zbigniew Jędrzejewski-Szmek and
317Nick Coghlan)
318
319
Nick Coghlanab7bf212012-02-26 17:49:52 +1000320PEP 409: Suppressing exception context
321======================================
322
323:pep:`409` - Suppressing exception context
324 PEP written by Ethan Furman, implemented by Ethan Furman and Nick Coghlan.
325
326PEP 409 introduces new syntax that allows the display of the chained
327exception context to be disabled. This allows cleaner error messages in
328applications that convert between exception types::
329
330 >>> class D:
331 ... def __init__(self, extra):
332 ... self._extra_attributes = extra
333 ... def __getattr__(self, attr):
334 ... try:
335 ... return self._extra_attributes[attr]
336 ... except KeyError:
337 ... raise AttributeError(attr) from None
338 ...
339 >>> D({}).x
340 Traceback (most recent call last):
341 File "<stdin>", line 1, in <module>
342 File "<stdin>", line 8, in __getattr__
343 AttributeError: x
344
345Without the ``from None`` suffix to suppress the cause, the original
346exception would be displayed by default::
347
348 >>> class C:
349 ... def __init__(self, extra):
350 ... self._extra_attributes = extra
351 ... def __getattr__(self, attr):
352 ... try:
353 ... return self._extra_attributes[attr]
354 ... except KeyError:
355 ... raise AttributeError(attr)
356 ...
357 >>> C({}).x
358 Traceback (most recent call last):
359 File "<stdin>", line 6, in __getattr__
360 KeyError: 'x'
361
362 During handling of the above exception, another exception occurred:
363
364 Traceback (most recent call last):
365 File "<stdin>", line 1, in <module>
366 File "<stdin>", line 8, in __getattr__
367 AttributeError: x
368
369No debugging capability is lost, as the original exception context remains
370available if needed (for example, if an intervening library has incorrectly
371suppressed valuable underlying details)::
372
373 >>> try:
374 ... D({}).x
375 ... except AttributeError as exc:
376 ... print(repr(exc.__context__))
377 ...
378 KeyError('x',)
379
380
Nick Coghlan98e20702012-03-06 21:50:13 +1000381PEP 414: Explicit Unicode literals
382======================================
383
384:pep:`414` - Explicit Unicode literals
385 PEP written by Armin Ronacher.
386
387To ease the transition from Python 2 for Unicode aware Python applications
388that make heavy use of Unicode literals, Python 3.3 once again supports the
389"``u``" prefix for string literals. This prefix has no semantic significance
390in Python 3, it is provided solely to reduce the number of purely mechanical
391changes in migrating to Python 3, making it easier for developers to focus on
392the more significant semantic changes (such as the stricter default
393separation of binary and text data).
394
395
Antoine Pitrou6bbd76b2011-11-25 19:10:05 +0100396PEP 3155: Qualified name for classes and functions
397==================================================
398
399:pep:`3155` - Qualified name for classes and functions
400 PEP written and implemented by Antoine Pitrou.
401
402Functions and class objects have a new ``__qualname__`` attribute representing
403the "path" from the module top-level to their definition. For global functions
404and classes, this is the same as ``__name__``. For other functions and classes,
405it provides better information about where they were actually defined, and
406how they might be accessible from the global scope.
407
408Example with (non-bound) methods::
Nick Coghlan2dfe6b02012-01-14 14:19:49 +1000409
Antoine Pitrou6bbd76b2011-11-25 19:10:05 +0100410 >>> class C:
411 ... def meth(self):
412 ... pass
413 >>> C.meth.__name__
414 'meth'
415 >>> C.meth.__qualname__
416 'C.meth'
417
418Example with nested classes::
419
420 >>> class C:
421 ... class D:
422 ... def meth(self):
423 ... pass
424 ...
425 >>> C.D.__name__
426 'D'
427 >>> C.D.__qualname__
428 'C.D'
429 >>> C.D.meth.__name__
430 'meth'
431 >>> C.D.meth.__qualname__
432 'C.D.meth'
433
434Example with nested functions::
435
436 >>> def outer():
437 ... def inner():
438 ... pass
439 ... return inner
440 ...
441 >>> outer().__name__
442 'inner'
443 >>> outer().__qualname__
444 'outer.<locals>.inner'
445
Antoine Pitroue7ede062011-11-25 19:11:26 +0100446The string representation of those objects is also changed to include the
Antoine Pitrou6bbd76b2011-11-25 19:10:05 +0100447new, more precise information::
448
449 >>> str(C.D)
450 "<class '__main__.C.D'>"
451 >>> str(C.D.meth)
452 '<function C.D.meth at 0x7f46b9fe31e0>'
453
454
Giampaolo Rodolà3108f982011-02-24 20:59:48 +0000455Other Language Changes
456======================
457
458Some smaller changes made to the core Python language are:
459
Antoine Pitrou7b578b32011-11-29 22:47:11 +0100460* Added support for Unicode name aliases and named sequences.
461 Both :func:`unicodedata.lookup()` and ``'\N{...}'`` now resolve name aliases,
462 and :func:`unicodedata.lookup()` resolves named sequences too.
Giampaolo Rodolà3108f982011-02-24 20:59:48 +0000463
Antoine Pitrou7b578b32011-11-29 22:47:11 +0100464 (Contributed by Ezio Melotti in :issue:`12753`)
Ezio Melotti931b8aa2011-10-21 21:57:36 +0300465
Antoine Pitrou7b578b32011-11-29 22:47:11 +0100466* Equality comparisons on :func:`range` objects now return a result reflecting
467 the equality of the underlying sequences generated by those range objects.
Ezio Melotti931b8aa2011-10-21 21:57:36 +0300468
Sandro Tosicd899122012-01-22 12:16:04 +0100469 (:issue:`13201`)
Giampaolo Rodolà3108f982011-02-24 20:59:48 +0000470
Antoine Pitrou7b578b32011-11-29 22:47:11 +0100471* The ``count()``, ``find()``, ``rfind()``, ``index()`` and ``rindex()``
472 methods of :class:`bytes` and :class:`bytearray` objects now accept an
473 integer between 0 and 255 as their first argument.
Mark Dickinson36645682011-10-23 19:53:01 +0100474
Antoine Pitrou7b578b32011-11-29 22:47:11 +0100475 (:issue:`12170`)
Mark Dickinson36645682011-10-23 19:53:01 +0100476
Victor Stinner8c43e692012-03-09 14:04:01 +0100477* A dict lookup now raises a :exc:`RuntimeError` if the dict is modified during
Georg Brandla08e7e12012-03-10 20:52:16 +0100478 the lookup. If you implement your own comparison function for objects used
Victor Stinner8c43e692012-03-09 14:04:01 +0100479 as dict keys and the dict is shared by multiple threads, access to the dict
480 should be protected by a lock.
481
482 (:issue:`14205`)
483
Eli Bendersky7add4ea2012-03-17 15:14:35 +0200484* New methods have been added to :class:`list` and :class:`bytearray`:
485 ``copy()`` and ``clear()``.
486
487 (:issue:`10516`)
Petri Lehtinen61ea8a02011-11-24 22:00:46 +0200488
Victor Stinner46606ce2011-11-20 18:27:55 +0100489New and Improved Modules
490========================
Giampaolo Rodolà3108f982011-02-24 20:59:48 +0000491
Victor Stinnerf4c54ff2012-02-08 01:48:34 +0100492abc
493---
494
495Improved support for abstract base classes containing descriptors composed with
496abstract methods. The recommended approach to declaring abstract descriptors is
497now to provide :attr:`__isabstractmethod__` as a dynamically updated
498property. The built-in descriptors have been updated accordingly.
499
500 * :class:`abc.abstractproperty` has been deprecated, use :class:`property`
501 with :func:`abc.abstractmethod` instead.
502 * :class:`abc.abstractclassmethod` has been deprecated, use
503 :class:`classmethod` with :func:`abc.abstractmethod` instead.
504 * :class:`abc.abstractstaticmethod` has been deprecated, use
505 :class:`staticmethod` with :func:`abc.abstractmethod` instead.
506
507(Contributed by Darren Dale in :issue:`11610`)
508
Meador Ingec5dbb3d2011-09-20 21:48:16 -0500509array
510-----
511
512The :mod:`array` module supports the :c:type:`long long` type using ``q`` and
513``Q`` type codes.
514
515(Contributed by Oren Tirosh and Hirokazu Yamamoto in :issue:`1172711`)
516
517
Nadeem Vawdad7e5c6e2012-02-12 01:34:18 +0200518bz2
519---
520
521The :mod:`bz2` module has been rewritten from scratch. In the process, several
522new features have been added:
523
524* :class:`bz2.BZ2File` can now read from and write to arbitrary file-like
525 objects, by means of its constructor's *fileobj* argument.
526
527 (Contributed by Nadeem Vawda in :issue:`5863`)
528
529* :class:`bz2.BZ2File` and :func:`bz2.decompress` can now decompress
530 multi-stream inputs (such as those produced by the :program:`pbzip2` tool).
531 :class:`bz2.BZ2File` can now also be used to create this type of file, using
532 the ``'a'`` (append) mode.
533
534 (Contributed by Nir Aides in :issue:`1625`)
535
536* :class:`bz2.BZ2File` now implements all of the :class:`io.BufferedIOBase` API,
537 except for the :meth:`detach` and :meth:`truncate` methods.
538
539
Victor Stinner2cded9c2011-07-08 01:45:13 +0200540codecs
541------
542
Antoine Pitrou4f863432012-02-12 02:12:47 +0100543The :mod:`~encodings.mbcs` codec has been rewritten to handle correctly
Georg Brandlff962c52012-02-04 08:55:56 +0100544``replace`` and ``ignore`` error handlers on all Windows versions. The
545:mod:`~encodings.mbcs` codec now supports all error handlers, instead of only
546``replace`` to encode and ``ignore`` to decode.
Victor Stinner3a50e702011-10-18 21:21:00 +0200547
Georg Brandlff962c52012-02-04 08:55:56 +0100548A new Windows-only codec has been added: ``cp65001`` (:issue:`13216`). It is the
549Windows code page 65001 (Windows UTF-8, ``CP_UTF8``). For example, it is used
550by ``sys.stdout`` if the console output code page is set to cp65001 (e.g., using
551``chcp 65001`` command).
Victor Stinner2f3ca9f2011-10-27 01:38:56 +0200552
Georg Brandlff962c52012-02-04 08:55:56 +0100553Multibyte CJK decoders now resynchronize faster. They only ignore the first
Georg Brandl6c0929b2011-07-09 11:43:33 +0200554byte of an invalid byte sequence. For example, ``b'\xff\n'.decode('gb2312',
555'replace')`` now returns a ``\n`` after the replacement character.
Victor Stinner2cded9c2011-07-08 01:45:13 +0200556
Georg Brandl6c0929b2011-07-09 11:43:33 +0200557(:issue:`12016`)
Victor Stinner2cded9c2011-07-08 01:45:13 +0200558
Georg Brandlff962c52012-02-04 08:55:56 +0100559Incremental CJK codec encoders are no longer reset at each call to their
560encode() methods. For example::
Victor Stinner2cded9c2011-07-08 01:45:13 +0200561
562 $ ./python -q
563 >>> import codecs
564 >>> encoder = codecs.getincrementalencoder('hz')('strict')
565 >>> b''.join(encoder.encode(x) for x in '\u52ff\u65bd\u65bc\u4eba\u3002 Bye.')
566 b'~{NpJ)l6HK!#~} Bye.'
567
Georg Brandl6c0929b2011-07-09 11:43:33 +0200568This example gives ``b'~{Np~}~{J)~}~{l6~}~{HK~}~{!#~} Bye.'`` with older Python
Victor Stinner2cded9c2011-07-08 01:45:13 +0200569versions.
570
Georg Brandl6c0929b2011-07-09 11:43:33 +0200571(:issue:`12100`)
Victor Stinner2cded9c2011-07-08 01:45:13 +0200572
Victor Stinner9f4b1e92011-11-10 20:56:30 +0100573The ``unicode_internal`` codec has been deprecated.
574
Éric Araujo84b8ed82011-08-29 21:42:47 +0200575crypt
576-----
577
Victor Stinnerc78fb332011-09-21 03:35:44 +0200578Addition of salt and modular crypt format and the :func:`~crypt.mksalt`
579function to the :mod:`crypt` module.
Éric Araujo84b8ed82011-08-29 21:42:47 +0200580
581(:issue:`10924`)
582
Victor Stinnera7878b72011-07-14 23:07:44 +0200583curses
584------
585
Victor Stinner0fdfceb2011-11-25 22:10:02 +0100586 * If the :mod:`curses` module is linked to the ncursesw library, use Unicode
587 functions when Unicode strings or characters are passed (e.g.
588 :c:func:`waddwstr`), and bytes functions otherwise (e.g. :c:func:`waddstr`).
589 * Use the locale encoding instead of ``utf-8`` to encode Unicode strings.
590 * :class:`curses.window` has a new :attr:`curses.window.encoding` attribute.
Victor Stinnerc78fb332011-09-21 03:35:44 +0200591 * The :class:`curses.window` class has a new :meth:`~curses.window.get_wch`
592 method to get a wide character
593 * The :mod:`curses` module has a new :meth:`~curses.unget_wch` function to
594 push a wide character so the next :meth:`~curses.window.get_wch` will return
595 it
Victor Stinnera7878b72011-07-14 23:07:44 +0200596
Victor Stinnerc78fb332011-09-21 03:35:44 +0200597(Contributed by Iñigo Serna in :issue:`6755`)
Victor Stinnera7878b72011-07-14 23:07:44 +0200598
Victor Stinner024e37a2011-03-31 01:31:06 +0200599faulthandler
600------------
601
602New module: :mod:`faulthandler`.
603
604 * :envvar:`PYTHONFAULTHANDLER`
605 * :option:`-X` ``faulthandler``
606
Victor Stinner811db3b2011-09-21 03:20:03 +0200607ftplib
608------
609
610The :class:`~ftplib.FTP_TLS` class now provides a new
611:func:`~ftplib.FTP_TLS.ccc` function to revert control channel back to
Florent Xicluna6d57d212011-10-23 22:23:57 +0200612plaintext. This can be useful to take advantage of firewalls that know how to
Victor Stinner811db3b2011-09-21 03:20:03 +0200613handle NAT with non-secure FTP without opening fixed ports.
614
615(Contributed by Giampaolo Rodolà in :issue:`12139`)
616
617
Antoine Pitrou5a8bc6f2011-11-17 02:20:48 +0100618imaplib
619-------
620
621The :class:`~imaplib.IMAP4_SSL` constructor now accepts an SSLContext
622parameter to control parameters of the secure channel.
623
624(Contributed by Sijin Joseph in :issue:`8808`)
625
626
Charles-François Natalidc3044c2012-01-09 22:40:02 +0100627io
628--
629
Charles-François Natalid612de12012-01-14 11:51:00 +0100630The :func:`~io.open` function has a new ``'x'`` mode that can be used to
631exclusively create a new file, and raise a :exc:`FileExistsError` if the file
632already exists. It is based on the C11 'x' mode to fopen().
Charles-François Natalidc3044c2012-01-09 22:40:02 +0100633
634(Contributed by David Townshend in :issue:`12760`)
635
636
Nadeem Vawda34599222011-12-09 01:32:46 +0200637lzma
638----
639
640The newly-added :mod:`lzma` module provides data compression and decompression
641using the LZMA algorithm, including support for the ``.xz`` and ``.lzma``
642file formats.
643
644(Contributed by Nadeem Vawda and Per Øyvind Karlsen in :issue:`6715`)
645
646
Victor Stinnerfa0e3d52011-05-09 01:01:09 +0200647math
648----
649
650The :mod:`math` module has a new function:
651
652 * :func:`~math.log2`: return the base-2 logarithm of *x*
653 (Written by Mark Dickinson in :issue:`11888`).
654
655
656nntplib
657-------
658
659The :class:`nntplib.NNTP` class now supports the context manager protocol to
660unconditionally consume :exc:`socket.error` exceptions and to close the NNTP
661connection when done::
662
663 >>> from nntplib import NNTP
Ezio Melotti3c14b4e2011-07-13 11:44:44 +0300664 >>> with NNTP('news.gmane.org') as n:
Victor Stinnerfa0e3d52011-05-09 01:01:09 +0200665 ... n.group('gmane.comp.python.committers')
666 ...
Ezio Melotti04f648c2011-07-26 09:37:46 +0300667 ('211 1755 1 1755 gmane.comp.python.committers', 1755, 1, 1755, 'gmane.comp.python.committers')
Victor Stinnerfa0e3d52011-05-09 01:01:09 +0200668 >>>
669
670(Contributed by Giampaolo Rodolà in :issue:`9795`)
671
672
Giampaolo Rodolàc9c2c8b2011-02-25 14:39:16 +0000673os
674--
675
Charles-François Natalia003af12011-06-01 20:30:52 +0200676* The :mod:`os` module has a new :func:`~os.pipe2` function that makes it
677 possible to create a pipe with :data:`~os.O_CLOEXEC` or
678 :data:`~os.O_NONBLOCK` flags set atomically. This is especially useful to
679 avoid race conditions in multi-threaded programs.
680
Giampaolo Rodolà18e8bcb2011-02-25 20:57:54 +0000681* The :mod:`os` module has a new :func:`~os.sendfile` function which provides
682 an efficent "zero-copy" way for copying data from one file (or socket)
683 descriptor to another. The phrase "zero-copy" refers to the fact that all of
684 the copying of data between the two descriptors is done entirely by the
685 kernel, with no copying of data into userspace buffers. :func:`~os.sendfile`
686 can be used to efficiently copy data from a file on disk to a network socket,
687 e.g. for downloading a file.
Giampaolo Rodolàc9c2c8b2011-02-25 14:39:16 +0000688
Giampaolo Rodolà18e8bcb2011-02-25 20:57:54 +0000689 (Patch submitted by Ross Lagerwall and Giampaolo Rodolà in :issue:`10882`.)
690
691* The :mod:`os` module has two new functions: :func:`~os.getpriority` and
692 :func:`~os.setpriority`. They can be used to get or set process
693 niceness/priority in a fashion similar to :func:`os.nice` but extended to all
694 processes instead of just the current one.
695
696 (Patch submitted by Giampaolo Rodolà in :issue:`10784`.)
Giampaolo Rodolà3108f982011-02-24 20:59:48 +0000697
Charles-François Natali7372b062012-02-05 15:15:38 +0100698* The :mod:`os` module has a new :func:`~os.fwalk` function similar to
699 :func:`~os.walk` except that it also yields file descriptors referring to the
700 directories visited. This is especially useful to avoid symlink races.
701
Victor Stinnere5064372011-10-14 00:08:29 +0200702* "at" functions (:issue:`4761`):
703
704 * :func:`~os.faccessat`
705 * :func:`~os.fchmodat`
706 * :func:`~os.fchownat`
707 * :func:`~os.fstatat`
708 * :func:`~os.futimesat`
Victor Stinnere5064372011-10-14 00:08:29 +0200709 * :func:`~os.linkat`
710 * :func:`~os.mkdirat`
711 * :func:`~os.mkfifoat`
712 * :func:`~os.mknodat`
713 * :func:`~os.openat`
714 * :func:`~os.readlinkat`
715 * :func:`~os.renameat`
716 * :func:`~os.symlinkat`
717 * :func:`~os.unlinkat`
718 * :func:`~os.utimensat`
Victor Stinnere5064372011-10-14 00:08:29 +0200719
720* extended attributes (:issue:`12720`):
721
722 * :func:`~os.fgetxattr`
723 * :func:`~os.flistxattr`
724 * :func:`~os.fremovexattr`
725 * :func:`~os.fsetxattr`
726 * :func:`~os.getxattr`
727 * :func:`~os.lgetxattr`
728 * :func:`~os.listxattr`
729 * :func:`~os.llistxattr`
730 * :func:`~os.lremovexattr`
731 * :func:`~os.lsetxattr`
732 * :func:`~os.removexattr`
733 * :func:`~os.setxattr`
734
735* Scheduler functions (:issue:`12655`):
736
737 * :func:`~os.sched_get_priority_max`
738 * :func:`~os.sched_get_priority_min`
739 * :func:`~os.sched_getaffinity`
740 * :func:`~os.sched_getparam`
741 * :func:`~os.sched_getscheduler`
742 * :func:`~os.sched_rr_get_interval`
743 * :func:`~os.sched_setaffinity`
744 * :func:`~os.sched_setparam`
745 * :func:`~os.sched_setscheduler`
746 * :func:`~os.sched_yield`
747
748* Add some extra posix functions to the os module (:issue:`10812`):
749
750 * :func:`~os.fexecve`
751 * :func:`~os.futimens`
Victor Stinnere5064372011-10-14 00:08:29 +0200752 * :func:`~os.futimes`
753 * :func:`~os.lockf`
754 * :func:`~os.lutimes`
Victor Stinnere5064372011-10-14 00:08:29 +0200755 * :func:`~os.posix_fadvise`
756 * :func:`~os.posix_fallocate`
757 * :func:`~os.pread`
758 * :func:`~os.pwrite`
759 * :func:`~os.readv`
760 * :func:`~os.sync`
761 * :func:`~os.truncate`
762 * :func:`~os.waitid`
763 * :func:`~os.writev`
764
765* Other new functions:
766
Charles-François Natali77940902012-02-06 19:54:48 +0100767 * :func:`~os.flistdir` (:issue:`10755`)
Victor Stinnere5064372011-10-14 00:08:29 +0200768 * :func:`~os.getgrouplist` (:issue:`9344`)
769
Giampaolo Rodolà424298a2011-03-03 18:34:06 +0000770
Éric Araujo765e94f2011-06-03 17:26:59 +0200771packaging
772---------
773
774:mod:`distutils` has undergone additions and refactoring under a new name,
775:mod:`packaging`, to allow developers to break backward compatibility.
776:mod:`distutils` is still provided in the standard library, but users are
777encouraged to transition to :mod:`packaging`. For older versions of Python, a
778backport compatible with 2.4+ and 3.1+ will be made available on PyPI under the
779name :mod:`distutils2`.
780
781.. TODO add examples and howto to the packaging docs and link to them
782
783
Georg Brandl4c7c3c52012-03-10 22:36:48 +0100784pdb
785---
786
787* Tab-completion is now available not only for command names, but also their
788 arguments. For example, for the ``break`` command, function and file names
789 are completed. (Contributed by Georg Brandl in :issue:`14210`)
790
791
Victor Stinner383c3fc2011-05-25 01:35:05 +0200792pydoc
793-----
794
Victor Stinner6daa33c2011-05-25 01:41:22 +0200795The Tk GUI and the :func:`~pydoc.serve` function have been removed from the
796:mod:`pydoc` module: ``pydoc -g`` and :func:`~pydoc.serve` have been deprecated
797in Python 3.2.
Victor Stinner383c3fc2011-05-25 01:35:05 +0200798
799
Victor Stinnerf4c54ff2012-02-08 01:48:34 +0100800sched
801-----
Victor Stinner754851f2011-04-19 23:58:51 +0200802
Victor Stinnerf4c54ff2012-02-08 01:48:34 +0100803* :meth:`~sched.scheduler.run` now accepts a *blocking* parameter which when
804 set to False makes the method execute the scheduled events due to expire
805 soonest (if any) and then return immediately.
806 This is useful in case you want to use the :class:`~sched.scheduler` in
807 non-blocking applications. (Contributed by Giampaolo Rodolà in :issue:`13449`)
Victor Stinner754851f2011-04-19 23:58:51 +0200808
Victor Stinnerf4c54ff2012-02-08 01:48:34 +0100809* :class:`~sched.scheduler` class can now be safely used in multi-threaded
810 environments. (Contributed by Josiah Carlson and Giampaolo Rodolà in
811 :issue:`8684`)
812
813* *timefunc* and *delayfunct* parameters of :class:`~sched.scheduler` class
814 constructor are now optional and defaults to :func:`time.time` and
815 :func:`time.sleep` respectively. (Contributed by Chris Clark in
816 :issue:`13245`)
817
818* :meth:`~sched.scheduler.enter` and :meth:`~sched.scheduler.enterabs`
819 *argument* parameter is now optional. (Contributed by Chris Clark in
820 :issue:`13245`)
821
822* :meth:`~sched.scheduler.enter` and :meth:`~sched.scheduler.enterabs`
823 now accept a *kwargs* parameter. (Contributed by Chris Clark in
824 :issue:`13245`)
825
826
827shutil
828------
829
830* The :mod:`shutil` module has these new fuctions:
831
832 * :func:`~shutil.disk_usage`: provides total, used and free disk space
833 statistics. (Contributed by Giampaolo Rodolà in :issue:`12442`)
834 * :func:`~shutil.chown`: allows one to change user and/or group of the given
835 path also specifying the user/group names and not only their numeric
836 ids. (Contributed by Sandro Tosi in :issue:`12191`)
Victor Stinnera9293352011-04-30 15:21:58 +0200837
Victor Stinnerfa0e3d52011-05-09 01:01:09 +0200838
Victor Stinnera9293352011-04-30 15:21:58 +0200839signal
840------
841
Victor Stinnerfa0e3d52011-05-09 01:01:09 +0200842* The :mod:`signal` module has new functions:
Victor Stinnera9293352011-04-30 15:21:58 +0200843
Victor Stinnerb3e72192011-05-08 01:46:11 +0200844 * :func:`~signal.pthread_sigmask`: fetch and/or change the signal mask of the
845 calling thread (Contributed by Jean-Paul Calderone in :issue:`8407`) ;
846 * :func:`~signal.pthread_kill`: send a signal to a thread ;
847 * :func:`~signal.sigpending`: examine pending functions ;
848 * :func:`~signal.sigwait`: wait a signal.
Ross Lagerwallbc808222011-06-25 12:13:40 +0200849 * :func:`~signal.sigwaitinfo`: wait for a signal, returning detailed
850 information about it.
851 * :func:`~signal.sigtimedwait`: like :func:`~signal.sigwaitinfo` but with a
852 timeout.
Victor Stinnera9293352011-04-30 15:21:58 +0200853
Victor Stinnerd49b1f12011-05-08 02:03:15 +0200854* The signal handler writes the signal number as a single byte instead of
855 a nul byte into the wakeup file descriptor. So it is possible to wait more
856 than one signal and know which signals were raised.
857
Victor Stinner388196e2011-05-10 17:13:00 +0200858* :func:`signal.signal` and :func:`signal.siginterrupt` raise an OSError,
859 instead of a RuntimeError: OSError has an errno attribute.
860
Victor Stinnerf4c54ff2012-02-08 01:48:34 +0100861smtplib
862-------
863
864The :class:`~smtplib.SMTP_SSL` constructor and the :meth:`~smtplib.SMTP.starttls`
865method now accept an SSLContext parameter to control parameters of the secure
866channel.
867
868(Contributed by Kasun Herath in :issue:`8809`)
869
870
Nick Coghlan96fe56a2011-08-22 11:55:57 +1000871socket
872------
873
Charles-François Natali47413c12011-10-06 19:47:44 +0200874* The :class:`~socket.socket` class now exposes additional methods to process
875 ancillary data when supported by the underlying platform:
Nick Coghlan96fe56a2011-08-22 11:55:57 +1000876
Charles-François Natali47413c12011-10-06 19:47:44 +0200877 * :func:`~socket.socket.sendmsg`
878 * :func:`~socket.socket.recvmsg`
879 * :func:`~socket.socket.recvmsg_into`
Nick Coghlan96fe56a2011-08-22 11:55:57 +1000880
Charles-François Natali47413c12011-10-06 19:47:44 +0200881 (Contributed by David Watson in :issue:`6560`, based on an earlier patch by
882 Heiko Wundram)
883
884* The :class:`~socket.socket` class now supports the PF_CAN protocol family
885 (http://en.wikipedia.org/wiki/Socketcan), on Linux
886 (http://lwn.net/Articles/253425).
887
888 (Contributed by Matthias Fuchs, updated by Tiago Gonçalves in :issue:`10141`)
889
Charles-François Natali10b8cf42011-11-10 19:21:37 +0100890* The :class:`~socket.socket` class now supports the PF_RDS protocol family
891 (http://en.wikipedia.org/wiki/Reliable_Datagram_Sockets and
892 http://oss.oracle.com/projects/rds/).
Victor Stinner754851f2011-04-19 23:58:51 +0200893
Victor Stinnerf4c54ff2012-02-08 01:48:34 +0100894
Victor Stinner99c8b162011-05-24 12:05:19 +0200895ssl
896---
897
Antoine Pitrou2c0a9672011-11-17 02:09:13 +0100898* The :mod:`ssl` module has two new random generation functions:
Victor Stinner99c8b162011-05-24 12:05:19 +0200899
900 * :func:`~ssl.RAND_bytes`: generate cryptographically strong
901 pseudo-random bytes.
902 * :func:`~ssl.RAND_pseudo_bytes`: generate pseudo-random bytes.
903
Antoine Pitrou2c0a9672011-11-17 02:09:13 +0100904 (Contributed by Victor Stinner in :issue:`12049`)
905
906* The :mod:`ssl` module now exposes a finer-grained exception hierarchy
907 in order to make it easier to inspect the various kinds of errors.
908
909 (Contributed by Antoine Pitrou in :issue:`11183`)
910
911* :meth:`~ssl.SSLContext.load_cert_chain` now accepts a *password* argument
912 to be used if the private key is encrypted.
913
914 (Contributed by Adam Simpkins in :issue:`12803`)
915
Antoine Pitrou73fc8142011-12-23 20:58:36 +0100916* Diffie-Hellman key exchange, both regular and Elliptic Curve-based, is
917 now supported through the :meth:`~ssl.SSLContext.load_dh_params` and
918 :meth:`~ssl.SSLContext.set_ecdh_curve` methods.
919
920 (Contributed by Antoine Pitrou in :issue:`13626` and :issue:`13627`)
921
Antoine Pitrou2c0a9672011-11-17 02:09:13 +0100922* SSL sockets have a new :meth:`~ssl.SSLSocket.get_channel_binding` method
923 allowing the implementation of certain authentication mechanisms such as
924 SCRAM-SHA-1-PLUS.
925
926 (Contributed by Jacek Konieczny in :issue:`12551`)
927
Antoine Pitrou73fc8142011-12-23 20:58:36 +0100928* You can query the SSL compression algorithm used by an SSL socket, thanks
929 to its new :meth:`~ssl.SSLSocket.compression` method.
930
931 (Contributed by Antoine Pitrou in :issue:`13634`)
932
933
Victor Stinnerf4c54ff2012-02-08 01:48:34 +0100934sys
935---
Giampaolo Rodola'210e7ca2011-07-01 13:55:36 +0200936
Victor Stinnerf4c54ff2012-02-08 01:48:34 +0100937* The :mod:`sys` module has a new :data:`~sys.thread_info` :term:`struct
938 sequence` holding informations about the thread implementation.
Giampaolo Rodola'210e7ca2011-07-01 13:55:36 +0200939
Victor Stinnerf4c54ff2012-02-08 01:48:34 +0100940 (:issue:`11223`)
Giampaolo Rodola'096dcb12011-06-27 11:17:51 +0200941
Antoine Pitrou5a8bc6f2011-11-17 02:20:48 +0100942
Victor Stinnerf4c54ff2012-02-08 01:48:34 +0100943time
944----
Antoine Pitrou5a8bc6f2011-11-17 02:20:48 +0100945
Victor Stinnerf4c54ff2012-02-08 01:48:34 +0100946The :mod:`time` module has new functions:
947
948* :func:`~time.clock_getres` and :func:`~time.clock_gettime` functions and
949 ``CLOCK_xxx`` constants.
Victor Stinnerec919cc2012-03-15 00:58:32 +0100950* :func:`~time.steady`.
Victor Stinnerf4c54ff2012-02-08 01:48:34 +0100951
952(Contributed by Victor Stinner in :issue:`10278`)
953
Antoine Pitrou5a8bc6f2011-11-17 02:20:48 +0100954
Senthil Kumarande49d642011-10-16 23:54:44 +0800955urllib
956------
957
958The :class:`~urllib.request.Request` class, now accepts a *method* argument
959used by :meth:`~urllib.request.Request.get_method` to determine what HTTP method
Senthil Kumarana41c9422011-10-20 02:37:08 +0800960should be used. For example, this will send a ``'HEAD'`` request::
Senthil Kumarande49d642011-10-16 23:54:44 +0800961
962 >>> urlopen(Request('http://www.python.org', method='HEAD'))
963
964(:issue:`1673007`)
Giampaolo Rodola'096dcb12011-06-27 11:17:51 +0200965
Giampaolo Rodola'be55d992011-11-22 13:33:34 +0100966
Giampaolo Rodolà3108f982011-02-24 20:59:48 +0000967Optimizations
968=============
969
970Major performance enhancements have been added:
971
Victor Stinner46606ce2011-11-20 18:27:55 +0100972* Thanks to the :pep:`393`, some operations on Unicode strings has been optimized:
973
974 * the memory footprint is divided by 2 to 4 depending on the text
Victor Stinnera996f1e2011-11-21 13:14:43 +0100975 * encode an ASCII string to UTF-8 doesn't need to encode characters anymore,
976 the UTF-8 representation is shared with the ASCII representation
Victor Stinner6099a032011-12-18 14:22:26 +0100977 * the UTF-8 encoder has been optimized
978 * repeating a single ASCII letter and getting a substring of a ASCII strings
979 is 4 times faster
Giampaolo Rodolà3108f982011-02-24 20:59:48 +0000980
981
982Build and C API Changes
983=======================
984
985Changes to Python's build process and to the C API include:
986
Stefan Krah95b1ba62012-02-29 17:27:21 +0100987* New :pep:`3118` related function:
988
989 * :c:func:`PyMemoryView_FromMemory`
990
Victor Stinner46606ce2011-11-20 18:27:55 +0100991* The :pep:`393` added new Unicode types, macros and functions:
992
Victor Stinnera996f1e2011-11-21 13:14:43 +0100993 * High-level API:
994
995 * :c:func:`PyUnicode_CopyCharacters`
996 * :c:func:`PyUnicode_FindChar`
997 * :c:func:`PyUnicode_GetLength`, :c:macro:`PyUnicode_GET_LENGTH`
998 * :c:func:`PyUnicode_New`
999 * :c:func:`PyUnicode_Substring`
1000 * :c:func:`PyUnicode_ReadChar`, :c:func:`PyUnicode_WriteChar`
1001
1002 * Low-level API:
1003
1004 * :c:type:`Py_UCS1`, :c:type:`Py_UCS2`, :c:type:`Py_UCS4` types
1005 * :c:type:`PyASCIIObject` and :c:type:`PyCompactUnicodeObject` structures
1006 * :c:macro:`PyUnicode_READY`
1007 * :c:func:`PyUnicode_FromKindAndData`
1008 * :c:func:`PyUnicode_AsUCS4`, :c:func:`PyUnicode_AsUCS4Copy`
1009 * :c:macro:`PyUnicode_DATA`, :c:macro:`PyUnicode_1BYTE_DATA`,
1010 :c:macro:`PyUnicode_2BYTE_DATA`, :c:macro:`PyUnicode_4BYTE_DATA`
1011 * :c:macro:`PyUnicode_KIND` with :c:type:`PyUnicode_Kind` enum:
1012 :c:data:`PyUnicode_WCHAR_KIND`, :c:data:`PyUnicode_1BYTE_KIND`,
1013 :c:data:`PyUnicode_2BYTE_KIND`, :c:data:`PyUnicode_4BYTE_KIND`
1014 * :c:macro:`PyUnicode_READ`, :c:macro:`PyUnicode_READ_CHAR`, :c:macro:`PyUnicode_WRITE`
1015 * :c:macro:`PyUnicode_MAX_CHAR_VALUE`
1016
Giampaolo Rodolà3108f982011-02-24 20:59:48 +00001017
1018
Victor Stinnerd1be8782011-12-09 00:10:41 +01001019Deprecated
1020==========
1021
Georg Brandl0cd25c92011-04-29 13:45:54 +02001022Unsupported Operating Systems
Victor Stinnerd1be8782011-12-09 00:10:41 +01001023-----------------------------
Victor Stinnerb90db4c2011-04-26 22:48:24 +02001024
Brian Curtin49a40cd2011-05-02 22:30:06 -05001025OS/2 and VMS are no longer supported due to the lack of a maintainer.
1026
1027Windows 2000 and Windows platforms which set ``COMSPEC`` to ``command.com``
1028are no longer supported due to maintenance burden.
Victor Stinnerb90db4c2011-04-26 22:48:24 +02001029
1030
Victor Stinner46606ce2011-11-20 18:27:55 +01001031Deprecated Python modules, functions and methods
Victor Stinnerd1be8782011-12-09 00:10:41 +01001032------------------------------------------------
Victor Stinner19bd0692011-11-16 00:18:57 +01001033
1034* The :mod:`packaging` module replaces the :mod:`distutils` module
1035* The ``unicode_internal`` codec has been deprecated because of the
Sandro Tosicd899122012-01-22 12:16:04 +01001036 :pep:`393`, use UTF-8, UTF-16 (``utf-16-le`` or ``utf-16-be``), or UTF-32
1037 (``utf-32-le`` or ``utf-32-be``)
Victor Stinner19bd0692011-11-16 00:18:57 +01001038* :meth:`ftplib.FTP.nlst` and :meth:`ftplib.FTP.dir`: use
Victor Stinner46606ce2011-11-20 18:27:55 +01001039 :meth:`ftplib.FTP.mlsd`
Victor Stinner19bd0692011-11-16 00:18:57 +01001040* :func:`platform.popen`: use the :mod:`subprocess` module. Check especially
1041 the :ref:`subprocess-replacements` section.
1042* :issue:`13374`: The Windows bytes API has been deprecated in the :mod:`os`
Victor Stinner46606ce2011-11-20 18:27:55 +01001043 module. Use Unicode filenames, instead of bytes filenames, to not depend on
Victor Stinner19bd0692011-11-16 00:18:57 +01001044 the ANSI code page anymore and to support any filename.
Florent Xiclunaa72a98f2012-02-13 11:03:30 +01001045* :issue:`13988`: The :mod:`xml.etree.cElementTree` module is deprecated. The
1046 accelerator is used automatically whenever available.
Victor Stinner19bd0692011-11-16 00:18:57 +01001047
1048
Victor Stinner46606ce2011-11-20 18:27:55 +01001049Deprecated functions and types of the C API
Victor Stinnerd1be8782011-12-09 00:10:41 +01001050-------------------------------------------
Victor Stinner46606ce2011-11-20 18:27:55 +01001051
1052The :c:type:`Py_UNICODE` has been deprecated by the :pep:`393` and will be
1053removed in Python 4. All functions using this type are deprecated:
1054
Victor Stinner46606ce2011-11-20 18:27:55 +01001055Unicode functions and methods using :c:type:`Py_UNICODE` and
1056:c:type:`Py_UNICODE*` types:
1057
1058 * :c:macro:`PyUnicode_FromUnicode`: use :c:func:`PyUnicode_FromWideChar` or
1059 :c:func:`PyUnicode_FromKindAndData`
1060 * :c:macro:`PyUnicode_AS_UNICODE`, :c:func:`PyUnicode_AsUnicode`,
1061 :c:func:`PyUnicode_AsUnicodeAndSize`: use :c:func:`PyUnicode_AsWideCharString`
1062 * :c:macro:`PyUnicode_AS_DATA`: use :c:macro:`PyUnicode_DATA` with
1063 :c:macro:`PyUnicode_READ` and :c:macro:`PyUnicode_WRITE`
1064 * :c:macro:`PyUnicode_GET_SIZE`, :c:func:`PyUnicode_GetSize`: use
1065 :c:macro:`PyUnicode_GET_LENGTH` or :c:func:`PyUnicode_GetLength`
1066 * :c:macro:`PyUnicode_GET_DATA_SIZE`: use
1067 ``PyUnicode_GET_LENGTH(str) * PyUnicode_KIND(str)`` (only work on ready
1068 strings)
Victor Stinnerbf6e5602011-12-12 01:53:47 +01001069 * :c:func:`PyUnicode_AsUnicodeCopy`: use :c:func:`PyUnicode_AsUCS4Copy` or
1070 :c:func:`PyUnicode_AsWideCharString`
Victor Stinnerab595942011-12-17 04:59:06 +01001071 * :c:func:`PyUnicode_GetMax`
1072
Victor Stinner46606ce2011-11-20 18:27:55 +01001073
Victor Stinnera996f1e2011-11-21 13:14:43 +01001074Functions and macros manipulating Py_UNICODE* strings:
1075
1076 * :c:macro:`Py_UNICODE_strlen`: use :c:func:`PyUnicode_GetLength` or
1077 :c:macro:`PyUnicode_GET_LENGTH`
1078 * :c:macro:`Py_UNICODE_strcat`: use :c:func:`PyUnicode_CopyCharacters` or
1079 :c:func:`PyUnicode_FromFormat`
1080 * :c:macro:`Py_UNICODE_strcpy`, :c:macro:`Py_UNICODE_strncpy`,
1081 :c:macro:`Py_UNICODE_COPY`: use :c:func:`PyUnicode_CopyCharacters` or
1082 :c:func:`PyUnicode_Substring`
1083 * :c:macro:`Py_UNICODE_strcmp`: use :c:func:`PyUnicode_Compare`
1084 * :c:macro:`Py_UNICODE_strncmp`: use :c:func:`PyUnicode_Tailmatch`
1085 * :c:macro:`Py_UNICODE_strchr`, :c:macro:`Py_UNICODE_strrchr`: use
1086 :c:func:`PyUnicode_FindChar`
Victor Stinner606e19d2012-01-04 03:59:16 +01001087 * :c:macro:`Py_UNICODE_FILL`: use :c:func:`PyUnicode_Fill`
Victor Stinnerab595942011-12-17 04:59:06 +01001088 * :c:macro:`Py_UNICODE_MATCH`
Victor Stinnera996f1e2011-11-21 13:14:43 +01001089
Victor Stinner46606ce2011-11-20 18:27:55 +01001090Encoders:
1091
1092 * :c:func:`PyUnicode_Encode`: use :c:func:`PyUnicode_AsEncodedObject`
1093 * :c:func:`PyUnicode_EncodeUTF7`
Victor Stinnera996f1e2011-11-21 13:14:43 +01001094 * :c:func:`PyUnicode_EncodeUTF8`: use :c:func:`PyUnicode_AsUTF8` or
1095 :c:func:`PyUnicode_AsUTF8String`
Victor Stinner46606ce2011-11-20 18:27:55 +01001096 * :c:func:`PyUnicode_EncodeUTF32`
1097 * :c:func:`PyUnicode_EncodeUTF16`
1098 * :c:func:`PyUnicode_EncodeUnicodeEscape:` use
1099 :c:func:`PyUnicode_AsUnicodeEscapeString`
1100 * :c:func:`PyUnicode_EncodeRawUnicodeEscape:` use
1101 :c:func:`PyUnicode_AsRawUnicodeEscapeString`
1102 * :c:func:`PyUnicode_EncodeLatin1`: use :c:func:`PyUnicode_AsLatin1String`
1103 * :c:func:`PyUnicode_EncodeASCII`: use :c:func:`PyUnicode_AsASCIIString`
1104 * :c:func:`PyUnicode_EncodeCharmap`
1105 * :c:func:`PyUnicode_TranslateCharmap`
1106 * :c:func:`PyUnicode_EncodeMBCS`: use :c:func:`PyUnicode_AsMBCSString` or
1107 :c:func:`PyUnicode_EncodeCodePage` (with ``CP_ACP`` code_page)
1108 * :c:func:`PyUnicode_EncodeDecimal`,
1109 :c:func:`PyUnicode_TransformDecimalToASCII`
1110
1111
Giampaolo Rodolà3108f982011-02-24 20:59:48 +00001112Porting to Python 3.3
1113=====================
1114
1115This section lists previously described changes and other bugfixes
Antoine Pitrou037ffbf2011-10-24 00:25:41 +02001116that may require changes to your code.
1117
1118Porting Python code
1119-------------------
Giampaolo Rodolà3108f982011-02-24 20:59:48 +00001120
Georg Brandld6c43402012-03-07 08:55:52 +01001121.. XXX add a point about hash randomization and that it's always on in 3.3
1122
Victor Stinner30c825c2012-03-14 00:40:57 +01001123* :issue:`14205`: A dict lookup now raises a :exc:`RuntimeError` if the dict is
1124 modified during the lookup. If you implement your own comparison function for
1125 objects used as dict keys and the dict is shared by multiple threads, access
1126 to the dict should be protected by a lock.
1127
Victor Stinner19bd0692011-11-16 00:18:57 +01001128* :issue:`12326`: On Linux, sys.platform doesn't contain the major version
Victor Stinnerff3d9392011-08-20 23:39:26 +02001129 anymore. It is now always 'linux', instead of 'linux2' or 'linux3' depending
1130 on the Linux version used to build Python. Replace sys.platform == 'linux2'
1131 with sys.platform.startswith('linux'), or directly sys.platform == 'linux' if
1132 you don't need to support older Python versions.
Éric Araujoc09fca62011-03-23 02:06:24 +01001133
Victor Stinnerecc6e662012-03-14 00:39:29 +01001134* :issue:`13847`, :issue:`14180`: :mod:`time` and :mod:`datetime`:
1135 :exc:`OverflowError` is now raised instead of :exc:`ValueError` if a
1136 timestamp is out of range. :exc:`OSError` is now raised if C functions
1137 :c:func:`gmtime` or :c:func:`localtime` failed.
1138
Antoine Pitrou037ffbf2011-10-24 00:25:41 +02001139Porting C code
1140--------------
1141
Stefan Krah54c32032012-02-29 17:47:21 +01001142* In the course of changes to the buffer API the undocumented
1143 :c:member:`~Py_buffer.smalltable` member of the
1144 :c:type:`Py_buffer` structure has been removed and the
1145 layout of the :c:type:`PyMemoryViewObject` has changed.
1146
1147 All extensions relying on the relevant parts in ``memoryobject.h``
1148 or ``object.h`` must be rebuilt.
1149
Antoine Pitrou037ffbf2011-10-24 00:25:41 +02001150* Due to :ref:`PEP 393 <pep-393>`, the :c:type:`Py_UNICODE` type and all
1151 functions using this type are deprecated (but will stay available for
1152 at least five years). If you were using low-level Unicode APIs to
1153 construct and access unicode objects and you want to benefit of the
1154 memory footprint reduction provided by the PEP 393, you have to convert
1155 your code to the new :doc:`Unicode API <../c-api/unicode>`.
1156
1157 However, if you only have been using high-level functions such as
1158 :c:func:`PyUnicode_Concat()`, :c:func:`PyUnicode_Join` or
1159 :c:func:`PyUnicode_FromFormat()`, your code will automatically take
1160 advantage of the new unicode representations.
1161
Antoine Pitrouc229e6e2012-02-20 19:41:11 +01001162Building C extensions
1163---------------------
1164
1165* The range of possible file names for C extensions has been narrowed.
1166 Very rarely used spellings have been suppressed: under POSIX, files
1167 named ``xxxmodule.so``, ``xxxmodule.abi3.so`` and
1168 ``xxxmodule.cpython-*.so`` are no longer recognized as implementing
1169 the ``xxx`` module. If you had been generating such files, you have
1170 to switch to the other spellings (i.e., remove the ``module`` string
1171 from the file names).
1172
1173 (implemented in :issue:`14040`.)
1174
1175
Antoine Pitrou037ffbf2011-10-24 00:25:41 +02001176Other issues
1177------------
1178
Éric Araujoc09fca62011-03-23 02:06:24 +01001179.. Issue #11591: When :program:`python` was started with :option:`-S`,
1180 ``import site`` will not add site-specific paths to the module search
1181 paths. In previous versions, it did. See changeset for doc changes in
1182 various files. Contributed by Carl Meyer with editions by Éric Araujo.
Éric Araujobe3bd572011-03-26 01:55:15 +01001183
Éric Araujobfc97292011-11-14 18:18:15 +01001184.. Issue #10998: the -Q command-line flag and related artifacts have been
Éric Araujobe3bd572011-03-26 01:55:15 +01001185 removed. Code checking sys.flags.division_warning will need updating.
1186 Contributed by Éric Araujo.