blob: 26c42b541675417e4f671d8d02b4308488700e88 [file] [log] [blame]
Giampaolo Rodolà3108f982011-02-24 20:59:48 +00001****************************
2 What's New In Python 3.3
3****************************
4
5:Author: Raymond Hettinger
6:Release: |release|
7:Date: |today|
8
Éric Araujob07b97f2011-10-05 01:03:34 +02009.. Rules for maintenance:
Giampaolo Rodolà3108f982011-02-24 20:59:48 +000010
11 * Anyone can add text to this document. Do not spend very much time
12 on the wording of your changes, because your text will probably
13 get rewritten to some degree.
14
15 * The maintainer will go through Misc/NEWS periodically and add
16 changes; it's therefore more important to add your changes to
17 Misc/NEWS than to this file.
18
19 * This is not a complete list of every single change; completeness
20 is the purpose of Misc/NEWS. Some changes I consider too small
21 or esoteric to include. If such a change is added to the text,
22 I'll just remove it. (This is another reason you shouldn't spend
23 too much time on writing your addition.)
24
25 * If you want to draw your new text to the attention of the
26 maintainer, add 'XXX' to the beginning of the paragraph or
27 section.
28
29 * It's OK to just add a fragmentary note about a change. For
30 example: "XXX Describe the transmogrify() function added to the
31 socket module." The maintainer will research the change and
32 write the necessary text.
33
34 * You can comment out your additions if you like, but it's not
35 necessary (especially when a final release is some months away).
36
37 * Credit the author of a patch or bugfix. Just the name is
38 sufficient; the e-mail address isn't necessary.
39
40 * It's helpful to add the bug/patch number as a comment:
41
Giampaolo Rodolà3108f982011-02-24 20:59:48 +000042 XXX Describe the transmogrify() function added to the socket
43 module.
Éric Araujob07b97f2011-10-05 01:03:34 +020044 (Contributed by P.Y. Developer in :issue:`12345`.)
Giampaolo Rodolà3108f982011-02-24 20:59:48 +000045
Éric Araujob07b97f2011-10-05 01:03:34 +020046 This saves the maintainer the effort of going through the Mercurial log
Giampaolo Rodolà3108f982011-02-24 20:59:48 +000047 when researching a change.
48
49This article explains the new features in Python 3.3, compared to 3.2.
50
51
Nick Coghlan98e20702012-03-06 21:50:13 +100052.. pep-3118-update:
53
Stefan Krah9a2d99e2012-02-25 12:24:21 +010054PEP 3118: New memoryview implementation and buffer protocol documentation
55=========================================================================
56
57:issue:`10181` - memoryview bug fixes and features.
58 Written by Stefan Krah.
59
60The new memoryview implementation comprehensively fixes all ownership and
61lifetime issues of dynamically allocated fields in the Py_buffer struct
62that led to multiple crash reports. Additionally, several functions that
63crashed or returned incorrect results for non-contiguous or multi-dimensional
64input have been fixed.
65
66The memoryview object now has a PEP-3118 compliant getbufferproc()
67that checks the consumer's request type. Many new features have been
68added, most of them work in full generality for non-contiguous arrays
69and arrays with suboffsets.
70
71The documentation has been updated, clearly spelling out responsibilities
72for both exporters and consumers. Buffer request flags are grouped into
73basic and compound flags. The memory layout of non-contiguous and
74multi-dimensional NumPy-style arrays is explained.
75
76Features
77--------
78
79* All native single character format specifiers in struct module syntax
80 (optionally prefixed with '@') are now supported.
81
82* With some restrictions, the cast() method allows changing of format and
83 shape of C-contiguous arrays.
84
85* Multi-dimensional list representations are supported for any array type.
86
87* Multi-dimensional comparisons are supported for any array type.
88
89* All array types are hashable if the exporting object is hashable
Nick Coghlan98e20702012-03-06 21:50:13 +100090 and the view is read-only. (Contributed by Antoine Pitrou in
91 :issue:`13411`)
92
Stefan Krah9a2d99e2012-02-25 12:24:21 +010093
94* Arbitrary slicing of any 1-D arrays type is supported. For example, it
95 is now possible to reverse a memoryview in O(1) by using a negative step.
96
97API changes
98-----------
99
100* The maximum number of dimensions is officially limited to 64.
101
102* The representation of empty shape, strides and suboffsets is now
103 an empty tuple instead of None.
104
105* Accessing a memoryview element with format 'B' (unsigned bytes)
106 now returns an integer (in accordance with the struct module syntax).
107 For returning a bytes object the view must be cast to 'c' first.
108
Stefan Krah54c32032012-02-29 17:47:21 +0100109* For further changes see `Build and C API Changes`_ and `Porting C code`_ .
Stefan Krah9a2d99e2012-02-25 12:24:21 +0100110
Antoine Pitrou037ffbf2011-10-24 00:25:41 +0200111.. _pep-393:
112
Ezio Melotti48a2f8f2011-09-29 00:18:19 +0300113PEP 393: Flexible String Representation
114=======================================
115
Antoine Pitroufd9b4162011-10-24 00:14:43 +0200116The Unicode string type is changed to support multiple internal
117representations, depending on the character with the largest Unicode ordinal
118(1, 2, or 4 bytes) in the represented string. This allows a space-efficient
119representation in common cases, but gives access to full UCS-4 on all
120systems. For compatibility with existing APIs, several representations may
121exist in parallel; over time, this compatibility should be phased out.
Ezio Melotti397546a2011-09-29 08:34:36 +0300122
Antoine Pitroufd9b4162011-10-24 00:14:43 +0200123On the Python side, there should be no downside to this change.
Ezio Melotti397546a2011-09-29 08:34:36 +0300124
Antoine Pitroufd9b4162011-10-24 00:14:43 +0200125On the C API side, PEP 393 is fully backward compatible. The legacy API
126should remain available at least five years. Applications using the legacy
127API will not fully benefit of the memory reduction, or - worse - may use
128a bit more memory, because Python may have to maintain two versions of each
129string (in the legacy format and in the new efficient storage).
130
Antoine Pitrou0599b5b2011-11-29 22:45:07 +0100131Functionality
132-------------
133
Antoine Pitroufd9b4162011-10-24 00:14:43 +0200134Changes introduced by :pep:`393` are the following:
Ezio Melotti48a2f8f2011-09-29 00:18:19 +0300135
Ezio Melotti397546a2011-09-29 08:34:36 +0300136* Python now always supports the full range of Unicode codepoints, including
137 non-BMP ones (i.e. from ``U+0000`` to ``U+10FFFF``). The distinction between
138 narrow and wide builds no longer exists and Python now behaves like a wide
Antoine Pitroufd9b4162011-10-24 00:14:43 +0200139 build, even under Windows.
Ezio Melotti397546a2011-09-29 08:34:36 +0300140
Antoine Pitroufd9b4162011-10-24 00:14:43 +0200141* With the death of narrow builds, the problems specific to narrow builds have
142 also been fixed, for example:
Ezio Melotti397546a2011-09-29 08:34:36 +0300143
144 * :func:`len` now always returns 1 for non-BMP characters,
145 so ``len('\U0010FFFF') == 1``;
146
147 * surrogate pairs are not recombined in string literals,
148 so ``'\uDBFF\uDFFF' != '\U0010FFFF'``;
149
Antoine Pitroufd9b4162011-10-24 00:14:43 +0200150 * indexing or slicing non-BMP characters returns the expected value,
Ezio Melotti397546a2011-09-29 08:34:36 +0300151 so ``'\U0010FFFF'[0]`` now returns ``'\U0010FFFF'`` and not ``'\uDBFF'``;
152
Antoine Pitroud136aec2011-11-17 01:48:06 +0100153 * all other functions in the standard library now correctly handle
Antoine Pitroufd9b4162011-10-24 00:14:43 +0200154 non-BMP codepoints.
Ezio Melotti397546a2011-09-29 08:34:36 +0300155
Ezio Melotti48a2f8f2011-09-29 00:18:19 +0300156* The value of :data:`sys.maxunicode` is now always ``1114111`` (``0x10FFFF``
157 in hexadecimal). The :c:func:`PyUnicode_GetMax` function still returns
158 either ``0xFFFF`` or ``0x10FFFF`` for backward compatibility, and it should
159 not be used with the new Unicode API (see :issue:`13054`).
160
Ezio Melotti397546a2011-09-29 08:34:36 +0300161* The :file:`./configure` flag ``--with-wide-unicode`` has been removed.
Victor Stinner7d637ab2011-09-29 02:56:16 +0200162
Antoine Pitrou0599b5b2011-11-29 22:45:07 +0100163Performance and resource usage
164------------------------------
165
166The storage of Unicode strings now depends on the highest codepoint in the string:
167
168* pure ASCII and Latin1 strings (``U+0000-U+00FF``) use 1 byte per codepoint;
169
170* BMP strings (``U+0000-U+FFFF``) use 2 bytes per codepoint;
171
172* non-BMP strings (``U+10000-U+10FFFF``) use 4 bytes per codepoint.
173
Martin v. Löwisde157cc2012-03-06 08:42:17 +0100174The net effect is that for most applications, memory usage of string
175storage should decrease significantly - especially compared to former
176wide unicode builds - as, in many cases, strings will be pure ASCII
177even in international contexts (because many strings store non-human
178language data, such as XML fragments, HTTP headers, JSON-encoded data,
179etc.). We also hope that it will, for the same reasons, increase CPU
180cache efficiency on non-trivial applications. The memory usage of
181Python 3.3 is two to three times smaller than Python 3.2, and a little
182bit better than Python 2.7, on a Django benchmark (see the PEP for
183details).
Antoine Pitrou0599b5b2011-11-29 22:45:07 +0100184
Éric Araujob07b97f2011-10-05 01:03:34 +0200185
Victor Stinnera1bf2982011-10-12 20:35:02 +0200186PEP 3151: Reworking the OS and IO exception hierarchy
187=====================================================
188
189:pep:`3151` - Reworking the OS and IO exception hierarchy
Antoine Pitrou01fd26c2011-10-24 00:07:02 +0200190 PEP written and implemented by Antoine Pitrou.
Victor Stinnera1bf2982011-10-12 20:35:02 +0200191
Antoine Pitrou01fd26c2011-10-24 00:07:02 +0200192The hierarchy of exceptions raised by operating system errors is now both
193simplified and finer-grained.
Victor Stinnera1bf2982011-10-12 20:35:02 +0200194
Antoine Pitrou01fd26c2011-10-24 00:07:02 +0200195You don't have to worry anymore about choosing the appropriate exception
196type between :exc:`OSError`, :exc:`IOError`, :exc:`EnvironmentError`,
197:exc:`WindowsError`, :exc:`mmap.error`, :exc:`socket.error` or
198:exc:`select.error`. All these exception types are now only one:
199:exc:`OSError`. The other names are kept as aliases for compatibility
200reasons.
Victor Stinnera1bf2982011-10-12 20:35:02 +0200201
Antoine Pitrou01fd26c2011-10-24 00:07:02 +0200202Also, it is now easier to catch a specific error condition. Instead of
203inspecting the ``errno`` attribute (or ``args[0]``) for a particular
204constant from the :mod:`errno` module, you can catch the adequate
205:exc:`OSError` subclass. The available subclasses are the following:
Victor Stinnera1bf2982011-10-12 20:35:02 +0200206
Antoine Pitrou01fd26c2011-10-24 00:07:02 +0200207* :exc:`BlockingIOError`
208* :exc:`ChildProcessError`
209* :exc:`ConnectionError`
210* :exc:`FileExistsError`
211* :exc:`FileNotFoundError`
212* :exc:`InterruptedError`
213* :exc:`IsADirectoryError`
214* :exc:`NotADirectoryError`
215* :exc:`PermissionError`
216* :exc:`ProcessLookupError`
217* :exc:`TimeoutError`
Victor Stinnera1bf2982011-10-12 20:35:02 +0200218
Antoine Pitrou01fd26c2011-10-24 00:07:02 +0200219And the :exc:`ConnectionError` itself has finer-grained subclasses:
Victor Stinnera1bf2982011-10-12 20:35:02 +0200220
Antoine Pitrou01fd26c2011-10-24 00:07:02 +0200221* :exc:`BrokenPipeError`
222* :exc:`ConnectionAbortedError`
223* :exc:`ConnectionRefusedError`
224* :exc:`ConnectionResetError`
Victor Stinnera1bf2982011-10-12 20:35:02 +0200225
226Thanks to the new exceptions, common usages of the :mod:`errno` can now be
Antoine Pitrou01fd26c2011-10-24 00:07:02 +0200227avoided. For example, the following code written for Python 3.2::
Victor Stinnera1bf2982011-10-12 20:35:02 +0200228
229 from errno import ENOENT, EACCES, EPERM
230
231 try:
232 with open("document.txt") as f:
233 content = f.read()
234 except IOError as err:
235 if err.errno == ENOENT:
236 print("document.txt file is missing")
237 elif err.errno in (EACCES, EPERM):
238 print("You are not allowed to read document.txt")
239 else:
240 raise
241
Antoine Pitrou01fd26c2011-10-24 00:07:02 +0200242can now be written without the :mod:`errno` import and without manual
243inspection of exception attributes::
Victor Stinnera1bf2982011-10-12 20:35:02 +0200244
245 try:
246 with open("document.txt") as f:
247 content = f.read()
248 except FileNotFoundError:
249 print("document.txt file is missing")
250 except PermissionError:
251 print("You are not allowed to read document.txt")
252
253
Nick Coghlan1f7ce622012-01-13 21:43:40 +1000254PEP 380: Syntax for Delegating to a Subgenerator
255================================================
256
Nick Coghlanab7bf212012-02-26 17:49:52 +1000257:pep:`380` - Syntax for Delegating to a Subgenerator
258 PEP written by Greg Ewing.
259
Nick Coghlan1f7ce622012-01-13 21:43:40 +1000260PEP 380 adds the ``yield from`` expression, allowing a generator to delegate
261part of its operations to another generator. This allows a section of code
262containing 'yield' to be factored out and placed in another generator.
263Additionally, the subgenerator is allowed to return with a value, and the
264value is made available to the delegating generator.
Nick Coghlanb9b281b2012-03-06 22:31:12 +1000265
Nick Coghlan1f7ce622012-01-13 21:43:40 +1000266While designed primarily for use in delegating to a subgenerator, the ``yield
267from`` expression actually allows delegation to arbitrary subiterators.
268
Nick Coghlanb9b281b2012-03-06 22:31:12 +1000269For simple iterators, ``yield from iterable`` is essentially just a shortened
270form of ``for item in iterable: yield item``::
271
272 >>> def g(x):
273 ... yield from range(x, 0, -1)
274 ... yield from range(x)
275 ...
276 >>> list(g(5))
277 [5, 4, 3, 2, 1, 0, 1, 2, 3, 4]
278
279However, unlike an ordinary loop, ``yield from`` allows subgenerators to
280receive sent and thrown values directly from the calling scope, and
281return a final value to the outer generator::
282
283 >>> def accumulate(start=0):
284 ... tally = start
285 ... while 1:
286 ... next = yield
287 ... if next is None:
288 ... return tally
289 ... tally += next
290 ...
291 >>> def gather_tallies(tallies, start=0):
292 ... while 1:
293 ... tally = yield from accumulate()
294 ... tallies.append(tally)
295 ...
296 >>> tallies = []
297 >>> acc = gather_tallies(tallies)
298 >>> next(acc) # Ensure the accumulator is ready to accept values
299 >>> for i in range(10):
300 ... acc.send(i)
301 ...
302 >>> acc.send(None) # Finish the first tally
303 >>> for i in range(5):
304 ... acc.send(i)
305 ...
306 >>> acc.send(None) # Finish the second tally
307 >>> tallies
308 [45, 10]
309
310The main principle driving this change is to allow even generators that are
311designed to be used with the ``send`` and ``throw`` methods to be split into
312multiple subgenerators as easily as a single large function can be split into
313multiple subfunctions.
314
Nick Coghlan1f7ce622012-01-13 21:43:40 +1000315(Implementation by Greg Ewing, integrated into 3.3 by Renaud Blanch, Ryan
316Kelly and Nick Coghlan, documentation by Zbigniew Jędrzejewski-Szmek and
317Nick Coghlan)
318
319
Nick Coghlanab7bf212012-02-26 17:49:52 +1000320PEP 409: Suppressing exception context
321======================================
322
323:pep:`409` - Suppressing exception context
324 PEP written by Ethan Furman, implemented by Ethan Furman and Nick Coghlan.
325
326PEP 409 introduces new syntax that allows the display of the chained
327exception context to be disabled. This allows cleaner error messages in
328applications that convert between exception types::
329
330 >>> class D:
331 ... def __init__(self, extra):
332 ... self._extra_attributes = extra
333 ... def __getattr__(self, attr):
334 ... try:
335 ... return self._extra_attributes[attr]
336 ... except KeyError:
337 ... raise AttributeError(attr) from None
338 ...
339 >>> D({}).x
340 Traceback (most recent call last):
341 File "<stdin>", line 1, in <module>
342 File "<stdin>", line 8, in __getattr__
343 AttributeError: x
344
345Without the ``from None`` suffix to suppress the cause, the original
346exception would be displayed by default::
347
348 >>> class C:
349 ... def __init__(self, extra):
350 ... self._extra_attributes = extra
351 ... def __getattr__(self, attr):
352 ... try:
353 ... return self._extra_attributes[attr]
354 ... except KeyError:
355 ... raise AttributeError(attr)
356 ...
357 >>> C({}).x
358 Traceback (most recent call last):
359 File "<stdin>", line 6, in __getattr__
360 KeyError: 'x'
361
362 During handling of the above exception, another exception occurred:
363
364 Traceback (most recent call last):
365 File "<stdin>", line 1, in <module>
366 File "<stdin>", line 8, in __getattr__
367 AttributeError: x
368
369No debugging capability is lost, as the original exception context remains
370available if needed (for example, if an intervening library has incorrectly
371suppressed valuable underlying details)::
372
373 >>> try:
374 ... D({}).x
375 ... except AttributeError as exc:
376 ... print(repr(exc.__context__))
377 ...
378 KeyError('x',)
379
380
Nick Coghlan98e20702012-03-06 21:50:13 +1000381PEP 414: Explicit Unicode literals
382======================================
383
384:pep:`414` - Explicit Unicode literals
385 PEP written by Armin Ronacher.
386
387To ease the transition from Python 2 for Unicode aware Python applications
388that make heavy use of Unicode literals, Python 3.3 once again supports the
389"``u``" prefix for string literals. This prefix has no semantic significance
390in Python 3, it is provided solely to reduce the number of purely mechanical
391changes in migrating to Python 3, making it easier for developers to focus on
392the more significant semantic changes (such as the stricter default
393separation of binary and text data).
394
395
Antoine Pitrou6bbd76b2011-11-25 19:10:05 +0100396PEP 3155: Qualified name for classes and functions
397==================================================
398
399:pep:`3155` - Qualified name for classes and functions
400 PEP written and implemented by Antoine Pitrou.
401
402Functions and class objects have a new ``__qualname__`` attribute representing
403the "path" from the module top-level to their definition. For global functions
404and classes, this is the same as ``__name__``. For other functions and classes,
405it provides better information about where they were actually defined, and
406how they might be accessible from the global scope.
407
408Example with (non-bound) methods::
Nick Coghlan2dfe6b02012-01-14 14:19:49 +1000409
Antoine Pitrou6bbd76b2011-11-25 19:10:05 +0100410 >>> class C:
411 ... def meth(self):
412 ... pass
413 >>> C.meth.__name__
414 'meth'
415 >>> C.meth.__qualname__
416 'C.meth'
417
418Example with nested classes::
419
420 >>> class C:
421 ... class D:
422 ... def meth(self):
423 ... pass
424 ...
425 >>> C.D.__name__
426 'D'
427 >>> C.D.__qualname__
428 'C.D'
429 >>> C.D.meth.__name__
430 'meth'
431 >>> C.D.meth.__qualname__
432 'C.D.meth'
433
434Example with nested functions::
435
436 >>> def outer():
437 ... def inner():
438 ... pass
439 ... return inner
440 ...
441 >>> outer().__name__
442 'inner'
443 >>> outer().__qualname__
444 'outer.<locals>.inner'
445
Antoine Pitroue7ede062011-11-25 19:11:26 +0100446The string representation of those objects is also changed to include the
Antoine Pitrou6bbd76b2011-11-25 19:10:05 +0100447new, more precise information::
448
449 >>> str(C.D)
450 "<class '__main__.C.D'>"
451 >>> str(C.D.meth)
452 '<function C.D.meth at 0x7f46b9fe31e0>'
453
454
Giampaolo Rodolà3108f982011-02-24 20:59:48 +0000455Other Language Changes
456======================
457
458Some smaller changes made to the core Python language are:
459
Antoine Pitrou7b578b32011-11-29 22:47:11 +0100460* Added support for Unicode name aliases and named sequences.
461 Both :func:`unicodedata.lookup()` and ``'\N{...}'`` now resolve name aliases,
462 and :func:`unicodedata.lookup()` resolves named sequences too.
Giampaolo Rodolà3108f982011-02-24 20:59:48 +0000463
Antoine Pitrou7b578b32011-11-29 22:47:11 +0100464 (Contributed by Ezio Melotti in :issue:`12753`)
Ezio Melotti931b8aa2011-10-21 21:57:36 +0300465
Antoine Pitrou7b578b32011-11-29 22:47:11 +0100466* Equality comparisons on :func:`range` objects now return a result reflecting
467 the equality of the underlying sequences generated by those range objects.
Ezio Melotti931b8aa2011-10-21 21:57:36 +0300468
Sandro Tosicd899122012-01-22 12:16:04 +0100469 (:issue:`13201`)
Giampaolo Rodolà3108f982011-02-24 20:59:48 +0000470
Antoine Pitrou7b578b32011-11-29 22:47:11 +0100471* The ``count()``, ``find()``, ``rfind()``, ``index()`` and ``rindex()``
472 methods of :class:`bytes` and :class:`bytearray` objects now accept an
473 integer between 0 and 255 as their first argument.
Mark Dickinson36645682011-10-23 19:53:01 +0100474
Antoine Pitrou7b578b32011-11-29 22:47:11 +0100475 (:issue:`12170`)
Mark Dickinson36645682011-10-23 19:53:01 +0100476
Victor Stinner8c43e692012-03-09 14:04:01 +0100477* A dict lookup now raises a :exc:`RuntimeError` if the dict is modified during
Georg Brandla08e7e12012-03-10 20:52:16 +0100478 the lookup. If you implement your own comparison function for objects used
Victor Stinner8c43e692012-03-09 14:04:01 +0100479 as dict keys and the dict is shared by multiple threads, access to the dict
480 should be protected by a lock.
481
482 (:issue:`14205`)
483
Petri Lehtinen61ea8a02011-11-24 22:00:46 +0200484
Victor Stinner46606ce2011-11-20 18:27:55 +0100485New and Improved Modules
486========================
Giampaolo Rodolà3108f982011-02-24 20:59:48 +0000487
Victor Stinnerf4c54ff2012-02-08 01:48:34 +0100488abc
489---
490
491Improved support for abstract base classes containing descriptors composed with
492abstract methods. The recommended approach to declaring abstract descriptors is
493now to provide :attr:`__isabstractmethod__` as a dynamically updated
494property. The built-in descriptors have been updated accordingly.
495
496 * :class:`abc.abstractproperty` has been deprecated, use :class:`property`
497 with :func:`abc.abstractmethod` instead.
498 * :class:`abc.abstractclassmethod` has been deprecated, use
499 :class:`classmethod` with :func:`abc.abstractmethod` instead.
500 * :class:`abc.abstractstaticmethod` has been deprecated, use
501 :class:`staticmethod` with :func:`abc.abstractmethod` instead.
502
503(Contributed by Darren Dale in :issue:`11610`)
504
Meador Ingec5dbb3d2011-09-20 21:48:16 -0500505array
506-----
507
508The :mod:`array` module supports the :c:type:`long long` type using ``q`` and
509``Q`` type codes.
510
511(Contributed by Oren Tirosh and Hirokazu Yamamoto in :issue:`1172711`)
512
513
Nadeem Vawdad7e5c6e2012-02-12 01:34:18 +0200514bz2
515---
516
517The :mod:`bz2` module has been rewritten from scratch. In the process, several
518new features have been added:
519
520* :class:`bz2.BZ2File` can now read from and write to arbitrary file-like
521 objects, by means of its constructor's *fileobj* argument.
522
523 (Contributed by Nadeem Vawda in :issue:`5863`)
524
525* :class:`bz2.BZ2File` and :func:`bz2.decompress` can now decompress
526 multi-stream inputs (such as those produced by the :program:`pbzip2` tool).
527 :class:`bz2.BZ2File` can now also be used to create this type of file, using
528 the ``'a'`` (append) mode.
529
530 (Contributed by Nir Aides in :issue:`1625`)
531
532* :class:`bz2.BZ2File` now implements all of the :class:`io.BufferedIOBase` API,
533 except for the :meth:`detach` and :meth:`truncate` methods.
534
535
Victor Stinner2cded9c2011-07-08 01:45:13 +0200536codecs
537------
538
Antoine Pitrou4f863432012-02-12 02:12:47 +0100539The :mod:`~encodings.mbcs` codec has been rewritten to handle correctly
Georg Brandlff962c52012-02-04 08:55:56 +0100540``replace`` and ``ignore`` error handlers on all Windows versions. The
541:mod:`~encodings.mbcs` codec now supports all error handlers, instead of only
542``replace`` to encode and ``ignore`` to decode.
Victor Stinner3a50e702011-10-18 21:21:00 +0200543
Georg Brandlff962c52012-02-04 08:55:56 +0100544A new Windows-only codec has been added: ``cp65001`` (:issue:`13216`). It is the
545Windows code page 65001 (Windows UTF-8, ``CP_UTF8``). For example, it is used
546by ``sys.stdout`` if the console output code page is set to cp65001 (e.g., using
547``chcp 65001`` command).
Victor Stinner2f3ca9f2011-10-27 01:38:56 +0200548
Georg Brandlff962c52012-02-04 08:55:56 +0100549Multibyte CJK decoders now resynchronize faster. They only ignore the first
Georg Brandl6c0929b2011-07-09 11:43:33 +0200550byte of an invalid byte sequence. For example, ``b'\xff\n'.decode('gb2312',
551'replace')`` now returns a ``\n`` after the replacement character.
Victor Stinner2cded9c2011-07-08 01:45:13 +0200552
Georg Brandl6c0929b2011-07-09 11:43:33 +0200553(:issue:`12016`)
Victor Stinner2cded9c2011-07-08 01:45:13 +0200554
Georg Brandlff962c52012-02-04 08:55:56 +0100555Incremental CJK codec encoders are no longer reset at each call to their
556encode() methods. For example::
Victor Stinner2cded9c2011-07-08 01:45:13 +0200557
558 $ ./python -q
559 >>> import codecs
560 >>> encoder = codecs.getincrementalencoder('hz')('strict')
561 >>> b''.join(encoder.encode(x) for x in '\u52ff\u65bd\u65bc\u4eba\u3002 Bye.')
562 b'~{NpJ)l6HK!#~} Bye.'
563
Georg Brandl6c0929b2011-07-09 11:43:33 +0200564This example gives ``b'~{Np~}~{J)~}~{l6~}~{HK~}~{!#~} Bye.'`` with older Python
Victor Stinner2cded9c2011-07-08 01:45:13 +0200565versions.
566
Georg Brandl6c0929b2011-07-09 11:43:33 +0200567(:issue:`12100`)
Victor Stinner2cded9c2011-07-08 01:45:13 +0200568
Victor Stinner9f4b1e92011-11-10 20:56:30 +0100569The ``unicode_internal`` codec has been deprecated.
570
Éric Araujo84b8ed82011-08-29 21:42:47 +0200571crypt
572-----
573
Victor Stinnerc78fb332011-09-21 03:35:44 +0200574Addition of salt and modular crypt format and the :func:`~crypt.mksalt`
575function to the :mod:`crypt` module.
Éric Araujo84b8ed82011-08-29 21:42:47 +0200576
577(:issue:`10924`)
578
Victor Stinnera7878b72011-07-14 23:07:44 +0200579curses
580------
581
Victor Stinner0fdfceb2011-11-25 22:10:02 +0100582 * If the :mod:`curses` module is linked to the ncursesw library, use Unicode
583 functions when Unicode strings or characters are passed (e.g.
584 :c:func:`waddwstr`), and bytes functions otherwise (e.g. :c:func:`waddstr`).
585 * Use the locale encoding instead of ``utf-8`` to encode Unicode strings.
586 * :class:`curses.window` has a new :attr:`curses.window.encoding` attribute.
Victor Stinnerc78fb332011-09-21 03:35:44 +0200587 * The :class:`curses.window` class has a new :meth:`~curses.window.get_wch`
588 method to get a wide character
589 * The :mod:`curses` module has a new :meth:`~curses.unget_wch` function to
590 push a wide character so the next :meth:`~curses.window.get_wch` will return
591 it
Victor Stinnera7878b72011-07-14 23:07:44 +0200592
Victor Stinnerc78fb332011-09-21 03:35:44 +0200593(Contributed by Iñigo Serna in :issue:`6755`)
Victor Stinnera7878b72011-07-14 23:07:44 +0200594
Victor Stinner024e37a2011-03-31 01:31:06 +0200595faulthandler
596------------
597
598New module: :mod:`faulthandler`.
599
600 * :envvar:`PYTHONFAULTHANDLER`
601 * :option:`-X` ``faulthandler``
602
Victor Stinner811db3b2011-09-21 03:20:03 +0200603ftplib
604------
605
606The :class:`~ftplib.FTP_TLS` class now provides a new
607:func:`~ftplib.FTP_TLS.ccc` function to revert control channel back to
Florent Xicluna6d57d212011-10-23 22:23:57 +0200608plaintext. This can be useful to take advantage of firewalls that know how to
Victor Stinner811db3b2011-09-21 03:20:03 +0200609handle NAT with non-secure FTP without opening fixed ports.
610
611(Contributed by Giampaolo Rodolà in :issue:`12139`)
612
613
Antoine Pitrou5a8bc6f2011-11-17 02:20:48 +0100614imaplib
615-------
616
617The :class:`~imaplib.IMAP4_SSL` constructor now accepts an SSLContext
618parameter to control parameters of the secure channel.
619
620(Contributed by Sijin Joseph in :issue:`8808`)
621
622
Charles-François Natalidc3044c2012-01-09 22:40:02 +0100623io
624--
625
Charles-François Natalid612de12012-01-14 11:51:00 +0100626The :func:`~io.open` function has a new ``'x'`` mode that can be used to
627exclusively create a new file, and raise a :exc:`FileExistsError` if the file
628already exists. It is based on the C11 'x' mode to fopen().
Charles-François Natalidc3044c2012-01-09 22:40:02 +0100629
630(Contributed by David Townshend in :issue:`12760`)
631
632
Nadeem Vawda34599222011-12-09 01:32:46 +0200633lzma
634----
635
636The newly-added :mod:`lzma` module provides data compression and decompression
637using the LZMA algorithm, including support for the ``.xz`` and ``.lzma``
638file formats.
639
640(Contributed by Nadeem Vawda and Per Øyvind Karlsen in :issue:`6715`)
641
642
Victor Stinnerfa0e3d52011-05-09 01:01:09 +0200643math
644----
645
646The :mod:`math` module has a new function:
647
648 * :func:`~math.log2`: return the base-2 logarithm of *x*
649 (Written by Mark Dickinson in :issue:`11888`).
650
651
652nntplib
653-------
654
655The :class:`nntplib.NNTP` class now supports the context manager protocol to
656unconditionally consume :exc:`socket.error` exceptions and to close the NNTP
657connection when done::
658
659 >>> from nntplib import NNTP
Ezio Melotti3c14b4e2011-07-13 11:44:44 +0300660 >>> with NNTP('news.gmane.org') as n:
Victor Stinnerfa0e3d52011-05-09 01:01:09 +0200661 ... n.group('gmane.comp.python.committers')
662 ...
Ezio Melotti04f648c2011-07-26 09:37:46 +0300663 ('211 1755 1 1755 gmane.comp.python.committers', 1755, 1, 1755, 'gmane.comp.python.committers')
Victor Stinnerfa0e3d52011-05-09 01:01:09 +0200664 >>>
665
666(Contributed by Giampaolo Rodolà in :issue:`9795`)
667
668
Giampaolo Rodolàc9c2c8b2011-02-25 14:39:16 +0000669os
670--
671
Charles-François Natalia003af12011-06-01 20:30:52 +0200672* The :mod:`os` module has a new :func:`~os.pipe2` function that makes it
673 possible to create a pipe with :data:`~os.O_CLOEXEC` or
674 :data:`~os.O_NONBLOCK` flags set atomically. This is especially useful to
675 avoid race conditions in multi-threaded programs.
676
Giampaolo Rodolà18e8bcb2011-02-25 20:57:54 +0000677* The :mod:`os` module has a new :func:`~os.sendfile` function which provides
678 an efficent "zero-copy" way for copying data from one file (or socket)
679 descriptor to another. The phrase "zero-copy" refers to the fact that all of
680 the copying of data between the two descriptors is done entirely by the
681 kernel, with no copying of data into userspace buffers. :func:`~os.sendfile`
682 can be used to efficiently copy data from a file on disk to a network socket,
683 e.g. for downloading a file.
Giampaolo Rodolàc9c2c8b2011-02-25 14:39:16 +0000684
Giampaolo Rodolà18e8bcb2011-02-25 20:57:54 +0000685 (Patch submitted by Ross Lagerwall and Giampaolo Rodolà in :issue:`10882`.)
686
687* The :mod:`os` module has two new functions: :func:`~os.getpriority` and
688 :func:`~os.setpriority`. They can be used to get or set process
689 niceness/priority in a fashion similar to :func:`os.nice` but extended to all
690 processes instead of just the current one.
691
692 (Patch submitted by Giampaolo Rodolà in :issue:`10784`.)
Giampaolo Rodolà3108f982011-02-24 20:59:48 +0000693
Charles-François Natali7372b062012-02-05 15:15:38 +0100694* The :mod:`os` module has a new :func:`~os.fwalk` function similar to
695 :func:`~os.walk` except that it also yields file descriptors referring to the
696 directories visited. This is especially useful to avoid symlink races.
697
Victor Stinnere5064372011-10-14 00:08:29 +0200698* "at" functions (:issue:`4761`):
699
700 * :func:`~os.faccessat`
701 * :func:`~os.fchmodat`
702 * :func:`~os.fchownat`
703 * :func:`~os.fstatat`
704 * :func:`~os.futimesat`
Victor Stinnere5064372011-10-14 00:08:29 +0200705 * :func:`~os.linkat`
706 * :func:`~os.mkdirat`
707 * :func:`~os.mkfifoat`
708 * :func:`~os.mknodat`
709 * :func:`~os.openat`
710 * :func:`~os.readlinkat`
711 * :func:`~os.renameat`
712 * :func:`~os.symlinkat`
713 * :func:`~os.unlinkat`
714 * :func:`~os.utimensat`
Victor Stinnere5064372011-10-14 00:08:29 +0200715
716* extended attributes (:issue:`12720`):
717
718 * :func:`~os.fgetxattr`
719 * :func:`~os.flistxattr`
720 * :func:`~os.fremovexattr`
721 * :func:`~os.fsetxattr`
722 * :func:`~os.getxattr`
723 * :func:`~os.lgetxattr`
724 * :func:`~os.listxattr`
725 * :func:`~os.llistxattr`
726 * :func:`~os.lremovexattr`
727 * :func:`~os.lsetxattr`
728 * :func:`~os.removexattr`
729 * :func:`~os.setxattr`
730
731* Scheduler functions (:issue:`12655`):
732
733 * :func:`~os.sched_get_priority_max`
734 * :func:`~os.sched_get_priority_min`
735 * :func:`~os.sched_getaffinity`
736 * :func:`~os.sched_getparam`
737 * :func:`~os.sched_getscheduler`
738 * :func:`~os.sched_rr_get_interval`
739 * :func:`~os.sched_setaffinity`
740 * :func:`~os.sched_setparam`
741 * :func:`~os.sched_setscheduler`
742 * :func:`~os.sched_yield`
743
744* Add some extra posix functions to the os module (:issue:`10812`):
745
746 * :func:`~os.fexecve`
747 * :func:`~os.futimens`
Victor Stinnere5064372011-10-14 00:08:29 +0200748 * :func:`~os.futimes`
749 * :func:`~os.lockf`
750 * :func:`~os.lutimes`
Victor Stinnere5064372011-10-14 00:08:29 +0200751 * :func:`~os.posix_fadvise`
752 * :func:`~os.posix_fallocate`
753 * :func:`~os.pread`
754 * :func:`~os.pwrite`
755 * :func:`~os.readv`
756 * :func:`~os.sync`
757 * :func:`~os.truncate`
758 * :func:`~os.waitid`
759 * :func:`~os.writev`
760
761* Other new functions:
762
Charles-François Natali77940902012-02-06 19:54:48 +0100763 * :func:`~os.flistdir` (:issue:`10755`)
Victor Stinnere5064372011-10-14 00:08:29 +0200764 * :func:`~os.getgrouplist` (:issue:`9344`)
765
Giampaolo Rodolà424298a2011-03-03 18:34:06 +0000766
Éric Araujo765e94f2011-06-03 17:26:59 +0200767packaging
768---------
769
770:mod:`distutils` has undergone additions and refactoring under a new name,
771:mod:`packaging`, to allow developers to break backward compatibility.
772:mod:`distutils` is still provided in the standard library, but users are
773encouraged to transition to :mod:`packaging`. For older versions of Python, a
774backport compatible with 2.4+ and 3.1+ will be made available on PyPI under the
775name :mod:`distutils2`.
776
777.. TODO add examples and howto to the packaging docs and link to them
778
779
Georg Brandl4c7c3c52012-03-10 22:36:48 +0100780pdb
781---
782
783* Tab-completion is now available not only for command names, but also their
784 arguments. For example, for the ``break`` command, function and file names
785 are completed. (Contributed by Georg Brandl in :issue:`14210`)
786
787
Victor Stinner383c3fc2011-05-25 01:35:05 +0200788pydoc
789-----
790
Victor Stinner6daa33c2011-05-25 01:41:22 +0200791The Tk GUI and the :func:`~pydoc.serve` function have been removed from the
792:mod:`pydoc` module: ``pydoc -g`` and :func:`~pydoc.serve` have been deprecated
793in Python 3.2.
Victor Stinner383c3fc2011-05-25 01:35:05 +0200794
795
Victor Stinnerf4c54ff2012-02-08 01:48:34 +0100796sched
797-----
Victor Stinner754851f2011-04-19 23:58:51 +0200798
Victor Stinnerf4c54ff2012-02-08 01:48:34 +0100799* :meth:`~sched.scheduler.run` now accepts a *blocking* parameter which when
800 set to False makes the method execute the scheduled events due to expire
801 soonest (if any) and then return immediately.
802 This is useful in case you want to use the :class:`~sched.scheduler` in
803 non-blocking applications. (Contributed by Giampaolo Rodolà in :issue:`13449`)
Victor Stinner754851f2011-04-19 23:58:51 +0200804
Victor Stinnerf4c54ff2012-02-08 01:48:34 +0100805* :class:`~sched.scheduler` class can now be safely used in multi-threaded
806 environments. (Contributed by Josiah Carlson and Giampaolo Rodolà in
807 :issue:`8684`)
808
809* *timefunc* and *delayfunct* parameters of :class:`~sched.scheduler` class
810 constructor are now optional and defaults to :func:`time.time` and
811 :func:`time.sleep` respectively. (Contributed by Chris Clark in
812 :issue:`13245`)
813
814* :meth:`~sched.scheduler.enter` and :meth:`~sched.scheduler.enterabs`
815 *argument* parameter is now optional. (Contributed by Chris Clark in
816 :issue:`13245`)
817
818* :meth:`~sched.scheduler.enter` and :meth:`~sched.scheduler.enterabs`
819 now accept a *kwargs* parameter. (Contributed by Chris Clark in
820 :issue:`13245`)
821
822
823shutil
824------
825
826* The :mod:`shutil` module has these new fuctions:
827
828 * :func:`~shutil.disk_usage`: provides total, used and free disk space
829 statistics. (Contributed by Giampaolo Rodolà in :issue:`12442`)
830 * :func:`~shutil.chown`: allows one to change user and/or group of the given
831 path also specifying the user/group names and not only their numeric
832 ids. (Contributed by Sandro Tosi in :issue:`12191`)
Victor Stinnera9293352011-04-30 15:21:58 +0200833
Victor Stinnerfa0e3d52011-05-09 01:01:09 +0200834
Victor Stinnera9293352011-04-30 15:21:58 +0200835signal
836------
837
Victor Stinnerfa0e3d52011-05-09 01:01:09 +0200838* The :mod:`signal` module has new functions:
Victor Stinnera9293352011-04-30 15:21:58 +0200839
Victor Stinnerb3e72192011-05-08 01:46:11 +0200840 * :func:`~signal.pthread_sigmask`: fetch and/or change the signal mask of the
841 calling thread (Contributed by Jean-Paul Calderone in :issue:`8407`) ;
842 * :func:`~signal.pthread_kill`: send a signal to a thread ;
843 * :func:`~signal.sigpending`: examine pending functions ;
844 * :func:`~signal.sigwait`: wait a signal.
Ross Lagerwallbc808222011-06-25 12:13:40 +0200845 * :func:`~signal.sigwaitinfo`: wait for a signal, returning detailed
846 information about it.
847 * :func:`~signal.sigtimedwait`: like :func:`~signal.sigwaitinfo` but with a
848 timeout.
Victor Stinnera9293352011-04-30 15:21:58 +0200849
Victor Stinnerd49b1f12011-05-08 02:03:15 +0200850* The signal handler writes the signal number as a single byte instead of
851 a nul byte into the wakeup file descriptor. So it is possible to wait more
852 than one signal and know which signals were raised.
853
Victor Stinner388196e2011-05-10 17:13:00 +0200854* :func:`signal.signal` and :func:`signal.siginterrupt` raise an OSError,
855 instead of a RuntimeError: OSError has an errno attribute.
856
Victor Stinnerf4c54ff2012-02-08 01:48:34 +0100857smtplib
858-------
859
860The :class:`~smtplib.SMTP_SSL` constructor and the :meth:`~smtplib.SMTP.starttls`
861method now accept an SSLContext parameter to control parameters of the secure
862channel.
863
864(Contributed by Kasun Herath in :issue:`8809`)
865
866
Nick Coghlan96fe56a2011-08-22 11:55:57 +1000867socket
868------
869
Charles-François Natali47413c12011-10-06 19:47:44 +0200870* The :class:`~socket.socket` class now exposes additional methods to process
871 ancillary data when supported by the underlying platform:
Nick Coghlan96fe56a2011-08-22 11:55:57 +1000872
Charles-François Natali47413c12011-10-06 19:47:44 +0200873 * :func:`~socket.socket.sendmsg`
874 * :func:`~socket.socket.recvmsg`
875 * :func:`~socket.socket.recvmsg_into`
Nick Coghlan96fe56a2011-08-22 11:55:57 +1000876
Charles-François Natali47413c12011-10-06 19:47:44 +0200877 (Contributed by David Watson in :issue:`6560`, based on an earlier patch by
878 Heiko Wundram)
879
880* The :class:`~socket.socket` class now supports the PF_CAN protocol family
881 (http://en.wikipedia.org/wiki/Socketcan), on Linux
882 (http://lwn.net/Articles/253425).
883
884 (Contributed by Matthias Fuchs, updated by Tiago Gonçalves in :issue:`10141`)
885
Charles-François Natali10b8cf42011-11-10 19:21:37 +0100886* The :class:`~socket.socket` class now supports the PF_RDS protocol family
887 (http://en.wikipedia.org/wiki/Reliable_Datagram_Sockets and
888 http://oss.oracle.com/projects/rds/).
Victor Stinner754851f2011-04-19 23:58:51 +0200889
Victor Stinnerf4c54ff2012-02-08 01:48:34 +0100890
Victor Stinner99c8b162011-05-24 12:05:19 +0200891ssl
892---
893
Antoine Pitrou2c0a9672011-11-17 02:09:13 +0100894* The :mod:`ssl` module has two new random generation functions:
Victor Stinner99c8b162011-05-24 12:05:19 +0200895
896 * :func:`~ssl.RAND_bytes`: generate cryptographically strong
897 pseudo-random bytes.
898 * :func:`~ssl.RAND_pseudo_bytes`: generate pseudo-random bytes.
899
Antoine Pitrou2c0a9672011-11-17 02:09:13 +0100900 (Contributed by Victor Stinner in :issue:`12049`)
901
902* The :mod:`ssl` module now exposes a finer-grained exception hierarchy
903 in order to make it easier to inspect the various kinds of errors.
904
905 (Contributed by Antoine Pitrou in :issue:`11183`)
906
907* :meth:`~ssl.SSLContext.load_cert_chain` now accepts a *password* argument
908 to be used if the private key is encrypted.
909
910 (Contributed by Adam Simpkins in :issue:`12803`)
911
Antoine Pitrou73fc8142011-12-23 20:58:36 +0100912* Diffie-Hellman key exchange, both regular and Elliptic Curve-based, is
913 now supported through the :meth:`~ssl.SSLContext.load_dh_params` and
914 :meth:`~ssl.SSLContext.set_ecdh_curve` methods.
915
916 (Contributed by Antoine Pitrou in :issue:`13626` and :issue:`13627`)
917
Antoine Pitrou2c0a9672011-11-17 02:09:13 +0100918* SSL sockets have a new :meth:`~ssl.SSLSocket.get_channel_binding` method
919 allowing the implementation of certain authentication mechanisms such as
920 SCRAM-SHA-1-PLUS.
921
922 (Contributed by Jacek Konieczny in :issue:`12551`)
923
Antoine Pitrou73fc8142011-12-23 20:58:36 +0100924* You can query the SSL compression algorithm used by an SSL socket, thanks
925 to its new :meth:`~ssl.SSLSocket.compression` method.
926
927 (Contributed by Antoine Pitrou in :issue:`13634`)
928
929
Victor Stinnerf4c54ff2012-02-08 01:48:34 +0100930sys
931---
Giampaolo Rodola'210e7ca2011-07-01 13:55:36 +0200932
Victor Stinnerf4c54ff2012-02-08 01:48:34 +0100933* The :mod:`sys` module has a new :data:`~sys.thread_info` :term:`struct
934 sequence` holding informations about the thread implementation.
Giampaolo Rodola'210e7ca2011-07-01 13:55:36 +0200935
Victor Stinnerf4c54ff2012-02-08 01:48:34 +0100936 (:issue:`11223`)
Giampaolo Rodola'096dcb12011-06-27 11:17:51 +0200937
Antoine Pitrou5a8bc6f2011-11-17 02:20:48 +0100938
Victor Stinnerf4c54ff2012-02-08 01:48:34 +0100939time
940----
Antoine Pitrou5a8bc6f2011-11-17 02:20:48 +0100941
Victor Stinnerf4c54ff2012-02-08 01:48:34 +0100942The :mod:`time` module has new functions:
943
944* :func:`~time.clock_getres` and :func:`~time.clock_gettime` functions and
945 ``CLOCK_xxx`` constants.
946* :func:`~time.monotonic`: monotonic clock.
947* :func:`~time.wallclock`.
948
949(Contributed by Victor Stinner in :issue:`10278`)
950
Antoine Pitrou5a8bc6f2011-11-17 02:20:48 +0100951
Senthil Kumarande49d642011-10-16 23:54:44 +0800952urllib
953------
954
955The :class:`~urllib.request.Request` class, now accepts a *method* argument
956used by :meth:`~urllib.request.Request.get_method` to determine what HTTP method
Senthil Kumarana41c9422011-10-20 02:37:08 +0800957should be used. For example, this will send a ``'HEAD'`` request::
Senthil Kumarande49d642011-10-16 23:54:44 +0800958
959 >>> urlopen(Request('http://www.python.org', method='HEAD'))
960
961(:issue:`1673007`)
Giampaolo Rodola'096dcb12011-06-27 11:17:51 +0200962
Giampaolo Rodola'be55d992011-11-22 13:33:34 +0100963
Giampaolo Rodolà3108f982011-02-24 20:59:48 +0000964Optimizations
965=============
966
967Major performance enhancements have been added:
968
Victor Stinner46606ce2011-11-20 18:27:55 +0100969* Thanks to the :pep:`393`, some operations on Unicode strings has been optimized:
970
971 * the memory footprint is divided by 2 to 4 depending on the text
Victor Stinnera996f1e2011-11-21 13:14:43 +0100972 * encode an ASCII string to UTF-8 doesn't need to encode characters anymore,
973 the UTF-8 representation is shared with the ASCII representation
Victor Stinner6099a032011-12-18 14:22:26 +0100974 * the UTF-8 encoder has been optimized
975 * repeating a single ASCII letter and getting a substring of a ASCII strings
976 is 4 times faster
Giampaolo Rodolà3108f982011-02-24 20:59:48 +0000977
978
979Build and C API Changes
980=======================
981
982Changes to Python's build process and to the C API include:
983
Stefan Krah95b1ba62012-02-29 17:27:21 +0100984* New :pep:`3118` related function:
985
986 * :c:func:`PyMemoryView_FromMemory`
987
Victor Stinner46606ce2011-11-20 18:27:55 +0100988* The :pep:`393` added new Unicode types, macros and functions:
989
Victor Stinnera996f1e2011-11-21 13:14:43 +0100990 * High-level API:
991
992 * :c:func:`PyUnicode_CopyCharacters`
993 * :c:func:`PyUnicode_FindChar`
994 * :c:func:`PyUnicode_GetLength`, :c:macro:`PyUnicode_GET_LENGTH`
995 * :c:func:`PyUnicode_New`
996 * :c:func:`PyUnicode_Substring`
997 * :c:func:`PyUnicode_ReadChar`, :c:func:`PyUnicode_WriteChar`
998
999 * Low-level API:
1000
1001 * :c:type:`Py_UCS1`, :c:type:`Py_UCS2`, :c:type:`Py_UCS4` types
1002 * :c:type:`PyASCIIObject` and :c:type:`PyCompactUnicodeObject` structures
1003 * :c:macro:`PyUnicode_READY`
1004 * :c:func:`PyUnicode_FromKindAndData`
1005 * :c:func:`PyUnicode_AsUCS4`, :c:func:`PyUnicode_AsUCS4Copy`
1006 * :c:macro:`PyUnicode_DATA`, :c:macro:`PyUnicode_1BYTE_DATA`,
1007 :c:macro:`PyUnicode_2BYTE_DATA`, :c:macro:`PyUnicode_4BYTE_DATA`
1008 * :c:macro:`PyUnicode_KIND` with :c:type:`PyUnicode_Kind` enum:
1009 :c:data:`PyUnicode_WCHAR_KIND`, :c:data:`PyUnicode_1BYTE_KIND`,
1010 :c:data:`PyUnicode_2BYTE_KIND`, :c:data:`PyUnicode_4BYTE_KIND`
1011 * :c:macro:`PyUnicode_READ`, :c:macro:`PyUnicode_READ_CHAR`, :c:macro:`PyUnicode_WRITE`
1012 * :c:macro:`PyUnicode_MAX_CHAR_VALUE`
1013
Giampaolo Rodolà3108f982011-02-24 20:59:48 +00001014
1015
Victor Stinnerd1be8782011-12-09 00:10:41 +01001016Deprecated
1017==========
1018
Georg Brandl0cd25c92011-04-29 13:45:54 +02001019Unsupported Operating Systems
Victor Stinnerd1be8782011-12-09 00:10:41 +01001020-----------------------------
Victor Stinnerb90db4c2011-04-26 22:48:24 +02001021
Brian Curtin49a40cd2011-05-02 22:30:06 -05001022OS/2 and VMS are no longer supported due to the lack of a maintainer.
1023
1024Windows 2000 and Windows platforms which set ``COMSPEC`` to ``command.com``
1025are no longer supported due to maintenance burden.
Victor Stinnerb90db4c2011-04-26 22:48:24 +02001026
1027
Victor Stinner46606ce2011-11-20 18:27:55 +01001028Deprecated Python modules, functions and methods
Victor Stinnerd1be8782011-12-09 00:10:41 +01001029------------------------------------------------
Victor Stinner19bd0692011-11-16 00:18:57 +01001030
1031* The :mod:`packaging` module replaces the :mod:`distutils` module
1032* The ``unicode_internal`` codec has been deprecated because of the
Sandro Tosicd899122012-01-22 12:16:04 +01001033 :pep:`393`, use UTF-8, UTF-16 (``utf-16-le`` or ``utf-16-be``), or UTF-32
1034 (``utf-32-le`` or ``utf-32-be``)
Victor Stinner19bd0692011-11-16 00:18:57 +01001035* :meth:`ftplib.FTP.nlst` and :meth:`ftplib.FTP.dir`: use
Victor Stinner46606ce2011-11-20 18:27:55 +01001036 :meth:`ftplib.FTP.mlsd`
Victor Stinner19bd0692011-11-16 00:18:57 +01001037* :func:`platform.popen`: use the :mod:`subprocess` module. Check especially
1038 the :ref:`subprocess-replacements` section.
1039* :issue:`13374`: The Windows bytes API has been deprecated in the :mod:`os`
Victor Stinner46606ce2011-11-20 18:27:55 +01001040 module. Use Unicode filenames, instead of bytes filenames, to not depend on
Victor Stinner19bd0692011-11-16 00:18:57 +01001041 the ANSI code page anymore and to support any filename.
Florent Xiclunaa72a98f2012-02-13 11:03:30 +01001042* :issue:`13988`: The :mod:`xml.etree.cElementTree` module is deprecated. The
1043 accelerator is used automatically whenever available.
Victor Stinner19bd0692011-11-16 00:18:57 +01001044
1045
Victor Stinner46606ce2011-11-20 18:27:55 +01001046Deprecated functions and types of the C API
Victor Stinnerd1be8782011-12-09 00:10:41 +01001047-------------------------------------------
Victor Stinner46606ce2011-11-20 18:27:55 +01001048
1049The :c:type:`Py_UNICODE` has been deprecated by the :pep:`393` and will be
1050removed in Python 4. All functions using this type are deprecated:
1051
Victor Stinner46606ce2011-11-20 18:27:55 +01001052Unicode functions and methods using :c:type:`Py_UNICODE` and
1053:c:type:`Py_UNICODE*` types:
1054
1055 * :c:macro:`PyUnicode_FromUnicode`: use :c:func:`PyUnicode_FromWideChar` or
1056 :c:func:`PyUnicode_FromKindAndData`
1057 * :c:macro:`PyUnicode_AS_UNICODE`, :c:func:`PyUnicode_AsUnicode`,
1058 :c:func:`PyUnicode_AsUnicodeAndSize`: use :c:func:`PyUnicode_AsWideCharString`
1059 * :c:macro:`PyUnicode_AS_DATA`: use :c:macro:`PyUnicode_DATA` with
1060 :c:macro:`PyUnicode_READ` and :c:macro:`PyUnicode_WRITE`
1061 * :c:macro:`PyUnicode_GET_SIZE`, :c:func:`PyUnicode_GetSize`: use
1062 :c:macro:`PyUnicode_GET_LENGTH` or :c:func:`PyUnicode_GetLength`
1063 * :c:macro:`PyUnicode_GET_DATA_SIZE`: use
1064 ``PyUnicode_GET_LENGTH(str) * PyUnicode_KIND(str)`` (only work on ready
1065 strings)
Victor Stinnerbf6e5602011-12-12 01:53:47 +01001066 * :c:func:`PyUnicode_AsUnicodeCopy`: use :c:func:`PyUnicode_AsUCS4Copy` or
1067 :c:func:`PyUnicode_AsWideCharString`
Victor Stinnerab595942011-12-17 04:59:06 +01001068 * :c:func:`PyUnicode_GetMax`
1069
Victor Stinner46606ce2011-11-20 18:27:55 +01001070
Victor Stinnera996f1e2011-11-21 13:14:43 +01001071Functions and macros manipulating Py_UNICODE* strings:
1072
1073 * :c:macro:`Py_UNICODE_strlen`: use :c:func:`PyUnicode_GetLength` or
1074 :c:macro:`PyUnicode_GET_LENGTH`
1075 * :c:macro:`Py_UNICODE_strcat`: use :c:func:`PyUnicode_CopyCharacters` or
1076 :c:func:`PyUnicode_FromFormat`
1077 * :c:macro:`Py_UNICODE_strcpy`, :c:macro:`Py_UNICODE_strncpy`,
1078 :c:macro:`Py_UNICODE_COPY`: use :c:func:`PyUnicode_CopyCharacters` or
1079 :c:func:`PyUnicode_Substring`
1080 * :c:macro:`Py_UNICODE_strcmp`: use :c:func:`PyUnicode_Compare`
1081 * :c:macro:`Py_UNICODE_strncmp`: use :c:func:`PyUnicode_Tailmatch`
1082 * :c:macro:`Py_UNICODE_strchr`, :c:macro:`Py_UNICODE_strrchr`: use
1083 :c:func:`PyUnicode_FindChar`
Victor Stinner606e19d2012-01-04 03:59:16 +01001084 * :c:macro:`Py_UNICODE_FILL`: use :c:func:`PyUnicode_Fill`
Victor Stinnerab595942011-12-17 04:59:06 +01001085 * :c:macro:`Py_UNICODE_MATCH`
Victor Stinnera996f1e2011-11-21 13:14:43 +01001086
Victor Stinner46606ce2011-11-20 18:27:55 +01001087Encoders:
1088
1089 * :c:func:`PyUnicode_Encode`: use :c:func:`PyUnicode_AsEncodedObject`
1090 * :c:func:`PyUnicode_EncodeUTF7`
Victor Stinnera996f1e2011-11-21 13:14:43 +01001091 * :c:func:`PyUnicode_EncodeUTF8`: use :c:func:`PyUnicode_AsUTF8` or
1092 :c:func:`PyUnicode_AsUTF8String`
Victor Stinner46606ce2011-11-20 18:27:55 +01001093 * :c:func:`PyUnicode_EncodeUTF32`
1094 * :c:func:`PyUnicode_EncodeUTF16`
1095 * :c:func:`PyUnicode_EncodeUnicodeEscape:` use
1096 :c:func:`PyUnicode_AsUnicodeEscapeString`
1097 * :c:func:`PyUnicode_EncodeRawUnicodeEscape:` use
1098 :c:func:`PyUnicode_AsRawUnicodeEscapeString`
1099 * :c:func:`PyUnicode_EncodeLatin1`: use :c:func:`PyUnicode_AsLatin1String`
1100 * :c:func:`PyUnicode_EncodeASCII`: use :c:func:`PyUnicode_AsASCIIString`
1101 * :c:func:`PyUnicode_EncodeCharmap`
1102 * :c:func:`PyUnicode_TranslateCharmap`
1103 * :c:func:`PyUnicode_EncodeMBCS`: use :c:func:`PyUnicode_AsMBCSString` or
1104 :c:func:`PyUnicode_EncodeCodePage` (with ``CP_ACP`` code_page)
1105 * :c:func:`PyUnicode_EncodeDecimal`,
1106 :c:func:`PyUnicode_TransformDecimalToASCII`
1107
1108
Giampaolo Rodolà3108f982011-02-24 20:59:48 +00001109Porting to Python 3.3
1110=====================
1111
1112This section lists previously described changes and other bugfixes
Antoine Pitrou037ffbf2011-10-24 00:25:41 +02001113that may require changes to your code.
1114
1115Porting Python code
1116-------------------
Giampaolo Rodolà3108f982011-02-24 20:59:48 +00001117
Georg Brandld6c43402012-03-07 08:55:52 +01001118.. XXX add a point about hash randomization and that it's always on in 3.3
1119
Victor Stinner19bd0692011-11-16 00:18:57 +01001120* :issue:`12326`: On Linux, sys.platform doesn't contain the major version
Victor Stinnerff3d9392011-08-20 23:39:26 +02001121 anymore. It is now always 'linux', instead of 'linux2' or 'linux3' depending
1122 on the Linux version used to build Python. Replace sys.platform == 'linux2'
1123 with sys.platform.startswith('linux'), or directly sys.platform == 'linux' if
1124 you don't need to support older Python versions.
Éric Araujoc09fca62011-03-23 02:06:24 +01001125
Antoine Pitrou037ffbf2011-10-24 00:25:41 +02001126Porting C code
1127--------------
1128
Stefan Krah54c32032012-02-29 17:47:21 +01001129* In the course of changes to the buffer API the undocumented
1130 :c:member:`~Py_buffer.smalltable` member of the
1131 :c:type:`Py_buffer` structure has been removed and the
1132 layout of the :c:type:`PyMemoryViewObject` has changed.
1133
1134 All extensions relying on the relevant parts in ``memoryobject.h``
1135 or ``object.h`` must be rebuilt.
1136
Antoine Pitrou037ffbf2011-10-24 00:25:41 +02001137* Due to :ref:`PEP 393 <pep-393>`, the :c:type:`Py_UNICODE` type and all
1138 functions using this type are deprecated (but will stay available for
1139 at least five years). If you were using low-level Unicode APIs to
1140 construct and access unicode objects and you want to benefit of the
1141 memory footprint reduction provided by the PEP 393, you have to convert
1142 your code to the new :doc:`Unicode API <../c-api/unicode>`.
1143
1144 However, if you only have been using high-level functions such as
1145 :c:func:`PyUnicode_Concat()`, :c:func:`PyUnicode_Join` or
1146 :c:func:`PyUnicode_FromFormat()`, your code will automatically take
1147 advantage of the new unicode representations.
1148
Antoine Pitrouc229e6e2012-02-20 19:41:11 +01001149Building C extensions
1150---------------------
1151
1152* The range of possible file names for C extensions has been narrowed.
1153 Very rarely used spellings have been suppressed: under POSIX, files
1154 named ``xxxmodule.so``, ``xxxmodule.abi3.so`` and
1155 ``xxxmodule.cpython-*.so`` are no longer recognized as implementing
1156 the ``xxx`` module. If you had been generating such files, you have
1157 to switch to the other spellings (i.e., remove the ``module`` string
1158 from the file names).
1159
1160 (implemented in :issue:`14040`.)
1161
1162
Antoine Pitrou037ffbf2011-10-24 00:25:41 +02001163Other issues
1164------------
1165
Éric Araujoc09fca62011-03-23 02:06:24 +01001166.. Issue #11591: When :program:`python` was started with :option:`-S`,
1167 ``import site`` will not add site-specific paths to the module search
1168 paths. In previous versions, it did. See changeset for doc changes in
1169 various files. Contributed by Carl Meyer with editions by Éric Araujo.
Éric Araujobe3bd572011-03-26 01:55:15 +01001170
Éric Araujobfc97292011-11-14 18:18:15 +01001171.. Issue #10998: the -Q command-line flag and related artifacts have been
Éric Araujobe3bd572011-03-26 01:55:15 +01001172 removed. Code checking sys.flags.division_warning will need updating.
1173 Contributed by Éric Araujo.