blob: 2ecb8ce5f15938426a11f1c70622be04976a0980 [file] [log] [blame]
Giampaolo Rodolà3108f982011-02-24 20:59:48 +00001****************************
2 What's New In Python 3.3
3****************************
4
5:Author: Raymond Hettinger
6:Release: |release|
7:Date: |today|
8
Éric Araujob07b97f2011-10-05 01:03:34 +02009.. Rules for maintenance:
Giampaolo Rodolà3108f982011-02-24 20:59:48 +000010
11 * Anyone can add text to this document. Do not spend very much time
12 on the wording of your changes, because your text will probably
13 get rewritten to some degree.
14
15 * The maintainer will go through Misc/NEWS periodically and add
16 changes; it's therefore more important to add your changes to
17 Misc/NEWS than to this file.
18
19 * This is not a complete list of every single change; completeness
20 is the purpose of Misc/NEWS. Some changes I consider too small
21 or esoteric to include. If such a change is added to the text,
22 I'll just remove it. (This is another reason you shouldn't spend
23 too much time on writing your addition.)
24
25 * If you want to draw your new text to the attention of the
26 maintainer, add 'XXX' to the beginning of the paragraph or
27 section.
28
29 * It's OK to just add a fragmentary note about a change. For
30 example: "XXX Describe the transmogrify() function added to the
31 socket module." The maintainer will research the change and
32 write the necessary text.
33
34 * You can comment out your additions if you like, but it's not
35 necessary (especially when a final release is some months away).
36
37 * Credit the author of a patch or bugfix. Just the name is
38 sufficient; the e-mail address isn't necessary.
39
40 * It's helpful to add the bug/patch number as a comment:
41
Giampaolo Rodolà3108f982011-02-24 20:59:48 +000042 XXX Describe the transmogrify() function added to the socket
43 module.
Éric Araujob07b97f2011-10-05 01:03:34 +020044 (Contributed by P.Y. Developer in :issue:`12345`.)
Giampaolo Rodolà3108f982011-02-24 20:59:48 +000045
Éric Araujob07b97f2011-10-05 01:03:34 +020046 This saves the maintainer the effort of going through the Mercurial log
Giampaolo Rodolà3108f982011-02-24 20:59:48 +000047 when researching a change.
48
49This article explains the new features in Python 3.3, compared to 3.2.
50
51
Nick Coghlan98e20702012-03-06 21:50:13 +100052.. pep-3118-update:
53
Stefan Krah9a2d99e2012-02-25 12:24:21 +010054PEP 3118: New memoryview implementation and buffer protocol documentation
55=========================================================================
56
57:issue:`10181` - memoryview bug fixes and features.
58 Written by Stefan Krah.
59
60The new memoryview implementation comprehensively fixes all ownership and
61lifetime issues of dynamically allocated fields in the Py_buffer struct
62that led to multiple crash reports. Additionally, several functions that
63crashed or returned incorrect results for non-contiguous or multi-dimensional
64input have been fixed.
65
66The memoryview object now has a PEP-3118 compliant getbufferproc()
67that checks the consumer's request type. Many new features have been
68added, most of them work in full generality for non-contiguous arrays
69and arrays with suboffsets.
70
71The documentation has been updated, clearly spelling out responsibilities
72for both exporters and consumers. Buffer request flags are grouped into
73basic and compound flags. The memory layout of non-contiguous and
74multi-dimensional NumPy-style arrays is explained.
75
76Features
77--------
78
79* All native single character format specifiers in struct module syntax
80 (optionally prefixed with '@') are now supported.
81
82* With some restrictions, the cast() method allows changing of format and
83 shape of C-contiguous arrays.
84
85* Multi-dimensional list representations are supported for any array type.
86
87* Multi-dimensional comparisons are supported for any array type.
88
89* All array types are hashable if the exporting object is hashable
Nick Coghlan98e20702012-03-06 21:50:13 +100090 and the view is read-only. (Contributed by Antoine Pitrou in
91 :issue:`13411`)
92
Stefan Krah9a2d99e2012-02-25 12:24:21 +010093
94* Arbitrary slicing of any 1-D arrays type is supported. For example, it
95 is now possible to reverse a memoryview in O(1) by using a negative step.
96
97API changes
98-----------
99
100* The maximum number of dimensions is officially limited to 64.
101
102* The representation of empty shape, strides and suboffsets is now
103 an empty tuple instead of None.
104
105* Accessing a memoryview element with format 'B' (unsigned bytes)
106 now returns an integer (in accordance with the struct module syntax).
107 For returning a bytes object the view must be cast to 'c' first.
108
Stefan Krah54c32032012-02-29 17:47:21 +0100109* For further changes see `Build and C API Changes`_ and `Porting C code`_ .
Stefan Krah9a2d99e2012-02-25 12:24:21 +0100110
Antoine Pitrou037ffbf2011-10-24 00:25:41 +0200111.. _pep-393:
112
Ezio Melotti48a2f8f2011-09-29 00:18:19 +0300113PEP 393: Flexible String Representation
114=======================================
115
Antoine Pitroufd9b4162011-10-24 00:14:43 +0200116The Unicode string type is changed to support multiple internal
117representations, depending on the character with the largest Unicode ordinal
118(1, 2, or 4 bytes) in the represented string. This allows a space-efficient
119representation in common cases, but gives access to full UCS-4 on all
120systems. For compatibility with existing APIs, several representations may
121exist in parallel; over time, this compatibility should be phased out.
Ezio Melotti397546a2011-09-29 08:34:36 +0300122
Antoine Pitroufd9b4162011-10-24 00:14:43 +0200123On the Python side, there should be no downside to this change.
Ezio Melotti397546a2011-09-29 08:34:36 +0300124
Antoine Pitroufd9b4162011-10-24 00:14:43 +0200125On the C API side, PEP 393 is fully backward compatible. The legacy API
126should remain available at least five years. Applications using the legacy
127API will not fully benefit of the memory reduction, or - worse - may use
128a bit more memory, because Python may have to maintain two versions of each
129string (in the legacy format and in the new efficient storage).
130
Antoine Pitrou0599b5b2011-11-29 22:45:07 +0100131Functionality
132-------------
133
Antoine Pitroufd9b4162011-10-24 00:14:43 +0200134Changes introduced by :pep:`393` are the following:
Ezio Melotti48a2f8f2011-09-29 00:18:19 +0300135
Ezio Melotti397546a2011-09-29 08:34:36 +0300136* Python now always supports the full range of Unicode codepoints, including
137 non-BMP ones (i.e. from ``U+0000`` to ``U+10FFFF``). The distinction between
138 narrow and wide builds no longer exists and Python now behaves like a wide
Antoine Pitroufd9b4162011-10-24 00:14:43 +0200139 build, even under Windows.
Ezio Melotti397546a2011-09-29 08:34:36 +0300140
Antoine Pitroufd9b4162011-10-24 00:14:43 +0200141* With the death of narrow builds, the problems specific to narrow builds have
142 also been fixed, for example:
Ezio Melotti397546a2011-09-29 08:34:36 +0300143
144 * :func:`len` now always returns 1 for non-BMP characters,
145 so ``len('\U0010FFFF') == 1``;
146
147 * surrogate pairs are not recombined in string literals,
148 so ``'\uDBFF\uDFFF' != '\U0010FFFF'``;
149
Antoine Pitroufd9b4162011-10-24 00:14:43 +0200150 * indexing or slicing non-BMP characters returns the expected value,
Ezio Melotti397546a2011-09-29 08:34:36 +0300151 so ``'\U0010FFFF'[0]`` now returns ``'\U0010FFFF'`` and not ``'\uDBFF'``;
152
Antoine Pitroud136aec2011-11-17 01:48:06 +0100153 * all other functions in the standard library now correctly handle
Antoine Pitroufd9b4162011-10-24 00:14:43 +0200154 non-BMP codepoints.
Ezio Melotti397546a2011-09-29 08:34:36 +0300155
Ezio Melotti48a2f8f2011-09-29 00:18:19 +0300156* The value of :data:`sys.maxunicode` is now always ``1114111`` (``0x10FFFF``
157 in hexadecimal). The :c:func:`PyUnicode_GetMax` function still returns
158 either ``0xFFFF`` or ``0x10FFFF`` for backward compatibility, and it should
159 not be used with the new Unicode API (see :issue:`13054`).
160
Ezio Melotti397546a2011-09-29 08:34:36 +0300161* The :file:`./configure` flag ``--with-wide-unicode`` has been removed.
Victor Stinner7d637ab2011-09-29 02:56:16 +0200162
Antoine Pitrou0599b5b2011-11-29 22:45:07 +0100163Performance and resource usage
164------------------------------
165
166The storage of Unicode strings now depends on the highest codepoint in the string:
167
168* pure ASCII and Latin1 strings (``U+0000-U+00FF``) use 1 byte per codepoint;
169
170* BMP strings (``U+0000-U+FFFF``) use 2 bytes per codepoint;
171
172* non-BMP strings (``U+10000-U+10FFFF``) use 4 bytes per codepoint.
173
Martin v. Löwisde157cc2012-03-06 08:42:17 +0100174The net effect is that for most applications, memory usage of string
175storage should decrease significantly - especially compared to former
176wide unicode builds - as, in many cases, strings will be pure ASCII
177even in international contexts (because many strings store non-human
178language data, such as XML fragments, HTTP headers, JSON-encoded data,
179etc.). We also hope that it will, for the same reasons, increase CPU
180cache efficiency on non-trivial applications. The memory usage of
181Python 3.3 is two to three times smaller than Python 3.2, and a little
182bit better than Python 2.7, on a Django benchmark (see the PEP for
183details).
Antoine Pitrou0599b5b2011-11-29 22:45:07 +0100184
Éric Araujob07b97f2011-10-05 01:03:34 +0200185
Victor Stinnera1bf2982011-10-12 20:35:02 +0200186PEP 3151: Reworking the OS and IO exception hierarchy
187=====================================================
188
189:pep:`3151` - Reworking the OS and IO exception hierarchy
Antoine Pitrou01fd26c2011-10-24 00:07:02 +0200190 PEP written and implemented by Antoine Pitrou.
Victor Stinnera1bf2982011-10-12 20:35:02 +0200191
Antoine Pitrou01fd26c2011-10-24 00:07:02 +0200192The hierarchy of exceptions raised by operating system errors is now both
193simplified and finer-grained.
Victor Stinnera1bf2982011-10-12 20:35:02 +0200194
Antoine Pitrou01fd26c2011-10-24 00:07:02 +0200195You don't have to worry anymore about choosing the appropriate exception
196type between :exc:`OSError`, :exc:`IOError`, :exc:`EnvironmentError`,
197:exc:`WindowsError`, :exc:`mmap.error`, :exc:`socket.error` or
198:exc:`select.error`. All these exception types are now only one:
199:exc:`OSError`. The other names are kept as aliases for compatibility
200reasons.
Victor Stinnera1bf2982011-10-12 20:35:02 +0200201
Antoine Pitrou01fd26c2011-10-24 00:07:02 +0200202Also, it is now easier to catch a specific error condition. Instead of
203inspecting the ``errno`` attribute (or ``args[0]``) for a particular
204constant from the :mod:`errno` module, you can catch the adequate
205:exc:`OSError` subclass. The available subclasses are the following:
Victor Stinnera1bf2982011-10-12 20:35:02 +0200206
Antoine Pitrou01fd26c2011-10-24 00:07:02 +0200207* :exc:`BlockingIOError`
208* :exc:`ChildProcessError`
209* :exc:`ConnectionError`
210* :exc:`FileExistsError`
211* :exc:`FileNotFoundError`
212* :exc:`InterruptedError`
213* :exc:`IsADirectoryError`
214* :exc:`NotADirectoryError`
215* :exc:`PermissionError`
216* :exc:`ProcessLookupError`
217* :exc:`TimeoutError`
Victor Stinnera1bf2982011-10-12 20:35:02 +0200218
Antoine Pitrou01fd26c2011-10-24 00:07:02 +0200219And the :exc:`ConnectionError` itself has finer-grained subclasses:
Victor Stinnera1bf2982011-10-12 20:35:02 +0200220
Antoine Pitrou01fd26c2011-10-24 00:07:02 +0200221* :exc:`BrokenPipeError`
222* :exc:`ConnectionAbortedError`
223* :exc:`ConnectionRefusedError`
224* :exc:`ConnectionResetError`
Victor Stinnera1bf2982011-10-12 20:35:02 +0200225
226Thanks to the new exceptions, common usages of the :mod:`errno` can now be
Antoine Pitrou01fd26c2011-10-24 00:07:02 +0200227avoided. For example, the following code written for Python 3.2::
Victor Stinnera1bf2982011-10-12 20:35:02 +0200228
229 from errno import ENOENT, EACCES, EPERM
230
231 try:
232 with open("document.txt") as f:
233 content = f.read()
234 except IOError as err:
235 if err.errno == ENOENT:
236 print("document.txt file is missing")
237 elif err.errno in (EACCES, EPERM):
238 print("You are not allowed to read document.txt")
239 else:
240 raise
241
Antoine Pitrou01fd26c2011-10-24 00:07:02 +0200242can now be written without the :mod:`errno` import and without manual
243inspection of exception attributes::
Victor Stinnera1bf2982011-10-12 20:35:02 +0200244
245 try:
246 with open("document.txt") as f:
247 content = f.read()
248 except FileNotFoundError:
249 print("document.txt file is missing")
250 except PermissionError:
251 print("You are not allowed to read document.txt")
252
253
Nick Coghlan1f7ce622012-01-13 21:43:40 +1000254PEP 380: Syntax for Delegating to a Subgenerator
255================================================
256
Nick Coghlanab7bf212012-02-26 17:49:52 +1000257:pep:`380` - Syntax for Delegating to a Subgenerator
258 PEP written by Greg Ewing.
259
Nick Coghlan1f7ce622012-01-13 21:43:40 +1000260PEP 380 adds the ``yield from`` expression, allowing a generator to delegate
261part of its operations to another generator. This allows a section of code
262containing 'yield' to be factored out and placed in another generator.
263Additionally, the subgenerator is allowed to return with a value, and the
264value is made available to the delegating generator.
Nick Coghlanb9b281b2012-03-06 22:31:12 +1000265
Nick Coghlan1f7ce622012-01-13 21:43:40 +1000266While designed primarily for use in delegating to a subgenerator, the ``yield
267from`` expression actually allows delegation to arbitrary subiterators.
268
Nick Coghlanb9b281b2012-03-06 22:31:12 +1000269For simple iterators, ``yield from iterable`` is essentially just a shortened
270form of ``for item in iterable: yield item``::
271
272 >>> def g(x):
273 ... yield from range(x, 0, -1)
274 ... yield from range(x)
275 ...
276 >>> list(g(5))
277 [5, 4, 3, 2, 1, 0, 1, 2, 3, 4]
278
279However, unlike an ordinary loop, ``yield from`` allows subgenerators to
280receive sent and thrown values directly from the calling scope, and
281return a final value to the outer generator::
282
283 >>> def accumulate(start=0):
284 ... tally = start
285 ... while 1:
286 ... next = yield
287 ... if next is None:
288 ... return tally
289 ... tally += next
290 ...
291 >>> def gather_tallies(tallies, start=0):
292 ... while 1:
293 ... tally = yield from accumulate()
294 ... tallies.append(tally)
295 ...
296 >>> tallies = []
297 >>> acc = gather_tallies(tallies)
298 >>> next(acc) # Ensure the accumulator is ready to accept values
299 >>> for i in range(10):
300 ... acc.send(i)
301 ...
302 >>> acc.send(None) # Finish the first tally
303 >>> for i in range(5):
304 ... acc.send(i)
305 ...
306 >>> acc.send(None) # Finish the second tally
307 >>> tallies
308 [45, 10]
309
310The main principle driving this change is to allow even generators that are
311designed to be used with the ``send`` and ``throw`` methods to be split into
312multiple subgenerators as easily as a single large function can be split into
313multiple subfunctions.
314
Nick Coghlan1f7ce622012-01-13 21:43:40 +1000315(Implementation by Greg Ewing, integrated into 3.3 by Renaud Blanch, Ryan
316Kelly and Nick Coghlan, documentation by Zbigniew Jędrzejewski-Szmek and
317Nick Coghlan)
318
319
Nick Coghlanab7bf212012-02-26 17:49:52 +1000320PEP 409: Suppressing exception context
321======================================
322
323:pep:`409` - Suppressing exception context
324 PEP written by Ethan Furman, implemented by Ethan Furman and Nick Coghlan.
325
326PEP 409 introduces new syntax that allows the display of the chained
327exception context to be disabled. This allows cleaner error messages in
328applications that convert between exception types::
329
330 >>> class D:
331 ... def __init__(self, extra):
332 ... self._extra_attributes = extra
333 ... def __getattr__(self, attr):
334 ... try:
335 ... return self._extra_attributes[attr]
336 ... except KeyError:
337 ... raise AttributeError(attr) from None
338 ...
339 >>> D({}).x
340 Traceback (most recent call last):
341 File "<stdin>", line 1, in <module>
342 File "<stdin>", line 8, in __getattr__
343 AttributeError: x
344
345Without the ``from None`` suffix to suppress the cause, the original
346exception would be displayed by default::
347
348 >>> class C:
349 ... def __init__(self, extra):
350 ... self._extra_attributes = extra
351 ... def __getattr__(self, attr):
352 ... try:
353 ... return self._extra_attributes[attr]
354 ... except KeyError:
355 ... raise AttributeError(attr)
356 ...
357 >>> C({}).x
358 Traceback (most recent call last):
359 File "<stdin>", line 6, in __getattr__
360 KeyError: 'x'
361
362 During handling of the above exception, another exception occurred:
363
364 Traceback (most recent call last):
365 File "<stdin>", line 1, in <module>
366 File "<stdin>", line 8, in __getattr__
367 AttributeError: x
368
369No debugging capability is lost, as the original exception context remains
370available if needed (for example, if an intervening library has incorrectly
371suppressed valuable underlying details)::
372
373 >>> try:
374 ... D({}).x
375 ... except AttributeError as exc:
376 ... print(repr(exc.__context__))
377 ...
378 KeyError('x',)
379
380
Nick Coghlan98e20702012-03-06 21:50:13 +1000381PEP 414: Explicit Unicode literals
382======================================
383
384:pep:`414` - Explicit Unicode literals
385 PEP written by Armin Ronacher.
386
387To ease the transition from Python 2 for Unicode aware Python applications
388that make heavy use of Unicode literals, Python 3.3 once again supports the
389"``u``" prefix for string literals. This prefix has no semantic significance
390in Python 3, it is provided solely to reduce the number of purely mechanical
391changes in migrating to Python 3, making it easier for developers to focus on
392the more significant semantic changes (such as the stricter default
393separation of binary and text data).
394
395
Antoine Pitrou6bbd76b2011-11-25 19:10:05 +0100396PEP 3155: Qualified name for classes and functions
397==================================================
398
399:pep:`3155` - Qualified name for classes and functions
400 PEP written and implemented by Antoine Pitrou.
401
402Functions and class objects have a new ``__qualname__`` attribute representing
403the "path" from the module top-level to their definition. For global functions
404and classes, this is the same as ``__name__``. For other functions and classes,
405it provides better information about where they were actually defined, and
406how they might be accessible from the global scope.
407
408Example with (non-bound) methods::
Nick Coghlan2dfe6b02012-01-14 14:19:49 +1000409
Antoine Pitrou6bbd76b2011-11-25 19:10:05 +0100410 >>> class C:
411 ... def meth(self):
412 ... pass
413 >>> C.meth.__name__
414 'meth'
415 >>> C.meth.__qualname__
416 'C.meth'
417
418Example with nested classes::
419
420 >>> class C:
421 ... class D:
422 ... def meth(self):
423 ... pass
424 ...
425 >>> C.D.__name__
426 'D'
427 >>> C.D.__qualname__
428 'C.D'
429 >>> C.D.meth.__name__
430 'meth'
431 >>> C.D.meth.__qualname__
432 'C.D.meth'
433
434Example with nested functions::
435
436 >>> def outer():
437 ... def inner():
438 ... pass
439 ... return inner
440 ...
441 >>> outer().__name__
442 'inner'
443 >>> outer().__qualname__
444 'outer.<locals>.inner'
445
Antoine Pitroue7ede062011-11-25 19:11:26 +0100446The string representation of those objects is also changed to include the
Antoine Pitrou6bbd76b2011-11-25 19:10:05 +0100447new, more precise information::
448
449 >>> str(C.D)
450 "<class '__main__.C.D'>"
451 >>> str(C.D.meth)
452 '<function C.D.meth at 0x7f46b9fe31e0>'
453
454
Giampaolo Rodolà3108f982011-02-24 20:59:48 +0000455Other Language Changes
456======================
457
458Some smaller changes made to the core Python language are:
459
Antoine Pitrou7b578b32011-11-29 22:47:11 +0100460* Added support for Unicode name aliases and named sequences.
461 Both :func:`unicodedata.lookup()` and ``'\N{...}'`` now resolve name aliases,
462 and :func:`unicodedata.lookup()` resolves named sequences too.
Giampaolo Rodolà3108f982011-02-24 20:59:48 +0000463
Antoine Pitrou7b578b32011-11-29 22:47:11 +0100464 (Contributed by Ezio Melotti in :issue:`12753`)
Ezio Melotti931b8aa2011-10-21 21:57:36 +0300465
Antoine Pitrou7b578b32011-11-29 22:47:11 +0100466* Equality comparisons on :func:`range` objects now return a result reflecting
467 the equality of the underlying sequences generated by those range objects.
Ezio Melotti931b8aa2011-10-21 21:57:36 +0300468
Sandro Tosicd899122012-01-22 12:16:04 +0100469 (:issue:`13201`)
Giampaolo Rodolà3108f982011-02-24 20:59:48 +0000470
Antoine Pitrou7b578b32011-11-29 22:47:11 +0100471* The ``count()``, ``find()``, ``rfind()``, ``index()`` and ``rindex()``
472 methods of :class:`bytes` and :class:`bytearray` objects now accept an
473 integer between 0 and 255 as their first argument.
Mark Dickinson36645682011-10-23 19:53:01 +0100474
Antoine Pitrou7b578b32011-11-29 22:47:11 +0100475 (:issue:`12170`)
Mark Dickinson36645682011-10-23 19:53:01 +0100476
Petri Lehtinen61ea8a02011-11-24 22:00:46 +0200477
Victor Stinner46606ce2011-11-20 18:27:55 +0100478New and Improved Modules
479========================
Giampaolo Rodolà3108f982011-02-24 20:59:48 +0000480
Victor Stinnerf4c54ff2012-02-08 01:48:34 +0100481abc
482---
483
484Improved support for abstract base classes containing descriptors composed with
485abstract methods. The recommended approach to declaring abstract descriptors is
486now to provide :attr:`__isabstractmethod__` as a dynamically updated
487property. The built-in descriptors have been updated accordingly.
488
489 * :class:`abc.abstractproperty` has been deprecated, use :class:`property`
490 with :func:`abc.abstractmethod` instead.
491 * :class:`abc.abstractclassmethod` has been deprecated, use
492 :class:`classmethod` with :func:`abc.abstractmethod` instead.
493 * :class:`abc.abstractstaticmethod` has been deprecated, use
494 :class:`staticmethod` with :func:`abc.abstractmethod` instead.
495
496(Contributed by Darren Dale in :issue:`11610`)
497
Meador Ingec5dbb3d2011-09-20 21:48:16 -0500498array
499-----
500
501The :mod:`array` module supports the :c:type:`long long` type using ``q`` and
502``Q`` type codes.
503
504(Contributed by Oren Tirosh and Hirokazu Yamamoto in :issue:`1172711`)
505
506
Nadeem Vawdad7e5c6e2012-02-12 01:34:18 +0200507bz2
508---
509
510The :mod:`bz2` module has been rewritten from scratch. In the process, several
511new features have been added:
512
513* :class:`bz2.BZ2File` can now read from and write to arbitrary file-like
514 objects, by means of its constructor's *fileobj* argument.
515
516 (Contributed by Nadeem Vawda in :issue:`5863`)
517
518* :class:`bz2.BZ2File` and :func:`bz2.decompress` can now decompress
519 multi-stream inputs (such as those produced by the :program:`pbzip2` tool).
520 :class:`bz2.BZ2File` can now also be used to create this type of file, using
521 the ``'a'`` (append) mode.
522
523 (Contributed by Nir Aides in :issue:`1625`)
524
525* :class:`bz2.BZ2File` now implements all of the :class:`io.BufferedIOBase` API,
526 except for the :meth:`detach` and :meth:`truncate` methods.
527
528
Victor Stinner2cded9c2011-07-08 01:45:13 +0200529codecs
530------
531
Antoine Pitrou4f863432012-02-12 02:12:47 +0100532The :mod:`~encodings.mbcs` codec has been rewritten to handle correctly
Georg Brandlff962c52012-02-04 08:55:56 +0100533``replace`` and ``ignore`` error handlers on all Windows versions. The
534:mod:`~encodings.mbcs` codec now supports all error handlers, instead of only
535``replace`` to encode and ``ignore`` to decode.
Victor Stinner3a50e702011-10-18 21:21:00 +0200536
Georg Brandlff962c52012-02-04 08:55:56 +0100537A new Windows-only codec has been added: ``cp65001`` (:issue:`13216`). It is the
538Windows code page 65001 (Windows UTF-8, ``CP_UTF8``). For example, it is used
539by ``sys.stdout`` if the console output code page is set to cp65001 (e.g., using
540``chcp 65001`` command).
Victor Stinner2f3ca9f2011-10-27 01:38:56 +0200541
Georg Brandlff962c52012-02-04 08:55:56 +0100542Multibyte CJK decoders now resynchronize faster. They only ignore the first
Georg Brandl6c0929b2011-07-09 11:43:33 +0200543byte of an invalid byte sequence. For example, ``b'\xff\n'.decode('gb2312',
544'replace')`` now returns a ``\n`` after the replacement character.
Victor Stinner2cded9c2011-07-08 01:45:13 +0200545
Georg Brandl6c0929b2011-07-09 11:43:33 +0200546(:issue:`12016`)
Victor Stinner2cded9c2011-07-08 01:45:13 +0200547
Georg Brandlff962c52012-02-04 08:55:56 +0100548Incremental CJK codec encoders are no longer reset at each call to their
549encode() methods. For example::
Victor Stinner2cded9c2011-07-08 01:45:13 +0200550
551 $ ./python -q
552 >>> import codecs
553 >>> encoder = codecs.getincrementalencoder('hz')('strict')
554 >>> b''.join(encoder.encode(x) for x in '\u52ff\u65bd\u65bc\u4eba\u3002 Bye.')
555 b'~{NpJ)l6HK!#~} Bye.'
556
Georg Brandl6c0929b2011-07-09 11:43:33 +0200557This example gives ``b'~{Np~}~{J)~}~{l6~}~{HK~}~{!#~} Bye.'`` with older Python
Victor Stinner2cded9c2011-07-08 01:45:13 +0200558versions.
559
Georg Brandl6c0929b2011-07-09 11:43:33 +0200560(:issue:`12100`)
Victor Stinner2cded9c2011-07-08 01:45:13 +0200561
Victor Stinner9f4b1e92011-11-10 20:56:30 +0100562The ``unicode_internal`` codec has been deprecated.
563
Éric Araujo84b8ed82011-08-29 21:42:47 +0200564crypt
565-----
566
Victor Stinnerc78fb332011-09-21 03:35:44 +0200567Addition of salt and modular crypt format and the :func:`~crypt.mksalt`
568function to the :mod:`crypt` module.
Éric Araujo84b8ed82011-08-29 21:42:47 +0200569
570(:issue:`10924`)
571
Victor Stinnera7878b72011-07-14 23:07:44 +0200572curses
573------
574
Victor Stinner0fdfceb2011-11-25 22:10:02 +0100575 * If the :mod:`curses` module is linked to the ncursesw library, use Unicode
576 functions when Unicode strings or characters are passed (e.g.
577 :c:func:`waddwstr`), and bytes functions otherwise (e.g. :c:func:`waddstr`).
578 * Use the locale encoding instead of ``utf-8`` to encode Unicode strings.
579 * :class:`curses.window` has a new :attr:`curses.window.encoding` attribute.
Victor Stinnerc78fb332011-09-21 03:35:44 +0200580 * The :class:`curses.window` class has a new :meth:`~curses.window.get_wch`
581 method to get a wide character
582 * The :mod:`curses` module has a new :meth:`~curses.unget_wch` function to
583 push a wide character so the next :meth:`~curses.window.get_wch` will return
584 it
Victor Stinnera7878b72011-07-14 23:07:44 +0200585
Victor Stinnerc78fb332011-09-21 03:35:44 +0200586(Contributed by Iñigo Serna in :issue:`6755`)
Victor Stinnera7878b72011-07-14 23:07:44 +0200587
Victor Stinner024e37a2011-03-31 01:31:06 +0200588faulthandler
589------------
590
591New module: :mod:`faulthandler`.
592
593 * :envvar:`PYTHONFAULTHANDLER`
594 * :option:`-X` ``faulthandler``
595
Victor Stinner811db3b2011-09-21 03:20:03 +0200596ftplib
597------
598
599The :class:`~ftplib.FTP_TLS` class now provides a new
600:func:`~ftplib.FTP_TLS.ccc` function to revert control channel back to
Florent Xicluna6d57d212011-10-23 22:23:57 +0200601plaintext. This can be useful to take advantage of firewalls that know how to
Victor Stinner811db3b2011-09-21 03:20:03 +0200602handle NAT with non-secure FTP without opening fixed ports.
603
604(Contributed by Giampaolo Rodolà in :issue:`12139`)
605
606
Antoine Pitrou5a8bc6f2011-11-17 02:20:48 +0100607imaplib
608-------
609
610The :class:`~imaplib.IMAP4_SSL` constructor now accepts an SSLContext
611parameter to control parameters of the secure channel.
612
613(Contributed by Sijin Joseph in :issue:`8808`)
614
615
Charles-François Natalidc3044c2012-01-09 22:40:02 +0100616io
617--
618
Charles-François Natalid612de12012-01-14 11:51:00 +0100619The :func:`~io.open` function has a new ``'x'`` mode that can be used to
620exclusively create a new file, and raise a :exc:`FileExistsError` if the file
621already exists. It is based on the C11 'x' mode to fopen().
Charles-François Natalidc3044c2012-01-09 22:40:02 +0100622
623(Contributed by David Townshend in :issue:`12760`)
624
625
Nadeem Vawda34599222011-12-09 01:32:46 +0200626lzma
627----
628
629The newly-added :mod:`lzma` module provides data compression and decompression
630using the LZMA algorithm, including support for the ``.xz`` and ``.lzma``
631file formats.
632
633(Contributed by Nadeem Vawda and Per Øyvind Karlsen in :issue:`6715`)
634
635
Victor Stinnerfa0e3d52011-05-09 01:01:09 +0200636math
637----
638
639The :mod:`math` module has a new function:
640
641 * :func:`~math.log2`: return the base-2 logarithm of *x*
642 (Written by Mark Dickinson in :issue:`11888`).
643
644
645nntplib
646-------
647
648The :class:`nntplib.NNTP` class now supports the context manager protocol to
649unconditionally consume :exc:`socket.error` exceptions and to close the NNTP
650connection when done::
651
652 >>> from nntplib import NNTP
Ezio Melotti3c14b4e2011-07-13 11:44:44 +0300653 >>> with NNTP('news.gmane.org') as n:
Victor Stinnerfa0e3d52011-05-09 01:01:09 +0200654 ... n.group('gmane.comp.python.committers')
655 ...
Ezio Melotti04f648c2011-07-26 09:37:46 +0300656 ('211 1755 1 1755 gmane.comp.python.committers', 1755, 1, 1755, 'gmane.comp.python.committers')
Victor Stinnerfa0e3d52011-05-09 01:01:09 +0200657 >>>
658
659(Contributed by Giampaolo Rodolà in :issue:`9795`)
660
661
Giampaolo Rodolàc9c2c8b2011-02-25 14:39:16 +0000662os
663--
664
Charles-François Natalia003af12011-06-01 20:30:52 +0200665* The :mod:`os` module has a new :func:`~os.pipe2` function that makes it
666 possible to create a pipe with :data:`~os.O_CLOEXEC` or
667 :data:`~os.O_NONBLOCK` flags set atomically. This is especially useful to
668 avoid race conditions in multi-threaded programs.
669
Giampaolo Rodolà18e8bcb2011-02-25 20:57:54 +0000670* The :mod:`os` module has a new :func:`~os.sendfile` function which provides
671 an efficent "zero-copy" way for copying data from one file (or socket)
672 descriptor to another. The phrase "zero-copy" refers to the fact that all of
673 the copying of data between the two descriptors is done entirely by the
674 kernel, with no copying of data into userspace buffers. :func:`~os.sendfile`
675 can be used to efficiently copy data from a file on disk to a network socket,
676 e.g. for downloading a file.
Giampaolo Rodolàc9c2c8b2011-02-25 14:39:16 +0000677
Giampaolo Rodolà18e8bcb2011-02-25 20:57:54 +0000678 (Patch submitted by Ross Lagerwall and Giampaolo Rodolà in :issue:`10882`.)
679
680* The :mod:`os` module has two new functions: :func:`~os.getpriority` and
681 :func:`~os.setpriority`. They can be used to get or set process
682 niceness/priority in a fashion similar to :func:`os.nice` but extended to all
683 processes instead of just the current one.
684
685 (Patch submitted by Giampaolo Rodolà in :issue:`10784`.)
Giampaolo Rodolà3108f982011-02-24 20:59:48 +0000686
Charles-François Natali7372b062012-02-05 15:15:38 +0100687* The :mod:`os` module has a new :func:`~os.fwalk` function similar to
688 :func:`~os.walk` except that it also yields file descriptors referring to the
689 directories visited. This is especially useful to avoid symlink races.
690
Victor Stinnere5064372011-10-14 00:08:29 +0200691* "at" functions (:issue:`4761`):
692
693 * :func:`~os.faccessat`
694 * :func:`~os.fchmodat`
695 * :func:`~os.fchownat`
696 * :func:`~os.fstatat`
697 * :func:`~os.futimesat`
Victor Stinnere5064372011-10-14 00:08:29 +0200698 * :func:`~os.linkat`
699 * :func:`~os.mkdirat`
700 * :func:`~os.mkfifoat`
701 * :func:`~os.mknodat`
702 * :func:`~os.openat`
703 * :func:`~os.readlinkat`
704 * :func:`~os.renameat`
705 * :func:`~os.symlinkat`
706 * :func:`~os.unlinkat`
707 * :func:`~os.utimensat`
Victor Stinnere5064372011-10-14 00:08:29 +0200708
709* extended attributes (:issue:`12720`):
710
711 * :func:`~os.fgetxattr`
712 * :func:`~os.flistxattr`
713 * :func:`~os.fremovexattr`
714 * :func:`~os.fsetxattr`
715 * :func:`~os.getxattr`
716 * :func:`~os.lgetxattr`
717 * :func:`~os.listxattr`
718 * :func:`~os.llistxattr`
719 * :func:`~os.lremovexattr`
720 * :func:`~os.lsetxattr`
721 * :func:`~os.removexattr`
722 * :func:`~os.setxattr`
723
724* Scheduler functions (:issue:`12655`):
725
726 * :func:`~os.sched_get_priority_max`
727 * :func:`~os.sched_get_priority_min`
728 * :func:`~os.sched_getaffinity`
729 * :func:`~os.sched_getparam`
730 * :func:`~os.sched_getscheduler`
731 * :func:`~os.sched_rr_get_interval`
732 * :func:`~os.sched_setaffinity`
733 * :func:`~os.sched_setparam`
734 * :func:`~os.sched_setscheduler`
735 * :func:`~os.sched_yield`
736
737* Add some extra posix functions to the os module (:issue:`10812`):
738
739 * :func:`~os.fexecve`
740 * :func:`~os.futimens`
Victor Stinnere5064372011-10-14 00:08:29 +0200741 * :func:`~os.futimes`
742 * :func:`~os.lockf`
743 * :func:`~os.lutimes`
Victor Stinnere5064372011-10-14 00:08:29 +0200744 * :func:`~os.posix_fadvise`
745 * :func:`~os.posix_fallocate`
746 * :func:`~os.pread`
747 * :func:`~os.pwrite`
748 * :func:`~os.readv`
749 * :func:`~os.sync`
750 * :func:`~os.truncate`
751 * :func:`~os.waitid`
752 * :func:`~os.writev`
753
754* Other new functions:
755
Charles-François Natali77940902012-02-06 19:54:48 +0100756 * :func:`~os.flistdir` (:issue:`10755`)
Victor Stinnere5064372011-10-14 00:08:29 +0200757 * :func:`~os.getgrouplist` (:issue:`9344`)
758
Giampaolo Rodolà424298a2011-03-03 18:34:06 +0000759
Éric Araujo765e94f2011-06-03 17:26:59 +0200760packaging
761---------
762
763:mod:`distutils` has undergone additions and refactoring under a new name,
764:mod:`packaging`, to allow developers to break backward compatibility.
765:mod:`distutils` is still provided in the standard library, but users are
766encouraged to transition to :mod:`packaging`. For older versions of Python, a
767backport compatible with 2.4+ and 3.1+ will be made available on PyPI under the
768name :mod:`distutils2`.
769
770.. TODO add examples and howto to the packaging docs and link to them
771
772
Victor Stinner383c3fc2011-05-25 01:35:05 +0200773pydoc
774-----
775
Victor Stinner6daa33c2011-05-25 01:41:22 +0200776The Tk GUI and the :func:`~pydoc.serve` function have been removed from the
777:mod:`pydoc` module: ``pydoc -g`` and :func:`~pydoc.serve` have been deprecated
778in Python 3.2.
Victor Stinner383c3fc2011-05-25 01:35:05 +0200779
780
Victor Stinnerf4c54ff2012-02-08 01:48:34 +0100781sched
782-----
Victor Stinner754851f2011-04-19 23:58:51 +0200783
Victor Stinnerf4c54ff2012-02-08 01:48:34 +0100784* :meth:`~sched.scheduler.run` now accepts a *blocking* parameter which when
785 set to False makes the method execute the scheduled events due to expire
786 soonest (if any) and then return immediately.
787 This is useful in case you want to use the :class:`~sched.scheduler` in
788 non-blocking applications. (Contributed by Giampaolo Rodolà in :issue:`13449`)
Victor Stinner754851f2011-04-19 23:58:51 +0200789
Victor Stinnerf4c54ff2012-02-08 01:48:34 +0100790* :class:`~sched.scheduler` class can now be safely used in multi-threaded
791 environments. (Contributed by Josiah Carlson and Giampaolo Rodolà in
792 :issue:`8684`)
793
794* *timefunc* and *delayfunct* parameters of :class:`~sched.scheduler` class
795 constructor are now optional and defaults to :func:`time.time` and
796 :func:`time.sleep` respectively. (Contributed by Chris Clark in
797 :issue:`13245`)
798
799* :meth:`~sched.scheduler.enter` and :meth:`~sched.scheduler.enterabs`
800 *argument* parameter is now optional. (Contributed by Chris Clark in
801 :issue:`13245`)
802
803* :meth:`~sched.scheduler.enter` and :meth:`~sched.scheduler.enterabs`
804 now accept a *kwargs* parameter. (Contributed by Chris Clark in
805 :issue:`13245`)
806
807
808shutil
809------
810
811* The :mod:`shutil` module has these new fuctions:
812
813 * :func:`~shutil.disk_usage`: provides total, used and free disk space
814 statistics. (Contributed by Giampaolo Rodolà in :issue:`12442`)
815 * :func:`~shutil.chown`: allows one to change user and/or group of the given
816 path also specifying the user/group names and not only their numeric
817 ids. (Contributed by Sandro Tosi in :issue:`12191`)
Victor Stinnera9293352011-04-30 15:21:58 +0200818
Victor Stinnerfa0e3d52011-05-09 01:01:09 +0200819
Victor Stinnera9293352011-04-30 15:21:58 +0200820signal
821------
822
Victor Stinnerfa0e3d52011-05-09 01:01:09 +0200823* The :mod:`signal` module has new functions:
Victor Stinnera9293352011-04-30 15:21:58 +0200824
Victor Stinnerb3e72192011-05-08 01:46:11 +0200825 * :func:`~signal.pthread_sigmask`: fetch and/or change the signal mask of the
826 calling thread (Contributed by Jean-Paul Calderone in :issue:`8407`) ;
827 * :func:`~signal.pthread_kill`: send a signal to a thread ;
828 * :func:`~signal.sigpending`: examine pending functions ;
829 * :func:`~signal.sigwait`: wait a signal.
Ross Lagerwallbc808222011-06-25 12:13:40 +0200830 * :func:`~signal.sigwaitinfo`: wait for a signal, returning detailed
831 information about it.
832 * :func:`~signal.sigtimedwait`: like :func:`~signal.sigwaitinfo` but with a
833 timeout.
Victor Stinnera9293352011-04-30 15:21:58 +0200834
Victor Stinnerd49b1f12011-05-08 02:03:15 +0200835* The signal handler writes the signal number as a single byte instead of
836 a nul byte into the wakeup file descriptor. So it is possible to wait more
837 than one signal and know which signals were raised.
838
Victor Stinner388196e2011-05-10 17:13:00 +0200839* :func:`signal.signal` and :func:`signal.siginterrupt` raise an OSError,
840 instead of a RuntimeError: OSError has an errno attribute.
841
Victor Stinnerf4c54ff2012-02-08 01:48:34 +0100842smtplib
843-------
844
845The :class:`~smtplib.SMTP_SSL` constructor and the :meth:`~smtplib.SMTP.starttls`
846method now accept an SSLContext parameter to control parameters of the secure
847channel.
848
849(Contributed by Kasun Herath in :issue:`8809`)
850
851
Nick Coghlan96fe56a2011-08-22 11:55:57 +1000852socket
853------
854
Charles-François Natali47413c12011-10-06 19:47:44 +0200855* The :class:`~socket.socket` class now exposes additional methods to process
856 ancillary data when supported by the underlying platform:
Nick Coghlan96fe56a2011-08-22 11:55:57 +1000857
Charles-François Natali47413c12011-10-06 19:47:44 +0200858 * :func:`~socket.socket.sendmsg`
859 * :func:`~socket.socket.recvmsg`
860 * :func:`~socket.socket.recvmsg_into`
Nick Coghlan96fe56a2011-08-22 11:55:57 +1000861
Charles-François Natali47413c12011-10-06 19:47:44 +0200862 (Contributed by David Watson in :issue:`6560`, based on an earlier patch by
863 Heiko Wundram)
864
865* The :class:`~socket.socket` class now supports the PF_CAN protocol family
866 (http://en.wikipedia.org/wiki/Socketcan), on Linux
867 (http://lwn.net/Articles/253425).
868
869 (Contributed by Matthias Fuchs, updated by Tiago Gonçalves in :issue:`10141`)
870
Charles-François Natali10b8cf42011-11-10 19:21:37 +0100871* The :class:`~socket.socket` class now supports the PF_RDS protocol family
872 (http://en.wikipedia.org/wiki/Reliable_Datagram_Sockets and
873 http://oss.oracle.com/projects/rds/).
Victor Stinner754851f2011-04-19 23:58:51 +0200874
Victor Stinnerf4c54ff2012-02-08 01:48:34 +0100875
Victor Stinner99c8b162011-05-24 12:05:19 +0200876ssl
877---
878
Antoine Pitrou2c0a9672011-11-17 02:09:13 +0100879* The :mod:`ssl` module has two new random generation functions:
Victor Stinner99c8b162011-05-24 12:05:19 +0200880
881 * :func:`~ssl.RAND_bytes`: generate cryptographically strong
882 pseudo-random bytes.
883 * :func:`~ssl.RAND_pseudo_bytes`: generate pseudo-random bytes.
884
Antoine Pitrou2c0a9672011-11-17 02:09:13 +0100885 (Contributed by Victor Stinner in :issue:`12049`)
886
887* The :mod:`ssl` module now exposes a finer-grained exception hierarchy
888 in order to make it easier to inspect the various kinds of errors.
889
890 (Contributed by Antoine Pitrou in :issue:`11183`)
891
892* :meth:`~ssl.SSLContext.load_cert_chain` now accepts a *password* argument
893 to be used if the private key is encrypted.
894
895 (Contributed by Adam Simpkins in :issue:`12803`)
896
Antoine Pitrou73fc8142011-12-23 20:58:36 +0100897* Diffie-Hellman key exchange, both regular and Elliptic Curve-based, is
898 now supported through the :meth:`~ssl.SSLContext.load_dh_params` and
899 :meth:`~ssl.SSLContext.set_ecdh_curve` methods.
900
901 (Contributed by Antoine Pitrou in :issue:`13626` and :issue:`13627`)
902
Antoine Pitrou2c0a9672011-11-17 02:09:13 +0100903* SSL sockets have a new :meth:`~ssl.SSLSocket.get_channel_binding` method
904 allowing the implementation of certain authentication mechanisms such as
905 SCRAM-SHA-1-PLUS.
906
907 (Contributed by Jacek Konieczny in :issue:`12551`)
908
Antoine Pitrou73fc8142011-12-23 20:58:36 +0100909* You can query the SSL compression algorithm used by an SSL socket, thanks
910 to its new :meth:`~ssl.SSLSocket.compression` method.
911
912 (Contributed by Antoine Pitrou in :issue:`13634`)
913
914
Victor Stinnerf4c54ff2012-02-08 01:48:34 +0100915sys
916---
Giampaolo Rodola'210e7ca2011-07-01 13:55:36 +0200917
Victor Stinnerf4c54ff2012-02-08 01:48:34 +0100918* The :mod:`sys` module has a new :data:`~sys.thread_info` :term:`struct
919 sequence` holding informations about the thread implementation.
Giampaolo Rodola'210e7ca2011-07-01 13:55:36 +0200920
Victor Stinnerf4c54ff2012-02-08 01:48:34 +0100921 (:issue:`11223`)
Giampaolo Rodola'096dcb12011-06-27 11:17:51 +0200922
Antoine Pitrou5a8bc6f2011-11-17 02:20:48 +0100923
Victor Stinnerf4c54ff2012-02-08 01:48:34 +0100924time
925----
Antoine Pitrou5a8bc6f2011-11-17 02:20:48 +0100926
Victor Stinnerf4c54ff2012-02-08 01:48:34 +0100927The :mod:`time` module has new functions:
928
929* :func:`~time.clock_getres` and :func:`~time.clock_gettime` functions and
930 ``CLOCK_xxx`` constants.
931* :func:`~time.monotonic`: monotonic clock.
932* :func:`~time.wallclock`.
933
934(Contributed by Victor Stinner in :issue:`10278`)
935
Antoine Pitrou5a8bc6f2011-11-17 02:20:48 +0100936
Senthil Kumarande49d642011-10-16 23:54:44 +0800937urllib
938------
939
940The :class:`~urllib.request.Request` class, now accepts a *method* argument
941used by :meth:`~urllib.request.Request.get_method` to determine what HTTP method
Senthil Kumarana41c9422011-10-20 02:37:08 +0800942should be used. For example, this will send a ``'HEAD'`` request::
Senthil Kumarande49d642011-10-16 23:54:44 +0800943
944 >>> urlopen(Request('http://www.python.org', method='HEAD'))
945
946(:issue:`1673007`)
Giampaolo Rodola'096dcb12011-06-27 11:17:51 +0200947
Giampaolo Rodola'be55d992011-11-22 13:33:34 +0100948
Giampaolo Rodolà3108f982011-02-24 20:59:48 +0000949Optimizations
950=============
951
952Major performance enhancements have been added:
953
Victor Stinner46606ce2011-11-20 18:27:55 +0100954* Thanks to the :pep:`393`, some operations on Unicode strings has been optimized:
955
956 * the memory footprint is divided by 2 to 4 depending on the text
Victor Stinnera996f1e2011-11-21 13:14:43 +0100957 * encode an ASCII string to UTF-8 doesn't need to encode characters anymore,
958 the UTF-8 representation is shared with the ASCII representation
Victor Stinner6099a032011-12-18 14:22:26 +0100959 * the UTF-8 encoder has been optimized
960 * repeating a single ASCII letter and getting a substring of a ASCII strings
961 is 4 times faster
Giampaolo Rodolà3108f982011-02-24 20:59:48 +0000962
963
964Build and C API Changes
965=======================
966
967Changes to Python's build process and to the C API include:
968
Stefan Krah95b1ba62012-02-29 17:27:21 +0100969* New :pep:`3118` related function:
970
971 * :c:func:`PyMemoryView_FromMemory`
972
Victor Stinner46606ce2011-11-20 18:27:55 +0100973* The :pep:`393` added new Unicode types, macros and functions:
974
Victor Stinnera996f1e2011-11-21 13:14:43 +0100975 * High-level API:
976
977 * :c:func:`PyUnicode_CopyCharacters`
978 * :c:func:`PyUnicode_FindChar`
979 * :c:func:`PyUnicode_GetLength`, :c:macro:`PyUnicode_GET_LENGTH`
980 * :c:func:`PyUnicode_New`
981 * :c:func:`PyUnicode_Substring`
982 * :c:func:`PyUnicode_ReadChar`, :c:func:`PyUnicode_WriteChar`
983
984 * Low-level API:
985
986 * :c:type:`Py_UCS1`, :c:type:`Py_UCS2`, :c:type:`Py_UCS4` types
987 * :c:type:`PyASCIIObject` and :c:type:`PyCompactUnicodeObject` structures
988 * :c:macro:`PyUnicode_READY`
989 * :c:func:`PyUnicode_FromKindAndData`
990 * :c:func:`PyUnicode_AsUCS4`, :c:func:`PyUnicode_AsUCS4Copy`
991 * :c:macro:`PyUnicode_DATA`, :c:macro:`PyUnicode_1BYTE_DATA`,
992 :c:macro:`PyUnicode_2BYTE_DATA`, :c:macro:`PyUnicode_4BYTE_DATA`
993 * :c:macro:`PyUnicode_KIND` with :c:type:`PyUnicode_Kind` enum:
994 :c:data:`PyUnicode_WCHAR_KIND`, :c:data:`PyUnicode_1BYTE_KIND`,
995 :c:data:`PyUnicode_2BYTE_KIND`, :c:data:`PyUnicode_4BYTE_KIND`
996 * :c:macro:`PyUnicode_READ`, :c:macro:`PyUnicode_READ_CHAR`, :c:macro:`PyUnicode_WRITE`
997 * :c:macro:`PyUnicode_MAX_CHAR_VALUE`
998
Giampaolo Rodolà3108f982011-02-24 20:59:48 +0000999
1000
Victor Stinnerd1be8782011-12-09 00:10:41 +01001001Deprecated
1002==========
1003
Georg Brandl0cd25c92011-04-29 13:45:54 +02001004Unsupported Operating Systems
Victor Stinnerd1be8782011-12-09 00:10:41 +01001005-----------------------------
Victor Stinnerb90db4c2011-04-26 22:48:24 +02001006
Brian Curtin49a40cd2011-05-02 22:30:06 -05001007OS/2 and VMS are no longer supported due to the lack of a maintainer.
1008
1009Windows 2000 and Windows platforms which set ``COMSPEC`` to ``command.com``
1010are no longer supported due to maintenance burden.
Victor Stinnerb90db4c2011-04-26 22:48:24 +02001011
1012
Victor Stinner46606ce2011-11-20 18:27:55 +01001013Deprecated Python modules, functions and methods
Victor Stinnerd1be8782011-12-09 00:10:41 +01001014------------------------------------------------
Victor Stinner19bd0692011-11-16 00:18:57 +01001015
1016* The :mod:`packaging` module replaces the :mod:`distutils` module
1017* The ``unicode_internal`` codec has been deprecated because of the
Sandro Tosicd899122012-01-22 12:16:04 +01001018 :pep:`393`, use UTF-8, UTF-16 (``utf-16-le`` or ``utf-16-be``), or UTF-32
1019 (``utf-32-le`` or ``utf-32-be``)
Victor Stinner19bd0692011-11-16 00:18:57 +01001020* :meth:`ftplib.FTP.nlst` and :meth:`ftplib.FTP.dir`: use
Victor Stinner46606ce2011-11-20 18:27:55 +01001021 :meth:`ftplib.FTP.mlsd`
Victor Stinner19bd0692011-11-16 00:18:57 +01001022* :func:`platform.popen`: use the :mod:`subprocess` module. Check especially
1023 the :ref:`subprocess-replacements` section.
1024* :issue:`13374`: The Windows bytes API has been deprecated in the :mod:`os`
Victor Stinner46606ce2011-11-20 18:27:55 +01001025 module. Use Unicode filenames, instead of bytes filenames, to not depend on
Victor Stinner19bd0692011-11-16 00:18:57 +01001026 the ANSI code page anymore and to support any filename.
Florent Xiclunaa72a98f2012-02-13 11:03:30 +01001027* :issue:`13988`: The :mod:`xml.etree.cElementTree` module is deprecated. The
1028 accelerator is used automatically whenever available.
Victor Stinner19bd0692011-11-16 00:18:57 +01001029
1030
Victor Stinner46606ce2011-11-20 18:27:55 +01001031Deprecated functions and types of the C API
Victor Stinnerd1be8782011-12-09 00:10:41 +01001032-------------------------------------------
Victor Stinner46606ce2011-11-20 18:27:55 +01001033
1034The :c:type:`Py_UNICODE` has been deprecated by the :pep:`393` and will be
1035removed in Python 4. All functions using this type are deprecated:
1036
Victor Stinner46606ce2011-11-20 18:27:55 +01001037Unicode functions and methods using :c:type:`Py_UNICODE` and
1038:c:type:`Py_UNICODE*` types:
1039
1040 * :c:macro:`PyUnicode_FromUnicode`: use :c:func:`PyUnicode_FromWideChar` or
1041 :c:func:`PyUnicode_FromKindAndData`
1042 * :c:macro:`PyUnicode_AS_UNICODE`, :c:func:`PyUnicode_AsUnicode`,
1043 :c:func:`PyUnicode_AsUnicodeAndSize`: use :c:func:`PyUnicode_AsWideCharString`
1044 * :c:macro:`PyUnicode_AS_DATA`: use :c:macro:`PyUnicode_DATA` with
1045 :c:macro:`PyUnicode_READ` and :c:macro:`PyUnicode_WRITE`
1046 * :c:macro:`PyUnicode_GET_SIZE`, :c:func:`PyUnicode_GetSize`: use
1047 :c:macro:`PyUnicode_GET_LENGTH` or :c:func:`PyUnicode_GetLength`
1048 * :c:macro:`PyUnicode_GET_DATA_SIZE`: use
1049 ``PyUnicode_GET_LENGTH(str) * PyUnicode_KIND(str)`` (only work on ready
1050 strings)
Victor Stinnerbf6e5602011-12-12 01:53:47 +01001051 * :c:func:`PyUnicode_AsUnicodeCopy`: use :c:func:`PyUnicode_AsUCS4Copy` or
1052 :c:func:`PyUnicode_AsWideCharString`
Victor Stinnerab595942011-12-17 04:59:06 +01001053 * :c:func:`PyUnicode_GetMax`
1054
Victor Stinner46606ce2011-11-20 18:27:55 +01001055
Victor Stinnera996f1e2011-11-21 13:14:43 +01001056Functions and macros manipulating Py_UNICODE* strings:
1057
1058 * :c:macro:`Py_UNICODE_strlen`: use :c:func:`PyUnicode_GetLength` or
1059 :c:macro:`PyUnicode_GET_LENGTH`
1060 * :c:macro:`Py_UNICODE_strcat`: use :c:func:`PyUnicode_CopyCharacters` or
1061 :c:func:`PyUnicode_FromFormat`
1062 * :c:macro:`Py_UNICODE_strcpy`, :c:macro:`Py_UNICODE_strncpy`,
1063 :c:macro:`Py_UNICODE_COPY`: use :c:func:`PyUnicode_CopyCharacters` or
1064 :c:func:`PyUnicode_Substring`
1065 * :c:macro:`Py_UNICODE_strcmp`: use :c:func:`PyUnicode_Compare`
1066 * :c:macro:`Py_UNICODE_strncmp`: use :c:func:`PyUnicode_Tailmatch`
1067 * :c:macro:`Py_UNICODE_strchr`, :c:macro:`Py_UNICODE_strrchr`: use
1068 :c:func:`PyUnicode_FindChar`
Victor Stinner606e19d2012-01-04 03:59:16 +01001069 * :c:macro:`Py_UNICODE_FILL`: use :c:func:`PyUnicode_Fill`
Victor Stinnerab595942011-12-17 04:59:06 +01001070 * :c:macro:`Py_UNICODE_MATCH`
Victor Stinnera996f1e2011-11-21 13:14:43 +01001071
Victor Stinner46606ce2011-11-20 18:27:55 +01001072Encoders:
1073
1074 * :c:func:`PyUnicode_Encode`: use :c:func:`PyUnicode_AsEncodedObject`
1075 * :c:func:`PyUnicode_EncodeUTF7`
Victor Stinnera996f1e2011-11-21 13:14:43 +01001076 * :c:func:`PyUnicode_EncodeUTF8`: use :c:func:`PyUnicode_AsUTF8` or
1077 :c:func:`PyUnicode_AsUTF8String`
Victor Stinner46606ce2011-11-20 18:27:55 +01001078 * :c:func:`PyUnicode_EncodeUTF32`
1079 * :c:func:`PyUnicode_EncodeUTF16`
1080 * :c:func:`PyUnicode_EncodeUnicodeEscape:` use
1081 :c:func:`PyUnicode_AsUnicodeEscapeString`
1082 * :c:func:`PyUnicode_EncodeRawUnicodeEscape:` use
1083 :c:func:`PyUnicode_AsRawUnicodeEscapeString`
1084 * :c:func:`PyUnicode_EncodeLatin1`: use :c:func:`PyUnicode_AsLatin1String`
1085 * :c:func:`PyUnicode_EncodeASCII`: use :c:func:`PyUnicode_AsASCIIString`
1086 * :c:func:`PyUnicode_EncodeCharmap`
1087 * :c:func:`PyUnicode_TranslateCharmap`
1088 * :c:func:`PyUnicode_EncodeMBCS`: use :c:func:`PyUnicode_AsMBCSString` or
1089 :c:func:`PyUnicode_EncodeCodePage` (with ``CP_ACP`` code_page)
1090 * :c:func:`PyUnicode_EncodeDecimal`,
1091 :c:func:`PyUnicode_TransformDecimalToASCII`
1092
1093
Giampaolo Rodolà3108f982011-02-24 20:59:48 +00001094Porting to Python 3.3
1095=====================
1096
1097This section lists previously described changes and other bugfixes
Antoine Pitrou037ffbf2011-10-24 00:25:41 +02001098that may require changes to your code.
1099
1100Porting Python code
1101-------------------
Giampaolo Rodolà3108f982011-02-24 20:59:48 +00001102
Georg Brandld6c43402012-03-07 08:55:52 +01001103.. XXX add a point about hash randomization and that it's always on in 3.3
1104
Victor Stinner19bd0692011-11-16 00:18:57 +01001105* :issue:`12326`: On Linux, sys.platform doesn't contain the major version
Victor Stinnerff3d9392011-08-20 23:39:26 +02001106 anymore. It is now always 'linux', instead of 'linux2' or 'linux3' depending
1107 on the Linux version used to build Python. Replace sys.platform == 'linux2'
1108 with sys.platform.startswith('linux'), or directly sys.platform == 'linux' if
1109 you don't need to support older Python versions.
Éric Araujoc09fca62011-03-23 02:06:24 +01001110
Antoine Pitrou037ffbf2011-10-24 00:25:41 +02001111Porting C code
1112--------------
1113
Stefan Krah54c32032012-02-29 17:47:21 +01001114* In the course of changes to the buffer API the undocumented
1115 :c:member:`~Py_buffer.smalltable` member of the
1116 :c:type:`Py_buffer` structure has been removed and the
1117 layout of the :c:type:`PyMemoryViewObject` has changed.
1118
1119 All extensions relying on the relevant parts in ``memoryobject.h``
1120 or ``object.h`` must be rebuilt.
1121
Antoine Pitrou037ffbf2011-10-24 00:25:41 +02001122* Due to :ref:`PEP 393 <pep-393>`, the :c:type:`Py_UNICODE` type and all
1123 functions using this type are deprecated (but will stay available for
1124 at least five years). If you were using low-level Unicode APIs to
1125 construct and access unicode objects and you want to benefit of the
1126 memory footprint reduction provided by the PEP 393, you have to convert
1127 your code to the new :doc:`Unicode API <../c-api/unicode>`.
1128
1129 However, if you only have been using high-level functions such as
1130 :c:func:`PyUnicode_Concat()`, :c:func:`PyUnicode_Join` or
1131 :c:func:`PyUnicode_FromFormat()`, your code will automatically take
1132 advantage of the new unicode representations.
1133
Antoine Pitrouc229e6e2012-02-20 19:41:11 +01001134Building C extensions
1135---------------------
1136
1137* The range of possible file names for C extensions has been narrowed.
1138 Very rarely used spellings have been suppressed: under POSIX, files
1139 named ``xxxmodule.so``, ``xxxmodule.abi3.so`` and
1140 ``xxxmodule.cpython-*.so`` are no longer recognized as implementing
1141 the ``xxx`` module. If you had been generating such files, you have
1142 to switch to the other spellings (i.e., remove the ``module`` string
1143 from the file names).
1144
1145 (implemented in :issue:`14040`.)
1146
1147
Antoine Pitrou037ffbf2011-10-24 00:25:41 +02001148Other issues
1149------------
1150
Éric Araujoc09fca62011-03-23 02:06:24 +01001151.. Issue #11591: When :program:`python` was started with :option:`-S`,
1152 ``import site`` will not add site-specific paths to the module search
1153 paths. In previous versions, it did. See changeset for doc changes in
1154 various files. Contributed by Carl Meyer with editions by Éric Araujo.
Éric Araujobe3bd572011-03-26 01:55:15 +01001155
Éric Araujobfc97292011-11-14 18:18:15 +01001156.. Issue #10998: the -Q command-line flag and related artifacts have been
Éric Araujobe3bd572011-03-26 01:55:15 +01001157 removed. Code checking sys.flags.division_warning will need updating.
1158 Contributed by Éric Araujo.