blob: 28294cb977d80d2b44ee73168f4dc4fe4a37c2c5 [file] [log] [blame]
Giampaolo Rodolà3108f982011-02-24 20:59:48 +00001****************************
2 What's New In Python 3.3
3****************************
4
5:Author: Raymond Hettinger
6:Release: |release|
7:Date: |today|
8
Éric Araujob07b97f2011-10-05 01:03:34 +02009.. Rules for maintenance:
Giampaolo Rodolà3108f982011-02-24 20:59:48 +000010
11 * Anyone can add text to this document. Do not spend very much time
12 on the wording of your changes, because your text will probably
13 get rewritten to some degree.
14
15 * The maintainer will go through Misc/NEWS periodically and add
16 changes; it's therefore more important to add your changes to
17 Misc/NEWS than to this file.
18
19 * This is not a complete list of every single change; completeness
20 is the purpose of Misc/NEWS. Some changes I consider too small
21 or esoteric to include. If such a change is added to the text,
22 I'll just remove it. (This is another reason you shouldn't spend
23 too much time on writing your addition.)
24
25 * If you want to draw your new text to the attention of the
26 maintainer, add 'XXX' to the beginning of the paragraph or
27 section.
28
29 * It's OK to just add a fragmentary note about a change. For
30 example: "XXX Describe the transmogrify() function added to the
31 socket module." The maintainer will research the change and
32 write the necessary text.
33
34 * You can comment out your additions if you like, but it's not
35 necessary (especially when a final release is some months away).
36
37 * Credit the author of a patch or bugfix. Just the name is
38 sufficient; the e-mail address isn't necessary.
39
40 * It's helpful to add the bug/patch number as a comment:
41
Giampaolo Rodolà3108f982011-02-24 20:59:48 +000042 XXX Describe the transmogrify() function added to the socket
43 module.
Éric Araujob07b97f2011-10-05 01:03:34 +020044 (Contributed by P.Y. Developer in :issue:`12345`.)
Giampaolo Rodolà3108f982011-02-24 20:59:48 +000045
Éric Araujob07b97f2011-10-05 01:03:34 +020046 This saves the maintainer the effort of going through the Mercurial log
Giampaolo Rodolà3108f982011-02-24 20:59:48 +000047 when researching a change.
48
49This article explains the new features in Python 3.3, compared to 3.2.
50
51
Nick Coghlan98e20702012-03-06 21:50:13 +100052.. pep-3118-update:
53
Stefan Krah9a2d99e2012-02-25 12:24:21 +010054PEP 3118: New memoryview implementation and buffer protocol documentation
55=========================================================================
56
57:issue:`10181` - memoryview bug fixes and features.
58 Written by Stefan Krah.
59
60The new memoryview implementation comprehensively fixes all ownership and
61lifetime issues of dynamically allocated fields in the Py_buffer struct
62that led to multiple crash reports. Additionally, several functions that
63crashed or returned incorrect results for non-contiguous or multi-dimensional
64input have been fixed.
65
66The memoryview object now has a PEP-3118 compliant getbufferproc()
67that checks the consumer's request type. Many new features have been
68added, most of them work in full generality for non-contiguous arrays
69and arrays with suboffsets.
70
71The documentation has been updated, clearly spelling out responsibilities
72for both exporters and consumers. Buffer request flags are grouped into
73basic and compound flags. The memory layout of non-contiguous and
74multi-dimensional NumPy-style arrays is explained.
75
76Features
77--------
78
79* All native single character format specifiers in struct module syntax
80 (optionally prefixed with '@') are now supported.
81
82* With some restrictions, the cast() method allows changing of format and
83 shape of C-contiguous arrays.
84
85* Multi-dimensional list representations are supported for any array type.
86
87* Multi-dimensional comparisons are supported for any array type.
88
89* All array types are hashable if the exporting object is hashable
Nick Coghlan98e20702012-03-06 21:50:13 +100090 and the view is read-only. (Contributed by Antoine Pitrou in
91 :issue:`13411`)
92
Stefan Krah9a2d99e2012-02-25 12:24:21 +010093
94* Arbitrary slicing of any 1-D arrays type is supported. For example, it
95 is now possible to reverse a memoryview in O(1) by using a negative step.
96
97API changes
98-----------
99
100* The maximum number of dimensions is officially limited to 64.
101
102* The representation of empty shape, strides and suboffsets is now
103 an empty tuple instead of None.
104
105* Accessing a memoryview element with format 'B' (unsigned bytes)
106 now returns an integer (in accordance with the struct module syntax).
107 For returning a bytes object the view must be cast to 'c' first.
108
Stefan Krah54c32032012-02-29 17:47:21 +0100109* For further changes see `Build and C API Changes`_ and `Porting C code`_ .
Stefan Krah9a2d99e2012-02-25 12:24:21 +0100110
Antoine Pitrou037ffbf2011-10-24 00:25:41 +0200111.. _pep-393:
112
Ezio Melotti48a2f8f2011-09-29 00:18:19 +0300113PEP 393: Flexible String Representation
114=======================================
115
Antoine Pitroufd9b4162011-10-24 00:14:43 +0200116The Unicode string type is changed to support multiple internal
117representations, depending on the character with the largest Unicode ordinal
118(1, 2, or 4 bytes) in the represented string. This allows a space-efficient
119representation in common cases, but gives access to full UCS-4 on all
120systems. For compatibility with existing APIs, several representations may
121exist in parallel; over time, this compatibility should be phased out.
Ezio Melotti397546a2011-09-29 08:34:36 +0300122
Antoine Pitroufd9b4162011-10-24 00:14:43 +0200123On the Python side, there should be no downside to this change.
Ezio Melotti397546a2011-09-29 08:34:36 +0300124
Antoine Pitroufd9b4162011-10-24 00:14:43 +0200125On the C API side, PEP 393 is fully backward compatible. The legacy API
126should remain available at least five years. Applications using the legacy
127API will not fully benefit of the memory reduction, or - worse - may use
128a bit more memory, because Python may have to maintain two versions of each
129string (in the legacy format and in the new efficient storage).
130
Antoine Pitrou0599b5b2011-11-29 22:45:07 +0100131Functionality
132-------------
133
Antoine Pitroufd9b4162011-10-24 00:14:43 +0200134Changes introduced by :pep:`393` are the following:
Ezio Melotti48a2f8f2011-09-29 00:18:19 +0300135
Ezio Melotti397546a2011-09-29 08:34:36 +0300136* Python now always supports the full range of Unicode codepoints, including
137 non-BMP ones (i.e. from ``U+0000`` to ``U+10FFFF``). The distinction between
138 narrow and wide builds no longer exists and Python now behaves like a wide
Antoine Pitroufd9b4162011-10-24 00:14:43 +0200139 build, even under Windows.
Ezio Melotti397546a2011-09-29 08:34:36 +0300140
Antoine Pitroufd9b4162011-10-24 00:14:43 +0200141* With the death of narrow builds, the problems specific to narrow builds have
142 also been fixed, for example:
Ezio Melotti397546a2011-09-29 08:34:36 +0300143
144 * :func:`len` now always returns 1 for non-BMP characters,
145 so ``len('\U0010FFFF') == 1``;
146
147 * surrogate pairs are not recombined in string literals,
148 so ``'\uDBFF\uDFFF' != '\U0010FFFF'``;
149
Antoine Pitroufd9b4162011-10-24 00:14:43 +0200150 * indexing or slicing non-BMP characters returns the expected value,
Ezio Melotti397546a2011-09-29 08:34:36 +0300151 so ``'\U0010FFFF'[0]`` now returns ``'\U0010FFFF'`` and not ``'\uDBFF'``;
152
Antoine Pitroud136aec2011-11-17 01:48:06 +0100153 * all other functions in the standard library now correctly handle
Antoine Pitroufd9b4162011-10-24 00:14:43 +0200154 non-BMP codepoints.
Ezio Melotti397546a2011-09-29 08:34:36 +0300155
Ezio Melotti48a2f8f2011-09-29 00:18:19 +0300156* The value of :data:`sys.maxunicode` is now always ``1114111`` (``0x10FFFF``
157 in hexadecimal). The :c:func:`PyUnicode_GetMax` function still returns
158 either ``0xFFFF`` or ``0x10FFFF`` for backward compatibility, and it should
159 not be used with the new Unicode API (see :issue:`13054`).
160
Ezio Melotti397546a2011-09-29 08:34:36 +0300161* The :file:`./configure` flag ``--with-wide-unicode`` has been removed.
Victor Stinner7d637ab2011-09-29 02:56:16 +0200162
Antoine Pitrou0599b5b2011-11-29 22:45:07 +0100163Performance and resource usage
164------------------------------
165
166The storage of Unicode strings now depends on the highest codepoint in the string:
167
168* pure ASCII and Latin1 strings (``U+0000-U+00FF``) use 1 byte per codepoint;
169
170* BMP strings (``U+0000-U+FFFF``) use 2 bytes per codepoint;
171
172* non-BMP strings (``U+10000-U+10FFFF``) use 4 bytes per codepoint.
173
Martin v. Löwisde157cc2012-03-06 08:42:17 +0100174The net effect is that for most applications, memory usage of string
175storage should decrease significantly - especially compared to former
176wide unicode builds - as, in many cases, strings will be pure ASCII
177even in international contexts (because many strings store non-human
178language data, such as XML fragments, HTTP headers, JSON-encoded data,
179etc.). We also hope that it will, for the same reasons, increase CPU
180cache efficiency on non-trivial applications. The memory usage of
181Python 3.3 is two to three times smaller than Python 3.2, and a little
182bit better than Python 2.7, on a Django benchmark (see the PEP for
183details).
Antoine Pitrou0599b5b2011-11-29 22:45:07 +0100184
Éric Araujob07b97f2011-10-05 01:03:34 +0200185
Victor Stinnera1bf2982011-10-12 20:35:02 +0200186PEP 3151: Reworking the OS and IO exception hierarchy
187=====================================================
188
189:pep:`3151` - Reworking the OS and IO exception hierarchy
Antoine Pitrou01fd26c2011-10-24 00:07:02 +0200190 PEP written and implemented by Antoine Pitrou.
Victor Stinnera1bf2982011-10-12 20:35:02 +0200191
Antoine Pitrou01fd26c2011-10-24 00:07:02 +0200192The hierarchy of exceptions raised by operating system errors is now both
193simplified and finer-grained.
Victor Stinnera1bf2982011-10-12 20:35:02 +0200194
Antoine Pitrou01fd26c2011-10-24 00:07:02 +0200195You don't have to worry anymore about choosing the appropriate exception
196type between :exc:`OSError`, :exc:`IOError`, :exc:`EnvironmentError`,
197:exc:`WindowsError`, :exc:`mmap.error`, :exc:`socket.error` or
198:exc:`select.error`. All these exception types are now only one:
199:exc:`OSError`. The other names are kept as aliases for compatibility
200reasons.
Victor Stinnera1bf2982011-10-12 20:35:02 +0200201
Antoine Pitrou01fd26c2011-10-24 00:07:02 +0200202Also, it is now easier to catch a specific error condition. Instead of
203inspecting the ``errno`` attribute (or ``args[0]``) for a particular
204constant from the :mod:`errno` module, you can catch the adequate
205:exc:`OSError` subclass. The available subclasses are the following:
Victor Stinnera1bf2982011-10-12 20:35:02 +0200206
Antoine Pitrou01fd26c2011-10-24 00:07:02 +0200207* :exc:`BlockingIOError`
208* :exc:`ChildProcessError`
209* :exc:`ConnectionError`
210* :exc:`FileExistsError`
211* :exc:`FileNotFoundError`
212* :exc:`InterruptedError`
213* :exc:`IsADirectoryError`
214* :exc:`NotADirectoryError`
215* :exc:`PermissionError`
216* :exc:`ProcessLookupError`
217* :exc:`TimeoutError`
Victor Stinnera1bf2982011-10-12 20:35:02 +0200218
Antoine Pitrou01fd26c2011-10-24 00:07:02 +0200219And the :exc:`ConnectionError` itself has finer-grained subclasses:
Victor Stinnera1bf2982011-10-12 20:35:02 +0200220
Antoine Pitrou01fd26c2011-10-24 00:07:02 +0200221* :exc:`BrokenPipeError`
222* :exc:`ConnectionAbortedError`
223* :exc:`ConnectionRefusedError`
224* :exc:`ConnectionResetError`
Victor Stinnera1bf2982011-10-12 20:35:02 +0200225
226Thanks to the new exceptions, common usages of the :mod:`errno` can now be
Antoine Pitrou01fd26c2011-10-24 00:07:02 +0200227avoided. For example, the following code written for Python 3.2::
Victor Stinnera1bf2982011-10-12 20:35:02 +0200228
229 from errno import ENOENT, EACCES, EPERM
230
231 try:
232 with open("document.txt") as f:
233 content = f.read()
234 except IOError as err:
235 if err.errno == ENOENT:
236 print("document.txt file is missing")
237 elif err.errno in (EACCES, EPERM):
238 print("You are not allowed to read document.txt")
239 else:
240 raise
241
Antoine Pitrou01fd26c2011-10-24 00:07:02 +0200242can now be written without the :mod:`errno` import and without manual
243inspection of exception attributes::
Victor Stinnera1bf2982011-10-12 20:35:02 +0200244
245 try:
246 with open("document.txt") as f:
247 content = f.read()
248 except FileNotFoundError:
249 print("document.txt file is missing")
250 except PermissionError:
251 print("You are not allowed to read document.txt")
252
253
Nick Coghlan1f7ce622012-01-13 21:43:40 +1000254PEP 380: Syntax for Delegating to a Subgenerator
255================================================
256
Nick Coghlanab7bf212012-02-26 17:49:52 +1000257:pep:`380` - Syntax for Delegating to a Subgenerator
258 PEP written by Greg Ewing.
259
Nick Coghlan1f7ce622012-01-13 21:43:40 +1000260PEP 380 adds the ``yield from`` expression, allowing a generator to delegate
261part of its operations to another generator. This allows a section of code
262containing 'yield' to be factored out and placed in another generator.
263Additionally, the subgenerator is allowed to return with a value, and the
264value is made available to the delegating generator.
Nick Coghlanb9b281b2012-03-06 22:31:12 +1000265
Nick Coghlan1f7ce622012-01-13 21:43:40 +1000266While designed primarily for use in delegating to a subgenerator, the ``yield
267from`` expression actually allows delegation to arbitrary subiterators.
268
Nick Coghlanb9b281b2012-03-06 22:31:12 +1000269For simple iterators, ``yield from iterable`` is essentially just a shortened
270form of ``for item in iterable: yield item``::
271
272 >>> def g(x):
273 ... yield from range(x, 0, -1)
274 ... yield from range(x)
275 ...
276 >>> list(g(5))
277 [5, 4, 3, 2, 1, 0, 1, 2, 3, 4]
278
279However, unlike an ordinary loop, ``yield from`` allows subgenerators to
280receive sent and thrown values directly from the calling scope, and
281return a final value to the outer generator::
282
283 >>> def accumulate(start=0):
284 ... tally = start
285 ... while 1:
286 ... next = yield
287 ... if next is None:
288 ... return tally
289 ... tally += next
290 ...
291 >>> def gather_tallies(tallies, start=0):
292 ... while 1:
293 ... tally = yield from accumulate()
294 ... tallies.append(tally)
295 ...
296 >>> tallies = []
297 >>> acc = gather_tallies(tallies)
298 >>> next(acc) # Ensure the accumulator is ready to accept values
299 >>> for i in range(10):
300 ... acc.send(i)
301 ...
302 >>> acc.send(None) # Finish the first tally
303 >>> for i in range(5):
304 ... acc.send(i)
305 ...
306 >>> acc.send(None) # Finish the second tally
307 >>> tallies
308 [45, 10]
309
310The main principle driving this change is to allow even generators that are
311designed to be used with the ``send`` and ``throw`` methods to be split into
312multiple subgenerators as easily as a single large function can be split into
313multiple subfunctions.
314
Nick Coghlan1f7ce622012-01-13 21:43:40 +1000315(Implementation by Greg Ewing, integrated into 3.3 by Renaud Blanch, Ryan
316Kelly and Nick Coghlan, documentation by Zbigniew Jędrzejewski-Szmek and
317Nick Coghlan)
318
319
Nick Coghlanab7bf212012-02-26 17:49:52 +1000320PEP 409: Suppressing exception context
321======================================
322
323:pep:`409` - Suppressing exception context
324 PEP written by Ethan Furman, implemented by Ethan Furman and Nick Coghlan.
325
326PEP 409 introduces new syntax that allows the display of the chained
327exception context to be disabled. This allows cleaner error messages in
328applications that convert between exception types::
329
330 >>> class D:
331 ... def __init__(self, extra):
332 ... self._extra_attributes = extra
333 ... def __getattr__(self, attr):
334 ... try:
335 ... return self._extra_attributes[attr]
336 ... except KeyError:
337 ... raise AttributeError(attr) from None
338 ...
339 >>> D({}).x
340 Traceback (most recent call last):
341 File "<stdin>", line 1, in <module>
342 File "<stdin>", line 8, in __getattr__
343 AttributeError: x
344
345Without the ``from None`` suffix to suppress the cause, the original
346exception would be displayed by default::
347
348 >>> class C:
349 ... def __init__(self, extra):
350 ... self._extra_attributes = extra
351 ... def __getattr__(self, attr):
352 ... try:
353 ... return self._extra_attributes[attr]
354 ... except KeyError:
355 ... raise AttributeError(attr)
356 ...
357 >>> C({}).x
358 Traceback (most recent call last):
359 File "<stdin>", line 6, in __getattr__
360 KeyError: 'x'
361
362 During handling of the above exception, another exception occurred:
363
364 Traceback (most recent call last):
365 File "<stdin>", line 1, in <module>
366 File "<stdin>", line 8, in __getattr__
367 AttributeError: x
368
369No debugging capability is lost, as the original exception context remains
370available if needed (for example, if an intervening library has incorrectly
371suppressed valuable underlying details)::
372
373 >>> try:
374 ... D({}).x
375 ... except AttributeError as exc:
376 ... print(repr(exc.__context__))
377 ...
378 KeyError('x',)
379
380
Nick Coghlan98e20702012-03-06 21:50:13 +1000381PEP 414: Explicit Unicode literals
382======================================
383
384:pep:`414` - Explicit Unicode literals
385 PEP written by Armin Ronacher.
386
387To ease the transition from Python 2 for Unicode aware Python applications
388that make heavy use of Unicode literals, Python 3.3 once again supports the
389"``u``" prefix for string literals. This prefix has no semantic significance
390in Python 3, it is provided solely to reduce the number of purely mechanical
391changes in migrating to Python 3, making it easier for developers to focus on
392the more significant semantic changes (such as the stricter default
393separation of binary and text data).
394
395
Antoine Pitrou6bbd76b2011-11-25 19:10:05 +0100396PEP 3155: Qualified name for classes and functions
397==================================================
398
399:pep:`3155` - Qualified name for classes and functions
400 PEP written and implemented by Antoine Pitrou.
401
402Functions and class objects have a new ``__qualname__`` attribute representing
403the "path" from the module top-level to their definition. For global functions
404and classes, this is the same as ``__name__``. For other functions and classes,
405it provides better information about where they were actually defined, and
406how they might be accessible from the global scope.
407
408Example with (non-bound) methods::
Nick Coghlan2dfe6b02012-01-14 14:19:49 +1000409
Antoine Pitrou6bbd76b2011-11-25 19:10:05 +0100410 >>> class C:
411 ... def meth(self):
412 ... pass
413 >>> C.meth.__name__
414 'meth'
415 >>> C.meth.__qualname__
416 'C.meth'
417
418Example with nested classes::
419
420 >>> class C:
421 ... class D:
422 ... def meth(self):
423 ... pass
424 ...
425 >>> C.D.__name__
426 'D'
427 >>> C.D.__qualname__
428 'C.D'
429 >>> C.D.meth.__name__
430 'meth'
431 >>> C.D.meth.__qualname__
432 'C.D.meth'
433
434Example with nested functions::
435
436 >>> def outer():
437 ... def inner():
438 ... pass
439 ... return inner
440 ...
441 >>> outer().__name__
442 'inner'
443 >>> outer().__qualname__
444 'outer.<locals>.inner'
445
Antoine Pitroue7ede062011-11-25 19:11:26 +0100446The string representation of those objects is also changed to include the
Antoine Pitrou6bbd76b2011-11-25 19:10:05 +0100447new, more precise information::
448
449 >>> str(C.D)
450 "<class '__main__.C.D'>"
451 >>> str(C.D.meth)
452 '<function C.D.meth at 0x7f46b9fe31e0>'
453
454
Giampaolo Rodolà3108f982011-02-24 20:59:48 +0000455Other Language Changes
456======================
457
458Some smaller changes made to the core Python language are:
459
Antoine Pitrou7b578b32011-11-29 22:47:11 +0100460* Added support for Unicode name aliases and named sequences.
461 Both :func:`unicodedata.lookup()` and ``'\N{...}'`` now resolve name aliases,
462 and :func:`unicodedata.lookup()` resolves named sequences too.
Giampaolo Rodolà3108f982011-02-24 20:59:48 +0000463
Antoine Pitrou7b578b32011-11-29 22:47:11 +0100464 (Contributed by Ezio Melotti in :issue:`12753`)
Ezio Melotti931b8aa2011-10-21 21:57:36 +0300465
Antoine Pitrou7b578b32011-11-29 22:47:11 +0100466* Equality comparisons on :func:`range` objects now return a result reflecting
467 the equality of the underlying sequences generated by those range objects.
Ezio Melotti931b8aa2011-10-21 21:57:36 +0300468
Sandro Tosicd899122012-01-22 12:16:04 +0100469 (:issue:`13201`)
Giampaolo Rodolà3108f982011-02-24 20:59:48 +0000470
Antoine Pitrou7b578b32011-11-29 22:47:11 +0100471* The ``count()``, ``find()``, ``rfind()``, ``index()`` and ``rindex()``
472 methods of :class:`bytes` and :class:`bytearray` objects now accept an
473 integer between 0 and 255 as their first argument.
Mark Dickinson36645682011-10-23 19:53:01 +0100474
Antoine Pitrou7b578b32011-11-29 22:47:11 +0100475 (:issue:`12170`)
Mark Dickinson36645682011-10-23 19:53:01 +0100476
Victor Stinner8c43e692012-03-09 14:04:01 +0100477* A dict lookup now raises a :exc:`RuntimeError` if the dict is modified during
478 the lookup. If you implement your own comparaison function for objects used
479 as dict keys and the dict is shared by multiple threads, access to the dict
480 should be protected by a lock.
481
482 (:issue:`14205`)
483
Petri Lehtinen61ea8a02011-11-24 22:00:46 +0200484
Victor Stinner46606ce2011-11-20 18:27:55 +0100485New and Improved Modules
486========================
Giampaolo Rodolà3108f982011-02-24 20:59:48 +0000487
Victor Stinnerf4c54ff2012-02-08 01:48:34 +0100488abc
489---
490
491Improved support for abstract base classes containing descriptors composed with
492abstract methods. The recommended approach to declaring abstract descriptors is
493now to provide :attr:`__isabstractmethod__` as a dynamically updated
494property. The built-in descriptors have been updated accordingly.
495
496 * :class:`abc.abstractproperty` has been deprecated, use :class:`property`
497 with :func:`abc.abstractmethod` instead.
498 * :class:`abc.abstractclassmethod` has been deprecated, use
499 :class:`classmethod` with :func:`abc.abstractmethod` instead.
500 * :class:`abc.abstractstaticmethod` has been deprecated, use
501 :class:`staticmethod` with :func:`abc.abstractmethod` instead.
502
503(Contributed by Darren Dale in :issue:`11610`)
504
Meador Ingec5dbb3d2011-09-20 21:48:16 -0500505array
506-----
507
508The :mod:`array` module supports the :c:type:`long long` type using ``q`` and
509``Q`` type codes.
510
511(Contributed by Oren Tirosh and Hirokazu Yamamoto in :issue:`1172711`)
512
513
Nadeem Vawdad7e5c6e2012-02-12 01:34:18 +0200514bz2
515---
516
517The :mod:`bz2` module has been rewritten from scratch. In the process, several
518new features have been added:
519
520* :class:`bz2.BZ2File` can now read from and write to arbitrary file-like
521 objects, by means of its constructor's *fileobj* argument.
522
523 (Contributed by Nadeem Vawda in :issue:`5863`)
524
525* :class:`bz2.BZ2File` and :func:`bz2.decompress` can now decompress
526 multi-stream inputs (such as those produced by the :program:`pbzip2` tool).
527 :class:`bz2.BZ2File` can now also be used to create this type of file, using
528 the ``'a'`` (append) mode.
529
530 (Contributed by Nir Aides in :issue:`1625`)
531
532* :class:`bz2.BZ2File` now implements all of the :class:`io.BufferedIOBase` API,
533 except for the :meth:`detach` and :meth:`truncate` methods.
534
535
Victor Stinner2cded9c2011-07-08 01:45:13 +0200536codecs
537------
538
Antoine Pitrou4f863432012-02-12 02:12:47 +0100539The :mod:`~encodings.mbcs` codec has been rewritten to handle correctly
Georg Brandlff962c52012-02-04 08:55:56 +0100540``replace`` and ``ignore`` error handlers on all Windows versions. The
541:mod:`~encodings.mbcs` codec now supports all error handlers, instead of only
542``replace`` to encode and ``ignore`` to decode.
Victor Stinner3a50e702011-10-18 21:21:00 +0200543
Georg Brandlff962c52012-02-04 08:55:56 +0100544A new Windows-only codec has been added: ``cp65001`` (:issue:`13216`). It is the
545Windows code page 65001 (Windows UTF-8, ``CP_UTF8``). For example, it is used
546by ``sys.stdout`` if the console output code page is set to cp65001 (e.g., using
547``chcp 65001`` command).
Victor Stinner2f3ca9f2011-10-27 01:38:56 +0200548
Georg Brandlff962c52012-02-04 08:55:56 +0100549Multibyte CJK decoders now resynchronize faster. They only ignore the first
Georg Brandl6c0929b2011-07-09 11:43:33 +0200550byte of an invalid byte sequence. For example, ``b'\xff\n'.decode('gb2312',
551'replace')`` now returns a ``\n`` after the replacement character.
Victor Stinner2cded9c2011-07-08 01:45:13 +0200552
Georg Brandl6c0929b2011-07-09 11:43:33 +0200553(:issue:`12016`)
Victor Stinner2cded9c2011-07-08 01:45:13 +0200554
Georg Brandlff962c52012-02-04 08:55:56 +0100555Incremental CJK codec encoders are no longer reset at each call to their
556encode() methods. For example::
Victor Stinner2cded9c2011-07-08 01:45:13 +0200557
558 $ ./python -q
559 >>> import codecs
560 >>> encoder = codecs.getincrementalencoder('hz')('strict')
561 >>> b''.join(encoder.encode(x) for x in '\u52ff\u65bd\u65bc\u4eba\u3002 Bye.')
562 b'~{NpJ)l6HK!#~} Bye.'
563
Georg Brandl6c0929b2011-07-09 11:43:33 +0200564This example gives ``b'~{Np~}~{J)~}~{l6~}~{HK~}~{!#~} Bye.'`` with older Python
Victor Stinner2cded9c2011-07-08 01:45:13 +0200565versions.
566
Georg Brandl6c0929b2011-07-09 11:43:33 +0200567(:issue:`12100`)
Victor Stinner2cded9c2011-07-08 01:45:13 +0200568
Victor Stinner9f4b1e92011-11-10 20:56:30 +0100569The ``unicode_internal`` codec has been deprecated.
570
Éric Araujo84b8ed82011-08-29 21:42:47 +0200571crypt
572-----
573
Victor Stinnerc78fb332011-09-21 03:35:44 +0200574Addition of salt and modular crypt format and the :func:`~crypt.mksalt`
575function to the :mod:`crypt` module.
Éric Araujo84b8ed82011-08-29 21:42:47 +0200576
577(:issue:`10924`)
578
Victor Stinnera7878b72011-07-14 23:07:44 +0200579curses
580------
581
Victor Stinner0fdfceb2011-11-25 22:10:02 +0100582 * If the :mod:`curses` module is linked to the ncursesw library, use Unicode
583 functions when Unicode strings or characters are passed (e.g.
584 :c:func:`waddwstr`), and bytes functions otherwise (e.g. :c:func:`waddstr`).
585 * Use the locale encoding instead of ``utf-8`` to encode Unicode strings.
586 * :class:`curses.window` has a new :attr:`curses.window.encoding` attribute.
Victor Stinnerc78fb332011-09-21 03:35:44 +0200587 * The :class:`curses.window` class has a new :meth:`~curses.window.get_wch`
588 method to get a wide character
589 * The :mod:`curses` module has a new :meth:`~curses.unget_wch` function to
590 push a wide character so the next :meth:`~curses.window.get_wch` will return
591 it
Victor Stinnera7878b72011-07-14 23:07:44 +0200592
Victor Stinnerc78fb332011-09-21 03:35:44 +0200593(Contributed by Iñigo Serna in :issue:`6755`)
Victor Stinnera7878b72011-07-14 23:07:44 +0200594
Victor Stinner024e37a2011-03-31 01:31:06 +0200595faulthandler
596------------
597
598New module: :mod:`faulthandler`.
599
600 * :envvar:`PYTHONFAULTHANDLER`
601 * :option:`-X` ``faulthandler``
602
Victor Stinner811db3b2011-09-21 03:20:03 +0200603ftplib
604------
605
606The :class:`~ftplib.FTP_TLS` class now provides a new
607:func:`~ftplib.FTP_TLS.ccc` function to revert control channel back to
Florent Xicluna6d57d212011-10-23 22:23:57 +0200608plaintext. This can be useful to take advantage of firewalls that know how to
Victor Stinner811db3b2011-09-21 03:20:03 +0200609handle NAT with non-secure FTP without opening fixed ports.
610
611(Contributed by Giampaolo Rodolà in :issue:`12139`)
612
613
Antoine Pitrou5a8bc6f2011-11-17 02:20:48 +0100614imaplib
615-------
616
617The :class:`~imaplib.IMAP4_SSL` constructor now accepts an SSLContext
618parameter to control parameters of the secure channel.
619
620(Contributed by Sijin Joseph in :issue:`8808`)
621
622
Charles-François Natalidc3044c2012-01-09 22:40:02 +0100623io
624--
625
Charles-François Natalid612de12012-01-14 11:51:00 +0100626The :func:`~io.open` function has a new ``'x'`` mode that can be used to
627exclusively create a new file, and raise a :exc:`FileExistsError` if the file
628already exists. It is based on the C11 'x' mode to fopen().
Charles-François Natalidc3044c2012-01-09 22:40:02 +0100629
630(Contributed by David Townshend in :issue:`12760`)
631
632
Nadeem Vawda34599222011-12-09 01:32:46 +0200633lzma
634----
635
636The newly-added :mod:`lzma` module provides data compression and decompression
637using the LZMA algorithm, including support for the ``.xz`` and ``.lzma``
638file formats.
639
640(Contributed by Nadeem Vawda and Per Øyvind Karlsen in :issue:`6715`)
641
642
Victor Stinnerfa0e3d52011-05-09 01:01:09 +0200643math
644----
645
646The :mod:`math` module has a new function:
647
648 * :func:`~math.log2`: return the base-2 logarithm of *x*
649 (Written by Mark Dickinson in :issue:`11888`).
650
651
652nntplib
653-------
654
655The :class:`nntplib.NNTP` class now supports the context manager protocol to
656unconditionally consume :exc:`socket.error` exceptions and to close the NNTP
657connection when done::
658
659 >>> from nntplib import NNTP
Ezio Melotti3c14b4e2011-07-13 11:44:44 +0300660 >>> with NNTP('news.gmane.org') as n:
Victor Stinnerfa0e3d52011-05-09 01:01:09 +0200661 ... n.group('gmane.comp.python.committers')
662 ...
Ezio Melotti04f648c2011-07-26 09:37:46 +0300663 ('211 1755 1 1755 gmane.comp.python.committers', 1755, 1, 1755, 'gmane.comp.python.committers')
Victor Stinnerfa0e3d52011-05-09 01:01:09 +0200664 >>>
665
666(Contributed by Giampaolo Rodolà in :issue:`9795`)
667
668
Giampaolo Rodolàc9c2c8b2011-02-25 14:39:16 +0000669os
670--
671
Charles-François Natalia003af12011-06-01 20:30:52 +0200672* The :mod:`os` module has a new :func:`~os.pipe2` function that makes it
673 possible to create a pipe with :data:`~os.O_CLOEXEC` or
674 :data:`~os.O_NONBLOCK` flags set atomically. This is especially useful to
675 avoid race conditions in multi-threaded programs.
676
Giampaolo Rodolà18e8bcb2011-02-25 20:57:54 +0000677* The :mod:`os` module has a new :func:`~os.sendfile` function which provides
678 an efficent "zero-copy" way for copying data from one file (or socket)
679 descriptor to another. The phrase "zero-copy" refers to the fact that all of
680 the copying of data between the two descriptors is done entirely by the
681 kernel, with no copying of data into userspace buffers. :func:`~os.sendfile`
682 can be used to efficiently copy data from a file on disk to a network socket,
683 e.g. for downloading a file.
Giampaolo Rodolàc9c2c8b2011-02-25 14:39:16 +0000684
Giampaolo Rodolà18e8bcb2011-02-25 20:57:54 +0000685 (Patch submitted by Ross Lagerwall and Giampaolo Rodolà in :issue:`10882`.)
686
687* The :mod:`os` module has two new functions: :func:`~os.getpriority` and
688 :func:`~os.setpriority`. They can be used to get or set process
689 niceness/priority in a fashion similar to :func:`os.nice` but extended to all
690 processes instead of just the current one.
691
692 (Patch submitted by Giampaolo Rodolà in :issue:`10784`.)
Giampaolo Rodolà3108f982011-02-24 20:59:48 +0000693
Charles-François Natali7372b062012-02-05 15:15:38 +0100694* The :mod:`os` module has a new :func:`~os.fwalk` function similar to
695 :func:`~os.walk` except that it also yields file descriptors referring to the
696 directories visited. This is especially useful to avoid symlink races.
697
Victor Stinnere5064372011-10-14 00:08:29 +0200698* "at" functions (:issue:`4761`):
699
700 * :func:`~os.faccessat`
701 * :func:`~os.fchmodat`
702 * :func:`~os.fchownat`
703 * :func:`~os.fstatat`
704 * :func:`~os.futimesat`
Victor Stinnere5064372011-10-14 00:08:29 +0200705 * :func:`~os.linkat`
706 * :func:`~os.mkdirat`
707 * :func:`~os.mkfifoat`
708 * :func:`~os.mknodat`
709 * :func:`~os.openat`
710 * :func:`~os.readlinkat`
711 * :func:`~os.renameat`
712 * :func:`~os.symlinkat`
713 * :func:`~os.unlinkat`
714 * :func:`~os.utimensat`
Victor Stinnere5064372011-10-14 00:08:29 +0200715
716* extended attributes (:issue:`12720`):
717
718 * :func:`~os.fgetxattr`
719 * :func:`~os.flistxattr`
720 * :func:`~os.fremovexattr`
721 * :func:`~os.fsetxattr`
722 * :func:`~os.getxattr`
723 * :func:`~os.lgetxattr`
724 * :func:`~os.listxattr`
725 * :func:`~os.llistxattr`
726 * :func:`~os.lremovexattr`
727 * :func:`~os.lsetxattr`
728 * :func:`~os.removexattr`
729 * :func:`~os.setxattr`
730
731* Scheduler functions (:issue:`12655`):
732
733 * :func:`~os.sched_get_priority_max`
734 * :func:`~os.sched_get_priority_min`
735 * :func:`~os.sched_getaffinity`
736 * :func:`~os.sched_getparam`
737 * :func:`~os.sched_getscheduler`
738 * :func:`~os.sched_rr_get_interval`
739 * :func:`~os.sched_setaffinity`
740 * :func:`~os.sched_setparam`
741 * :func:`~os.sched_setscheduler`
742 * :func:`~os.sched_yield`
743
744* Add some extra posix functions to the os module (:issue:`10812`):
745
746 * :func:`~os.fexecve`
747 * :func:`~os.futimens`
Victor Stinnere5064372011-10-14 00:08:29 +0200748 * :func:`~os.futimes`
749 * :func:`~os.lockf`
750 * :func:`~os.lutimes`
Victor Stinnere5064372011-10-14 00:08:29 +0200751 * :func:`~os.posix_fadvise`
752 * :func:`~os.posix_fallocate`
753 * :func:`~os.pread`
754 * :func:`~os.pwrite`
755 * :func:`~os.readv`
756 * :func:`~os.sync`
757 * :func:`~os.truncate`
758 * :func:`~os.waitid`
759 * :func:`~os.writev`
760
761* Other new functions:
762
Charles-François Natali77940902012-02-06 19:54:48 +0100763 * :func:`~os.flistdir` (:issue:`10755`)
Victor Stinnere5064372011-10-14 00:08:29 +0200764 * :func:`~os.getgrouplist` (:issue:`9344`)
765
Giampaolo Rodolà424298a2011-03-03 18:34:06 +0000766
Éric Araujo765e94f2011-06-03 17:26:59 +0200767packaging
768---------
769
770:mod:`distutils` has undergone additions and refactoring under a new name,
771:mod:`packaging`, to allow developers to break backward compatibility.
772:mod:`distutils` is still provided in the standard library, but users are
773encouraged to transition to :mod:`packaging`. For older versions of Python, a
774backport compatible with 2.4+ and 3.1+ will be made available on PyPI under the
775name :mod:`distutils2`.
776
777.. TODO add examples and howto to the packaging docs and link to them
778
779
Victor Stinner383c3fc2011-05-25 01:35:05 +0200780pydoc
781-----
782
Victor Stinner6daa33c2011-05-25 01:41:22 +0200783The Tk GUI and the :func:`~pydoc.serve` function have been removed from the
784:mod:`pydoc` module: ``pydoc -g`` and :func:`~pydoc.serve` have been deprecated
785in Python 3.2.
Victor Stinner383c3fc2011-05-25 01:35:05 +0200786
787
Victor Stinnerf4c54ff2012-02-08 01:48:34 +0100788sched
789-----
Victor Stinner754851f2011-04-19 23:58:51 +0200790
Victor Stinnerf4c54ff2012-02-08 01:48:34 +0100791* :meth:`~sched.scheduler.run` now accepts a *blocking* parameter which when
792 set to False makes the method execute the scheduled events due to expire
793 soonest (if any) and then return immediately.
794 This is useful in case you want to use the :class:`~sched.scheduler` in
795 non-blocking applications. (Contributed by Giampaolo Rodolà in :issue:`13449`)
Victor Stinner754851f2011-04-19 23:58:51 +0200796
Victor Stinnerf4c54ff2012-02-08 01:48:34 +0100797* :class:`~sched.scheduler` class can now be safely used in multi-threaded
798 environments. (Contributed by Josiah Carlson and Giampaolo Rodolà in
799 :issue:`8684`)
800
801* *timefunc* and *delayfunct* parameters of :class:`~sched.scheduler` class
802 constructor are now optional and defaults to :func:`time.time` and
803 :func:`time.sleep` respectively. (Contributed by Chris Clark in
804 :issue:`13245`)
805
806* :meth:`~sched.scheduler.enter` and :meth:`~sched.scheduler.enterabs`
807 *argument* parameter is now optional. (Contributed by Chris Clark in
808 :issue:`13245`)
809
810* :meth:`~sched.scheduler.enter` and :meth:`~sched.scheduler.enterabs`
811 now accept a *kwargs* parameter. (Contributed by Chris Clark in
812 :issue:`13245`)
813
814
815shutil
816------
817
818* The :mod:`shutil` module has these new fuctions:
819
820 * :func:`~shutil.disk_usage`: provides total, used and free disk space
821 statistics. (Contributed by Giampaolo Rodolà in :issue:`12442`)
822 * :func:`~shutil.chown`: allows one to change user and/or group of the given
823 path also specifying the user/group names and not only their numeric
824 ids. (Contributed by Sandro Tosi in :issue:`12191`)
Victor Stinnera9293352011-04-30 15:21:58 +0200825
Victor Stinnerfa0e3d52011-05-09 01:01:09 +0200826
Victor Stinnera9293352011-04-30 15:21:58 +0200827signal
828------
829
Victor Stinnerfa0e3d52011-05-09 01:01:09 +0200830* The :mod:`signal` module has new functions:
Victor Stinnera9293352011-04-30 15:21:58 +0200831
Victor Stinnerb3e72192011-05-08 01:46:11 +0200832 * :func:`~signal.pthread_sigmask`: fetch and/or change the signal mask of the
833 calling thread (Contributed by Jean-Paul Calderone in :issue:`8407`) ;
834 * :func:`~signal.pthread_kill`: send a signal to a thread ;
835 * :func:`~signal.sigpending`: examine pending functions ;
836 * :func:`~signal.sigwait`: wait a signal.
Ross Lagerwallbc808222011-06-25 12:13:40 +0200837 * :func:`~signal.sigwaitinfo`: wait for a signal, returning detailed
838 information about it.
839 * :func:`~signal.sigtimedwait`: like :func:`~signal.sigwaitinfo` but with a
840 timeout.
Victor Stinnera9293352011-04-30 15:21:58 +0200841
Victor Stinnerd49b1f12011-05-08 02:03:15 +0200842* The signal handler writes the signal number as a single byte instead of
843 a nul byte into the wakeup file descriptor. So it is possible to wait more
844 than one signal and know which signals were raised.
845
Victor Stinner388196e2011-05-10 17:13:00 +0200846* :func:`signal.signal` and :func:`signal.siginterrupt` raise an OSError,
847 instead of a RuntimeError: OSError has an errno attribute.
848
Victor Stinnerf4c54ff2012-02-08 01:48:34 +0100849smtplib
850-------
851
852The :class:`~smtplib.SMTP_SSL` constructor and the :meth:`~smtplib.SMTP.starttls`
853method now accept an SSLContext parameter to control parameters of the secure
854channel.
855
856(Contributed by Kasun Herath in :issue:`8809`)
857
858
Nick Coghlan96fe56a2011-08-22 11:55:57 +1000859socket
860------
861
Charles-François Natali47413c12011-10-06 19:47:44 +0200862* The :class:`~socket.socket` class now exposes additional methods to process
863 ancillary data when supported by the underlying platform:
Nick Coghlan96fe56a2011-08-22 11:55:57 +1000864
Charles-François Natali47413c12011-10-06 19:47:44 +0200865 * :func:`~socket.socket.sendmsg`
866 * :func:`~socket.socket.recvmsg`
867 * :func:`~socket.socket.recvmsg_into`
Nick Coghlan96fe56a2011-08-22 11:55:57 +1000868
Charles-François Natali47413c12011-10-06 19:47:44 +0200869 (Contributed by David Watson in :issue:`6560`, based on an earlier patch by
870 Heiko Wundram)
871
872* The :class:`~socket.socket` class now supports the PF_CAN protocol family
873 (http://en.wikipedia.org/wiki/Socketcan), on Linux
874 (http://lwn.net/Articles/253425).
875
876 (Contributed by Matthias Fuchs, updated by Tiago Gonçalves in :issue:`10141`)
877
Charles-François Natali10b8cf42011-11-10 19:21:37 +0100878* The :class:`~socket.socket` class now supports the PF_RDS protocol family
879 (http://en.wikipedia.org/wiki/Reliable_Datagram_Sockets and
880 http://oss.oracle.com/projects/rds/).
Victor Stinner754851f2011-04-19 23:58:51 +0200881
Victor Stinnerf4c54ff2012-02-08 01:48:34 +0100882
Victor Stinner99c8b162011-05-24 12:05:19 +0200883ssl
884---
885
Antoine Pitrou2c0a9672011-11-17 02:09:13 +0100886* The :mod:`ssl` module has two new random generation functions:
Victor Stinner99c8b162011-05-24 12:05:19 +0200887
888 * :func:`~ssl.RAND_bytes`: generate cryptographically strong
889 pseudo-random bytes.
890 * :func:`~ssl.RAND_pseudo_bytes`: generate pseudo-random bytes.
891
Antoine Pitrou2c0a9672011-11-17 02:09:13 +0100892 (Contributed by Victor Stinner in :issue:`12049`)
893
894* The :mod:`ssl` module now exposes a finer-grained exception hierarchy
895 in order to make it easier to inspect the various kinds of errors.
896
897 (Contributed by Antoine Pitrou in :issue:`11183`)
898
899* :meth:`~ssl.SSLContext.load_cert_chain` now accepts a *password* argument
900 to be used if the private key is encrypted.
901
902 (Contributed by Adam Simpkins in :issue:`12803`)
903
Antoine Pitrou73fc8142011-12-23 20:58:36 +0100904* Diffie-Hellman key exchange, both regular and Elliptic Curve-based, is
905 now supported through the :meth:`~ssl.SSLContext.load_dh_params` and
906 :meth:`~ssl.SSLContext.set_ecdh_curve` methods.
907
908 (Contributed by Antoine Pitrou in :issue:`13626` and :issue:`13627`)
909
Antoine Pitrou2c0a9672011-11-17 02:09:13 +0100910* SSL sockets have a new :meth:`~ssl.SSLSocket.get_channel_binding` method
911 allowing the implementation of certain authentication mechanisms such as
912 SCRAM-SHA-1-PLUS.
913
914 (Contributed by Jacek Konieczny in :issue:`12551`)
915
Antoine Pitrou73fc8142011-12-23 20:58:36 +0100916* You can query the SSL compression algorithm used by an SSL socket, thanks
917 to its new :meth:`~ssl.SSLSocket.compression` method.
918
919 (Contributed by Antoine Pitrou in :issue:`13634`)
920
921
Victor Stinnerf4c54ff2012-02-08 01:48:34 +0100922sys
923---
Giampaolo Rodola'210e7ca2011-07-01 13:55:36 +0200924
Victor Stinnerf4c54ff2012-02-08 01:48:34 +0100925* The :mod:`sys` module has a new :data:`~sys.thread_info` :term:`struct
926 sequence` holding informations about the thread implementation.
Giampaolo Rodola'210e7ca2011-07-01 13:55:36 +0200927
Victor Stinnerf4c54ff2012-02-08 01:48:34 +0100928 (:issue:`11223`)
Giampaolo Rodola'096dcb12011-06-27 11:17:51 +0200929
Antoine Pitrou5a8bc6f2011-11-17 02:20:48 +0100930
Victor Stinnerf4c54ff2012-02-08 01:48:34 +0100931time
932----
Antoine Pitrou5a8bc6f2011-11-17 02:20:48 +0100933
Victor Stinnerf4c54ff2012-02-08 01:48:34 +0100934The :mod:`time` module has new functions:
935
936* :func:`~time.clock_getres` and :func:`~time.clock_gettime` functions and
937 ``CLOCK_xxx`` constants.
938* :func:`~time.monotonic`: monotonic clock.
939* :func:`~time.wallclock`.
940
941(Contributed by Victor Stinner in :issue:`10278`)
942
Antoine Pitrou5a8bc6f2011-11-17 02:20:48 +0100943
Senthil Kumarande49d642011-10-16 23:54:44 +0800944urllib
945------
946
947The :class:`~urllib.request.Request` class, now accepts a *method* argument
948used by :meth:`~urllib.request.Request.get_method` to determine what HTTP method
Senthil Kumarana41c9422011-10-20 02:37:08 +0800949should be used. For example, this will send a ``'HEAD'`` request::
Senthil Kumarande49d642011-10-16 23:54:44 +0800950
951 >>> urlopen(Request('http://www.python.org', method='HEAD'))
952
953(:issue:`1673007`)
Giampaolo Rodola'096dcb12011-06-27 11:17:51 +0200954
Giampaolo Rodola'be55d992011-11-22 13:33:34 +0100955
Giampaolo Rodolà3108f982011-02-24 20:59:48 +0000956Optimizations
957=============
958
959Major performance enhancements have been added:
960
Victor Stinner46606ce2011-11-20 18:27:55 +0100961* Thanks to the :pep:`393`, some operations on Unicode strings has been optimized:
962
963 * the memory footprint is divided by 2 to 4 depending on the text
Victor Stinnera996f1e2011-11-21 13:14:43 +0100964 * encode an ASCII string to UTF-8 doesn't need to encode characters anymore,
965 the UTF-8 representation is shared with the ASCII representation
Victor Stinner6099a032011-12-18 14:22:26 +0100966 * the UTF-8 encoder has been optimized
967 * repeating a single ASCII letter and getting a substring of a ASCII strings
968 is 4 times faster
Giampaolo Rodolà3108f982011-02-24 20:59:48 +0000969
970
971Build and C API Changes
972=======================
973
974Changes to Python's build process and to the C API include:
975
Stefan Krah95b1ba62012-02-29 17:27:21 +0100976* New :pep:`3118` related function:
977
978 * :c:func:`PyMemoryView_FromMemory`
979
Victor Stinner46606ce2011-11-20 18:27:55 +0100980* The :pep:`393` added new Unicode types, macros and functions:
981
Victor Stinnera996f1e2011-11-21 13:14:43 +0100982 * High-level API:
983
984 * :c:func:`PyUnicode_CopyCharacters`
985 * :c:func:`PyUnicode_FindChar`
986 * :c:func:`PyUnicode_GetLength`, :c:macro:`PyUnicode_GET_LENGTH`
987 * :c:func:`PyUnicode_New`
988 * :c:func:`PyUnicode_Substring`
989 * :c:func:`PyUnicode_ReadChar`, :c:func:`PyUnicode_WriteChar`
990
991 * Low-level API:
992
993 * :c:type:`Py_UCS1`, :c:type:`Py_UCS2`, :c:type:`Py_UCS4` types
994 * :c:type:`PyASCIIObject` and :c:type:`PyCompactUnicodeObject` structures
995 * :c:macro:`PyUnicode_READY`
996 * :c:func:`PyUnicode_FromKindAndData`
997 * :c:func:`PyUnicode_AsUCS4`, :c:func:`PyUnicode_AsUCS4Copy`
998 * :c:macro:`PyUnicode_DATA`, :c:macro:`PyUnicode_1BYTE_DATA`,
999 :c:macro:`PyUnicode_2BYTE_DATA`, :c:macro:`PyUnicode_4BYTE_DATA`
1000 * :c:macro:`PyUnicode_KIND` with :c:type:`PyUnicode_Kind` enum:
1001 :c:data:`PyUnicode_WCHAR_KIND`, :c:data:`PyUnicode_1BYTE_KIND`,
1002 :c:data:`PyUnicode_2BYTE_KIND`, :c:data:`PyUnicode_4BYTE_KIND`
1003 * :c:macro:`PyUnicode_READ`, :c:macro:`PyUnicode_READ_CHAR`, :c:macro:`PyUnicode_WRITE`
1004 * :c:macro:`PyUnicode_MAX_CHAR_VALUE`
1005
Giampaolo Rodolà3108f982011-02-24 20:59:48 +00001006
1007
Victor Stinnerd1be8782011-12-09 00:10:41 +01001008Deprecated
1009==========
1010
Georg Brandl0cd25c92011-04-29 13:45:54 +02001011Unsupported Operating Systems
Victor Stinnerd1be8782011-12-09 00:10:41 +01001012-----------------------------
Victor Stinnerb90db4c2011-04-26 22:48:24 +02001013
Brian Curtin49a40cd2011-05-02 22:30:06 -05001014OS/2 and VMS are no longer supported due to the lack of a maintainer.
1015
1016Windows 2000 and Windows platforms which set ``COMSPEC`` to ``command.com``
1017are no longer supported due to maintenance burden.
Victor Stinnerb90db4c2011-04-26 22:48:24 +02001018
1019
Victor Stinner46606ce2011-11-20 18:27:55 +01001020Deprecated Python modules, functions and methods
Victor Stinnerd1be8782011-12-09 00:10:41 +01001021------------------------------------------------
Victor Stinner19bd0692011-11-16 00:18:57 +01001022
1023* The :mod:`packaging` module replaces the :mod:`distutils` module
1024* The ``unicode_internal`` codec has been deprecated because of the
Sandro Tosicd899122012-01-22 12:16:04 +01001025 :pep:`393`, use UTF-8, UTF-16 (``utf-16-le`` or ``utf-16-be``), or UTF-32
1026 (``utf-32-le`` or ``utf-32-be``)
Victor Stinner19bd0692011-11-16 00:18:57 +01001027* :meth:`ftplib.FTP.nlst` and :meth:`ftplib.FTP.dir`: use
Victor Stinner46606ce2011-11-20 18:27:55 +01001028 :meth:`ftplib.FTP.mlsd`
Victor Stinner19bd0692011-11-16 00:18:57 +01001029* :func:`platform.popen`: use the :mod:`subprocess` module. Check especially
1030 the :ref:`subprocess-replacements` section.
1031* :issue:`13374`: The Windows bytes API has been deprecated in the :mod:`os`
Victor Stinner46606ce2011-11-20 18:27:55 +01001032 module. Use Unicode filenames, instead of bytes filenames, to not depend on
Victor Stinner19bd0692011-11-16 00:18:57 +01001033 the ANSI code page anymore and to support any filename.
Florent Xiclunaa72a98f2012-02-13 11:03:30 +01001034* :issue:`13988`: The :mod:`xml.etree.cElementTree` module is deprecated. The
1035 accelerator is used automatically whenever available.
Victor Stinner19bd0692011-11-16 00:18:57 +01001036
1037
Victor Stinner46606ce2011-11-20 18:27:55 +01001038Deprecated functions and types of the C API
Victor Stinnerd1be8782011-12-09 00:10:41 +01001039-------------------------------------------
Victor Stinner46606ce2011-11-20 18:27:55 +01001040
1041The :c:type:`Py_UNICODE` has been deprecated by the :pep:`393` and will be
1042removed in Python 4. All functions using this type are deprecated:
1043
Victor Stinner46606ce2011-11-20 18:27:55 +01001044Unicode functions and methods using :c:type:`Py_UNICODE` and
1045:c:type:`Py_UNICODE*` types:
1046
1047 * :c:macro:`PyUnicode_FromUnicode`: use :c:func:`PyUnicode_FromWideChar` or
1048 :c:func:`PyUnicode_FromKindAndData`
1049 * :c:macro:`PyUnicode_AS_UNICODE`, :c:func:`PyUnicode_AsUnicode`,
1050 :c:func:`PyUnicode_AsUnicodeAndSize`: use :c:func:`PyUnicode_AsWideCharString`
1051 * :c:macro:`PyUnicode_AS_DATA`: use :c:macro:`PyUnicode_DATA` with
1052 :c:macro:`PyUnicode_READ` and :c:macro:`PyUnicode_WRITE`
1053 * :c:macro:`PyUnicode_GET_SIZE`, :c:func:`PyUnicode_GetSize`: use
1054 :c:macro:`PyUnicode_GET_LENGTH` or :c:func:`PyUnicode_GetLength`
1055 * :c:macro:`PyUnicode_GET_DATA_SIZE`: use
1056 ``PyUnicode_GET_LENGTH(str) * PyUnicode_KIND(str)`` (only work on ready
1057 strings)
Victor Stinnerbf6e5602011-12-12 01:53:47 +01001058 * :c:func:`PyUnicode_AsUnicodeCopy`: use :c:func:`PyUnicode_AsUCS4Copy` or
1059 :c:func:`PyUnicode_AsWideCharString`
Victor Stinnerab595942011-12-17 04:59:06 +01001060 * :c:func:`PyUnicode_GetMax`
1061
Victor Stinner46606ce2011-11-20 18:27:55 +01001062
Victor Stinnera996f1e2011-11-21 13:14:43 +01001063Functions and macros manipulating Py_UNICODE* strings:
1064
1065 * :c:macro:`Py_UNICODE_strlen`: use :c:func:`PyUnicode_GetLength` or
1066 :c:macro:`PyUnicode_GET_LENGTH`
1067 * :c:macro:`Py_UNICODE_strcat`: use :c:func:`PyUnicode_CopyCharacters` or
1068 :c:func:`PyUnicode_FromFormat`
1069 * :c:macro:`Py_UNICODE_strcpy`, :c:macro:`Py_UNICODE_strncpy`,
1070 :c:macro:`Py_UNICODE_COPY`: use :c:func:`PyUnicode_CopyCharacters` or
1071 :c:func:`PyUnicode_Substring`
1072 * :c:macro:`Py_UNICODE_strcmp`: use :c:func:`PyUnicode_Compare`
1073 * :c:macro:`Py_UNICODE_strncmp`: use :c:func:`PyUnicode_Tailmatch`
1074 * :c:macro:`Py_UNICODE_strchr`, :c:macro:`Py_UNICODE_strrchr`: use
1075 :c:func:`PyUnicode_FindChar`
Victor Stinner606e19d2012-01-04 03:59:16 +01001076 * :c:macro:`Py_UNICODE_FILL`: use :c:func:`PyUnicode_Fill`
Victor Stinnerab595942011-12-17 04:59:06 +01001077 * :c:macro:`Py_UNICODE_MATCH`
Victor Stinnera996f1e2011-11-21 13:14:43 +01001078
Victor Stinner46606ce2011-11-20 18:27:55 +01001079Encoders:
1080
1081 * :c:func:`PyUnicode_Encode`: use :c:func:`PyUnicode_AsEncodedObject`
1082 * :c:func:`PyUnicode_EncodeUTF7`
Victor Stinnera996f1e2011-11-21 13:14:43 +01001083 * :c:func:`PyUnicode_EncodeUTF8`: use :c:func:`PyUnicode_AsUTF8` or
1084 :c:func:`PyUnicode_AsUTF8String`
Victor Stinner46606ce2011-11-20 18:27:55 +01001085 * :c:func:`PyUnicode_EncodeUTF32`
1086 * :c:func:`PyUnicode_EncodeUTF16`
1087 * :c:func:`PyUnicode_EncodeUnicodeEscape:` use
1088 :c:func:`PyUnicode_AsUnicodeEscapeString`
1089 * :c:func:`PyUnicode_EncodeRawUnicodeEscape:` use
1090 :c:func:`PyUnicode_AsRawUnicodeEscapeString`
1091 * :c:func:`PyUnicode_EncodeLatin1`: use :c:func:`PyUnicode_AsLatin1String`
1092 * :c:func:`PyUnicode_EncodeASCII`: use :c:func:`PyUnicode_AsASCIIString`
1093 * :c:func:`PyUnicode_EncodeCharmap`
1094 * :c:func:`PyUnicode_TranslateCharmap`
1095 * :c:func:`PyUnicode_EncodeMBCS`: use :c:func:`PyUnicode_AsMBCSString` or
1096 :c:func:`PyUnicode_EncodeCodePage` (with ``CP_ACP`` code_page)
1097 * :c:func:`PyUnicode_EncodeDecimal`,
1098 :c:func:`PyUnicode_TransformDecimalToASCII`
1099
1100
Giampaolo Rodolà3108f982011-02-24 20:59:48 +00001101Porting to Python 3.3
1102=====================
1103
1104This section lists previously described changes and other bugfixes
Antoine Pitrou037ffbf2011-10-24 00:25:41 +02001105that may require changes to your code.
1106
1107Porting Python code
1108-------------------
Giampaolo Rodolà3108f982011-02-24 20:59:48 +00001109
Georg Brandld6c43402012-03-07 08:55:52 +01001110.. XXX add a point about hash randomization and that it's always on in 3.3
1111
Victor Stinner19bd0692011-11-16 00:18:57 +01001112* :issue:`12326`: On Linux, sys.platform doesn't contain the major version
Victor Stinnerff3d9392011-08-20 23:39:26 +02001113 anymore. It is now always 'linux', instead of 'linux2' or 'linux3' depending
1114 on the Linux version used to build Python. Replace sys.platform == 'linux2'
1115 with sys.platform.startswith('linux'), or directly sys.platform == 'linux' if
1116 you don't need to support older Python versions.
Éric Araujoc09fca62011-03-23 02:06:24 +01001117
Antoine Pitrou037ffbf2011-10-24 00:25:41 +02001118Porting C code
1119--------------
1120
Stefan Krah54c32032012-02-29 17:47:21 +01001121* In the course of changes to the buffer API the undocumented
1122 :c:member:`~Py_buffer.smalltable` member of the
1123 :c:type:`Py_buffer` structure has been removed and the
1124 layout of the :c:type:`PyMemoryViewObject` has changed.
1125
1126 All extensions relying on the relevant parts in ``memoryobject.h``
1127 or ``object.h`` must be rebuilt.
1128
Antoine Pitrou037ffbf2011-10-24 00:25:41 +02001129* Due to :ref:`PEP 393 <pep-393>`, the :c:type:`Py_UNICODE` type and all
1130 functions using this type are deprecated (but will stay available for
1131 at least five years). If you were using low-level Unicode APIs to
1132 construct and access unicode objects and you want to benefit of the
1133 memory footprint reduction provided by the PEP 393, you have to convert
1134 your code to the new :doc:`Unicode API <../c-api/unicode>`.
1135
1136 However, if you only have been using high-level functions such as
1137 :c:func:`PyUnicode_Concat()`, :c:func:`PyUnicode_Join` or
1138 :c:func:`PyUnicode_FromFormat()`, your code will automatically take
1139 advantage of the new unicode representations.
1140
Antoine Pitrouc229e6e2012-02-20 19:41:11 +01001141Building C extensions
1142---------------------
1143
1144* The range of possible file names for C extensions has been narrowed.
1145 Very rarely used spellings have been suppressed: under POSIX, files
1146 named ``xxxmodule.so``, ``xxxmodule.abi3.so`` and
1147 ``xxxmodule.cpython-*.so`` are no longer recognized as implementing
1148 the ``xxx`` module. If you had been generating such files, you have
1149 to switch to the other spellings (i.e., remove the ``module`` string
1150 from the file names).
1151
1152 (implemented in :issue:`14040`.)
1153
1154
Antoine Pitrou037ffbf2011-10-24 00:25:41 +02001155Other issues
1156------------
1157
Éric Araujoc09fca62011-03-23 02:06:24 +01001158.. Issue #11591: When :program:`python` was started with :option:`-S`,
1159 ``import site`` will not add site-specific paths to the module search
1160 paths. In previous versions, it did. See changeset for doc changes in
1161 various files. Contributed by Carl Meyer with editions by Éric Araujo.
Éric Araujobe3bd572011-03-26 01:55:15 +01001162
Éric Araujobfc97292011-11-14 18:18:15 +01001163.. Issue #10998: the -Q command-line flag and related artifacts have been
Éric Araujobe3bd572011-03-26 01:55:15 +01001164 removed. Code checking sys.flags.division_warning will need updating.
1165 Contributed by Éric Araujo.