blob: 56a16da0ae364f2d512a3bac02de68dc06fee785 [file] [log] [blame]
Giampaolo Rodolà3108f982011-02-24 20:59:48 +00001****************************
2 What's New In Python 3.3
3****************************
4
5:Author: Raymond Hettinger
6:Release: |release|
7:Date: |today|
8
Éric Araujob07b97f2011-10-05 01:03:34 +02009.. Rules for maintenance:
Giampaolo Rodolà3108f982011-02-24 20:59:48 +000010
11 * Anyone can add text to this document. Do not spend very much time
12 on the wording of your changes, because your text will probably
13 get rewritten to some degree.
14
15 * The maintainer will go through Misc/NEWS periodically and add
16 changes; it's therefore more important to add your changes to
17 Misc/NEWS than to this file.
18
19 * This is not a complete list of every single change; completeness
20 is the purpose of Misc/NEWS. Some changes I consider too small
21 or esoteric to include. If such a change is added to the text,
22 I'll just remove it. (This is another reason you shouldn't spend
23 too much time on writing your addition.)
24
25 * If you want to draw your new text to the attention of the
26 maintainer, add 'XXX' to the beginning of the paragraph or
27 section.
28
29 * It's OK to just add a fragmentary note about a change. For
30 example: "XXX Describe the transmogrify() function added to the
31 socket module." The maintainer will research the change and
32 write the necessary text.
33
34 * You can comment out your additions if you like, but it's not
35 necessary (especially when a final release is some months away).
36
37 * Credit the author of a patch or bugfix. Just the name is
38 sufficient; the e-mail address isn't necessary.
39
40 * It's helpful to add the bug/patch number as a comment:
41
Giampaolo Rodolà3108f982011-02-24 20:59:48 +000042 XXX Describe the transmogrify() function added to the socket
43 module.
Éric Araujob07b97f2011-10-05 01:03:34 +020044 (Contributed by P.Y. Developer in :issue:`12345`.)
Giampaolo Rodolà3108f982011-02-24 20:59:48 +000045
Éric Araujob07b97f2011-10-05 01:03:34 +020046 This saves the maintainer the effort of going through the Mercurial log
Giampaolo Rodolà3108f982011-02-24 20:59:48 +000047 when researching a change.
48
49This article explains the new features in Python 3.3, compared to 3.2.
50
51
Antoine Pitrou037ffbf2011-10-24 00:25:41 +020052.. _pep-393:
53
Ezio Melotti48a2f8f2011-09-29 00:18:19 +030054PEP 393: Flexible String Representation
55=======================================
56
Antoine Pitroufd9b4162011-10-24 00:14:43 +020057The Unicode string type is changed to support multiple internal
58representations, depending on the character with the largest Unicode ordinal
59(1, 2, or 4 bytes) in the represented string. This allows a space-efficient
60representation in common cases, but gives access to full UCS-4 on all
61systems. For compatibility with existing APIs, several representations may
62exist in parallel; over time, this compatibility should be phased out.
Ezio Melotti397546a2011-09-29 08:34:36 +030063
Antoine Pitroufd9b4162011-10-24 00:14:43 +020064On the Python side, there should be no downside to this change.
Ezio Melotti397546a2011-09-29 08:34:36 +030065
Antoine Pitroufd9b4162011-10-24 00:14:43 +020066On the C API side, PEP 393 is fully backward compatible. The legacy API
67should remain available at least five years. Applications using the legacy
68API will not fully benefit of the memory reduction, or - worse - may use
69a bit more memory, because Python may have to maintain two versions of each
70string (in the legacy format and in the new efficient storage).
71
Antoine Pitrou0599b5b2011-11-29 22:45:07 +010072Functionality
73-------------
74
Antoine Pitroufd9b4162011-10-24 00:14:43 +020075Changes introduced by :pep:`393` are the following:
Ezio Melotti48a2f8f2011-09-29 00:18:19 +030076
Ezio Melotti397546a2011-09-29 08:34:36 +030077* Python now always supports the full range of Unicode codepoints, including
78 non-BMP ones (i.e. from ``U+0000`` to ``U+10FFFF``). The distinction between
79 narrow and wide builds no longer exists and Python now behaves like a wide
Antoine Pitroufd9b4162011-10-24 00:14:43 +020080 build, even under Windows.
Ezio Melotti397546a2011-09-29 08:34:36 +030081
Antoine Pitroufd9b4162011-10-24 00:14:43 +020082* With the death of narrow builds, the problems specific to narrow builds have
83 also been fixed, for example:
Ezio Melotti397546a2011-09-29 08:34:36 +030084
85 * :func:`len` now always returns 1 for non-BMP characters,
86 so ``len('\U0010FFFF') == 1``;
87
88 * surrogate pairs are not recombined in string literals,
89 so ``'\uDBFF\uDFFF' != '\U0010FFFF'``;
90
Antoine Pitroufd9b4162011-10-24 00:14:43 +020091 * indexing or slicing non-BMP characters returns the expected value,
Ezio Melotti397546a2011-09-29 08:34:36 +030092 so ``'\U0010FFFF'[0]`` now returns ``'\U0010FFFF'`` and not ``'\uDBFF'``;
93
Antoine Pitroud136aec2011-11-17 01:48:06 +010094 * all other functions in the standard library now correctly handle
Antoine Pitroufd9b4162011-10-24 00:14:43 +020095 non-BMP codepoints.
Ezio Melotti397546a2011-09-29 08:34:36 +030096
Ezio Melotti48a2f8f2011-09-29 00:18:19 +030097* The value of :data:`sys.maxunicode` is now always ``1114111`` (``0x10FFFF``
98 in hexadecimal). The :c:func:`PyUnicode_GetMax` function still returns
99 either ``0xFFFF`` or ``0x10FFFF`` for backward compatibility, and it should
100 not be used with the new Unicode API (see :issue:`13054`).
101
Ezio Melotti397546a2011-09-29 08:34:36 +0300102* The :file:`./configure` flag ``--with-wide-unicode`` has been removed.
Victor Stinner7d637ab2011-09-29 02:56:16 +0200103
Antoine Pitrou0599b5b2011-11-29 22:45:07 +0100104Performance and resource usage
105------------------------------
106
107The storage of Unicode strings now depends on the highest codepoint in the string:
108
109* pure ASCII and Latin1 strings (``U+0000-U+00FF``) use 1 byte per codepoint;
110
111* BMP strings (``U+0000-U+FFFF``) use 2 bytes per codepoint;
112
113* non-BMP strings (``U+10000-U+10FFFF``) use 4 bytes per codepoint.
114
115The net effect is that for most applications, memory usage of string storage
116should decrease significantly - especially compared to former wide unicode
117builds - as, in many cases, strings will be pure ASCII even in international
118contexts (because many strings store non-human language data, such as XML
119fragments, HTTP headers, JSON-encoded data, etc.). We also hope that it
120will, for the same reasons, increase CPU cache efficiency on non-trivial
121applications.
122
123.. The memory usage of Python 3.3 is two to three times smaller than Python 3.2,
124 and a little bit better than Python 2.7, on a `Django benchmark
125 <http://mail.python.org/pipermail/python-dev/2011-September/113714.html>`_.
126 XXX The result should be moved in the PEP and a link to the PEP should
127 be added here.
128
Éric Araujob07b97f2011-10-05 01:03:34 +0200129
Victor Stinnera1bf2982011-10-12 20:35:02 +0200130PEP 3151: Reworking the OS and IO exception hierarchy
131=====================================================
132
133:pep:`3151` - Reworking the OS and IO exception hierarchy
Antoine Pitrou01fd26c2011-10-24 00:07:02 +0200134 PEP written and implemented by Antoine Pitrou.
Victor Stinnera1bf2982011-10-12 20:35:02 +0200135
Antoine Pitrou01fd26c2011-10-24 00:07:02 +0200136The hierarchy of exceptions raised by operating system errors is now both
137simplified and finer-grained.
Victor Stinnera1bf2982011-10-12 20:35:02 +0200138
Antoine Pitrou01fd26c2011-10-24 00:07:02 +0200139You don't have to worry anymore about choosing the appropriate exception
140type between :exc:`OSError`, :exc:`IOError`, :exc:`EnvironmentError`,
141:exc:`WindowsError`, :exc:`mmap.error`, :exc:`socket.error` or
142:exc:`select.error`. All these exception types are now only one:
143:exc:`OSError`. The other names are kept as aliases for compatibility
144reasons.
Victor Stinnera1bf2982011-10-12 20:35:02 +0200145
Antoine Pitrou01fd26c2011-10-24 00:07:02 +0200146Also, it is now easier to catch a specific error condition. Instead of
147inspecting the ``errno`` attribute (or ``args[0]``) for a particular
148constant from the :mod:`errno` module, you can catch the adequate
149:exc:`OSError` subclass. The available subclasses are the following:
Victor Stinnera1bf2982011-10-12 20:35:02 +0200150
Antoine Pitrou01fd26c2011-10-24 00:07:02 +0200151* :exc:`BlockingIOError`
152* :exc:`ChildProcessError`
153* :exc:`ConnectionError`
154* :exc:`FileExistsError`
155* :exc:`FileNotFoundError`
156* :exc:`InterruptedError`
157* :exc:`IsADirectoryError`
158* :exc:`NotADirectoryError`
159* :exc:`PermissionError`
160* :exc:`ProcessLookupError`
161* :exc:`TimeoutError`
Victor Stinnera1bf2982011-10-12 20:35:02 +0200162
Antoine Pitrou01fd26c2011-10-24 00:07:02 +0200163And the :exc:`ConnectionError` itself has finer-grained subclasses:
Victor Stinnera1bf2982011-10-12 20:35:02 +0200164
Antoine Pitrou01fd26c2011-10-24 00:07:02 +0200165* :exc:`BrokenPipeError`
166* :exc:`ConnectionAbortedError`
167* :exc:`ConnectionRefusedError`
168* :exc:`ConnectionResetError`
Victor Stinnera1bf2982011-10-12 20:35:02 +0200169
170Thanks to the new exceptions, common usages of the :mod:`errno` can now be
Antoine Pitrou01fd26c2011-10-24 00:07:02 +0200171avoided. For example, the following code written for Python 3.2::
Victor Stinnera1bf2982011-10-12 20:35:02 +0200172
173 from errno import ENOENT, EACCES, EPERM
174
175 try:
176 with open("document.txt") as f:
177 content = f.read()
178 except IOError as err:
179 if err.errno == ENOENT:
180 print("document.txt file is missing")
181 elif err.errno in (EACCES, EPERM):
182 print("You are not allowed to read document.txt")
183 else:
184 raise
185
Antoine Pitrou01fd26c2011-10-24 00:07:02 +0200186can now be written without the :mod:`errno` import and without manual
187inspection of exception attributes::
Victor Stinnera1bf2982011-10-12 20:35:02 +0200188
189 try:
190 with open("document.txt") as f:
191 content = f.read()
192 except FileNotFoundError:
193 print("document.txt file is missing")
194 except PermissionError:
195 print("You are not allowed to read document.txt")
196
197
Antoine Pitrou6bbd76b2011-11-25 19:10:05 +0100198PEP 3155: Qualified name for classes and functions
199==================================================
200
201:pep:`3155` - Qualified name for classes and functions
202 PEP written and implemented by Antoine Pitrou.
203
204Functions and class objects have a new ``__qualname__`` attribute representing
205the "path" from the module top-level to their definition. For global functions
206and classes, this is the same as ``__name__``. For other functions and classes,
207it provides better information about where they were actually defined, and
208how they might be accessible from the global scope.
209
210Example with (non-bound) methods::
211
212 >>> class C:
213 ... def meth(self):
214 ... pass
215 >>> C.meth.__name__
216 'meth'
217 >>> C.meth.__qualname__
218 'C.meth'
219
220Example with nested classes::
221
222 >>> class C:
223 ... class D:
224 ... def meth(self):
225 ... pass
226 ...
227 >>> C.D.__name__
228 'D'
229 >>> C.D.__qualname__
230 'C.D'
231 >>> C.D.meth.__name__
232 'meth'
233 >>> C.D.meth.__qualname__
234 'C.D.meth'
235
236Example with nested functions::
237
238 >>> def outer():
239 ... def inner():
240 ... pass
241 ... return inner
242 ...
243 >>> outer().__name__
244 'inner'
245 >>> outer().__qualname__
246 'outer.<locals>.inner'
247
Antoine Pitroue7ede062011-11-25 19:11:26 +0100248The string representation of those objects is also changed to include the
Antoine Pitrou6bbd76b2011-11-25 19:10:05 +0100249new, more precise information::
250
251 >>> str(C.D)
252 "<class '__main__.C.D'>"
253 >>> str(C.D.meth)
254 '<function C.D.meth at 0x7f46b9fe31e0>'
255
256
Giampaolo Rodolà3108f982011-02-24 20:59:48 +0000257Other Language Changes
258======================
259
260Some smaller changes made to the core Python language are:
261
Antoine Pitrou7b578b32011-11-29 22:47:11 +0100262* Added support for Unicode name aliases and named sequences.
263 Both :func:`unicodedata.lookup()` and ``'\N{...}'`` now resolve name aliases,
264 and :func:`unicodedata.lookup()` resolves named sequences too.
Giampaolo Rodolà3108f982011-02-24 20:59:48 +0000265
Antoine Pitrou7b578b32011-11-29 22:47:11 +0100266 (Contributed by Ezio Melotti in :issue:`12753`)
Ezio Melotti931b8aa2011-10-21 21:57:36 +0300267
Antoine Pitrou7b578b32011-11-29 22:47:11 +0100268* Equality comparisons on :func:`range` objects now return a result reflecting
269 the equality of the underlying sequences generated by those range objects.
Ezio Melotti931b8aa2011-10-21 21:57:36 +0300270
Antoine Pitrou7b578b32011-11-29 22:47:11 +0100271 (:issue:`13021`)
Giampaolo Rodolà3108f982011-02-24 20:59:48 +0000272
Antoine Pitrou7b578b32011-11-29 22:47:11 +0100273* The ``count()``, ``find()``, ``rfind()``, ``index()`` and ``rindex()``
274 methods of :class:`bytes` and :class:`bytearray` objects now accept an
275 integer between 0 and 255 as their first argument.
Mark Dickinson36645682011-10-23 19:53:01 +0100276
Antoine Pitrou7b578b32011-11-29 22:47:11 +0100277 (:issue:`12170`)
Mark Dickinson36645682011-10-23 19:53:01 +0100278
Antoine Pitrou7b578b32011-11-29 22:47:11 +0100279* Memoryview objects are now hashable when the underlying object is hashable.
Mark Dickinson36645682011-10-23 19:53:01 +0100280
Antoine Pitrou7b578b32011-11-29 22:47:11 +0100281 (Contributed by Antoine Pitrou in :issue:`13411`)
Petri Lehtinen61ea8a02011-11-24 22:00:46 +0200282
283
Victor Stinner46606ce2011-11-20 18:27:55 +0100284New and Improved Modules
285========================
Giampaolo Rodolà3108f982011-02-24 20:59:48 +0000286
Meador Ingec5dbb3d2011-09-20 21:48:16 -0500287array
288-----
289
290The :mod:`array` module supports the :c:type:`long long` type using ``q`` and
291``Q`` type codes.
292
293(Contributed by Oren Tirosh and Hirokazu Yamamoto in :issue:`1172711`)
294
295
Victor Stinner2cded9c2011-07-08 01:45:13 +0200296codecs
297------
298
Victor Stinner3a50e702011-10-18 21:21:00 +0200299The :mod:`~encodings.mbcs` codec has be rewritten to handle correclty
300``replace`` and ``ignore`` error handlers on all Windows versions. The
301:mod:`~encodings.mbcs` codec is now supporting all error handlers, instead of
302only ``replace`` to encode and ``ignore`` to decode.
303
Victor Stinner7592d052011-10-27 01:43:48 +0200304A new Windows-only codec has been added: ``cp65001`` (:issue:`13216`). It is
Victor Stinner2f3ca9f2011-10-27 01:38:56 +0200305the Windows code page 65001 (Windows UTF-8, ``CP_UTF8``). For example, it is
306used by ``sys.stdout`` if the console output code page is set to cp65001 (e.g.
307using ``chcp 65001`` command).
308
Victor Stinner2cded9c2011-07-08 01:45:13 +0200309Multibyte CJK decoders now resynchronize faster. They only ignore the first
Georg Brandl6c0929b2011-07-09 11:43:33 +0200310byte of an invalid byte sequence. For example, ``b'\xff\n'.decode('gb2312',
311'replace')`` now returns a ``\n`` after the replacement character.
Victor Stinner2cded9c2011-07-08 01:45:13 +0200312
Georg Brandl6c0929b2011-07-09 11:43:33 +0200313(:issue:`12016`)
Victor Stinner2cded9c2011-07-08 01:45:13 +0200314
315Don't reset incremental encoders of CJK codecs at each call to their encode()
Georg Brandl6c0929b2011-07-09 11:43:33 +0200316method anymore. For example::
Victor Stinner2cded9c2011-07-08 01:45:13 +0200317
318 $ ./python -q
319 >>> import codecs
320 >>> encoder = codecs.getincrementalencoder('hz')('strict')
321 >>> b''.join(encoder.encode(x) for x in '\u52ff\u65bd\u65bc\u4eba\u3002 Bye.')
322 b'~{NpJ)l6HK!#~} Bye.'
323
Georg Brandl6c0929b2011-07-09 11:43:33 +0200324This example gives ``b'~{Np~}~{J)~}~{l6~}~{HK~}~{!#~} Bye.'`` with older Python
Victor Stinner2cded9c2011-07-08 01:45:13 +0200325versions.
326
Georg Brandl6c0929b2011-07-09 11:43:33 +0200327(:issue:`12100`)
Victor Stinner2cded9c2011-07-08 01:45:13 +0200328
Victor Stinner9f4b1e92011-11-10 20:56:30 +0100329The ``unicode_internal`` codec has been deprecated.
330
Éric Araujo84b8ed82011-08-29 21:42:47 +0200331crypt
332-----
333
Victor Stinnerc78fb332011-09-21 03:35:44 +0200334Addition of salt and modular crypt format and the :func:`~crypt.mksalt`
335function to the :mod:`crypt` module.
Éric Araujo84b8ed82011-08-29 21:42:47 +0200336
337(:issue:`10924`)
338
Victor Stinnera7878b72011-07-14 23:07:44 +0200339curses
340------
341
Victor Stinner0fdfceb2011-11-25 22:10:02 +0100342 * If the :mod:`curses` module is linked to the ncursesw library, use Unicode
343 functions when Unicode strings or characters are passed (e.g.
344 :c:func:`waddwstr`), and bytes functions otherwise (e.g. :c:func:`waddstr`).
345 * Use the locale encoding instead of ``utf-8`` to encode Unicode strings.
346 * :class:`curses.window` has a new :attr:`curses.window.encoding` attribute.
Victor Stinnerc78fb332011-09-21 03:35:44 +0200347 * The :class:`curses.window` class has a new :meth:`~curses.window.get_wch`
348 method to get a wide character
349 * The :mod:`curses` module has a new :meth:`~curses.unget_wch` function to
350 push a wide character so the next :meth:`~curses.window.get_wch` will return
351 it
Victor Stinnera7878b72011-07-14 23:07:44 +0200352
Victor Stinnerc78fb332011-09-21 03:35:44 +0200353(Contributed by Iñigo Serna in :issue:`6755`)
Victor Stinnera7878b72011-07-14 23:07:44 +0200354
Victor Stinner024e37a2011-03-31 01:31:06 +0200355faulthandler
356------------
357
358New module: :mod:`faulthandler`.
359
360 * :envvar:`PYTHONFAULTHANDLER`
361 * :option:`-X` ``faulthandler``
362
Victor Stinnere0be4232011-10-25 13:06:09 +0200363time
364----
365
366* The :mod:`time` module has new :func:`~time.clock_getres` and
367 :func:`~time.clock_gettime` functions and ``CLOCK_xxx`` constants.
368 :func:`~time.clock_gettime` can be used with :data:`time.CLOCK_MONOTONIC` to
369 get a monotonic clock.
370
371 (Contributed by Victor Stinner in :issue:`10278`)
372
Victor Stinnerfa0e3d52011-05-09 01:01:09 +0200373
Victor Stinner811db3b2011-09-21 03:20:03 +0200374ftplib
375------
376
377The :class:`~ftplib.FTP_TLS` class now provides a new
378:func:`~ftplib.FTP_TLS.ccc` function to revert control channel back to
Florent Xicluna6d57d212011-10-23 22:23:57 +0200379plaintext. This can be useful to take advantage of firewalls that know how to
Victor Stinner811db3b2011-09-21 03:20:03 +0200380handle NAT with non-secure FTP without opening fixed ports.
381
382(Contributed by Giampaolo Rodolà in :issue:`12139`)
383
384
Antoine Pitrou5a8bc6f2011-11-17 02:20:48 +0100385imaplib
386-------
387
388The :class:`~imaplib.IMAP4_SSL` constructor now accepts an SSLContext
389parameter to control parameters of the secure channel.
390
391(Contributed by Sijin Joseph in :issue:`8808`)
392
393
Nadeem Vawda34599222011-12-09 01:32:46 +0200394lzma
395----
396
397The newly-added :mod:`lzma` module provides data compression and decompression
398using the LZMA algorithm, including support for the ``.xz`` and ``.lzma``
399file formats.
400
401(Contributed by Nadeem Vawda and Per Øyvind Karlsen in :issue:`6715`)
402
403
Victor Stinnerfa0e3d52011-05-09 01:01:09 +0200404math
405----
406
407The :mod:`math` module has a new function:
408
409 * :func:`~math.log2`: return the base-2 logarithm of *x*
410 (Written by Mark Dickinson in :issue:`11888`).
411
412
413nntplib
414-------
415
416The :class:`nntplib.NNTP` class now supports the context manager protocol to
417unconditionally consume :exc:`socket.error` exceptions and to close the NNTP
418connection when done::
419
420 >>> from nntplib import NNTP
Ezio Melotti3c14b4e2011-07-13 11:44:44 +0300421 >>> with NNTP('news.gmane.org') as n:
Victor Stinnerfa0e3d52011-05-09 01:01:09 +0200422 ... n.group('gmane.comp.python.committers')
423 ...
Ezio Melotti04f648c2011-07-26 09:37:46 +0300424 ('211 1755 1 1755 gmane.comp.python.committers', 1755, 1, 1755, 'gmane.comp.python.committers')
Victor Stinnerfa0e3d52011-05-09 01:01:09 +0200425 >>>
426
427(Contributed by Giampaolo Rodolà in :issue:`9795`)
428
429
Giampaolo Rodolàc9c2c8b2011-02-25 14:39:16 +0000430os
431--
432
Charles-François Natalia003af12011-06-01 20:30:52 +0200433* The :mod:`os` module has a new :func:`~os.pipe2` function that makes it
434 possible to create a pipe with :data:`~os.O_CLOEXEC` or
435 :data:`~os.O_NONBLOCK` flags set atomically. This is especially useful to
436 avoid race conditions in multi-threaded programs.
437
Giampaolo Rodolà18e8bcb2011-02-25 20:57:54 +0000438* The :mod:`os` module has a new :func:`~os.sendfile` function which provides
439 an efficent "zero-copy" way for copying data from one file (or socket)
440 descriptor to another. The phrase "zero-copy" refers to the fact that all of
441 the copying of data between the two descriptors is done entirely by the
442 kernel, with no copying of data into userspace buffers. :func:`~os.sendfile`
443 can be used to efficiently copy data from a file on disk to a network socket,
444 e.g. for downloading a file.
Giampaolo Rodolàc9c2c8b2011-02-25 14:39:16 +0000445
Giampaolo Rodolà18e8bcb2011-02-25 20:57:54 +0000446 (Patch submitted by Ross Lagerwall and Giampaolo Rodolà in :issue:`10882`.)
447
448* The :mod:`os` module has two new functions: :func:`~os.getpriority` and
449 :func:`~os.setpriority`. They can be used to get or set process
450 niceness/priority in a fashion similar to :func:`os.nice` but extended to all
451 processes instead of just the current one.
452
453 (Patch submitted by Giampaolo Rodolà in :issue:`10784`.)
Giampaolo Rodolà3108f982011-02-24 20:59:48 +0000454
Victor Stinnere5064372011-10-14 00:08:29 +0200455* "at" functions (:issue:`4761`):
456
457 * :func:`~os.faccessat`
458 * :func:`~os.fchmodat`
459 * :func:`~os.fchownat`
460 * :func:`~os.fstatat`
461 * :func:`~os.futimesat`
462 * :func:`~os.futimesat`
463 * :func:`~os.linkat`
464 * :func:`~os.mkdirat`
465 * :func:`~os.mkfifoat`
466 * :func:`~os.mknodat`
467 * :func:`~os.openat`
468 * :func:`~os.readlinkat`
469 * :func:`~os.renameat`
470 * :func:`~os.symlinkat`
471 * :func:`~os.unlinkat`
472 * :func:`~os.utimensat`
473 * :func:`~os.utimensat`
474
475* extended attributes (:issue:`12720`):
476
477 * :func:`~os.fgetxattr`
478 * :func:`~os.flistxattr`
479 * :func:`~os.fremovexattr`
480 * :func:`~os.fsetxattr`
481 * :func:`~os.getxattr`
482 * :func:`~os.lgetxattr`
483 * :func:`~os.listxattr`
484 * :func:`~os.llistxattr`
485 * :func:`~os.lremovexattr`
486 * :func:`~os.lsetxattr`
487 * :func:`~os.removexattr`
488 * :func:`~os.setxattr`
489
490* Scheduler functions (:issue:`12655`):
491
492 * :func:`~os.sched_get_priority_max`
493 * :func:`~os.sched_get_priority_min`
494 * :func:`~os.sched_getaffinity`
495 * :func:`~os.sched_getparam`
496 * :func:`~os.sched_getscheduler`
497 * :func:`~os.sched_rr_get_interval`
498 * :func:`~os.sched_setaffinity`
499 * :func:`~os.sched_setparam`
500 * :func:`~os.sched_setscheduler`
501 * :func:`~os.sched_yield`
502
503* Add some extra posix functions to the os module (:issue:`10812`):
504
505 * :func:`~os.fexecve`
506 * :func:`~os.futimens`
507 * :func:`~os.futimens`
508 * :func:`~os.futimes`
509 * :func:`~os.futimes`
510 * :func:`~os.lockf`
511 * :func:`~os.lutimes`
512 * :func:`~os.lutimes`
513 * :func:`~os.posix_fadvise`
514 * :func:`~os.posix_fallocate`
515 * :func:`~os.pread`
516 * :func:`~os.pwrite`
517 * :func:`~os.readv`
518 * :func:`~os.sync`
519 * :func:`~os.truncate`
520 * :func:`~os.waitid`
521 * :func:`~os.writev`
522
523* Other new functions:
524
525 * :func:`~os.fdlistdir` (:issue:`10755`)
526 * :func:`~os.getgrouplist` (:issue:`9344`)
527
Giampaolo Rodolà424298a2011-03-03 18:34:06 +0000528
Éric Araujo765e94f2011-06-03 17:26:59 +0200529packaging
530---------
531
532:mod:`distutils` has undergone additions and refactoring under a new name,
533:mod:`packaging`, to allow developers to break backward compatibility.
534:mod:`distutils` is still provided in the standard library, but users are
535encouraged to transition to :mod:`packaging`. For older versions of Python, a
536backport compatible with 2.4+ and 3.1+ will be made available on PyPI under the
537name :mod:`distutils2`.
538
539.. TODO add examples and howto to the packaging docs and link to them
540
541
Victor Stinner383c3fc2011-05-25 01:35:05 +0200542pydoc
543-----
544
Victor Stinner6daa33c2011-05-25 01:41:22 +0200545The Tk GUI and the :func:`~pydoc.serve` function have been removed from the
546:mod:`pydoc` module: ``pydoc -g`` and :func:`~pydoc.serve` have been deprecated
547in Python 3.2.
Victor Stinner383c3fc2011-05-25 01:35:05 +0200548
549
Victor Stinnerd5c355c2011-04-30 14:53:09 +0200550sys
551---
Victor Stinner754851f2011-04-19 23:58:51 +0200552
Éric Araujo84b8ed82011-08-29 21:42:47 +0200553* The :mod:`sys` module has a new :data:`~sys.thread_info` :term:`struct
Victor Stinnerd5c355c2011-04-30 14:53:09 +0200554 sequence` holding informations about the thread implementation.
Victor Stinner754851f2011-04-19 23:58:51 +0200555
Georg Brandl00db5822011-04-30 15:30:03 +0200556 (:issue:`11223`)
Victor Stinnera9293352011-04-30 15:21:58 +0200557
Victor Stinnerfa0e3d52011-05-09 01:01:09 +0200558
Victor Stinnera9293352011-04-30 15:21:58 +0200559signal
560------
561
Victor Stinnerfa0e3d52011-05-09 01:01:09 +0200562* The :mod:`signal` module has new functions:
Victor Stinnera9293352011-04-30 15:21:58 +0200563
Victor Stinnerb3e72192011-05-08 01:46:11 +0200564 * :func:`~signal.pthread_sigmask`: fetch and/or change the signal mask of the
565 calling thread (Contributed by Jean-Paul Calderone in :issue:`8407`) ;
566 * :func:`~signal.pthread_kill`: send a signal to a thread ;
567 * :func:`~signal.sigpending`: examine pending functions ;
568 * :func:`~signal.sigwait`: wait a signal.
Ross Lagerwallbc808222011-06-25 12:13:40 +0200569 * :func:`~signal.sigwaitinfo`: wait for a signal, returning detailed
570 information about it.
571 * :func:`~signal.sigtimedwait`: like :func:`~signal.sigwaitinfo` but with a
572 timeout.
Victor Stinnera9293352011-04-30 15:21:58 +0200573
Victor Stinnerd49b1f12011-05-08 02:03:15 +0200574* The signal handler writes the signal number as a single byte instead of
575 a nul byte into the wakeup file descriptor. So it is possible to wait more
576 than one signal and know which signals were raised.
577
Victor Stinner388196e2011-05-10 17:13:00 +0200578* :func:`signal.signal` and :func:`signal.siginterrupt` raise an OSError,
579 instead of a RuntimeError: OSError has an errno attribute.
580
Nick Coghlan96fe56a2011-08-22 11:55:57 +1000581socket
582------
583
Charles-François Natali47413c12011-10-06 19:47:44 +0200584* The :class:`~socket.socket` class now exposes additional methods to process
585 ancillary data when supported by the underlying platform:
Nick Coghlan96fe56a2011-08-22 11:55:57 +1000586
Charles-François Natali47413c12011-10-06 19:47:44 +0200587 * :func:`~socket.socket.sendmsg`
588 * :func:`~socket.socket.recvmsg`
589 * :func:`~socket.socket.recvmsg_into`
Nick Coghlan96fe56a2011-08-22 11:55:57 +1000590
Charles-François Natali47413c12011-10-06 19:47:44 +0200591 (Contributed by David Watson in :issue:`6560`, based on an earlier patch by
592 Heiko Wundram)
593
594* The :class:`~socket.socket` class now supports the PF_CAN protocol family
595 (http://en.wikipedia.org/wiki/Socketcan), on Linux
596 (http://lwn.net/Articles/253425).
597
598 (Contributed by Matthias Fuchs, updated by Tiago Gonçalves in :issue:`10141`)
599
Charles-François Natali10b8cf42011-11-10 19:21:37 +0100600* The :class:`~socket.socket` class now supports the PF_RDS protocol family
601 (http://en.wikipedia.org/wiki/Reliable_Datagram_Sockets and
602 http://oss.oracle.com/projects/rds/).
Victor Stinner754851f2011-04-19 23:58:51 +0200603
Victor Stinner99c8b162011-05-24 12:05:19 +0200604ssl
605---
606
Antoine Pitrou2c0a9672011-11-17 02:09:13 +0100607* The :mod:`ssl` module has two new random generation functions:
Victor Stinner99c8b162011-05-24 12:05:19 +0200608
609 * :func:`~ssl.RAND_bytes`: generate cryptographically strong
610 pseudo-random bytes.
611 * :func:`~ssl.RAND_pseudo_bytes`: generate pseudo-random bytes.
612
Antoine Pitrou2c0a9672011-11-17 02:09:13 +0100613 (Contributed by Victor Stinner in :issue:`12049`)
614
615* The :mod:`ssl` module now exposes a finer-grained exception hierarchy
616 in order to make it easier to inspect the various kinds of errors.
617
618 (Contributed by Antoine Pitrou in :issue:`11183`)
619
620* :meth:`~ssl.SSLContext.load_cert_chain` now accepts a *password* argument
621 to be used if the private key is encrypted.
622
623 (Contributed by Adam Simpkins in :issue:`12803`)
624
625* SSL sockets have a new :meth:`~ssl.SSLSocket.get_channel_binding` method
626 allowing the implementation of certain authentication mechanisms such as
627 SCRAM-SHA-1-PLUS.
628
629 (Contributed by Jacek Konieczny in :issue:`12551`)
630
Giampaolo Rodola'210e7ca2011-07-01 13:55:36 +0200631shutil
632------
633
Sandro Tosiaec2f212011-08-23 00:58:21 +0200634* The :mod:`shutil` module has these new fuctions:
Giampaolo Rodola'210e7ca2011-07-01 13:55:36 +0200635
Sandro Tosiaec2f212011-08-23 00:58:21 +0200636 * :func:`~shutil.disk_usage`: provides total, used and free disk space
637 statistics. (Contributed by Giampaolo Rodolà in :issue:`12442`)
638 * :func:`~shutil.chown`: allows one to change user and/or group of the given
639 path also specifying the user/group names and not only their numeric
640 ids. (Contributed by Sandro Tosi in :issue:`12191`)
Giampaolo Rodola'096dcb12011-06-27 11:17:51 +0200641
Antoine Pitrou5a8bc6f2011-11-17 02:20:48 +0100642smtplib
643-------
644
645The :class:`~smtplib.SMTP_SSL` constructor and the :meth:`~smtplib.SMTP.starttls`
646method now accept an SSLContext parameter to control parameters of the secure
647channel.
648
649(Contributed by Kasun Herath in :issue:`8809`)
650
Senthil Kumarande49d642011-10-16 23:54:44 +0800651urllib
652------
653
654The :class:`~urllib.request.Request` class, now accepts a *method* argument
655used by :meth:`~urllib.request.Request.get_method` to determine what HTTP method
Senthil Kumarana41c9422011-10-20 02:37:08 +0800656should be used. For example, this will send a ``'HEAD'`` request::
Senthil Kumarande49d642011-10-16 23:54:44 +0800657
658 >>> urlopen(Request('http://www.python.org', method='HEAD'))
659
660(:issue:`1673007`)
Giampaolo Rodola'096dcb12011-06-27 11:17:51 +0200661
Giampaolo Rodola'be55d992011-11-22 13:33:34 +0100662sched
663-----
664
665* *timefunc* and *delayfunct* parameters of :class:`~sched.scheduler` class
666 constructor are now optional and defaults to :func:`time.time` and
Giampaolo Rodola'bc7ea582011-11-22 13:37:58 +0100667 :func:`time.sleep` respectively. (Contributed by Chris Clark in
668 :issue:`13245`)
Giampaolo Rodola'be55d992011-11-22 13:33:34 +0100669
670* :meth:`~sched.scheduler.enter` and :meth:`~sched.scheduler.enterabs`
Giampaolo Rodola'bc7ea582011-11-22 13:37:58 +0100671 *argument* parameter is now optional. (Contributed by Chris Clark in
672 :issue:`13245`)
Giampaolo Rodola'be55d992011-11-22 13:33:34 +0100673
674* :meth:`~sched.scheduler.enter` and :meth:`~sched.scheduler.enterabs`
Giampaolo Rodola'bc7ea582011-11-22 13:37:58 +0100675 now accept a *kwargs* parameter. (Contributed by Chris Clark in
676 :issue:`13245`)
Giampaolo Rodola'be55d992011-11-22 13:33:34 +0100677
Giampaolo Rodolà3108f982011-02-24 20:59:48 +0000678Optimizations
679=============
680
681Major performance enhancements have been added:
682
Victor Stinner46606ce2011-11-20 18:27:55 +0100683* Thanks to the :pep:`393`, some operations on Unicode strings has been optimized:
684
685 * the memory footprint is divided by 2 to 4 depending on the text
Victor Stinnera996f1e2011-11-21 13:14:43 +0100686 * encode an ASCII string to UTF-8 doesn't need to encode characters anymore,
687 the UTF-8 representation is shared with the ASCII representation
Victor Stinner46606ce2011-11-20 18:27:55 +0100688 * getting a substring of a latin1 strings is 4 times faster
Giampaolo Rodolà3108f982011-02-24 20:59:48 +0000689
690
691Build and C API Changes
692=======================
693
694Changes to Python's build process and to the C API include:
695
Victor Stinner46606ce2011-11-20 18:27:55 +0100696* The :pep:`393` added new Unicode types, macros and functions:
697
Victor Stinnera996f1e2011-11-21 13:14:43 +0100698 * High-level API:
699
700 * :c:func:`PyUnicode_CopyCharacters`
701 * :c:func:`PyUnicode_FindChar`
702 * :c:func:`PyUnicode_GetLength`, :c:macro:`PyUnicode_GET_LENGTH`
703 * :c:func:`PyUnicode_New`
704 * :c:func:`PyUnicode_Substring`
705 * :c:func:`PyUnicode_ReadChar`, :c:func:`PyUnicode_WriteChar`
706
707 * Low-level API:
708
709 * :c:type:`Py_UCS1`, :c:type:`Py_UCS2`, :c:type:`Py_UCS4` types
710 * :c:type:`PyASCIIObject` and :c:type:`PyCompactUnicodeObject` structures
711 * :c:macro:`PyUnicode_READY`
712 * :c:func:`PyUnicode_FromKindAndData`
713 * :c:func:`PyUnicode_AsUCS4`, :c:func:`PyUnicode_AsUCS4Copy`
714 * :c:macro:`PyUnicode_DATA`, :c:macro:`PyUnicode_1BYTE_DATA`,
715 :c:macro:`PyUnicode_2BYTE_DATA`, :c:macro:`PyUnicode_4BYTE_DATA`
716 * :c:macro:`PyUnicode_KIND` with :c:type:`PyUnicode_Kind` enum:
717 :c:data:`PyUnicode_WCHAR_KIND`, :c:data:`PyUnicode_1BYTE_KIND`,
718 :c:data:`PyUnicode_2BYTE_KIND`, :c:data:`PyUnicode_4BYTE_KIND`
719 * :c:macro:`PyUnicode_READ`, :c:macro:`PyUnicode_READ_CHAR`, :c:macro:`PyUnicode_WRITE`
720 * :c:macro:`PyUnicode_MAX_CHAR_VALUE`
721
Giampaolo Rodolà3108f982011-02-24 20:59:48 +0000722
723
Victor Stinnerd1be8782011-12-09 00:10:41 +0100724Deprecated
725==========
726
Georg Brandl0cd25c92011-04-29 13:45:54 +0200727Unsupported Operating Systems
Victor Stinnerd1be8782011-12-09 00:10:41 +0100728-----------------------------
Victor Stinnerb90db4c2011-04-26 22:48:24 +0200729
Brian Curtin49a40cd2011-05-02 22:30:06 -0500730OS/2 and VMS are no longer supported due to the lack of a maintainer.
731
732Windows 2000 and Windows platforms which set ``COMSPEC`` to ``command.com``
733are no longer supported due to maintenance burden.
Victor Stinnerb90db4c2011-04-26 22:48:24 +0200734
735
Victor Stinner46606ce2011-11-20 18:27:55 +0100736Deprecated Python modules, functions and methods
Victor Stinnerd1be8782011-12-09 00:10:41 +0100737------------------------------------------------
Victor Stinner19bd0692011-11-16 00:18:57 +0100738
739* The :mod:`packaging` module replaces the :mod:`distutils` module
740* The ``unicode_internal`` codec has been deprecated because of the
741 :pep:`393`, use UTF-8, UTF-16 (``utf-16-le`` or ``utf-16-le``), or UTF-32
Victor Stinner46606ce2011-11-20 18:27:55 +0100742 (``utf-32-le`` or ``utf-32-le``)
Victor Stinner19bd0692011-11-16 00:18:57 +0100743* :meth:`ftplib.FTP.nlst` and :meth:`ftplib.FTP.dir`: use
Victor Stinner46606ce2011-11-20 18:27:55 +0100744 :meth:`ftplib.FTP.mlsd`
Victor Stinner19bd0692011-11-16 00:18:57 +0100745* :func:`platform.popen`: use the :mod:`subprocess` module. Check especially
746 the :ref:`subprocess-replacements` section.
747* :issue:`13374`: The Windows bytes API has been deprecated in the :mod:`os`
Victor Stinner46606ce2011-11-20 18:27:55 +0100748 module. Use Unicode filenames, instead of bytes filenames, to not depend on
Victor Stinner19bd0692011-11-16 00:18:57 +0100749 the ANSI code page anymore and to support any filename.
750
751
Victor Stinner46606ce2011-11-20 18:27:55 +0100752Deprecated functions and types of the C API
Victor Stinnerd1be8782011-12-09 00:10:41 +0100753-------------------------------------------
Victor Stinner46606ce2011-11-20 18:27:55 +0100754
755The :c:type:`Py_UNICODE` has been deprecated by the :pep:`393` and will be
756removed in Python 4. All functions using this type are deprecated:
757
Victor Stinner46606ce2011-11-20 18:27:55 +0100758Unicode functions and methods using :c:type:`Py_UNICODE` and
759:c:type:`Py_UNICODE*` types:
760
761 * :c:macro:`PyUnicode_FromUnicode`: use :c:func:`PyUnicode_FromWideChar` or
762 :c:func:`PyUnicode_FromKindAndData`
763 * :c:macro:`PyUnicode_AS_UNICODE`, :c:func:`PyUnicode_AsUnicode`,
764 :c:func:`PyUnicode_AsUnicodeAndSize`: use :c:func:`PyUnicode_AsWideCharString`
765 * :c:macro:`PyUnicode_AS_DATA`: use :c:macro:`PyUnicode_DATA` with
766 :c:macro:`PyUnicode_READ` and :c:macro:`PyUnicode_WRITE`
767 * :c:macro:`PyUnicode_GET_SIZE`, :c:func:`PyUnicode_GetSize`: use
768 :c:macro:`PyUnicode_GET_LENGTH` or :c:func:`PyUnicode_GetLength`
769 * :c:macro:`PyUnicode_GET_DATA_SIZE`: use
770 ``PyUnicode_GET_LENGTH(str) * PyUnicode_KIND(str)`` (only work on ready
771 strings)
772 * :c:func:`PyUnicode_AsUnicodeCopy`: use :c:func:`PyUnicode_AsUCS4Copy`,
773 :c:func:`PyUnicode_AsWideCharString` or :c:func:`PyUnicode_Copy`
774
Victor Stinnera996f1e2011-11-21 13:14:43 +0100775Functions and macros manipulating Py_UNICODE* strings:
776
777 * :c:macro:`Py_UNICODE_strlen`: use :c:func:`PyUnicode_GetLength` or
778 :c:macro:`PyUnicode_GET_LENGTH`
779 * :c:macro:`Py_UNICODE_strcat`: use :c:func:`PyUnicode_CopyCharacters` or
780 :c:func:`PyUnicode_FromFormat`
781 * :c:macro:`Py_UNICODE_strcpy`, :c:macro:`Py_UNICODE_strncpy`,
782 :c:macro:`Py_UNICODE_COPY`: use :c:func:`PyUnicode_CopyCharacters` or
783 :c:func:`PyUnicode_Substring`
784 * :c:macro:`Py_UNICODE_strcmp`: use :c:func:`PyUnicode_Compare`
785 * :c:macro:`Py_UNICODE_strncmp`: use :c:func:`PyUnicode_Tailmatch`
786 * :c:macro:`Py_UNICODE_strchr`, :c:macro:`Py_UNICODE_strrchr`: use
787 :c:func:`PyUnicode_FindChar`
788 * :c:macro:`Py_UNICODE_FILL`
789
Victor Stinner46606ce2011-11-20 18:27:55 +0100790Encoders:
791
792 * :c:func:`PyUnicode_Encode`: use :c:func:`PyUnicode_AsEncodedObject`
793 * :c:func:`PyUnicode_EncodeUTF7`
Victor Stinnera996f1e2011-11-21 13:14:43 +0100794 * :c:func:`PyUnicode_EncodeUTF8`: use :c:func:`PyUnicode_AsUTF8` or
795 :c:func:`PyUnicode_AsUTF8String`
Victor Stinner46606ce2011-11-20 18:27:55 +0100796 * :c:func:`PyUnicode_EncodeUTF32`
797 * :c:func:`PyUnicode_EncodeUTF16`
798 * :c:func:`PyUnicode_EncodeUnicodeEscape:` use
799 :c:func:`PyUnicode_AsUnicodeEscapeString`
800 * :c:func:`PyUnicode_EncodeRawUnicodeEscape:` use
801 :c:func:`PyUnicode_AsRawUnicodeEscapeString`
802 * :c:func:`PyUnicode_EncodeLatin1`: use :c:func:`PyUnicode_AsLatin1String`
803 * :c:func:`PyUnicode_EncodeASCII`: use :c:func:`PyUnicode_AsASCIIString`
804 * :c:func:`PyUnicode_EncodeCharmap`
805 * :c:func:`PyUnicode_TranslateCharmap`
806 * :c:func:`PyUnicode_EncodeMBCS`: use :c:func:`PyUnicode_AsMBCSString` or
807 :c:func:`PyUnicode_EncodeCodePage` (with ``CP_ACP`` code_page)
808 * :c:func:`PyUnicode_EncodeDecimal`,
809 :c:func:`PyUnicode_TransformDecimalToASCII`
810
811
Giampaolo Rodolà3108f982011-02-24 20:59:48 +0000812Porting to Python 3.3
813=====================
814
815This section lists previously described changes and other bugfixes
Antoine Pitrou037ffbf2011-10-24 00:25:41 +0200816that may require changes to your code.
817
818Porting Python code
819-------------------
Giampaolo Rodolà3108f982011-02-24 20:59:48 +0000820
Victor Stinner19bd0692011-11-16 00:18:57 +0100821* :issue:`12326`: On Linux, sys.platform doesn't contain the major version
Victor Stinnerff3d9392011-08-20 23:39:26 +0200822 anymore. It is now always 'linux', instead of 'linux2' or 'linux3' depending
823 on the Linux version used to build Python. Replace sys.platform == 'linux2'
824 with sys.platform.startswith('linux'), or directly sys.platform == 'linux' if
825 you don't need to support older Python versions.
Éric Araujoc09fca62011-03-23 02:06:24 +0100826
Antoine Pitrou037ffbf2011-10-24 00:25:41 +0200827Porting C code
828--------------
829
830* Due to :ref:`PEP 393 <pep-393>`, the :c:type:`Py_UNICODE` type and all
831 functions using this type are deprecated (but will stay available for
832 at least five years). If you were using low-level Unicode APIs to
833 construct and access unicode objects and you want to benefit of the
834 memory footprint reduction provided by the PEP 393, you have to convert
835 your code to the new :doc:`Unicode API <../c-api/unicode>`.
836
837 However, if you only have been using high-level functions such as
838 :c:func:`PyUnicode_Concat()`, :c:func:`PyUnicode_Join` or
839 :c:func:`PyUnicode_FromFormat()`, your code will automatically take
840 advantage of the new unicode representations.
841
842Other issues
843------------
844
Éric Araujoc09fca62011-03-23 02:06:24 +0100845.. Issue #11591: When :program:`python` was started with :option:`-S`,
846 ``import site`` will not add site-specific paths to the module search
847 paths. In previous versions, it did. See changeset for doc changes in
848 various files. Contributed by Carl Meyer with editions by Éric Araujo.
Éric Araujobe3bd572011-03-26 01:55:15 +0100849
Éric Araujobfc97292011-11-14 18:18:15 +0100850.. Issue #10998: the -Q command-line flag and related artifacts have been
Éric Araujobe3bd572011-03-26 01:55:15 +0100851 removed. Code checking sys.flags.division_warning will need updating.
852 Contributed by Éric Araujo.