blob: 6c2041840a156c9a635242d3d17eb4127bb14240 [file] [log] [blame]
Giampaolo Rodolà3108f982011-02-24 20:59:48 +00001****************************
2 What's New In Python 3.3
3****************************
4
5:Author: Raymond Hettinger
6:Release: |release|
7:Date: |today|
8
Éric Araujob07b97f2011-10-05 01:03:34 +02009.. Rules for maintenance:
Giampaolo Rodolà3108f982011-02-24 20:59:48 +000010
11 * Anyone can add text to this document. Do not spend very much time
12 on the wording of your changes, because your text will probably
13 get rewritten to some degree.
14
15 * The maintainer will go through Misc/NEWS periodically and add
16 changes; it's therefore more important to add your changes to
17 Misc/NEWS than to this file.
18
19 * This is not a complete list of every single change; completeness
20 is the purpose of Misc/NEWS. Some changes I consider too small
21 or esoteric to include. If such a change is added to the text,
22 I'll just remove it. (This is another reason you shouldn't spend
23 too much time on writing your addition.)
24
25 * If you want to draw your new text to the attention of the
26 maintainer, add 'XXX' to the beginning of the paragraph or
27 section.
28
29 * It's OK to just add a fragmentary note about a change. For
30 example: "XXX Describe the transmogrify() function added to the
31 socket module." The maintainer will research the change and
32 write the necessary text.
33
34 * You can comment out your additions if you like, but it's not
35 necessary (especially when a final release is some months away).
36
37 * Credit the author of a patch or bugfix. Just the name is
38 sufficient; the e-mail address isn't necessary.
39
40 * It's helpful to add the bug/patch number as a comment:
41
Giampaolo Rodolà3108f982011-02-24 20:59:48 +000042 XXX Describe the transmogrify() function added to the socket
43 module.
Éric Araujob07b97f2011-10-05 01:03:34 +020044 (Contributed by P.Y. Developer in :issue:`12345`.)
Giampaolo Rodolà3108f982011-02-24 20:59:48 +000045
Éric Araujob07b97f2011-10-05 01:03:34 +020046 This saves the maintainer the effort of going through the Mercurial log
Giampaolo Rodolà3108f982011-02-24 20:59:48 +000047 when researching a change.
48
49This article explains the new features in Python 3.3, compared to 3.2.
50
51
Antoine Pitrou037ffbf2011-10-24 00:25:41 +020052.. _pep-393:
53
Ezio Melotti48a2f8f2011-09-29 00:18:19 +030054PEP 393: Flexible String Representation
55=======================================
56
Antoine Pitroufd9b4162011-10-24 00:14:43 +020057The Unicode string type is changed to support multiple internal
58representations, depending on the character with the largest Unicode ordinal
59(1, 2, or 4 bytes) in the represented string. This allows a space-efficient
60representation in common cases, but gives access to full UCS-4 on all
61systems. For compatibility with existing APIs, several representations may
62exist in parallel; over time, this compatibility should be phased out.
Ezio Melotti397546a2011-09-29 08:34:36 +030063
Antoine Pitroufd9b4162011-10-24 00:14:43 +020064On the Python side, there should be no downside to this change.
Ezio Melotti397546a2011-09-29 08:34:36 +030065
Antoine Pitroufd9b4162011-10-24 00:14:43 +020066On the C API side, PEP 393 is fully backward compatible. The legacy API
67should remain available at least five years. Applications using the legacy
68API will not fully benefit of the memory reduction, or - worse - may use
69a bit more memory, because Python may have to maintain two versions of each
70string (in the legacy format and in the new efficient storage).
71
72Changes introduced by :pep:`393` are the following:
Ezio Melotti48a2f8f2011-09-29 00:18:19 +030073
Ezio Melotti397546a2011-09-29 08:34:36 +030074* Python now always supports the full range of Unicode codepoints, including
75 non-BMP ones (i.e. from ``U+0000`` to ``U+10FFFF``). The distinction between
76 narrow and wide builds no longer exists and Python now behaves like a wide
Antoine Pitroufd9b4162011-10-24 00:14:43 +020077 build, even under Windows.
Ezio Melotti397546a2011-09-29 08:34:36 +030078
79* The storage of Unicode strings now depends on the highest codepoint in the string:
80
81 * pure ASCII and Latin1 strings (``U+0000-U+00FF``) use 1 byte per codepoint;
82
83 * BMP strings (``U+0000-U+FFFF``) use 2 bytes per codepoint;
84
85 * non-BMP strings (``U+10000-U+10FFFF``) use 4 bytes per codepoint.
86
Antoine Pitroubeb78362011-11-17 01:59:51 +010087 The net effect is that for most applications, memory usage of string storage
88 should decrease significantly - especially compared to former wide unicode
89 builds - as, in many cases, strings will be pure ASCII even in international
90 contexts (because many strings store non-human language data, such as XML
91 fragments, HTTP headers, JSON-encoded data, etc.). We also hope that it
92 will, for the same reasons, increase CPU cache efficiency on non-trivial
93 applications.
94
95 .. The memory usage of Python 3.3 is two to three times smaller than Python 3.2,
96 and a little bit better than Python 2.7, on a `Django benchmark
97 <http://mail.python.org/pipermail/python-dev/2011-September/113714.html>`_.
98 XXX The result should be moved in the PEP and a link to the PEP should
99 be added here.
Ezio Melotti397546a2011-09-29 08:34:36 +0300100
Antoine Pitroufd9b4162011-10-24 00:14:43 +0200101* With the death of narrow builds, the problems specific to narrow builds have
102 also been fixed, for example:
Ezio Melotti397546a2011-09-29 08:34:36 +0300103
104 * :func:`len` now always returns 1 for non-BMP characters,
105 so ``len('\U0010FFFF') == 1``;
106
107 * surrogate pairs are not recombined in string literals,
108 so ``'\uDBFF\uDFFF' != '\U0010FFFF'``;
109
Antoine Pitroufd9b4162011-10-24 00:14:43 +0200110 * indexing or slicing non-BMP characters returns the expected value,
Ezio Melotti397546a2011-09-29 08:34:36 +0300111 so ``'\U0010FFFF'[0]`` now returns ``'\U0010FFFF'`` and not ``'\uDBFF'``;
112
Antoine Pitroud136aec2011-11-17 01:48:06 +0100113 * all other functions in the standard library now correctly handle
Antoine Pitroufd9b4162011-10-24 00:14:43 +0200114 non-BMP codepoints.
Ezio Melotti397546a2011-09-29 08:34:36 +0300115
Ezio Melotti48a2f8f2011-09-29 00:18:19 +0300116* The value of :data:`sys.maxunicode` is now always ``1114111`` (``0x10FFFF``
117 in hexadecimal). The :c:func:`PyUnicode_GetMax` function still returns
118 either ``0xFFFF`` or ``0x10FFFF`` for backward compatibility, and it should
119 not be used with the new Unicode API (see :issue:`13054`).
120
Ezio Melotti397546a2011-09-29 08:34:36 +0300121* The :file:`./configure` flag ``--with-wide-unicode`` has been removed.
Victor Stinner7d637ab2011-09-29 02:56:16 +0200122
Éric Araujob07b97f2011-10-05 01:03:34 +0200123
Victor Stinnera1bf2982011-10-12 20:35:02 +0200124PEP 3151: Reworking the OS and IO exception hierarchy
125=====================================================
126
127:pep:`3151` - Reworking the OS and IO exception hierarchy
Antoine Pitrou01fd26c2011-10-24 00:07:02 +0200128 PEP written and implemented by Antoine Pitrou.
Victor Stinnera1bf2982011-10-12 20:35:02 +0200129
Antoine Pitrou01fd26c2011-10-24 00:07:02 +0200130The hierarchy of exceptions raised by operating system errors is now both
131simplified and finer-grained.
Victor Stinnera1bf2982011-10-12 20:35:02 +0200132
Antoine Pitrou01fd26c2011-10-24 00:07:02 +0200133You don't have to worry anymore about choosing the appropriate exception
134type between :exc:`OSError`, :exc:`IOError`, :exc:`EnvironmentError`,
135:exc:`WindowsError`, :exc:`mmap.error`, :exc:`socket.error` or
136:exc:`select.error`. All these exception types are now only one:
137:exc:`OSError`. The other names are kept as aliases for compatibility
138reasons.
Victor Stinnera1bf2982011-10-12 20:35:02 +0200139
Antoine Pitrou01fd26c2011-10-24 00:07:02 +0200140Also, it is now easier to catch a specific error condition. Instead of
141inspecting the ``errno`` attribute (or ``args[0]``) for a particular
142constant from the :mod:`errno` module, you can catch the adequate
143:exc:`OSError` subclass. The available subclasses are the following:
Victor Stinnera1bf2982011-10-12 20:35:02 +0200144
Antoine Pitrou01fd26c2011-10-24 00:07:02 +0200145* :exc:`BlockingIOError`
146* :exc:`ChildProcessError`
147* :exc:`ConnectionError`
148* :exc:`FileExistsError`
149* :exc:`FileNotFoundError`
150* :exc:`InterruptedError`
151* :exc:`IsADirectoryError`
152* :exc:`NotADirectoryError`
153* :exc:`PermissionError`
154* :exc:`ProcessLookupError`
155* :exc:`TimeoutError`
Victor Stinnera1bf2982011-10-12 20:35:02 +0200156
Antoine Pitrou01fd26c2011-10-24 00:07:02 +0200157And the :exc:`ConnectionError` itself has finer-grained subclasses:
Victor Stinnera1bf2982011-10-12 20:35:02 +0200158
Antoine Pitrou01fd26c2011-10-24 00:07:02 +0200159* :exc:`BrokenPipeError`
160* :exc:`ConnectionAbortedError`
161* :exc:`ConnectionRefusedError`
162* :exc:`ConnectionResetError`
Victor Stinnera1bf2982011-10-12 20:35:02 +0200163
164Thanks to the new exceptions, common usages of the :mod:`errno` can now be
Antoine Pitrou01fd26c2011-10-24 00:07:02 +0200165avoided. For example, the following code written for Python 3.2::
Victor Stinnera1bf2982011-10-12 20:35:02 +0200166
167 from errno import ENOENT, EACCES, EPERM
168
169 try:
170 with open("document.txt") as f:
171 content = f.read()
172 except IOError as err:
173 if err.errno == ENOENT:
174 print("document.txt file is missing")
175 elif err.errno in (EACCES, EPERM):
176 print("You are not allowed to read document.txt")
177 else:
178 raise
179
Antoine Pitrou01fd26c2011-10-24 00:07:02 +0200180can now be written without the :mod:`errno` import and without manual
181inspection of exception attributes::
Victor Stinnera1bf2982011-10-12 20:35:02 +0200182
183 try:
184 with open("document.txt") as f:
185 content = f.read()
186 except FileNotFoundError:
187 print("document.txt file is missing")
188 except PermissionError:
189 print("You are not allowed to read document.txt")
190
191
Antoine Pitrou6bbd76b2011-11-25 19:10:05 +0100192PEP 3155: Qualified name for classes and functions
193==================================================
194
195:pep:`3155` - Qualified name for classes and functions
196 PEP written and implemented by Antoine Pitrou.
197
198Functions and class objects have a new ``__qualname__`` attribute representing
199the "path" from the module top-level to their definition. For global functions
200and classes, this is the same as ``__name__``. For other functions and classes,
201it provides better information about where they were actually defined, and
202how they might be accessible from the global scope.
203
204Example with (non-bound) methods::
205
206 >>> class C:
207 ... def meth(self):
208 ... pass
209 >>> C.meth.__name__
210 'meth'
211 >>> C.meth.__qualname__
212 'C.meth'
213
214Example with nested classes::
215
216 >>> class C:
217 ... class D:
218 ... def meth(self):
219 ... pass
220 ...
221 >>> C.D.__name__
222 'D'
223 >>> C.D.__qualname__
224 'C.D'
225 >>> C.D.meth.__name__
226 'meth'
227 >>> C.D.meth.__qualname__
228 'C.D.meth'
229
230Example with nested functions::
231
232 >>> def outer():
233 ... def inner():
234 ... pass
235 ... return inner
236 ...
237 >>> outer().__name__
238 'inner'
239 >>> outer().__qualname__
240 'outer.<locals>.inner'
241
242The string representation of those objects is also changed to included the
243new, more precise information::
244
245 >>> str(C.D)
246 "<class '__main__.C.D'>"
247 >>> str(C.D.meth)
248 '<function C.D.meth at 0x7f46b9fe31e0>'
249
250
Giampaolo Rodolà3108f982011-02-24 20:59:48 +0000251Other Language Changes
252======================
253
254Some smaller changes made to the core Python language are:
255
256* Stub
257
Ezio Melotti931b8aa2011-10-21 21:57:36 +0300258Added support for Unicode name aliases and named sequences.
Ezio Melotti2d99dac2011-10-24 00:44:03 +0300259Both :func:`unicodedata.lookup()` and ``'\N{...}'`` now resolve name aliases,
Ezio Melotti931b8aa2011-10-21 21:57:36 +0300260and :func:`unicodedata.lookup()` resolves named sequences too.
261
262(Contributed by Ezio Melotti in :issue:`12753`)
263
Giampaolo Rodolà3108f982011-02-24 20:59:48 +0000264
Mark Dickinson36645682011-10-23 19:53:01 +0100265Equality comparisons on :func:`range` objects now return a result reflecting
266the equality of the underlying sequences generated by those range objects.
267
268(:issue:`13021`)
269
270
Petri Lehtinen61ea8a02011-11-24 22:00:46 +0200271The ``count()``, ``find()``, ``rfind()``, ``index()`` and ``rindex()``
272methods of :class:`bytes` and :class:`bytearray` objects now accept an
273integer between 0 and 255 as their first argument.
274
275(:issue:`12170`)
276
277
Victor Stinner46606ce2011-11-20 18:27:55 +0100278New and Improved Modules
279========================
Giampaolo Rodolà3108f982011-02-24 20:59:48 +0000280
Meador Ingec5dbb3d2011-09-20 21:48:16 -0500281array
282-----
283
284The :mod:`array` module supports the :c:type:`long long` type using ``q`` and
285``Q`` type codes.
286
287(Contributed by Oren Tirosh and Hirokazu Yamamoto in :issue:`1172711`)
288
289
Victor Stinner2cded9c2011-07-08 01:45:13 +0200290codecs
291------
292
Victor Stinner3a50e702011-10-18 21:21:00 +0200293The :mod:`~encodings.mbcs` codec has be rewritten to handle correclty
294``replace`` and ``ignore`` error handlers on all Windows versions. The
295:mod:`~encodings.mbcs` codec is now supporting all error handlers, instead of
296only ``replace`` to encode and ``ignore`` to decode.
297
Victor Stinner7592d052011-10-27 01:43:48 +0200298A new Windows-only codec has been added: ``cp65001`` (:issue:`13216`). It is
Victor Stinner2f3ca9f2011-10-27 01:38:56 +0200299the Windows code page 65001 (Windows UTF-8, ``CP_UTF8``). For example, it is
300used by ``sys.stdout`` if the console output code page is set to cp65001 (e.g.
301using ``chcp 65001`` command).
302
Victor Stinner2cded9c2011-07-08 01:45:13 +0200303Multibyte CJK decoders now resynchronize faster. They only ignore the first
Georg Brandl6c0929b2011-07-09 11:43:33 +0200304byte of an invalid byte sequence. For example, ``b'\xff\n'.decode('gb2312',
305'replace')`` now returns a ``\n`` after the replacement character.
Victor Stinner2cded9c2011-07-08 01:45:13 +0200306
Georg Brandl6c0929b2011-07-09 11:43:33 +0200307(:issue:`12016`)
Victor Stinner2cded9c2011-07-08 01:45:13 +0200308
309Don't reset incremental encoders of CJK codecs at each call to their encode()
Georg Brandl6c0929b2011-07-09 11:43:33 +0200310method anymore. For example::
Victor Stinner2cded9c2011-07-08 01:45:13 +0200311
312 $ ./python -q
313 >>> import codecs
314 >>> encoder = codecs.getincrementalencoder('hz')('strict')
315 >>> b''.join(encoder.encode(x) for x in '\u52ff\u65bd\u65bc\u4eba\u3002 Bye.')
316 b'~{NpJ)l6HK!#~} Bye.'
317
Georg Brandl6c0929b2011-07-09 11:43:33 +0200318This example gives ``b'~{Np~}~{J)~}~{l6~}~{HK~}~{!#~} Bye.'`` with older Python
Victor Stinner2cded9c2011-07-08 01:45:13 +0200319versions.
320
Georg Brandl6c0929b2011-07-09 11:43:33 +0200321(:issue:`12100`)
Victor Stinner2cded9c2011-07-08 01:45:13 +0200322
Victor Stinner9f4b1e92011-11-10 20:56:30 +0100323The ``unicode_internal`` codec has been deprecated.
324
Éric Araujo84b8ed82011-08-29 21:42:47 +0200325crypt
326-----
327
Victor Stinnerc78fb332011-09-21 03:35:44 +0200328Addition of salt and modular crypt format and the :func:`~crypt.mksalt`
329function to the :mod:`crypt` module.
Éric Araujo84b8ed82011-08-29 21:42:47 +0200330
331(:issue:`10924`)
332
Victor Stinnera7878b72011-07-14 23:07:44 +0200333curses
334------
335
Victor Stinnerc78fb332011-09-21 03:35:44 +0200336 * The :class:`curses.window` class has a new :meth:`~curses.window.get_wch`
337 method to get a wide character
338 * The :mod:`curses` module has a new :meth:`~curses.unget_wch` function to
339 push a wide character so the next :meth:`~curses.window.get_wch` will return
340 it
Victor Stinnera7878b72011-07-14 23:07:44 +0200341
Victor Stinnerc78fb332011-09-21 03:35:44 +0200342(Contributed by Iñigo Serna in :issue:`6755`)
Victor Stinnera7878b72011-07-14 23:07:44 +0200343
Victor Stinner024e37a2011-03-31 01:31:06 +0200344faulthandler
345------------
346
347New module: :mod:`faulthandler`.
348
349 * :envvar:`PYTHONFAULTHANDLER`
350 * :option:`-X` ``faulthandler``
351
Victor Stinnere0be4232011-10-25 13:06:09 +0200352time
353----
354
355* The :mod:`time` module has new :func:`~time.clock_getres` and
356 :func:`~time.clock_gettime` functions and ``CLOCK_xxx`` constants.
357 :func:`~time.clock_gettime` can be used with :data:`time.CLOCK_MONOTONIC` to
358 get a monotonic clock.
359
360 (Contributed by Victor Stinner in :issue:`10278`)
361
Victor Stinnerfa0e3d52011-05-09 01:01:09 +0200362
Victor Stinner811db3b2011-09-21 03:20:03 +0200363ftplib
364------
365
366The :class:`~ftplib.FTP_TLS` class now provides a new
367:func:`~ftplib.FTP_TLS.ccc` function to revert control channel back to
Florent Xicluna6d57d212011-10-23 22:23:57 +0200368plaintext. This can be useful to take advantage of firewalls that know how to
Victor Stinner811db3b2011-09-21 03:20:03 +0200369handle NAT with non-secure FTP without opening fixed ports.
370
371(Contributed by Giampaolo Rodolà in :issue:`12139`)
372
373
Antoine Pitrou5a8bc6f2011-11-17 02:20:48 +0100374imaplib
375-------
376
377The :class:`~imaplib.IMAP4_SSL` constructor now accepts an SSLContext
378parameter to control parameters of the secure channel.
379
380(Contributed by Sijin Joseph in :issue:`8808`)
381
382
Victor Stinnerfa0e3d52011-05-09 01:01:09 +0200383math
384----
385
386The :mod:`math` module has a new function:
387
388 * :func:`~math.log2`: return the base-2 logarithm of *x*
389 (Written by Mark Dickinson in :issue:`11888`).
390
391
392nntplib
393-------
394
395The :class:`nntplib.NNTP` class now supports the context manager protocol to
396unconditionally consume :exc:`socket.error` exceptions and to close the NNTP
397connection when done::
398
399 >>> from nntplib import NNTP
Ezio Melotti3c14b4e2011-07-13 11:44:44 +0300400 >>> with NNTP('news.gmane.org') as n:
Victor Stinnerfa0e3d52011-05-09 01:01:09 +0200401 ... n.group('gmane.comp.python.committers')
402 ...
Ezio Melotti04f648c2011-07-26 09:37:46 +0300403 ('211 1755 1 1755 gmane.comp.python.committers', 1755, 1, 1755, 'gmane.comp.python.committers')
Victor Stinnerfa0e3d52011-05-09 01:01:09 +0200404 >>>
405
406(Contributed by Giampaolo Rodolà in :issue:`9795`)
407
408
Giampaolo Rodolàc9c2c8b2011-02-25 14:39:16 +0000409os
410--
411
Charles-François Natalia003af12011-06-01 20:30:52 +0200412* The :mod:`os` module has a new :func:`~os.pipe2` function that makes it
413 possible to create a pipe with :data:`~os.O_CLOEXEC` or
414 :data:`~os.O_NONBLOCK` flags set atomically. This is especially useful to
415 avoid race conditions in multi-threaded programs.
416
Giampaolo Rodolà18e8bcb2011-02-25 20:57:54 +0000417* The :mod:`os` module has a new :func:`~os.sendfile` function which provides
418 an efficent "zero-copy" way for copying data from one file (or socket)
419 descriptor to another. The phrase "zero-copy" refers to the fact that all of
420 the copying of data between the two descriptors is done entirely by the
421 kernel, with no copying of data into userspace buffers. :func:`~os.sendfile`
422 can be used to efficiently copy data from a file on disk to a network socket,
423 e.g. for downloading a file.
Giampaolo Rodolàc9c2c8b2011-02-25 14:39:16 +0000424
Giampaolo Rodolà18e8bcb2011-02-25 20:57:54 +0000425 (Patch submitted by Ross Lagerwall and Giampaolo Rodolà in :issue:`10882`.)
426
427* The :mod:`os` module has two new functions: :func:`~os.getpriority` and
428 :func:`~os.setpriority`. They can be used to get or set process
429 niceness/priority in a fashion similar to :func:`os.nice` but extended to all
430 processes instead of just the current one.
431
432 (Patch submitted by Giampaolo Rodolà in :issue:`10784`.)
Giampaolo Rodolà3108f982011-02-24 20:59:48 +0000433
Victor Stinnere5064372011-10-14 00:08:29 +0200434* "at" functions (:issue:`4761`):
435
436 * :func:`~os.faccessat`
437 * :func:`~os.fchmodat`
438 * :func:`~os.fchownat`
439 * :func:`~os.fstatat`
440 * :func:`~os.futimesat`
441 * :func:`~os.futimesat`
442 * :func:`~os.linkat`
443 * :func:`~os.mkdirat`
444 * :func:`~os.mkfifoat`
445 * :func:`~os.mknodat`
446 * :func:`~os.openat`
447 * :func:`~os.readlinkat`
448 * :func:`~os.renameat`
449 * :func:`~os.symlinkat`
450 * :func:`~os.unlinkat`
451 * :func:`~os.utimensat`
452 * :func:`~os.utimensat`
453
454* extended attributes (:issue:`12720`):
455
456 * :func:`~os.fgetxattr`
457 * :func:`~os.flistxattr`
458 * :func:`~os.fremovexattr`
459 * :func:`~os.fsetxattr`
460 * :func:`~os.getxattr`
461 * :func:`~os.lgetxattr`
462 * :func:`~os.listxattr`
463 * :func:`~os.llistxattr`
464 * :func:`~os.lremovexattr`
465 * :func:`~os.lsetxattr`
466 * :func:`~os.removexattr`
467 * :func:`~os.setxattr`
468
469* Scheduler functions (:issue:`12655`):
470
471 * :func:`~os.sched_get_priority_max`
472 * :func:`~os.sched_get_priority_min`
473 * :func:`~os.sched_getaffinity`
474 * :func:`~os.sched_getparam`
475 * :func:`~os.sched_getscheduler`
476 * :func:`~os.sched_rr_get_interval`
477 * :func:`~os.sched_setaffinity`
478 * :func:`~os.sched_setparam`
479 * :func:`~os.sched_setscheduler`
480 * :func:`~os.sched_yield`
481
482* Add some extra posix functions to the os module (:issue:`10812`):
483
484 * :func:`~os.fexecve`
485 * :func:`~os.futimens`
486 * :func:`~os.futimens`
487 * :func:`~os.futimes`
488 * :func:`~os.futimes`
489 * :func:`~os.lockf`
490 * :func:`~os.lutimes`
491 * :func:`~os.lutimes`
492 * :func:`~os.posix_fadvise`
493 * :func:`~os.posix_fallocate`
494 * :func:`~os.pread`
495 * :func:`~os.pwrite`
496 * :func:`~os.readv`
497 * :func:`~os.sync`
498 * :func:`~os.truncate`
499 * :func:`~os.waitid`
500 * :func:`~os.writev`
501
502* Other new functions:
503
504 * :func:`~os.fdlistdir` (:issue:`10755`)
505 * :func:`~os.getgrouplist` (:issue:`9344`)
506
Giampaolo Rodolà424298a2011-03-03 18:34:06 +0000507
Éric Araujo765e94f2011-06-03 17:26:59 +0200508packaging
509---------
510
511:mod:`distutils` has undergone additions and refactoring under a new name,
512:mod:`packaging`, to allow developers to break backward compatibility.
513:mod:`distutils` is still provided in the standard library, but users are
514encouraged to transition to :mod:`packaging`. For older versions of Python, a
515backport compatible with 2.4+ and 3.1+ will be made available on PyPI under the
516name :mod:`distutils2`.
517
518.. TODO add examples and howto to the packaging docs and link to them
519
520
Victor Stinner383c3fc2011-05-25 01:35:05 +0200521pydoc
522-----
523
Victor Stinner6daa33c2011-05-25 01:41:22 +0200524The Tk GUI and the :func:`~pydoc.serve` function have been removed from the
525:mod:`pydoc` module: ``pydoc -g`` and :func:`~pydoc.serve` have been deprecated
526in Python 3.2.
Victor Stinner383c3fc2011-05-25 01:35:05 +0200527
528
Victor Stinnerd5c355c2011-04-30 14:53:09 +0200529sys
530---
Victor Stinner754851f2011-04-19 23:58:51 +0200531
Éric Araujo84b8ed82011-08-29 21:42:47 +0200532* The :mod:`sys` module has a new :data:`~sys.thread_info` :term:`struct
Victor Stinnerd5c355c2011-04-30 14:53:09 +0200533 sequence` holding informations about the thread implementation.
Victor Stinner754851f2011-04-19 23:58:51 +0200534
Georg Brandl00db5822011-04-30 15:30:03 +0200535 (:issue:`11223`)
Victor Stinnera9293352011-04-30 15:21:58 +0200536
Victor Stinnerfa0e3d52011-05-09 01:01:09 +0200537
Victor Stinnera9293352011-04-30 15:21:58 +0200538signal
539------
540
Victor Stinnerfa0e3d52011-05-09 01:01:09 +0200541* The :mod:`signal` module has new functions:
Victor Stinnera9293352011-04-30 15:21:58 +0200542
Victor Stinnerb3e72192011-05-08 01:46:11 +0200543 * :func:`~signal.pthread_sigmask`: fetch and/or change the signal mask of the
544 calling thread (Contributed by Jean-Paul Calderone in :issue:`8407`) ;
545 * :func:`~signal.pthread_kill`: send a signal to a thread ;
546 * :func:`~signal.sigpending`: examine pending functions ;
547 * :func:`~signal.sigwait`: wait a signal.
Ross Lagerwallbc808222011-06-25 12:13:40 +0200548 * :func:`~signal.sigwaitinfo`: wait for a signal, returning detailed
549 information about it.
550 * :func:`~signal.sigtimedwait`: like :func:`~signal.sigwaitinfo` but with a
551 timeout.
Victor Stinnera9293352011-04-30 15:21:58 +0200552
Victor Stinnerd49b1f12011-05-08 02:03:15 +0200553* The signal handler writes the signal number as a single byte instead of
554 a nul byte into the wakeup file descriptor. So it is possible to wait more
555 than one signal and know which signals were raised.
556
Victor Stinner388196e2011-05-10 17:13:00 +0200557* :func:`signal.signal` and :func:`signal.siginterrupt` raise an OSError,
558 instead of a RuntimeError: OSError has an errno attribute.
559
Nick Coghlan96fe56a2011-08-22 11:55:57 +1000560socket
561------
562
Charles-François Natali47413c12011-10-06 19:47:44 +0200563* The :class:`~socket.socket` class now exposes additional methods to process
564 ancillary data when supported by the underlying platform:
Nick Coghlan96fe56a2011-08-22 11:55:57 +1000565
Charles-François Natali47413c12011-10-06 19:47:44 +0200566 * :func:`~socket.socket.sendmsg`
567 * :func:`~socket.socket.recvmsg`
568 * :func:`~socket.socket.recvmsg_into`
Nick Coghlan96fe56a2011-08-22 11:55:57 +1000569
Charles-François Natali47413c12011-10-06 19:47:44 +0200570 (Contributed by David Watson in :issue:`6560`, based on an earlier patch by
571 Heiko Wundram)
572
573* The :class:`~socket.socket` class now supports the PF_CAN protocol family
574 (http://en.wikipedia.org/wiki/Socketcan), on Linux
575 (http://lwn.net/Articles/253425).
576
577 (Contributed by Matthias Fuchs, updated by Tiago Gonçalves in :issue:`10141`)
578
Charles-François Natali10b8cf42011-11-10 19:21:37 +0100579* The :class:`~socket.socket` class now supports the PF_RDS protocol family
580 (http://en.wikipedia.org/wiki/Reliable_Datagram_Sockets and
581 http://oss.oracle.com/projects/rds/).
Victor Stinner754851f2011-04-19 23:58:51 +0200582
Victor Stinner99c8b162011-05-24 12:05:19 +0200583ssl
584---
585
Antoine Pitrou2c0a9672011-11-17 02:09:13 +0100586* The :mod:`ssl` module has two new random generation functions:
Victor Stinner99c8b162011-05-24 12:05:19 +0200587
588 * :func:`~ssl.RAND_bytes`: generate cryptographically strong
589 pseudo-random bytes.
590 * :func:`~ssl.RAND_pseudo_bytes`: generate pseudo-random bytes.
591
Antoine Pitrou2c0a9672011-11-17 02:09:13 +0100592 (Contributed by Victor Stinner in :issue:`12049`)
593
594* The :mod:`ssl` module now exposes a finer-grained exception hierarchy
595 in order to make it easier to inspect the various kinds of errors.
596
597 (Contributed by Antoine Pitrou in :issue:`11183`)
598
599* :meth:`~ssl.SSLContext.load_cert_chain` now accepts a *password* argument
600 to be used if the private key is encrypted.
601
602 (Contributed by Adam Simpkins in :issue:`12803`)
603
604* SSL sockets have a new :meth:`~ssl.SSLSocket.get_channel_binding` method
605 allowing the implementation of certain authentication mechanisms such as
606 SCRAM-SHA-1-PLUS.
607
608 (Contributed by Jacek Konieczny in :issue:`12551`)
609
Giampaolo Rodola'210e7ca2011-07-01 13:55:36 +0200610shutil
611------
612
Sandro Tosiaec2f212011-08-23 00:58:21 +0200613* The :mod:`shutil` module has these new fuctions:
Giampaolo Rodola'210e7ca2011-07-01 13:55:36 +0200614
Sandro Tosiaec2f212011-08-23 00:58:21 +0200615 * :func:`~shutil.disk_usage`: provides total, used and free disk space
616 statistics. (Contributed by Giampaolo Rodolà in :issue:`12442`)
617 * :func:`~shutil.chown`: allows one to change user and/or group of the given
618 path also specifying the user/group names and not only their numeric
619 ids. (Contributed by Sandro Tosi in :issue:`12191`)
Giampaolo Rodola'096dcb12011-06-27 11:17:51 +0200620
Antoine Pitrou5a8bc6f2011-11-17 02:20:48 +0100621smtplib
622-------
623
624The :class:`~smtplib.SMTP_SSL` constructor and the :meth:`~smtplib.SMTP.starttls`
625method now accept an SSLContext parameter to control parameters of the secure
626channel.
627
628(Contributed by Kasun Herath in :issue:`8809`)
629
Senthil Kumarande49d642011-10-16 23:54:44 +0800630urllib
631------
632
633The :class:`~urllib.request.Request` class, now accepts a *method* argument
634used by :meth:`~urllib.request.Request.get_method` to determine what HTTP method
Senthil Kumarana41c9422011-10-20 02:37:08 +0800635should be used. For example, this will send a ``'HEAD'`` request::
Senthil Kumarande49d642011-10-16 23:54:44 +0800636
637 >>> urlopen(Request('http://www.python.org', method='HEAD'))
638
639(:issue:`1673007`)
Giampaolo Rodola'096dcb12011-06-27 11:17:51 +0200640
Giampaolo Rodola'be55d992011-11-22 13:33:34 +0100641sched
642-----
643
644* *timefunc* and *delayfunct* parameters of :class:`~sched.scheduler` class
645 constructor are now optional and defaults to :func:`time.time` and
Giampaolo Rodola'bc7ea582011-11-22 13:37:58 +0100646 :func:`time.sleep` respectively. (Contributed by Chris Clark in
647 :issue:`13245`)
Giampaolo Rodola'be55d992011-11-22 13:33:34 +0100648
649* :meth:`~sched.scheduler.enter` and :meth:`~sched.scheduler.enterabs`
Giampaolo Rodola'bc7ea582011-11-22 13:37:58 +0100650 *argument* parameter is now optional. (Contributed by Chris Clark in
651 :issue:`13245`)
Giampaolo Rodola'be55d992011-11-22 13:33:34 +0100652
653* :meth:`~sched.scheduler.enter` and :meth:`~sched.scheduler.enterabs`
Giampaolo Rodola'bc7ea582011-11-22 13:37:58 +0100654 now accept a *kwargs* parameter. (Contributed by Chris Clark in
655 :issue:`13245`)
Giampaolo Rodola'be55d992011-11-22 13:33:34 +0100656
Giampaolo Rodolà3108f982011-02-24 20:59:48 +0000657Optimizations
658=============
659
660Major performance enhancements have been added:
661
Victor Stinner46606ce2011-11-20 18:27:55 +0100662* Thanks to the :pep:`393`, some operations on Unicode strings has been optimized:
663
664 * the memory footprint is divided by 2 to 4 depending on the text
Victor Stinnera996f1e2011-11-21 13:14:43 +0100665 * encode an ASCII string to UTF-8 doesn't need to encode characters anymore,
666 the UTF-8 representation is shared with the ASCII representation
Victor Stinner46606ce2011-11-20 18:27:55 +0100667 * getting a substring of a latin1 strings is 4 times faster
Giampaolo Rodolà3108f982011-02-24 20:59:48 +0000668
669
670Build and C API Changes
671=======================
672
673Changes to Python's build process and to the C API include:
674
Victor Stinner46606ce2011-11-20 18:27:55 +0100675* The :pep:`393` added new Unicode types, macros and functions:
676
Victor Stinnera996f1e2011-11-21 13:14:43 +0100677 * High-level API:
678
679 * :c:func:`PyUnicode_CopyCharacters`
680 * :c:func:`PyUnicode_FindChar`
681 * :c:func:`PyUnicode_GetLength`, :c:macro:`PyUnicode_GET_LENGTH`
682 * :c:func:`PyUnicode_New`
683 * :c:func:`PyUnicode_Substring`
684 * :c:func:`PyUnicode_ReadChar`, :c:func:`PyUnicode_WriteChar`
685
686 * Low-level API:
687
688 * :c:type:`Py_UCS1`, :c:type:`Py_UCS2`, :c:type:`Py_UCS4` types
689 * :c:type:`PyASCIIObject` and :c:type:`PyCompactUnicodeObject` structures
690 * :c:macro:`PyUnicode_READY`
691 * :c:func:`PyUnicode_FromKindAndData`
692 * :c:func:`PyUnicode_AsUCS4`, :c:func:`PyUnicode_AsUCS4Copy`
693 * :c:macro:`PyUnicode_DATA`, :c:macro:`PyUnicode_1BYTE_DATA`,
694 :c:macro:`PyUnicode_2BYTE_DATA`, :c:macro:`PyUnicode_4BYTE_DATA`
695 * :c:macro:`PyUnicode_KIND` with :c:type:`PyUnicode_Kind` enum:
696 :c:data:`PyUnicode_WCHAR_KIND`, :c:data:`PyUnicode_1BYTE_KIND`,
697 :c:data:`PyUnicode_2BYTE_KIND`, :c:data:`PyUnicode_4BYTE_KIND`
698 * :c:macro:`PyUnicode_READ`, :c:macro:`PyUnicode_READ_CHAR`, :c:macro:`PyUnicode_WRITE`
699 * :c:macro:`PyUnicode_MAX_CHAR_VALUE`
700
Giampaolo Rodolà3108f982011-02-24 20:59:48 +0000701
702
Georg Brandl0cd25c92011-04-29 13:45:54 +0200703Unsupported Operating Systems
Victor Stinnerb90db4c2011-04-26 22:48:24 +0200704=============================
705
Brian Curtin49a40cd2011-05-02 22:30:06 -0500706OS/2 and VMS are no longer supported due to the lack of a maintainer.
707
708Windows 2000 and Windows platforms which set ``COMSPEC`` to ``command.com``
709are no longer supported due to maintenance burden.
Victor Stinnerb90db4c2011-04-26 22:48:24 +0200710
711
Victor Stinner46606ce2011-11-20 18:27:55 +0100712Deprecated Python modules, functions and methods
713================================================
Victor Stinner19bd0692011-11-16 00:18:57 +0100714
715* The :mod:`packaging` module replaces the :mod:`distutils` module
716* The ``unicode_internal`` codec has been deprecated because of the
717 :pep:`393`, use UTF-8, UTF-16 (``utf-16-le`` or ``utf-16-le``), or UTF-32
Victor Stinner46606ce2011-11-20 18:27:55 +0100718 (``utf-32-le`` or ``utf-32-le``)
Victor Stinner19bd0692011-11-16 00:18:57 +0100719* :meth:`ftplib.FTP.nlst` and :meth:`ftplib.FTP.dir`: use
Victor Stinner46606ce2011-11-20 18:27:55 +0100720 :meth:`ftplib.FTP.mlsd`
Victor Stinner19bd0692011-11-16 00:18:57 +0100721* :func:`platform.popen`: use the :mod:`subprocess` module. Check especially
722 the :ref:`subprocess-replacements` section.
723* :issue:`13374`: The Windows bytes API has been deprecated in the :mod:`os`
Victor Stinner46606ce2011-11-20 18:27:55 +0100724 module. Use Unicode filenames, instead of bytes filenames, to not depend on
Victor Stinner19bd0692011-11-16 00:18:57 +0100725 the ANSI code page anymore and to support any filename.
726
727
Victor Stinner46606ce2011-11-20 18:27:55 +0100728Deprecated functions and types of the C API
729===========================================
730
731The :c:type:`Py_UNICODE` has been deprecated by the :pep:`393` and will be
732removed in Python 4. All functions using this type are deprecated:
733
Victor Stinner46606ce2011-11-20 18:27:55 +0100734Unicode functions and methods using :c:type:`Py_UNICODE` and
735:c:type:`Py_UNICODE*` types:
736
737 * :c:macro:`PyUnicode_FromUnicode`: use :c:func:`PyUnicode_FromWideChar` or
738 :c:func:`PyUnicode_FromKindAndData`
739 * :c:macro:`PyUnicode_AS_UNICODE`, :c:func:`PyUnicode_AsUnicode`,
740 :c:func:`PyUnicode_AsUnicodeAndSize`: use :c:func:`PyUnicode_AsWideCharString`
741 * :c:macro:`PyUnicode_AS_DATA`: use :c:macro:`PyUnicode_DATA` with
742 :c:macro:`PyUnicode_READ` and :c:macro:`PyUnicode_WRITE`
743 * :c:macro:`PyUnicode_GET_SIZE`, :c:func:`PyUnicode_GetSize`: use
744 :c:macro:`PyUnicode_GET_LENGTH` or :c:func:`PyUnicode_GetLength`
745 * :c:macro:`PyUnicode_GET_DATA_SIZE`: use
746 ``PyUnicode_GET_LENGTH(str) * PyUnicode_KIND(str)`` (only work on ready
747 strings)
748 * :c:func:`PyUnicode_AsUnicodeCopy`: use :c:func:`PyUnicode_AsUCS4Copy`,
749 :c:func:`PyUnicode_AsWideCharString` or :c:func:`PyUnicode_Copy`
750
Victor Stinnera996f1e2011-11-21 13:14:43 +0100751Functions and macros manipulating Py_UNICODE* strings:
752
753 * :c:macro:`Py_UNICODE_strlen`: use :c:func:`PyUnicode_GetLength` or
754 :c:macro:`PyUnicode_GET_LENGTH`
755 * :c:macro:`Py_UNICODE_strcat`: use :c:func:`PyUnicode_CopyCharacters` or
756 :c:func:`PyUnicode_FromFormat`
757 * :c:macro:`Py_UNICODE_strcpy`, :c:macro:`Py_UNICODE_strncpy`,
758 :c:macro:`Py_UNICODE_COPY`: use :c:func:`PyUnicode_CopyCharacters` or
759 :c:func:`PyUnicode_Substring`
760 * :c:macro:`Py_UNICODE_strcmp`: use :c:func:`PyUnicode_Compare`
761 * :c:macro:`Py_UNICODE_strncmp`: use :c:func:`PyUnicode_Tailmatch`
762 * :c:macro:`Py_UNICODE_strchr`, :c:macro:`Py_UNICODE_strrchr`: use
763 :c:func:`PyUnicode_FindChar`
764 * :c:macro:`Py_UNICODE_FILL`
765
Victor Stinner46606ce2011-11-20 18:27:55 +0100766Encoders:
767
768 * :c:func:`PyUnicode_Encode`: use :c:func:`PyUnicode_AsEncodedObject`
769 * :c:func:`PyUnicode_EncodeUTF7`
Victor Stinnera996f1e2011-11-21 13:14:43 +0100770 * :c:func:`PyUnicode_EncodeUTF8`: use :c:func:`PyUnicode_AsUTF8` or
771 :c:func:`PyUnicode_AsUTF8String`
Victor Stinner46606ce2011-11-20 18:27:55 +0100772 * :c:func:`PyUnicode_EncodeUTF32`
773 * :c:func:`PyUnicode_EncodeUTF16`
774 * :c:func:`PyUnicode_EncodeUnicodeEscape:` use
775 :c:func:`PyUnicode_AsUnicodeEscapeString`
776 * :c:func:`PyUnicode_EncodeRawUnicodeEscape:` use
777 :c:func:`PyUnicode_AsRawUnicodeEscapeString`
778 * :c:func:`PyUnicode_EncodeLatin1`: use :c:func:`PyUnicode_AsLatin1String`
779 * :c:func:`PyUnicode_EncodeASCII`: use :c:func:`PyUnicode_AsASCIIString`
780 * :c:func:`PyUnicode_EncodeCharmap`
781 * :c:func:`PyUnicode_TranslateCharmap`
782 * :c:func:`PyUnicode_EncodeMBCS`: use :c:func:`PyUnicode_AsMBCSString` or
783 :c:func:`PyUnicode_EncodeCodePage` (with ``CP_ACP`` code_page)
784 * :c:func:`PyUnicode_EncodeDecimal`,
785 :c:func:`PyUnicode_TransformDecimalToASCII`
786
787
Giampaolo Rodolà3108f982011-02-24 20:59:48 +0000788Porting to Python 3.3
789=====================
790
791This section lists previously described changes and other bugfixes
Antoine Pitrou037ffbf2011-10-24 00:25:41 +0200792that may require changes to your code.
793
794Porting Python code
795-------------------
Giampaolo Rodolà3108f982011-02-24 20:59:48 +0000796
Victor Stinner19bd0692011-11-16 00:18:57 +0100797* :issue:`12326`: On Linux, sys.platform doesn't contain the major version
Victor Stinnerff3d9392011-08-20 23:39:26 +0200798 anymore. It is now always 'linux', instead of 'linux2' or 'linux3' depending
799 on the Linux version used to build Python. Replace sys.platform == 'linux2'
800 with sys.platform.startswith('linux'), or directly sys.platform == 'linux' if
801 you don't need to support older Python versions.
Éric Araujoc09fca62011-03-23 02:06:24 +0100802
Antoine Pitrou037ffbf2011-10-24 00:25:41 +0200803Porting C code
804--------------
805
806* Due to :ref:`PEP 393 <pep-393>`, the :c:type:`Py_UNICODE` type and all
807 functions using this type are deprecated (but will stay available for
808 at least five years). If you were using low-level Unicode APIs to
809 construct and access unicode objects and you want to benefit of the
810 memory footprint reduction provided by the PEP 393, you have to convert
811 your code to the new :doc:`Unicode API <../c-api/unicode>`.
812
813 However, if you only have been using high-level functions such as
814 :c:func:`PyUnicode_Concat()`, :c:func:`PyUnicode_Join` or
815 :c:func:`PyUnicode_FromFormat()`, your code will automatically take
816 advantage of the new unicode representations.
817
818Other issues
819------------
820
Éric Araujoc09fca62011-03-23 02:06:24 +0100821.. Issue #11591: When :program:`python` was started with :option:`-S`,
822 ``import site`` will not add site-specific paths to the module search
823 paths. In previous versions, it did. See changeset for doc changes in
824 various files. Contributed by Carl Meyer with editions by Éric Araujo.
Éric Araujobe3bd572011-03-26 01:55:15 +0100825
Éric Araujobfc97292011-11-14 18:18:15 +0100826.. Issue #10998: the -Q command-line flag and related artifacts have been
Éric Araujobe3bd572011-03-26 01:55:15 +0100827 removed. Code checking sys.flags.division_warning will need updating.
828 Contributed by Éric Araujo.