blob: 0c3be15627eee091e4058fdfc5657390d6df8c22 [file] [log] [blame]
Giampaolo Rodolà3108f982011-02-24 20:59:48 +00001****************************
2 What's New In Python 3.3
3****************************
4
5:Author: Raymond Hettinger
6:Release: |release|
7:Date: |today|
8
Éric Araujob07b97f2011-10-05 01:03:34 +02009.. Rules for maintenance:
Giampaolo Rodolà3108f982011-02-24 20:59:48 +000010
11 * Anyone can add text to this document. Do not spend very much time
12 on the wording of your changes, because your text will probably
13 get rewritten to some degree.
14
15 * The maintainer will go through Misc/NEWS periodically and add
16 changes; it's therefore more important to add your changes to
17 Misc/NEWS than to this file.
18
19 * This is not a complete list of every single change; completeness
20 is the purpose of Misc/NEWS. Some changes I consider too small
21 or esoteric to include. If such a change is added to the text,
22 I'll just remove it. (This is another reason you shouldn't spend
23 too much time on writing your addition.)
24
25 * If you want to draw your new text to the attention of the
26 maintainer, add 'XXX' to the beginning of the paragraph or
27 section.
28
29 * It's OK to just add a fragmentary note about a change. For
30 example: "XXX Describe the transmogrify() function added to the
31 socket module." The maintainer will research the change and
32 write the necessary text.
33
34 * You can comment out your additions if you like, but it's not
35 necessary (especially when a final release is some months away).
36
37 * Credit the author of a patch or bugfix. Just the name is
38 sufficient; the e-mail address isn't necessary.
39
40 * It's helpful to add the bug/patch number as a comment:
41
Giampaolo Rodolà3108f982011-02-24 20:59:48 +000042 XXX Describe the transmogrify() function added to the socket
43 module.
Éric Araujob07b97f2011-10-05 01:03:34 +020044 (Contributed by P.Y. Developer in :issue:`12345`.)
Giampaolo Rodolà3108f982011-02-24 20:59:48 +000045
Éric Araujob07b97f2011-10-05 01:03:34 +020046 This saves the maintainer the effort of going through the Mercurial log
Giampaolo Rodolà3108f982011-02-24 20:59:48 +000047 when researching a change.
48
49This article explains the new features in Python 3.3, compared to 3.2.
50
51
Antoine Pitrou037ffbf2011-10-24 00:25:41 +020052.. _pep-393:
53
Ezio Melotti48a2f8f2011-09-29 00:18:19 +030054PEP 393: Flexible String Representation
55=======================================
56
Antoine Pitroufd9b4162011-10-24 00:14:43 +020057The Unicode string type is changed to support multiple internal
58representations, depending on the character with the largest Unicode ordinal
59(1, 2, or 4 bytes) in the represented string. This allows a space-efficient
60representation in common cases, but gives access to full UCS-4 on all
61systems. For compatibility with existing APIs, several representations may
62exist in parallel; over time, this compatibility should be phased out.
Ezio Melotti397546a2011-09-29 08:34:36 +030063
Antoine Pitroufd9b4162011-10-24 00:14:43 +020064On the Python side, there should be no downside to this change.
Ezio Melotti397546a2011-09-29 08:34:36 +030065
Antoine Pitroufd9b4162011-10-24 00:14:43 +020066On the C API side, PEP 393 is fully backward compatible. The legacy API
67should remain available at least five years. Applications using the legacy
68API will not fully benefit of the memory reduction, or - worse - may use
69a bit more memory, because Python may have to maintain two versions of each
70string (in the legacy format and in the new efficient storage).
71
72Changes introduced by :pep:`393` are the following:
Ezio Melotti48a2f8f2011-09-29 00:18:19 +030073
Ezio Melotti397546a2011-09-29 08:34:36 +030074* Python now always supports the full range of Unicode codepoints, including
75 non-BMP ones (i.e. from ``U+0000`` to ``U+10FFFF``). The distinction between
76 narrow and wide builds no longer exists and Python now behaves like a wide
Antoine Pitroufd9b4162011-10-24 00:14:43 +020077 build, even under Windows.
Ezio Melotti397546a2011-09-29 08:34:36 +030078
79* The storage of Unicode strings now depends on the highest codepoint in the string:
80
81 * pure ASCII and Latin1 strings (``U+0000-U+00FF``) use 1 byte per codepoint;
82
83 * BMP strings (``U+0000-U+FFFF``) use 2 bytes per codepoint;
84
85 * non-BMP strings (``U+10000-U+10FFFF``) use 4 bytes per codepoint.
86
Antoine Pitroubeb78362011-11-17 01:59:51 +010087 The net effect is that for most applications, memory usage of string storage
88 should decrease significantly - especially compared to former wide unicode
89 builds - as, in many cases, strings will be pure ASCII even in international
90 contexts (because many strings store non-human language data, such as XML
91 fragments, HTTP headers, JSON-encoded data, etc.). We also hope that it
92 will, for the same reasons, increase CPU cache efficiency on non-trivial
93 applications.
94
95 .. The memory usage of Python 3.3 is two to three times smaller than Python 3.2,
96 and a little bit better than Python 2.7, on a `Django benchmark
97 <http://mail.python.org/pipermail/python-dev/2011-September/113714.html>`_.
98 XXX The result should be moved in the PEP and a link to the PEP should
99 be added here.
Ezio Melotti397546a2011-09-29 08:34:36 +0300100
Antoine Pitroufd9b4162011-10-24 00:14:43 +0200101* With the death of narrow builds, the problems specific to narrow builds have
102 also been fixed, for example:
Ezio Melotti397546a2011-09-29 08:34:36 +0300103
104 * :func:`len` now always returns 1 for non-BMP characters,
105 so ``len('\U0010FFFF') == 1``;
106
107 * surrogate pairs are not recombined in string literals,
108 so ``'\uDBFF\uDFFF' != '\U0010FFFF'``;
109
Antoine Pitroufd9b4162011-10-24 00:14:43 +0200110 * indexing or slicing non-BMP characters returns the expected value,
Ezio Melotti397546a2011-09-29 08:34:36 +0300111 so ``'\U0010FFFF'[0]`` now returns ``'\U0010FFFF'`` and not ``'\uDBFF'``;
112
Antoine Pitroud136aec2011-11-17 01:48:06 +0100113 * all other functions in the standard library now correctly handle
Antoine Pitroufd9b4162011-10-24 00:14:43 +0200114 non-BMP codepoints.
Ezio Melotti397546a2011-09-29 08:34:36 +0300115
Ezio Melotti48a2f8f2011-09-29 00:18:19 +0300116* The value of :data:`sys.maxunicode` is now always ``1114111`` (``0x10FFFF``
117 in hexadecimal). The :c:func:`PyUnicode_GetMax` function still returns
118 either ``0xFFFF`` or ``0x10FFFF`` for backward compatibility, and it should
119 not be used with the new Unicode API (see :issue:`13054`).
120
Ezio Melotti397546a2011-09-29 08:34:36 +0300121* The :file:`./configure` flag ``--with-wide-unicode`` has been removed.
Victor Stinner7d637ab2011-09-29 02:56:16 +0200122
Éric Araujob07b97f2011-10-05 01:03:34 +0200123
Victor Stinnera1bf2982011-10-12 20:35:02 +0200124PEP 3151: Reworking the OS and IO exception hierarchy
125=====================================================
126
127:pep:`3151` - Reworking the OS and IO exception hierarchy
Antoine Pitrou01fd26c2011-10-24 00:07:02 +0200128 PEP written and implemented by Antoine Pitrou.
Victor Stinnera1bf2982011-10-12 20:35:02 +0200129
Antoine Pitrou01fd26c2011-10-24 00:07:02 +0200130The hierarchy of exceptions raised by operating system errors is now both
131simplified and finer-grained.
Victor Stinnera1bf2982011-10-12 20:35:02 +0200132
Antoine Pitrou01fd26c2011-10-24 00:07:02 +0200133You don't have to worry anymore about choosing the appropriate exception
134type between :exc:`OSError`, :exc:`IOError`, :exc:`EnvironmentError`,
135:exc:`WindowsError`, :exc:`mmap.error`, :exc:`socket.error` or
136:exc:`select.error`. All these exception types are now only one:
137:exc:`OSError`. The other names are kept as aliases for compatibility
138reasons.
Victor Stinnera1bf2982011-10-12 20:35:02 +0200139
Antoine Pitrou01fd26c2011-10-24 00:07:02 +0200140Also, it is now easier to catch a specific error condition. Instead of
141inspecting the ``errno`` attribute (or ``args[0]``) for a particular
142constant from the :mod:`errno` module, you can catch the adequate
143:exc:`OSError` subclass. The available subclasses are the following:
Victor Stinnera1bf2982011-10-12 20:35:02 +0200144
Antoine Pitrou01fd26c2011-10-24 00:07:02 +0200145* :exc:`BlockingIOError`
146* :exc:`ChildProcessError`
147* :exc:`ConnectionError`
148* :exc:`FileExistsError`
149* :exc:`FileNotFoundError`
150* :exc:`InterruptedError`
151* :exc:`IsADirectoryError`
152* :exc:`NotADirectoryError`
153* :exc:`PermissionError`
154* :exc:`ProcessLookupError`
155* :exc:`TimeoutError`
Victor Stinnera1bf2982011-10-12 20:35:02 +0200156
Antoine Pitrou01fd26c2011-10-24 00:07:02 +0200157And the :exc:`ConnectionError` itself has finer-grained subclasses:
Victor Stinnera1bf2982011-10-12 20:35:02 +0200158
Antoine Pitrou01fd26c2011-10-24 00:07:02 +0200159* :exc:`BrokenPipeError`
160* :exc:`ConnectionAbortedError`
161* :exc:`ConnectionRefusedError`
162* :exc:`ConnectionResetError`
Victor Stinnera1bf2982011-10-12 20:35:02 +0200163
164Thanks to the new exceptions, common usages of the :mod:`errno` can now be
Antoine Pitrou01fd26c2011-10-24 00:07:02 +0200165avoided. For example, the following code written for Python 3.2::
Victor Stinnera1bf2982011-10-12 20:35:02 +0200166
167 from errno import ENOENT, EACCES, EPERM
168
169 try:
170 with open("document.txt") as f:
171 content = f.read()
172 except IOError as err:
173 if err.errno == ENOENT:
174 print("document.txt file is missing")
175 elif err.errno in (EACCES, EPERM):
176 print("You are not allowed to read document.txt")
177 else:
178 raise
179
Antoine Pitrou01fd26c2011-10-24 00:07:02 +0200180can now be written without the :mod:`errno` import and without manual
181inspection of exception attributes::
Victor Stinnera1bf2982011-10-12 20:35:02 +0200182
183 try:
184 with open("document.txt") as f:
185 content = f.read()
186 except FileNotFoundError:
187 print("document.txt file is missing")
188 except PermissionError:
189 print("You are not allowed to read document.txt")
190
191
Giampaolo Rodolà3108f982011-02-24 20:59:48 +0000192Other Language Changes
193======================
194
195Some smaller changes made to the core Python language are:
196
197* Stub
198
Ezio Melotti931b8aa2011-10-21 21:57:36 +0300199Added support for Unicode name aliases and named sequences.
Ezio Melotti2d99dac2011-10-24 00:44:03 +0300200Both :func:`unicodedata.lookup()` and ``'\N{...}'`` now resolve name aliases,
Ezio Melotti931b8aa2011-10-21 21:57:36 +0300201and :func:`unicodedata.lookup()` resolves named sequences too.
202
203(Contributed by Ezio Melotti in :issue:`12753`)
204
Giampaolo Rodolà3108f982011-02-24 20:59:48 +0000205
Mark Dickinson36645682011-10-23 19:53:01 +0100206Equality comparisons on :func:`range` objects now return a result reflecting
207the equality of the underlying sequences generated by those range objects.
208
209(:issue:`13021`)
210
211
Petri Lehtinen61ea8a02011-11-24 22:00:46 +0200212The ``count()``, ``find()``, ``rfind()``, ``index()`` and ``rindex()``
213methods of :class:`bytes` and :class:`bytearray` objects now accept an
214integer between 0 and 255 as their first argument.
215
216(:issue:`12170`)
217
218
Victor Stinner46606ce2011-11-20 18:27:55 +0100219New and Improved Modules
220========================
Giampaolo Rodolà3108f982011-02-24 20:59:48 +0000221
Meador Ingec5dbb3d2011-09-20 21:48:16 -0500222array
223-----
224
225The :mod:`array` module supports the :c:type:`long long` type using ``q`` and
226``Q`` type codes.
227
228(Contributed by Oren Tirosh and Hirokazu Yamamoto in :issue:`1172711`)
229
230
Victor Stinner2cded9c2011-07-08 01:45:13 +0200231codecs
232------
233
Victor Stinner3a50e702011-10-18 21:21:00 +0200234The :mod:`~encodings.mbcs` codec has be rewritten to handle correclty
235``replace`` and ``ignore`` error handlers on all Windows versions. The
236:mod:`~encodings.mbcs` codec is now supporting all error handlers, instead of
237only ``replace`` to encode and ``ignore`` to decode.
238
Victor Stinner7592d052011-10-27 01:43:48 +0200239A new Windows-only codec has been added: ``cp65001`` (:issue:`13216`). It is
Victor Stinner2f3ca9f2011-10-27 01:38:56 +0200240the Windows code page 65001 (Windows UTF-8, ``CP_UTF8``). For example, it is
241used by ``sys.stdout`` if the console output code page is set to cp65001 (e.g.
242using ``chcp 65001`` command).
243
Victor Stinner2cded9c2011-07-08 01:45:13 +0200244Multibyte CJK decoders now resynchronize faster. They only ignore the first
Georg Brandl6c0929b2011-07-09 11:43:33 +0200245byte of an invalid byte sequence. For example, ``b'\xff\n'.decode('gb2312',
246'replace')`` now returns a ``\n`` after the replacement character.
Victor Stinner2cded9c2011-07-08 01:45:13 +0200247
Georg Brandl6c0929b2011-07-09 11:43:33 +0200248(:issue:`12016`)
Victor Stinner2cded9c2011-07-08 01:45:13 +0200249
250Don't reset incremental encoders of CJK codecs at each call to their encode()
Georg Brandl6c0929b2011-07-09 11:43:33 +0200251method anymore. For example::
Victor Stinner2cded9c2011-07-08 01:45:13 +0200252
253 $ ./python -q
254 >>> import codecs
255 >>> encoder = codecs.getincrementalencoder('hz')('strict')
256 >>> b''.join(encoder.encode(x) for x in '\u52ff\u65bd\u65bc\u4eba\u3002 Bye.')
257 b'~{NpJ)l6HK!#~} Bye.'
258
Georg Brandl6c0929b2011-07-09 11:43:33 +0200259This example gives ``b'~{Np~}~{J)~}~{l6~}~{HK~}~{!#~} Bye.'`` with older Python
Victor Stinner2cded9c2011-07-08 01:45:13 +0200260versions.
261
Georg Brandl6c0929b2011-07-09 11:43:33 +0200262(:issue:`12100`)
Victor Stinner2cded9c2011-07-08 01:45:13 +0200263
Victor Stinner9f4b1e92011-11-10 20:56:30 +0100264The ``unicode_internal`` codec has been deprecated.
265
Éric Araujo84b8ed82011-08-29 21:42:47 +0200266crypt
267-----
268
Victor Stinnerc78fb332011-09-21 03:35:44 +0200269Addition of salt and modular crypt format and the :func:`~crypt.mksalt`
270function to the :mod:`crypt` module.
Éric Araujo84b8ed82011-08-29 21:42:47 +0200271
272(:issue:`10924`)
273
Victor Stinnera7878b72011-07-14 23:07:44 +0200274curses
275------
276
Victor Stinnerc78fb332011-09-21 03:35:44 +0200277 * The :class:`curses.window` class has a new :meth:`~curses.window.get_wch`
278 method to get a wide character
279 * The :mod:`curses` module has a new :meth:`~curses.unget_wch` function to
280 push a wide character so the next :meth:`~curses.window.get_wch` will return
281 it
Victor Stinnera7878b72011-07-14 23:07:44 +0200282
Victor Stinnerc78fb332011-09-21 03:35:44 +0200283(Contributed by Iñigo Serna in :issue:`6755`)
Victor Stinnera7878b72011-07-14 23:07:44 +0200284
Victor Stinner024e37a2011-03-31 01:31:06 +0200285faulthandler
286------------
287
288New module: :mod:`faulthandler`.
289
290 * :envvar:`PYTHONFAULTHANDLER`
291 * :option:`-X` ``faulthandler``
292
Victor Stinnere0be4232011-10-25 13:06:09 +0200293time
294----
295
296* The :mod:`time` module has new :func:`~time.clock_getres` and
297 :func:`~time.clock_gettime` functions and ``CLOCK_xxx`` constants.
298 :func:`~time.clock_gettime` can be used with :data:`time.CLOCK_MONOTONIC` to
299 get a monotonic clock.
300
301 (Contributed by Victor Stinner in :issue:`10278`)
302
Victor Stinnerfa0e3d52011-05-09 01:01:09 +0200303
Victor Stinner811db3b2011-09-21 03:20:03 +0200304ftplib
305------
306
307The :class:`~ftplib.FTP_TLS` class now provides a new
308:func:`~ftplib.FTP_TLS.ccc` function to revert control channel back to
Florent Xicluna6d57d212011-10-23 22:23:57 +0200309plaintext. This can be useful to take advantage of firewalls that know how to
Victor Stinner811db3b2011-09-21 03:20:03 +0200310handle NAT with non-secure FTP without opening fixed ports.
311
312(Contributed by Giampaolo Rodolà in :issue:`12139`)
313
314
Antoine Pitrou5a8bc6f2011-11-17 02:20:48 +0100315imaplib
316-------
317
318The :class:`~imaplib.IMAP4_SSL` constructor now accepts an SSLContext
319parameter to control parameters of the secure channel.
320
321(Contributed by Sijin Joseph in :issue:`8808`)
322
323
Victor Stinnerfa0e3d52011-05-09 01:01:09 +0200324math
325----
326
327The :mod:`math` module has a new function:
328
329 * :func:`~math.log2`: return the base-2 logarithm of *x*
330 (Written by Mark Dickinson in :issue:`11888`).
331
332
333nntplib
334-------
335
336The :class:`nntplib.NNTP` class now supports the context manager protocol to
337unconditionally consume :exc:`socket.error` exceptions and to close the NNTP
338connection when done::
339
340 >>> from nntplib import NNTP
Ezio Melotti3c14b4e2011-07-13 11:44:44 +0300341 >>> with NNTP('news.gmane.org') as n:
Victor Stinnerfa0e3d52011-05-09 01:01:09 +0200342 ... n.group('gmane.comp.python.committers')
343 ...
Ezio Melotti04f648c2011-07-26 09:37:46 +0300344 ('211 1755 1 1755 gmane.comp.python.committers', 1755, 1, 1755, 'gmane.comp.python.committers')
Victor Stinnerfa0e3d52011-05-09 01:01:09 +0200345 >>>
346
347(Contributed by Giampaolo Rodolà in :issue:`9795`)
348
349
Giampaolo Rodolàc9c2c8b2011-02-25 14:39:16 +0000350os
351--
352
Charles-François Natalia003af12011-06-01 20:30:52 +0200353* The :mod:`os` module has a new :func:`~os.pipe2` function that makes it
354 possible to create a pipe with :data:`~os.O_CLOEXEC` or
355 :data:`~os.O_NONBLOCK` flags set atomically. This is especially useful to
356 avoid race conditions in multi-threaded programs.
357
Giampaolo Rodolà18e8bcb2011-02-25 20:57:54 +0000358* The :mod:`os` module has a new :func:`~os.sendfile` function which provides
359 an efficent "zero-copy" way for copying data from one file (or socket)
360 descriptor to another. The phrase "zero-copy" refers to the fact that all of
361 the copying of data between the two descriptors is done entirely by the
362 kernel, with no copying of data into userspace buffers. :func:`~os.sendfile`
363 can be used to efficiently copy data from a file on disk to a network socket,
364 e.g. for downloading a file.
Giampaolo Rodolàc9c2c8b2011-02-25 14:39:16 +0000365
Giampaolo Rodolà18e8bcb2011-02-25 20:57:54 +0000366 (Patch submitted by Ross Lagerwall and Giampaolo Rodolà in :issue:`10882`.)
367
368* The :mod:`os` module has two new functions: :func:`~os.getpriority` and
369 :func:`~os.setpriority`. They can be used to get or set process
370 niceness/priority in a fashion similar to :func:`os.nice` but extended to all
371 processes instead of just the current one.
372
373 (Patch submitted by Giampaolo Rodolà in :issue:`10784`.)
Giampaolo Rodolà3108f982011-02-24 20:59:48 +0000374
Victor Stinnere5064372011-10-14 00:08:29 +0200375* "at" functions (:issue:`4761`):
376
377 * :func:`~os.faccessat`
378 * :func:`~os.fchmodat`
379 * :func:`~os.fchownat`
380 * :func:`~os.fstatat`
381 * :func:`~os.futimesat`
382 * :func:`~os.futimesat`
383 * :func:`~os.linkat`
384 * :func:`~os.mkdirat`
385 * :func:`~os.mkfifoat`
386 * :func:`~os.mknodat`
387 * :func:`~os.openat`
388 * :func:`~os.readlinkat`
389 * :func:`~os.renameat`
390 * :func:`~os.symlinkat`
391 * :func:`~os.unlinkat`
392 * :func:`~os.utimensat`
393 * :func:`~os.utimensat`
394
395* extended attributes (:issue:`12720`):
396
397 * :func:`~os.fgetxattr`
398 * :func:`~os.flistxattr`
399 * :func:`~os.fremovexattr`
400 * :func:`~os.fsetxattr`
401 * :func:`~os.getxattr`
402 * :func:`~os.lgetxattr`
403 * :func:`~os.listxattr`
404 * :func:`~os.llistxattr`
405 * :func:`~os.lremovexattr`
406 * :func:`~os.lsetxattr`
407 * :func:`~os.removexattr`
408 * :func:`~os.setxattr`
409
410* Scheduler functions (:issue:`12655`):
411
412 * :func:`~os.sched_get_priority_max`
413 * :func:`~os.sched_get_priority_min`
414 * :func:`~os.sched_getaffinity`
415 * :func:`~os.sched_getparam`
416 * :func:`~os.sched_getscheduler`
417 * :func:`~os.sched_rr_get_interval`
418 * :func:`~os.sched_setaffinity`
419 * :func:`~os.sched_setparam`
420 * :func:`~os.sched_setscheduler`
421 * :func:`~os.sched_yield`
422
423* Add some extra posix functions to the os module (:issue:`10812`):
424
425 * :func:`~os.fexecve`
426 * :func:`~os.futimens`
427 * :func:`~os.futimens`
428 * :func:`~os.futimes`
429 * :func:`~os.futimes`
430 * :func:`~os.lockf`
431 * :func:`~os.lutimes`
432 * :func:`~os.lutimes`
433 * :func:`~os.posix_fadvise`
434 * :func:`~os.posix_fallocate`
435 * :func:`~os.pread`
436 * :func:`~os.pwrite`
437 * :func:`~os.readv`
438 * :func:`~os.sync`
439 * :func:`~os.truncate`
440 * :func:`~os.waitid`
441 * :func:`~os.writev`
442
443* Other new functions:
444
445 * :func:`~os.fdlistdir` (:issue:`10755`)
446 * :func:`~os.getgrouplist` (:issue:`9344`)
447
Giampaolo Rodolà424298a2011-03-03 18:34:06 +0000448
Éric Araujo765e94f2011-06-03 17:26:59 +0200449packaging
450---------
451
452:mod:`distutils` has undergone additions and refactoring under a new name,
453:mod:`packaging`, to allow developers to break backward compatibility.
454:mod:`distutils` is still provided in the standard library, but users are
455encouraged to transition to :mod:`packaging`. For older versions of Python, a
456backport compatible with 2.4+ and 3.1+ will be made available on PyPI under the
457name :mod:`distutils2`.
458
459.. TODO add examples and howto to the packaging docs and link to them
460
461
Victor Stinner383c3fc2011-05-25 01:35:05 +0200462pydoc
463-----
464
Victor Stinner6daa33c2011-05-25 01:41:22 +0200465The Tk GUI and the :func:`~pydoc.serve` function have been removed from the
466:mod:`pydoc` module: ``pydoc -g`` and :func:`~pydoc.serve` have been deprecated
467in Python 3.2.
Victor Stinner383c3fc2011-05-25 01:35:05 +0200468
469
Victor Stinnerd5c355c2011-04-30 14:53:09 +0200470sys
471---
Victor Stinner754851f2011-04-19 23:58:51 +0200472
Éric Araujo84b8ed82011-08-29 21:42:47 +0200473* The :mod:`sys` module has a new :data:`~sys.thread_info` :term:`struct
Victor Stinnerd5c355c2011-04-30 14:53:09 +0200474 sequence` holding informations about the thread implementation.
Victor Stinner754851f2011-04-19 23:58:51 +0200475
Georg Brandl00db5822011-04-30 15:30:03 +0200476 (:issue:`11223`)
Victor Stinnera9293352011-04-30 15:21:58 +0200477
Victor Stinnerfa0e3d52011-05-09 01:01:09 +0200478
Victor Stinnera9293352011-04-30 15:21:58 +0200479signal
480------
481
Victor Stinnerfa0e3d52011-05-09 01:01:09 +0200482* The :mod:`signal` module has new functions:
Victor Stinnera9293352011-04-30 15:21:58 +0200483
Victor Stinnerb3e72192011-05-08 01:46:11 +0200484 * :func:`~signal.pthread_sigmask`: fetch and/or change the signal mask of the
485 calling thread (Contributed by Jean-Paul Calderone in :issue:`8407`) ;
486 * :func:`~signal.pthread_kill`: send a signal to a thread ;
487 * :func:`~signal.sigpending`: examine pending functions ;
488 * :func:`~signal.sigwait`: wait a signal.
Ross Lagerwallbc808222011-06-25 12:13:40 +0200489 * :func:`~signal.sigwaitinfo`: wait for a signal, returning detailed
490 information about it.
491 * :func:`~signal.sigtimedwait`: like :func:`~signal.sigwaitinfo` but with a
492 timeout.
Victor Stinnera9293352011-04-30 15:21:58 +0200493
Victor Stinnerd49b1f12011-05-08 02:03:15 +0200494* The signal handler writes the signal number as a single byte instead of
495 a nul byte into the wakeup file descriptor. So it is possible to wait more
496 than one signal and know which signals were raised.
497
Victor Stinner388196e2011-05-10 17:13:00 +0200498* :func:`signal.signal` and :func:`signal.siginterrupt` raise an OSError,
499 instead of a RuntimeError: OSError has an errno attribute.
500
Nick Coghlan96fe56a2011-08-22 11:55:57 +1000501socket
502------
503
Charles-François Natali47413c12011-10-06 19:47:44 +0200504* The :class:`~socket.socket` class now exposes additional methods to process
505 ancillary data when supported by the underlying platform:
Nick Coghlan96fe56a2011-08-22 11:55:57 +1000506
Charles-François Natali47413c12011-10-06 19:47:44 +0200507 * :func:`~socket.socket.sendmsg`
508 * :func:`~socket.socket.recvmsg`
509 * :func:`~socket.socket.recvmsg_into`
Nick Coghlan96fe56a2011-08-22 11:55:57 +1000510
Charles-François Natali47413c12011-10-06 19:47:44 +0200511 (Contributed by David Watson in :issue:`6560`, based on an earlier patch by
512 Heiko Wundram)
513
514* The :class:`~socket.socket` class now supports the PF_CAN protocol family
515 (http://en.wikipedia.org/wiki/Socketcan), on Linux
516 (http://lwn.net/Articles/253425).
517
518 (Contributed by Matthias Fuchs, updated by Tiago Gonçalves in :issue:`10141`)
519
Charles-François Natali10b8cf42011-11-10 19:21:37 +0100520* The :class:`~socket.socket` class now supports the PF_RDS protocol family
521 (http://en.wikipedia.org/wiki/Reliable_Datagram_Sockets and
522 http://oss.oracle.com/projects/rds/).
Victor Stinner754851f2011-04-19 23:58:51 +0200523
Victor Stinner99c8b162011-05-24 12:05:19 +0200524ssl
525---
526
Antoine Pitrou2c0a9672011-11-17 02:09:13 +0100527* The :mod:`ssl` module has two new random generation functions:
Victor Stinner99c8b162011-05-24 12:05:19 +0200528
529 * :func:`~ssl.RAND_bytes`: generate cryptographically strong
530 pseudo-random bytes.
531 * :func:`~ssl.RAND_pseudo_bytes`: generate pseudo-random bytes.
532
Antoine Pitrou2c0a9672011-11-17 02:09:13 +0100533 (Contributed by Victor Stinner in :issue:`12049`)
534
535* The :mod:`ssl` module now exposes a finer-grained exception hierarchy
536 in order to make it easier to inspect the various kinds of errors.
537
538 (Contributed by Antoine Pitrou in :issue:`11183`)
539
540* :meth:`~ssl.SSLContext.load_cert_chain` now accepts a *password* argument
541 to be used if the private key is encrypted.
542
543 (Contributed by Adam Simpkins in :issue:`12803`)
544
545* SSL sockets have a new :meth:`~ssl.SSLSocket.get_channel_binding` method
546 allowing the implementation of certain authentication mechanisms such as
547 SCRAM-SHA-1-PLUS.
548
549 (Contributed by Jacek Konieczny in :issue:`12551`)
550
Giampaolo Rodola'210e7ca2011-07-01 13:55:36 +0200551shutil
552------
553
Sandro Tosiaec2f212011-08-23 00:58:21 +0200554* The :mod:`shutil` module has these new fuctions:
Giampaolo Rodola'210e7ca2011-07-01 13:55:36 +0200555
Sandro Tosiaec2f212011-08-23 00:58:21 +0200556 * :func:`~shutil.disk_usage`: provides total, used and free disk space
557 statistics. (Contributed by Giampaolo Rodolà in :issue:`12442`)
558 * :func:`~shutil.chown`: allows one to change user and/or group of the given
559 path also specifying the user/group names and not only their numeric
560 ids. (Contributed by Sandro Tosi in :issue:`12191`)
Giampaolo Rodola'096dcb12011-06-27 11:17:51 +0200561
Antoine Pitrou5a8bc6f2011-11-17 02:20:48 +0100562smtplib
563-------
564
565The :class:`~smtplib.SMTP_SSL` constructor and the :meth:`~smtplib.SMTP.starttls`
566method now accept an SSLContext parameter to control parameters of the secure
567channel.
568
569(Contributed by Kasun Herath in :issue:`8809`)
570
Senthil Kumarande49d642011-10-16 23:54:44 +0800571urllib
572------
573
574The :class:`~urllib.request.Request` class, now accepts a *method* argument
575used by :meth:`~urllib.request.Request.get_method` to determine what HTTP method
Senthil Kumarana41c9422011-10-20 02:37:08 +0800576should be used. For example, this will send a ``'HEAD'`` request::
Senthil Kumarande49d642011-10-16 23:54:44 +0800577
578 >>> urlopen(Request('http://www.python.org', method='HEAD'))
579
580(:issue:`1673007`)
Giampaolo Rodola'096dcb12011-06-27 11:17:51 +0200581
Giampaolo Rodola'be55d992011-11-22 13:33:34 +0100582sched
583-----
584
585* *timefunc* and *delayfunct* parameters of :class:`~sched.scheduler` class
586 constructor are now optional and defaults to :func:`time.time` and
Giampaolo Rodola'bc7ea582011-11-22 13:37:58 +0100587 :func:`time.sleep` respectively. (Contributed by Chris Clark in
588 :issue:`13245`)
Giampaolo Rodola'be55d992011-11-22 13:33:34 +0100589
590* :meth:`~sched.scheduler.enter` and :meth:`~sched.scheduler.enterabs`
Giampaolo Rodola'bc7ea582011-11-22 13:37:58 +0100591 *argument* parameter is now optional. (Contributed by Chris Clark in
592 :issue:`13245`)
Giampaolo Rodola'be55d992011-11-22 13:33:34 +0100593
594* :meth:`~sched.scheduler.enter` and :meth:`~sched.scheduler.enterabs`
Giampaolo Rodola'bc7ea582011-11-22 13:37:58 +0100595 now accept a *kwargs* parameter. (Contributed by Chris Clark in
596 :issue:`13245`)
Giampaolo Rodola'be55d992011-11-22 13:33:34 +0100597
Giampaolo Rodolà3108f982011-02-24 20:59:48 +0000598Optimizations
599=============
600
601Major performance enhancements have been added:
602
Victor Stinner46606ce2011-11-20 18:27:55 +0100603* Thanks to the :pep:`393`, some operations on Unicode strings has been optimized:
604
605 * the memory footprint is divided by 2 to 4 depending on the text
Victor Stinnera996f1e2011-11-21 13:14:43 +0100606 * encode an ASCII string to UTF-8 doesn't need to encode characters anymore,
607 the UTF-8 representation is shared with the ASCII representation
Victor Stinner46606ce2011-11-20 18:27:55 +0100608 * getting a substring of a latin1 strings is 4 times faster
Giampaolo Rodolà3108f982011-02-24 20:59:48 +0000609
610
611Build and C API Changes
612=======================
613
614Changes to Python's build process and to the C API include:
615
Victor Stinner46606ce2011-11-20 18:27:55 +0100616* The :pep:`393` added new Unicode types, macros and functions:
617
Victor Stinnera996f1e2011-11-21 13:14:43 +0100618 * High-level API:
619
620 * :c:func:`PyUnicode_CopyCharacters`
621 * :c:func:`PyUnicode_FindChar`
622 * :c:func:`PyUnicode_GetLength`, :c:macro:`PyUnicode_GET_LENGTH`
623 * :c:func:`PyUnicode_New`
624 * :c:func:`PyUnicode_Substring`
625 * :c:func:`PyUnicode_ReadChar`, :c:func:`PyUnicode_WriteChar`
626
627 * Low-level API:
628
629 * :c:type:`Py_UCS1`, :c:type:`Py_UCS2`, :c:type:`Py_UCS4` types
630 * :c:type:`PyASCIIObject` and :c:type:`PyCompactUnicodeObject` structures
631 * :c:macro:`PyUnicode_READY`
632 * :c:func:`PyUnicode_FromKindAndData`
633 * :c:func:`PyUnicode_AsUCS4`, :c:func:`PyUnicode_AsUCS4Copy`
634 * :c:macro:`PyUnicode_DATA`, :c:macro:`PyUnicode_1BYTE_DATA`,
635 :c:macro:`PyUnicode_2BYTE_DATA`, :c:macro:`PyUnicode_4BYTE_DATA`
636 * :c:macro:`PyUnicode_KIND` with :c:type:`PyUnicode_Kind` enum:
637 :c:data:`PyUnicode_WCHAR_KIND`, :c:data:`PyUnicode_1BYTE_KIND`,
638 :c:data:`PyUnicode_2BYTE_KIND`, :c:data:`PyUnicode_4BYTE_KIND`
639 * :c:macro:`PyUnicode_READ`, :c:macro:`PyUnicode_READ_CHAR`, :c:macro:`PyUnicode_WRITE`
640 * :c:macro:`PyUnicode_MAX_CHAR_VALUE`
641
Giampaolo Rodolà3108f982011-02-24 20:59:48 +0000642
643
Georg Brandl0cd25c92011-04-29 13:45:54 +0200644Unsupported Operating Systems
Victor Stinnerb90db4c2011-04-26 22:48:24 +0200645=============================
646
Brian Curtin49a40cd2011-05-02 22:30:06 -0500647OS/2 and VMS are no longer supported due to the lack of a maintainer.
648
649Windows 2000 and Windows platforms which set ``COMSPEC`` to ``command.com``
650are no longer supported due to maintenance burden.
Victor Stinnerb90db4c2011-04-26 22:48:24 +0200651
652
Victor Stinner46606ce2011-11-20 18:27:55 +0100653Deprecated Python modules, functions and methods
654================================================
Victor Stinner19bd0692011-11-16 00:18:57 +0100655
656* The :mod:`packaging` module replaces the :mod:`distutils` module
657* The ``unicode_internal`` codec has been deprecated because of the
658 :pep:`393`, use UTF-8, UTF-16 (``utf-16-le`` or ``utf-16-le``), or UTF-32
Victor Stinner46606ce2011-11-20 18:27:55 +0100659 (``utf-32-le`` or ``utf-32-le``)
Victor Stinner19bd0692011-11-16 00:18:57 +0100660* :meth:`ftplib.FTP.nlst` and :meth:`ftplib.FTP.dir`: use
Victor Stinner46606ce2011-11-20 18:27:55 +0100661 :meth:`ftplib.FTP.mlsd`
Victor Stinner19bd0692011-11-16 00:18:57 +0100662* :func:`platform.popen`: use the :mod:`subprocess` module. Check especially
663 the :ref:`subprocess-replacements` section.
664* :issue:`13374`: The Windows bytes API has been deprecated in the :mod:`os`
Victor Stinner46606ce2011-11-20 18:27:55 +0100665 module. Use Unicode filenames, instead of bytes filenames, to not depend on
Victor Stinner19bd0692011-11-16 00:18:57 +0100666 the ANSI code page anymore and to support any filename.
667
668
Victor Stinner46606ce2011-11-20 18:27:55 +0100669Deprecated functions and types of the C API
670===========================================
671
672The :c:type:`Py_UNICODE` has been deprecated by the :pep:`393` and will be
673removed in Python 4. All functions using this type are deprecated:
674
Victor Stinner46606ce2011-11-20 18:27:55 +0100675Unicode functions and methods using :c:type:`Py_UNICODE` and
676:c:type:`Py_UNICODE*` types:
677
678 * :c:macro:`PyUnicode_FromUnicode`: use :c:func:`PyUnicode_FromWideChar` or
679 :c:func:`PyUnicode_FromKindAndData`
680 * :c:macro:`PyUnicode_AS_UNICODE`, :c:func:`PyUnicode_AsUnicode`,
681 :c:func:`PyUnicode_AsUnicodeAndSize`: use :c:func:`PyUnicode_AsWideCharString`
682 * :c:macro:`PyUnicode_AS_DATA`: use :c:macro:`PyUnicode_DATA` with
683 :c:macro:`PyUnicode_READ` and :c:macro:`PyUnicode_WRITE`
684 * :c:macro:`PyUnicode_GET_SIZE`, :c:func:`PyUnicode_GetSize`: use
685 :c:macro:`PyUnicode_GET_LENGTH` or :c:func:`PyUnicode_GetLength`
686 * :c:macro:`PyUnicode_GET_DATA_SIZE`: use
687 ``PyUnicode_GET_LENGTH(str) * PyUnicode_KIND(str)`` (only work on ready
688 strings)
689 * :c:func:`PyUnicode_AsUnicodeCopy`: use :c:func:`PyUnicode_AsUCS4Copy`,
690 :c:func:`PyUnicode_AsWideCharString` or :c:func:`PyUnicode_Copy`
691
Victor Stinnera996f1e2011-11-21 13:14:43 +0100692Functions and macros manipulating Py_UNICODE* strings:
693
694 * :c:macro:`Py_UNICODE_strlen`: use :c:func:`PyUnicode_GetLength` or
695 :c:macro:`PyUnicode_GET_LENGTH`
696 * :c:macro:`Py_UNICODE_strcat`: use :c:func:`PyUnicode_CopyCharacters` or
697 :c:func:`PyUnicode_FromFormat`
698 * :c:macro:`Py_UNICODE_strcpy`, :c:macro:`Py_UNICODE_strncpy`,
699 :c:macro:`Py_UNICODE_COPY`: use :c:func:`PyUnicode_CopyCharacters` or
700 :c:func:`PyUnicode_Substring`
701 * :c:macro:`Py_UNICODE_strcmp`: use :c:func:`PyUnicode_Compare`
702 * :c:macro:`Py_UNICODE_strncmp`: use :c:func:`PyUnicode_Tailmatch`
703 * :c:macro:`Py_UNICODE_strchr`, :c:macro:`Py_UNICODE_strrchr`: use
704 :c:func:`PyUnicode_FindChar`
705 * :c:macro:`Py_UNICODE_FILL`
706
Victor Stinner46606ce2011-11-20 18:27:55 +0100707Encoders:
708
709 * :c:func:`PyUnicode_Encode`: use :c:func:`PyUnicode_AsEncodedObject`
710 * :c:func:`PyUnicode_EncodeUTF7`
Victor Stinnera996f1e2011-11-21 13:14:43 +0100711 * :c:func:`PyUnicode_EncodeUTF8`: use :c:func:`PyUnicode_AsUTF8` or
712 :c:func:`PyUnicode_AsUTF8String`
Victor Stinner46606ce2011-11-20 18:27:55 +0100713 * :c:func:`PyUnicode_EncodeUTF32`
714 * :c:func:`PyUnicode_EncodeUTF16`
715 * :c:func:`PyUnicode_EncodeUnicodeEscape:` use
716 :c:func:`PyUnicode_AsUnicodeEscapeString`
717 * :c:func:`PyUnicode_EncodeRawUnicodeEscape:` use
718 :c:func:`PyUnicode_AsRawUnicodeEscapeString`
719 * :c:func:`PyUnicode_EncodeLatin1`: use :c:func:`PyUnicode_AsLatin1String`
720 * :c:func:`PyUnicode_EncodeASCII`: use :c:func:`PyUnicode_AsASCIIString`
721 * :c:func:`PyUnicode_EncodeCharmap`
722 * :c:func:`PyUnicode_TranslateCharmap`
723 * :c:func:`PyUnicode_EncodeMBCS`: use :c:func:`PyUnicode_AsMBCSString` or
724 :c:func:`PyUnicode_EncodeCodePage` (with ``CP_ACP`` code_page)
725 * :c:func:`PyUnicode_EncodeDecimal`,
726 :c:func:`PyUnicode_TransformDecimalToASCII`
727
728
Giampaolo Rodolà3108f982011-02-24 20:59:48 +0000729Porting to Python 3.3
730=====================
731
732This section lists previously described changes and other bugfixes
Antoine Pitrou037ffbf2011-10-24 00:25:41 +0200733that may require changes to your code.
734
735Porting Python code
736-------------------
Giampaolo Rodolà3108f982011-02-24 20:59:48 +0000737
Victor Stinner19bd0692011-11-16 00:18:57 +0100738* :issue:`12326`: On Linux, sys.platform doesn't contain the major version
Victor Stinnerff3d9392011-08-20 23:39:26 +0200739 anymore. It is now always 'linux', instead of 'linux2' or 'linux3' depending
740 on the Linux version used to build Python. Replace sys.platform == 'linux2'
741 with sys.platform.startswith('linux'), or directly sys.platform == 'linux' if
742 you don't need to support older Python versions.
Éric Araujoc09fca62011-03-23 02:06:24 +0100743
Antoine Pitrou037ffbf2011-10-24 00:25:41 +0200744Porting C code
745--------------
746
747* Due to :ref:`PEP 393 <pep-393>`, the :c:type:`Py_UNICODE` type and all
748 functions using this type are deprecated (but will stay available for
749 at least five years). If you were using low-level Unicode APIs to
750 construct and access unicode objects and you want to benefit of the
751 memory footprint reduction provided by the PEP 393, you have to convert
752 your code to the new :doc:`Unicode API <../c-api/unicode>`.
753
754 However, if you only have been using high-level functions such as
755 :c:func:`PyUnicode_Concat()`, :c:func:`PyUnicode_Join` or
756 :c:func:`PyUnicode_FromFormat()`, your code will automatically take
757 advantage of the new unicode representations.
758
759Other issues
760------------
761
Éric Araujoc09fca62011-03-23 02:06:24 +0100762.. Issue #11591: When :program:`python` was started with :option:`-S`,
763 ``import site`` will not add site-specific paths to the module search
764 paths. In previous versions, it did. See changeset for doc changes in
765 various files. Contributed by Carl Meyer with editions by Éric Araujo.
Éric Araujobe3bd572011-03-26 01:55:15 +0100766
Éric Araujobfc97292011-11-14 18:18:15 +0100767.. Issue #10998: the -Q command-line flag and related artifacts have been
Éric Araujobe3bd572011-03-26 01:55:15 +0100768 removed. Code checking sys.flags.division_warning will need updating.
769 Contributed by Éric Araujo.