Raymond Hettinger | 6e6565b | 2009-06-28 20:56:11 +0000 | [diff] [blame] | 1 | **************************** |
Raymond Hettinger | f558ddd | 2009-06-28 21:37:08 +0000 | [diff] [blame] | 2 | What's New In Python 3.2 |
Raymond Hettinger | 6e6565b | 2009-06-28 20:56:11 +0000 | [diff] [blame] | 3 | **************************** |
| 4 | |
| 5 | :Author: Raymond Hettinger |
| 6 | :Release: |release| |
| 7 | :Date: |today| |
| 8 | |
| 9 | .. $Id$ |
| 10 | Rules for maintenance: |
| 11 | |
| 12 | * Anyone can add text to this document. Do not spend very much time |
| 13 | on the wording of your changes, because your text will probably |
Raymond Hettinger | 6f04adc | 2010-12-04 22:56:25 +0000 | [diff] [blame] | 14 | get rewritten. |
Raymond Hettinger | 6e6565b | 2009-06-28 20:56:11 +0000 | [diff] [blame] | 15 | |
| 16 | * The maintainer will go through Misc/NEWS periodically and add |
| 17 | changes; it's therefore more important to add your changes to |
| 18 | Misc/NEWS than to this file. |
| 19 | |
| 20 | * This is not a complete list of every single change; completeness |
| 21 | is the purpose of Misc/NEWS. Some changes I consider too small |
| 22 | or esoteric to include. If such a change is added to the text, |
| 23 | I'll just remove it. (This is another reason you shouldn't spend |
| 24 | too much time on writing your addition.) |
| 25 | |
| 26 | * If you want to draw your new text to the attention of the |
| 27 | maintainer, add 'XXX' to the beginning of the paragraph or |
| 28 | section. |
| 29 | |
| 30 | * It's OK to just add a fragmentary note about a change. For |
| 31 | example: "XXX Describe the transmogrify() function added to the |
| 32 | socket module." The maintainer will research the change and |
| 33 | write the necessary text. |
| 34 | |
| 35 | * You can comment out your additions if you like, but it's not |
| 36 | necessary (especially when a final release is some months away). |
| 37 | |
| 38 | * Credit the author of a patch or bugfix. Just the name is |
Georg Brandl | da0a211 | 2010-09-05 11:28:33 +0000 | [diff] [blame] | 39 | sufficient; the e-mail address isn't necessary. It's helpful to |
| 40 | add the issue number: |
Raymond Hettinger | 6e6565b | 2009-06-28 20:56:11 +0000 | [diff] [blame] | 41 | |
Éric Araujo | 4234ad4 | 2010-09-05 17:32:25 +0000 | [diff] [blame] | 42 | XXX Describe the transmogrify() function added to the socket |
| 43 | module. |
| 44 | |
| 45 | (Contributed by P.Y. Developer; :issue:`12345`.) |
Raymond Hettinger | 6e6565b | 2009-06-28 20:56:11 +0000 | [diff] [blame] | 46 | |
| 47 | This saves the maintainer the effort of going through the SVN log |
| 48 | when researching a change. |
| 49 | |
Raymond Hettinger | ffad35e | 2010-12-14 21:12:03 +0000 | [diff] [blame] | 50 | This article explains the new features in Python 3.2 as compared to 3.1. It |
| 51 | focuses on a few highlights and gives a few examples. For full details, see the |
| 52 | :source:`Misc/NEWS <Misc/NEWS>` file. |
Raymond Hettinger | 2c1ecc3 | 2010-12-07 09:55:02 +0000 | [diff] [blame] | 53 | |
Raymond Hettinger | 6e6565b | 2009-06-28 20:56:11 +0000 | [diff] [blame] | 54 | |
Martin v. Löwis | 932e49e | 2010-12-04 13:49:32 +0000 | [diff] [blame] | 55 | PEP 384: Defining a Stable ABI |
Martin v. Löwis | 4d0d471 | 2010-12-03 20:14:31 +0000 | [diff] [blame] | 56 | ============================== |
| 57 | |
| 58 | In the past, extension modules built for one Python version were often |
| 59 | not usable with other Python versions. Particularly on Windows, every |
| 60 | feature release of Python required rebuilding all extension modules that |
| 61 | one wanted to use. This requirement was the result of the free access to |
| 62 | Python interpreter internals that extension modules could use. |
| 63 | |
| 64 | With Python 3.2, an alternative approach becomes available: extension |
Raymond Hettinger | 6f04adc | 2010-12-04 22:56:25 +0000 | [diff] [blame] | 65 | modules which restrict themselves to a limited API (by defining |
Martin v. Löwis | 4d0d471 | 2010-12-03 20:14:31 +0000 | [diff] [blame] | 66 | Py_LIMITED_API) cannot use many of the internals, but are constrained |
| 67 | to a set of API functions that are promised to be stable for several |
| 68 | releases. As a consequence, extension modules built for 3.2 in that |
| 69 | mode will also work with 3.3, 3.4, and so on. Extension modules that |
| 70 | make use of details of memory structures can still be built, but will |
| 71 | need to be recompiled for every feature release. |
| 72 | |
Raymond Hettinger | 6f04adc | 2010-12-04 22:56:25 +0000 | [diff] [blame] | 73 | .. seealso:: |
| 74 | |
Georg Brandl | 65b2eb9 | 2010-12-05 11:42:38 +0000 | [diff] [blame] | 75 | :pep:`384` - Defining a Stable ABI |
Raymond Hettinger | 2c1ecc3 | 2010-12-07 09:55:02 +0000 | [diff] [blame] | 76 | PEP written by Martin von Löwis. |
Raymond Hettinger | 6f04adc | 2010-12-04 22:56:25 +0000 | [diff] [blame] | 77 | |
Raymond Hettinger | a5a3554 | 2010-12-05 00:39:18 +0000 | [diff] [blame] | 78 | PEP 389: Argparse Command Line Parsing Module |
| 79 | ============================================= |
| 80 | |
| 81 | A new module for command line parsing, :mod:`argparse`, was introduced to |
| 82 | overcome the limitations of :mod:`optparse` which did not provide support for |
Raymond Hettinger | 677e10a | 2010-12-07 06:45:30 +0000 | [diff] [blame] | 83 | positional arguments (not just options), subcommands, required options and other |
Raymond Hettinger | 413abbc | 2010-12-05 07:06:47 +0000 | [diff] [blame] | 84 | common patterns of specifying and validating options. |
Raymond Hettinger | a5a3554 | 2010-12-05 00:39:18 +0000 | [diff] [blame] | 85 | |
| 86 | This module has already has wide-spread success in the community as a |
Raymond Hettinger | b1ff402 | 2010-12-08 11:19:45 +0000 | [diff] [blame] | 87 | third-party module. Being more fully featured than its predecessor, the |
| 88 | :mod:`argparse` module is now the preferred module for command-line processing. |
| 89 | The older module is still being kept available because of the substantial amount |
| 90 | of legacy code that depends on it. |
Raymond Hettinger | a5a3554 | 2010-12-05 00:39:18 +0000 | [diff] [blame] | 91 | |
Raymond Hettinger | 677e10a | 2010-12-07 06:45:30 +0000 | [diff] [blame] | 92 | Here's an annotated example parser showing features like limiting results to a |
| 93 | set of choices, specifying a *metavar* in the help screen, validating that one |
Raymond Hettinger | 68f1e8d | 2010-12-07 09:24:30 +0000 | [diff] [blame] | 94 | or more positional arguments is present, and making a required option:: |
Raymond Hettinger | 677e10a | 2010-12-07 06:45:30 +0000 | [diff] [blame] | 95 | |
| 96 | import argparse |
| 97 | parser = argparse.ArgumentParser( |
| 98 | description = 'Manage servers', # main description for help |
| 99 | epilog = 'Tested on Solaris and Linux') # displayed after help |
| 100 | parser.add_argument('action', # argument name |
| 101 | choices = ['deploy', 'start', 'stop'], # one of four allowed values |
| 102 | help = 'action on each target') # help msg |
| 103 | parser.add_argument('targets', |
| 104 | metavar = 'HOSTNAME', # var name used in help msg |
| 105 | nargs = '+', # require 1 or more targets |
| 106 | help = 'url for target machines') # help msg explanation |
| 107 | parser.add_argument('-u', '--user', # -u or --user option |
| 108 | required = True, # make this a required argument |
| 109 | help = 'login as user') |
| 110 | |
| 111 | Example of calling the parser on a command string:: |
| 112 | |
| 113 | >>> cmd = 'deploy sneezy.example.com sleepy.example.com -u skycaptain' |
| 114 | >>> result = parser.parse_args(cmd.split()) |
Raymond Hettinger | 677e10a | 2010-12-07 06:45:30 +0000 | [diff] [blame] | 115 | >>> result.action |
| 116 | 'deploy' |
| 117 | >>> result.targets |
| 118 | ['sneezy.example.com', 'sleepy.example.com'] |
| 119 | >>> result.user |
| 120 | 'skycaptain' |
| 121 | |
| 122 | Example of the parser's automatically generated help:: |
| 123 | |
| 124 | >>> parser.parse_args('-h'.split()) |
| 125 | |
Raymond Hettinger | 3fcf002 | 2010-12-08 01:13:53 +0000 | [diff] [blame] | 126 | usage: manage_cloud.py [-h] -u USER |
| 127 | {deploy,start,stop} HOSTNAME [HOSTNAME ...] |
Raymond Hettinger | 677e10a | 2010-12-07 06:45:30 +0000 | [diff] [blame] | 128 | |
| 129 | Manage servers |
| 130 | |
| 131 | positional arguments: |
| 132 | {deploy,start,stop} action on each target |
| 133 | HOSTNAME url for target machines |
| 134 | |
| 135 | optional arguments: |
| 136 | -h, --help show this help message and exit |
| 137 | -u USER, --user USER login as user |
| 138 | |
| 139 | Tested on Solaris and Linux |
| 140 | |
Raymond Hettinger | b1ff402 | 2010-12-08 11:19:45 +0000 | [diff] [blame] | 141 | An especially nice :mod:`argparse` feature is the ability to define subparsers, |
| 142 | each with their own argument patterns and help displays:: |
| 143 | |
| 144 | import argparse |
| 145 | parser = argparse.ArgumentParser(prog='HELM') |
| 146 | subparsers = parser.add_subparsers() |
| 147 | |
| 148 | parser_l = subparsers.add_parser('launch', help='Launch Control') # first subgroup |
Raymond Hettinger | bb9686f | 2010-12-16 00:53:05 +0000 | [diff] [blame] | 149 | parser_l.add_argument('-m', '--missiles', action='store_true') |
Raymond Hettinger | b1ff402 | 2010-12-08 11:19:45 +0000 | [diff] [blame] | 150 | parser_l.add_argument('-t', '--torpedos', action='store_true') |
| 151 | |
| 152 | parser_m = subparsers.add_parser('move', help='Move Vessel') # second subgroup |
| 153 | parser_m.add_argument('-c', '--course', type=int, required=True) |
| 154 | parser_m.add_argument('-s', '--speed', type=int, default=0) |
| 155 | |
| 156 | $ ./helm.py --help # top level help (launch and move) |
| 157 | $ ./helm.py launch --help # help for launch options |
| 158 | $ ./helm.py launch --missiles # set missiles=True and torpedos=False |
| 159 | $ ./helm.py move --course 180 --speed 5 # set movement parameters |
Raymond Hettinger | a5a3554 | 2010-12-05 00:39:18 +0000 | [diff] [blame] | 160 | |
| 161 | .. seealso:: |
| 162 | |
| 163 | :pep:`389` - New Command Line Parsing Module |
| 164 | PEP written by Steven Bethard. |
| 165 | |
Raymond Hettinger | 677e10a | 2010-12-07 06:45:30 +0000 | [diff] [blame] | 166 | :ref:`upgrading-optparse-code` for details on the differences from |
| 167 | :mod:`optparse`. |
| 168 | |
Raymond Hettinger | 6e6565b | 2009-06-28 20:56:11 +0000 | [diff] [blame] | 169 | |
Éric Araujo | 4234ad4 | 2010-09-05 17:32:25 +0000 | [diff] [blame] | 170 | PEP 391: Dictionary Based Configuration for Logging |
| 171 | ==================================================== |
Raymond Hettinger | ef2335c | 2010-09-05 08:35:38 +0000 | [diff] [blame] | 172 | |
Raymond Hettinger | 92ba286 | 2010-09-06 01:16:46 +0000 | [diff] [blame] | 173 | The :mod:`logging` module provided two kinds of configuration, one style with |
| 174 | function calls for each option or another style driven by an external file saved |
| 175 | in a :mod:`ConfigParser` format. Those options did not provide the flexibility |
Georg Brandl | 9e75cad | 2010-09-06 06:45:47 +0000 | [diff] [blame] | 176 | to create configurations from JSON or YAML files, nor did they support |
Raymond Hettinger | 92ba286 | 2010-09-06 01:16:46 +0000 | [diff] [blame] | 177 | incremental configuration, which is needed for specifying logger options from a |
| 178 | command line. |
Raymond Hettinger | ef2335c | 2010-09-05 08:35:38 +0000 | [diff] [blame] | 179 | |
| 180 | To support a more flexible style, the module now offers |
Raymond Hettinger | 92ba286 | 2010-09-06 01:16:46 +0000 | [diff] [blame] | 181 | :func:`logging.config.dictConfig` for specifying logging configuration with |
| 182 | plain Python dictionaries. The configuration options include formatters, |
| 183 | handlers, filters, and loggers. Here's a working example of a configuration |
| 184 | dictionary:: |
Raymond Hettinger | ef2335c | 2010-09-05 08:35:38 +0000 | [diff] [blame] | 185 | |
Georg Brandl | da0a211 | 2010-09-05 11:28:33 +0000 | [diff] [blame] | 186 | {"version": 1, |
| 187 | "formatters": {"brief": {"format": "%(levelname)-8s: %(name)-15s: %(message)s"}, |
| 188 | "full": {"format": "%(asctime)s %(name)-15s %(levelname)-8s %(message)s"}, |
| 189 | }, |
| 190 | "handlers": {"console": { |
| 191 | "class": "logging.StreamHandler", |
| 192 | "formatter": "brief", |
| 193 | "level": "INFO", |
| 194 | "stream": "ext://sys.stdout"}, |
| 195 | "console_priority": { |
| 196 | "class": "logging.StreamHandler", |
| 197 | "formatter": "full", |
| 198 | "level": "ERROR", |
| 199 | "stream": "ext://sys.stderr"}, |
| 200 | }, |
| 201 | "root": {"level": "DEBUG", "handlers": ["console", "console_priority"]}} |
Raymond Hettinger | ef2335c | 2010-09-05 08:35:38 +0000 | [diff] [blame] | 202 | |
Raymond Hettinger | 92ba286 | 2010-09-06 01:16:46 +0000 | [diff] [blame] | 203 | |
Raymond Hettinger | 6f04adc | 2010-12-04 22:56:25 +0000 | [diff] [blame] | 204 | If that dictionary is stored in a file called :file:`conf.json`, it can loaded |
Raymond Hettinger | 92ba286 | 2010-09-06 01:16:46 +0000 | [diff] [blame] | 205 | and called with code like this:: |
| 206 | |
| 207 | >>> import logging.config |
| 208 | >>> logging.config.dictConfig(json.load(open('conf.json', 'rb'))) |
| 209 | >>> logging.info("Transaction completed normally") |
| 210 | >>> logging.critical("Abnormal termination") |
| 211 | |
Raymond Hettinger | ef2335c | 2010-09-05 08:35:38 +0000 | [diff] [blame] | 212 | .. seealso:: |
| 213 | |
| 214 | :pep:`391` - Dictionary Based Configuration for Logging |
| 215 | PEP written by Vinay Sajip. |
| 216 | |
Georg Brandl | 97b20da | 2010-11-16 15:15:29 +0000 | [diff] [blame] | 217 | PEP 3148: The ``concurrent.futures`` module |
| 218 | ============================================ |
| 219 | |
Raymond Hettinger | 6f04adc | 2010-12-04 22:56:25 +0000 | [diff] [blame] | 220 | Code for creating and managing concurrency is being collected in a new toplevel |
| 221 | namespace, *concurrent*. Its first member is a *futures* package which provides |
| 222 | a uniform high level interface for managing threads and processes. |
| 223 | |
| 224 | The design for :mod:`concurrent.futures` was inspired by |
| 225 | *java.util.concurrent.package*. In that model, a running call and its result |
| 226 | are represented by a :class:`~concurrent.futures.Future` object which abstracts |
| 227 | features common to threads, processes, and remote procedure calls. That object |
| 228 | supports status checks (running or done), timeouts, cancellations, adding |
Raymond Hettinger | 24a0941 | 2010-12-08 06:50:02 +0000 | [diff] [blame] | 229 | callbacks, and access to results or exceptions. |
Raymond Hettinger | 6f04adc | 2010-12-04 22:56:25 +0000 | [diff] [blame] | 230 | |
| 231 | The primary offering of the new module is a pair of executor classes for |
| 232 | launching and managing calls. The goal of the executors is to make it easier to |
| 233 | use existing tools for making parallel calls. They save the effort needed to |
| 234 | setup a pool of resources, launch the calls, create a results queue, add |
| 235 | time-out handling, and limit the total number of threads, processes, or remote |
Raymond Hettinger | c269ae8 | 2010-12-05 01:01:52 +0000 | [diff] [blame] | 236 | procedure calls. |
Raymond Hettinger | 6f04adc | 2010-12-04 22:56:25 +0000 | [diff] [blame] | 237 | |
| 238 | Ideally, each application should share a single executor across multiple |
| 239 | components so that process and thread limits can be centrally managed. This |
| 240 | solves the design challenge that arises when each component has its own |
| 241 | competing strategy for resource management. |
| 242 | |
Raymond Hettinger | b105519 | 2010-12-08 06:42:41 +0000 | [diff] [blame] | 243 | Both classes share a common interface with three methods: |
| 244 | :meth:`~concurrent.futures.Executor.submit` for scheduling a callable and |
| 245 | returning a :class:`~concurrent.futures.Future` object; |
| 246 | :meth:`~concurrent.futures.Executor.map` for scheduling many asynchronous calls |
Raymond Hettinger | 83d8079 | 2010-12-08 06:48:33 +0000 | [diff] [blame] | 247 | at a time, and :meth:`~concurrent.futures.Executor.shutdown` for freeing |
| 248 | resources. The class is a :term:`context manager` and can be used within a |
| 249 | :keyword:`with` statement to assure that resources are automatically released |
| 250 | when currently pending futures are done executing. |
Raymond Hettinger | 6f04adc | 2010-12-04 22:56:25 +0000 | [diff] [blame] | 251 | |
Raymond Hettinger | b105519 | 2010-12-08 06:42:41 +0000 | [diff] [blame] | 252 | A simple of example of :class:`~concurrent.futures.ThreadPoolExecutor` is a |
Raymond Hettinger | 83d8079 | 2010-12-08 06:48:33 +0000 | [diff] [blame] | 253 | launch of four parallel threads for copying files:: |
Raymond Hettinger | b105519 | 2010-12-08 06:42:41 +0000 | [diff] [blame] | 254 | |
| 255 | import shutil |
| 256 | with ThreadPoolExecutor(max_workers=4) as e: |
| 257 | e.submit(shutil.copy, 'src1.txt', 'dest1.txt') |
| 258 | e.submit(shutil.copy, 'src2.txt', 'dest2.txt') |
| 259 | e.submit(shutil.copy, 'src3.txt', 'dest3.txt') |
| 260 | e.submit(shutil.copy, 'src3.txt', 'dest4.txt') |
| 261 | |
Raymond Hettinger | 6f04adc | 2010-12-04 22:56:25 +0000 | [diff] [blame] | 262 | .. seealso:: |
| 263 | |
Raymond Hettinger | a5a3554 | 2010-12-05 00:39:18 +0000 | [diff] [blame] | 264 | :pep:`3148` - Futures -- Execute Computations Asynchronously |
Andrew M. Kuchling | 42877fe | 2010-12-15 02:37:01 +0000 | [diff] [blame] | 265 | PEP written by Brian Quinlan. |
Georg Brandl | 97b20da | 2010-11-16 15:15:29 +0000 | [diff] [blame] | 266 | |
Raymond Hettinger | 83d8079 | 2010-12-08 06:48:33 +0000 | [diff] [blame] | 267 | :ref:`Code for Threaded Parallel URL reads<threadpoolexecutor-example>`, an |
| 268 | example using threads to fetch multiple web pages in parallel. |
| 269 | |
| 270 | :ref:`Code for computing prime numbers in |
| 271 | parallel<processpoolexecutor-example>`, an example demonstrating |
| 272 | :class:`~concurrent.futures.ProcessPoolExecutor`. |
| 273 | |
| 274 | |
Georg Brandl | da0a211 | 2010-09-05 11:28:33 +0000 | [diff] [blame] | 275 | |
Raymond Hettinger | f95b199 | 2010-09-04 23:53:24 +0000 | [diff] [blame] | 276 | PEP 3147: PYC Repository Directories |
| 277 | ===================================== |
| 278 | |
David Malcolm | 778645a | 2010-12-07 00:32:04 +0000 | [diff] [blame] | 279 | Python's scheme for caching bytecode in *.pyc* files did not work well in |
Raymond Hettinger | f95b199 | 2010-09-04 23:53:24 +0000 | [diff] [blame] | 280 | environments with multiple python interpreters. If one interpreter encountered |
| 281 | a cached file created by another interpreter, it would recompile the source and |
| 282 | overwrite the cached file, thus losing the benefits of caching. |
| 283 | |
| 284 | The issue of "pyc fights" has become more pronounced as it has become |
Éric Araujo | 4234ad4 | 2010-09-05 17:32:25 +0000 | [diff] [blame] | 285 | commonplace for Linux distributions to ship with multiple versions of Python. |
Raymond Hettinger | f95b199 | 2010-09-04 23:53:24 +0000 | [diff] [blame] | 286 | These conflicts also arise with CPython alternatives such as Unladen Swallow. |
| 287 | |
| 288 | To solve this problem, Python's import machinery has been extended to use |
Éric Araujo | 4234ad4 | 2010-09-05 17:32:25 +0000 | [diff] [blame] | 289 | distinct filenames for each interpreter. Instead of Python 3.2 and Python 3.3 and |
| 290 | Unladen Swallow each competing for a file called "mymodule.pyc", they will now |
Raymond Hettinger | f95b199 | 2010-09-04 23:53:24 +0000 | [diff] [blame] | 291 | look for "mymodule.cpython-32.pyc", "mymodule.cpython-33.pyc", and |
Éric Araujo | 4234ad4 | 2010-09-05 17:32:25 +0000 | [diff] [blame] | 292 | "mymodule.unladen10.pyc". And to prevent all of these new files from |
Raymond Hettinger | f95b199 | 2010-09-04 23:53:24 +0000 | [diff] [blame] | 293 | cluttering source directories, the *pyc* files are now collected in a |
| 294 | "__pycache__" directory stored under the package directory. |
| 295 | |
| 296 | Aside from the filenames and target directories, the new scheme has a few |
| 297 | aspects that are visible to the programmer: |
| 298 | |
Georg Brandl | da0a211 | 2010-09-05 11:28:33 +0000 | [diff] [blame] | 299 | * Imported modules now have a :attr:`__cached__` attribute which stores the name |
| 300 | of the actual file that was imported: |
Raymond Hettinger | f95b199 | 2010-09-04 23:53:24 +0000 | [diff] [blame] | 301 | |
Raymond Hettinger | 92ba286 | 2010-09-06 01:16:46 +0000 | [diff] [blame] | 302 | >>> import collections |
| 303 | >>> collections.__cached__ |
| 304 | 'c:/py32/lib/__pycache__/collections.cpython-32.pyc' |
Raymond Hettinger | f95b199 | 2010-09-04 23:53:24 +0000 | [diff] [blame] | 305 | |
| 306 | * The tag that is unique to each interpreter is accessible from the :mod:`imp` |
Georg Brandl | da0a211 | 2010-09-05 11:28:33 +0000 | [diff] [blame] | 307 | module: |
Raymond Hettinger | f95b199 | 2010-09-04 23:53:24 +0000 | [diff] [blame] | 308 | |
Raymond Hettinger | 92ba286 | 2010-09-06 01:16:46 +0000 | [diff] [blame] | 309 | >>> import imp |
| 310 | >>> imp.get_tag() |
| 311 | 'cpython-32' |
Raymond Hettinger | f95b199 | 2010-09-04 23:53:24 +0000 | [diff] [blame] | 312 | |
| 313 | * Scripts that try to deduce source filename from the imported file now need to |
| 314 | be smarter. It is no longer sufficient to simply strip the "c" from a ".pyc" |
| 315 | filename. Instead, use the new functions in the :mod:`imp` module: |
| 316 | |
Georg Brandl | da0a211 | 2010-09-05 11:28:33 +0000 | [diff] [blame] | 317 | >>> imp.source_from_cache('c:/py32/lib/__pycache__/collections.cpython-32.pyc') |
| 318 | 'c:/py32/lib/collections.py' |
| 319 | >>> imp.cache_from_source('c:/py32/lib/collections.py') |
| 320 | 'c:/py32/lib/__pycache__/collections.cpython-32.pyc' |
Raymond Hettinger | f95b199 | 2010-09-04 23:53:24 +0000 | [diff] [blame] | 321 | |
| 322 | * The :mod:`py_compile` and :mod:`compileall` modules have been updated to |
| 323 | reflect the new naming convention and target directory. |
| 324 | |
| 325 | .. seealso:: |
| 326 | |
| 327 | :pep:`3147` - PYC Repository Directories |
| 328 | PEP written by Barry Warsaw. |
| 329 | |
Georg Brandl | da0a211 | 2010-09-05 11:28:33 +0000 | [diff] [blame] | 330 | |
Georg Brandl | 3ad4675 | 2010-12-05 07:59:29 +0000 | [diff] [blame] | 331 | PEP 3149: ABI Version Tagged .so Files |
| 332 | ====================================== |
Georg Brandl | f11c6c4 | 2010-09-03 22:20:58 +0000 | [diff] [blame] | 333 | |
Raymond Hettinger | ebea6fa | 2010-09-05 00:27:25 +0000 | [diff] [blame] | 334 | The PYC repository directory allows multiple bytecode cache files to be |
| 335 | co-located. This PEP implements a similar mechanism for shared object files by |
| 336 | giving them a common directory and distinct names for each version. |
Georg Brandl | f11c6c4 | 2010-09-03 22:20:58 +0000 | [diff] [blame] | 337 | |
Raymond Hettinger | ebea6fa | 2010-09-05 00:27:25 +0000 | [diff] [blame] | 338 | The common directory is "pyshared" and the file names are made distinct by |
| 339 | identifying the Python implementation (such as CPython, PyPy, Jython, etc.), the |
| 340 | major and minor version numbers, and optional build flags (such as "d" for |
Éric Araujo | 4234ad4 | 2010-09-05 17:32:25 +0000 | [diff] [blame] | 341 | debug, "m" for pymalloc, "u" for wide-unicode). For an arbitrary package "foo", |
Raymond Hettinger | ebea6fa | 2010-09-05 00:27:25 +0000 | [diff] [blame] | 342 | you may see these files when the distribution package is installed:: |
| 343 | |
| 344 | /usr/share/pyshared/foo.cpython-32m.so |
| 345 | /usr/share/pyshared/foo.cpython-33md.so |
| 346 | |
| 347 | In Python itself, the tags are accessible from functions in the :mod:`sysconfig` |
| 348 | module:: |
| 349 | |
| 350 | >>> import sysconfig |
| 351 | >>> sysconfig.get_config_var('SOABI') # find the version tag |
| 352 | 'cpython-32mu' |
| 353 | >>> sysconfig.get_config_var('SO') # find the full filename extension |
| 354 | 'cpython-32mu.so' |
| 355 | |
| 356 | .. seealso:: |
| 357 | |
| 358 | :pep:`3149` - ABI Version Tagged .so Files |
| 359 | PEP written by Barry Warsaw. |
Raymond Hettinger | 6e6565b | 2009-06-28 20:56:11 +0000 | [diff] [blame] | 360 | |
| 361 | |
| 362 | Other Language Changes |
| 363 | ====================== |
| 364 | |
| 365 | Some smaller changes made to the core Python language are: |
| 366 | |
Raymond Hettinger | e5e1a98 | 2010-12-05 08:35:21 +0000 | [diff] [blame] | 367 | * String formatting for :func:`format` and :meth:`str.format` gained new |
| 368 | capabilities for the format character **#**. Previously, for integers in |
| 369 | binary, octal, or hexadecimal, it caused the output to be prefixed with '0b', |
| 370 | '0o', or '0x' respectively. Now it can also handle floats, complex, and |
| 371 | Decimal, causing the output to always have a decimal point even when no digits |
| 372 | follow it. |
Raymond Hettinger | e5e728b | 2010-12-05 06:35:16 +0000 | [diff] [blame] | 373 | |
| 374 | >>> format(20, '#o') |
| 375 | '0o24' |
| 376 | >>> format(12.34, '#5.0f') |
| 377 | ' 12.' |
| 378 | |
| 379 | (Suggested by Mark Dickinson and implemented by Eric Smith in :issue:`7094`.) |
Raymond Hettinger | 43b5a85 | 2010-12-05 04:04:21 +0000 | [diff] [blame] | 380 | |
Raymond Hettinger | c269ae8 | 2010-12-05 01:01:52 +0000 | [diff] [blame] | 381 | * The interpreter can now be started with a quiet option, ``-q``, to suppress |
| 382 | the copyright and version information in an interactive mode. |
| 383 | |
| 384 | (Contributed by Marcin Wojdyr in issue:`1772833`). |
| 385 | |
Georg Brandl | da0a211 | 2010-09-05 11:28:33 +0000 | [diff] [blame] | 386 | * The :func:`hasattr` function used to catch and suppress any Exception. Now, |
| 387 | it only catches :exc:`AttributeError`. Under the hood, :func:`hasattr` works |
| 388 | by calling :func:`getattr` and throwing away the results. This is necessary |
| 389 | because dynamic attribute creation is possible using :meth:`__getattribute__` |
Éric Araujo | 4234ad4 | 2010-09-05 17:32:25 +0000 | [diff] [blame] | 390 | or :meth:`__getattr__`. If :func:`hasattr` were to just scan instance and class |
Éric Araujo | cc6aac6 | 2010-09-07 21:35:35 +0000 | [diff] [blame] | 391 | dictionaries it would miss the dynamic methods and make it difficult to |
Georg Brandl | da0a211 | 2010-09-05 11:28:33 +0000 | [diff] [blame] | 392 | implement proxy objects. |
Raymond Hettinger | 1784ff0 | 2010-09-05 01:00:19 +0000 | [diff] [blame] | 393 | |
Raymond Hettinger | a55ffbc | 2010-12-15 18:31:57 +0000 | [diff] [blame] | 394 | (Discovered by Yury Selivanov and fixed by Benjamin Peterson; :issue:`9666`.) |
Raymond Hettinger | 1784ff0 | 2010-09-05 01:00:19 +0000 | [diff] [blame] | 395 | |
Éric Araujo | 4234ad4 | 2010-09-05 17:32:25 +0000 | [diff] [blame] | 396 | * The :func:`str` of a float or complex number is now the same as its |
Raymond Hettinger | 1784ff0 | 2010-09-05 01:00:19 +0000 | [diff] [blame] | 397 | :func:`repr`. Previously, the :func:`str` form was shorter but that just |
Éric Araujo | 4234ad4 | 2010-09-05 17:32:25 +0000 | [diff] [blame] | 398 | caused confusion and is no longer needed now that the shortest possible |
Georg Brandl | da0a211 | 2010-09-05 11:28:33 +0000 | [diff] [blame] | 399 | :func:`repr` is displayed by default: |
Raymond Hettinger | bb734c6 | 2010-09-05 05:56:44 +0000 | [diff] [blame] | 400 | |
Raymond Hettinger | 92ba286 | 2010-09-06 01:16:46 +0000 | [diff] [blame] | 401 | >>> repr(math.pi) |
| 402 | '3.141592653589793' |
| 403 | >>> str(math.pi) |
| 404 | '3.141592653589793' |
Raymond Hettinger | 1784ff0 | 2010-09-05 01:00:19 +0000 | [diff] [blame] | 405 | |
Georg Brandl | da0a211 | 2010-09-05 11:28:33 +0000 | [diff] [blame] | 406 | (Proposed and implemented by Mark Dickinson; :issue:`9337`.) |
Raymond Hettinger | 6e6565b | 2009-06-28 20:56:11 +0000 | [diff] [blame] | 407 | |
Raymond Hettinger | 21ec4bc | 2010-12-10 01:09:01 +0000 | [diff] [blame] | 408 | * :class:`memoryview` objects now have a :meth:`~memoryview.release()` method |
| 409 | and they also now support the context manager protocol. This allows timely |
| 410 | release of any resources that were acquired when requesting a buffer from the |
| 411 | original object. |
Antoine Pitrou | d305200 | 2010-09-15 15:09:40 +0000 | [diff] [blame] | 412 | |
Raymond Hettinger | d8fae4e | 2010-12-05 05:39:54 +0000 | [diff] [blame] | 413 | >>> with memoryview(b'abcdefgh') as v: |
| 414 | ... print(v.tolist()) |
| 415 | ... |
| 416 | [97, 98, 99, 100, 101, 102, 103, 104] |
| 417 | |
Antoine Pitrou | d305200 | 2010-09-15 15:09:40 +0000 | [diff] [blame] | 418 | (Added by Antoine Pitrou; :issue:`9757`.) |
| 419 | |
Raymond Hettinger | 6e6565b | 2009-06-28 20:56:11 +0000 | [diff] [blame] | 420 | |
Amaury Forgeot d'Arc | ba117ef | 2010-09-10 21:39:53 +0000 | [diff] [blame] | 421 | * Previously it was illegal to delete a name from the local namespace if it |
| 422 | occurs as a free variable in a nested block:: |
| 423 | |
| 424 | >>> def outer(x): |
| 425 | ... def inner(): |
| 426 | ... return x |
| 427 | ... inner() |
| 428 | ... del x |
| 429 | |
| 430 | This is now allowed. Remember that the target of an :keyword:`except` clause |
| 431 | is cleared, so this code which used to work with Python 2.6, raised a |
| 432 | :exc:`SyntaxError` with Python 3.1 and now works again:: |
| 433 | |
| 434 | >>> def f(): |
| 435 | ... def print_error(): |
| 436 | ... print(e) |
| 437 | ... try: |
| 438 | ... something |
| 439 | ... except Exception as e: |
| 440 | ... print_error() |
| 441 | ... # implicit "del e" here |
| 442 | |
| 443 | (See :issue:`4617`.) |
| 444 | |
Raymond Hettinger | 480ed78 | 2010-12-15 22:07:15 +0000 | [diff] [blame] | 445 | * The internal :c:type:`structsequence` tool now creates subclasses of tuple. |
| 446 | This means that C generated structures like those returned by :func:`os.stat`, |
| 447 | :func:`time.gmtime`, and :func:`sys.version_info` now work like a |
| 448 | :term:`named tuple` and are more interoperable with functions and methods that |
| 449 | expect a tuple as an argument. The is a big step forward in making the C |
| 450 | structures as flexible as their pure Python counterparts. |
| 451 | |
| 452 | (Suggested by Arfrever Frehtes Taifersar Arahesis and implemented |
| 453 | by Benjamin Peterson in :issue:`8413`.) |
| 454 | |
| 455 | * Warnings are now easier control. An :envvar:`PYTHONWARNINGS` environment |
| 456 | variable is now available as an alternative to using ``-W`` at the command |
| 457 | line. |
| 458 | |
| 459 | (Suggested by Barry Warsaw and implemented by Philip Jenvey in :issue:`7301`.) |
| 460 | |
Antoine Pitrou | 7d15a72 | 2010-11-05 22:13:55 +0000 | [diff] [blame] | 461 | * A new warning category, :exc:`ResourceWarning`, has been added. It is |
Raymond Hettinger | c269ae8 | 2010-12-05 01:01:52 +0000 | [diff] [blame] | 462 | emitted when potential issues with resource consumption or cleanup |
Antoine Pitrou | 7d15a72 | 2010-11-05 22:13:55 +0000 | [diff] [blame] | 463 | are detected. It is silenced by default in normal release builds, but |
Raymond Hettinger | c269ae8 | 2010-12-05 01:01:52 +0000 | [diff] [blame] | 464 | can be enabled through the means provided by the :mod:`warnings` |
Antoine Pitrou | 7d15a72 | 2010-11-05 22:13:55 +0000 | [diff] [blame] | 465 | module, or on the command line. |
| 466 | |
Raymond Hettinger | d8fae4e | 2010-12-05 05:39:54 +0000 | [diff] [blame] | 467 | A :exc:`ResourceWarning` is issued at interpreter shutdown if the |
Antoine Pitrou | 7d15a72 | 2010-11-05 22:13:55 +0000 | [diff] [blame] | 468 | :data:`gc.garbage` list isn't empty. This is meant to make the programmer |
| 469 | aware that their code contains object finalization issues. |
| 470 | |
Raymond Hettinger | d8fae4e | 2010-12-05 05:39:54 +0000 | [diff] [blame] | 471 | A :exc:`ResourceWarning` is also issued when a :term:`file object` is destroyed |
Antoine Pitrou | 7d15a72 | 2010-11-05 22:13:55 +0000 | [diff] [blame] | 472 | without having been explicitly closed. While the deallocator for such |
| 473 | object ensures it closes the underlying operating system resource |
| 474 | (usually, a file descriptor), the delay in deallocating the object could |
| 475 | produce various issues, especially under Windows. Here is an example |
| 476 | of enabling the warning from the command line:: |
| 477 | |
Raymond Hettinger | 673ccf2 | 2010-12-07 09:37:11 +0000 | [diff] [blame] | 478 | $ ./python -q -Wdefault |
Antoine Pitrou | 7d15a72 | 2010-11-05 22:13:55 +0000 | [diff] [blame] | 479 | >>> f = open("foo", "wb") |
| 480 | >>> del f |
| 481 | __main__:1: ResourceWarning: unclosed file <_io.BufferedWriter name='foo'> |
Antoine Pitrou | 7d15a72 | 2010-11-05 22:13:55 +0000 | [diff] [blame] | 482 | |
Raymond Hettinger | d8fae4e | 2010-12-05 05:39:54 +0000 | [diff] [blame] | 483 | (Added by Antoine Pitrou and Georg Brandl in :issue:`10093` and :issue:`477863`.) |
Antoine Pitrou | 7d15a72 | 2010-11-05 22:13:55 +0000 | [diff] [blame] | 484 | |
Raymond Hettinger | a026633 | 2010-12-07 08:52:41 +0000 | [diff] [blame] | 485 | * :class:`range` objects now support *index* and *count* methods. This is part |
| 486 | of an effort to make more objects fully implement the |
| 487 | :class:`collections.Sequence` :term:`abstract base class`. As a result, the |
| 488 | language will have a more uniform API. In addition, :class:`range` objects |
| 489 | now support slicing and negative indices. This makes *range* more |
Raymond Hettinger | 2ffa671 | 2010-12-08 10:18:21 +0000 | [diff] [blame] | 490 | interoperable with lists:: |
| 491 | |
| 492 | >>> range(0, 100, 2).count(10) |
| 493 | 1 |
| 494 | >>> range(0, 100, 2).index(10) |
| 495 | 5 |
| 496 | >>> range(0, 100, 2)[5] |
| 497 | 10 |
| 498 | >>> range(0, 100, 2)[0:5] |
| 499 | range(0, 10, 2) |
Raymond Hettinger | dadf93c | 2010-12-05 02:56:21 +0000 | [diff] [blame] | 500 | |
| 501 | (Contributed by Daniel Stuzback in :issue:`9213` and by Alexander Belopolsky |
| 502 | in :issue:`2690`.) |
Nick Coghlan | 37ee850 | 2010-12-03 14:26:13 +0000 | [diff] [blame] | 503 | |
Raymond Hettinger | d8fae4e | 2010-12-05 05:39:54 +0000 | [diff] [blame] | 504 | * The :func:`callable` builtin function from Py2.x was resurrected. It provides |
Raymond Hettinger | b87ba26 | 2010-12-06 04:31:40 +0000 | [diff] [blame] | 505 | a concise, readable alternative to using an :term:`abstract base class` in an |
Raymond Hettinger | 792c076 | 2010-12-09 16:41:54 +0000 | [diff] [blame] | 506 | expression like ``isinstance(x, collections.Callable)``: |
| 507 | |
| 508 | >>> callable(max) |
| 509 | True |
| 510 | >>> callable(20) |
| 511 | False |
Raymond Hettinger | d8fae4e | 2010-12-05 05:39:54 +0000 | [diff] [blame] | 512 | |
| 513 | (See :issue:`10518`.) |
Amaury Forgeot d'Arc | ba117ef | 2010-09-10 21:39:53 +0000 | [diff] [blame] | 514 | |
Raymond Hettinger | 070ec70 | 2010-12-10 17:45:13 +0000 | [diff] [blame] | 515 | * Python's import mechanism can now load module installed in directories with |
| 516 | non-ASCII characters in the path name. |
| 517 | |
| 518 | (Required extensive work by Victor Stinner in :issue:`9425`.) |
| 519 | |
| 520 | |
Raymond Hettinger | 6e6565b | 2009-06-28 20:56:11 +0000 | [diff] [blame] | 521 | New, Improved, and Deprecated Modules |
| 522 | ===================================== |
| 523 | |
Raymond Hettinger | 99db3fd | 2010-12-15 19:33:49 +0000 | [diff] [blame] | 524 | Python's standard library has undergone significant maintenance efforts and |
| 525 | quality improvements. |
Raymond Hettinger | e434b3b | 2010-12-15 19:20:01 +0000 | [diff] [blame] | 526 | |
| 527 | The biggest news for Python 3.2 is that the :mod:`email` package and |
Raymond Hettinger | 99db3fd | 2010-12-15 19:33:49 +0000 | [diff] [blame] | 528 | :mod:`nntplib` modules now work correctly with the bytes/text model in Python 3. |
Raymond Hettinger | e434b3b | 2010-12-15 19:20:01 +0000 | [diff] [blame] | 529 | For the first time, there is correct handling of inputs with mixed encodings. |
| 530 | |
Raymond Hettinger | 6046e22 | 2010-12-16 00:21:08 +0000 | [diff] [blame] | 531 | Throughout the standard library, there has been more careful attention to |
| 532 | encodings and text versus bytes issues. In particular, interactions with the |
| 533 | operating system are now better able to pass non-ASCII data using the Windows |
| 534 | mcbs encoding, locale aware encodings, or UTF-8. |
| 535 | |
Raymond Hettinger | e434b3b | 2010-12-15 19:20:01 +0000 | [diff] [blame] | 536 | Another significant win is the addition of substantially better support for |
| 537 | *SSL* connections and security certificates. |
| 538 | |
Raymond Hettinger | 99db3fd | 2010-12-15 19:33:49 +0000 | [diff] [blame] | 539 | In addition, more functions and classes now have a :term:`context manager` to |
| 540 | support convenient and reliable resource clean-up using the |
Raymond Hettinger | e434b3b | 2010-12-15 19:20:01 +0000 | [diff] [blame] | 541 | :keyword:`with`-statement. |
| 542 | |
Raymond Hettinger | 0358a17 | 2010-12-15 19:00:38 +0000 | [diff] [blame] | 543 | email |
| 544 | ----- |
| 545 | |
| 546 | The usability of the :mod:`email` package in Python 3 has been mostly fixed by |
| 547 | the extensive efforts of R. David Murray. The problem was that emails are |
| 548 | typically read and stored in the form of :class:`bytes` rather than :class:`str` |
| 549 | text, and they may contain multiple encodings within a single email. So, the |
| 550 | email package had to be extended to parse and generate email messages in bytes |
| 551 | format. |
| 552 | |
| 553 | * New functions :func:`~email.message_from_bytes` and |
| 554 | :func:`~email.message_from_binary_file`, and new classes |
| 555 | :class:`~email.parser.BytesFeedParser` and :class:`~email.parser.BytesParser` |
| 556 | allow binary message data to be parsed into model objects. |
| 557 | |
| 558 | * Given bytes input to the model, :meth:`~email.message.Message.get_payload` |
| 559 | will by default decode a message body that has a |
| 560 | :mailheader:`Content-Transfer-Encoding` of *8bit* using the charset |
| 561 | specified in the MIME headers and return the resulting string. |
| 562 | |
| 563 | * Given bytes input to the model, :class:`~email.generator.Generator` will |
| 564 | convert message bodies that have a :mailheader:`Content-Transfer-Encoding` of |
| 565 | *8bit* to instead have a *7bit* :mailheader:`Content-Transfer-Encoding`. |
| 566 | |
| 567 | * A new class :class:`~email.generator.BytesGenerator` produces bytes as output, |
| 568 | preserving any unchanged non-ASCII data that was present in the input used to |
| 569 | build the model, including message bodies with a |
| 570 | :mailheader:`Content-Transfer-Encoding` of *8bit*. |
| 571 | |
| 572 | * The :mod:`smtplib` :class:`~smtplib.SMTP` class now accepts a byte string |
| 573 | for the *msg* argument to the :meth:`~smtplib.SMTP.sendmail` method, |
| 574 | and a new method, :meth:`~smtplib.SMTP.send_message` accepts a |
| 575 | :class:`~email.message.Message` object and can optionally obtain the |
| 576 | *from_addr* and *to_addrs* addresses directly from the object. |
| 577 | |
| 578 | .. XXX Update before 3.2rc1 to reflect all of the latest work and add examples. |
| 579 | |
| 580 | (Proposed and implemented by R. David Murray, :issue:`4661` and :issue:`10321`.) |
| 581 | |
Raymond Hettinger | 6046e22 | 2010-12-16 00:21:08 +0000 | [diff] [blame] | 582 | elementtree |
| 583 | ----------- |
| 584 | |
| 585 | The :mod:`xml.etree.ElementTree` package and it's :mod:`xml.etree.cElementTree` |
| 586 | counterpart have been updated to version 1.3. |
| 587 | |
| 588 | Several new and useful functions and methods have been added: |
| 589 | |
| 590 | * :func:`xml.etree.ElementTree.fromstringlist` which builds an XML document |
| 591 | from a sequence of fragments |
| 592 | * :func:`xml.etree.ElementTree.register_namespace` for registering a global |
| 593 | namespace prefix |
| 594 | * :func:`xml.etree.ElementTree.tostringlist` for string representation |
| 595 | including all sublists |
| 596 | * :meth:`xml.etree.ElementTree.Element.extend` for appending a sequence of zero |
| 597 | or more elements |
| 598 | * :meth:`xml.etree.ElementTree.Element.iterfind` searches an element and |
| 599 | subelements |
| 600 | * :meth:`xml.etree.ElementTree.Element.itertext` creates a text iterator over |
| 601 | an element and its sub-elements |
| 602 | * :meth:`xml.etree.ElementTree.TreeBuilder.end` closes the current element |
| 603 | * :meth:`xml.etree.ElementTree.TreeBuilder.doctype` handles a doctype |
| 604 | declaration |
| 605 | |
| 606 | Two methods have been deprecated: |
| 607 | |
| 608 | * :meth:`xml.etree.ElementTree.getchildren` use ``list(elem)`` instead. |
| 609 | * :meth:`xml.etree.ElementTree.getiterator` use ``Element.iter`` instead. |
| 610 | |
| 611 | For details of the update, see `Introducing ElementTree |
| 612 | <http://effbot.org/zone/elementtree-13-intro.htm>`_ on Fredrik Lundh's website. |
| 613 | |
Antoine Pitrou | 12de8ac | 2010-12-16 13:33:56 +0000 | [diff] [blame] | 614 | (Contributed by Florent Xicluna and Fredrik Lundh, :issue:`6472`.) |
Raymond Hettinger | 0358a17 | 2010-12-15 19:00:38 +0000 | [diff] [blame] | 615 | |
Raymond Hettinger | bba537b | 2010-12-15 18:20:19 +0000 | [diff] [blame] | 616 | functools |
| 617 | --------- |
| 618 | |
Éric Araujo | 4234ad4 | 2010-09-05 17:32:25 +0000 | [diff] [blame] | 619 | * The :mod:`functools` module includes a new decorator for caching function |
Georg Brandl | da0a211 | 2010-09-05 11:28:33 +0000 | [diff] [blame] | 620 | calls. :func:`functools.lru_cache` can save repeated queries to an external |
| 621 | resource whenever the results are expected to be the same. |
Raymond Hettinger | aed05eb | 2010-08-02 01:43:41 +0000 | [diff] [blame] | 622 | |
Raymond Hettinger | 86f9613 | 2010-08-06 23:23:49 +0000 | [diff] [blame] | 623 | For example, adding a caching decorator to a database query function can save |
| 624 | database accesses for popular searches:: |
Raymond Hettinger | aed05eb | 2010-08-02 01:43:41 +0000 | [diff] [blame] | 625 | |
Georg Brandl | da0a211 | 2010-09-05 11:28:33 +0000 | [diff] [blame] | 626 | @functools.lru_cache(maxsize=300) |
| 627 | def get_phone_number(name): |
| 628 | c = conn.cursor() |
| 629 | c.execute('SELECT phonenumber FROM phonelist WHERE name=?', (name,)) |
| 630 | return c.fetchone()[0] |
Raymond Hettinger | aed05eb | 2010-08-02 01:43:41 +0000 | [diff] [blame] | 631 | |
Georg Brandl | da0a211 | 2010-09-05 11:28:33 +0000 | [diff] [blame] | 632 | >>> for name in user_requests: |
Raymond Hettinger | 7496b41 | 2010-11-30 19:15:45 +0000 | [diff] [blame] | 633 | ... get_phone_number(name) # cached lookup |
| 634 | |
| 635 | To help with choosing an effective cache size, the wrapped function is |
| 636 | instrumented for tracking cache statistics: |
| 637 | |
Raymond Hettinger | 5e20bab | 2010-11-30 07:13:04 +0000 | [diff] [blame] | 638 | >>> get_phone_number.cache_info() |
Raymond Hettinger | 7496b41 | 2010-11-30 19:15:45 +0000 | [diff] [blame] | 639 | CacheInfo(hits=4805, misses=980, maxsize=300, currsize=300) |
Raymond Hettinger | aed05eb | 2010-08-02 01:43:41 +0000 | [diff] [blame] | 640 | |
Raymond Hettinger | f309828 | 2010-08-15 03:30:45 +0000 | [diff] [blame] | 641 | If the phonelist table gets updated, the outdated contents of the cache can be |
Georg Brandl | da0a211 | 2010-09-05 11:28:33 +0000 | [diff] [blame] | 642 | cleared with: |
Raymond Hettinger | f309828 | 2010-08-15 03:30:45 +0000 | [diff] [blame] | 643 | |
Georg Brandl | da0a211 | 2010-09-05 11:28:33 +0000 | [diff] [blame] | 644 | >>> get_phone_number.cache_clear() |
Raymond Hettinger | f309828 | 2010-08-15 03:30:45 +0000 | [diff] [blame] | 645 | |
Raymond Hettinger | 6e35394 | 2010-12-04 23:42:12 +0000 | [diff] [blame] | 646 | (Contributed by Raymond Hettinger and incorporating design ideas from |
Raymond Hettinger | b87ba26 | 2010-12-06 04:31:40 +0000 | [diff] [blame] | 647 | Jim Baker, Miki Tebeka, and Nick Coghlan.) |
Raymond Hettinger | aed05eb | 2010-08-02 01:43:41 +0000 | [diff] [blame] | 648 | |
Antoine Pitrou | 7d49bc9 | 2010-09-15 15:13:17 +0000 | [diff] [blame] | 649 | * The :func:`functools.wraps` decorator now adds a :attr:`__wrapped__` attribute |
| 650 | pointing to the original callable function. This allows wrapped functions to |
| 651 | be introspected. It also copies :attr:`__annotations__` if defined. And now |
| 652 | it also gracefully skips over missing attributes such as :attr:`__doc__` which |
Raymond Hettinger | 5eb6390 | 2010-12-09 23:43:34 +0000 | [diff] [blame] | 653 | might not be defined for the wrapped callable. |
Antoine Pitrou | 7d49bc9 | 2010-09-15 15:13:17 +0000 | [diff] [blame] | 654 | |
| 655 | (By Nick Coghlan and Terrence Cole; :issue:`9567`, :issue:`3445`, and |
| 656 | :issue:`8814`.) |
| 657 | |
Raymond Hettinger | 6046e22 | 2010-12-16 00:21:08 +0000 | [diff] [blame] | 658 | * To help write classes with rich comparison methods, a new decorator |
| 659 | :func:`functools.total_ordering` will use a existing equality and inequality |
| 660 | methods to fill-in the remaining methods. |
| 661 | |
| 662 | For example, supplying *__eq__* and *__lt__* will enable |
| 663 | :func:`~functools.total_ordering` to fill-in *__le__*, *__gt__* and *__ge__*:: |
| 664 | |
| 665 | @total_ordering |
| 666 | class Student: |
| 667 | def __eq__(self, other): |
| 668 | return ((self.lastname.lower(), self.firstname.lower()) == |
| 669 | (other.lastname.lower(), other.firstname.lower())) |
| 670 | def __lt__(self, other): |
| 671 | return ((self.lastname.lower(), self.firstname.lower()) < |
| 672 | (other.lastname.lower(), other.firstname.lower())) |
| 673 | |
| 674 | (Contributed by Raymond Hettinger.) |
| 675 | |
| 676 | * To aid in porting programs from Python 2, the :func:`~functools.cmp_to_key` |
Raymond Hettinger | bb9686f | 2010-12-16 00:53:05 +0000 | [diff] [blame] | 677 | function converts an old-style comparison function to |
Raymond Hettinger | 6046e22 | 2010-12-16 00:21:08 +0000 | [diff] [blame] | 678 | modern :term:`key function`: |
| 679 | |
| 680 | >>> # locale-aware sort order |
| 681 | >>> sorted(iterable, key=cmp_to_key(locale.strcoll)) |
| 682 | |
| 683 | For sorting examples and a brief sorting tutorial, see the `Sorting HowTo |
| 684 | <http://wiki.python.org/moin/HowTo/Sorting/>`_ tutorial. |
| 685 | |
| 686 | (Contributed by Raymond Hettinger.) |
| 687 | |
Raymond Hettinger | bba537b | 2010-12-15 18:20:19 +0000 | [diff] [blame] | 688 | itertools |
| 689 | --------- |
| 690 | |
Raymond Hettinger | 673ccf2 | 2010-12-07 09:37:11 +0000 | [diff] [blame] | 691 | * The :mod:`itertools` module has a new :func:`~itertools.accumulate` function |
Raymond Hettinger | a5a3554 | 2010-12-05 00:39:18 +0000 | [diff] [blame] | 692 | modeled on APL's *scan* operator and on Numpy's *accumulate* function: |
Raymond Hettinger | 6e35394 | 2010-12-04 23:42:12 +0000 | [diff] [blame] | 693 | |
| 694 | >>> list(accumulate(8, 2, 50)) |
| 695 | [8, 10, 60] |
| 696 | |
| 697 | >>> prob_dist = [0.1, 0.4, 0.2, 0.3] |
| 698 | >>> list(accumulate(prob_dist)) # cumulative probability distribution |
| 699 | [0.1, 0.5, 0.7, 1.0] |
| 700 | |
| 701 | For an example using :func:`~itertools.accumulate`, see the :ref:`examples for |
| 702 | the random module <random-examples>`. |
| 703 | |
| 704 | (Contributed by Raymond Hettinger and incorporating design suggestions |
| 705 | from Mark Dickinson.) |
| 706 | |
Raymond Hettinger | bba537b | 2010-12-15 18:20:19 +0000 | [diff] [blame] | 707 | collections |
| 708 | ----------- |
| 709 | |
Raymond Hettinger | 792c076 | 2010-12-09 16:41:54 +0000 | [diff] [blame] | 710 | * The :class:`collections.Counter` class now has two forms of in-place |
| 711 | subtraction, the existing *-=* operator for `saturating subtraction |
| 712 | <http://en.wikipedia.org/wiki/Saturation_arithmetic>`_ and the new |
| 713 | :meth:`~collections.Counter.subtract` method for regular subtraction. The |
| 714 | former is suitable for `multisets <http://en.wikipedia.org/wiki/Multiset>`_ |
Raymond Hettinger | ffad35e | 2010-12-14 21:12:03 +0000 | [diff] [blame] | 715 | which only have positive counts, and the latter is more suitable for use cases |
Raymond Hettinger | 792c076 | 2010-12-09 16:41:54 +0000 | [diff] [blame] | 716 | that allow negative counts: |
Alexander Belopolsky | 7257231 | 2010-12-08 21:21:56 +0000 | [diff] [blame] | 717 | |
Raymond Hettinger | 792c076 | 2010-12-09 16:41:54 +0000 | [diff] [blame] | 718 | >>> tally = Counter(dogs=5, cat=3) |
| 719 | >>> tally -= Counter(dogs=2, cats=8) # saturating subtraction |
| 720 | >>> tally |
| 721 | Counter({'dogs': 3}) |
Alexander Belopolsky | 7257231 | 2010-12-08 21:21:56 +0000 | [diff] [blame] | 722 | |
Raymond Hettinger | 792c076 | 2010-12-09 16:41:54 +0000 | [diff] [blame] | 723 | >>> tally = Counter(dogs=5, cats=3) |
| 724 | >>> tally.subtract(dogs=2, cats=8) # regular subtraction |
| 725 | >>> tally |
| 726 | Counter({'dogs': 3, 'cats': -5}) |
Alexander Belopolsky | 7257231 | 2010-12-08 21:21:56 +0000 | [diff] [blame] | 727 | |
Raymond Hettinger | 792c076 | 2010-12-09 16:41:54 +0000 | [diff] [blame] | 728 | (Contributed by Raymond Hettinger.) |
Alexander Belopolsky | 7257231 | 2010-12-08 21:21:56 +0000 | [diff] [blame] | 729 | |
Raymond Hettinger | e0a9600 | 2010-12-15 17:54:13 +0000 | [diff] [blame] | 730 | * The :class:`collections.OrderedDict` class has a new method |
| 731 | :meth:`~collections.OrderedDict.move_to_end` which takes an existing key and |
| 732 | moves it to either the beginning or end of an ordered sequence. When the |
| 733 | dictionary sequence is being used as a queue, these operations correspond to |
| 734 | "move to the front of the line" or "move to the back of the line": |
| 735 | |
| 736 | >>> d = OrderedDict.fromkeys(['a', 'b', 'X', 'd', 'e']) |
| 737 | >>> list(d) |
| 738 | ['a', 'b', 'X', 'd', 'e'] |
| 739 | >>> d.move_to_end('X', last=True) |
| 740 | >>> list(d) |
| 741 | ['a', 'b', 'd', 'e', 'X'] |
| 742 | >>> d.move_to_end('X', last=False) |
| 743 | >>> list(d) |
| 744 | ['X', 'a', 'b', 'd', 'e'] |
| 745 | |
Raymond Hettinger | 6046e22 | 2010-12-16 00:21:08 +0000 | [diff] [blame] | 746 | (Contributed by Raymond Hettinger.) |
| 747 | |
| 748 | * The :class:`collections.deque` grew two new methods :meth:`~collections.deque.count` |
| 749 | and :meth:`collections.deque.reverse` that make them more substitutable for |
| 750 | :class:`list` when needed: |
| 751 | |
| 752 | >>> d = deque('simsalabim') |
| 753 | >>> d.count('s') |
| 754 | 2 |
| 755 | >>> d.reverse() |
| 756 | >>> d |
| 757 | deque(['m', 'i', 'b', 'a', 'l', 'a', 's', 'm', 'i', 's']) |
| 758 | |
| 759 | (Contributed by Raymond Hettinger.) |
| 760 | |
Raymond Hettinger | bba537b | 2010-12-15 18:20:19 +0000 | [diff] [blame] | 761 | datetime |
| 762 | -------- |
| 763 | |
Raymond Hettinger | 792c076 | 2010-12-09 16:41:54 +0000 | [diff] [blame] | 764 | * The :mod:`datetime` module has a new type :class:`~datetime.timezone` that |
| 765 | implements the :class:`~datetime.tzinfo` interface by returning a fixed UTC |
| 766 | offset and timezone name. This makes it easier to create timezone aware |
| 767 | datetime objects: |
Alexander Belopolsky | 7257231 | 2010-12-08 21:21:56 +0000 | [diff] [blame] | 768 | |
Raymond Hettinger | 792c076 | 2010-12-09 16:41:54 +0000 | [diff] [blame] | 769 | >>> datetime.now(timezone.utc) |
| 770 | datetime.datetime(2010, 12, 8, 21, 4, 2, 923754, tzinfo=datetime.timezone.utc) |
Alexander Belopolsky | 7257231 | 2010-12-08 21:21:56 +0000 | [diff] [blame] | 771 | |
Raymond Hettinger | 792c076 | 2010-12-09 16:41:54 +0000 | [diff] [blame] | 772 | >>> datetime.strptime("01/01/2000 12:00 +0000", "%m/%d/%Y %H:%M %z") |
| 773 | datetime.datetime(2000, 1, 1, 12, 0, tzinfo=datetime.timezone.utc) |
Alexander Belopolsky | 7257231 | 2010-12-08 21:21:56 +0000 | [diff] [blame] | 774 | |
Raymond Hettinger | bba537b | 2010-12-15 18:20:19 +0000 | [diff] [blame] | 775 | * Also, :class:`~datetime.timedelta` objects can now be multiplied by |
Raymond Hettinger | 792c076 | 2010-12-09 16:41:54 +0000 | [diff] [blame] | 776 | :class:`float` and divided by :class:`float` and :class:`int` objects. |
| 777 | |
| 778 | (Contributed by Alexander Belopolsky in :issue:`1289118`, :issue:`5094` and |
| 779 | :issue:`6641`.) |
Alexander Belopolsky | 7257231 | 2010-12-08 21:21:56 +0000 | [diff] [blame] | 780 | |
Raymond Hettinger | bba537b | 2010-12-15 18:20:19 +0000 | [diff] [blame] | 781 | abc |
| 782 | --- |
Antoine Pitrou | 7d49bc9 | 2010-09-15 15:13:17 +0000 | [diff] [blame] | 783 | |
Raymond Hettinger | bba537b | 2010-12-15 18:20:19 +0000 | [diff] [blame] | 784 | The :mod:`abc` module now supports :func:`~abc.abstractclassmethod` and |
| 785 | :func:`~abc.abstractstaticmethod`. |
Raymond Hettinger | a5a3554 | 2010-12-05 00:39:18 +0000 | [diff] [blame] | 786 | |
Raymond Hettinger | bba537b | 2010-12-15 18:20:19 +0000 | [diff] [blame] | 787 | These tools make it possible to define an :term:`Abstract Base Class` that |
| 788 | requires a particular :func:`classmethod` or :func:`staticmethod` to be |
| 789 | implemented. |
Antoine Pitrou | 7d49bc9 | 2010-09-15 15:13:17 +0000 | [diff] [blame] | 790 | |
Raymond Hettinger | bba537b | 2010-12-15 18:20:19 +0000 | [diff] [blame] | 791 | (Patch submitted by Daniel Urban; :issue:`5867`.) |
Raymond Hettinger | bcbd696 | 2010-09-05 08:46:36 +0000 | [diff] [blame] | 792 | |
Raymond Hettinger | 480ed78 | 2010-12-15 22:07:15 +0000 | [diff] [blame] | 793 | contextlib |
| 794 | ---------- |
| 795 | |
| 796 | There is a new and slightly mind-blowing tool |
| 797 | :class:`~contextlib.ContextDecorator` that is helpful for creating a |
| 798 | :term:`context manager` that does double-duty as a function decorator. |
| 799 | |
| 800 | As a convenience, this new functionality is used by |
| 801 | :func:`~contextlib.contextmanager` so that no extra effort is needed to support |
| 802 | both roles. |
| 803 | |
| 804 | The basic idea is that both context managers and function decorators can be used |
| 805 | for pre-action and post-action wrappers. Context managers wrap a group of |
| 806 | statements using the :keyword:`with`-statement, and function decorators wrap a |
| 807 | group of statements enclosed in a function. So, occasionally there is a need to |
| 808 | write a pre/post action wrapper that can be used in either role. |
| 809 | |
| 810 | For example, it is sometimes useful to wrap functions or groups of statements |
| 811 | with a logger that can track the time of entry and time of exit. Rather than |
| 812 | writing both a function decorator and a context manager for the task, the |
| 813 | :func:`~contextlib.contextmanager` provides both capabilities in a single |
| 814 | definition: |
| 815 | |
| 816 | >>> import logging |
| 817 | >>> logging.basicConfig(level=logging.INFO) |
| 818 | >>> @contextmanager |
Raymond Hettinger | 9743e4f | 2010-12-16 02:24:12 +0000 | [diff] [blame] | 819 | ... def track_entry_and_exit(name): |
| 820 | ... logging.info('Entering: {}'.format(name)) |
Raymond Hettinger | 480ed78 | 2010-12-15 22:07:15 +0000 | [diff] [blame] | 821 | ... yield |
Raymond Hettinger | 9743e4f | 2010-12-16 02:24:12 +0000 | [diff] [blame] | 822 | ... logging.info('Exiting: {}'.format(name)) |
Raymond Hettinger | 480ed78 | 2010-12-15 22:07:15 +0000 | [diff] [blame] | 823 | |
| 824 | Formerly, this would have only been usable as a context manager: |
| 825 | |
Raymond Hettinger | 9743e4f | 2010-12-16 02:24:12 +0000 | [diff] [blame] | 826 | >>> with track_entry_and_exit('widget loader'): |
Raymond Hettinger | 480ed78 | 2010-12-15 22:07:15 +0000 | [diff] [blame] | 827 | ... print('Some time consuming activity goes here') |
Raymond Hettinger | 9743e4f | 2010-12-16 02:24:12 +0000 | [diff] [blame] | 828 | ... load_widget() |
Raymond Hettinger | 480ed78 | 2010-12-15 22:07:15 +0000 | [diff] [blame] | 829 | |
| 830 | Now, it can be used as a decorator as well: |
| 831 | |
Raymond Hettinger | 9743e4f | 2010-12-16 02:24:12 +0000 | [diff] [blame] | 832 | >>> @track_entry_and_exit('widget loader') |
Raymond Hettinger | 480ed78 | 2010-12-15 22:07:15 +0000 | [diff] [blame] | 833 | ... def activity(): |
Raymond Hettinger | 9743e4f | 2010-12-16 02:24:12 +0000 | [diff] [blame] | 834 | ... print('Some time consuming activity goes here') |
| 835 | ... load_widget() |
Raymond Hettinger | 480ed78 | 2010-12-15 22:07:15 +0000 | [diff] [blame] | 836 | |
| 837 | Trying to fulfill two roles at once places some limitations on the technique. |
| 838 | Context managers normally have the flexibility to return an argument usable by |
Raymond Hettinger | 9743e4f | 2010-12-16 02:24:12 +0000 | [diff] [blame] | 839 | the :keyword:`with`-statement, but there is no parallel for function decorators. |
Raymond Hettinger | 480ed78 | 2010-12-15 22:07:15 +0000 | [diff] [blame] | 840 | |
Raymond Hettinger | 9743e4f | 2010-12-16 02:24:12 +0000 | [diff] [blame] | 841 | In the above example, there is not a clean way for the *track_entry_and_exit* |
| 842 | context manager does not have a way to return a logging instance for use in the |
| 843 | body of enclosed statements. |
Raymond Hettinger | 480ed78 | 2010-12-15 22:07:15 +0000 | [diff] [blame] | 844 | |
| 845 | (Contributed by Michael Foord in :issue:`9110`.) |
| 846 | |
Raymond Hettinger | 07a605b | 2010-12-15 22:35:03 +0000 | [diff] [blame] | 847 | decimal and fractions |
| 848 | --------------------- |
| 849 | |
| 850 | Mark Dickinson crafted an elegant and efficient scheme for assuring that |
| 851 | different numeric datatypes will have the same hash value whenever their actual |
| 852 | values are equal (:issue:`8188`):: |
| 853 | |
| 854 | >>> assert hash(Fraction(3, 2)) == hash(1.5) == \ |
| 855 | hash(Decimal("1.5")) == hash(complex(1.5, 0)) |
| 856 | |
| 857 | An early decision to limit the inter-operability of various numeric types has |
| 858 | been relaxed. It is still unsupported (and ill-advised) to to have implicit |
| 859 | mixing in arithmetic expressions such as ``Decimal('1.1') + float('1.1')`` |
| 860 | because the latter loses information in the process of constructing the binary |
| 861 | float. However, since existing floating point value can be converted losslessly |
| 862 | to either a decimal or rational representation, it makes sense to add them to |
| 863 | the constructor and to support mixed-type comparisons. |
| 864 | |
Raymond Hettinger | bb9686f | 2010-12-16 00:53:05 +0000 | [diff] [blame] | 865 | * The :class:`decimal.Decimal` constructor now accepts :class:`float` objects |
Raymond Hettinger | 07a605b | 2010-12-15 22:35:03 +0000 | [diff] [blame] | 866 | directly so there in no longer a need to use the :meth:`~decimal.Decimal.from_float` |
Raymond Hettinger | 6046e22 | 2010-12-16 00:21:08 +0000 | [diff] [blame] | 867 | method (:issue:`8257`). |
Raymond Hettinger | 07a605b | 2010-12-15 22:35:03 +0000 | [diff] [blame] | 868 | |
| 869 | * Mixed type comparisons are now fully supported so that |
| 870 | :class:`~decimal.Decimal` objects can be directly compared with :class:`float` |
Raymond Hettinger | 6046e22 | 2010-12-16 00:21:08 +0000 | [diff] [blame] | 871 | and :class:`fractions.Fraction` (:issue:`2531` and :issue:`8188`). |
Raymond Hettinger | 07a605b | 2010-12-15 22:35:03 +0000 | [diff] [blame] | 872 | |
| 873 | Similar changes were made to :class:`fractions.Fraction` so that the |
| 874 | :meth:`~fractions.Fraction.from_float()` and :meth:`~fractions.Fraction.from_decimal` |
Raymond Hettinger | 6046e22 | 2010-12-16 00:21:08 +0000 | [diff] [blame] | 875 | methods are no longer needed (:issue:`8294`): |
| 876 | |
| 877 | >>> Decimal(1.1) |
| 878 | Decimal('1.100000000000000088817841970012523233890533447265625') |
| 879 | >>> Fraction(1.1) |
| 880 | Fraction(2476979795053773, 2251799813685248) |
Raymond Hettinger | 07a605b | 2010-12-15 22:35:03 +0000 | [diff] [blame] | 881 | |
| 882 | Another useful change for the :mod:`decimal` module is that the |
| 883 | :attr:`Context.clamp` attribute is now public. This is useful in creating |
| 884 | contexts that correspond to the decimal interchange formats specified in IEEE |
| 885 | 754 (see :issue:`8540`). |
| 886 | |
Raymond Hettinger | 6046e22 | 2010-12-16 00:21:08 +0000 | [diff] [blame] | 887 | (Contributed by Mark Dickinson and Raymond Hettinger.) |
Raymond Hettinger | 07a605b | 2010-12-15 22:35:03 +0000 | [diff] [blame] | 888 | |
Raymond Hettinger | bba537b | 2010-12-15 18:20:19 +0000 | [diff] [blame] | 889 | ftp |
| 890 | --- |
Raymond Hettinger | bcbd696 | 2010-09-05 08:46:36 +0000 | [diff] [blame] | 891 | |
Raymond Hettinger | bba537b | 2010-12-15 18:20:19 +0000 | [diff] [blame] | 892 | The :class:`ftplib.FTP` class now supports the context manager protocol to |
| 893 | unconditionally consume :exc:`socket.error` exceptions and to close the FTP |
| 894 | connection when done:: |
Giampaolo Rodolà | bd576b7 | 2010-05-10 14:53:29 +0000 | [diff] [blame] | 895 | |
Raymond Hettinger | bba537b | 2010-12-15 18:20:19 +0000 | [diff] [blame] | 896 | >>> from ftplib import FTP |
| 897 | >>> with FTP("ftp1.at.proftpd.org") as ftp: |
| 898 | ... ftp.login() |
| 899 | ... ftp.dir() |
| 900 | ... |
| 901 | '230 Anonymous login ok, restrictions apply.' |
| 902 | dr-xr-xr-x 9 ftp ftp 154 May 6 10:43 . |
| 903 | dr-xr-xr-x 9 ftp ftp 154 May 6 10:43 .. |
| 904 | dr-xr-xr-x 5 ftp ftp 4096 May 6 10:43 CentOS |
| 905 | dr-xr-xr-x 3 ftp ftp 18 Jul 10 2008 Fedora |
Georg Brandl | da0a211 | 2010-09-05 11:28:33 +0000 | [diff] [blame] | 906 | |
Raymond Hettinger | bba537b | 2010-12-15 18:20:19 +0000 | [diff] [blame] | 907 | Other file-like objects such as :class:`mmap.mmap` and :func:`fileinput.input` |
| 908 | also grew auto-closing context managers:: |
| 909 | |
| 910 | with fileinput.input(files=('log1.txt', 'log2.txt')) as f: |
| 911 | for line in f: |
| 912 | process(line) |
| 913 | |
| 914 | (Contributed by Tarek Ziadé and Giampaolo Rodolà in :issue:`4972`, and |
| 915 | by Georg Brandl in :issue:`8046` and :issue:`1286`.) |
Antoine Pitrou | 696e035 | 2010-08-08 22:18:46 +0000 | [diff] [blame] | 916 | |
Raymond Hettinger | 0358a17 | 2010-12-15 19:00:38 +0000 | [diff] [blame] | 917 | .. XXX mention os.popen and subprocess.Popen auto-closing of fds |
Georg Brandl | 3ad4675 | 2010-12-05 07:59:29 +0000 | [diff] [blame] | 918 | |
Raymond Hettinger | 6046e22 | 2010-12-16 00:21:08 +0000 | [diff] [blame] | 919 | gzip and zipfile |
| 920 | ---------------- |
Antoine Pitrou | cd889af | 2010-10-06 21:13:56 +0000 | [diff] [blame] | 921 | |
Raymond Hettinger | bba537b | 2010-12-15 18:20:19 +0000 | [diff] [blame] | 922 | :class:`gzip.GzipFile` now implements the :class:`io.BufferedIOBase` |
| 923 | :term:`abstract base class` (except for ``truncate()``). It also has a |
| 924 | :meth:`~gzip.GzipFile.peek` method and supports unseekable as well as |
| 925 | zero-padded file objects. |
Raymond Hettinger | a026633 | 2010-12-07 08:52:41 +0000 | [diff] [blame] | 926 | |
Raymond Hettinger | bba537b | 2010-12-15 18:20:19 +0000 | [diff] [blame] | 927 | The :mod:`gzip` module also gains the :func:`~gzip.compress` and |
| 928 | :func:`~gzip.decompress` functions for easier in-memory compression and |
| 929 | decompression. Keep in mind that text needs to be encoded in to :class:`bytes` |
| 930 | before compressing and decompressing: |
Raymond Hettinger | a026633 | 2010-12-07 08:52:41 +0000 | [diff] [blame] | 931 | |
Raymond Hettinger | bba537b | 2010-12-15 18:20:19 +0000 | [diff] [blame] | 932 | >>> s = 'Three shall be the number thou shalt count, ' |
| 933 | >>> s += 'and the number of the counting shall be three' |
| 934 | >>> b = s.encode() # convert to utf-8 |
| 935 | >>> len(b) |
| 936 | 89 |
| 937 | >>> c = gzip.compress(b) |
| 938 | >>> len(c) |
| 939 | 77 |
| 940 | >>> gzip.decompress(c).decode()[:42] # decompress and convert to text |
| 941 | 'Three shall be the number thou shalt count,' |
Antoine Pitrou | cd889af | 2010-10-06 21:13:56 +0000 | [diff] [blame] | 942 | |
Raymond Hettinger | bba537b | 2010-12-15 18:20:19 +0000 | [diff] [blame] | 943 | (Contributed by Anand B. Pillai in :issue:`3488`; and by Antoine Pitrou, Nir |
| 944 | Aides and Brian Curtin in :issue:`9962`, :issue:`1675951`, :issue:`7471` and |
| 945 | :issue:`2846`.) |
| 946 | |
Raymond Hettinger | 6046e22 | 2010-12-16 00:21:08 +0000 | [diff] [blame] | 947 | Also, the :class:`zipfile.ZipExtFile` class was reworked internally to represent |
| 948 | files stored inside an archive. The new implementation is significantly faster |
| 949 | and can be wrapped in a :class:`io.BufferedReader` object for more speedups. It |
| 950 | also solves an issue where interleaved calls to *read* and *readline* gave the |
| 951 | wrong results. |
| 952 | |
| 953 | (Patch submitted by by Nir Aides in :issue:`7610`.) |
| 954 | |
Raymond Hettinger | bba537b | 2010-12-15 18:20:19 +0000 | [diff] [blame] | 955 | shutil |
| 956 | ------ |
| 957 | |
| 958 | The :func:`shutil.copytree` function has two new options: |
Antoine Pitrou | d67075e | 2010-07-31 22:48:02 +0000 | [diff] [blame] | 959 | |
Raymond Hettinger | db9044e | 2010-09-06 01:29:23 +0000 | [diff] [blame] | 960 | * *ignore_dangling_symlinks*: when ``symlinks=False`` so that the function |
| 961 | copies the file pointed to by the symlink, not the symlink itself. This |
Georg Brandl | da0a211 | 2010-09-05 11:28:33 +0000 | [diff] [blame] | 962 | option will silence the error raised if the file doesn't exist. |
Antoine Pitrou | d67075e | 2010-07-31 22:48:02 +0000 | [diff] [blame] | 963 | |
Raymond Hettinger | db9044e | 2010-09-06 01:29:23 +0000 | [diff] [blame] | 964 | * *copy_function*: is a callable that will be used to copy files. |
Antoine Pitrou | d67075e | 2010-07-31 22:48:02 +0000 | [diff] [blame] | 965 | :func:`shutil.copy2` is used by default. |
| 966 | |
Raymond Hettinger | bba537b | 2010-12-15 18:20:19 +0000 | [diff] [blame] | 967 | (Contributed by Tarek Ziadé.) |
Antoine Pitrou | d67075e | 2010-07-31 22:48:02 +0000 | [diff] [blame] | 968 | |
Raymond Hettinger | bba537b | 2010-12-15 18:20:19 +0000 | [diff] [blame] | 969 | sqlite3 |
| 970 | ------- |
Antoine Pitrou | e43f9d0 | 2010-08-08 23:24:50 +0000 | [diff] [blame] | 971 | |
Raymond Hettinger | 6046e22 | 2010-12-16 00:21:08 +0000 | [diff] [blame] | 972 | The :mod:`sqlite3` module was updated to version 2.6.0. It has two new capabilities. |
Antoine Pitrou | e43f9d0 | 2010-08-08 23:24:50 +0000 | [diff] [blame] | 973 | |
Raymond Hettinger | 0358a17 | 2010-12-15 19:00:38 +0000 | [diff] [blame] | 974 | * The :attr:`sqlite3.Connection.in_transit` attribute is true if there is an |
| 975 | active transaction for uncommitted changes. |
Antoine Pitrou | d67075e | 2010-07-31 22:48:02 +0000 | [diff] [blame] | 976 | |
Raymond Hettinger | 0358a17 | 2010-12-15 19:00:38 +0000 | [diff] [blame] | 977 | * The :meth:`sqlite3.Connection.enable_load_extension` and |
| 978 | :meth:`sqlite3.Connection.load_extension` methods allows you to load SQLite |
| 979 | extensions from ".so" files. One well-known extension is the fulltext-search |
| 980 | extension distributed with SQLite. |
Antoine Pitrou | d67075e | 2010-07-31 22:48:02 +0000 | [diff] [blame] | 981 | |
Raymond Hettinger | bba537b | 2010-12-15 18:20:19 +0000 | [diff] [blame] | 982 | (Contributed by R. David Murray and Shashwat Anand; :issue:`8845`.) |
| 983 | |
| 984 | socket |
| 985 | ------ |
| 986 | |
| 987 | The :mod:`socket` module has two new improvements. |
| 988 | |
| 989 | * Socket objects now have a :meth:`~socket.socket.detach()` method which puts |
| 990 | the socket into closed state without actually closing the underlying file |
| 991 | descriptor. The latter can then be reused for other purposes. |
| 992 | (Added by Antoine Pitrou; :issue:`8524`.) |
| 993 | |
| 994 | * :func:`socket.create_connection` now supports the context manager protocol |
| 995 | to unconditionally consume :exc:`socket.error` exceptions and to close the |
| 996 | socket when done. |
| 997 | (Contributed by Giampaolo Rodolà; :issue:`9794`.) |
| 998 | |
| 999 | ssl |
| 1000 | --- |
Antoine Pitrou | d67075e | 2010-07-31 22:48:02 +0000 | [diff] [blame] | 1001 | |
Georg Brandl | da0a211 | 2010-09-05 11:28:33 +0000 | [diff] [blame] | 1002 | * The :mod:`ssl` module has a new class, :class:`~ssl.SSLContext` which serves |
| 1003 | as a container for various persistent SSL data, such as protocol settings, |
| 1004 | certificates, private keys, and various other options. The |
| 1005 | :meth:`~ssl.SSLContext.wrap_socket` method allows to create an SSL socket from |
| 1006 | such an SSL context. (Added by Antoine Pitrou; :issue:`8550`.) |
Antoine Pitrou | 4f2a0a8 | 2010-07-31 18:08:33 +0000 | [diff] [blame] | 1007 | |
Raymond Hettinger | bba537b | 2010-12-15 18:20:19 +0000 | [diff] [blame] | 1008 | * A new function, :func:`ssl.match_hostname`, helps implement server identity |
Antoine Pitrou | 0ee4c9f | 2010-10-08 16:46:17 +0000 | [diff] [blame] | 1009 | verification for higher-level protocols by implementing the rules of |
| 1010 | HTTPS (from :rfc:`2818`), which are also suitable for other protocols. |
| 1011 | (Added by Antoine Pitrou, :issue:`1589`). |
| 1012 | |
Raymond Hettinger | bba537b | 2010-12-15 18:20:19 +0000 | [diff] [blame] | 1013 | * The :func:`ssl.wrap_socket` constructor function now takes a *ciphers* |
Georg Brandl | da0a211 | 2010-09-05 11:28:33 +0000 | [diff] [blame] | 1014 | argument that's a string listing the encryption algorithms to be allowed; the |
| 1015 | format of the string is described `in the OpenSSL documentation |
| 1016 | <http://www.openssl.org/docs/apps/ciphers.html#CIPHER_LIST_FORMAT>`__. (Added |
| 1017 | by Antoine Pitrou; :issue:`8322`.) |
Antoine Pitrou | 4f2a0a8 | 2010-07-31 18:08:33 +0000 | [diff] [blame] | 1018 | |
Raymond Hettinger | bba537b | 2010-12-15 18:20:19 +0000 | [diff] [blame] | 1019 | * When linked against a recent enough version of OpenSSL, the :mod:`ssl` |
Antoine Pitrou | 7d15a72 | 2010-11-05 22:13:55 +0000 | [diff] [blame] | 1020 | module now supports the Server Name Indication extension to the TLS |
| 1021 | protocol, allowing for several "virtual hosts" using different certificates |
| 1022 | on a single IP/port. This extension is only supported in client mode, |
| 1023 | and is activated by passing the *server_hostname* argument to |
| 1024 | :meth:`SSLContext.wrap_socket`. |
| 1025 | (Added by Antoine Pitrou, :issue:`5639`.) |
| 1026 | |
Raymond Hettinger | bba537b | 2010-12-15 18:20:19 +0000 | [diff] [blame] | 1027 | * Various options have been added to the :mod:`ssl` module, such as |
Georg Brandl | da0a211 | 2010-09-05 11:28:33 +0000 | [diff] [blame] | 1028 | :data:`~ssl.OP_NO_SSLv2` which allows to force disabling of the insecure and |
| 1029 | obsolete SSLv2 protocol. (Added by Antoine Pitrou; :issue:`4870`.) |
Antoine Pitrou | 4f2a0a8 | 2010-07-31 18:08:33 +0000 | [diff] [blame] | 1030 | |
Raymond Hettinger | bba537b | 2010-12-15 18:20:19 +0000 | [diff] [blame] | 1031 | * Another change makes the extension load all of OpenSSL's ciphers and digest |
Georg Brandl | da0a211 | 2010-09-05 11:28:33 +0000 | [diff] [blame] | 1032 | algorithms so that they're all available. Some SSL certificates couldn't be |
| 1033 | verified, reporting an "unknown algorithm" error. (Reported by Beda Kosata, |
| 1034 | and fixed by Antoine Pitrou; :issue:`8484`.) |
Antoine Pitrou | 4f2a0a8 | 2010-07-31 18:08:33 +0000 | [diff] [blame] | 1035 | |
Raymond Hettinger | bba537b | 2010-12-15 18:20:19 +0000 | [diff] [blame] | 1036 | * The version of OpenSSL being used is now available as the module attributes |
Georg Brandl | da0a211 | 2010-09-05 11:28:33 +0000 | [diff] [blame] | 1037 | :data:`ssl.OPENSSL_VERSION` (a string), :data:`ssl.OPENSSL_VERSION_INFO` (a |
| 1038 | 5-tuple), and :data:`ssl.OPENSSL_VERSION_NUMBER` (an integer). (Added by |
| 1039 | Antoine Pitrou; :issue:`8321`.) |
Antoine Pitrou | 4f2a0a8 | 2010-07-31 18:08:33 +0000 | [diff] [blame] | 1040 | |
Raymond Hettinger | bba537b | 2010-12-15 18:20:19 +0000 | [diff] [blame] | 1041 | nntp |
| 1042 | ---- |
Raymond Hettinger | 070ec70 | 2010-12-10 17:45:13 +0000 | [diff] [blame] | 1043 | |
Raymond Hettinger | bba537b | 2010-12-15 18:20:19 +0000 | [diff] [blame] | 1044 | The :mod:`nntplib` module has a revamped implementation with better bytes and |
| 1045 | unicode semantics as well as more practical APIs. These improvements break |
| 1046 | compatibility with the nntplib version in Python 3.1, which was partly |
| 1047 | dysfunctional in itself. |
Raymond Hettinger | 070ec70 | 2010-12-10 17:45:13 +0000 | [diff] [blame] | 1048 | |
Raymond Hettinger | bba537b | 2010-12-15 18:20:19 +0000 | [diff] [blame] | 1049 | (Contributed by Antoine Pitrou in :issue:`9360`) |
| 1050 | |
| 1051 | certificates |
| 1052 | ------------ |
| 1053 | |
| 1054 | :class:`http.client.HTTPSConnection`, :class:`urllib.request.HTTPSHandler` |
| 1055 | and :func:`urllib.request.urlopen` now take optional arguments to allow for |
| 1056 | server certificate checking against a set of Certificate Authorities, |
| 1057 | as recommended in public uses of HTTPS. |
| 1058 | |
| 1059 | (Added by Antoine Pitrou, :issue:`9003`.) |
| 1060 | |
| 1061 | unittest |
| 1062 | -------- |
Antoine Pitrou | afb078d | 2010-11-05 22:18:28 +0000 | [diff] [blame] | 1063 | |
Raymond Hettinger | a026633 | 2010-12-07 08:52:41 +0000 | [diff] [blame] | 1064 | * The command-line call, ``python -m unittest`` can now accept file paths |
| 1065 | instead of module names for running specific tests (:issue:`10620`). The new |
| 1066 | test discovery can find tests within packages, locating any test importable |
| 1067 | from the top level directory. The top level directory can be specified with |
| 1068 | the `-t` option, a pattern for matching files with ``-p``, and a directory to |
| 1069 | start discovery with ``-s``:: |
| 1070 | |
| 1071 | $ python -m unittest discover -s my_proj_dir -p '_test.py' |
| 1072 | |
| 1073 | (Contributed by Michael Foord.) |
Antoine Pitrou | d305200 | 2010-09-15 15:09:40 +0000 | [diff] [blame] | 1074 | |
Raymond Hettinger | dc2f9b5 | 2010-12-05 07:02:45 +0000 | [diff] [blame] | 1075 | * The :mod:`unittest` module has two new methods, |
| 1076 | :meth:`~unittest.TestCase.assertWarns` and |
| 1077 | :meth:`~unittest.TestCase.assertWarnsRegex` to check that a given warning type |
Raymond Hettinger | 413abbc | 2010-12-05 07:06:47 +0000 | [diff] [blame] | 1078 | is triggered by the code under test: |
Antoine Pitrou | d305200 | 2010-09-15 15:09:40 +0000 | [diff] [blame] | 1079 | |
Raymond Hettinger | dc2f9b5 | 2010-12-05 07:02:45 +0000 | [diff] [blame] | 1080 | >>> with self.assertWarns(DeprecationWarning): |
| 1081 | ... legacy_function('XYZ') |
Ezio Melotti | 2baf1a6 | 2010-11-22 12:56:58 +0000 | [diff] [blame] | 1082 | |
Raymond Hettinger | 21ec4bc | 2010-12-10 01:09:01 +0000 | [diff] [blame] | 1083 | Another new method, :meth:`~unittest.TestCase.assertCountEqual` is used to |
Raymond Hettinger | ffad35e | 2010-12-14 21:12:03 +0000 | [diff] [blame] | 1084 | compare two iterables to determine if their element counts are equal (whether |
| 1085 | the same elements are present with the same number of occurrences regardless |
| 1086 | of order):: |
Raymond Hettinger | a026633 | 2010-12-07 08:52:41 +0000 | [diff] [blame] | 1087 | |
| 1088 | def test_anagram(self): |
| 1089 | self.assertCountEqual('algorithm', 'logarithm') |
| 1090 | |
| 1091 | A principal feature of the unittest module is an effort to produce meaningful |
| 1092 | diagnostics when a test fails. When possible the failure is recorded along |
| 1093 | with a diff of the output. This is especially helpful for analyzing log files |
| 1094 | of failed test runs. However, since diffs can sometime be voluminous, there is |
| 1095 | a new :attr:`~unittest.TestCase.maxDiff` attribute which sets maximum length of |
| 1096 | diffs. |
| 1097 | |
Raymond Hettinger | 68f1e8d | 2010-12-07 09:24:30 +0000 | [diff] [blame] | 1098 | In addition the naming in the module has undergone a number of clean-ups. For |
Raymond Hettinger | a026633 | 2010-12-07 08:52:41 +0000 | [diff] [blame] | 1099 | example, :meth:`~unittest.TestCase.assertRegex` is the new name for |
| 1100 | :meth:`~unittest.TestCase.assertRegexpMatches` which was misnamed because the |
Raymond Hettinger | 21ec4bc | 2010-12-10 01:09:01 +0000 | [diff] [blame] | 1101 | test uses :func:`re.search`, not :func:`re.match`. Other methods using |
| 1102 | regular expressions are now named using short form "Regex" in preference |
| 1103 | to "Regexp" -- this matches the names used in other unittest implementations, |
| 1104 | matches Python's old name for the :mod:`re` module, and it has unambiguous |
| 1105 | camel-casing. |
Raymond Hettinger | dc2f9b5 | 2010-12-05 07:02:45 +0000 | [diff] [blame] | 1106 | |
| 1107 | To improve consistency, some of long-standing method aliases are being |
| 1108 | deprecated in favor of the preferred names: |
| 1109 | |
| 1110 | - replace :meth:`assert_` with :meth:`.assertTrue` |
| 1111 | - replace :meth:`assertEquals` with :meth:`.assertEqual` |
| 1112 | - replace :meth:`assertNotEquals` with :meth:`.assertNotEqual` |
| 1113 | - replace :meth:`assertAlmostEquals` with :meth:`.assertAlmostEqual` |
| 1114 | - replace :meth:`assertNotAlmostEquals` with :meth:`.assertNotAlmostEqual` |
| 1115 | |
| 1116 | Likewise, the ``TestCase.fail*`` methods deprecated in Python 3.1 are expected |
| 1117 | to be removed in Python 3.3. See also the :ref:`deprecated-aliases` section in |
| 1118 | the :mod:`unittest` documentation. |
Ezio Melotti | 2baf1a6 | 2010-11-22 12:56:58 +0000 | [diff] [blame] | 1119 | |
| 1120 | (Contributed by Ezio Melotti; :issue:`9424`.) |
Antoine Pitrou | d305200 | 2010-09-15 15:09:40 +0000 | [diff] [blame] | 1121 | |
Raymond Hettinger | bba537b | 2010-12-15 18:20:19 +0000 | [diff] [blame] | 1122 | random |
| 1123 | ------ |
Raymond Hettinger | e0a9600 | 2010-12-15 17:54:13 +0000 | [diff] [blame] | 1124 | |
Raymond Hettinger | 0358a17 | 2010-12-15 19:00:38 +0000 | [diff] [blame] | 1125 | The integer methods in the :mod:`random` module now do a better job of producing |
Raymond Hettinger | 99db3fd | 2010-12-15 19:33:49 +0000 | [diff] [blame] | 1126 | uniform distributions. Previously, they computed selections with |
| 1127 | ``int(n*random())`` which had a slight bias whenever *n* was not a power of two. |
| 1128 | Now, multiple selections are made from a range upto the next power of two and a |
| 1129 | selection is kept only when it falls within the range ``0 <= x < n``. The |
| 1130 | functions and methods affected are :func:`~random.randrange`, |
| 1131 | :func:`~random.randint`, :func:`~random.choice`, :func:`~random.shuffle` and |
| 1132 | :func:`~random.sample`. |
Raymond Hettinger | bba537b | 2010-12-15 18:20:19 +0000 | [diff] [blame] | 1133 | |
| 1134 | (Contributed by Raymond Hettinger; :issue:`9025`.) |
| 1135 | |
| 1136 | poplib |
| 1137 | ------ |
Raymond Hettinger | e0a9600 | 2010-12-15 17:54:13 +0000 | [diff] [blame] | 1138 | |
Giampaolo Rodolà | 42382fe | 2010-08-17 16:09:53 +0000 | [diff] [blame] | 1139 | * :class:`~poplib.POP3_SSL` class now accepts a *context* parameter, which is a |
| 1140 | :class:`ssl.SSLContext` object allowing bundling SSL configuration options, |
| 1141 | certificates and private keys into a single (potentially long-lived) |
| 1142 | structure. |
| 1143 | |
| 1144 | (Contributed by Giampaolo Rodolà; :issue:`8807`.) |
| 1145 | |
Giampaolo Rodolà | 977c707 | 2010-10-04 21:08:36 +0000 | [diff] [blame] | 1146 | * :class:`asyncore.dispatcher` now provides a |
| 1147 | :meth:`~asyncore.dispatcher.handle_accepted()` method |
| 1148 | returning a `(sock, addr)` pair which is called when a connection has actually |
| 1149 | been established with a new remote endpoint. This is supposed to be used as a |
| 1150 | replacement for old :meth:`~asyncore.dispatcher.handle_accept()` and avoids |
| 1151 | the user to call :meth:`~asyncore.dispatcher.accept()` directly. |
| 1152 | |
| 1153 | (Contributed by Giampaolo Rodolà; :issue:`6706`.) |
Georg Brandl | da0a211 | 2010-09-05 11:28:33 +0000 | [diff] [blame] | 1154 | |
Raymond Hettinger | bba537b | 2010-12-15 18:20:19 +0000 | [diff] [blame] | 1155 | tempfile |
| 1156 | -------- |
Raymond Hettinger | a026633 | 2010-12-07 08:52:41 +0000 | [diff] [blame] | 1157 | |
Raymond Hettinger | bba537b | 2010-12-15 18:20:19 +0000 | [diff] [blame] | 1158 | The :mod:`tempfile` module has a new context manager, |
| 1159 | :class:`~tempfile.TemporaryDirectory` which provides easy deterministic |
| 1160 | cleanup of temporary directories: |
Nick Coghlan | 543af75 | 2010-10-24 11:23:25 +0000 | [diff] [blame] | 1161 | |
Raymond Hettinger | bba537b | 2010-12-15 18:20:19 +0000 | [diff] [blame] | 1162 | >>> with tempfile.TemporaryDirectory() as tmpdirname: |
| 1163 | ... print 'created temporary directory', tmpdirname |
Nick Coghlan | 543af75 | 2010-10-24 11:23:25 +0000 | [diff] [blame] | 1164 | |
Raymond Hettinger | bba537b | 2010-12-15 18:20:19 +0000 | [diff] [blame] | 1165 | (Contributed by Neil Schemenauer and Nick Coghlan; :issue:`5178`.) |
Nick Coghlan | e0f0465 | 2010-11-21 03:44:04 +0000 | [diff] [blame] | 1166 | |
Raymond Hettinger | bba537b | 2010-12-15 18:20:19 +0000 | [diff] [blame] | 1167 | inspect |
| 1168 | ------- |
| 1169 | |
Raymond Hettinger | 0358a17 | 2010-12-15 19:00:38 +0000 | [diff] [blame] | 1170 | * The :mod:`inspect` module has a new function |
| 1171 | :func:`~inspect.getgeneratorstate` to easily identify the current state of a |
| 1172 | generator as one of ``GEN_CREATED``, ``GEN_RUNNING``, ``GEN_SUSPENDED`` or |
| 1173 | ``GEN_CLOSED``. (Contributed by Rodolpho Eckhardt and Nick Coghlan, |
| 1174 | :issue:`10220`.) |
Raymond Hettinger | bba537b | 2010-12-15 18:20:19 +0000 | [diff] [blame] | 1175 | |
Raymond Hettinger | a55ffbc | 2010-12-15 18:31:57 +0000 | [diff] [blame] | 1176 | * To support lookups without the possibility of activating a dynamic attribute, |
| 1177 | the :mod:`inspect` module has a new function, :func:`~inspect.getattr_static`. |
| 1178 | Unlike, :func:`hasattr`, this is a true read-only search, guaranteed not to |
| 1179 | change state while it is searching. (Contributed by Michael Foord.) |
Nick Coghlan | e0f0465 | 2010-11-21 03:44:04 +0000 | [diff] [blame] | 1180 | |
Raymond Hettinger | bba537b | 2010-12-15 18:20:19 +0000 | [diff] [blame] | 1181 | pydoc |
| 1182 | ----- |
Nick Coghlan | 7bb30b7 | 2010-12-03 09:29:11 +0000 | [diff] [blame] | 1183 | |
Raymond Hettinger | bba537b | 2010-12-15 18:20:19 +0000 | [diff] [blame] | 1184 | The :mod:`pydoc` module now provides a much improved Web server interface, |
| 1185 | as well as a new command-line option to automatically open a browser |
| 1186 | window to display that server. |
Nick Coghlan | 7bb30b7 | 2010-12-03 09:29:11 +0000 | [diff] [blame] | 1187 | |
Raymond Hettinger | bba537b | 2010-12-15 18:20:19 +0000 | [diff] [blame] | 1188 | (Contributed by Ron Adam; :issue:`2001`.) |
Raymond Hettinger | 3f9734c | 2010-12-07 01:47:52 +0000 | [diff] [blame] | 1189 | |
Raymond Hettinger | bba537b | 2010-12-15 18:20:19 +0000 | [diff] [blame] | 1190 | sysconfig |
| 1191 | --------- |
Raymond Hettinger | 3f9734c | 2010-12-07 01:47:52 +0000 | [diff] [blame] | 1192 | |
Raymond Hettinger | bba537b | 2010-12-15 18:20:19 +0000 | [diff] [blame] | 1193 | The new :mod:`sysconfig` module makes it straight-forward to discover |
| 1194 | installation paths and configuration variables which vary across platforms and |
| 1195 | installations. |
Raymond Hettinger | 3f9734c | 2010-12-07 01:47:52 +0000 | [diff] [blame] | 1196 | |
Raymond Hettinger | bba537b | 2010-12-15 18:20:19 +0000 | [diff] [blame] | 1197 | The module offers access simple access functions for platform and version |
| 1198 | information: |
Raymond Hettinger | 3f9734c | 2010-12-07 01:47:52 +0000 | [diff] [blame] | 1199 | |
Raymond Hettinger | bba537b | 2010-12-15 18:20:19 +0000 | [diff] [blame] | 1200 | * :func:`~sysconfig.get_platform` returning values like *linux-i586* or |
| 1201 | *macosx-10.6-ppc*. |
| 1202 | * :func:`~sysconfig.get_python_version` returns a Python version string in |
| 1203 | the form, "3.2". |
Raymond Hettinger | 3f9734c | 2010-12-07 01:47:52 +0000 | [diff] [blame] | 1204 | |
Raymond Hettinger | bba537b | 2010-12-15 18:20:19 +0000 | [diff] [blame] | 1205 | It also provides access to the paths and variables corresponding to one of |
| 1206 | seven named schemes used by :mod:`distutils`. Those include *posix_prefix*, |
| 1207 | *posix_home*, *posix_user*, *nt*, *nt_user*, *os2*, *os2_home*: |
Raymond Hettinger | 3f9734c | 2010-12-07 01:47:52 +0000 | [diff] [blame] | 1208 | |
Raymond Hettinger | bba537b | 2010-12-15 18:20:19 +0000 | [diff] [blame] | 1209 | * :func:`~sysconfig.get_paths` makes a dictionary containing installation paths |
| 1210 | for the current installation scheme. |
| 1211 | * :func:`~sysconfig.get_config_vars` returns a dictionary of platform specific |
| 1212 | variables. |
Raymond Hettinger | 3f9734c | 2010-12-07 01:47:52 +0000 | [diff] [blame] | 1213 | |
Raymond Hettinger | bba537b | 2010-12-15 18:20:19 +0000 | [diff] [blame] | 1214 | There is also a convenient command-line interface:: |
Raymond Hettinger | 3f9734c | 2010-12-07 01:47:52 +0000 | [diff] [blame] | 1215 | |
Raymond Hettinger | bba537b | 2010-12-15 18:20:19 +0000 | [diff] [blame] | 1216 | C:\Python32>python -m sysconfig |
| 1217 | Platform: "win32" |
| 1218 | Python version: "3.2" |
| 1219 | Current installation scheme: "nt" |
Raymond Hettinger | 3f9734c | 2010-12-07 01:47:52 +0000 | [diff] [blame] | 1220 | |
Raymond Hettinger | bba537b | 2010-12-15 18:20:19 +0000 | [diff] [blame] | 1221 | Paths: |
| 1222 | data = "C:\Python32" |
Łukasz Langa | 2b38b6c | 2010-12-17 21:57:32 +0000 | [diff] [blame^] | 1223 | include = "C:\Python32\Include" platinclude = "C:\Python32\Include" |
| 1224 | platlib = "C:\Python32\Lib\site-packages" platstdlib |
| 1225 | = "C:\Python32\Lib" purelib = "C:\Python32\Lib\site-packages" scripts |
| 1226 | = "C:\Python32\Scripts" stdlib = "C:\Python32\Lib" |
Raymond Hettinger | bba537b | 2010-12-15 18:20:19 +0000 | [diff] [blame] | 1227 | |
| 1228 | Variables: |
| 1229 | BINDIR = "C:\Python32" |
Łukasz Langa | 2b38b6c | 2010-12-17 21:57:32 +0000 | [diff] [blame^] | 1230 | BINLIBDEST = "C:\Python32\Lib" EXE = ".exe" INCLUDEPY |
| 1231 | = "C:\Python32\Include" LIBDEST = "C:\Python32\Lib" SO = ".pyd" |
| 1232 | VERSION = "32" abiflags = "" base = "C:\Python32" exec_prefix |
| 1233 | = "C:\Python32" platbase = "C:\Python32" prefix = "C:\Python32" |
| 1234 | projectbase = "C:\Python32" py_version = "3.2" py_version_nodot = "32" |
| 1235 | py_version_short = "3.2" srcdir = "C:\Python32" userbase |
| 1236 | = "C:\Documents and Settings\Raymond\Application Data\Python" |
Raymond Hettinger | bba537b | 2010-12-15 18:20:19 +0000 | [diff] [blame] | 1237 | |
| 1238 | pdb |
| 1239 | --- |
| 1240 | |
| 1241 | The :mod:`pdb` debugger module gained a number of usability improvements: |
Raymond Hettinger | b5d7933 | 2010-12-07 02:04:56 +0000 | [diff] [blame] | 1242 | |
Raymond Hettinger | 99db3fd | 2010-12-15 19:33:49 +0000 | [diff] [blame] | 1243 | * :file:`pdb.py` now has a ``-c`` option that executes commands as given in a |
| 1244 | :file:`.pdbrc` script file. |
| 1245 | * A :file:`.pdbrc` script file can contain ``continue`` and ``next`` commands |
| 1246 | that continue debugging. |
| 1247 | * The :class:`Pdb` class constructor now accepts a *nosigint* argument. |
| 1248 | * new commands: ``l(list)``, ``ll(long list`` and ``source`` for |
| 1249 | listing source code. |
| 1250 | * new commands: ``display`` and ``undisplay`` for showing or hiding |
| 1251 | the value of an expression if it has changed. |
| 1252 | * new command: ``interact`` for starting an interactive interpreter containing |
| 1253 | the global and local names found in the current scope. |
| 1254 | * breakpoints can be cleared by breakpoint number |
Raymond Hettinger | b5d7933 | 2010-12-07 02:04:56 +0000 | [diff] [blame] | 1255 | |
Łukasz Langa | 2b38b6c | 2010-12-17 21:57:32 +0000 | [diff] [blame^] | 1256 | configparser |
| 1257 | ------------ |
Raymond Hettinger | 3f9734c | 2010-12-07 01:47:52 +0000 | [diff] [blame] | 1258 | |
Łukasz Langa | 2b38b6c | 2010-12-17 21:57:32 +0000 | [diff] [blame^] | 1259 | The :mod:`configparser` module was modified to improve usability and |
| 1260 | predictability of the default parser and its supported INI syntax. The old |
| 1261 | :class:`ConfigParser` class was removed in favor of :class:`SafeConfigParser` |
| 1262 | which has in turn been renamed to :class:`ConfigParser`. Support for inline |
| 1263 | comments is now turned off by default and section or option duplicates are not |
| 1264 | allowed in a single configuration source. |
| 1265 | |
| 1266 | Config parsers gained a new API based on the mapping protocol:: |
| 1267 | |
| 1268 | >>> parser = ConfigParser() |
| 1269 | >>> parser.read_string(""" |
| 1270 | ... [DEFAULT] |
| 1271 | ... monty = python |
| 1272 | ... |
| 1273 | ... [phrases] |
| 1274 | ... the = who |
| 1275 | ... full = metal jacket |
| 1276 | ... """) |
| 1277 | >>> parser['phrases']['full'] |
| 1278 | 'metal jacket' |
| 1279 | >>> section = parser['phrases'] |
| 1280 | >>> section['the'] |
| 1281 | 'who' |
| 1282 | >>> section['british'] = '%(the)s %(full)s %(monty)s!' |
| 1283 | >>> parser['phrases']['british'] |
| 1284 | 'who metal jacket python!' |
| 1285 | >>> 'british' in section |
| 1286 | True |
| 1287 | |
| 1288 | The new API is implemented on top of the classical API e.g. custom parser |
| 1289 | subclasses should be able to use it without modifications. |
| 1290 | |
| 1291 | The INI file structure accepted by config parsers can now be customized. Users |
| 1292 | are able to specify alternative option/value delimiters and comment prefixes, |
| 1293 | change the name of the DEFAULT section or switch the interpolation syntax. |
| 1294 | Along with support for pluggable interpolation, an additional buildout-like |
| 1295 | interpolation handler (ExtendedInterpolation) was introduced:: |
| 1296 | |
| 1297 | >>> parser = ConfigParser(interpolation=ExtendedInterpolation()) |
| 1298 | >>> parser.read_dict({'buildout': {'directory': '/home/ambv/zope9'}, |
| 1299 | ... 'custom': {'prefix': '/usr/local'}}) |
| 1300 | >>> parser.read_string(""" |
| 1301 | ... [buildout] |
| 1302 | ... parts = |
| 1303 | ... zope9 |
| 1304 | ... instance |
| 1305 | ... find-links = |
| 1306 | ... ${buildout:directory}/downloads/dist |
| 1307 | ... |
| 1308 | ... [zope9] |
| 1309 | ... recipe = plone.recipe.zope9install |
| 1310 | ... location = /opt/zope |
| 1311 | ... |
| 1312 | ... [instance] |
| 1313 | ... recipe = plone.recipe.zope9instance |
| 1314 | ... zope9-location = ${zope9:location} |
| 1315 | ... zope-conf = ${custom:prefix}/etc/zope.conf |
| 1316 | ... """) |
| 1317 | >>> parser['buildout']['find-links'] |
| 1318 | '\n/home/ambv/zope9/downloads/dist' |
| 1319 | >>> parser['instance']['zope-conf'] |
| 1320 | '/usr/local/etc/zope.conf' |
| 1321 | >>> instance = parser['instance'] |
| 1322 | >>> instance['zope-conf'] |
| 1323 | '/usr/local/etc/zope.conf' |
| 1324 | >>> instance['zope9-location'] |
| 1325 | '/opt/zope' |
| 1326 | |
| 1327 | A number of smaller features were also introduced, like support for specifying |
| 1328 | encoding in read operations, specifying fallback values in getters, or reading |
| 1329 | directly from dictionaries and strings. |
| 1330 | |
| 1331 | (All changes contributed by Łukasz Langa.) |
| 1332 | |
Raymond Hettinger | a55ffbc | 2010-12-15 18:31:57 +0000 | [diff] [blame] | 1333 | .. XXX: Mention urllib.parse changes |
| 1334 | Issue 9873 (Nick Coghlan): |
| 1335 | - ASCII byte sequence support in URL parsing |
| 1336 | - named tuple for urldefrag return value |
| 1337 | Issue 5468 (Dan Mahn) for urlencode: |
| 1338 | - bytes input support |
| 1339 | - non-UTF8 percent encoding of non-ASCII characters |
| 1340 | Issue 2987 for IPv6 (RFC2732) support in urlparse |
Raymond Hettinger | 202717d | 2010-12-16 10:06:11 +0000 | [diff] [blame] | 1341 | .. XXX: Any updates to the WSGI bytes versus text problem? |
Raymond Hettinger | a55ffbc | 2010-12-15 18:31:57 +0000 | [diff] [blame] | 1342 | |
Antoine Pitrou | d42bc51 | 2009-11-10 23:18:31 +0000 | [diff] [blame] | 1343 | Multi-threading |
| 1344 | =============== |
| 1345 | |
Georg Brandl | da0a211 | 2010-09-05 11:28:33 +0000 | [diff] [blame] | 1346 | * The mechanism for serializing execution of concurrently running Python threads |
| 1347 | (generally known as the GIL or Global Interpreter Lock) has been rewritten. |
| 1348 | Among the objectives were more predictable switching intervals and reduced |
| 1349 | overhead due to lock contention and the number of ensuing system calls. The |
| 1350 | notion of a "check interval" to allow thread switches has been abandoned and |
| 1351 | replaced by an absolute duration expressed in seconds. This parameter is |
| 1352 | tunable through :func:`sys.setswitchinterval()`. It currently defaults to 5 |
| 1353 | milliseconds. |
Antoine Pitrou | d42bc51 | 2009-11-10 23:18:31 +0000 | [diff] [blame] | 1354 | |
| 1355 | Additional details about the implementation can be read from a `python-dev |
| 1356 | mailing-list message |
| 1357 | <http://mail.python.org/pipermail/python-dev/2009-October/093321.html>`_ |
Georg Brandl | da0a211 | 2010-09-05 11:28:33 +0000 | [diff] [blame] | 1358 | (however, "priority requests" as exposed in this message have not been kept |
| 1359 | for inclusion). |
Antoine Pitrou | d42bc51 | 2009-11-10 23:18:31 +0000 | [diff] [blame] | 1360 | |
Georg Brandl | 5e73a81 | 2010-04-22 07:02:51 +0000 | [diff] [blame] | 1361 | (Contributed by Antoine Pitrou.) |
Antoine Pitrou | d42bc51 | 2009-11-10 23:18:31 +0000 | [diff] [blame] | 1362 | |
Georg Brandl | da0a211 | 2010-09-05 11:28:33 +0000 | [diff] [blame] | 1363 | * Regular and recursive locks now accept an optional *timeout* argument to their |
Raymond Hettinger | 09e4ebb | 2010-09-06 19:55:51 +0000 | [diff] [blame] | 1364 | :meth:`acquire` method. (Contributed by Antoine Pitrou; :issue:`7316`.) |
Georg Brandl | da0a211 | 2010-09-05 11:28:33 +0000 | [diff] [blame] | 1365 | |
Raymond Hettinger | bba537b | 2010-12-15 18:20:19 +0000 | [diff] [blame] | 1366 | * Similarly, :meth:`threading.Semaphore.acquire` also gained a *timeout* |
Georg Brandl | da0a211 | 2010-09-05 11:28:33 +0000 | [diff] [blame] | 1367 | argument. (Contributed by Torsten Landschoff; :issue:`850728`.) |
Antoine Pitrou | e95a9ff | 2010-05-04 23:31:41 +0000 | [diff] [blame] | 1368 | |
Antoine Pitrou | 810023d | 2010-12-15 22:59:16 +0000 | [diff] [blame] | 1369 | * Regular and recursive lock acquisitions can now be interrupted by signals on |
| 1370 | platforms using pthreads. This means that Python programs that deadlock while |
| 1371 | acquiring locks can be successfully killed by repeatedly sending SIGINT to the |
Raymond Hettinger | 48f3bd3 | 2010-12-16 00:30:53 +0000 | [diff] [blame] | 1372 | process (by pressing Ctl+C in most shells). |
Antoine Pitrou | 810023d | 2010-12-15 22:59:16 +0000 | [diff] [blame] | 1373 | (Contributed by Reid Kleckner; :issue:`8844`.) |
| 1374 | |
Antoine Pitrou | d42bc51 | 2009-11-10 23:18:31 +0000 | [diff] [blame] | 1375 | |
Raymond Hettinger | 92ba286 | 2010-09-06 01:16:46 +0000 | [diff] [blame] | 1376 | Optimizations |
| 1377 | ============= |
Raymond Hettinger | 6e6565b | 2009-06-28 20:56:11 +0000 | [diff] [blame] | 1378 | |
Raymond Hettinger | 92ba286 | 2010-09-06 01:16:46 +0000 | [diff] [blame] | 1379 | A number of small performance enhancements have been added: |
Raymond Hettinger | 6e6565b | 2009-06-28 20:56:11 +0000 | [diff] [blame] | 1380 | |
Antoine Pitrou | d305200 | 2010-09-15 15:09:40 +0000 | [diff] [blame] | 1381 | * Python's peephole optimizer now recognizes patterns such ``x in {1, 2, 3}`` as |
Raymond Hettinger | 92ba286 | 2010-09-06 01:16:46 +0000 | [diff] [blame] | 1382 | being a test for membership in a set of constants. The optimizer recasts the |
| 1383 | :class:`set` as a :class:`frozenset` and stores the pre-built constant. |
| 1384 | |
| 1385 | Now that the speed penalty is gone, it is practical to start writing |
| 1386 | membership tests using set-notation. This style is both semantically clear |
| 1387 | and operationally fast:: |
| 1388 | |
| 1389 | extension = name.rpartition('.')[2] |
| 1390 | if extension in {'xml', 'html', 'xhtml', 'css'}: |
| 1391 | handle(name) |
| 1392 | |
| 1393 | (Patch and additional tests by Dave Malcolm; :issue:`6690`). |
| 1394 | |
Antoine Pitrou | d305200 | 2010-09-15 15:09:40 +0000 | [diff] [blame] | 1395 | * Serializing and unserializing data using the :mod:`pickle` module is now |
Raymond Hettinger | dadf93c | 2010-12-05 02:56:21 +0000 | [diff] [blame] | 1396 | several times faster. |
| 1397 | |
| 1398 | (Contributed by Alexandre Vassalotti, Antoine Pitrou |
Antoine Pitrou | ff150f2 | 2010-10-22 21:41:05 +0000 | [diff] [blame] | 1399 | and the Unladen Swallow team in :issue:`9410` and :issue:`3873`.) |
Antoine Pitrou | d305200 | 2010-09-15 15:09:40 +0000 | [diff] [blame] | 1400 | |
Raymond Hettinger | c269ae8 | 2010-12-05 01:01:52 +0000 | [diff] [blame] | 1401 | * The `Timsort algorithm <http://en.wikipedia.org/wiki/Timsort>`_ used in |
Raymond Hettinger | ffad35e | 2010-12-14 21:12:03 +0000 | [diff] [blame] | 1402 | :meth:`list.sort` and :func:`sorted` now runs faster and uses less memory |
Raymond Hettinger | c269ae8 | 2010-12-05 01:01:52 +0000 | [diff] [blame] | 1403 | when called with a :term:`key function`. Previously, every element of |
| 1404 | a list was wrapped with a temporary object that remembered the key value |
| 1405 | associated with each element. Now, an array of keys and values are |
| 1406 | sorted in parallel. This save the memory consumed by the sort wrappers, |
| 1407 | and it saves time lost from during comparisons which where delegated |
| 1408 | by the sort wrappers. |
| 1409 | |
| 1410 | (Patch by Daniel Stuzback in :issue:`9915`.) |
| 1411 | |
Raymond Hettinger | dadf93c | 2010-12-05 02:56:21 +0000 | [diff] [blame] | 1412 | * JSON decoding performance is improved and memory consumption is reduced |
Raymond Hettinger | 413abbc | 2010-12-05 07:06:47 +0000 | [diff] [blame] | 1413 | whenever the same string is repeated for multiple keys. Also, JSON encoding |
Raymond Hettinger | dadf93c | 2010-12-05 02:56:21 +0000 | [diff] [blame] | 1414 | now uses the C speedups when the ``sort_keys`` argument is true. |
| 1415 | |
| 1416 | (Contributed by Antoine Pitrou in :issue:`7451` and by Raymond Hettinger and |
| 1417 | Antoine Pitrou in :issue:`10314`.) |
| 1418 | |
Raymond Hettinger | 21ec4bc | 2010-12-10 01:09:01 +0000 | [diff] [blame] | 1419 | * Recursive locks (created with the :func:`threading.RLock` API) now benefit |
| 1420 | from a C implementation which makes them as fast as regular locks, and between |
| 1421 | 10x and 15x faster than their previous pure Python implementation. |
| 1422 | |
| 1423 | (Contributed by Antoine Pitrou; :issue:`3001`.) |
| 1424 | |
Raymond Hettinger | dadf93c | 2010-12-05 02:56:21 +0000 | [diff] [blame] | 1425 | * The fast-search algorithm in stringlib is now used by the :meth:`split`, |
| 1426 | :meth:`rsplit`, :meth:`splitlines` and :meth:`replace` methods on |
| 1427 | :class:`bytes`, :class:`bytearray` and :class:`str` objects. Likewise, the |
| 1428 | algorithm is also used by :meth:`rfind`, :meth:`rindex`, :meth:`rsplit` and |
| 1429 | :meth:`rpartition`. |
| 1430 | |
| 1431 | (Patch by Florent Xicluna in :issue:`7622` and :issue:`7462`.) |
| 1432 | |
Raymond Hettinger | 480ed78 | 2010-12-15 22:07:15 +0000 | [diff] [blame] | 1433 | |
| 1434 | * String to integer conversions now work two "digits" at a time, reducing the |
| 1435 | number of division and modulo operations. |
| 1436 | |
| 1437 | (:issue:`6713` by Gawain Bolton, Mark Dickinson, and Victor Stinner.) |
| 1438 | |
Raymond Hettinger | d8fae4e | 2010-12-05 05:39:54 +0000 | [diff] [blame] | 1439 | There were several other minor optimizations. Set differencing now runs faster |
| 1440 | when one operand is much larger than the other (Patch by Andress Bennetts in |
| 1441 | :issue:`8685`). The :meth:`array.repeat` method has a faster implementation |
| 1442 | (:issue:`1569291` by Alexander Belopolsky). The :class:`BaseHTTPRequestHandler` |
| 1443 | has more efficient buffering (:issue:`3709` by Andrew Schaaf). The |
| 1444 | multi-argument form of :func:`operator.attrgetter` now function runs slightly |
| 1445 | faster (:issue:`10160` by Christos Georgiou). And :class:`ConfigParser` loads |
| 1446 | multi-line arguments a bit faster (:issue:`7113` by Łukasz Langa). |
| 1447 | |
Antoine Pitrou | d305200 | 2010-09-15 15:09:40 +0000 | [diff] [blame] | 1448 | |
Victor Stinner | 47ce965 | 2010-10-29 00:57:35 +0000 | [diff] [blame] | 1449 | Unicode |
| 1450 | ======= |
Victor Stinner | 94908bb | 2010-08-18 21:23:25 +0000 | [diff] [blame] | 1451 | |
Alexander Belopolsky | 507e3f8 | 2010-12-02 00:05:57 +0000 | [diff] [blame] | 1452 | Python has been updated to Unicode 6.0.0. The new features of the |
| 1453 | Unicode Standard that will affect Python users include: |
| 1454 | |
Alexander Belopolsky | 84cc062 | 2010-12-08 21:38:46 +0000 | [diff] [blame] | 1455 | * addition of 2,088 characters, including over 1,000 additional |
| 1456 | symbols—chief among them the additional emoji symbols, which are |
| 1457 | especially important for mobile phones; |
Alexander Belopolsky | 507e3f8 | 2010-12-02 00:05:57 +0000 | [diff] [blame] | 1458 | |
Alexander Belopolsky | 84cc062 | 2010-12-08 21:38:46 +0000 | [diff] [blame] | 1459 | * changes to character properties for existing characters including |
Alexander Belopolsky | 507e3f8 | 2010-12-02 00:05:57 +0000 | [diff] [blame] | 1460 | |
Raymond Hettinger | c74d518 | 2010-12-02 01:38:25 +0000 | [diff] [blame] | 1461 | - a general category change to two Kannada characters (U+0CF1, |
| 1462 | U+0CF2), which has the effect of making them newly eligible for |
| 1463 | inclusion in identifiers; |
| 1464 | |
| 1465 | - a general category change to one New Tai Lue numeric character |
Alexander Belopolsky | 84cc062 | 2010-12-08 21:38:46 +0000 | [diff] [blame] | 1466 | (U+19DA), which has the effect of disqualifying it from |
| 1467 | inclusion in identifiers. |
| 1468 | |
| 1469 | For more information, see `Unicode Character Database Changes |
| 1470 | <http://www.unicode.org/versions/Unicode6.0.0/#Database_Changes>`_ |
| 1471 | at the `Unicode Consortium <http://www.unicode.org/>`_ web site. |
Alexander Belopolsky | 507e3f8 | 2010-12-02 00:05:57 +0000 | [diff] [blame] | 1472 | |
Éric Araujo | 4234ad4 | 2010-09-05 17:32:25 +0000 | [diff] [blame] | 1473 | The :mod:`os` module has two new functions: :func:`~os.fsencode` and |
Victor Stinner | 47ce965 | 2010-10-29 00:57:35 +0000 | [diff] [blame] | 1474 | :func:`~os.fsdecode`. Add :data:`os.environb`: bytes version of |
| 1475 | :data:`os.environ`, :func:`os.getenvb` function and |
| 1476 | :data:`os.supports_bytes_environ` constant. |
Victor Stinner | e8d5145 | 2010-08-19 01:05:19 +0000 | [diff] [blame] | 1477 | |
Georg Brandl | 326c57d | 2010-11-26 12:10:06 +0000 | [diff] [blame] | 1478 | ``'mbcs'`` encoding doesn't ignore the error handler argument any more. By |
Victor Stinner | 47ce965 | 2010-10-29 00:57:35 +0000 | [diff] [blame] | 1479 | default (strict mode), it raises an UnicodeDecodeError on undecodable byte |
| 1480 | sequence and UnicodeEncodeError on unencodable character. To get the ``'mbcs'`` |
| 1481 | encoding of Python 3.1, use ``'ignore'`` error handler to decode and |
| 1482 | ``'replace'`` error handler to encode. ``'mbcs'`` supports ``'strict'`` and |
| 1483 | ``'ignore'`` error handlers for decoding, and ``'strict'`` and ``'replace'`` |
| 1484 | for encoding. |
| 1485 | |
| 1486 | On Mac OS X, Python uses ``'utf-8'`` to decode the command line arguments, |
| 1487 | instead of the locale encoding (which is ISO-8859-1 if the ``LANG`` environment |
| 1488 | variable is not set). |
| 1489 | |
| 1490 | By default, tarfile uses ``'utf-8'`` encoding on Windows (instead of |
| 1491 | ``'mbcs'``), and the ``'surrogateescape'`` error handler on all operating |
| 1492 | systems. |
Antoine Pitrou | d305200 | 2010-09-15 15:09:40 +0000 | [diff] [blame] | 1493 | |
Raymond Hettinger | 480ed78 | 2010-12-15 22:07:15 +0000 | [diff] [blame] | 1494 | * Added the *cp720* Arabic DOS encoding (:issue:`1616979`). |
| 1495 | |
Victor Stinner | 94908bb | 2010-08-18 21:23:25 +0000 | [diff] [blame] | 1496 | |
Raymond Hettinger | 1fa7682 | 2010-12-06 23:31:36 +0000 | [diff] [blame] | 1497 | Documentation |
| 1498 | ============= |
| 1499 | |
| 1500 | The documentation continues to be improved. |
| 1501 | |
| 1502 | A table of quick links has been added to the top of lengthy sections such as |
| 1503 | :ref:`built-in-funcs`. In the case of :mod:`itertools`, the links are |
| 1504 | accompanied by tables of cheatsheet-style summaries to provide an overview and |
| 1505 | memory jog without having to read all of the docs. |
| 1506 | |
| 1507 | In some cases, the pure python source code can be helpful adjunct to the docs, |
| 1508 | so now some modules feature quick links to the latest version of the source |
| 1509 | code. For example, the :mod:`functools` module documentation has a quick link |
| 1510 | at the top labeled :source:`functools Python source code <Lib/functools.py>`. |
| 1511 | |
| 1512 | The docs now contain more examples and recipes. In particular, :mod:`re` module |
| 1513 | has an extensive section, :ref:`re-examples`. Likewise, the :mod:`itertools` |
| 1514 | module continues to be updated with new :ref:`itertools-recipes`. |
| 1515 | |
Raymond Hettinger | 677e10a | 2010-12-07 06:45:30 +0000 | [diff] [blame] | 1516 | The :mod:`datetime` module now has an auxiliary implementation in pure Python. |
| 1517 | No functionality was changed. This just provides an easier-to-read |
| 1518 | alternate implementation. (Contributed by Alexander Belopolsky.) |
| 1519 | |
Raymond Hettinger | 1fa7682 | 2010-12-06 23:31:36 +0000 | [diff] [blame] | 1520 | |
| 1521 | IDLE |
| 1522 | ==== |
Raymond Hettinger | 6e6565b | 2009-06-28 20:56:11 +0000 | [diff] [blame] | 1523 | |
Georg Brandl | cc9d237 | 2010-12-10 19:22:11 +0000 | [diff] [blame] | 1524 | * The format menu now has an option to clean-up source files by stripping |
| 1525 | trailing whitespace (:issue:`5150`). |
Raymond Hettinger | 6e6565b | 2009-06-28 20:56:11 +0000 | [diff] [blame] | 1526 | |
| 1527 | |
| 1528 | Build and C API Changes |
| 1529 | ======================= |
| 1530 | |
| 1531 | Changes to Python's build process and to the C API include: |
| 1532 | |
Georg Brandl | da0a211 | 2010-09-05 11:28:33 +0000 | [diff] [blame] | 1533 | * The C functions that access the Unicode Database now accept and return |
| 1534 | characters from the full Unicode range, even on narrow unicode builds |
Raymond Hettinger | 1784ff0 | 2010-09-05 01:00:19 +0000 | [diff] [blame] | 1535 | (Py_UNICODE_TOLOWER, Py_UNICODE_ISDECIMAL, and others). A visible difference |
Georg Brandl | da0a211 | 2010-09-05 11:28:33 +0000 | [diff] [blame] | 1536 | in Python is that :func:`unicodedata.numeric` now returns the correct value |
| 1537 | for large code points, and :func:`repr` may consider more characters as |
| 1538 | printable. |
Raymond Hettinger | 6e6565b | 2009-06-28 20:56:11 +0000 | [diff] [blame] | 1539 | |
Raymond Hettinger | 1784ff0 | 2010-09-05 01:00:19 +0000 | [diff] [blame] | 1540 | (Reported by Bupjoe Lee and fixed by Amaury Forgeot D'Arc; :issue:`5127`.) |
| 1541 | |
Georg Brandl | da0a211 | 2010-09-05 11:28:33 +0000 | [diff] [blame] | 1542 | * Computed gotos are now enabled by default on supported compilers (which are |
Raymond Hettinger | db9044e | 2010-09-06 01:29:23 +0000 | [diff] [blame] | 1543 | detected by the configure script). They can still be disabled selectively by |
Georg Brandl | da0a211 | 2010-09-05 11:28:33 +0000 | [diff] [blame] | 1544 | specifying ``--without-computed-gotos``. |
Raymond Hettinger | 1784ff0 | 2010-09-05 01:00:19 +0000 | [diff] [blame] | 1545 | |
Georg Brandl | da0a211 | 2010-09-05 11:28:33 +0000 | [diff] [blame] | 1546 | (Contributed by Antoine Pitrou; :issue:`9203`.) |
| 1547 | |
Amaury Forgeot d'Arc | feb7307 | 2010-09-12 22:42:57 +0000 | [diff] [blame] | 1548 | * The option ``--with-wctype-functions`` was removed. The built-in unicode |
| 1549 | database is now used for all functions. |
| 1550 | |
| 1551 | (Contributed by Amaury Forgeot D'Arc; :issue:`9210`.) |
| 1552 | |
Raymond Hettinger | 480ed78 | 2010-12-15 22:07:15 +0000 | [diff] [blame] | 1553 | * Hash values are now values of a new type, :c:type:`Py_hash_t`, which is |
| 1554 | defined to be the same size as a pointer. Previously they were of type long, |
| 1555 | which on some 64-bit operating systems is still only 32 bits long. As a |
| 1556 | result of this fix, :class:`set` and :class:`dict` can now hold more than |
| 1557 | ``2**32`` entries on builds with 64-bit pointers (previously, they could grow |
| 1558 | to that size but their performance degraded catastrophically). |
Skip Montanaro | 961aaf5 | 2010-10-17 22:22:24 +0000 | [diff] [blame] | 1559 | |
Raymond Hettinger | 480ed78 | 2010-12-15 22:07:15 +0000 | [diff] [blame] | 1560 | (Suggested by Raymond Hettinger and implemented by Benjamin Peterson; |
| 1561 | :issue:`9778`.) |
| 1562 | |
| 1563 | * A new macro :c:macro:`Py_VA_COPY` copies the state of the variable argument |
| 1564 | list. It is equivalent to C99 *va_copy* but available on all python platforms |
| 1565 | (:issue:`2443`). |
| 1566 | |
Raymond Hettinger | bb9686f | 2010-12-16 00:53:05 +0000 | [diff] [blame] | 1567 | * A new C API function :c:func:`PySys_SetArgvEx` allows an embedded |
Raymond Hettinger | 480ed78 | 2010-12-15 22:07:15 +0000 | [diff] [blame] | 1568 | interpreter to set sys.argv without also modifying :attr:`sys.path` |
| 1569 | (:issue:`5753`). |
| 1570 | |
| 1571 | * :c:macro:`PyEval_CallObject` is now only available in macro form. The |
| 1572 | function declaration, which was kept for backwards compatibility reasons, is |
| 1573 | now removed -- the macro was introduced in 1997 (:issue:`8276`). |
| 1574 | |
| 1575 | * The is a new function :c:func:`PyLong_AsLongLongAndOverflow` which |
| 1576 | is analogous to :c:func:`PyLong_AsLongAndOverflow`. The both serve to |
| 1577 | convert Python :class:`int` into a native fixed-width type while providing |
| 1578 | detection of cases where the conversion won't fit (:issue:`7767`). |
| 1579 | |
| 1580 | * The :c:func:`PyUnicode_CompareWithASCIIString` now returns *not equal* |
| 1581 | if the Python string in *NUL* terminated. |
| 1582 | |
| 1583 | * There is a new function :c:func:`PyErr_NewExceptionWithDoc` that is |
| 1584 | like :c:func:`PyErr_NewException` but allows a docstring to be specified. |
| 1585 | This lets C exceptions have the same self-documenting capabilities as |
| 1586 | their pure Python counterparts (:issue:`7033`). |
| 1587 | |
| 1588 | * When compiled with the ``--with-valgrind`` option, the pymalloc |
| 1589 | allocator will be automatically disabled when running under Valgrind. This |
| 1590 | gives improved memory leak detection when running under Valgrind, while taking |
| 1591 | advantage of pymalloc at other times (:issue:`2422`). |
| 1592 | |
| 1593 | * Removed the "O?" format from the *PyArg_Parse* functions. The format is no |
| 1594 | longer used and it had never been documented (:issue:`8837`). |
| 1595 | |
| 1596 | There were a number of other small changes to the C-API. See the |
| 1597 | :file:`Misc/NEWS` file for a complete list. |
Skip Montanaro | 961aaf5 | 2010-10-17 22:22:24 +0000 | [diff] [blame] | 1598 | |
Raymond Hettinger | 6e6565b | 2009-06-28 20:56:11 +0000 | [diff] [blame] | 1599 | |
Raymond Hettinger | f558ddd | 2009-06-28 21:37:08 +0000 | [diff] [blame] | 1600 | Porting to Python 3.2 |
Raymond Hettinger | 6e6565b | 2009-06-28 20:56:11 +0000 | [diff] [blame] | 1601 | ===================== |
| 1602 | |
Georg Brandl | da0a211 | 2010-09-05 11:28:33 +0000 | [diff] [blame] | 1603 | This section lists previously described changes and other bugfixes that may |
| 1604 | require changes to your code: |
Raymond Hettinger | 6e6565b | 2009-06-28 20:56:11 +0000 | [diff] [blame] | 1605 | |
Łukasz Langa | 2b38b6c | 2010-12-17 21:57:32 +0000 | [diff] [blame^] | 1606 | * The :mod:`configparser` class :class:`SafeConfigParser` has been updated and |
| 1607 | renamed to :class:`ConfigParser` whereas the old :class:`ConfigParser` class |
| 1608 | has been removed. This means a couple of minor incompatibilities: |
| 1609 | |
| 1610 | * interpolation syntax is now validated on :meth:`get` and :meth:`set` |
| 1611 | operations. In the default interpolation scheme, only two tokens with |
| 1612 | percent signs are valid: %(name)s and %%, the latter being an escaped |
| 1613 | percent sign. If that is not welcome, consider using |
| 1614 | :class:`ExtendedInterpolation` or none at all. |
| 1615 | |
| 1616 | * :meth:`set` and :meth:`add_section` now check whether the given value type |
| 1617 | is a string. :mod:`configparser` was never designed to hold non-string |
| 1618 | values internally. |
| 1619 | |
| 1620 | * exception is raised on any section or option duplicates that appear when |
| 1621 | reading a single source. This exposes mistakes in user configuration. |
| 1622 | |
| 1623 | * inline comments are now disabled by default which means the ``;`` character |
| 1624 | can be safeuly used in values (``#`` was never allowed as inline comment). |
| 1625 | |
| 1626 | * comments now can be indented which means for ``;`` and ``#`` to appear at |
| 1627 | the start of a line in multiline values, it has to be interpolated. This is |
| 1628 | preferable because in INI files a character that is also a comment prefix |
| 1629 | cannot be taken for a comment by mistake. |
| 1630 | |
| 1631 | * ``""`` is now a valid value, no longer automatically converted to an empty |
| 1632 | string. For empty strings users can use ``"option ="`` in a line. |
| 1633 | |
Antoine Pitrou | cd889af | 2010-10-06 21:13:56 +0000 | [diff] [blame] | 1634 | * The :mod:`nntplib` module was reworked extensively, meaning that its APIs |
| 1635 | are often incompatible with the 3.1 APIs. |
| 1636 | |
Raymond Hettinger | 1fa7682 | 2010-12-06 23:31:36 +0000 | [diff] [blame] | 1637 | * :class:`bytearray` objects can no longer be used as filenames; instead, |
| 1638 | they should be converted to :class:`bytes`. |
Victor Stinner | dcb2403 | 2010-04-22 12:08:36 +0000 | [diff] [blame] | 1639 | |
Victor Stinner | 25e8ec4 | 2010-06-25 00:02:38 +0000 | [diff] [blame] | 1640 | * PyArg_Parse*() functions: |
Victor Stinner | 3dcb5ac | 2010-06-08 22:54:19 +0000 | [diff] [blame] | 1641 | |
Victor Stinner | 25e8ec4 | 2010-06-25 00:02:38 +0000 | [diff] [blame] | 1642 | * "t#" format has been removed: use "s#" or "s*" instead |
| 1643 | * "w" and "w#" formats has been removed: use "w*" instead |
| 1644 | |
Georg Brandl | 60203b4 | 2010-10-06 10:11:56 +0000 | [diff] [blame] | 1645 | * The :c:type:`PyCObject` type, deprecated in 3.1, has been removed. To wrap |
| 1646 | opaque C pointers in Python objects, the :c:type:`PyCapsule` API should be used |
Éric Araujo | 4234ad4 | 2010-09-05 17:32:25 +0000 | [diff] [blame] | 1647 | instead; the new type has a well-defined interface for passing typing safety |
Georg Brandl | da0a211 | 2010-09-05 11:28:33 +0000 | [diff] [blame] | 1648 | information and a less complicated signature for calling a destructor. |
Victor Stinner | 0cbec57 | 2010-09-12 20:32:57 +0000 | [diff] [blame] | 1649 | |
Raymond Hettinger | e0a9600 | 2010-12-15 17:54:13 +0000 | [diff] [blame] | 1650 | * The :func:`sys.setfilesystemencoding` function was removed because |
| 1651 | it had a flawed design. |
Raymond Hettinger | 3fcf002 | 2010-12-08 01:13:53 +0000 | [diff] [blame] | 1652 | |
Raymond Hettinger | e0a9600 | 2010-12-15 17:54:13 +0000 | [diff] [blame] | 1653 | * The :func:`random.seed` function and method now salt string seeds with an |
| 1654 | sha512 hash function. To access the previous version of *seed* in order to |
| 1655 | reproduce Python 3.1 sequences, set the *version* argument to *1*, |
| 1656 | ``random.seed(s, version=1)``. |
Raymond Hettinger | 21ec4bc | 2010-12-10 01:09:01 +0000 | [diff] [blame] | 1657 | |
Raymond Hettinger | 522cc0a | 2010-12-10 01:19:15 +0000 | [diff] [blame] | 1658 | * The previously deprecated :func:`string.maketrans` function has been removed |
| 1659 | in favor of the static methods, :meth:`bytes.maketrans` and |
| 1660 | :meth:`bytearray.maketrans`. This change solves the confusion around which |
| 1661 | types were supported by the :mod:`string` module. Now, :class:`str`, |
| 1662 | :class:`bytes`, and :class:`bytearray` each have their own **maketrans** and |
| 1663 | **translate** methods with intermediate translation tables of the appropriate |
| 1664 | type. |
| 1665 | |
| 1666 | (Contributed by Georg Brandl; :issue:`5675`.) |
| 1667 | |
| 1668 | * The previously deprecated :func:`contextlib.nested` function has been removed |
| 1669 | in favor of a plain :keyword:`with` statement which can accept multiple |
| 1670 | context managers. The latter technique is faster (because it is built-in), |
| 1671 | and it does a better job finalizing multiple context managers when one of them |
| 1672 | raises an exception:: |
| 1673 | |
| 1674 | >>> with open('mylog.txt') as infile, open('a.out', 'w') as outfile: |
| 1675 | ... for line in infile: |
| 1676 | ... if '<critical>' in line: |
| 1677 | ... outfile.write(line) |
| 1678 | |
| 1679 | (Contributed by Georg Brandl and Mattias Brändström; |
| 1680 | `appspot issue 53094 <http://codereview.appspot.com/53094>`_.) |