Raymond Hettinger | d73be67 | 2010-12-18 10:48:26 +0000 | [diff] [blame] | 1 | **************************** |
Raymond Hettinger | f558ddd | 2009-06-28 21:37:08 +0000 | [diff] [blame] | 2 | What's New In Python 3.2 |
Raymond Hettinger | 6e6565b | 2009-06-28 20:56:11 +0000 | [diff] [blame] | 3 | **************************** |
| 4 | |
| 5 | :Author: Raymond Hettinger |
| 6 | :Release: |release| |
| 7 | :Date: |today| |
| 8 | |
| 9 | .. $Id$ |
| 10 | Rules for maintenance: |
| 11 | |
| 12 | * Anyone can add text to this document. Do not spend very much time |
| 13 | on the wording of your changes, because your text will probably |
Raymond Hettinger | 6f04adc | 2010-12-04 22:56:25 +0000 | [diff] [blame] | 14 | get rewritten. |
Raymond Hettinger | 6e6565b | 2009-06-28 20:56:11 +0000 | [diff] [blame] | 15 | |
| 16 | * The maintainer will go through Misc/NEWS periodically and add |
| 17 | changes; it's therefore more important to add your changes to |
| 18 | Misc/NEWS than to this file. |
| 19 | |
| 20 | * This is not a complete list of every single change; completeness |
| 21 | is the purpose of Misc/NEWS. Some changes I consider too small |
| 22 | or esoteric to include. If such a change is added to the text, |
| 23 | I'll just remove it. (This is another reason you shouldn't spend |
| 24 | too much time on writing your addition.) |
| 25 | |
| 26 | * If you want to draw your new text to the attention of the |
| 27 | maintainer, add 'XXX' to the beginning of the paragraph or |
| 28 | section. |
| 29 | |
| 30 | * It's OK to just add a fragmentary note about a change. For |
| 31 | example: "XXX Describe the transmogrify() function added to the |
| 32 | socket module." The maintainer will research the change and |
| 33 | write the necessary text. |
| 34 | |
| 35 | * You can comment out your additions if you like, but it's not |
| 36 | necessary (especially when a final release is some months away). |
| 37 | |
| 38 | * Credit the author of a patch or bugfix. Just the name is |
Georg Brandl | da0a211 | 2010-09-05 11:28:33 +0000 | [diff] [blame] | 39 | sufficient; the e-mail address isn't necessary. It's helpful to |
| 40 | add the issue number: |
Raymond Hettinger | 6e6565b | 2009-06-28 20:56:11 +0000 | [diff] [blame] | 41 | |
Éric Araujo | 4234ad4 | 2010-09-05 17:32:25 +0000 | [diff] [blame] | 42 | XXX Describe the transmogrify() function added to the socket |
| 43 | module. |
| 44 | |
| 45 | (Contributed by P.Y. Developer; :issue:`12345`.) |
Raymond Hettinger | 6e6565b | 2009-06-28 20:56:11 +0000 | [diff] [blame] | 46 | |
| 47 | This saves the maintainer the effort of going through the SVN log |
| 48 | when researching a change. |
| 49 | |
Raymond Hettinger | ffad35e | 2010-12-14 21:12:03 +0000 | [diff] [blame] | 50 | This article explains the new features in Python 3.2 as compared to 3.1. It |
| 51 | focuses on a few highlights and gives a few examples. For full details, see the |
| 52 | :source:`Misc/NEWS <Misc/NEWS>` file. |
Raymond Hettinger | 2c1ecc3 | 2010-12-07 09:55:02 +0000 | [diff] [blame] | 53 | |
Raymond Hettinger | 6e6565b | 2009-06-28 20:56:11 +0000 | [diff] [blame] | 54 | |
Martin v. Löwis | 932e49e | 2010-12-04 13:49:32 +0000 | [diff] [blame] | 55 | PEP 384: Defining a Stable ABI |
Martin v. Löwis | 4d0d471 | 2010-12-03 20:14:31 +0000 | [diff] [blame] | 56 | ============================== |
| 57 | |
| 58 | In the past, extension modules built for one Python version were often |
| 59 | not usable with other Python versions. Particularly on Windows, every |
| 60 | feature release of Python required rebuilding all extension modules that |
| 61 | one wanted to use. This requirement was the result of the free access to |
| 62 | Python interpreter internals that extension modules could use. |
| 63 | |
| 64 | With Python 3.2, an alternative approach becomes available: extension |
Raymond Hettinger | 6f04adc | 2010-12-04 22:56:25 +0000 | [diff] [blame] | 65 | modules which restrict themselves to a limited API (by defining |
Martin v. Löwis | 4d0d471 | 2010-12-03 20:14:31 +0000 | [diff] [blame] | 66 | Py_LIMITED_API) cannot use many of the internals, but are constrained |
| 67 | to a set of API functions that are promised to be stable for several |
| 68 | releases. As a consequence, extension modules built for 3.2 in that |
| 69 | mode will also work with 3.3, 3.4, and so on. Extension modules that |
| 70 | make use of details of memory structures can still be built, but will |
| 71 | need to be recompiled for every feature release. |
| 72 | |
Raymond Hettinger | 6f04adc | 2010-12-04 22:56:25 +0000 | [diff] [blame] | 73 | .. seealso:: |
| 74 | |
Georg Brandl | 65b2eb9 | 2010-12-05 11:42:38 +0000 | [diff] [blame] | 75 | :pep:`384` - Defining a Stable ABI |
Raymond Hettinger | 2c1ecc3 | 2010-12-07 09:55:02 +0000 | [diff] [blame] | 76 | PEP written by Martin von Löwis. |
Raymond Hettinger | 6f04adc | 2010-12-04 22:56:25 +0000 | [diff] [blame] | 77 | |
Raymond Hettinger | a5a3554 | 2010-12-05 00:39:18 +0000 | [diff] [blame] | 78 | PEP 389: Argparse Command Line Parsing Module |
| 79 | ============================================= |
| 80 | |
| 81 | A new module for command line parsing, :mod:`argparse`, was introduced to |
| 82 | overcome the limitations of :mod:`optparse` which did not provide support for |
Raymond Hettinger | 677e10a | 2010-12-07 06:45:30 +0000 | [diff] [blame] | 83 | positional arguments (not just options), subcommands, required options and other |
Raymond Hettinger | 413abbc | 2010-12-05 07:06:47 +0000 | [diff] [blame] | 84 | common patterns of specifying and validating options. |
Raymond Hettinger | a5a3554 | 2010-12-05 00:39:18 +0000 | [diff] [blame] | 85 | |
| 86 | This module has already has wide-spread success in the community as a |
Raymond Hettinger | b1ff402 | 2010-12-08 11:19:45 +0000 | [diff] [blame] | 87 | third-party module. Being more fully featured than its predecessor, the |
| 88 | :mod:`argparse` module is now the preferred module for command-line processing. |
| 89 | The older module is still being kept available because of the substantial amount |
| 90 | of legacy code that depends on it. |
Raymond Hettinger | a5a3554 | 2010-12-05 00:39:18 +0000 | [diff] [blame] | 91 | |
Raymond Hettinger | 677e10a | 2010-12-07 06:45:30 +0000 | [diff] [blame] | 92 | Here's an annotated example parser showing features like limiting results to a |
| 93 | set of choices, specifying a *metavar* in the help screen, validating that one |
Raymond Hettinger | 68f1e8d | 2010-12-07 09:24:30 +0000 | [diff] [blame] | 94 | or more positional arguments is present, and making a required option:: |
Raymond Hettinger | 677e10a | 2010-12-07 06:45:30 +0000 | [diff] [blame] | 95 | |
| 96 | import argparse |
| 97 | parser = argparse.ArgumentParser( |
| 98 | description = 'Manage servers', # main description for help |
| 99 | epilog = 'Tested on Solaris and Linux') # displayed after help |
| 100 | parser.add_argument('action', # argument name |
| 101 | choices = ['deploy', 'start', 'stop'], # one of four allowed values |
| 102 | help = 'action on each target') # help msg |
| 103 | parser.add_argument('targets', |
| 104 | metavar = 'HOSTNAME', # var name used in help msg |
| 105 | nargs = '+', # require 1 or more targets |
| 106 | help = 'url for target machines') # help msg explanation |
| 107 | parser.add_argument('-u', '--user', # -u or --user option |
| 108 | required = True, # make this a required argument |
| 109 | help = 'login as user') |
| 110 | |
| 111 | Example of calling the parser on a command string:: |
| 112 | |
| 113 | >>> cmd = 'deploy sneezy.example.com sleepy.example.com -u skycaptain' |
| 114 | >>> result = parser.parse_args(cmd.split()) |
Raymond Hettinger | 677e10a | 2010-12-07 06:45:30 +0000 | [diff] [blame] | 115 | >>> result.action |
| 116 | 'deploy' |
| 117 | >>> result.targets |
| 118 | ['sneezy.example.com', 'sleepy.example.com'] |
| 119 | >>> result.user |
| 120 | 'skycaptain' |
| 121 | |
| 122 | Example of the parser's automatically generated help:: |
| 123 | |
| 124 | >>> parser.parse_args('-h'.split()) |
| 125 | |
Raymond Hettinger | 3fcf002 | 2010-12-08 01:13:53 +0000 | [diff] [blame] | 126 | usage: manage_cloud.py [-h] -u USER |
| 127 | {deploy,start,stop} HOSTNAME [HOSTNAME ...] |
Raymond Hettinger | 677e10a | 2010-12-07 06:45:30 +0000 | [diff] [blame] | 128 | |
| 129 | Manage servers |
| 130 | |
| 131 | positional arguments: |
| 132 | {deploy,start,stop} action on each target |
| 133 | HOSTNAME url for target machines |
| 134 | |
| 135 | optional arguments: |
| 136 | -h, --help show this help message and exit |
| 137 | -u USER, --user USER login as user |
| 138 | |
| 139 | Tested on Solaris and Linux |
| 140 | |
Raymond Hettinger | b1ff402 | 2010-12-08 11:19:45 +0000 | [diff] [blame] | 141 | An especially nice :mod:`argparse` feature is the ability to define subparsers, |
| 142 | each with their own argument patterns and help displays:: |
| 143 | |
| 144 | import argparse |
| 145 | parser = argparse.ArgumentParser(prog='HELM') |
| 146 | subparsers = parser.add_subparsers() |
| 147 | |
| 148 | parser_l = subparsers.add_parser('launch', help='Launch Control') # first subgroup |
Raymond Hettinger | bb9686f | 2010-12-16 00:53:05 +0000 | [diff] [blame] | 149 | parser_l.add_argument('-m', '--missiles', action='store_true') |
Raymond Hettinger | b1ff402 | 2010-12-08 11:19:45 +0000 | [diff] [blame] | 150 | parser_l.add_argument('-t', '--torpedos', action='store_true') |
| 151 | |
Raymond Hettinger | 3094ed8 | 2010-12-18 09:41:32 +0000 | [diff] [blame] | 152 | parser_m = subparsers.add_parser('move', help='Move Vessel', # second subgroup |
| 153 | aliases=('steer', 'turn')) # equivalent names |
Raymond Hettinger | b1ff402 | 2010-12-08 11:19:45 +0000 | [diff] [blame] | 154 | parser_m.add_argument('-c', '--course', type=int, required=True) |
| 155 | parser_m.add_argument('-s', '--speed', type=int, default=0) |
| 156 | |
| 157 | $ ./helm.py --help # top level help (launch and move) |
| 158 | $ ./helm.py launch --help # help for launch options |
| 159 | $ ./helm.py launch --missiles # set missiles=True and torpedos=False |
Raymond Hettinger | 3094ed8 | 2010-12-18 09:41:32 +0000 | [diff] [blame] | 160 | $ ./helm.py steer --course 180 --speed 5 # set movement parameters |
Raymond Hettinger | a5a3554 | 2010-12-05 00:39:18 +0000 | [diff] [blame] | 161 | |
| 162 | .. seealso:: |
| 163 | |
| 164 | :pep:`389` - New Command Line Parsing Module |
| 165 | PEP written by Steven Bethard. |
| 166 | |
Raymond Hettinger | 677e10a | 2010-12-07 06:45:30 +0000 | [diff] [blame] | 167 | :ref:`upgrading-optparse-code` for details on the differences from |
| 168 | :mod:`optparse`. |
| 169 | |
Raymond Hettinger | 6e6565b | 2009-06-28 20:56:11 +0000 | [diff] [blame] | 170 | |
Éric Araujo | 4234ad4 | 2010-09-05 17:32:25 +0000 | [diff] [blame] | 171 | PEP 391: Dictionary Based Configuration for Logging |
| 172 | ==================================================== |
Raymond Hettinger | ef2335c | 2010-09-05 08:35:38 +0000 | [diff] [blame] | 173 | |
Raymond Hettinger | 92ba286 | 2010-09-06 01:16:46 +0000 | [diff] [blame] | 174 | The :mod:`logging` module provided two kinds of configuration, one style with |
| 175 | function calls for each option or another style driven by an external file saved |
| 176 | in a :mod:`ConfigParser` format. Those options did not provide the flexibility |
Georg Brandl | 9e75cad | 2010-09-06 06:45:47 +0000 | [diff] [blame] | 177 | to create configurations from JSON or YAML files, nor did they support |
Raymond Hettinger | 92ba286 | 2010-09-06 01:16:46 +0000 | [diff] [blame] | 178 | incremental configuration, which is needed for specifying logger options from a |
| 179 | command line. |
Raymond Hettinger | ef2335c | 2010-09-05 08:35:38 +0000 | [diff] [blame] | 180 | |
| 181 | To support a more flexible style, the module now offers |
Raymond Hettinger | 92ba286 | 2010-09-06 01:16:46 +0000 | [diff] [blame] | 182 | :func:`logging.config.dictConfig` for specifying logging configuration with |
| 183 | plain Python dictionaries. The configuration options include formatters, |
| 184 | handlers, filters, and loggers. Here's a working example of a configuration |
| 185 | dictionary:: |
Raymond Hettinger | ef2335c | 2010-09-05 08:35:38 +0000 | [diff] [blame] | 186 | |
Georg Brandl | da0a211 | 2010-09-05 11:28:33 +0000 | [diff] [blame] | 187 | {"version": 1, |
| 188 | "formatters": {"brief": {"format": "%(levelname)-8s: %(name)-15s: %(message)s"}, |
| 189 | "full": {"format": "%(asctime)s %(name)-15s %(levelname)-8s %(message)s"}, |
| 190 | }, |
| 191 | "handlers": {"console": { |
| 192 | "class": "logging.StreamHandler", |
| 193 | "formatter": "brief", |
| 194 | "level": "INFO", |
| 195 | "stream": "ext://sys.stdout"}, |
| 196 | "console_priority": { |
| 197 | "class": "logging.StreamHandler", |
| 198 | "formatter": "full", |
| 199 | "level": "ERROR", |
| 200 | "stream": "ext://sys.stderr"}, |
| 201 | }, |
| 202 | "root": {"level": "DEBUG", "handlers": ["console", "console_priority"]}} |
Raymond Hettinger | ef2335c | 2010-09-05 08:35:38 +0000 | [diff] [blame] | 203 | |
Raymond Hettinger | 92ba286 | 2010-09-06 01:16:46 +0000 | [diff] [blame] | 204 | |
Raymond Hettinger | 6f04adc | 2010-12-04 22:56:25 +0000 | [diff] [blame] | 205 | If that dictionary is stored in a file called :file:`conf.json`, it can loaded |
Raymond Hettinger | 92ba286 | 2010-09-06 01:16:46 +0000 | [diff] [blame] | 206 | and called with code like this:: |
| 207 | |
| 208 | >>> import logging.config |
| 209 | >>> logging.config.dictConfig(json.load(open('conf.json', 'rb'))) |
| 210 | >>> logging.info("Transaction completed normally") |
| 211 | >>> logging.critical("Abnormal termination") |
| 212 | |
Raymond Hettinger | ef2335c | 2010-09-05 08:35:38 +0000 | [diff] [blame] | 213 | .. seealso:: |
| 214 | |
| 215 | :pep:`391` - Dictionary Based Configuration for Logging |
| 216 | PEP written by Vinay Sajip. |
| 217 | |
Georg Brandl | 97b20da | 2010-11-16 15:15:29 +0000 | [diff] [blame] | 218 | PEP 3148: The ``concurrent.futures`` module |
| 219 | ============================================ |
| 220 | |
Raymond Hettinger | 6f04adc | 2010-12-04 22:56:25 +0000 | [diff] [blame] | 221 | Code for creating and managing concurrency is being collected in a new toplevel |
| 222 | namespace, *concurrent*. Its first member is a *futures* package which provides |
| 223 | a uniform high level interface for managing threads and processes. |
| 224 | |
| 225 | The design for :mod:`concurrent.futures` was inspired by |
| 226 | *java.util.concurrent.package*. In that model, a running call and its result |
| 227 | are represented by a :class:`~concurrent.futures.Future` object which abstracts |
| 228 | features common to threads, processes, and remote procedure calls. That object |
| 229 | supports status checks (running or done), timeouts, cancellations, adding |
Raymond Hettinger | 24a0941 | 2010-12-08 06:50:02 +0000 | [diff] [blame] | 230 | callbacks, and access to results or exceptions. |
Raymond Hettinger | 6f04adc | 2010-12-04 22:56:25 +0000 | [diff] [blame] | 231 | |
| 232 | The primary offering of the new module is a pair of executor classes for |
| 233 | launching and managing calls. The goal of the executors is to make it easier to |
| 234 | use existing tools for making parallel calls. They save the effort needed to |
| 235 | setup a pool of resources, launch the calls, create a results queue, add |
| 236 | time-out handling, and limit the total number of threads, processes, or remote |
Raymond Hettinger | c269ae8 | 2010-12-05 01:01:52 +0000 | [diff] [blame] | 237 | procedure calls. |
Raymond Hettinger | 6f04adc | 2010-12-04 22:56:25 +0000 | [diff] [blame] | 238 | |
| 239 | Ideally, each application should share a single executor across multiple |
| 240 | components so that process and thread limits can be centrally managed. This |
| 241 | solves the design challenge that arises when each component has its own |
| 242 | competing strategy for resource management. |
| 243 | |
Raymond Hettinger | b105519 | 2010-12-08 06:42:41 +0000 | [diff] [blame] | 244 | Both classes share a common interface with three methods: |
| 245 | :meth:`~concurrent.futures.Executor.submit` for scheduling a callable and |
| 246 | returning a :class:`~concurrent.futures.Future` object; |
| 247 | :meth:`~concurrent.futures.Executor.map` for scheduling many asynchronous calls |
Raymond Hettinger | 83d8079 | 2010-12-08 06:48:33 +0000 | [diff] [blame] | 248 | at a time, and :meth:`~concurrent.futures.Executor.shutdown` for freeing |
| 249 | resources. The class is a :term:`context manager` and can be used within a |
| 250 | :keyword:`with` statement to assure that resources are automatically released |
| 251 | when currently pending futures are done executing. |
Raymond Hettinger | 6f04adc | 2010-12-04 22:56:25 +0000 | [diff] [blame] | 252 | |
Raymond Hettinger | b105519 | 2010-12-08 06:42:41 +0000 | [diff] [blame] | 253 | A simple of example of :class:`~concurrent.futures.ThreadPoolExecutor` is a |
Raymond Hettinger | 83d8079 | 2010-12-08 06:48:33 +0000 | [diff] [blame] | 254 | launch of four parallel threads for copying files:: |
Raymond Hettinger | b105519 | 2010-12-08 06:42:41 +0000 | [diff] [blame] | 255 | |
| 256 | import shutil |
| 257 | with ThreadPoolExecutor(max_workers=4) as e: |
| 258 | e.submit(shutil.copy, 'src1.txt', 'dest1.txt') |
| 259 | e.submit(shutil.copy, 'src2.txt', 'dest2.txt') |
| 260 | e.submit(shutil.copy, 'src3.txt', 'dest3.txt') |
| 261 | e.submit(shutil.copy, 'src3.txt', 'dest4.txt') |
| 262 | |
Raymond Hettinger | 6f04adc | 2010-12-04 22:56:25 +0000 | [diff] [blame] | 263 | .. seealso:: |
| 264 | |
Raymond Hettinger | a5a3554 | 2010-12-05 00:39:18 +0000 | [diff] [blame] | 265 | :pep:`3148` - Futures -- Execute Computations Asynchronously |
Andrew M. Kuchling | 42877fe | 2010-12-15 02:37:01 +0000 | [diff] [blame] | 266 | PEP written by Brian Quinlan. |
Georg Brandl | 97b20da | 2010-11-16 15:15:29 +0000 | [diff] [blame] | 267 | |
Raymond Hettinger | 83d8079 | 2010-12-08 06:48:33 +0000 | [diff] [blame] | 268 | :ref:`Code for Threaded Parallel URL reads<threadpoolexecutor-example>`, an |
| 269 | example using threads to fetch multiple web pages in parallel. |
| 270 | |
| 271 | :ref:`Code for computing prime numbers in |
| 272 | parallel<processpoolexecutor-example>`, an example demonstrating |
| 273 | :class:`~concurrent.futures.ProcessPoolExecutor`. |
| 274 | |
| 275 | |
Georg Brandl | da0a211 | 2010-09-05 11:28:33 +0000 | [diff] [blame] | 276 | |
Raymond Hettinger | f95b199 | 2010-09-04 23:53:24 +0000 | [diff] [blame] | 277 | PEP 3147: PYC Repository Directories |
| 278 | ===================================== |
| 279 | |
David Malcolm | 778645a | 2010-12-07 00:32:04 +0000 | [diff] [blame] | 280 | Python's scheme for caching bytecode in *.pyc* files did not work well in |
Raymond Hettinger | f95b199 | 2010-09-04 23:53:24 +0000 | [diff] [blame] | 281 | environments with multiple python interpreters. If one interpreter encountered |
| 282 | a cached file created by another interpreter, it would recompile the source and |
| 283 | overwrite the cached file, thus losing the benefits of caching. |
| 284 | |
| 285 | The issue of "pyc fights" has become more pronounced as it has become |
Éric Araujo | 4234ad4 | 2010-09-05 17:32:25 +0000 | [diff] [blame] | 286 | commonplace for Linux distributions to ship with multiple versions of Python. |
Raymond Hettinger | f95b199 | 2010-09-04 23:53:24 +0000 | [diff] [blame] | 287 | These conflicts also arise with CPython alternatives such as Unladen Swallow. |
| 288 | |
| 289 | To solve this problem, Python's import machinery has been extended to use |
Éric Araujo | 4234ad4 | 2010-09-05 17:32:25 +0000 | [diff] [blame] | 290 | distinct filenames for each interpreter. Instead of Python 3.2 and Python 3.3 and |
| 291 | Unladen Swallow each competing for a file called "mymodule.pyc", they will now |
Raymond Hettinger | f95b199 | 2010-09-04 23:53:24 +0000 | [diff] [blame] | 292 | look for "mymodule.cpython-32.pyc", "mymodule.cpython-33.pyc", and |
Éric Araujo | 4234ad4 | 2010-09-05 17:32:25 +0000 | [diff] [blame] | 293 | "mymodule.unladen10.pyc". And to prevent all of these new files from |
Raymond Hettinger | f95b199 | 2010-09-04 23:53:24 +0000 | [diff] [blame] | 294 | cluttering source directories, the *pyc* files are now collected in a |
| 295 | "__pycache__" directory stored under the package directory. |
| 296 | |
| 297 | Aside from the filenames and target directories, the new scheme has a few |
| 298 | aspects that are visible to the programmer: |
| 299 | |
Georg Brandl | da0a211 | 2010-09-05 11:28:33 +0000 | [diff] [blame] | 300 | * Imported modules now have a :attr:`__cached__` attribute which stores the name |
| 301 | of the actual file that was imported: |
Raymond Hettinger | f95b199 | 2010-09-04 23:53:24 +0000 | [diff] [blame] | 302 | |
Raymond Hettinger | 92ba286 | 2010-09-06 01:16:46 +0000 | [diff] [blame] | 303 | >>> import collections |
| 304 | >>> collections.__cached__ |
| 305 | 'c:/py32/lib/__pycache__/collections.cpython-32.pyc' |
Raymond Hettinger | f95b199 | 2010-09-04 23:53:24 +0000 | [diff] [blame] | 306 | |
| 307 | * The tag that is unique to each interpreter is accessible from the :mod:`imp` |
Georg Brandl | da0a211 | 2010-09-05 11:28:33 +0000 | [diff] [blame] | 308 | module: |
Raymond Hettinger | f95b199 | 2010-09-04 23:53:24 +0000 | [diff] [blame] | 309 | |
Raymond Hettinger | 92ba286 | 2010-09-06 01:16:46 +0000 | [diff] [blame] | 310 | >>> import imp |
| 311 | >>> imp.get_tag() |
| 312 | 'cpython-32' |
Raymond Hettinger | f95b199 | 2010-09-04 23:53:24 +0000 | [diff] [blame] | 313 | |
| 314 | * Scripts that try to deduce source filename from the imported file now need to |
| 315 | be smarter. It is no longer sufficient to simply strip the "c" from a ".pyc" |
| 316 | filename. Instead, use the new functions in the :mod:`imp` module: |
| 317 | |
Georg Brandl | da0a211 | 2010-09-05 11:28:33 +0000 | [diff] [blame] | 318 | >>> imp.source_from_cache('c:/py32/lib/__pycache__/collections.cpython-32.pyc') |
| 319 | 'c:/py32/lib/collections.py' |
| 320 | >>> imp.cache_from_source('c:/py32/lib/collections.py') |
| 321 | 'c:/py32/lib/__pycache__/collections.cpython-32.pyc' |
Raymond Hettinger | f95b199 | 2010-09-04 23:53:24 +0000 | [diff] [blame] | 322 | |
| 323 | * The :mod:`py_compile` and :mod:`compileall` modules have been updated to |
| 324 | reflect the new naming convention and target directory. |
| 325 | |
| 326 | .. seealso:: |
| 327 | |
| 328 | :pep:`3147` - PYC Repository Directories |
| 329 | PEP written by Barry Warsaw. |
| 330 | |
Georg Brandl | da0a211 | 2010-09-05 11:28:33 +0000 | [diff] [blame] | 331 | |
Georg Brandl | 3ad4675 | 2010-12-05 07:59:29 +0000 | [diff] [blame] | 332 | PEP 3149: ABI Version Tagged .so Files |
| 333 | ====================================== |
Georg Brandl | f11c6c4 | 2010-09-03 22:20:58 +0000 | [diff] [blame] | 334 | |
Raymond Hettinger | ebea6fa | 2010-09-05 00:27:25 +0000 | [diff] [blame] | 335 | The PYC repository directory allows multiple bytecode cache files to be |
| 336 | co-located. This PEP implements a similar mechanism for shared object files by |
| 337 | giving them a common directory and distinct names for each version. |
Georg Brandl | f11c6c4 | 2010-09-03 22:20:58 +0000 | [diff] [blame] | 338 | |
Raymond Hettinger | ebea6fa | 2010-09-05 00:27:25 +0000 | [diff] [blame] | 339 | The common directory is "pyshared" and the file names are made distinct by |
| 340 | identifying the Python implementation (such as CPython, PyPy, Jython, etc.), the |
| 341 | major and minor version numbers, and optional build flags (such as "d" for |
Éric Araujo | 4234ad4 | 2010-09-05 17:32:25 +0000 | [diff] [blame] | 342 | debug, "m" for pymalloc, "u" for wide-unicode). For an arbitrary package "foo", |
Raymond Hettinger | ebea6fa | 2010-09-05 00:27:25 +0000 | [diff] [blame] | 343 | you may see these files when the distribution package is installed:: |
| 344 | |
| 345 | /usr/share/pyshared/foo.cpython-32m.so |
| 346 | /usr/share/pyshared/foo.cpython-33md.so |
| 347 | |
| 348 | In Python itself, the tags are accessible from functions in the :mod:`sysconfig` |
| 349 | module:: |
| 350 | |
| 351 | >>> import sysconfig |
| 352 | >>> sysconfig.get_config_var('SOABI') # find the version tag |
| 353 | 'cpython-32mu' |
| 354 | >>> sysconfig.get_config_var('SO') # find the full filename extension |
| 355 | 'cpython-32mu.so' |
| 356 | |
| 357 | .. seealso:: |
| 358 | |
| 359 | :pep:`3149` - ABI Version Tagged .so Files |
| 360 | PEP written by Barry Warsaw. |
Raymond Hettinger | 6e6565b | 2009-06-28 20:56:11 +0000 | [diff] [blame] | 361 | |
| 362 | |
| 363 | Other Language Changes |
| 364 | ====================== |
| 365 | |
| 366 | Some smaller changes made to the core Python language are: |
| 367 | |
Raymond Hettinger | e5e1a98 | 2010-12-05 08:35:21 +0000 | [diff] [blame] | 368 | * String formatting for :func:`format` and :meth:`str.format` gained new |
| 369 | capabilities for the format character **#**. Previously, for integers in |
| 370 | binary, octal, or hexadecimal, it caused the output to be prefixed with '0b', |
| 371 | '0o', or '0x' respectively. Now it can also handle floats, complex, and |
| 372 | Decimal, causing the output to always have a decimal point even when no digits |
| 373 | follow it. |
Raymond Hettinger | e5e728b | 2010-12-05 06:35:16 +0000 | [diff] [blame] | 374 | |
| 375 | >>> format(20, '#o') |
| 376 | '0o24' |
| 377 | >>> format(12.34, '#5.0f') |
| 378 | ' 12.' |
| 379 | |
| 380 | (Suggested by Mark Dickinson and implemented by Eric Smith in :issue:`7094`.) |
Raymond Hettinger | 43b5a85 | 2010-12-05 04:04:21 +0000 | [diff] [blame] | 381 | |
Raymond Hettinger | c269ae8 | 2010-12-05 01:01:52 +0000 | [diff] [blame] | 382 | * The interpreter can now be started with a quiet option, ``-q``, to suppress |
| 383 | the copyright and version information in an interactive mode. |
| 384 | |
| 385 | (Contributed by Marcin Wojdyr in issue:`1772833`). |
| 386 | |
Georg Brandl | da0a211 | 2010-09-05 11:28:33 +0000 | [diff] [blame] | 387 | * The :func:`hasattr` function used to catch and suppress any Exception. Now, |
| 388 | it only catches :exc:`AttributeError`. Under the hood, :func:`hasattr` works |
| 389 | by calling :func:`getattr` and throwing away the results. This is necessary |
| 390 | because dynamic attribute creation is possible using :meth:`__getattribute__` |
Éric Araujo | 4234ad4 | 2010-09-05 17:32:25 +0000 | [diff] [blame] | 391 | or :meth:`__getattr__`. If :func:`hasattr` were to just scan instance and class |
Éric Araujo | cc6aac6 | 2010-09-07 21:35:35 +0000 | [diff] [blame] | 392 | dictionaries it would miss the dynamic methods and make it difficult to |
Georg Brandl | da0a211 | 2010-09-05 11:28:33 +0000 | [diff] [blame] | 393 | implement proxy objects. |
Raymond Hettinger | 1784ff0 | 2010-09-05 01:00:19 +0000 | [diff] [blame] | 394 | |
Raymond Hettinger | a55ffbc | 2010-12-15 18:31:57 +0000 | [diff] [blame] | 395 | (Discovered by Yury Selivanov and fixed by Benjamin Peterson; :issue:`9666`.) |
Raymond Hettinger | 1784ff0 | 2010-09-05 01:00:19 +0000 | [diff] [blame] | 396 | |
Éric Araujo | 4234ad4 | 2010-09-05 17:32:25 +0000 | [diff] [blame] | 397 | * The :func:`str` of a float or complex number is now the same as its |
Raymond Hettinger | 1784ff0 | 2010-09-05 01:00:19 +0000 | [diff] [blame] | 398 | :func:`repr`. Previously, the :func:`str` form was shorter but that just |
Éric Araujo | 4234ad4 | 2010-09-05 17:32:25 +0000 | [diff] [blame] | 399 | caused confusion and is no longer needed now that the shortest possible |
Georg Brandl | da0a211 | 2010-09-05 11:28:33 +0000 | [diff] [blame] | 400 | :func:`repr` is displayed by default: |
Raymond Hettinger | bb734c6 | 2010-09-05 05:56:44 +0000 | [diff] [blame] | 401 | |
Raymond Hettinger | 92ba286 | 2010-09-06 01:16:46 +0000 | [diff] [blame] | 402 | >>> repr(math.pi) |
| 403 | '3.141592653589793' |
| 404 | >>> str(math.pi) |
| 405 | '3.141592653589793' |
Raymond Hettinger | 1784ff0 | 2010-09-05 01:00:19 +0000 | [diff] [blame] | 406 | |
Georg Brandl | da0a211 | 2010-09-05 11:28:33 +0000 | [diff] [blame] | 407 | (Proposed and implemented by Mark Dickinson; :issue:`9337`.) |
Raymond Hettinger | 6e6565b | 2009-06-28 20:56:11 +0000 | [diff] [blame] | 408 | |
Raymond Hettinger | 21ec4bc | 2010-12-10 01:09:01 +0000 | [diff] [blame] | 409 | * :class:`memoryview` objects now have a :meth:`~memoryview.release()` method |
| 410 | and they also now support the context manager protocol. This allows timely |
| 411 | release of any resources that were acquired when requesting a buffer from the |
| 412 | original object. |
Antoine Pitrou | d305200 | 2010-09-15 15:09:40 +0000 | [diff] [blame] | 413 | |
Raymond Hettinger | d8fae4e | 2010-12-05 05:39:54 +0000 | [diff] [blame] | 414 | >>> with memoryview(b'abcdefgh') as v: |
| 415 | ... print(v.tolist()) |
| 416 | ... |
| 417 | [97, 98, 99, 100, 101, 102, 103, 104] |
| 418 | |
Antoine Pitrou | d305200 | 2010-09-15 15:09:40 +0000 | [diff] [blame] | 419 | (Added by Antoine Pitrou; :issue:`9757`.) |
| 420 | |
Raymond Hettinger | 6e6565b | 2009-06-28 20:56:11 +0000 | [diff] [blame] | 421 | |
Amaury Forgeot d'Arc | ba117ef | 2010-09-10 21:39:53 +0000 | [diff] [blame] | 422 | * Previously it was illegal to delete a name from the local namespace if it |
| 423 | occurs as a free variable in a nested block:: |
| 424 | |
| 425 | >>> def outer(x): |
| 426 | ... def inner(): |
| 427 | ... return x |
| 428 | ... inner() |
| 429 | ... del x |
| 430 | |
| 431 | This is now allowed. Remember that the target of an :keyword:`except` clause |
| 432 | is cleared, so this code which used to work with Python 2.6, raised a |
| 433 | :exc:`SyntaxError` with Python 3.1 and now works again:: |
| 434 | |
| 435 | >>> def f(): |
| 436 | ... def print_error(): |
| 437 | ... print(e) |
| 438 | ... try: |
| 439 | ... something |
| 440 | ... except Exception as e: |
| 441 | ... print_error() |
| 442 | ... # implicit "del e" here |
| 443 | |
| 444 | (See :issue:`4617`.) |
| 445 | |
Raymond Hettinger | 480ed78 | 2010-12-15 22:07:15 +0000 | [diff] [blame] | 446 | * The internal :c:type:`structsequence` tool now creates subclasses of tuple. |
| 447 | This means that C generated structures like those returned by :func:`os.stat`, |
| 448 | :func:`time.gmtime`, and :func:`sys.version_info` now work like a |
| 449 | :term:`named tuple` and are more interoperable with functions and methods that |
| 450 | expect a tuple as an argument. The is a big step forward in making the C |
| 451 | structures as flexible as their pure Python counterparts. |
| 452 | |
| 453 | (Suggested by Arfrever Frehtes Taifersar Arahesis and implemented |
| 454 | by Benjamin Peterson in :issue:`8413`.) |
| 455 | |
| 456 | * Warnings are now easier control. An :envvar:`PYTHONWARNINGS` environment |
| 457 | variable is now available as an alternative to using ``-W`` at the command |
| 458 | line. |
| 459 | |
| 460 | (Suggested by Barry Warsaw and implemented by Philip Jenvey in :issue:`7301`.) |
| 461 | |
Antoine Pitrou | 7d15a72 | 2010-11-05 22:13:55 +0000 | [diff] [blame] | 462 | * A new warning category, :exc:`ResourceWarning`, has been added. It is |
Raymond Hettinger | c269ae8 | 2010-12-05 01:01:52 +0000 | [diff] [blame] | 463 | emitted when potential issues with resource consumption or cleanup |
Antoine Pitrou | 7d15a72 | 2010-11-05 22:13:55 +0000 | [diff] [blame] | 464 | are detected. It is silenced by default in normal release builds, but |
Raymond Hettinger | c269ae8 | 2010-12-05 01:01:52 +0000 | [diff] [blame] | 465 | can be enabled through the means provided by the :mod:`warnings` |
Antoine Pitrou | 7d15a72 | 2010-11-05 22:13:55 +0000 | [diff] [blame] | 466 | module, or on the command line. |
| 467 | |
Raymond Hettinger | d8fae4e | 2010-12-05 05:39:54 +0000 | [diff] [blame] | 468 | A :exc:`ResourceWarning` is issued at interpreter shutdown if the |
Antoine Pitrou | 7d15a72 | 2010-11-05 22:13:55 +0000 | [diff] [blame] | 469 | :data:`gc.garbage` list isn't empty. This is meant to make the programmer |
| 470 | aware that their code contains object finalization issues. |
| 471 | |
Raymond Hettinger | d8fae4e | 2010-12-05 05:39:54 +0000 | [diff] [blame] | 472 | A :exc:`ResourceWarning` is also issued when a :term:`file object` is destroyed |
Antoine Pitrou | 7d15a72 | 2010-11-05 22:13:55 +0000 | [diff] [blame] | 473 | without having been explicitly closed. While the deallocator for such |
| 474 | object ensures it closes the underlying operating system resource |
| 475 | (usually, a file descriptor), the delay in deallocating the object could |
| 476 | produce various issues, especially under Windows. Here is an example |
| 477 | of enabling the warning from the command line:: |
| 478 | |
Raymond Hettinger | 673ccf2 | 2010-12-07 09:37:11 +0000 | [diff] [blame] | 479 | $ ./python -q -Wdefault |
Antoine Pitrou | 7d15a72 | 2010-11-05 22:13:55 +0000 | [diff] [blame] | 480 | >>> f = open("foo", "wb") |
| 481 | >>> del f |
| 482 | __main__:1: ResourceWarning: unclosed file <_io.BufferedWriter name='foo'> |
Antoine Pitrou | 7d15a72 | 2010-11-05 22:13:55 +0000 | [diff] [blame] | 483 | |
Raymond Hettinger | d8fae4e | 2010-12-05 05:39:54 +0000 | [diff] [blame] | 484 | (Added by Antoine Pitrou and Georg Brandl in :issue:`10093` and :issue:`477863`.) |
Antoine Pitrou | 7d15a72 | 2010-11-05 22:13:55 +0000 | [diff] [blame] | 485 | |
Raymond Hettinger | a026633 | 2010-12-07 08:52:41 +0000 | [diff] [blame] | 486 | * :class:`range` objects now support *index* and *count* methods. This is part |
| 487 | of an effort to make more objects fully implement the |
| 488 | :class:`collections.Sequence` :term:`abstract base class`. As a result, the |
| 489 | language will have a more uniform API. In addition, :class:`range` objects |
| 490 | now support slicing and negative indices. This makes *range* more |
Raymond Hettinger | 2ffa671 | 2010-12-08 10:18:21 +0000 | [diff] [blame] | 491 | interoperable with lists:: |
| 492 | |
| 493 | >>> range(0, 100, 2).count(10) |
| 494 | 1 |
| 495 | >>> range(0, 100, 2).index(10) |
| 496 | 5 |
| 497 | >>> range(0, 100, 2)[5] |
| 498 | 10 |
| 499 | >>> range(0, 100, 2)[0:5] |
| 500 | range(0, 10, 2) |
Raymond Hettinger | dadf93c | 2010-12-05 02:56:21 +0000 | [diff] [blame] | 501 | |
| 502 | (Contributed by Daniel Stuzback in :issue:`9213` and by Alexander Belopolsky |
| 503 | in :issue:`2690`.) |
Nick Coghlan | 37ee850 | 2010-12-03 14:26:13 +0000 | [diff] [blame] | 504 | |
Raymond Hettinger | d8fae4e | 2010-12-05 05:39:54 +0000 | [diff] [blame] | 505 | * The :func:`callable` builtin function from Py2.x was resurrected. It provides |
Raymond Hettinger | b87ba26 | 2010-12-06 04:31:40 +0000 | [diff] [blame] | 506 | a concise, readable alternative to using an :term:`abstract base class` in an |
Raymond Hettinger | 792c076 | 2010-12-09 16:41:54 +0000 | [diff] [blame] | 507 | expression like ``isinstance(x, collections.Callable)``: |
| 508 | |
| 509 | >>> callable(max) |
| 510 | True |
| 511 | >>> callable(20) |
| 512 | False |
Raymond Hettinger | d8fae4e | 2010-12-05 05:39:54 +0000 | [diff] [blame] | 513 | |
| 514 | (See :issue:`10518`.) |
Amaury Forgeot d'Arc | ba117ef | 2010-09-10 21:39:53 +0000 | [diff] [blame] | 515 | |
Raymond Hettinger | 070ec70 | 2010-12-10 17:45:13 +0000 | [diff] [blame] | 516 | * Python's import mechanism can now load module installed in directories with |
| 517 | non-ASCII characters in the path name. |
| 518 | |
| 519 | (Required extensive work by Victor Stinner in :issue:`9425`.) |
| 520 | |
| 521 | |
Raymond Hettinger | 6e6565b | 2009-06-28 20:56:11 +0000 | [diff] [blame] | 522 | New, Improved, and Deprecated Modules |
| 523 | ===================================== |
| 524 | |
Raymond Hettinger | 99db3fd | 2010-12-15 19:33:49 +0000 | [diff] [blame] | 525 | Python's standard library has undergone significant maintenance efforts and |
| 526 | quality improvements. |
Raymond Hettinger | e434b3b | 2010-12-15 19:20:01 +0000 | [diff] [blame] | 527 | |
| 528 | The biggest news for Python 3.2 is that the :mod:`email` package and |
Raymond Hettinger | 99db3fd | 2010-12-15 19:33:49 +0000 | [diff] [blame] | 529 | :mod:`nntplib` modules now work correctly with the bytes/text model in Python 3. |
Raymond Hettinger | e434b3b | 2010-12-15 19:20:01 +0000 | [diff] [blame] | 530 | For the first time, there is correct handling of inputs with mixed encodings. |
| 531 | |
Raymond Hettinger | 6046e22 | 2010-12-16 00:21:08 +0000 | [diff] [blame] | 532 | Throughout the standard library, there has been more careful attention to |
| 533 | encodings and text versus bytes issues. In particular, interactions with the |
| 534 | operating system are now better able to pass non-ASCII data using the Windows |
| 535 | mcbs encoding, locale aware encodings, or UTF-8. |
| 536 | |
Raymond Hettinger | e434b3b | 2010-12-15 19:20:01 +0000 | [diff] [blame] | 537 | Another significant win is the addition of substantially better support for |
| 538 | *SSL* connections and security certificates. |
| 539 | |
Raymond Hettinger | 99db3fd | 2010-12-15 19:33:49 +0000 | [diff] [blame] | 540 | In addition, more functions and classes now have a :term:`context manager` to |
| 541 | support convenient and reliable resource clean-up using the |
Raymond Hettinger | e434b3b | 2010-12-15 19:20:01 +0000 | [diff] [blame] | 542 | :keyword:`with`-statement. |
| 543 | |
Raymond Hettinger | 0358a17 | 2010-12-15 19:00:38 +0000 | [diff] [blame] | 544 | email |
| 545 | ----- |
| 546 | |
| 547 | The usability of the :mod:`email` package in Python 3 has been mostly fixed by |
| 548 | the extensive efforts of R. David Murray. The problem was that emails are |
| 549 | typically read and stored in the form of :class:`bytes` rather than :class:`str` |
| 550 | text, and they may contain multiple encodings within a single email. So, the |
| 551 | email package had to be extended to parse and generate email messages in bytes |
| 552 | format. |
| 553 | |
| 554 | * New functions :func:`~email.message_from_bytes` and |
| 555 | :func:`~email.message_from_binary_file`, and new classes |
| 556 | :class:`~email.parser.BytesFeedParser` and :class:`~email.parser.BytesParser` |
| 557 | allow binary message data to be parsed into model objects. |
| 558 | |
| 559 | * Given bytes input to the model, :meth:`~email.message.Message.get_payload` |
| 560 | will by default decode a message body that has a |
| 561 | :mailheader:`Content-Transfer-Encoding` of *8bit* using the charset |
| 562 | specified in the MIME headers and return the resulting string. |
| 563 | |
| 564 | * Given bytes input to the model, :class:`~email.generator.Generator` will |
| 565 | convert message bodies that have a :mailheader:`Content-Transfer-Encoding` of |
| 566 | *8bit* to instead have a *7bit* :mailheader:`Content-Transfer-Encoding`. |
| 567 | |
| 568 | * A new class :class:`~email.generator.BytesGenerator` produces bytes as output, |
| 569 | preserving any unchanged non-ASCII data that was present in the input used to |
| 570 | build the model, including message bodies with a |
| 571 | :mailheader:`Content-Transfer-Encoding` of *8bit*. |
| 572 | |
| 573 | * The :mod:`smtplib` :class:`~smtplib.SMTP` class now accepts a byte string |
| 574 | for the *msg* argument to the :meth:`~smtplib.SMTP.sendmail` method, |
| 575 | and a new method, :meth:`~smtplib.SMTP.send_message` accepts a |
| 576 | :class:`~email.message.Message` object and can optionally obtain the |
| 577 | *from_addr* and *to_addrs* addresses directly from the object. |
| 578 | |
| 579 | .. XXX Update before 3.2rc1 to reflect all of the latest work and add examples. |
| 580 | |
| 581 | (Proposed and implemented by R. David Murray, :issue:`4661` and :issue:`10321`.) |
| 582 | |
Raymond Hettinger | 6046e22 | 2010-12-16 00:21:08 +0000 | [diff] [blame] | 583 | elementtree |
| 584 | ----------- |
| 585 | |
| 586 | The :mod:`xml.etree.ElementTree` package and it's :mod:`xml.etree.cElementTree` |
| 587 | counterpart have been updated to version 1.3. |
| 588 | |
| 589 | Several new and useful functions and methods have been added: |
| 590 | |
| 591 | * :func:`xml.etree.ElementTree.fromstringlist` which builds an XML document |
| 592 | from a sequence of fragments |
| 593 | * :func:`xml.etree.ElementTree.register_namespace` for registering a global |
| 594 | namespace prefix |
| 595 | * :func:`xml.etree.ElementTree.tostringlist` for string representation |
| 596 | including all sublists |
| 597 | * :meth:`xml.etree.ElementTree.Element.extend` for appending a sequence of zero |
| 598 | or more elements |
| 599 | * :meth:`xml.etree.ElementTree.Element.iterfind` searches an element and |
| 600 | subelements |
| 601 | * :meth:`xml.etree.ElementTree.Element.itertext` creates a text iterator over |
| 602 | an element and its sub-elements |
| 603 | * :meth:`xml.etree.ElementTree.TreeBuilder.end` closes the current element |
| 604 | * :meth:`xml.etree.ElementTree.TreeBuilder.doctype` handles a doctype |
| 605 | declaration |
| 606 | |
| 607 | Two methods have been deprecated: |
| 608 | |
| 609 | * :meth:`xml.etree.ElementTree.getchildren` use ``list(elem)`` instead. |
| 610 | * :meth:`xml.etree.ElementTree.getiterator` use ``Element.iter`` instead. |
| 611 | |
| 612 | For details of the update, see `Introducing ElementTree |
| 613 | <http://effbot.org/zone/elementtree-13-intro.htm>`_ on Fredrik Lundh's website. |
| 614 | |
Antoine Pitrou | 12de8ac | 2010-12-16 13:33:56 +0000 | [diff] [blame] | 615 | (Contributed by Florent Xicluna and Fredrik Lundh, :issue:`6472`.) |
Raymond Hettinger | 0358a17 | 2010-12-15 19:00:38 +0000 | [diff] [blame] | 616 | |
Raymond Hettinger | bba537b | 2010-12-15 18:20:19 +0000 | [diff] [blame] | 617 | functools |
| 618 | --------- |
| 619 | |
Éric Araujo | 4234ad4 | 2010-09-05 17:32:25 +0000 | [diff] [blame] | 620 | * The :mod:`functools` module includes a new decorator for caching function |
Georg Brandl | da0a211 | 2010-09-05 11:28:33 +0000 | [diff] [blame] | 621 | calls. :func:`functools.lru_cache` can save repeated queries to an external |
| 622 | resource whenever the results are expected to be the same. |
Raymond Hettinger | aed05eb | 2010-08-02 01:43:41 +0000 | [diff] [blame] | 623 | |
Raymond Hettinger | 86f9613 | 2010-08-06 23:23:49 +0000 | [diff] [blame] | 624 | For example, adding a caching decorator to a database query function can save |
| 625 | database accesses for popular searches:: |
Raymond Hettinger | aed05eb | 2010-08-02 01:43:41 +0000 | [diff] [blame] | 626 | |
Georg Brandl | da0a211 | 2010-09-05 11:28:33 +0000 | [diff] [blame] | 627 | @functools.lru_cache(maxsize=300) |
| 628 | def get_phone_number(name): |
| 629 | c = conn.cursor() |
| 630 | c.execute('SELECT phonenumber FROM phonelist WHERE name=?', (name,)) |
| 631 | return c.fetchone()[0] |
Raymond Hettinger | aed05eb | 2010-08-02 01:43:41 +0000 | [diff] [blame] | 632 | |
Georg Brandl | da0a211 | 2010-09-05 11:28:33 +0000 | [diff] [blame] | 633 | >>> for name in user_requests: |
Raymond Hettinger | 7496b41 | 2010-11-30 19:15:45 +0000 | [diff] [blame] | 634 | ... get_phone_number(name) # cached lookup |
| 635 | |
| 636 | To help with choosing an effective cache size, the wrapped function is |
| 637 | instrumented for tracking cache statistics: |
| 638 | |
Raymond Hettinger | 5e20bab | 2010-11-30 07:13:04 +0000 | [diff] [blame] | 639 | >>> get_phone_number.cache_info() |
Raymond Hettinger | 7496b41 | 2010-11-30 19:15:45 +0000 | [diff] [blame] | 640 | CacheInfo(hits=4805, misses=980, maxsize=300, currsize=300) |
Raymond Hettinger | aed05eb | 2010-08-02 01:43:41 +0000 | [diff] [blame] | 641 | |
Raymond Hettinger | f309828 | 2010-08-15 03:30:45 +0000 | [diff] [blame] | 642 | If the phonelist table gets updated, the outdated contents of the cache can be |
Georg Brandl | da0a211 | 2010-09-05 11:28:33 +0000 | [diff] [blame] | 643 | cleared with: |
Raymond Hettinger | f309828 | 2010-08-15 03:30:45 +0000 | [diff] [blame] | 644 | |
Georg Brandl | da0a211 | 2010-09-05 11:28:33 +0000 | [diff] [blame] | 645 | >>> get_phone_number.cache_clear() |
Raymond Hettinger | f309828 | 2010-08-15 03:30:45 +0000 | [diff] [blame] | 646 | |
Raymond Hettinger | 6e35394 | 2010-12-04 23:42:12 +0000 | [diff] [blame] | 647 | (Contributed by Raymond Hettinger and incorporating design ideas from |
Raymond Hettinger | b87ba26 | 2010-12-06 04:31:40 +0000 | [diff] [blame] | 648 | Jim Baker, Miki Tebeka, and Nick Coghlan.) |
Raymond Hettinger | aed05eb | 2010-08-02 01:43:41 +0000 | [diff] [blame] | 649 | |
Antoine Pitrou | 7d49bc9 | 2010-09-15 15:13:17 +0000 | [diff] [blame] | 650 | * The :func:`functools.wraps` decorator now adds a :attr:`__wrapped__` attribute |
| 651 | pointing to the original callable function. This allows wrapped functions to |
| 652 | be introspected. It also copies :attr:`__annotations__` if defined. And now |
| 653 | it also gracefully skips over missing attributes such as :attr:`__doc__` which |
Raymond Hettinger | 5eb6390 | 2010-12-09 23:43:34 +0000 | [diff] [blame] | 654 | might not be defined for the wrapped callable. |
Antoine Pitrou | 7d49bc9 | 2010-09-15 15:13:17 +0000 | [diff] [blame] | 655 | |
| 656 | (By Nick Coghlan and Terrence Cole; :issue:`9567`, :issue:`3445`, and |
| 657 | :issue:`8814`.) |
| 658 | |
Raymond Hettinger | 6046e22 | 2010-12-16 00:21:08 +0000 | [diff] [blame] | 659 | * To help write classes with rich comparison methods, a new decorator |
| 660 | :func:`functools.total_ordering` will use a existing equality and inequality |
| 661 | methods to fill-in the remaining methods. |
| 662 | |
| 663 | For example, supplying *__eq__* and *__lt__* will enable |
| 664 | :func:`~functools.total_ordering` to fill-in *__le__*, *__gt__* and *__ge__*:: |
| 665 | |
| 666 | @total_ordering |
| 667 | class Student: |
| 668 | def __eq__(self, other): |
| 669 | return ((self.lastname.lower(), self.firstname.lower()) == |
| 670 | (other.lastname.lower(), other.firstname.lower())) |
| 671 | def __lt__(self, other): |
| 672 | return ((self.lastname.lower(), self.firstname.lower()) < |
| 673 | (other.lastname.lower(), other.firstname.lower())) |
| 674 | |
| 675 | (Contributed by Raymond Hettinger.) |
| 676 | |
| 677 | * To aid in porting programs from Python 2, the :func:`~functools.cmp_to_key` |
Raymond Hettinger | bb9686f | 2010-12-16 00:53:05 +0000 | [diff] [blame] | 678 | function converts an old-style comparison function to |
Raymond Hettinger | 6046e22 | 2010-12-16 00:21:08 +0000 | [diff] [blame] | 679 | modern :term:`key function`: |
| 680 | |
| 681 | >>> # locale-aware sort order |
| 682 | >>> sorted(iterable, key=cmp_to_key(locale.strcoll)) |
| 683 | |
| 684 | For sorting examples and a brief sorting tutorial, see the `Sorting HowTo |
| 685 | <http://wiki.python.org/moin/HowTo/Sorting/>`_ tutorial. |
| 686 | |
| 687 | (Contributed by Raymond Hettinger.) |
| 688 | |
Raymond Hettinger | bba537b | 2010-12-15 18:20:19 +0000 | [diff] [blame] | 689 | itertools |
| 690 | --------- |
| 691 | |
Raymond Hettinger | 673ccf2 | 2010-12-07 09:37:11 +0000 | [diff] [blame] | 692 | * The :mod:`itertools` module has a new :func:`~itertools.accumulate` function |
Raymond Hettinger | a5a3554 | 2010-12-05 00:39:18 +0000 | [diff] [blame] | 693 | modeled on APL's *scan* operator and on Numpy's *accumulate* function: |
Raymond Hettinger | 6e35394 | 2010-12-04 23:42:12 +0000 | [diff] [blame] | 694 | |
| 695 | >>> list(accumulate(8, 2, 50)) |
| 696 | [8, 10, 60] |
| 697 | |
| 698 | >>> prob_dist = [0.1, 0.4, 0.2, 0.3] |
| 699 | >>> list(accumulate(prob_dist)) # cumulative probability distribution |
| 700 | [0.1, 0.5, 0.7, 1.0] |
| 701 | |
| 702 | For an example using :func:`~itertools.accumulate`, see the :ref:`examples for |
| 703 | the random module <random-examples>`. |
| 704 | |
| 705 | (Contributed by Raymond Hettinger and incorporating design suggestions |
| 706 | from Mark Dickinson.) |
| 707 | |
Raymond Hettinger | bba537b | 2010-12-15 18:20:19 +0000 | [diff] [blame] | 708 | collections |
| 709 | ----------- |
| 710 | |
Raymond Hettinger | 792c076 | 2010-12-09 16:41:54 +0000 | [diff] [blame] | 711 | * The :class:`collections.Counter` class now has two forms of in-place |
| 712 | subtraction, the existing *-=* operator for `saturating subtraction |
| 713 | <http://en.wikipedia.org/wiki/Saturation_arithmetic>`_ and the new |
| 714 | :meth:`~collections.Counter.subtract` method for regular subtraction. The |
| 715 | former is suitable for `multisets <http://en.wikipedia.org/wiki/Multiset>`_ |
Raymond Hettinger | ffad35e | 2010-12-14 21:12:03 +0000 | [diff] [blame] | 716 | which only have positive counts, and the latter is more suitable for use cases |
Raymond Hettinger | 792c076 | 2010-12-09 16:41:54 +0000 | [diff] [blame] | 717 | that allow negative counts: |
Alexander Belopolsky | 7257231 | 2010-12-08 21:21:56 +0000 | [diff] [blame] | 718 | |
Raymond Hettinger | 792c076 | 2010-12-09 16:41:54 +0000 | [diff] [blame] | 719 | >>> tally = Counter(dogs=5, cat=3) |
| 720 | >>> tally -= Counter(dogs=2, cats=8) # saturating subtraction |
| 721 | >>> tally |
| 722 | Counter({'dogs': 3}) |
Alexander Belopolsky | 7257231 | 2010-12-08 21:21:56 +0000 | [diff] [blame] | 723 | |
Raymond Hettinger | 792c076 | 2010-12-09 16:41:54 +0000 | [diff] [blame] | 724 | >>> tally = Counter(dogs=5, cats=3) |
| 725 | >>> tally.subtract(dogs=2, cats=8) # regular subtraction |
| 726 | >>> tally |
| 727 | Counter({'dogs': 3, 'cats': -5}) |
Alexander Belopolsky | 7257231 | 2010-12-08 21:21:56 +0000 | [diff] [blame] | 728 | |
Raymond Hettinger | 792c076 | 2010-12-09 16:41:54 +0000 | [diff] [blame] | 729 | (Contributed by Raymond Hettinger.) |
Alexander Belopolsky | 7257231 | 2010-12-08 21:21:56 +0000 | [diff] [blame] | 730 | |
Raymond Hettinger | e0a9600 | 2010-12-15 17:54:13 +0000 | [diff] [blame] | 731 | * The :class:`collections.OrderedDict` class has a new method |
| 732 | :meth:`~collections.OrderedDict.move_to_end` which takes an existing key and |
| 733 | moves it to either the beginning or end of an ordered sequence. When the |
| 734 | dictionary sequence is being used as a queue, these operations correspond to |
| 735 | "move to the front of the line" or "move to the back of the line": |
| 736 | |
| 737 | >>> d = OrderedDict.fromkeys(['a', 'b', 'X', 'd', 'e']) |
| 738 | >>> list(d) |
| 739 | ['a', 'b', 'X', 'd', 'e'] |
| 740 | >>> d.move_to_end('X', last=True) |
| 741 | >>> list(d) |
| 742 | ['a', 'b', 'd', 'e', 'X'] |
| 743 | >>> d.move_to_end('X', last=False) |
| 744 | >>> list(d) |
| 745 | ['X', 'a', 'b', 'd', 'e'] |
| 746 | |
Raymond Hettinger | 6046e22 | 2010-12-16 00:21:08 +0000 | [diff] [blame] | 747 | (Contributed by Raymond Hettinger.) |
| 748 | |
| 749 | * The :class:`collections.deque` grew two new methods :meth:`~collections.deque.count` |
| 750 | and :meth:`collections.deque.reverse` that make them more substitutable for |
| 751 | :class:`list` when needed: |
| 752 | |
| 753 | >>> d = deque('simsalabim') |
| 754 | >>> d.count('s') |
| 755 | 2 |
| 756 | >>> d.reverse() |
| 757 | >>> d |
| 758 | deque(['m', 'i', 'b', 'a', 'l', 'a', 's', 'm', 'i', 's']) |
| 759 | |
| 760 | (Contributed by Raymond Hettinger.) |
| 761 | |
Raymond Hettinger | bba537b | 2010-12-15 18:20:19 +0000 | [diff] [blame] | 762 | datetime |
| 763 | -------- |
| 764 | |
Raymond Hettinger | 792c076 | 2010-12-09 16:41:54 +0000 | [diff] [blame] | 765 | * The :mod:`datetime` module has a new type :class:`~datetime.timezone` that |
| 766 | implements the :class:`~datetime.tzinfo` interface by returning a fixed UTC |
| 767 | offset and timezone name. This makes it easier to create timezone aware |
| 768 | datetime objects: |
Alexander Belopolsky | 7257231 | 2010-12-08 21:21:56 +0000 | [diff] [blame] | 769 | |
Raymond Hettinger | 792c076 | 2010-12-09 16:41:54 +0000 | [diff] [blame] | 770 | >>> datetime.now(timezone.utc) |
| 771 | datetime.datetime(2010, 12, 8, 21, 4, 2, 923754, tzinfo=datetime.timezone.utc) |
Alexander Belopolsky | 7257231 | 2010-12-08 21:21:56 +0000 | [diff] [blame] | 772 | |
Raymond Hettinger | 792c076 | 2010-12-09 16:41:54 +0000 | [diff] [blame] | 773 | >>> datetime.strptime("01/01/2000 12:00 +0000", "%m/%d/%Y %H:%M %z") |
| 774 | datetime.datetime(2000, 1, 1, 12, 0, tzinfo=datetime.timezone.utc) |
Alexander Belopolsky | 7257231 | 2010-12-08 21:21:56 +0000 | [diff] [blame] | 775 | |
Raymond Hettinger | bba537b | 2010-12-15 18:20:19 +0000 | [diff] [blame] | 776 | * Also, :class:`~datetime.timedelta` objects can now be multiplied by |
Raymond Hettinger | 792c076 | 2010-12-09 16:41:54 +0000 | [diff] [blame] | 777 | :class:`float` and divided by :class:`float` and :class:`int` objects. |
| 778 | |
| 779 | (Contributed by Alexander Belopolsky in :issue:`1289118`, :issue:`5094` and |
| 780 | :issue:`6641`.) |
Alexander Belopolsky | 7257231 | 2010-12-08 21:21:56 +0000 | [diff] [blame] | 781 | |
Raymond Hettinger | bba537b | 2010-12-15 18:20:19 +0000 | [diff] [blame] | 782 | abc |
| 783 | --- |
Antoine Pitrou | 7d49bc9 | 2010-09-15 15:13:17 +0000 | [diff] [blame] | 784 | |
Raymond Hettinger | bba537b | 2010-12-15 18:20:19 +0000 | [diff] [blame] | 785 | The :mod:`abc` module now supports :func:`~abc.abstractclassmethod` and |
| 786 | :func:`~abc.abstractstaticmethod`. |
Raymond Hettinger | a5a3554 | 2010-12-05 00:39:18 +0000 | [diff] [blame] | 787 | |
Raymond Hettinger | bba537b | 2010-12-15 18:20:19 +0000 | [diff] [blame] | 788 | These tools make it possible to define an :term:`Abstract Base Class` that |
| 789 | requires a particular :func:`classmethod` or :func:`staticmethod` to be |
| 790 | implemented. |
Antoine Pitrou | 7d49bc9 | 2010-09-15 15:13:17 +0000 | [diff] [blame] | 791 | |
Raymond Hettinger | bba537b | 2010-12-15 18:20:19 +0000 | [diff] [blame] | 792 | (Patch submitted by Daniel Urban; :issue:`5867`.) |
Raymond Hettinger | bcbd696 | 2010-09-05 08:46:36 +0000 | [diff] [blame] | 793 | |
Raymond Hettinger | 480ed78 | 2010-12-15 22:07:15 +0000 | [diff] [blame] | 794 | contextlib |
| 795 | ---------- |
| 796 | |
| 797 | There is a new and slightly mind-blowing tool |
| 798 | :class:`~contextlib.ContextDecorator` that is helpful for creating a |
| 799 | :term:`context manager` that does double-duty as a function decorator. |
| 800 | |
| 801 | As a convenience, this new functionality is used by |
| 802 | :func:`~contextlib.contextmanager` so that no extra effort is needed to support |
| 803 | both roles. |
| 804 | |
| 805 | The basic idea is that both context managers and function decorators can be used |
| 806 | for pre-action and post-action wrappers. Context managers wrap a group of |
| 807 | statements using the :keyword:`with`-statement, and function decorators wrap a |
| 808 | group of statements enclosed in a function. So, occasionally there is a need to |
| 809 | write a pre/post action wrapper that can be used in either role. |
| 810 | |
| 811 | For example, it is sometimes useful to wrap functions or groups of statements |
| 812 | with a logger that can track the time of entry and time of exit. Rather than |
| 813 | writing both a function decorator and a context manager for the task, the |
| 814 | :func:`~contextlib.contextmanager` provides both capabilities in a single |
| 815 | definition: |
| 816 | |
| 817 | >>> import logging |
| 818 | >>> logging.basicConfig(level=logging.INFO) |
| 819 | >>> @contextmanager |
Raymond Hettinger | 9743e4f | 2010-12-16 02:24:12 +0000 | [diff] [blame] | 820 | ... def track_entry_and_exit(name): |
| 821 | ... logging.info('Entering: {}'.format(name)) |
Raymond Hettinger | 480ed78 | 2010-12-15 22:07:15 +0000 | [diff] [blame] | 822 | ... yield |
Raymond Hettinger | 9743e4f | 2010-12-16 02:24:12 +0000 | [diff] [blame] | 823 | ... logging.info('Exiting: {}'.format(name)) |
Raymond Hettinger | 480ed78 | 2010-12-15 22:07:15 +0000 | [diff] [blame] | 824 | |
| 825 | Formerly, this would have only been usable as a context manager: |
| 826 | |
Raymond Hettinger | 9743e4f | 2010-12-16 02:24:12 +0000 | [diff] [blame] | 827 | >>> with track_entry_and_exit('widget loader'): |
Raymond Hettinger | 480ed78 | 2010-12-15 22:07:15 +0000 | [diff] [blame] | 828 | ... print('Some time consuming activity goes here') |
Raymond Hettinger | 9743e4f | 2010-12-16 02:24:12 +0000 | [diff] [blame] | 829 | ... load_widget() |
Raymond Hettinger | 480ed78 | 2010-12-15 22:07:15 +0000 | [diff] [blame] | 830 | |
| 831 | Now, it can be used as a decorator as well: |
| 832 | |
Raymond Hettinger | 9743e4f | 2010-12-16 02:24:12 +0000 | [diff] [blame] | 833 | >>> @track_entry_and_exit('widget loader') |
Raymond Hettinger | 480ed78 | 2010-12-15 22:07:15 +0000 | [diff] [blame] | 834 | ... def activity(): |
Raymond Hettinger | 9743e4f | 2010-12-16 02:24:12 +0000 | [diff] [blame] | 835 | ... print('Some time consuming activity goes here') |
| 836 | ... load_widget() |
Raymond Hettinger | 480ed78 | 2010-12-15 22:07:15 +0000 | [diff] [blame] | 837 | |
| 838 | Trying to fulfill two roles at once places some limitations on the technique. |
| 839 | Context managers normally have the flexibility to return an argument usable by |
Raymond Hettinger | 9743e4f | 2010-12-16 02:24:12 +0000 | [diff] [blame] | 840 | the :keyword:`with`-statement, but there is no parallel for function decorators. |
Raymond Hettinger | 480ed78 | 2010-12-15 22:07:15 +0000 | [diff] [blame] | 841 | |
Raymond Hettinger | 9743e4f | 2010-12-16 02:24:12 +0000 | [diff] [blame] | 842 | In the above example, there is not a clean way for the *track_entry_and_exit* |
| 843 | context manager does not have a way to return a logging instance for use in the |
| 844 | body of enclosed statements. |
Raymond Hettinger | 480ed78 | 2010-12-15 22:07:15 +0000 | [diff] [blame] | 845 | |
| 846 | (Contributed by Michael Foord in :issue:`9110`.) |
| 847 | |
Raymond Hettinger | 07a605b | 2010-12-15 22:35:03 +0000 | [diff] [blame] | 848 | decimal and fractions |
| 849 | --------------------- |
| 850 | |
| 851 | Mark Dickinson crafted an elegant and efficient scheme for assuring that |
| 852 | different numeric datatypes will have the same hash value whenever their actual |
| 853 | values are equal (:issue:`8188`):: |
| 854 | |
| 855 | >>> assert hash(Fraction(3, 2)) == hash(1.5) == \ |
| 856 | hash(Decimal("1.5")) == hash(complex(1.5, 0)) |
| 857 | |
| 858 | An early decision to limit the inter-operability of various numeric types has |
| 859 | been relaxed. It is still unsupported (and ill-advised) to to have implicit |
| 860 | mixing in arithmetic expressions such as ``Decimal('1.1') + float('1.1')`` |
| 861 | because the latter loses information in the process of constructing the binary |
| 862 | float. However, since existing floating point value can be converted losslessly |
| 863 | to either a decimal or rational representation, it makes sense to add them to |
| 864 | the constructor and to support mixed-type comparisons. |
| 865 | |
Raymond Hettinger | bb9686f | 2010-12-16 00:53:05 +0000 | [diff] [blame] | 866 | * The :class:`decimal.Decimal` constructor now accepts :class:`float` objects |
Raymond Hettinger | 07a605b | 2010-12-15 22:35:03 +0000 | [diff] [blame] | 867 | directly so there in no longer a need to use the :meth:`~decimal.Decimal.from_float` |
Raymond Hettinger | 6046e22 | 2010-12-16 00:21:08 +0000 | [diff] [blame] | 868 | method (:issue:`8257`). |
Raymond Hettinger | 07a605b | 2010-12-15 22:35:03 +0000 | [diff] [blame] | 869 | |
| 870 | * Mixed type comparisons are now fully supported so that |
| 871 | :class:`~decimal.Decimal` objects can be directly compared with :class:`float` |
Raymond Hettinger | 6046e22 | 2010-12-16 00:21:08 +0000 | [diff] [blame] | 872 | and :class:`fractions.Fraction` (:issue:`2531` and :issue:`8188`). |
Raymond Hettinger | 07a605b | 2010-12-15 22:35:03 +0000 | [diff] [blame] | 873 | |
| 874 | Similar changes were made to :class:`fractions.Fraction` so that the |
| 875 | :meth:`~fractions.Fraction.from_float()` and :meth:`~fractions.Fraction.from_decimal` |
Raymond Hettinger | 6046e22 | 2010-12-16 00:21:08 +0000 | [diff] [blame] | 876 | methods are no longer needed (:issue:`8294`): |
| 877 | |
| 878 | >>> Decimal(1.1) |
| 879 | Decimal('1.100000000000000088817841970012523233890533447265625') |
| 880 | >>> Fraction(1.1) |
| 881 | Fraction(2476979795053773, 2251799813685248) |
Raymond Hettinger | 07a605b | 2010-12-15 22:35:03 +0000 | [diff] [blame] | 882 | |
| 883 | Another useful change for the :mod:`decimal` module is that the |
| 884 | :attr:`Context.clamp` attribute is now public. This is useful in creating |
| 885 | contexts that correspond to the decimal interchange formats specified in IEEE |
| 886 | 754 (see :issue:`8540`). |
| 887 | |
Raymond Hettinger | 6046e22 | 2010-12-16 00:21:08 +0000 | [diff] [blame] | 888 | (Contributed by Mark Dickinson and Raymond Hettinger.) |
Raymond Hettinger | 07a605b | 2010-12-15 22:35:03 +0000 | [diff] [blame] | 889 | |
Raymond Hettinger | bba537b | 2010-12-15 18:20:19 +0000 | [diff] [blame] | 890 | ftp |
| 891 | --- |
Raymond Hettinger | bcbd696 | 2010-09-05 08:46:36 +0000 | [diff] [blame] | 892 | |
Raymond Hettinger | bba537b | 2010-12-15 18:20:19 +0000 | [diff] [blame] | 893 | The :class:`ftplib.FTP` class now supports the context manager protocol to |
| 894 | unconditionally consume :exc:`socket.error` exceptions and to close the FTP |
| 895 | connection when done:: |
Giampaolo Rodolà | bd576b7 | 2010-05-10 14:53:29 +0000 | [diff] [blame] | 896 | |
Raymond Hettinger | bba537b | 2010-12-15 18:20:19 +0000 | [diff] [blame] | 897 | >>> from ftplib import FTP |
| 898 | >>> with FTP("ftp1.at.proftpd.org") as ftp: |
| 899 | ... ftp.login() |
| 900 | ... ftp.dir() |
| 901 | ... |
| 902 | '230 Anonymous login ok, restrictions apply.' |
| 903 | dr-xr-xr-x 9 ftp ftp 154 May 6 10:43 . |
| 904 | dr-xr-xr-x 9 ftp ftp 154 May 6 10:43 .. |
| 905 | dr-xr-xr-x 5 ftp ftp 4096 May 6 10:43 CentOS |
| 906 | dr-xr-xr-x 3 ftp ftp 18 Jul 10 2008 Fedora |
Georg Brandl | da0a211 | 2010-09-05 11:28:33 +0000 | [diff] [blame] | 907 | |
Raymond Hettinger | bba537b | 2010-12-15 18:20:19 +0000 | [diff] [blame] | 908 | Other file-like objects such as :class:`mmap.mmap` and :func:`fileinput.input` |
| 909 | also grew auto-closing context managers:: |
| 910 | |
| 911 | with fileinput.input(files=('log1.txt', 'log2.txt')) as f: |
| 912 | for line in f: |
| 913 | process(line) |
| 914 | |
| 915 | (Contributed by Tarek Ziadé and Giampaolo Rodolà in :issue:`4972`, and |
| 916 | by Georg Brandl in :issue:`8046` and :issue:`1286`.) |
Antoine Pitrou | 696e035 | 2010-08-08 22:18:46 +0000 | [diff] [blame] | 917 | |
Raymond Hettinger | 0358a17 | 2010-12-15 19:00:38 +0000 | [diff] [blame] | 918 | .. XXX mention os.popen and subprocess.Popen auto-closing of fds |
Georg Brandl | 3ad4675 | 2010-12-05 07:59:29 +0000 | [diff] [blame] | 919 | |
Raymond Hettinger | 6046e22 | 2010-12-16 00:21:08 +0000 | [diff] [blame] | 920 | gzip and zipfile |
| 921 | ---------------- |
Antoine Pitrou | cd889af | 2010-10-06 21:13:56 +0000 | [diff] [blame] | 922 | |
Raymond Hettinger | bba537b | 2010-12-15 18:20:19 +0000 | [diff] [blame] | 923 | :class:`gzip.GzipFile` now implements the :class:`io.BufferedIOBase` |
| 924 | :term:`abstract base class` (except for ``truncate()``). It also has a |
| 925 | :meth:`~gzip.GzipFile.peek` method and supports unseekable as well as |
| 926 | zero-padded file objects. |
Raymond Hettinger | a026633 | 2010-12-07 08:52:41 +0000 | [diff] [blame] | 927 | |
Raymond Hettinger | bba537b | 2010-12-15 18:20:19 +0000 | [diff] [blame] | 928 | The :mod:`gzip` module also gains the :func:`~gzip.compress` and |
| 929 | :func:`~gzip.decompress` functions for easier in-memory compression and |
| 930 | decompression. Keep in mind that text needs to be encoded in to :class:`bytes` |
| 931 | before compressing and decompressing: |
Raymond Hettinger | a026633 | 2010-12-07 08:52:41 +0000 | [diff] [blame] | 932 | |
Raymond Hettinger | bba537b | 2010-12-15 18:20:19 +0000 | [diff] [blame] | 933 | >>> s = 'Three shall be the number thou shalt count, ' |
| 934 | >>> s += 'and the number of the counting shall be three' |
| 935 | >>> b = s.encode() # convert to utf-8 |
| 936 | >>> len(b) |
| 937 | 89 |
| 938 | >>> c = gzip.compress(b) |
| 939 | >>> len(c) |
| 940 | 77 |
| 941 | >>> gzip.decompress(c).decode()[:42] # decompress and convert to text |
| 942 | 'Three shall be the number thou shalt count,' |
Antoine Pitrou | cd889af | 2010-10-06 21:13:56 +0000 | [diff] [blame] | 943 | |
Raymond Hettinger | bba537b | 2010-12-15 18:20:19 +0000 | [diff] [blame] | 944 | (Contributed by Anand B. Pillai in :issue:`3488`; and by Antoine Pitrou, Nir |
| 945 | Aides and Brian Curtin in :issue:`9962`, :issue:`1675951`, :issue:`7471` and |
| 946 | :issue:`2846`.) |
| 947 | |
Raymond Hettinger | 6046e22 | 2010-12-16 00:21:08 +0000 | [diff] [blame] | 948 | Also, the :class:`zipfile.ZipExtFile` class was reworked internally to represent |
| 949 | files stored inside an archive. The new implementation is significantly faster |
| 950 | and can be wrapped in a :class:`io.BufferedReader` object for more speedups. It |
| 951 | also solves an issue where interleaved calls to *read* and *readline* gave the |
| 952 | wrong results. |
| 953 | |
| 954 | (Patch submitted by by Nir Aides in :issue:`7610`.) |
| 955 | |
Raymond Hettinger | bba537b | 2010-12-15 18:20:19 +0000 | [diff] [blame] | 956 | shutil |
| 957 | ------ |
| 958 | |
| 959 | The :func:`shutil.copytree` function has two new options: |
Antoine Pitrou | d67075e | 2010-07-31 22:48:02 +0000 | [diff] [blame] | 960 | |
Raymond Hettinger | db9044e | 2010-09-06 01:29:23 +0000 | [diff] [blame] | 961 | * *ignore_dangling_symlinks*: when ``symlinks=False`` so that the function |
| 962 | copies the file pointed to by the symlink, not the symlink itself. This |
Georg Brandl | da0a211 | 2010-09-05 11:28:33 +0000 | [diff] [blame] | 963 | option will silence the error raised if the file doesn't exist. |
Antoine Pitrou | d67075e | 2010-07-31 22:48:02 +0000 | [diff] [blame] | 964 | |
Raymond Hettinger | db9044e | 2010-09-06 01:29:23 +0000 | [diff] [blame] | 965 | * *copy_function*: is a callable that will be used to copy files. |
Antoine Pitrou | d67075e | 2010-07-31 22:48:02 +0000 | [diff] [blame] | 966 | :func:`shutil.copy2` is used by default. |
| 967 | |
Raymond Hettinger | bba537b | 2010-12-15 18:20:19 +0000 | [diff] [blame] | 968 | (Contributed by Tarek Ziadé.) |
Antoine Pitrou | d67075e | 2010-07-31 22:48:02 +0000 | [diff] [blame] | 969 | |
Raymond Hettinger | bba537b | 2010-12-15 18:20:19 +0000 | [diff] [blame] | 970 | sqlite3 |
| 971 | ------- |
Antoine Pitrou | e43f9d0 | 2010-08-08 23:24:50 +0000 | [diff] [blame] | 972 | |
Raymond Hettinger | 6046e22 | 2010-12-16 00:21:08 +0000 | [diff] [blame] | 973 | The :mod:`sqlite3` module was updated to version 2.6.0. It has two new capabilities. |
Antoine Pitrou | e43f9d0 | 2010-08-08 23:24:50 +0000 | [diff] [blame] | 974 | |
Raymond Hettinger | 0358a17 | 2010-12-15 19:00:38 +0000 | [diff] [blame] | 975 | * The :attr:`sqlite3.Connection.in_transit` attribute is true if there is an |
| 976 | active transaction for uncommitted changes. |
Antoine Pitrou | d67075e | 2010-07-31 22:48:02 +0000 | [diff] [blame] | 977 | |
Raymond Hettinger | 0358a17 | 2010-12-15 19:00:38 +0000 | [diff] [blame] | 978 | * The :meth:`sqlite3.Connection.enable_load_extension` and |
| 979 | :meth:`sqlite3.Connection.load_extension` methods allows you to load SQLite |
| 980 | extensions from ".so" files. One well-known extension is the fulltext-search |
| 981 | extension distributed with SQLite. |
Antoine Pitrou | d67075e | 2010-07-31 22:48:02 +0000 | [diff] [blame] | 982 | |
Raymond Hettinger | bba537b | 2010-12-15 18:20:19 +0000 | [diff] [blame] | 983 | (Contributed by R. David Murray and Shashwat Anand; :issue:`8845`.) |
| 984 | |
| 985 | socket |
| 986 | ------ |
| 987 | |
| 988 | The :mod:`socket` module has two new improvements. |
| 989 | |
| 990 | * Socket objects now have a :meth:`~socket.socket.detach()` method which puts |
| 991 | the socket into closed state without actually closing the underlying file |
| 992 | descriptor. The latter can then be reused for other purposes. |
| 993 | (Added by Antoine Pitrou; :issue:`8524`.) |
| 994 | |
| 995 | * :func:`socket.create_connection` now supports the context manager protocol |
| 996 | to unconditionally consume :exc:`socket.error` exceptions and to close the |
| 997 | socket when done. |
| 998 | (Contributed by Giampaolo Rodolà; :issue:`9794`.) |
| 999 | |
| 1000 | ssl |
| 1001 | --- |
Antoine Pitrou | d67075e | 2010-07-31 22:48:02 +0000 | [diff] [blame] | 1002 | |
Georg Brandl | da0a211 | 2010-09-05 11:28:33 +0000 | [diff] [blame] | 1003 | * The :mod:`ssl` module has a new class, :class:`~ssl.SSLContext` which serves |
| 1004 | as a container for various persistent SSL data, such as protocol settings, |
| 1005 | certificates, private keys, and various other options. The |
| 1006 | :meth:`~ssl.SSLContext.wrap_socket` method allows to create an SSL socket from |
| 1007 | such an SSL context. (Added by Antoine Pitrou; :issue:`8550`.) |
Antoine Pitrou | 4f2a0a8 | 2010-07-31 18:08:33 +0000 | [diff] [blame] | 1008 | |
Raymond Hettinger | bba537b | 2010-12-15 18:20:19 +0000 | [diff] [blame] | 1009 | * A new function, :func:`ssl.match_hostname`, helps implement server identity |
Antoine Pitrou | 0ee4c9f | 2010-10-08 16:46:17 +0000 | [diff] [blame] | 1010 | verification for higher-level protocols by implementing the rules of |
| 1011 | HTTPS (from :rfc:`2818`), which are also suitable for other protocols. |
| 1012 | (Added by Antoine Pitrou, :issue:`1589`). |
| 1013 | |
Raymond Hettinger | bba537b | 2010-12-15 18:20:19 +0000 | [diff] [blame] | 1014 | * The :func:`ssl.wrap_socket` constructor function now takes a *ciphers* |
Georg Brandl | da0a211 | 2010-09-05 11:28:33 +0000 | [diff] [blame] | 1015 | argument that's a string listing the encryption algorithms to be allowed; the |
| 1016 | format of the string is described `in the OpenSSL documentation |
| 1017 | <http://www.openssl.org/docs/apps/ciphers.html#CIPHER_LIST_FORMAT>`__. (Added |
| 1018 | by Antoine Pitrou; :issue:`8322`.) |
Antoine Pitrou | 4f2a0a8 | 2010-07-31 18:08:33 +0000 | [diff] [blame] | 1019 | |
Raymond Hettinger | bba537b | 2010-12-15 18:20:19 +0000 | [diff] [blame] | 1020 | * When linked against a recent enough version of OpenSSL, the :mod:`ssl` |
Antoine Pitrou | 7d15a72 | 2010-11-05 22:13:55 +0000 | [diff] [blame] | 1021 | module now supports the Server Name Indication extension to the TLS |
| 1022 | protocol, allowing for several "virtual hosts" using different certificates |
| 1023 | on a single IP/port. This extension is only supported in client mode, |
| 1024 | and is activated by passing the *server_hostname* argument to |
| 1025 | :meth:`SSLContext.wrap_socket`. |
| 1026 | (Added by Antoine Pitrou, :issue:`5639`.) |
| 1027 | |
Raymond Hettinger | bba537b | 2010-12-15 18:20:19 +0000 | [diff] [blame] | 1028 | * Various options have been added to the :mod:`ssl` module, such as |
Georg Brandl | da0a211 | 2010-09-05 11:28:33 +0000 | [diff] [blame] | 1029 | :data:`~ssl.OP_NO_SSLv2` which allows to force disabling of the insecure and |
| 1030 | obsolete SSLv2 protocol. (Added by Antoine Pitrou; :issue:`4870`.) |
Antoine Pitrou | 4f2a0a8 | 2010-07-31 18:08:33 +0000 | [diff] [blame] | 1031 | |
Raymond Hettinger | bba537b | 2010-12-15 18:20:19 +0000 | [diff] [blame] | 1032 | * Another change makes the extension load all of OpenSSL's ciphers and digest |
Georg Brandl | da0a211 | 2010-09-05 11:28:33 +0000 | [diff] [blame] | 1033 | algorithms so that they're all available. Some SSL certificates couldn't be |
| 1034 | verified, reporting an "unknown algorithm" error. (Reported by Beda Kosata, |
| 1035 | and fixed by Antoine Pitrou; :issue:`8484`.) |
Antoine Pitrou | 4f2a0a8 | 2010-07-31 18:08:33 +0000 | [diff] [blame] | 1036 | |
Raymond Hettinger | bba537b | 2010-12-15 18:20:19 +0000 | [diff] [blame] | 1037 | * The version of OpenSSL being used is now available as the module attributes |
Georg Brandl | da0a211 | 2010-09-05 11:28:33 +0000 | [diff] [blame] | 1038 | :data:`ssl.OPENSSL_VERSION` (a string), :data:`ssl.OPENSSL_VERSION_INFO` (a |
| 1039 | 5-tuple), and :data:`ssl.OPENSSL_VERSION_NUMBER` (an integer). (Added by |
| 1040 | Antoine Pitrou; :issue:`8321`.) |
Antoine Pitrou | 4f2a0a8 | 2010-07-31 18:08:33 +0000 | [diff] [blame] | 1041 | |
Raymond Hettinger | bba537b | 2010-12-15 18:20:19 +0000 | [diff] [blame] | 1042 | nntp |
| 1043 | ---- |
Raymond Hettinger | 070ec70 | 2010-12-10 17:45:13 +0000 | [diff] [blame] | 1044 | |
Raymond Hettinger | bba537b | 2010-12-15 18:20:19 +0000 | [diff] [blame] | 1045 | The :mod:`nntplib` module has a revamped implementation with better bytes and |
| 1046 | unicode semantics as well as more practical APIs. These improvements break |
| 1047 | compatibility with the nntplib version in Python 3.1, which was partly |
| 1048 | dysfunctional in itself. |
Raymond Hettinger | 070ec70 | 2010-12-10 17:45:13 +0000 | [diff] [blame] | 1049 | |
Raymond Hettinger | bba537b | 2010-12-15 18:20:19 +0000 | [diff] [blame] | 1050 | (Contributed by Antoine Pitrou in :issue:`9360`) |
| 1051 | |
| 1052 | certificates |
| 1053 | ------------ |
| 1054 | |
| 1055 | :class:`http.client.HTTPSConnection`, :class:`urllib.request.HTTPSHandler` |
| 1056 | and :func:`urllib.request.urlopen` now take optional arguments to allow for |
| 1057 | server certificate checking against a set of Certificate Authorities, |
| 1058 | as recommended in public uses of HTTPS. |
| 1059 | |
| 1060 | (Added by Antoine Pitrou, :issue:`9003`.) |
| 1061 | |
| 1062 | unittest |
| 1063 | -------- |
Antoine Pitrou | afb078d | 2010-11-05 22:18:28 +0000 | [diff] [blame] | 1064 | |
Raymond Hettinger | a026633 | 2010-12-07 08:52:41 +0000 | [diff] [blame] | 1065 | * The command-line call, ``python -m unittest`` can now accept file paths |
| 1066 | instead of module names for running specific tests (:issue:`10620`). The new |
| 1067 | test discovery can find tests within packages, locating any test importable |
| 1068 | from the top level directory. The top level directory can be specified with |
| 1069 | the `-t` option, a pattern for matching files with ``-p``, and a directory to |
| 1070 | start discovery with ``-s``:: |
| 1071 | |
| 1072 | $ python -m unittest discover -s my_proj_dir -p '_test.py' |
| 1073 | |
| 1074 | (Contributed by Michael Foord.) |
Antoine Pitrou | d305200 | 2010-09-15 15:09:40 +0000 | [diff] [blame] | 1075 | |
Raymond Hettinger | dc2f9b5 | 2010-12-05 07:02:45 +0000 | [diff] [blame] | 1076 | * The :mod:`unittest` module has two new methods, |
| 1077 | :meth:`~unittest.TestCase.assertWarns` and |
| 1078 | :meth:`~unittest.TestCase.assertWarnsRegex` to check that a given warning type |
Raymond Hettinger | 413abbc | 2010-12-05 07:06:47 +0000 | [diff] [blame] | 1079 | is triggered by the code under test: |
Antoine Pitrou | d305200 | 2010-09-15 15:09:40 +0000 | [diff] [blame] | 1080 | |
Raymond Hettinger | dc2f9b5 | 2010-12-05 07:02:45 +0000 | [diff] [blame] | 1081 | >>> with self.assertWarns(DeprecationWarning): |
| 1082 | ... legacy_function('XYZ') |
Ezio Melotti | 2baf1a6 | 2010-11-22 12:56:58 +0000 | [diff] [blame] | 1083 | |
Raymond Hettinger | 21ec4bc | 2010-12-10 01:09:01 +0000 | [diff] [blame] | 1084 | Another new method, :meth:`~unittest.TestCase.assertCountEqual` is used to |
Raymond Hettinger | ffad35e | 2010-12-14 21:12:03 +0000 | [diff] [blame] | 1085 | compare two iterables to determine if their element counts are equal (whether |
| 1086 | the same elements are present with the same number of occurrences regardless |
| 1087 | of order):: |
Raymond Hettinger | a026633 | 2010-12-07 08:52:41 +0000 | [diff] [blame] | 1088 | |
| 1089 | def test_anagram(self): |
| 1090 | self.assertCountEqual('algorithm', 'logarithm') |
| 1091 | |
| 1092 | A principal feature of the unittest module is an effort to produce meaningful |
| 1093 | diagnostics when a test fails. When possible the failure is recorded along |
| 1094 | with a diff of the output. This is especially helpful for analyzing log files |
| 1095 | of failed test runs. However, since diffs can sometime be voluminous, there is |
| 1096 | a new :attr:`~unittest.TestCase.maxDiff` attribute which sets maximum length of |
| 1097 | diffs. |
| 1098 | |
Raymond Hettinger | 68f1e8d | 2010-12-07 09:24:30 +0000 | [diff] [blame] | 1099 | In addition the naming in the module has undergone a number of clean-ups. For |
Raymond Hettinger | a026633 | 2010-12-07 08:52:41 +0000 | [diff] [blame] | 1100 | example, :meth:`~unittest.TestCase.assertRegex` is the new name for |
| 1101 | :meth:`~unittest.TestCase.assertRegexpMatches` which was misnamed because the |
Raymond Hettinger | 21ec4bc | 2010-12-10 01:09:01 +0000 | [diff] [blame] | 1102 | test uses :func:`re.search`, not :func:`re.match`. Other methods using |
| 1103 | regular expressions are now named using short form "Regex" in preference |
| 1104 | to "Regexp" -- this matches the names used in other unittest implementations, |
| 1105 | matches Python's old name for the :mod:`re` module, and it has unambiguous |
| 1106 | camel-casing. |
Raymond Hettinger | dc2f9b5 | 2010-12-05 07:02:45 +0000 | [diff] [blame] | 1107 | |
| 1108 | To improve consistency, some of long-standing method aliases are being |
| 1109 | deprecated in favor of the preferred names: |
| 1110 | |
| 1111 | - replace :meth:`assert_` with :meth:`.assertTrue` |
| 1112 | - replace :meth:`assertEquals` with :meth:`.assertEqual` |
| 1113 | - replace :meth:`assertNotEquals` with :meth:`.assertNotEqual` |
| 1114 | - replace :meth:`assertAlmostEquals` with :meth:`.assertAlmostEqual` |
| 1115 | - replace :meth:`assertNotAlmostEquals` with :meth:`.assertNotAlmostEqual` |
| 1116 | |
| 1117 | Likewise, the ``TestCase.fail*`` methods deprecated in Python 3.1 are expected |
| 1118 | to be removed in Python 3.3. See also the :ref:`deprecated-aliases` section in |
| 1119 | the :mod:`unittest` documentation. |
Ezio Melotti | 2baf1a6 | 2010-11-22 12:56:58 +0000 | [diff] [blame] | 1120 | |
| 1121 | (Contributed by Ezio Melotti; :issue:`9424`.) |
Antoine Pitrou | d305200 | 2010-09-15 15:09:40 +0000 | [diff] [blame] | 1122 | |
Raymond Hettinger | bba537b | 2010-12-15 18:20:19 +0000 | [diff] [blame] | 1123 | random |
| 1124 | ------ |
Raymond Hettinger | e0a9600 | 2010-12-15 17:54:13 +0000 | [diff] [blame] | 1125 | |
Raymond Hettinger | 0358a17 | 2010-12-15 19:00:38 +0000 | [diff] [blame] | 1126 | The integer methods in the :mod:`random` module now do a better job of producing |
Raymond Hettinger | 99db3fd | 2010-12-15 19:33:49 +0000 | [diff] [blame] | 1127 | uniform distributions. Previously, they computed selections with |
| 1128 | ``int(n*random())`` which had a slight bias whenever *n* was not a power of two. |
| 1129 | Now, multiple selections are made from a range upto the next power of two and a |
| 1130 | selection is kept only when it falls within the range ``0 <= x < n``. The |
| 1131 | functions and methods affected are :func:`~random.randrange`, |
| 1132 | :func:`~random.randint`, :func:`~random.choice`, :func:`~random.shuffle` and |
| 1133 | :func:`~random.sample`. |
Raymond Hettinger | bba537b | 2010-12-15 18:20:19 +0000 | [diff] [blame] | 1134 | |
| 1135 | (Contributed by Raymond Hettinger; :issue:`9025`.) |
| 1136 | |
| 1137 | poplib |
| 1138 | ------ |
Raymond Hettinger | e0a9600 | 2010-12-15 17:54:13 +0000 | [diff] [blame] | 1139 | |
Giampaolo Rodolà | 42382fe | 2010-08-17 16:09:53 +0000 | [diff] [blame] | 1140 | * :class:`~poplib.POP3_SSL` class now accepts a *context* parameter, which is a |
| 1141 | :class:`ssl.SSLContext` object allowing bundling SSL configuration options, |
| 1142 | certificates and private keys into a single (potentially long-lived) |
| 1143 | structure. |
| 1144 | |
| 1145 | (Contributed by Giampaolo Rodolà; :issue:`8807`.) |
| 1146 | |
Giampaolo Rodolà | 977c707 | 2010-10-04 21:08:36 +0000 | [diff] [blame] | 1147 | * :class:`asyncore.dispatcher` now provides a |
| 1148 | :meth:`~asyncore.dispatcher.handle_accepted()` method |
| 1149 | returning a `(sock, addr)` pair which is called when a connection has actually |
| 1150 | been established with a new remote endpoint. This is supposed to be used as a |
| 1151 | replacement for old :meth:`~asyncore.dispatcher.handle_accept()` and avoids |
| 1152 | the user to call :meth:`~asyncore.dispatcher.accept()` directly. |
| 1153 | |
| 1154 | (Contributed by Giampaolo Rodolà; :issue:`6706`.) |
Georg Brandl | da0a211 | 2010-09-05 11:28:33 +0000 | [diff] [blame] | 1155 | |
Raymond Hettinger | bba537b | 2010-12-15 18:20:19 +0000 | [diff] [blame] | 1156 | tempfile |
| 1157 | -------- |
Raymond Hettinger | a026633 | 2010-12-07 08:52:41 +0000 | [diff] [blame] | 1158 | |
Raymond Hettinger | bba537b | 2010-12-15 18:20:19 +0000 | [diff] [blame] | 1159 | The :mod:`tempfile` module has a new context manager, |
| 1160 | :class:`~tempfile.TemporaryDirectory` which provides easy deterministic |
| 1161 | cleanup of temporary directories: |
Nick Coghlan | 543af75 | 2010-10-24 11:23:25 +0000 | [diff] [blame] | 1162 | |
Raymond Hettinger | bba537b | 2010-12-15 18:20:19 +0000 | [diff] [blame] | 1163 | >>> with tempfile.TemporaryDirectory() as tmpdirname: |
| 1164 | ... print 'created temporary directory', tmpdirname |
Nick Coghlan | 543af75 | 2010-10-24 11:23:25 +0000 | [diff] [blame] | 1165 | |
Raymond Hettinger | bba537b | 2010-12-15 18:20:19 +0000 | [diff] [blame] | 1166 | (Contributed by Neil Schemenauer and Nick Coghlan; :issue:`5178`.) |
Nick Coghlan | e0f0465 | 2010-11-21 03:44:04 +0000 | [diff] [blame] | 1167 | |
Raymond Hettinger | bba537b | 2010-12-15 18:20:19 +0000 | [diff] [blame] | 1168 | inspect |
| 1169 | ------- |
| 1170 | |
Raymond Hettinger | 0358a17 | 2010-12-15 19:00:38 +0000 | [diff] [blame] | 1171 | * The :mod:`inspect` module has a new function |
| 1172 | :func:`~inspect.getgeneratorstate` to easily identify the current state of a |
| 1173 | generator as one of ``GEN_CREATED``, ``GEN_RUNNING``, ``GEN_SUSPENDED`` or |
| 1174 | ``GEN_CLOSED``. (Contributed by Rodolpho Eckhardt and Nick Coghlan, |
| 1175 | :issue:`10220`.) |
Raymond Hettinger | bba537b | 2010-12-15 18:20:19 +0000 | [diff] [blame] | 1176 | |
Raymond Hettinger | a55ffbc | 2010-12-15 18:31:57 +0000 | [diff] [blame] | 1177 | * To support lookups without the possibility of activating a dynamic attribute, |
| 1178 | the :mod:`inspect` module has a new function, :func:`~inspect.getattr_static`. |
| 1179 | Unlike, :func:`hasattr`, this is a true read-only search, guaranteed not to |
| 1180 | change state while it is searching. (Contributed by Michael Foord.) |
Nick Coghlan | e0f0465 | 2010-11-21 03:44:04 +0000 | [diff] [blame] | 1181 | |
Raymond Hettinger | bba537b | 2010-12-15 18:20:19 +0000 | [diff] [blame] | 1182 | pydoc |
| 1183 | ----- |
Nick Coghlan | 7bb30b7 | 2010-12-03 09:29:11 +0000 | [diff] [blame] | 1184 | |
Raymond Hettinger | bba537b | 2010-12-15 18:20:19 +0000 | [diff] [blame] | 1185 | The :mod:`pydoc` module now provides a much improved Web server interface, |
| 1186 | as well as a new command-line option to automatically open a browser |
| 1187 | window to display that server. |
Nick Coghlan | 7bb30b7 | 2010-12-03 09:29:11 +0000 | [diff] [blame] | 1188 | |
Raymond Hettinger | bba537b | 2010-12-15 18:20:19 +0000 | [diff] [blame] | 1189 | (Contributed by Ron Adam; :issue:`2001`.) |
Raymond Hettinger | 3f9734c | 2010-12-07 01:47:52 +0000 | [diff] [blame] | 1190 | |
Raymond Hettinger | bba537b | 2010-12-15 18:20:19 +0000 | [diff] [blame] | 1191 | sysconfig |
| 1192 | --------- |
Raymond Hettinger | 3f9734c | 2010-12-07 01:47:52 +0000 | [diff] [blame] | 1193 | |
Raymond Hettinger | bba537b | 2010-12-15 18:20:19 +0000 | [diff] [blame] | 1194 | The new :mod:`sysconfig` module makes it straight-forward to discover |
| 1195 | installation paths and configuration variables which vary across platforms and |
| 1196 | installations. |
Raymond Hettinger | 3f9734c | 2010-12-07 01:47:52 +0000 | [diff] [blame] | 1197 | |
Raymond Hettinger | bba537b | 2010-12-15 18:20:19 +0000 | [diff] [blame] | 1198 | The module offers access simple access functions for platform and version |
| 1199 | information: |
Raymond Hettinger | 3f9734c | 2010-12-07 01:47:52 +0000 | [diff] [blame] | 1200 | |
Raymond Hettinger | bba537b | 2010-12-15 18:20:19 +0000 | [diff] [blame] | 1201 | * :func:`~sysconfig.get_platform` returning values like *linux-i586* or |
| 1202 | *macosx-10.6-ppc*. |
| 1203 | * :func:`~sysconfig.get_python_version` returns a Python version string in |
| 1204 | the form, "3.2". |
Raymond Hettinger | 3f9734c | 2010-12-07 01:47:52 +0000 | [diff] [blame] | 1205 | |
Raymond Hettinger | bba537b | 2010-12-15 18:20:19 +0000 | [diff] [blame] | 1206 | It also provides access to the paths and variables corresponding to one of |
| 1207 | seven named schemes used by :mod:`distutils`. Those include *posix_prefix*, |
| 1208 | *posix_home*, *posix_user*, *nt*, *nt_user*, *os2*, *os2_home*: |
Raymond Hettinger | 3f9734c | 2010-12-07 01:47:52 +0000 | [diff] [blame] | 1209 | |
Raymond Hettinger | bba537b | 2010-12-15 18:20:19 +0000 | [diff] [blame] | 1210 | * :func:`~sysconfig.get_paths` makes a dictionary containing installation paths |
| 1211 | for the current installation scheme. |
| 1212 | * :func:`~sysconfig.get_config_vars` returns a dictionary of platform specific |
| 1213 | variables. |
Raymond Hettinger | 3f9734c | 2010-12-07 01:47:52 +0000 | [diff] [blame] | 1214 | |
Raymond Hettinger | bba537b | 2010-12-15 18:20:19 +0000 | [diff] [blame] | 1215 | There is also a convenient command-line interface:: |
Raymond Hettinger | 3f9734c | 2010-12-07 01:47:52 +0000 | [diff] [blame] | 1216 | |
Raymond Hettinger | bba537b | 2010-12-15 18:20:19 +0000 | [diff] [blame] | 1217 | C:\Python32>python -m sysconfig |
| 1218 | Platform: "win32" |
| 1219 | Python version: "3.2" |
| 1220 | Current installation scheme: "nt" |
Raymond Hettinger | 3f9734c | 2010-12-07 01:47:52 +0000 | [diff] [blame] | 1221 | |
Raymond Hettinger | bba537b | 2010-12-15 18:20:19 +0000 | [diff] [blame] | 1222 | Paths: |
| 1223 | data = "C:\Python32" |
Łukasz Langa | 79a06ed | 2010-12-17 22:05:46 +0000 | [diff] [blame] | 1224 | include = "C:\Python32\Include" |
| 1225 | platinclude = "C:\Python32\Include" |
| 1226 | platlib = "C:\Python32\Lib\site-packages" |
| 1227 | platstdlib = "C:\Python32\Lib" |
| 1228 | purelib = "C:\Python32\Lib\site-packages" |
| 1229 | scripts = "C:\Python32\Scripts" |
| 1230 | stdlib = "C:\Python32\Lib" |
Raymond Hettinger | bba537b | 2010-12-15 18:20:19 +0000 | [diff] [blame] | 1231 | |
| 1232 | Variables: |
| 1233 | BINDIR = "C:\Python32" |
Łukasz Langa | 79a06ed | 2010-12-17 22:05:46 +0000 | [diff] [blame] | 1234 | BINLIBDEST = "C:\Python32\Lib" |
| 1235 | EXE = ".exe" |
| 1236 | INCLUDEPY = "C:\Python32\Include" |
| 1237 | LIBDEST = "C:\Python32\Lib" |
| 1238 | SO = ".pyd" |
| 1239 | VERSION = "32" |
| 1240 | abiflags = "" |
| 1241 | base = "C:\Python32" |
| 1242 | exec_prefix = "C:\Python32" |
| 1243 | platbase = "C:\Python32" |
| 1244 | prefix = "C:\Python32" |
| 1245 | projectbase = "C:\Python32" |
| 1246 | py_version = "3.2" |
| 1247 | py_version_nodot = "32" |
| 1248 | py_version_short = "3.2" |
| 1249 | srcdir = "C:\Python32" |
| 1250 | userbase = "C:\Documents and Settings\Raymond\Application Data\Python" |
Raymond Hettinger | bba537b | 2010-12-15 18:20:19 +0000 | [diff] [blame] | 1251 | |
| 1252 | pdb |
| 1253 | --- |
| 1254 | |
| 1255 | The :mod:`pdb` debugger module gained a number of usability improvements: |
Raymond Hettinger | b5d7933 | 2010-12-07 02:04:56 +0000 | [diff] [blame] | 1256 | |
Raymond Hettinger | 99db3fd | 2010-12-15 19:33:49 +0000 | [diff] [blame] | 1257 | * :file:`pdb.py` now has a ``-c`` option that executes commands as given in a |
| 1258 | :file:`.pdbrc` script file. |
| 1259 | * A :file:`.pdbrc` script file can contain ``continue`` and ``next`` commands |
| 1260 | that continue debugging. |
| 1261 | * The :class:`Pdb` class constructor now accepts a *nosigint* argument. |
| 1262 | * new commands: ``l(list)``, ``ll(long list`` and ``source`` for |
| 1263 | listing source code. |
| 1264 | * new commands: ``display`` and ``undisplay`` for showing or hiding |
| 1265 | the value of an expression if it has changed. |
| 1266 | * new command: ``interact`` for starting an interactive interpreter containing |
| 1267 | the global and local names found in the current scope. |
| 1268 | * breakpoints can be cleared by breakpoint number |
Raymond Hettinger | b5d7933 | 2010-12-07 02:04:56 +0000 | [diff] [blame] | 1269 | |
Georg Brandl | 101234b | 2010-12-18 11:53:25 +0000 | [diff] [blame^] | 1270 | (Contributed by Georg Brandl, Antonio Cuni and Ilya Sandler.) |
| 1271 | |
Łukasz Langa | 2b38b6c | 2010-12-17 21:57:32 +0000 | [diff] [blame] | 1272 | configparser |
| 1273 | ------------ |
Raymond Hettinger | 3f9734c | 2010-12-07 01:47:52 +0000 | [diff] [blame] | 1274 | |
Łukasz Langa | 2b38b6c | 2010-12-17 21:57:32 +0000 | [diff] [blame] | 1275 | The :mod:`configparser` module was modified to improve usability and |
| 1276 | predictability of the default parser and its supported INI syntax. The old |
| 1277 | :class:`ConfigParser` class was removed in favor of :class:`SafeConfigParser` |
Raymond Hettinger | 0412974 | 2010-12-18 10:57:50 +0000 | [diff] [blame] | 1278 | which has in turn been renamed to :class:`~configparser.ConfigParser`. Support |
| 1279 | for inline comments is now turned off by default and section or option |
| 1280 | duplicates are not allowed in a single configuration source. |
Łukasz Langa | 2b38b6c | 2010-12-17 21:57:32 +0000 | [diff] [blame] | 1281 | |
| 1282 | Config parsers gained a new API based on the mapping protocol:: |
| 1283 | |
| 1284 | >>> parser = ConfigParser() |
| 1285 | >>> parser.read_string(""" |
| 1286 | ... [DEFAULT] |
| 1287 | ... monty = python |
| 1288 | ... |
| 1289 | ... [phrases] |
| 1290 | ... the = who |
| 1291 | ... full = metal jacket |
| 1292 | ... """) |
| 1293 | >>> parser['phrases']['full'] |
| 1294 | 'metal jacket' |
| 1295 | >>> section = parser['phrases'] |
| 1296 | >>> section['the'] |
| 1297 | 'who' |
| 1298 | >>> section['british'] = '%(the)s %(full)s %(monty)s!' |
| 1299 | >>> parser['phrases']['british'] |
| 1300 | 'who metal jacket python!' |
| 1301 | >>> 'british' in section |
| 1302 | True |
| 1303 | |
Raymond Hettinger | 0412974 | 2010-12-18 10:57:50 +0000 | [diff] [blame] | 1304 | The new API is implemented on top of the classical API so custom parser |
Łukasz Langa | 2b38b6c | 2010-12-17 21:57:32 +0000 | [diff] [blame] | 1305 | subclasses should be able to use it without modifications. |
| 1306 | |
| 1307 | The INI file structure accepted by config parsers can now be customized. Users |
Raymond Hettinger | 0412974 | 2010-12-18 10:57:50 +0000 | [diff] [blame] | 1308 | can specify alternative option/value delimiters and comment prefixes, change the |
| 1309 | name of the *DEFAULT* section or switch the interpolation syntax. Along with |
| 1310 | support for pluggable interpolation, an additional interpolation handler |
| 1311 | :class:`~configparser.ExtendedInterpolation` was introduced:: |
Łukasz Langa | 2b38b6c | 2010-12-17 21:57:32 +0000 | [diff] [blame] | 1312 | |
| 1313 | >>> parser = ConfigParser(interpolation=ExtendedInterpolation()) |
| 1314 | >>> parser.read_dict({'buildout': {'directory': '/home/ambv/zope9'}, |
| 1315 | ... 'custom': {'prefix': '/usr/local'}}) |
| 1316 | >>> parser.read_string(""" |
| 1317 | ... [buildout] |
| 1318 | ... parts = |
| 1319 | ... zope9 |
| 1320 | ... instance |
| 1321 | ... find-links = |
| 1322 | ... ${buildout:directory}/downloads/dist |
| 1323 | ... |
| 1324 | ... [zope9] |
| 1325 | ... recipe = plone.recipe.zope9install |
| 1326 | ... location = /opt/zope |
| 1327 | ... |
| 1328 | ... [instance] |
| 1329 | ... recipe = plone.recipe.zope9instance |
| 1330 | ... zope9-location = ${zope9:location} |
| 1331 | ... zope-conf = ${custom:prefix}/etc/zope.conf |
| 1332 | ... """) |
| 1333 | >>> parser['buildout']['find-links'] |
| 1334 | '\n/home/ambv/zope9/downloads/dist' |
| 1335 | >>> parser['instance']['zope-conf'] |
| 1336 | '/usr/local/etc/zope.conf' |
| 1337 | >>> instance = parser['instance'] |
| 1338 | >>> instance['zope-conf'] |
| 1339 | '/usr/local/etc/zope.conf' |
| 1340 | >>> instance['zope9-location'] |
| 1341 | '/opt/zope' |
| 1342 | |
| 1343 | A number of smaller features were also introduced, like support for specifying |
Raymond Hettinger | 0412974 | 2010-12-18 10:57:50 +0000 | [diff] [blame] | 1344 | encoding in read operations, specifying fallback values for get-functions, or |
| 1345 | reading directly from dictionaries and strings. |
Łukasz Langa | 2b38b6c | 2010-12-17 21:57:32 +0000 | [diff] [blame] | 1346 | |
| 1347 | (All changes contributed by Łukasz Langa.) |
| 1348 | |
Raymond Hettinger | a55ffbc | 2010-12-15 18:31:57 +0000 | [diff] [blame] | 1349 | .. XXX: Mention urllib.parse changes |
| 1350 | Issue 9873 (Nick Coghlan): |
| 1351 | - ASCII byte sequence support in URL parsing |
| 1352 | - named tuple for urldefrag return value |
| 1353 | Issue 5468 (Dan Mahn) for urlencode: |
| 1354 | - bytes input support |
| 1355 | - non-UTF8 percent encoding of non-ASCII characters |
| 1356 | Issue 2987 for IPv6 (RFC2732) support in urlparse |
Raymond Hettinger | 202717d | 2010-12-16 10:06:11 +0000 | [diff] [blame] | 1357 | .. XXX: Any updates to the WSGI bytes versus text problem? |
Raymond Hettinger | a55ffbc | 2010-12-15 18:31:57 +0000 | [diff] [blame] | 1358 | |
Antoine Pitrou | d42bc51 | 2009-11-10 23:18:31 +0000 | [diff] [blame] | 1359 | Multi-threading |
| 1360 | =============== |
| 1361 | |
Georg Brandl | da0a211 | 2010-09-05 11:28:33 +0000 | [diff] [blame] | 1362 | * The mechanism for serializing execution of concurrently running Python threads |
| 1363 | (generally known as the GIL or Global Interpreter Lock) has been rewritten. |
| 1364 | Among the objectives were more predictable switching intervals and reduced |
| 1365 | overhead due to lock contention and the number of ensuing system calls. The |
| 1366 | notion of a "check interval" to allow thread switches has been abandoned and |
| 1367 | replaced by an absolute duration expressed in seconds. This parameter is |
| 1368 | tunable through :func:`sys.setswitchinterval()`. It currently defaults to 5 |
| 1369 | milliseconds. |
Antoine Pitrou | d42bc51 | 2009-11-10 23:18:31 +0000 | [diff] [blame] | 1370 | |
| 1371 | Additional details about the implementation can be read from a `python-dev |
| 1372 | mailing-list message |
| 1373 | <http://mail.python.org/pipermail/python-dev/2009-October/093321.html>`_ |
Georg Brandl | da0a211 | 2010-09-05 11:28:33 +0000 | [diff] [blame] | 1374 | (however, "priority requests" as exposed in this message have not been kept |
| 1375 | for inclusion). |
Antoine Pitrou | d42bc51 | 2009-11-10 23:18:31 +0000 | [diff] [blame] | 1376 | |
Georg Brandl | 5e73a81 | 2010-04-22 07:02:51 +0000 | [diff] [blame] | 1377 | (Contributed by Antoine Pitrou.) |
Antoine Pitrou | d42bc51 | 2009-11-10 23:18:31 +0000 | [diff] [blame] | 1378 | |
Georg Brandl | da0a211 | 2010-09-05 11:28:33 +0000 | [diff] [blame] | 1379 | * Regular and recursive locks now accept an optional *timeout* argument to their |
Raymond Hettinger | 09e4ebb | 2010-09-06 19:55:51 +0000 | [diff] [blame] | 1380 | :meth:`acquire` method. (Contributed by Antoine Pitrou; :issue:`7316`.) |
Georg Brandl | da0a211 | 2010-09-05 11:28:33 +0000 | [diff] [blame] | 1381 | |
Raymond Hettinger | bba537b | 2010-12-15 18:20:19 +0000 | [diff] [blame] | 1382 | * Similarly, :meth:`threading.Semaphore.acquire` also gained a *timeout* |
Georg Brandl | da0a211 | 2010-09-05 11:28:33 +0000 | [diff] [blame] | 1383 | argument. (Contributed by Torsten Landschoff; :issue:`850728`.) |
Antoine Pitrou | e95a9ff | 2010-05-04 23:31:41 +0000 | [diff] [blame] | 1384 | |
Antoine Pitrou | 810023d | 2010-12-15 22:59:16 +0000 | [diff] [blame] | 1385 | * Regular and recursive lock acquisitions can now be interrupted by signals on |
| 1386 | platforms using pthreads. This means that Python programs that deadlock while |
| 1387 | acquiring locks can be successfully killed by repeatedly sending SIGINT to the |
Raymond Hettinger | 48f3bd3 | 2010-12-16 00:30:53 +0000 | [diff] [blame] | 1388 | process (by pressing Ctl+C in most shells). |
Antoine Pitrou | 810023d | 2010-12-15 22:59:16 +0000 | [diff] [blame] | 1389 | (Contributed by Reid Kleckner; :issue:`8844`.) |
| 1390 | |
Antoine Pitrou | d42bc51 | 2009-11-10 23:18:31 +0000 | [diff] [blame] | 1391 | |
Raymond Hettinger | 92ba286 | 2010-09-06 01:16:46 +0000 | [diff] [blame] | 1392 | Optimizations |
| 1393 | ============= |
Raymond Hettinger | 6e6565b | 2009-06-28 20:56:11 +0000 | [diff] [blame] | 1394 | |
Raymond Hettinger | 92ba286 | 2010-09-06 01:16:46 +0000 | [diff] [blame] | 1395 | A number of small performance enhancements have been added: |
Raymond Hettinger | 6e6565b | 2009-06-28 20:56:11 +0000 | [diff] [blame] | 1396 | |
Antoine Pitrou | d305200 | 2010-09-15 15:09:40 +0000 | [diff] [blame] | 1397 | * Python's peephole optimizer now recognizes patterns such ``x in {1, 2, 3}`` as |
Raymond Hettinger | 92ba286 | 2010-09-06 01:16:46 +0000 | [diff] [blame] | 1398 | being a test for membership in a set of constants. The optimizer recasts the |
| 1399 | :class:`set` as a :class:`frozenset` and stores the pre-built constant. |
| 1400 | |
| 1401 | Now that the speed penalty is gone, it is practical to start writing |
| 1402 | membership tests using set-notation. This style is both semantically clear |
| 1403 | and operationally fast:: |
| 1404 | |
| 1405 | extension = name.rpartition('.')[2] |
| 1406 | if extension in {'xml', 'html', 'xhtml', 'css'}: |
| 1407 | handle(name) |
| 1408 | |
| 1409 | (Patch and additional tests by Dave Malcolm; :issue:`6690`). |
| 1410 | |
Antoine Pitrou | d305200 | 2010-09-15 15:09:40 +0000 | [diff] [blame] | 1411 | * Serializing and unserializing data using the :mod:`pickle` module is now |
Raymond Hettinger | dadf93c | 2010-12-05 02:56:21 +0000 | [diff] [blame] | 1412 | several times faster. |
| 1413 | |
| 1414 | (Contributed by Alexandre Vassalotti, Antoine Pitrou |
Antoine Pitrou | ff150f2 | 2010-10-22 21:41:05 +0000 | [diff] [blame] | 1415 | and the Unladen Swallow team in :issue:`9410` and :issue:`3873`.) |
Antoine Pitrou | d305200 | 2010-09-15 15:09:40 +0000 | [diff] [blame] | 1416 | |
Raymond Hettinger | c269ae8 | 2010-12-05 01:01:52 +0000 | [diff] [blame] | 1417 | * The `Timsort algorithm <http://en.wikipedia.org/wiki/Timsort>`_ used in |
Raymond Hettinger | ffad35e | 2010-12-14 21:12:03 +0000 | [diff] [blame] | 1418 | :meth:`list.sort` and :func:`sorted` now runs faster and uses less memory |
Raymond Hettinger | c269ae8 | 2010-12-05 01:01:52 +0000 | [diff] [blame] | 1419 | when called with a :term:`key function`. Previously, every element of |
| 1420 | a list was wrapped with a temporary object that remembered the key value |
| 1421 | associated with each element. Now, an array of keys and values are |
| 1422 | sorted in parallel. This save the memory consumed by the sort wrappers, |
| 1423 | and it saves time lost from during comparisons which where delegated |
| 1424 | by the sort wrappers. |
| 1425 | |
| 1426 | (Patch by Daniel Stuzback in :issue:`9915`.) |
| 1427 | |
Raymond Hettinger | dadf93c | 2010-12-05 02:56:21 +0000 | [diff] [blame] | 1428 | * JSON decoding performance is improved and memory consumption is reduced |
Raymond Hettinger | 413abbc | 2010-12-05 07:06:47 +0000 | [diff] [blame] | 1429 | whenever the same string is repeated for multiple keys. Also, JSON encoding |
Raymond Hettinger | dadf93c | 2010-12-05 02:56:21 +0000 | [diff] [blame] | 1430 | now uses the C speedups when the ``sort_keys`` argument is true. |
| 1431 | |
| 1432 | (Contributed by Antoine Pitrou in :issue:`7451` and by Raymond Hettinger and |
| 1433 | Antoine Pitrou in :issue:`10314`.) |
| 1434 | |
Raymond Hettinger | 21ec4bc | 2010-12-10 01:09:01 +0000 | [diff] [blame] | 1435 | * Recursive locks (created with the :func:`threading.RLock` API) now benefit |
| 1436 | from a C implementation which makes them as fast as regular locks, and between |
| 1437 | 10x and 15x faster than their previous pure Python implementation. |
| 1438 | |
| 1439 | (Contributed by Antoine Pitrou; :issue:`3001`.) |
| 1440 | |
Raymond Hettinger | dadf93c | 2010-12-05 02:56:21 +0000 | [diff] [blame] | 1441 | * The fast-search algorithm in stringlib is now used by the :meth:`split`, |
| 1442 | :meth:`rsplit`, :meth:`splitlines` and :meth:`replace` methods on |
| 1443 | :class:`bytes`, :class:`bytearray` and :class:`str` objects. Likewise, the |
| 1444 | algorithm is also used by :meth:`rfind`, :meth:`rindex`, :meth:`rsplit` and |
| 1445 | :meth:`rpartition`. |
| 1446 | |
| 1447 | (Patch by Florent Xicluna in :issue:`7622` and :issue:`7462`.) |
| 1448 | |
Raymond Hettinger | 480ed78 | 2010-12-15 22:07:15 +0000 | [diff] [blame] | 1449 | |
| 1450 | * String to integer conversions now work two "digits" at a time, reducing the |
| 1451 | number of division and modulo operations. |
| 1452 | |
| 1453 | (:issue:`6713` by Gawain Bolton, Mark Dickinson, and Victor Stinner.) |
| 1454 | |
Raymond Hettinger | d8fae4e | 2010-12-05 05:39:54 +0000 | [diff] [blame] | 1455 | There were several other minor optimizations. Set differencing now runs faster |
| 1456 | when one operand is much larger than the other (Patch by Andress Bennetts in |
| 1457 | :issue:`8685`). The :meth:`array.repeat` method has a faster implementation |
| 1458 | (:issue:`1569291` by Alexander Belopolsky). The :class:`BaseHTTPRequestHandler` |
| 1459 | has more efficient buffering (:issue:`3709` by Andrew Schaaf). The |
| 1460 | multi-argument form of :func:`operator.attrgetter` now function runs slightly |
| 1461 | faster (:issue:`10160` by Christos Georgiou). And :class:`ConfigParser` loads |
| 1462 | multi-line arguments a bit faster (:issue:`7113` by Łukasz Langa). |
| 1463 | |
Antoine Pitrou | d305200 | 2010-09-15 15:09:40 +0000 | [diff] [blame] | 1464 | |
Victor Stinner | 47ce965 | 2010-10-29 00:57:35 +0000 | [diff] [blame] | 1465 | Unicode |
| 1466 | ======= |
Victor Stinner | 94908bb | 2010-08-18 21:23:25 +0000 | [diff] [blame] | 1467 | |
Alexander Belopolsky | 507e3f8 | 2010-12-02 00:05:57 +0000 | [diff] [blame] | 1468 | Python has been updated to Unicode 6.0.0. The new features of the |
| 1469 | Unicode Standard that will affect Python users include: |
| 1470 | |
Alexander Belopolsky | 84cc062 | 2010-12-08 21:38:46 +0000 | [diff] [blame] | 1471 | * addition of 2,088 characters, including over 1,000 additional |
| 1472 | symbols—chief among them the additional emoji symbols, which are |
| 1473 | especially important for mobile phones; |
Alexander Belopolsky | 507e3f8 | 2010-12-02 00:05:57 +0000 | [diff] [blame] | 1474 | |
Alexander Belopolsky | 84cc062 | 2010-12-08 21:38:46 +0000 | [diff] [blame] | 1475 | * changes to character properties for existing characters including |
Alexander Belopolsky | 507e3f8 | 2010-12-02 00:05:57 +0000 | [diff] [blame] | 1476 | |
Raymond Hettinger | c74d518 | 2010-12-02 01:38:25 +0000 | [diff] [blame] | 1477 | - a general category change to two Kannada characters (U+0CF1, |
| 1478 | U+0CF2), which has the effect of making them newly eligible for |
| 1479 | inclusion in identifiers; |
| 1480 | |
| 1481 | - a general category change to one New Tai Lue numeric character |
Alexander Belopolsky | 84cc062 | 2010-12-08 21:38:46 +0000 | [diff] [blame] | 1482 | (U+19DA), which has the effect of disqualifying it from |
| 1483 | inclusion in identifiers. |
| 1484 | |
| 1485 | For more information, see `Unicode Character Database Changes |
| 1486 | <http://www.unicode.org/versions/Unicode6.0.0/#Database_Changes>`_ |
| 1487 | at the `Unicode Consortium <http://www.unicode.org/>`_ web site. |
Alexander Belopolsky | 507e3f8 | 2010-12-02 00:05:57 +0000 | [diff] [blame] | 1488 | |
Éric Araujo | 4234ad4 | 2010-09-05 17:32:25 +0000 | [diff] [blame] | 1489 | The :mod:`os` module has two new functions: :func:`~os.fsencode` and |
Victor Stinner | 47ce965 | 2010-10-29 00:57:35 +0000 | [diff] [blame] | 1490 | :func:`~os.fsdecode`. Add :data:`os.environb`: bytes version of |
| 1491 | :data:`os.environ`, :func:`os.getenvb` function and |
| 1492 | :data:`os.supports_bytes_environ` constant. |
Victor Stinner | e8d5145 | 2010-08-19 01:05:19 +0000 | [diff] [blame] | 1493 | |
Georg Brandl | 326c57d | 2010-11-26 12:10:06 +0000 | [diff] [blame] | 1494 | ``'mbcs'`` encoding doesn't ignore the error handler argument any more. By |
Victor Stinner | 47ce965 | 2010-10-29 00:57:35 +0000 | [diff] [blame] | 1495 | default (strict mode), it raises an UnicodeDecodeError on undecodable byte |
| 1496 | sequence and UnicodeEncodeError on unencodable character. To get the ``'mbcs'`` |
| 1497 | encoding of Python 3.1, use ``'ignore'`` error handler to decode and |
| 1498 | ``'replace'`` error handler to encode. ``'mbcs'`` supports ``'strict'`` and |
| 1499 | ``'ignore'`` error handlers for decoding, and ``'strict'`` and ``'replace'`` |
| 1500 | for encoding. |
| 1501 | |
| 1502 | On Mac OS X, Python uses ``'utf-8'`` to decode the command line arguments, |
| 1503 | instead of the locale encoding (which is ISO-8859-1 if the ``LANG`` environment |
| 1504 | variable is not set). |
| 1505 | |
| 1506 | By default, tarfile uses ``'utf-8'`` encoding on Windows (instead of |
| 1507 | ``'mbcs'``), and the ``'surrogateescape'`` error handler on all operating |
| 1508 | systems. |
Antoine Pitrou | d305200 | 2010-09-15 15:09:40 +0000 | [diff] [blame] | 1509 | |
Raymond Hettinger | 480ed78 | 2010-12-15 22:07:15 +0000 | [diff] [blame] | 1510 | * Added the *cp720* Arabic DOS encoding (:issue:`1616979`). |
| 1511 | |
Victor Stinner | 94908bb | 2010-08-18 21:23:25 +0000 | [diff] [blame] | 1512 | |
Raymond Hettinger | 1fa7682 | 2010-12-06 23:31:36 +0000 | [diff] [blame] | 1513 | Documentation |
| 1514 | ============= |
| 1515 | |
| 1516 | The documentation continues to be improved. |
| 1517 | |
| 1518 | A table of quick links has been added to the top of lengthy sections such as |
| 1519 | :ref:`built-in-funcs`. In the case of :mod:`itertools`, the links are |
| 1520 | accompanied by tables of cheatsheet-style summaries to provide an overview and |
| 1521 | memory jog without having to read all of the docs. |
| 1522 | |
| 1523 | In some cases, the pure python source code can be helpful adjunct to the docs, |
| 1524 | so now some modules feature quick links to the latest version of the source |
| 1525 | code. For example, the :mod:`functools` module documentation has a quick link |
| 1526 | at the top labeled :source:`functools Python source code <Lib/functools.py>`. |
| 1527 | |
| 1528 | The docs now contain more examples and recipes. In particular, :mod:`re` module |
| 1529 | has an extensive section, :ref:`re-examples`. Likewise, the :mod:`itertools` |
| 1530 | module continues to be updated with new :ref:`itertools-recipes`. |
| 1531 | |
Raymond Hettinger | 677e10a | 2010-12-07 06:45:30 +0000 | [diff] [blame] | 1532 | The :mod:`datetime` module now has an auxiliary implementation in pure Python. |
| 1533 | No functionality was changed. This just provides an easier-to-read |
| 1534 | alternate implementation. (Contributed by Alexander Belopolsky.) |
| 1535 | |
Raymond Hettinger | 1fa7682 | 2010-12-06 23:31:36 +0000 | [diff] [blame] | 1536 | |
| 1537 | IDLE |
| 1538 | ==== |
Raymond Hettinger | 6e6565b | 2009-06-28 20:56:11 +0000 | [diff] [blame] | 1539 | |
Georg Brandl | cc9d237 | 2010-12-10 19:22:11 +0000 | [diff] [blame] | 1540 | * The format menu now has an option to clean-up source files by stripping |
| 1541 | trailing whitespace (:issue:`5150`). |
Raymond Hettinger | 6e6565b | 2009-06-28 20:56:11 +0000 | [diff] [blame] | 1542 | |
| 1543 | |
| 1544 | Build and C API Changes |
| 1545 | ======================= |
| 1546 | |
| 1547 | Changes to Python's build process and to the C API include: |
| 1548 | |
Georg Brandl | da0a211 | 2010-09-05 11:28:33 +0000 | [diff] [blame] | 1549 | * The C functions that access the Unicode Database now accept and return |
| 1550 | characters from the full Unicode range, even on narrow unicode builds |
Raymond Hettinger | 1784ff0 | 2010-09-05 01:00:19 +0000 | [diff] [blame] | 1551 | (Py_UNICODE_TOLOWER, Py_UNICODE_ISDECIMAL, and others). A visible difference |
Georg Brandl | da0a211 | 2010-09-05 11:28:33 +0000 | [diff] [blame] | 1552 | in Python is that :func:`unicodedata.numeric` now returns the correct value |
| 1553 | for large code points, and :func:`repr` may consider more characters as |
| 1554 | printable. |
Raymond Hettinger | 6e6565b | 2009-06-28 20:56:11 +0000 | [diff] [blame] | 1555 | |
Raymond Hettinger | 1784ff0 | 2010-09-05 01:00:19 +0000 | [diff] [blame] | 1556 | (Reported by Bupjoe Lee and fixed by Amaury Forgeot D'Arc; :issue:`5127`.) |
| 1557 | |
Georg Brandl | da0a211 | 2010-09-05 11:28:33 +0000 | [diff] [blame] | 1558 | * Computed gotos are now enabled by default on supported compilers (which are |
Raymond Hettinger | db9044e | 2010-09-06 01:29:23 +0000 | [diff] [blame] | 1559 | detected by the configure script). They can still be disabled selectively by |
Georg Brandl | da0a211 | 2010-09-05 11:28:33 +0000 | [diff] [blame] | 1560 | specifying ``--without-computed-gotos``. |
Raymond Hettinger | 1784ff0 | 2010-09-05 01:00:19 +0000 | [diff] [blame] | 1561 | |
Georg Brandl | da0a211 | 2010-09-05 11:28:33 +0000 | [diff] [blame] | 1562 | (Contributed by Antoine Pitrou; :issue:`9203`.) |
| 1563 | |
Amaury Forgeot d'Arc | feb7307 | 2010-09-12 22:42:57 +0000 | [diff] [blame] | 1564 | * The option ``--with-wctype-functions`` was removed. The built-in unicode |
| 1565 | database is now used for all functions. |
| 1566 | |
| 1567 | (Contributed by Amaury Forgeot D'Arc; :issue:`9210`.) |
| 1568 | |
Raymond Hettinger | 480ed78 | 2010-12-15 22:07:15 +0000 | [diff] [blame] | 1569 | * Hash values are now values of a new type, :c:type:`Py_hash_t`, which is |
| 1570 | defined to be the same size as a pointer. Previously they were of type long, |
| 1571 | which on some 64-bit operating systems is still only 32 bits long. As a |
| 1572 | result of this fix, :class:`set` and :class:`dict` can now hold more than |
| 1573 | ``2**32`` entries on builds with 64-bit pointers (previously, they could grow |
| 1574 | to that size but their performance degraded catastrophically). |
Skip Montanaro | 961aaf5 | 2010-10-17 22:22:24 +0000 | [diff] [blame] | 1575 | |
Raymond Hettinger | 480ed78 | 2010-12-15 22:07:15 +0000 | [diff] [blame] | 1576 | (Suggested by Raymond Hettinger and implemented by Benjamin Peterson; |
| 1577 | :issue:`9778`.) |
| 1578 | |
| 1579 | * A new macro :c:macro:`Py_VA_COPY` copies the state of the variable argument |
| 1580 | list. It is equivalent to C99 *va_copy* but available on all python platforms |
| 1581 | (:issue:`2443`). |
| 1582 | |
Raymond Hettinger | bb9686f | 2010-12-16 00:53:05 +0000 | [diff] [blame] | 1583 | * A new C API function :c:func:`PySys_SetArgvEx` allows an embedded |
Raymond Hettinger | 480ed78 | 2010-12-15 22:07:15 +0000 | [diff] [blame] | 1584 | interpreter to set sys.argv without also modifying :attr:`sys.path` |
| 1585 | (:issue:`5753`). |
| 1586 | |
| 1587 | * :c:macro:`PyEval_CallObject` is now only available in macro form. The |
| 1588 | function declaration, which was kept for backwards compatibility reasons, is |
| 1589 | now removed -- the macro was introduced in 1997 (:issue:`8276`). |
| 1590 | |
| 1591 | * The is a new function :c:func:`PyLong_AsLongLongAndOverflow` which |
| 1592 | is analogous to :c:func:`PyLong_AsLongAndOverflow`. The both serve to |
| 1593 | convert Python :class:`int` into a native fixed-width type while providing |
| 1594 | detection of cases where the conversion won't fit (:issue:`7767`). |
| 1595 | |
| 1596 | * The :c:func:`PyUnicode_CompareWithASCIIString` now returns *not equal* |
| 1597 | if the Python string in *NUL* terminated. |
| 1598 | |
| 1599 | * There is a new function :c:func:`PyErr_NewExceptionWithDoc` that is |
| 1600 | like :c:func:`PyErr_NewException` but allows a docstring to be specified. |
| 1601 | This lets C exceptions have the same self-documenting capabilities as |
| 1602 | their pure Python counterparts (:issue:`7033`). |
| 1603 | |
| 1604 | * When compiled with the ``--with-valgrind`` option, the pymalloc |
| 1605 | allocator will be automatically disabled when running under Valgrind. This |
| 1606 | gives improved memory leak detection when running under Valgrind, while taking |
| 1607 | advantage of pymalloc at other times (:issue:`2422`). |
| 1608 | |
| 1609 | * Removed the "O?" format from the *PyArg_Parse* functions. The format is no |
| 1610 | longer used and it had never been documented (:issue:`8837`). |
| 1611 | |
| 1612 | There were a number of other small changes to the C-API. See the |
| 1613 | :file:`Misc/NEWS` file for a complete list. |
Skip Montanaro | 961aaf5 | 2010-10-17 22:22:24 +0000 | [diff] [blame] | 1614 | |
Raymond Hettinger | 6e6565b | 2009-06-28 20:56:11 +0000 | [diff] [blame] | 1615 | |
Raymond Hettinger | f558ddd | 2009-06-28 21:37:08 +0000 | [diff] [blame] | 1616 | Porting to Python 3.2 |
Raymond Hettinger | 6e6565b | 2009-06-28 20:56:11 +0000 | [diff] [blame] | 1617 | ===================== |
| 1618 | |
Georg Brandl | da0a211 | 2010-09-05 11:28:33 +0000 | [diff] [blame] | 1619 | This section lists previously described changes and other bugfixes that may |
| 1620 | require changes to your code: |
Raymond Hettinger | 6e6565b | 2009-06-28 20:56:11 +0000 | [diff] [blame] | 1621 | |
Raymond Hettinger | d73be67 | 2010-12-18 10:48:26 +0000 | [diff] [blame] | 1622 | * The :mod:`configparser` module has a number of clean-ups. The major change is |
| 1623 | to replace the old :class:`ConfigParser` class with long-standing preferred |
| 1624 | alternative :class:`SafeConfigParser`. In addition there are a number of |
| 1625 | smaller incompatibilites: |
Łukasz Langa | 2b38b6c | 2010-12-17 21:57:32 +0000 | [diff] [blame] | 1626 | |
Raymond Hettinger | d73be67 | 2010-12-18 10:48:26 +0000 | [diff] [blame] | 1627 | * The interpolation syntax is now validated on |
| 1628 | :meth:`~configparser.ConfigParser.get` and |
| 1629 | :meth:`~configparser.ConfigParser.set` operations. In the default |
| 1630 | interpolation scheme, only two tokens with percent signs are valid: ``%(name)s`` |
| 1631 | and ``%%``, the latter being an escaped percent sign. |
Łukasz Langa | 2b38b6c | 2010-12-17 21:57:32 +0000 | [diff] [blame] | 1632 | |
Raymond Hettinger | d73be67 | 2010-12-18 10:48:26 +0000 | [diff] [blame] | 1633 | * The :meth:`~configparser.ConfigParser.set` and |
| 1634 | :meth:`~configparser.ConfigParser.add_section` methods now verify that |
| 1635 | values are actual strings. Formerly, unsupported types could be introduced |
| 1636 | unintentionally. |
Łukasz Langa | 2b38b6c | 2010-12-17 21:57:32 +0000 | [diff] [blame] | 1637 | |
Raymond Hettinger | 2b8861f | 2010-12-18 11:20:52 +0000 | [diff] [blame] | 1638 | * Duplicate sections or options from a single source now raise either |
Raymond Hettinger | d73be67 | 2010-12-18 10:48:26 +0000 | [diff] [blame] | 1639 | :exc:`~configparser.DuplicateSectionError` or |
| 1640 | :exc:`~configparser.DuplicateOptionError`. Formerly, duplicates would |
| 1641 | silently overwrite a previous entry. |
Łukasz Langa | 2b38b6c | 2010-12-17 21:57:32 +0000 | [diff] [blame] | 1642 | |
Raymond Hettinger | d73be67 | 2010-12-18 10:48:26 +0000 | [diff] [blame] | 1643 | * Inline comments are now disabled by default so now the **;** character |
| 1644 | can be safely used in values. |
Łukasz Langa | 2b38b6c | 2010-12-17 21:57:32 +0000 | [diff] [blame] | 1645 | |
Raymond Hettinger | d73be67 | 2010-12-18 10:48:26 +0000 | [diff] [blame] | 1646 | * Comments now can be indented. Consequently, for **;** or **#** to appear at |
| 1647 | the start of a line in multiline values, it has to be interpolated. This |
Raymond Hettinger | 2b8861f | 2010-12-18 11:20:52 +0000 | [diff] [blame] | 1648 | keeps comment prefix characters in values from being mistaken as comments. |
Łukasz Langa | 2b38b6c | 2010-12-17 21:57:32 +0000 | [diff] [blame] | 1649 | |
Raymond Hettinger | d73be67 | 2010-12-18 10:48:26 +0000 | [diff] [blame] | 1650 | * ``""`` is now a valid value and is no longer automatically converted to an |
| 1651 | empty string. For empty strings, use ``"option ="`` in a line. |
Łukasz Langa | 2b38b6c | 2010-12-17 21:57:32 +0000 | [diff] [blame] | 1652 | |
Antoine Pitrou | cd889af | 2010-10-06 21:13:56 +0000 | [diff] [blame] | 1653 | * The :mod:`nntplib` module was reworked extensively, meaning that its APIs |
| 1654 | are often incompatible with the 3.1 APIs. |
| 1655 | |
Raymond Hettinger | 1fa7682 | 2010-12-06 23:31:36 +0000 | [diff] [blame] | 1656 | * :class:`bytearray` objects can no longer be used as filenames; instead, |
| 1657 | they should be converted to :class:`bytes`. |
Victor Stinner | dcb2403 | 2010-04-22 12:08:36 +0000 | [diff] [blame] | 1658 | |
Victor Stinner | 25e8ec4 | 2010-06-25 00:02:38 +0000 | [diff] [blame] | 1659 | * PyArg_Parse*() functions: |
Victor Stinner | 3dcb5ac | 2010-06-08 22:54:19 +0000 | [diff] [blame] | 1660 | |
Victor Stinner | 25e8ec4 | 2010-06-25 00:02:38 +0000 | [diff] [blame] | 1661 | * "t#" format has been removed: use "s#" or "s*" instead |
| 1662 | * "w" and "w#" formats has been removed: use "w*" instead |
| 1663 | |
Georg Brandl | 60203b4 | 2010-10-06 10:11:56 +0000 | [diff] [blame] | 1664 | * The :c:type:`PyCObject` type, deprecated in 3.1, has been removed. To wrap |
| 1665 | opaque C pointers in Python objects, the :c:type:`PyCapsule` API should be used |
Éric Araujo | 4234ad4 | 2010-09-05 17:32:25 +0000 | [diff] [blame] | 1666 | instead; the new type has a well-defined interface for passing typing safety |
Georg Brandl | da0a211 | 2010-09-05 11:28:33 +0000 | [diff] [blame] | 1667 | information and a less complicated signature for calling a destructor. |
Victor Stinner | 0cbec57 | 2010-09-12 20:32:57 +0000 | [diff] [blame] | 1668 | |
Raymond Hettinger | e0a9600 | 2010-12-15 17:54:13 +0000 | [diff] [blame] | 1669 | * The :func:`sys.setfilesystemencoding` function was removed because |
| 1670 | it had a flawed design. |
Raymond Hettinger | 3fcf002 | 2010-12-08 01:13:53 +0000 | [diff] [blame] | 1671 | |
Raymond Hettinger | e0a9600 | 2010-12-15 17:54:13 +0000 | [diff] [blame] | 1672 | * The :func:`random.seed` function and method now salt string seeds with an |
| 1673 | sha512 hash function. To access the previous version of *seed* in order to |
| 1674 | reproduce Python 3.1 sequences, set the *version* argument to *1*, |
| 1675 | ``random.seed(s, version=1)``. |
Raymond Hettinger | 21ec4bc | 2010-12-10 01:09:01 +0000 | [diff] [blame] | 1676 | |
Raymond Hettinger | 522cc0a | 2010-12-10 01:19:15 +0000 | [diff] [blame] | 1677 | * The previously deprecated :func:`string.maketrans` function has been removed |
| 1678 | in favor of the static methods, :meth:`bytes.maketrans` and |
| 1679 | :meth:`bytearray.maketrans`. This change solves the confusion around which |
| 1680 | types were supported by the :mod:`string` module. Now, :class:`str`, |
| 1681 | :class:`bytes`, and :class:`bytearray` each have their own **maketrans** and |
| 1682 | **translate** methods with intermediate translation tables of the appropriate |
| 1683 | type. |
| 1684 | |
| 1685 | (Contributed by Georg Brandl; :issue:`5675`.) |
| 1686 | |
| 1687 | * The previously deprecated :func:`contextlib.nested` function has been removed |
| 1688 | in favor of a plain :keyword:`with` statement which can accept multiple |
| 1689 | context managers. The latter technique is faster (because it is built-in), |
| 1690 | and it does a better job finalizing multiple context managers when one of them |
| 1691 | raises an exception:: |
| 1692 | |
| 1693 | >>> with open('mylog.txt') as infile, open('a.out', 'w') as outfile: |
| 1694 | ... for line in infile: |
| 1695 | ... if '<critical>' in line: |
| 1696 | ... outfile.write(line) |
| 1697 | |
| 1698 | (Contributed by Georg Brandl and Mattias Brändström; |
| 1699 | `appspot issue 53094 <http://codereview.appspot.com/53094>`_.) |