blob: 0d4e3cd23cbba4581c59cf4ff3dc2c6dc9815516 [file] [log] [blame]
Brett Cannon8045d972011-02-03 22:01:54 +00001.. _pyporting-howto:
2
3*********************************
4Porting Python 2 Code to Python 3
5*********************************
6
7:author: Brett Cannon
8
9.. topic:: Abstract
10
Brett Cannon4b0c24a2011-02-03 22:14:58 +000011 With Python 3 being the future of Python while Python 2 is still in active
12 use, it is good to have your project available for both major releases of
Brett Cannon9ca21b32014-01-07 11:52:04 -050013 Python. This guide is meant to help you figure out how best to support both
14 Python 2 & 3 simultaneously.
Brett Cannon8045d972011-02-03 22:01:54 +000015
Brett Cannon4b0c24a2011-02-03 22:14:58 +000016 If you are looking to port an extension module instead of pure Python code,
Éric Araujo5405a0b2011-02-05 16:03:12 +000017 please see :ref:`cporting-howto`.
Brett Cannon8045d972011-02-03 22:01:54 +000018
Brett Cannon9ca21b32014-01-07 11:52:04 -050019 If you would like to read one core Python developer's take on why Python 3
20 came into existence, you can read Nick Coghlan's `Python 3 Q & A`_.
Brett Cannon8045d972011-02-03 22:01:54 +000021
Brett Cannon9ca21b32014-01-07 11:52:04 -050022 If you prefer to read a (free) book on porting a project to Python 3,
23 consider reading `Porting to Python 3`_ by Lennart Regebro which should cover
24 much of what is discussed in this HOWTO.
Georg Brandl2cb2fa92011-02-07 15:30:45 +000025
Brett Cannon9ca21b32014-01-07 11:52:04 -050026 For help with porting, you can email the python-porting_ mailing list with
27 questions.
Brett Cannon8045d972011-02-03 22:01:54 +000028
Larry Hastings3732ed22014-03-15 21:13:56 -070029The Short Version
30=================
31
32* Decide what's the oldest version of Python 2 you want to support (if at all)
33* Make sure you have a thorough test suite and use continuous integration
34 testing to make sure you stay compatible with the versions of Python you care
35 about
36* If you have dependencies, check their Python 3 status using caniusepython3
37 (`command-line tool <https://pypi.python.org/pypi/caniusepython3>`__,
38 `web app <https://caniusepython3.com/>`__)
39
40With that done, your options are:
41
Georg Brandl728e4de2014-10-29 09:00:30 +010042* If you are dropping Python 2 support, use :ref:`2to3 <2to3-reference>` to port
43 to Python 3
44
Larry Hastings3732ed22014-03-15 21:13:56 -070045* If you are keeping Python 2 support, then start writing Python 2/3-compatible
46 code starting **TODAY**
47
48 + If you have dependencies that have not been ported, reach out to them to port
49 their project while working to make your code compatible with Python 3 so
50 you're ready when your dependencies are all ported
51 + If all your dependencies have been ported (or you have none), go ahead and
52 port to Python 3
53
54* If you are creating a new project that wants to have 2/3 compatibility,
55 code in Python 3 and then backport to Python 2
56
Brett Cannon8045d972011-02-03 22:01:54 +000057
Brett Cannon9ca21b32014-01-07 11:52:04 -050058Before You Begin
59================
Georg Brandl2cb2fa92011-02-07 15:30:45 +000060
Brett Cannon9ca21b32014-01-07 11:52:04 -050061If your project is on the Cheeseshop_/PyPI_, make sure it has the proper
62`trove classifiers`_ to signify what versions of Python it **currently**
63supports. At minimum you should specify the major version(s), e.g.
64``Programming Language :: Python :: 2`` if your project currently only supports
65Python 2. It is preferrable that you be as specific as possible by listing every
66major/minor version of Python that you support, e.g. if your project supports
67Python 2.6 and 2.7, then you want the classifiers of::
Brett Cannon8045d972011-02-03 22:01:54 +000068
Brett Cannon9ca21b32014-01-07 11:52:04 -050069 Programming Language :: Python :: 2
70 Programming Language :: Python :: 2.6
71 Programming Language :: Python :: 2.7
Brett Cannon8045d972011-02-03 22:01:54 +000072
Brett Cannon9ca21b32014-01-07 11:52:04 -050073Once your project supports Python 3 you will want to go back and add the
74appropriate classifiers for Python 3 as well. This is important as setting the
75``Programming Language :: Python :: 3`` classifier will lead to your project
76being listed under the `Python 3 Packages`_ section of PyPI.
Brett Cannon8045d972011-02-03 22:01:54 +000077
Brett Cannon9ca21b32014-01-07 11:52:04 -050078Make sure you have a robust test suite. You need to
79make sure everything continues to work, just like when you support a new
80minor/feature release of Python. This means making sure your test suite is
81thorough and is ported properly between Python 2 & 3 (consider using coverage_
82to measure that you have effective test coverage). You will also most likely
83want to use something like tox_ to automate testing between all of your
84supported versions of Python. You will also want to **port your tests first** so
85that you can make sure that you detect breakage during the transition. Tests also
86tend to be simpler than the code they are testing so it gives you an idea of how
87easy it can be to port code.
Brett Cannon8045d972011-02-03 22:01:54 +000088
Georg Brandl9bdcb3b2014-10-29 09:37:43 +010089Drop support for older Python versions if possible. Python 2.5
Eli Bendersky2d062de2011-02-07 04:19:57 +000090introduced a lot of useful syntax and libraries which have become idiomatic
Georg Brandl9bdcb3b2014-10-29 09:37:43 +010091in Python 3. Python 2.6 introduced future statements which makes
Eli Bendersky2d062de2011-02-07 04:19:57 +000092compatibility much easier if you are going from Python 2 to 3.
Georg Brandl9bdcb3b2014-10-29 09:37:43 +010093Python 2.7 continues the trend in the stdlib. Choose the newest version
Eli Bendersky2d062de2011-02-07 04:19:57 +000094of Python which you believe can be your minimum support version
Brett Cannon8045d972011-02-03 22:01:54 +000095and work from there.
96
Brett Cannon9ca21b32014-01-07 11:52:04 -050097Target the newest version of Python 3 that you can. Beyond just the usual
Brett Cannonb7e6b892013-03-09 14:22:35 -050098bugfixes, compatibility has continued to improve between Python 2 and 3 as time
Brett Cannon9ca21b32014-01-07 11:52:04 -050099has passed. E.g. Python 3.3 added back the ``u`` prefix for
100strings, making source-compatible Python code easier to write.
Brett Cannonb7e6b892013-03-09 14:22:35 -0500101
Brett Cannon8045d972011-02-03 22:01:54 +0000102
Brett Cannon9ca21b32014-01-07 11:52:04 -0500103Writing Source-Compatible Python 2/3 Code
104=========================================
105
106Over the years the Python community has discovered that the easiest way to
107support both Python 2 and 3 in parallel is to write Python code that works in
108either version. While this might sound counter-intuitive at first, it actually
109is not difficult and typically only requires following some select
110(non-idiomatic) practices and using some key projects to help make bridging
111between Python 2 and 3 easier.
112
113Projects to Consider
114--------------------
115
Larry Hastings3732ed22014-03-15 21:13:56 -0700116The lowest level library for supporting Python 2 & 3 simultaneously is six_.
Brett Cannon9ca21b32014-01-07 11:52:04 -0500117Reading through its documentation will give you an idea of where exactly the
118Python language changed between versions 2 & 3 and thus what you will want the
119library to help you continue to support.
120
121To help automate porting your code over to using six, you can use
122modernize_. This project will attempt to rewrite your code to be as modern as
123possible while using six to smooth out any differences between Python 2 & 3.
124
125If you want to write your compatible code to feel more like Python 3 there is
126the future_ project. It tries to provide backports of objects from Python 3 so
127that you can use them from Python 2-compatible code, e.g. replacing the
128``bytes`` type from Python 2 with the one from Python 3.
129It also provides a translation script like modernize (its translation code is
130actually partially based on it) to help start working with a pre-existing code
131base. It is also unique in that its translation script will also port Python 3
132code backwards as well as Python 2 code forwards.
Brett Cannon8045d972011-02-03 22:01:54 +0000133
134
Brett Cannon9ca21b32014-01-07 11:52:04 -0500135Tips & Tricks
136-------------
Brett Cannon8045d972011-02-03 22:01:54 +0000137
Brett Cannon9ca21b32014-01-07 11:52:04 -0500138To help with writing source-compatible code using one of the projects mentioned
139in `Projects to Consider`_, consider following the below suggestions. Some of
140them are handled by the suggested projects, so if you do use one of them then
141read their documentation first to see which suggestions below will taken care of
142for you.
Brett Cannon8045d972011-02-03 22:01:54 +0000143
144Support Python 2.7
Brett Cannon9ca21b32014-01-07 11:52:04 -0500145//////////////////
Georg Brandl2cb2fa92011-02-07 15:30:45 +0000146
Georg Brandl9bdcb3b2014-10-29 09:37:43 +0100147As a first step, make sure that your project is compatible with Python 2.7.
Brett Cannon8045d972011-02-03 22:01:54 +0000148This is just good to do as Python 2.7 is the last release of Python 2 and thus
149will be used for a rather long time. It also allows for use of the ``-3`` flag
Brett Cannon9ca21b32014-01-07 11:52:04 -0500150to Python to help discover places in your code where compatibility might be an
151issue (the ``-3`` flag is in Python 2.6 but Python 2.7 adds more warnings).
Brett Cannon8045d972011-02-03 22:01:54 +0000152
Georg Brandl9bdcb3b2014-10-29 09:37:43 +0100153Try to Support Python 2.6 and Newer Only
154////////////////////////////////////////
Georg Brandl2cb2fa92011-02-07 15:30:45 +0000155
Georg Brandl9bdcb3b2014-10-29 09:37:43 +0100156While not possible for all projects, if you can support Python 2.6 and newer
Brett Cannon8045d972011-02-03 22:01:54 +0000157**only**, your life will be much easier. Various future statements, stdlib
158additions, etc. exist only in Python 2.6 and later which greatly assist in
Georg Brandl9bdcb3b2014-10-29 09:37:43 +0100159supporting Python 3. But if you project must keep support for Python 2.5 then
Brett Cannon9ca21b32014-01-07 11:52:04 -0500160it is still possible to simultaneously support Python 3.
Brett Cannon8045d972011-02-03 22:01:54 +0000161
162Below are the benefits you gain if you only have to support Python 2.6 and
163newer. Some of these options are personal choice while others are
164**strongly** recommended (the ones that are more for personal choice are
165labeled as such). If you continue to support older versions of Python then you
Brett Cannon9ca21b32014-01-07 11:52:04 -0500166at least need to watch out for situations that these solutions fix and handle
167them appropriately (which is where library help from e.g. six_ comes in handy).
Brett Cannon8045d972011-02-03 22:01:54 +0000168
169
Brett Cannon8045d972011-02-03 22:01:54 +0000170``from __future__ import print_function``
171'''''''''''''''''''''''''''''''''''''''''
Georg Brandl2cb2fa92011-02-07 15:30:45 +0000172
Brett Cannon9ca21b32014-01-07 11:52:04 -0500173It will not only get you used to typing ``print()`` as a function instead of a
174statement, but it will also give you the various benefits the function has over
175the Python 2 statement (six_ provides a function if you support Python 2.5 or
176older).
Brett Cannon8045d972011-02-03 22:01:54 +0000177
178
179``from __future__ import unicode_literals``
180'''''''''''''''''''''''''''''''''''''''''''
Georg Brandl2cb2fa92011-02-07 15:30:45 +0000181
Brett Cannonf0996a92014-01-17 11:45:01 -0500182If you choose to use this future statement then all string literals in
183Python 2 will be assumed to be Unicode (as is already the case in Python 3).
184If you choose not to use this future statement then you should mark all of your
Brett Cannon9ca21b32014-01-07 11:52:04 -0500185text strings with a ``u`` prefix and only support Python 3.3 or newer. But you
186are **strongly** advised to do one or the other (six_ provides a function in
187case you don't want to use the future statement **and** you want to support
188Python 3.2 or older).
Brett Cannon8045d972011-02-03 22:01:54 +0000189
190
Brett Cannonf0996a92014-01-17 11:45:01 -0500191Bytes/string literals
192'''''''''''''''''''''
Georg Brandl2cb2fa92011-02-07 15:30:45 +0000193
Brett Cannon9ca21b32014-01-07 11:52:04 -0500194This is a **very** important one. Prefix Python 2 strings that
195are meant to contain bytes with a ``b`` prefix to very clearly delineate
196what is and is not a Python 3 text string (six_ provides a function to use for
197Python 2.5 compatibility).
Brett Cannon8045d972011-02-03 22:01:54 +0000198
Brett Cannonb7e6b892013-03-09 14:22:35 -0500199This point cannot be stressed enough: make sure you know what all of your string
Brett Cannon9ca21b32014-01-07 11:52:04 -0500200literals in Python 2 are meant to be in Python 3. Any string literal that
Brett Cannonb7e6b892013-03-09 14:22:35 -0500201should be treated as bytes should have the ``b`` prefix. Any string literal
202that should be Unicode/text in Python 2 should either have the ``u`` literal
203(supported, but ignored, in Python 3.3 and later) or you should have
204``from __future__ import unicode_literals`` at the top of the file. But the key
Brett Cannon9ca21b32014-01-07 11:52:04 -0500205point is you should know how Python 3 will treat every one one of your string
Brett Cannonb7e6b892013-03-09 14:22:35 -0500206literals and you should mark them as appropriate.
207
Brett Cannon8045d972011-02-03 22:01:54 +0000208There are some differences between byte literals in Python 2 and those in
209Python 3 thanks to the bytes type just being an alias to ``str`` in Python 2.
Brett Cannon9ca21b32014-01-07 11:52:04 -0500210See the `Handle Common "Gotchas"`_ section for what to watch out for.
Brett Cannon8045d972011-02-03 22:01:54 +0000211
Brett Cannon9ca21b32014-01-07 11:52:04 -0500212``from __future__ import absolute_import``
213''''''''''''''''''''''''''''''''''''''''''
214Discussed in more detail below, but you should use this future statement to
215prevent yourself from accidentally using implicit relative imports.
Brett Cannon8045d972011-02-03 22:01:54 +0000216
217
Georg Brandl9bdcb3b2014-10-29 09:37:43 +0100218Supporting Python 2.5 and Newer Only
219////////////////////////////////////
Georg Brandl2cb2fa92011-02-07 15:30:45 +0000220
Georg Brandl9bdcb3b2014-10-29 09:37:43 +0100221If you are supporting Python 2.5 and newer there are still some features of
Brett Cannonce71ab22011-02-05 22:05:05 +0000222Python that you can utilize.
223
224
Ezio Melottic17c1f62011-04-21 14:49:03 +0300225``from __future__ import absolute_import``
226''''''''''''''''''''''''''''''''''''''''''
Georg Brandl2cb2fa92011-02-07 15:30:45 +0000227
Brett Cannonce71ab22011-02-05 22:05:05 +0000228Implicit relative imports (e.g., importing ``spam.bacon`` from within
Brett Cannon9ca21b32014-01-07 11:52:04 -0500229``spam.eggs`` with the statement ``import bacon``) do not work in Python 3.
Brett Cannonce71ab22011-02-05 22:05:05 +0000230This future statement moves away from that and allows the use of explicit
231relative imports (e.g., ``from . import bacon``).
232
Georg Brandl9bdcb3b2014-10-29 09:37:43 +0100233In Python 2.5 you must use
Brett Cannonce71ab22011-02-05 22:05:05 +0000234the __future__ statement to get to use explicit relative imports and prevent
Georg Brandl9bdcb3b2014-10-29 09:37:43 +0100235implicit ones. In Python 2.6 explicit relative imports are available without
Brett Cannonce71ab22011-02-05 22:05:05 +0000236the statement, but you still want the __future__ statement to prevent implicit
Georg Brandl9bdcb3b2014-10-29 09:37:43 +0100237relative imports. In Python 2.7 the __future__ statement is not needed. In
Brett Cannonce71ab22011-02-05 22:05:05 +0000238other words, unless you are only supporting Python 2.7 or a version earlier
Brett Cannon9ca21b32014-01-07 11:52:04 -0500239than Python 2.5, use this __future__ statement.
Brett Cannonce71ab22011-02-05 22:05:05 +0000240
241
Brett Cannonb7e6b892013-03-09 14:22:35 -0500242Mark all Unicode strings with a ``u`` prefix
243'''''''''''''''''''''''''''''''''''''''''''''
244
245While Python 2.6 has a ``__future__`` statement to automatically cause Python 2
246to treat all string literals as Unicode, Python 2.5 does not have that shortcut.
247This means you should go through and mark all string literals with a ``u``
Brett Cannon9ca21b32014-01-07 11:52:04 -0500248prefix to turn them explicitly into text strings where appropriate and only
249support Python 3.3 or newer. Otherwise use a project like six_ which provides a
250function to pass all text string literals through.
Brett Cannonb7e6b892013-03-09 14:22:35 -0500251
252
Brett Cannon9ca21b32014-01-07 11:52:04 -0500253Capturing the Currently Raised Exception
254''''''''''''''''''''''''''''''''''''''''
255
256In Python 2.5 and earlier the syntax to access the current exception is::
257
258 try:
259 raise Exception()
260 except Exception, exc:
261 # Current exception is 'exc'.
262 pass
263
Georg Brandl9bdcb3b2014-10-29 09:37:43 +0100264This syntax changed in Python 3 (and backported to Python 2.6 and later)
Brett Cannon9ca21b32014-01-07 11:52:04 -0500265to::
266
267 try:
268 raise Exception()
269 except Exception as exc:
270 # Current exception is 'exc'.
271 # In Python 3, 'exc' is restricted to the block; in Python 2.6/2.7 it will "leak".
272 pass
273
274Because of this syntax change you must change how you capture the current
275exception in Python 2.5 and earlier to::
276
277 try:
278 raise Exception()
279 except Exception:
280 import sys
281 exc = sys.exc_info()[1]
282 # Current exception is 'exc'.
283 pass
284
285You can get more information about the raised exception from
286:func:`sys.exc_info` than simply the current exception instance, but you most
287likely don't need it.
288
289.. note::
290 In Python 3, the traceback is attached to the exception instance
291 through the ``__traceback__`` attribute. If the instance is saved in
292 a local variable that persists outside of the ``except`` block, the
293 traceback will create a reference cycle with the current frame and its
294 dictionary of local variables. This will delay reclaiming dead
295 resources until the next cyclic :term:`garbage collection` pass.
296
297 In Python 2, this problem only occurs if you save the traceback itself
298 (e.g. the third element of the tuple returned by :func:`sys.exc_info`)
299 in a variable.
300
Brett Cannonce71ab22011-02-05 22:05:05 +0000301
Brett Cannon8045d972011-02-03 22:01:54 +0000302Handle Common "Gotchas"
Brett Cannon9ca21b32014-01-07 11:52:04 -0500303///////////////////////
Georg Brandl2cb2fa92011-02-07 15:30:45 +0000304
Brett Cannon9ca21b32014-01-07 11:52:04 -0500305These are things to watch out for no matter what version of Python 2 you are
306supporting which are not syntactic considerations.
Brett Cannon8045d972011-02-03 22:01:54 +0000307
308
Brett Cannonce71ab22011-02-05 22:05:05 +0000309``from __future__ import division``
310'''''''''''''''''''''''''''''''''''
Georg Brandl2cb2fa92011-02-07 15:30:45 +0000311
Brett Cannonce71ab22011-02-05 22:05:05 +0000312While the exact same outcome can be had by using the ``-Qnew`` argument to
313Python, using this future statement lifts the requirement that your users use
314the flag to get the expected behavior of division in Python 3
315(e.g., ``1/2 == 0.5; 1//2 == 0``).
316
317
318
Antoine Pitrou5c28cfdc2011-02-05 11:53:39 +0000319Specify when opening a file as binary
320'''''''''''''''''''''''''''''''''''''
321
322Unless you have been working on Windows, there is a chance you have not always
323bothered to add the ``b`` mode when opening a binary file (e.g., ``rb`` for
324binary reading). Under Python 3, binary files and text files are clearly
325distinct and mutually incompatible; see the :mod:`io` module for details.
326Therefore, you **must** make a decision of whether a file will be used for
327binary access (allowing to read and/or write bytes data) or text access
328(allowing to read and/or write unicode data).
329
330Text files
331''''''''''
332
333Text files created using ``open()`` under Python 2 return byte strings,
334while under Python 3 they return unicode strings. Depending on your porting
335strategy, this can be an issue.
336
337If you want text files to return unicode strings in Python 2, you have two
338possibilities:
339
340* Under Python 2.6 and higher, use :func:`io.open`. Since :func:`io.open`
341 is essentially the same function in both Python 2 and Python 3, it will
342 help iron out any issues that might arise.
343
344* If pre-2.6 compatibility is needed, then you should use :func:`codecs.open`
345 instead. This will make sure that you get back unicode strings in Python 2.
346
Brett Cannon8045d972011-02-03 22:01:54 +0000347Subclass ``object``
348'''''''''''''''''''
Georg Brandl2cb2fa92011-02-07 15:30:45 +0000349
Georg Brandl9bdcb3b2014-10-29 09:37:43 +0100350New-style classes have been around since Python 2.2. You need to make sure
Brett Cannonce71ab22011-02-05 22:05:05 +0000351you are subclassing from ``object`` to avoid odd edge cases involving method
Brett Cannon8045d972011-02-03 22:01:54 +0000352resolution order, etc. This continues to be totally valid in Python 3 (although
353unneeded as all classes implicitly inherit from ``object``).
354
355
356Deal With the Bytes/String Dichotomy
357''''''''''''''''''''''''''''''''''''
Georg Brandl2cb2fa92011-02-07 15:30:45 +0000358
Brett Cannon8045d972011-02-03 22:01:54 +0000359One of the biggest issues people have when porting code to Python 3 is handling
360the bytes/string dichotomy. Because Python 2 allowed the ``str`` type to hold
361textual data, people have over the years been rather loose in their delineation
362of what ``str`` instances held text compared to bytes. In Python 3 you cannot
Brett Cannon9ca21b32014-01-07 11:52:04 -0500363be so care-free anymore and need to properly handle the difference. The key to
R David Murray790e0052012-04-23 14:44:00 -0400364handling this issue is to make sure that **every** string literal in your
Brett Cannon9ca21b32014-01-07 11:52:04 -0500365Python 2 code is either syntactically or functionally marked as either bytes or
Brett Cannon8045d972011-02-03 22:01:54 +0000366text data. After this is done you then need to make sure your APIs are designed
367to either handle a specific type or made to be properly polymorphic.
368
369
370Mark Up Python 2 String Literals
371********************************
372
373First thing you must do is designate every single string literal in Python 2
374as either textual or bytes data. If you are only supporting Python 2.6 or
375newer, this can be accomplished by marking bytes literals with a ``b`` prefix
376and then designating textual data with a ``u`` prefix or using the
377``unicode_literals`` future statement.
378
R David Murray790e0052012-04-23 14:44:00 -0400379If your project supports versions of Python predating 2.6, then you should use
Brett Cannon8045d972011-02-03 22:01:54 +0000380the six_ project and its ``b()`` function to denote bytes literals. For text
381literals you can either use six's ``u()`` function or use a ``u`` prefix.
382
383
384Decide what APIs Will Accept
385****************************
Georg Brandl2cb2fa92011-02-07 15:30:45 +0000386
Brett Cannon8045d972011-02-03 22:01:54 +0000387In Python 2 it was very easy to accidentally create an API that accepted both
388bytes and textual data. But in Python 3, thanks to the more strict handling of
389disparate types, this loose usage of bytes and text together tends to fail.
390
391Take the dict ``{b'a': 'bytes', u'a': 'text'}`` in Python 2.6. It creates the
392dict ``{u'a': 'text'}`` since ``b'a' == u'a'``. But in Python 3 the equivalent
393dict creates ``{b'a': 'bytes', 'a': 'text'}``, i.e., no lost data. Similar
394issues can crop up when transitioning Python 2 code to Python 3.
395
396This means you need to choose what an API is going to accept and create and
397consistently stick to that API in both Python 2 and 3.
398
399
Brett Cannonce71ab22011-02-05 22:05:05 +0000400Bytes / Unicode Comparison
Antoine Pitrou8d8f7c52011-02-05 11:40:05 +0000401**************************
402
403In Python 3, mixing bytes and unicode is forbidden in most situations; it
404will raise a :class:`TypeError` where Python 2 would have attempted an implicit
405coercion between types. However, there is one case where it doesn't and
406it can be very misleading::
407
408 >>> b"" == ""
409 False
410
Brett Cannona2f15442011-02-09 22:55:13 +0000411This is because an equality comparison is required by the language to always
Antoine Pitrou8d8f7c52011-02-05 11:40:05 +0000412succeed (and return ``False`` for incompatible types). However, this also
413means that code incorrectly ported to Python 3 can display buggy behaviour
414if such comparisons are silently executed. To detect such situations,
415Python 3 has a ``-b`` flag that will display a warning::
416
417 $ python3 -b
418 >>> b"" == ""
419 __main__:1: BytesWarning: Comparison between bytes and string
420 False
421
422To turn the warning into an exception, use the ``-bb`` flag instead::
423
424 $ python3 -bb
425 >>> b"" == ""
426 Traceback (most recent call last):
427 File "<stdin>", line 1, in <module>
428 BytesWarning: Comparison between bytes and string
429
430
Antoine Pitroubd866e92011-02-05 12:13:38 +0000431Indexing bytes objects
432''''''''''''''''''''''
433
434Another potentially surprising change is the indexing behaviour of bytes
435objects in Python 3::
436
437 >>> b"xyz"[0]
438 120
439
440Indeed, Python 3 bytes objects (as well as :class:`bytearray` objects)
441are sequences of integers. But code converted from Python 2 will often
442assume that indexing a bytestring produces another bytestring, not an
443integer. To reconcile both behaviours, use slicing::
444
445 >>> b"xyz"[0:1]
446 b'x'
447 >>> n = 1
448 >>> b"xyz"[n:n+1]
449 b'y'
450
451The only remaining gotcha is that an out-of-bounds slice returns an empty
452bytes object instead of raising ``IndexError``:
453
454 >>> b"xyz"[3]
455 Traceback (most recent call last):
456 File "<stdin>", line 1, in <module>
457 IndexError: index out of range
458 >>> b"xyz"[3:4]
459 b''
460
461
Brett Cannon8045d972011-02-03 22:01:54 +0000462``__str__()``/``__unicode__()``
463'''''''''''''''''''''''''''''''
Georg Brandl2cb2fa92011-02-07 15:30:45 +0000464
Brett Cannon8045d972011-02-03 22:01:54 +0000465In Python 2, objects can specify both a string and unicode representation of
466themselves. In Python 3, though, there is only a string representation. This
Georg Brandl2cb2fa92011-02-07 15:30:45 +0000467becomes an issue as people can inadvertently do things in their ``__str__()``
Brett Cannon8045d972011-02-03 22:01:54 +0000468methods which have unpredictable results (e.g., infinite recursion if you
469happen to use the ``unicode(self).encode('utf8')`` idiom as the body of your
470``__str__()`` method).
471
Brett Cannon9ca21b32014-01-07 11:52:04 -0500472You can use a mixin class to work around this. This allows you to only define a
Brett Cannon8045d972011-02-03 22:01:54 +0000473``__unicode__()`` method for your class and let the mixin derive
474``__str__()`` for you (code from
475http://lucumr.pocoo.org/2011/1/22/forwards-compatible-python/)::
476
Brett Cannon4b0c24a2011-02-03 22:14:58 +0000477 import sys
Brett Cannon8045d972011-02-03 22:01:54 +0000478
Brett Cannon4b0c24a2011-02-03 22:14:58 +0000479 class UnicodeMixin(object):
Brett Cannon8045d972011-02-03 22:01:54 +0000480
Brett Cannon4b0c24a2011-02-03 22:14:58 +0000481 """Mixin class to handle defining the proper __str__/__unicode__
482 methods in Python 2 or 3."""
Brett Cannon8045d972011-02-03 22:01:54 +0000483
Brett Cannon4b0c24a2011-02-03 22:14:58 +0000484 if sys.version_info[0] >= 3: # Python 3
485 def __str__(self):
486 return self.__unicode__()
487 else: # Python 2
488 def __str__(self):
489 return self.__unicode__().encode('utf8')
Brett Cannon8045d972011-02-03 22:01:54 +0000490
491
Brett Cannon4b0c24a2011-02-03 22:14:58 +0000492 class Spam(UnicodeMixin):
Brett Cannon8045d972011-02-03 22:01:54 +0000493
Brett Cannon4b0c24a2011-02-03 22:14:58 +0000494 def __unicode__(self):
495 return u'spam-spam-bacon-spam' # 2to3 will remove the 'u' prefix
Brett Cannon8045d972011-02-03 22:01:54 +0000496
497
Brett Cannon8045d972011-02-03 22:01:54 +0000498Don't Index on Exceptions
499'''''''''''''''''''''''''
Antoine Pitrou5c28cfdc2011-02-05 11:53:39 +0000500
Brett Cannon8045d972011-02-03 22:01:54 +0000501In Python 2, the following worked::
502
Brett Cannon4b0c24a2011-02-03 22:14:58 +0000503 >>> exc = Exception(1, 2, 3)
504 >>> exc.args[1]
505 2
506 >>> exc[1] # Python 2 only!
507 2
Brett Cannon8045d972011-02-03 22:01:54 +0000508
Eli Bendersky7ac34192011-02-07 04:44:19 +0000509But in Python 3, indexing directly on an exception is an error. You need to
510make sure to only index on the :attr:`BaseException.args` attribute which is a
Brett Cannon8045d972011-02-03 22:01:54 +0000511sequence containing all arguments passed to the :meth:`__init__` method.
512
Eli Bendersky7ac34192011-02-07 04:44:19 +0000513Even better is to use the documented attributes the exception provides.
Brett Cannon8045d972011-02-03 22:01:54 +0000514
Brett Cannon9ca21b32014-01-07 11:52:04 -0500515
Brett Cannon8045d972011-02-03 22:01:54 +0000516Don't use ``__getslice__`` & Friends
517''''''''''''''''''''''''''''''''''''
Antoine Pitrou5c28cfdc2011-02-05 11:53:39 +0000518
Brett Cannon8045d972011-02-03 22:01:54 +0000519Been deprecated for a while, but Python 3 finally drops support for
520``__getslice__()``, etc. Move completely over to :meth:`__getitem__` and
521friends.
522
523
Brett Cannon45aa7cc2011-02-05 22:16:40 +0000524Updating doctests
525'''''''''''''''''
Georg Brandl2cb2fa92011-02-07 15:30:45 +0000526
Brett Cannon9ca21b32014-01-07 11:52:04 -0500527Don't forget to make them Python 2/3 compatible as well. If you wrote a
528monolithic set of doctests (e.g., a single docstring containing all of your
529doctests), you should at least consider breaking the doctests up into smaller
530pieces to make it more manageable to fix. Otherwise it might very well be worth
531your time and effort to port your tests to :mod:`unittest`.
Brett Cannon8045d972011-02-03 22:01:54 +0000532
533
Brett Cannon9ca21b32014-01-07 11:52:04 -0500534Update ``map`` for imbalanced input sequences
535'''''''''''''''''''''''''''''''''''''''''''''
Jason R. Coombsa90e3642011-12-03 08:24:21 -0500536
Brett Cannonf0996a92014-01-17 11:45:01 -0500537With Python 2, when ``map`` was given more than one input sequence it would pad
Georg Brandl6b4c8472014-10-30 22:26:26 +0100538the shorter sequences with ``None`` values, returning a sequence as long as the
Brett Cannonf0996a92014-01-17 11:45:01 -0500539longest input sequence.
Jason R. Coombsa90e3642011-12-03 08:24:21 -0500540
Brett Cannon9ca21b32014-01-07 11:52:04 -0500541With Python 3, if the input sequences to ``map`` are of unequal length, ``map``
Jason R. Coombsa90e3642011-12-03 08:24:21 -0500542will stop at the termination of the shortest of the sequences. For full
Brett Cannonf0996a92014-01-17 11:45:01 -0500543compatibility with ``map`` from Python 2.x, wrap the sequence arguments in
Jason R. Coombsa90e3642011-12-03 08:24:21 -0500544:func:`itertools.zip_longest`, e.g. ``map(func, *sequences)`` becomes
545``list(map(func, itertools.zip_longest(*sequences)))``.
546
Brett Cannon8045d972011-02-03 22:01:54 +0000547Eliminate ``-3`` Warnings
548-------------------------
Georg Brandl2cb2fa92011-02-07 15:30:45 +0000549
Brett Cannon8045d972011-02-03 22:01:54 +0000550When you run your application's test suite, run it using the ``-3`` flag passed
551to Python. This will cause various warnings to be raised during execution about
Brett Cannonf0996a92014-01-17 11:45:01 -0500552things that are semantic changes between Python 2 and 3. Try to eliminate those
553warnings to make your code even more portable to Python 3.
Brett Cannon8045d972011-02-03 22:01:54 +0000554
555
Brett Cannon9ca21b32014-01-07 11:52:04 -0500556Alternative Approaches
557======================
Georg Brandl2cb2fa92011-02-07 15:30:45 +0000558
Brett Cannon9ca21b32014-01-07 11:52:04 -0500559While supporting Python 2 & 3 simultaneously is typically the preferred choice
560by people so that they can continue to improve code and have it work for the
561most number of users, your life may be easier if you only have to support one
562major version of Python going forward.
563
564Supporting Only Python 3 Going Forward From Python 2 Code
565---------------------------------------------------------
566
567If you have Python 2 code but going forward only want to improve it as Python 3
Georg Brandl5aa761d2014-10-30 22:52:02 +0100568code, then you can use :ref:`2to3 <2to3-reference>` to translate your Python 2
569code to Python 3 code. This is only recommended, though, if your current
570version of your project is going into maintenance mode and you want all new
571features to be exclusive to Python 3.
Brett Cannon8045d972011-02-03 22:01:54 +0000572
573
Brett Cannon9ca21b32014-01-07 11:52:04 -0500574Backporting Python 3 code to Python 2
575-------------------------------------
Georg Brandl2cb2fa92011-02-07 15:30:45 +0000576
Brett Cannon9ca21b32014-01-07 11:52:04 -0500577If you have Python 3 code and have little interest in supporting Python 2 you
578can use 3to2_ to translate from Python 3 code to Python 2 code. This is only
Larry Hastings3732ed22014-03-15 21:13:56 -0700579recommended if you don't plan to heavily support Python 2 users. Otherwise
580write your code for Python 3 and then backport as far back as you want. This
581is typically easier than going from Python 2 to 3 as you will have worked out
582any difficulties with e.g. bytes/strings, etc.
Brett Cannon8045d972011-02-03 22:01:54 +0000583
Georg Brandl2cb2fa92011-02-07 15:30:45 +0000584
Brett Cannon8045d972011-02-03 22:01:54 +0000585Other Resources
586===============
Georg Brandl2cb2fa92011-02-07 15:30:45 +0000587
Brett Cannon6277fa42011-02-18 01:34:28 +0000588The authors of the following blog posts, wiki pages, and books deserve special
589thanks for making public their tips for porting Python 2 code to Python 3 (and
Brett Cannon9ca21b32014-01-07 11:52:04 -0500590thus helping provide information for this document and its various revisions
591over the years):
Brett Cannon8045d972011-02-03 22:01:54 +0000592
Georg Brandle73778c2014-10-29 08:36:35 +0100593* https://wiki.python.org/moin/PortingPythonToPy3k
Brett Cannon6277fa42011-02-18 01:34:28 +0000594* http://python3porting.com/
Brett Cannon8045d972011-02-03 22:01:54 +0000595* http://docs.pythonsprints.com/python3_porting/py-porting.html
596* http://techspot.zzzeek.org/2011/01/24/zzzeek-s-guide-to-python-3-porting/
597* http://dabeaz.blogspot.com/2011/01/porting-py65-and-my-superboard-to.html
598* http://lucumr.pocoo.org/2011/1/22/forwards-compatible-python/
599* http://lucumr.pocoo.org/2010/2/11/porting-to-python-3-a-guide/
Brett Cannonb7e6b892013-03-09 14:22:35 -0500600* https://wiki.ubuntu.com/Python/3
Brett Cannon8045d972011-02-03 22:01:54 +0000601
602If you feel there is something missing from this document that should be added,
603please email the python-porting_ mailing list.
604
Brett Cannon9ca21b32014-01-07 11:52:04 -0500605
Brett Cannon9ca21b32014-01-07 11:52:04 -0500606.. _3to2: https://pypi.python.org/pypi/3to2
607.. _Cheeseshop: PyPI_
608.. _coverage: https://pypi.python.org/pypi/coverage
609.. _future: http://python-future.org/
610.. _modernize: https://github.com/mitsuhiko/python-modernize
611.. _Porting to Python 3: http://python3porting.com/
Georg Brandlb7354a62014-10-29 10:57:37 +0100612.. _PyPI: https://pypi.python.org/pypi
Brett Cannon9ca21b32014-01-07 11:52:04 -0500613.. _Python 3 Packages: https://pypi.python.org/pypi?:action=browse&c=533&show=all
614.. _Python 3 Q & A: http://ncoghlan-devs-python-notes.readthedocs.org/en/latest/python3/questions_and_answers.html
Georg Brandle73778c2014-10-29 08:36:35 +0100615.. _python-porting: https://mail.python.org/mailman/listinfo/python-porting
Brett Cannon9ca21b32014-01-07 11:52:04 -0500616.. _six: https://pypi.python.org/pypi/six
617.. _tox: https://pypi.python.org/pypi/tox
618.. _trove classifiers: https://pypi.python.org/pypi?%3Aaction=list_classifiers