blob: 124ef33ae4b52c8c98cd8808557f0b4f1e7caa32 [file] [log] [blame]
Brett Cannon8045d972011-02-03 22:01:54 +00001.. _pyporting-howto:
2
3*********************************
4Porting Python 2 Code to Python 3
5*********************************
6
7:author: Brett Cannon
8
9.. topic:: Abstract
10
Brett Cannon4b0c24a2011-02-03 22:14:58 +000011 With Python 3 being the future of Python while Python 2 is still in active
12 use, it is good to have your project available for both major releases of
13 Python. This guide is meant to help you choose which strategy works best
14 for your project to support both Python 2 & 3 along with how to execute
15 that strategy.
Brett Cannon8045d972011-02-03 22:01:54 +000016
Brett Cannon4b0c24a2011-02-03 22:14:58 +000017 If you are looking to port an extension module instead of pure Python code,
Éric Araujo5405a0b2011-02-05 16:03:12 +000018 please see :ref:`cporting-howto`.
Brett Cannon8045d972011-02-03 22:01:54 +000019
20
21Choosing a Strategy
22===================
Georg Brandl2cb2fa92011-02-07 15:30:45 +000023
Brett Cannon8045d972011-02-03 22:01:54 +000024When a project makes the decision that it's time to support both Python 2 & 3,
25a decision needs to be made as to how to go about accomplishing that goal.
Eli Bendersky2d062de2011-02-07 04:19:57 +000026The chosen strategy will depend on how large the project's existing
Brett Cannon8045d972011-02-03 22:01:54 +000027codebase is and how much divergence you want from your Python 2 codebase from
28your Python 3 one (e.g., starting a new version with Python 3).
29
30If your project is brand-new or does not have a large codebase, then you may
31want to consider writing/porting :ref:`all of your code for Python 3
32and use 3to2 <use_3to2>` to port your code for Python 2.
33
Brett Cannon6277fa42011-02-18 01:34:28 +000034If you would prefer to maintain a codebase which is semantically **and**
35syntactically compatible with Python 2 & 3 simultaneously, you can write
36:ref:`use_same_source`. While this tends to lead to somewhat non-idiomatic
37code, it does mean you keep a rapid development process for you, the developer.
Brett Cannon8045d972011-02-03 22:01:54 +000038
Brett Cannon6277fa42011-02-18 01:34:28 +000039Finally, you do have the option of :ref:`using 2to3 <use_2to3>` to translate
40Python 2 code into Python 3 code (with some manual help). This can take the
41form of branching your code and using 2to3 to start a Python 3 branch. You can
42also have users perform the translation as installation time automatically so
43that you only have to maintain a Python 2 codebase.
Brett Cannon8045d972011-02-03 22:01:54 +000044
Brett Cannon6277fa42011-02-18 01:34:28 +000045Regardless of which approach you choose, porting is not as hard or
Brett Cannon8045d972011-02-03 22:01:54 +000046time-consuming as you might initially think. You can also tackle the problem
47piece-meal as a good portion of porting is simply updating your code to follow
48current best practices in a Python 2/3 compatible way.
49
50
51Universal Bits of Advice
52------------------------
Georg Brandl2cb2fa92011-02-07 15:30:45 +000053
Brett Cannon8045d972011-02-03 22:01:54 +000054Regardless of what strategy you pick, there are a few things you should
55consider.
56
57One is make sure you have a robust test suite. You need to make sure everything
58continues to work, just like when you support a new minor version of Python.
59This means making sure your test suite is thorough and is ported properly
60between Python 2 & 3. You will also most likely want to use something like tox_
61to automate testing between both a Python 2 and Python 3 VM.
62
63Two, once your project has Python 3 support, make sure to add the proper
64classifier on the Cheeseshop_ (PyPI_). To have your project listed as Python 3
65compatible it must have the
66`Python 3 classifier <http://pypi.python.org/pypi?:action=browse&c=533>`_
67(from
68http://techspot.zzzeek.org/2011/01/24/zzzeek-s-guide-to-python-3-porting/)::
69
Brett Cannon4b0c24a2011-02-03 22:14:58 +000070 setup(
71 name='Your Library',
72 version='1.0',
73 classifiers=[
74 # make sure to use :: Python *and* :: Python :: 3 so
75 # that pypi can list the package on the python 3 page
76 'Programming Language :: Python',
77 'Programming Language :: Python :: 3'
78 ],
79 packages=['yourlibrary'],
80 # make sure to add custom_fixers to the MANIFEST.in
81 include_package_data=True,
82 # ...
83 )
Brett Cannon8045d972011-02-03 22:01:54 +000084
85
86Doing so will cause your project to show up in the
87`Python 3 packages list
88<http://pypi.python.org/pypi?:action=browse&c=533&show=all>`_. You will know
89you set the classifier properly as visiting your project page on the Cheeseshop
90will show a Python 3 logo in the upper-left corner of the page.
91
92Three, the six_ project provides a library which helps iron out differences
93between Python 2 & 3. If you find there is a sticky point that is a continual
94point of contention in your translation or maintenance of code, consider using
95a source-compatible solution relying on six. If you have to create your own
96Python 2/3 compatible solution, you can use ``sys.version_info[0] >= 3`` as a
97guard.
98
99Four, read all the approaches. Just because some bit of advice applies to one
100approach more than another doesn't mean that some advice doesn't apply to other
101strategies.
102
Eli Bendersky2d062de2011-02-07 04:19:57 +0000103Five, drop support for older Python versions if possible. `Python 2.5`_
104introduced a lot of useful syntax and libraries which have become idiomatic
105in Python 3. `Python 2.6`_ introduced future statements which makes
106compatibility much easier if you are going from Python 2 to 3.
Brett Cannon8045d972011-02-03 22:01:54 +0000107`Python 2.7`_ continues the trend in the stdlib. So choose the newest version
Eli Bendersky2d062de2011-02-07 04:19:57 +0000108of Python which you believe can be your minimum support version
Brett Cannon8045d972011-02-03 22:01:54 +0000109and work from there.
110
111
112.. _tox: http://codespeak.net/tox/
113.. _Cheeseshop:
114.. _PyPI: http://pypi.python.org/
115.. _six: http://packages.python.org/six
116.. _Python 2.7: http://www.python.org/2.7.x
117.. _Python 2.6: http://www.python.org/2.6.x
118.. _Python 2.5: http://www.python.org/2.5.x
119.. _Python 2.4: http://www.python.org/2.4.x
Brett Cannonce71ab22011-02-05 22:05:05 +0000120.. _Python 2.3: http://www.python.org/2.3.x
121.. _Python 2.2: http://www.python.org/2.2.x
Brett Cannon8045d972011-02-03 22:01:54 +0000122
123
124.. _use_3to2:
125
126Python 3 and 3to2
127=================
Georg Brandl2cb2fa92011-02-07 15:30:45 +0000128
Brett Cannon8045d972011-02-03 22:01:54 +0000129If you are starting a new project or your codebase is small enough, you may
130want to consider writing your code for Python 3 and backporting to Python 2
131using 3to2_. Thanks to Python 3 being more strict about things than Python 2
132(e.g., bytes vs. strings), the source translation can be easier and more
133straightforward than from Python 2 to 3. Plus it gives you more direct
134experience developing in Python 3 which, since it is the future of Python, is a
135good thing long-term.
136
137A drawback of this approach is that 3to2 is a third-party project. This means
138that the Python core developers (and thus this guide) can make no promises
139about how well 3to2 works at any time. There is nothing to suggest, though,
140that 3to2 is not a high-quality project.
141
142
143.. _3to2: https://bitbucket.org/amentajo/lib3to2/overview
144
145
146.. _use_2to3:
147
148Python 2 and 2to3
149=================
Georg Brandl2cb2fa92011-02-07 15:30:45 +0000150
Eli Bendersky7ac34192011-02-07 04:44:19 +0000151Included with Python since 2.6, the 2to3_ tool (and :mod:`lib2to3` module)
152helps with porting Python 2 to Python 3 by performing various source
153translations. This is a perfect solution for projects which wish to branch
154their Python 3 code from their Python 2 codebase and maintain them as
155independent codebases. You can even begin preparing to use this approach
156today by writing future-compatible Python code which works cleanly in
157Python 2 in conjunction with 2to3; all steps outlined below will work
158with Python 2 code up to the point when the actual use of 2to3 occurs.
Brett Cannon8045d972011-02-03 22:01:54 +0000159
160Use of 2to3 as an on-demand translation step at install time is also possible,
161preventing the need to maintain a separate Python 3 codebase, but this approach
162does come with some drawbacks. While users will only have to pay the
163translation cost once at installation, you as a developer will need to pay the
164cost regularly during development. If your codebase is sufficiently large
165enough then the translation step ends up acting like a compilation step,
166robbing you of the rapid development process you are used to with Python.
167Obviously the time required to translate a project will vary, so do an
168experimental translation just to see how long it takes to evaluate whether you
169prefer this approach compared to using :ref:`use_same_source` or simply keeping
170a separate Python 3 codebase.
171
172Below are the typical steps taken by a project which uses a 2to3-based approach
173to supporting Python 2 & 3.
174
175
176Support Python 2.7
177------------------
Georg Brandl2cb2fa92011-02-07 15:30:45 +0000178
Brett Cannon8045d972011-02-03 22:01:54 +0000179As a first step, make sure that your project is compatible with `Python 2.7`_.
180This is just good to do as Python 2.7 is the last release of Python 2 and thus
181will be used for a rather long time. It also allows for use of the ``-3`` flag
182to Python to help discover places in your code which 2to3 cannot handle but are
183known to cause issues.
184
Brett Cannonce71ab22011-02-05 22:05:05 +0000185Try to Support `Python 2.6`_ and Newer Only
186-------------------------------------------
Georg Brandl2cb2fa92011-02-07 15:30:45 +0000187
Brett Cannon8045d972011-02-03 22:01:54 +0000188While not possible for all projects, if you can support `Python 2.6`_ and newer
189**only**, your life will be much easier. Various future statements, stdlib
190additions, etc. exist only in Python 2.6 and later which greatly assist in
191porting to Python 3. But if you project must keep support for `Python 2.5`_ (or
192even `Python 2.4`_) then it is still possible to port to Python 3.
193
194Below are the benefits you gain if you only have to support Python 2.6 and
195newer. Some of these options are personal choice while others are
196**strongly** recommended (the ones that are more for personal choice are
197labeled as such). If you continue to support older versions of Python then you
198at least need to watch out for situations that these solutions fix.
199
200
Brett Cannon8045d972011-02-03 22:01:54 +0000201``from __future__ import print_function``
202'''''''''''''''''''''''''''''''''''''''''
Georg Brandl2cb2fa92011-02-07 15:30:45 +0000203
Brett Cannon8045d972011-02-03 22:01:54 +0000204This is a personal choice. 2to3 handles the translation from the print
205statement to the print function rather well so this is an optional step. This
206future statement does help, though, with getting used to typing
207``print('Hello, World')`` instead of ``print 'Hello, World'``.
208
209
210``from __future__ import unicode_literals``
211'''''''''''''''''''''''''''''''''''''''''''
Georg Brandl2cb2fa92011-02-07 15:30:45 +0000212
Brett Cannon8045d972011-02-03 22:01:54 +0000213Another personal choice. You can always mark what you want to be a (unicode)
214string with a ``u`` prefix to get the same effect. But regardless of whether
215you use this future statement or not, you **must** make sure you know exactly
216which Python 2 strings you want to be bytes, and which are to be strings. This
217means you should, **at minimum** mark all strings that are meant to be text
218strings with a ``u`` prefix if you do not use this future statement.
219
220
221Bytes literals
222''''''''''''''
Georg Brandl2cb2fa92011-02-07 15:30:45 +0000223
Brett Cannon8045d972011-02-03 22:01:54 +0000224This is a **very** important one. The ability to prefix Python 2 strings that
225are meant to contain bytes with a ``b`` prefix help to very clearly delineate
226what is and is not a Python 3 string. When you run 2to3 on code, all Python 2
227strings become Python 3 strings **unless** they are prefixed with ``b``.
228
229There are some differences between byte literals in Python 2 and those in
230Python 3 thanks to the bytes type just being an alias to ``str`` in Python 2.
231Probably the biggest "gotcha" is that indexing results in different values. In
232Python 2, the value of ``b'py'[1]`` is ``'y'``, while in Python 3 it's ``121``.
233You can avoid this disparity by always slicing at the size of a single element:
234``b'py'[1:2]`` is ``'y'`` in Python 2 and ``b'y'`` in Python 3 (i.e., close
235enough).
236
237You cannot concatenate bytes and strings in Python 3. But since in Python
2382 has bytes aliased to ``str``, it will succeed: ``b'a' + u'b'`` works in
239Python 2, but ``b'a' + 'b'`` in Python 3 is a :exc:`TypeError`. A similar issue
240also comes about when doing comparisons between bytes and strings.
241
242
Brett Cannonce71ab22011-02-05 22:05:05 +0000243Supporting `Python 2.5`_ and Newer Only
244---------------------------------------
Georg Brandl2cb2fa92011-02-07 15:30:45 +0000245
Brett Cannonce71ab22011-02-05 22:05:05 +0000246If you are supporting `Python 2.5`_ and newer there are still some features of
247Python that you can utilize.
248
249
Ezio Melottic17c1f62011-04-21 14:49:03 +0300250``from __future__ import absolute_import``
251''''''''''''''''''''''''''''''''''''''''''
Georg Brandl2cb2fa92011-02-07 15:30:45 +0000252
Brett Cannonce71ab22011-02-05 22:05:05 +0000253Implicit relative imports (e.g., importing ``spam.bacon`` from within
254``spam.eggs`` with the statement ``import bacon``) does not work in Python 3.
255This future statement moves away from that and allows the use of explicit
256relative imports (e.g., ``from . import bacon``).
257
258In `Python 2.5`_ you must use
259the __future__ statement to get to use explicit relative imports and prevent
260implicit ones. In `Python 2.6`_ explicit relative imports are available without
261the statement, but you still want the __future__ statement to prevent implicit
262relative imports. In `Python 2.7`_ the __future__ statement is not needed. In
263other words, unless you are only supporting Python 2.7 or a version earlier
264than Python 2.5, use the __future__ statement.
265
266
267
Brett Cannon8045d972011-02-03 22:01:54 +0000268Handle Common "Gotchas"
269-----------------------
Georg Brandl2cb2fa92011-02-07 15:30:45 +0000270
Brett Cannon8045d972011-02-03 22:01:54 +0000271There are a few things that just consistently come up as sticking points for
272people which 2to3 cannot handle automatically or can easily be done in Python 2
273to help modernize your code.
274
275
Brett Cannonce71ab22011-02-05 22:05:05 +0000276``from __future__ import division``
277'''''''''''''''''''''''''''''''''''
Georg Brandl2cb2fa92011-02-07 15:30:45 +0000278
Brett Cannonce71ab22011-02-05 22:05:05 +0000279While the exact same outcome can be had by using the ``-Qnew`` argument to
280Python, using this future statement lifts the requirement that your users use
281the flag to get the expected behavior of division in Python 3
282(e.g., ``1/2 == 0.5; 1//2 == 0``).
283
284
285
Antoine Pitrou5c28cfdc2011-02-05 11:53:39 +0000286Specify when opening a file as binary
287'''''''''''''''''''''''''''''''''''''
288
289Unless you have been working on Windows, there is a chance you have not always
290bothered to add the ``b`` mode when opening a binary file (e.g., ``rb`` for
291binary reading). Under Python 3, binary files and text files are clearly
292distinct and mutually incompatible; see the :mod:`io` module for details.
293Therefore, you **must** make a decision of whether a file will be used for
294binary access (allowing to read and/or write bytes data) or text access
295(allowing to read and/or write unicode data).
296
297Text files
298''''''''''
299
300Text files created using ``open()`` under Python 2 return byte strings,
301while under Python 3 they return unicode strings. Depending on your porting
302strategy, this can be an issue.
303
304If you want text files to return unicode strings in Python 2, you have two
305possibilities:
306
307* Under Python 2.6 and higher, use :func:`io.open`. Since :func:`io.open`
308 is essentially the same function in both Python 2 and Python 3, it will
309 help iron out any issues that might arise.
310
311* If pre-2.6 compatibility is needed, then you should use :func:`codecs.open`
312 instead. This will make sure that you get back unicode strings in Python 2.
313
Brett Cannon8045d972011-02-03 22:01:54 +0000314Subclass ``object``
315'''''''''''''''''''
Georg Brandl2cb2fa92011-02-07 15:30:45 +0000316
Brett Cannonce71ab22011-02-05 22:05:05 +0000317New-style classes have been around since `Python 2.2`_. You need to make sure
318you are subclassing from ``object`` to avoid odd edge cases involving method
Brett Cannon8045d972011-02-03 22:01:54 +0000319resolution order, etc. This continues to be totally valid in Python 3 (although
320unneeded as all classes implicitly inherit from ``object``).
321
322
323Deal With the Bytes/String Dichotomy
324''''''''''''''''''''''''''''''''''''
Georg Brandl2cb2fa92011-02-07 15:30:45 +0000325
Brett Cannon8045d972011-02-03 22:01:54 +0000326One of the biggest issues people have when porting code to Python 3 is handling
327the bytes/string dichotomy. Because Python 2 allowed the ``str`` type to hold
328textual data, people have over the years been rather loose in their delineation
329of what ``str`` instances held text compared to bytes. In Python 3 you cannot
330be so care-free anymore and need to properly handle the difference. The key
331handling this issue to to make sure that **every** string literal in your
332Python 2 code is either syntactically of functionally marked as either bytes or
333text data. After this is done you then need to make sure your APIs are designed
334to either handle a specific type or made to be properly polymorphic.
335
336
337Mark Up Python 2 String Literals
338********************************
339
340First thing you must do is designate every single string literal in Python 2
341as either textual or bytes data. If you are only supporting Python 2.6 or
342newer, this can be accomplished by marking bytes literals with a ``b`` prefix
343and then designating textual data with a ``u`` prefix or using the
344``unicode_literals`` future statement.
345
346If your project supports versions of Python pre-dating 2.6, then you should use
347the six_ project and its ``b()`` function to denote bytes literals. For text
348literals you can either use six's ``u()`` function or use a ``u`` prefix.
349
350
351Decide what APIs Will Accept
352****************************
Georg Brandl2cb2fa92011-02-07 15:30:45 +0000353
Brett Cannon8045d972011-02-03 22:01:54 +0000354In Python 2 it was very easy to accidentally create an API that accepted both
355bytes and textual data. But in Python 3, thanks to the more strict handling of
356disparate types, this loose usage of bytes and text together tends to fail.
357
358Take the dict ``{b'a': 'bytes', u'a': 'text'}`` in Python 2.6. It creates the
359dict ``{u'a': 'text'}`` since ``b'a' == u'a'``. But in Python 3 the equivalent
360dict creates ``{b'a': 'bytes', 'a': 'text'}``, i.e., no lost data. Similar
361issues can crop up when transitioning Python 2 code to Python 3.
362
363This means you need to choose what an API is going to accept and create and
364consistently stick to that API in both Python 2 and 3.
365
366
Brett Cannonce71ab22011-02-05 22:05:05 +0000367Bytes / Unicode Comparison
Antoine Pitrou8d8f7c52011-02-05 11:40:05 +0000368**************************
369
370In Python 3, mixing bytes and unicode is forbidden in most situations; it
371will raise a :class:`TypeError` where Python 2 would have attempted an implicit
372coercion between types. However, there is one case where it doesn't and
373it can be very misleading::
374
375 >>> b"" == ""
376 False
377
Brett Cannona2f15442011-02-09 22:55:13 +0000378This is because an equality comparison is required by the language to always
Antoine Pitrou8d8f7c52011-02-05 11:40:05 +0000379succeed (and return ``False`` for incompatible types). However, this also
380means that code incorrectly ported to Python 3 can display buggy behaviour
381if such comparisons are silently executed. To detect such situations,
382Python 3 has a ``-b`` flag that will display a warning::
383
384 $ python3 -b
385 >>> b"" == ""
386 __main__:1: BytesWarning: Comparison between bytes and string
387 False
388
389To turn the warning into an exception, use the ``-bb`` flag instead::
390
391 $ python3 -bb
392 >>> b"" == ""
393 Traceback (most recent call last):
394 File "<stdin>", line 1, in <module>
395 BytesWarning: Comparison between bytes and string
396
397
Antoine Pitroubd866e92011-02-05 12:13:38 +0000398Indexing bytes objects
399''''''''''''''''''''''
400
401Another potentially surprising change is the indexing behaviour of bytes
402objects in Python 3::
403
404 >>> b"xyz"[0]
405 120
406
407Indeed, Python 3 bytes objects (as well as :class:`bytearray` objects)
408are sequences of integers. But code converted from Python 2 will often
409assume that indexing a bytestring produces another bytestring, not an
410integer. To reconcile both behaviours, use slicing::
411
412 >>> b"xyz"[0:1]
413 b'x'
414 >>> n = 1
415 >>> b"xyz"[n:n+1]
416 b'y'
417
418The only remaining gotcha is that an out-of-bounds slice returns an empty
419bytes object instead of raising ``IndexError``:
420
421 >>> b"xyz"[3]
422 Traceback (most recent call last):
423 File "<stdin>", line 1, in <module>
424 IndexError: index out of range
425 >>> b"xyz"[3:4]
426 b''
427
428
Brett Cannon8045d972011-02-03 22:01:54 +0000429``__str__()``/``__unicode__()``
430'''''''''''''''''''''''''''''''
Georg Brandl2cb2fa92011-02-07 15:30:45 +0000431
Brett Cannon8045d972011-02-03 22:01:54 +0000432In Python 2, objects can specify both a string and unicode representation of
433themselves. In Python 3, though, there is only a string representation. This
Georg Brandl2cb2fa92011-02-07 15:30:45 +0000434becomes an issue as people can inadvertently do things in their ``__str__()``
Brett Cannon8045d972011-02-03 22:01:54 +0000435methods which have unpredictable results (e.g., infinite recursion if you
436happen to use the ``unicode(self).encode('utf8')`` idiom as the body of your
437``__str__()`` method).
438
439There are two ways to solve this issue. One is to use a custom 2to3 fixer. The
440blog post at http://lucumr.pocoo.org/2011/1/22/forwards-compatible-python/
441specifies how to do this. That will allow 2to3 to change all instances of ``def
442__unicode(self): ...`` to ``def __str__(self): ...``. This does require you
443define your ``__str__()`` method in Python 2 before your ``__unicode__()``
444method.
445
446The other option is to use a mixin class. This allows you to only define a
447``__unicode__()`` method for your class and let the mixin derive
448``__str__()`` for you (code from
449http://lucumr.pocoo.org/2011/1/22/forwards-compatible-python/)::
450
Brett Cannon4b0c24a2011-02-03 22:14:58 +0000451 import sys
Brett Cannon8045d972011-02-03 22:01:54 +0000452
Brett Cannon4b0c24a2011-02-03 22:14:58 +0000453 class UnicodeMixin(object):
Brett Cannon8045d972011-02-03 22:01:54 +0000454
Brett Cannon4b0c24a2011-02-03 22:14:58 +0000455 """Mixin class to handle defining the proper __str__/__unicode__
456 methods in Python 2 or 3."""
Brett Cannon8045d972011-02-03 22:01:54 +0000457
Brett Cannon4b0c24a2011-02-03 22:14:58 +0000458 if sys.version_info[0] >= 3: # Python 3
459 def __str__(self):
460 return self.__unicode__()
461 else: # Python 2
462 def __str__(self):
463 return self.__unicode__().encode('utf8')
Brett Cannon8045d972011-02-03 22:01:54 +0000464
465
Brett Cannon4b0c24a2011-02-03 22:14:58 +0000466 class Spam(UnicodeMixin):
Brett Cannon8045d972011-02-03 22:01:54 +0000467
Brett Cannon4b0c24a2011-02-03 22:14:58 +0000468 def __unicode__(self):
469 return u'spam-spam-bacon-spam' # 2to3 will remove the 'u' prefix
Brett Cannon8045d972011-02-03 22:01:54 +0000470
471
Brett Cannon8045d972011-02-03 22:01:54 +0000472Don't Index on Exceptions
473'''''''''''''''''''''''''
Antoine Pitrou5c28cfdc2011-02-05 11:53:39 +0000474
Brett Cannon8045d972011-02-03 22:01:54 +0000475In Python 2, the following worked::
476
Brett Cannon4b0c24a2011-02-03 22:14:58 +0000477 >>> exc = Exception(1, 2, 3)
478 >>> exc.args[1]
479 2
480 >>> exc[1] # Python 2 only!
481 2
Brett Cannon8045d972011-02-03 22:01:54 +0000482
Eli Bendersky7ac34192011-02-07 04:44:19 +0000483But in Python 3, indexing directly on an exception is an error. You need to
484make sure to only index on the :attr:`BaseException.args` attribute which is a
Brett Cannon8045d972011-02-03 22:01:54 +0000485sequence containing all arguments passed to the :meth:`__init__` method.
486
Eli Bendersky7ac34192011-02-07 04:44:19 +0000487Even better is to use the documented attributes the exception provides.
Brett Cannon8045d972011-02-03 22:01:54 +0000488
Brett Cannon8045d972011-02-03 22:01:54 +0000489Don't use ``__getslice__`` & Friends
490''''''''''''''''''''''''''''''''''''
Antoine Pitrou5c28cfdc2011-02-05 11:53:39 +0000491
Brett Cannon8045d972011-02-03 22:01:54 +0000492Been deprecated for a while, but Python 3 finally drops support for
493``__getslice__()``, etc. Move completely over to :meth:`__getitem__` and
494friends.
495
496
Brett Cannon45aa7cc2011-02-05 22:16:40 +0000497Updating doctests
498'''''''''''''''''
Georg Brandl2cb2fa92011-02-07 15:30:45 +0000499
Brett Cannon45aa7cc2011-02-05 22:16:40 +00005002to3_ will attempt to generate fixes for doctests that it comes across. It's
501not perfect, though. If you wrote a monolithic set of doctests (e.g., a single
502docstring containing all of your doctests), you should at least consider
503breaking the doctests up into smaller pieces to make it more manageable to fix.
504Otherwise it might very well be worth your time and effort to port your tests
505to :mod:`unittest`.
Brett Cannon8045d972011-02-03 22:01:54 +0000506
507
508Eliminate ``-3`` Warnings
509-------------------------
Georg Brandl2cb2fa92011-02-07 15:30:45 +0000510
Brett Cannon8045d972011-02-03 22:01:54 +0000511When you run your application's test suite, run it using the ``-3`` flag passed
512to Python. This will cause various warnings to be raised during execution about
513things that 2to3 cannot handle automatically (e.g., modules that have been
514removed). Try to eliminate those warnings to make your code even more portable
515to Python 3.
516
517
518Run 2to3
519--------
Georg Brandl2cb2fa92011-02-07 15:30:45 +0000520
Brett Cannon8045d972011-02-03 22:01:54 +0000521Once you have made your Python 2 code future-compatible with Python 3, it's
522time to use 2to3_ to actually port your code.
523
524
525Manually
526''''''''
Georg Brandl2cb2fa92011-02-07 15:30:45 +0000527
Brett Cannon8045d972011-02-03 22:01:54 +0000528To manually convert source code using 2to3_, you use the ``2to3`` script that
529is installed with Python 2.6 and later.::
530
Brett Cannon4b0c24a2011-02-03 22:14:58 +0000531 2to3 <directory or file to convert>
Brett Cannon8045d972011-02-03 22:01:54 +0000532
533This will cause 2to3 to write out a diff with all of the fixers applied for the
534converted source code. If you would like 2to3 to go ahead and apply the changes
535you can pass it the ``-w`` flag::
536
Brett Cannon4b0c24a2011-02-03 22:14:58 +0000537 2to3 -w <stuff to convert>
Brett Cannon8045d972011-02-03 22:01:54 +0000538
539There are other flags available to control exactly which fixers are applied,
540etc.
541
542
543During Installation
544'''''''''''''''''''
Georg Brandl2cb2fa92011-02-07 15:30:45 +0000545
Brett Cannon8045d972011-02-03 22:01:54 +0000546When a user installs your project for Python 3, you can have either
547:mod:`distutils` or Distribute_ run 2to3_ on your behalf.
548For distutils, use the following idiom::
549
Brett Cannon4b0c24a2011-02-03 22:14:58 +0000550 try: # Python 3
551 from distutils.command.build_py import build_py_2to3 as build_py
552 except ImportError: # Python 2
553 from distutils.command.build_py import build_py
Brett Cannon8045d972011-02-03 22:01:54 +0000554
Georg Brandl829befb2011-02-13 09:59:39 +0000555 setup(cmdclass = {'build_py': build_py},
Brett Cannon4b0c24a2011-02-03 22:14:58 +0000556 # ...
557 )
Brett Cannon8045d972011-02-03 22:01:54 +0000558
Georg Brandl829befb2011-02-13 09:59:39 +0000559For Distribute::
Brett Cannon8045d972011-02-03 22:01:54 +0000560
Brett Cannon4b0c24a2011-02-03 22:14:58 +0000561 setup(use_2to3=True,
562 # ...
563 )
Brett Cannon8045d972011-02-03 22:01:54 +0000564
565This will allow you to not have to distribute a separate Python 3 version of
566your project. It does require, though, that when you perform development that
567you at least build your project and use the built Python 3 source for testing.
568
569
570Verify & Test
571-------------
Georg Brandl2cb2fa92011-02-07 15:30:45 +0000572
Brett Cannon8045d972011-02-03 22:01:54 +0000573At this point you should (hopefully) have your project converted in such a way
574that it works in Python 3. Verify it by running your unit tests and making sure
575nothing has gone awry. If you miss something then figure out how to fix it in
576Python 3, backport to your Python 2 code, and run your code through 2to3 again
577to verify the fix transforms properly.
578
579
580.. _2to3: http://docs.python.org/py3k/library/2to3.html
581.. _Distribute: http://packages.python.org/distribute/
582
583
584.. _use_same_source:
585
586Python 2/3 Compatible Source
587============================
Georg Brandl2cb2fa92011-02-07 15:30:45 +0000588
Brett Cannon8045d972011-02-03 22:01:54 +0000589While it may seem counter-intuitive, you can write Python code which is
590source-compatible between Python 2 & 3. It does lead to code that is not
591entirely idiomatic Python (e.g., having to extract the currently raised
592exception from ``sys.exc_info()[1]``), but it can be run under Python 2
Brett Cannon6277fa42011-02-18 01:34:28 +0000593**and** Python 3 without using 2to3_ as a translation step (although the tool
594should be used to help find potential portability problems). This allows you to
Brett Cannon8045d972011-02-03 22:01:54 +0000595continue to have a rapid development process regardless of whether you are
596developing under Python 2 or Python 3. Whether this approach or using
597:ref:`use_2to3` works best for you will be a per-project decision.
598
599To get a complete idea of what issues you will need to deal with, see the
600`What's New in Python 3.0`_. Others have reorganized the data in other formats
601such as http://docs.pythonsprints.com/python3_porting/py-porting.html .
602
603The following are some steps to take to try to support both Python 2 & 3 from
604the same source code.
605
606
607.. _What's New in Python 3.0: http://docs.python.org/release/3.0/whatsnew/3.0.html
608
609
Brett Cannon6277fa42011-02-18 01:34:28 +0000610Follow The Steps for Using 2to3_
611--------------------------------
Georg Brandl2cb2fa92011-02-07 15:30:45 +0000612
Brett Cannon8045d972011-02-03 22:01:54 +0000613All of the steps outlined in how to
614:ref:`port Python 2 code with 2to3 <use_2to3>` apply
615to creating a Python 2/3 codebase. This includes trying only support Python 2.6
616or newer (the :mod:`__future__` statements work in Python 3 without issue),
617eliminating warnings that are triggered by ``-3``, etc.
618
Brett Cannon98135d02011-02-05 22:22:47 +0000619You should even consider running 2to3_ over your code (without committing the
620changes). This will let you know where potential pain points are within your
621code so that you can fix them properly before they become an issue.
Brett Cannon8045d972011-02-03 22:01:54 +0000622
623
624Use six_
625--------
Georg Brandl2cb2fa92011-02-07 15:30:45 +0000626
Brett Cannon8045d972011-02-03 22:01:54 +0000627The six_ project contains many things to help you write portable Python code.
628You should make sure to read its documentation from beginning to end and use
629any and all features it provides. That way you will minimize any mistakes you
630might make in writing cross-version code.
631
632
633Capturing the Currently Raised Exception
634----------------------------------------
Georg Brandl2cb2fa92011-02-07 15:30:45 +0000635
Brett Cannonce71ab22011-02-05 22:05:05 +0000636One change between Python 2 and 3 that will require changing how you code (if
637you support `Python 2.5`_ and earlier) is
638accessing the currently raised exception. In Python 2.5 and earlier the syntax
639to access the current exception is::
Brett Cannon8045d972011-02-03 22:01:54 +0000640
Brett Cannon4b0c24a2011-02-03 22:14:58 +0000641 try:
642 raise Exception()
643 except Exception, exc:
644 # Current exception is 'exc'
645 pass
Brett Cannon8045d972011-02-03 22:01:54 +0000646
Brett Cannonce71ab22011-02-05 22:05:05 +0000647This syntax changed in Python 3 (and backported to `Python 2.6`_ and later)
648to::
Brett Cannon8045d972011-02-03 22:01:54 +0000649
Brett Cannon4b0c24a2011-02-03 22:14:58 +0000650 try:
651 raise Exception()
652 except Exception as exc:
653 # Current exception is 'exc'
Brett Cannonce71ab22011-02-05 22:05:05 +0000654 # In Python 3, 'exc' is restricted to the block; Python 2.6 will "leak"
Brett Cannon4b0c24a2011-02-03 22:14:58 +0000655 pass
Brett Cannon8045d972011-02-03 22:01:54 +0000656
657Because of this syntax change you must change to capturing the current
658exception to::
659
Brett Cannon4b0c24a2011-02-03 22:14:58 +0000660 try:
661 raise Exception()
662 except Exception:
663 import sys
664 exc = sys.exc_info()[1]
665 # Current exception is 'exc'
666 pass
Brett Cannon8045d972011-02-03 22:01:54 +0000667
668You can get more information about the raised exception from
669:func:`sys.exc_info` than simply the current exception instance, but you most
Antoine Pitroue6a14642011-02-05 12:01:07 +0000670likely don't need it.
Brett Cannon8045d972011-02-03 22:01:54 +0000671
Antoine Pitroue6a14642011-02-05 12:01:07 +0000672.. note::
673 In Python 3, the traceback is attached to the exception instance
Brett Cannonce71ab22011-02-05 22:05:05 +0000674 through the ``__traceback__`` attribute. If the instance is saved in
Antoine Pitroue6a14642011-02-05 12:01:07 +0000675 a local variable that persists outside of the ``except`` block, the
676 traceback will create a reference cycle with the current frame and its
677 dictionary of local variables. This will delay reclaiming dead
678 resources until the next cyclic :term:`garbage collection` pass.
679
680 In Python 2, this problem only occurs if you save the traceback itself
681 (e.g. the third element of the tuple returned by :func:`sys.exc_info`)
682 in a variable.
Brett Cannon8045d972011-02-03 22:01:54 +0000683
Georg Brandl2cb2fa92011-02-07 15:30:45 +0000684
Brett Cannon8045d972011-02-03 22:01:54 +0000685Other Resources
686===============
Georg Brandl2cb2fa92011-02-07 15:30:45 +0000687
Brett Cannon6277fa42011-02-18 01:34:28 +0000688The authors of the following blog posts, wiki pages, and books deserve special
689thanks for making public their tips for porting Python 2 code to Python 3 (and
690thus helping provide information for this document):
Brett Cannon8045d972011-02-03 22:01:54 +0000691
Brett Cannon6277fa42011-02-18 01:34:28 +0000692* http://python3porting.com/
Brett Cannon8045d972011-02-03 22:01:54 +0000693* http://docs.pythonsprints.com/python3_porting/py-porting.html
694* http://techspot.zzzeek.org/2011/01/24/zzzeek-s-guide-to-python-3-porting/
695* http://dabeaz.blogspot.com/2011/01/porting-py65-and-my-superboard-to.html
696* http://lucumr.pocoo.org/2011/1/22/forwards-compatible-python/
697* http://lucumr.pocoo.org/2010/2/11/porting-to-python-3-a-guide/
698* http://wiki.python.org/moin/PortingPythonToPy3k
699
700If you feel there is something missing from this document that should be added,
701please email the python-porting_ mailing list.
702
703.. _python-porting: http://mail.python.org/mailman/listinfo/python-porting