blob: 33ea92f1e4d700f52e8deae7b3469804374b8985 [file] [log] [blame]
Brett Cannon8045d972011-02-03 22:01:54 +00001.. _pyporting-howto:
2
3*********************************
4Porting Python 2 Code to Python 3
5*********************************
6
7:author: Brett Cannon
8
9.. topic:: Abstract
10
Brett Cannon4b0c24a2011-02-03 22:14:58 +000011 With Python 3 being the future of Python while Python 2 is still in active
12 use, it is good to have your project available for both major releases of
13 Python. This guide is meant to help you choose which strategy works best
14 for your project to support both Python 2 & 3 along with how to execute
15 that strategy.
Brett Cannon8045d972011-02-03 22:01:54 +000016
Brett Cannon4b0c24a2011-02-03 22:14:58 +000017 If you are looking to port an extension module instead of pure Python code,
Éric Araujo5405a0b2011-02-05 16:03:12 +000018 please see :ref:`cporting-howto`.
Brett Cannon8045d972011-02-03 22:01:54 +000019
20
21Choosing a Strategy
22===================
23When a project makes the decision that it's time to support both Python 2 & 3,
24a decision needs to be made as to how to go about accomplishing that goal.
Eli Bendersky2d062de2011-02-07 04:19:57 +000025The chosen strategy will depend on how large the project's existing
Brett Cannon8045d972011-02-03 22:01:54 +000026codebase is and how much divergence you want from your Python 2 codebase from
27your Python 3 one (e.g., starting a new version with Python 3).
28
29If your project is brand-new or does not have a large codebase, then you may
30want to consider writing/porting :ref:`all of your code for Python 3
31and use 3to2 <use_3to2>` to port your code for Python 2.
32
33If your project has a pre-existing Python 2 codebase and you would like Python
343 support to start off a new branch or version of your project, then you will
Eli Bendersky2d062de2011-02-07 04:19:57 +000035most likely want to :ref:`port using 2to3 <use_2to3>`. This will allow you to
36port your Python 2 code to Python 3 in a semi-automated fashion and begin to
Brett Cannon8045d972011-02-03 22:01:54 +000037maintain it separately from your Python 2 code. This approach can also work if
38your codebase is small and/or simple enough for the translation to occur
39quickly.
40
41Finally, if you want to maintain Python 2 and Python 3 versions of your project
42simultaneously and with no differences, then you can write :ref:`Python 2/3
43source-compatible code <use_same_source>`. While the code is not quite as
44idiomatic as it would be written just for Python 3 or automating the port from
45Python 2, it does makes it easier to continue to do rapid development
46regardless of what major version of Python you are developing against at the
47time.
48
49Regardless of which approach you choose, porting is probably not as hard or
50time-consuming as you might initially think. You can also tackle the problem
51piece-meal as a good portion of porting is simply updating your code to follow
52current best practices in a Python 2/3 compatible way.
53
54
55Universal Bits of Advice
56------------------------
57Regardless of what strategy you pick, there are a few things you should
58consider.
59
60One is make sure you have a robust test suite. You need to make sure everything
61continues to work, just like when you support a new minor version of Python.
62This means making sure your test suite is thorough and is ported properly
63between Python 2 & 3. You will also most likely want to use something like tox_
64to automate testing between both a Python 2 and Python 3 VM.
65
66Two, once your project has Python 3 support, make sure to add the proper
67classifier on the Cheeseshop_ (PyPI_). To have your project listed as Python 3
68compatible it must have the
69`Python 3 classifier <http://pypi.python.org/pypi?:action=browse&c=533>`_
70(from
71http://techspot.zzzeek.org/2011/01/24/zzzeek-s-guide-to-python-3-porting/)::
72
Brett Cannon4b0c24a2011-02-03 22:14:58 +000073 setup(
74 name='Your Library',
75 version='1.0',
76 classifiers=[
77 # make sure to use :: Python *and* :: Python :: 3 so
78 # that pypi can list the package on the python 3 page
79 'Programming Language :: Python',
80 'Programming Language :: Python :: 3'
81 ],
82 packages=['yourlibrary'],
83 # make sure to add custom_fixers to the MANIFEST.in
84 include_package_data=True,
85 # ...
86 )
Brett Cannon8045d972011-02-03 22:01:54 +000087
88
89Doing so will cause your project to show up in the
90`Python 3 packages list
91<http://pypi.python.org/pypi?:action=browse&c=533&show=all>`_. You will know
92you set the classifier properly as visiting your project page on the Cheeseshop
93will show a Python 3 logo in the upper-left corner of the page.
94
95Three, the six_ project provides a library which helps iron out differences
96between Python 2 & 3. If you find there is a sticky point that is a continual
97point of contention in your translation or maintenance of code, consider using
98a source-compatible solution relying on six. If you have to create your own
99Python 2/3 compatible solution, you can use ``sys.version_info[0] >= 3`` as a
100guard.
101
102Four, read all the approaches. Just because some bit of advice applies to one
103approach more than another doesn't mean that some advice doesn't apply to other
104strategies.
105
Eli Bendersky2d062de2011-02-07 04:19:57 +0000106Five, drop support for older Python versions if possible. `Python 2.5`_
107introduced a lot of useful syntax and libraries which have become idiomatic
108in Python 3. `Python 2.6`_ introduced future statements which makes
109compatibility much easier if you are going from Python 2 to 3.
Brett Cannon8045d972011-02-03 22:01:54 +0000110`Python 2.7`_ continues the trend in the stdlib. So choose the newest version
Eli Bendersky2d062de2011-02-07 04:19:57 +0000111of Python which you believe can be your minimum support version
Brett Cannon8045d972011-02-03 22:01:54 +0000112and work from there.
113
114
115.. _tox: http://codespeak.net/tox/
116.. _Cheeseshop:
117.. _PyPI: http://pypi.python.org/
118.. _six: http://packages.python.org/six
119.. _Python 2.7: http://www.python.org/2.7.x
120.. _Python 2.6: http://www.python.org/2.6.x
121.. _Python 2.5: http://www.python.org/2.5.x
122.. _Python 2.4: http://www.python.org/2.4.x
Brett Cannonce71ab22011-02-05 22:05:05 +0000123.. _Python 2.3: http://www.python.org/2.3.x
124.. _Python 2.2: http://www.python.org/2.2.x
Brett Cannon8045d972011-02-03 22:01:54 +0000125
126
127.. _use_3to2:
128
129Python 3 and 3to2
130=================
131If you are starting a new project or your codebase is small enough, you may
132want to consider writing your code for Python 3 and backporting to Python 2
133using 3to2_. Thanks to Python 3 being more strict about things than Python 2
134(e.g., bytes vs. strings), the source translation can be easier and more
135straightforward than from Python 2 to 3. Plus it gives you more direct
136experience developing in Python 3 which, since it is the future of Python, is a
137good thing long-term.
138
139A drawback of this approach is that 3to2 is a third-party project. This means
140that the Python core developers (and thus this guide) can make no promises
141about how well 3to2 works at any time. There is nothing to suggest, though,
142that 3to2 is not a high-quality project.
143
144
145.. _3to2: https://bitbucket.org/amentajo/lib3to2/overview
146
147
148.. _use_2to3:
149
150Python 2 and 2to3
151=================
Eli Bendersky7ac34192011-02-07 04:44:19 +0000152Included with Python since 2.6, the 2to3_ tool (and :mod:`lib2to3` module)
153helps with porting Python 2 to Python 3 by performing various source
154translations. This is a perfect solution for projects which wish to branch
155their Python 3 code from their Python 2 codebase and maintain them as
156independent codebases. You can even begin preparing to use this approach
157today by writing future-compatible Python code which works cleanly in
158Python 2 in conjunction with 2to3; all steps outlined below will work
159with Python 2 code up to the point when the actual use of 2to3 occurs.
Brett Cannon8045d972011-02-03 22:01:54 +0000160
161Use of 2to3 as an on-demand translation step at install time is also possible,
162preventing the need to maintain a separate Python 3 codebase, but this approach
163does come with some drawbacks. While users will only have to pay the
164translation cost once at installation, you as a developer will need to pay the
165cost regularly during development. If your codebase is sufficiently large
166enough then the translation step ends up acting like a compilation step,
167robbing you of the rapid development process you are used to with Python.
168Obviously the time required to translate a project will vary, so do an
169experimental translation just to see how long it takes to evaluate whether you
170prefer this approach compared to using :ref:`use_same_source` or simply keeping
171a separate Python 3 codebase.
172
173Below are the typical steps taken by a project which uses a 2to3-based approach
174to supporting Python 2 & 3.
175
176
177Support Python 2.7
178------------------
179As a first step, make sure that your project is compatible with `Python 2.7`_.
180This is just good to do as Python 2.7 is the last release of Python 2 and thus
181will be used for a rather long time. It also allows for use of the ``-3`` flag
182to Python to help discover places in your code which 2to3 cannot handle but are
183known to cause issues.
184
Brett Cannonce71ab22011-02-05 22:05:05 +0000185Try to Support `Python 2.6`_ and Newer Only
186-------------------------------------------
Brett Cannon8045d972011-02-03 22:01:54 +0000187While not possible for all projects, if you can support `Python 2.6`_ and newer
188**only**, your life will be much easier. Various future statements, stdlib
189additions, etc. exist only in Python 2.6 and later which greatly assist in
190porting to Python 3. But if you project must keep support for `Python 2.5`_ (or
191even `Python 2.4`_) then it is still possible to port to Python 3.
192
193Below are the benefits you gain if you only have to support Python 2.6 and
194newer. Some of these options are personal choice while others are
195**strongly** recommended (the ones that are more for personal choice are
196labeled as such). If you continue to support older versions of Python then you
197at least need to watch out for situations that these solutions fix.
198
199
Brett Cannon8045d972011-02-03 22:01:54 +0000200``from __future__ import print_function``
201'''''''''''''''''''''''''''''''''''''''''
202This is a personal choice. 2to3 handles the translation from the print
203statement to the print function rather well so this is an optional step. This
204future statement does help, though, with getting used to typing
205``print('Hello, World')`` instead of ``print 'Hello, World'``.
206
207
208``from __future__ import unicode_literals``
209'''''''''''''''''''''''''''''''''''''''''''
210Another personal choice. You can always mark what you want to be a (unicode)
211string with a ``u`` prefix to get the same effect. But regardless of whether
212you use this future statement or not, you **must** make sure you know exactly
213which Python 2 strings you want to be bytes, and which are to be strings. This
214means you should, **at minimum** mark all strings that are meant to be text
215strings with a ``u`` prefix if you do not use this future statement.
216
217
218Bytes literals
219''''''''''''''
220This is a **very** important one. The ability to prefix Python 2 strings that
221are meant to contain bytes with a ``b`` prefix help to very clearly delineate
222what is and is not a Python 3 string. When you run 2to3 on code, all Python 2
223strings become Python 3 strings **unless** they are prefixed with ``b``.
224
225There are some differences between byte literals in Python 2 and those in
226Python 3 thanks to the bytes type just being an alias to ``str`` in Python 2.
227Probably the biggest "gotcha" is that indexing results in different values. In
228Python 2, the value of ``b'py'[1]`` is ``'y'``, while in Python 3 it's ``121``.
229You can avoid this disparity by always slicing at the size of a single element:
230``b'py'[1:2]`` is ``'y'`` in Python 2 and ``b'y'`` in Python 3 (i.e., close
231enough).
232
233You cannot concatenate bytes and strings in Python 3. But since in Python
2342 has bytes aliased to ``str``, it will succeed: ``b'a' + u'b'`` works in
235Python 2, but ``b'a' + 'b'`` in Python 3 is a :exc:`TypeError`. A similar issue
236also comes about when doing comparisons between bytes and strings.
237
238
Brett Cannonce71ab22011-02-05 22:05:05 +0000239Supporting `Python 2.5`_ and Newer Only
240---------------------------------------
241If you are supporting `Python 2.5`_ and newer there are still some features of
242Python that you can utilize.
243
244
245``from __future__ import absolute_imports``
246'''''''''''''''''''''''''''''''''''''''''''
247Implicit relative imports (e.g., importing ``spam.bacon`` from within
248``spam.eggs`` with the statement ``import bacon``) does not work in Python 3.
249This future statement moves away from that and allows the use of explicit
250relative imports (e.g., ``from . import bacon``).
251
252In `Python 2.5`_ you must use
253the __future__ statement to get to use explicit relative imports and prevent
254implicit ones. In `Python 2.6`_ explicit relative imports are available without
255the statement, but you still want the __future__ statement to prevent implicit
256relative imports. In `Python 2.7`_ the __future__ statement is not needed. In
257other words, unless you are only supporting Python 2.7 or a version earlier
258than Python 2.5, use the __future__ statement.
259
260
261
Brett Cannon8045d972011-02-03 22:01:54 +0000262Handle Common "Gotchas"
263-----------------------
264There are a few things that just consistently come up as sticking points for
265people which 2to3 cannot handle automatically or can easily be done in Python 2
266to help modernize your code.
267
268
Brett Cannonce71ab22011-02-05 22:05:05 +0000269``from __future__ import division``
270'''''''''''''''''''''''''''''''''''
271While the exact same outcome can be had by using the ``-Qnew`` argument to
272Python, using this future statement lifts the requirement that your users use
273the flag to get the expected behavior of division in Python 3
274(e.g., ``1/2 == 0.5; 1//2 == 0``).
275
276
277
Antoine Pitrou5c28cfdc2011-02-05 11:53:39 +0000278Specify when opening a file as binary
279'''''''''''''''''''''''''''''''''''''
280
281Unless you have been working on Windows, there is a chance you have not always
282bothered to add the ``b`` mode when opening a binary file (e.g., ``rb`` for
283binary reading). Under Python 3, binary files and text files are clearly
284distinct and mutually incompatible; see the :mod:`io` module for details.
285Therefore, you **must** make a decision of whether a file will be used for
286binary access (allowing to read and/or write bytes data) or text access
287(allowing to read and/or write unicode data).
288
289Text files
290''''''''''
291
292Text files created using ``open()`` under Python 2 return byte strings,
293while under Python 3 they return unicode strings. Depending on your porting
294strategy, this can be an issue.
295
296If you want text files to return unicode strings in Python 2, you have two
297possibilities:
298
299* Under Python 2.6 and higher, use :func:`io.open`. Since :func:`io.open`
300 is essentially the same function in both Python 2 and Python 3, it will
301 help iron out any issues that might arise.
302
303* If pre-2.6 compatibility is needed, then you should use :func:`codecs.open`
304 instead. This will make sure that you get back unicode strings in Python 2.
305
Brett Cannon8045d972011-02-03 22:01:54 +0000306Subclass ``object``
307'''''''''''''''''''
Brett Cannonce71ab22011-02-05 22:05:05 +0000308New-style classes have been around since `Python 2.2`_. You need to make sure
309you are subclassing from ``object`` to avoid odd edge cases involving method
Brett Cannon8045d972011-02-03 22:01:54 +0000310resolution order, etc. This continues to be totally valid in Python 3 (although
311unneeded as all classes implicitly inherit from ``object``).
312
313
314Deal With the Bytes/String Dichotomy
315''''''''''''''''''''''''''''''''''''
316One of the biggest issues people have when porting code to Python 3 is handling
317the bytes/string dichotomy. Because Python 2 allowed the ``str`` type to hold
318textual data, people have over the years been rather loose in their delineation
319of what ``str`` instances held text compared to bytes. In Python 3 you cannot
320be so care-free anymore and need to properly handle the difference. The key
321handling this issue to to make sure that **every** string literal in your
322Python 2 code is either syntactically of functionally marked as either bytes or
323text data. After this is done you then need to make sure your APIs are designed
324to either handle a specific type or made to be properly polymorphic.
325
326
327Mark Up Python 2 String Literals
328********************************
329
330First thing you must do is designate every single string literal in Python 2
331as either textual or bytes data. If you are only supporting Python 2.6 or
332newer, this can be accomplished by marking bytes literals with a ``b`` prefix
333and then designating textual data with a ``u`` prefix or using the
334``unicode_literals`` future statement.
335
336If your project supports versions of Python pre-dating 2.6, then you should use
337the six_ project and its ``b()`` function to denote bytes literals. For text
338literals you can either use six's ``u()`` function or use a ``u`` prefix.
339
340
341Decide what APIs Will Accept
342****************************
343In Python 2 it was very easy to accidentally create an API that accepted both
344bytes and textual data. But in Python 3, thanks to the more strict handling of
345disparate types, this loose usage of bytes and text together tends to fail.
346
347Take the dict ``{b'a': 'bytes', u'a': 'text'}`` in Python 2.6. It creates the
348dict ``{u'a': 'text'}`` since ``b'a' == u'a'``. But in Python 3 the equivalent
349dict creates ``{b'a': 'bytes', 'a': 'text'}``, i.e., no lost data. Similar
350issues can crop up when transitioning Python 2 code to Python 3.
351
352This means you need to choose what an API is going to accept and create and
353consistently stick to that API in both Python 2 and 3.
354
355
Brett Cannonce71ab22011-02-05 22:05:05 +0000356Bytes / Unicode Comparison
Antoine Pitrou8d8f7c52011-02-05 11:40:05 +0000357**************************
358
359In Python 3, mixing bytes and unicode is forbidden in most situations; it
360will raise a :class:`TypeError` where Python 2 would have attempted an implicit
361coercion between types. However, there is one case where it doesn't and
362it can be very misleading::
363
364 >>> b"" == ""
365 False
366
367This is because comparison for equality is required by the language to always
368succeed (and return ``False`` for incompatible types). However, this also
369means that code incorrectly ported to Python 3 can display buggy behaviour
370if such comparisons are silently executed. To detect such situations,
371Python 3 has a ``-b`` flag that will display a warning::
372
373 $ python3 -b
374 >>> b"" == ""
375 __main__:1: BytesWarning: Comparison between bytes and string
376 False
377
378To turn the warning into an exception, use the ``-bb`` flag instead::
379
380 $ python3 -bb
381 >>> b"" == ""
382 Traceback (most recent call last):
383 File "<stdin>", line 1, in <module>
384 BytesWarning: Comparison between bytes and string
385
386
Antoine Pitroubd866e92011-02-05 12:13:38 +0000387Indexing bytes objects
388''''''''''''''''''''''
389
390Another potentially surprising change is the indexing behaviour of bytes
391objects in Python 3::
392
393 >>> b"xyz"[0]
394 120
395
396Indeed, Python 3 bytes objects (as well as :class:`bytearray` objects)
397are sequences of integers. But code converted from Python 2 will often
398assume that indexing a bytestring produces another bytestring, not an
399integer. To reconcile both behaviours, use slicing::
400
401 >>> b"xyz"[0:1]
402 b'x'
403 >>> n = 1
404 >>> b"xyz"[n:n+1]
405 b'y'
406
407The only remaining gotcha is that an out-of-bounds slice returns an empty
408bytes object instead of raising ``IndexError``:
409
410 >>> b"xyz"[3]
411 Traceback (most recent call last):
412 File "<stdin>", line 1, in <module>
413 IndexError: index out of range
414 >>> b"xyz"[3:4]
415 b''
416
417
Brett Cannon8045d972011-02-03 22:01:54 +0000418``__str__()``/``__unicode__()``
419'''''''''''''''''''''''''''''''
420In Python 2, objects can specify both a string and unicode representation of
421themselves. In Python 3, though, there is only a string representation. This
422becomes an issue as people can inadvertantly do things in their ``__str__()``
423methods which have unpredictable results (e.g., infinite recursion if you
424happen to use the ``unicode(self).encode('utf8')`` idiom as the body of your
425``__str__()`` method).
426
427There are two ways to solve this issue. One is to use a custom 2to3 fixer. The
428blog post at http://lucumr.pocoo.org/2011/1/22/forwards-compatible-python/
429specifies how to do this. That will allow 2to3 to change all instances of ``def
430__unicode(self): ...`` to ``def __str__(self): ...``. This does require you
431define your ``__str__()`` method in Python 2 before your ``__unicode__()``
432method.
433
434The other option is to use a mixin class. This allows you to only define a
435``__unicode__()`` method for your class and let the mixin derive
436``__str__()`` for you (code from
437http://lucumr.pocoo.org/2011/1/22/forwards-compatible-python/)::
438
Brett Cannon4b0c24a2011-02-03 22:14:58 +0000439 import sys
Brett Cannon8045d972011-02-03 22:01:54 +0000440
Brett Cannon4b0c24a2011-02-03 22:14:58 +0000441 class UnicodeMixin(object):
Brett Cannon8045d972011-02-03 22:01:54 +0000442
Brett Cannon4b0c24a2011-02-03 22:14:58 +0000443 """Mixin class to handle defining the proper __str__/__unicode__
444 methods in Python 2 or 3."""
Brett Cannon8045d972011-02-03 22:01:54 +0000445
Brett Cannon4b0c24a2011-02-03 22:14:58 +0000446 if sys.version_info[0] >= 3: # Python 3
447 def __str__(self):
448 return self.__unicode__()
449 else: # Python 2
450 def __str__(self):
451 return self.__unicode__().encode('utf8')
Brett Cannon8045d972011-02-03 22:01:54 +0000452
453
Brett Cannon4b0c24a2011-02-03 22:14:58 +0000454 class Spam(UnicodeMixin):
Brett Cannon8045d972011-02-03 22:01:54 +0000455
Brett Cannon4b0c24a2011-02-03 22:14:58 +0000456 def __unicode__(self):
457 return u'spam-spam-bacon-spam' # 2to3 will remove the 'u' prefix
Brett Cannon8045d972011-02-03 22:01:54 +0000458
459
Brett Cannon8045d972011-02-03 22:01:54 +0000460Don't Index on Exceptions
461'''''''''''''''''''''''''
Antoine Pitrou5c28cfdc2011-02-05 11:53:39 +0000462
Brett Cannon8045d972011-02-03 22:01:54 +0000463In Python 2, the following worked::
464
Brett Cannon4b0c24a2011-02-03 22:14:58 +0000465 >>> exc = Exception(1, 2, 3)
466 >>> exc.args[1]
467 2
468 >>> exc[1] # Python 2 only!
469 2
Brett Cannon8045d972011-02-03 22:01:54 +0000470
Eli Bendersky7ac34192011-02-07 04:44:19 +0000471But in Python 3, indexing directly on an exception is an error. You need to
472make sure to only index on the :attr:`BaseException.args` attribute which is a
Brett Cannon8045d972011-02-03 22:01:54 +0000473sequence containing all arguments passed to the :meth:`__init__` method.
474
Eli Bendersky7ac34192011-02-07 04:44:19 +0000475Even better is to use the documented attributes the exception provides.
Brett Cannon8045d972011-02-03 22:01:54 +0000476
Brett Cannon8045d972011-02-03 22:01:54 +0000477Don't use ``__getslice__`` & Friends
478''''''''''''''''''''''''''''''''''''
Antoine Pitrou5c28cfdc2011-02-05 11:53:39 +0000479
Brett Cannon8045d972011-02-03 22:01:54 +0000480Been deprecated for a while, but Python 3 finally drops support for
481``__getslice__()``, etc. Move completely over to :meth:`__getitem__` and
482friends.
483
484
Brett Cannon45aa7cc2011-02-05 22:16:40 +0000485Updating doctests
486'''''''''''''''''
4872to3_ will attempt to generate fixes for doctests that it comes across. It's
488not perfect, though. If you wrote a monolithic set of doctests (e.g., a single
489docstring containing all of your doctests), you should at least consider
490breaking the doctests up into smaller pieces to make it more manageable to fix.
491Otherwise it might very well be worth your time and effort to port your tests
492to :mod:`unittest`.
Brett Cannon8045d972011-02-03 22:01:54 +0000493
494
495Eliminate ``-3`` Warnings
496-------------------------
497When you run your application's test suite, run it using the ``-3`` flag passed
498to Python. This will cause various warnings to be raised during execution about
499things that 2to3 cannot handle automatically (e.g., modules that have been
500removed). Try to eliminate those warnings to make your code even more portable
501to Python 3.
502
503
504Run 2to3
505--------
506Once you have made your Python 2 code future-compatible with Python 3, it's
507time to use 2to3_ to actually port your code.
508
509
510Manually
511''''''''
512To manually convert source code using 2to3_, you use the ``2to3`` script that
513is installed with Python 2.6 and later.::
514
Brett Cannon4b0c24a2011-02-03 22:14:58 +0000515 2to3 <directory or file to convert>
Brett Cannon8045d972011-02-03 22:01:54 +0000516
517This will cause 2to3 to write out a diff with all of the fixers applied for the
518converted source code. If you would like 2to3 to go ahead and apply the changes
519you can pass it the ``-w`` flag::
520
Brett Cannon4b0c24a2011-02-03 22:14:58 +0000521 2to3 -w <stuff to convert>
Brett Cannon8045d972011-02-03 22:01:54 +0000522
523There are other flags available to control exactly which fixers are applied,
524etc.
525
526
527During Installation
528'''''''''''''''''''
529When a user installs your project for Python 3, you can have either
530:mod:`distutils` or Distribute_ run 2to3_ on your behalf.
531For distutils, use the following idiom::
532
Brett Cannon4b0c24a2011-02-03 22:14:58 +0000533 try: # Python 3
534 from distutils.command.build_py import build_py_2to3 as build_py
535 except ImportError: # Python 2
536 from distutils.command.build_py import build_py
Brett Cannon8045d972011-02-03 22:01:54 +0000537
Brett Cannon4b0c24a2011-02-03 22:14:58 +0000538 setup(cmdclass = {'build_py':build_py},
539 # ...
540 )
Brett Cannon8045d972011-02-03 22:01:54 +0000541
Brett Cannon4b0c24a2011-02-03 22:14:58 +0000542 For Distribute::
Brett Cannon8045d972011-02-03 22:01:54 +0000543
Brett Cannon4b0c24a2011-02-03 22:14:58 +0000544 setup(use_2to3=True,
545 # ...
546 )
Brett Cannon8045d972011-02-03 22:01:54 +0000547
548This will allow you to not have to distribute a separate Python 3 version of
549your project. It does require, though, that when you perform development that
550you at least build your project and use the built Python 3 source for testing.
551
552
553Verify & Test
554-------------
555At this point you should (hopefully) have your project converted in such a way
556that it works in Python 3. Verify it by running your unit tests and making sure
557nothing has gone awry. If you miss something then figure out how to fix it in
558Python 3, backport to your Python 2 code, and run your code through 2to3 again
559to verify the fix transforms properly.
560
561
562.. _2to3: http://docs.python.org/py3k/library/2to3.html
563.. _Distribute: http://packages.python.org/distribute/
564
565
566.. _use_same_source:
567
568Python 2/3 Compatible Source
569============================
570While it may seem counter-intuitive, you can write Python code which is
571source-compatible between Python 2 & 3. It does lead to code that is not
572entirely idiomatic Python (e.g., having to extract the currently raised
573exception from ``sys.exc_info()[1]``), but it can be run under Python 2
574**and** Python 3 without using 2to3_ as a translation step. This allows you to
575continue to have a rapid development process regardless of whether you are
576developing under Python 2 or Python 3. Whether this approach or using
577:ref:`use_2to3` works best for you will be a per-project decision.
578
579To get a complete idea of what issues you will need to deal with, see the
580`What's New in Python 3.0`_. Others have reorganized the data in other formats
581such as http://docs.pythonsprints.com/python3_porting/py-porting.html .
582
583The following are some steps to take to try to support both Python 2 & 3 from
584the same source code.
585
586
587.. _What's New in Python 3.0: http://docs.python.org/release/3.0/whatsnew/3.0.html
588
589
590Follow The Steps for Using 2to3_ (sans 2to3)
591--------------------------------------------
592All of the steps outlined in how to
593:ref:`port Python 2 code with 2to3 <use_2to3>` apply
594to creating a Python 2/3 codebase. This includes trying only support Python 2.6
595or newer (the :mod:`__future__` statements work in Python 3 without issue),
596eliminating warnings that are triggered by ``-3``, etc.
597
Brett Cannon98135d02011-02-05 22:22:47 +0000598You should even consider running 2to3_ over your code (without committing the
599changes). This will let you know where potential pain points are within your
600code so that you can fix them properly before they become an issue.
Brett Cannon8045d972011-02-03 22:01:54 +0000601
602
603Use six_
604--------
605The six_ project contains many things to help you write portable Python code.
606You should make sure to read its documentation from beginning to end and use
607any and all features it provides. That way you will minimize any mistakes you
608might make in writing cross-version code.
609
610
611Capturing the Currently Raised Exception
612----------------------------------------
Brett Cannonce71ab22011-02-05 22:05:05 +0000613One change between Python 2 and 3 that will require changing how you code (if
614you support `Python 2.5`_ and earlier) is
615accessing the currently raised exception. In Python 2.5 and earlier the syntax
616to access the current exception is::
Brett Cannon8045d972011-02-03 22:01:54 +0000617
Brett Cannon4b0c24a2011-02-03 22:14:58 +0000618 try:
619 raise Exception()
620 except Exception, exc:
621 # Current exception is 'exc'
622 pass
Brett Cannon8045d972011-02-03 22:01:54 +0000623
Brett Cannonce71ab22011-02-05 22:05:05 +0000624This syntax changed in Python 3 (and backported to `Python 2.6`_ and later)
625to::
Brett Cannon8045d972011-02-03 22:01:54 +0000626
Brett Cannon4b0c24a2011-02-03 22:14:58 +0000627 try:
628 raise Exception()
629 except Exception as exc:
630 # Current exception is 'exc'
Brett Cannonce71ab22011-02-05 22:05:05 +0000631 # In Python 3, 'exc' is restricted to the block; Python 2.6 will "leak"
Brett Cannon4b0c24a2011-02-03 22:14:58 +0000632 pass
Brett Cannon8045d972011-02-03 22:01:54 +0000633
634Because of this syntax change you must change to capturing the current
635exception to::
636
Brett Cannon4b0c24a2011-02-03 22:14:58 +0000637 try:
638 raise Exception()
639 except Exception:
640 import sys
641 exc = sys.exc_info()[1]
642 # Current exception is 'exc'
643 pass
Brett Cannon8045d972011-02-03 22:01:54 +0000644
645You can get more information about the raised exception from
646:func:`sys.exc_info` than simply the current exception instance, but you most
Antoine Pitroue6a14642011-02-05 12:01:07 +0000647likely don't need it.
Brett Cannon8045d972011-02-03 22:01:54 +0000648
Antoine Pitroue6a14642011-02-05 12:01:07 +0000649.. note::
650 In Python 3, the traceback is attached to the exception instance
Brett Cannonce71ab22011-02-05 22:05:05 +0000651 through the ``__traceback__`` attribute. If the instance is saved in
Antoine Pitroue6a14642011-02-05 12:01:07 +0000652 a local variable that persists outside of the ``except`` block, the
653 traceback will create a reference cycle with the current frame and its
654 dictionary of local variables. This will delay reclaiming dead
655 resources until the next cyclic :term:`garbage collection` pass.
656
657 In Python 2, this problem only occurs if you save the traceback itself
658 (e.g. the third element of the tuple returned by :func:`sys.exc_info`)
659 in a variable.
Brett Cannon8045d972011-02-03 22:01:54 +0000660
661Other Resources
662===============
663The authors of the following blogs posts and wiki pages deserve special thanks
664for making public their tips for porting Python 2 code to Python 3 (and thus
665helping provide information for this document):
666
667* http://docs.pythonsprints.com/python3_porting/py-porting.html
668* http://techspot.zzzeek.org/2011/01/24/zzzeek-s-guide-to-python-3-porting/
669* http://dabeaz.blogspot.com/2011/01/porting-py65-and-my-superboard-to.html
670* http://lucumr.pocoo.org/2011/1/22/forwards-compatible-python/
671* http://lucumr.pocoo.org/2010/2/11/porting-to-python-3-a-guide/
672* http://wiki.python.org/moin/PortingPythonToPy3k
673
674If you feel there is something missing from this document that should be added,
675please email the python-porting_ mailing list.
676
677.. _python-porting: http://mail.python.org/mailman/listinfo/python-porting