blob: 042d1209b9fb910436dfc925e91801ea872407e0 [file] [log] [blame]
Brett Cannon8045d972011-02-03 22:01:54 +00001.. _pyporting-howto:
2
3*********************************
4Porting Python 2 Code to Python 3
5*********************************
6
7:author: Brett Cannon
8
9.. topic:: Abstract
10
Brett Cannon4b0c24a2011-02-03 22:14:58 +000011 With Python 3 being the future of Python while Python 2 is still in active
12 use, it is good to have your project available for both major releases of
13 Python. This guide is meant to help you choose which strategy works best
14 for your project to support both Python 2 & 3 along with how to execute
15 that strategy.
Brett Cannon8045d972011-02-03 22:01:54 +000016
Brett Cannon4b0c24a2011-02-03 22:14:58 +000017 If you are looking to port an extension module instead of pure Python code,
Éric Araujo5405a0b2011-02-05 16:03:12 +000018 please see :ref:`cporting-howto`.
Brett Cannon8045d972011-02-03 22:01:54 +000019
20
21Choosing a Strategy
22===================
23When a project makes the decision that it's time to support both Python 2 & 3,
24a decision needs to be made as to how to go about accomplishing that goal.
25Which strategy goes with will depend on how large the project's existing
26codebase is and how much divergence you want from your Python 2 codebase from
27your Python 3 one (e.g., starting a new version with Python 3).
28
29If your project is brand-new or does not have a large codebase, then you may
30want to consider writing/porting :ref:`all of your code for Python 3
31and use 3to2 <use_3to2>` to port your code for Python 2.
32
33If your project has a pre-existing Python 2 codebase and you would like Python
343 support to start off a new branch or version of your project, then you will
35most likely want to :ref:`port using 2to3 <use_2to3>`. This will allow you port
36your Python 2 code to Python 3 in a semi-automated fashion and begin to
37maintain it separately from your Python 2 code. This approach can also work if
38your codebase is small and/or simple enough for the translation to occur
39quickly.
40
41Finally, if you want to maintain Python 2 and Python 3 versions of your project
42simultaneously and with no differences, then you can write :ref:`Python 2/3
43source-compatible code <use_same_source>`. While the code is not quite as
44idiomatic as it would be written just for Python 3 or automating the port from
45Python 2, it does makes it easier to continue to do rapid development
46regardless of what major version of Python you are developing against at the
47time.
48
49Regardless of which approach you choose, porting is probably not as hard or
50time-consuming as you might initially think. You can also tackle the problem
51piece-meal as a good portion of porting is simply updating your code to follow
52current best practices in a Python 2/3 compatible way.
53
54
55Universal Bits of Advice
56------------------------
57Regardless of what strategy you pick, there are a few things you should
58consider.
59
60One is make sure you have a robust test suite. You need to make sure everything
61continues to work, just like when you support a new minor version of Python.
62This means making sure your test suite is thorough and is ported properly
63between Python 2 & 3. You will also most likely want to use something like tox_
64to automate testing between both a Python 2 and Python 3 VM.
65
66Two, once your project has Python 3 support, make sure to add the proper
67classifier on the Cheeseshop_ (PyPI_). To have your project listed as Python 3
68compatible it must have the
69`Python 3 classifier <http://pypi.python.org/pypi?:action=browse&c=533>`_
70(from
71http://techspot.zzzeek.org/2011/01/24/zzzeek-s-guide-to-python-3-porting/)::
72
Brett Cannon4b0c24a2011-02-03 22:14:58 +000073 setup(
74 name='Your Library',
75 version='1.0',
76 classifiers=[
77 # make sure to use :: Python *and* :: Python :: 3 so
78 # that pypi can list the package on the python 3 page
79 'Programming Language :: Python',
80 'Programming Language :: Python :: 3'
81 ],
82 packages=['yourlibrary'],
83 # make sure to add custom_fixers to the MANIFEST.in
84 include_package_data=True,
85 # ...
86 )
Brett Cannon8045d972011-02-03 22:01:54 +000087
88
89Doing so will cause your project to show up in the
90`Python 3 packages list
91<http://pypi.python.org/pypi?:action=browse&c=533&show=all>`_. You will know
92you set the classifier properly as visiting your project page on the Cheeseshop
93will show a Python 3 logo in the upper-left corner of the page.
94
95Three, the six_ project provides a library which helps iron out differences
96between Python 2 & 3. If you find there is a sticky point that is a continual
97point of contention in your translation or maintenance of code, consider using
98a source-compatible solution relying on six. If you have to create your own
99Python 2/3 compatible solution, you can use ``sys.version_info[0] >= 3`` as a
100guard.
101
102Four, read all the approaches. Just because some bit of advice applies to one
103approach more than another doesn't mean that some advice doesn't apply to other
104strategies.
105
106Five, drop support for older Python versions if possible. While not a
107requirement, `Python 2.5`_) introduced a lot of useful syntax and libraries
108which have become idiomatic in Python 3. `Python 2.6`_ introduced future
109statements which makes compatibility much easier if you are going from Python 2
110to 3.
111`Python 2.7`_ continues the trend in the stdlib. So choose the newest version
112of Python for which you believe you believe can be your minimum support version
113and work from there.
114
115
116.. _tox: http://codespeak.net/tox/
117.. _Cheeseshop:
118.. _PyPI: http://pypi.python.org/
119.. _six: http://packages.python.org/six
120.. _Python 2.7: http://www.python.org/2.7.x
121.. _Python 2.6: http://www.python.org/2.6.x
122.. _Python 2.5: http://www.python.org/2.5.x
123.. _Python 2.4: http://www.python.org/2.4.x
Brett Cannonce71ab22011-02-05 22:05:05 +0000124.. _Python 2.3: http://www.python.org/2.3.x
125.. _Python 2.2: http://www.python.org/2.2.x
Brett Cannon8045d972011-02-03 22:01:54 +0000126
127
128.. _use_3to2:
129
130Python 3 and 3to2
131=================
132If you are starting a new project or your codebase is small enough, you may
133want to consider writing your code for Python 3 and backporting to Python 2
134using 3to2_. Thanks to Python 3 being more strict about things than Python 2
135(e.g., bytes vs. strings), the source translation can be easier and more
136straightforward than from Python 2 to 3. Plus it gives you more direct
137experience developing in Python 3 which, since it is the future of Python, is a
138good thing long-term.
139
140A drawback of this approach is that 3to2 is a third-party project. This means
141that the Python core developers (and thus this guide) can make no promises
142about how well 3to2 works at any time. There is nothing to suggest, though,
143that 3to2 is not a high-quality project.
144
145
146.. _3to2: https://bitbucket.org/amentajo/lib3to2/overview
147
148
149.. _use_2to3:
150
151Python 2 and 2to3
152=================
153Included with Python since 2.6, 2to3_ tool (and :mod:`lib2to3` module) helps
154with porting Python 2 to Python 3 by performing various source translations.
155This is a perfect solution for projects which wish to branch their Python 3
156code from their Python 2 codebase and maintain them as independent codebases.
157You can even begin preparing to use this approach today by writing
158future-compatible Python code which works cleanly in Python 2 in conjunction
159with 2to3; all steps outlined below will work with Python 2 code up to the
160point when the actual use of 2to3 occurs.
161
162Use of 2to3 as an on-demand translation step at install time is also possible,
163preventing the need to maintain a separate Python 3 codebase, but this approach
164does come with some drawbacks. While users will only have to pay the
165translation cost once at installation, you as a developer will need to pay the
166cost regularly during development. If your codebase is sufficiently large
167enough then the translation step ends up acting like a compilation step,
168robbing you of the rapid development process you are used to with Python.
169Obviously the time required to translate a project will vary, so do an
170experimental translation just to see how long it takes to evaluate whether you
171prefer this approach compared to using :ref:`use_same_source` or simply keeping
172a separate Python 3 codebase.
173
174Below are the typical steps taken by a project which uses a 2to3-based approach
175to supporting Python 2 & 3.
176
177
178Support Python 2.7
179------------------
180As a first step, make sure that your project is compatible with `Python 2.7`_.
181This is just good to do as Python 2.7 is the last release of Python 2 and thus
182will be used for a rather long time. It also allows for use of the ``-3`` flag
183to Python to help discover places in your code which 2to3 cannot handle but are
184known to cause issues.
185
Brett Cannonce71ab22011-02-05 22:05:05 +0000186Try to Support `Python 2.6`_ and Newer Only
187-------------------------------------------
Brett Cannon8045d972011-02-03 22:01:54 +0000188While not possible for all projects, if you can support `Python 2.6`_ and newer
189**only**, your life will be much easier. Various future statements, stdlib
190additions, etc. exist only in Python 2.6 and later which greatly assist in
191porting to Python 3. But if you project must keep support for `Python 2.5`_ (or
192even `Python 2.4`_) then it is still possible to port to Python 3.
193
194Below are the benefits you gain if you only have to support Python 2.6 and
195newer. Some of these options are personal choice while others are
196**strongly** recommended (the ones that are more for personal choice are
197labeled as such). If you continue to support older versions of Python then you
198at least need to watch out for situations that these solutions fix.
199
200
Brett Cannon8045d972011-02-03 22:01:54 +0000201``from __future__ import print_function``
202'''''''''''''''''''''''''''''''''''''''''
203This is a personal choice. 2to3 handles the translation from the print
204statement to the print function rather well so this is an optional step. This
205future statement does help, though, with getting used to typing
206``print('Hello, World')`` instead of ``print 'Hello, World'``.
207
208
209``from __future__ import unicode_literals``
210'''''''''''''''''''''''''''''''''''''''''''
211Another personal choice. You can always mark what you want to be a (unicode)
212string with a ``u`` prefix to get the same effect. But regardless of whether
213you use this future statement or not, you **must** make sure you know exactly
214which Python 2 strings you want to be bytes, and which are to be strings. This
215means you should, **at minimum** mark all strings that are meant to be text
216strings with a ``u`` prefix if you do not use this future statement.
217
218
219Bytes literals
220''''''''''''''
221This is a **very** important one. The ability to prefix Python 2 strings that
222are meant to contain bytes with a ``b`` prefix help to very clearly delineate
223what is and is not a Python 3 string. When you run 2to3 on code, all Python 2
224strings become Python 3 strings **unless** they are prefixed with ``b``.
225
226There are some differences between byte literals in Python 2 and those in
227Python 3 thanks to the bytes type just being an alias to ``str`` in Python 2.
228Probably the biggest "gotcha" is that indexing results in different values. In
229Python 2, the value of ``b'py'[1]`` is ``'y'``, while in Python 3 it's ``121``.
230You can avoid this disparity by always slicing at the size of a single element:
231``b'py'[1:2]`` is ``'y'`` in Python 2 and ``b'y'`` in Python 3 (i.e., close
232enough).
233
234You cannot concatenate bytes and strings in Python 3. But since in Python
2352 has bytes aliased to ``str``, it will succeed: ``b'a' + u'b'`` works in
236Python 2, but ``b'a' + 'b'`` in Python 3 is a :exc:`TypeError`. A similar issue
237also comes about when doing comparisons between bytes and strings.
238
239
Brett Cannonce71ab22011-02-05 22:05:05 +0000240Supporting `Python 2.5`_ and Newer Only
241---------------------------------------
242If you are supporting `Python 2.5`_ and newer there are still some features of
243Python that you can utilize.
244
245
246``from __future__ import absolute_imports``
247'''''''''''''''''''''''''''''''''''''''''''
248Implicit relative imports (e.g., importing ``spam.bacon`` from within
249``spam.eggs`` with the statement ``import bacon``) does not work in Python 3.
250This future statement moves away from that and allows the use of explicit
251relative imports (e.g., ``from . import bacon``).
252
253In `Python 2.5`_ you must use
254the __future__ statement to get to use explicit relative imports and prevent
255implicit ones. In `Python 2.6`_ explicit relative imports are available without
256the statement, but you still want the __future__ statement to prevent implicit
257relative imports. In `Python 2.7`_ the __future__ statement is not needed. In
258other words, unless you are only supporting Python 2.7 or a version earlier
259than Python 2.5, use the __future__ statement.
260
261
262
Brett Cannon8045d972011-02-03 22:01:54 +0000263Handle Common "Gotchas"
264-----------------------
265There are a few things that just consistently come up as sticking points for
266people which 2to3 cannot handle automatically or can easily be done in Python 2
267to help modernize your code.
268
269
Brett Cannonce71ab22011-02-05 22:05:05 +0000270``from __future__ import division``
271'''''''''''''''''''''''''''''''''''
272While the exact same outcome can be had by using the ``-Qnew`` argument to
273Python, using this future statement lifts the requirement that your users use
274the flag to get the expected behavior of division in Python 3
275(e.g., ``1/2 == 0.5; 1//2 == 0``).
276
277
278
Antoine Pitrou5c28cfdc2011-02-05 11:53:39 +0000279Specify when opening a file as binary
280'''''''''''''''''''''''''''''''''''''
281
282Unless you have been working on Windows, there is a chance you have not always
283bothered to add the ``b`` mode when opening a binary file (e.g., ``rb`` for
284binary reading). Under Python 3, binary files and text files are clearly
285distinct and mutually incompatible; see the :mod:`io` module for details.
286Therefore, you **must** make a decision of whether a file will be used for
287binary access (allowing to read and/or write bytes data) or text access
288(allowing to read and/or write unicode data).
289
290Text files
291''''''''''
292
293Text files created using ``open()`` under Python 2 return byte strings,
294while under Python 3 they return unicode strings. Depending on your porting
295strategy, this can be an issue.
296
297If you want text files to return unicode strings in Python 2, you have two
298possibilities:
299
300* Under Python 2.6 and higher, use :func:`io.open`. Since :func:`io.open`
301 is essentially the same function in both Python 2 and Python 3, it will
302 help iron out any issues that might arise.
303
304* If pre-2.6 compatibility is needed, then you should use :func:`codecs.open`
305 instead. This will make sure that you get back unicode strings in Python 2.
306
Brett Cannon8045d972011-02-03 22:01:54 +0000307Subclass ``object``
308'''''''''''''''''''
Brett Cannonce71ab22011-02-05 22:05:05 +0000309New-style classes have been around since `Python 2.2`_. You need to make sure
310you are subclassing from ``object`` to avoid odd edge cases involving method
Brett Cannon8045d972011-02-03 22:01:54 +0000311resolution order, etc. This continues to be totally valid in Python 3 (although
312unneeded as all classes implicitly inherit from ``object``).
313
314
315Deal With the Bytes/String Dichotomy
316''''''''''''''''''''''''''''''''''''
317One of the biggest issues people have when porting code to Python 3 is handling
318the bytes/string dichotomy. Because Python 2 allowed the ``str`` type to hold
319textual data, people have over the years been rather loose in their delineation
320of what ``str`` instances held text compared to bytes. In Python 3 you cannot
321be so care-free anymore and need to properly handle the difference. The key
322handling this issue to to make sure that **every** string literal in your
323Python 2 code is either syntactically of functionally marked as either bytes or
324text data. After this is done you then need to make sure your APIs are designed
325to either handle a specific type or made to be properly polymorphic.
326
327
328Mark Up Python 2 String Literals
329********************************
330
331First thing you must do is designate every single string literal in Python 2
332as either textual or bytes data. If you are only supporting Python 2.6 or
333newer, this can be accomplished by marking bytes literals with a ``b`` prefix
334and then designating textual data with a ``u`` prefix or using the
335``unicode_literals`` future statement.
336
337If your project supports versions of Python pre-dating 2.6, then you should use
338the six_ project and its ``b()`` function to denote bytes literals. For text
339literals you can either use six's ``u()`` function or use a ``u`` prefix.
340
341
342Decide what APIs Will Accept
343****************************
344In Python 2 it was very easy to accidentally create an API that accepted both
345bytes and textual data. But in Python 3, thanks to the more strict handling of
346disparate types, this loose usage of bytes and text together tends to fail.
347
348Take the dict ``{b'a': 'bytes', u'a': 'text'}`` in Python 2.6. It creates the
349dict ``{u'a': 'text'}`` since ``b'a' == u'a'``. But in Python 3 the equivalent
350dict creates ``{b'a': 'bytes', 'a': 'text'}``, i.e., no lost data. Similar
351issues can crop up when transitioning Python 2 code to Python 3.
352
353This means you need to choose what an API is going to accept and create and
354consistently stick to that API in both Python 2 and 3.
355
356
Brett Cannonce71ab22011-02-05 22:05:05 +0000357Bytes / Unicode Comparison
Antoine Pitrou8d8f7c52011-02-05 11:40:05 +0000358**************************
359
360In Python 3, mixing bytes and unicode is forbidden in most situations; it
361will raise a :class:`TypeError` where Python 2 would have attempted an implicit
362coercion between types. However, there is one case where it doesn't and
363it can be very misleading::
364
365 >>> b"" == ""
366 False
367
368This is because comparison for equality is required by the language to always
369succeed (and return ``False`` for incompatible types). However, this also
370means that code incorrectly ported to Python 3 can display buggy behaviour
371if such comparisons are silently executed. To detect such situations,
372Python 3 has a ``-b`` flag that will display a warning::
373
374 $ python3 -b
375 >>> b"" == ""
376 __main__:1: BytesWarning: Comparison between bytes and string
377 False
378
379To turn the warning into an exception, use the ``-bb`` flag instead::
380
381 $ python3 -bb
382 >>> b"" == ""
383 Traceback (most recent call last):
384 File "<stdin>", line 1, in <module>
385 BytesWarning: Comparison between bytes and string
386
387
Antoine Pitroubd866e92011-02-05 12:13:38 +0000388Indexing bytes objects
389''''''''''''''''''''''
390
391Another potentially surprising change is the indexing behaviour of bytes
392objects in Python 3::
393
394 >>> b"xyz"[0]
395 120
396
397Indeed, Python 3 bytes objects (as well as :class:`bytearray` objects)
398are sequences of integers. But code converted from Python 2 will often
399assume that indexing a bytestring produces another bytestring, not an
400integer. To reconcile both behaviours, use slicing::
401
402 >>> b"xyz"[0:1]
403 b'x'
404 >>> n = 1
405 >>> b"xyz"[n:n+1]
406 b'y'
407
408The only remaining gotcha is that an out-of-bounds slice returns an empty
409bytes object instead of raising ``IndexError``:
410
411 >>> b"xyz"[3]
412 Traceback (most recent call last):
413 File "<stdin>", line 1, in <module>
414 IndexError: index out of range
415 >>> b"xyz"[3:4]
416 b''
417
418
Brett Cannon8045d972011-02-03 22:01:54 +0000419``__str__()``/``__unicode__()``
420'''''''''''''''''''''''''''''''
421In Python 2, objects can specify both a string and unicode representation of
422themselves. In Python 3, though, there is only a string representation. This
423becomes an issue as people can inadvertantly do things in their ``__str__()``
424methods which have unpredictable results (e.g., infinite recursion if you
425happen to use the ``unicode(self).encode('utf8')`` idiom as the body of your
426``__str__()`` method).
427
428There are two ways to solve this issue. One is to use a custom 2to3 fixer. The
429blog post at http://lucumr.pocoo.org/2011/1/22/forwards-compatible-python/
430specifies how to do this. That will allow 2to3 to change all instances of ``def
431__unicode(self): ...`` to ``def __str__(self): ...``. This does require you
432define your ``__str__()`` method in Python 2 before your ``__unicode__()``
433method.
434
435The other option is to use a mixin class. This allows you to only define a
436``__unicode__()`` method for your class and let the mixin derive
437``__str__()`` for you (code from
438http://lucumr.pocoo.org/2011/1/22/forwards-compatible-python/)::
439
Brett Cannon4b0c24a2011-02-03 22:14:58 +0000440 import sys
Brett Cannon8045d972011-02-03 22:01:54 +0000441
Brett Cannon4b0c24a2011-02-03 22:14:58 +0000442 class UnicodeMixin(object):
Brett Cannon8045d972011-02-03 22:01:54 +0000443
Brett Cannon4b0c24a2011-02-03 22:14:58 +0000444 """Mixin class to handle defining the proper __str__/__unicode__
445 methods in Python 2 or 3."""
Brett Cannon8045d972011-02-03 22:01:54 +0000446
Brett Cannon4b0c24a2011-02-03 22:14:58 +0000447 if sys.version_info[0] >= 3: # Python 3
448 def __str__(self):
449 return self.__unicode__()
450 else: # Python 2
451 def __str__(self):
452 return self.__unicode__().encode('utf8')
Brett Cannon8045d972011-02-03 22:01:54 +0000453
454
Brett Cannon4b0c24a2011-02-03 22:14:58 +0000455 class Spam(UnicodeMixin):
Brett Cannon8045d972011-02-03 22:01:54 +0000456
Brett Cannon4b0c24a2011-02-03 22:14:58 +0000457 def __unicode__(self):
458 return u'spam-spam-bacon-spam' # 2to3 will remove the 'u' prefix
Brett Cannon8045d972011-02-03 22:01:54 +0000459
460
Brett Cannon8045d972011-02-03 22:01:54 +0000461Don't Index on Exceptions
462'''''''''''''''''''''''''
Antoine Pitrou5c28cfdc2011-02-05 11:53:39 +0000463
Brett Cannon8045d972011-02-03 22:01:54 +0000464In Python 2, the following worked::
465
Brett Cannon4b0c24a2011-02-03 22:14:58 +0000466 >>> exc = Exception(1, 2, 3)
467 >>> exc.args[1]
468 2
469 >>> exc[1] # Python 2 only!
470 2
Brett Cannon8045d972011-02-03 22:01:54 +0000471
472But in Python 3, indexing directly off of an exception is an error. You need to
473make sure to only index on :attr:`BaseException.args` attribute which is a
474sequence containing all arguments passed to the :meth:`__init__` method.
475
476Even better is to use documented attributes the exception provides.
477
Brett Cannon8045d972011-02-03 22:01:54 +0000478Don't use ``__getslice__`` & Friends
479''''''''''''''''''''''''''''''''''''
Antoine Pitrou5c28cfdc2011-02-05 11:53:39 +0000480
Brett Cannon8045d972011-02-03 22:01:54 +0000481Been deprecated for a while, but Python 3 finally drops support for
482``__getslice__()``, etc. Move completely over to :meth:`__getitem__` and
483friends.
484
485
Brett Cannon45aa7cc2011-02-05 22:16:40 +0000486Updating doctests
487'''''''''''''''''
4882to3_ will attempt to generate fixes for doctests that it comes across. It's
489not perfect, though. If you wrote a monolithic set of doctests (e.g., a single
490docstring containing all of your doctests), you should at least consider
491breaking the doctests up into smaller pieces to make it more manageable to fix.
492Otherwise it might very well be worth your time and effort to port your tests
493to :mod:`unittest`.
Brett Cannon8045d972011-02-03 22:01:54 +0000494
495
496Eliminate ``-3`` Warnings
497-------------------------
498When you run your application's test suite, run it using the ``-3`` flag passed
499to Python. This will cause various warnings to be raised during execution about
500things that 2to3 cannot handle automatically (e.g., modules that have been
501removed). Try to eliminate those warnings to make your code even more portable
502to Python 3.
503
504
505Run 2to3
506--------
507Once you have made your Python 2 code future-compatible with Python 3, it's
508time to use 2to3_ to actually port your code.
509
510
511Manually
512''''''''
513To manually convert source code using 2to3_, you use the ``2to3`` script that
514is installed with Python 2.6 and later.::
515
Brett Cannon4b0c24a2011-02-03 22:14:58 +0000516 2to3 <directory or file to convert>
Brett Cannon8045d972011-02-03 22:01:54 +0000517
518This will cause 2to3 to write out a diff with all of the fixers applied for the
519converted source code. If you would like 2to3 to go ahead and apply the changes
520you can pass it the ``-w`` flag::
521
Brett Cannon4b0c24a2011-02-03 22:14:58 +0000522 2to3 -w <stuff to convert>
Brett Cannon8045d972011-02-03 22:01:54 +0000523
524There are other flags available to control exactly which fixers are applied,
525etc.
526
527
528During Installation
529'''''''''''''''''''
530When a user installs your project for Python 3, you can have either
531:mod:`distutils` or Distribute_ run 2to3_ on your behalf.
532For distutils, use the following idiom::
533
Brett Cannon4b0c24a2011-02-03 22:14:58 +0000534 try: # Python 3
535 from distutils.command.build_py import build_py_2to3 as build_py
536 except ImportError: # Python 2
537 from distutils.command.build_py import build_py
Brett Cannon8045d972011-02-03 22:01:54 +0000538
Brett Cannon4b0c24a2011-02-03 22:14:58 +0000539 setup(cmdclass = {'build_py':build_py},
540 # ...
541 )
Brett Cannon8045d972011-02-03 22:01:54 +0000542
Brett Cannon4b0c24a2011-02-03 22:14:58 +0000543 For Distribute::
Brett Cannon8045d972011-02-03 22:01:54 +0000544
Brett Cannon4b0c24a2011-02-03 22:14:58 +0000545 setup(use_2to3=True,
546 # ...
547 )
Brett Cannon8045d972011-02-03 22:01:54 +0000548
549This will allow you to not have to distribute a separate Python 3 version of
550your project. It does require, though, that when you perform development that
551you at least build your project and use the built Python 3 source for testing.
552
553
554Verify & Test
555-------------
556At this point you should (hopefully) have your project converted in such a way
557that it works in Python 3. Verify it by running your unit tests and making sure
558nothing has gone awry. If you miss something then figure out how to fix it in
559Python 3, backport to your Python 2 code, and run your code through 2to3 again
560to verify the fix transforms properly.
561
562
563.. _2to3: http://docs.python.org/py3k/library/2to3.html
564.. _Distribute: http://packages.python.org/distribute/
565
566
567.. _use_same_source:
568
569Python 2/3 Compatible Source
570============================
571While it may seem counter-intuitive, you can write Python code which is
572source-compatible between Python 2 & 3. It does lead to code that is not
573entirely idiomatic Python (e.g., having to extract the currently raised
574exception from ``sys.exc_info()[1]``), but it can be run under Python 2
575**and** Python 3 without using 2to3_ as a translation step. This allows you to
576continue to have a rapid development process regardless of whether you are
577developing under Python 2 or Python 3. Whether this approach or using
578:ref:`use_2to3` works best for you will be a per-project decision.
579
580To get a complete idea of what issues you will need to deal with, see the
581`What's New in Python 3.0`_. Others have reorganized the data in other formats
582such as http://docs.pythonsprints.com/python3_porting/py-porting.html .
583
584The following are some steps to take to try to support both Python 2 & 3 from
585the same source code.
586
587
588.. _What's New in Python 3.0: http://docs.python.org/release/3.0/whatsnew/3.0.html
589
590
591Follow The Steps for Using 2to3_ (sans 2to3)
592--------------------------------------------
593All of the steps outlined in how to
594:ref:`port Python 2 code with 2to3 <use_2to3>` apply
595to creating a Python 2/3 codebase. This includes trying only support Python 2.6
596or newer (the :mod:`__future__` statements work in Python 3 without issue),
597eliminating warnings that are triggered by ``-3``, etc.
598
Brett Cannon98135d02011-02-05 22:22:47 +0000599You should even consider running 2to3_ over your code (without committing the
600changes). This will let you know where potential pain points are within your
601code so that you can fix them properly before they become an issue.
Brett Cannon8045d972011-02-03 22:01:54 +0000602
603
604Use six_
605--------
606The six_ project contains many things to help you write portable Python code.
607You should make sure to read its documentation from beginning to end and use
608any and all features it provides. That way you will minimize any mistakes you
609might make in writing cross-version code.
610
611
612Capturing the Currently Raised Exception
613----------------------------------------
Brett Cannonce71ab22011-02-05 22:05:05 +0000614One change between Python 2 and 3 that will require changing how you code (if
615you support `Python 2.5`_ and earlier) is
616accessing the currently raised exception. In Python 2.5 and earlier the syntax
617to access the current exception is::
Brett Cannon8045d972011-02-03 22:01:54 +0000618
Brett Cannon4b0c24a2011-02-03 22:14:58 +0000619 try:
620 raise Exception()
621 except Exception, exc:
622 # Current exception is 'exc'
623 pass
Brett Cannon8045d972011-02-03 22:01:54 +0000624
Brett Cannonce71ab22011-02-05 22:05:05 +0000625This syntax changed in Python 3 (and backported to `Python 2.6`_ and later)
626to::
Brett Cannon8045d972011-02-03 22:01:54 +0000627
Brett Cannon4b0c24a2011-02-03 22:14:58 +0000628 try:
629 raise Exception()
630 except Exception as exc:
631 # Current exception is 'exc'
Brett Cannonce71ab22011-02-05 22:05:05 +0000632 # In Python 3, 'exc' is restricted to the block; Python 2.6 will "leak"
Brett Cannon4b0c24a2011-02-03 22:14:58 +0000633 pass
Brett Cannon8045d972011-02-03 22:01:54 +0000634
635Because of this syntax change you must change to capturing the current
636exception to::
637
Brett Cannon4b0c24a2011-02-03 22:14:58 +0000638 try:
639 raise Exception()
640 except Exception:
641 import sys
642 exc = sys.exc_info()[1]
643 # Current exception is 'exc'
644 pass
Brett Cannon8045d972011-02-03 22:01:54 +0000645
646You can get more information about the raised exception from
647:func:`sys.exc_info` than simply the current exception instance, but you most
Antoine Pitroue6a14642011-02-05 12:01:07 +0000648likely don't need it.
Brett Cannon8045d972011-02-03 22:01:54 +0000649
Antoine Pitroue6a14642011-02-05 12:01:07 +0000650.. note::
651 In Python 3, the traceback is attached to the exception instance
Brett Cannonce71ab22011-02-05 22:05:05 +0000652 through the ``__traceback__`` attribute. If the instance is saved in
Antoine Pitroue6a14642011-02-05 12:01:07 +0000653 a local variable that persists outside of the ``except`` block, the
654 traceback will create a reference cycle with the current frame and its
655 dictionary of local variables. This will delay reclaiming dead
656 resources until the next cyclic :term:`garbage collection` pass.
657
658 In Python 2, this problem only occurs if you save the traceback itself
659 (e.g. the third element of the tuple returned by :func:`sys.exc_info`)
660 in a variable.
Brett Cannon8045d972011-02-03 22:01:54 +0000661
662Other Resources
663===============
664The authors of the following blogs posts and wiki pages deserve special thanks
665for making public their tips for porting Python 2 code to Python 3 (and thus
666helping provide information for this document):
667
668* http://docs.pythonsprints.com/python3_porting/py-porting.html
669* http://techspot.zzzeek.org/2011/01/24/zzzeek-s-guide-to-python-3-porting/
670* http://dabeaz.blogspot.com/2011/01/porting-py65-and-my-superboard-to.html
671* http://lucumr.pocoo.org/2011/1/22/forwards-compatible-python/
672* http://lucumr.pocoo.org/2010/2/11/porting-to-python-3-a-guide/
673* http://wiki.python.org/moin/PortingPythonToPy3k
674
675If you feel there is something missing from this document that should be added,
676please email the python-porting_ mailing list.
677
678.. _python-porting: http://mail.python.org/mailman/listinfo/python-porting