blob: c0bb03edf8b983780a20fd616f8f5768c78fc340 [file] [log] [blame]
Georg Brandl116aa622007-08-15 14:28:22 +00001************************************
Georg Brandl48310cd2009-01-03 21:18:54 +00002 Idioms and Anti-Idioms in Python
Georg Brandl116aa622007-08-15 14:28:22 +00003************************************
4
5:Author: Moshe Zadka
6
Christian Heimesdae2a892008-04-19 00:55:37 +00007This document is placed in the public domain.
Georg Brandl116aa622007-08-15 14:28:22 +00008
9
10.. topic:: Abstract
11
12 This document can be considered a companion to the tutorial. It shows how to use
13 Python, and even more importantly, how *not* to use Python.
14
15
16Language Constructs You Should Not Use
17======================================
18
19While Python has relatively few gotchas compared to other languages, it still
20has some constructs which are only useful in corner cases, or are plain
21dangerous.
22
23
24from module import \*
25---------------------
26
27
28Inside Function Definitions
29^^^^^^^^^^^^^^^^^^^^^^^^^^^
30
31``from module import *`` is *invalid* inside function definitions. While many
32versions of Python do not check for the invalidity, it does not make it more
Georg Brandlfe5f4092009-05-22 10:44:31 +000033valid, no more than having a smart lawyer makes a man innocent. Do not use it
Georg Brandl116aa622007-08-15 14:28:22 +000034like that ever. Even in versions where it was accepted, it made the function
35execution slower, because the compiler could not be certain which names are
36local and which are global. In Python 2.1 this construct causes warnings, and
37sometimes even errors.
38
39
40At Module Level
41^^^^^^^^^^^^^^^
42
43While it is valid to use ``from module import *`` at module level it is usually
44a bad idea. For one, this loses an important property Python otherwise has ---
45you can know where each toplevel name is defined by a simple "search" function
46in your favourite editor. You also open yourself to trouble in the future, if
47some module grows additional functions or classes.
48
49One of the most awful question asked on the newsgroup is why this code::
50
51 f = open("www")
52 f.read()
53
54does not work. Of course, it works just fine (assuming you have a file called
Georg Brandlc4a55fc2010-02-06 18:46:57 +000055"www".) But it does not work if somewhere in the module, the statement ``from
56os import *`` is present. The :mod:`os` module has a function called
57:func:`open` which returns an integer. While it is very useful, shadowing a
58builtin is one of its least useful properties.
Georg Brandl116aa622007-08-15 14:28:22 +000059
60Remember, you can never know for sure what names a module exports, so either
61take what you need --- ``from module import name1, name2``, or keep them in the
Georg Brandl6911e3c2007-09-04 07:15:32 +000062module and access on a per-need basis --- ``import module; print(module.name)``.
Georg Brandl116aa622007-08-15 14:28:22 +000063
64
65When It Is Just Fine
66^^^^^^^^^^^^^^^^^^^^
67
68There are situations in which ``from module import *`` is just fine:
69
70* The interactive prompt. For example, ``from math import *`` makes Python an
71 amazing scientific calculator.
72
73* When extending a module in C with a module in Python.
74
75* When the module advertises itself as ``from import *`` safe.
76
77
Georg Brandl116aa622007-08-15 14:28:22 +000078from module import name1, name2
79-------------------------------
80
Georg Brandlfe5f4092009-05-22 10:44:31 +000081This is a "don't" which is much weaker than the previous "don't"s but is still
Georg Brandl116aa622007-08-15 14:28:22 +000082something you should not do if you don't have good reasons to do that. The
83reason it is usually bad idea is because you suddenly have an object which lives
Christian Heimesc3f30c42008-02-22 16:37:40 +000084in two separate namespaces. When the binding in one namespace changes, the
Georg Brandl116aa622007-08-15 14:28:22 +000085binding in the other will not, so there will be a discrepancy between them. This
86happens when, for example, one module is reloaded, or changes the definition of
87a function at runtime.
88
89Bad example::
90
91 # foo.py
92 a = 1
93
94 # bar.py
95 from foo import a
96 if something():
Georg Brandl48310cd2009-01-03 21:18:54 +000097 a = 2 # danger: foo.a != a
Georg Brandl116aa622007-08-15 14:28:22 +000098
99Good example::
100
101 # foo.py
102 a = 1
103
104 # bar.py
105 import foo
106 if something():
107 foo.a = 2
108
109
110except:
111-------
112
113Python has the ``except:`` clause, which catches all exceptions. Since *every*
R. David Murray44ef7742010-09-11 18:12:25 +0000114error in Python raises an exception, using ``except:`` can make many
115programming errors look like runtime problems, which hinders the debugging
116process.
Georg Brandl116aa622007-08-15 14:28:22 +0000117
R. David Murray44ef7742010-09-11 18:12:25 +0000118The following code shows a great example of why this is bad::
Georg Brandl116aa622007-08-15 14:28:22 +0000119
120 try:
121 foo = opne("file") # misspelled "open"
122 except:
123 sys.exit("could not open file!")
124
R. David Murray44ef7742010-09-11 18:12:25 +0000125The second line triggers a :exc:`NameError`, which is caught by the except
126clause. The program will exit, and the error message the program prints will
127make you think the problem is the readability of ``"file"`` when in fact
128the real error has nothing to do with ``"file"``.
Georg Brandl116aa622007-08-15 14:28:22 +0000129
R. David Murray44ef7742010-09-11 18:12:25 +0000130A better way to write the above is ::
Georg Brandl116aa622007-08-15 14:28:22 +0000131
132 try:
R. David Murray44ef7742010-09-11 18:12:25 +0000133 foo = opne("file")
Georg Brandl116aa622007-08-15 14:28:22 +0000134 except IOError:
135 sys.exit("could not open file")
136
R. David Murray44ef7742010-09-11 18:12:25 +0000137When this is run, Python will produce a traceback showing the :exc:`NameError`,
138and it will be immediately apparent what needs to be fixed.
139
140.. index:: bare except, except; bare
141
142Because ``except:`` catches *all* exceptions, including :exc:`SystemExit`,
143:exc:`KeyboardInterrupt`, and :exc:`GeneratorExit` (which is not an error and
144should not normally be caught by user code), using a bare ``except:`` is almost
145never a good idea. In situations where you need to catch all "normal" errors,
146such as in a framework that runs callbacks, you can catch the base class for
147all normal exceptions, :exc:`Exception`.
Georg Brandl116aa622007-08-15 14:28:22 +0000148
149
150Exceptions
151==========
152
153Exceptions are a useful feature of Python. You should learn to raise them
154whenever something unexpected occurs, and catch them only where you can do
155something about them.
156
157The following is a very popular anti-idiom ::
158
159 def get_status(file):
160 if not os.path.exists(file):
Georg Brandl6911e3c2007-09-04 07:15:32 +0000161 print("file not found")
Georg Brandl116aa622007-08-15 14:28:22 +0000162 sys.exit(1)
163 return open(file).readline()
164
R. David Murray44ef7742010-09-11 18:12:25 +0000165Consider the case where the file gets deleted between the time the call to
166:func:`os.path.exists` is made and the time :func:`open` is called. In that
167case the last line will raise an :exc:`IOError`. The same thing would happen
168if *file* exists but has no read permission. Since testing this on a normal
169machine on existent and non-existent files makes it seem bugless, the test
170results will seem fine, and the code will get shipped. Later an unhandled
171:exc:`IOError` (or perhaps some other :exc:`EnvironmentError`) escapes to the
172user, who gets to watch the ugly traceback.
Georg Brandl116aa622007-08-15 14:28:22 +0000173
R. David Murray44ef7742010-09-11 18:12:25 +0000174Here is a somewhat better way to do it. ::
Georg Brandl116aa622007-08-15 14:28:22 +0000175
176 def get_status(file):
177 try:
178 return open(file).readline()
R. David Murray44ef7742010-09-11 18:12:25 +0000179 except EnvironmentError as err:
180 print("Unable to open file: {}".format(err))
Georg Brandl116aa622007-08-15 14:28:22 +0000181 sys.exit(1)
182
R. David Murray44ef7742010-09-11 18:12:25 +0000183In this version, *either* the file gets opened and the line is read (so it
184works even on flaky NFS or SMB connections), or an error message is printed
185that provides all the available information on why the open failed, and the
186application is aborted.
Georg Brandl116aa622007-08-15 14:28:22 +0000187
R. David Murray44ef7742010-09-11 18:12:25 +0000188However, even this version of :func:`get_status` makes too many assumptions ---
189that it will only be used in a short running script, and not, say, in a long
190running server. Sure, the caller could do something like ::
Georg Brandl116aa622007-08-15 14:28:22 +0000191
192 try:
193 status = get_status(log)
194 except SystemExit:
195 status = None
196
R. David Murray44ef7742010-09-11 18:12:25 +0000197But there is a better way. You should try to use as few ``except`` clauses in
198your code as you can --- the ones you do use will usually be inside calls which
199should always succeed, or a catch-all in a main function.
Georg Brandl116aa622007-08-15 14:28:22 +0000200
R. David Murray44ef7742010-09-11 18:12:25 +0000201So, an even better version of :func:`get_status()` is probably ::
Georg Brandl116aa622007-08-15 14:28:22 +0000202
203 def get_status(file):
204 return open(file).readline()
205
R. David Murray44ef7742010-09-11 18:12:25 +0000206The caller can deal with the exception if it wants (for example, if it tries
Georg Brandl116aa622007-08-15 14:28:22 +0000207several files in a loop), or just let the exception filter upwards to *its*
208caller.
209
R. David Murray44ef7742010-09-11 18:12:25 +0000210But the last version still has a serious problem --- due to implementation
211details in CPython, the file would not be closed when an exception is raised
212until the exception handler finishes; and, worse, in other implementations
213(e.g., Jython) it might not be closed at all regardless of whether or not
214an exception is raised.
215
216The best version of this function uses the ``open()`` call as a context
217manager, which will ensure that the file gets closed as soon as the
218function returns::
Georg Brandl116aa622007-08-15 14:28:22 +0000219
220 def get_status(file):
Georg Brandl386bc6d2010-04-25 10:19:53 +0000221 with open(file) as fp:
Georg Brandl116aa622007-08-15 14:28:22 +0000222 return fp.readline()
Georg Brandl116aa622007-08-15 14:28:22 +0000223
224
225Using the Batteries
226===================
227
228Every so often, people seem to be writing stuff in the Python library again,
229usually poorly. While the occasional module has a poor interface, it is usually
230much better to use the rich standard library and data types that come with
Georg Brandlfe5f4092009-05-22 10:44:31 +0000231Python than inventing your own.
Georg Brandl116aa622007-08-15 14:28:22 +0000232
233A useful module very few people know about is :mod:`os.path`. It always has the
234correct path arithmetic for your operating system, and will usually be much
Georg Brandlfe5f4092009-05-22 10:44:31 +0000235better than whatever you come up with yourself.
Georg Brandl116aa622007-08-15 14:28:22 +0000236
237Compare::
238
239 # ugh!
240 return dir+"/"+file
241 # better
242 return os.path.join(dir, file)
243
244More useful functions in :mod:`os.path`: :func:`basename`, :func:`dirname` and
245:func:`splitext`.
246
Georg Brandl22b34312009-07-26 14:54:51 +0000247There are also many useful built-in functions people seem not to be aware of for
Georg Brandl116aa622007-08-15 14:28:22 +0000248some reason: :func:`min` and :func:`max` can find the minimum/maximum of any
249sequence with comparable semantics, for example, yet many people write their own
Georg Brandl6911e3c2007-09-04 07:15:32 +0000250:func:`max`/:func:`min`. Another highly useful function is
251:func:`functools.reduce`. A classical use of :func:`reduce` is something like
252::
Georg Brandl116aa622007-08-15 14:28:22 +0000253
Georg Brandl6911e3c2007-09-04 07:15:32 +0000254 import sys, operator, functools
Benjamin Peterson09310422008-09-23 13:44:44 +0000255 nums = list(map(float, sys.argv[1:]))
Georg Brandl6911e3c2007-09-04 07:15:32 +0000256 print(functools.reduce(operator.add, nums) / len(nums))
Georg Brandl116aa622007-08-15 14:28:22 +0000257
258This cute little script prints the average of all numbers given on the command
259line. The :func:`reduce` adds up all the numbers, and the rest is just some
260pre- and postprocessing.
261
Georg Brandl5c106642007-11-29 17:41:05 +0000262On the same note, note that :func:`float` and :func:`int` accept arguments of
263type string, and so are suited to parsing --- assuming you are ready to deal
264with the :exc:`ValueError` they raise.
Georg Brandl116aa622007-08-15 14:28:22 +0000265
266
267Using Backslash to Continue Statements
268======================================
269
270Since Python treats a newline as a statement terminator, and since statements
Georg Brandlfe5f4092009-05-22 10:44:31 +0000271are often more than is comfortable to put in one line, many people do::
Georg Brandl116aa622007-08-15 14:28:22 +0000272
273 if foo.bar()['first'][0] == baz.quux(1, 2)[5:9] and \
274 calculate_number(10, 20) != forbulate(500, 360):
275 pass
276
Christian Heimes5b5e81c2007-12-31 16:14:33 +0000277You should realize that this is dangerous: a stray space after the ``\`` would
Georg Brandl116aa622007-08-15 14:28:22 +0000278make this line wrong, and stray spaces are notoriously hard to see in editors.
279In this case, at least it would be a syntax error, but if the code was::
280
281 value = foo.bar()['first'][0]*baz.quux(1, 2)[5:9] \
282 + calculate_number(10, 20)*forbulate(500, 360)
283
284then it would just be subtly wrong.
285
286It is usually much better to use the implicit continuation inside parenthesis:
287
288This version is bulletproof::
289
Georg Brandl48310cd2009-01-03 21:18:54 +0000290 value = (foo.bar()['first'][0]*baz.quux(1, 2)[5:9]
Georg Brandl116aa622007-08-15 14:28:22 +0000291 + calculate_number(10, 20)*forbulate(500, 360))
292