blob: 62b9bb5af530a22393273f5932e9f475813d3e81 [file] [log] [blame]
Georg Brandl8ec7f652007-08-15 14:28:01 +00001.. _tut-modules:
2
3*******
4Modules
5*******
6
7If you quit from the Python interpreter and enter it again, the definitions you
8have made (functions and variables) are lost. Therefore, if you want to write a
9somewhat longer program, you are better off using a text editor to prepare the
10input for the interpreter and running it with that file as input instead. This
11is known as creating a *script*. As your program gets longer, you may want to
12split it into several files for easier maintenance. You may also want to use a
13handy function that you've written in several programs without copying its
14definition into each program.
15
16To support this, Python has a way to put definitions in a file and use them in a
17script or in an interactive instance of the interpreter. Such a file is called a
18*module*; definitions from a module can be *imported* into other modules or into
19the *main* module (the collection of variables that you have access to in a
20script executed at the top level and in calculator mode).
21
22A module is a file containing Python definitions and statements. The file name
23is the module name with the suffix :file:`.py` appended. Within a module, the
24module's name (as a string) is available as the value of the global variable
25``__name__``. For instance, use your favorite text editor to create a file
26called :file:`fibo.py` in the current directory with the following contents::
27
28 # Fibonacci numbers module
29
30 def fib(n): # write Fibonacci series up to n
31 a, b = 0, 1
32 while b < n:
33 print b,
34 a, b = b, a+b
35
36 def fib2(n): # return Fibonacci series up to n
37 result = []
38 a, b = 0, 1
39 while b < n:
40 result.append(b)
41 a, b = b, a+b
42 return result
43
44Now enter the Python interpreter and import this module with the following
45command::
46
47 >>> import fibo
48
49This does not enter the names of the functions defined in ``fibo`` directly in
50the current symbol table; it only enters the module name ``fibo`` there. Using
51the module name you can access the functions::
52
53 >>> fibo.fib(1000)
54 1 1 2 3 5 8 13 21 34 55 89 144 233 377 610 987
55 >>> fibo.fib2(100)
56 [1, 1, 2, 3, 5, 8, 13, 21, 34, 55, 89]
57 >>> fibo.__name__
58 'fibo'
59
60If you intend to use a function often you can assign it to a local name::
61
62 >>> fib = fibo.fib
63 >>> fib(500)
64 1 1 2 3 5 8 13 21 34 55 89 144 233 377
65
66
67.. _tut-moremodules:
68
69More on Modules
70===============
71
72A module can contain executable statements as well as function definitions.
73These statements are intended to initialize the module. They are executed only
74the *first* time the module is imported somewhere. [#]_
75
76Each module has its own private symbol table, which is used as the global symbol
77table by all functions defined in the module. Thus, the author of a module can
78use global variables in the module without worrying about accidental clashes
79with a user's global variables. On the other hand, if you know what you are
80doing you can touch a module's global variables with the same notation used to
81refer to its functions, ``modname.itemname``.
82
83Modules can import other modules. It is customary but not required to place all
84:keyword:`import` statements at the beginning of a module (or script, for that
85matter). The imported module names are placed in the importing module's global
86symbol table.
87
88There is a variant of the :keyword:`import` statement that imports names from a
89module directly into the importing module's symbol table. For example::
90
91 >>> from fibo import fib, fib2
92 >>> fib(500)
93 1 1 2 3 5 8 13 21 34 55 89 144 233 377
94
95This does not introduce the module name from which the imports are taken in the
96local symbol table (so in the example, ``fibo`` is not defined).
97
98There is even a variant to import all names that a module defines::
99
100 >>> from fibo import *
101 >>> fib(500)
102 1 1 2 3 5 8 13 21 34 55 89 144 233 377
103
104This imports all names except those beginning with an underscore (``_``).
105
Michael Foordee2df032009-09-13 17:07:46 +0000106Note that in general the practice of importing ``*`` from a module or package is
107frowned upon, since it often causes poorly readable code. However, it is okay to
108use it to save typing in interactive sessions.
109
Georg Brandl68fd63b2008-05-09 06:39:58 +0000110.. note::
111
112 For efficiency reasons, each module is only imported once per interpreter
113 session. Therefore, if you change your modules, you must restart the
114 interpreter -- or, if it's just one module you want to test interactively,
Georg Brandl605a0c62008-05-11 07:02:17 +0000115 use :func:`reload`, e.g. ``reload(modulename)``.
Georg Brandl68fd63b2008-05-09 06:39:58 +0000116
Georg Brandl8ec7f652007-08-15 14:28:01 +0000117
118.. _tut-modulesasscripts:
119
120Executing modules as scripts
121----------------------------
122
123When you run a Python module with ::
124
125 python fibo.py <arguments>
126
127the code in the module will be executed, just as if you imported it, but with
128the ``__name__`` set to ``"__main__"``. That means that by adding this code at
129the end of your module::
130
131 if __name__ == "__main__":
132 import sys
133 fib(int(sys.argv[1]))
134
135you can make the file usable as a script as well as an importable module,
136because the code that parses the command line only runs if the module is
137executed as the "main" file::
138
139 $ python fibo.py 50
140 1 1 2 3 5 8 13 21 34
141
142If the module is imported, the code is not run::
143
144 >>> import fibo
145 >>>
146
147This is often used either to provide a convenient user interface to a module, or
148for testing purposes (running the module as a script executes a test suite).
149
150
151.. _tut-searchpath:
152
153The Module Search Path
154----------------------
155
156.. index:: triple: module; search; path
157
Sandro Tosid53abd32012-01-19 11:28:15 +0100158When a module named :mod:`spam` is imported, the interpreter first searches for
159a built-in module with that name. If not found, it then searches for a file
160named :file:`spam.py` in a list of directories given by the variable
161:data:`sys.path`. :data:`sys.path` is initialized from these locations:
Georg Brandl8ec7f652007-08-15 14:28:01 +0000162
Sandro Tosid53abd32012-01-19 11:28:15 +0100163* the directory containing the input script (or the current directory).
164* :envvar:`PYTHONPATH` (a list of directory names, with the same syntax as the
165 shell variable :envvar:`PATH`).
166* the installation-dependent default.
167
168After initialization, Python programs can modify :data:`sys.path`. The
169directory containing the script being run is placed at the beginning of the
170search path, ahead of the standard library path. This means that scripts in that
171directory will be loaded instead of modules of the same name in the library
172directory. This is an error unless the replacement is intended. See section
173:ref:`tut-standardmodules` for more information.
Georg Brandl8ec7f652007-08-15 14:28:01 +0000174
175
176"Compiled" Python files
177-----------------------
178
179As an important speed-up of the start-up time for short programs that use a lot
180of standard modules, if a file called :file:`spam.pyc` exists in the directory
181where :file:`spam.py` is found, this is assumed to contain an
182already-"byte-compiled" version of the module :mod:`spam`. The modification time
183of the version of :file:`spam.py` used to create :file:`spam.pyc` is recorded in
184:file:`spam.pyc`, and the :file:`.pyc` file is ignored if these don't match.
185
186Normally, you don't need to do anything to create the :file:`spam.pyc` file.
187Whenever :file:`spam.py` is successfully compiled, an attempt is made to write
188the compiled version to :file:`spam.pyc`. It is not an error if this attempt
189fails; if for any reason the file is not written completely, the resulting
190:file:`spam.pyc` file will be recognized as invalid and thus ignored later. The
191contents of the :file:`spam.pyc` file are platform independent, so a Python
192module directory can be shared by machines of different architectures.
193
194Some tips for experts:
195
196* When the Python interpreter is invoked with the :option:`-O` flag, optimized
197 code is generated and stored in :file:`.pyo` files. The optimizer currently
198 doesn't help much; it only removes :keyword:`assert` statements. When
Georg Brandl5e52db02007-10-21 10:45:46 +0000199 :option:`-O` is used, *all* :term:`bytecode` is optimized; ``.pyc`` files are
200 ignored and ``.py`` files are compiled to optimized bytecode.
Georg Brandl8ec7f652007-08-15 14:28:01 +0000201
202* Passing two :option:`-O` flags to the Python interpreter (:option:`-OO`) will
203 cause the bytecode compiler to perform optimizations that could in some rare
204 cases result in malfunctioning programs. Currently only ``__doc__`` strings are
205 removed from the bytecode, resulting in more compact :file:`.pyo` files. Since
206 some programs may rely on having these available, you should only use this
207 option if you know what you're doing.
208
209* A program doesn't run any faster when it is read from a :file:`.pyc` or
210 :file:`.pyo` file than when it is read from a :file:`.py` file; the only thing
211 that's faster about :file:`.pyc` or :file:`.pyo` files is the speed with which
212 they are loaded.
213
214* When a script is run by giving its name on the command line, the bytecode for
215 the script is never written to a :file:`.pyc` or :file:`.pyo` file. Thus, the
216 startup time of a script may be reduced by moving most of its code to a module
217 and having a small bootstrap script that imports that module. It is also
218 possible to name a :file:`.pyc` or :file:`.pyo` file directly on the command
219 line.
220
221* It is possible to have a file called :file:`spam.pyc` (or :file:`spam.pyo`
222 when :option:`-O` is used) without a file :file:`spam.py` for the same module.
223 This can be used to distribute a library of Python code in a form that is
224 moderately hard to reverse engineer.
225
226 .. index:: module: compileall
227
228* The module :mod:`compileall` can create :file:`.pyc` files (or :file:`.pyo`
229 files when :option:`-O` is used) for all modules in a directory.
230
Georg Brandl8ec7f652007-08-15 14:28:01 +0000231
232.. _tut-standardmodules:
233
234Standard Modules
235================
236
237.. index:: module: sys
238
239Python comes with a library of standard modules, described in a separate
240document, the Python Library Reference ("Library Reference" hereafter). Some
241modules are built into the interpreter; these provide access to operations that
242are not part of the core of the language but are nevertheless built in, either
243for efficiency or to provide access to operating system primitives such as
244system calls. The set of such modules is a configuration option which also
245depends on the underlying platform For example, the :mod:`winreg` module is only
246provided on Windows systems. One particular module deserves some attention:
247:mod:`sys`, which is built into every Python interpreter. The variables
248``sys.ps1`` and ``sys.ps2`` define the strings used as primary and secondary
Georg Brandlb19be572007-12-29 10:57:00 +0000249prompts::
Georg Brandl8ec7f652007-08-15 14:28:01 +0000250
251 >>> import sys
252 >>> sys.ps1
253 '>>> '
254 >>> sys.ps2
255 '... '
256 >>> sys.ps1 = 'C> '
257 C> print 'Yuck!'
258 Yuck!
259 C>
260
261
262These two variables are only defined if the interpreter is in interactive mode.
263
264The variable ``sys.path`` is a list of strings that determines the interpreter's
265search path for modules. It is initialized to a default path taken from the
266environment variable :envvar:`PYTHONPATH`, or from a built-in default if
267:envvar:`PYTHONPATH` is not set. You can modify it using standard list
268operations::
269
270 >>> import sys
271 >>> sys.path.append('/ufs/guido/lib/python')
272
273
274.. _tut-dir:
275
276The :func:`dir` Function
277========================
278
279The built-in function :func:`dir` is used to find out which names a module
280defines. It returns a sorted list of strings::
281
282 >>> import fibo, sys
283 >>> dir(fibo)
284 ['__name__', 'fib', 'fib2']
285 >>> dir(sys)
286 ['__displayhook__', '__doc__', '__excepthook__', '__name__', '__stderr__',
Georg Brandlc62ef8b2009-01-03 20:55:06 +0000287 '__stdin__', '__stdout__', '_getframe', 'api_version', 'argv',
Georg Brandl8ec7f652007-08-15 14:28:01 +0000288 'builtin_module_names', 'byteorder', 'callstats', 'copyright',
289 'displayhook', 'exc_clear', 'exc_info', 'exc_type', 'excepthook',
290 'exec_prefix', 'executable', 'exit', 'getdefaultencoding', 'getdlopenflags',
291 'getrecursionlimit', 'getrefcount', 'hexversion', 'maxint', 'maxunicode',
292 'meta_path', 'modules', 'path', 'path_hooks', 'path_importer_cache',
293 'platform', 'prefix', 'ps1', 'ps2', 'setcheckinterval', 'setdlopenflags',
294 'setprofile', 'setrecursionlimit', 'settrace', 'stderr', 'stdin', 'stdout',
295 'version', 'version_info', 'warnoptions']
296
297Without arguments, :func:`dir` lists the names you have defined currently::
298
299 >>> a = [1, 2, 3, 4, 5]
300 >>> import fibo
301 >>> fib = fibo.fib
302 >>> dir()
303 ['__builtins__', '__doc__', '__file__', '__name__', 'a', 'fib', 'fibo', 'sys']
304
305Note that it lists all types of names: variables, modules, functions, etc.
306
307.. index:: module: __builtin__
308
309:func:`dir` does not list the names of built-in functions and variables. If you
310want a list of those, they are defined in the standard module
311:mod:`__builtin__`::
312
313 >>> import __builtin__
314 >>> dir(__builtin__)
315 ['ArithmeticError', 'AssertionError', 'AttributeError', 'DeprecationWarning',
316 'EOFError', 'Ellipsis', 'EnvironmentError', 'Exception', 'False',
317 'FloatingPointError', 'FutureWarning', 'IOError', 'ImportError',
318 'IndentationError', 'IndexError', 'KeyError', 'KeyboardInterrupt',
319 'LookupError', 'MemoryError', 'NameError', 'None', 'NotImplemented',
Georg Brandlc62ef8b2009-01-03 20:55:06 +0000320 'NotImplementedError', 'OSError', 'OverflowError',
Georg Brandl8ec7f652007-08-15 14:28:01 +0000321 'PendingDeprecationWarning', 'ReferenceError', 'RuntimeError',
322 'RuntimeWarning', 'StandardError', 'StopIteration', 'SyntaxError',
323 'SyntaxWarning', 'SystemError', 'SystemExit', 'TabError', 'True',
324 'TypeError', 'UnboundLocalError', 'UnicodeDecodeError',
325 'UnicodeEncodeError', 'UnicodeError', 'UnicodeTranslateError',
326 'UserWarning', 'ValueError', 'Warning', 'WindowsError',
327 'ZeroDivisionError', '_', '__debug__', '__doc__', '__import__',
328 '__name__', 'abs', 'apply', 'basestring', 'bool', 'buffer',
329 'callable', 'chr', 'classmethod', 'cmp', 'coerce', 'compile',
330 'complex', 'copyright', 'credits', 'delattr', 'dict', 'dir', 'divmod',
331 'enumerate', 'eval', 'execfile', 'exit', 'file', 'filter', 'float',
332 'frozenset', 'getattr', 'globals', 'hasattr', 'hash', 'help', 'hex',
333 'id', 'input', 'int', 'intern', 'isinstance', 'issubclass', 'iter',
Antoine Pitrou789be0c2009-04-02 21:18:34 +0000334 'len', 'license', 'list', 'locals', 'long', 'map', 'max', 'memoryview',
335 'min', 'object', 'oct', 'open', 'ord', 'pow', 'property', 'quit', 'range',
Georg Brandl8ec7f652007-08-15 14:28:01 +0000336 'raw_input', 'reduce', 'reload', 'repr', 'reversed', 'round', 'set',
337 'setattr', 'slice', 'sorted', 'staticmethod', 'str', 'sum', 'super',
338 'tuple', 'type', 'unichr', 'unicode', 'vars', 'xrange', 'zip']
339
340
341.. _tut-packages:
342
343Packages
344========
345
346Packages are a way of structuring Python's module namespace by using "dotted
347module names". For example, the module name :mod:`A.B` designates a submodule
348named ``B`` in a package named ``A``. Just like the use of modules saves the
349authors of different modules from having to worry about each other's global
350variable names, the use of dotted module names saves the authors of multi-module
351packages like NumPy or the Python Imaging Library from having to worry about
352each other's module names.
353
354Suppose you want to design a collection of modules (a "package") for the uniform
355handling of sound files and sound data. There are many different sound file
356formats (usually recognized by their extension, for example: :file:`.wav`,
357:file:`.aiff`, :file:`.au`), so you may need to create and maintain a growing
358collection of modules for the conversion between the various file formats.
359There are also many different operations you might want to perform on sound data
360(such as mixing, adding echo, applying an equalizer function, creating an
361artificial stereo effect), so in addition you will be writing a never-ending
362stream of modules to perform these operations. Here's a possible structure for
363your package (expressed in terms of a hierarchical filesystem)::
364
365 sound/ Top-level package
366 __init__.py Initialize the sound package
367 formats/ Subpackage for file format conversions
368 __init__.py
369 wavread.py
370 wavwrite.py
371 aiffread.py
372 aiffwrite.py
373 auread.py
374 auwrite.py
375 ...
376 effects/ Subpackage for sound effects
377 __init__.py
378 echo.py
379 surround.py
380 reverse.py
381 ...
382 filters/ Subpackage for filters
383 __init__.py
384 equalizer.py
385 vocoder.py
386 karaoke.py
387 ...
388
389When importing the package, Python searches through the directories on
390``sys.path`` looking for the package subdirectory.
391
392The :file:`__init__.py` files are required to make Python treat the directories
393as containing packages; this is done to prevent directories with a common name,
394such as ``string``, from unintentionally hiding valid modules that occur later
395on the module search path. In the simplest case, :file:`__init__.py` can just be
396an empty file, but it can also execute initialization code for the package or
397set the ``__all__`` variable, described later.
398
399Users of the package can import individual modules from the package, for
400example::
401
402 import sound.effects.echo
403
404This loads the submodule :mod:`sound.effects.echo`. It must be referenced with
405its full name. ::
406
407 sound.effects.echo.echofilter(input, output, delay=0.7, atten=4)
408
409An alternative way of importing the submodule is::
410
411 from sound.effects import echo
412
413This also loads the submodule :mod:`echo`, and makes it available without its
414package prefix, so it can be used as follows::
415
416 echo.echofilter(input, output, delay=0.7, atten=4)
417
418Yet another variation is to import the desired function or variable directly::
419
420 from sound.effects.echo import echofilter
421
422Again, this loads the submodule :mod:`echo`, but this makes its function
423:func:`echofilter` directly available::
424
425 echofilter(input, output, delay=0.7, atten=4)
426
427Note that when using ``from package import item``, the item can be either a
428submodule (or subpackage) of the package, or some other name defined in the
429package, like a function, class or variable. The ``import`` statement first
430tests whether the item is defined in the package; if not, it assumes it is a
431module and attempts to load it. If it fails to find it, an :exc:`ImportError`
432exception is raised.
433
434Contrarily, when using syntax like ``import item.subitem.subsubitem``, each item
435except for the last must be a package; the last item can be a module or a
436package but can't be a class or function or variable defined in the previous
437item.
438
439
440.. _tut-pkg-import-star:
441
442Importing \* From a Package
443---------------------------
444
445.. index:: single: __all__
446
447Now what happens when the user writes ``from sound.effects import *``? Ideally,
448one would hope that this somehow goes out to the filesystem, finds which
Michael Foordee2df032009-09-13 17:07:46 +0000449submodules are present in the package, and imports them all. This could take a
450long time and importing sub-modules might have unwanted side-effects that should
451only happen when the sub-module is explicitly imported.
Georg Brandl8ec7f652007-08-15 14:28:01 +0000452
Georg Brandl8ec7f652007-08-15 14:28:01 +0000453The only solution is for the package author to provide an explicit index of the
Georg Brandlffefd5a2009-07-29 17:07:21 +0000454package. The :keyword:`import` statement uses the following convention: if a package's
Georg Brandl8ec7f652007-08-15 14:28:01 +0000455:file:`__init__.py` code defines a list named ``__all__``, it is taken to be the
456list of module names that should be imported when ``from package import *`` is
457encountered. It is up to the package author to keep this list up-to-date when a
458new version of the package is released. Package authors may also decide not to
459support it, if they don't see a use for importing \* from their package. For
460example, the file :file:`sounds/effects/__init__.py` could contain the following
461code::
462
463 __all__ = ["echo", "surround", "reverse"]
464
465This would mean that ``from sound.effects import *`` would import the three
466named submodules of the :mod:`sound` package.
467
468If ``__all__`` is not defined, the statement ``from sound.effects import *``
469does *not* import all submodules from the package :mod:`sound.effects` into the
470current namespace; it only ensures that the package :mod:`sound.effects` has
471been imported (possibly running any initialization code in :file:`__init__.py`)
472and then imports whatever names are defined in the package. This includes any
473names defined (and submodules explicitly loaded) by :file:`__init__.py`. It
474also includes any submodules of the package that were explicitly loaded by
Georg Brandlffefd5a2009-07-29 17:07:21 +0000475previous :keyword:`import` statements. Consider this code::
Georg Brandl8ec7f652007-08-15 14:28:01 +0000476
477 import sound.effects.echo
478 import sound.effects.surround
479 from sound.effects import *
480
Georg Brandlffefd5a2009-07-29 17:07:21 +0000481In this example, the :mod:`echo` and :mod:`surround` modules are imported in the
482current namespace because they are defined in the :mod:`sound.effects` package
483when the ``from...import`` statement is executed. (This also works when
484``__all__`` is defined.)
Georg Brandl8ec7f652007-08-15 14:28:01 +0000485
Michael Foordee2df032009-09-13 17:07:46 +0000486Although certain modules are designed to export only names that follow certain
487patterns when you use ``import *``, it is still considered bad practise in
488production code.
Georg Brandl8ec7f652007-08-15 14:28:01 +0000489
490Remember, there is nothing wrong with using ``from Package import
491specific_submodule``! In fact, this is the recommended notation unless the
492importing module needs to use submodules with the same name from different
493packages.
494
495
496Intra-package References
497------------------------
498
499The submodules often need to refer to each other. For example, the
500:mod:`surround` module might use the :mod:`echo` module. In fact, such
501references are so common that the :keyword:`import` statement first looks in the
502containing package before looking in the standard module search path. Thus, the
503:mod:`surround` module can simply use ``import echo`` or ``from echo import
504echofilter``. If the imported module is not found in the current package (the
505package of which the current module is a submodule), the :keyword:`import`
506statement looks for a top-level module with the given name.
507
508When packages are structured into subpackages (as with the :mod:`sound` package
509in the example), you can use absolute imports to refer to submodules of siblings
510packages. For example, if the module :mod:`sound.filters.vocoder` needs to use
511the :mod:`echo` module in the :mod:`sound.effects` package, it can use ``from
512sound.effects import echo``.
513
514Starting with Python 2.5, in addition to the implicit relative imports described
515above, you can write explicit relative imports with the ``from module import
516name`` form of import statement. These explicit relative imports use leading
517dots to indicate the current and parent packages involved in the relative
518import. From the :mod:`surround` module for example, you might use::
519
520 from . import echo
521 from .. import formats
522 from ..filters import equalizer
523
524Note that both explicit and implicit relative imports are based on the name of
525the current module. Since the name of the main module is always ``"__main__"``,
526modules intended for use as the main module of a Python application should
527always use absolute imports.
528
529
530Packages in Multiple Directories
531--------------------------------
532
533Packages support one more special attribute, :attr:`__path__`. This is
534initialized to be a list containing the name of the directory holding the
535package's :file:`__init__.py` before the code in that file is executed. This
536variable can be modified; doing so affects future searches for modules and
537subpackages contained in the package.
538
539While this feature is not often needed, it can be used to extend the set of
540modules found in a package.
541
542
543.. rubric:: Footnotes
544
545.. [#] In fact function definitions are also 'statements' that are 'executed'; the
Georg Brandlffefd5a2009-07-29 17:07:21 +0000546 execution of a module-level function enters the function name in the module's
547 global symbol table.
Georg Brandl8ec7f652007-08-15 14:28:01 +0000548