blob: 6f51357a66b496cdbee015bed5e0df1b6d193519 [file] [log] [blame]
Georg Brandl8ec7f652007-08-15 14:28:01 +00001.. _tut-modules:
2
3*******
4Modules
5*******
6
7If you quit from the Python interpreter and enter it again, the definitions you
8have made (functions and variables) are lost. Therefore, if you want to write a
9somewhat longer program, you are better off using a text editor to prepare the
10input for the interpreter and running it with that file as input instead. This
11is known as creating a *script*. As your program gets longer, you may want to
12split it into several files for easier maintenance. You may also want to use a
13handy function that you've written in several programs without copying its
14definition into each program.
15
16To support this, Python has a way to put definitions in a file and use them in a
17script or in an interactive instance of the interpreter. Such a file is called a
18*module*; definitions from a module can be *imported* into other modules or into
19the *main* module (the collection of variables that you have access to in a
20script executed at the top level and in calculator mode).
21
22A module is a file containing Python definitions and statements. The file name
23is the module name with the suffix :file:`.py` appended. Within a module, the
24module's name (as a string) is available as the value of the global variable
25``__name__``. For instance, use your favorite text editor to create a file
26called :file:`fibo.py` in the current directory with the following contents::
27
28 # Fibonacci numbers module
29
30 def fib(n): # write Fibonacci series up to n
31 a, b = 0, 1
32 while b < n:
33 print b,
34 a, b = b, a+b
35
36 def fib2(n): # return Fibonacci series up to n
37 result = []
38 a, b = 0, 1
39 while b < n:
40 result.append(b)
41 a, b = b, a+b
42 return result
43
44Now enter the Python interpreter and import this module with the following
45command::
46
47 >>> import fibo
48
49This does not enter the names of the functions defined in ``fibo`` directly in
50the current symbol table; it only enters the module name ``fibo`` there. Using
51the module name you can access the functions::
52
53 >>> fibo.fib(1000)
54 1 1 2 3 5 8 13 21 34 55 89 144 233 377 610 987
55 >>> fibo.fib2(100)
56 [1, 1, 2, 3, 5, 8, 13, 21, 34, 55, 89]
57 >>> fibo.__name__
58 'fibo'
59
60If you intend to use a function often you can assign it to a local name::
61
62 >>> fib = fibo.fib
63 >>> fib(500)
64 1 1 2 3 5 8 13 21 34 55 89 144 233 377
65
66
67.. _tut-moremodules:
68
69More on Modules
70===============
71
72A module can contain executable statements as well as function definitions.
73These statements are intended to initialize the module. They are executed only
74the *first* time the module is imported somewhere. [#]_
75
76Each module has its own private symbol table, which is used as the global symbol
77table by all functions defined in the module. Thus, the author of a module can
78use global variables in the module without worrying about accidental clashes
79with a user's global variables. On the other hand, if you know what you are
80doing you can touch a module's global variables with the same notation used to
81refer to its functions, ``modname.itemname``.
82
83Modules can import other modules. It is customary but not required to place all
84:keyword:`import` statements at the beginning of a module (or script, for that
85matter). The imported module names are placed in the importing module's global
86symbol table.
87
88There is a variant of the :keyword:`import` statement that imports names from a
89module directly into the importing module's symbol table. For example::
90
91 >>> from fibo import fib, fib2
92 >>> fib(500)
93 1 1 2 3 5 8 13 21 34 55 89 144 233 377
94
95This does not introduce the module name from which the imports are taken in the
96local symbol table (so in the example, ``fibo`` is not defined).
97
98There is even a variant to import all names that a module defines::
99
100 >>> from fibo import *
101 >>> fib(500)
102 1 1 2 3 5 8 13 21 34 55 89 144 233 377
103
104This imports all names except those beginning with an underscore (``_``).
105
Michael Foordee2df032009-09-13 17:07:46 +0000106Note that in general the practice of importing ``*`` from a module or package is
107frowned upon, since it often causes poorly readable code. However, it is okay to
108use it to save typing in interactive sessions.
109
Georg Brandl68fd63b2008-05-09 06:39:58 +0000110.. note::
111
112 For efficiency reasons, each module is only imported once per interpreter
113 session. Therefore, if you change your modules, you must restart the
114 interpreter -- or, if it's just one module you want to test interactively,
Georg Brandl605a0c62008-05-11 07:02:17 +0000115 use :func:`reload`, e.g. ``reload(modulename)``.
Georg Brandl68fd63b2008-05-09 06:39:58 +0000116
Georg Brandl8ec7f652007-08-15 14:28:01 +0000117
118.. _tut-modulesasscripts:
119
120Executing modules as scripts
121----------------------------
122
123When you run a Python module with ::
124
125 python fibo.py <arguments>
126
127the code in the module will be executed, just as if you imported it, but with
128the ``__name__`` set to ``"__main__"``. That means that by adding this code at
129the end of your module::
130
131 if __name__ == "__main__":
132 import sys
133 fib(int(sys.argv[1]))
134
135you can make the file usable as a script as well as an importable module,
136because the code that parses the command line only runs if the module is
137executed as the "main" file::
138
139 $ python fibo.py 50
140 1 1 2 3 5 8 13 21 34
141
142If the module is imported, the code is not run::
143
144 >>> import fibo
145 >>>
146
147This is often used either to provide a convenient user interface to a module, or
148for testing purposes (running the module as a script executes a test suite).
149
150
151.. _tut-searchpath:
152
153The Module Search Path
154----------------------
155
156.. index:: triple: module; search; path
157
158When a module named :mod:`spam` is imported, the interpreter searches for a file
159named :file:`spam.py` in the current directory, and then in the list of
160directories specified by the environment variable :envvar:`PYTHONPATH`. This
161has the same syntax as the shell variable :envvar:`PATH`, that is, a list of
162directory names. When :envvar:`PYTHONPATH` is not set, or when the file is not
163found there, the search continues in an installation-dependent default path; on
164Unix, this is usually :file:`.:/usr/local/lib/python`.
165
166Actually, modules are searched in the list of directories given by the variable
167``sys.path`` which is initialized from the directory containing the input script
168(or the current directory), :envvar:`PYTHONPATH` and the installation- dependent
169default. This allows Python programs that know what they're doing to modify or
170replace the module search path. Note that because the directory containing the
171script being run is on the search path, it is important that the script not have
172the same name as a standard module, or Python will attempt to load the script as
173a module when that module is imported. This will generally be an error. See
174section :ref:`tut-standardmodules` for more information.
175
176
177"Compiled" Python files
178-----------------------
179
180As an important speed-up of the start-up time for short programs that use a lot
181of standard modules, if a file called :file:`spam.pyc` exists in the directory
182where :file:`spam.py` is found, this is assumed to contain an
183already-"byte-compiled" version of the module :mod:`spam`. The modification time
184of the version of :file:`spam.py` used to create :file:`spam.pyc` is recorded in
185:file:`spam.pyc`, and the :file:`.pyc` file is ignored if these don't match.
186
187Normally, you don't need to do anything to create the :file:`spam.pyc` file.
188Whenever :file:`spam.py` is successfully compiled, an attempt is made to write
189the compiled version to :file:`spam.pyc`. It is not an error if this attempt
190fails; if for any reason the file is not written completely, the resulting
191:file:`spam.pyc` file will be recognized as invalid and thus ignored later. The
192contents of the :file:`spam.pyc` file are platform independent, so a Python
193module directory can be shared by machines of different architectures.
194
195Some tips for experts:
196
197* When the Python interpreter is invoked with the :option:`-O` flag, optimized
198 code is generated and stored in :file:`.pyo` files. The optimizer currently
199 doesn't help much; it only removes :keyword:`assert` statements. When
Georg Brandl5e52db02007-10-21 10:45:46 +0000200 :option:`-O` is used, *all* :term:`bytecode` is optimized; ``.pyc`` files are
201 ignored and ``.py`` files are compiled to optimized bytecode.
Georg Brandl8ec7f652007-08-15 14:28:01 +0000202
203* Passing two :option:`-O` flags to the Python interpreter (:option:`-OO`) will
204 cause the bytecode compiler to perform optimizations that could in some rare
205 cases result in malfunctioning programs. Currently only ``__doc__`` strings are
206 removed from the bytecode, resulting in more compact :file:`.pyo` files. Since
207 some programs may rely on having these available, you should only use this
208 option if you know what you're doing.
209
210* A program doesn't run any faster when it is read from a :file:`.pyc` or
211 :file:`.pyo` file than when it is read from a :file:`.py` file; the only thing
212 that's faster about :file:`.pyc` or :file:`.pyo` files is the speed with which
213 they are loaded.
214
215* When a script is run by giving its name on the command line, the bytecode for
216 the script is never written to a :file:`.pyc` or :file:`.pyo` file. Thus, the
217 startup time of a script may be reduced by moving most of its code to a module
218 and having a small bootstrap script that imports that module. It is also
219 possible to name a :file:`.pyc` or :file:`.pyo` file directly on the command
220 line.
221
222* It is possible to have a file called :file:`spam.pyc` (or :file:`spam.pyo`
223 when :option:`-O` is used) without a file :file:`spam.py` for the same module.
224 This can be used to distribute a library of Python code in a form that is
225 moderately hard to reverse engineer.
226
227 .. index:: module: compileall
228
229* The module :mod:`compileall` can create :file:`.pyc` files (or :file:`.pyo`
230 files when :option:`-O` is used) for all modules in a directory.
231
Georg Brandl8ec7f652007-08-15 14:28:01 +0000232
233.. _tut-standardmodules:
234
235Standard Modules
236================
237
238.. index:: module: sys
239
240Python comes with a library of standard modules, described in a separate
241document, the Python Library Reference ("Library Reference" hereafter). Some
242modules are built into the interpreter; these provide access to operations that
243are not part of the core of the language but are nevertheless built in, either
244for efficiency or to provide access to operating system primitives such as
245system calls. The set of such modules is a configuration option which also
246depends on the underlying platform For example, the :mod:`winreg` module is only
247provided on Windows systems. One particular module deserves some attention:
248:mod:`sys`, which is built into every Python interpreter. The variables
249``sys.ps1`` and ``sys.ps2`` define the strings used as primary and secondary
Georg Brandlb19be572007-12-29 10:57:00 +0000250prompts::
Georg Brandl8ec7f652007-08-15 14:28:01 +0000251
252 >>> import sys
253 >>> sys.ps1
254 '>>> '
255 >>> sys.ps2
256 '... '
257 >>> sys.ps1 = 'C> '
258 C> print 'Yuck!'
259 Yuck!
260 C>
261
262
263These two variables are only defined if the interpreter is in interactive mode.
264
265The variable ``sys.path`` is a list of strings that determines the interpreter's
266search path for modules. It is initialized to a default path taken from the
267environment variable :envvar:`PYTHONPATH`, or from a built-in default if
268:envvar:`PYTHONPATH` is not set. You can modify it using standard list
269operations::
270
271 >>> import sys
272 >>> sys.path.append('/ufs/guido/lib/python')
273
274
275.. _tut-dir:
276
277The :func:`dir` Function
278========================
279
280The built-in function :func:`dir` is used to find out which names a module
281defines. It returns a sorted list of strings::
282
283 >>> import fibo, sys
284 >>> dir(fibo)
285 ['__name__', 'fib', 'fib2']
286 >>> dir(sys)
287 ['__displayhook__', '__doc__', '__excepthook__', '__name__', '__stderr__',
Georg Brandlc62ef8b2009-01-03 20:55:06 +0000288 '__stdin__', '__stdout__', '_getframe', 'api_version', 'argv',
Georg Brandl8ec7f652007-08-15 14:28:01 +0000289 'builtin_module_names', 'byteorder', 'callstats', 'copyright',
290 'displayhook', 'exc_clear', 'exc_info', 'exc_type', 'excepthook',
291 'exec_prefix', 'executable', 'exit', 'getdefaultencoding', 'getdlopenflags',
292 'getrecursionlimit', 'getrefcount', 'hexversion', 'maxint', 'maxunicode',
293 'meta_path', 'modules', 'path', 'path_hooks', 'path_importer_cache',
294 'platform', 'prefix', 'ps1', 'ps2', 'setcheckinterval', 'setdlopenflags',
295 'setprofile', 'setrecursionlimit', 'settrace', 'stderr', 'stdin', 'stdout',
296 'version', 'version_info', 'warnoptions']
297
298Without arguments, :func:`dir` lists the names you have defined currently::
299
300 >>> a = [1, 2, 3, 4, 5]
301 >>> import fibo
302 >>> fib = fibo.fib
303 >>> dir()
304 ['__builtins__', '__doc__', '__file__', '__name__', 'a', 'fib', 'fibo', 'sys']
305
306Note that it lists all types of names: variables, modules, functions, etc.
307
308.. index:: module: __builtin__
309
310:func:`dir` does not list the names of built-in functions and variables. If you
311want a list of those, they are defined in the standard module
312:mod:`__builtin__`::
313
314 >>> import __builtin__
315 >>> dir(__builtin__)
316 ['ArithmeticError', 'AssertionError', 'AttributeError', 'DeprecationWarning',
317 'EOFError', 'Ellipsis', 'EnvironmentError', 'Exception', 'False',
318 'FloatingPointError', 'FutureWarning', 'IOError', 'ImportError',
319 'IndentationError', 'IndexError', 'KeyError', 'KeyboardInterrupt',
320 'LookupError', 'MemoryError', 'NameError', 'None', 'NotImplemented',
Georg Brandlc62ef8b2009-01-03 20:55:06 +0000321 'NotImplementedError', 'OSError', 'OverflowError',
Georg Brandl8ec7f652007-08-15 14:28:01 +0000322 'PendingDeprecationWarning', 'ReferenceError', 'RuntimeError',
323 'RuntimeWarning', 'StandardError', 'StopIteration', 'SyntaxError',
324 'SyntaxWarning', 'SystemError', 'SystemExit', 'TabError', 'True',
325 'TypeError', 'UnboundLocalError', 'UnicodeDecodeError',
326 'UnicodeEncodeError', 'UnicodeError', 'UnicodeTranslateError',
327 'UserWarning', 'ValueError', 'Warning', 'WindowsError',
328 'ZeroDivisionError', '_', '__debug__', '__doc__', '__import__',
329 '__name__', 'abs', 'apply', 'basestring', 'bool', 'buffer',
330 'callable', 'chr', 'classmethod', 'cmp', 'coerce', 'compile',
331 'complex', 'copyright', 'credits', 'delattr', 'dict', 'dir', 'divmod',
332 'enumerate', 'eval', 'execfile', 'exit', 'file', 'filter', 'float',
333 'frozenset', 'getattr', 'globals', 'hasattr', 'hash', 'help', 'hex',
334 'id', 'input', 'int', 'intern', 'isinstance', 'issubclass', 'iter',
Antoine Pitrou789be0c2009-04-02 21:18:34 +0000335 'len', 'license', 'list', 'locals', 'long', 'map', 'max', 'memoryview',
336 'min', 'object', 'oct', 'open', 'ord', 'pow', 'property', 'quit', 'range',
Georg Brandl8ec7f652007-08-15 14:28:01 +0000337 'raw_input', 'reduce', 'reload', 'repr', 'reversed', 'round', 'set',
338 'setattr', 'slice', 'sorted', 'staticmethod', 'str', 'sum', 'super',
339 'tuple', 'type', 'unichr', 'unicode', 'vars', 'xrange', 'zip']
340
341
342.. _tut-packages:
343
344Packages
345========
346
347Packages are a way of structuring Python's module namespace by using "dotted
348module names". For example, the module name :mod:`A.B` designates a submodule
349named ``B`` in a package named ``A``. Just like the use of modules saves the
350authors of different modules from having to worry about each other's global
351variable names, the use of dotted module names saves the authors of multi-module
352packages like NumPy or the Python Imaging Library from having to worry about
353each other's module names.
354
355Suppose you want to design a collection of modules (a "package") for the uniform
356handling of sound files and sound data. There are many different sound file
357formats (usually recognized by their extension, for example: :file:`.wav`,
358:file:`.aiff`, :file:`.au`), so you may need to create and maintain a growing
359collection of modules for the conversion between the various file formats.
360There are also many different operations you might want to perform on sound data
361(such as mixing, adding echo, applying an equalizer function, creating an
362artificial stereo effect), so in addition you will be writing a never-ending
363stream of modules to perform these operations. Here's a possible structure for
364your package (expressed in terms of a hierarchical filesystem)::
365
366 sound/ Top-level package
367 __init__.py Initialize the sound package
368 formats/ Subpackage for file format conversions
369 __init__.py
370 wavread.py
371 wavwrite.py
372 aiffread.py
373 aiffwrite.py
374 auread.py
375 auwrite.py
376 ...
377 effects/ Subpackage for sound effects
378 __init__.py
379 echo.py
380 surround.py
381 reverse.py
382 ...
383 filters/ Subpackage for filters
384 __init__.py
385 equalizer.py
386 vocoder.py
387 karaoke.py
388 ...
389
390When importing the package, Python searches through the directories on
391``sys.path`` looking for the package subdirectory.
392
393The :file:`__init__.py` files are required to make Python treat the directories
394as containing packages; this is done to prevent directories with a common name,
395such as ``string``, from unintentionally hiding valid modules that occur later
396on the module search path. In the simplest case, :file:`__init__.py` can just be
397an empty file, but it can also execute initialization code for the package or
398set the ``__all__`` variable, described later.
399
400Users of the package can import individual modules from the package, for
401example::
402
403 import sound.effects.echo
404
405This loads the submodule :mod:`sound.effects.echo`. It must be referenced with
406its full name. ::
407
408 sound.effects.echo.echofilter(input, output, delay=0.7, atten=4)
409
410An alternative way of importing the submodule is::
411
412 from sound.effects import echo
413
414This also loads the submodule :mod:`echo`, and makes it available without its
415package prefix, so it can be used as follows::
416
417 echo.echofilter(input, output, delay=0.7, atten=4)
418
419Yet another variation is to import the desired function or variable directly::
420
421 from sound.effects.echo import echofilter
422
423Again, this loads the submodule :mod:`echo`, but this makes its function
424:func:`echofilter` directly available::
425
426 echofilter(input, output, delay=0.7, atten=4)
427
428Note that when using ``from package import item``, the item can be either a
429submodule (or subpackage) of the package, or some other name defined in the
430package, like a function, class or variable. The ``import`` statement first
431tests whether the item is defined in the package; if not, it assumes it is a
432module and attempts to load it. If it fails to find it, an :exc:`ImportError`
433exception is raised.
434
435Contrarily, when using syntax like ``import item.subitem.subsubitem``, each item
436except for the last must be a package; the last item can be a module or a
437package but can't be a class or function or variable defined in the previous
438item.
439
440
441.. _tut-pkg-import-star:
442
443Importing \* From a Package
444---------------------------
445
446.. index:: single: __all__
447
448Now what happens when the user writes ``from sound.effects import *``? Ideally,
449one would hope that this somehow goes out to the filesystem, finds which
Michael Foordee2df032009-09-13 17:07:46 +0000450submodules are present in the package, and imports them all. This could take a
451long time and importing sub-modules might have unwanted side-effects that should
452only happen when the sub-module is explicitly imported.
Georg Brandl8ec7f652007-08-15 14:28:01 +0000453
Georg Brandl8ec7f652007-08-15 14:28:01 +0000454The only solution is for the package author to provide an explicit index of the
Georg Brandlffefd5a2009-07-29 17:07:21 +0000455package. The :keyword:`import` statement uses the following convention: if a package's
Georg Brandl8ec7f652007-08-15 14:28:01 +0000456:file:`__init__.py` code defines a list named ``__all__``, it is taken to be the
457list of module names that should be imported when ``from package import *`` is
458encountered. It is up to the package author to keep this list up-to-date when a
459new version of the package is released. Package authors may also decide not to
460support it, if they don't see a use for importing \* from their package. For
461example, the file :file:`sounds/effects/__init__.py` could contain the following
462code::
463
464 __all__ = ["echo", "surround", "reverse"]
465
466This would mean that ``from sound.effects import *`` would import the three
467named submodules of the :mod:`sound` package.
468
469If ``__all__`` is not defined, the statement ``from sound.effects import *``
470does *not* import all submodules from the package :mod:`sound.effects` into the
471current namespace; it only ensures that the package :mod:`sound.effects` has
472been imported (possibly running any initialization code in :file:`__init__.py`)
473and then imports whatever names are defined in the package. This includes any
474names defined (and submodules explicitly loaded) by :file:`__init__.py`. It
475also includes any submodules of the package that were explicitly loaded by
Georg Brandlffefd5a2009-07-29 17:07:21 +0000476previous :keyword:`import` statements. Consider this code::
Georg Brandl8ec7f652007-08-15 14:28:01 +0000477
478 import sound.effects.echo
479 import sound.effects.surround
480 from sound.effects import *
481
Georg Brandlffefd5a2009-07-29 17:07:21 +0000482In this example, the :mod:`echo` and :mod:`surround` modules are imported in the
483current namespace because they are defined in the :mod:`sound.effects` package
484when the ``from...import`` statement is executed. (This also works when
485``__all__`` is defined.)
Georg Brandl8ec7f652007-08-15 14:28:01 +0000486
Michael Foordee2df032009-09-13 17:07:46 +0000487Although certain modules are designed to export only names that follow certain
488patterns when you use ``import *``, it is still considered bad practise in
489production code.
Georg Brandl8ec7f652007-08-15 14:28:01 +0000490
491Remember, there is nothing wrong with using ``from Package import
492specific_submodule``! In fact, this is the recommended notation unless the
493importing module needs to use submodules with the same name from different
494packages.
495
496
497Intra-package References
498------------------------
499
500The submodules often need to refer to each other. For example, the
501:mod:`surround` module might use the :mod:`echo` module. In fact, such
502references are so common that the :keyword:`import` statement first looks in the
503containing package before looking in the standard module search path. Thus, the
504:mod:`surround` module can simply use ``import echo`` or ``from echo import
505echofilter``. If the imported module is not found in the current package (the
506package of which the current module is a submodule), the :keyword:`import`
507statement looks for a top-level module with the given name.
508
509When packages are structured into subpackages (as with the :mod:`sound` package
510in the example), you can use absolute imports to refer to submodules of siblings
511packages. For example, if the module :mod:`sound.filters.vocoder` needs to use
512the :mod:`echo` module in the :mod:`sound.effects` package, it can use ``from
513sound.effects import echo``.
514
515Starting with Python 2.5, in addition to the implicit relative imports described
516above, you can write explicit relative imports with the ``from module import
517name`` form of import statement. These explicit relative imports use leading
518dots to indicate the current and parent packages involved in the relative
519import. From the :mod:`surround` module for example, you might use::
520
521 from . import echo
522 from .. import formats
523 from ..filters import equalizer
524
525Note that both explicit and implicit relative imports are based on the name of
526the current module. Since the name of the main module is always ``"__main__"``,
527modules intended for use as the main module of a Python application should
528always use absolute imports.
529
530
531Packages in Multiple Directories
532--------------------------------
533
534Packages support one more special attribute, :attr:`__path__`. This is
535initialized to be a list containing the name of the directory holding the
536package's :file:`__init__.py` before the code in that file is executed. This
537variable can be modified; doing so affects future searches for modules and
538subpackages contained in the package.
539
540While this feature is not often needed, it can be used to extend the set of
541modules found in a package.
542
543
544.. rubric:: Footnotes
545
546.. [#] In fact function definitions are also 'statements' that are 'executed'; the
Georg Brandlffefd5a2009-07-29 17:07:21 +0000547 execution of a module-level function enters the function name in the module's
548 global symbol table.
Georg Brandl8ec7f652007-08-15 14:28:01 +0000549