blob: 06cfefc3da58a5bc0bc46d4df4d4f1fd82f935c8 [file] [log] [blame]
Georg Brandl116aa622007-08-15 14:28:22 +00001:mod:`imp` --- Access the :keyword:`import` internals
2=====================================================
3
4.. module:: imp
5 :synopsis: Access the implementation of the import statement.
6
7
8.. index:: statement: import
9
10This module provides an interface to the mechanisms used to implement the
11:keyword:`import` statement. It defines the following constants and functions:
12
13
14.. function:: get_magic()
15
16 .. index:: pair: file; byte-code
17
18 Return the magic string value used to recognize byte-compiled code files
19 (:file:`.pyc` files). (This value may be different for each Python version.)
20
21
22.. function:: get_suffixes()
23
Guido van Rossum04110fb2007-08-24 16:32:05 +000024 Return a list of 3-element tuples, each describing a particular type of
25 module. Each triple has the form ``(suffix, mode, type)``, where *suffix* is
26 a string to be appended to the module name to form the filename to search
27 for, *mode* is the mode string to pass to the built-in :func:`open` function
28 to open the file (this can be ``'r'`` for text files or ``'rb'`` for binary
29 files), and *type* is the file type, which has one of the values
30 :const:`PY_SOURCE`, :const:`PY_COMPILED`, or :const:`C_EXTENSION`, described
31 below.
Georg Brandl116aa622007-08-15 14:28:22 +000032
33
34.. function:: find_module(name[, path])
35
Guido van Rossum04110fb2007-08-24 16:32:05 +000036 Try to find the module *name* on the search path *path*. If *path* is a list
37 of directory names, each directory is searched for files with any of the
38 suffixes returned by :func:`get_suffixes` above. Invalid names in the list
39 are silently ignored (but all list items must be strings). If *path* is
40 omitted or ``None``, the list of directory names given by ``sys.path`` is
41 searched, but first it searches a few special places: it tries to find a
42 built-in module with the given name (:const:`C_BUILTIN`), then a frozen
43 module (:const:`PY_FROZEN`), and on some systems some other places are looked
Georg Brandlc575c902008-09-13 17:46:05 +000044 in as well (on Windows, it looks in the registry which may point to a
45 specific file).
Georg Brandl116aa622007-08-15 14:28:22 +000046
Guido van Rossum04110fb2007-08-24 16:32:05 +000047 If search is successful, the return value is a 3-element tuple ``(file,
48 pathname, description)``:
49
50 *file* is an open file object positioned at the beginning, *pathname* is the
51 pathname of the file found, and *description* is a 3-element tuple as
Georg Brandl116aa622007-08-15 14:28:22 +000052 contained in the list returned by :func:`get_suffixes` describing the kind of
Guido van Rossum04110fb2007-08-24 16:32:05 +000053 module found.
Georg Brandl116aa622007-08-15 14:28:22 +000054
Guido van Rossum04110fb2007-08-24 16:32:05 +000055 If the module does not live in a file, the returned *file* is ``None``,
56 *pathname* is the empty string, and the *description* tuple contains empty
57 strings for its suffix and mode; the module type is indicated as given in
58 parentheses above. If the search is unsuccessful, :exc:`ImportError` is
59 raised. Other exceptions indicate problems with the arguments or
60 environment.
61
62 If the module is a package, *file* is ``None``, *pathname* is the package
63 path and the last item in the *description* tuple is :const:`PKG_DIRECTORY`.
64
65 This function does not handle hierarchical module names (names containing
66 dots). In order to find *P*.*M*, that is, submodule *M* of package *P*, use
Georg Brandl116aa622007-08-15 14:28:22 +000067 :func:`find_module` and :func:`load_module` to find and load package *P*, and
68 then use :func:`find_module` with the *path* argument set to ``P.__path__``.
69 When *P* itself has a dotted name, apply this recipe recursively.
70
71
Guido van Rossum04110fb2007-08-24 16:32:05 +000072.. function:: load_module(name, file, pathname, description)
Georg Brandl116aa622007-08-15 14:28:22 +000073
74 Load a module that was previously found by :func:`find_module` (or by an
75 otherwise conducted search yielding compatible results). This function does
76 more than importing the module: if the module was already imported, it will
Guido van Rossum04110fb2007-08-24 16:32:05 +000077 reload the module! The *name* argument indicates the full
78 module name (including the package name, if this is a submodule of a
79 package). The *file* argument is an open file, and *pathname* is the
80 corresponding file name; these can be ``None`` and ``''``, respectively, when
81 the module is a package or not being loaded from a file. The *description*
82 argument is a tuple, as would be returned by :func:`get_suffixes`, describing
83 what kind of module must be loaded.
Georg Brandl116aa622007-08-15 14:28:22 +000084
Guido van Rossum04110fb2007-08-24 16:32:05 +000085 If the load is successful, the return value is the module object; otherwise,
86 an exception (usually :exc:`ImportError`) is raised.
Georg Brandl116aa622007-08-15 14:28:22 +000087
Guido van Rossum04110fb2007-08-24 16:32:05 +000088 **Important:** the caller is responsible for closing the *file* argument, if
89 it was not ``None``, even when an exception is raised. This is best done
90 using a :keyword:`try` ... :keyword:`finally` statement.
Georg Brandl116aa622007-08-15 14:28:22 +000091
92
93.. function:: new_module(name)
94
95 Return a new empty module object called *name*. This object is *not* inserted
96 in ``sys.modules``.
97
98
99.. function:: lock_held()
100
101 Return ``True`` if the import lock is currently held, else ``False``. On
102 platforms without threads, always return ``False``.
103
104 On platforms with threads, a thread executing an import holds an internal lock
105 until the import is complete. This lock blocks other threads from doing an
106 import until the original import completes, which in turn prevents other threads
107 from seeing incomplete module objects constructed by the original thread while
108 in the process of completing its import (and the imports, if any, triggered by
109 that).
110
111
112.. function:: acquire_lock()
113
Alexandre Vassalottia79e33e2008-05-15 22:51:26 +0000114 Acquire the interpreter's import lock for the current thread. This lock should
Georg Brandl116aa622007-08-15 14:28:22 +0000115 be used by import hooks to ensure thread-safety when importing modules. On
116 platforms without threads, this function does nothing.
117
Alexandre Vassalottia79e33e2008-05-15 22:51:26 +0000118 Once a thread has acquired the import lock, the same thread may acquire it
119 again without blocking; the thread must release it once for each time it has
120 acquired it.
121
122 On platforms without threads, this function does nothing.
123
Georg Brandl116aa622007-08-15 14:28:22 +0000124
125.. function:: release_lock()
126
127 Release the interpreter's import lock. On platforms without threads, this
128 function does nothing.
129
Georg Brandl116aa622007-08-15 14:28:22 +0000130
Christian Heimes043d6f62008-01-07 17:19:16 +0000131.. function:: reload(module)
132
133 Reload a previously imported *module*. The argument must be a module object, so
134 it must have been successfully imported before. This is useful if you have
135 edited the module source file using an external editor and want to try out the
136 new version without leaving the Python interpreter. The return value is the
137 module object (the same as the *module* argument).
138
139 When ``reload(module)`` is executed:
140
141 * Python modules' code is recompiled and the module-level code reexecuted,
142 defining a new set of objects which are bound to names in the module's
143 dictionary. The ``init`` function of extension modules is not called a second
144 time.
145
146 * As with all other objects in Python the old objects are only reclaimed after
147 their reference counts drop to zero.
148
149 * The names in the module namespace are updated to point to any new or changed
150 objects.
151
152 * Other references to the old objects (such as names external to the module) are
153 not rebound to refer to the new objects and must be updated in each namespace
154 where they occur if that is desired.
155
156 There are a number of other caveats:
157
158 If a module is syntactically correct but its initialization fails, the first
159 :keyword:`import` statement for it does not bind its name locally, but does
160 store a (partially initialized) module object in ``sys.modules``. To reload the
161 module you must first :keyword:`import` it again (this will bind the name to the
162 partially initialized module object) before you can :func:`reload` it.
163
164 When a module is reloaded, its dictionary (containing the module's global
165 variables) is retained. Redefinitions of names will override the old
166 definitions, so this is generally not a problem. If the new version of a module
167 does not define a name that was defined by the old version, the old definition
168 remains. This feature can be used to the module's advantage if it maintains a
169 global table or cache of objects --- with a :keyword:`try` statement it can test
170 for the table's presence and skip its initialization if desired::
171
172 try:
173 cache
174 except NameError:
175 cache = {}
176
177 It is legal though generally not very useful to reload built-in or dynamically
178 loaded modules, except for :mod:`sys`, :mod:`__main__` and :mod:`__builtin__`.
179 In many cases, however, extension modules are not designed to be initialized
180 more than once, and may fail in arbitrary ways when reloaded.
181
182 If a module imports objects from another module using :keyword:`from` ...
183 :keyword:`import` ..., calling :func:`reload` for the other module does not
184 redefine the objects imported from it --- one way around this is to re-execute
185 the :keyword:`from` statement, another is to use :keyword:`import` and qualified
186 names (*module*.*name*) instead.
187
188 If a module instantiates instances of a class, reloading the module that defines
189 the class does not affect the method definitions of the instances --- they
190 continue to use the old class definition. The same is true for derived classes.
191
192
Georg Brandlf6945182008-02-01 11:56:49 +0000193.. function:: acquire_lock()
194
195 Acquires the interpreter's import lock for the current thread. This lock should
196 be used by import hooks to ensure thread-safety when importing modules. On
197 platforms without threads, this function does nothing.
198
199
200.. function:: release_lock()
201
202 Release the interpreter's import lock. On platforms without threads, this
203 function does nothing.
204
205
Georg Brandl116aa622007-08-15 14:28:22 +0000206The following constants with integer values, defined in this module, are used to
207indicate the search result of :func:`find_module`.
208
209
210.. data:: PY_SOURCE
211
212 The module was found as a source file.
213
214
215.. data:: PY_COMPILED
216
217 The module was found as a compiled code object file.
218
219
220.. data:: C_EXTENSION
221
222 The module was found as dynamically loadable shared library.
223
224
Georg Brandl116aa622007-08-15 14:28:22 +0000225.. data:: PKG_DIRECTORY
226
227 The module was found as a package directory.
228
229
230.. data:: C_BUILTIN
231
232 The module was found as a built-in module.
233
234
235.. data:: PY_FROZEN
236
237 The module was found as a frozen module (see :func:`init_frozen`).
238
239The following constant and functions are obsolete; their functionality is
240available through :func:`find_module` or :func:`load_module`. They are kept
241around for backward compatibility:
242
243
244.. data:: SEARCH_ERROR
245
246 Unused.
247
248
249.. function:: init_builtin(name)
250
251 Initialize the built-in module called *name* and return its module object along
252 with storing it in ``sys.modules``. If the module was already initialized, it
253 will be initialized *again*. Re-initialization involves the copying of the
254 built-in module's ``__dict__`` from the cached module over the module's entry in
255 ``sys.modules``. If there is no built-in module called *name*, ``None`` is
256 returned.
257
258
259.. function:: init_frozen(name)
260
261 Initialize the frozen module called *name* and return its module object. If
262 the module was already initialized, it will be initialized *again*. If there
263 is no frozen module called *name*, ``None`` is returned. (Frozen modules are
264 modules written in Python whose compiled byte-code object is incorporated
265 into a custom-built Python interpreter by Python's :program:`freeze`
266 utility. See :file:`Tools/freeze/` for now.)
267
268
269.. function:: is_builtin(name)
270
271 Return ``1`` if there is a built-in module called *name* which can be
272 initialized again. Return ``-1`` if there is a built-in module called *name*
273 which cannot be initialized again (see :func:`init_builtin`). Return ``0`` if
274 there is no built-in module called *name*.
275
276
277.. function:: is_frozen(name)
278
279 Return ``True`` if there is a frozen module (see :func:`init_frozen`) called
280 *name*, or ``False`` if there is no such module.
281
282
283.. function:: load_compiled(name, pathname, [file])
284
285 .. index:: pair: file; byte-code
286
287 Load and initialize a module implemented as a byte-compiled code file and return
288 its module object. If the module was already initialized, it will be
289 initialized *again*. The *name* argument is used to create or access a module
290 object. The *pathname* argument points to the byte-compiled code file. The
291 *file* argument is the byte-compiled code file, open for reading in binary mode,
292 from the beginning. It must currently be a real file object, not a user-defined
293 class emulating a file.
294
295
296.. function:: load_dynamic(name, pathname[, file])
297
298 Load and initialize a module implemented as a dynamically loadable shared
299 library and return its module object. If the module was already initialized, it
300 will be initialized *again*. Re-initialization involves copying the ``__dict__``
301 attribute of the cached instance of the module over the value used in the module
302 cached in ``sys.modules``. The *pathname* argument must point to the shared
303 library. The *name* argument is used to construct the name of the
304 initialization function: an external C function called ``initname()`` in the
305 shared library is called. The optional *file* argument is ignored. (Note:
306 using shared libraries is highly system dependent, and not all systems support
307 it.)
308
309
310.. function:: load_source(name, pathname[, file])
311
312 Load and initialize a module implemented as a Python source file and return its
313 module object. If the module was already initialized, it will be initialized
314 *again*. The *name* argument is used to create or access a module object. The
315 *pathname* argument points to the source file. The *file* argument is the
316 source file, open for reading as text, from the beginning. It must currently be
317 a real file object, not a user-defined class emulating a file. Note that if a
318 properly matching byte-compiled file (with suffix :file:`.pyc` or :file:`.pyo`)
319 exists, it will be used instead of parsing the given source file.
320
321
322.. class:: NullImporter(path_string)
323
324 The :class:`NullImporter` type is a :pep:`302` import hook that handles
325 non-directory path strings by failing to find any modules. Calling this type
326 with an existing directory or empty string raises :exc:`ImportError`.
327 Otherwise, a :class:`NullImporter` instance is returned.
328
329 Python adds instances of this type to ``sys.path_importer_cache`` for any path
330 entries that are not directories and are not handled by any other path hooks on
331 ``sys.path_hooks``. Instances have only one method:
332
333
334 .. method:: NullImporter.find_module(fullname [, path])
335
336 This method always returns ``None``, indicating that the requested module could
337 not be found.
338
Georg Brandl116aa622007-08-15 14:28:22 +0000339
340.. _examples-imp:
341
342Examples
343--------
344
345The following function emulates what was the standard import statement up to
346Python 1.4 (no hierarchical module names). (This *implementation* wouldn't work
347in that version, since :func:`find_module` has been extended and
348:func:`load_module` has been added in 1.4.) ::
349
350 import imp
351 import sys
352
353 def __import__(name, globals=None, locals=None, fromlist=None):
354 # Fast path: see if the module has already been imported.
355 try:
356 return sys.modules[name]
357 except KeyError:
358 pass
359
360 # If any of the following calls raises an exception,
361 # there's a problem we can't handle -- let the caller handle it.
362
363 fp, pathname, description = imp.find_module(name)
364
365 try:
366 return imp.load_module(name, fp, pathname, description)
367 finally:
368 # Since we may exit via an exception, close fp explicitly.
369 if fp:
370 fp.close()
371
372.. index:: module: knee
373
374A more complete example that implements hierarchical module names and includes a
375:func:`reload` function can be found in the module :mod:`knee`. The :mod:`knee`
376module can be found in :file:`Demo/imputil/` in the Python source distribution.
377