blob: e245a0c6a5b97c3cf07183d5565ae0fb6cef6e47 [file] [log] [blame]
Georg Brandl116aa622007-08-15 14:28:22 +00001
2:mod:`imp` --- Access the :keyword:`import` internals
3=====================================================
4
5.. module:: imp
6 :synopsis: Access the implementation of the import statement.
7
8
9.. index:: statement: import
10
11This module provides an interface to the mechanisms used to implement the
12:keyword:`import` statement. It defines the following constants and functions:
13
14
15.. function:: get_magic()
16
17 .. index:: pair: file; byte-code
18
19 Return the magic string value used to recognize byte-compiled code files
20 (:file:`.pyc` files). (This value may be different for each Python version.)
21
22
23.. function:: get_suffixes()
24
Guido van Rossum04110fb2007-08-24 16:32:05 +000025 Return a list of 3-element tuples, each describing a particular type of
26 module. Each triple has the form ``(suffix, mode, type)``, where *suffix* is
27 a string to be appended to the module name to form the filename to search
28 for, *mode* is the mode string to pass to the built-in :func:`open` function
29 to open the file (this can be ``'r'`` for text files or ``'rb'`` for binary
30 files), and *type* is the file type, which has one of the values
31 :const:`PY_SOURCE`, :const:`PY_COMPILED`, or :const:`C_EXTENSION`, described
32 below.
Georg Brandl116aa622007-08-15 14:28:22 +000033
34
35.. function:: find_module(name[, path])
36
Guido van Rossum04110fb2007-08-24 16:32:05 +000037 Try to find the module *name* on the search path *path*. If *path* is a list
38 of directory names, each directory is searched for files with any of the
39 suffixes returned by :func:`get_suffixes` above. Invalid names in the list
40 are silently ignored (but all list items must be strings). If *path* is
41 omitted or ``None``, the list of directory names given by ``sys.path`` is
42 searched, but first it searches a few special places: it tries to find a
43 built-in module with the given name (:const:`C_BUILTIN`), then a frozen
44 module (:const:`PY_FROZEN`), and on some systems some other places are looked
45 in as well (on the Mac, it looks for a resource (:const:`PY_RESOURCE`); on
46 Windows, it looks in the registry which may point to a specific file).
Georg Brandl116aa622007-08-15 14:28:22 +000047
Guido van Rossum04110fb2007-08-24 16:32:05 +000048 If search is successful, the return value is a 3-element tuple ``(file,
49 pathname, description)``:
50
51 *file* is an open file object positioned at the beginning, *pathname* is the
52 pathname of the file found, and *description* is a 3-element tuple as
Georg Brandl116aa622007-08-15 14:28:22 +000053 contained in the list returned by :func:`get_suffixes` describing the kind of
Guido van Rossum04110fb2007-08-24 16:32:05 +000054 module found.
Georg Brandl116aa622007-08-15 14:28:22 +000055
Guido van Rossum04110fb2007-08-24 16:32:05 +000056 If the module does not live in a file, the returned *file* is ``None``,
57 *pathname* is the empty string, and the *description* tuple contains empty
58 strings for its suffix and mode; the module type is indicated as given in
59 parentheses above. If the search is unsuccessful, :exc:`ImportError` is
60 raised. Other exceptions indicate problems with the arguments or
61 environment.
62
63 If the module is a package, *file* is ``None``, *pathname* is the package
64 path and the last item in the *description* tuple is :const:`PKG_DIRECTORY`.
65
66 This function does not handle hierarchical module names (names containing
67 dots). In order to find *P*.*M*, that is, submodule *M* of package *P*, use
Georg Brandl116aa622007-08-15 14:28:22 +000068 :func:`find_module` and :func:`load_module` to find and load package *P*, and
69 then use :func:`find_module` with the *path* argument set to ``P.__path__``.
70 When *P* itself has a dotted name, apply this recipe recursively.
71
72
Guido van Rossum04110fb2007-08-24 16:32:05 +000073.. function:: load_module(name, file, pathname, description)
Georg Brandl116aa622007-08-15 14:28:22 +000074
75 Load a module that was previously found by :func:`find_module` (or by an
76 otherwise conducted search yielding compatible results). This function does
77 more than importing the module: if the module was already imported, it will
Guido van Rossum04110fb2007-08-24 16:32:05 +000078 reload the module! The *name* argument indicates the full
79 module name (including the package name, if this is a submodule of a
80 package). The *file* argument is an open file, and *pathname* is the
81 corresponding file name; these can be ``None`` and ``''``, respectively, when
82 the module is a package or not being loaded from a file. The *description*
83 argument is a tuple, as would be returned by :func:`get_suffixes`, describing
84 what kind of module must be loaded.
Georg Brandl116aa622007-08-15 14:28:22 +000085
Guido van Rossum04110fb2007-08-24 16:32:05 +000086 If the load is successful, the return value is the module object; otherwise,
87 an exception (usually :exc:`ImportError`) is raised.
Georg Brandl116aa622007-08-15 14:28:22 +000088
Guido van Rossum04110fb2007-08-24 16:32:05 +000089 **Important:** the caller is responsible for closing the *file* argument, if
90 it was not ``None``, even when an exception is raised. This is best done
91 using a :keyword:`try` ... :keyword:`finally` statement.
Georg Brandl116aa622007-08-15 14:28:22 +000092
93
94.. function:: new_module(name)
95
96 Return a new empty module object called *name*. This object is *not* inserted
97 in ``sys.modules``.
98
99
100.. function:: lock_held()
101
102 Return ``True`` if the import lock is currently held, else ``False``. On
103 platforms without threads, always return ``False``.
104
105 On platforms with threads, a thread executing an import holds an internal lock
106 until the import is complete. This lock blocks other threads from doing an
107 import until the original import completes, which in turn prevents other threads
108 from seeing incomplete module objects constructed by the original thread while
109 in the process of completing its import (and the imports, if any, triggered by
110 that).
111
112
113.. function:: acquire_lock()
114
Alexandre Vassalottia79e33e2008-05-15 22:51:26 +0000115 Acquire the interpreter's import lock for the current thread. This lock should
Georg Brandl116aa622007-08-15 14:28:22 +0000116 be used by import hooks to ensure thread-safety when importing modules. On
117 platforms without threads, this function does nothing.
118
Alexandre Vassalottia79e33e2008-05-15 22:51:26 +0000119 Once a thread has acquired the import lock, the same thread may acquire it
120 again without blocking; the thread must release it once for each time it has
121 acquired it.
122
123 On platforms without threads, this function does nothing.
124
Georg Brandl116aa622007-08-15 14:28:22 +0000125
126.. function:: release_lock()
127
128 Release the interpreter's import lock. On platforms without threads, this
129 function does nothing.
130
Georg Brandl116aa622007-08-15 14:28:22 +0000131
Christian Heimes043d6f62008-01-07 17:19:16 +0000132.. function:: reload(module)
133
134 Reload a previously imported *module*. The argument must be a module object, so
135 it must have been successfully imported before. This is useful if you have
136 edited the module source file using an external editor and want to try out the
137 new version without leaving the Python interpreter. The return value is the
138 module object (the same as the *module* argument).
139
140 When ``reload(module)`` is executed:
141
142 * Python modules' code is recompiled and the module-level code reexecuted,
143 defining a new set of objects which are bound to names in the module's
144 dictionary. The ``init`` function of extension modules is not called a second
145 time.
146
147 * As with all other objects in Python the old objects are only reclaimed after
148 their reference counts drop to zero.
149
150 * The names in the module namespace are updated to point to any new or changed
151 objects.
152
153 * Other references to the old objects (such as names external to the module) are
154 not rebound to refer to the new objects and must be updated in each namespace
155 where they occur if that is desired.
156
157 There are a number of other caveats:
158
159 If a module is syntactically correct but its initialization fails, the first
160 :keyword:`import` statement for it does not bind its name locally, but does
161 store a (partially initialized) module object in ``sys.modules``. To reload the
162 module you must first :keyword:`import` it again (this will bind the name to the
163 partially initialized module object) before you can :func:`reload` it.
164
165 When a module is reloaded, its dictionary (containing the module's global
166 variables) is retained. Redefinitions of names will override the old
167 definitions, so this is generally not a problem. If the new version of a module
168 does not define a name that was defined by the old version, the old definition
169 remains. This feature can be used to the module's advantage if it maintains a
170 global table or cache of objects --- with a :keyword:`try` statement it can test
171 for the table's presence and skip its initialization if desired::
172
173 try:
174 cache
175 except NameError:
176 cache = {}
177
178 It is legal though generally not very useful to reload built-in or dynamically
179 loaded modules, except for :mod:`sys`, :mod:`__main__` and :mod:`__builtin__`.
180 In many cases, however, extension modules are not designed to be initialized
181 more than once, and may fail in arbitrary ways when reloaded.
182
183 If a module imports objects from another module using :keyword:`from` ...
184 :keyword:`import` ..., calling :func:`reload` for the other module does not
185 redefine the objects imported from it --- one way around this is to re-execute
186 the :keyword:`from` statement, another is to use :keyword:`import` and qualified
187 names (*module*.*name*) instead.
188
189 If a module instantiates instances of a class, reloading the module that defines
190 the class does not affect the method definitions of the instances --- they
191 continue to use the old class definition. The same is true for derived classes.
192
193
Georg Brandlf6945182008-02-01 11:56:49 +0000194.. function:: acquire_lock()
195
196 Acquires the interpreter's import lock for the current thread. This lock should
197 be used by import hooks to ensure thread-safety when importing modules. On
198 platforms without threads, this function does nothing.
199
200
201.. function:: release_lock()
202
203 Release the interpreter's import lock. On platforms without threads, this
204 function does nothing.
205
206
Georg Brandl116aa622007-08-15 14:28:22 +0000207The following constants with integer values, defined in this module, are used to
208indicate the search result of :func:`find_module`.
209
210
211.. data:: PY_SOURCE
212
213 The module was found as a source file.
214
215
216.. data:: PY_COMPILED
217
218 The module was found as a compiled code object file.
219
220
221.. data:: C_EXTENSION
222
223 The module was found as dynamically loadable shared library.
224
225
226.. data:: PY_RESOURCE
227
228 The module was found as a Mac OS 9 resource. This value can only be returned on
229 a Mac OS 9 or earlier Macintosh.
230
231
232.. data:: PKG_DIRECTORY
233
234 The module was found as a package directory.
235
236
237.. data:: C_BUILTIN
238
239 The module was found as a built-in module.
240
241
242.. data:: PY_FROZEN
243
244 The module was found as a frozen module (see :func:`init_frozen`).
245
246The following constant and functions are obsolete; their functionality is
247available through :func:`find_module` or :func:`load_module`. They are kept
248around for backward compatibility:
249
250
251.. data:: SEARCH_ERROR
252
253 Unused.
254
255
256.. function:: init_builtin(name)
257
258 Initialize the built-in module called *name* and return its module object along
259 with storing it in ``sys.modules``. If the module was already initialized, it
260 will be initialized *again*. Re-initialization involves the copying of the
261 built-in module's ``__dict__`` from the cached module over the module's entry in
262 ``sys.modules``. If there is no built-in module called *name*, ``None`` is
263 returned.
264
265
266.. function:: init_frozen(name)
267
268 Initialize the frozen module called *name* and return its module object. If
269 the module was already initialized, it will be initialized *again*. If there
270 is no frozen module called *name*, ``None`` is returned. (Frozen modules are
271 modules written in Python whose compiled byte-code object is incorporated
272 into a custom-built Python interpreter by Python's :program:`freeze`
273 utility. See :file:`Tools/freeze/` for now.)
274
275
276.. function:: is_builtin(name)
277
278 Return ``1`` if there is a built-in module called *name* which can be
279 initialized again. Return ``-1`` if there is a built-in module called *name*
280 which cannot be initialized again (see :func:`init_builtin`). Return ``0`` if
281 there is no built-in module called *name*.
282
283
284.. function:: is_frozen(name)
285
286 Return ``True`` if there is a frozen module (see :func:`init_frozen`) called
287 *name*, or ``False`` if there is no such module.
288
289
290.. function:: load_compiled(name, pathname, [file])
291
292 .. index:: pair: file; byte-code
293
294 Load and initialize a module implemented as a byte-compiled code file and return
295 its module object. If the module was already initialized, it will be
296 initialized *again*. The *name* argument is used to create or access a module
297 object. The *pathname* argument points to the byte-compiled code file. The
298 *file* argument is the byte-compiled code file, open for reading in binary mode,
299 from the beginning. It must currently be a real file object, not a user-defined
300 class emulating a file.
301
302
303.. function:: load_dynamic(name, pathname[, file])
304
305 Load and initialize a module implemented as a dynamically loadable shared
306 library and return its module object. If the module was already initialized, it
307 will be initialized *again*. Re-initialization involves copying the ``__dict__``
308 attribute of the cached instance of the module over the value used in the module
309 cached in ``sys.modules``. The *pathname* argument must point to the shared
310 library. The *name* argument is used to construct the name of the
311 initialization function: an external C function called ``initname()`` in the
312 shared library is called. The optional *file* argument is ignored. (Note:
313 using shared libraries is highly system dependent, and not all systems support
314 it.)
315
316
317.. function:: load_source(name, pathname[, file])
318
319 Load and initialize a module implemented as a Python source file and return its
320 module object. If the module was already initialized, it will be initialized
321 *again*. The *name* argument is used to create or access a module object. The
322 *pathname* argument points to the source file. The *file* argument is the
323 source file, open for reading as text, from the beginning. It must currently be
324 a real file object, not a user-defined class emulating a file. Note that if a
325 properly matching byte-compiled file (with suffix :file:`.pyc` or :file:`.pyo`)
326 exists, it will be used instead of parsing the given source file.
327
328
329.. class:: NullImporter(path_string)
330
331 The :class:`NullImporter` type is a :pep:`302` import hook that handles
332 non-directory path strings by failing to find any modules. Calling this type
333 with an existing directory or empty string raises :exc:`ImportError`.
334 Otherwise, a :class:`NullImporter` instance is returned.
335
336 Python adds instances of this type to ``sys.path_importer_cache`` for any path
337 entries that are not directories and are not handled by any other path hooks on
338 ``sys.path_hooks``. Instances have only one method:
339
340
341 .. method:: NullImporter.find_module(fullname [, path])
342
343 This method always returns ``None``, indicating that the requested module could
344 not be found.
345
Georg Brandl116aa622007-08-15 14:28:22 +0000346
347.. _examples-imp:
348
349Examples
350--------
351
352The following function emulates what was the standard import statement up to
353Python 1.4 (no hierarchical module names). (This *implementation* wouldn't work
354in that version, since :func:`find_module` has been extended and
355:func:`load_module` has been added in 1.4.) ::
356
357 import imp
358 import sys
359
360 def __import__(name, globals=None, locals=None, fromlist=None):
361 # Fast path: see if the module has already been imported.
362 try:
363 return sys.modules[name]
364 except KeyError:
365 pass
366
367 # If any of the following calls raises an exception,
368 # there's a problem we can't handle -- let the caller handle it.
369
370 fp, pathname, description = imp.find_module(name)
371
372 try:
373 return imp.load_module(name, fp, pathname, description)
374 finally:
375 # Since we may exit via an exception, close fp explicitly.
376 if fp:
377 fp.close()
378
379.. index:: module: knee
380
381A more complete example that implements hierarchical module names and includes a
382:func:`reload` function can be found in the module :mod:`knee`. The :mod:`knee`
383module can be found in :file:`Demo/imputil/` in the Python source distribution.
384