Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 1 | :mod:`imp` --- Access the :keyword:`import` internals |
| 2 | ===================================================== |
| 3 | |
| 4 | .. module:: imp |
| 5 | :synopsis: Access the implementation of the import statement. |
| 6 | |
| 7 | |
| 8 | .. index:: statement: import |
| 9 | |
| 10 | This module provides an interface to the mechanisms used to implement the |
| 11 | :keyword:`import` statement. It defines the following constants and functions: |
| 12 | |
| 13 | |
| 14 | .. function:: get_magic() |
| 15 | |
| 16 | .. index:: pair: file; byte-code |
| 17 | |
| 18 | Return the magic string value used to recognize byte-compiled code files |
| 19 | (:file:`.pyc` files). (This value may be different for each Python version.) |
| 20 | |
| 21 | |
| 22 | .. function:: get_suffixes() |
| 23 | |
Guido van Rossum | 04110fb | 2007-08-24 16:32:05 +0000 | [diff] [blame] | 24 | Return a list of 3-element tuples, each describing a particular type of |
| 25 | module. Each triple has the form ``(suffix, mode, type)``, where *suffix* is |
| 26 | a string to be appended to the module name to form the filename to search |
| 27 | for, *mode* is the mode string to pass to the built-in :func:`open` function |
| 28 | to open the file (this can be ``'r'`` for text files or ``'rb'`` for binary |
| 29 | files), and *type* is the file type, which has one of the values |
| 30 | :const:`PY_SOURCE`, :const:`PY_COMPILED`, or :const:`C_EXTENSION`, described |
| 31 | below. |
Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 32 | |
| 33 | |
| 34 | .. function:: find_module(name[, path]) |
| 35 | |
Alexandre Vassalotti | 711ed4a | 2009-07-17 10:42:05 +0000 | [diff] [blame] | 36 | Try to find the module *name*. If *path* is omitted or ``None``, the list of |
| 37 | directory names given by ``sys.path`` is searched, but first a few special |
| 38 | places are searched: the function tries to find a built-in module with the |
| 39 | given name (:const:`C_BUILTIN`), then a frozen module (:const:`PY_FROZEN`), |
| 40 | and on some systems some other places are looked in as well (on Windows, it |
| 41 | looks in the registry which may point to a specific file). |
| 42 | |
| 43 | Otherwise, *path* must be a list of directory names; each directory is |
| 44 | searched for files with any of the suffixes returned by :func:`get_suffixes` |
| 45 | above. Invalid names in the list are silently ignored (but all list items |
| 46 | must be strings). |
Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 47 | |
Guido van Rossum | 04110fb | 2007-08-24 16:32:05 +0000 | [diff] [blame] | 48 | If search is successful, the return value is a 3-element tuple ``(file, |
| 49 | pathname, description)``: |
| 50 | |
Antoine Pitrou | 11cb961 | 2010-09-15 11:11:28 +0000 | [diff] [blame] | 51 | *file* is an open :term:`file object` positioned at the beginning, *pathname* |
| 52 | is the pathname of the file found, and *description* is a 3-element tuple as |
Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 53 | contained in the list returned by :func:`get_suffixes` describing the kind of |
Guido van Rossum | 04110fb | 2007-08-24 16:32:05 +0000 | [diff] [blame] | 54 | module found. |
Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 55 | |
Guido van Rossum | 04110fb | 2007-08-24 16:32:05 +0000 | [diff] [blame] | 56 | If the module does not live in a file, the returned *file* is ``None``, |
| 57 | *pathname* is the empty string, and the *description* tuple contains empty |
| 58 | strings for its suffix and mode; the module type is indicated as given in |
| 59 | parentheses above. If the search is unsuccessful, :exc:`ImportError` is |
| 60 | raised. Other exceptions indicate problems with the arguments or |
| 61 | environment. |
| 62 | |
| 63 | If the module is a package, *file* is ``None``, *pathname* is the package |
| 64 | path and the last item in the *description* tuple is :const:`PKG_DIRECTORY`. |
| 65 | |
| 66 | This function does not handle hierarchical module names (names containing |
Senthil Kumaran | cc49790 | 2012-04-10 19:51:00 +0800 | [diff] [blame] | 67 | dots). In order to find *P.M*, that is, submodule *M* of package *P*, use |
Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 68 | :func:`find_module` and :func:`load_module` to find and load package *P*, and |
| 69 | then use :func:`find_module` with the *path* argument set to ``P.__path__``. |
| 70 | When *P* itself has a dotted name, apply this recipe recursively. |
| 71 | |
| 72 | |
Guido van Rossum | 04110fb | 2007-08-24 16:32:05 +0000 | [diff] [blame] | 73 | .. function:: load_module(name, file, pathname, description) |
Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 74 | |
| 75 | Load a module that was previously found by :func:`find_module` (or by an |
| 76 | otherwise conducted search yielding compatible results). This function does |
| 77 | more than importing the module: if the module was already imported, it will |
Guido van Rossum | 04110fb | 2007-08-24 16:32:05 +0000 | [diff] [blame] | 78 | reload the module! The *name* argument indicates the full |
| 79 | module name (including the package name, if this is a submodule of a |
| 80 | package). The *file* argument is an open file, and *pathname* is the |
| 81 | corresponding file name; these can be ``None`` and ``''``, respectively, when |
| 82 | the module is a package or not being loaded from a file. The *description* |
| 83 | argument is a tuple, as would be returned by :func:`get_suffixes`, describing |
| 84 | what kind of module must be loaded. |
Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 85 | |
Guido van Rossum | 04110fb | 2007-08-24 16:32:05 +0000 | [diff] [blame] | 86 | If the load is successful, the return value is the module object; otherwise, |
| 87 | an exception (usually :exc:`ImportError`) is raised. |
Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 88 | |
Guido van Rossum | 04110fb | 2007-08-24 16:32:05 +0000 | [diff] [blame] | 89 | **Important:** the caller is responsible for closing the *file* argument, if |
| 90 | it was not ``None``, even when an exception is raised. This is best done |
| 91 | using a :keyword:`try` ... :keyword:`finally` statement. |
Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 92 | |
| 93 | |
| 94 | .. function:: new_module(name) |
| 95 | |
| 96 | Return a new empty module object called *name*. This object is *not* inserted |
| 97 | in ``sys.modules``. |
| 98 | |
| 99 | |
| 100 | .. function:: lock_held() |
| 101 | |
| 102 | Return ``True`` if the import lock is currently held, else ``False``. On |
| 103 | platforms without threads, always return ``False``. |
| 104 | |
| 105 | On platforms with threads, a thread executing an import holds an internal lock |
| 106 | until the import is complete. This lock blocks other threads from doing an |
| 107 | import until the original import completes, which in turn prevents other threads |
| 108 | from seeing incomplete module objects constructed by the original thread while |
| 109 | in the process of completing its import (and the imports, if any, triggered by |
| 110 | that). |
| 111 | |
| 112 | |
| 113 | .. function:: acquire_lock() |
| 114 | |
Alexandre Vassalotti | a79e33e | 2008-05-15 22:51:26 +0000 | [diff] [blame] | 115 | Acquire the interpreter's import lock for the current thread. This lock should |
Benjamin Peterson | c985f1f | 2010-09-13 01:25:38 +0000 | [diff] [blame] | 116 | be used by import hooks to ensure thread-safety when importing modules. |
Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 117 | |
Alexandre Vassalotti | a79e33e | 2008-05-15 22:51:26 +0000 | [diff] [blame] | 118 | Once a thread has acquired the import lock, the same thread may acquire it |
| 119 | again without blocking; the thread must release it once for each time it has |
| 120 | acquired it. |
| 121 | |
| 122 | On platforms without threads, this function does nothing. |
| 123 | |
Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 124 | |
| 125 | .. function:: release_lock() |
| 126 | |
| 127 | Release the interpreter's import lock. On platforms without threads, this |
| 128 | function does nothing. |
| 129 | |
Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 130 | |
Christian Heimes | 043d6f6 | 2008-01-07 17:19:16 +0000 | [diff] [blame] | 131 | .. function:: reload(module) |
| 132 | |
| 133 | Reload a previously imported *module*. The argument must be a module object, so |
| 134 | it must have been successfully imported before. This is useful if you have |
| 135 | edited the module source file using an external editor and want to try out the |
| 136 | new version without leaving the Python interpreter. The return value is the |
| 137 | module object (the same as the *module* argument). |
| 138 | |
| 139 | When ``reload(module)`` is executed: |
| 140 | |
| 141 | * Python modules' code is recompiled and the module-level code reexecuted, |
| 142 | defining a new set of objects which are bound to names in the module's |
| 143 | dictionary. The ``init`` function of extension modules is not called a second |
| 144 | time. |
| 145 | |
| 146 | * As with all other objects in Python the old objects are only reclaimed after |
| 147 | their reference counts drop to zero. |
| 148 | |
| 149 | * The names in the module namespace are updated to point to any new or changed |
| 150 | objects. |
| 151 | |
| 152 | * Other references to the old objects (such as names external to the module) are |
| 153 | not rebound to refer to the new objects and must be updated in each namespace |
| 154 | where they occur if that is desired. |
| 155 | |
| 156 | There are a number of other caveats: |
| 157 | |
| 158 | If a module is syntactically correct but its initialization fails, the first |
| 159 | :keyword:`import` statement for it does not bind its name locally, but does |
| 160 | store a (partially initialized) module object in ``sys.modules``. To reload the |
| 161 | module you must first :keyword:`import` it again (this will bind the name to the |
| 162 | partially initialized module object) before you can :func:`reload` it. |
| 163 | |
| 164 | When a module is reloaded, its dictionary (containing the module's global |
| 165 | variables) is retained. Redefinitions of names will override the old |
| 166 | definitions, so this is generally not a problem. If the new version of a module |
| 167 | does not define a name that was defined by the old version, the old definition |
| 168 | remains. This feature can be used to the module's advantage if it maintains a |
| 169 | global table or cache of objects --- with a :keyword:`try` statement it can test |
| 170 | for the table's presence and skip its initialization if desired:: |
| 171 | |
| 172 | try: |
| 173 | cache |
| 174 | except NameError: |
| 175 | cache = {} |
| 176 | |
| 177 | It is legal though generally not very useful to reload built-in or dynamically |
| 178 | loaded modules, except for :mod:`sys`, :mod:`__main__` and :mod:`__builtin__`. |
| 179 | In many cases, however, extension modules are not designed to be initialized |
| 180 | more than once, and may fail in arbitrary ways when reloaded. |
| 181 | |
| 182 | If a module imports objects from another module using :keyword:`from` ... |
| 183 | :keyword:`import` ..., calling :func:`reload` for the other module does not |
| 184 | redefine the objects imported from it --- one way around this is to re-execute |
| 185 | the :keyword:`from` statement, another is to use :keyword:`import` and qualified |
| 186 | names (*module*.*name*) instead. |
| 187 | |
| 188 | If a module instantiates instances of a class, reloading the module that defines |
| 189 | the class does not affect the method definitions of the instances --- they |
| 190 | continue to use the old class definition. The same is true for derived classes. |
| 191 | |
| 192 | |
Éric Araujo | 930df31 | 2010-12-16 06:28:48 +0000 | [diff] [blame] | 193 | The following functions are conveniences for handling :pep:`3147` byte-compiled |
| 194 | file paths. |
Barry Warsaw | 28a691b | 2010-04-17 00:19:56 +0000 | [diff] [blame] | 195 | |
| 196 | .. versionadded:: 3.2 |
| 197 | |
| 198 | .. function:: cache_from_source(path, debug_override=None) |
| 199 | |
Victor Stinner | 766ad36 | 2010-05-14 14:36:18 +0000 | [diff] [blame] | 200 | Return the :pep:`3147` path to the byte-compiled file associated with the |
Barry Warsaw | 28a691b | 2010-04-17 00:19:56 +0000 | [diff] [blame] | 201 | source *path*. For example, if *path* is ``/foo/bar/baz.py`` the return |
| 202 | value would be ``/foo/bar/__pycache__/baz.cpython-32.pyc`` for Python 3.2. |
| 203 | The ``cpython-32`` string comes from the current magic tag (see |
| 204 | :func:`get_tag`). The returned path will end in ``.pyc`` when |
| 205 | ``__debug__`` is True or ``.pyo`` for an optimized Python |
| 206 | (i.e. ``__debug__`` is False). By passing in True or False for |
| 207 | *debug_override* you can override the system's value for ``__debug__`` for |
| 208 | extension selection. |
| 209 | |
| 210 | *path* need not exist. |
| 211 | |
Benjamin Peterson | 0f4dd9a | 2010-09-13 01:31:57 +0000 | [diff] [blame] | 212 | |
Barry Warsaw | 28a691b | 2010-04-17 00:19:56 +0000 | [diff] [blame] | 213 | .. function:: source_from_cache(path) |
| 214 | |
Victor Stinner | 766ad36 | 2010-05-14 14:36:18 +0000 | [diff] [blame] | 215 | Given the *path* to a :pep:`3147` file name, return the associated source code |
Barry Warsaw | 28a691b | 2010-04-17 00:19:56 +0000 | [diff] [blame] | 216 | file path. For example, if *path* is |
| 217 | ``/foo/bar/__pycache__/baz.cpython-32.pyc`` the returned path would be |
| 218 | ``/foo/bar/baz.py``. *path* need not exist, however if it does not conform |
Victor Stinner | 766ad36 | 2010-05-14 14:36:18 +0000 | [diff] [blame] | 219 | to :pep:`3147` format, a ``ValueError`` is raised. |
Barry Warsaw | 28a691b | 2010-04-17 00:19:56 +0000 | [diff] [blame] | 220 | |
Benjamin Peterson | 0f4dd9a | 2010-09-13 01:31:57 +0000 | [diff] [blame] | 221 | |
Barry Warsaw | 28a691b | 2010-04-17 00:19:56 +0000 | [diff] [blame] | 222 | .. function:: get_tag() |
| 223 | |
Victor Stinner | 766ad36 | 2010-05-14 14:36:18 +0000 | [diff] [blame] | 224 | Return the :pep:`3147` magic tag string matching this version of Python's |
Barry Warsaw | 28a691b | 2010-04-17 00:19:56 +0000 | [diff] [blame] | 225 | magic number, as returned by :func:`get_magic`. |
| 226 | |
| 227 | |
| 228 | The following constants with integer values, defined in this module, are used |
| 229 | to indicate the search result of :func:`find_module`. |
Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 230 | |
| 231 | |
| 232 | .. data:: PY_SOURCE |
| 233 | |
| 234 | The module was found as a source file. |
| 235 | |
| 236 | |
| 237 | .. data:: PY_COMPILED |
| 238 | |
| 239 | The module was found as a compiled code object file. |
| 240 | |
| 241 | |
| 242 | .. data:: C_EXTENSION |
| 243 | |
| 244 | The module was found as dynamically loadable shared library. |
| 245 | |
| 246 | |
Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 247 | .. data:: PKG_DIRECTORY |
| 248 | |
| 249 | The module was found as a package directory. |
| 250 | |
| 251 | |
| 252 | .. data:: C_BUILTIN |
| 253 | |
| 254 | The module was found as a built-in module. |
| 255 | |
| 256 | |
| 257 | .. data:: PY_FROZEN |
| 258 | |
R David Murray | 1623aff | 2012-03-18 20:50:03 -0400 | [diff] [blame] | 259 | The module was found as a frozen module. |
Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 260 | |
Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 261 | |
| 262 | .. class:: NullImporter(path_string) |
| 263 | |
| 264 | The :class:`NullImporter` type is a :pep:`302` import hook that handles |
| 265 | non-directory path strings by failing to find any modules. Calling this type |
| 266 | with an existing directory or empty string raises :exc:`ImportError`. |
| 267 | Otherwise, a :class:`NullImporter` instance is returned. |
| 268 | |
| 269 | Python adds instances of this type to ``sys.path_importer_cache`` for any path |
| 270 | entries that are not directories and are not handled by any other path hooks on |
| 271 | ``sys.path_hooks``. Instances have only one method: |
| 272 | |
| 273 | |
| 274 | .. method:: NullImporter.find_module(fullname [, path]) |
| 275 | |
| 276 | This method always returns ``None``, indicating that the requested module could |
| 277 | not be found. |
| 278 | |
Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 279 | |
| 280 | .. _examples-imp: |
| 281 | |
| 282 | Examples |
| 283 | -------- |
| 284 | |
| 285 | The following function emulates what was the standard import statement up to |
| 286 | Python 1.4 (no hierarchical module names). (This *implementation* wouldn't work |
| 287 | in that version, since :func:`find_module` has been extended and |
| 288 | :func:`load_module` has been added in 1.4.) :: |
| 289 | |
| 290 | import imp |
| 291 | import sys |
| 292 | |
| 293 | def __import__(name, globals=None, locals=None, fromlist=None): |
| 294 | # Fast path: see if the module has already been imported. |
| 295 | try: |
| 296 | return sys.modules[name] |
| 297 | except KeyError: |
| 298 | pass |
| 299 | |
| 300 | # If any of the following calls raises an exception, |
| 301 | # there's a problem we can't handle -- let the caller handle it. |
| 302 | |
| 303 | fp, pathname, description = imp.find_module(name) |
| 304 | |
| 305 | try: |
| 306 | return imp.load_module(name, fp, pathname, description) |
| 307 | finally: |
| 308 | # Since we may exit via an exception, close fp explicitly. |
| 309 | if fp: |
| 310 | fp.close() |