| Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 1 | :mod:`imp` --- Access the :keyword:`import` internals | 
 | 2 | ===================================================== | 
 | 3 |  | 
 | 4 | .. module:: imp | 
 | 5 |    :synopsis: Access the implementation of the import statement. | 
 | 6 |  | 
 | 7 |  | 
 | 8 | .. index:: statement: import | 
 | 9 |  | 
 | 10 | This module provides an interface to the mechanisms used to implement the | 
 | 11 | :keyword:`import` statement.  It defines the following constants and functions: | 
 | 12 |  | 
 | 13 |  | 
 | 14 | .. function:: get_magic() | 
 | 15 |  | 
 | 16 |    .. index:: pair: file; byte-code | 
 | 17 |  | 
 | 18 |    Return the magic string value used to recognize byte-compiled code files | 
 | 19 |    (:file:`.pyc` files).  (This value may be different for each Python version.) | 
 | 20 |  | 
 | 21 |  | 
 | 22 | .. function:: get_suffixes() | 
 | 23 |  | 
| Guido van Rossum | 04110fb | 2007-08-24 16:32:05 +0000 | [diff] [blame] | 24 |    Return a list of 3-element tuples, each describing a particular type of | 
 | 25 |    module. Each triple has the form ``(suffix, mode, type)``, where *suffix* is | 
 | 26 |    a string to be appended to the module name to form the filename to search | 
 | 27 |    for, *mode* is the mode string to pass to the built-in :func:`open` function | 
 | 28 |    to open the file (this can be ``'r'`` for text files or ``'rb'`` for binary | 
 | 29 |    files), and *type* is the file type, which has one of the values | 
 | 30 |    :const:`PY_SOURCE`, :const:`PY_COMPILED`, or :const:`C_EXTENSION`, described | 
 | 31 |    below. | 
| Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 32 |  | 
 | 33 |  | 
 | 34 | .. function:: find_module(name[, path]) | 
 | 35 |  | 
| Alexandre Vassalotti | 711ed4a | 2009-07-17 10:42:05 +0000 | [diff] [blame] | 36 |    Try to find the module *name*.  If *path* is omitted or ``None``, the list of | 
 | 37 |    directory names given by ``sys.path`` is searched, but first a few special | 
 | 38 |    places are searched: the function tries to find a built-in module with the | 
 | 39 |    given name (:const:`C_BUILTIN`), then a frozen module (:const:`PY_FROZEN`), | 
 | 40 |    and on some systems some other places are looked in as well (on Windows, it | 
 | 41 |    looks in the registry which may point to a specific file). | 
 | 42 |  | 
 | 43 |    Otherwise, *path* must be a list of directory names; each directory is | 
 | 44 |    searched for files with any of the suffixes returned by :func:`get_suffixes` | 
 | 45 |    above.  Invalid names in the list are silently ignored (but all list items | 
 | 46 |    must be strings). | 
| Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 47 |  | 
| Guido van Rossum | 04110fb | 2007-08-24 16:32:05 +0000 | [diff] [blame] | 48 |    If search is successful, the return value is a 3-element tuple ``(file, | 
 | 49 |    pathname, description)``: | 
 | 50 |  | 
| Antoine Pitrou | 11cb961 | 2010-09-15 11:11:28 +0000 | [diff] [blame] | 51 |    *file* is an open :term:`file object` positioned at the beginning, *pathname* | 
 | 52 |    is the pathname of the file found, and *description* is a 3-element tuple as | 
| Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 53 |    contained in the list returned by :func:`get_suffixes` describing the kind of | 
| Guido van Rossum | 04110fb | 2007-08-24 16:32:05 +0000 | [diff] [blame] | 54 |    module found. | 
| Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 55 |  | 
| Guido van Rossum | 04110fb | 2007-08-24 16:32:05 +0000 | [diff] [blame] | 56 |    If the module does not live in a file, the returned *file* is ``None``, | 
 | 57 |    *pathname* is the empty string, and the *description* tuple contains empty | 
 | 58 |    strings for its suffix and mode; the module type is indicated as given in | 
 | 59 |    parentheses above.  If the search is unsuccessful, :exc:`ImportError` is | 
 | 60 |    raised.  Other exceptions indicate problems with the arguments or | 
 | 61 |    environment. | 
 | 62 |  | 
 | 63 |    If the module is a package, *file* is ``None``, *pathname* is the package | 
 | 64 |    path and the last item in the *description* tuple is :const:`PKG_DIRECTORY`. | 
 | 65 |  | 
 | 66 |    This function does not handle hierarchical module names (names containing | 
| Senthil Kumaran | cc49790 | 2012-04-10 19:51:00 +0800 | [diff] [blame] | 67 |    dots).  In order to find *P.M*, that is, submodule *M* of package *P*, use | 
| Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 68 |    :func:`find_module` and :func:`load_module` to find and load package *P*, and | 
 | 69 |    then use :func:`find_module` with the *path* argument set to ``P.__path__``. | 
 | 70 |    When *P* itself has a dotted name, apply this recipe recursively. | 
 | 71 |  | 
 | 72 |  | 
| Guido van Rossum | 04110fb | 2007-08-24 16:32:05 +0000 | [diff] [blame] | 73 | .. function:: load_module(name, file, pathname, description) | 
| Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 74 |  | 
 | 75 |    Load a module that was previously found by :func:`find_module` (or by an | 
 | 76 |    otherwise conducted search yielding compatible results).  This function does | 
 | 77 |    more than importing the module: if the module was already imported, it will | 
| Guido van Rossum | 04110fb | 2007-08-24 16:32:05 +0000 | [diff] [blame] | 78 |    reload the module!  The *name* argument indicates the full | 
 | 79 |    module name (including the package name, if this is a submodule of a | 
 | 80 |    package).  The *file* argument is an open file, and *pathname* is the | 
 | 81 |    corresponding file name; these can be ``None`` and ``''``, respectively, when | 
 | 82 |    the module is a package or not being loaded from a file.  The *description* | 
 | 83 |    argument is a tuple, as would be returned by :func:`get_suffixes`, describing | 
 | 84 |    what kind of module must be loaded. | 
| Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 85 |  | 
| Guido van Rossum | 04110fb | 2007-08-24 16:32:05 +0000 | [diff] [blame] | 86 |    If the load is successful, the return value is the module object; otherwise, | 
 | 87 |    an exception (usually :exc:`ImportError`) is raised. | 
| Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 88 |  | 
| Guido van Rossum | 04110fb | 2007-08-24 16:32:05 +0000 | [diff] [blame] | 89 |    **Important:** the caller is responsible for closing the *file* argument, if | 
 | 90 |    it was not ``None``, even when an exception is raised.  This is best done | 
 | 91 |    using a :keyword:`try` ... :keyword:`finally` statement. | 
| Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 92 |  | 
 | 93 |  | 
 | 94 | .. function:: new_module(name) | 
 | 95 |  | 
 | 96 |    Return a new empty module object called *name*.  This object is *not* inserted | 
 | 97 |    in ``sys.modules``. | 
 | 98 |  | 
 | 99 |  | 
 | 100 | .. function:: lock_held() | 
 | 101 |  | 
 | 102 |    Return ``True`` if the import lock is currently held, else ``False``. On | 
 | 103 |    platforms without threads, always return ``False``. | 
 | 104 |  | 
 | 105 |    On platforms with threads, a thread executing an import holds an internal lock | 
 | 106 |    until the import is complete. This lock blocks other threads from doing an | 
 | 107 |    import until the original import completes, which in turn prevents other threads | 
 | 108 |    from seeing incomplete module objects constructed by the original thread while | 
 | 109 |    in the process of completing its import (and the imports, if any, triggered by | 
 | 110 |    that). | 
 | 111 |  | 
 | 112 |  | 
 | 113 | .. function:: acquire_lock() | 
 | 114 |  | 
| Alexandre Vassalotti | a79e33e | 2008-05-15 22:51:26 +0000 | [diff] [blame] | 115 |    Acquire the interpreter's import lock for the current thread.  This lock should | 
| Benjamin Peterson | c985f1f | 2010-09-13 01:25:38 +0000 | [diff] [blame] | 116 |    be used by import hooks to ensure thread-safety when importing modules. | 
| Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 117 |  | 
| Alexandre Vassalotti | a79e33e | 2008-05-15 22:51:26 +0000 | [diff] [blame] | 118 |    Once a thread has acquired the import lock, the same thread may acquire it | 
 | 119 |    again without blocking; the thread must release it once for each time it has | 
 | 120 |    acquired it. | 
 | 121 |  | 
 | 122 |    On platforms without threads, this function does nothing. | 
 | 123 |  | 
| Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 124 |  | 
 | 125 | .. function:: release_lock() | 
 | 126 |  | 
 | 127 |    Release the interpreter's import lock. On platforms without threads, this | 
 | 128 |    function does nothing. | 
 | 129 |  | 
| Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 130 |  | 
| Christian Heimes | 043d6f6 | 2008-01-07 17:19:16 +0000 | [diff] [blame] | 131 | .. function:: reload(module) | 
 | 132 |  | 
 | 133 |    Reload a previously imported *module*.  The argument must be a module object, so | 
 | 134 |    it must have been successfully imported before.  This is useful if you have | 
 | 135 |    edited the module source file using an external editor and want to try out the | 
 | 136 |    new version without leaving the Python interpreter.  The return value is the | 
 | 137 |    module object (the same as the *module* argument). | 
 | 138 |  | 
 | 139 |    When ``reload(module)`` is executed: | 
 | 140 |  | 
 | 141 |    * Python modules' code is recompiled and the module-level code reexecuted, | 
 | 142 |      defining a new set of objects which are bound to names in the module's | 
 | 143 |      dictionary.  The ``init`` function of extension modules is not called a second | 
 | 144 |      time. | 
 | 145 |  | 
 | 146 |    * As with all other objects in Python the old objects are only reclaimed after | 
 | 147 |      their reference counts drop to zero. | 
 | 148 |  | 
 | 149 |    * The names in the module namespace are updated to point to any new or changed | 
 | 150 |      objects. | 
 | 151 |  | 
 | 152 |    * Other references to the old objects (such as names external to the module) are | 
 | 153 |      not rebound to refer to the new objects and must be updated in each namespace | 
 | 154 |      where they occur if that is desired. | 
 | 155 |  | 
 | 156 |    There are a number of other caveats: | 
 | 157 |  | 
 | 158 |    If a module is syntactically correct but its initialization fails, the first | 
 | 159 |    :keyword:`import` statement for it does not bind its name locally, but does | 
 | 160 |    store a (partially initialized) module object in ``sys.modules``.  To reload the | 
 | 161 |    module you must first :keyword:`import` it again (this will bind the name to the | 
 | 162 |    partially initialized module object) before you can :func:`reload` it. | 
 | 163 |  | 
 | 164 |    When a module is reloaded, its dictionary (containing the module's global | 
 | 165 |    variables) is retained.  Redefinitions of names will override the old | 
 | 166 |    definitions, so this is generally not a problem.  If the new version of a module | 
 | 167 |    does not define a name that was defined by the old version, the old definition | 
 | 168 |    remains.  This feature can be used to the module's advantage if it maintains a | 
 | 169 |    global table or cache of objects --- with a :keyword:`try` statement it can test | 
 | 170 |    for the table's presence and skip its initialization if desired:: | 
 | 171 |  | 
 | 172 |       try: | 
 | 173 |           cache | 
 | 174 |       except NameError: | 
 | 175 |           cache = {} | 
 | 176 |  | 
 | 177 |    It is legal though generally not very useful to reload built-in or dynamically | 
 | 178 |    loaded modules, except for :mod:`sys`, :mod:`__main__` and :mod:`__builtin__`. | 
 | 179 |    In many cases, however, extension modules are not designed to be initialized | 
 | 180 |    more than once, and may fail in arbitrary ways when reloaded. | 
 | 181 |  | 
 | 182 |    If a module imports objects from another module using :keyword:`from` ... | 
 | 183 |    :keyword:`import` ..., calling :func:`reload` for the other module does not | 
 | 184 |    redefine the objects imported from it --- one way around this is to re-execute | 
 | 185 |    the :keyword:`from` statement, another is to use :keyword:`import` and qualified | 
 | 186 |    names (*module*.*name*) instead. | 
 | 187 |  | 
 | 188 |    If a module instantiates instances of a class, reloading the module that defines | 
 | 189 |    the class does not affect the method definitions of the instances --- they | 
 | 190 |    continue to use the old class definition.  The same is true for derived classes. | 
 | 191 |  | 
 | 192 |  | 
| Éric Araujo | 930df31 | 2010-12-16 06:28:48 +0000 | [diff] [blame] | 193 | The following functions are conveniences for handling :pep:`3147` byte-compiled | 
 | 194 | file paths. | 
| Barry Warsaw | 28a691b | 2010-04-17 00:19:56 +0000 | [diff] [blame] | 195 |  | 
 | 196 | .. versionadded:: 3.2 | 
 | 197 |  | 
 | 198 | .. function:: cache_from_source(path, debug_override=None) | 
 | 199 |  | 
| Victor Stinner | 766ad36 | 2010-05-14 14:36:18 +0000 | [diff] [blame] | 200 |    Return the :pep:`3147` path to the byte-compiled file associated with the | 
| Barry Warsaw | 28a691b | 2010-04-17 00:19:56 +0000 | [diff] [blame] | 201 |    source *path*.  For example, if *path* is ``/foo/bar/baz.py`` the return | 
 | 202 |    value would be ``/foo/bar/__pycache__/baz.cpython-32.pyc`` for Python 3.2. | 
 | 203 |    The ``cpython-32`` string comes from the current magic tag (see | 
 | 204 |    :func:`get_tag`).  The returned path will end in ``.pyc`` when | 
 | 205 |    ``__debug__`` is True or ``.pyo`` for an optimized Python | 
 | 206 |    (i.e. ``__debug__`` is False).  By passing in True or False for | 
 | 207 |    *debug_override* you can override the system's value for ``__debug__`` for | 
 | 208 |    extension selection. | 
 | 209 |  | 
 | 210 |    *path* need not exist. | 
 | 211 |  | 
| Benjamin Peterson | 0f4dd9a | 2010-09-13 01:31:57 +0000 | [diff] [blame] | 212 |  | 
| Barry Warsaw | 28a691b | 2010-04-17 00:19:56 +0000 | [diff] [blame] | 213 | .. function:: source_from_cache(path) | 
 | 214 |  | 
| Victor Stinner | 766ad36 | 2010-05-14 14:36:18 +0000 | [diff] [blame] | 215 |    Given the *path* to a :pep:`3147` file name, return the associated source code | 
| Barry Warsaw | 28a691b | 2010-04-17 00:19:56 +0000 | [diff] [blame] | 216 |    file path.  For example, if *path* is | 
 | 217 |    ``/foo/bar/__pycache__/baz.cpython-32.pyc`` the returned path would be | 
 | 218 |    ``/foo/bar/baz.py``.  *path* need not exist, however if it does not conform | 
| Victor Stinner | 766ad36 | 2010-05-14 14:36:18 +0000 | [diff] [blame] | 219 |    to :pep:`3147` format, a ``ValueError`` is raised. | 
| Barry Warsaw | 28a691b | 2010-04-17 00:19:56 +0000 | [diff] [blame] | 220 |  | 
| Benjamin Peterson | 0f4dd9a | 2010-09-13 01:31:57 +0000 | [diff] [blame] | 221 |  | 
| Barry Warsaw | 28a691b | 2010-04-17 00:19:56 +0000 | [diff] [blame] | 222 | .. function:: get_tag() | 
 | 223 |  | 
| Victor Stinner | 766ad36 | 2010-05-14 14:36:18 +0000 | [diff] [blame] | 224 |    Return the :pep:`3147` magic tag string matching this version of Python's | 
| Barry Warsaw | 28a691b | 2010-04-17 00:19:56 +0000 | [diff] [blame] | 225 |    magic number, as returned by :func:`get_magic`. | 
 | 226 |  | 
 | 227 |  | 
 | 228 | The following constants with integer values, defined in this module, are used | 
 | 229 | to indicate the search result of :func:`find_module`. | 
| Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 230 |  | 
 | 231 |  | 
 | 232 | .. data:: PY_SOURCE | 
 | 233 |  | 
 | 234 |    The module was found as a source file. | 
 | 235 |  | 
 | 236 |  | 
 | 237 | .. data:: PY_COMPILED | 
 | 238 |  | 
 | 239 |    The module was found as a compiled code object file. | 
 | 240 |  | 
 | 241 |  | 
 | 242 | .. data:: C_EXTENSION | 
 | 243 |  | 
 | 244 |    The module was found as dynamically loadable shared library. | 
 | 245 |  | 
 | 246 |  | 
| Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 247 | .. data:: PKG_DIRECTORY | 
 | 248 |  | 
 | 249 |    The module was found as a package directory. | 
 | 250 |  | 
 | 251 |  | 
 | 252 | .. data:: C_BUILTIN | 
 | 253 |  | 
 | 254 |    The module was found as a built-in module. | 
 | 255 |  | 
 | 256 |  | 
 | 257 | .. data:: PY_FROZEN | 
 | 258 |  | 
| R David Murray | 1623aff | 2012-03-18 20:50:03 -0400 | [diff] [blame] | 259 |    The module was found as a frozen module. | 
| Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 260 |  | 
| Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 261 |  | 
 | 262 | .. class:: NullImporter(path_string) | 
 | 263 |  | 
 | 264 |    The :class:`NullImporter` type is a :pep:`302` import hook that handles | 
 | 265 |    non-directory path strings by failing to find any modules.  Calling this type | 
 | 266 |    with an existing directory or empty string raises :exc:`ImportError`. | 
 | 267 |    Otherwise, a :class:`NullImporter` instance is returned. | 
 | 268 |  | 
 | 269 |    Python adds instances of this type to ``sys.path_importer_cache`` for any path | 
 | 270 |    entries that are not directories and are not handled by any other path hooks on | 
 | 271 |    ``sys.path_hooks``.  Instances have only one method: | 
 | 272 |  | 
 | 273 |  | 
 | 274 |    .. method:: NullImporter.find_module(fullname [, path]) | 
 | 275 |  | 
 | 276 |       This method always returns ``None``, indicating that the requested module could | 
 | 277 |       not be found. | 
 | 278 |  | 
| Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 279 |  | 
 | 280 | .. _examples-imp: | 
 | 281 |  | 
 | 282 | Examples | 
 | 283 | -------- | 
 | 284 |  | 
 | 285 | The following function emulates what was the standard import statement up to | 
 | 286 | Python 1.4 (no hierarchical module names).  (This *implementation* wouldn't work | 
 | 287 | in that version, since :func:`find_module` has been extended and | 
 | 288 | :func:`load_module` has been added in 1.4.) :: | 
 | 289 |  | 
 | 290 |    import imp | 
 | 291 |    import sys | 
 | 292 |  | 
 | 293 |    def __import__(name, globals=None, locals=None, fromlist=None): | 
 | 294 |        # Fast path: see if the module has already been imported. | 
 | 295 |        try: | 
 | 296 |            return sys.modules[name] | 
 | 297 |        except KeyError: | 
 | 298 |            pass | 
 | 299 |  | 
 | 300 |        # If any of the following calls raises an exception, | 
 | 301 |        # there's a problem we can't handle -- let the caller handle it. | 
 | 302 |  | 
 | 303 |        fp, pathname, description = imp.find_module(name) | 
 | 304 |  | 
 | 305 |        try: | 
 | 306 |            return imp.load_module(name, fp, pathname, description) | 
 | 307 |        finally: | 
 | 308 |            # Since we may exit via an exception, close fp explicitly. | 
 | 309 |            if fp: | 
 | 310 |                fp.close() |