| Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 1 |  | 
 | 2 | :mod:`imp` --- Access the :keyword:`import` internals | 
 | 3 | ===================================================== | 
 | 4 |  | 
 | 5 | .. module:: imp | 
 | 6 |    :synopsis: Access the implementation of the import statement. | 
 | 7 |  | 
 | 8 |  | 
 | 9 | .. index:: statement: import | 
 | 10 |  | 
 | 11 | This module provides an interface to the mechanisms used to implement the | 
 | 12 | :keyword:`import` statement.  It defines the following constants and functions: | 
 | 13 |  | 
 | 14 |  | 
 | 15 | .. function:: get_magic() | 
 | 16 |  | 
 | 17 |    .. index:: pair: file; byte-code | 
 | 18 |  | 
 | 19 |    Return the magic string value used to recognize byte-compiled code files | 
 | 20 |    (:file:`.pyc` files).  (This value may be different for each Python version.) | 
 | 21 |  | 
 | 22 |  | 
 | 23 | .. function:: get_suffixes() | 
 | 24 |  | 
| Guido van Rossum | 04110fb | 2007-08-24 16:32:05 +0000 | [diff] [blame] | 25 |    Return a list of 3-element tuples, each describing a particular type of | 
 | 26 |    module. Each triple has the form ``(suffix, mode, type)``, where *suffix* is | 
 | 27 |    a string to be appended to the module name to form the filename to search | 
 | 28 |    for, *mode* is the mode string to pass to the built-in :func:`open` function | 
 | 29 |    to open the file (this can be ``'r'`` for text files or ``'rb'`` for binary | 
 | 30 |    files), and *type* is the file type, which has one of the values | 
 | 31 |    :const:`PY_SOURCE`, :const:`PY_COMPILED`, or :const:`C_EXTENSION`, described | 
 | 32 |    below. | 
| Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 33 |  | 
 | 34 |  | 
 | 35 | .. function:: find_module(name[, path]) | 
 | 36 |  | 
| Guido van Rossum | 04110fb | 2007-08-24 16:32:05 +0000 | [diff] [blame] | 37 |    Try to find the module *name* on the search path *path*.  If *path* is a list | 
 | 38 |    of directory names, each directory is searched for files with any of the | 
 | 39 |    suffixes returned by :func:`get_suffixes` above.  Invalid names in the list | 
 | 40 |    are silently ignored (but all list items must be strings).  If *path* is | 
 | 41 |    omitted or ``None``, the list of directory names given by ``sys.path`` is | 
 | 42 |    searched, but first it searches a few special places: it tries to find a | 
 | 43 |    built-in module with the given name (:const:`C_BUILTIN`), then a frozen | 
 | 44 |    module (:const:`PY_FROZEN`), and on some systems some other places are looked | 
 | 45 |    in as well (on the Mac, it looks for a resource (:const:`PY_RESOURCE`); on | 
 | 46 |    Windows, it looks in the registry which may point to a specific file). | 
| Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 47 |  | 
| Guido van Rossum | 04110fb | 2007-08-24 16:32:05 +0000 | [diff] [blame] | 48 |    If search is successful, the return value is a 3-element tuple ``(file, | 
 | 49 |    pathname, description)``: | 
 | 50 |  | 
 | 51 |    *file* is an open file object positioned at the beginning, *pathname* is the | 
 | 52 |    pathname of the file found, and *description* is a 3-element tuple as | 
| Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 53 |    contained in the list returned by :func:`get_suffixes` describing the kind of | 
| Guido van Rossum | 04110fb | 2007-08-24 16:32:05 +0000 | [diff] [blame] | 54 |    module found. | 
| Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 55 |  | 
| Guido van Rossum | 04110fb | 2007-08-24 16:32:05 +0000 | [diff] [blame] | 56 |    If the module does not live in a file, the returned *file* is ``None``, | 
 | 57 |    *pathname* is the empty string, and the *description* tuple contains empty | 
 | 58 |    strings for its suffix and mode; the module type is indicated as given in | 
 | 59 |    parentheses above.  If the search is unsuccessful, :exc:`ImportError` is | 
 | 60 |    raised.  Other exceptions indicate problems with the arguments or | 
 | 61 |    environment. | 
 | 62 |  | 
 | 63 |    If the module is a package, *file* is ``None``, *pathname* is the package | 
 | 64 |    path and the last item in the *description* tuple is :const:`PKG_DIRECTORY`. | 
 | 65 |  | 
 | 66 |    This function does not handle hierarchical module names (names containing | 
 | 67 |    dots).  In order to find *P*.*M*, that is, submodule *M* of package *P*, use | 
| Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 68 |    :func:`find_module` and :func:`load_module` to find and load package *P*, and | 
 | 69 |    then use :func:`find_module` with the *path* argument set to ``P.__path__``. | 
 | 70 |    When *P* itself has a dotted name, apply this recipe recursively. | 
 | 71 |  | 
 | 72 |  | 
| Guido van Rossum | 04110fb | 2007-08-24 16:32:05 +0000 | [diff] [blame] | 73 | .. function:: load_module(name, file, pathname, description) | 
| Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 74 |  | 
 | 75 |    Load a module that was previously found by :func:`find_module` (or by an | 
 | 76 |    otherwise conducted search yielding compatible results).  This function does | 
 | 77 |    more than importing the module: if the module was already imported, it will | 
| Guido van Rossum | 04110fb | 2007-08-24 16:32:05 +0000 | [diff] [blame] | 78 |    reload the module!  The *name* argument indicates the full | 
 | 79 |    module name (including the package name, if this is a submodule of a | 
 | 80 |    package).  The *file* argument is an open file, and *pathname* is the | 
 | 81 |    corresponding file name; these can be ``None`` and ``''``, respectively, when | 
 | 82 |    the module is a package or not being loaded from a file.  The *description* | 
 | 83 |    argument is a tuple, as would be returned by :func:`get_suffixes`, describing | 
 | 84 |    what kind of module must be loaded. | 
| Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 85 |  | 
| Guido van Rossum | 04110fb | 2007-08-24 16:32:05 +0000 | [diff] [blame] | 86 |    If the load is successful, the return value is the module object; otherwise, | 
 | 87 |    an exception (usually :exc:`ImportError`) is raised. | 
| Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 88 |  | 
| Guido van Rossum | 04110fb | 2007-08-24 16:32:05 +0000 | [diff] [blame] | 89 |    **Important:** the caller is responsible for closing the *file* argument, if | 
 | 90 |    it was not ``None``, even when an exception is raised.  This is best done | 
 | 91 |    using a :keyword:`try` ... :keyword:`finally` statement. | 
| Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 92 |  | 
 | 93 |  | 
 | 94 | .. function:: new_module(name) | 
 | 95 |  | 
 | 96 |    Return a new empty module object called *name*.  This object is *not* inserted | 
 | 97 |    in ``sys.modules``. | 
 | 98 |  | 
 | 99 |  | 
 | 100 | .. function:: lock_held() | 
 | 101 |  | 
 | 102 |    Return ``True`` if the import lock is currently held, else ``False``. On | 
 | 103 |    platforms without threads, always return ``False``. | 
 | 104 |  | 
 | 105 |    On platforms with threads, a thread executing an import holds an internal lock | 
 | 106 |    until the import is complete. This lock blocks other threads from doing an | 
 | 107 |    import until the original import completes, which in turn prevents other threads | 
 | 108 |    from seeing incomplete module objects constructed by the original thread while | 
 | 109 |    in the process of completing its import (and the imports, if any, triggered by | 
 | 110 |    that). | 
 | 111 |  | 
 | 112 |  | 
 | 113 | .. function:: acquire_lock() | 
 | 114 |  | 
 | 115 |    Acquires the interpreter's import lock for the current thread.  This lock should | 
 | 116 |    be used by import hooks to ensure thread-safety when importing modules. On | 
 | 117 |    platforms without threads, this function does nothing. | 
 | 118 |  | 
| Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 119 |  | 
 | 120 | .. function:: release_lock() | 
 | 121 |  | 
 | 122 |    Release the interpreter's import lock. On platforms without threads, this | 
 | 123 |    function does nothing. | 
 | 124 |  | 
| Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 125 |  | 
| Christian Heimes | 043d6f6 | 2008-01-07 17:19:16 +0000 | [diff] [blame] | 126 | .. function:: reload(module) | 
 | 127 |  | 
 | 128 |    Reload a previously imported *module*.  The argument must be a module object, so | 
 | 129 |    it must have been successfully imported before.  This is useful if you have | 
 | 130 |    edited the module source file using an external editor and want to try out the | 
 | 131 |    new version without leaving the Python interpreter.  The return value is the | 
 | 132 |    module object (the same as the *module* argument). | 
 | 133 |  | 
 | 134 |    When ``reload(module)`` is executed: | 
 | 135 |  | 
 | 136 |    * Python modules' code is recompiled and the module-level code reexecuted, | 
 | 137 |      defining a new set of objects which are bound to names in the module's | 
 | 138 |      dictionary.  The ``init`` function of extension modules is not called a second | 
 | 139 |      time. | 
 | 140 |  | 
 | 141 |    * As with all other objects in Python the old objects are only reclaimed after | 
 | 142 |      their reference counts drop to zero. | 
 | 143 |  | 
 | 144 |    * The names in the module namespace are updated to point to any new or changed | 
 | 145 |      objects. | 
 | 146 |  | 
 | 147 |    * Other references to the old objects (such as names external to the module) are | 
 | 148 |      not rebound to refer to the new objects and must be updated in each namespace | 
 | 149 |      where they occur if that is desired. | 
 | 150 |  | 
 | 151 |    There are a number of other caveats: | 
 | 152 |  | 
 | 153 |    If a module is syntactically correct but its initialization fails, the first | 
 | 154 |    :keyword:`import` statement for it does not bind its name locally, but does | 
 | 155 |    store a (partially initialized) module object in ``sys.modules``.  To reload the | 
 | 156 |    module you must first :keyword:`import` it again (this will bind the name to the | 
 | 157 |    partially initialized module object) before you can :func:`reload` it. | 
 | 158 |  | 
 | 159 |    When a module is reloaded, its dictionary (containing the module's global | 
 | 160 |    variables) is retained.  Redefinitions of names will override the old | 
 | 161 |    definitions, so this is generally not a problem.  If the new version of a module | 
 | 162 |    does not define a name that was defined by the old version, the old definition | 
 | 163 |    remains.  This feature can be used to the module's advantage if it maintains a | 
 | 164 |    global table or cache of objects --- with a :keyword:`try` statement it can test | 
 | 165 |    for the table's presence and skip its initialization if desired:: | 
 | 166 |  | 
 | 167 |       try: | 
 | 168 |           cache | 
 | 169 |       except NameError: | 
 | 170 |           cache = {} | 
 | 171 |  | 
 | 172 |    It is legal though generally not very useful to reload built-in or dynamically | 
 | 173 |    loaded modules, except for :mod:`sys`, :mod:`__main__` and :mod:`__builtin__`. | 
 | 174 |    In many cases, however, extension modules are not designed to be initialized | 
 | 175 |    more than once, and may fail in arbitrary ways when reloaded. | 
 | 176 |  | 
 | 177 |    If a module imports objects from another module using :keyword:`from` ... | 
 | 178 |    :keyword:`import` ..., calling :func:`reload` for the other module does not | 
 | 179 |    redefine the objects imported from it --- one way around this is to re-execute | 
 | 180 |    the :keyword:`from` statement, another is to use :keyword:`import` and qualified | 
 | 181 |    names (*module*.*name*) instead. | 
 | 182 |  | 
 | 183 |    If a module instantiates instances of a class, reloading the module that defines | 
 | 184 |    the class does not affect the method definitions of the instances --- they | 
 | 185 |    continue to use the old class definition.  The same is true for derived classes. | 
 | 186 |  | 
 | 187 |  | 
| Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 188 | The following constants with integer values, defined in this module, are used to | 
 | 189 | indicate the search result of :func:`find_module`. | 
 | 190 |  | 
 | 191 |  | 
 | 192 | .. data:: PY_SOURCE | 
 | 193 |  | 
 | 194 |    The module was found as a source file. | 
 | 195 |  | 
 | 196 |  | 
 | 197 | .. data:: PY_COMPILED | 
 | 198 |  | 
 | 199 |    The module was found as a compiled code object file. | 
 | 200 |  | 
 | 201 |  | 
 | 202 | .. data:: C_EXTENSION | 
 | 203 |  | 
 | 204 |    The module was found as dynamically loadable shared library. | 
 | 205 |  | 
 | 206 |  | 
 | 207 | .. data:: PY_RESOURCE | 
 | 208 |  | 
 | 209 |    The module was found as a Mac OS 9 resource.  This value can only be returned on | 
 | 210 |    a Mac OS 9 or earlier Macintosh. | 
 | 211 |  | 
 | 212 |  | 
 | 213 | .. data:: PKG_DIRECTORY | 
 | 214 |  | 
 | 215 |    The module was found as a package directory. | 
 | 216 |  | 
 | 217 |  | 
 | 218 | .. data:: C_BUILTIN | 
 | 219 |  | 
 | 220 |    The module was found as a built-in module. | 
 | 221 |  | 
 | 222 |  | 
 | 223 | .. data:: PY_FROZEN | 
 | 224 |  | 
 | 225 |    The module was found as a frozen module (see :func:`init_frozen`). | 
 | 226 |  | 
 | 227 | The following constant and functions are obsolete; their functionality is | 
 | 228 | available through :func:`find_module` or :func:`load_module`. They are kept | 
 | 229 | around for backward compatibility: | 
 | 230 |  | 
 | 231 |  | 
 | 232 | .. data:: SEARCH_ERROR | 
 | 233 |  | 
 | 234 |    Unused. | 
 | 235 |  | 
 | 236 |  | 
 | 237 | .. function:: init_builtin(name) | 
 | 238 |  | 
 | 239 |    Initialize the built-in module called *name* and return its module object along | 
 | 240 |    with storing it in ``sys.modules``.  If the module was already initialized, it | 
 | 241 |    will be initialized *again*.  Re-initialization involves the copying of the | 
 | 242 |    built-in module's ``__dict__`` from the cached module over the module's entry in | 
 | 243 |    ``sys.modules``.  If there is no built-in module called *name*, ``None`` is | 
 | 244 |    returned. | 
 | 245 |  | 
 | 246 |  | 
 | 247 | .. function:: init_frozen(name) | 
 | 248 |  | 
 | 249 |    Initialize the frozen module called *name* and return its module object.  If | 
 | 250 |    the module was already initialized, it will be initialized *again*.  If there | 
 | 251 |    is no frozen module called *name*, ``None`` is returned.  (Frozen modules are | 
 | 252 |    modules written in Python whose compiled byte-code object is incorporated | 
 | 253 |    into a custom-built Python interpreter by Python's :program:`freeze` | 
 | 254 |    utility. See :file:`Tools/freeze/` for now.) | 
 | 255 |  | 
 | 256 |  | 
 | 257 | .. function:: is_builtin(name) | 
 | 258 |  | 
 | 259 |    Return ``1`` if there is a built-in module called *name* which can be | 
 | 260 |    initialized again.  Return ``-1`` if there is a built-in module called *name* | 
 | 261 |    which cannot be initialized again (see :func:`init_builtin`).  Return ``0`` if | 
 | 262 |    there is no built-in module called *name*. | 
 | 263 |  | 
 | 264 |  | 
 | 265 | .. function:: is_frozen(name) | 
 | 266 |  | 
 | 267 |    Return ``True`` if there is a frozen module (see :func:`init_frozen`) called | 
 | 268 |    *name*, or ``False`` if there is no such module. | 
 | 269 |  | 
 | 270 |  | 
 | 271 | .. function:: load_compiled(name, pathname, [file]) | 
 | 272 |  | 
 | 273 |    .. index:: pair: file; byte-code | 
 | 274 |  | 
 | 275 |    Load and initialize a module implemented as a byte-compiled code file and return | 
 | 276 |    its module object.  If the module was already initialized, it will be | 
 | 277 |    initialized *again*.  The *name* argument is used to create or access a module | 
 | 278 |    object.  The *pathname* argument points to the byte-compiled code file.  The | 
 | 279 |    *file* argument is the byte-compiled code file, open for reading in binary mode, | 
 | 280 |    from the beginning. It must currently be a real file object, not a user-defined | 
 | 281 |    class emulating a file. | 
 | 282 |  | 
 | 283 |  | 
 | 284 | .. function:: load_dynamic(name, pathname[, file]) | 
 | 285 |  | 
 | 286 |    Load and initialize a module implemented as a dynamically loadable shared | 
 | 287 |    library and return its module object.  If the module was already initialized, it | 
 | 288 |    will be initialized *again*. Re-initialization involves copying the ``__dict__`` | 
 | 289 |    attribute of the cached instance of the module over the value used in the module | 
 | 290 |    cached in ``sys.modules``.  The *pathname* argument must point to the shared | 
 | 291 |    library.  The *name* argument is used to construct the name of the | 
 | 292 |    initialization function: an external C function called ``initname()`` in the | 
 | 293 |    shared library is called.  The optional *file* argument is ignored.  (Note: | 
 | 294 |    using shared libraries is highly system dependent, and not all systems support | 
 | 295 |    it.) | 
 | 296 |  | 
 | 297 |  | 
 | 298 | .. function:: load_source(name, pathname[, file]) | 
 | 299 |  | 
 | 300 |    Load and initialize a module implemented as a Python source file and return its | 
 | 301 |    module object.  If the module was already initialized, it will be initialized | 
 | 302 |    *again*.  The *name* argument is used to create or access a module object.  The | 
 | 303 |    *pathname* argument points to the source file.  The *file* argument is the | 
 | 304 |    source file, open for reading as text, from the beginning. It must currently be | 
 | 305 |    a real file object, not a user-defined class emulating a file.  Note that if a | 
 | 306 |    properly matching byte-compiled file (with suffix :file:`.pyc` or :file:`.pyo`) | 
 | 307 |    exists, it will be used instead of parsing the given source file. | 
 | 308 |  | 
 | 309 |  | 
 | 310 | .. class:: NullImporter(path_string) | 
 | 311 |  | 
 | 312 |    The :class:`NullImporter` type is a :pep:`302` import hook that handles | 
 | 313 |    non-directory path strings by failing to find any modules.  Calling this type | 
 | 314 |    with an existing directory or empty string raises :exc:`ImportError`. | 
 | 315 |    Otherwise, a :class:`NullImporter` instance is returned. | 
 | 316 |  | 
 | 317 |    Python adds instances of this type to ``sys.path_importer_cache`` for any path | 
 | 318 |    entries that are not directories and are not handled by any other path hooks on | 
 | 319 |    ``sys.path_hooks``.  Instances have only one method: | 
 | 320 |  | 
 | 321 |  | 
 | 322 |    .. method:: NullImporter.find_module(fullname [, path]) | 
 | 323 |  | 
 | 324 |       This method always returns ``None``, indicating that the requested module could | 
 | 325 |       not be found. | 
 | 326 |  | 
| Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 327 |  | 
 | 328 | .. _examples-imp: | 
 | 329 |  | 
 | 330 | Examples | 
 | 331 | -------- | 
 | 332 |  | 
 | 333 | The following function emulates what was the standard import statement up to | 
 | 334 | Python 1.4 (no hierarchical module names).  (This *implementation* wouldn't work | 
 | 335 | in that version, since :func:`find_module` has been extended and | 
 | 336 | :func:`load_module` has been added in 1.4.) :: | 
 | 337 |  | 
 | 338 |    import imp | 
 | 339 |    import sys | 
 | 340 |  | 
 | 341 |    def __import__(name, globals=None, locals=None, fromlist=None): | 
 | 342 |        # Fast path: see if the module has already been imported. | 
 | 343 |        try: | 
 | 344 |            return sys.modules[name] | 
 | 345 |        except KeyError: | 
 | 346 |            pass | 
 | 347 |  | 
 | 348 |        # If any of the following calls raises an exception, | 
 | 349 |        # there's a problem we can't handle -- let the caller handle it. | 
 | 350 |  | 
 | 351 |        fp, pathname, description = imp.find_module(name) | 
 | 352 |  | 
 | 353 |        try: | 
 | 354 |            return imp.load_module(name, fp, pathname, description) | 
 | 355 |        finally: | 
 | 356 |            # Since we may exit via an exception, close fp explicitly. | 
 | 357 |            if fp: | 
 | 358 |                fp.close() | 
 | 359 |  | 
 | 360 | .. index:: module: knee | 
 | 361 |  | 
 | 362 | A more complete example that implements hierarchical module names and includes a | 
 | 363 | :func:`reload` function can be found in the module :mod:`knee`.  The :mod:`knee` | 
 | 364 | module can be found in :file:`Demo/imputil/` in the Python source distribution. | 
 | 365 |  |