imp.rst 15.1 KB
Newer Older
1 2 3
:mod:`imp` --- Access the :ref:`import <importsystem>` internals
================================================================

4 5
.. module:: imp
   :synopsis: Access the implementation of the import statement.
6
   :deprecated:
7

8 9
.. deprecated:: 3.4
   The :mod:`imp` package is pending deprecation in favor of :mod:`importlib`.
10 11 12 13 14 15 16 17 18 19 20 21 22 23

.. index:: statement: import

This module provides an interface to the mechanisms used to implement the
:keyword:`import` statement.  It defines the following constants and functions:


.. function:: get_magic()

   .. index:: pair: file; byte-code

   Return the magic string value used to recognize byte-compiled code files
   (:file:`.pyc` files).  (This value may be different for each Python version.)

24 25 26
   .. deprecated:: 3.4
       Use :attr:`importlib.util.MAGIC_NUMBER` instead.

27 28 29

.. function:: get_suffixes()

30 31 32 33 34 35 36 37
   Return a list of 3-element tuples, each describing a particular type of
   module. Each triple has the form ``(suffix, mode, type)``, where *suffix* is
   a string to be appended to the module name to form the filename to search
   for, *mode* is the mode string to pass to the built-in :func:`open` function
   to open the file (this can be ``'r'`` for text files or ``'rb'`` for binary
   files), and *type* is the file type, which has one of the values
   :const:`PY_SOURCE`, :const:`PY_COMPILED`, or :const:`C_EXTENSION`, described
   below.
38

39 40 41
   .. deprecated:: 3.3
      Use the constants defined on :mod:`importlib.machinery` instead.

42 43 44

.. function:: find_module(name[, path])

45 46 47 48 49 50 51 52 53 54 55
   Try to find the module *name*.  If *path* is omitted or ``None``, the list of
   directory names given by ``sys.path`` is searched, but first a few special
   places are searched: the function tries to find a built-in module with the
   given name (:const:`C_BUILTIN`), then a frozen module (:const:`PY_FROZEN`),
   and on some systems some other places are looked in as well (on Windows, it
   looks in the registry which may point to a specific file).

   Otherwise, *path* must be a list of directory names; each directory is
   searched for files with any of the suffixes returned by :func:`get_suffixes`
   above.  Invalid names in the list are silently ignored (but all list items
   must be strings).
56 57 58 59

   If search is successful, the return value is a 3-element tuple ``(file,
   pathname, description)``:

60 61
   *file* is an open :term:`file object` positioned at the beginning, *pathname*
   is the pathname of the file found, and *description* is a 3-element tuple as
62
   contained in the list returned by :func:`get_suffixes` describing the kind of
63 64 65 66 67 68 69 70 71 72 73 74 75
   module found.

   If the module does not live in a file, the returned *file* is ``None``,
   *pathname* is the empty string, and the *description* tuple contains empty
   strings for its suffix and mode; the module type is indicated as given in
   parentheses above.  If the search is unsuccessful, :exc:`ImportError` is
   raised.  Other exceptions indicate problems with the arguments or
   environment.

   If the module is a package, *file* is ``None``, *pathname* is the package
   path and the last item in the *description* tuple is :const:`PKG_DIRECTORY`.

   This function does not handle hierarchical module names (names containing
Senthil Kumaran's avatar
Senthil Kumaran committed
76
   dots).  In order to find *P.M*, that is, submodule *M* of package *P*, use
77 78 79 80
   :func:`find_module` and :func:`load_module` to find and load package *P*, and
   then use :func:`find_module` with the *path* argument set to ``P.__path__``.
   When *P* itself has a dotted name, apply this recipe recursively.

81
   .. deprecated:: 3.3
82 83
      Use :func:`importlib.util.find_spec` instead unless Python 3.3
      compatibility is required, in which case use
84 85 86
      :func:`importlib.find_loader`. For example usage of the former case,
      see the :ref:`importlib-examples` section of the :mod:`importlib`
      documentation.
87

88

89
.. function:: load_module(name, file, pathname, description)
90 91 92 93

   Load a module that was previously found by :func:`find_module` (or by an
   otherwise conducted search yielding compatible results).  This function does
   more than importing the module: if the module was already imported, it will
94 95 96 97 98 99 100 101 102 103 104 105 106 107
   reload the module!  The *name* argument indicates the full
   module name (including the package name, if this is a submodule of a
   package).  The *file* argument is an open file, and *pathname* is the
   corresponding file name; these can be ``None`` and ``''``, respectively, when
   the module is a package or not being loaded from a file.  The *description*
   argument is a tuple, as would be returned by :func:`get_suffixes`, describing
   what kind of module must be loaded.

   If the load is successful, the return value is the module object; otherwise,
   an exception (usually :exc:`ImportError`) is raised.

   **Important:** the caller is responsible for closing the *file* argument, if
   it was not ``None``, even when an exception is raised.  This is best done
   using a :keyword:`try` ... :keyword:`finally` statement.
108

109
   .. deprecated:: 3.3
110
      If previously used in conjunction with :func:`imp.find_module` then
111 112
      consider using :func:`importlib.import_module`, otherwise use the loader
      returned by the replacement you chose for :func:`imp.find_module`. If you
113 114 115 116 117 118
      called :func:`imp.load_module` and related functions directly with file
      path arguments then use a combination of
      :func:`importlib.util.spec_from_file_location` and
      :func:`importlib.util.module_from_spec`. See the :ref:`importlib-examples`
      section of the :mod:`importlib` documentation for details of the various
      approaches.
119

120 121 122 123 124 125

.. function:: new_module(name)

   Return a new empty module object called *name*.  This object is *not* inserted
   in ``sys.modules``.

126
   .. deprecated:: 3.4
127
      Use :func:`importlib.util.module_from_spec` instead.
128

129

130 131 132 133 134 135 136 137 138 139 140 141 142 143 144 145 146 147 148 149 150 151 152 153 154 155 156 157 158 159 160 161 162 163 164 165 166 167 168 169 170
.. function:: reload(module)

   Reload a previously imported *module*.  The argument must be a module object, so
   it must have been successfully imported before.  This is useful if you have
   edited the module source file using an external editor and want to try out the
   new version without leaving the Python interpreter.  The return value is the
   module object (the same as the *module* argument).

   When ``reload(module)`` is executed:

   * Python modules' code is recompiled and the module-level code reexecuted,
     defining a new set of objects which are bound to names in the module's
     dictionary.  The ``init`` function of extension modules is not called a second
     time.

   * As with all other objects in Python the old objects are only reclaimed after
     their reference counts drop to zero.

   * The names in the module namespace are updated to point to any new or changed
     objects.

   * Other references to the old objects (such as names external to the module) are
     not rebound to refer to the new objects and must be updated in each namespace
     where they occur if that is desired.

   There are a number of other caveats:

   When a module is reloaded, its dictionary (containing the module's global
   variables) is retained.  Redefinitions of names will override the old
   definitions, so this is generally not a problem.  If the new version of a module
   does not define a name that was defined by the old version, the old definition
   remains.  This feature can be used to the module's advantage if it maintains a
   global table or cache of objects --- with a :keyword:`try` statement it can test
   for the table's presence and skip its initialization if desired::

      try:
          cache
      except NameError:
          cache = {}

   It is legal though generally not very useful to reload built-in or dynamically
171
   loaded modules, except for :mod:`sys`, :mod:`__main__` and :mod:`builtins`.
172 173 174 175 176 177 178 179 180 181 182 183 184
   In many cases, however, extension modules are not designed to be initialized
   more than once, and may fail in arbitrary ways when reloaded.

   If a module imports objects from another module using :keyword:`from` ...
   :keyword:`import` ..., calling :func:`reload` for the other module does not
   redefine the objects imported from it --- one way around this is to re-execute
   the :keyword:`from` statement, another is to use :keyword:`import` and qualified
   names (*module*.*name*) instead.

   If a module instantiates instances of a class, reloading the module that defines
   the class does not affect the method definitions of the instances --- they
   continue to use the old class definition.  The same is true for derived classes.

185 186 187 188
   .. versionchanged:: 3.3
      Relies on both ``__name__`` and ``__loader__`` being defined on the module
      being reloaded instead of just ``__name__``.

189 190 191
   .. deprecated:: 3.4
      Use :func:`importlib.reload` instead.

192

193 194
The following functions are conveniences for handling :pep:`3147` byte-compiled
file paths.
Barry Warsaw's avatar
Barry Warsaw committed
195 196 197 198 199

.. versionadded:: 3.2

.. function:: cache_from_source(path, debug_override=None)

200
   Return the :pep:`3147` path to the byte-compiled file associated with the
Barry Warsaw's avatar
Barry Warsaw committed
201 202 203
   source *path*.  For example, if *path* is ``/foo/bar/baz.py`` the return
   value would be ``/foo/bar/__pycache__/baz.cpython-32.pyc`` for Python 3.2.
   The ``cpython-32`` string comes from the current magic tag (see
204
   :func:`get_tag`; if :attr:`sys.implementation.cache_tag` is not defined then
205 206 207
   :exc:`NotImplementedError` will be raised). By passing in ``True`` or
   ``False`` for *debug_override* you can override the system's value for
   ``__debug__``, leading to optimized bytecode.
Barry Warsaw's avatar
Barry Warsaw committed
208 209 210

   *path* need not exist.

211 212 213 214
   .. versionchanged:: 3.3
      If :attr:`sys.implementation.cache_tag` is ``None``, then
      :exc:`NotImplementedError` is raised.

215 216 217
   .. deprecated:: 3.4
      Use :func:`importlib.util.cache_from_source` instead.

218 219 220
   .. versionchanged:: 3.5
      The *debug_override* parameter no longer creates a ``.pyo`` file.

Benjamin Peterson's avatar
Benjamin Peterson committed
221

Barry Warsaw's avatar
Barry Warsaw committed
222 223
.. function:: source_from_cache(path)

224
   Given the *path* to a :pep:`3147` file name, return the associated source code
Barry Warsaw's avatar
Barry Warsaw committed
225 226 227
   file path.  For example, if *path* is
   ``/foo/bar/__pycache__/baz.cpython-32.pyc`` the returned path would be
   ``/foo/bar/baz.py``.  *path* need not exist, however if it does not conform
228 229 230 231 232 233 234
   to :pep:`3147` format, a ``ValueError`` is raised. If
   :attr:`sys.implementation.cache_tag` is not defined,
   :exc:`NotImplementedError` is raised.

   .. versionchanged:: 3.3
      Raise :exc:`NotImplementedError` when
      :attr:`sys.implementation.cache_tag` is not defined.
Barry Warsaw's avatar
Barry Warsaw committed
235

236 237 238
   .. deprecated:: 3.4
      Use :func:`importlib.util.source_from_cache` instead.

Benjamin Peterson's avatar
Benjamin Peterson committed
239

Barry Warsaw's avatar
Barry Warsaw committed
240 241
.. function:: get_tag()

242
   Return the :pep:`3147` magic tag string matching this version of Python's
Barry Warsaw's avatar
Barry Warsaw committed
243 244
   magic number, as returned by :func:`get_magic`.

245
   .. deprecated:: 3.4
246
      Use :attr:`sys.implementation.cache_tag` directly starting
247 248
      in Python 3.3.

Barry Warsaw's avatar
Barry Warsaw committed
249

250 251 252 253 254 255 256 257 258 259 260 261 262 263 264 265 266 267 268
The following functions help interact with the import system's internal
locking mechanism.  Locking semantics of imports are an implementation
detail which may vary from release to release.  However, Python ensures
that circular imports work without any deadlocks.


.. function:: lock_held()

   Return ``True`` if the global import lock is currently held, else
   ``False``. On platforms without threads, always return ``False``.

   On platforms with threads, a thread executing an import first holds a
   global import lock, then sets up a per-module lock for the rest of the
   import.  This blocks other threads from importing the same module until
   the original import completes, preventing other threads from seeing
   incomplete module objects constructed by the original thread.  An
   exception is made for circular imports, which by construction have to
   expose an incomplete module object at some point.

269 270 271 272
   .. versionchanged:: 3.3
      The locking scheme has changed to per-module locks for
      the most part.  A global import lock is kept for some critical tasks,
      such as initializing the per-module locks.
273

274
   .. deprecated:: 3.4
275

276

277 278 279 280 281 282 283 284 285 286 287 288
.. function:: acquire_lock()

   Acquire the interpreter's global import lock for the current thread.
   This lock should be used by import hooks to ensure thread-safety when
   importing modules.

   Once a thread has acquired the import lock, the same thread may acquire it
   again without blocking; the thread must release it once for each time it has
   acquired it.

   On platforms without threads, this function does nothing.

289 290 291 292
   .. versionchanged:: 3.3
      The locking scheme has changed to per-module locks for
      the most part.  A global import lock is kept for some critical tasks,
      such as initializing the per-module locks.
293

294
   .. deprecated:: 3.4
295

296 297 298 299 300 301

.. function:: release_lock()

   Release the interpreter's global import lock. On platforms without
   threads, this function does nothing.

302 303 304 305
   .. versionchanged:: 3.3
      The locking scheme has changed to per-module locks for
      the most part.  A global import lock is kept for some critical tasks,
      such as initializing the per-module locks.
306

307
   .. deprecated:: 3.4
308

309

Barry Warsaw's avatar
Barry Warsaw committed
310 311
The following constants with integer values, defined in this module, are used
to indicate the search result of :func:`find_module`.
312 313 314 315 316 317


.. data:: PY_SOURCE

   The module was found as a source file.

318 319
   .. deprecated:: 3.3

320 321 322 323 324

.. data:: PY_COMPILED

   The module was found as a compiled code object file.

325 326
   .. deprecated:: 3.3

327 328 329 330 331

.. data:: C_EXTENSION

   The module was found as dynamically loadable shared library.

332 333
   .. deprecated:: 3.3

334 335 336 337 338

.. data:: PKG_DIRECTORY

   The module was found as a package directory.

339 340
   .. deprecated:: 3.3

341 342 343 344 345

.. data:: C_BUILTIN

   The module was found as a built-in module.

346 347
   .. deprecated:: 3.3

348 349 350

.. data:: PY_FROZEN

351
   The module was found as a frozen module.
352

353 354
   .. deprecated:: 3.3

355 356 357 358 359 360 361 362

.. class:: NullImporter(path_string)

   The :class:`NullImporter` type is a :pep:`302` import hook that handles
   non-directory path strings by failing to find any modules.  Calling this type
   with an existing directory or empty string raises :exc:`ImportError`.
   Otherwise, a :class:`NullImporter` instance is returned.

363
   Instances have only one method:
364 365 366 367 368 369

   .. method:: NullImporter.find_module(fullname [, path])

      This method always returns ``None``, indicating that the requested module could
      not be found.

370 371 372 373
   .. versionchanged:: 3.3
      ``None`` is inserted into ``sys.path_importer_cache`` instead of an
      instance of :class:`NullImporter`.

374 375 376
   .. deprecated:: 3.4
      Insert ``None`` into ``sys.path_importer_cache`` instead.

377 378 379 380 381 382 383 384 385 386 387 388 389 390 391 392 393 394 395 396 397 398 399 400 401 402 403 404 405 406 407 408

.. _examples-imp:

Examples
--------

The following function emulates what was the standard import statement up to
Python 1.4 (no hierarchical module names).  (This *implementation* wouldn't work
in that version, since :func:`find_module` has been extended and
:func:`load_module` has been added in 1.4.) ::

   import imp
   import sys

   def __import__(name, globals=None, locals=None, fromlist=None):
       # Fast path: see if the module has already been imported.
       try:
           return sys.modules[name]
       except KeyError:
           pass

       # If any of the following calls raises an exception,
       # there's a problem we can't handle -- let the caller handle it.

       fp, pathname, description = imp.find_module(name)

       try:
           return imp.load_module(name, fp, pathname, description)
       finally:
           # Since we may exit via an exception, close fp explicitly.
           if fp:
               fp.close()