| Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 1 |  | 
 | 2 | :mod:`gc` --- Garbage Collector interface | 
 | 3 | ========================================= | 
 | 4 |  | 
 | 5 | .. module:: gc | 
 | 6 |    :synopsis: Interface to the cycle-detecting garbage collector. | 
 | 7 | .. moduleauthor:: Neil Schemenauer <nas@arctrix.com> | 
 | 8 | .. sectionauthor:: Neil Schemenauer <nas@arctrix.com> | 
 | 9 |  | 
 | 10 |  | 
 | 11 | This module provides an interface to the optional garbage collector.  It | 
 | 12 | provides the ability to disable the collector, tune the collection frequency, | 
 | 13 | and set debugging options.  It also provides access to unreachable objects that | 
 | 14 | the collector found but cannot free.  Since the collector supplements the | 
 | 15 | reference counting already used in Python, you can disable the collector if you | 
 | 16 | are sure your program does not create reference cycles.  Automatic collection | 
 | 17 | can be disabled by calling ``gc.disable()``.  To debug a leaking program call | 
 | 18 | ``gc.set_debug(gc.DEBUG_LEAK)``. Notice that this includes | 
 | 19 | ``gc.DEBUG_SAVEALL``, causing garbage-collected objects to be saved in | 
 | 20 | gc.garbage for inspection. | 
 | 21 |  | 
 | 22 | The :mod:`gc` module provides the following functions: | 
 | 23 |  | 
 | 24 |  | 
 | 25 | .. function:: enable() | 
 | 26 |  | 
 | 27 |    Enable automatic garbage collection. | 
 | 28 |  | 
 | 29 |  | 
 | 30 | .. function:: disable() | 
 | 31 |  | 
 | 32 |    Disable automatic garbage collection. | 
 | 33 |  | 
 | 34 |  | 
 | 35 | .. function:: isenabled() | 
 | 36 |  | 
 | 37 |    Returns true if automatic collection is enabled. | 
 | 38 |  | 
 | 39 |  | 
 | 40 | .. function:: collect([generation]) | 
 | 41 |  | 
 | 42 |    With no arguments, run a full collection.  The optional argument *generation* | 
 | 43 |    may be an integer specifying which generation to collect (from 0 to 2).  A | 
 | 44 |    :exc:`ValueError` is raised if the generation number  is invalid. The number of | 
 | 45 |    unreachable objects found is returned. | 
 | 46 |  | 
| Georg Brandl | 2ee470f | 2008-07-16 12:55:28 +0000 | [diff] [blame] | 47 |    The free lists maintained for a number of builtin types are cleared | 
 | 48 |    whenever a full collection or collection of the highest generation (2) | 
 | 49 |    is run.  Not all items in some free lists may be freed due to the | 
 | 50 |    particular implementation, in particular :class:`float`. | 
 | 51 |  | 
| Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 52 |  | 
 | 53 | .. function:: set_debug(flags) | 
 | 54 |  | 
 | 55 |    Set the garbage collection debugging flags. Debugging information will be | 
 | 56 |    written to ``sys.stderr``.  See below for a list of debugging flags which can be | 
 | 57 |    combined using bit operations to control debugging. | 
 | 58 |  | 
 | 59 |  | 
 | 60 | .. function:: get_debug() | 
 | 61 |  | 
 | 62 |    Return the debugging flags currently set. | 
 | 63 |  | 
 | 64 |  | 
 | 65 | .. function:: get_objects() | 
 | 66 |  | 
 | 67 |    Returns a list of all objects tracked by the collector, excluding the list | 
 | 68 |    returned. | 
 | 69 |  | 
| Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 70 |  | 
 | 71 | .. function:: set_threshold(threshold0[, threshold1[, threshold2]]) | 
 | 72 |  | 
 | 73 |    Set the garbage collection thresholds (the collection frequency). Setting | 
 | 74 |    *threshold0* to zero disables collection. | 
 | 75 |  | 
 | 76 |    The GC classifies objects into three generations depending on how many | 
 | 77 |    collection sweeps they have survived.  New objects are placed in the youngest | 
 | 78 |    generation (generation ``0``).  If an object survives a collection it is moved | 
 | 79 |    into the next older generation.  Since generation ``2`` is the oldest | 
 | 80 |    generation, objects in that generation remain there after a collection.  In | 
 | 81 |    order to decide when to run, the collector keeps track of the number object | 
 | 82 |    allocations and deallocations since the last collection.  When the number of | 
 | 83 |    allocations minus the number of deallocations exceeds *threshold0*, collection | 
 | 84 |    starts.  Initially only generation ``0`` is examined.  If generation ``0`` has | 
 | 85 |    been examined more than *threshold1* times since generation ``1`` has been | 
 | 86 |    examined, then generation ``1`` is examined as well.  Similarly, *threshold2* | 
 | 87 |    controls the number of collections of generation ``1`` before collecting | 
 | 88 |    generation ``2``. | 
 | 89 |  | 
 | 90 |  | 
 | 91 | .. function:: get_count() | 
 | 92 |  | 
 | 93 |    Return the current collection  counts as a tuple of ``(count0, count1, | 
 | 94 |    count2)``. | 
 | 95 |  | 
| Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 96 |  | 
 | 97 | .. function:: get_threshold() | 
 | 98 |  | 
 | 99 |    Return the current collection thresholds as a tuple of ``(threshold0, | 
 | 100 |    threshold1, threshold2)``. | 
 | 101 |  | 
 | 102 |  | 
 | 103 | .. function:: get_referrers(*objs) | 
 | 104 |  | 
 | 105 |    Return the list of objects that directly refer to any of objs. This function | 
 | 106 |    will only locate those containers which support garbage collection; extension | 
 | 107 |    types which do refer to other objects but do not support garbage collection will | 
 | 108 |    not be found. | 
 | 109 |  | 
 | 110 |    Note that objects which have already been dereferenced, but which live in cycles | 
 | 111 |    and have not yet been collected by the garbage collector can be listed among the | 
 | 112 |    resulting referrers.  To get only currently live objects, call :func:`collect` | 
 | 113 |    before calling :func:`get_referrers`. | 
 | 114 |  | 
 | 115 |    Care must be taken when using objects returned by :func:`get_referrers` because | 
 | 116 |    some of them could still be under construction and hence in a temporarily | 
 | 117 |    invalid state. Avoid using :func:`get_referrers` for any purpose other than | 
 | 118 |    debugging. | 
 | 119 |  | 
| Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 120 |  | 
 | 121 | .. function:: get_referents(*objs) | 
 | 122 |  | 
 | 123 |    Return a list of objects directly referred to by any of the arguments. The | 
 | 124 |    referents returned are those objects visited by the arguments' C-level | 
 | 125 |    :attr:`tp_traverse` methods (if any), and may not be all objects actually | 
 | 126 |    directly reachable.  :attr:`tp_traverse` methods are supported only by objects | 
 | 127 |    that support garbage collection, and are only required to visit objects that may | 
 | 128 |    be involved in a cycle.  So, for example, if an integer is directly reachable | 
 | 129 |    from an argument, that integer object may or may not appear in the result list. | 
 | 130 |  | 
| Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 131 |  | 
| Antoine Pitrou | 3a652b1 | 2009-03-23 18:52:06 +0000 | [diff] [blame] | 132 | .. function:: is_tracked(obj) | 
 | 133 |  | 
 | 134 |    Returns True if the object is currently tracked by the garbage collector, | 
 | 135 |    False otherwise.  As a general rule, instances of atomic types aren't | 
 | 136 |    tracked and instances of non-atomic types (containers, user-defined | 
 | 137 |    objects...) are.  However, some type-specific optimizations can be present | 
 | 138 |    in order to suppress the garbage collector footprint of simple instances | 
 | 139 |    (e.g. dicts containing only atomic keys and values):: | 
 | 140 |  | 
 | 141 |       >>> gc.is_tracked(0) | 
 | 142 |       False | 
 | 143 |       >>> gc.is_tracked("a") | 
 | 144 |       False | 
 | 145 |       >>> gc.is_tracked([]) | 
 | 146 |       True | 
 | 147 |       >>> gc.is_tracked({}) | 
 | 148 |       False | 
 | 149 |       >>> gc.is_tracked({"a": 1}) | 
 | 150 |       False | 
 | 151 |       >>> gc.is_tracked({"a": []}) | 
 | 152 |       True | 
 | 153 |  | 
 | 154 |    .. versionadded:: 2.7 | 
 | 155 |  | 
 | 156 |  | 
| Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 157 | The following variable is provided for read-only access (you can mutate its | 
 | 158 | value but should not rebind it): | 
 | 159 |  | 
| Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 160 | .. data:: garbage | 
 | 161 |  | 
 | 162 |    A list of objects which the collector found to be unreachable but could not be | 
 | 163 |    freed (uncollectable objects).  By default, this list contains only objects with | 
| Amaury Forgeot d'Arc | ad8dcd5 | 2007-12-10 23:58:35 +0000 | [diff] [blame] | 164 |    :meth:`__del__` methods. Objects that have :meth:`__del__` methods and are | 
| Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 165 |    part of a reference cycle cause the entire reference cycle to be uncollectable, | 
 | 166 |    including objects not necessarily in the cycle but reachable only from it. | 
 | 167 |    Python doesn't collect such cycles automatically because, in general, it isn't | 
 | 168 |    possible for Python to guess a safe order in which to run the :meth:`__del__` | 
 | 169 |    methods.  If you know a safe order, you can force the issue by examining the | 
 | 170 |    *garbage* list, and explicitly breaking cycles due to your objects within the | 
 | 171 |    list.  Note that these objects are kept alive even so by virtue of being in the | 
 | 172 |    *garbage* list, so they should be removed from *garbage* too.  For example, | 
 | 173 |    after breaking cycles, do ``del gc.garbage[:]`` to empty the list.  It's | 
 | 174 |    generally better to avoid the issue by not creating cycles containing objects | 
 | 175 |    with :meth:`__del__` methods, and *garbage* can be examined in that case to | 
 | 176 |    verify that no such cycles are being created. | 
 | 177 |  | 
 | 178 |    If :const:`DEBUG_SAVEALL` is set, then all unreachable objects will be added to | 
 | 179 |    this list rather than freed. | 
 | 180 |  | 
 | 181 | The following constants are provided for use with :func:`set_debug`: | 
 | 182 |  | 
 | 183 |  | 
 | 184 | .. data:: DEBUG_STATS | 
 | 185 |  | 
 | 186 |    Print statistics during collection.  This information can be useful when tuning | 
 | 187 |    the collection frequency. | 
 | 188 |  | 
 | 189 |  | 
 | 190 | .. data:: DEBUG_COLLECTABLE | 
 | 191 |  | 
 | 192 |    Print information on collectable objects found. | 
 | 193 |  | 
 | 194 |  | 
 | 195 | .. data:: DEBUG_UNCOLLECTABLE | 
 | 196 |  | 
 | 197 |    Print information of uncollectable objects found (objects which are not | 
 | 198 |    reachable but cannot be freed by the collector).  These objects will be added to | 
 | 199 |    the ``garbage`` list. | 
 | 200 |  | 
 | 201 |  | 
| Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 202 | .. data:: DEBUG_SAVEALL | 
 | 203 |  | 
 | 204 |    When set, all unreachable objects found will be appended to *garbage* rather | 
 | 205 |    than being freed.  This can be useful for debugging a leaking program. | 
 | 206 |  | 
 | 207 |  | 
 | 208 | .. data:: DEBUG_LEAK | 
 | 209 |  | 
 | 210 |    The debugging flags necessary for the collector to print information about a | 
 | 211 |    leaking program (equal to ``DEBUG_COLLECTABLE | DEBUG_UNCOLLECTABLE | | 
| Amaury Forgeot d'Arc | ad8dcd5 | 2007-12-10 23:58:35 +0000 | [diff] [blame] | 212 |    DEBUG_SAVEALL``). | 
| Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 213 |  | 
 | 214 | .. rubric:: Footnotes |