blob: d847d6b79227cbc79ec298ba74e3b99ac216ea14 [file] [log] [blame]
Raymond Hettinger53dbe392008-02-12 20:03:09 +00001:mod:`collections` --- Container datatypes
2==========================================
Georg Brandl116aa622007-08-15 14:28:22 +00003
4.. module:: collections
Raymond Hettinger7929cfb2012-06-09 19:15:26 -07005 :synopsis: Container datatypes
Terry Jan Reedyfa089b92016-06-11 15:02:54 -04006
Georg Brandl116aa622007-08-15 14:28:22 +00007.. moduleauthor:: Raymond Hettinger <python@rcn.com>
8.. sectionauthor:: Raymond Hettinger <python@rcn.com>
9
Terry Jan Reedyfa089b92016-06-11 15:02:54 -040010**Source code:** :source:`Lib/collections/__init__.py`
11
Christian Heimesfe337bf2008-03-23 21:54:12 +000012.. testsetup:: *
13
Raymond Hettinger7929cfb2012-06-09 19:15:26 -070014 from collections import *
15 import itertools
16 __name__ = '<doctest>'
Georg Brandl116aa622007-08-15 14:28:22 +000017
Raymond Hettinger4f707fd2011-01-10 19:54:11 +000018--------------
19
Raymond Hettingera6b76ba2010-08-08 00:29:08 +000020This module implements specialized container datatypes providing alternatives to
21Python's general purpose built-in containers, :class:`dict`, :class:`list`,
22:class:`set`, and :class:`tuple`.
Christian Heimes0bd4e112008-02-12 22:59:25 +000023
Raymond Hettingera6b76ba2010-08-08 00:29:08 +000024===================== ====================================================================
25:func:`namedtuple` factory function for creating tuple subclasses with named fields
26:class:`deque` list-like container with fast appends and pops on either end
Raymond Hettinger9fe1ccf2011-02-26 01:02:51 +000027:class:`ChainMap` dict-like class for creating a single view of multiple mappings
Raymond Hettingera6b76ba2010-08-08 00:29:08 +000028:class:`Counter` dict subclass for counting hashable objects
29:class:`OrderedDict` dict subclass that remembers the order entries were added
30:class:`defaultdict` dict subclass that calls a factory function to supply missing values
31:class:`UserDict` wrapper around dictionary objects for easier dict subclassing
32:class:`UserList` wrapper around list objects for easier list subclassing
33:class:`UserString` wrapper around string objects for easier string subclassing
34===================== ====================================================================
Georg Brandl116aa622007-08-15 14:28:22 +000035
Raymond Hettinger158c9c22011-02-22 00:41:50 +000036.. versionchanged:: 3.3
Raymond Hettinger7929cfb2012-06-09 19:15:26 -070037 Moved :ref:`collections-abstract-base-classes` to the :mod:`collections.abc` module.
Raymond Hettingere6d34212018-01-29 08:27:49 -080038 For backwards compatibility, they continue to be visible in this module through
39 Python 3.7. Subsequently, they will be removed entirely.
Mark Summerfield08898b42007-09-05 08:43:04 +000040
41
Raymond Hettinger9fe1ccf2011-02-26 01:02:51 +000042:class:`ChainMap` objects
43-------------------------
44
Georg Brandl283b96b2012-04-03 09:16:46 +020045.. versionadded:: 3.3
46
Raymond Hettinger9fe1ccf2011-02-26 01:02:51 +000047A :class:`ChainMap` class is provided for quickly linking a number of mappings
48so they can be treated as a single unit. It is often much faster than creating
49a new dictionary and running multiple :meth:`~dict.update` calls.
50
51The class can be used to simulate nested scopes and is useful in templating.
52
53.. class:: ChainMap(*maps)
54
Raymond Hettinger7929cfb2012-06-09 19:15:26 -070055 A :class:`ChainMap` groups multiple dicts or other mappings together to
56 create a single, updateable view. If no *maps* are specified, a single empty
57 dictionary is provided so that a new chain always has at least one mapping.
Raymond Hettinger9fe1ccf2011-02-26 01:02:51 +000058
Raymond Hettinger7929cfb2012-06-09 19:15:26 -070059 The underlying mappings are stored in a list. That list is public and can
Martin Panter8d56c022016-05-29 04:13:35 +000060 be accessed or updated using the *maps* attribute. There is no other state.
Raymond Hettinger9fe1ccf2011-02-26 01:02:51 +000061
Raymond Hettinger7929cfb2012-06-09 19:15:26 -070062 Lookups search the underlying mappings successively until a key is found. In
63 contrast, writes, updates, and deletions only operate on the first mapping.
Raymond Hettinger9fe1ccf2011-02-26 01:02:51 +000064
Raymond Hettinger7929cfb2012-06-09 19:15:26 -070065 A :class:`ChainMap` incorporates the underlying mappings by reference. So, if
66 one of the underlying mappings gets updated, those changes will be reflected
67 in :class:`ChainMap`.
Raymond Hettinger9fe1ccf2011-02-26 01:02:51 +000068
Raymond Hettinger7929cfb2012-06-09 19:15:26 -070069 All of the usual dictionary methods are supported. In addition, there is a
70 *maps* attribute, a method for creating new subcontexts, and a property for
71 accessing all but the first mapping:
Raymond Hettinger9fe1ccf2011-02-26 01:02:51 +000072
Raymond Hettinger7929cfb2012-06-09 19:15:26 -070073 .. attribute:: maps
Raymond Hettinger9fe1ccf2011-02-26 01:02:51 +000074
Raymond Hettinger7929cfb2012-06-09 19:15:26 -070075 A user updateable list of mappings. The list is ordered from
76 first-searched to last-searched. It is the only stored state and can
77 be modified to change which mappings are searched. The list should
78 always contain at least one mapping.
Raymond Hettinger9fe1ccf2011-02-26 01:02:51 +000079
Vinay Sajip1ba81ee2013-01-11 23:39:53 +000080 .. method:: new_child(m=None)
Raymond Hettinger9fe1ccf2011-02-26 01:02:51 +000081
Vinay Sajip1ba81ee2013-01-11 23:39:53 +000082 Returns a new :class:`ChainMap` containing a new map followed by
83 all of the maps in the current instance. If ``m`` is specified,
84 it becomes the new map at the front of the list of mappings; if not
85 specified, an empty dict is used, so that a call to ``d.new_child()``
86 is equivalent to: ``ChainMap({}, *d.maps)``. This method is used for
Raymond Hettinger7929cfb2012-06-09 19:15:26 -070087 creating subcontexts that can be updated without altering values in any
88 of the parent mappings.
Raymond Hettinger9fe1ccf2011-02-26 01:02:51 +000089
Vinay Sajip1ba81ee2013-01-11 23:39:53 +000090 .. versionchanged:: 3.4
91 The optional ``m`` parameter was added.
92
Raymond Hettinger2a61c452012-07-15 22:37:20 -070093 .. attribute:: parents
Raymond Hettinger9fe1ccf2011-02-26 01:02:51 +000094
Raymond Hettingerb22ba042012-07-16 02:07:41 -070095 Property returning a new :class:`ChainMap` containing all of the maps in
Raymond Hettinger2a61c452012-07-15 22:37:20 -070096 the current instance except the first one. This is useful for skipping
97 the first map in the search. Use cases are similar to those for the
98 :keyword:`nonlocal` keyword used in :term:`nested scopes <nested
99 scope>`. The use cases also parallel those for the built-in
100 :func:`super` function. A reference to ``d.parents`` is equivalent to:
101 ``ChainMap(*d.maps[1:])``.
Raymond Hettinger9fe1ccf2011-02-26 01:02:51 +0000102
Raymond Hettingerbfcb4292012-06-10 11:39:44 -0700103
104.. seealso::
105
106 * The `MultiContext class
Sandro Tosiea475302012-08-12 10:37:23 +0200107 <https://github.com/enthought/codetools/blob/4.0.0/codetools/contexts/multi_context.py>`_
Raymond Hettingerbfcb4292012-06-10 11:39:44 -0700108 in the Enthought `CodeTools package
109 <https://github.com/enthought/codetools>`_ has options to support
110 writing to any mapping in the chain.
111
112 * Django's `Context class
Georg Brandl525d3552014-10-29 10:26:56 +0100113 <https://github.com/django/django/blob/master/django/template/context.py>`_
Raymond Hettingerbfcb4292012-06-10 11:39:44 -0700114 for templating is a read-only chain of mappings. It also features
115 pushing and popping of contexts similar to the
116 :meth:`~collections.ChainMap.new_child` method and the
Michael Seiferte1052942018-03-26 13:40:35 +0200117 :attr:`~collections.ChainMap.parents` property.
Raymond Hettingerbfcb4292012-06-10 11:39:44 -0700118
119 * The `Nested Contexts recipe
Serhiy Storchaka6dff0202016-05-07 10:49:07 +0300120 <https://code.activestate.com/recipes/577434/>`_ has options to control
Raymond Hettingerbfcb4292012-06-10 11:39:44 -0700121 whether writes and other mutations apply only to the first mapping or to
122 any mapping in the chain.
123
124 * A `greatly simplified read-only version of Chainmap
Serhiy Storchaka6dff0202016-05-07 10:49:07 +0300125 <https://code.activestate.com/recipes/305268/>`_.
Raymond Hettingerbfcb4292012-06-10 11:39:44 -0700126
127
128:class:`ChainMap` Examples and Recipes
129^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
130
131This section shows various approaches to working with chained maps.
132
133
134Example of simulating Python's internal lookup chain::
Raymond Hettinger9fe1ccf2011-02-26 01:02:51 +0000135
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700136 import builtins
137 pylookup = ChainMap(locals(), globals(), vars(builtins))
Raymond Hettinger9fe1ccf2011-02-26 01:02:51 +0000138
Raymond Hettingerb2269ba2012-07-15 23:53:32 -0700139Example of letting user specified command-line arguments take precedence over
140environment variables which in turn take precedence over default values::
Raymond Hettinger9fe1ccf2011-02-26 01:02:51 +0000141
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700142 import os, argparse
Raymond Hettingerb2269ba2012-07-15 23:53:32 -0700143
144 defaults = {'color': 'red', 'user': 'guest'}
145
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700146 parser = argparse.ArgumentParser()
147 parser.add_argument('-u', '--user')
148 parser.add_argument('-c', '--color')
Raymond Hettingerb2269ba2012-07-15 23:53:32 -0700149 namespace = parser.parse_args()
150 command_line_args = {k:v for k, v in vars(namespace).items() if v}
151
152 combined = ChainMap(command_line_args, os.environ, defaults)
153 print(combined['color'])
154 print(combined['user'])
Raymond Hettinger9fe1ccf2011-02-26 01:02:51 +0000155
Raymond Hettingerbfcb4292012-06-10 11:39:44 -0700156Example patterns for using the :class:`ChainMap` class to simulate nested
157contexts::
Raymond Hettinger9fe1ccf2011-02-26 01:02:51 +0000158
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700159 c = ChainMap() # Create root context
160 d = c.new_child() # Create nested child context
161 e = c.new_child() # Child of c, independent from d
162 e.maps[0] # Current context dictionary -- like Python's locals()
163 e.maps[-1] # Root context -- like Python's globals()
164 e.parents # Enclosing context chain -- like Python's nonlocals
Raymond Hettinger9fe1ccf2011-02-26 01:02:51 +0000165
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700166 d['x'] = 1 # Set value in current context
Raymond Hettinger9b0c6812019-02-20 13:49:23 -0800167 d['x'] # Get first key in the chain of contexts
Andrew Svetlov1a8db9c2012-10-04 19:29:25 +0300168 del d['x'] # Delete from current context
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700169 list(d) # All nested values
170 k in d # Check all nested values
171 len(d) # Number of nested values
172 d.items() # All nested items
173 dict(d) # Flatten into a regular dictionary
Raymond Hettinger9fe1ccf2011-02-26 01:02:51 +0000174
Raymond Hettingerbfcb4292012-06-10 11:39:44 -0700175The :class:`ChainMap` class only makes updates (writes and deletions) to the
176first mapping in the chain while lookups will search the full chain. However,
177if deep writes and deletions are desired, it is easy to make a subclass that
178updates keys found deeper in the chain::
Raymond Hettinger9fe1ccf2011-02-26 01:02:51 +0000179
Raymond Hettingerbfcb4292012-06-10 11:39:44 -0700180 class DeepChainMap(ChainMap):
181 'Variant of ChainMap that allows direct updates to inner scopes'
Raymond Hettinger9fe1ccf2011-02-26 01:02:51 +0000182
Raymond Hettingerbfcb4292012-06-10 11:39:44 -0700183 def __setitem__(self, key, value):
184 for mapping in self.maps:
185 if key in mapping:
186 mapping[key] = value
187 return
188 self.maps[0][key] = value
Raymond Hettinger9fe1ccf2011-02-26 01:02:51 +0000189
Raymond Hettingerbfcb4292012-06-10 11:39:44 -0700190 def __delitem__(self, key):
191 for mapping in self.maps:
192 if key in mapping:
193 del mapping[key]
194 return
195 raise KeyError(key)
Raymond Hettinger9fe1ccf2011-02-26 01:02:51 +0000196
Serhiy Storchakaf47036c2013-12-24 11:04:36 +0200197 >>> d = DeepChainMap({'zebra': 'black'}, {'elephant': 'blue'}, {'lion': 'yellow'})
Raymond Hettingerbfcb4292012-06-10 11:39:44 -0700198 >>> d['lion'] = 'orange' # update an existing key two levels down
199 >>> d['snake'] = 'red' # new keys get added to the topmost dict
200 >>> del d['elephant'] # remove an existing key one level down
Raymond Hettinger2064bb62018-09-10 18:43:08 -0700201 >>> d # display result
Raymond Hettingerbfcb4292012-06-10 11:39:44 -0700202 DeepChainMap({'zebra': 'black', 'snake': 'red'}, {}, {'lion': 'orange'})
Georg Brandl4dcf4742012-03-08 20:35:08 +0100203
Raymond Hettinger9fe1ccf2011-02-26 01:02:51 +0000204
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000205:class:`Counter` objects
206------------------------
207
208A counter tool is provided to support convenient and rapid tallies.
209For example::
210
Raymond Hettinger1c62dc92009-02-04 11:41:45 +0000211 >>> # Tally occurrences of words in a list
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000212 >>> cnt = Counter()
Raymond Hettinger670eaec2009-01-21 23:14:07 +0000213 >>> for word in ['red', 'blue', 'red', 'green', 'blue', 'blue']:
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000214 ... cnt[word] += 1
215 >>> cnt
216 Counter({'blue': 3, 'red': 2, 'green': 1})
217
Raymond Hettinger1c62dc92009-02-04 11:41:45 +0000218 >>> # Find the ten most common words in Hamlet
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000219 >>> import re
Raymond Hettingerfaaba592013-03-01 03:30:20 -0800220 >>> words = re.findall(r'\w+', open('hamlet.txt').read().lower())
Raymond Hettinger0bae6622009-01-20 13:00:59 +0000221 >>> Counter(words).most_common(10)
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000222 [('the', 1143), ('and', 966), ('to', 762), ('of', 669), ('i', 631),
223 ('you', 554), ('a', 546), ('my', 514), ('hamlet', 471), ('in', 451)]
224
225.. class:: Counter([iterable-or-mapping])
226
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700227 A :class:`Counter` is a :class:`dict` subclass for counting hashable objects.
Andrés Delfino76e8fd72018-11-11 13:56:47 -0300228 It is a collection where elements are stored as dictionary keys
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700229 and their counts are stored as dictionary values. Counts are allowed to be
230 any integer value including zero or negative counts. The :class:`Counter`
231 class is similar to bags or multisets in other languages.
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000232
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700233 Elements are counted from an *iterable* or initialized from another
234 *mapping* (or counter):
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000235
Raymond Hettinger73662a52009-01-27 02:38:22 +0000236 >>> c = Counter() # a new, empty counter
237 >>> c = Counter('gallahad') # a new counter from an iterable
238 >>> c = Counter({'red': 4, 'blue': 2}) # a new counter from a mapping
239 >>> c = Counter(cats=4, dogs=8) # a new counter from keyword args
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000240
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700241 Counter objects have a dictionary interface except that they return a zero
242 count for missing items instead of raising a :exc:`KeyError`:
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000243
Raymond Hettinger94adc8e2009-01-22 05:27:37 +0000244 >>> c = Counter(['eggs', 'ham'])
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000245 >>> c['bacon'] # count of a missing element is zero
246 0
247
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700248 Setting a count to zero does not remove an element from a counter.
249 Use ``del`` to remove it entirely:
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000250
Raymond Hettinger94adc8e2009-01-22 05:27:37 +0000251 >>> c['sausage'] = 0 # counter entry with a zero count
252 >>> del c['sausage'] # del actually removes the entry
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000253
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700254 .. versionadded:: 3.1
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000255
256
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700257 Counter objects support three methods beyond those available for all
258 dictionaries:
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000259
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700260 .. method:: elements()
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000261
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700262 Return an iterator over elements repeating each as many times as its
263 count. Elements are returned in arbitrary order. If an element's count
264 is less than one, :meth:`elements` will ignore it.
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000265
Raymond Hettinger0bae6622009-01-20 13:00:59 +0000266 >>> c = Counter(a=4, b=2, c=0, d=-2)
Zachary Ware2b52c0a2016-08-09 17:38:22 -0500267 >>> sorted(c.elements())
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000268 ['a', 'a', 'a', 'a', 'b', 'b']
269
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700270 .. method:: most_common([n])
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000271
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700272 Return a list of the *n* most common elements and their counts from the
Raymond Hettingera3780252015-05-13 02:47:57 -0700273 most common to the least. If *n* is omitted or ``None``,
Michael Seiferte1052942018-03-26 13:40:35 +0200274 :meth:`most_common` returns *all* elements in the counter.
Raymond Hettinger3afdb282015-05-13 14:39:04 -0700275 Elements with equal counts are ordered arbitrarily:
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000276
Zachary Ware2b52c0a2016-08-09 17:38:22 -0500277 >>> Counter('abracadabra').most_common(3) # doctest: +SKIP
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000278 [('a', 5), ('r', 2), ('b', 2)]
279
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700280 .. method:: subtract([iterable-or-mapping])
Raymond Hettinger9c01e442010-04-03 10:32:58 +0000281
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700282 Elements are subtracted from an *iterable* or from another *mapping*
283 (or counter). Like :meth:`dict.update` but subtracts counts instead
284 of replacing them. Both inputs and outputs may be zero or negative.
Raymond Hettinger9c01e442010-04-03 10:32:58 +0000285
286 >>> c = Counter(a=4, b=2, c=0, d=-2)
287 >>> d = Counter(a=1, b=2, c=3, d=4)
288 >>> c.subtract(d)
Andrew Svetlovf6351722012-12-17 14:01:16 +0200289 >>> c
Raymond Hettinger9c01e442010-04-03 10:32:58 +0000290 Counter({'a': 3, 'b': 0, 'c': -3, 'd': -6})
291
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700292 .. versionadded:: 3.2
Ezio Melotti0be8b1c2010-04-04 06:53:44 +0000293
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700294 The usual dictionary methods are available for :class:`Counter` objects
295 except for two which work differently for counters.
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000296
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700297 .. method:: fromkeys(iterable)
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000298
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700299 This class method is not implemented for :class:`Counter` objects.
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000300
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700301 .. method:: update([iterable-or-mapping])
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000302
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700303 Elements are counted from an *iterable* or added-in from another
304 *mapping* (or counter). Like :meth:`dict.update` but adds counts
305 instead of replacing them. Also, the *iterable* is expected to be a
306 sequence of elements, not a sequence of ``(key, value)`` pairs.
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000307
308Common patterns for working with :class:`Counter` objects::
309
Raymond Hettinger73662a52009-01-27 02:38:22 +0000310 sum(c.values()) # total of all counts
311 c.clear() # reset all counts
312 list(c) # list unique elements
313 set(c) # convert to a set
314 dict(c) # convert to a regular dictionary
315 c.items() # convert to a list of (elem, cnt) pairs
316 Counter(dict(list_of_pairs)) # convert from a list of (elem, cnt) pairs
Georg Brandl87f3d7b2013-10-06 12:36:39 +0200317 c.most_common()[:-n-1:-1] # n least common elements
Raymond Hettingerfcb393c2011-08-09 13:00:40 -0700318 +c # remove zero and negative counts
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000319
Raymond Hettinger72a95cc2009-02-25 22:51:40 +0000320Several mathematical operations are provided for combining :class:`Counter`
321objects to produce multisets (counters that have counts greater than zero).
322Addition and subtraction combine counters by adding or subtracting the counts
323of corresponding elements. Intersection and union return the minimum and
324maximum of corresponding counts. Each operation can accept inputs with signed
325counts, but the output will exclude results with counts of zero or less.
Raymond Hettinger4d2073a2009-01-20 03:41:22 +0000326
Raymond Hettingere0d1b9f2009-01-21 20:36:27 +0000327 >>> c = Counter(a=3, b=1)
328 >>> d = Counter(a=1, b=2)
Raymond Hettinger73662a52009-01-27 02:38:22 +0000329 >>> c + d # add two counters together: c[x] + d[x]
Raymond Hettinger4d2073a2009-01-20 03:41:22 +0000330 Counter({'a': 4, 'b': 3})
Raymond Hettinger73662a52009-01-27 02:38:22 +0000331 >>> c - d # subtract (keeping only positive counts)
Raymond Hettinger4d2073a2009-01-20 03:41:22 +0000332 Counter({'a': 2})
Zachary Ware2b52c0a2016-08-09 17:38:22 -0500333 >>> c & d # intersection: min(c[x], d[x]) # doctest: +SKIP
Raymond Hettinger4d2073a2009-01-20 03:41:22 +0000334 Counter({'a': 1, 'b': 1})
Raymond Hettinger73662a52009-01-27 02:38:22 +0000335 >>> c | d # union: max(c[x], d[x])
Raymond Hettinger4d2073a2009-01-20 03:41:22 +0000336 Counter({'a': 3, 'b': 2})
337
Berker Peksag315e1042015-05-19 01:36:55 +0300338Unary addition and subtraction are shortcuts for adding an empty counter
Raymond Hettingerfcb393c2011-08-09 13:00:40 -0700339or subtracting from an empty counter.
340
341 >>> c = Counter(a=2, b=-4)
342 >>> +c
343 Counter({'a': 2})
344 >>> -c
345 Counter({'b': 4})
346
347.. versionadded:: 3.3
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700348 Added support for unary plus, unary minus, and in-place multiset operations.
Raymond Hettingerfcb393c2011-08-09 13:00:40 -0700349
Raymond Hettinger22f18852010-04-12 21:45:14 +0000350.. note::
351
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700352 Counters were primarily designed to work with positive integers to represent
353 running counts; however, care was taken to not unnecessarily preclude use
354 cases needing other types or negative values. To help with those use cases,
355 this section documents the minimum range and type restrictions.
Raymond Hettinger22f18852010-04-12 21:45:14 +0000356
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700357 * The :class:`Counter` class itself is a dictionary subclass with no
Georg Brandl2fdc0f82012-10-06 22:38:20 +0200358 restrictions on its keys and values. The values are intended to be numbers
359 representing counts, but you *could* store anything in the value field.
Raymond Hettinger22f18852010-04-12 21:45:14 +0000360
Michael Seiferte1052942018-03-26 13:40:35 +0200361 * The :meth:`~Counter.most_common` method requires only that the values be orderable.
Raymond Hettinger22f18852010-04-12 21:45:14 +0000362
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700363 * For in-place operations such as ``c[key] += 1``, the value type need only
Georg Brandl2fdc0f82012-10-06 22:38:20 +0200364 support addition and subtraction. So fractions, floats, and decimals would
365 work and negative values are supported. The same is also true for
Michael Seiferte1052942018-03-26 13:40:35 +0200366 :meth:`~Counter.update` and :meth:`~Counter.subtract` which allow negative and zero values
Georg Brandl2fdc0f82012-10-06 22:38:20 +0200367 for both inputs and outputs.
Raymond Hettinger22f18852010-04-12 21:45:14 +0000368
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700369 * The multiset methods are designed only for use cases with positive values.
Georg Brandl2fdc0f82012-10-06 22:38:20 +0200370 The inputs may be negative or zero, but only outputs with positive values
371 are created. There are no type restrictions, but the value type needs to
372 support addition, subtraction, and comparison.
Raymond Hettinger22f18852010-04-12 21:45:14 +0000373
Michael Seiferte1052942018-03-26 13:40:35 +0200374 * The :meth:`~Counter.elements` method requires integer counts. It ignores zero and
Georg Brandl2fdc0f82012-10-06 22:38:20 +0200375 negative counts.
Raymond Hettinger22f18852010-04-12 21:45:14 +0000376
Raymond Hettingerb14043c2009-01-20 23:44:31 +0000377.. seealso::
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000378
Georg Brandl5d941342016-02-26 19:37:12 +0100379 * `Bag class <https://www.gnu.org/software/smalltalk/manual-base/html_node/Bag.html>`_
Raymond Hettingerbfcb4292012-06-10 11:39:44 -0700380 in Smalltalk.
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000381
Georg Brandl5d941342016-02-26 19:37:12 +0100382 * Wikipedia entry for `Multisets <https://en.wikipedia.org/wiki/Multiset>`_.
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000383
Georg Brandl5d941342016-02-26 19:37:12 +0100384 * `C++ multisets <http://www.java2s.com/Tutorial/Cpp/0380__set-multiset/Catalog0380__set-multiset.htm>`_
Raymond Hettingerbfcb4292012-06-10 11:39:44 -0700385 tutorial with examples.
Raymond Hettingerb14043c2009-01-20 23:44:31 +0000386
Raymond Hettinger94adc8e2009-01-22 05:27:37 +0000387 * For mathematical operations on multisets and their use cases, see
Raymond Hettingerbfcb4292012-06-10 11:39:44 -0700388 *Knuth, Donald. The Art of Computer Programming Volume II,
389 Section 4.6.3, Exercise 19*.
Raymond Hettingerb14043c2009-01-20 23:44:31 +0000390
Raymond Hettinger670eaec2009-01-21 23:14:07 +0000391 * To enumerate all distinct multisets of a given size over a given set of
Michael Seiferte1052942018-03-26 13:40:35 +0200392 elements, see :func:`itertools.combinations_with_replacement`::
Raymond Hettingerb14043c2009-01-20 23:44:31 +0000393
Michael Seiferte1052942018-03-26 13:40:35 +0200394 map(Counter, combinations_with_replacement('ABC', 2)) # --> AA AB AC BB BC CC
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000395
396
Georg Brandl116aa622007-08-15 14:28:22 +0000397:class:`deque` objects
398----------------------
399
Georg Brandlc2a4f4f2009-04-10 09:03:43 +0000400.. class:: deque([iterable, [maxlen]])
Georg Brandl116aa622007-08-15 14:28:22 +0000401
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700402 Returns a new deque object initialized left-to-right (using :meth:`append`) with
403 data from *iterable*. If *iterable* is not specified, the new deque is empty.
Georg Brandl116aa622007-08-15 14:28:22 +0000404
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700405 Deques are a generalization of stacks and queues (the name is pronounced "deck"
406 and is short for "double-ended queue"). Deques support thread-safe, memory
407 efficient appends and pops from either side of the deque with approximately the
408 same O(1) performance in either direction.
Georg Brandl116aa622007-08-15 14:28:22 +0000409
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700410 Though :class:`list` objects support similar operations, they are optimized for
411 fast fixed-length operations and incur O(n) memory movement costs for
412 ``pop(0)`` and ``insert(0, v)`` operations which change both the size and
413 position of the underlying data representation.
Georg Brandl116aa622007-08-15 14:28:22 +0000414
Georg Brandl116aa622007-08-15 14:28:22 +0000415
Serhiy Storchakaecf41da2016-10-19 16:29:26 +0300416 If *maxlen* is not specified or is ``None``, deques may grow to an
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700417 arbitrary length. Otherwise, the deque is bounded to the specified maximum
418 length. Once a bounded length deque is full, when new items are added, a
419 corresponding number of items are discarded from the opposite end. Bounded
420 length deques provide functionality similar to the ``tail`` filter in
421 Unix. They are also useful for tracking transactions and other pools of data
422 where only the most recent activity is of interest.
Georg Brandl9afde1c2007-11-01 20:32:30 +0000423
Georg Brandl9afde1c2007-11-01 20:32:30 +0000424
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700425 Deque objects support the following methods:
Georg Brandl116aa622007-08-15 14:28:22 +0000426
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700427 .. method:: append(x)
Georg Brandl116aa622007-08-15 14:28:22 +0000428
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700429 Add *x* to the right side of the deque.
Georg Brandl116aa622007-08-15 14:28:22 +0000430
431
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700432 .. method:: appendleft(x)
Georg Brandl116aa622007-08-15 14:28:22 +0000433
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700434 Add *x* to the left side of the deque.
Georg Brandl116aa622007-08-15 14:28:22 +0000435
436
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700437 .. method:: clear()
Georg Brandl116aa622007-08-15 14:28:22 +0000438
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700439 Remove all elements from the deque leaving it with length 0.
Georg Brandl116aa622007-08-15 14:28:22 +0000440
441
Raymond Hettinger32ea1652015-03-21 01:37:37 -0700442 .. method:: copy()
443
444 Create a shallow copy of the deque.
445
446 .. versionadded:: 3.5
447
448
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700449 .. method:: count(x)
Raymond Hettinger44459de2010-04-03 23:20:46 +0000450
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700451 Count the number of deque elements equal to *x*.
Raymond Hettinger44459de2010-04-03 23:20:46 +0000452
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700453 .. versionadded:: 3.2
Raymond Hettinger44459de2010-04-03 23:20:46 +0000454
Georg Brandl67b21b72010-08-17 15:07:14 +0000455
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700456 .. method:: extend(iterable)
Georg Brandl116aa622007-08-15 14:28:22 +0000457
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700458 Extend the right side of the deque by appending elements from the iterable
459 argument.
Georg Brandl116aa622007-08-15 14:28:22 +0000460
461
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700462 .. method:: extendleft(iterable)
Georg Brandl116aa622007-08-15 14:28:22 +0000463
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700464 Extend the left side of the deque by appending elements from *iterable*.
465 Note, the series of left appends results in reversing the order of
466 elements in the iterable argument.
Georg Brandl116aa622007-08-15 14:28:22 +0000467
468
Raymond Hettinger855482e2015-05-23 08:57:58 -0700469 .. method:: index(x[, start[, stop]])
Raymond Hettinger32ea1652015-03-21 01:37:37 -0700470
Raymond Hettinger855482e2015-05-23 08:57:58 -0700471 Return the position of *x* in the deque (at or after index *start*
472 and before index *stop*). Returns the first match or raises
473 :exc:`ValueError` if not found.
Raymond Hettinger32ea1652015-03-21 01:37:37 -0700474
475 .. versionadded:: 3.5
476
477
478 .. method:: insert(i, x)
479
480 Insert *x* into the deque at position *i*.
481
Raymond Hettingera6389712016-02-01 21:21:19 -0800482 If the insertion would cause a bounded deque to grow beyond *maxlen*,
483 an :exc:`IndexError` is raised.
Raymond Hettinger37434322016-01-26 21:44:16 -0800484
Raymond Hettinger32ea1652015-03-21 01:37:37 -0700485 .. versionadded:: 3.5
486
487
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700488 .. method:: pop()
Georg Brandl116aa622007-08-15 14:28:22 +0000489
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700490 Remove and return an element from the right side of the deque. If no
491 elements are present, raises an :exc:`IndexError`.
Georg Brandl116aa622007-08-15 14:28:22 +0000492
493
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700494 .. method:: popleft()
Georg Brandl116aa622007-08-15 14:28:22 +0000495
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700496 Remove and return an element from the left side of the deque. If no
497 elements are present, raises an :exc:`IndexError`.
Georg Brandl116aa622007-08-15 14:28:22 +0000498
499
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700500 .. method:: remove(value)
Georg Brandl116aa622007-08-15 14:28:22 +0000501
Raymond Hettinger855482e2015-05-23 08:57:58 -0700502 Remove the first occurrence of *value*. If not found, raises a
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700503 :exc:`ValueError`.
Georg Brandl116aa622007-08-15 14:28:22 +0000504
Georg Brandl67b21b72010-08-17 15:07:14 +0000505
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700506 .. method:: reverse()
Raymond Hettingere5fdedb2009-12-10 00:47:21 +0000507
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700508 Reverse the elements of the deque in-place and then return ``None``.
Raymond Hettingere5fdedb2009-12-10 00:47:21 +0000509
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700510 .. versionadded:: 3.2
Georg Brandl116aa622007-08-15 14:28:22 +0000511
Georg Brandl67b21b72010-08-17 15:07:14 +0000512
Raymond Hettinger589c7182018-02-03 08:46:28 -0800513 .. method:: rotate(n=1)
Georg Brandl116aa622007-08-15 14:28:22 +0000514
Raymond Hettinger589c7182018-02-03 08:46:28 -0800515 Rotate the deque *n* steps to the right. If *n* is negative, rotate
516 to the left.
517
Raymond Hettingerca6c1252018-02-04 09:15:01 -0800518 When the deque is not empty, rotating one step to the right is equivalent
Raymond Hettinger589c7182018-02-03 08:46:28 -0800519 to ``d.appendleft(d.pop())``, and rotating one step to the left is
520 equivalent to ``d.append(d.popleft())``.
Benjamin Petersone41251e2008-04-25 01:59:09 +0000521
Georg Brandl116aa622007-08-15 14:28:22 +0000522
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700523 Deque objects also provide one read-only attribute:
Raymond Hettinger5bb0f0e2009-03-10 12:56:32 +0000524
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700525 .. attribute:: maxlen
Raymond Hettinger5bb0f0e2009-03-10 12:56:32 +0000526
Serhiy Storchakaecf41da2016-10-19 16:29:26 +0300527 Maximum size of a deque or ``None`` if unbounded.
Raymond Hettinger5bb0f0e2009-03-10 12:56:32 +0000528
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700529 .. versionadded:: 3.1
Raymond Hettinger5bb0f0e2009-03-10 12:56:32 +0000530
531
Georg Brandl116aa622007-08-15 14:28:22 +0000532In addition to the above, deques support iteration, pickling, ``len(d)``,
533``reversed(d)``, ``copy.copy(d)``, ``copy.deepcopy(d)``, membership testing with
Windson yang98b85352018-11-05 06:34:22 +0800534the :keyword:`in` operator, and subscript references such as ``d[0]`` to access
535the first element. Indexed access is O(1) at both ends but slows to O(n) in
536the middle. For fast random access, use lists instead.
Georg Brandl116aa622007-08-15 14:28:22 +0000537
Raymond Hettinger41290a62015-03-31 08:12:23 -0700538Starting in version 3.5, deques support ``__add__()``, ``__mul__()``,
539and ``__imul__()``.
540
Christian Heimesfe337bf2008-03-23 21:54:12 +0000541Example:
542
543.. doctest::
Georg Brandl116aa622007-08-15 14:28:22 +0000544
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700545 >>> from collections import deque
546 >>> d = deque('ghi') # make a new deque with three items
547 >>> for elem in d: # iterate over the deque's elements
548 ... print(elem.upper())
549 G
550 H
551 I
Georg Brandl116aa622007-08-15 14:28:22 +0000552
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700553 >>> d.append('j') # add a new entry to the right side
554 >>> d.appendleft('f') # add a new entry to the left side
555 >>> d # show the representation of the deque
556 deque(['f', 'g', 'h', 'i', 'j'])
Georg Brandl116aa622007-08-15 14:28:22 +0000557
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700558 >>> d.pop() # return and remove the rightmost item
559 'j'
560 >>> d.popleft() # return and remove the leftmost item
561 'f'
562 >>> list(d) # list the contents of the deque
563 ['g', 'h', 'i']
564 >>> d[0] # peek at leftmost item
565 'g'
566 >>> d[-1] # peek at rightmost item
567 'i'
Georg Brandl116aa622007-08-15 14:28:22 +0000568
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700569 >>> list(reversed(d)) # list the contents of a deque in reverse
570 ['i', 'h', 'g']
571 >>> 'h' in d # search the deque
572 True
573 >>> d.extend('jkl') # add multiple elements at once
574 >>> d
575 deque(['g', 'h', 'i', 'j', 'k', 'l'])
576 >>> d.rotate(1) # right rotation
577 >>> d
578 deque(['l', 'g', 'h', 'i', 'j', 'k'])
579 >>> d.rotate(-1) # left rotation
580 >>> d
581 deque(['g', 'h', 'i', 'j', 'k', 'l'])
Georg Brandl116aa622007-08-15 14:28:22 +0000582
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700583 >>> deque(reversed(d)) # make a new deque in reverse order
584 deque(['l', 'k', 'j', 'i', 'h', 'g'])
585 >>> d.clear() # empty the deque
586 >>> d.pop() # cannot pop from an empty deque
587 Traceback (most recent call last):
588 File "<pyshell#6>", line 1, in -toplevel-
589 d.pop()
590 IndexError: pop from an empty deque
Georg Brandl116aa622007-08-15 14:28:22 +0000591
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700592 >>> d.extendleft('abc') # extendleft() reverses the input order
593 >>> d
594 deque(['c', 'b', 'a'])
Georg Brandl116aa622007-08-15 14:28:22 +0000595
596
Georg Brandl9afde1c2007-11-01 20:32:30 +0000597:class:`deque` Recipes
598^^^^^^^^^^^^^^^^^^^^^^
Georg Brandl116aa622007-08-15 14:28:22 +0000599
600This section shows various approaches to working with deques.
601
Raymond Hettingerd2ee64d2009-03-31 22:52:48 +0000602Bounded length deques provide functionality similar to the ``tail`` filter
603in Unix::
604
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700605 def tail(filename, n=10):
606 'Return the last n lines of a file'
607 with open(filename) as f:
608 return deque(f, n)
Raymond Hettingerd2ee64d2009-03-31 22:52:48 +0000609
610Another approach to using deques is to maintain a sequence of recently
611added elements by appending to the right and popping to the left::
612
613 def moving_average(iterable, n=3):
614 # moving_average([40, 30, 50, 46, 39, 44]) --> 40.0 42.0 45.0 43.0
615 # http://en.wikipedia.org/wiki/Moving_average
616 it = iter(iterable)
Raymond Hettingerd40285a2009-05-22 01:11:26 +0000617 d = deque(itertools.islice(it, n-1))
618 d.appendleft(0)
Raymond Hettingerd2ee64d2009-03-31 22:52:48 +0000619 s = sum(d)
Raymond Hettingerd2ee64d2009-03-31 22:52:48 +0000620 for elem in it:
621 s += elem - d.popleft()
622 d.append(elem)
623 yield s / n
624
Raymond Hettinger08584952017-11-23 13:32:23 -0800625A `round-robin scheduler
626<https://en.wikipedia.org/wiki/Round-robin_scheduling>`_ can be implemented with
627input iterators stored in a :class:`deque`. Values are yielded from the active
628iterator in position zero. If that iterator is exhausted, it can be removed
629with :meth:`~deque.popleft`; otherwise, it can be cycled back to the end with
630the :meth:`~deque.rotate` method::
631
632 def roundrobin(*iterables):
633 "roundrobin('ABC', 'D', 'EF') --> A D E B F C"
634 iterators = deque(map(iter, iterables))
635 while iterators:
636 try:
637 while True:
638 yield next(iterators[0])
639 iterators.rotate(-1)
640 except StopIteration:
641 # Remove an exhausted iterator.
642 iterators.popleft()
643
Michael Seiferte1052942018-03-26 13:40:35 +0200644The :meth:`~deque.rotate` method provides a way to implement :class:`deque` slicing and
Ezio Melotti0639d5a2009-12-19 23:26:38 +0000645deletion. For example, a pure Python implementation of ``del d[n]`` relies on
Michael Seiferte1052942018-03-26 13:40:35 +0200646the ``rotate()`` method to position elements to be popped::
Georg Brandl116aa622007-08-15 14:28:22 +0000647
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700648 def delete_nth(d, n):
649 d.rotate(-n)
650 d.popleft()
651 d.rotate(n)
Georg Brandl116aa622007-08-15 14:28:22 +0000652
653To implement :class:`deque` slicing, use a similar approach applying
Michael Seiferte1052942018-03-26 13:40:35 +0200654:meth:`~deque.rotate` to bring a target element to the left side of the deque. Remove
655old entries with :meth:`~deque.popleft`, add new entries with :meth:`~deque.extend`, and then
Georg Brandl116aa622007-08-15 14:28:22 +0000656reverse the rotation.
Georg Brandl116aa622007-08-15 14:28:22 +0000657With minor variations on that approach, it is easy to implement Forth style
658stack manipulations such as ``dup``, ``drop``, ``swap``, ``over``, ``pick``,
659``rot``, and ``roll``.
660
Georg Brandl116aa622007-08-15 14:28:22 +0000661
662:class:`defaultdict` objects
663----------------------------
664
Georg Brandl116aa622007-08-15 14:28:22 +0000665.. class:: defaultdict([default_factory[, ...]])
666
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700667 Returns a new dictionary-like object. :class:`defaultdict` is a subclass of the
668 built-in :class:`dict` class. It overrides one method and adds one writable
669 instance variable. The remaining functionality is the same as for the
670 :class:`dict` class and is not documented here.
Georg Brandl116aa622007-08-15 14:28:22 +0000671
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700672 The first argument provides the initial value for the :attr:`default_factory`
673 attribute; it defaults to ``None``. All remaining arguments are treated the same
674 as if they were passed to the :class:`dict` constructor, including keyword
675 arguments.
Georg Brandl116aa622007-08-15 14:28:22 +0000676
Georg Brandl116aa622007-08-15 14:28:22 +0000677
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700678 :class:`defaultdict` objects support the following method in addition to the
679 standard :class:`dict` operations:
Georg Brandl116aa622007-08-15 14:28:22 +0000680
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700681 .. method:: __missing__(key)
Georg Brandl116aa622007-08-15 14:28:22 +0000682
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700683 If the :attr:`default_factory` attribute is ``None``, this raises a
684 :exc:`KeyError` exception with the *key* as argument.
Georg Brandl116aa622007-08-15 14:28:22 +0000685
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700686 If :attr:`default_factory` is not ``None``, it is called without arguments
687 to provide a default value for the given *key*, this value is inserted in
688 the dictionary for the *key*, and returned.
Georg Brandl116aa622007-08-15 14:28:22 +0000689
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700690 If calling :attr:`default_factory` raises an exception this exception is
691 propagated unchanged.
Georg Brandl116aa622007-08-15 14:28:22 +0000692
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700693 This method is called by the :meth:`__getitem__` method of the
694 :class:`dict` class when the requested key is not found; whatever it
695 returns or raises is then returned or raised by :meth:`__getitem__`.
Georg Brandl116aa622007-08-15 14:28:22 +0000696
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700697 Note that :meth:`__missing__` is *not* called for any operations besides
698 :meth:`__getitem__`. This means that :meth:`get` will, like normal
699 dictionaries, return ``None`` as a default rather than using
700 :attr:`default_factory`.
Benjamin Peterson871b9d12012-01-27 09:14:01 -0500701
Georg Brandl116aa622007-08-15 14:28:22 +0000702
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700703 :class:`defaultdict` objects support the following instance variable:
Georg Brandl116aa622007-08-15 14:28:22 +0000704
Benjamin Petersone41251e2008-04-25 01:59:09 +0000705
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700706 .. attribute:: default_factory
Benjamin Petersone41251e2008-04-25 01:59:09 +0000707
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700708 This attribute is used by the :meth:`__missing__` method; it is
709 initialized from the first argument to the constructor, if present, or to
710 ``None``, if absent.
Georg Brandl116aa622007-08-15 14:28:22 +0000711
712
Georg Brandl116aa622007-08-15 14:28:22 +0000713:class:`defaultdict` Examples
714^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
715
Michael Seiferte1052942018-03-26 13:40:35 +0200716Using :class:`list` as the :attr:`~defaultdict.default_factory`, it is easy to group a
Christian Heimesfe337bf2008-03-23 21:54:12 +0000717sequence of key-value pairs into a dictionary of lists:
Georg Brandl116aa622007-08-15 14:28:22 +0000718
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700719 >>> s = [('yellow', 1), ('blue', 2), ('yellow', 3), ('blue', 4), ('red', 1)]
720 >>> d = defaultdict(list)
721 >>> for k, v in s:
722 ... d[k].append(v)
723 ...
Zachary Ware2b52c0a2016-08-09 17:38:22 -0500724 >>> sorted(d.items())
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700725 [('blue', [2, 4]), ('red', [1]), ('yellow', [1, 3])]
Georg Brandl116aa622007-08-15 14:28:22 +0000726
727When each key is encountered for the first time, it is not already in the
Michael Seiferte1052942018-03-26 13:40:35 +0200728mapping; so an entry is automatically created using the :attr:`~defaultdict.default_factory`
Georg Brandl116aa622007-08-15 14:28:22 +0000729function which returns an empty :class:`list`. The :meth:`list.append`
730operation then attaches the value to the new list. When keys are encountered
731again, the look-up proceeds normally (returning the list for that key) and the
732:meth:`list.append` operation adds another value to the list. This technique is
Christian Heimesfe337bf2008-03-23 21:54:12 +0000733simpler and faster than an equivalent technique using :meth:`dict.setdefault`:
Georg Brandl116aa622007-08-15 14:28:22 +0000734
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700735 >>> d = {}
736 >>> for k, v in s:
737 ... d.setdefault(k, []).append(v)
738 ...
Zachary Ware2b52c0a2016-08-09 17:38:22 -0500739 >>> sorted(d.items())
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700740 [('blue', [2, 4]), ('red', [1]), ('yellow', [1, 3])]
Georg Brandl116aa622007-08-15 14:28:22 +0000741
Michael Seiferte1052942018-03-26 13:40:35 +0200742Setting the :attr:`~defaultdict.default_factory` to :class:`int` makes the
Georg Brandl116aa622007-08-15 14:28:22 +0000743:class:`defaultdict` useful for counting (like a bag or multiset in other
Christian Heimesfe337bf2008-03-23 21:54:12 +0000744languages):
Georg Brandl116aa622007-08-15 14:28:22 +0000745
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700746 >>> s = 'mississippi'
747 >>> d = defaultdict(int)
748 >>> for k in s:
749 ... d[k] += 1
750 ...
Zachary Ware2b52c0a2016-08-09 17:38:22 -0500751 >>> sorted(d.items())
752 [('i', 4), ('m', 1), ('p', 2), ('s', 4)]
Georg Brandl116aa622007-08-15 14:28:22 +0000753
754When a letter is first encountered, it is missing from the mapping, so the
Michael Seiferte1052942018-03-26 13:40:35 +0200755:attr:`~defaultdict.default_factory` function calls :func:`int` to supply a default count of
Georg Brandl116aa622007-08-15 14:28:22 +0000756zero. The increment operation then builds up the count for each letter.
757
758The function :func:`int` which always returns zero is just a special case of
759constant functions. A faster and more flexible way to create constant functions
760is to use a lambda function which can supply any constant value (not just
Christian Heimesfe337bf2008-03-23 21:54:12 +0000761zero):
Georg Brandl116aa622007-08-15 14:28:22 +0000762
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700763 >>> def constant_factory(value):
764 ... return lambda: value
765 >>> d = defaultdict(constant_factory('<missing>'))
766 >>> d.update(name='John', action='ran')
767 >>> '%(name)s %(action)s to %(object)s' % d
768 'John ran to <missing>'
Georg Brandl116aa622007-08-15 14:28:22 +0000769
Michael Seiferte1052942018-03-26 13:40:35 +0200770Setting the :attr:`~defaultdict.default_factory` to :class:`set` makes the
Christian Heimesfe337bf2008-03-23 21:54:12 +0000771:class:`defaultdict` useful for building a dictionary of sets:
Georg Brandl116aa622007-08-15 14:28:22 +0000772
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700773 >>> s = [('red', 1), ('blue', 2), ('red', 3), ('blue', 4), ('red', 1), ('blue', 4)]
774 >>> d = defaultdict(set)
775 >>> for k, v in s:
776 ... d[k].add(v)
777 ...
Zachary Ware2b52c0a2016-08-09 17:38:22 -0500778 >>> sorted(d.items())
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700779 [('blue', {2, 4}), ('red', {1, 3})]
Georg Brandl116aa622007-08-15 14:28:22 +0000780
781
Guido van Rossum3d392eb2007-11-16 00:35:22 +0000782:func:`namedtuple` Factory Function for Tuples with Named Fields
Christian Heimes790c8232008-01-07 21:14:23 +0000783----------------------------------------------------------------
Georg Brandl116aa622007-08-15 14:28:22 +0000784
Thomas Wouters1b7f8912007-09-19 03:06:30 +0000785Named tuples assign meaning to each position in a tuple and allow for more readable,
786self-documenting code. They can be used wherever regular tuples are used, and
787they add the ability to access fields by name instead of position index.
Georg Brandl116aa622007-08-15 14:28:22 +0000788
Raymond Hettinger39482072018-01-10 21:45:19 -0800789.. function:: namedtuple(typename, field_names, *, rename=False, defaults=None, module=None)
Georg Brandl116aa622007-08-15 14:28:22 +0000790
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700791 Returns a new tuple subclass named *typename*. The new subclass is used to
792 create tuple-like objects that have fields accessible by attribute lookup as
793 well as being indexable and iterable. Instances of the subclass also have a
794 helpful docstring (with typename and field_names) and a helpful :meth:`__repr__`
795 method which lists the tuple contents in a ``name=value`` format.
Georg Brandl116aa622007-08-15 14:28:22 +0000796
csabella97bf7222017-04-25 12:14:45 -0400797 The *field_names* are a sequence of strings such as ``['x', 'y']``.
798 Alternatively, *field_names* can be a single string with each fieldname
799 separated by whitespace and/or commas, for example ``'x y'`` or ``'x, y'``.
Georg Brandl9afde1c2007-11-01 20:32:30 +0000800
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700801 Any valid Python identifier may be used for a fieldname except for names
802 starting with an underscore. Valid identifiers consist of letters, digits,
803 and underscores but do not start with a digit or underscore and cannot be
804 a :mod:`keyword` such as *class*, *for*, *return*, *global*, *pass*,
805 or *raise*.
Georg Brandl116aa622007-08-15 14:28:22 +0000806
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700807 If *rename* is true, invalid fieldnames are automatically replaced
808 with positional names. For example, ``['abc', 'def', 'ghi', 'abc']`` is
809 converted to ``['abc', '_1', 'ghi', '_3']``, eliminating the keyword
810 ``def`` and the duplicate fieldname ``abc``.
Benjamin Petersona86f2c02009-02-10 02:41:10 +0000811
Raymond Hettinger39482072018-01-10 21:45:19 -0800812 *defaults* can be ``None`` or an :term:`iterable` of default values.
813 Since fields with a default value must come after any fields without a
814 default, the *defaults* are applied to the rightmost parameters. For
815 example, if the fieldnames are ``['x', 'y', 'z']`` and the defaults are
816 ``(1, 2)``, then ``x`` will be a required argument, ``y`` will default to
817 ``1``, and ``z`` will default to ``2``.
818
Raymond Hettinger0d5048c2016-09-12 00:18:31 -0700819 If *module* is defined, the ``__module__`` attribute of the named tuple is
820 set to that value.
821
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700822 Named tuple instances do not have per-instance dictionaries, so they are
823 lightweight and require no more memory than regular tuples.
Georg Brandl116aa622007-08-15 14:28:22 +0000824
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700825 .. versionchanged:: 3.1
Raymond Hettinger6538b432016-08-16 10:55:43 -0700826 Added support for *rename*.
827
828 .. versionchanged:: 3.6
829 The *verbose* and *rename* parameters became
830 :ref:`keyword-only arguments <keyword-only_parameter>`.
Benjamin Petersona86f2c02009-02-10 02:41:10 +0000831
Raymond Hettinger0d5048c2016-09-12 00:18:31 -0700832 .. versionchanged:: 3.6
833 Added the *module* parameter.
Christian Heimesfe337bf2008-03-23 21:54:12 +0000834
Raymond Hettinger8b57d732017-09-10 10:23:36 -0700835 .. versionchanged:: 3.7
836 Remove the *verbose* parameter and the :attr:`_source` attribute.
837
Raymond Hettinger39482072018-01-10 21:45:19 -0800838 .. versionchanged:: 3.7
839 Added the *defaults* parameter and the :attr:`_field_defaults`
840 attribute.
841
Christian Heimesfe337bf2008-03-23 21:54:12 +0000842.. doctest::
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700843 :options: +NORMALIZE_WHITESPACE
Georg Brandl116aa622007-08-15 14:28:22 +0000844
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700845 >>> # Basic example
846 >>> Point = namedtuple('Point', ['x', 'y'])
847 >>> p = Point(11, y=22) # instantiate with positional or keyword arguments
848 >>> p[0] + p[1] # indexable like the plain tuple (11, 22)
849 33
850 >>> x, y = p # unpack like a regular tuple
851 >>> x, y
852 (11, 22)
853 >>> p.x + p.y # fields also accessible by name
854 33
855 >>> p # readable __repr__ with a name=value style
856 Point(x=11, y=22)
Georg Brandl116aa622007-08-15 14:28:22 +0000857
Thomas Wouters1b7f8912007-09-19 03:06:30 +0000858Named tuples are especially useful for assigning field names to result tuples returned
859by the :mod:`csv` or :mod:`sqlite3` modules::
860
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700861 EmployeeRecord = namedtuple('EmployeeRecord', 'name, age, title, department, paygrade')
Georg Brandl9afde1c2007-11-01 20:32:30 +0000862
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700863 import csv
864 for emp in map(EmployeeRecord._make, csv.reader(open("employees.csv", "rb"))):
865 print(emp.name, emp.title)
Thomas Wouters1b7f8912007-09-19 03:06:30 +0000866
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700867 import sqlite3
868 conn = sqlite3.connect('/companydata')
869 cursor = conn.cursor()
870 cursor.execute('SELECT name, age, title, department, paygrade FROM employees')
871 for emp in map(EmployeeRecord._make, cursor.fetchall()):
872 print(emp.name, emp.title)
Georg Brandl9afde1c2007-11-01 20:32:30 +0000873
Christian Heimes99170a52007-12-19 02:07:34 +0000874In addition to the methods inherited from tuples, named tuples support
Raymond Hettinger2ebea412011-03-23 12:52:23 -0700875three additional methods and two attributes. To prevent conflicts with
Christian Heimes2380ac72008-01-09 00:17:24 +0000876field names, the method and attribute names start with an underscore.
Christian Heimes99170a52007-12-19 02:07:34 +0000877
Benjamin Peterson0b9fb802010-07-18 14:23:36 +0000878.. classmethod:: somenamedtuple._make(iterable)
Christian Heimes99170a52007-12-19 02:07:34 +0000879
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700880 Class method that makes a new instance from an existing sequence or iterable.
Christian Heimes99170a52007-12-19 02:07:34 +0000881
Raymond Hettinger6fed9fd2012-06-11 00:38:14 -0700882 .. doctest::
Thomas Wouters1b7f8912007-09-19 03:06:30 +0000883
Raymond Hettinger6fed9fd2012-06-11 00:38:14 -0700884 >>> t = [11, 22]
885 >>> Point._make(t)
886 Point(x=11, y=22)
Thomas Wouters1b7f8912007-09-19 03:06:30 +0000887
Christian Heimes790c8232008-01-07 21:14:23 +0000888.. method:: somenamedtuple._asdict()
Georg Brandl9afde1c2007-11-01 20:32:30 +0000889
Raymond Hettinger49fd6dd2019-02-21 00:05:30 -0800890 Return a new :class:`dict` which maps field names to their corresponding
Raymond Hettingerfd27f622016-08-16 13:13:17 -0700891 values:
892
893 .. doctest::
Georg Brandl9afde1c2007-11-01 20:32:30 +0000894
Raymond Hettinger7a3602e2015-08-30 09:13:48 -0700895 >>> p = Point(x=11, y=22)
896 >>> p._asdict()
Raymond Hettinger0bb4bdf2019-01-31 00:59:50 -0800897 {'x': 11, 'y': 22}
Raymond Hettingera4f52b12009-03-02 22:28:31 +0000898
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700899 .. versionchanged:: 3.1
900 Returns an :class:`OrderedDict` instead of a regular :class:`dict`.
Christian Heimesfe337bf2008-03-23 21:54:12 +0000901
Raymond Hettinger0bb4bdf2019-01-31 00:59:50 -0800902 .. versionchanged:: 3.8
903 Returns a regular :class:`dict` instead of an :class:`OrderedDict`.
904 As of Python 3.7, regular dicts are guaranteed to be ordered. If the
905 extra features of :class:`OrderedDict` are required, the suggested
906 remediation is to cast the result to the desired type:
907 ``OrderedDict(nt._asdict())``.
908
Ben Hoyt184bd822017-06-13 15:20:51 -0400909.. method:: somenamedtuple._replace(**kwargs)
Thomas Wouters1b7f8912007-09-19 03:06:30 +0000910
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700911 Return a new instance of the named tuple replacing specified fields with new
Raymond Hettinger6fed9fd2012-06-11 00:38:14 -0700912 values::
Thomas Wouters8ce81f72007-09-20 18:22:40 +0000913
Raymond Hettinger6fed9fd2012-06-11 00:38:14 -0700914 >>> p = Point(x=11, y=22)
915 >>> p._replace(x=33)
916 Point(x=33, y=22)
Thomas Wouters1b7f8912007-09-19 03:06:30 +0000917
Raymond Hettinger6fed9fd2012-06-11 00:38:14 -0700918 >>> for partnum, record in inventory.items():
919 ... inventory[partnum] = record._replace(price=newprices[partnum], timestamp=time.now())
Thomas Wouters1b7f8912007-09-19 03:06:30 +0000920
Christian Heimes790c8232008-01-07 21:14:23 +0000921.. attribute:: somenamedtuple._fields
Thomas Wouters1b7f8912007-09-19 03:06:30 +0000922
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700923 Tuple of strings listing the field names. Useful for introspection
924 and for creating new named tuple types from existing named tuples.
Thomas Wouters8ce81f72007-09-20 18:22:40 +0000925
Raymond Hettinger6fed9fd2012-06-11 00:38:14 -0700926 .. doctest::
Thomas Wouters1b7f8912007-09-19 03:06:30 +0000927
Raymond Hettinger6fed9fd2012-06-11 00:38:14 -0700928 >>> p._fields # view the field names
929 ('x', 'y')
Thomas Wouters1b7f8912007-09-19 03:06:30 +0000930
Raymond Hettinger6fed9fd2012-06-11 00:38:14 -0700931 >>> Color = namedtuple('Color', 'red green blue')
932 >>> Pixel = namedtuple('Pixel', Point._fields + Color._fields)
933 >>> Pixel(11, 22, 128, 255, 0)
934 Pixel(x=11, y=22, red=128, green=255, blue=0)
Georg Brandl116aa622007-08-15 14:28:22 +0000935
Raymond Hettinger39482072018-01-10 21:45:19 -0800936.. attribute:: somenamedtuple._fields_defaults
937
938 Dictionary mapping field names to default values.
939
940 .. doctest::
941
942 >>> Account = namedtuple('Account', ['type', 'balance'], defaults=[0])
943 >>> Account._fields_defaults
944 {'balance': 0}
945 >>> Account('premium')
946 Account(type='premium', balance=0)
947
Christian Heimes0449f632007-12-15 01:27:15 +0000948To retrieve a field whose name is stored in a string, use the :func:`getattr`
Christian Heimesfe337bf2008-03-23 21:54:12 +0000949function:
Christian Heimes0449f632007-12-15 01:27:15 +0000950
951 >>> getattr(p, 'x')
952 11
953
Raymond Hettinger651453a2009-02-11 00:20:02 +0000954To convert a dictionary to a named tuple, use the double-star-operator
955(as described in :ref:`tut-unpacking-arguments`):
Christian Heimes99170a52007-12-19 02:07:34 +0000956
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700957 >>> d = {'x': 11, 'y': 22}
958 >>> Point(**d)
959 Point(x=11, y=22)
Christian Heimes99170a52007-12-19 02:07:34 +0000960
Guido van Rossum3d392eb2007-11-16 00:35:22 +0000961Since a named tuple is a regular Python class, it is easy to add or change
Christian Heimes043d6f62008-01-07 17:19:16 +0000962functionality with a subclass. Here is how to add a calculated field and
Christian Heimesfe337bf2008-03-23 21:54:12 +0000963a fixed-width print format:
Guido van Rossum3d392eb2007-11-16 00:35:22 +0000964
Raymond Hettingerfd27f622016-08-16 13:13:17 -0700965.. doctest::
966
967 >>> class Point(namedtuple('Point', ['x', 'y'])):
Zachary Ware2b52c0a2016-08-09 17:38:22 -0500968 ... __slots__ = ()
969 ... @property
970 ... def hypot(self):
971 ... return (self.x ** 2 + self.y ** 2) ** 0.5
972 ... def __str__(self):
973 ... return 'Point: x=%6.3f y=%6.3f hypot=%6.3f' % (self.x, self.y, self.hypot)
Guido van Rossum3d392eb2007-11-16 00:35:22 +0000974
Georg Brandl0df79792008-10-04 18:33:26 +0000975 >>> for p in Point(3, 4), Point(14, 5/7):
Zachary Ware2b52c0a2016-08-09 17:38:22 -0500976 ... print(p)
Christian Heimes25bb7832008-01-11 16:17:00 +0000977 Point: x= 3.000 y= 4.000 hypot= 5.000
978 Point: x=14.000 y= 0.714 hypot=14.018
Christian Heimes043d6f62008-01-07 17:19:16 +0000979
Georg Brandlaf5c2382009-12-28 08:02:38 +0000980The subclass shown above sets ``__slots__`` to an empty tuple. This helps
Christian Heimes679db4a2008-01-18 09:56:22 +0000981keep memory requirements low by preventing the creation of instance dictionaries.
982
Christian Heimes2380ac72008-01-09 00:17:24 +0000983Subclassing is not useful for adding new, stored fields. Instead, simply
Michael Seiferte1052942018-03-26 13:40:35 +0200984create a new named tuple type from the :attr:`~somenamedtuple._fields` attribute:
Christian Heimes2380ac72008-01-09 00:17:24 +0000985
Christian Heimes25bb7832008-01-11 16:17:00 +0000986 >>> Point3D = namedtuple('Point3D', Point._fields + ('z',))
Christian Heimes2380ac72008-01-09 00:17:24 +0000987
Raymond Hettingereac503a2015-05-13 01:09:59 -0700988Docstrings can be customized by making direct assignments to the ``__doc__``
989fields:
990
991 >>> Book = namedtuple('Book', ['id', 'title', 'authors'])
Raymond Hettinger850be0f2015-11-09 08:24:53 -0800992 >>> Book.__doc__ += ': Hardcover book in active collection'
Berker Peksagde7cafa2015-05-13 12:16:27 +0300993 >>> Book.id.__doc__ = '13-digit ISBN'
994 >>> Book.title.__doc__ = 'Title of first printing'
Raymond Hettinger850be0f2015-11-09 08:24:53 -0800995 >>> Book.authors.__doc__ = 'List of authors sorted by last name'
Raymond Hettingereac503a2015-05-13 01:09:59 -0700996
Raymond Hettinger6e701312015-11-23 22:18:55 -0800997.. versionchanged:: 3.5
998 Property docstrings became writeable.
999
Michael Seiferte1052942018-03-26 13:40:35 +02001000Default values can be implemented by using :meth:`~somenamedtuple._replace` to
Christian Heimesfe337bf2008-03-23 21:54:12 +00001001customize a prototype instance:
Guido van Rossum3d392eb2007-11-16 00:35:22 +00001002
1003 >>> Account = namedtuple('Account', 'owner balance transaction_count')
Christian Heimes587c2bf2008-01-19 16:21:02 +00001004 >>> default_account = Account('<owner name>', 0.0, 0)
1005 >>> johns_account = default_account._replace(owner='John')
Raymond Hettingerb2d09452011-03-22 22:36:21 -07001006 >>> janes_account = default_account._replace(owner='Jane')
Guido van Rossum3d392eb2007-11-16 00:35:22 +00001007
Georg Brandl8ed75cd2014-10-31 10:25:48 +01001008
1009.. seealso::
1010
Raymond Hettinger7929cfb2012-06-09 19:15:26 -07001011 * `Recipe for named tuple abstract base class with a metaclass mix-in
Serhiy Storchaka6dff0202016-05-07 10:49:07 +03001012 <https://code.activestate.com/recipes/577629-namedtupleabc-abstract-base-class-mix-in-for-named/>`_
Raymond Hettingerbfcb4292012-06-10 11:39:44 -07001013 by Jan Kaliszewski. Besides providing an :term:`abstract base class` for
1014 named tuples, it also supports an alternate :term:`metaclass`-based
1015 constructor that is convenient for use cases where named tuples are being
1016 subclassed.
Raymond Hettingere4c96ad2008-02-06 01:23:58 +00001017
Raymond Hettingerfd27f622016-08-16 13:13:17 -07001018 * See :meth:`types.SimpleNamespace` for a mutable namespace based on an
1019 underlying dictionary instead of a tuple.
1020
1021 * See :meth:`typing.NamedTuple` for a way to add type hints for named tuples.
Raymond Hettinger2a75e8f2015-08-16 08:32:01 -07001022
Raymond Hettingere4c96ad2008-02-06 01:23:58 +00001023
Raymond Hettinger2d32f632009-03-02 21:24:57 +00001024:class:`OrderedDict` objects
1025----------------------------
1026
Raymond Hettinger49fd6dd2019-02-21 00:05:30 -08001027Ordered dictionaries are just like regular dictionaries but have some extra
1028capabilities relating to ordering operations. They have become less
1029important now that the built-in :class:`dict` class gained the ability
1030to remember insertion order (this new behavior became guaranteed in
1031Python 3.7).
1032
1033Some differences from :class:`dict` still remain:
1034
1035* The regular :class:`dict` was designed to be very good at mapping
1036 operations. Tracking insertion order was secondary.
1037
1038* The :class:`OrderedDict` was designed to be good at reordering operations.
1039 Space efficiency, iteration speed, and the performance of update
1040 operations were secondary.
1041
1042* Algorithmically, :class:`OrderedDict` can handle frequent reordering
1043 operations better than :class:`dict`. This makes it suitable for tracking
1044 recent accesses (for example in an `LRU cache
1045 <https://medium.com/@krishankantsinghal/my-first-blog-on-medium-583159139237>`_).
1046
1047* The equality operation for :class:`OrderedDict` checks for matching order.
1048
1049* The :meth:`popitem` method of :class:`OrderedDict` has a different
1050 signature. It accepts an optional argument to specify which item is popped.
1051
1052* :class:`OrderedDict` has a :meth:`move_to_end` method to
1053 efficiently reposition an element to an endpoint.
1054
1055* Until Python 3.8, :class:`dict` lacked a :meth:`__reversed__` method.
1056
Raymond Hettinger2d32f632009-03-02 21:24:57 +00001057
1058.. class:: OrderedDict([items])
1059
Raymond Hettinger49fd6dd2019-02-21 00:05:30 -08001060 Return an instance of a :class:`dict` subclass that has methods
1061 specialized for rearranging dictionary order.
Raymond Hettinger2d32f632009-03-02 21:24:57 +00001062
Raymond Hettinger7929cfb2012-06-09 19:15:26 -07001063 .. versionadded:: 3.1
Raymond Hettinger2d32f632009-03-02 21:24:57 +00001064
Raymond Hettinger7929cfb2012-06-09 19:15:26 -07001065 .. method:: popitem(last=True)
Raymond Hettingerdc879f02009-03-19 20:30:56 +00001066
Raymond Hettinger7929cfb2012-06-09 19:15:26 -07001067 The :meth:`popitem` method for ordered dictionaries returns and removes a
Serhiy Storchaka4ecfa452016-05-16 09:31:54 +03001068 (key, value) pair. The pairs are returned in
1069 :abbr:`LIFO (last-in, first-out)` order if *last* is true
1070 or :abbr:`FIFO (first-in, first-out)` order if false.
Raymond Hettinger2d32f632009-03-02 21:24:57 +00001071
Raymond Hettinger7929cfb2012-06-09 19:15:26 -07001072 .. method:: move_to_end(key, last=True)
Raymond Hettingerf45abc92010-09-06 21:26:09 +00001073
Raymond Hettinger7929cfb2012-06-09 19:15:26 -07001074 Move an existing *key* to either end of an ordered dictionary. The item
1075 is moved to the right end if *last* is true (the default) or to the
1076 beginning if *last* is false. Raises :exc:`KeyError` if the *key* does
1077 not exist::
Raymond Hettingerf45abc92010-09-06 21:26:09 +00001078
Raymond Hettinger7929cfb2012-06-09 19:15:26 -07001079 >>> d = OrderedDict.fromkeys('abcde')
1080 >>> d.move_to_end('b')
1081 >>> ''.join(d.keys())
1082 'acdeb'
1083 >>> d.move_to_end('b', last=False)
1084 >>> ''.join(d.keys())
1085 'bacde'
Raymond Hettingerf45abc92010-09-06 21:26:09 +00001086
Raymond Hettinger7929cfb2012-06-09 19:15:26 -07001087 .. versionadded:: 3.2
Raymond Hettingerf45abc92010-09-06 21:26:09 +00001088
Raymond Hettingere9091502009-05-19 17:40:07 +00001089In addition to the usual mapping methods, ordered dictionaries also support
1090reverse iteration using :func:`reversed`.
1091
Raymond Hettinger2d32f632009-03-02 21:24:57 +00001092Equality tests between :class:`OrderedDict` objects are order-sensitive
1093and are implemented as ``list(od1.items())==list(od2.items())``.
1094Equality tests between :class:`OrderedDict` objects and other
Serhiy Storchakabfdcd432013-10-13 23:09:14 +03001095:class:`~collections.abc.Mapping` objects are order-insensitive like regular
1096dictionaries. This allows :class:`OrderedDict` objects to be substituted
1097anywhere a regular dictionary is used.
Raymond Hettinger2d32f632009-03-02 21:24:57 +00001098
Serhiy Storchaka578c9212014-04-04 15:19:36 +03001099.. versionchanged:: 3.5
Martin Panter397625e2015-10-07 10:03:20 +00001100 The items, keys, and values :term:`views <dictionary view>`
1101 of :class:`OrderedDict` now support reverse iteration using :func:`reversed`.
Raymond Hettingerdc879f02009-03-19 20:30:56 +00001102
Raymond Hettingerd15bb262017-01-07 22:05:12 -08001103.. versionchanged:: 3.6
1104 With the acceptance of :pep:`468`, order is retained for keyword arguments
1105 passed to the :class:`OrderedDict` constructor and its :meth:`update`
1106 method.
1107
Raymond Hettinger7bba6832011-04-15 17:43:19 -07001108:class:`OrderedDict` Examples and Recipes
1109^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
1110
Raymond Hettinger49fd6dd2019-02-21 00:05:30 -08001111It is straightforward to create an ordered dictionary variant
Andrew Svetlovff63e7a2012-08-31 13:54:54 +03001112that remembers the order the keys were *last* inserted.
Raymond Hettinger4821ef82010-07-31 10:14:41 +00001113If a new entry overwrites an existing entry, the
1114original insertion position is changed and moved to the end::
1115
1116 class LastUpdatedOrderedDict(OrderedDict):
Georg Brandl77570e22010-12-18 16:21:58 +00001117 'Store items in the order the keys were last added'
Raymond Hettinger7bba6832011-04-15 17:43:19 -07001118
Raymond Hettinger4821ef82010-07-31 10:14:41 +00001119 def __setitem__(self, key, value):
Raymond Hettinger49fd6dd2019-02-21 00:05:30 -08001120 super().__setitem__(key, value)
1121 super().move_to_end(key)
Raymond Hettinger4821ef82010-07-31 10:14:41 +00001122
Raymond Hettinger49fd6dd2019-02-21 00:05:30 -08001123An :class:`OrderedDict` would also be useful for implementing
1124variants of :func:`functools.lru_cache`::
Raymond Hettinger7bba6832011-04-15 17:43:19 -07001125
Raymond Hettinger49fd6dd2019-02-21 00:05:30 -08001126 class LRU(OrderedDict):
1127 'Limit size, evicting the least recently looked-up key when full'
Raymond Hettinger7bba6832011-04-15 17:43:19 -07001128
Raymond Hettinger49fd6dd2019-02-21 00:05:30 -08001129 def __init__(self, maxsize=128, *args, **kwds):
1130 self.maxsize = maxsize
1131 super().__init__(*args, **kwds)
Raymond Hettinger7bba6832011-04-15 17:43:19 -07001132
Raymond Hettinger49fd6dd2019-02-21 00:05:30 -08001133 def __getitem__(self, key):
1134 value = super().__getitem__(key)
1135 self.move_to_end(key)
1136 return value
1137
1138 def __setitem__(self, key, value):
1139 super().__setitem__(key, value)
1140 if len(self) > self.maxsize:
1141 oldest = next(iter(self))
1142 del self[oldest]
Raymond Hettinger7bba6832011-04-15 17:43:19 -07001143
Raymond Hettingere4c96ad2008-02-06 01:23:58 +00001144
1145:class:`UserDict` objects
Mark Summerfield8f2d0062008-02-06 13:30:44 +00001146-------------------------
Raymond Hettingere4c96ad2008-02-06 01:23:58 +00001147
Georg Brandl48310cd2009-01-03 21:18:54 +00001148The class, :class:`UserDict` acts as a wrapper around dictionary objects.
1149The need for this class has been partially supplanted by the ability to
Raymond Hettingere4c96ad2008-02-06 01:23:58 +00001150subclass directly from :class:`dict`; however, this class can be easier
1151to work with because the underlying dictionary is accessible as an
1152attribute.
1153
1154.. class:: UserDict([initialdata])
1155
Raymond Hettinger7929cfb2012-06-09 19:15:26 -07001156 Class that simulates a dictionary. The instance's contents are kept in a
1157 regular dictionary, which is accessible via the :attr:`data` attribute of
1158 :class:`UserDict` instances. If *initialdata* is provided, :attr:`data` is
1159 initialized with its contents; note that a reference to *initialdata* will not
1160 be kept, allowing it be used for other purposes.
Raymond Hettingere4c96ad2008-02-06 01:23:58 +00001161
Raymond Hettinger7929cfb2012-06-09 19:15:26 -07001162 In addition to supporting the methods and operations of mappings,
1163 :class:`UserDict` instances provide the following attribute:
Raymond Hettingere4c96ad2008-02-06 01:23:58 +00001164
Raymond Hettinger7929cfb2012-06-09 19:15:26 -07001165 .. attribute:: data
Raymond Hettingere4c96ad2008-02-06 01:23:58 +00001166
Raymond Hettinger7929cfb2012-06-09 19:15:26 -07001167 A real dictionary used to store the contents of the :class:`UserDict`
1168 class.
Raymond Hettinger53dbe392008-02-12 20:03:09 +00001169
1170
1171
1172:class:`UserList` objects
1173-------------------------
1174
1175This class acts as a wrapper around list objects. It is a useful base class
Georg Brandl48310cd2009-01-03 21:18:54 +00001176for your own list-like classes which can inherit from them and override
Raymond Hettinger53dbe392008-02-12 20:03:09 +00001177existing methods or add new ones. In this way, one can add new behaviors to
1178lists.
1179
Georg Brandl48310cd2009-01-03 21:18:54 +00001180The need for this class has been partially supplanted by the ability to
Raymond Hettinger53dbe392008-02-12 20:03:09 +00001181subclass directly from :class:`list`; however, this class can be easier
1182to work with because the underlying list is accessible as an attribute.
1183
1184.. class:: UserList([list])
1185
Raymond Hettinger7929cfb2012-06-09 19:15:26 -07001186 Class that simulates a list. The instance's contents are kept in a regular
1187 list, which is accessible via the :attr:`data` attribute of :class:`UserList`
1188 instances. The instance's contents are initially set to a copy of *list*,
1189 defaulting to the empty list ``[]``. *list* can be any iterable, for
1190 example a real Python list or a :class:`UserList` object.
Raymond Hettinger53dbe392008-02-12 20:03:09 +00001191
Raymond Hettinger7929cfb2012-06-09 19:15:26 -07001192 In addition to supporting the methods and operations of mutable sequences,
1193 :class:`UserList` instances provide the following attribute:
Raymond Hettinger53dbe392008-02-12 20:03:09 +00001194
Raymond Hettinger7929cfb2012-06-09 19:15:26 -07001195 .. attribute:: data
Raymond Hettinger53dbe392008-02-12 20:03:09 +00001196
Raymond Hettinger7929cfb2012-06-09 19:15:26 -07001197 A real :class:`list` object used to store the contents of the
1198 :class:`UserList` class.
Raymond Hettinger53dbe392008-02-12 20:03:09 +00001199
Zachary Ware80602e02014-01-13 20:38:57 -06001200**Subclassing requirements:** Subclasses of :class:`UserList` are expected to
Raymond Hettinger53dbe392008-02-12 20:03:09 +00001201offer a constructor which can be called with either no arguments or one
1202argument. List operations which return a new sequence attempt to create an
1203instance of the actual implementation class. To do so, it assumes that the
1204constructor can be called with a single parameter, which is a sequence object
1205used as a data source.
1206
1207If a derived class does not wish to comply with this requirement, all of the
1208special methods supported by this class will need to be overridden; please
1209consult the sources for information about the methods which need to be provided
1210in that case.
Raymond Hettingerb3a65f82008-02-21 22:11:37 +00001211
1212:class:`UserString` objects
Christian Heimesc3f30c42008-02-22 16:37:40 +00001213---------------------------
Raymond Hettingerb3a65f82008-02-21 22:11:37 +00001214
Georg Brandl48310cd2009-01-03 21:18:54 +00001215The class, :class:`UserString` acts as a wrapper around string objects.
1216The need for this class has been partially supplanted by the ability to
Raymond Hettingerb3a65f82008-02-21 22:11:37 +00001217subclass directly from :class:`str`; however, this class can be easier
1218to work with because the underlying string is accessible as an
1219attribute.
1220
Michael Seiferte1052942018-03-26 13:40:35 +02001221.. class:: UserString(seq)
Raymond Hettingerb3a65f82008-02-21 22:11:37 +00001222
Michael Seiferte1052942018-03-26 13:40:35 +02001223 Class that simulates a string object. The instance's
Raymond Hettinger7929cfb2012-06-09 19:15:26 -07001224 content is kept in a regular string object, which is accessible via the
1225 :attr:`data` attribute of :class:`UserString` instances. The instance's
Michael Seiferte1052942018-03-26 13:40:35 +02001226 contents are initially set to a copy of *seq*. The *seq* argument can
1227 be any object which can be converted into a string using the built-in
1228 :func:`str` function.
1229
1230 In addition to supporting the methods and operations of strings,
1231 :class:`UserString` instances provide the following attribute:
1232
1233 .. attribute:: data
1234
1235 A real :class:`str` object used to store the contents of the
1236 :class:`UserString` class.
Yury Selivanov336b37b2015-09-09 12:23:01 -04001237
1238 .. versionchanged:: 3.5
1239 New methods ``__getnewargs__``, ``__rmod__``, ``casefold``,
1240 ``format_map``, ``isprintable``, and ``maketrans``.