blob: 9f3f5dc8ff9f08ed5db31762542dbdc2224ffa5a [file] [log] [blame]
Raymond Hettinger53dbe392008-02-12 20:03:09 +00001:mod:`collections` --- Container datatypes
2==========================================
Georg Brandl116aa622007-08-15 14:28:22 +00003
4.. module:: collections
Raymond Hettinger7929cfb2012-06-09 19:15:26 -07005 :synopsis: Container datatypes
Terry Jan Reedyfa089b92016-06-11 15:02:54 -04006
Georg Brandl116aa622007-08-15 14:28:22 +00007.. moduleauthor:: Raymond Hettinger <python@rcn.com>
8.. sectionauthor:: Raymond Hettinger <python@rcn.com>
9
Terry Jan Reedyfa089b92016-06-11 15:02:54 -040010**Source code:** :source:`Lib/collections/__init__.py`
11
Christian Heimesfe337bf2008-03-23 21:54:12 +000012.. testsetup:: *
13
Raymond Hettinger7929cfb2012-06-09 19:15:26 -070014 from collections import *
15 import itertools
16 __name__ = '<doctest>'
Georg Brandl116aa622007-08-15 14:28:22 +000017
Raymond Hettinger4f707fd2011-01-10 19:54:11 +000018--------------
19
Raymond Hettingera6b76ba2010-08-08 00:29:08 +000020This module implements specialized container datatypes providing alternatives to
21Python's general purpose built-in containers, :class:`dict`, :class:`list`,
22:class:`set`, and :class:`tuple`.
Christian Heimes0bd4e112008-02-12 22:59:25 +000023
Raymond Hettingera6b76ba2010-08-08 00:29:08 +000024===================== ====================================================================
25:func:`namedtuple` factory function for creating tuple subclasses with named fields
26:class:`deque` list-like container with fast appends and pops on either end
Raymond Hettinger9fe1ccf2011-02-26 01:02:51 +000027:class:`ChainMap` dict-like class for creating a single view of multiple mappings
Raymond Hettingera6b76ba2010-08-08 00:29:08 +000028:class:`Counter` dict subclass for counting hashable objects
29:class:`OrderedDict` dict subclass that remembers the order entries were added
30:class:`defaultdict` dict subclass that calls a factory function to supply missing values
31:class:`UserDict` wrapper around dictionary objects for easier dict subclassing
32:class:`UserList` wrapper around list objects for easier list subclassing
33:class:`UserString` wrapper around string objects for easier string subclassing
34===================== ====================================================================
Georg Brandl116aa622007-08-15 14:28:22 +000035
Raymond Hettinger158c9c22011-02-22 00:41:50 +000036.. versionchanged:: 3.3
Raymond Hettinger7929cfb2012-06-09 19:15:26 -070037 Moved :ref:`collections-abstract-base-classes` to the :mod:`collections.abc` module.
38 For backwards compatibility, they continue to be visible in this module
39 as well.
Mark Summerfield08898b42007-09-05 08:43:04 +000040
41
Raymond Hettinger9fe1ccf2011-02-26 01:02:51 +000042:class:`ChainMap` objects
43-------------------------
44
Georg Brandl283b96b2012-04-03 09:16:46 +020045.. versionadded:: 3.3
46
Raymond Hettinger9fe1ccf2011-02-26 01:02:51 +000047A :class:`ChainMap` class is provided for quickly linking a number of mappings
48so they can be treated as a single unit. It is often much faster than creating
49a new dictionary and running multiple :meth:`~dict.update` calls.
50
51The class can be used to simulate nested scopes and is useful in templating.
52
53.. class:: ChainMap(*maps)
54
Raymond Hettinger7929cfb2012-06-09 19:15:26 -070055 A :class:`ChainMap` groups multiple dicts or other mappings together to
56 create a single, updateable view. If no *maps* are specified, a single empty
57 dictionary is provided so that a new chain always has at least one mapping.
Raymond Hettinger9fe1ccf2011-02-26 01:02:51 +000058
Raymond Hettinger7929cfb2012-06-09 19:15:26 -070059 The underlying mappings are stored in a list. That list is public and can
Martin Panter8d56c022016-05-29 04:13:35 +000060 be accessed or updated using the *maps* attribute. There is no other state.
Raymond Hettinger9fe1ccf2011-02-26 01:02:51 +000061
Raymond Hettinger7929cfb2012-06-09 19:15:26 -070062 Lookups search the underlying mappings successively until a key is found. In
63 contrast, writes, updates, and deletions only operate on the first mapping.
Raymond Hettinger9fe1ccf2011-02-26 01:02:51 +000064
Raymond Hettinger7929cfb2012-06-09 19:15:26 -070065 A :class:`ChainMap` incorporates the underlying mappings by reference. So, if
66 one of the underlying mappings gets updated, those changes will be reflected
67 in :class:`ChainMap`.
Raymond Hettinger9fe1ccf2011-02-26 01:02:51 +000068
Raymond Hettinger7929cfb2012-06-09 19:15:26 -070069 All of the usual dictionary methods are supported. In addition, there is a
70 *maps* attribute, a method for creating new subcontexts, and a property for
71 accessing all but the first mapping:
Raymond Hettinger9fe1ccf2011-02-26 01:02:51 +000072
Raymond Hettinger7929cfb2012-06-09 19:15:26 -070073 .. attribute:: maps
Raymond Hettinger9fe1ccf2011-02-26 01:02:51 +000074
Raymond Hettinger7929cfb2012-06-09 19:15:26 -070075 A user updateable list of mappings. The list is ordered from
76 first-searched to last-searched. It is the only stored state and can
77 be modified to change which mappings are searched. The list should
78 always contain at least one mapping.
Raymond Hettinger9fe1ccf2011-02-26 01:02:51 +000079
Vinay Sajip1ba81ee2013-01-11 23:39:53 +000080 .. method:: new_child(m=None)
Raymond Hettinger9fe1ccf2011-02-26 01:02:51 +000081
Vinay Sajip1ba81ee2013-01-11 23:39:53 +000082 Returns a new :class:`ChainMap` containing a new map followed by
83 all of the maps in the current instance. If ``m`` is specified,
84 it becomes the new map at the front of the list of mappings; if not
85 specified, an empty dict is used, so that a call to ``d.new_child()``
86 is equivalent to: ``ChainMap({}, *d.maps)``. This method is used for
Raymond Hettinger7929cfb2012-06-09 19:15:26 -070087 creating subcontexts that can be updated without altering values in any
88 of the parent mappings.
Raymond Hettinger9fe1ccf2011-02-26 01:02:51 +000089
Vinay Sajip1ba81ee2013-01-11 23:39:53 +000090 .. versionchanged:: 3.4
91 The optional ``m`` parameter was added.
92
Raymond Hettinger2a61c452012-07-15 22:37:20 -070093 .. attribute:: parents
Raymond Hettinger9fe1ccf2011-02-26 01:02:51 +000094
Raymond Hettingerb22ba042012-07-16 02:07:41 -070095 Property returning a new :class:`ChainMap` containing all of the maps in
Raymond Hettinger2a61c452012-07-15 22:37:20 -070096 the current instance except the first one. This is useful for skipping
97 the first map in the search. Use cases are similar to those for the
98 :keyword:`nonlocal` keyword used in :term:`nested scopes <nested
99 scope>`. The use cases also parallel those for the built-in
100 :func:`super` function. A reference to ``d.parents`` is equivalent to:
101 ``ChainMap(*d.maps[1:])``.
Raymond Hettinger9fe1ccf2011-02-26 01:02:51 +0000102
Raymond Hettingerbfcb4292012-06-10 11:39:44 -0700103
104.. seealso::
105
106 * The `MultiContext class
Sandro Tosiea475302012-08-12 10:37:23 +0200107 <https://github.com/enthought/codetools/blob/4.0.0/codetools/contexts/multi_context.py>`_
Raymond Hettingerbfcb4292012-06-10 11:39:44 -0700108 in the Enthought `CodeTools package
109 <https://github.com/enthought/codetools>`_ has options to support
110 writing to any mapping in the chain.
111
112 * Django's `Context class
Georg Brandl525d3552014-10-29 10:26:56 +0100113 <https://github.com/django/django/blob/master/django/template/context.py>`_
Raymond Hettingerbfcb4292012-06-10 11:39:44 -0700114 for templating is a read-only chain of mappings. It also features
115 pushing and popping of contexts similar to the
116 :meth:`~collections.ChainMap.new_child` method and the
117 :meth:`~collections.ChainMap.parents` property.
118
119 * The `Nested Contexts recipe
Serhiy Storchaka6dff0202016-05-07 10:49:07 +0300120 <https://code.activestate.com/recipes/577434/>`_ has options to control
Raymond Hettingerbfcb4292012-06-10 11:39:44 -0700121 whether writes and other mutations apply only to the first mapping or to
122 any mapping in the chain.
123
124 * A `greatly simplified read-only version of Chainmap
Serhiy Storchaka6dff0202016-05-07 10:49:07 +0300125 <https://code.activestate.com/recipes/305268/>`_.
Raymond Hettingerbfcb4292012-06-10 11:39:44 -0700126
127
128:class:`ChainMap` Examples and Recipes
129^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
130
131This section shows various approaches to working with chained maps.
132
133
134Example of simulating Python's internal lookup chain::
Raymond Hettinger9fe1ccf2011-02-26 01:02:51 +0000135
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700136 import builtins
137 pylookup = ChainMap(locals(), globals(), vars(builtins))
Raymond Hettinger9fe1ccf2011-02-26 01:02:51 +0000138
Raymond Hettingerb2269ba2012-07-15 23:53:32 -0700139Example of letting user specified command-line arguments take precedence over
140environment variables which in turn take precedence over default values::
Raymond Hettinger9fe1ccf2011-02-26 01:02:51 +0000141
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700142 import os, argparse
Raymond Hettingerb2269ba2012-07-15 23:53:32 -0700143
144 defaults = {'color': 'red', 'user': 'guest'}
145
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700146 parser = argparse.ArgumentParser()
147 parser.add_argument('-u', '--user')
148 parser.add_argument('-c', '--color')
Raymond Hettingerb2269ba2012-07-15 23:53:32 -0700149 namespace = parser.parse_args()
150 command_line_args = {k:v for k, v in vars(namespace).items() if v}
151
152 combined = ChainMap(command_line_args, os.environ, defaults)
153 print(combined['color'])
154 print(combined['user'])
Raymond Hettinger9fe1ccf2011-02-26 01:02:51 +0000155
Raymond Hettingerbfcb4292012-06-10 11:39:44 -0700156Example patterns for using the :class:`ChainMap` class to simulate nested
157contexts::
Raymond Hettinger9fe1ccf2011-02-26 01:02:51 +0000158
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700159 c = ChainMap() # Create root context
160 d = c.new_child() # Create nested child context
161 e = c.new_child() # Child of c, independent from d
162 e.maps[0] # Current context dictionary -- like Python's locals()
163 e.maps[-1] # Root context -- like Python's globals()
164 e.parents # Enclosing context chain -- like Python's nonlocals
Raymond Hettinger9fe1ccf2011-02-26 01:02:51 +0000165
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700166 d['x'] # Get first key in the chain of contexts
167 d['x'] = 1 # Set value in current context
Andrew Svetlov1a8db9c2012-10-04 19:29:25 +0300168 del d['x'] # Delete from current context
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700169 list(d) # All nested values
170 k in d # Check all nested values
171 len(d) # Number of nested values
172 d.items() # All nested items
173 dict(d) # Flatten into a regular dictionary
Raymond Hettinger9fe1ccf2011-02-26 01:02:51 +0000174
Raymond Hettingerbfcb4292012-06-10 11:39:44 -0700175The :class:`ChainMap` class only makes updates (writes and deletions) to the
176first mapping in the chain while lookups will search the full chain. However,
177if deep writes and deletions are desired, it is easy to make a subclass that
178updates keys found deeper in the chain::
Raymond Hettinger9fe1ccf2011-02-26 01:02:51 +0000179
Raymond Hettingerbfcb4292012-06-10 11:39:44 -0700180 class DeepChainMap(ChainMap):
181 'Variant of ChainMap that allows direct updates to inner scopes'
Raymond Hettinger9fe1ccf2011-02-26 01:02:51 +0000182
Raymond Hettingerbfcb4292012-06-10 11:39:44 -0700183 def __setitem__(self, key, value):
184 for mapping in self.maps:
185 if key in mapping:
186 mapping[key] = value
187 return
188 self.maps[0][key] = value
Raymond Hettinger9fe1ccf2011-02-26 01:02:51 +0000189
Raymond Hettingerbfcb4292012-06-10 11:39:44 -0700190 def __delitem__(self, key):
191 for mapping in self.maps:
192 if key in mapping:
193 del mapping[key]
194 return
195 raise KeyError(key)
Raymond Hettinger9fe1ccf2011-02-26 01:02:51 +0000196
Serhiy Storchakaf47036c2013-12-24 11:04:36 +0200197 >>> d = DeepChainMap({'zebra': 'black'}, {'elephant': 'blue'}, {'lion': 'yellow'})
Raymond Hettingerbfcb4292012-06-10 11:39:44 -0700198 >>> d['lion'] = 'orange' # update an existing key two levels down
199 >>> d['snake'] = 'red' # new keys get added to the topmost dict
200 >>> del d['elephant'] # remove an existing key one level down
201 DeepChainMap({'zebra': 'black', 'snake': 'red'}, {}, {'lion': 'orange'})
Georg Brandl4dcf4742012-03-08 20:35:08 +0100202
Raymond Hettinger9fe1ccf2011-02-26 01:02:51 +0000203
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000204:class:`Counter` objects
205------------------------
206
207A counter tool is provided to support convenient and rapid tallies.
208For example::
209
Raymond Hettinger1c62dc92009-02-04 11:41:45 +0000210 >>> # Tally occurrences of words in a list
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000211 >>> cnt = Counter()
Raymond Hettinger670eaec2009-01-21 23:14:07 +0000212 >>> for word in ['red', 'blue', 'red', 'green', 'blue', 'blue']:
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000213 ... cnt[word] += 1
214 >>> cnt
215 Counter({'blue': 3, 'red': 2, 'green': 1})
216
Raymond Hettinger1c62dc92009-02-04 11:41:45 +0000217 >>> # Find the ten most common words in Hamlet
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000218 >>> import re
Raymond Hettingerfaaba592013-03-01 03:30:20 -0800219 >>> words = re.findall(r'\w+', open('hamlet.txt').read().lower())
Raymond Hettinger0bae6622009-01-20 13:00:59 +0000220 >>> Counter(words).most_common(10)
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000221 [('the', 1143), ('and', 966), ('to', 762), ('of', 669), ('i', 631),
222 ('you', 554), ('a', 546), ('my', 514), ('hamlet', 471), ('in', 451)]
223
224.. class:: Counter([iterable-or-mapping])
225
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700226 A :class:`Counter` is a :class:`dict` subclass for counting hashable objects.
227 It is an unordered collection where elements are stored as dictionary keys
228 and their counts are stored as dictionary values. Counts are allowed to be
229 any integer value including zero or negative counts. The :class:`Counter`
230 class is similar to bags or multisets in other languages.
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000231
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700232 Elements are counted from an *iterable* or initialized from another
233 *mapping* (or counter):
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000234
Raymond Hettinger73662a52009-01-27 02:38:22 +0000235 >>> c = Counter() # a new, empty counter
236 >>> c = Counter('gallahad') # a new counter from an iterable
237 >>> c = Counter({'red': 4, 'blue': 2}) # a new counter from a mapping
238 >>> c = Counter(cats=4, dogs=8) # a new counter from keyword args
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000239
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700240 Counter objects have a dictionary interface except that they return a zero
241 count for missing items instead of raising a :exc:`KeyError`:
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000242
Raymond Hettinger94adc8e2009-01-22 05:27:37 +0000243 >>> c = Counter(['eggs', 'ham'])
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000244 >>> c['bacon'] # count of a missing element is zero
245 0
246
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700247 Setting a count to zero does not remove an element from a counter.
248 Use ``del`` to remove it entirely:
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000249
Raymond Hettinger94adc8e2009-01-22 05:27:37 +0000250 >>> c['sausage'] = 0 # counter entry with a zero count
251 >>> del c['sausage'] # del actually removes the entry
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000252
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700253 .. versionadded:: 3.1
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000254
255
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700256 Counter objects support three methods beyond those available for all
257 dictionaries:
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000258
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700259 .. method:: elements()
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000260
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700261 Return an iterator over elements repeating each as many times as its
262 count. Elements are returned in arbitrary order. If an element's count
263 is less than one, :meth:`elements` will ignore it.
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000264
Raymond Hettinger0bae6622009-01-20 13:00:59 +0000265 >>> c = Counter(a=4, b=2, c=0, d=-2)
Zachary Ware2b52c0a2016-08-09 17:38:22 -0500266 >>> sorted(c.elements())
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000267 ['a', 'a', 'a', 'a', 'b', 'b']
268
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700269 .. method:: most_common([n])
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000270
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700271 Return a list of the *n* most common elements and their counts from the
Raymond Hettingera3780252015-05-13 02:47:57 -0700272 most common to the least. If *n* is omitted or ``None``,
Raymond Hettinger3afdb282015-05-13 14:39:04 -0700273 :func:`most_common` returns *all* elements in the counter.
274 Elements with equal counts are ordered arbitrarily:
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000275
Zachary Ware2b52c0a2016-08-09 17:38:22 -0500276 >>> Counter('abracadabra').most_common(3) # doctest: +SKIP
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000277 [('a', 5), ('r', 2), ('b', 2)]
278
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700279 .. method:: subtract([iterable-or-mapping])
Raymond Hettinger9c01e442010-04-03 10:32:58 +0000280
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700281 Elements are subtracted from an *iterable* or from another *mapping*
282 (or counter). Like :meth:`dict.update` but subtracts counts instead
283 of replacing them. Both inputs and outputs may be zero or negative.
Raymond Hettinger9c01e442010-04-03 10:32:58 +0000284
285 >>> c = Counter(a=4, b=2, c=0, d=-2)
286 >>> d = Counter(a=1, b=2, c=3, d=4)
287 >>> c.subtract(d)
Andrew Svetlovf6351722012-12-17 14:01:16 +0200288 >>> c
Raymond Hettinger9c01e442010-04-03 10:32:58 +0000289 Counter({'a': 3, 'b': 0, 'c': -3, 'd': -6})
290
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700291 .. versionadded:: 3.2
Ezio Melotti0be8b1c2010-04-04 06:53:44 +0000292
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700293 The usual dictionary methods are available for :class:`Counter` objects
294 except for two which work differently for counters.
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000295
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700296 .. method:: fromkeys(iterable)
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000297
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700298 This class method is not implemented for :class:`Counter` objects.
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000299
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700300 .. method:: update([iterable-or-mapping])
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000301
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700302 Elements are counted from an *iterable* or added-in from another
303 *mapping* (or counter). Like :meth:`dict.update` but adds counts
304 instead of replacing them. Also, the *iterable* is expected to be a
305 sequence of elements, not a sequence of ``(key, value)`` pairs.
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000306
307Common patterns for working with :class:`Counter` objects::
308
Raymond Hettinger73662a52009-01-27 02:38:22 +0000309 sum(c.values()) # total of all counts
310 c.clear() # reset all counts
311 list(c) # list unique elements
312 set(c) # convert to a set
313 dict(c) # convert to a regular dictionary
314 c.items() # convert to a list of (elem, cnt) pairs
315 Counter(dict(list_of_pairs)) # convert from a list of (elem, cnt) pairs
Georg Brandl87f3d7b2013-10-06 12:36:39 +0200316 c.most_common()[:-n-1:-1] # n least common elements
Raymond Hettingerfcb393c2011-08-09 13:00:40 -0700317 +c # remove zero and negative counts
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000318
Raymond Hettinger72a95cc2009-02-25 22:51:40 +0000319Several mathematical operations are provided for combining :class:`Counter`
320objects to produce multisets (counters that have counts greater than zero).
321Addition and subtraction combine counters by adding or subtracting the counts
322of corresponding elements. Intersection and union return the minimum and
323maximum of corresponding counts. Each operation can accept inputs with signed
324counts, but the output will exclude results with counts of zero or less.
Raymond Hettinger4d2073a2009-01-20 03:41:22 +0000325
Raymond Hettingere0d1b9f2009-01-21 20:36:27 +0000326 >>> c = Counter(a=3, b=1)
327 >>> d = Counter(a=1, b=2)
Raymond Hettinger73662a52009-01-27 02:38:22 +0000328 >>> c + d # add two counters together: c[x] + d[x]
Raymond Hettinger4d2073a2009-01-20 03:41:22 +0000329 Counter({'a': 4, 'b': 3})
Raymond Hettinger73662a52009-01-27 02:38:22 +0000330 >>> c - d # subtract (keeping only positive counts)
Raymond Hettinger4d2073a2009-01-20 03:41:22 +0000331 Counter({'a': 2})
Zachary Ware2b52c0a2016-08-09 17:38:22 -0500332 >>> c & d # intersection: min(c[x], d[x]) # doctest: +SKIP
Raymond Hettinger4d2073a2009-01-20 03:41:22 +0000333 Counter({'a': 1, 'b': 1})
Raymond Hettinger73662a52009-01-27 02:38:22 +0000334 >>> c | d # union: max(c[x], d[x])
Raymond Hettinger4d2073a2009-01-20 03:41:22 +0000335 Counter({'a': 3, 'b': 2})
336
Berker Peksag315e1042015-05-19 01:36:55 +0300337Unary addition and subtraction are shortcuts for adding an empty counter
Raymond Hettingerfcb393c2011-08-09 13:00:40 -0700338or subtracting from an empty counter.
339
340 >>> c = Counter(a=2, b=-4)
341 >>> +c
342 Counter({'a': 2})
343 >>> -c
344 Counter({'b': 4})
345
346.. versionadded:: 3.3
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700347 Added support for unary plus, unary minus, and in-place multiset operations.
Raymond Hettingerfcb393c2011-08-09 13:00:40 -0700348
Raymond Hettinger22f18852010-04-12 21:45:14 +0000349.. note::
350
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700351 Counters were primarily designed to work with positive integers to represent
352 running counts; however, care was taken to not unnecessarily preclude use
353 cases needing other types or negative values. To help with those use cases,
354 this section documents the minimum range and type restrictions.
Raymond Hettinger22f18852010-04-12 21:45:14 +0000355
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700356 * The :class:`Counter` class itself is a dictionary subclass with no
Georg Brandl2fdc0f82012-10-06 22:38:20 +0200357 restrictions on its keys and values. The values are intended to be numbers
358 representing counts, but you *could* store anything in the value field.
Raymond Hettinger22f18852010-04-12 21:45:14 +0000359
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700360 * The :meth:`most_common` method requires only that the values be orderable.
Raymond Hettinger22f18852010-04-12 21:45:14 +0000361
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700362 * For in-place operations such as ``c[key] += 1``, the value type need only
Georg Brandl2fdc0f82012-10-06 22:38:20 +0200363 support addition and subtraction. So fractions, floats, and decimals would
364 work and negative values are supported. The same is also true for
365 :meth:`update` and :meth:`subtract` which allow negative and zero values
366 for both inputs and outputs.
Raymond Hettinger22f18852010-04-12 21:45:14 +0000367
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700368 * The multiset methods are designed only for use cases with positive values.
Georg Brandl2fdc0f82012-10-06 22:38:20 +0200369 The inputs may be negative or zero, but only outputs with positive values
370 are created. There are no type restrictions, but the value type needs to
371 support addition, subtraction, and comparison.
Raymond Hettinger22f18852010-04-12 21:45:14 +0000372
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700373 * The :meth:`elements` method requires integer counts. It ignores zero and
Georg Brandl2fdc0f82012-10-06 22:38:20 +0200374 negative counts.
Raymond Hettinger22f18852010-04-12 21:45:14 +0000375
Raymond Hettingerb14043c2009-01-20 23:44:31 +0000376.. seealso::
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000377
Georg Brandl5d941342016-02-26 19:37:12 +0100378 * `Bag class <https://www.gnu.org/software/smalltalk/manual-base/html_node/Bag.html>`_
Raymond Hettingerbfcb4292012-06-10 11:39:44 -0700379 in Smalltalk.
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000380
Georg Brandl5d941342016-02-26 19:37:12 +0100381 * Wikipedia entry for `Multisets <https://en.wikipedia.org/wiki/Multiset>`_.
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000382
Georg Brandl5d941342016-02-26 19:37:12 +0100383 * `C++ multisets <http://www.java2s.com/Tutorial/Cpp/0380__set-multiset/Catalog0380__set-multiset.htm>`_
Raymond Hettingerbfcb4292012-06-10 11:39:44 -0700384 tutorial with examples.
Raymond Hettingerb14043c2009-01-20 23:44:31 +0000385
Raymond Hettinger94adc8e2009-01-22 05:27:37 +0000386 * For mathematical operations on multisets and their use cases, see
Raymond Hettingerbfcb4292012-06-10 11:39:44 -0700387 *Knuth, Donald. The Art of Computer Programming Volume II,
388 Section 4.6.3, Exercise 19*.
Raymond Hettingerb14043c2009-01-20 23:44:31 +0000389
Raymond Hettinger670eaec2009-01-21 23:14:07 +0000390 * To enumerate all distinct multisets of a given size over a given set of
Raymond Hettinger855482e2015-05-23 08:57:58 -0700391 elements, see :func:`itertools.combinations_with_replacement`:
Raymond Hettingerb14043c2009-01-20 23:44:31 +0000392
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700393 map(Counter, combinations_with_replacement('ABC', 2)) --> AA AB AC BB BC CC
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000394
395
Georg Brandl116aa622007-08-15 14:28:22 +0000396:class:`deque` objects
397----------------------
398
Georg Brandlc2a4f4f2009-04-10 09:03:43 +0000399.. class:: deque([iterable, [maxlen]])
Georg Brandl116aa622007-08-15 14:28:22 +0000400
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700401 Returns a new deque object initialized left-to-right (using :meth:`append`) with
402 data from *iterable*. If *iterable* is not specified, the new deque is empty.
Georg Brandl116aa622007-08-15 14:28:22 +0000403
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700404 Deques are a generalization of stacks and queues (the name is pronounced "deck"
405 and is short for "double-ended queue"). Deques support thread-safe, memory
406 efficient appends and pops from either side of the deque with approximately the
407 same O(1) performance in either direction.
Georg Brandl116aa622007-08-15 14:28:22 +0000408
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700409 Though :class:`list` objects support similar operations, they are optimized for
410 fast fixed-length operations and incur O(n) memory movement costs for
411 ``pop(0)`` and ``insert(0, v)`` operations which change both the size and
412 position of the underlying data representation.
Georg Brandl116aa622007-08-15 14:28:22 +0000413
Georg Brandl116aa622007-08-15 14:28:22 +0000414
Serhiy Storchakaecf41da2016-10-19 16:29:26 +0300415 If *maxlen* is not specified or is ``None``, deques may grow to an
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700416 arbitrary length. Otherwise, the deque is bounded to the specified maximum
417 length. Once a bounded length deque is full, when new items are added, a
418 corresponding number of items are discarded from the opposite end. Bounded
419 length deques provide functionality similar to the ``tail`` filter in
420 Unix. They are also useful for tracking transactions and other pools of data
421 where only the most recent activity is of interest.
Georg Brandl9afde1c2007-11-01 20:32:30 +0000422
Georg Brandl9afde1c2007-11-01 20:32:30 +0000423
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700424 Deque objects support the following methods:
Georg Brandl116aa622007-08-15 14:28:22 +0000425
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700426 .. method:: append(x)
Georg Brandl116aa622007-08-15 14:28:22 +0000427
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700428 Add *x* to the right side of the deque.
Georg Brandl116aa622007-08-15 14:28:22 +0000429
430
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700431 .. method:: appendleft(x)
Georg Brandl116aa622007-08-15 14:28:22 +0000432
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700433 Add *x* to the left side of the deque.
Georg Brandl116aa622007-08-15 14:28:22 +0000434
435
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700436 .. method:: clear()
Georg Brandl116aa622007-08-15 14:28:22 +0000437
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700438 Remove all elements from the deque leaving it with length 0.
Georg Brandl116aa622007-08-15 14:28:22 +0000439
440
Raymond Hettinger32ea1652015-03-21 01:37:37 -0700441 .. method:: copy()
442
443 Create a shallow copy of the deque.
444
445 .. versionadded:: 3.5
446
447
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700448 .. method:: count(x)
Raymond Hettinger44459de2010-04-03 23:20:46 +0000449
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700450 Count the number of deque elements equal to *x*.
Raymond Hettinger44459de2010-04-03 23:20:46 +0000451
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700452 .. versionadded:: 3.2
Raymond Hettinger44459de2010-04-03 23:20:46 +0000453
Georg Brandl67b21b72010-08-17 15:07:14 +0000454
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700455 .. method:: extend(iterable)
Georg Brandl116aa622007-08-15 14:28:22 +0000456
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700457 Extend the right side of the deque by appending elements from the iterable
458 argument.
Georg Brandl116aa622007-08-15 14:28:22 +0000459
460
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700461 .. method:: extendleft(iterable)
Georg Brandl116aa622007-08-15 14:28:22 +0000462
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700463 Extend the left side of the deque by appending elements from *iterable*.
464 Note, the series of left appends results in reversing the order of
465 elements in the iterable argument.
Georg Brandl116aa622007-08-15 14:28:22 +0000466
467
Raymond Hettinger855482e2015-05-23 08:57:58 -0700468 .. method:: index(x[, start[, stop]])
Raymond Hettinger32ea1652015-03-21 01:37:37 -0700469
Raymond Hettinger855482e2015-05-23 08:57:58 -0700470 Return the position of *x* in the deque (at or after index *start*
471 and before index *stop*). Returns the first match or raises
472 :exc:`ValueError` if not found.
Raymond Hettinger32ea1652015-03-21 01:37:37 -0700473
474 .. versionadded:: 3.5
475
476
477 .. method:: insert(i, x)
478
479 Insert *x* into the deque at position *i*.
480
Raymond Hettingera6389712016-02-01 21:21:19 -0800481 If the insertion would cause a bounded deque to grow beyond *maxlen*,
482 an :exc:`IndexError` is raised.
Raymond Hettinger37434322016-01-26 21:44:16 -0800483
Raymond Hettinger32ea1652015-03-21 01:37:37 -0700484 .. versionadded:: 3.5
485
486
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700487 .. method:: pop()
Georg Brandl116aa622007-08-15 14:28:22 +0000488
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700489 Remove and return an element from the right side of the deque. If no
490 elements are present, raises an :exc:`IndexError`.
Georg Brandl116aa622007-08-15 14:28:22 +0000491
492
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700493 .. method:: popleft()
Georg Brandl116aa622007-08-15 14:28:22 +0000494
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700495 Remove and return an element from the left side of the deque. If no
496 elements are present, raises an :exc:`IndexError`.
Georg Brandl116aa622007-08-15 14:28:22 +0000497
498
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700499 .. method:: remove(value)
Georg Brandl116aa622007-08-15 14:28:22 +0000500
Raymond Hettinger855482e2015-05-23 08:57:58 -0700501 Remove the first occurrence of *value*. If not found, raises a
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700502 :exc:`ValueError`.
Georg Brandl116aa622007-08-15 14:28:22 +0000503
Georg Brandl67b21b72010-08-17 15:07:14 +0000504
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700505 .. method:: reverse()
Raymond Hettingere5fdedb2009-12-10 00:47:21 +0000506
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700507 Reverse the elements of the deque in-place and then return ``None``.
Raymond Hettingere5fdedb2009-12-10 00:47:21 +0000508
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700509 .. versionadded:: 3.2
Georg Brandl116aa622007-08-15 14:28:22 +0000510
Georg Brandl67b21b72010-08-17 15:07:14 +0000511
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700512 .. method:: rotate(n)
Georg Brandl116aa622007-08-15 14:28:22 +0000513
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700514 Rotate the deque *n* steps to the right. If *n* is negative, rotate to
515 the left. Rotating one step to the right is equivalent to:
516 ``d.appendleft(d.pop())``.
Benjamin Petersone41251e2008-04-25 01:59:09 +0000517
Georg Brandl116aa622007-08-15 14:28:22 +0000518
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700519 Deque objects also provide one read-only attribute:
Raymond Hettinger5bb0f0e2009-03-10 12:56:32 +0000520
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700521 .. attribute:: maxlen
Raymond Hettinger5bb0f0e2009-03-10 12:56:32 +0000522
Serhiy Storchakaecf41da2016-10-19 16:29:26 +0300523 Maximum size of a deque or ``None`` if unbounded.
Raymond Hettinger5bb0f0e2009-03-10 12:56:32 +0000524
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700525 .. versionadded:: 3.1
Raymond Hettinger5bb0f0e2009-03-10 12:56:32 +0000526
527
Georg Brandl116aa622007-08-15 14:28:22 +0000528In addition to the above, deques support iteration, pickling, ``len(d)``,
529``reversed(d)``, ``copy.copy(d)``, ``copy.deepcopy(d)``, membership testing with
Benjamin Peterson206e3072008-10-19 14:07:49 +0000530the :keyword:`in` operator, and subscript references such as ``d[-1]``. Indexed
531access is O(1) at both ends but slows to O(n) in the middle. For fast random
532access, use lists instead.
Georg Brandl116aa622007-08-15 14:28:22 +0000533
Raymond Hettinger41290a62015-03-31 08:12:23 -0700534Starting in version 3.5, deques support ``__add__()``, ``__mul__()``,
535and ``__imul__()``.
536
Christian Heimesfe337bf2008-03-23 21:54:12 +0000537Example:
538
539.. doctest::
Georg Brandl116aa622007-08-15 14:28:22 +0000540
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700541 >>> from collections import deque
542 >>> d = deque('ghi') # make a new deque with three items
543 >>> for elem in d: # iterate over the deque's elements
544 ... print(elem.upper())
545 G
546 H
547 I
Georg Brandl116aa622007-08-15 14:28:22 +0000548
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700549 >>> d.append('j') # add a new entry to the right side
550 >>> d.appendleft('f') # add a new entry to the left side
551 >>> d # show the representation of the deque
552 deque(['f', 'g', 'h', 'i', 'j'])
Georg Brandl116aa622007-08-15 14:28:22 +0000553
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700554 >>> d.pop() # return and remove the rightmost item
555 'j'
556 >>> d.popleft() # return and remove the leftmost item
557 'f'
558 >>> list(d) # list the contents of the deque
559 ['g', 'h', 'i']
560 >>> d[0] # peek at leftmost item
561 'g'
562 >>> d[-1] # peek at rightmost item
563 'i'
Georg Brandl116aa622007-08-15 14:28:22 +0000564
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700565 >>> list(reversed(d)) # list the contents of a deque in reverse
566 ['i', 'h', 'g']
567 >>> 'h' in d # search the deque
568 True
569 >>> d.extend('jkl') # add multiple elements at once
570 >>> d
571 deque(['g', 'h', 'i', 'j', 'k', 'l'])
572 >>> d.rotate(1) # right rotation
573 >>> d
574 deque(['l', 'g', 'h', 'i', 'j', 'k'])
575 >>> d.rotate(-1) # left rotation
576 >>> d
577 deque(['g', 'h', 'i', 'j', 'k', 'l'])
Georg Brandl116aa622007-08-15 14:28:22 +0000578
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700579 >>> deque(reversed(d)) # make a new deque in reverse order
580 deque(['l', 'k', 'j', 'i', 'h', 'g'])
581 >>> d.clear() # empty the deque
582 >>> d.pop() # cannot pop from an empty deque
583 Traceback (most recent call last):
584 File "<pyshell#6>", line 1, in -toplevel-
585 d.pop()
586 IndexError: pop from an empty deque
Georg Brandl116aa622007-08-15 14:28:22 +0000587
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700588 >>> d.extendleft('abc') # extendleft() reverses the input order
589 >>> d
590 deque(['c', 'b', 'a'])
Georg Brandl116aa622007-08-15 14:28:22 +0000591
592
Georg Brandl9afde1c2007-11-01 20:32:30 +0000593:class:`deque` Recipes
594^^^^^^^^^^^^^^^^^^^^^^
Georg Brandl116aa622007-08-15 14:28:22 +0000595
596This section shows various approaches to working with deques.
597
Raymond Hettingerd2ee64d2009-03-31 22:52:48 +0000598Bounded length deques provide functionality similar to the ``tail`` filter
599in Unix::
600
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700601 def tail(filename, n=10):
602 'Return the last n lines of a file'
603 with open(filename) as f:
604 return deque(f, n)
Raymond Hettingerd2ee64d2009-03-31 22:52:48 +0000605
606Another approach to using deques is to maintain a sequence of recently
607added elements by appending to the right and popping to the left::
608
609 def moving_average(iterable, n=3):
610 # moving_average([40, 30, 50, 46, 39, 44]) --> 40.0 42.0 45.0 43.0
611 # http://en.wikipedia.org/wiki/Moving_average
612 it = iter(iterable)
Raymond Hettingerd40285a2009-05-22 01:11:26 +0000613 d = deque(itertools.islice(it, n-1))
614 d.appendleft(0)
Raymond Hettingerd2ee64d2009-03-31 22:52:48 +0000615 s = sum(d)
Raymond Hettingerd2ee64d2009-03-31 22:52:48 +0000616 for elem in it:
617 s += elem - d.popleft()
618 d.append(elem)
619 yield s / n
620
Georg Brandl116aa622007-08-15 14:28:22 +0000621The :meth:`rotate` method provides a way to implement :class:`deque` slicing and
Ezio Melotti0639d5a2009-12-19 23:26:38 +0000622deletion. For example, a pure Python implementation of ``del d[n]`` relies on
Georg Brandl116aa622007-08-15 14:28:22 +0000623the :meth:`rotate` method to position elements to be popped::
624
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700625 def delete_nth(d, n):
626 d.rotate(-n)
627 d.popleft()
628 d.rotate(n)
Georg Brandl116aa622007-08-15 14:28:22 +0000629
630To implement :class:`deque` slicing, use a similar approach applying
631:meth:`rotate` to bring a target element to the left side of the deque. Remove
632old entries with :meth:`popleft`, add new entries with :meth:`extend`, and then
633reverse the rotation.
Georg Brandl116aa622007-08-15 14:28:22 +0000634With minor variations on that approach, it is easy to implement Forth style
635stack manipulations such as ``dup``, ``drop``, ``swap``, ``over``, ``pick``,
636``rot``, and ``roll``.
637
Georg Brandl116aa622007-08-15 14:28:22 +0000638
639:class:`defaultdict` objects
640----------------------------
641
Georg Brandl116aa622007-08-15 14:28:22 +0000642.. class:: defaultdict([default_factory[, ...]])
643
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700644 Returns a new dictionary-like object. :class:`defaultdict` is a subclass of the
645 built-in :class:`dict` class. It overrides one method and adds one writable
646 instance variable. The remaining functionality is the same as for the
647 :class:`dict` class and is not documented here.
Georg Brandl116aa622007-08-15 14:28:22 +0000648
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700649 The first argument provides the initial value for the :attr:`default_factory`
650 attribute; it defaults to ``None``. All remaining arguments are treated the same
651 as if they were passed to the :class:`dict` constructor, including keyword
652 arguments.
Georg Brandl116aa622007-08-15 14:28:22 +0000653
Georg Brandl116aa622007-08-15 14:28:22 +0000654
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700655 :class:`defaultdict` objects support the following method in addition to the
656 standard :class:`dict` operations:
Georg Brandl116aa622007-08-15 14:28:22 +0000657
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700658 .. method:: __missing__(key)
Georg Brandl116aa622007-08-15 14:28:22 +0000659
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700660 If the :attr:`default_factory` attribute is ``None``, this raises a
661 :exc:`KeyError` exception with the *key* as argument.
Georg Brandl116aa622007-08-15 14:28:22 +0000662
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700663 If :attr:`default_factory` is not ``None``, it is called without arguments
664 to provide a default value for the given *key*, this value is inserted in
665 the dictionary for the *key*, and returned.
Georg Brandl116aa622007-08-15 14:28:22 +0000666
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700667 If calling :attr:`default_factory` raises an exception this exception is
668 propagated unchanged.
Georg Brandl116aa622007-08-15 14:28:22 +0000669
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700670 This method is called by the :meth:`__getitem__` method of the
671 :class:`dict` class when the requested key is not found; whatever it
672 returns or raises is then returned or raised by :meth:`__getitem__`.
Georg Brandl116aa622007-08-15 14:28:22 +0000673
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700674 Note that :meth:`__missing__` is *not* called for any operations besides
675 :meth:`__getitem__`. This means that :meth:`get` will, like normal
676 dictionaries, return ``None`` as a default rather than using
677 :attr:`default_factory`.
Benjamin Peterson871b9d12012-01-27 09:14:01 -0500678
Georg Brandl116aa622007-08-15 14:28:22 +0000679
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700680 :class:`defaultdict` objects support the following instance variable:
Georg Brandl116aa622007-08-15 14:28:22 +0000681
Benjamin Petersone41251e2008-04-25 01:59:09 +0000682
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700683 .. attribute:: default_factory
Benjamin Petersone41251e2008-04-25 01:59:09 +0000684
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700685 This attribute is used by the :meth:`__missing__` method; it is
686 initialized from the first argument to the constructor, if present, or to
687 ``None``, if absent.
Georg Brandl116aa622007-08-15 14:28:22 +0000688
689
Georg Brandl116aa622007-08-15 14:28:22 +0000690:class:`defaultdict` Examples
691^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
692
693Using :class:`list` as the :attr:`default_factory`, it is easy to group a
Christian Heimesfe337bf2008-03-23 21:54:12 +0000694sequence of key-value pairs into a dictionary of lists:
Georg Brandl116aa622007-08-15 14:28:22 +0000695
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700696 >>> s = [('yellow', 1), ('blue', 2), ('yellow', 3), ('blue', 4), ('red', 1)]
697 >>> d = defaultdict(list)
698 >>> for k, v in s:
699 ... d[k].append(v)
700 ...
Zachary Ware2b52c0a2016-08-09 17:38:22 -0500701 >>> sorted(d.items())
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700702 [('blue', [2, 4]), ('red', [1]), ('yellow', [1, 3])]
Georg Brandl116aa622007-08-15 14:28:22 +0000703
704When each key is encountered for the first time, it is not already in the
705mapping; so an entry is automatically created using the :attr:`default_factory`
706function which returns an empty :class:`list`. The :meth:`list.append`
707operation then attaches the value to the new list. When keys are encountered
708again, the look-up proceeds normally (returning the list for that key) and the
709:meth:`list.append` operation adds another value to the list. This technique is
Christian Heimesfe337bf2008-03-23 21:54:12 +0000710simpler and faster than an equivalent technique using :meth:`dict.setdefault`:
Georg Brandl116aa622007-08-15 14:28:22 +0000711
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700712 >>> d = {}
713 >>> for k, v in s:
714 ... d.setdefault(k, []).append(v)
715 ...
Zachary Ware2b52c0a2016-08-09 17:38:22 -0500716 >>> sorted(d.items())
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700717 [('blue', [2, 4]), ('red', [1]), ('yellow', [1, 3])]
Georg Brandl116aa622007-08-15 14:28:22 +0000718
719Setting the :attr:`default_factory` to :class:`int` makes the
720:class:`defaultdict` useful for counting (like a bag or multiset in other
Christian Heimesfe337bf2008-03-23 21:54:12 +0000721languages):
Georg Brandl116aa622007-08-15 14:28:22 +0000722
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700723 >>> s = 'mississippi'
724 >>> d = defaultdict(int)
725 >>> for k in s:
726 ... d[k] += 1
727 ...
Zachary Ware2b52c0a2016-08-09 17:38:22 -0500728 >>> sorted(d.items())
729 [('i', 4), ('m', 1), ('p', 2), ('s', 4)]
Georg Brandl116aa622007-08-15 14:28:22 +0000730
731When a letter is first encountered, it is missing from the mapping, so the
732:attr:`default_factory` function calls :func:`int` to supply a default count of
733zero. The increment operation then builds up the count for each letter.
734
735The function :func:`int` which always returns zero is just a special case of
736constant functions. A faster and more flexible way to create constant functions
737is to use a lambda function which can supply any constant value (not just
Christian Heimesfe337bf2008-03-23 21:54:12 +0000738zero):
Georg Brandl116aa622007-08-15 14:28:22 +0000739
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700740 >>> def constant_factory(value):
741 ... return lambda: value
742 >>> d = defaultdict(constant_factory('<missing>'))
743 >>> d.update(name='John', action='ran')
744 >>> '%(name)s %(action)s to %(object)s' % d
745 'John ran to <missing>'
Georg Brandl116aa622007-08-15 14:28:22 +0000746
747Setting the :attr:`default_factory` to :class:`set` makes the
Christian Heimesfe337bf2008-03-23 21:54:12 +0000748:class:`defaultdict` useful for building a dictionary of sets:
Georg Brandl116aa622007-08-15 14:28:22 +0000749
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700750 >>> s = [('red', 1), ('blue', 2), ('red', 3), ('blue', 4), ('red', 1), ('blue', 4)]
751 >>> d = defaultdict(set)
752 >>> for k, v in s:
753 ... d[k].add(v)
754 ...
Zachary Ware2b52c0a2016-08-09 17:38:22 -0500755 >>> sorted(d.items())
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700756 [('blue', {2, 4}), ('red', {1, 3})]
Georg Brandl116aa622007-08-15 14:28:22 +0000757
758
Guido van Rossum3d392eb2007-11-16 00:35:22 +0000759:func:`namedtuple` Factory Function for Tuples with Named Fields
Christian Heimes790c8232008-01-07 21:14:23 +0000760----------------------------------------------------------------
Georg Brandl116aa622007-08-15 14:28:22 +0000761
Thomas Wouters1b7f8912007-09-19 03:06:30 +0000762Named tuples assign meaning to each position in a tuple and allow for more readable,
763self-documenting code. They can be used wherever regular tuples are used, and
764they add the ability to access fields by name instead of position index.
Georg Brandl116aa622007-08-15 14:28:22 +0000765
Raymond Hettinger0d5048c2016-09-12 00:18:31 -0700766.. function:: namedtuple(typename, field_names, *, verbose=False, rename=False, module=None)
Georg Brandl116aa622007-08-15 14:28:22 +0000767
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700768 Returns a new tuple subclass named *typename*. The new subclass is used to
769 create tuple-like objects that have fields accessible by attribute lookup as
770 well as being indexable and iterable. Instances of the subclass also have a
771 helpful docstring (with typename and field_names) and a helpful :meth:`__repr__`
772 method which lists the tuple contents in a ``name=value`` format.
Georg Brandl116aa622007-08-15 14:28:22 +0000773
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700774 The *field_names* are a single string with each fieldname separated by whitespace
775 and/or commas, for example ``'x y'`` or ``'x, y'``. Alternatively, *field_names*
776 can be a sequence of strings such as ``['x', 'y']``.
Georg Brandl9afde1c2007-11-01 20:32:30 +0000777
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700778 Any valid Python identifier may be used for a fieldname except for names
779 starting with an underscore. Valid identifiers consist of letters, digits,
780 and underscores but do not start with a digit or underscore and cannot be
781 a :mod:`keyword` such as *class*, *for*, *return*, *global*, *pass*,
782 or *raise*.
Georg Brandl116aa622007-08-15 14:28:22 +0000783
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700784 If *rename* is true, invalid fieldnames are automatically replaced
785 with positional names. For example, ``['abc', 'def', 'ghi', 'abc']`` is
786 converted to ``['abc', '_1', 'ghi', '_3']``, eliminating the keyword
787 ``def`` and the duplicate fieldname ``abc``.
Benjamin Petersona86f2c02009-02-10 02:41:10 +0000788
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700789 If *verbose* is true, the class definition is printed after it is
790 built. This option is outdated; instead, it is simpler to print the
791 :attr:`_source` attribute.
Georg Brandl116aa622007-08-15 14:28:22 +0000792
Raymond Hettinger0d5048c2016-09-12 00:18:31 -0700793 If *module* is defined, the ``__module__`` attribute of the named tuple is
794 set to that value.
795
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700796 Named tuple instances do not have per-instance dictionaries, so they are
797 lightweight and require no more memory than regular tuples.
Georg Brandl116aa622007-08-15 14:28:22 +0000798
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700799 .. versionchanged:: 3.1
Raymond Hettinger6538b432016-08-16 10:55:43 -0700800 Added support for *rename*.
801
802 .. versionchanged:: 3.6
803 The *verbose* and *rename* parameters became
804 :ref:`keyword-only arguments <keyword-only_parameter>`.
Benjamin Petersona86f2c02009-02-10 02:41:10 +0000805
Raymond Hettinger0d5048c2016-09-12 00:18:31 -0700806 .. versionchanged:: 3.6
807 Added the *module* parameter.
Christian Heimesfe337bf2008-03-23 21:54:12 +0000808
809.. doctest::
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700810 :options: +NORMALIZE_WHITESPACE
Georg Brandl116aa622007-08-15 14:28:22 +0000811
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700812 >>> # Basic example
813 >>> Point = namedtuple('Point', ['x', 'y'])
814 >>> p = Point(11, y=22) # instantiate with positional or keyword arguments
815 >>> p[0] + p[1] # indexable like the plain tuple (11, 22)
816 33
817 >>> x, y = p # unpack like a regular tuple
818 >>> x, y
819 (11, 22)
820 >>> p.x + p.y # fields also accessible by name
821 33
822 >>> p # readable __repr__ with a name=value style
823 Point(x=11, y=22)
Georg Brandl116aa622007-08-15 14:28:22 +0000824
Thomas Wouters1b7f8912007-09-19 03:06:30 +0000825Named tuples are especially useful for assigning field names to result tuples returned
826by the :mod:`csv` or :mod:`sqlite3` modules::
827
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700828 EmployeeRecord = namedtuple('EmployeeRecord', 'name, age, title, department, paygrade')
Georg Brandl9afde1c2007-11-01 20:32:30 +0000829
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700830 import csv
831 for emp in map(EmployeeRecord._make, csv.reader(open("employees.csv", "rb"))):
832 print(emp.name, emp.title)
Thomas Wouters1b7f8912007-09-19 03:06:30 +0000833
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700834 import sqlite3
835 conn = sqlite3.connect('/companydata')
836 cursor = conn.cursor()
837 cursor.execute('SELECT name, age, title, department, paygrade FROM employees')
838 for emp in map(EmployeeRecord._make, cursor.fetchall()):
839 print(emp.name, emp.title)
Georg Brandl9afde1c2007-11-01 20:32:30 +0000840
Christian Heimes99170a52007-12-19 02:07:34 +0000841In addition to the methods inherited from tuples, named tuples support
Raymond Hettinger2ebea412011-03-23 12:52:23 -0700842three additional methods and two attributes. To prevent conflicts with
Christian Heimes2380ac72008-01-09 00:17:24 +0000843field names, the method and attribute names start with an underscore.
Christian Heimes99170a52007-12-19 02:07:34 +0000844
Benjamin Peterson0b9fb802010-07-18 14:23:36 +0000845.. classmethod:: somenamedtuple._make(iterable)
Christian Heimes99170a52007-12-19 02:07:34 +0000846
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700847 Class method that makes a new instance from an existing sequence or iterable.
Christian Heimes99170a52007-12-19 02:07:34 +0000848
Raymond Hettinger6fed9fd2012-06-11 00:38:14 -0700849 .. doctest::
Thomas Wouters1b7f8912007-09-19 03:06:30 +0000850
Raymond Hettinger6fed9fd2012-06-11 00:38:14 -0700851 >>> t = [11, 22]
852 >>> Point._make(t)
853 Point(x=11, y=22)
Thomas Wouters1b7f8912007-09-19 03:06:30 +0000854
Christian Heimes790c8232008-01-07 21:14:23 +0000855.. method:: somenamedtuple._asdict()
Georg Brandl9afde1c2007-11-01 20:32:30 +0000856
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700857 Return a new :class:`OrderedDict` which maps field names to their corresponding
Raymond Hettingerfd27f622016-08-16 13:13:17 -0700858 values:
859
860 .. doctest::
Georg Brandl9afde1c2007-11-01 20:32:30 +0000861
Raymond Hettinger7a3602e2015-08-30 09:13:48 -0700862 >>> p = Point(x=11, y=22)
863 >>> p._asdict()
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700864 OrderedDict([('x', 11), ('y', 22)])
Raymond Hettingera4f52b12009-03-02 22:28:31 +0000865
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700866 .. versionchanged:: 3.1
867 Returns an :class:`OrderedDict` instead of a regular :class:`dict`.
Christian Heimesfe337bf2008-03-23 21:54:12 +0000868
Christian Heimes790c8232008-01-07 21:14:23 +0000869.. method:: somenamedtuple._replace(kwargs)
Thomas Wouters1b7f8912007-09-19 03:06:30 +0000870
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700871 Return a new instance of the named tuple replacing specified fields with new
Raymond Hettinger6fed9fd2012-06-11 00:38:14 -0700872 values::
Thomas Wouters8ce81f72007-09-20 18:22:40 +0000873
Raymond Hettinger6fed9fd2012-06-11 00:38:14 -0700874 >>> p = Point(x=11, y=22)
875 >>> p._replace(x=33)
876 Point(x=33, y=22)
Thomas Wouters1b7f8912007-09-19 03:06:30 +0000877
Raymond Hettinger6fed9fd2012-06-11 00:38:14 -0700878 >>> for partnum, record in inventory.items():
879 ... inventory[partnum] = record._replace(price=newprices[partnum], timestamp=time.now())
Thomas Wouters1b7f8912007-09-19 03:06:30 +0000880
Raymond Hettinger2ebea412011-03-23 12:52:23 -0700881.. attribute:: somenamedtuple._source
882
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700883 A string with the pure Python source code used to create the named
884 tuple class. The source makes the named tuple self-documenting.
Raymond Hettinger163e9822013-05-18 00:05:20 -0700885 It can be printed, executed using :func:`exec`, or saved to a file
886 and imported.
Raymond Hettinger2ebea412011-03-23 12:52:23 -0700887
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700888 .. versionadded:: 3.3
Raymond Hettinger2ebea412011-03-23 12:52:23 -0700889
Christian Heimes790c8232008-01-07 21:14:23 +0000890.. attribute:: somenamedtuple._fields
Thomas Wouters1b7f8912007-09-19 03:06:30 +0000891
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700892 Tuple of strings listing the field names. Useful for introspection
893 and for creating new named tuple types from existing named tuples.
Thomas Wouters8ce81f72007-09-20 18:22:40 +0000894
Raymond Hettinger6fed9fd2012-06-11 00:38:14 -0700895 .. doctest::
Thomas Wouters1b7f8912007-09-19 03:06:30 +0000896
Raymond Hettinger6fed9fd2012-06-11 00:38:14 -0700897 >>> p._fields # view the field names
898 ('x', 'y')
Thomas Wouters1b7f8912007-09-19 03:06:30 +0000899
Raymond Hettinger6fed9fd2012-06-11 00:38:14 -0700900 >>> Color = namedtuple('Color', 'red green blue')
901 >>> Pixel = namedtuple('Pixel', Point._fields + Color._fields)
902 >>> Pixel(11, 22, 128, 255, 0)
903 Pixel(x=11, y=22, red=128, green=255, blue=0)
Georg Brandl116aa622007-08-15 14:28:22 +0000904
Christian Heimes0449f632007-12-15 01:27:15 +0000905To retrieve a field whose name is stored in a string, use the :func:`getattr`
Christian Heimesfe337bf2008-03-23 21:54:12 +0000906function:
Christian Heimes0449f632007-12-15 01:27:15 +0000907
908 >>> getattr(p, 'x')
909 11
910
Raymond Hettinger651453a2009-02-11 00:20:02 +0000911To convert a dictionary to a named tuple, use the double-star-operator
912(as described in :ref:`tut-unpacking-arguments`):
Christian Heimes99170a52007-12-19 02:07:34 +0000913
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700914 >>> d = {'x': 11, 'y': 22}
915 >>> Point(**d)
916 Point(x=11, y=22)
Christian Heimes99170a52007-12-19 02:07:34 +0000917
Guido van Rossum3d392eb2007-11-16 00:35:22 +0000918Since a named tuple is a regular Python class, it is easy to add or change
Christian Heimes043d6f62008-01-07 17:19:16 +0000919functionality with a subclass. Here is how to add a calculated field and
Christian Heimesfe337bf2008-03-23 21:54:12 +0000920a fixed-width print format:
Guido van Rossum3d392eb2007-11-16 00:35:22 +0000921
Raymond Hettingerfd27f622016-08-16 13:13:17 -0700922.. doctest::
923
924 >>> class Point(namedtuple('Point', ['x', 'y'])):
Zachary Ware2b52c0a2016-08-09 17:38:22 -0500925 ... __slots__ = ()
926 ... @property
927 ... def hypot(self):
928 ... return (self.x ** 2 + self.y ** 2) ** 0.5
929 ... def __str__(self):
930 ... return 'Point: x=%6.3f y=%6.3f hypot=%6.3f' % (self.x, self.y, self.hypot)
Guido van Rossum3d392eb2007-11-16 00:35:22 +0000931
Georg Brandl0df79792008-10-04 18:33:26 +0000932 >>> for p in Point(3, 4), Point(14, 5/7):
Zachary Ware2b52c0a2016-08-09 17:38:22 -0500933 ... print(p)
Christian Heimes25bb7832008-01-11 16:17:00 +0000934 Point: x= 3.000 y= 4.000 hypot= 5.000
935 Point: x=14.000 y= 0.714 hypot=14.018
Christian Heimes043d6f62008-01-07 17:19:16 +0000936
Georg Brandlaf5c2382009-12-28 08:02:38 +0000937The subclass shown above sets ``__slots__`` to an empty tuple. This helps
Christian Heimes679db4a2008-01-18 09:56:22 +0000938keep memory requirements low by preventing the creation of instance dictionaries.
939
Christian Heimes2380ac72008-01-09 00:17:24 +0000940Subclassing is not useful for adding new, stored fields. Instead, simply
Christian Heimesfe337bf2008-03-23 21:54:12 +0000941create a new named tuple type from the :attr:`_fields` attribute:
Christian Heimes2380ac72008-01-09 00:17:24 +0000942
Christian Heimes25bb7832008-01-11 16:17:00 +0000943 >>> Point3D = namedtuple('Point3D', Point._fields + ('z',))
Christian Heimes2380ac72008-01-09 00:17:24 +0000944
Raymond Hettingereac503a2015-05-13 01:09:59 -0700945Docstrings can be customized by making direct assignments to the ``__doc__``
946fields:
947
948 >>> Book = namedtuple('Book', ['id', 'title', 'authors'])
Raymond Hettinger850be0f2015-11-09 08:24:53 -0800949 >>> Book.__doc__ += ': Hardcover book in active collection'
Berker Peksagde7cafa2015-05-13 12:16:27 +0300950 >>> Book.id.__doc__ = '13-digit ISBN'
951 >>> Book.title.__doc__ = 'Title of first printing'
Raymond Hettinger850be0f2015-11-09 08:24:53 -0800952 >>> Book.authors.__doc__ = 'List of authors sorted by last name'
Raymond Hettingereac503a2015-05-13 01:09:59 -0700953
Raymond Hettinger6e701312015-11-23 22:18:55 -0800954.. versionchanged:: 3.5
955 Property docstrings became writeable.
956
Christian Heimes2380ac72008-01-09 00:17:24 +0000957Default values can be implemented by using :meth:`_replace` to
Christian Heimesfe337bf2008-03-23 21:54:12 +0000958customize a prototype instance:
Guido van Rossum3d392eb2007-11-16 00:35:22 +0000959
960 >>> Account = namedtuple('Account', 'owner balance transaction_count')
Christian Heimes587c2bf2008-01-19 16:21:02 +0000961 >>> default_account = Account('<owner name>', 0.0, 0)
962 >>> johns_account = default_account._replace(owner='John')
Raymond Hettingerb2d09452011-03-22 22:36:21 -0700963 >>> janes_account = default_account._replace(owner='Jane')
Guido van Rossum3d392eb2007-11-16 00:35:22 +0000964
Georg Brandl8ed75cd2014-10-31 10:25:48 +0100965
966.. seealso::
967
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700968 * `Recipe for named tuple abstract base class with a metaclass mix-in
Serhiy Storchaka6dff0202016-05-07 10:49:07 +0300969 <https://code.activestate.com/recipes/577629-namedtupleabc-abstract-base-class-mix-in-for-named/>`_
Raymond Hettingerbfcb4292012-06-10 11:39:44 -0700970 by Jan Kaliszewski. Besides providing an :term:`abstract base class` for
971 named tuples, it also supports an alternate :term:`metaclass`-based
972 constructor that is convenient for use cases where named tuples are being
973 subclassed.
Raymond Hettingere4c96ad2008-02-06 01:23:58 +0000974
Raymond Hettingerfd27f622016-08-16 13:13:17 -0700975 * See :meth:`types.SimpleNamespace` for a mutable namespace based on an
976 underlying dictionary instead of a tuple.
977
978 * See :meth:`typing.NamedTuple` for a way to add type hints for named tuples.
Raymond Hettinger2a75e8f2015-08-16 08:32:01 -0700979
Raymond Hettingere4c96ad2008-02-06 01:23:58 +0000980
Raymond Hettinger2d32f632009-03-02 21:24:57 +0000981:class:`OrderedDict` objects
982----------------------------
983
984Ordered dictionaries are just like regular dictionaries but they remember the
985order that items were inserted. When iterating over an ordered dictionary,
986the items are returned in the order their keys were first added.
987
988.. class:: OrderedDict([items])
989
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700990 Return an instance of a dict subclass, supporting the usual :class:`dict`
991 methods. An *OrderedDict* is a dict that remembers the order that keys
992 were first inserted. If a new entry overwrites an existing entry, the
993 original insertion position is left unchanged. Deleting an entry and
994 reinserting it will move it to the end.
Raymond Hettinger2d32f632009-03-02 21:24:57 +0000995
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700996 .. versionadded:: 3.1
Raymond Hettinger2d32f632009-03-02 21:24:57 +0000997
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700998 .. method:: popitem(last=True)
Raymond Hettingerdc879f02009-03-19 20:30:56 +0000999
Raymond Hettinger7929cfb2012-06-09 19:15:26 -07001000 The :meth:`popitem` method for ordered dictionaries returns and removes a
Serhiy Storchaka4ecfa452016-05-16 09:31:54 +03001001 (key, value) pair. The pairs are returned in
1002 :abbr:`LIFO (last-in, first-out)` order if *last* is true
1003 or :abbr:`FIFO (first-in, first-out)` order if false.
Raymond Hettinger2d32f632009-03-02 21:24:57 +00001004
Raymond Hettinger7929cfb2012-06-09 19:15:26 -07001005 .. method:: move_to_end(key, last=True)
Raymond Hettingerf45abc92010-09-06 21:26:09 +00001006
Raymond Hettinger7929cfb2012-06-09 19:15:26 -07001007 Move an existing *key* to either end of an ordered dictionary. The item
1008 is moved to the right end if *last* is true (the default) or to the
1009 beginning if *last* is false. Raises :exc:`KeyError` if the *key* does
1010 not exist::
Raymond Hettingerf45abc92010-09-06 21:26:09 +00001011
Raymond Hettinger7929cfb2012-06-09 19:15:26 -07001012 >>> d = OrderedDict.fromkeys('abcde')
1013 >>> d.move_to_end('b')
1014 >>> ''.join(d.keys())
1015 'acdeb'
1016 >>> d.move_to_end('b', last=False)
1017 >>> ''.join(d.keys())
1018 'bacde'
Raymond Hettingerf45abc92010-09-06 21:26:09 +00001019
Raymond Hettinger7929cfb2012-06-09 19:15:26 -07001020 .. versionadded:: 3.2
Raymond Hettingerf45abc92010-09-06 21:26:09 +00001021
Raymond Hettingere9091502009-05-19 17:40:07 +00001022In addition to the usual mapping methods, ordered dictionaries also support
1023reverse iteration using :func:`reversed`.
1024
Raymond Hettinger2d32f632009-03-02 21:24:57 +00001025Equality tests between :class:`OrderedDict` objects are order-sensitive
1026and are implemented as ``list(od1.items())==list(od2.items())``.
1027Equality tests between :class:`OrderedDict` objects and other
Serhiy Storchakabfdcd432013-10-13 23:09:14 +03001028:class:`~collections.abc.Mapping` objects are order-insensitive like regular
1029dictionaries. This allows :class:`OrderedDict` objects to be substituted
1030anywhere a regular dictionary is used.
Raymond Hettinger2d32f632009-03-02 21:24:57 +00001031
Raymond Hettinger36180782009-04-09 22:34:23 +00001032The :class:`OrderedDict` constructor and :meth:`update` method both accept
1033keyword arguments, but their order is lost because Python's function call
Martin Panter9955a372015-10-07 10:26:23 +00001034semantics pass in keyword arguments using a regular unordered dictionary.
Raymond Hettinger36180782009-04-09 22:34:23 +00001035
Serhiy Storchaka578c9212014-04-04 15:19:36 +03001036.. versionchanged:: 3.5
Martin Panter397625e2015-10-07 10:03:20 +00001037 The items, keys, and values :term:`views <dictionary view>`
1038 of :class:`OrderedDict` now support reverse iteration using :func:`reversed`.
Raymond Hettingerdc879f02009-03-19 20:30:56 +00001039
Raymond Hettinger7bba6832011-04-15 17:43:19 -07001040:class:`OrderedDict` Examples and Recipes
1041^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
1042
Raymond Hettinger0e312012009-11-10 18:35:46 +00001043Since an ordered dictionary remembers its insertion order, it can be used
Donald Stufft8b852f12014-05-20 12:58:38 -04001044in conjunction with sorting to make a sorted dictionary::
Raymond Hettinger0e312012009-11-10 18:35:46 +00001045
1046 >>> # regular unsorted dictionary
Serhiy Storchakadba90392016-05-10 12:01:23 +03001047 >>> d = {'banana': 3, 'apple': 4, 'pear': 1, 'orange': 2}
Raymond Hettinger0e312012009-11-10 18:35:46 +00001048
1049 >>> # dictionary sorted by key
1050 >>> OrderedDict(sorted(d.items(), key=lambda t: t[0]))
1051 OrderedDict([('apple', 4), ('banana', 3), ('orange', 2), ('pear', 1)])
1052
1053 >>> # dictionary sorted by value
1054 >>> OrderedDict(sorted(d.items(), key=lambda t: t[1]))
1055 OrderedDict([('pear', 1), ('orange', 2), ('banana', 3), ('apple', 4)])
1056
1057 >>> # dictionary sorted by length of the key string
1058 >>> OrderedDict(sorted(d.items(), key=lambda t: len(t[0])))
1059 OrderedDict([('pear', 1), ('apple', 4), ('orange', 2), ('banana', 3)])
1060
1061The new sorted dictionaries maintain their sort order when entries
1062are deleted. But when new keys are added, the keys are appended
1063to the end and the sort is not maintained.
1064
Raymond Hettinger4821ef82010-07-31 10:14:41 +00001065It is also straight-forward to create an ordered dictionary variant
Andrew Svetlovff63e7a2012-08-31 13:54:54 +03001066that remembers the order the keys were *last* inserted.
Raymond Hettinger4821ef82010-07-31 10:14:41 +00001067If a new entry overwrites an existing entry, the
1068original insertion position is changed and moved to the end::
1069
1070 class LastUpdatedOrderedDict(OrderedDict):
Georg Brandl77570e22010-12-18 16:21:58 +00001071 'Store items in the order the keys were last added'
Raymond Hettinger7bba6832011-04-15 17:43:19 -07001072
Raymond Hettinger4821ef82010-07-31 10:14:41 +00001073 def __setitem__(self, key, value):
1074 if key in self:
1075 del self[key]
1076 OrderedDict.__setitem__(self, key, value)
1077
Éric Araujo889a7dc2011-08-19 00:40:46 +02001078An ordered dictionary can be combined with the :class:`Counter` class
Raymond Hettinger7bba6832011-04-15 17:43:19 -07001079so that the counter remembers the order elements are first encountered::
1080
Raymond Hettinger7929cfb2012-06-09 19:15:26 -07001081 class OrderedCounter(Counter, OrderedDict):
Raymond Hettinger7bba6832011-04-15 17:43:19 -07001082 'Counter that remembers the order elements are first encountered'
1083
Raymond Hettinger7bba6832011-04-15 17:43:19 -07001084 def __repr__(self):
1085 return '%s(%r)' % (self.__class__.__name__, OrderedDict(self))
1086
1087 def __reduce__(self):
1088 return self.__class__, (OrderedDict(self),)
1089
Raymond Hettingere4c96ad2008-02-06 01:23:58 +00001090
1091:class:`UserDict` objects
Mark Summerfield8f2d0062008-02-06 13:30:44 +00001092-------------------------
Raymond Hettingere4c96ad2008-02-06 01:23:58 +00001093
Georg Brandl48310cd2009-01-03 21:18:54 +00001094The class, :class:`UserDict` acts as a wrapper around dictionary objects.
1095The need for this class has been partially supplanted by the ability to
Raymond Hettingere4c96ad2008-02-06 01:23:58 +00001096subclass directly from :class:`dict`; however, this class can be easier
1097to work with because the underlying dictionary is accessible as an
1098attribute.
1099
1100.. class:: UserDict([initialdata])
1101
Raymond Hettinger7929cfb2012-06-09 19:15:26 -07001102 Class that simulates a dictionary. The instance's contents are kept in a
1103 regular dictionary, which is accessible via the :attr:`data` attribute of
1104 :class:`UserDict` instances. If *initialdata* is provided, :attr:`data` is
1105 initialized with its contents; note that a reference to *initialdata* will not
1106 be kept, allowing it be used for other purposes.
Raymond Hettingere4c96ad2008-02-06 01:23:58 +00001107
Raymond Hettinger7929cfb2012-06-09 19:15:26 -07001108 In addition to supporting the methods and operations of mappings,
1109 :class:`UserDict` instances provide the following attribute:
Raymond Hettingere4c96ad2008-02-06 01:23:58 +00001110
Raymond Hettinger7929cfb2012-06-09 19:15:26 -07001111 .. attribute:: data
Raymond Hettingere4c96ad2008-02-06 01:23:58 +00001112
Raymond Hettinger7929cfb2012-06-09 19:15:26 -07001113 A real dictionary used to store the contents of the :class:`UserDict`
1114 class.
Raymond Hettinger53dbe392008-02-12 20:03:09 +00001115
1116
1117
1118:class:`UserList` objects
1119-------------------------
1120
1121This class acts as a wrapper around list objects. It is a useful base class
Georg Brandl48310cd2009-01-03 21:18:54 +00001122for your own list-like classes which can inherit from them and override
Raymond Hettinger53dbe392008-02-12 20:03:09 +00001123existing methods or add new ones. In this way, one can add new behaviors to
1124lists.
1125
Georg Brandl48310cd2009-01-03 21:18:54 +00001126The need for this class has been partially supplanted by the ability to
Raymond Hettinger53dbe392008-02-12 20:03:09 +00001127subclass directly from :class:`list`; however, this class can be easier
1128to work with because the underlying list is accessible as an attribute.
1129
1130.. class:: UserList([list])
1131
Raymond Hettinger7929cfb2012-06-09 19:15:26 -07001132 Class that simulates a list. The instance's contents are kept in a regular
1133 list, which is accessible via the :attr:`data` attribute of :class:`UserList`
1134 instances. The instance's contents are initially set to a copy of *list*,
1135 defaulting to the empty list ``[]``. *list* can be any iterable, for
1136 example a real Python list or a :class:`UserList` object.
Raymond Hettinger53dbe392008-02-12 20:03:09 +00001137
Raymond Hettinger7929cfb2012-06-09 19:15:26 -07001138 In addition to supporting the methods and operations of mutable sequences,
1139 :class:`UserList` instances provide the following attribute:
Raymond Hettinger53dbe392008-02-12 20:03:09 +00001140
Raymond Hettinger7929cfb2012-06-09 19:15:26 -07001141 .. attribute:: data
Raymond Hettinger53dbe392008-02-12 20:03:09 +00001142
Raymond Hettinger7929cfb2012-06-09 19:15:26 -07001143 A real :class:`list` object used to store the contents of the
1144 :class:`UserList` class.
Raymond Hettinger53dbe392008-02-12 20:03:09 +00001145
Zachary Ware80602e02014-01-13 20:38:57 -06001146**Subclassing requirements:** Subclasses of :class:`UserList` are expected to
Raymond Hettinger53dbe392008-02-12 20:03:09 +00001147offer a constructor which can be called with either no arguments or one
1148argument. List operations which return a new sequence attempt to create an
1149instance of the actual implementation class. To do so, it assumes that the
1150constructor can be called with a single parameter, which is a sequence object
1151used as a data source.
1152
1153If a derived class does not wish to comply with this requirement, all of the
1154special methods supported by this class will need to be overridden; please
1155consult the sources for information about the methods which need to be provided
1156in that case.
Raymond Hettingerb3a65f82008-02-21 22:11:37 +00001157
1158:class:`UserString` objects
Christian Heimesc3f30c42008-02-22 16:37:40 +00001159---------------------------
Raymond Hettingerb3a65f82008-02-21 22:11:37 +00001160
Georg Brandl48310cd2009-01-03 21:18:54 +00001161The class, :class:`UserString` acts as a wrapper around string objects.
1162The need for this class has been partially supplanted by the ability to
Raymond Hettingerb3a65f82008-02-21 22:11:37 +00001163subclass directly from :class:`str`; however, this class can be easier
1164to work with because the underlying string is accessible as an
1165attribute.
1166
1167.. class:: UserString([sequence])
1168
Raymond Hettinger7929cfb2012-06-09 19:15:26 -07001169 Class that simulates a string or a Unicode string object. The instance's
1170 content is kept in a regular string object, which is accessible via the
1171 :attr:`data` attribute of :class:`UserString` instances. The instance's
1172 contents are initially set to a copy of *sequence*. The *sequence* can
1173 be an instance of :class:`bytes`, :class:`str`, :class:`UserString` (or a
1174 subclass) or an arbitrary sequence which can be converted into a string using
1175 the built-in :func:`str` function.
Yury Selivanov336b37b2015-09-09 12:23:01 -04001176
1177 .. versionchanged:: 3.5
1178 New methods ``__getnewargs__``, ``__rmod__``, ``casefold``,
1179 ``format_map``, ``isprintable``, and ``maketrans``.