blob: 858a5665dc3c53a3c9d9242ecd7a6818a6c77269 [file] [log] [blame]
Raymond Hettinger53dbe392008-02-12 20:03:09 +00001:mod:`collections` --- Container datatypes
2==========================================
Georg Brandl116aa622007-08-15 14:28:22 +00003
4.. module:: collections
Raymond Hettinger7929cfb2012-06-09 19:15:26 -07005 :synopsis: Container datatypes
Georg Brandl116aa622007-08-15 14:28:22 +00006.. moduleauthor:: Raymond Hettinger <python@rcn.com>
7.. sectionauthor:: Raymond Hettinger <python@rcn.com>
8
Christian Heimesfe337bf2008-03-23 21:54:12 +00009.. testsetup:: *
10
Raymond Hettinger7929cfb2012-06-09 19:15:26 -070011 from collections import *
12 import itertools
13 __name__ = '<doctest>'
Georg Brandl116aa622007-08-15 14:28:22 +000014
Raymond Hettinger158c9c22011-02-22 00:41:50 +000015**Source code:** :source:`Lib/collections/__init__.py`
Raymond Hettinger10480942011-01-10 03:26:08 +000016
Raymond Hettinger4f707fd2011-01-10 19:54:11 +000017--------------
18
Raymond Hettingera6b76ba2010-08-08 00:29:08 +000019This module implements specialized container datatypes providing alternatives to
20Python's general purpose built-in containers, :class:`dict`, :class:`list`,
21:class:`set`, and :class:`tuple`.
Christian Heimes0bd4e112008-02-12 22:59:25 +000022
Raymond Hettingera6b76ba2010-08-08 00:29:08 +000023===================== ====================================================================
24:func:`namedtuple` factory function for creating tuple subclasses with named fields
25:class:`deque` list-like container with fast appends and pops on either end
Raymond Hettinger9fe1ccf2011-02-26 01:02:51 +000026:class:`ChainMap` dict-like class for creating a single view of multiple mappings
Raymond Hettingera6b76ba2010-08-08 00:29:08 +000027:class:`Counter` dict subclass for counting hashable objects
28:class:`OrderedDict` dict subclass that remembers the order entries were added
29:class:`defaultdict` dict subclass that calls a factory function to supply missing values
30:class:`UserDict` wrapper around dictionary objects for easier dict subclassing
31:class:`UserList` wrapper around list objects for easier list subclassing
32:class:`UserString` wrapper around string objects for easier string subclassing
33===================== ====================================================================
Georg Brandl116aa622007-08-15 14:28:22 +000034
Raymond Hettinger158c9c22011-02-22 00:41:50 +000035.. versionchanged:: 3.3
Raymond Hettinger7929cfb2012-06-09 19:15:26 -070036 Moved :ref:`collections-abstract-base-classes` to the :mod:`collections.abc` module.
37 For backwards compatibility, they continue to be visible in this module
38 as well.
Mark Summerfield08898b42007-09-05 08:43:04 +000039
40
Raymond Hettinger9fe1ccf2011-02-26 01:02:51 +000041:class:`ChainMap` objects
42-------------------------
43
Georg Brandl283b96b2012-04-03 09:16:46 +020044.. versionadded:: 3.3
45
Raymond Hettinger9fe1ccf2011-02-26 01:02:51 +000046A :class:`ChainMap` class is provided for quickly linking a number of mappings
47so they can be treated as a single unit. It is often much faster than creating
48a new dictionary and running multiple :meth:`~dict.update` calls.
49
50The class can be used to simulate nested scopes and is useful in templating.
51
52.. class:: ChainMap(*maps)
53
Raymond Hettinger7929cfb2012-06-09 19:15:26 -070054 A :class:`ChainMap` groups multiple dicts or other mappings together to
55 create a single, updateable view. If no *maps* are specified, a single empty
56 dictionary is provided so that a new chain always has at least one mapping.
Raymond Hettinger9fe1ccf2011-02-26 01:02:51 +000057
Raymond Hettinger7929cfb2012-06-09 19:15:26 -070058 The underlying mappings are stored in a list. That list is public and can
59 accessed or updated using the *maps* attribute. There is no other state.
Raymond Hettinger9fe1ccf2011-02-26 01:02:51 +000060
Raymond Hettinger7929cfb2012-06-09 19:15:26 -070061 Lookups search the underlying mappings successively until a key is found. In
62 contrast, writes, updates, and deletions only operate on the first mapping.
Raymond Hettinger9fe1ccf2011-02-26 01:02:51 +000063
Raymond Hettinger7929cfb2012-06-09 19:15:26 -070064 A :class:`ChainMap` incorporates the underlying mappings by reference. So, if
65 one of the underlying mappings gets updated, those changes will be reflected
66 in :class:`ChainMap`.
Raymond Hettinger9fe1ccf2011-02-26 01:02:51 +000067
Raymond Hettinger7929cfb2012-06-09 19:15:26 -070068 All of the usual dictionary methods are supported. In addition, there is a
69 *maps* attribute, a method for creating new subcontexts, and a property for
70 accessing all but the first mapping:
Raymond Hettinger9fe1ccf2011-02-26 01:02:51 +000071
Raymond Hettinger7929cfb2012-06-09 19:15:26 -070072 .. attribute:: maps
Raymond Hettinger9fe1ccf2011-02-26 01:02:51 +000073
Raymond Hettinger7929cfb2012-06-09 19:15:26 -070074 A user updateable list of mappings. The list is ordered from
75 first-searched to last-searched. It is the only stored state and can
76 be modified to change which mappings are searched. The list should
77 always contain at least one mapping.
Raymond Hettinger9fe1ccf2011-02-26 01:02:51 +000078
Vinay Sajip1ba81ee2013-01-11 23:39:53 +000079 .. method:: new_child(m=None)
Raymond Hettinger9fe1ccf2011-02-26 01:02:51 +000080
Vinay Sajip1ba81ee2013-01-11 23:39:53 +000081 Returns a new :class:`ChainMap` containing a new map followed by
82 all of the maps in the current instance. If ``m`` is specified,
83 it becomes the new map at the front of the list of mappings; if not
84 specified, an empty dict is used, so that a call to ``d.new_child()``
85 is equivalent to: ``ChainMap({}, *d.maps)``. This method is used for
Raymond Hettinger7929cfb2012-06-09 19:15:26 -070086 creating subcontexts that can be updated without altering values in any
87 of the parent mappings.
Raymond Hettinger9fe1ccf2011-02-26 01:02:51 +000088
Vinay Sajip1ba81ee2013-01-11 23:39:53 +000089 .. versionchanged:: 3.4
90 The optional ``m`` parameter was added.
91
Raymond Hettinger2a61c452012-07-15 22:37:20 -070092 .. attribute:: parents
Raymond Hettinger9fe1ccf2011-02-26 01:02:51 +000093
Raymond Hettingerb22ba042012-07-16 02:07:41 -070094 Property returning a new :class:`ChainMap` containing all of the maps in
Raymond Hettinger2a61c452012-07-15 22:37:20 -070095 the current instance except the first one. This is useful for skipping
96 the first map in the search. Use cases are similar to those for the
97 :keyword:`nonlocal` keyword used in :term:`nested scopes <nested
98 scope>`. The use cases also parallel those for the built-in
99 :func:`super` function. A reference to ``d.parents`` is equivalent to:
100 ``ChainMap(*d.maps[1:])``.
Raymond Hettinger9fe1ccf2011-02-26 01:02:51 +0000101
Raymond Hettingerbfcb4292012-06-10 11:39:44 -0700102
103.. seealso::
104
105 * The `MultiContext class
Sandro Tosiea475302012-08-12 10:37:23 +0200106 <https://github.com/enthought/codetools/blob/4.0.0/codetools/contexts/multi_context.py>`_
Raymond Hettingerbfcb4292012-06-10 11:39:44 -0700107 in the Enthought `CodeTools package
108 <https://github.com/enthought/codetools>`_ has options to support
109 writing to any mapping in the chain.
110
111 * Django's `Context class
Georg Brandl525d3552014-10-29 10:26:56 +0100112 <https://github.com/django/django/blob/master/django/template/context.py>`_
Raymond Hettingerbfcb4292012-06-10 11:39:44 -0700113 for templating is a read-only chain of mappings. It also features
114 pushing and popping of contexts similar to the
115 :meth:`~collections.ChainMap.new_child` method and the
116 :meth:`~collections.ChainMap.parents` property.
117
118 * The `Nested Contexts recipe
119 <http://code.activestate.com/recipes/577434/>`_ has options to control
120 whether writes and other mutations apply only to the first mapping or to
121 any mapping in the chain.
122
123 * A `greatly simplified read-only version of Chainmap
124 <http://code.activestate.com/recipes/305268/>`_.
125
126
127:class:`ChainMap` Examples and Recipes
128^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
129
130This section shows various approaches to working with chained maps.
131
132
133Example of simulating Python's internal lookup chain::
Raymond Hettinger9fe1ccf2011-02-26 01:02:51 +0000134
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700135 import builtins
136 pylookup = ChainMap(locals(), globals(), vars(builtins))
Raymond Hettinger9fe1ccf2011-02-26 01:02:51 +0000137
Raymond Hettingerb2269ba2012-07-15 23:53:32 -0700138Example of letting user specified command-line arguments take precedence over
139environment variables which in turn take precedence over default values::
Raymond Hettinger9fe1ccf2011-02-26 01:02:51 +0000140
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700141 import os, argparse
Raymond Hettingerb2269ba2012-07-15 23:53:32 -0700142
143 defaults = {'color': 'red', 'user': 'guest'}
144
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700145 parser = argparse.ArgumentParser()
146 parser.add_argument('-u', '--user')
147 parser.add_argument('-c', '--color')
Raymond Hettingerb2269ba2012-07-15 23:53:32 -0700148 namespace = parser.parse_args()
149 command_line_args = {k:v for k, v in vars(namespace).items() if v}
150
151 combined = ChainMap(command_line_args, os.environ, defaults)
152 print(combined['color'])
153 print(combined['user'])
Raymond Hettinger9fe1ccf2011-02-26 01:02:51 +0000154
Raymond Hettingerbfcb4292012-06-10 11:39:44 -0700155Example patterns for using the :class:`ChainMap` class to simulate nested
156contexts::
Raymond Hettinger9fe1ccf2011-02-26 01:02:51 +0000157
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700158 c = ChainMap() # Create root context
159 d = c.new_child() # Create nested child context
160 e = c.new_child() # Child of c, independent from d
161 e.maps[0] # Current context dictionary -- like Python's locals()
162 e.maps[-1] # Root context -- like Python's globals()
163 e.parents # Enclosing context chain -- like Python's nonlocals
Raymond Hettinger9fe1ccf2011-02-26 01:02:51 +0000164
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700165 d['x'] # Get first key in the chain of contexts
166 d['x'] = 1 # Set value in current context
Andrew Svetlov1a8db9c2012-10-04 19:29:25 +0300167 del d['x'] # Delete from current context
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700168 list(d) # All nested values
169 k in d # Check all nested values
170 len(d) # Number of nested values
171 d.items() # All nested items
172 dict(d) # Flatten into a regular dictionary
Raymond Hettinger9fe1ccf2011-02-26 01:02:51 +0000173
Raymond Hettingerbfcb4292012-06-10 11:39:44 -0700174The :class:`ChainMap` class only makes updates (writes and deletions) to the
175first mapping in the chain while lookups will search the full chain. However,
176if deep writes and deletions are desired, it is easy to make a subclass that
177updates keys found deeper in the chain::
Raymond Hettinger9fe1ccf2011-02-26 01:02:51 +0000178
Raymond Hettingerbfcb4292012-06-10 11:39:44 -0700179 class DeepChainMap(ChainMap):
180 'Variant of ChainMap that allows direct updates to inner scopes'
Raymond Hettinger9fe1ccf2011-02-26 01:02:51 +0000181
Raymond Hettingerbfcb4292012-06-10 11:39:44 -0700182 def __setitem__(self, key, value):
183 for mapping in self.maps:
184 if key in mapping:
185 mapping[key] = value
186 return
187 self.maps[0][key] = value
Raymond Hettinger9fe1ccf2011-02-26 01:02:51 +0000188
Raymond Hettingerbfcb4292012-06-10 11:39:44 -0700189 def __delitem__(self, key):
190 for mapping in self.maps:
191 if key in mapping:
192 del mapping[key]
193 return
194 raise KeyError(key)
Raymond Hettinger9fe1ccf2011-02-26 01:02:51 +0000195
Serhiy Storchakaf47036c2013-12-24 11:04:36 +0200196 >>> d = DeepChainMap({'zebra': 'black'}, {'elephant': 'blue'}, {'lion': 'yellow'})
Raymond Hettingerbfcb4292012-06-10 11:39:44 -0700197 >>> d['lion'] = 'orange' # update an existing key two levels down
198 >>> d['snake'] = 'red' # new keys get added to the topmost dict
199 >>> del d['elephant'] # remove an existing key one level down
200 DeepChainMap({'zebra': 'black', 'snake': 'red'}, {}, {'lion': 'orange'})
Georg Brandl4dcf4742012-03-08 20:35:08 +0100201
Raymond Hettinger9fe1ccf2011-02-26 01:02:51 +0000202
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000203:class:`Counter` objects
204------------------------
205
206A counter tool is provided to support convenient and rapid tallies.
207For example::
208
Raymond Hettinger1c62dc92009-02-04 11:41:45 +0000209 >>> # Tally occurrences of words in a list
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000210 >>> cnt = Counter()
Raymond Hettinger670eaec2009-01-21 23:14:07 +0000211 >>> for word in ['red', 'blue', 'red', 'green', 'blue', 'blue']:
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000212 ... cnt[word] += 1
213 >>> cnt
214 Counter({'blue': 3, 'red': 2, 'green': 1})
215
Raymond Hettinger1c62dc92009-02-04 11:41:45 +0000216 >>> # Find the ten most common words in Hamlet
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000217 >>> import re
Raymond Hettingerfaaba592013-03-01 03:30:20 -0800218 >>> words = re.findall(r'\w+', open('hamlet.txt').read().lower())
Raymond Hettinger0bae6622009-01-20 13:00:59 +0000219 >>> Counter(words).most_common(10)
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000220 [('the', 1143), ('and', 966), ('to', 762), ('of', 669), ('i', 631),
221 ('you', 554), ('a', 546), ('my', 514), ('hamlet', 471), ('in', 451)]
222
223.. class:: Counter([iterable-or-mapping])
224
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700225 A :class:`Counter` is a :class:`dict` subclass for counting hashable objects.
226 It is an unordered collection where elements are stored as dictionary keys
227 and their counts are stored as dictionary values. Counts are allowed to be
228 any integer value including zero or negative counts. The :class:`Counter`
229 class is similar to bags or multisets in other languages.
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000230
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700231 Elements are counted from an *iterable* or initialized from another
232 *mapping* (or counter):
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000233
Raymond Hettinger73662a52009-01-27 02:38:22 +0000234 >>> c = Counter() # a new, empty counter
235 >>> c = Counter('gallahad') # a new counter from an iterable
236 >>> c = Counter({'red': 4, 'blue': 2}) # a new counter from a mapping
237 >>> c = Counter(cats=4, dogs=8) # a new counter from keyword args
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000238
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700239 Counter objects have a dictionary interface except that they return a zero
240 count for missing items instead of raising a :exc:`KeyError`:
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000241
Raymond Hettinger94adc8e2009-01-22 05:27:37 +0000242 >>> c = Counter(['eggs', 'ham'])
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000243 >>> c['bacon'] # count of a missing element is zero
244 0
245
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700246 Setting a count to zero does not remove an element from a counter.
247 Use ``del`` to remove it entirely:
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000248
Raymond Hettinger94adc8e2009-01-22 05:27:37 +0000249 >>> c['sausage'] = 0 # counter entry with a zero count
250 >>> del c['sausage'] # del actually removes the entry
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000251
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700252 .. versionadded:: 3.1
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000253
254
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700255 Counter objects support three methods beyond those available for all
256 dictionaries:
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000257
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700258 .. method:: elements()
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000259
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700260 Return an iterator over elements repeating each as many times as its
261 count. Elements are returned in arbitrary order. If an element's count
262 is less than one, :meth:`elements` will ignore it.
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000263
Raymond Hettinger0bae6622009-01-20 13:00:59 +0000264 >>> c = Counter(a=4, b=2, c=0, d=-2)
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000265 >>> list(c.elements())
266 ['a', 'a', 'a', 'a', 'b', 'b']
267
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700268 .. method:: most_common([n])
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000269
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700270 Return a list of the *n* most common elements and their counts from the
Raymond Hettingera3780252015-05-13 02:47:57 -0700271 most common to the least. If *n* is omitted or ``None``,
Raymond Hettinger3afdb282015-05-13 14:39:04 -0700272 :func:`most_common` returns *all* elements in the counter.
273 Elements with equal counts are ordered arbitrarily:
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000274
275 >>> Counter('abracadabra').most_common(3)
276 [('a', 5), ('r', 2), ('b', 2)]
277
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700278 .. method:: subtract([iterable-or-mapping])
Raymond Hettinger9c01e442010-04-03 10:32:58 +0000279
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700280 Elements are subtracted from an *iterable* or from another *mapping*
281 (or counter). Like :meth:`dict.update` but subtracts counts instead
282 of replacing them. Both inputs and outputs may be zero or negative.
Raymond Hettinger9c01e442010-04-03 10:32:58 +0000283
284 >>> c = Counter(a=4, b=2, c=0, d=-2)
285 >>> d = Counter(a=1, b=2, c=3, d=4)
286 >>> c.subtract(d)
Andrew Svetlovf6351722012-12-17 14:01:16 +0200287 >>> c
Raymond Hettinger9c01e442010-04-03 10:32:58 +0000288 Counter({'a': 3, 'b': 0, 'c': -3, 'd': -6})
289
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700290 .. versionadded:: 3.2
Ezio Melotti0be8b1c2010-04-04 06:53:44 +0000291
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700292 The usual dictionary methods are available for :class:`Counter` objects
293 except for two which work differently for counters.
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000294
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700295 .. method:: fromkeys(iterable)
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000296
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700297 This class method is not implemented for :class:`Counter` objects.
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000298
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700299 .. method:: update([iterable-or-mapping])
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000300
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700301 Elements are counted from an *iterable* or added-in from another
302 *mapping* (or counter). Like :meth:`dict.update` but adds counts
303 instead of replacing them. Also, the *iterable* is expected to be a
304 sequence of elements, not a sequence of ``(key, value)`` pairs.
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000305
306Common patterns for working with :class:`Counter` objects::
307
Raymond Hettinger73662a52009-01-27 02:38:22 +0000308 sum(c.values()) # total of all counts
309 c.clear() # reset all counts
310 list(c) # list unique elements
311 set(c) # convert to a set
312 dict(c) # convert to a regular dictionary
313 c.items() # convert to a list of (elem, cnt) pairs
314 Counter(dict(list_of_pairs)) # convert from a list of (elem, cnt) pairs
Georg Brandl87f3d7b2013-10-06 12:36:39 +0200315 c.most_common()[:-n-1:-1] # n least common elements
Raymond Hettingerfcb393c2011-08-09 13:00:40 -0700316 +c # remove zero and negative counts
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000317
Raymond Hettinger72a95cc2009-02-25 22:51:40 +0000318Several mathematical operations are provided for combining :class:`Counter`
319objects to produce multisets (counters that have counts greater than zero).
320Addition and subtraction combine counters by adding or subtracting the counts
321of corresponding elements. Intersection and union return the minimum and
322maximum of corresponding counts. Each operation can accept inputs with signed
323counts, but the output will exclude results with counts of zero or less.
Raymond Hettinger4d2073a2009-01-20 03:41:22 +0000324
Raymond Hettingere0d1b9f2009-01-21 20:36:27 +0000325 >>> c = Counter(a=3, b=1)
326 >>> d = Counter(a=1, b=2)
Raymond Hettinger73662a52009-01-27 02:38:22 +0000327 >>> c + d # add two counters together: c[x] + d[x]
Raymond Hettinger4d2073a2009-01-20 03:41:22 +0000328 Counter({'a': 4, 'b': 3})
Raymond Hettinger73662a52009-01-27 02:38:22 +0000329 >>> c - d # subtract (keeping only positive counts)
Raymond Hettinger4d2073a2009-01-20 03:41:22 +0000330 Counter({'a': 2})
Raymond Hettinger73662a52009-01-27 02:38:22 +0000331 >>> c & d # intersection: min(c[x], d[x])
Raymond Hettinger4d2073a2009-01-20 03:41:22 +0000332 Counter({'a': 1, 'b': 1})
Raymond Hettinger73662a52009-01-27 02:38:22 +0000333 >>> c | d # union: max(c[x], d[x])
Raymond Hettinger4d2073a2009-01-20 03:41:22 +0000334 Counter({'a': 3, 'b': 2})
335
Berker Peksag315e1042015-05-19 01:36:55 +0300336Unary addition and subtraction are shortcuts for adding an empty counter
Raymond Hettingerfcb393c2011-08-09 13:00:40 -0700337or subtracting from an empty counter.
338
339 >>> c = Counter(a=2, b=-4)
340 >>> +c
341 Counter({'a': 2})
342 >>> -c
343 Counter({'b': 4})
344
345.. versionadded:: 3.3
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700346 Added support for unary plus, unary minus, and in-place multiset operations.
Raymond Hettingerfcb393c2011-08-09 13:00:40 -0700347
Raymond Hettinger22f18852010-04-12 21:45:14 +0000348.. note::
349
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700350 Counters were primarily designed to work with positive integers to represent
351 running counts; however, care was taken to not unnecessarily preclude use
352 cases needing other types or negative values. To help with those use cases,
353 this section documents the minimum range and type restrictions.
Raymond Hettinger22f18852010-04-12 21:45:14 +0000354
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700355 * The :class:`Counter` class itself is a dictionary subclass with no
Georg Brandl2fdc0f82012-10-06 22:38:20 +0200356 restrictions on its keys and values. The values are intended to be numbers
357 representing counts, but you *could* store anything in the value field.
Raymond Hettinger22f18852010-04-12 21:45:14 +0000358
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700359 * The :meth:`most_common` method requires only that the values be orderable.
Raymond Hettinger22f18852010-04-12 21:45:14 +0000360
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700361 * For in-place operations such as ``c[key] += 1``, the value type need only
Georg Brandl2fdc0f82012-10-06 22:38:20 +0200362 support addition and subtraction. So fractions, floats, and decimals would
363 work and negative values are supported. The same is also true for
364 :meth:`update` and :meth:`subtract` which allow negative and zero values
365 for both inputs and outputs.
Raymond Hettinger22f18852010-04-12 21:45:14 +0000366
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700367 * The multiset methods are designed only for use cases with positive values.
Georg Brandl2fdc0f82012-10-06 22:38:20 +0200368 The inputs may be negative or zero, but only outputs with positive values
369 are created. There are no type restrictions, but the value type needs to
370 support addition, subtraction, and comparison.
Raymond Hettinger22f18852010-04-12 21:45:14 +0000371
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700372 * The :meth:`elements` method requires integer counts. It ignores zero and
Georg Brandl2fdc0f82012-10-06 22:38:20 +0200373 negative counts.
Raymond Hettinger22f18852010-04-12 21:45:14 +0000374
Raymond Hettingerb14043c2009-01-20 23:44:31 +0000375.. seealso::
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000376
Georg Brandl5d941342016-02-26 19:37:12 +0100377 * `Bag class <https://www.gnu.org/software/smalltalk/manual-base/html_node/Bag.html>`_
Raymond Hettingerbfcb4292012-06-10 11:39:44 -0700378 in Smalltalk.
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000379
Georg Brandl5d941342016-02-26 19:37:12 +0100380 * Wikipedia entry for `Multisets <https://en.wikipedia.org/wiki/Multiset>`_.
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000381
Georg Brandl5d941342016-02-26 19:37:12 +0100382 * `C++ multisets <http://www.java2s.com/Tutorial/Cpp/0380__set-multiset/Catalog0380__set-multiset.htm>`_
Raymond Hettingerbfcb4292012-06-10 11:39:44 -0700383 tutorial with examples.
Raymond Hettingerb14043c2009-01-20 23:44:31 +0000384
Raymond Hettinger94adc8e2009-01-22 05:27:37 +0000385 * For mathematical operations on multisets and their use cases, see
Raymond Hettingerbfcb4292012-06-10 11:39:44 -0700386 *Knuth, Donald. The Art of Computer Programming Volume II,
387 Section 4.6.3, Exercise 19*.
Raymond Hettingerb14043c2009-01-20 23:44:31 +0000388
Raymond Hettinger670eaec2009-01-21 23:14:07 +0000389 * To enumerate all distinct multisets of a given size over a given set of
Raymond Hettinger855482e2015-05-23 08:57:58 -0700390 elements, see :func:`itertools.combinations_with_replacement`:
Raymond Hettingerb14043c2009-01-20 23:44:31 +0000391
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700392 map(Counter, combinations_with_replacement('ABC', 2)) --> AA AB AC BB BC CC
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000393
394
Georg Brandl116aa622007-08-15 14:28:22 +0000395:class:`deque` objects
396----------------------
397
Georg Brandlc2a4f4f2009-04-10 09:03:43 +0000398.. class:: deque([iterable, [maxlen]])
Georg Brandl116aa622007-08-15 14:28:22 +0000399
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700400 Returns a new deque object initialized left-to-right (using :meth:`append`) with
401 data from *iterable*. If *iterable* is not specified, the new deque is empty.
Georg Brandl116aa622007-08-15 14:28:22 +0000402
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700403 Deques are a generalization of stacks and queues (the name is pronounced "deck"
404 and is short for "double-ended queue"). Deques support thread-safe, memory
405 efficient appends and pops from either side of the deque with approximately the
406 same O(1) performance in either direction.
Georg Brandl116aa622007-08-15 14:28:22 +0000407
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700408 Though :class:`list` objects support similar operations, they are optimized for
409 fast fixed-length operations and incur O(n) memory movement costs for
410 ``pop(0)`` and ``insert(0, v)`` operations which change both the size and
411 position of the underlying data representation.
Georg Brandl116aa622007-08-15 14:28:22 +0000412
Georg Brandl116aa622007-08-15 14:28:22 +0000413
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700414 If *maxlen* is not specified or is *None*, deques may grow to an
415 arbitrary length. Otherwise, the deque is bounded to the specified maximum
416 length. Once a bounded length deque is full, when new items are added, a
417 corresponding number of items are discarded from the opposite end. Bounded
418 length deques provide functionality similar to the ``tail`` filter in
419 Unix. They are also useful for tracking transactions and other pools of data
420 where only the most recent activity is of interest.
Georg Brandl9afde1c2007-11-01 20:32:30 +0000421
Georg Brandl9afde1c2007-11-01 20:32:30 +0000422
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700423 Deque objects support the following methods:
Georg Brandl116aa622007-08-15 14:28:22 +0000424
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700425 .. method:: append(x)
Georg Brandl116aa622007-08-15 14:28:22 +0000426
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700427 Add *x* to the right side of the deque.
Georg Brandl116aa622007-08-15 14:28:22 +0000428
429
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700430 .. method:: appendleft(x)
Georg Brandl116aa622007-08-15 14:28:22 +0000431
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700432 Add *x* to the left side of the deque.
Georg Brandl116aa622007-08-15 14:28:22 +0000433
434
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700435 .. method:: clear()
Georg Brandl116aa622007-08-15 14:28:22 +0000436
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700437 Remove all elements from the deque leaving it with length 0.
Georg Brandl116aa622007-08-15 14:28:22 +0000438
439
Raymond Hettinger32ea1652015-03-21 01:37:37 -0700440 .. method:: copy()
441
442 Create a shallow copy of the deque.
443
444 .. versionadded:: 3.5
445
446
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700447 .. method:: count(x)
Raymond Hettinger44459de2010-04-03 23:20:46 +0000448
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700449 Count the number of deque elements equal to *x*.
Raymond Hettinger44459de2010-04-03 23:20:46 +0000450
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700451 .. versionadded:: 3.2
Raymond Hettinger44459de2010-04-03 23:20:46 +0000452
Georg Brandl67b21b72010-08-17 15:07:14 +0000453
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700454 .. method:: extend(iterable)
Georg Brandl116aa622007-08-15 14:28:22 +0000455
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700456 Extend the right side of the deque by appending elements from the iterable
457 argument.
Georg Brandl116aa622007-08-15 14:28:22 +0000458
459
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700460 .. method:: extendleft(iterable)
Georg Brandl116aa622007-08-15 14:28:22 +0000461
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700462 Extend the left side of the deque by appending elements from *iterable*.
463 Note, the series of left appends results in reversing the order of
464 elements in the iterable argument.
Georg Brandl116aa622007-08-15 14:28:22 +0000465
466
Raymond Hettinger855482e2015-05-23 08:57:58 -0700467 .. method:: index(x[, start[, stop]])
Raymond Hettinger32ea1652015-03-21 01:37:37 -0700468
Raymond Hettinger855482e2015-05-23 08:57:58 -0700469 Return the position of *x* in the deque (at or after index *start*
470 and before index *stop*). Returns the first match or raises
471 :exc:`ValueError` if not found.
Raymond Hettinger32ea1652015-03-21 01:37:37 -0700472
473 .. versionadded:: 3.5
474
475
476 .. method:: insert(i, x)
477
478 Insert *x* into the deque at position *i*.
479
Raymond Hettingerb00da572016-02-01 21:19:22 -0800480 If the insertion would cause a bounded deque to grow beyond *maxlen*,
481 an :exc:`IndexError` is raised.
Raymond Hettinger37434322016-01-26 21:44:16 -0800482
Raymond Hettinger32ea1652015-03-21 01:37:37 -0700483 .. versionadded:: 3.5
484
485
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700486 .. method:: pop()
Georg Brandl116aa622007-08-15 14:28:22 +0000487
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700488 Remove and return an element from the right side of the deque. If no
489 elements are present, raises an :exc:`IndexError`.
Georg Brandl116aa622007-08-15 14:28:22 +0000490
491
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700492 .. method:: popleft()
Georg Brandl116aa622007-08-15 14:28:22 +0000493
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700494 Remove and return an element from the left side of the deque. If no
495 elements are present, raises an :exc:`IndexError`.
Georg Brandl116aa622007-08-15 14:28:22 +0000496
497
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700498 .. method:: remove(value)
Georg Brandl116aa622007-08-15 14:28:22 +0000499
Raymond Hettinger855482e2015-05-23 08:57:58 -0700500 Remove the first occurrence of *value*. If not found, raises a
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700501 :exc:`ValueError`.
Georg Brandl116aa622007-08-15 14:28:22 +0000502
Georg Brandl67b21b72010-08-17 15:07:14 +0000503
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700504 .. method:: reverse()
Raymond Hettingere5fdedb2009-12-10 00:47:21 +0000505
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700506 Reverse the elements of the deque in-place and then return ``None``.
Raymond Hettingere5fdedb2009-12-10 00:47:21 +0000507
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700508 .. versionadded:: 3.2
Georg Brandl116aa622007-08-15 14:28:22 +0000509
Georg Brandl67b21b72010-08-17 15:07:14 +0000510
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700511 .. method:: rotate(n)
Georg Brandl116aa622007-08-15 14:28:22 +0000512
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700513 Rotate the deque *n* steps to the right. If *n* is negative, rotate to
514 the left. Rotating one step to the right is equivalent to:
515 ``d.appendleft(d.pop())``.
Benjamin Petersone41251e2008-04-25 01:59:09 +0000516
Georg Brandl116aa622007-08-15 14:28:22 +0000517
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700518 Deque objects also provide one read-only attribute:
Raymond Hettinger5bb0f0e2009-03-10 12:56:32 +0000519
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700520 .. attribute:: maxlen
Raymond Hettinger5bb0f0e2009-03-10 12:56:32 +0000521
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700522 Maximum size of a deque or *None* if unbounded.
Raymond Hettinger5bb0f0e2009-03-10 12:56:32 +0000523
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700524 .. versionadded:: 3.1
Raymond Hettinger5bb0f0e2009-03-10 12:56:32 +0000525
526
Georg Brandl116aa622007-08-15 14:28:22 +0000527In addition to the above, deques support iteration, pickling, ``len(d)``,
528``reversed(d)``, ``copy.copy(d)``, ``copy.deepcopy(d)``, membership testing with
Benjamin Peterson206e3072008-10-19 14:07:49 +0000529the :keyword:`in` operator, and subscript references such as ``d[-1]``. Indexed
530access is O(1) at both ends but slows to O(n) in the middle. For fast random
531access, use lists instead.
Georg Brandl116aa622007-08-15 14:28:22 +0000532
Raymond Hettinger41290a62015-03-31 08:12:23 -0700533Starting in version 3.5, deques support ``__add__()``, ``__mul__()``,
534and ``__imul__()``.
535
Christian Heimesfe337bf2008-03-23 21:54:12 +0000536Example:
537
538.. doctest::
Georg Brandl116aa622007-08-15 14:28:22 +0000539
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700540 >>> from collections import deque
541 >>> d = deque('ghi') # make a new deque with three items
542 >>> for elem in d: # iterate over the deque's elements
543 ... print(elem.upper())
544 G
545 H
546 I
Georg Brandl116aa622007-08-15 14:28:22 +0000547
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700548 >>> d.append('j') # add a new entry to the right side
549 >>> d.appendleft('f') # add a new entry to the left side
550 >>> d # show the representation of the deque
551 deque(['f', 'g', 'h', 'i', 'j'])
Georg Brandl116aa622007-08-15 14:28:22 +0000552
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700553 >>> d.pop() # return and remove the rightmost item
554 'j'
555 >>> d.popleft() # return and remove the leftmost item
556 'f'
557 >>> list(d) # list the contents of the deque
558 ['g', 'h', 'i']
559 >>> d[0] # peek at leftmost item
560 'g'
561 >>> d[-1] # peek at rightmost item
562 'i'
Georg Brandl116aa622007-08-15 14:28:22 +0000563
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700564 >>> list(reversed(d)) # list the contents of a deque in reverse
565 ['i', 'h', 'g']
566 >>> 'h' in d # search the deque
567 True
568 >>> d.extend('jkl') # add multiple elements at once
569 >>> d
570 deque(['g', 'h', 'i', 'j', 'k', 'l'])
571 >>> d.rotate(1) # right rotation
572 >>> d
573 deque(['l', 'g', 'h', 'i', 'j', 'k'])
574 >>> d.rotate(-1) # left rotation
575 >>> d
576 deque(['g', 'h', 'i', 'j', 'k', 'l'])
Georg Brandl116aa622007-08-15 14:28:22 +0000577
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700578 >>> deque(reversed(d)) # make a new deque in reverse order
579 deque(['l', 'k', 'j', 'i', 'h', 'g'])
580 >>> d.clear() # empty the deque
581 >>> d.pop() # cannot pop from an empty deque
582 Traceback (most recent call last):
583 File "<pyshell#6>", line 1, in -toplevel-
584 d.pop()
585 IndexError: pop from an empty deque
Georg Brandl116aa622007-08-15 14:28:22 +0000586
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700587 >>> d.extendleft('abc') # extendleft() reverses the input order
588 >>> d
589 deque(['c', 'b', 'a'])
Georg Brandl116aa622007-08-15 14:28:22 +0000590
591
Georg Brandl9afde1c2007-11-01 20:32:30 +0000592:class:`deque` Recipes
593^^^^^^^^^^^^^^^^^^^^^^
Georg Brandl116aa622007-08-15 14:28:22 +0000594
595This section shows various approaches to working with deques.
596
Raymond Hettingerd2ee64d2009-03-31 22:52:48 +0000597Bounded length deques provide functionality similar to the ``tail`` filter
598in Unix::
599
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700600 def tail(filename, n=10):
601 'Return the last n lines of a file'
602 with open(filename) as f:
603 return deque(f, n)
Raymond Hettingerd2ee64d2009-03-31 22:52:48 +0000604
605Another approach to using deques is to maintain a sequence of recently
606added elements by appending to the right and popping to the left::
607
608 def moving_average(iterable, n=3):
609 # moving_average([40, 30, 50, 46, 39, 44]) --> 40.0 42.0 45.0 43.0
610 # http://en.wikipedia.org/wiki/Moving_average
611 it = iter(iterable)
Raymond Hettingerd40285a2009-05-22 01:11:26 +0000612 d = deque(itertools.islice(it, n-1))
613 d.appendleft(0)
Raymond Hettingerd2ee64d2009-03-31 22:52:48 +0000614 s = sum(d)
Raymond Hettingerd2ee64d2009-03-31 22:52:48 +0000615 for elem in it:
616 s += elem - d.popleft()
617 d.append(elem)
618 yield s / n
619
Georg Brandl116aa622007-08-15 14:28:22 +0000620The :meth:`rotate` method provides a way to implement :class:`deque` slicing and
Ezio Melotti0639d5a2009-12-19 23:26:38 +0000621deletion. For example, a pure Python implementation of ``del d[n]`` relies on
Georg Brandl116aa622007-08-15 14:28:22 +0000622the :meth:`rotate` method to position elements to be popped::
623
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700624 def delete_nth(d, n):
625 d.rotate(-n)
626 d.popleft()
627 d.rotate(n)
Georg Brandl116aa622007-08-15 14:28:22 +0000628
629To implement :class:`deque` slicing, use a similar approach applying
630:meth:`rotate` to bring a target element to the left side of the deque. Remove
631old entries with :meth:`popleft`, add new entries with :meth:`extend`, and then
632reverse the rotation.
Georg Brandl116aa622007-08-15 14:28:22 +0000633With minor variations on that approach, it is easy to implement Forth style
634stack manipulations such as ``dup``, ``drop``, ``swap``, ``over``, ``pick``,
635``rot``, and ``roll``.
636
Georg Brandl116aa622007-08-15 14:28:22 +0000637
638:class:`defaultdict` objects
639----------------------------
640
Georg Brandl116aa622007-08-15 14:28:22 +0000641.. class:: defaultdict([default_factory[, ...]])
642
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700643 Returns a new dictionary-like object. :class:`defaultdict` is a subclass of the
644 built-in :class:`dict` class. It overrides one method and adds one writable
645 instance variable. The remaining functionality is the same as for the
646 :class:`dict` class and is not documented here.
Georg Brandl116aa622007-08-15 14:28:22 +0000647
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700648 The first argument provides the initial value for the :attr:`default_factory`
649 attribute; it defaults to ``None``. All remaining arguments are treated the same
650 as if they were passed to the :class:`dict` constructor, including keyword
651 arguments.
Georg Brandl116aa622007-08-15 14:28:22 +0000652
Georg Brandl116aa622007-08-15 14:28:22 +0000653
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700654 :class:`defaultdict` objects support the following method in addition to the
655 standard :class:`dict` operations:
Georg Brandl116aa622007-08-15 14:28:22 +0000656
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700657 .. method:: __missing__(key)
Georg Brandl116aa622007-08-15 14:28:22 +0000658
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700659 If the :attr:`default_factory` attribute is ``None``, this raises a
660 :exc:`KeyError` exception with the *key* as argument.
Georg Brandl116aa622007-08-15 14:28:22 +0000661
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700662 If :attr:`default_factory` is not ``None``, it is called without arguments
663 to provide a default value for the given *key*, this value is inserted in
664 the dictionary for the *key*, and returned.
Georg Brandl116aa622007-08-15 14:28:22 +0000665
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700666 If calling :attr:`default_factory` raises an exception this exception is
667 propagated unchanged.
Georg Brandl116aa622007-08-15 14:28:22 +0000668
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700669 This method is called by the :meth:`__getitem__` method of the
670 :class:`dict` class when the requested key is not found; whatever it
671 returns or raises is then returned or raised by :meth:`__getitem__`.
Georg Brandl116aa622007-08-15 14:28:22 +0000672
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700673 Note that :meth:`__missing__` is *not* called for any operations besides
674 :meth:`__getitem__`. This means that :meth:`get` will, like normal
675 dictionaries, return ``None`` as a default rather than using
676 :attr:`default_factory`.
Benjamin Peterson871b9d12012-01-27 09:14:01 -0500677
Georg Brandl116aa622007-08-15 14:28:22 +0000678
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700679 :class:`defaultdict` objects support the following instance variable:
Georg Brandl116aa622007-08-15 14:28:22 +0000680
Benjamin Petersone41251e2008-04-25 01:59:09 +0000681
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700682 .. attribute:: default_factory
Benjamin Petersone41251e2008-04-25 01:59:09 +0000683
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700684 This attribute is used by the :meth:`__missing__` method; it is
685 initialized from the first argument to the constructor, if present, or to
686 ``None``, if absent.
Georg Brandl116aa622007-08-15 14:28:22 +0000687
688
Georg Brandl116aa622007-08-15 14:28:22 +0000689:class:`defaultdict` Examples
690^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
691
692Using :class:`list` as the :attr:`default_factory`, it is easy to group a
Christian Heimesfe337bf2008-03-23 21:54:12 +0000693sequence of key-value pairs into a dictionary of lists:
Georg Brandl116aa622007-08-15 14:28:22 +0000694
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700695 >>> s = [('yellow', 1), ('blue', 2), ('yellow', 3), ('blue', 4), ('red', 1)]
696 >>> d = defaultdict(list)
697 >>> for k, v in s:
698 ... d[k].append(v)
699 ...
700 >>> list(d.items())
701 [('blue', [2, 4]), ('red', [1]), ('yellow', [1, 3])]
Georg Brandl116aa622007-08-15 14:28:22 +0000702
703When each key is encountered for the first time, it is not already in the
704mapping; so an entry is automatically created using the :attr:`default_factory`
705function which returns an empty :class:`list`. The :meth:`list.append`
706operation then attaches the value to the new list. When keys are encountered
707again, the look-up proceeds normally (returning the list for that key) and the
708:meth:`list.append` operation adds another value to the list. This technique is
Christian Heimesfe337bf2008-03-23 21:54:12 +0000709simpler and faster than an equivalent technique using :meth:`dict.setdefault`:
Georg Brandl116aa622007-08-15 14:28:22 +0000710
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700711 >>> d = {}
712 >>> for k, v in s:
713 ... d.setdefault(k, []).append(v)
714 ...
715 >>> list(d.items())
716 [('blue', [2, 4]), ('red', [1]), ('yellow', [1, 3])]
Georg Brandl116aa622007-08-15 14:28:22 +0000717
718Setting the :attr:`default_factory` to :class:`int` makes the
719:class:`defaultdict` useful for counting (like a bag or multiset in other
Christian Heimesfe337bf2008-03-23 21:54:12 +0000720languages):
Georg Brandl116aa622007-08-15 14:28:22 +0000721
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700722 >>> s = 'mississippi'
723 >>> d = defaultdict(int)
724 >>> for k in s:
725 ... d[k] += 1
726 ...
727 >>> list(d.items())
728 [('i', 4), ('p', 2), ('s', 4), ('m', 1)]
Georg Brandl116aa622007-08-15 14:28:22 +0000729
730When a letter is first encountered, it is missing from the mapping, so the
731:attr:`default_factory` function calls :func:`int` to supply a default count of
732zero. The increment operation then builds up the count for each letter.
733
734The function :func:`int` which always returns zero is just a special case of
735constant functions. A faster and more flexible way to create constant functions
736is to use a lambda function which can supply any constant value (not just
Christian Heimesfe337bf2008-03-23 21:54:12 +0000737zero):
Georg Brandl116aa622007-08-15 14:28:22 +0000738
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700739 >>> def constant_factory(value):
740 ... return lambda: value
741 >>> d = defaultdict(constant_factory('<missing>'))
742 >>> d.update(name='John', action='ran')
743 >>> '%(name)s %(action)s to %(object)s' % d
744 'John ran to <missing>'
Georg Brandl116aa622007-08-15 14:28:22 +0000745
746Setting the :attr:`default_factory` to :class:`set` makes the
Christian Heimesfe337bf2008-03-23 21:54:12 +0000747:class:`defaultdict` useful for building a dictionary of sets:
Georg Brandl116aa622007-08-15 14:28:22 +0000748
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700749 >>> s = [('red', 1), ('blue', 2), ('red', 3), ('blue', 4), ('red', 1), ('blue', 4)]
750 >>> d = defaultdict(set)
751 >>> for k, v in s:
752 ... d[k].add(v)
753 ...
754 >>> list(d.items())
755 [('blue', {2, 4}), ('red', {1, 3})]
Georg Brandl116aa622007-08-15 14:28:22 +0000756
757
Guido van Rossum3d392eb2007-11-16 00:35:22 +0000758:func:`namedtuple` Factory Function for Tuples with Named Fields
Christian Heimes790c8232008-01-07 21:14:23 +0000759----------------------------------------------------------------
Georg Brandl116aa622007-08-15 14:28:22 +0000760
Thomas Wouters1b7f8912007-09-19 03:06:30 +0000761Named tuples assign meaning to each position in a tuple and allow for more readable,
762self-documenting code. They can be used wherever regular tuples are used, and
763they add the ability to access fields by name instead of position index.
Georg Brandl116aa622007-08-15 14:28:22 +0000764
Georg Brandlc2a4f4f2009-04-10 09:03:43 +0000765.. function:: namedtuple(typename, field_names, verbose=False, rename=False)
Georg Brandl116aa622007-08-15 14:28:22 +0000766
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700767 Returns a new tuple subclass named *typename*. The new subclass is used to
768 create tuple-like objects that have fields accessible by attribute lookup as
769 well as being indexable and iterable. Instances of the subclass also have a
770 helpful docstring (with typename and field_names) and a helpful :meth:`__repr__`
771 method which lists the tuple contents in a ``name=value`` format.
Georg Brandl116aa622007-08-15 14:28:22 +0000772
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700773 The *field_names* are a single string with each fieldname separated by whitespace
774 and/or commas, for example ``'x y'`` or ``'x, y'``. Alternatively, *field_names*
775 can be a sequence of strings such as ``['x', 'y']``.
Georg Brandl9afde1c2007-11-01 20:32:30 +0000776
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700777 Any valid Python identifier may be used for a fieldname except for names
778 starting with an underscore. Valid identifiers consist of letters, digits,
779 and underscores but do not start with a digit or underscore and cannot be
780 a :mod:`keyword` such as *class*, *for*, *return*, *global*, *pass*,
781 or *raise*.
Georg Brandl116aa622007-08-15 14:28:22 +0000782
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700783 If *rename* is true, invalid fieldnames are automatically replaced
784 with positional names. For example, ``['abc', 'def', 'ghi', 'abc']`` is
785 converted to ``['abc', '_1', 'ghi', '_3']``, eliminating the keyword
786 ``def`` and the duplicate fieldname ``abc``.
Benjamin Petersona86f2c02009-02-10 02:41:10 +0000787
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700788 If *verbose* is true, the class definition is printed after it is
789 built. This option is outdated; instead, it is simpler to print the
790 :attr:`_source` attribute.
Georg Brandl116aa622007-08-15 14:28:22 +0000791
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700792 Named tuple instances do not have per-instance dictionaries, so they are
793 lightweight and require no more memory than regular tuples.
Georg Brandl116aa622007-08-15 14:28:22 +0000794
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700795 .. versionchanged:: 3.1
796 Added support for *rename*.
Benjamin Petersona86f2c02009-02-10 02:41:10 +0000797
Christian Heimesfe337bf2008-03-23 21:54:12 +0000798
799.. doctest::
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700800 :options: +NORMALIZE_WHITESPACE
Georg Brandl116aa622007-08-15 14:28:22 +0000801
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700802 >>> # Basic example
803 >>> Point = namedtuple('Point', ['x', 'y'])
804 >>> p = Point(11, y=22) # instantiate with positional or keyword arguments
805 >>> p[0] + p[1] # indexable like the plain tuple (11, 22)
806 33
807 >>> x, y = p # unpack like a regular tuple
808 >>> x, y
809 (11, 22)
810 >>> p.x + p.y # fields also accessible by name
811 33
812 >>> p # readable __repr__ with a name=value style
813 Point(x=11, y=22)
Georg Brandl116aa622007-08-15 14:28:22 +0000814
Thomas Wouters1b7f8912007-09-19 03:06:30 +0000815Named tuples are especially useful for assigning field names to result tuples returned
816by the :mod:`csv` or :mod:`sqlite3` modules::
817
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700818 EmployeeRecord = namedtuple('EmployeeRecord', 'name, age, title, department, paygrade')
Georg Brandl9afde1c2007-11-01 20:32:30 +0000819
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700820 import csv
821 for emp in map(EmployeeRecord._make, csv.reader(open("employees.csv", "rb"))):
822 print(emp.name, emp.title)
Thomas Wouters1b7f8912007-09-19 03:06:30 +0000823
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700824 import sqlite3
825 conn = sqlite3.connect('/companydata')
826 cursor = conn.cursor()
827 cursor.execute('SELECT name, age, title, department, paygrade FROM employees')
828 for emp in map(EmployeeRecord._make, cursor.fetchall()):
829 print(emp.name, emp.title)
Georg Brandl9afde1c2007-11-01 20:32:30 +0000830
Christian Heimes99170a52007-12-19 02:07:34 +0000831In addition to the methods inherited from tuples, named tuples support
Raymond Hettinger2ebea412011-03-23 12:52:23 -0700832three additional methods and two attributes. To prevent conflicts with
Christian Heimes2380ac72008-01-09 00:17:24 +0000833field names, the method and attribute names start with an underscore.
Christian Heimes99170a52007-12-19 02:07:34 +0000834
Benjamin Peterson0b9fb802010-07-18 14:23:36 +0000835.. classmethod:: somenamedtuple._make(iterable)
Christian Heimes99170a52007-12-19 02:07:34 +0000836
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700837 Class method that makes a new instance from an existing sequence or iterable.
Christian Heimes99170a52007-12-19 02:07:34 +0000838
Raymond Hettinger6fed9fd2012-06-11 00:38:14 -0700839 .. doctest::
Thomas Wouters1b7f8912007-09-19 03:06:30 +0000840
Raymond Hettinger6fed9fd2012-06-11 00:38:14 -0700841 >>> t = [11, 22]
842 >>> Point._make(t)
843 Point(x=11, y=22)
Thomas Wouters1b7f8912007-09-19 03:06:30 +0000844
Christian Heimes790c8232008-01-07 21:14:23 +0000845.. method:: somenamedtuple._asdict()
Georg Brandl9afde1c2007-11-01 20:32:30 +0000846
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700847 Return a new :class:`OrderedDict` which maps field names to their corresponding
Raymond Hettinger7a3602e2015-08-30 09:13:48 -0700848 values::
Georg Brandl9afde1c2007-11-01 20:32:30 +0000849
Raymond Hettinger7a3602e2015-08-30 09:13:48 -0700850 >>> p = Point(x=11, y=22)
851 >>> p._asdict()
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700852 OrderedDict([('x', 11), ('y', 22)])
Raymond Hettingera4f52b12009-03-02 22:28:31 +0000853
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700854 .. versionchanged:: 3.1
855 Returns an :class:`OrderedDict` instead of a regular :class:`dict`.
Christian Heimesfe337bf2008-03-23 21:54:12 +0000856
Christian Heimes790c8232008-01-07 21:14:23 +0000857.. method:: somenamedtuple._replace(kwargs)
Thomas Wouters1b7f8912007-09-19 03:06:30 +0000858
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700859 Return a new instance of the named tuple replacing specified fields with new
Raymond Hettinger6fed9fd2012-06-11 00:38:14 -0700860 values::
Thomas Wouters8ce81f72007-09-20 18:22:40 +0000861
Raymond Hettinger6fed9fd2012-06-11 00:38:14 -0700862 >>> p = Point(x=11, y=22)
863 >>> p._replace(x=33)
864 Point(x=33, y=22)
Thomas Wouters1b7f8912007-09-19 03:06:30 +0000865
Raymond Hettinger6fed9fd2012-06-11 00:38:14 -0700866 >>> for partnum, record in inventory.items():
867 ... inventory[partnum] = record._replace(price=newprices[partnum], timestamp=time.now())
Thomas Wouters1b7f8912007-09-19 03:06:30 +0000868
Raymond Hettinger2ebea412011-03-23 12:52:23 -0700869.. attribute:: somenamedtuple._source
870
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700871 A string with the pure Python source code used to create the named
872 tuple class. The source makes the named tuple self-documenting.
Raymond Hettinger163e9822013-05-18 00:05:20 -0700873 It can be printed, executed using :func:`exec`, or saved to a file
874 and imported.
Raymond Hettinger2ebea412011-03-23 12:52:23 -0700875
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700876 .. versionadded:: 3.3
Raymond Hettinger2ebea412011-03-23 12:52:23 -0700877
Christian Heimes790c8232008-01-07 21:14:23 +0000878.. attribute:: somenamedtuple._fields
Thomas Wouters1b7f8912007-09-19 03:06:30 +0000879
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700880 Tuple of strings listing the field names. Useful for introspection
881 and for creating new named tuple types from existing named tuples.
Thomas Wouters8ce81f72007-09-20 18:22:40 +0000882
Raymond Hettinger6fed9fd2012-06-11 00:38:14 -0700883 .. doctest::
Thomas Wouters1b7f8912007-09-19 03:06:30 +0000884
Raymond Hettinger6fed9fd2012-06-11 00:38:14 -0700885 >>> p._fields # view the field names
886 ('x', 'y')
Thomas Wouters1b7f8912007-09-19 03:06:30 +0000887
Raymond Hettinger6fed9fd2012-06-11 00:38:14 -0700888 >>> Color = namedtuple('Color', 'red green blue')
889 >>> Pixel = namedtuple('Pixel', Point._fields + Color._fields)
890 >>> Pixel(11, 22, 128, 255, 0)
891 Pixel(x=11, y=22, red=128, green=255, blue=0)
Georg Brandl116aa622007-08-15 14:28:22 +0000892
Christian Heimes0449f632007-12-15 01:27:15 +0000893To retrieve a field whose name is stored in a string, use the :func:`getattr`
Christian Heimesfe337bf2008-03-23 21:54:12 +0000894function:
Christian Heimes0449f632007-12-15 01:27:15 +0000895
896 >>> getattr(p, 'x')
897 11
898
Raymond Hettinger651453a2009-02-11 00:20:02 +0000899To convert a dictionary to a named tuple, use the double-star-operator
900(as described in :ref:`tut-unpacking-arguments`):
Christian Heimes99170a52007-12-19 02:07:34 +0000901
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700902 >>> d = {'x': 11, 'y': 22}
903 >>> Point(**d)
904 Point(x=11, y=22)
Christian Heimes99170a52007-12-19 02:07:34 +0000905
Guido van Rossum3d392eb2007-11-16 00:35:22 +0000906Since a named tuple is a regular Python class, it is easy to add or change
Christian Heimes043d6f62008-01-07 17:19:16 +0000907functionality with a subclass. Here is how to add a calculated field and
Christian Heimesfe337bf2008-03-23 21:54:12 +0000908a fixed-width print format:
Guido van Rossum3d392eb2007-11-16 00:35:22 +0000909
Christian Heimes043d6f62008-01-07 17:19:16 +0000910 >>> class Point(namedtuple('Point', 'x y')):
Raymond Hettingerd2f69742015-11-10 00:09:34 -0800911 __slots__ = ()
912 @property
913 def hypot(self):
914 return (self.x ** 2 + self.y ** 2) ** 0.5
915 def __str__(self):
916 return 'Point: x=%6.3f y=%6.3f hypot=%6.3f' % (self.x, self.y, self.hypot)
Guido van Rossum3d392eb2007-11-16 00:35:22 +0000917
Georg Brandl0df79792008-10-04 18:33:26 +0000918 >>> for p in Point(3, 4), Point(14, 5/7):
Raymond Hettingerd2f69742015-11-10 00:09:34 -0800919 print(p)
Christian Heimes25bb7832008-01-11 16:17:00 +0000920 Point: x= 3.000 y= 4.000 hypot= 5.000
921 Point: x=14.000 y= 0.714 hypot=14.018
Christian Heimes043d6f62008-01-07 17:19:16 +0000922
Georg Brandlaf5c2382009-12-28 08:02:38 +0000923The subclass shown above sets ``__slots__`` to an empty tuple. This helps
Christian Heimes679db4a2008-01-18 09:56:22 +0000924keep memory requirements low by preventing the creation of instance dictionaries.
925
Christian Heimes2380ac72008-01-09 00:17:24 +0000926Subclassing is not useful for adding new, stored fields. Instead, simply
Christian Heimesfe337bf2008-03-23 21:54:12 +0000927create a new named tuple type from the :attr:`_fields` attribute:
Christian Heimes2380ac72008-01-09 00:17:24 +0000928
Christian Heimes25bb7832008-01-11 16:17:00 +0000929 >>> Point3D = namedtuple('Point3D', Point._fields + ('z',))
Christian Heimes2380ac72008-01-09 00:17:24 +0000930
Raymond Hettingereac503a2015-05-13 01:09:59 -0700931Docstrings can be customized by making direct assignments to the ``__doc__``
932fields:
933
934 >>> Book = namedtuple('Book', ['id', 'title', 'authors'])
Raymond Hettinger850be0f2015-11-09 08:24:53 -0800935 >>> Book.__doc__ += ': Hardcover book in active collection'
Berker Peksagde7cafa2015-05-13 12:16:27 +0300936 >>> Book.id.__doc__ = '13-digit ISBN'
937 >>> Book.title.__doc__ = 'Title of first printing'
Raymond Hettinger850be0f2015-11-09 08:24:53 -0800938 >>> Book.authors.__doc__ = 'List of authors sorted by last name'
Raymond Hettingereac503a2015-05-13 01:09:59 -0700939
Raymond Hettinger6e701312015-11-23 22:18:55 -0800940.. versionchanged:: 3.5
941 Property docstrings became writeable.
942
Christian Heimes2380ac72008-01-09 00:17:24 +0000943Default values can be implemented by using :meth:`_replace` to
Christian Heimesfe337bf2008-03-23 21:54:12 +0000944customize a prototype instance:
Guido van Rossum3d392eb2007-11-16 00:35:22 +0000945
946 >>> Account = namedtuple('Account', 'owner balance transaction_count')
Christian Heimes587c2bf2008-01-19 16:21:02 +0000947 >>> default_account = Account('<owner name>', 0.0, 0)
948 >>> johns_account = default_account._replace(owner='John')
Raymond Hettingerb2d09452011-03-22 22:36:21 -0700949 >>> janes_account = default_account._replace(owner='Jane')
Guido van Rossum3d392eb2007-11-16 00:35:22 +0000950
Georg Brandl8ed75cd2014-10-31 10:25:48 +0100951
952.. seealso::
953
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700954 * `Recipe for named tuple abstract base class with a metaclass mix-in
Raymond Hettingerbfcb4292012-06-10 11:39:44 -0700955 <http://code.activestate.com/recipes/577629-namedtupleabc-abstract-base-class-mix-in-for-named/>`_
956 by Jan Kaliszewski. Besides providing an :term:`abstract base class` for
957 named tuples, it also supports an alternate :term:`metaclass`-based
958 constructor that is convenient for use cases where named tuples are being
959 subclassed.
Raymond Hettingere4c96ad2008-02-06 01:23:58 +0000960
961
Raymond Hettinger2d32f632009-03-02 21:24:57 +0000962:class:`OrderedDict` objects
963----------------------------
964
965Ordered dictionaries are just like regular dictionaries but they remember the
966order that items were inserted. When iterating over an ordered dictionary,
967the items are returned in the order their keys were first added.
968
969.. class:: OrderedDict([items])
970
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700971 Return an instance of a dict subclass, supporting the usual :class:`dict`
972 methods. An *OrderedDict* is a dict that remembers the order that keys
973 were first inserted. If a new entry overwrites an existing entry, the
974 original insertion position is left unchanged. Deleting an entry and
975 reinserting it will move it to the end.
Raymond Hettinger2d32f632009-03-02 21:24:57 +0000976
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700977 .. versionadded:: 3.1
Raymond Hettinger2d32f632009-03-02 21:24:57 +0000978
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700979 .. method:: popitem(last=True)
Raymond Hettingerdc879f02009-03-19 20:30:56 +0000980
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700981 The :meth:`popitem` method for ordered dictionaries returns and removes a
982 (key, value) pair. The pairs are returned in LIFO order if *last* is true
983 or FIFO order if false.
Raymond Hettinger2d32f632009-03-02 21:24:57 +0000984
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700985 .. method:: move_to_end(key, last=True)
Raymond Hettingerf45abc92010-09-06 21:26:09 +0000986
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700987 Move an existing *key* to either end of an ordered dictionary. The item
988 is moved to the right end if *last* is true (the default) or to the
989 beginning if *last* is false. Raises :exc:`KeyError` if the *key* does
990 not exist::
Raymond Hettingerf45abc92010-09-06 21:26:09 +0000991
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700992 >>> d = OrderedDict.fromkeys('abcde')
993 >>> d.move_to_end('b')
994 >>> ''.join(d.keys())
995 'acdeb'
996 >>> d.move_to_end('b', last=False)
997 >>> ''.join(d.keys())
998 'bacde'
Raymond Hettingerf45abc92010-09-06 21:26:09 +0000999
Raymond Hettinger7929cfb2012-06-09 19:15:26 -07001000 .. versionadded:: 3.2
Raymond Hettingerf45abc92010-09-06 21:26:09 +00001001
Raymond Hettingere9091502009-05-19 17:40:07 +00001002In addition to the usual mapping methods, ordered dictionaries also support
1003reverse iteration using :func:`reversed`.
1004
Raymond Hettinger2d32f632009-03-02 21:24:57 +00001005Equality tests between :class:`OrderedDict` objects are order-sensitive
1006and are implemented as ``list(od1.items())==list(od2.items())``.
1007Equality tests between :class:`OrderedDict` objects and other
Serhiy Storchakabfdcd432013-10-13 23:09:14 +03001008:class:`~collections.abc.Mapping` objects are order-insensitive like regular
1009dictionaries. This allows :class:`OrderedDict` objects to be substituted
1010anywhere a regular dictionary is used.
Raymond Hettinger2d32f632009-03-02 21:24:57 +00001011
Raymond Hettinger36180782009-04-09 22:34:23 +00001012The :class:`OrderedDict` constructor and :meth:`update` method both accept
1013keyword arguments, but their order is lost because Python's function call
Martin Panter9955a372015-10-07 10:26:23 +00001014semantics pass in keyword arguments using a regular unordered dictionary.
Raymond Hettinger36180782009-04-09 22:34:23 +00001015
Serhiy Storchaka578c9212014-04-04 15:19:36 +03001016.. versionchanged:: 3.5
Martin Panter397625e2015-10-07 10:03:20 +00001017 The items, keys, and values :term:`views <dictionary view>`
1018 of :class:`OrderedDict` now support reverse iteration using :func:`reversed`.
Raymond Hettingerdc879f02009-03-19 20:30:56 +00001019
Raymond Hettinger7bba6832011-04-15 17:43:19 -07001020:class:`OrderedDict` Examples and Recipes
1021^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
1022
Raymond Hettinger0e312012009-11-10 18:35:46 +00001023Since an ordered dictionary remembers its insertion order, it can be used
Donald Stufft8b852f12014-05-20 12:58:38 -04001024in conjunction with sorting to make a sorted dictionary::
Raymond Hettinger0e312012009-11-10 18:35:46 +00001025
1026 >>> # regular unsorted dictionary
1027 >>> d = {'banana': 3, 'apple':4, 'pear': 1, 'orange': 2}
1028
1029 >>> # dictionary sorted by key
1030 >>> OrderedDict(sorted(d.items(), key=lambda t: t[0]))
1031 OrderedDict([('apple', 4), ('banana', 3), ('orange', 2), ('pear', 1)])
1032
1033 >>> # dictionary sorted by value
1034 >>> OrderedDict(sorted(d.items(), key=lambda t: t[1]))
1035 OrderedDict([('pear', 1), ('orange', 2), ('banana', 3), ('apple', 4)])
1036
1037 >>> # dictionary sorted by length of the key string
1038 >>> OrderedDict(sorted(d.items(), key=lambda t: len(t[0])))
1039 OrderedDict([('pear', 1), ('apple', 4), ('orange', 2), ('banana', 3)])
1040
1041The new sorted dictionaries maintain their sort order when entries
1042are deleted. But when new keys are added, the keys are appended
1043to the end and the sort is not maintained.
1044
Raymond Hettinger4821ef82010-07-31 10:14:41 +00001045It is also straight-forward to create an ordered dictionary variant
Andrew Svetlovff63e7a2012-08-31 13:54:54 +03001046that remembers the order the keys were *last* inserted.
Raymond Hettinger4821ef82010-07-31 10:14:41 +00001047If a new entry overwrites an existing entry, the
1048original insertion position is changed and moved to the end::
1049
1050 class LastUpdatedOrderedDict(OrderedDict):
Georg Brandl77570e22010-12-18 16:21:58 +00001051 'Store items in the order the keys were last added'
Raymond Hettinger7bba6832011-04-15 17:43:19 -07001052
Raymond Hettinger4821ef82010-07-31 10:14:41 +00001053 def __setitem__(self, key, value):
1054 if key in self:
1055 del self[key]
1056 OrderedDict.__setitem__(self, key, value)
1057
Éric Araujo889a7dc2011-08-19 00:40:46 +02001058An ordered dictionary can be combined with the :class:`Counter` class
Raymond Hettinger7bba6832011-04-15 17:43:19 -07001059so that the counter remembers the order elements are first encountered::
1060
Raymond Hettinger7929cfb2012-06-09 19:15:26 -07001061 class OrderedCounter(Counter, OrderedDict):
Raymond Hettinger7bba6832011-04-15 17:43:19 -07001062 'Counter that remembers the order elements are first encountered'
1063
Raymond Hettinger7bba6832011-04-15 17:43:19 -07001064 def __repr__(self):
1065 return '%s(%r)' % (self.__class__.__name__, OrderedDict(self))
1066
1067 def __reduce__(self):
1068 return self.__class__, (OrderedDict(self),)
1069
Raymond Hettingere4c96ad2008-02-06 01:23:58 +00001070
1071:class:`UserDict` objects
Mark Summerfield8f2d0062008-02-06 13:30:44 +00001072-------------------------
Raymond Hettingere4c96ad2008-02-06 01:23:58 +00001073
Georg Brandl48310cd2009-01-03 21:18:54 +00001074The class, :class:`UserDict` acts as a wrapper around dictionary objects.
1075The need for this class has been partially supplanted by the ability to
Raymond Hettingere4c96ad2008-02-06 01:23:58 +00001076subclass directly from :class:`dict`; however, this class can be easier
1077to work with because the underlying dictionary is accessible as an
1078attribute.
1079
1080.. class:: UserDict([initialdata])
1081
Raymond Hettinger7929cfb2012-06-09 19:15:26 -07001082 Class that simulates a dictionary. The instance's contents are kept in a
1083 regular dictionary, which is accessible via the :attr:`data` attribute of
1084 :class:`UserDict` instances. If *initialdata* is provided, :attr:`data` is
1085 initialized with its contents; note that a reference to *initialdata* will not
1086 be kept, allowing it be used for other purposes.
Raymond Hettingere4c96ad2008-02-06 01:23:58 +00001087
Raymond Hettinger7929cfb2012-06-09 19:15:26 -07001088 In addition to supporting the methods and operations of mappings,
1089 :class:`UserDict` instances provide the following attribute:
Raymond Hettingere4c96ad2008-02-06 01:23:58 +00001090
Raymond Hettinger7929cfb2012-06-09 19:15:26 -07001091 .. attribute:: data
Raymond Hettingere4c96ad2008-02-06 01:23:58 +00001092
Raymond Hettinger7929cfb2012-06-09 19:15:26 -07001093 A real dictionary used to store the contents of the :class:`UserDict`
1094 class.
Raymond Hettinger53dbe392008-02-12 20:03:09 +00001095
1096
1097
1098:class:`UserList` objects
1099-------------------------
1100
1101This class acts as a wrapper around list objects. It is a useful base class
Georg Brandl48310cd2009-01-03 21:18:54 +00001102for your own list-like classes which can inherit from them and override
Raymond Hettinger53dbe392008-02-12 20:03:09 +00001103existing methods or add new ones. In this way, one can add new behaviors to
1104lists.
1105
Georg Brandl48310cd2009-01-03 21:18:54 +00001106The need for this class has been partially supplanted by the ability to
Raymond Hettinger53dbe392008-02-12 20:03:09 +00001107subclass directly from :class:`list`; however, this class can be easier
1108to work with because the underlying list is accessible as an attribute.
1109
1110.. class:: UserList([list])
1111
Raymond Hettinger7929cfb2012-06-09 19:15:26 -07001112 Class that simulates a list. The instance's contents are kept in a regular
1113 list, which is accessible via the :attr:`data` attribute of :class:`UserList`
1114 instances. The instance's contents are initially set to a copy of *list*,
1115 defaulting to the empty list ``[]``. *list* can be any iterable, for
1116 example a real Python list or a :class:`UserList` object.
Raymond Hettinger53dbe392008-02-12 20:03:09 +00001117
Raymond Hettinger7929cfb2012-06-09 19:15:26 -07001118 In addition to supporting the methods and operations of mutable sequences,
1119 :class:`UserList` instances provide the following attribute:
Raymond Hettinger53dbe392008-02-12 20:03:09 +00001120
Raymond Hettinger7929cfb2012-06-09 19:15:26 -07001121 .. attribute:: data
Raymond Hettinger53dbe392008-02-12 20:03:09 +00001122
Raymond Hettinger7929cfb2012-06-09 19:15:26 -07001123 A real :class:`list` object used to store the contents of the
1124 :class:`UserList` class.
Raymond Hettinger53dbe392008-02-12 20:03:09 +00001125
Zachary Ware80602e02014-01-13 20:38:57 -06001126**Subclassing requirements:** Subclasses of :class:`UserList` are expected to
Raymond Hettinger53dbe392008-02-12 20:03:09 +00001127offer a constructor which can be called with either no arguments or one
1128argument. List operations which return a new sequence attempt to create an
1129instance of the actual implementation class. To do so, it assumes that the
1130constructor can be called with a single parameter, which is a sequence object
1131used as a data source.
1132
1133If a derived class does not wish to comply with this requirement, all of the
1134special methods supported by this class will need to be overridden; please
1135consult the sources for information about the methods which need to be provided
1136in that case.
Raymond Hettingerb3a65f82008-02-21 22:11:37 +00001137
1138:class:`UserString` objects
Christian Heimesc3f30c42008-02-22 16:37:40 +00001139---------------------------
Raymond Hettingerb3a65f82008-02-21 22:11:37 +00001140
Georg Brandl48310cd2009-01-03 21:18:54 +00001141The class, :class:`UserString` acts as a wrapper around string objects.
1142The need for this class has been partially supplanted by the ability to
Raymond Hettingerb3a65f82008-02-21 22:11:37 +00001143subclass directly from :class:`str`; however, this class can be easier
1144to work with because the underlying string is accessible as an
1145attribute.
1146
1147.. class:: UserString([sequence])
1148
Raymond Hettinger7929cfb2012-06-09 19:15:26 -07001149 Class that simulates a string or a Unicode string object. The instance's
1150 content is kept in a regular string object, which is accessible via the
1151 :attr:`data` attribute of :class:`UserString` instances. The instance's
1152 contents are initially set to a copy of *sequence*. The *sequence* can
1153 be an instance of :class:`bytes`, :class:`str`, :class:`UserString` (or a
1154 subclass) or an arbitrary sequence which can be converted into a string using
1155 the built-in :func:`str` function.
Yury Selivanov336b37b2015-09-09 12:23:01 -04001156
1157 .. versionchanged:: 3.5
1158 New methods ``__getnewargs__``, ``__rmod__``, ``casefold``,
1159 ``format_map``, ``isprintable``, and ``maketrans``.