blob: a5e8d04099b22f9acc0dc58d78466791dd9d6e37 [file] [log] [blame]
Raymond Hettinger53dbe392008-02-12 20:03:09 +00001:mod:`collections` --- Container datatypes
2==========================================
Georg Brandl116aa622007-08-15 14:28:22 +00003
4.. module:: collections
Raymond Hettinger7929cfb2012-06-09 19:15:26 -07005 :synopsis: Container datatypes
Terry Jan Reedyfa089b92016-06-11 15:02:54 -04006
Georg Brandl116aa622007-08-15 14:28:22 +00007.. moduleauthor:: Raymond Hettinger <python@rcn.com>
8.. sectionauthor:: Raymond Hettinger <python@rcn.com>
9
Terry Jan Reedyfa089b92016-06-11 15:02:54 -040010**Source code:** :source:`Lib/collections/__init__.py`
11
Christian Heimesfe337bf2008-03-23 21:54:12 +000012.. testsetup:: *
13
Raymond Hettinger7929cfb2012-06-09 19:15:26 -070014 from collections import *
15 import itertools
16 __name__ = '<doctest>'
Georg Brandl116aa622007-08-15 14:28:22 +000017
Raymond Hettinger4f707fd2011-01-10 19:54:11 +000018--------------
19
Raymond Hettingera6b76ba2010-08-08 00:29:08 +000020This module implements specialized container datatypes providing alternatives to
21Python's general purpose built-in containers, :class:`dict`, :class:`list`,
22:class:`set`, and :class:`tuple`.
Christian Heimes0bd4e112008-02-12 22:59:25 +000023
Raymond Hettingera6b76ba2010-08-08 00:29:08 +000024===================== ====================================================================
25:func:`namedtuple` factory function for creating tuple subclasses with named fields
26:class:`deque` list-like container with fast appends and pops on either end
Raymond Hettinger9fe1ccf2011-02-26 01:02:51 +000027:class:`ChainMap` dict-like class for creating a single view of multiple mappings
Raymond Hettingera6b76ba2010-08-08 00:29:08 +000028:class:`Counter` dict subclass for counting hashable objects
29:class:`OrderedDict` dict subclass that remembers the order entries were added
30:class:`defaultdict` dict subclass that calls a factory function to supply missing values
31:class:`UserDict` wrapper around dictionary objects for easier dict subclassing
32:class:`UserList` wrapper around list objects for easier list subclassing
33:class:`UserString` wrapper around string objects for easier string subclassing
34===================== ====================================================================
Georg Brandl116aa622007-08-15 14:28:22 +000035
Matthias Bussonniereea47e02019-05-30 14:45:48 -070036.. deprecated-removed:: 3.3 3.9
Raymond Hettinger7929cfb2012-06-09 19:15:26 -070037 Moved :ref:`collections-abstract-base-classes` to the :mod:`collections.abc` module.
Raymond Hettingere6d34212018-01-29 08:27:49 -080038 For backwards compatibility, they continue to be visible in this module through
Matthias Bussonniereea47e02019-05-30 14:45:48 -070039 Python 3.8.
Mark Summerfield08898b42007-09-05 08:43:04 +000040
41
Raymond Hettinger9fe1ccf2011-02-26 01:02:51 +000042:class:`ChainMap` objects
43-------------------------
44
Georg Brandl283b96b2012-04-03 09:16:46 +020045.. versionadded:: 3.3
46
Raymond Hettinger9fe1ccf2011-02-26 01:02:51 +000047A :class:`ChainMap` class is provided for quickly linking a number of mappings
48so they can be treated as a single unit. It is often much faster than creating
49a new dictionary and running multiple :meth:`~dict.update` calls.
50
51The class can be used to simulate nested scopes and is useful in templating.
52
53.. class:: ChainMap(*maps)
54
Raymond Hettinger7929cfb2012-06-09 19:15:26 -070055 A :class:`ChainMap` groups multiple dicts or other mappings together to
56 create a single, updateable view. If no *maps* are specified, a single empty
57 dictionary is provided so that a new chain always has at least one mapping.
Raymond Hettinger9fe1ccf2011-02-26 01:02:51 +000058
Raymond Hettinger7929cfb2012-06-09 19:15:26 -070059 The underlying mappings are stored in a list. That list is public and can
Martin Panter8d56c022016-05-29 04:13:35 +000060 be accessed or updated using the *maps* attribute. There is no other state.
Raymond Hettinger9fe1ccf2011-02-26 01:02:51 +000061
Raymond Hettinger7929cfb2012-06-09 19:15:26 -070062 Lookups search the underlying mappings successively until a key is found. In
63 contrast, writes, updates, and deletions only operate on the first mapping.
Raymond Hettinger9fe1ccf2011-02-26 01:02:51 +000064
Raymond Hettinger7929cfb2012-06-09 19:15:26 -070065 A :class:`ChainMap` incorporates the underlying mappings by reference. So, if
66 one of the underlying mappings gets updated, those changes will be reflected
67 in :class:`ChainMap`.
Raymond Hettinger9fe1ccf2011-02-26 01:02:51 +000068
Raymond Hettinger7929cfb2012-06-09 19:15:26 -070069 All of the usual dictionary methods are supported. In addition, there is a
70 *maps* attribute, a method for creating new subcontexts, and a property for
71 accessing all but the first mapping:
Raymond Hettinger9fe1ccf2011-02-26 01:02:51 +000072
Raymond Hettinger7929cfb2012-06-09 19:15:26 -070073 .. attribute:: maps
Raymond Hettinger9fe1ccf2011-02-26 01:02:51 +000074
Raymond Hettinger7929cfb2012-06-09 19:15:26 -070075 A user updateable list of mappings. The list is ordered from
76 first-searched to last-searched. It is the only stored state and can
77 be modified to change which mappings are searched. The list should
78 always contain at least one mapping.
Raymond Hettinger9fe1ccf2011-02-26 01:02:51 +000079
Vinay Sajip1ba81ee2013-01-11 23:39:53 +000080 .. method:: new_child(m=None)
Raymond Hettinger9fe1ccf2011-02-26 01:02:51 +000081
Vinay Sajip1ba81ee2013-01-11 23:39:53 +000082 Returns a new :class:`ChainMap` containing a new map followed by
83 all of the maps in the current instance. If ``m`` is specified,
84 it becomes the new map at the front of the list of mappings; if not
85 specified, an empty dict is used, so that a call to ``d.new_child()``
86 is equivalent to: ``ChainMap({}, *d.maps)``. This method is used for
Raymond Hettinger7929cfb2012-06-09 19:15:26 -070087 creating subcontexts that can be updated without altering values in any
88 of the parent mappings.
Raymond Hettinger9fe1ccf2011-02-26 01:02:51 +000089
Vinay Sajip1ba81ee2013-01-11 23:39:53 +000090 .. versionchanged:: 3.4
91 The optional ``m`` parameter was added.
92
Raymond Hettinger2a61c452012-07-15 22:37:20 -070093 .. attribute:: parents
Raymond Hettinger9fe1ccf2011-02-26 01:02:51 +000094
Raymond Hettingerb22ba042012-07-16 02:07:41 -070095 Property returning a new :class:`ChainMap` containing all of the maps in
Raymond Hettinger2a61c452012-07-15 22:37:20 -070096 the current instance except the first one. This is useful for skipping
97 the first map in the search. Use cases are similar to those for the
98 :keyword:`nonlocal` keyword used in :term:`nested scopes <nested
99 scope>`. The use cases also parallel those for the built-in
100 :func:`super` function. A reference to ``d.parents`` is equivalent to:
101 ``ChainMap(*d.maps[1:])``.
Raymond Hettinger9fe1ccf2011-02-26 01:02:51 +0000102
Raymond Hettinger86f093f2019-02-21 09:12:15 -0800103 Note, the iteration order of a :class:`ChainMap()` is determined by
104 scanning the mappings last to first::
105
106 >>> baseline = {'music': 'bach', 'art': 'rembrandt'}
107 >>> adjustments = {'art': 'van gogh', 'opera': 'carmen'}
108 >>> list(ChainMap(adjustments, baseline))
109 ['music', 'art', 'opera']
110
111 This gives the same ordering as a series of :meth:`dict.update` calls
112 starting with the last mapping::
113
114 >>> combined = baseline.copy()
115 >>> combined.update(adjustments)
116 >>> list(combined)
117 ['music', 'art', 'opera']
Raymond Hettingerbfcb4292012-06-10 11:39:44 -0700118
119.. seealso::
120
121 * The `MultiContext class
Sandro Tosiea475302012-08-12 10:37:23 +0200122 <https://github.com/enthought/codetools/blob/4.0.0/codetools/contexts/multi_context.py>`_
Raymond Hettingerbfcb4292012-06-10 11:39:44 -0700123 in the Enthought `CodeTools package
124 <https://github.com/enthought/codetools>`_ has options to support
125 writing to any mapping in the chain.
126
127 * Django's `Context class
Georg Brandl525d3552014-10-29 10:26:56 +0100128 <https://github.com/django/django/blob/master/django/template/context.py>`_
Raymond Hettingerbfcb4292012-06-10 11:39:44 -0700129 for templating is a read-only chain of mappings. It also features
130 pushing and popping of contexts similar to the
131 :meth:`~collections.ChainMap.new_child` method and the
Michael Seiferte1052942018-03-26 13:40:35 +0200132 :attr:`~collections.ChainMap.parents` property.
Raymond Hettingerbfcb4292012-06-10 11:39:44 -0700133
134 * The `Nested Contexts recipe
Serhiy Storchaka6dff0202016-05-07 10:49:07 +0300135 <https://code.activestate.com/recipes/577434/>`_ has options to control
Raymond Hettingerbfcb4292012-06-10 11:39:44 -0700136 whether writes and other mutations apply only to the first mapping or to
137 any mapping in the chain.
138
139 * A `greatly simplified read-only version of Chainmap
Serhiy Storchaka6dff0202016-05-07 10:49:07 +0300140 <https://code.activestate.com/recipes/305268/>`_.
Raymond Hettingerbfcb4292012-06-10 11:39:44 -0700141
142
143:class:`ChainMap` Examples and Recipes
144^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
145
146This section shows various approaches to working with chained maps.
147
148
149Example of simulating Python's internal lookup chain::
Raymond Hettinger9fe1ccf2011-02-26 01:02:51 +0000150
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700151 import builtins
152 pylookup = ChainMap(locals(), globals(), vars(builtins))
Raymond Hettinger9fe1ccf2011-02-26 01:02:51 +0000153
Raymond Hettingerb2269ba2012-07-15 23:53:32 -0700154Example of letting user specified command-line arguments take precedence over
155environment variables which in turn take precedence over default values::
Raymond Hettinger9fe1ccf2011-02-26 01:02:51 +0000156
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700157 import os, argparse
Raymond Hettingerb2269ba2012-07-15 23:53:32 -0700158
159 defaults = {'color': 'red', 'user': 'guest'}
160
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700161 parser = argparse.ArgumentParser()
162 parser.add_argument('-u', '--user')
163 parser.add_argument('-c', '--color')
Raymond Hettingerb2269ba2012-07-15 23:53:32 -0700164 namespace = parser.parse_args()
Jonathan Scholbach98480ce2019-11-12 01:49:41 +0100165 command_line_args = {k: v for k, v in vars(namespace).items() if v is not None}
Raymond Hettingerb2269ba2012-07-15 23:53:32 -0700166
167 combined = ChainMap(command_line_args, os.environ, defaults)
168 print(combined['color'])
169 print(combined['user'])
Raymond Hettinger9fe1ccf2011-02-26 01:02:51 +0000170
Raymond Hettingerbfcb4292012-06-10 11:39:44 -0700171Example patterns for using the :class:`ChainMap` class to simulate nested
172contexts::
Raymond Hettinger9fe1ccf2011-02-26 01:02:51 +0000173
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700174 c = ChainMap() # Create root context
175 d = c.new_child() # Create nested child context
176 e = c.new_child() # Child of c, independent from d
177 e.maps[0] # Current context dictionary -- like Python's locals()
178 e.maps[-1] # Root context -- like Python's globals()
179 e.parents # Enclosing context chain -- like Python's nonlocals
Raymond Hettinger9fe1ccf2011-02-26 01:02:51 +0000180
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700181 d['x'] = 1 # Set value in current context
Raymond Hettinger9b0c6812019-02-20 13:49:23 -0800182 d['x'] # Get first key in the chain of contexts
Andrew Svetlov1a8db9c2012-10-04 19:29:25 +0300183 del d['x'] # Delete from current context
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700184 list(d) # All nested values
185 k in d # Check all nested values
186 len(d) # Number of nested values
187 d.items() # All nested items
188 dict(d) # Flatten into a regular dictionary
Raymond Hettinger9fe1ccf2011-02-26 01:02:51 +0000189
Raymond Hettingerbfcb4292012-06-10 11:39:44 -0700190The :class:`ChainMap` class only makes updates (writes and deletions) to the
191first mapping in the chain while lookups will search the full chain. However,
192if deep writes and deletions are desired, it is easy to make a subclass that
193updates keys found deeper in the chain::
Raymond Hettinger9fe1ccf2011-02-26 01:02:51 +0000194
Raymond Hettingerbfcb4292012-06-10 11:39:44 -0700195 class DeepChainMap(ChainMap):
196 'Variant of ChainMap that allows direct updates to inner scopes'
Raymond Hettinger9fe1ccf2011-02-26 01:02:51 +0000197
Raymond Hettingerbfcb4292012-06-10 11:39:44 -0700198 def __setitem__(self, key, value):
199 for mapping in self.maps:
200 if key in mapping:
201 mapping[key] = value
202 return
203 self.maps[0][key] = value
Raymond Hettinger9fe1ccf2011-02-26 01:02:51 +0000204
Raymond Hettingerbfcb4292012-06-10 11:39:44 -0700205 def __delitem__(self, key):
206 for mapping in self.maps:
207 if key in mapping:
208 del mapping[key]
209 return
210 raise KeyError(key)
Raymond Hettinger9fe1ccf2011-02-26 01:02:51 +0000211
Serhiy Storchakaf47036c2013-12-24 11:04:36 +0200212 >>> d = DeepChainMap({'zebra': 'black'}, {'elephant': 'blue'}, {'lion': 'yellow'})
Raymond Hettingerbfcb4292012-06-10 11:39:44 -0700213 >>> d['lion'] = 'orange' # update an existing key two levels down
214 >>> d['snake'] = 'red' # new keys get added to the topmost dict
215 >>> del d['elephant'] # remove an existing key one level down
Raymond Hettinger2064bb62018-09-10 18:43:08 -0700216 >>> d # display result
Raymond Hettingerbfcb4292012-06-10 11:39:44 -0700217 DeepChainMap({'zebra': 'black', 'snake': 'red'}, {}, {'lion': 'orange'})
Georg Brandl4dcf4742012-03-08 20:35:08 +0100218
Raymond Hettinger9fe1ccf2011-02-26 01:02:51 +0000219
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000220:class:`Counter` objects
221------------------------
222
223A counter tool is provided to support convenient and rapid tallies.
224For example::
225
Raymond Hettinger1c62dc92009-02-04 11:41:45 +0000226 >>> # Tally occurrences of words in a list
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000227 >>> cnt = Counter()
Raymond Hettinger670eaec2009-01-21 23:14:07 +0000228 >>> for word in ['red', 'blue', 'red', 'green', 'blue', 'blue']:
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000229 ... cnt[word] += 1
230 >>> cnt
231 Counter({'blue': 3, 'red': 2, 'green': 1})
232
Raymond Hettinger1c62dc92009-02-04 11:41:45 +0000233 >>> # Find the ten most common words in Hamlet
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000234 >>> import re
Raymond Hettingerfaaba592013-03-01 03:30:20 -0800235 >>> words = re.findall(r'\w+', open('hamlet.txt').read().lower())
Raymond Hettinger0bae6622009-01-20 13:00:59 +0000236 >>> Counter(words).most_common(10)
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000237 [('the', 1143), ('and', 966), ('to', 762), ('of', 669), ('i', 631),
238 ('you', 554), ('a', 546), ('my', 514), ('hamlet', 471), ('in', 451)]
239
240.. class:: Counter([iterable-or-mapping])
241
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700242 A :class:`Counter` is a :class:`dict` subclass for counting hashable objects.
Andrés Delfino76e8fd72018-11-11 13:56:47 -0300243 It is a collection where elements are stored as dictionary keys
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700244 and their counts are stored as dictionary values. Counts are allowed to be
245 any integer value including zero or negative counts. The :class:`Counter`
246 class is similar to bags or multisets in other languages.
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000247
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700248 Elements are counted from an *iterable* or initialized from another
249 *mapping* (or counter):
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000250
Raymond Hettinger73662a52009-01-27 02:38:22 +0000251 >>> c = Counter() # a new, empty counter
252 >>> c = Counter('gallahad') # a new counter from an iterable
253 >>> c = Counter({'red': 4, 'blue': 2}) # a new counter from a mapping
254 >>> c = Counter(cats=4, dogs=8) # a new counter from keyword args
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000255
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700256 Counter objects have a dictionary interface except that they return a zero
257 count for missing items instead of raising a :exc:`KeyError`:
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000258
Raymond Hettinger94adc8e2009-01-22 05:27:37 +0000259 >>> c = Counter(['eggs', 'ham'])
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000260 >>> c['bacon'] # count of a missing element is zero
261 0
262
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700263 Setting a count to zero does not remove an element from a counter.
264 Use ``del`` to remove it entirely:
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000265
Raymond Hettinger94adc8e2009-01-22 05:27:37 +0000266 >>> c['sausage'] = 0 # counter entry with a zero count
267 >>> del c['sausage'] # del actually removes the entry
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000268
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700269 .. versionadded:: 3.1
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000270
Raymond Hettinger407c7342019-02-21 09:19:00 -0800271 .. versionchanged:: 3.7 As a :class:`dict` subclass, :class:`Counter`
272 Inherited the capability to remember insertion order. Math operations
273 on *Counter* objects also preserve order. Results are ordered
274 according to when an element is first encountered in the left operand
275 and then by the order encountered in the right operand.
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000276
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700277 Counter objects support three methods beyond those available for all
278 dictionaries:
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000279
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700280 .. method:: elements()
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000281
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700282 Return an iterator over elements repeating each as many times as its
Raymond Hettinger407c7342019-02-21 09:19:00 -0800283 count. Elements are returned in the order first encountered. If an
284 element's count is less than one, :meth:`elements` will ignore it.
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000285
Raymond Hettinger0bae6622009-01-20 13:00:59 +0000286 >>> c = Counter(a=4, b=2, c=0, d=-2)
Zachary Ware2b52c0a2016-08-09 17:38:22 -0500287 >>> sorted(c.elements())
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000288 ['a', 'a', 'a', 'a', 'b', 'b']
289
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700290 .. method:: most_common([n])
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000291
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700292 Return a list of the *n* most common elements and their counts from the
Raymond Hettingera3780252015-05-13 02:47:57 -0700293 most common to the least. If *n* is omitted or ``None``,
Michael Seiferte1052942018-03-26 13:40:35 +0200294 :meth:`most_common` returns *all* elements in the counter.
Raymond Hettinger407c7342019-02-21 09:19:00 -0800295 Elements with equal counts are ordered in the order first encountered:
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000296
Raymond Hettinger407c7342019-02-21 09:19:00 -0800297 >>> Counter('abracadabra').most_common(3)
298 [('a', 5), ('b', 2), ('r', 2)]
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000299
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700300 .. method:: subtract([iterable-or-mapping])
Raymond Hettinger9c01e442010-04-03 10:32:58 +0000301
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700302 Elements are subtracted from an *iterable* or from another *mapping*
303 (or counter). Like :meth:`dict.update` but subtracts counts instead
304 of replacing them. Both inputs and outputs may be zero or negative.
Raymond Hettinger9c01e442010-04-03 10:32:58 +0000305
306 >>> c = Counter(a=4, b=2, c=0, d=-2)
307 >>> d = Counter(a=1, b=2, c=3, d=4)
308 >>> c.subtract(d)
Andrew Svetlovf6351722012-12-17 14:01:16 +0200309 >>> c
Raymond Hettinger9c01e442010-04-03 10:32:58 +0000310 Counter({'a': 3, 'b': 0, 'c': -3, 'd': -6})
311
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700312 .. versionadded:: 3.2
Ezio Melotti0be8b1c2010-04-04 06:53:44 +0000313
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700314 The usual dictionary methods are available for :class:`Counter` objects
315 except for two which work differently for counters.
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000316
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700317 .. method:: fromkeys(iterable)
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000318
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700319 This class method is not implemented for :class:`Counter` objects.
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000320
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700321 .. method:: update([iterable-or-mapping])
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000322
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700323 Elements are counted from an *iterable* or added-in from another
324 *mapping* (or counter). Like :meth:`dict.update` but adds counts
325 instead of replacing them. Also, the *iterable* is expected to be a
326 sequence of elements, not a sequence of ``(key, value)`` pairs.
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000327
328Common patterns for working with :class:`Counter` objects::
329
Raymond Hettinger73662a52009-01-27 02:38:22 +0000330 sum(c.values()) # total of all counts
331 c.clear() # reset all counts
332 list(c) # list unique elements
333 set(c) # convert to a set
334 dict(c) # convert to a regular dictionary
335 c.items() # convert to a list of (elem, cnt) pairs
336 Counter(dict(list_of_pairs)) # convert from a list of (elem, cnt) pairs
Georg Brandl87f3d7b2013-10-06 12:36:39 +0200337 c.most_common()[:-n-1:-1] # n least common elements
Raymond Hettingerfcb393c2011-08-09 13:00:40 -0700338 +c # remove zero and negative counts
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000339
Raymond Hettinger72a95cc2009-02-25 22:51:40 +0000340Several mathematical operations are provided for combining :class:`Counter`
341objects to produce multisets (counters that have counts greater than zero).
342Addition and subtraction combine counters by adding or subtracting the counts
343of corresponding elements. Intersection and union return the minimum and
344maximum of corresponding counts. Each operation can accept inputs with signed
345counts, but the output will exclude results with counts of zero or less.
Raymond Hettinger4d2073a2009-01-20 03:41:22 +0000346
Raymond Hettingere0d1b9f2009-01-21 20:36:27 +0000347 >>> c = Counter(a=3, b=1)
348 >>> d = Counter(a=1, b=2)
Raymond Hettinger73662a52009-01-27 02:38:22 +0000349 >>> c + d # add two counters together: c[x] + d[x]
Raymond Hettinger4d2073a2009-01-20 03:41:22 +0000350 Counter({'a': 4, 'b': 3})
Raymond Hettinger73662a52009-01-27 02:38:22 +0000351 >>> c - d # subtract (keeping only positive counts)
Raymond Hettinger4d2073a2009-01-20 03:41:22 +0000352 Counter({'a': 2})
Zachary Ware2b52c0a2016-08-09 17:38:22 -0500353 >>> c & d # intersection: min(c[x], d[x]) # doctest: +SKIP
Raymond Hettinger4d2073a2009-01-20 03:41:22 +0000354 Counter({'a': 1, 'b': 1})
Raymond Hettinger73662a52009-01-27 02:38:22 +0000355 >>> c | d # union: max(c[x], d[x])
Raymond Hettinger4d2073a2009-01-20 03:41:22 +0000356 Counter({'a': 3, 'b': 2})
357
Berker Peksag315e1042015-05-19 01:36:55 +0300358Unary addition and subtraction are shortcuts for adding an empty counter
Raymond Hettingerfcb393c2011-08-09 13:00:40 -0700359or subtracting from an empty counter.
360
361 >>> c = Counter(a=2, b=-4)
362 >>> +c
363 Counter({'a': 2})
364 >>> -c
365 Counter({'b': 4})
366
367.. versionadded:: 3.3
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700368 Added support for unary plus, unary minus, and in-place multiset operations.
Raymond Hettingerfcb393c2011-08-09 13:00:40 -0700369
Raymond Hettinger22f18852010-04-12 21:45:14 +0000370.. note::
371
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700372 Counters were primarily designed to work with positive integers to represent
373 running counts; however, care was taken to not unnecessarily preclude use
374 cases needing other types or negative values. To help with those use cases,
375 this section documents the minimum range and type restrictions.
Raymond Hettinger22f18852010-04-12 21:45:14 +0000376
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700377 * The :class:`Counter` class itself is a dictionary subclass with no
Georg Brandl2fdc0f82012-10-06 22:38:20 +0200378 restrictions on its keys and values. The values are intended to be numbers
379 representing counts, but you *could* store anything in the value field.
Raymond Hettinger22f18852010-04-12 21:45:14 +0000380
Michael Seiferte1052942018-03-26 13:40:35 +0200381 * The :meth:`~Counter.most_common` method requires only that the values be orderable.
Raymond Hettinger22f18852010-04-12 21:45:14 +0000382
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700383 * For in-place operations such as ``c[key] += 1``, the value type need only
Georg Brandl2fdc0f82012-10-06 22:38:20 +0200384 support addition and subtraction. So fractions, floats, and decimals would
385 work and negative values are supported. The same is also true for
Michael Seiferte1052942018-03-26 13:40:35 +0200386 :meth:`~Counter.update` and :meth:`~Counter.subtract` which allow negative and zero values
Georg Brandl2fdc0f82012-10-06 22:38:20 +0200387 for both inputs and outputs.
Raymond Hettinger22f18852010-04-12 21:45:14 +0000388
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700389 * The multiset methods are designed only for use cases with positive values.
Georg Brandl2fdc0f82012-10-06 22:38:20 +0200390 The inputs may be negative or zero, but only outputs with positive values
391 are created. There are no type restrictions, but the value type needs to
392 support addition, subtraction, and comparison.
Raymond Hettinger22f18852010-04-12 21:45:14 +0000393
Michael Seiferte1052942018-03-26 13:40:35 +0200394 * The :meth:`~Counter.elements` method requires integer counts. It ignores zero and
Georg Brandl2fdc0f82012-10-06 22:38:20 +0200395 negative counts.
Raymond Hettinger22f18852010-04-12 21:45:14 +0000396
Raymond Hettingerb14043c2009-01-20 23:44:31 +0000397.. seealso::
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000398
Georg Brandl5d941342016-02-26 19:37:12 +0100399 * `Bag class <https://www.gnu.org/software/smalltalk/manual-base/html_node/Bag.html>`_
Raymond Hettingerbfcb4292012-06-10 11:39:44 -0700400 in Smalltalk.
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000401
Georg Brandl5d941342016-02-26 19:37:12 +0100402 * Wikipedia entry for `Multisets <https://en.wikipedia.org/wiki/Multiset>`_.
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000403
Georg Brandl5d941342016-02-26 19:37:12 +0100404 * `C++ multisets <http://www.java2s.com/Tutorial/Cpp/0380__set-multiset/Catalog0380__set-multiset.htm>`_
Raymond Hettingerbfcb4292012-06-10 11:39:44 -0700405 tutorial with examples.
Raymond Hettingerb14043c2009-01-20 23:44:31 +0000406
Raymond Hettinger94adc8e2009-01-22 05:27:37 +0000407 * For mathematical operations on multisets and their use cases, see
Raymond Hettingerbfcb4292012-06-10 11:39:44 -0700408 *Knuth, Donald. The Art of Computer Programming Volume II,
409 Section 4.6.3, Exercise 19*.
Raymond Hettingerb14043c2009-01-20 23:44:31 +0000410
Raymond Hettinger670eaec2009-01-21 23:14:07 +0000411 * To enumerate all distinct multisets of a given size over a given set of
Michael Seiferte1052942018-03-26 13:40:35 +0200412 elements, see :func:`itertools.combinations_with_replacement`::
Raymond Hettingerb14043c2009-01-20 23:44:31 +0000413
Michael Seiferte1052942018-03-26 13:40:35 +0200414 map(Counter, combinations_with_replacement('ABC', 2)) # --> AA AB AC BB BC CC
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000415
416
Georg Brandl116aa622007-08-15 14:28:22 +0000417:class:`deque` objects
418----------------------
419
Georg Brandlc2a4f4f2009-04-10 09:03:43 +0000420.. class:: deque([iterable, [maxlen]])
Georg Brandl116aa622007-08-15 14:28:22 +0000421
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700422 Returns a new deque object initialized left-to-right (using :meth:`append`) with
423 data from *iterable*. If *iterable* is not specified, the new deque is empty.
Georg Brandl116aa622007-08-15 14:28:22 +0000424
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700425 Deques are a generalization of stacks and queues (the name is pronounced "deck"
426 and is short for "double-ended queue"). Deques support thread-safe, memory
427 efficient appends and pops from either side of the deque with approximately the
428 same O(1) performance in either direction.
Georg Brandl116aa622007-08-15 14:28:22 +0000429
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700430 Though :class:`list` objects support similar operations, they are optimized for
431 fast fixed-length operations and incur O(n) memory movement costs for
432 ``pop(0)`` and ``insert(0, v)`` operations which change both the size and
433 position of the underlying data representation.
Georg Brandl116aa622007-08-15 14:28:22 +0000434
Georg Brandl116aa622007-08-15 14:28:22 +0000435
Serhiy Storchakaecf41da2016-10-19 16:29:26 +0300436 If *maxlen* is not specified or is ``None``, deques may grow to an
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700437 arbitrary length. Otherwise, the deque is bounded to the specified maximum
438 length. Once a bounded length deque is full, when new items are added, a
439 corresponding number of items are discarded from the opposite end. Bounded
440 length deques provide functionality similar to the ``tail`` filter in
441 Unix. They are also useful for tracking transactions and other pools of data
442 where only the most recent activity is of interest.
Georg Brandl9afde1c2007-11-01 20:32:30 +0000443
Georg Brandl9afde1c2007-11-01 20:32:30 +0000444
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700445 Deque objects support the following methods:
Georg Brandl116aa622007-08-15 14:28:22 +0000446
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700447 .. method:: append(x)
Georg Brandl116aa622007-08-15 14:28:22 +0000448
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700449 Add *x* to the right side of the deque.
Georg Brandl116aa622007-08-15 14:28:22 +0000450
451
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700452 .. method:: appendleft(x)
Georg Brandl116aa622007-08-15 14:28:22 +0000453
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700454 Add *x* to the left side of the deque.
Georg Brandl116aa622007-08-15 14:28:22 +0000455
456
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700457 .. method:: clear()
Georg Brandl116aa622007-08-15 14:28:22 +0000458
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700459 Remove all elements from the deque leaving it with length 0.
Georg Brandl116aa622007-08-15 14:28:22 +0000460
461
Raymond Hettinger32ea1652015-03-21 01:37:37 -0700462 .. method:: copy()
463
464 Create a shallow copy of the deque.
465
466 .. versionadded:: 3.5
467
468
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700469 .. method:: count(x)
Raymond Hettinger44459de2010-04-03 23:20:46 +0000470
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700471 Count the number of deque elements equal to *x*.
Raymond Hettinger44459de2010-04-03 23:20:46 +0000472
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700473 .. versionadded:: 3.2
Raymond Hettinger44459de2010-04-03 23:20:46 +0000474
Georg Brandl67b21b72010-08-17 15:07:14 +0000475
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700476 .. method:: extend(iterable)
Georg Brandl116aa622007-08-15 14:28:22 +0000477
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700478 Extend the right side of the deque by appending elements from the iterable
479 argument.
Georg Brandl116aa622007-08-15 14:28:22 +0000480
481
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700482 .. method:: extendleft(iterable)
Georg Brandl116aa622007-08-15 14:28:22 +0000483
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700484 Extend the left side of the deque by appending elements from *iterable*.
485 Note, the series of left appends results in reversing the order of
486 elements in the iterable argument.
Georg Brandl116aa622007-08-15 14:28:22 +0000487
488
Raymond Hettinger855482e2015-05-23 08:57:58 -0700489 .. method:: index(x[, start[, stop]])
Raymond Hettinger32ea1652015-03-21 01:37:37 -0700490
Raymond Hettinger855482e2015-05-23 08:57:58 -0700491 Return the position of *x* in the deque (at or after index *start*
492 and before index *stop*). Returns the first match or raises
493 :exc:`ValueError` if not found.
Raymond Hettinger32ea1652015-03-21 01:37:37 -0700494
495 .. versionadded:: 3.5
496
497
498 .. method:: insert(i, x)
499
500 Insert *x* into the deque at position *i*.
501
Raymond Hettingera6389712016-02-01 21:21:19 -0800502 If the insertion would cause a bounded deque to grow beyond *maxlen*,
503 an :exc:`IndexError` is raised.
Raymond Hettinger37434322016-01-26 21:44:16 -0800504
Raymond Hettinger32ea1652015-03-21 01:37:37 -0700505 .. versionadded:: 3.5
506
507
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700508 .. method:: pop()
Georg Brandl116aa622007-08-15 14:28:22 +0000509
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700510 Remove and return an element from the right side of the deque. If no
511 elements are present, raises an :exc:`IndexError`.
Georg Brandl116aa622007-08-15 14:28:22 +0000512
513
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700514 .. method:: popleft()
Georg Brandl116aa622007-08-15 14:28:22 +0000515
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700516 Remove and return an element from the left side of the deque. If no
517 elements are present, raises an :exc:`IndexError`.
Georg Brandl116aa622007-08-15 14:28:22 +0000518
519
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700520 .. method:: remove(value)
Georg Brandl116aa622007-08-15 14:28:22 +0000521
Raymond Hettinger855482e2015-05-23 08:57:58 -0700522 Remove the first occurrence of *value*. If not found, raises a
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700523 :exc:`ValueError`.
Georg Brandl116aa622007-08-15 14:28:22 +0000524
Georg Brandl67b21b72010-08-17 15:07:14 +0000525
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700526 .. method:: reverse()
Raymond Hettingere5fdedb2009-12-10 00:47:21 +0000527
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700528 Reverse the elements of the deque in-place and then return ``None``.
Raymond Hettingere5fdedb2009-12-10 00:47:21 +0000529
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700530 .. versionadded:: 3.2
Georg Brandl116aa622007-08-15 14:28:22 +0000531
Georg Brandl67b21b72010-08-17 15:07:14 +0000532
Raymond Hettinger589c7182018-02-03 08:46:28 -0800533 .. method:: rotate(n=1)
Georg Brandl116aa622007-08-15 14:28:22 +0000534
Raymond Hettinger589c7182018-02-03 08:46:28 -0800535 Rotate the deque *n* steps to the right. If *n* is negative, rotate
536 to the left.
537
Raymond Hettingerca6c1252018-02-04 09:15:01 -0800538 When the deque is not empty, rotating one step to the right is equivalent
Raymond Hettinger589c7182018-02-03 08:46:28 -0800539 to ``d.appendleft(d.pop())``, and rotating one step to the left is
540 equivalent to ``d.append(d.popleft())``.
Benjamin Petersone41251e2008-04-25 01:59:09 +0000541
Georg Brandl116aa622007-08-15 14:28:22 +0000542
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700543 Deque objects also provide one read-only attribute:
Raymond Hettinger5bb0f0e2009-03-10 12:56:32 +0000544
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700545 .. attribute:: maxlen
Raymond Hettinger5bb0f0e2009-03-10 12:56:32 +0000546
Serhiy Storchakaecf41da2016-10-19 16:29:26 +0300547 Maximum size of a deque or ``None`` if unbounded.
Raymond Hettinger5bb0f0e2009-03-10 12:56:32 +0000548
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700549 .. versionadded:: 3.1
Raymond Hettinger5bb0f0e2009-03-10 12:56:32 +0000550
551
Georg Brandl116aa622007-08-15 14:28:22 +0000552In addition to the above, deques support iteration, pickling, ``len(d)``,
553``reversed(d)``, ``copy.copy(d)``, ``copy.deepcopy(d)``, membership testing with
Windson yang98b85352018-11-05 06:34:22 +0800554the :keyword:`in` operator, and subscript references such as ``d[0]`` to access
555the first element. Indexed access is O(1) at both ends but slows to O(n) in
556the middle. For fast random access, use lists instead.
Georg Brandl116aa622007-08-15 14:28:22 +0000557
Raymond Hettinger41290a62015-03-31 08:12:23 -0700558Starting in version 3.5, deques support ``__add__()``, ``__mul__()``,
559and ``__imul__()``.
560
Christian Heimesfe337bf2008-03-23 21:54:12 +0000561Example:
562
563.. doctest::
Georg Brandl116aa622007-08-15 14:28:22 +0000564
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700565 >>> from collections import deque
566 >>> d = deque('ghi') # make a new deque with three items
567 >>> for elem in d: # iterate over the deque's elements
568 ... print(elem.upper())
569 G
570 H
571 I
Georg Brandl116aa622007-08-15 14:28:22 +0000572
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700573 >>> d.append('j') # add a new entry to the right side
574 >>> d.appendleft('f') # add a new entry to the left side
575 >>> d # show the representation of the deque
576 deque(['f', 'g', 'h', 'i', 'j'])
Georg Brandl116aa622007-08-15 14:28:22 +0000577
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700578 >>> d.pop() # return and remove the rightmost item
579 'j'
580 >>> d.popleft() # return and remove the leftmost item
581 'f'
582 >>> list(d) # list the contents of the deque
583 ['g', 'h', 'i']
584 >>> d[0] # peek at leftmost item
585 'g'
586 >>> d[-1] # peek at rightmost item
587 'i'
Georg Brandl116aa622007-08-15 14:28:22 +0000588
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700589 >>> list(reversed(d)) # list the contents of a deque in reverse
590 ['i', 'h', 'g']
591 >>> 'h' in d # search the deque
592 True
593 >>> d.extend('jkl') # add multiple elements at once
594 >>> d
595 deque(['g', 'h', 'i', 'j', 'k', 'l'])
596 >>> d.rotate(1) # right rotation
597 >>> d
598 deque(['l', 'g', 'h', 'i', 'j', 'k'])
599 >>> d.rotate(-1) # left rotation
600 >>> d
601 deque(['g', 'h', 'i', 'j', 'k', 'l'])
Georg Brandl116aa622007-08-15 14:28:22 +0000602
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700603 >>> deque(reversed(d)) # make a new deque in reverse order
604 deque(['l', 'k', 'j', 'i', 'h', 'g'])
605 >>> d.clear() # empty the deque
606 >>> d.pop() # cannot pop from an empty deque
607 Traceback (most recent call last):
608 File "<pyshell#6>", line 1, in -toplevel-
609 d.pop()
610 IndexError: pop from an empty deque
Georg Brandl116aa622007-08-15 14:28:22 +0000611
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700612 >>> d.extendleft('abc') # extendleft() reverses the input order
613 >>> d
614 deque(['c', 'b', 'a'])
Georg Brandl116aa622007-08-15 14:28:22 +0000615
616
Georg Brandl9afde1c2007-11-01 20:32:30 +0000617:class:`deque` Recipes
618^^^^^^^^^^^^^^^^^^^^^^
Georg Brandl116aa622007-08-15 14:28:22 +0000619
620This section shows various approaches to working with deques.
621
Raymond Hettingerd2ee64d2009-03-31 22:52:48 +0000622Bounded length deques provide functionality similar to the ``tail`` filter
623in Unix::
624
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700625 def tail(filename, n=10):
626 'Return the last n lines of a file'
627 with open(filename) as f:
628 return deque(f, n)
Raymond Hettingerd2ee64d2009-03-31 22:52:48 +0000629
630Another approach to using deques is to maintain a sequence of recently
631added elements by appending to the right and popping to the left::
632
633 def moving_average(iterable, n=3):
634 # moving_average([40, 30, 50, 46, 39, 44]) --> 40.0 42.0 45.0 43.0
635 # http://en.wikipedia.org/wiki/Moving_average
636 it = iter(iterable)
Raymond Hettingerd40285a2009-05-22 01:11:26 +0000637 d = deque(itertools.islice(it, n-1))
638 d.appendleft(0)
Raymond Hettingerd2ee64d2009-03-31 22:52:48 +0000639 s = sum(d)
Raymond Hettingerd2ee64d2009-03-31 22:52:48 +0000640 for elem in it:
641 s += elem - d.popleft()
642 d.append(elem)
643 yield s / n
644
Raymond Hettinger08584952017-11-23 13:32:23 -0800645A `round-robin scheduler
646<https://en.wikipedia.org/wiki/Round-robin_scheduling>`_ can be implemented with
647input iterators stored in a :class:`deque`. Values are yielded from the active
648iterator in position zero. If that iterator is exhausted, it can be removed
649with :meth:`~deque.popleft`; otherwise, it can be cycled back to the end with
650the :meth:`~deque.rotate` method::
651
652 def roundrobin(*iterables):
653 "roundrobin('ABC', 'D', 'EF') --> A D E B F C"
654 iterators = deque(map(iter, iterables))
655 while iterators:
656 try:
657 while True:
658 yield next(iterators[0])
659 iterators.rotate(-1)
660 except StopIteration:
661 # Remove an exhausted iterator.
662 iterators.popleft()
663
Michael Seiferte1052942018-03-26 13:40:35 +0200664The :meth:`~deque.rotate` method provides a way to implement :class:`deque` slicing and
Ezio Melotti0639d5a2009-12-19 23:26:38 +0000665deletion. For example, a pure Python implementation of ``del d[n]`` relies on
Michael Seiferte1052942018-03-26 13:40:35 +0200666the ``rotate()`` method to position elements to be popped::
Georg Brandl116aa622007-08-15 14:28:22 +0000667
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700668 def delete_nth(d, n):
669 d.rotate(-n)
670 d.popleft()
671 d.rotate(n)
Georg Brandl116aa622007-08-15 14:28:22 +0000672
673To implement :class:`deque` slicing, use a similar approach applying
Michael Seiferte1052942018-03-26 13:40:35 +0200674:meth:`~deque.rotate` to bring a target element to the left side of the deque. Remove
675old entries with :meth:`~deque.popleft`, add new entries with :meth:`~deque.extend`, and then
Georg Brandl116aa622007-08-15 14:28:22 +0000676reverse the rotation.
Georg Brandl116aa622007-08-15 14:28:22 +0000677With minor variations on that approach, it is easy to implement Forth style
678stack manipulations such as ``dup``, ``drop``, ``swap``, ``over``, ``pick``,
679``rot``, and ``roll``.
680
Georg Brandl116aa622007-08-15 14:28:22 +0000681
682:class:`defaultdict` objects
683----------------------------
684
Georg Brandl116aa622007-08-15 14:28:22 +0000685.. class:: defaultdict([default_factory[, ...]])
686
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700687 Returns a new dictionary-like object. :class:`defaultdict` is a subclass of the
688 built-in :class:`dict` class. It overrides one method and adds one writable
689 instance variable. The remaining functionality is the same as for the
690 :class:`dict` class and is not documented here.
Georg Brandl116aa622007-08-15 14:28:22 +0000691
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700692 The first argument provides the initial value for the :attr:`default_factory`
693 attribute; it defaults to ``None``. All remaining arguments are treated the same
694 as if they were passed to the :class:`dict` constructor, including keyword
695 arguments.
Georg Brandl116aa622007-08-15 14:28:22 +0000696
Georg Brandl116aa622007-08-15 14:28:22 +0000697
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700698 :class:`defaultdict` objects support the following method in addition to the
699 standard :class:`dict` operations:
Georg Brandl116aa622007-08-15 14:28:22 +0000700
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700701 .. method:: __missing__(key)
Georg Brandl116aa622007-08-15 14:28:22 +0000702
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700703 If the :attr:`default_factory` attribute is ``None``, this raises a
704 :exc:`KeyError` exception with the *key* as argument.
Georg Brandl116aa622007-08-15 14:28:22 +0000705
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700706 If :attr:`default_factory` is not ``None``, it is called without arguments
707 to provide a default value for the given *key*, this value is inserted in
708 the dictionary for the *key*, and returned.
Georg Brandl116aa622007-08-15 14:28:22 +0000709
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700710 If calling :attr:`default_factory` raises an exception this exception is
711 propagated unchanged.
Georg Brandl116aa622007-08-15 14:28:22 +0000712
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700713 This method is called by the :meth:`__getitem__` method of the
714 :class:`dict` class when the requested key is not found; whatever it
715 returns or raises is then returned or raised by :meth:`__getitem__`.
Georg Brandl116aa622007-08-15 14:28:22 +0000716
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700717 Note that :meth:`__missing__` is *not* called for any operations besides
718 :meth:`__getitem__`. This means that :meth:`get` will, like normal
719 dictionaries, return ``None`` as a default rather than using
720 :attr:`default_factory`.
Benjamin Peterson871b9d12012-01-27 09:14:01 -0500721
Georg Brandl116aa622007-08-15 14:28:22 +0000722
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700723 :class:`defaultdict` objects support the following instance variable:
Georg Brandl116aa622007-08-15 14:28:22 +0000724
Benjamin Petersone41251e2008-04-25 01:59:09 +0000725
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700726 .. attribute:: default_factory
Benjamin Petersone41251e2008-04-25 01:59:09 +0000727
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700728 This attribute is used by the :meth:`__missing__` method; it is
729 initialized from the first argument to the constructor, if present, or to
730 ``None``, if absent.
Georg Brandl116aa622007-08-15 14:28:22 +0000731
732
Georg Brandl116aa622007-08-15 14:28:22 +0000733:class:`defaultdict` Examples
734^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
735
Michael Seiferte1052942018-03-26 13:40:35 +0200736Using :class:`list` as the :attr:`~defaultdict.default_factory`, it is easy to group a
Christian Heimesfe337bf2008-03-23 21:54:12 +0000737sequence of key-value pairs into a dictionary of lists:
Georg Brandl116aa622007-08-15 14:28:22 +0000738
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700739 >>> s = [('yellow', 1), ('blue', 2), ('yellow', 3), ('blue', 4), ('red', 1)]
740 >>> d = defaultdict(list)
741 >>> for k, v in s:
742 ... d[k].append(v)
743 ...
Zachary Ware2b52c0a2016-08-09 17:38:22 -0500744 >>> sorted(d.items())
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700745 [('blue', [2, 4]), ('red', [1]), ('yellow', [1, 3])]
Georg Brandl116aa622007-08-15 14:28:22 +0000746
747When each key is encountered for the first time, it is not already in the
Michael Seiferte1052942018-03-26 13:40:35 +0200748mapping; so an entry is automatically created using the :attr:`~defaultdict.default_factory`
Georg Brandl116aa622007-08-15 14:28:22 +0000749function which returns an empty :class:`list`. The :meth:`list.append`
750operation then attaches the value to the new list. When keys are encountered
751again, the look-up proceeds normally (returning the list for that key) and the
752:meth:`list.append` operation adds another value to the list. This technique is
Christian Heimesfe337bf2008-03-23 21:54:12 +0000753simpler and faster than an equivalent technique using :meth:`dict.setdefault`:
Georg Brandl116aa622007-08-15 14:28:22 +0000754
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700755 >>> d = {}
756 >>> for k, v in s:
757 ... d.setdefault(k, []).append(v)
758 ...
Zachary Ware2b52c0a2016-08-09 17:38:22 -0500759 >>> sorted(d.items())
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700760 [('blue', [2, 4]), ('red', [1]), ('yellow', [1, 3])]
Georg Brandl116aa622007-08-15 14:28:22 +0000761
Michael Seiferte1052942018-03-26 13:40:35 +0200762Setting the :attr:`~defaultdict.default_factory` to :class:`int` makes the
Georg Brandl116aa622007-08-15 14:28:22 +0000763:class:`defaultdict` useful for counting (like a bag or multiset in other
Christian Heimesfe337bf2008-03-23 21:54:12 +0000764languages):
Georg Brandl116aa622007-08-15 14:28:22 +0000765
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700766 >>> s = 'mississippi'
767 >>> d = defaultdict(int)
768 >>> for k in s:
769 ... d[k] += 1
770 ...
Zachary Ware2b52c0a2016-08-09 17:38:22 -0500771 >>> sorted(d.items())
772 [('i', 4), ('m', 1), ('p', 2), ('s', 4)]
Georg Brandl116aa622007-08-15 14:28:22 +0000773
774When a letter is first encountered, it is missing from the mapping, so the
Michael Seiferte1052942018-03-26 13:40:35 +0200775:attr:`~defaultdict.default_factory` function calls :func:`int` to supply a default count of
Georg Brandl116aa622007-08-15 14:28:22 +0000776zero. The increment operation then builds up the count for each letter.
777
778The function :func:`int` which always returns zero is just a special case of
779constant functions. A faster and more flexible way to create constant functions
780is to use a lambda function which can supply any constant value (not just
Christian Heimesfe337bf2008-03-23 21:54:12 +0000781zero):
Georg Brandl116aa622007-08-15 14:28:22 +0000782
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700783 >>> def constant_factory(value):
784 ... return lambda: value
785 >>> d = defaultdict(constant_factory('<missing>'))
786 >>> d.update(name='John', action='ran')
787 >>> '%(name)s %(action)s to %(object)s' % d
788 'John ran to <missing>'
Georg Brandl116aa622007-08-15 14:28:22 +0000789
Michael Seiferte1052942018-03-26 13:40:35 +0200790Setting the :attr:`~defaultdict.default_factory` to :class:`set` makes the
Christian Heimesfe337bf2008-03-23 21:54:12 +0000791:class:`defaultdict` useful for building a dictionary of sets:
Georg Brandl116aa622007-08-15 14:28:22 +0000792
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700793 >>> s = [('red', 1), ('blue', 2), ('red', 3), ('blue', 4), ('red', 1), ('blue', 4)]
794 >>> d = defaultdict(set)
795 >>> for k, v in s:
796 ... d[k].add(v)
797 ...
Zachary Ware2b52c0a2016-08-09 17:38:22 -0500798 >>> sorted(d.items())
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700799 [('blue', {2, 4}), ('red', {1, 3})]
Georg Brandl116aa622007-08-15 14:28:22 +0000800
801
Guido van Rossum3d392eb2007-11-16 00:35:22 +0000802:func:`namedtuple` Factory Function for Tuples with Named Fields
Christian Heimes790c8232008-01-07 21:14:23 +0000803----------------------------------------------------------------
Georg Brandl116aa622007-08-15 14:28:22 +0000804
Thomas Wouters1b7f8912007-09-19 03:06:30 +0000805Named tuples assign meaning to each position in a tuple and allow for more readable,
806self-documenting code. They can be used wherever regular tuples are used, and
807they add the ability to access fields by name instead of position index.
Georg Brandl116aa622007-08-15 14:28:22 +0000808
Raymond Hettinger39482072018-01-10 21:45:19 -0800809.. function:: namedtuple(typename, field_names, *, rename=False, defaults=None, module=None)
Georg Brandl116aa622007-08-15 14:28:22 +0000810
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700811 Returns a new tuple subclass named *typename*. The new subclass is used to
812 create tuple-like objects that have fields accessible by attribute lookup as
813 well as being indexable and iterable. Instances of the subclass also have a
814 helpful docstring (with typename and field_names) and a helpful :meth:`__repr__`
815 method which lists the tuple contents in a ``name=value`` format.
Georg Brandl116aa622007-08-15 14:28:22 +0000816
csabella97bf7222017-04-25 12:14:45 -0400817 The *field_names* are a sequence of strings such as ``['x', 'y']``.
818 Alternatively, *field_names* can be a single string with each fieldname
819 separated by whitespace and/or commas, for example ``'x y'`` or ``'x, y'``.
Georg Brandl9afde1c2007-11-01 20:32:30 +0000820
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700821 Any valid Python identifier may be used for a fieldname except for names
822 starting with an underscore. Valid identifiers consist of letters, digits,
823 and underscores but do not start with a digit or underscore and cannot be
824 a :mod:`keyword` such as *class*, *for*, *return*, *global*, *pass*,
825 or *raise*.
Georg Brandl116aa622007-08-15 14:28:22 +0000826
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700827 If *rename* is true, invalid fieldnames are automatically replaced
828 with positional names. For example, ``['abc', 'def', 'ghi', 'abc']`` is
829 converted to ``['abc', '_1', 'ghi', '_3']``, eliminating the keyword
830 ``def`` and the duplicate fieldname ``abc``.
Benjamin Petersona86f2c02009-02-10 02:41:10 +0000831
Raymond Hettinger39482072018-01-10 21:45:19 -0800832 *defaults* can be ``None`` or an :term:`iterable` of default values.
833 Since fields with a default value must come after any fields without a
834 default, the *defaults* are applied to the rightmost parameters. For
835 example, if the fieldnames are ``['x', 'y', 'z']`` and the defaults are
836 ``(1, 2)``, then ``x`` will be a required argument, ``y`` will default to
837 ``1``, and ``z`` will default to ``2``.
838
Raymond Hettinger0d5048c2016-09-12 00:18:31 -0700839 If *module* is defined, the ``__module__`` attribute of the named tuple is
840 set to that value.
841
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700842 Named tuple instances do not have per-instance dictionaries, so they are
843 lightweight and require no more memory than regular tuples.
Georg Brandl116aa622007-08-15 14:28:22 +0000844
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700845 .. versionchanged:: 3.1
Raymond Hettinger6538b432016-08-16 10:55:43 -0700846 Added support for *rename*.
847
848 .. versionchanged:: 3.6
849 The *verbose* and *rename* parameters became
850 :ref:`keyword-only arguments <keyword-only_parameter>`.
Benjamin Petersona86f2c02009-02-10 02:41:10 +0000851
Raymond Hettinger0d5048c2016-09-12 00:18:31 -0700852 .. versionchanged:: 3.6
853 Added the *module* parameter.
Christian Heimesfe337bf2008-03-23 21:54:12 +0000854
Raymond Hettinger8b57d732017-09-10 10:23:36 -0700855 .. versionchanged:: 3.7
Raymond Hettinger5927cfd2019-03-16 11:16:29 -0700856 Removed the *verbose* parameter and the :attr:`_source` attribute.
Raymond Hettinger8b57d732017-09-10 10:23:36 -0700857
Raymond Hettinger39482072018-01-10 21:45:19 -0800858 .. versionchanged:: 3.7
859 Added the *defaults* parameter and the :attr:`_field_defaults`
860 attribute.
861
Christian Heimesfe337bf2008-03-23 21:54:12 +0000862.. doctest::
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700863 :options: +NORMALIZE_WHITESPACE
Georg Brandl116aa622007-08-15 14:28:22 +0000864
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700865 >>> # Basic example
866 >>> Point = namedtuple('Point', ['x', 'y'])
867 >>> p = Point(11, y=22) # instantiate with positional or keyword arguments
868 >>> p[0] + p[1] # indexable like the plain tuple (11, 22)
869 33
870 >>> x, y = p # unpack like a regular tuple
871 >>> x, y
872 (11, 22)
873 >>> p.x + p.y # fields also accessible by name
874 33
875 >>> p # readable __repr__ with a name=value style
876 Point(x=11, y=22)
Georg Brandl116aa622007-08-15 14:28:22 +0000877
Thomas Wouters1b7f8912007-09-19 03:06:30 +0000878Named tuples are especially useful for assigning field names to result tuples returned
879by the :mod:`csv` or :mod:`sqlite3` modules::
880
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700881 EmployeeRecord = namedtuple('EmployeeRecord', 'name, age, title, department, paygrade')
Georg Brandl9afde1c2007-11-01 20:32:30 +0000882
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700883 import csv
884 for emp in map(EmployeeRecord._make, csv.reader(open("employees.csv", "rb"))):
885 print(emp.name, emp.title)
Thomas Wouters1b7f8912007-09-19 03:06:30 +0000886
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700887 import sqlite3
888 conn = sqlite3.connect('/companydata')
889 cursor = conn.cursor()
890 cursor.execute('SELECT name, age, title, department, paygrade FROM employees')
891 for emp in map(EmployeeRecord._make, cursor.fetchall()):
892 print(emp.name, emp.title)
Georg Brandl9afde1c2007-11-01 20:32:30 +0000893
Christian Heimes99170a52007-12-19 02:07:34 +0000894In addition to the methods inherited from tuples, named tuples support
Raymond Hettinger2ebea412011-03-23 12:52:23 -0700895three additional methods and two attributes. To prevent conflicts with
Christian Heimes2380ac72008-01-09 00:17:24 +0000896field names, the method and attribute names start with an underscore.
Christian Heimes99170a52007-12-19 02:07:34 +0000897
Benjamin Peterson0b9fb802010-07-18 14:23:36 +0000898.. classmethod:: somenamedtuple._make(iterable)
Christian Heimes99170a52007-12-19 02:07:34 +0000899
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700900 Class method that makes a new instance from an existing sequence or iterable.
Christian Heimes99170a52007-12-19 02:07:34 +0000901
Raymond Hettinger6fed9fd2012-06-11 00:38:14 -0700902 .. doctest::
Thomas Wouters1b7f8912007-09-19 03:06:30 +0000903
Raymond Hettinger6fed9fd2012-06-11 00:38:14 -0700904 >>> t = [11, 22]
905 >>> Point._make(t)
906 Point(x=11, y=22)
Thomas Wouters1b7f8912007-09-19 03:06:30 +0000907
Christian Heimes790c8232008-01-07 21:14:23 +0000908.. method:: somenamedtuple._asdict()
Georg Brandl9afde1c2007-11-01 20:32:30 +0000909
Raymond Hettinger49fd6dd2019-02-21 00:05:30 -0800910 Return a new :class:`dict` which maps field names to their corresponding
Raymond Hettingerfd27f622016-08-16 13:13:17 -0700911 values:
912
913 .. doctest::
Georg Brandl9afde1c2007-11-01 20:32:30 +0000914
Raymond Hettinger7a3602e2015-08-30 09:13:48 -0700915 >>> p = Point(x=11, y=22)
916 >>> p._asdict()
Raymond Hettinger0bb4bdf2019-01-31 00:59:50 -0800917 {'x': 11, 'y': 22}
Raymond Hettingera4f52b12009-03-02 22:28:31 +0000918
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700919 .. versionchanged:: 3.1
920 Returns an :class:`OrderedDict` instead of a regular :class:`dict`.
Christian Heimesfe337bf2008-03-23 21:54:12 +0000921
Raymond Hettinger0bb4bdf2019-01-31 00:59:50 -0800922 .. versionchanged:: 3.8
923 Returns a regular :class:`dict` instead of an :class:`OrderedDict`.
924 As of Python 3.7, regular dicts are guaranteed to be ordered. If the
925 extra features of :class:`OrderedDict` are required, the suggested
926 remediation is to cast the result to the desired type:
927 ``OrderedDict(nt._asdict())``.
928
Ben Hoyt184bd822017-06-13 15:20:51 -0400929.. method:: somenamedtuple._replace(**kwargs)
Thomas Wouters1b7f8912007-09-19 03:06:30 +0000930
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700931 Return a new instance of the named tuple replacing specified fields with new
Raymond Hettinger6fed9fd2012-06-11 00:38:14 -0700932 values::
Thomas Wouters8ce81f72007-09-20 18:22:40 +0000933
Raymond Hettinger6fed9fd2012-06-11 00:38:14 -0700934 >>> p = Point(x=11, y=22)
935 >>> p._replace(x=33)
936 Point(x=33, y=22)
Thomas Wouters1b7f8912007-09-19 03:06:30 +0000937
Raymond Hettinger6fed9fd2012-06-11 00:38:14 -0700938 >>> for partnum, record in inventory.items():
939 ... inventory[partnum] = record._replace(price=newprices[partnum], timestamp=time.now())
Thomas Wouters1b7f8912007-09-19 03:06:30 +0000940
Christian Heimes790c8232008-01-07 21:14:23 +0000941.. attribute:: somenamedtuple._fields
Thomas Wouters1b7f8912007-09-19 03:06:30 +0000942
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700943 Tuple of strings listing the field names. Useful for introspection
944 and for creating new named tuple types from existing named tuples.
Thomas Wouters8ce81f72007-09-20 18:22:40 +0000945
Raymond Hettinger6fed9fd2012-06-11 00:38:14 -0700946 .. doctest::
Thomas Wouters1b7f8912007-09-19 03:06:30 +0000947
Raymond Hettinger6fed9fd2012-06-11 00:38:14 -0700948 >>> p._fields # view the field names
949 ('x', 'y')
Thomas Wouters1b7f8912007-09-19 03:06:30 +0000950
Raymond Hettinger6fed9fd2012-06-11 00:38:14 -0700951 >>> Color = namedtuple('Color', 'red green blue')
952 >>> Pixel = namedtuple('Pixel', Point._fields + Color._fields)
953 >>> Pixel(11, 22, 128, 255, 0)
954 Pixel(x=11, y=22, red=128, green=255, blue=0)
Georg Brandl116aa622007-08-15 14:28:22 +0000955
Raymond Hettinger23581c02019-03-18 00:27:39 -0700956.. attribute:: somenamedtuple._field_defaults
Raymond Hettinger39482072018-01-10 21:45:19 -0800957
958 Dictionary mapping field names to default values.
959
960 .. doctest::
961
962 >>> Account = namedtuple('Account', ['type', 'balance'], defaults=[0])
Raymond Hettinger23581c02019-03-18 00:27:39 -0700963 >>> Account._field_defaults
Raymond Hettinger39482072018-01-10 21:45:19 -0800964 {'balance': 0}
965 >>> Account('premium')
966 Account(type='premium', balance=0)
967
Christian Heimes0449f632007-12-15 01:27:15 +0000968To retrieve a field whose name is stored in a string, use the :func:`getattr`
Christian Heimesfe337bf2008-03-23 21:54:12 +0000969function:
Christian Heimes0449f632007-12-15 01:27:15 +0000970
971 >>> getattr(p, 'x')
972 11
973
Raymond Hettingerfb28fcc2019-03-27 21:03:02 -0700974To convert a dictionary to a named tuple, use the double-star-operator
Raymond Hettinger651453a2009-02-11 00:20:02 +0000975(as described in :ref:`tut-unpacking-arguments`):
Christian Heimes99170a52007-12-19 02:07:34 +0000976
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700977 >>> d = {'x': 11, 'y': 22}
978 >>> Point(**d)
979 Point(x=11, y=22)
Christian Heimes99170a52007-12-19 02:07:34 +0000980
Guido van Rossum3d392eb2007-11-16 00:35:22 +0000981Since a named tuple is a regular Python class, it is easy to add or change
Christian Heimes043d6f62008-01-07 17:19:16 +0000982functionality with a subclass. Here is how to add a calculated field and
Christian Heimesfe337bf2008-03-23 21:54:12 +0000983a fixed-width print format:
Guido van Rossum3d392eb2007-11-16 00:35:22 +0000984
Raymond Hettingerfd27f622016-08-16 13:13:17 -0700985.. doctest::
986
987 >>> class Point(namedtuple('Point', ['x', 'y'])):
Zachary Ware2b52c0a2016-08-09 17:38:22 -0500988 ... __slots__ = ()
989 ... @property
990 ... def hypot(self):
991 ... return (self.x ** 2 + self.y ** 2) ** 0.5
992 ... def __str__(self):
993 ... return 'Point: x=%6.3f y=%6.3f hypot=%6.3f' % (self.x, self.y, self.hypot)
Guido van Rossum3d392eb2007-11-16 00:35:22 +0000994
Georg Brandl0df79792008-10-04 18:33:26 +0000995 >>> for p in Point(3, 4), Point(14, 5/7):
Zachary Ware2b52c0a2016-08-09 17:38:22 -0500996 ... print(p)
Christian Heimes25bb7832008-01-11 16:17:00 +0000997 Point: x= 3.000 y= 4.000 hypot= 5.000
998 Point: x=14.000 y= 0.714 hypot=14.018
Christian Heimes043d6f62008-01-07 17:19:16 +0000999
Georg Brandlaf5c2382009-12-28 08:02:38 +00001000The subclass shown above sets ``__slots__`` to an empty tuple. This helps
Christian Heimes679db4a2008-01-18 09:56:22 +00001001keep memory requirements low by preventing the creation of instance dictionaries.
1002
Christian Heimes2380ac72008-01-09 00:17:24 +00001003Subclassing is not useful for adding new, stored fields. Instead, simply
Michael Seiferte1052942018-03-26 13:40:35 +02001004create a new named tuple type from the :attr:`~somenamedtuple._fields` attribute:
Christian Heimes2380ac72008-01-09 00:17:24 +00001005
Christian Heimes25bb7832008-01-11 16:17:00 +00001006 >>> Point3D = namedtuple('Point3D', Point._fields + ('z',))
Christian Heimes2380ac72008-01-09 00:17:24 +00001007
Raymond Hettingereac503a2015-05-13 01:09:59 -07001008Docstrings can be customized by making direct assignments to the ``__doc__``
1009fields:
1010
1011 >>> Book = namedtuple('Book', ['id', 'title', 'authors'])
Raymond Hettinger850be0f2015-11-09 08:24:53 -08001012 >>> Book.__doc__ += ': Hardcover book in active collection'
Berker Peksagde7cafa2015-05-13 12:16:27 +03001013 >>> Book.id.__doc__ = '13-digit ISBN'
1014 >>> Book.title.__doc__ = 'Title of first printing'
Raymond Hettinger850be0f2015-11-09 08:24:53 -08001015 >>> Book.authors.__doc__ = 'List of authors sorted by last name'
Raymond Hettingereac503a2015-05-13 01:09:59 -07001016
Raymond Hettinger6e701312015-11-23 22:18:55 -08001017.. versionchanged:: 3.5
1018 Property docstrings became writeable.
1019
Georg Brandl8ed75cd2014-10-31 10:25:48 +01001020.. seealso::
1021
Raymond Hettinger9c685432019-03-16 12:53:23 -07001022 * See :class:`typing.NamedTuple` for a way to add type hints for named
1023 tuples. It also provides an elegant notation using the :keyword:`class`
1024 keyword::
1025
1026 class Component(NamedTuple):
1027 part_number: int
1028 weight: float
1029 description: Optional[str] = None
Raymond Hettingere4c96ad2008-02-06 01:23:58 +00001030
Raymond Hettingerfd27f622016-08-16 13:13:17 -07001031 * See :meth:`types.SimpleNamespace` for a mutable namespace based on an
1032 underlying dictionary instead of a tuple.
1033
Raymond Hettinger9c685432019-03-16 12:53:23 -07001034 * The :mod:`dataclasses` module provides a decorator and functions for
1035 automatically adding generated special methods to user-defined classes.
Raymond Hettinger2a75e8f2015-08-16 08:32:01 -07001036
Raymond Hettingere4c96ad2008-02-06 01:23:58 +00001037
Raymond Hettinger2d32f632009-03-02 21:24:57 +00001038:class:`OrderedDict` objects
1039----------------------------
1040
Raymond Hettinger49fd6dd2019-02-21 00:05:30 -08001041Ordered dictionaries are just like regular dictionaries but have some extra
1042capabilities relating to ordering operations. They have become less
1043important now that the built-in :class:`dict` class gained the ability
1044to remember insertion order (this new behavior became guaranteed in
1045Python 3.7).
1046
1047Some differences from :class:`dict` still remain:
1048
1049* The regular :class:`dict` was designed to be very good at mapping
1050 operations. Tracking insertion order was secondary.
1051
1052* The :class:`OrderedDict` was designed to be good at reordering operations.
1053 Space efficiency, iteration speed, and the performance of update
1054 operations were secondary.
1055
1056* Algorithmically, :class:`OrderedDict` can handle frequent reordering
1057 operations better than :class:`dict`. This makes it suitable for tracking
1058 recent accesses (for example in an `LRU cache
1059 <https://medium.com/@krishankantsinghal/my-first-blog-on-medium-583159139237>`_).
1060
1061* The equality operation for :class:`OrderedDict` checks for matching order.
1062
1063* The :meth:`popitem` method of :class:`OrderedDict` has a different
1064 signature. It accepts an optional argument to specify which item is popped.
1065
1066* :class:`OrderedDict` has a :meth:`move_to_end` method to
1067 efficiently reposition an element to an endpoint.
1068
1069* Until Python 3.8, :class:`dict` lacked a :meth:`__reversed__` method.
1070
Raymond Hettinger2d32f632009-03-02 21:24:57 +00001071
1072.. class:: OrderedDict([items])
1073
Raymond Hettinger49fd6dd2019-02-21 00:05:30 -08001074 Return an instance of a :class:`dict` subclass that has methods
1075 specialized for rearranging dictionary order.
Raymond Hettinger2d32f632009-03-02 21:24:57 +00001076
Raymond Hettinger7929cfb2012-06-09 19:15:26 -07001077 .. versionadded:: 3.1
Raymond Hettinger2d32f632009-03-02 21:24:57 +00001078
Raymond Hettinger7929cfb2012-06-09 19:15:26 -07001079 .. method:: popitem(last=True)
Raymond Hettingerdc879f02009-03-19 20:30:56 +00001080
Raymond Hettinger7929cfb2012-06-09 19:15:26 -07001081 The :meth:`popitem` method for ordered dictionaries returns and removes a
Serhiy Storchaka4ecfa452016-05-16 09:31:54 +03001082 (key, value) pair. The pairs are returned in
1083 :abbr:`LIFO (last-in, first-out)` order if *last* is true
1084 or :abbr:`FIFO (first-in, first-out)` order if false.
Raymond Hettinger2d32f632009-03-02 21:24:57 +00001085
Raymond Hettinger7929cfb2012-06-09 19:15:26 -07001086 .. method:: move_to_end(key, last=True)
Raymond Hettingerf45abc92010-09-06 21:26:09 +00001087
Raymond Hettinger7929cfb2012-06-09 19:15:26 -07001088 Move an existing *key* to either end of an ordered dictionary. The item
1089 is moved to the right end if *last* is true (the default) or to the
1090 beginning if *last* is false. Raises :exc:`KeyError` if the *key* does
1091 not exist::
Raymond Hettingerf45abc92010-09-06 21:26:09 +00001092
Raymond Hettinger7929cfb2012-06-09 19:15:26 -07001093 >>> d = OrderedDict.fromkeys('abcde')
1094 >>> d.move_to_end('b')
1095 >>> ''.join(d.keys())
1096 'acdeb'
1097 >>> d.move_to_end('b', last=False)
1098 >>> ''.join(d.keys())
1099 'bacde'
Raymond Hettingerf45abc92010-09-06 21:26:09 +00001100
Raymond Hettinger7929cfb2012-06-09 19:15:26 -07001101 .. versionadded:: 3.2
Raymond Hettingerf45abc92010-09-06 21:26:09 +00001102
Raymond Hettingere9091502009-05-19 17:40:07 +00001103In addition to the usual mapping methods, ordered dictionaries also support
1104reverse iteration using :func:`reversed`.
1105
Raymond Hettinger2d32f632009-03-02 21:24:57 +00001106Equality tests between :class:`OrderedDict` objects are order-sensitive
1107and are implemented as ``list(od1.items())==list(od2.items())``.
1108Equality tests between :class:`OrderedDict` objects and other
Serhiy Storchakabfdcd432013-10-13 23:09:14 +03001109:class:`~collections.abc.Mapping` objects are order-insensitive like regular
1110dictionaries. This allows :class:`OrderedDict` objects to be substituted
1111anywhere a regular dictionary is used.
Raymond Hettinger2d32f632009-03-02 21:24:57 +00001112
Serhiy Storchaka578c9212014-04-04 15:19:36 +03001113.. versionchanged:: 3.5
Martin Panter397625e2015-10-07 10:03:20 +00001114 The items, keys, and values :term:`views <dictionary view>`
1115 of :class:`OrderedDict` now support reverse iteration using :func:`reversed`.
Raymond Hettingerdc879f02009-03-19 20:30:56 +00001116
Raymond Hettingerd15bb262017-01-07 22:05:12 -08001117.. versionchanged:: 3.6
1118 With the acceptance of :pep:`468`, order is retained for keyword arguments
1119 passed to the :class:`OrderedDict` constructor and its :meth:`update`
1120 method.
1121
Raymond Hettinger7bba6832011-04-15 17:43:19 -07001122:class:`OrderedDict` Examples and Recipes
1123^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
1124
Raymond Hettinger49fd6dd2019-02-21 00:05:30 -08001125It is straightforward to create an ordered dictionary variant
Andrew Svetlovff63e7a2012-08-31 13:54:54 +03001126that remembers the order the keys were *last* inserted.
Raymond Hettinger4821ef82010-07-31 10:14:41 +00001127If a new entry overwrites an existing entry, the
1128original insertion position is changed and moved to the end::
1129
1130 class LastUpdatedOrderedDict(OrderedDict):
Georg Brandl77570e22010-12-18 16:21:58 +00001131 'Store items in the order the keys were last added'
Raymond Hettinger7bba6832011-04-15 17:43:19 -07001132
Raymond Hettinger4821ef82010-07-31 10:14:41 +00001133 def __setitem__(self, key, value):
Raymond Hettinger49fd6dd2019-02-21 00:05:30 -08001134 super().__setitem__(key, value)
wim glenn1a10a6b2019-05-13 20:10:14 -05001135 self.move_to_end(key)
Raymond Hettinger4821ef82010-07-31 10:14:41 +00001136
Raymond Hettinger49fd6dd2019-02-21 00:05:30 -08001137An :class:`OrderedDict` would also be useful for implementing
1138variants of :func:`functools.lru_cache`::
Raymond Hettinger7bba6832011-04-15 17:43:19 -07001139
Raymond Hettinger49fd6dd2019-02-21 00:05:30 -08001140 class LRU(OrderedDict):
1141 'Limit size, evicting the least recently looked-up key when full'
Raymond Hettinger7bba6832011-04-15 17:43:19 -07001142
Serhiy Storchaka2085bd02019-06-01 11:00:15 +03001143 def __init__(self, maxsize=128, /, *args, **kwds):
Raymond Hettinger49fd6dd2019-02-21 00:05:30 -08001144 self.maxsize = maxsize
1145 super().__init__(*args, **kwds)
Raymond Hettinger7bba6832011-04-15 17:43:19 -07001146
Raymond Hettinger49fd6dd2019-02-21 00:05:30 -08001147 def __getitem__(self, key):
1148 value = super().__getitem__(key)
1149 self.move_to_end(key)
1150 return value
1151
1152 def __setitem__(self, key, value):
1153 super().__setitem__(key, value)
1154 if len(self) > self.maxsize:
1155 oldest = next(iter(self))
1156 del self[oldest]
Raymond Hettinger7bba6832011-04-15 17:43:19 -07001157
Raymond Hettingere4c96ad2008-02-06 01:23:58 +00001158
1159:class:`UserDict` objects
Mark Summerfield8f2d0062008-02-06 13:30:44 +00001160-------------------------
Raymond Hettingere4c96ad2008-02-06 01:23:58 +00001161
Georg Brandl48310cd2009-01-03 21:18:54 +00001162The class, :class:`UserDict` acts as a wrapper around dictionary objects.
1163The need for this class has been partially supplanted by the ability to
Raymond Hettingere4c96ad2008-02-06 01:23:58 +00001164subclass directly from :class:`dict`; however, this class can be easier
1165to work with because the underlying dictionary is accessible as an
1166attribute.
1167
1168.. class:: UserDict([initialdata])
1169
Raymond Hettinger7929cfb2012-06-09 19:15:26 -07001170 Class that simulates a dictionary. The instance's contents are kept in a
1171 regular dictionary, which is accessible via the :attr:`data` attribute of
1172 :class:`UserDict` instances. If *initialdata* is provided, :attr:`data` is
1173 initialized with its contents; note that a reference to *initialdata* will not
1174 be kept, allowing it be used for other purposes.
Raymond Hettingere4c96ad2008-02-06 01:23:58 +00001175
Raymond Hettinger7929cfb2012-06-09 19:15:26 -07001176 In addition to supporting the methods and operations of mappings,
1177 :class:`UserDict` instances provide the following attribute:
Raymond Hettingere4c96ad2008-02-06 01:23:58 +00001178
Raymond Hettinger7929cfb2012-06-09 19:15:26 -07001179 .. attribute:: data
Raymond Hettingere4c96ad2008-02-06 01:23:58 +00001180
Raymond Hettinger7929cfb2012-06-09 19:15:26 -07001181 A real dictionary used to store the contents of the :class:`UserDict`
1182 class.
Raymond Hettinger53dbe392008-02-12 20:03:09 +00001183
1184
1185
1186:class:`UserList` objects
1187-------------------------
1188
1189This class acts as a wrapper around list objects. It is a useful base class
Georg Brandl48310cd2009-01-03 21:18:54 +00001190for your own list-like classes which can inherit from them and override
Raymond Hettinger53dbe392008-02-12 20:03:09 +00001191existing methods or add new ones. In this way, one can add new behaviors to
1192lists.
1193
Georg Brandl48310cd2009-01-03 21:18:54 +00001194The need for this class has been partially supplanted by the ability to
Raymond Hettinger53dbe392008-02-12 20:03:09 +00001195subclass directly from :class:`list`; however, this class can be easier
1196to work with because the underlying list is accessible as an attribute.
1197
1198.. class:: UserList([list])
1199
Raymond Hettinger7929cfb2012-06-09 19:15:26 -07001200 Class that simulates a list. The instance's contents are kept in a regular
1201 list, which is accessible via the :attr:`data` attribute of :class:`UserList`
1202 instances. The instance's contents are initially set to a copy of *list*,
1203 defaulting to the empty list ``[]``. *list* can be any iterable, for
1204 example a real Python list or a :class:`UserList` object.
Raymond Hettinger53dbe392008-02-12 20:03:09 +00001205
Raymond Hettinger7929cfb2012-06-09 19:15:26 -07001206 In addition to supporting the methods and operations of mutable sequences,
1207 :class:`UserList` instances provide the following attribute:
Raymond Hettinger53dbe392008-02-12 20:03:09 +00001208
Raymond Hettinger7929cfb2012-06-09 19:15:26 -07001209 .. attribute:: data
Raymond Hettinger53dbe392008-02-12 20:03:09 +00001210
Raymond Hettinger7929cfb2012-06-09 19:15:26 -07001211 A real :class:`list` object used to store the contents of the
1212 :class:`UserList` class.
Raymond Hettinger53dbe392008-02-12 20:03:09 +00001213
Zachary Ware80602e02014-01-13 20:38:57 -06001214**Subclassing requirements:** Subclasses of :class:`UserList` are expected to
Raymond Hettinger53dbe392008-02-12 20:03:09 +00001215offer a constructor which can be called with either no arguments or one
1216argument. List operations which return a new sequence attempt to create an
1217instance of the actual implementation class. To do so, it assumes that the
1218constructor can be called with a single parameter, which is a sequence object
1219used as a data source.
1220
1221If a derived class does not wish to comply with this requirement, all of the
1222special methods supported by this class will need to be overridden; please
1223consult the sources for information about the methods which need to be provided
1224in that case.
Raymond Hettingerb3a65f82008-02-21 22:11:37 +00001225
1226:class:`UserString` objects
Christian Heimesc3f30c42008-02-22 16:37:40 +00001227---------------------------
Raymond Hettingerb3a65f82008-02-21 22:11:37 +00001228
Georg Brandl48310cd2009-01-03 21:18:54 +00001229The class, :class:`UserString` acts as a wrapper around string objects.
1230The need for this class has been partially supplanted by the ability to
Raymond Hettingerb3a65f82008-02-21 22:11:37 +00001231subclass directly from :class:`str`; however, this class can be easier
1232to work with because the underlying string is accessible as an
1233attribute.
1234
Michael Seiferte1052942018-03-26 13:40:35 +02001235.. class:: UserString(seq)
Raymond Hettingerb3a65f82008-02-21 22:11:37 +00001236
Michael Seiferte1052942018-03-26 13:40:35 +02001237 Class that simulates a string object. The instance's
Raymond Hettinger7929cfb2012-06-09 19:15:26 -07001238 content is kept in a regular string object, which is accessible via the
1239 :attr:`data` attribute of :class:`UserString` instances. The instance's
Michael Seiferte1052942018-03-26 13:40:35 +02001240 contents are initially set to a copy of *seq*. The *seq* argument can
1241 be any object which can be converted into a string using the built-in
1242 :func:`str` function.
1243
1244 In addition to supporting the methods and operations of strings,
1245 :class:`UserString` instances provide the following attribute:
1246
1247 .. attribute:: data
1248
1249 A real :class:`str` object used to store the contents of the
1250 :class:`UserString` class.
Yury Selivanov336b37b2015-09-09 12:23:01 -04001251
1252 .. versionchanged:: 3.5
1253 New methods ``__getnewargs__``, ``__rmod__``, ``casefold``,
1254 ``format_map``, ``isprintable``, and ``maketrans``.