blob: 5298dbcabe8e73f2a501c83d5baeef9286932e05 [file] [log] [blame]
Raymond Hettinger53dbe392008-02-12 20:03:09 +00001:mod:`collections` --- Container datatypes
2==========================================
Georg Brandl116aa622007-08-15 14:28:22 +00003
4.. module:: collections
Raymond Hettinger53dbe392008-02-12 20:03:09 +00005 :synopsis: Container datatypes
Georg Brandl116aa622007-08-15 14:28:22 +00006.. moduleauthor:: Raymond Hettinger <python@rcn.com>
7.. sectionauthor:: Raymond Hettinger <python@rcn.com>
8
Christian Heimesfe337bf2008-03-23 21:54:12 +00009.. testsetup:: *
10
11 from collections import *
12 import itertools
13 __name__ = '<doctest>'
Georg Brandl116aa622007-08-15 14:28:22 +000014
Raymond Hettinger158c9c22011-02-22 00:41:50 +000015**Source code:** :source:`Lib/collections/__init__.py`
Raymond Hettinger10480942011-01-10 03:26:08 +000016
Raymond Hettinger4f707fd2011-01-10 19:54:11 +000017--------------
18
Raymond Hettingera6b76ba2010-08-08 00:29:08 +000019This module implements specialized container datatypes providing alternatives to
20Python's general purpose built-in containers, :class:`dict`, :class:`list`,
21:class:`set`, and :class:`tuple`.
Christian Heimes0bd4e112008-02-12 22:59:25 +000022
Raymond Hettingera6b76ba2010-08-08 00:29:08 +000023===================== ====================================================================
24:func:`namedtuple` factory function for creating tuple subclasses with named fields
25:class:`deque` list-like container with fast appends and pops on either end
Raymond Hettinger9fe1ccf2011-02-26 01:02:51 +000026:class:`ChainMap` dict-like class for creating a single view of multiple mappings
Raymond Hettingera6b76ba2010-08-08 00:29:08 +000027:class:`Counter` dict subclass for counting hashable objects
28:class:`OrderedDict` dict subclass that remembers the order entries were added
29:class:`defaultdict` dict subclass that calls a factory function to supply missing values
30:class:`UserDict` wrapper around dictionary objects for easier dict subclassing
31:class:`UserList` wrapper around list objects for easier list subclassing
32:class:`UserString` wrapper around string objects for easier string subclassing
33===================== ====================================================================
Georg Brandl116aa622007-08-15 14:28:22 +000034
Raymond Hettinger158c9c22011-02-22 00:41:50 +000035.. versionchanged:: 3.3
Éric Araujof90112e2011-06-03 19:18:41 +020036 Moved :ref:`collections-abstract-base-classes` to the :mod:`collections.abc` module.
Raymond Hettinger158c9c22011-02-22 00:41:50 +000037 For backwards compatibility, they continue to be visible in this module
38 as well.
Mark Summerfield08898b42007-09-05 08:43:04 +000039
40
Raymond Hettinger9fe1ccf2011-02-26 01:02:51 +000041:class:`ChainMap` objects
42-------------------------
43
44A :class:`ChainMap` class is provided for quickly linking a number of mappings
45so they can be treated as a single unit. It is often much faster than creating
46a new dictionary and running multiple :meth:`~dict.update` calls.
47
48The class can be used to simulate nested scopes and is useful in templating.
49
50.. class:: ChainMap(*maps)
51
52 A :class:`ChainMap` groups multiple dicts or other mappings together to
53 create a single, updateable view. If no *maps* are specified, a single empty
54 dictionary is provided so that a new chain always has at least one mapping.
55
56 The underlying mappings are stored in a list. That list is public and can
57 accessed or updated using the *maps* attribute. There is no other state.
58
59 Lookups search the underlying mappings successively until a key is found. In
60 contrast, writes, updates, and deletions only operate on the first mapping.
61
Raymond Hettinger4bbde722011-04-11 17:57:21 -070062 A :class:`ChainMap` incorporates the underlying mappings by reference. So, if
Raymond Hettinger9fe1ccf2011-02-26 01:02:51 +000063 one of the underlying mappings gets updated, those changes will be reflected
Raymond Hettinger4bbde722011-04-11 17:57:21 -070064 in :class:`ChainMap`.
Raymond Hettinger9fe1ccf2011-02-26 01:02:51 +000065
66 All of the usual dictionary methods are supported. In addition, there is a
67 *maps* attribute, a method for creating new subcontexts, and a property for
68 accessing all but the first mapping:
69
70 .. attribute:: maps
71
72 A user updateable list of mappings. The list is ordered from
73 first-searched to last-searched. It is the only stored state and can
Éric Araujoe5905a92011-08-16 19:09:56 +020074 be modified to change which mappings are searched. The list should
Raymond Hettinger9fe1ccf2011-02-26 01:02:51 +000075 always contain at least one mapping.
76
77 .. method:: new_child()
78
79 Returns a new :class:`ChainMap` containing a new :class:`dict` followed by
80 all of the maps in the current instance. A call to ``d.new_child()`` is
81 equivalent to: ``ChainMap({}, *d.maps)``. This method is used for
82 creating subcontexts that can be updated without altering values in any
83 of the parent mappings.
84
Éric Araujo7ebadd52011-06-11 03:27:03 +020085 .. method:: parents()
Raymond Hettinger9fe1ccf2011-02-26 01:02:51 +000086
87 Returns a new :class:`ChainMap` containing all of the maps in the current
88 instance except the first one. This is useful for skipping the first map
89 in the search. The use-cases are similar to those for the
90 :keyword:`nonlocal` keyword used in :term:`nested scopes <nested scope>`.
91 The use-cases also parallel those for the builtin :func:`super` function.
92 A reference to ``d.parents`` is equivalent to: ``ChainMap(*d.maps[1:])``.
93
Georg Brandl4dcf4742012-03-08 20:35:08 +010094 .. versionadded:: 3.3
Raymond Hettinger9fe1ccf2011-02-26 01:02:51 +000095
Georg Brandl4dcf4742012-03-08 20:35:08 +010096 Example of simulating Python's internal lookup chain::
Raymond Hettinger9fe1ccf2011-02-26 01:02:51 +000097
Georg Brandl4dcf4742012-03-08 20:35:08 +010098 import builtins
99 pylookup = ChainMap(locals(), globals(), vars(builtins))
Raymond Hettinger9fe1ccf2011-02-26 01:02:51 +0000100
Georg Brandl4dcf4742012-03-08 20:35:08 +0100101 Example of letting user specified values take precedence over environment
102 variables which in turn take precedence over default values::
Raymond Hettinger9fe1ccf2011-02-26 01:02:51 +0000103
Georg Brandl4dcf4742012-03-08 20:35:08 +0100104 import os, argparse
105 defaults = {'color': 'red', 'user': guest}
106 parser = argparse.ArgumentParser()
107 parser.add_argument('-u', '--user')
108 parser.add_argument('-c', '--color')
109 user_specified = vars(parser.parse_args())
110 combined = ChainMap(user_specified, os.environ, defaults)
Raymond Hettinger9fe1ccf2011-02-26 01:02:51 +0000111
Georg Brandl4dcf4742012-03-08 20:35:08 +0100112 Example patterns for using the :class:`ChainMap` class to simulate nested
113 contexts::
Raymond Hettinger9fe1ccf2011-02-26 01:02:51 +0000114
Georg Brandl4dcf4742012-03-08 20:35:08 +0100115 c = ChainMap() # Create root context
116 d = c.new_child() # Create nested child context
117 e = c.new_child() # Child of c, independent from d
118 e.maps[0] # Current context dictionary -- like Python's locals()
119 e.maps[-1] # Root context -- like Python's globals()
120 e.parents # Enclosing context chain -- like Python's nonlocals
Raymond Hettinger9fe1ccf2011-02-26 01:02:51 +0000121
Georg Brandl4dcf4742012-03-08 20:35:08 +0100122 d['x'] # Get first key in the chain of contexts
123 d['x'] = 1 # Set value in current context
124 del['x'] # Delete from current context
125 list(d) # All nested values
126 k in d # Check all nested values
127 len(d) # Number of nested values
128 d.items() # All nested items
129 dict(d) # Flatten into a regular dictionary
Raymond Hettinger9fe1ccf2011-02-26 01:02:51 +0000130
Georg Brandl4dcf4742012-03-08 20:35:08 +0100131 .. seealso::
Raymond Hettinger9fe1ccf2011-02-26 01:02:51 +0000132
Georg Brandl4dcf4742012-03-08 20:35:08 +0100133 * The `MultiContext class
134 <http://svn.enthought.com/svn/enthought/CodeTools/trunk/enthought/contexts/multi_context.py>`_
135 in the Enthought `CodeTools package
136 <https://github.com/enthought/codetools>`_ has options to support
137 writing to any mapping in the chain.
Raymond Hettinger9fe1ccf2011-02-26 01:02:51 +0000138
Georg Brandl4dcf4742012-03-08 20:35:08 +0100139 * Django's `Context class
140 <http://code.djangoproject.com/browser/django/trunk/django/template/context.py>`_
141 for templating is a read-only chain of mappings. It also features
142 pushing and popping of contexts similar to the
143 :meth:`~collections.ChainMap.new_child` method and the
144 :meth:`~collections.ChainMap.parents` property.
Raymond Hettinger9fe1ccf2011-02-26 01:02:51 +0000145
Georg Brandl4dcf4742012-03-08 20:35:08 +0100146 * The `Nested Contexts recipe
147 <http://code.activestate.com/recipes/577434/>`_ has options to control
148 whether writes and other mutations apply only to the first mapping or to
149 any mapping in the chain.
Raymond Hettinger9fe1ccf2011-02-26 01:02:51 +0000150
Georg Brandl4dcf4742012-03-08 20:35:08 +0100151 * A `greatly simplified read-only version of Chainmap
152 <http://code.activestate.com/recipes/305268/>`_.
153
Raymond Hettinger9fe1ccf2011-02-26 01:02:51 +0000154
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000155:class:`Counter` objects
156------------------------
157
158A counter tool is provided to support convenient and rapid tallies.
159For example::
160
Raymond Hettinger1c62dc92009-02-04 11:41:45 +0000161 >>> # Tally occurrences of words in a list
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000162 >>> cnt = Counter()
Raymond Hettinger670eaec2009-01-21 23:14:07 +0000163 >>> for word in ['red', 'blue', 'red', 'green', 'blue', 'blue']:
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000164 ... cnt[word] += 1
165 >>> cnt
166 Counter({'blue': 3, 'red': 2, 'green': 1})
167
Raymond Hettinger1c62dc92009-02-04 11:41:45 +0000168 >>> # Find the ten most common words in Hamlet
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000169 >>> import re
170 >>> words = re.findall('\w+', open('hamlet.txt').read().lower())
Raymond Hettinger0bae6622009-01-20 13:00:59 +0000171 >>> Counter(words).most_common(10)
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000172 [('the', 1143), ('and', 966), ('to', 762), ('of', 669), ('i', 631),
173 ('you', 554), ('a', 546), ('my', 514), ('hamlet', 471), ('in', 451)]
174
175.. class:: Counter([iterable-or-mapping])
176
Raymond Hettinger670eaec2009-01-21 23:14:07 +0000177 A :class:`Counter` is a :class:`dict` subclass for counting hashable objects.
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000178 It is an unordered collection where elements are stored as dictionary keys
179 and their counts are stored as dictionary values. Counts are allowed to be
180 any integer value including zero or negative counts. The :class:`Counter`
181 class is similar to bags or multisets in other languages.
182
183 Elements are counted from an *iterable* or initialized from another
Benjamin Peterson25c95f12009-05-08 20:42:26 +0000184 *mapping* (or counter):
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000185
Raymond Hettinger73662a52009-01-27 02:38:22 +0000186 >>> c = Counter() # a new, empty counter
187 >>> c = Counter('gallahad') # a new counter from an iterable
188 >>> c = Counter({'red': 4, 'blue': 2}) # a new counter from a mapping
189 >>> c = Counter(cats=4, dogs=8) # a new counter from keyword args
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000190
Raymond Hettinger670eaec2009-01-21 23:14:07 +0000191 Counter objects have a dictionary interface except that they return a zero
Benjamin Peterson25c95f12009-05-08 20:42:26 +0000192 count for missing items instead of raising a :exc:`KeyError`:
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000193
Raymond Hettinger94adc8e2009-01-22 05:27:37 +0000194 >>> c = Counter(['eggs', 'ham'])
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000195 >>> c['bacon'] # count of a missing element is zero
196 0
197
Raymond Hettinger94adc8e2009-01-22 05:27:37 +0000198 Setting a count to zero does not remove an element from a counter.
199 Use ``del`` to remove it entirely:
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000200
Raymond Hettinger94adc8e2009-01-22 05:27:37 +0000201 >>> c['sausage'] = 0 # counter entry with a zero count
202 >>> del c['sausage'] # del actually removes the entry
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000203
Benjamin Petersond45bf582009-03-02 21:44:54 +0000204 .. versionadded:: 3.1
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000205
206
Ezio Melotti0be8b1c2010-04-04 06:53:44 +0000207 Counter objects support three methods beyond those available for all
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000208 dictionaries:
209
210 .. method:: elements()
211
Raymond Hettinger670eaec2009-01-21 23:14:07 +0000212 Return an iterator over elements repeating each as many times as its
213 count. Elements are returned in arbitrary order. If an element's count
214 is less than one, :meth:`elements` will ignore it.
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000215
Raymond Hettinger0bae6622009-01-20 13:00:59 +0000216 >>> c = Counter(a=4, b=2, c=0, d=-2)
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000217 >>> list(c.elements())
218 ['a', 'a', 'a', 'a', 'b', 'b']
219
220 .. method:: most_common([n])
221
Raymond Hettinger73662a52009-01-27 02:38:22 +0000222 Return a list of the *n* most common elements and their counts from the
Raymond Hettingerd04fa312009-02-04 19:45:13 +0000223 most common to the least. If *n* is not specified, :func:`most_common`
Raymond Hettinger73662a52009-01-27 02:38:22 +0000224 returns *all* elements in the counter. Elements with equal counts are
Benjamin Peterson25c95f12009-05-08 20:42:26 +0000225 ordered arbitrarily:
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000226
227 >>> Counter('abracadabra').most_common(3)
228 [('a', 5), ('r', 2), ('b', 2)]
229
Raymond Hettinger9c01e442010-04-03 10:32:58 +0000230 .. method:: subtract([iterable-or-mapping])
231
232 Elements are subtracted from an *iterable* or from another *mapping*
233 (or counter). Like :meth:`dict.update` but subtracts counts instead
234 of replacing them. Both inputs and outputs may be zero or negative.
235
236 >>> c = Counter(a=4, b=2, c=0, d=-2)
237 >>> d = Counter(a=1, b=2, c=3, d=4)
238 >>> c.subtract(d)
239 Counter({'a': 3, 'b': 0, 'c': -3, 'd': -6})
240
Ezio Melotti0be8b1c2010-04-04 06:53:44 +0000241 .. versionadded:: 3.2
242
Raymond Hettinger670eaec2009-01-21 23:14:07 +0000243 The usual dictionary methods are available for :class:`Counter` objects
244 except for two which work differently for counters.
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000245
246 .. method:: fromkeys(iterable)
247
Raymond Hettinger73662a52009-01-27 02:38:22 +0000248 This class method is not implemented for :class:`Counter` objects.
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000249
250 .. method:: update([iterable-or-mapping])
251
Raymond Hettinger73662a52009-01-27 02:38:22 +0000252 Elements are counted from an *iterable* or added-in from another
253 *mapping* (or counter). Like :meth:`dict.update` but adds counts
254 instead of replacing them. Also, the *iterable* is expected to be a
255 sequence of elements, not a sequence of ``(key, value)`` pairs.
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000256
257Common patterns for working with :class:`Counter` objects::
258
Raymond Hettinger73662a52009-01-27 02:38:22 +0000259 sum(c.values()) # total of all counts
260 c.clear() # reset all counts
261 list(c) # list unique elements
262 set(c) # convert to a set
263 dict(c) # convert to a regular dictionary
264 c.items() # convert to a list of (elem, cnt) pairs
265 Counter(dict(list_of_pairs)) # convert from a list of (elem, cnt) pairs
266 c.most_common()[:-n:-1] # n least common elements
Raymond Hettingerfcb393c2011-08-09 13:00:40 -0700267 +c # remove zero and negative counts
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000268
Raymond Hettinger72a95cc2009-02-25 22:51:40 +0000269Several mathematical operations are provided for combining :class:`Counter`
270objects to produce multisets (counters that have counts greater than zero).
271Addition and subtraction combine counters by adding or subtracting the counts
272of corresponding elements. Intersection and union return the minimum and
273maximum of corresponding counts. Each operation can accept inputs with signed
274counts, but the output will exclude results with counts of zero or less.
Raymond Hettinger4d2073a2009-01-20 03:41:22 +0000275
Raymond Hettingere0d1b9f2009-01-21 20:36:27 +0000276 >>> c = Counter(a=3, b=1)
277 >>> d = Counter(a=1, b=2)
Raymond Hettinger73662a52009-01-27 02:38:22 +0000278 >>> c + d # add two counters together: c[x] + d[x]
Raymond Hettinger4d2073a2009-01-20 03:41:22 +0000279 Counter({'a': 4, 'b': 3})
Raymond Hettinger73662a52009-01-27 02:38:22 +0000280 >>> c - d # subtract (keeping only positive counts)
Raymond Hettinger4d2073a2009-01-20 03:41:22 +0000281 Counter({'a': 2})
Raymond Hettinger73662a52009-01-27 02:38:22 +0000282 >>> c & d # intersection: min(c[x], d[x])
Raymond Hettinger4d2073a2009-01-20 03:41:22 +0000283 Counter({'a': 1, 'b': 1})
Raymond Hettinger73662a52009-01-27 02:38:22 +0000284 >>> c | d # union: max(c[x], d[x])
Raymond Hettinger4d2073a2009-01-20 03:41:22 +0000285 Counter({'a': 3, 'b': 2})
286
Raymond Hettingerfcb393c2011-08-09 13:00:40 -0700287Unary addition and substraction are shortcuts for adding an empty counter
288or subtracting from an empty counter.
289
290 >>> c = Counter(a=2, b=-4)
291 >>> +c
292 Counter({'a': 2})
293 >>> -c
294 Counter({'b': 4})
295
296.. versionadded:: 3.3
Raymond Hettingerbecd5682011-10-19 13:40:37 -0700297 Added support for unary plus, unary minus, and in-place multiset operations.
Raymond Hettingerfcb393c2011-08-09 13:00:40 -0700298
Raymond Hettinger22f18852010-04-12 21:45:14 +0000299.. note::
300
301 Counters were primarily designed to work with positive integers to represent
302 running counts; however, care was taken to not unnecessarily preclude use
303 cases needing other types or negative values. To help with those use cases,
304 this section documents the minimum range and type restrictions.
305
306 * The :class:`Counter` class itself is a dictionary subclass with no
307 restrictions on its keys and values. The values are intended to be numbers
308 representing counts, but you *could* store anything in the value field.
309
310 * The :meth:`most_common` method requires only that the values be orderable.
311
312 * For in-place operations such as ``c[key] += 1``, the value type need only
313 support addition and subtraction. So fractions, floats, and decimals would
314 work and negative values are supported. The same is also true for
315 :meth:`update` and :meth:`subtract` which allow negative and zero values
316 for both inputs and outputs.
317
318 * The multiset methods are designed only for use cases with positive values.
319 The inputs may be negative or zero, but only outputs with positive values
320 are created. There are no type restrictions, but the value type needs to
Ezio Melottie130a522011-10-19 10:58:56 +0300321 support addition, subtraction, and comparison.
Raymond Hettinger22f18852010-04-12 21:45:14 +0000322
323 * The :meth:`elements` method requires integer counts. It ignores zero and
324 negative counts.
325
Raymond Hettingerb14043c2009-01-20 23:44:31 +0000326.. seealso::
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000327
Raymond Hettinger94adc8e2009-01-22 05:27:37 +0000328 * `Counter class <http://code.activestate.com/recipes/576611/>`_
329 adapted for Python 2.5 and an early `Bag recipe
330 <http://code.activestate.com/recipes/259174/>`_ for Python 2.4.
331
Raymond Hettingerb14043c2009-01-20 23:44:31 +0000332 * `Bag class <http://www.gnu.org/software/smalltalk/manual-base/html_node/Bag.html>`_
333 in Smalltalk.
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000334
Éric Araujo08c9bd52011-04-24 02:59:02 +0200335 * Wikipedia entry for `Multisets <http://en.wikipedia.org/wiki/Multiset>`_.
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000336
Raymond Hettingerb14043c2009-01-20 23:44:31 +0000337 * `C++ multisets <http://www.demo2s.com/Tutorial/Cpp/0380__set-multiset/Catalog0380__set-multiset.htm>`_
Raymond Hettinger94adc8e2009-01-22 05:27:37 +0000338 tutorial with examples.
Raymond Hettingerb14043c2009-01-20 23:44:31 +0000339
Raymond Hettinger94adc8e2009-01-22 05:27:37 +0000340 * For mathematical operations on multisets and their use cases, see
Raymond Hettingerb14043c2009-01-20 23:44:31 +0000341 *Knuth, Donald. The Art of Computer Programming Volume II,
Éric Araujo08c9bd52011-04-24 02:59:02 +0200342 Section 4.6.3, Exercise 19*.
Raymond Hettingerb14043c2009-01-20 23:44:31 +0000343
Raymond Hettinger670eaec2009-01-21 23:14:07 +0000344 * To enumerate all distinct multisets of a given size over a given set of
Raymond Hettingerd07d9392009-01-27 04:20:44 +0000345 elements, see :func:`itertools.combinations_with_replacement`.
Raymond Hettingerb14043c2009-01-20 23:44:31 +0000346
Raymond Hettinger94adc8e2009-01-22 05:27:37 +0000347 map(Counter, combinations_with_replacement('ABC', 2)) --> AA AB AC BB BC CC
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000348
349
Georg Brandl116aa622007-08-15 14:28:22 +0000350:class:`deque` objects
351----------------------
352
Georg Brandlc2a4f4f2009-04-10 09:03:43 +0000353.. class:: deque([iterable, [maxlen]])
Georg Brandl116aa622007-08-15 14:28:22 +0000354
355 Returns a new deque object initialized left-to-right (using :meth:`append`) with
356 data from *iterable*. If *iterable* is not specified, the new deque is empty.
357
358 Deques are a generalization of stacks and queues (the name is pronounced "deck"
359 and is short for "double-ended queue"). Deques support thread-safe, memory
360 efficient appends and pops from either side of the deque with approximately the
361 same O(1) performance in either direction.
362
363 Though :class:`list` objects support similar operations, they are optimized for
364 fast fixed-length operations and incur O(n) memory movement costs for
365 ``pop(0)`` and ``insert(0, v)`` operations which change both the size and
366 position of the underlying data representation.
367
Georg Brandl116aa622007-08-15 14:28:22 +0000368
Georg Brandl9afde1c2007-11-01 20:32:30 +0000369 If *maxlen* is not specified or is *None*, deques may grow to an
370 arbitrary length. Otherwise, the deque is bounded to the specified maximum
371 length. Once a bounded length deque is full, when new items are added, a
372 corresponding number of items are discarded from the opposite end. Bounded
373 length deques provide functionality similar to the ``tail`` filter in
374 Unix. They are also useful for tracking transactions and other pools of data
375 where only the most recent activity is of interest.
376
Georg Brandl9afde1c2007-11-01 20:32:30 +0000377
Benjamin Petersone41251e2008-04-25 01:59:09 +0000378 Deque objects support the following methods:
Georg Brandl116aa622007-08-15 14:28:22 +0000379
Benjamin Petersone41251e2008-04-25 01:59:09 +0000380 .. method:: append(x)
Georg Brandl116aa622007-08-15 14:28:22 +0000381
Benjamin Petersone41251e2008-04-25 01:59:09 +0000382 Add *x* to the right side of the deque.
Georg Brandl116aa622007-08-15 14:28:22 +0000383
384
Benjamin Petersone41251e2008-04-25 01:59:09 +0000385 .. method:: appendleft(x)
Georg Brandl116aa622007-08-15 14:28:22 +0000386
Benjamin Petersone41251e2008-04-25 01:59:09 +0000387 Add *x* to the left side of the deque.
Georg Brandl116aa622007-08-15 14:28:22 +0000388
389
Benjamin Petersone41251e2008-04-25 01:59:09 +0000390 .. method:: clear()
Georg Brandl116aa622007-08-15 14:28:22 +0000391
Benjamin Petersone41251e2008-04-25 01:59:09 +0000392 Remove all elements from the deque leaving it with length 0.
Georg Brandl116aa622007-08-15 14:28:22 +0000393
394
Raymond Hettinger44459de2010-04-03 23:20:46 +0000395 .. method:: count(x)
396
397 Count the number of deque elements equal to *x*.
398
399 .. versionadded:: 3.2
400
Georg Brandl67b21b72010-08-17 15:07:14 +0000401
Benjamin Petersone41251e2008-04-25 01:59:09 +0000402 .. method:: extend(iterable)
Georg Brandl116aa622007-08-15 14:28:22 +0000403
Benjamin Petersone41251e2008-04-25 01:59:09 +0000404 Extend the right side of the deque by appending elements from the iterable
405 argument.
Georg Brandl116aa622007-08-15 14:28:22 +0000406
407
Benjamin Petersone41251e2008-04-25 01:59:09 +0000408 .. method:: extendleft(iterable)
Georg Brandl116aa622007-08-15 14:28:22 +0000409
Benjamin Petersone41251e2008-04-25 01:59:09 +0000410 Extend the left side of the deque by appending elements from *iterable*.
411 Note, the series of left appends results in reversing the order of
412 elements in the iterable argument.
Georg Brandl116aa622007-08-15 14:28:22 +0000413
414
Benjamin Petersone41251e2008-04-25 01:59:09 +0000415 .. method:: pop()
Georg Brandl116aa622007-08-15 14:28:22 +0000416
Benjamin Petersone41251e2008-04-25 01:59:09 +0000417 Remove and return an element from the right side of the deque. If no
418 elements are present, raises an :exc:`IndexError`.
Georg Brandl116aa622007-08-15 14:28:22 +0000419
420
Benjamin Petersone41251e2008-04-25 01:59:09 +0000421 .. method:: popleft()
Georg Brandl116aa622007-08-15 14:28:22 +0000422
Benjamin Petersone41251e2008-04-25 01:59:09 +0000423 Remove and return an element from the left side of the deque. If no
424 elements are present, raises an :exc:`IndexError`.
Georg Brandl116aa622007-08-15 14:28:22 +0000425
426
Benjamin Petersone41251e2008-04-25 01:59:09 +0000427 .. method:: remove(value)
Georg Brandl116aa622007-08-15 14:28:22 +0000428
Benjamin Petersone41251e2008-04-25 01:59:09 +0000429 Removed the first occurrence of *value*. If not found, raises a
430 :exc:`ValueError`.
Georg Brandl116aa622007-08-15 14:28:22 +0000431
Georg Brandl67b21b72010-08-17 15:07:14 +0000432
Raymond Hettingere5fdedb2009-12-10 00:47:21 +0000433 .. method:: reverse()
434
435 Reverse the elements of the deque in-place and then return ``None``.
436
437 .. versionadded:: 3.2
Georg Brandl116aa622007-08-15 14:28:22 +0000438
Georg Brandl67b21b72010-08-17 15:07:14 +0000439
Benjamin Petersone41251e2008-04-25 01:59:09 +0000440 .. method:: rotate(n)
Georg Brandl116aa622007-08-15 14:28:22 +0000441
Benjamin Petersone41251e2008-04-25 01:59:09 +0000442 Rotate the deque *n* steps to the right. If *n* is negative, rotate to
443 the left. Rotating one step to the right is equivalent to:
444 ``d.appendleft(d.pop())``.
445
Georg Brandl116aa622007-08-15 14:28:22 +0000446
Raymond Hettinger5bb0f0e2009-03-10 12:56:32 +0000447 Deque objects also provide one read-only attribute:
448
449 .. attribute:: maxlen
450
451 Maximum size of a deque or *None* if unbounded.
452
Raymond Hettinger150fb9c2009-03-10 22:48:06 +0000453 .. versionadded:: 3.1
Raymond Hettinger5bb0f0e2009-03-10 12:56:32 +0000454
455
Georg Brandl116aa622007-08-15 14:28:22 +0000456In addition to the above, deques support iteration, pickling, ``len(d)``,
457``reversed(d)``, ``copy.copy(d)``, ``copy.deepcopy(d)``, membership testing with
Benjamin Peterson206e3072008-10-19 14:07:49 +0000458the :keyword:`in` operator, and subscript references such as ``d[-1]``. Indexed
459access is O(1) at both ends but slows to O(n) in the middle. For fast random
460access, use lists instead.
Georg Brandl116aa622007-08-15 14:28:22 +0000461
Christian Heimesfe337bf2008-03-23 21:54:12 +0000462Example:
463
464.. doctest::
Georg Brandl116aa622007-08-15 14:28:22 +0000465
466 >>> from collections import deque
467 >>> d = deque('ghi') # make a new deque with three items
468 >>> for elem in d: # iterate over the deque's elements
Neal Norwitz752abd02008-05-13 04:55:24 +0000469 ... print(elem.upper())
Georg Brandl116aa622007-08-15 14:28:22 +0000470 G
471 H
472 I
473
474 >>> d.append('j') # add a new entry to the right side
475 >>> d.appendleft('f') # add a new entry to the left side
476 >>> d # show the representation of the deque
477 deque(['f', 'g', 'h', 'i', 'j'])
478
479 >>> d.pop() # return and remove the rightmost item
480 'j'
481 >>> d.popleft() # return and remove the leftmost item
482 'f'
483 >>> list(d) # list the contents of the deque
484 ['g', 'h', 'i']
485 >>> d[0] # peek at leftmost item
486 'g'
487 >>> d[-1] # peek at rightmost item
488 'i'
489
490 >>> list(reversed(d)) # list the contents of a deque in reverse
491 ['i', 'h', 'g']
492 >>> 'h' in d # search the deque
493 True
494 >>> d.extend('jkl') # add multiple elements at once
495 >>> d
496 deque(['g', 'h', 'i', 'j', 'k', 'l'])
497 >>> d.rotate(1) # right rotation
498 >>> d
499 deque(['l', 'g', 'h', 'i', 'j', 'k'])
500 >>> d.rotate(-1) # left rotation
501 >>> d
502 deque(['g', 'h', 'i', 'j', 'k', 'l'])
503
504 >>> deque(reversed(d)) # make a new deque in reverse order
505 deque(['l', 'k', 'j', 'i', 'h', 'g'])
506 >>> d.clear() # empty the deque
507 >>> d.pop() # cannot pop from an empty deque
508 Traceback (most recent call last):
509 File "<pyshell#6>", line 1, in -toplevel-
510 d.pop()
511 IndexError: pop from an empty deque
512
513 >>> d.extendleft('abc') # extendleft() reverses the input order
514 >>> d
515 deque(['c', 'b', 'a'])
516
517
Georg Brandl9afde1c2007-11-01 20:32:30 +0000518:class:`deque` Recipes
519^^^^^^^^^^^^^^^^^^^^^^
Georg Brandl116aa622007-08-15 14:28:22 +0000520
521This section shows various approaches to working with deques.
522
Raymond Hettingerd2ee64d2009-03-31 22:52:48 +0000523Bounded length deques provide functionality similar to the ``tail`` filter
524in Unix::
525
526 def tail(filename, n=10):
527 'Return the last n lines of a file'
Éric Araujoa3dd56b2011-03-11 17:42:48 +0100528 with open(filename) as f:
529 return deque(f, n)
Raymond Hettingerd2ee64d2009-03-31 22:52:48 +0000530
531Another approach to using deques is to maintain a sequence of recently
532added elements by appending to the right and popping to the left::
533
534 def moving_average(iterable, n=3):
535 # moving_average([40, 30, 50, 46, 39, 44]) --> 40.0 42.0 45.0 43.0
536 # http://en.wikipedia.org/wiki/Moving_average
537 it = iter(iterable)
Raymond Hettingerd40285a2009-05-22 01:11:26 +0000538 d = deque(itertools.islice(it, n-1))
539 d.appendleft(0)
Raymond Hettingerd2ee64d2009-03-31 22:52:48 +0000540 s = sum(d)
Raymond Hettingerd2ee64d2009-03-31 22:52:48 +0000541 for elem in it:
542 s += elem - d.popleft()
543 d.append(elem)
544 yield s / n
545
Georg Brandl116aa622007-08-15 14:28:22 +0000546The :meth:`rotate` method provides a way to implement :class:`deque` slicing and
Ezio Melotti0639d5a2009-12-19 23:26:38 +0000547deletion. For example, a pure Python implementation of ``del d[n]`` relies on
Georg Brandl116aa622007-08-15 14:28:22 +0000548the :meth:`rotate` method to position elements to be popped::
549
550 def delete_nth(d, n):
551 d.rotate(-n)
552 d.popleft()
553 d.rotate(n)
554
555To implement :class:`deque` slicing, use a similar approach applying
556:meth:`rotate` to bring a target element to the left side of the deque. Remove
557old entries with :meth:`popleft`, add new entries with :meth:`extend`, and then
558reverse the rotation.
Georg Brandl116aa622007-08-15 14:28:22 +0000559With minor variations on that approach, it is easy to implement Forth style
560stack manipulations such as ``dup``, ``drop``, ``swap``, ``over``, ``pick``,
561``rot``, and ``roll``.
562
Georg Brandl116aa622007-08-15 14:28:22 +0000563
564:class:`defaultdict` objects
565----------------------------
566
Georg Brandl116aa622007-08-15 14:28:22 +0000567.. class:: defaultdict([default_factory[, ...]])
568
569 Returns a new dictionary-like object. :class:`defaultdict` is a subclass of the
Georg Brandl22b34312009-07-26 14:54:51 +0000570 built-in :class:`dict` class. It overrides one method and adds one writable
Georg Brandl116aa622007-08-15 14:28:22 +0000571 instance variable. The remaining functionality is the same as for the
572 :class:`dict` class and is not documented here.
573
574 The first argument provides the initial value for the :attr:`default_factory`
575 attribute; it defaults to ``None``. All remaining arguments are treated the same
576 as if they were passed to the :class:`dict` constructor, including keyword
577 arguments.
578
Georg Brandl116aa622007-08-15 14:28:22 +0000579
Benjamin Petersone41251e2008-04-25 01:59:09 +0000580 :class:`defaultdict` objects support the following method in addition to the
581 standard :class:`dict` operations:
Georg Brandl116aa622007-08-15 14:28:22 +0000582
Benjamin Petersond319ad52010-07-18 14:27:02 +0000583 .. method:: __missing__(key)
Georg Brandl116aa622007-08-15 14:28:22 +0000584
Benjamin Peterson5478b472008-09-17 22:25:09 +0000585 If the :attr:`default_factory` attribute is ``None``, this raises a
Benjamin Petersone41251e2008-04-25 01:59:09 +0000586 :exc:`KeyError` exception with the *key* as argument.
Georg Brandl116aa622007-08-15 14:28:22 +0000587
Benjamin Petersone41251e2008-04-25 01:59:09 +0000588 If :attr:`default_factory` is not ``None``, it is called without arguments
589 to provide a default value for the given *key*, this value is inserted in
590 the dictionary for the *key*, and returned.
Georg Brandl116aa622007-08-15 14:28:22 +0000591
Benjamin Petersone41251e2008-04-25 01:59:09 +0000592 If calling :attr:`default_factory` raises an exception this exception is
593 propagated unchanged.
Georg Brandl116aa622007-08-15 14:28:22 +0000594
Benjamin Petersone41251e2008-04-25 01:59:09 +0000595 This method is called by the :meth:`__getitem__` method of the
596 :class:`dict` class when the requested key is not found; whatever it
597 returns or raises is then returned or raised by :meth:`__getitem__`.
Georg Brandl116aa622007-08-15 14:28:22 +0000598
Benjamin Peterson871b9d12012-01-27 09:14:01 -0500599 Note that :meth:`__missing__` is *not* called for any operations besides
600 :meth:`__getitem__`. This means that :meth:`get` will, like normal
601 dictionaries, return ``None`` as a default rather than using
602 :attr:`default_factory`.
603
Georg Brandl116aa622007-08-15 14:28:22 +0000604
Benjamin Petersone41251e2008-04-25 01:59:09 +0000605 :class:`defaultdict` objects support the following instance variable:
Georg Brandl116aa622007-08-15 14:28:22 +0000606
Benjamin Petersone41251e2008-04-25 01:59:09 +0000607
Benjamin Petersond319ad52010-07-18 14:27:02 +0000608 .. attribute:: default_factory
Benjamin Petersone41251e2008-04-25 01:59:09 +0000609
610 This attribute is used by the :meth:`__missing__` method; it is
611 initialized from the first argument to the constructor, if present, or to
612 ``None``, if absent.
Georg Brandl116aa622007-08-15 14:28:22 +0000613
614
Georg Brandl116aa622007-08-15 14:28:22 +0000615:class:`defaultdict` Examples
616^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
617
618Using :class:`list` as the :attr:`default_factory`, it is easy to group a
Christian Heimesfe337bf2008-03-23 21:54:12 +0000619sequence of key-value pairs into a dictionary of lists:
Georg Brandl116aa622007-08-15 14:28:22 +0000620
621 >>> s = [('yellow', 1), ('blue', 2), ('yellow', 3), ('blue', 4), ('red', 1)]
622 >>> d = defaultdict(list)
623 >>> for k, v in s:
624 ... d[k].append(v)
625 ...
Ezio Melottic53a8942009-09-12 01:52:05 +0000626 >>> list(d.items())
Georg Brandl116aa622007-08-15 14:28:22 +0000627 [('blue', [2, 4]), ('red', [1]), ('yellow', [1, 3])]
628
629When each key is encountered for the first time, it is not already in the
630mapping; so an entry is automatically created using the :attr:`default_factory`
631function which returns an empty :class:`list`. The :meth:`list.append`
632operation then attaches the value to the new list. When keys are encountered
633again, the look-up proceeds normally (returning the list for that key) and the
634:meth:`list.append` operation adds another value to the list. This technique is
Christian Heimesfe337bf2008-03-23 21:54:12 +0000635simpler and faster than an equivalent technique using :meth:`dict.setdefault`:
Georg Brandl116aa622007-08-15 14:28:22 +0000636
637 >>> d = {}
638 >>> for k, v in s:
639 ... d.setdefault(k, []).append(v)
640 ...
Ezio Melottic53a8942009-09-12 01:52:05 +0000641 >>> list(d.items())
Georg Brandl116aa622007-08-15 14:28:22 +0000642 [('blue', [2, 4]), ('red', [1]), ('yellow', [1, 3])]
643
644Setting the :attr:`default_factory` to :class:`int` makes the
645:class:`defaultdict` useful for counting (like a bag or multiset in other
Christian Heimesfe337bf2008-03-23 21:54:12 +0000646languages):
Georg Brandl116aa622007-08-15 14:28:22 +0000647
648 >>> s = 'mississippi'
649 >>> d = defaultdict(int)
650 >>> for k in s:
651 ... d[k] += 1
652 ...
Ezio Melottic53a8942009-09-12 01:52:05 +0000653 >>> list(d.items())
Georg Brandl116aa622007-08-15 14:28:22 +0000654 [('i', 4), ('p', 2), ('s', 4), ('m', 1)]
655
656When a letter is first encountered, it is missing from the mapping, so the
657:attr:`default_factory` function calls :func:`int` to supply a default count of
658zero. The increment operation then builds up the count for each letter.
659
660The function :func:`int` which always returns zero is just a special case of
661constant functions. A faster and more flexible way to create constant functions
662is to use a lambda function which can supply any constant value (not just
Christian Heimesfe337bf2008-03-23 21:54:12 +0000663zero):
Georg Brandl116aa622007-08-15 14:28:22 +0000664
665 >>> def constant_factory(value):
666 ... return lambda: value
667 >>> d = defaultdict(constant_factory('<missing>'))
668 >>> d.update(name='John', action='ran')
669 >>> '%(name)s %(action)s to %(object)s' % d
670 'John ran to <missing>'
671
672Setting the :attr:`default_factory` to :class:`set` makes the
Christian Heimesfe337bf2008-03-23 21:54:12 +0000673:class:`defaultdict` useful for building a dictionary of sets:
Georg Brandl116aa622007-08-15 14:28:22 +0000674
675 >>> s = [('red', 1), ('blue', 2), ('red', 3), ('blue', 4), ('red', 1), ('blue', 4)]
676 >>> d = defaultdict(set)
677 >>> for k, v in s:
678 ... d[k].add(v)
679 ...
Ezio Melottic53a8942009-09-12 01:52:05 +0000680 >>> list(d.items())
Raymond Hettingerba7b5602011-03-22 22:57:49 -0700681 [('blue', {2, 4}), ('red', {1, 3})]
Georg Brandl116aa622007-08-15 14:28:22 +0000682
683
Guido van Rossum3d392eb2007-11-16 00:35:22 +0000684:func:`namedtuple` Factory Function for Tuples with Named Fields
Christian Heimes790c8232008-01-07 21:14:23 +0000685----------------------------------------------------------------
Georg Brandl116aa622007-08-15 14:28:22 +0000686
Thomas Wouters1b7f8912007-09-19 03:06:30 +0000687Named tuples assign meaning to each position in a tuple and allow for more readable,
688self-documenting code. They can be used wherever regular tuples are used, and
689they add the ability to access fields by name instead of position index.
Georg Brandl116aa622007-08-15 14:28:22 +0000690
Georg Brandlc2a4f4f2009-04-10 09:03:43 +0000691.. function:: namedtuple(typename, field_names, verbose=False, rename=False)
Georg Brandl116aa622007-08-15 14:28:22 +0000692
693 Returns a new tuple subclass named *typename*. The new subclass is used to
Christian Heimesc3f30c42008-02-22 16:37:40 +0000694 create tuple-like objects that have fields accessible by attribute lookup as
Georg Brandl116aa622007-08-15 14:28:22 +0000695 well as being indexable and iterable. Instances of the subclass also have a
Benjamin Peterson4469d0c2008-11-30 22:46:23 +0000696 helpful docstring (with typename and field_names) and a helpful :meth:`__repr__`
Georg Brandl116aa622007-08-15 14:28:22 +0000697 method which lists the tuple contents in a ``name=value`` format.
698
Benjamin Peterson4469d0c2008-11-30 22:46:23 +0000699 The *field_names* are a single string with each fieldname separated by whitespace
700 and/or commas, for example ``'x y'`` or ``'x, y'``. Alternatively, *field_names*
Christian Heimes25bb7832008-01-11 16:17:00 +0000701 can be a sequence of strings such as ``['x', 'y']``.
Georg Brandl9afde1c2007-11-01 20:32:30 +0000702
703 Any valid Python identifier may be used for a fieldname except for names
Christian Heimes0449f632007-12-15 01:27:15 +0000704 starting with an underscore. Valid identifiers consist of letters, digits,
705 and underscores but do not start with a digit or underscore and cannot be
Georg Brandlf6945182008-02-01 11:56:49 +0000706 a :mod:`keyword` such as *class*, *for*, *return*, *global*, *pass*,
Georg Brandl9afde1c2007-11-01 20:32:30 +0000707 or *raise*.
Georg Brandl116aa622007-08-15 14:28:22 +0000708
Benjamin Petersona86f2c02009-02-10 02:41:10 +0000709 If *rename* is true, invalid fieldnames are automatically replaced
710 with positional names. For example, ``['abc', 'def', 'ghi', 'abc']`` is
Raymond Hettinger85737b82009-04-02 22:37:59 +0000711 converted to ``['abc', '_1', 'ghi', '_3']``, eliminating the keyword
Benjamin Petersona86f2c02009-02-10 02:41:10 +0000712 ``def`` and the duplicate fieldname ``abc``.
713
Raymond Hettinger2ebea412011-03-23 12:52:23 -0700714 If *verbose* is true, the class definition is printed after it is
715 built. This option is outdated; instead, it is simpler to print the
716 :attr:`_source` attribute.
Georg Brandl116aa622007-08-15 14:28:22 +0000717
Georg Brandl9afde1c2007-11-01 20:32:30 +0000718 Named tuple instances do not have per-instance dictionaries, so they are
Thomas Wouters8ce81f72007-09-20 18:22:40 +0000719 lightweight and require no more memory than regular tuples.
Georg Brandl116aa622007-08-15 14:28:22 +0000720
Raymond Hettingerb62ad242009-03-02 22:16:43 +0000721 .. versionchanged:: 3.1
Georg Brandl67b21b72010-08-17 15:07:14 +0000722 Added support for *rename*.
Benjamin Petersona86f2c02009-02-10 02:41:10 +0000723
Christian Heimesfe337bf2008-03-23 21:54:12 +0000724
725.. doctest::
726 :options: +NORMALIZE_WHITESPACE
Georg Brandl116aa622007-08-15 14:28:22 +0000727
Raymond Hettinger0ef956f2010-11-21 23:23:29 +0000728 >>> # Basic example
Raymond Hettinger15aded82011-03-15 17:25:51 -0700729 >>> Point = namedtuple('Point', ['x', 'y'])
Thomas Wouters1b7f8912007-09-19 03:06:30 +0000730 >>> p = Point(11, y=22) # instantiate with positional or keyword arguments
Christian Heimes99170a52007-12-19 02:07:34 +0000731 >>> p[0] + p[1] # indexable like the plain tuple (11, 22)
Thomas Wouters1b7f8912007-09-19 03:06:30 +0000732 33
733 >>> x, y = p # unpack like a regular tuple
734 >>> x, y
735 (11, 22)
Christian Heimesc3f30c42008-02-22 16:37:40 +0000736 >>> p.x + p.y # fields also accessible by name
Thomas Wouters1b7f8912007-09-19 03:06:30 +0000737 33
738 >>> p # readable __repr__ with a name=value style
739 Point(x=11, y=22)
Georg Brandl116aa622007-08-15 14:28:22 +0000740
Thomas Wouters1b7f8912007-09-19 03:06:30 +0000741Named tuples are especially useful for assigning field names to result tuples returned
742by the :mod:`csv` or :mod:`sqlite3` modules::
743
Guido van Rossum3d392eb2007-11-16 00:35:22 +0000744 EmployeeRecord = namedtuple('EmployeeRecord', 'name, age, title, department, paygrade')
Georg Brandl9afde1c2007-11-01 20:32:30 +0000745
Thomas Wouters1b7f8912007-09-19 03:06:30 +0000746 import csv
Christian Heimesfaf2f632008-01-06 16:59:19 +0000747 for emp in map(EmployeeRecord._make, csv.reader(open("employees.csv", "rb"))):
Thomas Wouters1b7f8912007-09-19 03:06:30 +0000748 print(emp.name, emp.title)
749
Georg Brandl9afde1c2007-11-01 20:32:30 +0000750 import sqlite3
751 conn = sqlite3.connect('/companydata')
752 cursor = conn.cursor()
753 cursor.execute('SELECT name, age, title, department, paygrade FROM employees')
Christian Heimesfaf2f632008-01-06 16:59:19 +0000754 for emp in map(EmployeeRecord._make, cursor.fetchall()):
Christian Heimes00412232008-01-10 16:02:19 +0000755 print(emp.name, emp.title)
Georg Brandl9afde1c2007-11-01 20:32:30 +0000756
Christian Heimes99170a52007-12-19 02:07:34 +0000757In addition to the methods inherited from tuples, named tuples support
Raymond Hettinger2ebea412011-03-23 12:52:23 -0700758three additional methods and two attributes. To prevent conflicts with
Christian Heimes2380ac72008-01-09 00:17:24 +0000759field names, the method and attribute names start with an underscore.
Christian Heimes99170a52007-12-19 02:07:34 +0000760
Benjamin Peterson0b9fb802010-07-18 14:23:36 +0000761.. classmethod:: somenamedtuple._make(iterable)
Christian Heimes99170a52007-12-19 02:07:34 +0000762
Christian Heimesfaf2f632008-01-06 16:59:19 +0000763 Class method that makes a new instance from an existing sequence or iterable.
Christian Heimes99170a52007-12-19 02:07:34 +0000764
Christian Heimesfe337bf2008-03-23 21:54:12 +0000765.. doctest::
Thomas Wouters1b7f8912007-09-19 03:06:30 +0000766
Christian Heimesfaf2f632008-01-06 16:59:19 +0000767 >>> t = [11, 22]
768 >>> Point._make(t)
769 Point(x=11, y=22)
Thomas Wouters1b7f8912007-09-19 03:06:30 +0000770
Christian Heimes790c8232008-01-07 21:14:23 +0000771.. method:: somenamedtuple._asdict()
Georg Brandl9afde1c2007-11-01 20:32:30 +0000772
Raymond Hettingera4f52b12009-03-02 22:28:31 +0000773 Return a new :class:`OrderedDict` which maps field names to their corresponding
774 values::
Georg Brandl9afde1c2007-11-01 20:32:30 +0000775
Christian Heimes0449f632007-12-15 01:27:15 +0000776 >>> p._asdict()
Raymond Hettingera4f52b12009-03-02 22:28:31 +0000777 OrderedDict([('x', 11), ('y', 22)])
778
Raymond Hettingera88e4da2009-03-03 05:12:27 +0000779 .. versionchanged:: 3.1
Raymond Hettingera4f52b12009-03-02 22:28:31 +0000780 Returns an :class:`OrderedDict` instead of a regular :class:`dict`.
Christian Heimesfe337bf2008-03-23 21:54:12 +0000781
Christian Heimes790c8232008-01-07 21:14:23 +0000782.. method:: somenamedtuple._replace(kwargs)
Thomas Wouters1b7f8912007-09-19 03:06:30 +0000783
Christian Heimesfe337bf2008-03-23 21:54:12 +0000784 Return a new instance of the named tuple replacing specified fields with new
785 values:
Thomas Wouters8ce81f72007-09-20 18:22:40 +0000786
787::
Thomas Wouters1b7f8912007-09-19 03:06:30 +0000788
789 >>> p = Point(x=11, y=22)
Christian Heimes0449f632007-12-15 01:27:15 +0000790 >>> p._replace(x=33)
Thomas Wouters1b7f8912007-09-19 03:06:30 +0000791 Point(x=33, y=22)
792
Guido van Rossum3d392eb2007-11-16 00:35:22 +0000793 >>> for partnum, record in inventory.items():
Christian Heimes454f37b2008-01-10 00:10:02 +0000794 ... inventory[partnum] = record._replace(price=newprices[partnum], timestamp=time.now())
Thomas Wouters1b7f8912007-09-19 03:06:30 +0000795
Raymond Hettinger2ebea412011-03-23 12:52:23 -0700796.. attribute:: somenamedtuple._source
797
798 A string with the pure Python source code used to create the named
799 tuple class. The source makes the named tuple self-documenting.
800 It can be printed, executed using :func:`exec`, or saved to a file
801 and imported.
802
803 .. versionadded:: 3.3
804
Christian Heimes790c8232008-01-07 21:14:23 +0000805.. attribute:: somenamedtuple._fields
Thomas Wouters1b7f8912007-09-19 03:06:30 +0000806
Christian Heimes2380ac72008-01-09 00:17:24 +0000807 Tuple of strings listing the field names. Useful for introspection
Georg Brandl9afde1c2007-11-01 20:32:30 +0000808 and for creating new named tuple types from existing named tuples.
Thomas Wouters8ce81f72007-09-20 18:22:40 +0000809
Christian Heimesfe337bf2008-03-23 21:54:12 +0000810.. doctest::
Thomas Wouters1b7f8912007-09-19 03:06:30 +0000811
Christian Heimes0449f632007-12-15 01:27:15 +0000812 >>> p._fields # view the field names
Thomas Wouters1b7f8912007-09-19 03:06:30 +0000813 ('x', 'y')
Thomas Wouters1b7f8912007-09-19 03:06:30 +0000814
Guido van Rossum3d392eb2007-11-16 00:35:22 +0000815 >>> Color = namedtuple('Color', 'red green blue')
Christian Heimes0449f632007-12-15 01:27:15 +0000816 >>> Pixel = namedtuple('Pixel', Point._fields + Color._fields)
Thomas Wouters1b7f8912007-09-19 03:06:30 +0000817 >>> Pixel(11, 22, 128, 255, 0)
Christian Heimes454f37b2008-01-10 00:10:02 +0000818 Pixel(x=11, y=22, red=128, green=255, blue=0)
Georg Brandl116aa622007-08-15 14:28:22 +0000819
Christian Heimes0449f632007-12-15 01:27:15 +0000820To retrieve a field whose name is stored in a string, use the :func:`getattr`
Christian Heimesfe337bf2008-03-23 21:54:12 +0000821function:
Christian Heimes0449f632007-12-15 01:27:15 +0000822
823 >>> getattr(p, 'x')
824 11
825
Raymond Hettinger651453a2009-02-11 00:20:02 +0000826To convert a dictionary to a named tuple, use the double-star-operator
827(as described in :ref:`tut-unpacking-arguments`):
Christian Heimes99170a52007-12-19 02:07:34 +0000828
829 >>> d = {'x': 11, 'y': 22}
830 >>> Point(**d)
831 Point(x=11, y=22)
832
Guido van Rossum3d392eb2007-11-16 00:35:22 +0000833Since a named tuple is a regular Python class, it is easy to add or change
Christian Heimes043d6f62008-01-07 17:19:16 +0000834functionality with a subclass. Here is how to add a calculated field and
Christian Heimesfe337bf2008-03-23 21:54:12 +0000835a fixed-width print format:
Guido van Rossum3d392eb2007-11-16 00:35:22 +0000836
Christian Heimes043d6f62008-01-07 17:19:16 +0000837 >>> class Point(namedtuple('Point', 'x y')):
Raymond Hettinger15aded82011-03-15 17:25:51 -0700838 __slots__ = ()
839 @property
840 def hypot(self):
841 return (self.x ** 2 + self.y ** 2) ** 0.5
842 def __str__(self):
843 return 'Point: x=%6.3f y=%6.3f hypot=%6.3f' % (self.x, self.y, self.hypot)
Guido van Rossum3d392eb2007-11-16 00:35:22 +0000844
Georg Brandl0df79792008-10-04 18:33:26 +0000845 >>> for p in Point(3, 4), Point(14, 5/7):
Raymond Hettinger15aded82011-03-15 17:25:51 -0700846 print(p)
Christian Heimes25bb7832008-01-11 16:17:00 +0000847 Point: x= 3.000 y= 4.000 hypot= 5.000
848 Point: x=14.000 y= 0.714 hypot=14.018
Christian Heimes043d6f62008-01-07 17:19:16 +0000849
Georg Brandlaf5c2382009-12-28 08:02:38 +0000850The subclass shown above sets ``__slots__`` to an empty tuple. This helps
Christian Heimes679db4a2008-01-18 09:56:22 +0000851keep memory requirements low by preventing the creation of instance dictionaries.
852
Christian Heimes2380ac72008-01-09 00:17:24 +0000853Subclassing is not useful for adding new, stored fields. Instead, simply
Christian Heimesfe337bf2008-03-23 21:54:12 +0000854create a new named tuple type from the :attr:`_fields` attribute:
Christian Heimes2380ac72008-01-09 00:17:24 +0000855
Christian Heimes25bb7832008-01-11 16:17:00 +0000856 >>> Point3D = namedtuple('Point3D', Point._fields + ('z',))
Christian Heimes2380ac72008-01-09 00:17:24 +0000857
858Default values can be implemented by using :meth:`_replace` to
Christian Heimesfe337bf2008-03-23 21:54:12 +0000859customize a prototype instance:
Guido van Rossum3d392eb2007-11-16 00:35:22 +0000860
861 >>> Account = namedtuple('Account', 'owner balance transaction_count')
Christian Heimes587c2bf2008-01-19 16:21:02 +0000862 >>> default_account = Account('<owner name>', 0.0, 0)
863 >>> johns_account = default_account._replace(owner='John')
Raymond Hettingerb2d09452011-03-22 22:36:21 -0700864 >>> janes_account = default_account._replace(owner='Jane')
Guido van Rossum3d392eb2007-11-16 00:35:22 +0000865
Christian Heimese4ca8152008-05-08 17:18:53 +0000866Enumerated constants can be implemented with named tuples, but it is simpler
867and more efficient to use a simple class declaration:
868
869 >>> Status = namedtuple('Status', 'open pending closed')._make(range(3))
870 >>> Status.open, Status.pending, Status.closed
871 (0, 1, 2)
872 >>> class Status:
Raymond Hettinger15aded82011-03-15 17:25:51 -0700873 open, pending, closed = range(3)
Christian Heimese4ca8152008-05-08 17:18:53 +0000874
Raymond Hettinger651453a2009-02-11 00:20:02 +0000875.. seealso::
Thomas Wouters47b49bf2007-08-30 22:15:33 +0000876
Raymond Hettinger6c94e6f2011-03-31 15:46:06 -0700877 * `Named tuple recipe <http://code.activestate.com/recipes/500261/>`_
878 adapted for Python 2.4.
879
880 * `Recipe for named tuple abstract base class with a metaclass mix-in
881 <http://code.activestate.com/recipes/577629-namedtupleabc-abstract-base-class-mix-in-for-named/>`_
882 by Jan Kaliszewski. Besides providing an :term:`abstract base class` for
883 named tuples, it also supports an alternate :term:`metaclass`-based
884 constructor that is convenient for use cases where named tuples are being
885 subclassed.
Raymond Hettingere4c96ad2008-02-06 01:23:58 +0000886
887
Raymond Hettinger2d32f632009-03-02 21:24:57 +0000888:class:`OrderedDict` objects
889----------------------------
890
891Ordered dictionaries are just like regular dictionaries but they remember the
892order that items were inserted. When iterating over an ordered dictionary,
893the items are returned in the order their keys were first added.
894
895.. class:: OrderedDict([items])
896
897 Return an instance of a dict subclass, supporting the usual :class:`dict`
898 methods. An *OrderedDict* is a dict that remembers the order that keys
899 were first inserted. If a new entry overwrites an existing entry, the
900 original insertion position is left unchanged. Deleting an entry and
901 reinserting it will move it to the end.
902
Benjamin Petersond45bf582009-03-02 21:44:54 +0000903 .. versionadded:: 3.1
Raymond Hettinger2d32f632009-03-02 21:24:57 +0000904
Benjamin Petersond319ad52010-07-18 14:27:02 +0000905 .. method:: popitem(last=True)
Raymond Hettingerdc879f02009-03-19 20:30:56 +0000906
Benjamin Petersond319ad52010-07-18 14:27:02 +0000907 The :meth:`popitem` method for ordered dictionaries returns and removes a
908 (key, value) pair. The pairs are returned in LIFO order if *last* is true
909 or FIFO order if false.
Raymond Hettinger2d32f632009-03-02 21:24:57 +0000910
Raymond Hettingerf45abc92010-09-06 21:26:09 +0000911 .. method:: move_to_end(key, last=True)
912
913 Move an existing *key* to either end of an ordered dictionary. The item
914 is moved to the right end if *last* is true (the default) or to the
915 beginning if *last* is false. Raises :exc:`KeyError` if the *key* does
916 not exist::
917
918 >>> d = OrderedDict.fromkeys('abcde')
919 >>> d.move_to_end('b')
Raymond Hettinger4d5208d2011-06-25 11:39:00 +0200920 >>> ''.join(d.keys())
Raymond Hettingerf45abc92010-09-06 21:26:09 +0000921 'acdeb'
Éric Araujo1cb25aa2010-11-06 07:03:07 +0000922 >>> d.move_to_end('b', last=False)
Raymond Hettinger4d5208d2011-06-25 11:39:00 +0200923 >>> ''.join(d.keys())
Raymond Hettingerf45abc92010-09-06 21:26:09 +0000924 'bacde'
925
926 .. versionadded:: 3.2
927
Raymond Hettingere9091502009-05-19 17:40:07 +0000928In addition to the usual mapping methods, ordered dictionaries also support
929reverse iteration using :func:`reversed`.
930
Raymond Hettinger2d32f632009-03-02 21:24:57 +0000931Equality tests between :class:`OrderedDict` objects are order-sensitive
932and are implemented as ``list(od1.items())==list(od2.items())``.
933Equality tests between :class:`OrderedDict` objects and other
934:class:`Mapping` objects are order-insensitive like regular dictionaries.
935This allows :class:`OrderedDict` objects to be substituted anywhere a
936regular dictionary is used.
937
Raymond Hettinger36180782009-04-09 22:34:23 +0000938The :class:`OrderedDict` constructor and :meth:`update` method both accept
939keyword arguments, but their order is lost because Python's function call
940semantics pass-in keyword arguments using a regular unordered dictionary.
941
Raymond Hettingerdc879f02009-03-19 20:30:56 +0000942.. seealso::
943
944 `Equivalent OrderedDict recipe <http://code.activestate.com/recipes/576693/>`_
945 that runs on Python 2.4 or later.
946
Raymond Hettinger7bba6832011-04-15 17:43:19 -0700947:class:`OrderedDict` Examples and Recipes
948^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
949
Raymond Hettinger0e312012009-11-10 18:35:46 +0000950Since an ordered dictionary remembers its insertion order, it can be used
951in conjuction with sorting to make a sorted dictionary::
952
953 >>> # regular unsorted dictionary
954 >>> d = {'banana': 3, 'apple':4, 'pear': 1, 'orange': 2}
955
956 >>> # dictionary sorted by key
957 >>> OrderedDict(sorted(d.items(), key=lambda t: t[0]))
958 OrderedDict([('apple', 4), ('banana', 3), ('orange', 2), ('pear', 1)])
959
960 >>> # dictionary sorted by value
961 >>> OrderedDict(sorted(d.items(), key=lambda t: t[1]))
962 OrderedDict([('pear', 1), ('orange', 2), ('banana', 3), ('apple', 4)])
963
964 >>> # dictionary sorted by length of the key string
965 >>> OrderedDict(sorted(d.items(), key=lambda t: len(t[0])))
966 OrderedDict([('pear', 1), ('apple', 4), ('orange', 2), ('banana', 3)])
967
968The new sorted dictionaries maintain their sort order when entries
969are deleted. But when new keys are added, the keys are appended
970to the end and the sort is not maintained.
971
Raymond Hettinger4821ef82010-07-31 10:14:41 +0000972It is also straight-forward to create an ordered dictionary variant
973that the remembers the order the keys were *last* inserted.
974If a new entry overwrites an existing entry, the
975original insertion position is changed and moved to the end::
976
977 class LastUpdatedOrderedDict(OrderedDict):
Georg Brandl77570e22010-12-18 16:21:58 +0000978 'Store items in the order the keys were last added'
Raymond Hettinger7bba6832011-04-15 17:43:19 -0700979
Raymond Hettinger4821ef82010-07-31 10:14:41 +0000980 def __setitem__(self, key, value):
981 if key in self:
982 del self[key]
983 OrderedDict.__setitem__(self, key, value)
984
Éric Araujo889a7dc2011-08-19 00:40:46 +0200985An ordered dictionary can be combined with the :class:`Counter` class
Raymond Hettinger7bba6832011-04-15 17:43:19 -0700986so that the counter remembers the order elements are first encountered::
987
988 class OrderedCounter(Counter, OrderedDict):
989 'Counter that remembers the order elements are first encountered'
990
Raymond Hettinger7bba6832011-04-15 17:43:19 -0700991 def __repr__(self):
992 return '%s(%r)' % (self.__class__.__name__, OrderedDict(self))
993
994 def __reduce__(self):
995 return self.__class__, (OrderedDict(self),)
996
Raymond Hettingere4c96ad2008-02-06 01:23:58 +0000997
998:class:`UserDict` objects
Mark Summerfield8f2d0062008-02-06 13:30:44 +0000999-------------------------
Raymond Hettingere4c96ad2008-02-06 01:23:58 +00001000
Georg Brandl48310cd2009-01-03 21:18:54 +00001001The class, :class:`UserDict` acts as a wrapper around dictionary objects.
1002The need for this class has been partially supplanted by the ability to
Raymond Hettingere4c96ad2008-02-06 01:23:58 +00001003subclass directly from :class:`dict`; however, this class can be easier
1004to work with because the underlying dictionary is accessible as an
1005attribute.
1006
1007.. class:: UserDict([initialdata])
1008
1009 Class that simulates a dictionary. The instance's contents are kept in a
1010 regular dictionary, which is accessible via the :attr:`data` attribute of
1011 :class:`UserDict` instances. If *initialdata* is provided, :attr:`data` is
1012 initialized with its contents; note that a reference to *initialdata* will not
1013 be kept, allowing it be used for other purposes.
1014
Benjamin Petersond319ad52010-07-18 14:27:02 +00001015 In addition to supporting the methods and operations of mappings,
1016 :class:`UserDict` instances provide the following attribute:
Raymond Hettingere4c96ad2008-02-06 01:23:58 +00001017
Benjamin Petersond319ad52010-07-18 14:27:02 +00001018 .. attribute:: data
Raymond Hettingere4c96ad2008-02-06 01:23:58 +00001019
Benjamin Petersond319ad52010-07-18 14:27:02 +00001020 A real dictionary used to store the contents of the :class:`UserDict`
1021 class.
Raymond Hettinger53dbe392008-02-12 20:03:09 +00001022
1023
1024
1025:class:`UserList` objects
1026-------------------------
1027
1028This class acts as a wrapper around list objects. It is a useful base class
Georg Brandl48310cd2009-01-03 21:18:54 +00001029for your own list-like classes which can inherit from them and override
Raymond Hettinger53dbe392008-02-12 20:03:09 +00001030existing methods or add new ones. In this way, one can add new behaviors to
1031lists.
1032
Georg Brandl48310cd2009-01-03 21:18:54 +00001033The need for this class has been partially supplanted by the ability to
Raymond Hettinger53dbe392008-02-12 20:03:09 +00001034subclass directly from :class:`list`; however, this class can be easier
1035to work with because the underlying list is accessible as an attribute.
1036
1037.. class:: UserList([list])
1038
1039 Class that simulates a list. The instance's contents are kept in a regular
1040 list, which is accessible via the :attr:`data` attribute of :class:`UserList`
1041 instances. The instance's contents are initially set to a copy of *list*,
1042 defaulting to the empty list ``[]``. *list* can be any iterable, for
1043 example a real Python list or a :class:`UserList` object.
1044
Benjamin Petersond319ad52010-07-18 14:27:02 +00001045 In addition to supporting the methods and operations of mutable sequences,
1046 :class:`UserList` instances provide the following attribute:
Raymond Hettinger53dbe392008-02-12 20:03:09 +00001047
Benjamin Petersond319ad52010-07-18 14:27:02 +00001048 .. attribute:: data
Raymond Hettinger53dbe392008-02-12 20:03:09 +00001049
Benjamin Petersond319ad52010-07-18 14:27:02 +00001050 A real :class:`list` object used to store the contents of the
1051 :class:`UserList` class.
Raymond Hettinger53dbe392008-02-12 20:03:09 +00001052
1053**Subclassing requirements:** Subclasses of :class:`UserList` are expect to
1054offer a constructor which can be called with either no arguments or one
1055argument. List operations which return a new sequence attempt to create an
1056instance of the actual implementation class. To do so, it assumes that the
1057constructor can be called with a single parameter, which is a sequence object
1058used as a data source.
1059
1060If a derived class does not wish to comply with this requirement, all of the
1061special methods supported by this class will need to be overridden; please
1062consult the sources for information about the methods which need to be provided
1063in that case.
Raymond Hettingerb3a65f82008-02-21 22:11:37 +00001064
1065:class:`UserString` objects
Christian Heimesc3f30c42008-02-22 16:37:40 +00001066---------------------------
Raymond Hettingerb3a65f82008-02-21 22:11:37 +00001067
Georg Brandl48310cd2009-01-03 21:18:54 +00001068The class, :class:`UserString` acts as a wrapper around string objects.
1069The need for this class has been partially supplanted by the ability to
Raymond Hettingerb3a65f82008-02-21 22:11:37 +00001070subclass directly from :class:`str`; however, this class can be easier
1071to work with because the underlying string is accessible as an
1072attribute.
1073
1074.. class:: UserString([sequence])
1075
1076 Class that simulates a string or a Unicode string object. The instance's
Georg Brandl48310cd2009-01-03 21:18:54 +00001077 content is kept in a regular string object, which is accessible via the
1078 :attr:`data` attribute of :class:`UserString` instances. The instance's
Raymond Hettingerb3a65f82008-02-21 22:11:37 +00001079 contents are initially set to a copy of *sequence*. The *sequence* can
1080 be an instance of :class:`bytes`, :class:`str`, :class:`UserString` (or a
1081 subclass) or an arbitrary sequence which can be converted into a string using
1082 the built-in :func:`str` function.