blob: d084179ca7b6da5b8f08c98a15a740694d2fc7f5 [file] [log] [blame]
Raymond Hettinger53dbe392008-02-12 20:03:09 +00001:mod:`collections` --- Container datatypes
2==========================================
Georg Brandl116aa622007-08-15 14:28:22 +00003
4.. module:: collections
Raymond Hettinger53dbe392008-02-12 20:03:09 +00005 :synopsis: Container datatypes
Georg Brandl116aa622007-08-15 14:28:22 +00006.. moduleauthor:: Raymond Hettinger <python@rcn.com>
7.. sectionauthor:: Raymond Hettinger <python@rcn.com>
8
Christian Heimesfe337bf2008-03-23 21:54:12 +00009.. testsetup:: *
10
11 from collections import *
12 import itertools
13 __name__ = '<doctest>'
Georg Brandl116aa622007-08-15 14:28:22 +000014
Georg Brandl116aa622007-08-15 14:28:22 +000015This module implements high-performance container datatypes. Currently,
Raymond Hettinger2d32f632009-03-02 21:24:57 +000016there are four datatypes, :class:`Counter`, :class:`deque`, :class:`OrderedDict` and
Raymond Hettingeracd82b92009-02-17 20:06:51 +000017:class:`defaultdict`, and one datatype factory function, :func:`namedtuple`.
Christian Heimes0bd4e112008-02-12 22:59:25 +000018
Raymond Hettingerebcee3f2008-02-06 19:54:00 +000019The specialized containers provided in this module provide alternatives
Christian Heimesfe337bf2008-03-23 21:54:12 +000020to Python's general purpose built-in containers, :class:`dict`,
Raymond Hettingerebcee3f2008-02-06 19:54:00 +000021:class:`list`, :class:`set`, and :class:`tuple`.
Georg Brandl116aa622007-08-15 14:28:22 +000022
Mark Summerfield08898b42007-09-05 08:43:04 +000023In addition to containers, the collections module provides some ABCs
Christian Heimesfe337bf2008-03-23 21:54:12 +000024(abstract base classes) that can be used to test whether a class
Raymond Hettingeracd82b92009-02-17 20:06:51 +000025provides a particular interface, for example, whether it is hashable or
26a mapping.
Raymond Hettingerebcee3f2008-02-06 19:54:00 +000027
28ABCs - abstract base classes
29----------------------------
30
31The collections module offers the following ABCs:
Mark Summerfield08898b42007-09-05 08:43:04 +000032
Georg Brandl86b2fb92008-07-16 03:43:04 +000033========================= ===================== ====================== ====================================================
34ABC Inherits Abstract Methods Mixin Methods
35========================= ===================== ====================== ====================================================
36:class:`Container` ``__contains__``
37:class:`Hashable` ``__hash__``
38:class:`Iterable` ``__iter__``
39:class:`Iterator` :class:`Iterable` ``__next__`` ``__iter__``
Georg Brandla1c6a1c2009-01-03 21:26:05 +000040:class:`Sized` ``__len__``
Georg Brandl86b2fb92008-07-16 03:43:04 +000041:class:`Callable` ``__call__``
Georg Brandl48310cd2009-01-03 21:18:54 +000042
Georg Brandl86b2fb92008-07-16 03:43:04 +000043:class:`Sequence` :class:`Sized`, ``__getitem__`` ``__contains__``. ``__iter__``, ``__reversed__``.
Raymond Hettingerd23e0132009-01-29 00:01:27 +000044 :class:`Iterable`, ``index``, and ``count``
Georg Brandl48310cd2009-01-03 21:18:54 +000045 :class:`Container`
46
Raymond Hettingerd23e0132009-01-29 00:01:27 +000047:class:`MutableSequence` :class:`Sequence` ``__setitem__`` Inherited Sequence methods and
Georg Brandl86b2fb92008-07-16 03:43:04 +000048 ``__delitem__``, ``append``, ``reverse``, ``extend``, ``pop``,
Raymond Hettingerd23e0132009-01-29 00:01:27 +000049 and ``insert`` ``remove``, and ``__iadd__``
Georg Brandl48310cd2009-01-03 21:18:54 +000050
Raymond Hettingerd23e0132009-01-29 00:01:27 +000051:class:`Set` :class:`Sized`, ``__le__``, ``__lt__``, ``__eq__``, ``__ne__``,
52 :class:`Iterable`, ``__gt__``, ``__ge__``, ``__and__``, ``__or__``
53 :class:`Container` ``__sub__``, ``__xor__``, and ``isdisjoint``
Georg Brandl48310cd2009-01-03 21:18:54 +000054
Georg Brandl86b2fb92008-07-16 03:43:04 +000055:class:`MutableSet` :class:`Set` ``add`` and Inherited Set methods and
56 ``discard`` ``clear``, ``pop``, ``remove``, ``__ior__``,
57 ``__iand__``, ``__ixor__``, and ``__isub__``
Georg Brandl48310cd2009-01-03 21:18:54 +000058
Raymond Hettingerd23e0132009-01-29 00:01:27 +000059:class:`Mapping` :class:`Sized`, ``__getitem__`` ``__contains__``, ``keys``, ``items``, ``values``,
60 :class:`Iterable`, ``get``, ``__eq__``, and ``__ne__``
61 :class:`Container`
Georg Brandl48310cd2009-01-03 21:18:54 +000062
Raymond Hettingerd23e0132009-01-29 00:01:27 +000063:class:`MutableMapping` :class:`Mapping` ``__setitem__`` and Inherited Mapping methods and
64 ``__delitem__`` ``pop``, ``popitem``, ``clear``, ``update``,
65 and ``setdefault``
66
Georg Brandl48310cd2009-01-03 21:18:54 +000067
Georg Brandl86b2fb92008-07-16 03:43:04 +000068:class:`MappingView` :class:`Sized` ``__len__``
69:class:`KeysView` :class:`MappingView`, ``__contains__``,
70 :class:`Set` ``__iter__``
71:class:`ItemsView` :class:`MappingView`, ``__contains__``,
72 :class:`Set` ``__iter__``
73:class:`ValuesView` :class:`MappingView` ``__contains__``, ``__iter__``
74========================= ===================== ====================== ====================================================
Mark Summerfield08898b42007-09-05 08:43:04 +000075
Mark Summerfield08898b42007-09-05 08:43:04 +000076These ABCs allow us to ask classes or instances if they provide
77particular functionality, for example::
78
Mark Summerfield08898b42007-09-05 08:43:04 +000079 size = None
Raymond Hettingerebcee3f2008-02-06 19:54:00 +000080 if isinstance(myvar, collections.Sized):
Georg Brandla1c6a1c2009-01-03 21:26:05 +000081 size = len(myvar)
Mark Summerfield08898b42007-09-05 08:43:04 +000082
Raymond Hettingerebcee3f2008-02-06 19:54:00 +000083Several of the ABCs are also useful as mixins that make it easier to develop
84classes supporting container APIs. For example, to write a class supporting
85the full :class:`Set` API, it only necessary to supply the three underlying
86abstract methods: :meth:`__contains__`, :meth:`__iter__`, and :meth:`__len__`.
87The ABC supplies the remaining methods such as :meth:`__and__` and
88:meth:`isdisjoint` ::
89
90 class ListBasedSet(collections.Set):
Raymond Hettingerc1b6a4a2008-02-08 23:46:23 +000091 ''' Alternate set implementation favoring space over speed
92 and not requiring the set elements to be hashable. '''
Raymond Hettingerebcee3f2008-02-06 19:54:00 +000093 def __init__(self, iterable):
Raymond Hettingerc1b6a4a2008-02-08 23:46:23 +000094 self.elements = lst = []
95 for value in iterable:
96 if value not in lst:
97 lst.append(value)
Raymond Hettingerebcee3f2008-02-06 19:54:00 +000098 def __iter__(self):
99 return iter(self.elements)
100 def __contains__(self, value):
101 return value in self.elements
102 def __len__(self):
103 return len(self.elements)
104
105 s1 = ListBasedSet('abcdef')
106 s2 = ListBasedSet('defghi')
107 overlap = s1 & s2 # The __and__() method is supported automatically
108
Raymond Hettinger7aebb642008-02-09 03:25:08 +0000109Notes on using :class:`Set` and :class:`MutableSet` as a mixin:
110
Christian Heimesfe337bf2008-03-23 21:54:12 +0000111(1)
Raymond Hettinger7aebb642008-02-09 03:25:08 +0000112 Since some set operations create new sets, the default mixin methods need
Christian Heimesfe337bf2008-03-23 21:54:12 +0000113 a way to create new instances from an iterable. The class constructor is
114 assumed to have a signature in the form ``ClassName(iterable)``.
Benjamin Peterson2b7411d2008-05-26 17:36:47 +0000115 That assumption is factored-out to an internal classmethod called
Raymond Hettinger7aebb642008-02-09 03:25:08 +0000116 :meth:`_from_iterable` which calls ``cls(iterable)`` to produce a new set.
117 If the :class:`Set` mixin is being used in a class with a different
Christian Heimesfe337bf2008-03-23 21:54:12 +0000118 constructor signature, you will need to override :meth:`from_iterable`
119 with a classmethod that can construct new instances from
Raymond Hettinger7aebb642008-02-09 03:25:08 +0000120 an iterable argument.
121
122(2)
123 To override the comparisons (presumably for speed, as the
124 semantics are fixed), redefine :meth:`__le__` and
125 then the other operations will automatically follow suit.
Raymond Hettingerebcee3f2008-02-06 19:54:00 +0000126
Raymond Hettinger0dbdab22008-02-09 03:48:16 +0000127(3)
128 The :class:`Set` mixin provides a :meth:`_hash` method to compute a hash value
129 for the set; however, :meth:`__hash__` is not defined because not all sets
130 are hashable or immutable. To add set hashabilty using mixins,
131 inherit from both :meth:`Set` and :meth:`Hashable`, then define
132 ``__hash__ = Set._hash``.
133
Raymond Hettingerbe075b12009-03-20 18:33:06 +0000134.. seealso::
135
136 * `OrderedSet recipe <http://code.activestate.com/recipes/576694/>`_ for an
137 example built on :class:`MutableSet`.
138
139 * For more about ABCs, see the :mod:`abc` module and :pep:`3119`.
Mark Summerfield08898b42007-09-05 08:43:04 +0000140
141
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000142:class:`Counter` objects
143------------------------
144
145A counter tool is provided to support convenient and rapid tallies.
146For example::
147
Raymond Hettinger1c62dc92009-02-04 11:41:45 +0000148 >>> # Tally occurrences of words in a list
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000149 >>> cnt = Counter()
Raymond Hettinger670eaec2009-01-21 23:14:07 +0000150 >>> for word in ['red', 'blue', 'red', 'green', 'blue', 'blue']:
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000151 ... cnt[word] += 1
152 >>> cnt
153 Counter({'blue': 3, 'red': 2, 'green': 1})
154
Raymond Hettinger1c62dc92009-02-04 11:41:45 +0000155 >>> # Find the ten most common words in Hamlet
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000156 >>> import re
157 >>> words = re.findall('\w+', open('hamlet.txt').read().lower())
Raymond Hettinger0bae6622009-01-20 13:00:59 +0000158 >>> Counter(words).most_common(10)
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000159 [('the', 1143), ('and', 966), ('to', 762), ('of', 669), ('i', 631),
160 ('you', 554), ('a', 546), ('my', 514), ('hamlet', 471), ('in', 451)]
161
162.. class:: Counter([iterable-or-mapping])
163
Raymond Hettinger670eaec2009-01-21 23:14:07 +0000164 A :class:`Counter` is a :class:`dict` subclass for counting hashable objects.
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000165 It is an unordered collection where elements are stored as dictionary keys
166 and their counts are stored as dictionary values. Counts are allowed to be
167 any integer value including zero or negative counts. The :class:`Counter`
168 class is similar to bags or multisets in other languages.
169
170 Elements are counted from an *iterable* or initialized from another
Benjamin Peterson25c95f12009-05-08 20:42:26 +0000171 *mapping* (or counter):
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000172
Raymond Hettinger73662a52009-01-27 02:38:22 +0000173 >>> c = Counter() # a new, empty counter
174 >>> c = Counter('gallahad') # a new counter from an iterable
175 >>> c = Counter({'red': 4, 'blue': 2}) # a new counter from a mapping
176 >>> c = Counter(cats=4, dogs=8) # a new counter from keyword args
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000177
Raymond Hettinger670eaec2009-01-21 23:14:07 +0000178 Counter objects have a dictionary interface except that they return a zero
Benjamin Peterson25c95f12009-05-08 20:42:26 +0000179 count for missing items instead of raising a :exc:`KeyError`:
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000180
Raymond Hettinger94adc8e2009-01-22 05:27:37 +0000181 >>> c = Counter(['eggs', 'ham'])
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000182 >>> c['bacon'] # count of a missing element is zero
183 0
184
Raymond Hettinger94adc8e2009-01-22 05:27:37 +0000185 Setting a count to zero does not remove an element from a counter.
186 Use ``del`` to remove it entirely:
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000187
Raymond Hettinger94adc8e2009-01-22 05:27:37 +0000188 >>> c['sausage'] = 0 # counter entry with a zero count
189 >>> del c['sausage'] # del actually removes the entry
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000190
Benjamin Petersond45bf582009-03-02 21:44:54 +0000191 .. versionadded:: 3.1
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000192
193
Ezio Melotti0be8b1c2010-04-04 06:53:44 +0000194 Counter objects support three methods beyond those available for all
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000195 dictionaries:
196
197 .. method:: elements()
198
Raymond Hettinger670eaec2009-01-21 23:14:07 +0000199 Return an iterator over elements repeating each as many times as its
200 count. Elements are returned in arbitrary order. If an element's count
201 is less than one, :meth:`elements` will ignore it.
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000202
Raymond Hettinger0bae6622009-01-20 13:00:59 +0000203 >>> c = Counter(a=4, b=2, c=0, d=-2)
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000204 >>> list(c.elements())
205 ['a', 'a', 'a', 'a', 'b', 'b']
206
207 .. method:: most_common([n])
208
Raymond Hettinger73662a52009-01-27 02:38:22 +0000209 Return a list of the *n* most common elements and their counts from the
Raymond Hettingerd04fa312009-02-04 19:45:13 +0000210 most common to the least. If *n* is not specified, :func:`most_common`
Raymond Hettinger73662a52009-01-27 02:38:22 +0000211 returns *all* elements in the counter. Elements with equal counts are
Benjamin Peterson25c95f12009-05-08 20:42:26 +0000212 ordered arbitrarily:
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000213
214 >>> Counter('abracadabra').most_common(3)
215 [('a', 5), ('r', 2), ('b', 2)]
216
Raymond Hettinger9c01e442010-04-03 10:32:58 +0000217 .. method:: subtract([iterable-or-mapping])
218
219 Elements are subtracted from an *iterable* or from another *mapping*
220 (or counter). Like :meth:`dict.update` but subtracts counts instead
221 of replacing them. Both inputs and outputs may be zero or negative.
222
223 >>> c = Counter(a=4, b=2, c=0, d=-2)
224 >>> d = Counter(a=1, b=2, c=3, d=4)
225 >>> c.subtract(d)
226 Counter({'a': 3, 'b': 0, 'c': -3, 'd': -6})
227
Ezio Melotti0be8b1c2010-04-04 06:53:44 +0000228 .. versionadded:: 3.2
229
Raymond Hettinger670eaec2009-01-21 23:14:07 +0000230 The usual dictionary methods are available for :class:`Counter` objects
231 except for two which work differently for counters.
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000232
233 .. method:: fromkeys(iterable)
234
Raymond Hettinger73662a52009-01-27 02:38:22 +0000235 This class method is not implemented for :class:`Counter` objects.
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000236
237 .. method:: update([iterable-or-mapping])
238
Raymond Hettinger73662a52009-01-27 02:38:22 +0000239 Elements are counted from an *iterable* or added-in from another
240 *mapping* (or counter). Like :meth:`dict.update` but adds counts
241 instead of replacing them. Also, the *iterable* is expected to be a
242 sequence of elements, not a sequence of ``(key, value)`` pairs.
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000243
244Common patterns for working with :class:`Counter` objects::
245
Raymond Hettinger73662a52009-01-27 02:38:22 +0000246 sum(c.values()) # total of all counts
247 c.clear() # reset all counts
248 list(c) # list unique elements
249 set(c) # convert to a set
250 dict(c) # convert to a regular dictionary
251 c.items() # convert to a list of (elem, cnt) pairs
252 Counter(dict(list_of_pairs)) # convert from a list of (elem, cnt) pairs
253 c.most_common()[:-n:-1] # n least common elements
254 c += Counter() # remove zero and negative counts
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000255
Raymond Hettinger72a95cc2009-02-25 22:51:40 +0000256Several mathematical operations are provided for combining :class:`Counter`
257objects to produce multisets (counters that have counts greater than zero).
258Addition and subtraction combine counters by adding or subtracting the counts
259of corresponding elements. Intersection and union return the minimum and
260maximum of corresponding counts. Each operation can accept inputs with signed
261counts, but the output will exclude results with counts of zero or less.
Raymond Hettinger4d2073a2009-01-20 03:41:22 +0000262
Raymond Hettingere0d1b9f2009-01-21 20:36:27 +0000263 >>> c = Counter(a=3, b=1)
264 >>> d = Counter(a=1, b=2)
Raymond Hettinger73662a52009-01-27 02:38:22 +0000265 >>> c + d # add two counters together: c[x] + d[x]
Raymond Hettinger4d2073a2009-01-20 03:41:22 +0000266 Counter({'a': 4, 'b': 3})
Raymond Hettinger73662a52009-01-27 02:38:22 +0000267 >>> c - d # subtract (keeping only positive counts)
Raymond Hettinger4d2073a2009-01-20 03:41:22 +0000268 Counter({'a': 2})
Raymond Hettinger73662a52009-01-27 02:38:22 +0000269 >>> c & d # intersection: min(c[x], d[x])
Raymond Hettinger4d2073a2009-01-20 03:41:22 +0000270 Counter({'a': 1, 'b': 1})
Raymond Hettinger73662a52009-01-27 02:38:22 +0000271 >>> c | d # union: max(c[x], d[x])
Raymond Hettinger4d2073a2009-01-20 03:41:22 +0000272 Counter({'a': 3, 'b': 2})
273
Raymond Hettinger22f18852010-04-12 21:45:14 +0000274.. note::
275
276 Counters were primarily designed to work with positive integers to represent
277 running counts; however, care was taken to not unnecessarily preclude use
278 cases needing other types or negative values. To help with those use cases,
279 this section documents the minimum range and type restrictions.
280
281 * The :class:`Counter` class itself is a dictionary subclass with no
282 restrictions on its keys and values. The values are intended to be numbers
283 representing counts, but you *could* store anything in the value field.
284
285 * The :meth:`most_common` method requires only that the values be orderable.
286
287 * For in-place operations such as ``c[key] += 1``, the value type need only
288 support addition and subtraction. So fractions, floats, and decimals would
289 work and negative values are supported. The same is also true for
290 :meth:`update` and :meth:`subtract` which allow negative and zero values
291 for both inputs and outputs.
292
293 * The multiset methods are designed only for use cases with positive values.
294 The inputs may be negative or zero, but only outputs with positive values
295 are created. There are no type restrictions, but the value type needs to
296 support support addition, subtraction, and comparison.
297
298 * The :meth:`elements` method requires integer counts. It ignores zero and
299 negative counts.
300
Raymond Hettingerb14043c2009-01-20 23:44:31 +0000301.. seealso::
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000302
Raymond Hettinger94adc8e2009-01-22 05:27:37 +0000303 * `Counter class <http://code.activestate.com/recipes/576611/>`_
304 adapted for Python 2.5 and an early `Bag recipe
305 <http://code.activestate.com/recipes/259174/>`_ for Python 2.4.
306
Raymond Hettingerb14043c2009-01-20 23:44:31 +0000307 * `Bag class <http://www.gnu.org/software/smalltalk/manual-base/html_node/Bag.html>`_
308 in Smalltalk.
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000309
Raymond Hettingerb14043c2009-01-20 23:44:31 +0000310 * Wikipedia entry for `Multisets <http://en.wikipedia.org/wiki/Multiset>`_\.
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000311
Raymond Hettingerb14043c2009-01-20 23:44:31 +0000312 * `C++ multisets <http://www.demo2s.com/Tutorial/Cpp/0380__set-multiset/Catalog0380__set-multiset.htm>`_
Raymond Hettinger94adc8e2009-01-22 05:27:37 +0000313 tutorial with examples.
Raymond Hettingerb14043c2009-01-20 23:44:31 +0000314
Raymond Hettinger94adc8e2009-01-22 05:27:37 +0000315 * For mathematical operations on multisets and their use cases, see
Raymond Hettingerb14043c2009-01-20 23:44:31 +0000316 *Knuth, Donald. The Art of Computer Programming Volume II,
317 Section 4.6.3, Exercise 19*\.
318
Raymond Hettinger670eaec2009-01-21 23:14:07 +0000319 * To enumerate all distinct multisets of a given size over a given set of
Raymond Hettingerd07d9392009-01-27 04:20:44 +0000320 elements, see :func:`itertools.combinations_with_replacement`.
Raymond Hettingerb14043c2009-01-20 23:44:31 +0000321
Raymond Hettinger94adc8e2009-01-22 05:27:37 +0000322 map(Counter, combinations_with_replacement('ABC', 2)) --> AA AB AC BB BC CC
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000323
324
Georg Brandl116aa622007-08-15 14:28:22 +0000325:class:`deque` objects
326----------------------
327
Georg Brandlc2a4f4f2009-04-10 09:03:43 +0000328.. class:: deque([iterable, [maxlen]])
Georg Brandl116aa622007-08-15 14:28:22 +0000329
330 Returns a new deque object initialized left-to-right (using :meth:`append`) with
331 data from *iterable*. If *iterable* is not specified, the new deque is empty.
332
333 Deques are a generalization of stacks and queues (the name is pronounced "deck"
334 and is short for "double-ended queue"). Deques support thread-safe, memory
335 efficient appends and pops from either side of the deque with approximately the
336 same O(1) performance in either direction.
337
338 Though :class:`list` objects support similar operations, they are optimized for
339 fast fixed-length operations and incur O(n) memory movement costs for
340 ``pop(0)`` and ``insert(0, v)`` operations which change both the size and
341 position of the underlying data representation.
342
Georg Brandl116aa622007-08-15 14:28:22 +0000343
Georg Brandl9afde1c2007-11-01 20:32:30 +0000344 If *maxlen* is not specified or is *None*, deques may grow to an
345 arbitrary length. Otherwise, the deque is bounded to the specified maximum
346 length. Once a bounded length deque is full, when new items are added, a
347 corresponding number of items are discarded from the opposite end. Bounded
348 length deques provide functionality similar to the ``tail`` filter in
349 Unix. They are also useful for tracking transactions and other pools of data
350 where only the most recent activity is of interest.
351
Georg Brandl9afde1c2007-11-01 20:32:30 +0000352
Benjamin Petersone41251e2008-04-25 01:59:09 +0000353 Deque objects support the following methods:
Georg Brandl116aa622007-08-15 14:28:22 +0000354
Benjamin Petersone41251e2008-04-25 01:59:09 +0000355 .. method:: append(x)
Georg Brandl116aa622007-08-15 14:28:22 +0000356
Benjamin Petersone41251e2008-04-25 01:59:09 +0000357 Add *x* to the right side of the deque.
Georg Brandl116aa622007-08-15 14:28:22 +0000358
359
Benjamin Petersone41251e2008-04-25 01:59:09 +0000360 .. method:: appendleft(x)
Georg Brandl116aa622007-08-15 14:28:22 +0000361
Benjamin Petersone41251e2008-04-25 01:59:09 +0000362 Add *x* to the left side of the deque.
Georg Brandl116aa622007-08-15 14:28:22 +0000363
364
Benjamin Petersone41251e2008-04-25 01:59:09 +0000365 .. method:: clear()
Georg Brandl116aa622007-08-15 14:28:22 +0000366
Benjamin Petersone41251e2008-04-25 01:59:09 +0000367 Remove all elements from the deque leaving it with length 0.
Georg Brandl116aa622007-08-15 14:28:22 +0000368
369
Raymond Hettinger44459de2010-04-03 23:20:46 +0000370 .. method:: count(x)
371
372 Count the number of deque elements equal to *x*.
373
374 .. versionadded:: 3.2
375
Benjamin Petersone41251e2008-04-25 01:59:09 +0000376 .. method:: extend(iterable)
Georg Brandl116aa622007-08-15 14:28:22 +0000377
Benjamin Petersone41251e2008-04-25 01:59:09 +0000378 Extend the right side of the deque by appending elements from the iterable
379 argument.
Georg Brandl116aa622007-08-15 14:28:22 +0000380
381
Benjamin Petersone41251e2008-04-25 01:59:09 +0000382 .. method:: extendleft(iterable)
Georg Brandl116aa622007-08-15 14:28:22 +0000383
Benjamin Petersone41251e2008-04-25 01:59:09 +0000384 Extend the left side of the deque by appending elements from *iterable*.
385 Note, the series of left appends results in reversing the order of
386 elements in the iterable argument.
Georg Brandl116aa622007-08-15 14:28:22 +0000387
388
Benjamin Petersone41251e2008-04-25 01:59:09 +0000389 .. method:: pop()
Georg Brandl116aa622007-08-15 14:28:22 +0000390
Benjamin Petersone41251e2008-04-25 01:59:09 +0000391 Remove and return an element from the right side of the deque. If no
392 elements are present, raises an :exc:`IndexError`.
Georg Brandl116aa622007-08-15 14:28:22 +0000393
394
Benjamin Petersone41251e2008-04-25 01:59:09 +0000395 .. method:: popleft()
Georg Brandl116aa622007-08-15 14:28:22 +0000396
Benjamin Petersone41251e2008-04-25 01:59:09 +0000397 Remove and return an element from the left side of the deque. If no
398 elements are present, raises an :exc:`IndexError`.
Georg Brandl116aa622007-08-15 14:28:22 +0000399
400
Benjamin Petersone41251e2008-04-25 01:59:09 +0000401 .. method:: remove(value)
Georg Brandl116aa622007-08-15 14:28:22 +0000402
Benjamin Petersone41251e2008-04-25 01:59:09 +0000403 Removed the first occurrence of *value*. If not found, raises a
404 :exc:`ValueError`.
Georg Brandl116aa622007-08-15 14:28:22 +0000405
Raymond Hettingere5fdedb2009-12-10 00:47:21 +0000406 .. method:: reverse()
407
408 Reverse the elements of the deque in-place and then return ``None``.
409
410 .. versionadded:: 3.2
Georg Brandl116aa622007-08-15 14:28:22 +0000411
Benjamin Petersone41251e2008-04-25 01:59:09 +0000412 .. method:: rotate(n)
Georg Brandl116aa622007-08-15 14:28:22 +0000413
Benjamin Petersone41251e2008-04-25 01:59:09 +0000414 Rotate the deque *n* steps to the right. If *n* is negative, rotate to
415 the left. Rotating one step to the right is equivalent to:
416 ``d.appendleft(d.pop())``.
417
Georg Brandl116aa622007-08-15 14:28:22 +0000418
Raymond Hettinger5bb0f0e2009-03-10 12:56:32 +0000419 Deque objects also provide one read-only attribute:
420
421 .. attribute:: maxlen
422
423 Maximum size of a deque or *None* if unbounded.
424
Raymond Hettinger150fb9c2009-03-10 22:48:06 +0000425 .. versionadded:: 3.1
Raymond Hettinger5bb0f0e2009-03-10 12:56:32 +0000426
427
Georg Brandl116aa622007-08-15 14:28:22 +0000428In addition to the above, deques support iteration, pickling, ``len(d)``,
429``reversed(d)``, ``copy.copy(d)``, ``copy.deepcopy(d)``, membership testing with
Benjamin Peterson206e3072008-10-19 14:07:49 +0000430the :keyword:`in` operator, and subscript references such as ``d[-1]``. Indexed
431access is O(1) at both ends but slows to O(n) in the middle. For fast random
432access, use lists instead.
Georg Brandl116aa622007-08-15 14:28:22 +0000433
Christian Heimesfe337bf2008-03-23 21:54:12 +0000434Example:
435
436.. doctest::
Georg Brandl116aa622007-08-15 14:28:22 +0000437
438 >>> from collections import deque
439 >>> d = deque('ghi') # make a new deque with three items
440 >>> for elem in d: # iterate over the deque's elements
Neal Norwitz752abd02008-05-13 04:55:24 +0000441 ... print(elem.upper())
Georg Brandl116aa622007-08-15 14:28:22 +0000442 G
443 H
444 I
445
446 >>> d.append('j') # add a new entry to the right side
447 >>> d.appendleft('f') # add a new entry to the left side
448 >>> d # show the representation of the deque
449 deque(['f', 'g', 'h', 'i', 'j'])
450
451 >>> d.pop() # return and remove the rightmost item
452 'j'
453 >>> d.popleft() # return and remove the leftmost item
454 'f'
455 >>> list(d) # list the contents of the deque
456 ['g', 'h', 'i']
457 >>> d[0] # peek at leftmost item
458 'g'
459 >>> d[-1] # peek at rightmost item
460 'i'
461
462 >>> list(reversed(d)) # list the contents of a deque in reverse
463 ['i', 'h', 'g']
464 >>> 'h' in d # search the deque
465 True
466 >>> d.extend('jkl') # add multiple elements at once
467 >>> d
468 deque(['g', 'h', 'i', 'j', 'k', 'l'])
469 >>> d.rotate(1) # right rotation
470 >>> d
471 deque(['l', 'g', 'h', 'i', 'j', 'k'])
472 >>> d.rotate(-1) # left rotation
473 >>> d
474 deque(['g', 'h', 'i', 'j', 'k', 'l'])
475
476 >>> deque(reversed(d)) # make a new deque in reverse order
477 deque(['l', 'k', 'j', 'i', 'h', 'g'])
478 >>> d.clear() # empty the deque
479 >>> d.pop() # cannot pop from an empty deque
480 Traceback (most recent call last):
481 File "<pyshell#6>", line 1, in -toplevel-
482 d.pop()
483 IndexError: pop from an empty deque
484
485 >>> d.extendleft('abc') # extendleft() reverses the input order
486 >>> d
487 deque(['c', 'b', 'a'])
488
489
Georg Brandl9afde1c2007-11-01 20:32:30 +0000490:class:`deque` Recipes
491^^^^^^^^^^^^^^^^^^^^^^
Georg Brandl116aa622007-08-15 14:28:22 +0000492
493This section shows various approaches to working with deques.
494
Raymond Hettingerd2ee64d2009-03-31 22:52:48 +0000495Bounded length deques provide functionality similar to the ``tail`` filter
496in Unix::
497
498 def tail(filename, n=10):
499 'Return the last n lines of a file'
500 return deque(open(filename), n)
501
502Another approach to using deques is to maintain a sequence of recently
503added elements by appending to the right and popping to the left::
504
505 def moving_average(iterable, n=3):
506 # moving_average([40, 30, 50, 46, 39, 44]) --> 40.0 42.0 45.0 43.0
507 # http://en.wikipedia.org/wiki/Moving_average
508 it = iter(iterable)
Raymond Hettingerd40285a2009-05-22 01:11:26 +0000509 d = deque(itertools.islice(it, n-1))
510 d.appendleft(0)
Raymond Hettingerd2ee64d2009-03-31 22:52:48 +0000511 s = sum(d)
Raymond Hettingerd2ee64d2009-03-31 22:52:48 +0000512 for elem in it:
513 s += elem - d.popleft()
514 d.append(elem)
515 yield s / n
516
Georg Brandl116aa622007-08-15 14:28:22 +0000517The :meth:`rotate` method provides a way to implement :class:`deque` slicing and
Ezio Melotti0639d5a2009-12-19 23:26:38 +0000518deletion. For example, a pure Python implementation of ``del d[n]`` relies on
Georg Brandl116aa622007-08-15 14:28:22 +0000519the :meth:`rotate` method to position elements to be popped::
520
521 def delete_nth(d, n):
522 d.rotate(-n)
523 d.popleft()
524 d.rotate(n)
525
526To implement :class:`deque` slicing, use a similar approach applying
527:meth:`rotate` to bring a target element to the left side of the deque. Remove
528old entries with :meth:`popleft`, add new entries with :meth:`extend`, and then
529reverse the rotation.
Georg Brandl116aa622007-08-15 14:28:22 +0000530With minor variations on that approach, it is easy to implement Forth style
531stack manipulations such as ``dup``, ``drop``, ``swap``, ``over``, ``pick``,
532``rot``, and ``roll``.
533
Georg Brandl116aa622007-08-15 14:28:22 +0000534
535:class:`defaultdict` objects
536----------------------------
537
Georg Brandl116aa622007-08-15 14:28:22 +0000538.. class:: defaultdict([default_factory[, ...]])
539
540 Returns a new dictionary-like object. :class:`defaultdict` is a subclass of the
Georg Brandl22b34312009-07-26 14:54:51 +0000541 built-in :class:`dict` class. It overrides one method and adds one writable
Georg Brandl116aa622007-08-15 14:28:22 +0000542 instance variable. The remaining functionality is the same as for the
543 :class:`dict` class and is not documented here.
544
545 The first argument provides the initial value for the :attr:`default_factory`
546 attribute; it defaults to ``None``. All remaining arguments are treated the same
547 as if they were passed to the :class:`dict` constructor, including keyword
548 arguments.
549
Georg Brandl116aa622007-08-15 14:28:22 +0000550
Benjamin Petersone41251e2008-04-25 01:59:09 +0000551 :class:`defaultdict` objects support the following method in addition to the
552 standard :class:`dict` operations:
Georg Brandl116aa622007-08-15 14:28:22 +0000553
Benjamin Petersone41251e2008-04-25 01:59:09 +0000554 .. method:: defaultdict.__missing__(key)
Georg Brandl116aa622007-08-15 14:28:22 +0000555
Benjamin Peterson5478b472008-09-17 22:25:09 +0000556 If the :attr:`default_factory` attribute is ``None``, this raises a
Benjamin Petersone41251e2008-04-25 01:59:09 +0000557 :exc:`KeyError` exception with the *key* as argument.
Georg Brandl116aa622007-08-15 14:28:22 +0000558
Benjamin Petersone41251e2008-04-25 01:59:09 +0000559 If :attr:`default_factory` is not ``None``, it is called without arguments
560 to provide a default value for the given *key*, this value is inserted in
561 the dictionary for the *key*, and returned.
Georg Brandl116aa622007-08-15 14:28:22 +0000562
Benjamin Petersone41251e2008-04-25 01:59:09 +0000563 If calling :attr:`default_factory` raises an exception this exception is
564 propagated unchanged.
Georg Brandl116aa622007-08-15 14:28:22 +0000565
Benjamin Petersone41251e2008-04-25 01:59:09 +0000566 This method is called by the :meth:`__getitem__` method of the
567 :class:`dict` class when the requested key is not found; whatever it
568 returns or raises is then returned or raised by :meth:`__getitem__`.
Georg Brandl116aa622007-08-15 14:28:22 +0000569
570
Benjamin Petersone41251e2008-04-25 01:59:09 +0000571 :class:`defaultdict` objects support the following instance variable:
Georg Brandl116aa622007-08-15 14:28:22 +0000572
Benjamin Petersone41251e2008-04-25 01:59:09 +0000573
574 .. attribute:: defaultdict.default_factory
575
576 This attribute is used by the :meth:`__missing__` method; it is
577 initialized from the first argument to the constructor, if present, or to
578 ``None``, if absent.
Georg Brandl116aa622007-08-15 14:28:22 +0000579
580
Georg Brandl116aa622007-08-15 14:28:22 +0000581:class:`defaultdict` Examples
582^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
583
584Using :class:`list` as the :attr:`default_factory`, it is easy to group a
Christian Heimesfe337bf2008-03-23 21:54:12 +0000585sequence of key-value pairs into a dictionary of lists:
Georg Brandl116aa622007-08-15 14:28:22 +0000586
587 >>> s = [('yellow', 1), ('blue', 2), ('yellow', 3), ('blue', 4), ('red', 1)]
588 >>> d = defaultdict(list)
589 >>> for k, v in s:
590 ... d[k].append(v)
591 ...
Ezio Melottic53a8942009-09-12 01:52:05 +0000592 >>> list(d.items())
Georg Brandl116aa622007-08-15 14:28:22 +0000593 [('blue', [2, 4]), ('red', [1]), ('yellow', [1, 3])]
594
595When each key is encountered for the first time, it is not already in the
596mapping; so an entry is automatically created using the :attr:`default_factory`
597function which returns an empty :class:`list`. The :meth:`list.append`
598operation then attaches the value to the new list. When keys are encountered
599again, the look-up proceeds normally (returning the list for that key) and the
600:meth:`list.append` operation adds another value to the list. This technique is
Christian Heimesfe337bf2008-03-23 21:54:12 +0000601simpler and faster than an equivalent technique using :meth:`dict.setdefault`:
Georg Brandl116aa622007-08-15 14:28:22 +0000602
603 >>> d = {}
604 >>> for k, v in s:
605 ... d.setdefault(k, []).append(v)
606 ...
Ezio Melottic53a8942009-09-12 01:52:05 +0000607 >>> list(d.items())
Georg Brandl116aa622007-08-15 14:28:22 +0000608 [('blue', [2, 4]), ('red', [1]), ('yellow', [1, 3])]
609
610Setting the :attr:`default_factory` to :class:`int` makes the
611:class:`defaultdict` useful for counting (like a bag or multiset in other
Christian Heimesfe337bf2008-03-23 21:54:12 +0000612languages):
Georg Brandl116aa622007-08-15 14:28:22 +0000613
614 >>> s = 'mississippi'
615 >>> d = defaultdict(int)
616 >>> for k in s:
617 ... d[k] += 1
618 ...
Ezio Melottic53a8942009-09-12 01:52:05 +0000619 >>> list(d.items())
Georg Brandl116aa622007-08-15 14:28:22 +0000620 [('i', 4), ('p', 2), ('s', 4), ('m', 1)]
621
622When a letter is first encountered, it is missing from the mapping, so the
623:attr:`default_factory` function calls :func:`int` to supply a default count of
624zero. The increment operation then builds up the count for each letter.
625
626The function :func:`int` which always returns zero is just a special case of
627constant functions. A faster and more flexible way to create constant functions
628is to use a lambda function which can supply any constant value (not just
Christian Heimesfe337bf2008-03-23 21:54:12 +0000629zero):
Georg Brandl116aa622007-08-15 14:28:22 +0000630
631 >>> def constant_factory(value):
632 ... return lambda: value
633 >>> d = defaultdict(constant_factory('<missing>'))
634 >>> d.update(name='John', action='ran')
635 >>> '%(name)s %(action)s to %(object)s' % d
636 'John ran to <missing>'
637
638Setting the :attr:`default_factory` to :class:`set` makes the
Christian Heimesfe337bf2008-03-23 21:54:12 +0000639:class:`defaultdict` useful for building a dictionary of sets:
Georg Brandl116aa622007-08-15 14:28:22 +0000640
641 >>> s = [('red', 1), ('blue', 2), ('red', 3), ('blue', 4), ('red', 1), ('blue', 4)]
642 >>> d = defaultdict(set)
643 >>> for k, v in s:
644 ... d[k].add(v)
645 ...
Ezio Melottic53a8942009-09-12 01:52:05 +0000646 >>> list(d.items())
Georg Brandl116aa622007-08-15 14:28:22 +0000647 [('blue', set([2, 4])), ('red', set([1, 3]))]
648
649
Guido van Rossum3d392eb2007-11-16 00:35:22 +0000650:func:`namedtuple` Factory Function for Tuples with Named Fields
Christian Heimes790c8232008-01-07 21:14:23 +0000651----------------------------------------------------------------
Georg Brandl116aa622007-08-15 14:28:22 +0000652
Thomas Wouters1b7f8912007-09-19 03:06:30 +0000653Named tuples assign meaning to each position in a tuple and allow for more readable,
654self-documenting code. They can be used wherever regular tuples are used, and
655they add the ability to access fields by name instead of position index.
Georg Brandl116aa622007-08-15 14:28:22 +0000656
Georg Brandlc2a4f4f2009-04-10 09:03:43 +0000657.. function:: namedtuple(typename, field_names, verbose=False, rename=False)
Georg Brandl116aa622007-08-15 14:28:22 +0000658
659 Returns a new tuple subclass named *typename*. The new subclass is used to
Christian Heimesc3f30c42008-02-22 16:37:40 +0000660 create tuple-like objects that have fields accessible by attribute lookup as
Georg Brandl116aa622007-08-15 14:28:22 +0000661 well as being indexable and iterable. Instances of the subclass also have a
Benjamin Peterson4469d0c2008-11-30 22:46:23 +0000662 helpful docstring (with typename and field_names) and a helpful :meth:`__repr__`
Georg Brandl116aa622007-08-15 14:28:22 +0000663 method which lists the tuple contents in a ``name=value`` format.
664
Benjamin Peterson4469d0c2008-11-30 22:46:23 +0000665 The *field_names* are a single string with each fieldname separated by whitespace
666 and/or commas, for example ``'x y'`` or ``'x, y'``. Alternatively, *field_names*
Christian Heimes25bb7832008-01-11 16:17:00 +0000667 can be a sequence of strings such as ``['x', 'y']``.
Georg Brandl9afde1c2007-11-01 20:32:30 +0000668
669 Any valid Python identifier may be used for a fieldname except for names
Christian Heimes0449f632007-12-15 01:27:15 +0000670 starting with an underscore. Valid identifiers consist of letters, digits,
671 and underscores but do not start with a digit or underscore and cannot be
Georg Brandlf6945182008-02-01 11:56:49 +0000672 a :mod:`keyword` such as *class*, *for*, *return*, *global*, *pass*,
Georg Brandl9afde1c2007-11-01 20:32:30 +0000673 or *raise*.
Georg Brandl116aa622007-08-15 14:28:22 +0000674
Benjamin Petersona86f2c02009-02-10 02:41:10 +0000675 If *rename* is true, invalid fieldnames are automatically replaced
676 with positional names. For example, ``['abc', 'def', 'ghi', 'abc']`` is
Raymond Hettinger85737b82009-04-02 22:37:59 +0000677 converted to ``['abc', '_1', 'ghi', '_3']``, eliminating the keyword
Benjamin Petersona86f2c02009-02-10 02:41:10 +0000678 ``def`` and the duplicate fieldname ``abc``.
679
Christian Heimes25bb7832008-01-11 16:17:00 +0000680 If *verbose* is true, the class definition is printed just before being built.
Georg Brandl116aa622007-08-15 14:28:22 +0000681
Georg Brandl9afde1c2007-11-01 20:32:30 +0000682 Named tuple instances do not have per-instance dictionaries, so they are
Thomas Wouters8ce81f72007-09-20 18:22:40 +0000683 lightweight and require no more memory than regular tuples.
Georg Brandl116aa622007-08-15 14:28:22 +0000684
Raymond Hettingerb62ad242009-03-02 22:16:43 +0000685 .. versionchanged:: 3.1
Benjamin Petersona86f2c02009-02-10 02:41:10 +0000686 added support for *rename*.
687
Christian Heimesfe337bf2008-03-23 21:54:12 +0000688Example:
689
690.. doctest::
691 :options: +NORMALIZE_WHITESPACE
Georg Brandl116aa622007-08-15 14:28:22 +0000692
Guido van Rossum3d392eb2007-11-16 00:35:22 +0000693 >>> Point = namedtuple('Point', 'x y', verbose=True)
Thomas Wouters1b7f8912007-09-19 03:06:30 +0000694 class Point(tuple):
695 'Point(x, y)'
Christian Heimesfe337bf2008-03-23 21:54:12 +0000696 <BLANKLINE>
Thomas Wouters1b7f8912007-09-19 03:06:30 +0000697 __slots__ = ()
Christian Heimesfe337bf2008-03-23 21:54:12 +0000698 <BLANKLINE>
Christian Heimesfaf2f632008-01-06 16:59:19 +0000699 _fields = ('x', 'y')
Christian Heimesfe337bf2008-03-23 21:54:12 +0000700 <BLANKLINE>
Raymond Hettinger089ba7f2009-05-27 00:38:24 +0000701 def __new__(_cls, x, y):
Raymond Hettinger7b0d3c62010-04-02 18:54:02 +0000702 'Create a new instance of Point(x, y)'
Raymond Hettinger089ba7f2009-05-27 00:38:24 +0000703 return _tuple.__new__(_cls, (x, y))
Christian Heimesfe337bf2008-03-23 21:54:12 +0000704 <BLANKLINE>
Christian Heimesfaf2f632008-01-06 16:59:19 +0000705 @classmethod
Christian Heimesfe337bf2008-03-23 21:54:12 +0000706 def _make(cls, iterable, new=tuple.__new__, len=len):
Christian Heimesfaf2f632008-01-06 16:59:19 +0000707 'Make a new Point object from a sequence or iterable'
Christian Heimesfe337bf2008-03-23 21:54:12 +0000708 result = new(cls, iterable)
Christian Heimesfaf2f632008-01-06 16:59:19 +0000709 if len(result) != 2:
710 raise TypeError('Expected 2 arguments, got %d' % len(result))
711 return result
Christian Heimesfe337bf2008-03-23 21:54:12 +0000712 <BLANKLINE>
Thomas Wouters1b7f8912007-09-19 03:06:30 +0000713 def __repr__(self):
Raymond Hettinger7b0d3c62010-04-02 18:54:02 +0000714 'Return a nicely formatted representation string'
Thomas Wouters1b7f8912007-09-19 03:06:30 +0000715 return 'Point(x=%r, y=%r)' % self
Christian Heimesfe337bf2008-03-23 21:54:12 +0000716 <BLANKLINE>
Raymond Hettingera4f52b12009-03-02 22:28:31 +0000717 def _asdict(self):
718 'Return a new OrderedDict which maps field names to their values'
719 return OrderedDict(zip(self._fields, self))
Christian Heimesfe337bf2008-03-23 21:54:12 +0000720 <BLANKLINE>
Raymond Hettinger089ba7f2009-05-27 00:38:24 +0000721 def _replace(_self, **kwds):
Guido van Rossum3d392eb2007-11-16 00:35:22 +0000722 'Return a new Point object replacing specified fields with new values'
Raymond Hettinger089ba7f2009-05-27 00:38:24 +0000723 result = _self._make(map(kwds.pop, ('x', 'y'), _self))
Christian Heimesfaf2f632008-01-06 16:59:19 +0000724 if kwds:
Ezio Melotti8f7649e2009-09-13 04:48:45 +0000725 raise ValueError('Got unexpected field names: %r' % list(kwds.keys()))
Christian Heimesfaf2f632008-01-06 16:59:19 +0000726 return result
Georg Brandl48310cd2009-01-03 21:18:54 +0000727 <BLANKLINE>
728 def __getnewargs__(self):
Raymond Hettinger7b0d3c62010-04-02 18:54:02 +0000729 'Return self as a plain tuple. Used by copy and pickle.'
Benjamin Peterson41181742008-07-02 20:22:54 +0000730 return tuple(self)
Christian Heimesfe337bf2008-03-23 21:54:12 +0000731 <BLANKLINE>
Raymond Hettinger7b0d3c62010-04-02 18:54:02 +0000732 x = _property(_itemgetter(0), doc='Alias for field number 0')
733 y = _property(_itemgetter(1), doc='Alias for field number 1')
Georg Brandl116aa622007-08-15 14:28:22 +0000734
Thomas Wouters1b7f8912007-09-19 03:06:30 +0000735 >>> p = Point(11, y=22) # instantiate with positional or keyword arguments
Christian Heimes99170a52007-12-19 02:07:34 +0000736 >>> p[0] + p[1] # indexable like the plain tuple (11, 22)
Thomas Wouters1b7f8912007-09-19 03:06:30 +0000737 33
738 >>> x, y = p # unpack like a regular tuple
739 >>> x, y
740 (11, 22)
Christian Heimesc3f30c42008-02-22 16:37:40 +0000741 >>> p.x + p.y # fields also accessible by name
Thomas Wouters1b7f8912007-09-19 03:06:30 +0000742 33
743 >>> p # readable __repr__ with a name=value style
744 Point(x=11, y=22)
Georg Brandl116aa622007-08-15 14:28:22 +0000745
Thomas Wouters1b7f8912007-09-19 03:06:30 +0000746Named tuples are especially useful for assigning field names to result tuples returned
747by the :mod:`csv` or :mod:`sqlite3` modules::
748
Guido van Rossum3d392eb2007-11-16 00:35:22 +0000749 EmployeeRecord = namedtuple('EmployeeRecord', 'name, age, title, department, paygrade')
Georg Brandl9afde1c2007-11-01 20:32:30 +0000750
Thomas Wouters1b7f8912007-09-19 03:06:30 +0000751 import csv
Christian Heimesfaf2f632008-01-06 16:59:19 +0000752 for emp in map(EmployeeRecord._make, csv.reader(open("employees.csv", "rb"))):
Thomas Wouters1b7f8912007-09-19 03:06:30 +0000753 print(emp.name, emp.title)
754
Georg Brandl9afde1c2007-11-01 20:32:30 +0000755 import sqlite3
756 conn = sqlite3.connect('/companydata')
757 cursor = conn.cursor()
758 cursor.execute('SELECT name, age, title, department, paygrade FROM employees')
Christian Heimesfaf2f632008-01-06 16:59:19 +0000759 for emp in map(EmployeeRecord._make, cursor.fetchall()):
Christian Heimes00412232008-01-10 16:02:19 +0000760 print(emp.name, emp.title)
Georg Brandl9afde1c2007-11-01 20:32:30 +0000761
Christian Heimes99170a52007-12-19 02:07:34 +0000762In addition to the methods inherited from tuples, named tuples support
Christian Heimes2380ac72008-01-09 00:17:24 +0000763three additional methods and one attribute. To prevent conflicts with
764field names, the method and attribute names start with an underscore.
Christian Heimes99170a52007-12-19 02:07:34 +0000765
Christian Heimes790c8232008-01-07 21:14:23 +0000766.. method:: somenamedtuple._make(iterable)
Christian Heimes99170a52007-12-19 02:07:34 +0000767
Christian Heimesfaf2f632008-01-06 16:59:19 +0000768 Class method that makes a new instance from an existing sequence or iterable.
Christian Heimes99170a52007-12-19 02:07:34 +0000769
Christian Heimesfe337bf2008-03-23 21:54:12 +0000770.. doctest::
Thomas Wouters1b7f8912007-09-19 03:06:30 +0000771
Christian Heimesfaf2f632008-01-06 16:59:19 +0000772 >>> t = [11, 22]
773 >>> Point._make(t)
774 Point(x=11, y=22)
Thomas Wouters1b7f8912007-09-19 03:06:30 +0000775
Christian Heimes790c8232008-01-07 21:14:23 +0000776.. method:: somenamedtuple._asdict()
Georg Brandl9afde1c2007-11-01 20:32:30 +0000777
Raymond Hettingera4f52b12009-03-02 22:28:31 +0000778 Return a new :class:`OrderedDict` which maps field names to their corresponding
779 values::
Georg Brandl9afde1c2007-11-01 20:32:30 +0000780
Christian Heimes0449f632007-12-15 01:27:15 +0000781 >>> p._asdict()
Raymond Hettingera4f52b12009-03-02 22:28:31 +0000782 OrderedDict([('x', 11), ('y', 22)])
783
Raymond Hettingera88e4da2009-03-03 05:12:27 +0000784 .. versionchanged:: 3.1
Raymond Hettingera4f52b12009-03-02 22:28:31 +0000785 Returns an :class:`OrderedDict` instead of a regular :class:`dict`.
Christian Heimesfe337bf2008-03-23 21:54:12 +0000786
Christian Heimes790c8232008-01-07 21:14:23 +0000787.. method:: somenamedtuple._replace(kwargs)
Thomas Wouters1b7f8912007-09-19 03:06:30 +0000788
Christian Heimesfe337bf2008-03-23 21:54:12 +0000789 Return a new instance of the named tuple replacing specified fields with new
790 values:
Thomas Wouters8ce81f72007-09-20 18:22:40 +0000791
792::
Thomas Wouters1b7f8912007-09-19 03:06:30 +0000793
794 >>> p = Point(x=11, y=22)
Christian Heimes0449f632007-12-15 01:27:15 +0000795 >>> p._replace(x=33)
Thomas Wouters1b7f8912007-09-19 03:06:30 +0000796 Point(x=33, y=22)
797
Guido van Rossum3d392eb2007-11-16 00:35:22 +0000798 >>> for partnum, record in inventory.items():
Christian Heimes454f37b2008-01-10 00:10:02 +0000799 ... inventory[partnum] = record._replace(price=newprices[partnum], timestamp=time.now())
Thomas Wouters1b7f8912007-09-19 03:06:30 +0000800
Christian Heimes790c8232008-01-07 21:14:23 +0000801.. attribute:: somenamedtuple._fields
Thomas Wouters1b7f8912007-09-19 03:06:30 +0000802
Christian Heimes2380ac72008-01-09 00:17:24 +0000803 Tuple of strings listing the field names. Useful for introspection
Georg Brandl9afde1c2007-11-01 20:32:30 +0000804 and for creating new named tuple types from existing named tuples.
Thomas Wouters8ce81f72007-09-20 18:22:40 +0000805
Christian Heimesfe337bf2008-03-23 21:54:12 +0000806.. doctest::
Thomas Wouters1b7f8912007-09-19 03:06:30 +0000807
Christian Heimes0449f632007-12-15 01:27:15 +0000808 >>> p._fields # view the field names
Thomas Wouters1b7f8912007-09-19 03:06:30 +0000809 ('x', 'y')
Thomas Wouters1b7f8912007-09-19 03:06:30 +0000810
Guido van Rossum3d392eb2007-11-16 00:35:22 +0000811 >>> Color = namedtuple('Color', 'red green blue')
Christian Heimes0449f632007-12-15 01:27:15 +0000812 >>> Pixel = namedtuple('Pixel', Point._fields + Color._fields)
Thomas Wouters1b7f8912007-09-19 03:06:30 +0000813 >>> Pixel(11, 22, 128, 255, 0)
Christian Heimes454f37b2008-01-10 00:10:02 +0000814 Pixel(x=11, y=22, red=128, green=255, blue=0)
Georg Brandl116aa622007-08-15 14:28:22 +0000815
Christian Heimes0449f632007-12-15 01:27:15 +0000816To retrieve a field whose name is stored in a string, use the :func:`getattr`
Christian Heimesfe337bf2008-03-23 21:54:12 +0000817function:
Christian Heimes0449f632007-12-15 01:27:15 +0000818
819 >>> getattr(p, 'x')
820 11
821
Raymond Hettinger651453a2009-02-11 00:20:02 +0000822To convert a dictionary to a named tuple, use the double-star-operator
823(as described in :ref:`tut-unpacking-arguments`):
Christian Heimes99170a52007-12-19 02:07:34 +0000824
825 >>> d = {'x': 11, 'y': 22}
826 >>> Point(**d)
827 Point(x=11, y=22)
828
Guido van Rossum3d392eb2007-11-16 00:35:22 +0000829Since a named tuple is a regular Python class, it is easy to add or change
Christian Heimes043d6f62008-01-07 17:19:16 +0000830functionality with a subclass. Here is how to add a calculated field and
Christian Heimesfe337bf2008-03-23 21:54:12 +0000831a fixed-width print format:
Guido van Rossum3d392eb2007-11-16 00:35:22 +0000832
Christian Heimes043d6f62008-01-07 17:19:16 +0000833 >>> class Point(namedtuple('Point', 'x y')):
Christian Heimes25bb7832008-01-11 16:17:00 +0000834 ... __slots__ = ()
Christian Heimes454f37b2008-01-10 00:10:02 +0000835 ... @property
836 ... def hypot(self):
837 ... return (self.x ** 2 + self.y ** 2) ** 0.5
838 ... def __str__(self):
Christian Heimes25bb7832008-01-11 16:17:00 +0000839 ... return 'Point: x=%6.3f y=%6.3f hypot=%6.3f' % (self.x, self.y, self.hypot)
Guido van Rossum3d392eb2007-11-16 00:35:22 +0000840
Georg Brandl0df79792008-10-04 18:33:26 +0000841 >>> for p in Point(3, 4), Point(14, 5/7):
Christian Heimes00412232008-01-10 16:02:19 +0000842 ... print(p)
Christian Heimes25bb7832008-01-11 16:17:00 +0000843 Point: x= 3.000 y= 4.000 hypot= 5.000
844 Point: x=14.000 y= 0.714 hypot=14.018
Christian Heimes043d6f62008-01-07 17:19:16 +0000845
Georg Brandlaf5c2382009-12-28 08:02:38 +0000846The subclass shown above sets ``__slots__`` to an empty tuple. This helps
Christian Heimes679db4a2008-01-18 09:56:22 +0000847keep memory requirements low by preventing the creation of instance dictionaries.
848
Christian Heimes2380ac72008-01-09 00:17:24 +0000849
850Subclassing is not useful for adding new, stored fields. Instead, simply
Christian Heimesfe337bf2008-03-23 21:54:12 +0000851create a new named tuple type from the :attr:`_fields` attribute:
Christian Heimes2380ac72008-01-09 00:17:24 +0000852
Christian Heimes25bb7832008-01-11 16:17:00 +0000853 >>> Point3D = namedtuple('Point3D', Point._fields + ('z',))
Christian Heimes2380ac72008-01-09 00:17:24 +0000854
855Default values can be implemented by using :meth:`_replace` to
Christian Heimesfe337bf2008-03-23 21:54:12 +0000856customize a prototype instance:
Guido van Rossum3d392eb2007-11-16 00:35:22 +0000857
858 >>> Account = namedtuple('Account', 'owner balance transaction_count')
Christian Heimes587c2bf2008-01-19 16:21:02 +0000859 >>> default_account = Account('<owner name>', 0.0, 0)
860 >>> johns_account = default_account._replace(owner='John')
Guido van Rossum3d392eb2007-11-16 00:35:22 +0000861
Christian Heimese4ca8152008-05-08 17:18:53 +0000862Enumerated constants can be implemented with named tuples, but it is simpler
863and more efficient to use a simple class declaration:
864
865 >>> Status = namedtuple('Status', 'open pending closed')._make(range(3))
866 >>> Status.open, Status.pending, Status.closed
867 (0, 1, 2)
868 >>> class Status:
869 ... open, pending, closed = range(3)
870
Raymond Hettinger651453a2009-02-11 00:20:02 +0000871.. seealso::
Thomas Wouters47b49bf2007-08-30 22:15:33 +0000872
Raymond Hettinger651453a2009-02-11 00:20:02 +0000873 `Named tuple recipe <http://code.activestate.com/recipes/500261/>`_
874 adapted for Python 2.4.
Raymond Hettingere4c96ad2008-02-06 01:23:58 +0000875
876
Raymond Hettinger2d32f632009-03-02 21:24:57 +0000877:class:`OrderedDict` objects
878----------------------------
879
880Ordered dictionaries are just like regular dictionaries but they remember the
881order that items were inserted. When iterating over an ordered dictionary,
882the items are returned in the order their keys were first added.
883
884.. class:: OrderedDict([items])
885
886 Return an instance of a dict subclass, supporting the usual :class:`dict`
887 methods. An *OrderedDict* is a dict that remembers the order that keys
888 were first inserted. If a new entry overwrites an existing entry, the
889 original insertion position is left unchanged. Deleting an entry and
890 reinserting it will move it to the end.
891
Benjamin Petersond45bf582009-03-02 21:44:54 +0000892 .. versionadded:: 3.1
Raymond Hettinger2d32f632009-03-02 21:24:57 +0000893
Raymond Hettingerdc879f02009-03-19 20:30:56 +0000894.. method:: OrderedDict.popitem(last=True)
895
896 The :meth:`popitem` method for ordered dictionaries returns and removes
897 a (key, value) pair. The pairs are returned in LIFO order if *last* is
898 true or FIFO order if false.
Raymond Hettinger2d32f632009-03-02 21:24:57 +0000899
Raymond Hettingere9091502009-05-19 17:40:07 +0000900In addition to the usual mapping methods, ordered dictionaries also support
901reverse iteration using :func:`reversed`.
902
Raymond Hettinger2d32f632009-03-02 21:24:57 +0000903Equality tests between :class:`OrderedDict` objects are order-sensitive
904and are implemented as ``list(od1.items())==list(od2.items())``.
905Equality tests between :class:`OrderedDict` objects and other
906:class:`Mapping` objects are order-insensitive like regular dictionaries.
907This allows :class:`OrderedDict` objects to be substituted anywhere a
908regular dictionary is used.
909
Raymond Hettinger36180782009-04-09 22:34:23 +0000910The :class:`OrderedDict` constructor and :meth:`update` method both accept
911keyword arguments, but their order is lost because Python's function call
912semantics pass-in keyword arguments using a regular unordered dictionary.
913
Raymond Hettingerdc879f02009-03-19 20:30:56 +0000914.. seealso::
915
916 `Equivalent OrderedDict recipe <http://code.activestate.com/recipes/576693/>`_
917 that runs on Python 2.4 or later.
918
Raymond Hettinger0e312012009-11-10 18:35:46 +0000919Since an ordered dictionary remembers its insertion order, it can be used
920in conjuction with sorting to make a sorted dictionary::
921
922 >>> # regular unsorted dictionary
923 >>> d = {'banana': 3, 'apple':4, 'pear': 1, 'orange': 2}
924
925 >>> # dictionary sorted by key
926 >>> OrderedDict(sorted(d.items(), key=lambda t: t[0]))
927 OrderedDict([('apple', 4), ('banana', 3), ('orange', 2), ('pear', 1)])
928
929 >>> # dictionary sorted by value
930 >>> OrderedDict(sorted(d.items(), key=lambda t: t[1]))
931 OrderedDict([('pear', 1), ('orange', 2), ('banana', 3), ('apple', 4)])
932
933 >>> # dictionary sorted by length of the key string
934 >>> OrderedDict(sorted(d.items(), key=lambda t: len(t[0])))
935 OrderedDict([('pear', 1), ('apple', 4), ('orange', 2), ('banana', 3)])
936
937The new sorted dictionaries maintain their sort order when entries
938are deleted. But when new keys are added, the keys are appended
939to the end and the sort is not maintained.
940
Raymond Hettingere4c96ad2008-02-06 01:23:58 +0000941
942:class:`UserDict` objects
Mark Summerfield8f2d0062008-02-06 13:30:44 +0000943-------------------------
Raymond Hettingere4c96ad2008-02-06 01:23:58 +0000944
Georg Brandl48310cd2009-01-03 21:18:54 +0000945The class, :class:`UserDict` acts as a wrapper around dictionary objects.
946The need for this class has been partially supplanted by the ability to
Raymond Hettingere4c96ad2008-02-06 01:23:58 +0000947subclass directly from :class:`dict`; however, this class can be easier
948to work with because the underlying dictionary is accessible as an
949attribute.
950
951.. class:: UserDict([initialdata])
952
953 Class that simulates a dictionary. The instance's contents are kept in a
954 regular dictionary, which is accessible via the :attr:`data` attribute of
955 :class:`UserDict` instances. If *initialdata* is provided, :attr:`data` is
956 initialized with its contents; note that a reference to *initialdata* will not
957 be kept, allowing it be used for other purposes.
958
Georg Brandl48310cd2009-01-03 21:18:54 +0000959In addition to supporting the methods and operations of mappings,
Raymond Hettingerebcee3f2008-02-06 19:54:00 +0000960:class:`UserDict` instances provide the following attribute:
Raymond Hettingere4c96ad2008-02-06 01:23:58 +0000961
962.. attribute:: UserDict.data
963
964 A real dictionary used to store the contents of the :class:`UserDict` class.
Raymond Hettinger53dbe392008-02-12 20:03:09 +0000965
966
967
968:class:`UserList` objects
969-------------------------
970
971This class acts as a wrapper around list objects. It is a useful base class
Georg Brandl48310cd2009-01-03 21:18:54 +0000972for your own list-like classes which can inherit from them and override
Raymond Hettinger53dbe392008-02-12 20:03:09 +0000973existing methods or add new ones. In this way, one can add new behaviors to
974lists.
975
Georg Brandl48310cd2009-01-03 21:18:54 +0000976The need for this class has been partially supplanted by the ability to
Raymond Hettinger53dbe392008-02-12 20:03:09 +0000977subclass directly from :class:`list`; however, this class can be easier
978to work with because the underlying list is accessible as an attribute.
979
980.. class:: UserList([list])
981
982 Class that simulates a list. The instance's contents are kept in a regular
983 list, which is accessible via the :attr:`data` attribute of :class:`UserList`
984 instances. The instance's contents are initially set to a copy of *list*,
985 defaulting to the empty list ``[]``. *list* can be any iterable, for
986 example a real Python list or a :class:`UserList` object.
987
Georg Brandl48310cd2009-01-03 21:18:54 +0000988In addition to supporting the methods and operations of mutable sequences,
Raymond Hettinger53dbe392008-02-12 20:03:09 +0000989:class:`UserList` instances provide the following attribute:
990
991.. attribute:: UserList.data
992
Georg Brandl48310cd2009-01-03 21:18:54 +0000993 A real :class:`list` object used to store the contents of the
Raymond Hettinger53dbe392008-02-12 20:03:09 +0000994 :class:`UserList` class.
995
996**Subclassing requirements:** Subclasses of :class:`UserList` are expect to
997offer a constructor which can be called with either no arguments or one
998argument. List operations which return a new sequence attempt to create an
999instance of the actual implementation class. To do so, it assumes that the
1000constructor can be called with a single parameter, which is a sequence object
1001used as a data source.
1002
1003If a derived class does not wish to comply with this requirement, all of the
1004special methods supported by this class will need to be overridden; please
1005consult the sources for information about the methods which need to be provided
1006in that case.
Raymond Hettingerb3a65f82008-02-21 22:11:37 +00001007
1008:class:`UserString` objects
Christian Heimesc3f30c42008-02-22 16:37:40 +00001009---------------------------
Raymond Hettingerb3a65f82008-02-21 22:11:37 +00001010
Georg Brandl48310cd2009-01-03 21:18:54 +00001011The class, :class:`UserString` acts as a wrapper around string objects.
1012The need for this class has been partially supplanted by the ability to
Raymond Hettingerb3a65f82008-02-21 22:11:37 +00001013subclass directly from :class:`str`; however, this class can be easier
1014to work with because the underlying string is accessible as an
1015attribute.
1016
1017.. class:: UserString([sequence])
1018
1019 Class that simulates a string or a Unicode string object. The instance's
Georg Brandl48310cd2009-01-03 21:18:54 +00001020 content is kept in a regular string object, which is accessible via the
1021 :attr:`data` attribute of :class:`UserString` instances. The instance's
Raymond Hettingerb3a65f82008-02-21 22:11:37 +00001022 contents are initially set to a copy of *sequence*. The *sequence* can
1023 be an instance of :class:`bytes`, :class:`str`, :class:`UserString` (or a
1024 subclass) or an arbitrary sequence which can be converted into a string using
1025 the built-in :func:`str` function.