blob: dd3e5a49f70169a6c349f7b285cf1db8bd4a7906 [file] [log] [blame]
Georg Brandl8ec7f652007-08-15 14:28:01 +00001
2:mod:`collections` --- High-performance container datatypes
3===========================================================
4
5.. module:: collections
6 :synopsis: High-performance datatypes
7.. moduleauthor:: Raymond Hettinger <python@rcn.com>
8.. sectionauthor:: Raymond Hettinger <python@rcn.com>
9
Georg Brandl8ec7f652007-08-15 14:28:01 +000010.. versionadded:: 2.4
11
Georg Brandl4c8bbe62008-03-22 21:06:20 +000012.. testsetup:: *
13
14 from collections import *
15 import itertools
16 __name__ = '<doctest>'
17
Georg Brandl8ec7f652007-08-15 14:28:01 +000018This module implements high-performance container datatypes. Currently,
Raymond Hettingerc4f46972009-04-10 06:38:39 +000019there are four datatypes, :class:`Counter`, :class:`deque`, :class:`OrderedDict` and
Raymond Hettingerf746a1f2009-02-17 08:33:01 +000020:class:`defaultdict`, and one datatype factory function, :func:`namedtuple`.
Georg Brandl8ec7f652007-08-15 14:28:01 +000021
Raymond Hettingerbc4ffc12008-02-11 23:38:00 +000022The specialized containers provided in this module provide alternatives
Georg Brandl4c8bbe62008-03-22 21:06:20 +000023to Python's general purpose built-in containers, :class:`dict`,
Raymond Hettingerbc4ffc12008-02-11 23:38:00 +000024:class:`list`, :class:`set`, and :class:`tuple`.
25
Raymond Hettingerf746a1f2009-02-17 08:33:01 +000026.. versionchanged:: 2.4
27 Added :class:`deque`.
28
29.. versionchanged:: 2.5
30 Added :class:`defaultdict`.
31
32.. versionchanged:: 2.6
33 Added :func:`namedtuple` and added abstract base classes.
34
35.. versionchanged:: 2.7
Raymond Hettingerbc512d32009-03-03 04:45:34 +000036 Added :class:`Counter` and :class:`OrderedDict`.
Raymond Hettingerbc4ffc12008-02-11 23:38:00 +000037
38In addition to containers, the collections module provides some ABCs
Georg Brandl4c8bbe62008-03-22 21:06:20 +000039(abstract base classes) that can be used to test whether a class
Raymond Hettingerf746a1f2009-02-17 08:33:01 +000040provides a particular interface, for example, whether it is hashable or
Georg Brandl4c8bbe62008-03-22 21:06:20 +000041a mapping.
Raymond Hettingerbc4ffc12008-02-11 23:38:00 +000042
Raymond Hettingerbc4ffc12008-02-11 23:38:00 +000043
44ABCs - abstract base classes
45----------------------------
46
47The collections module offers the following ABCs:
48
Georg Brandldbc59872008-07-08 07:05:23 +000049========================= ===================== ====================== ====================================================
50ABC Inherits Abstract Methods Mixin Methods
51========================= ===================== ====================== ====================================================
52:class:`Container` ``__contains__``
53:class:`Hashable` ``__hash__``
54:class:`Iterable` ``__iter__``
Ezio Melotti84424f12010-02-26 23:27:06 +000055:class:`Iterator` :class:`Iterable` ``next`` ``__iter__``
Georg Brandl7044b112009-01-03 21:04:55 +000056:class:`Sized` ``__len__``
Georg Brandldbc59872008-07-08 07:05:23 +000057:class:`Callable` ``__call__``
Georg Brandlc62ef8b2009-01-03 20:55:06 +000058
Georg Brandldbc59872008-07-08 07:05:23 +000059:class:`Sequence` :class:`Sized`, ``__getitem__`` ``__contains__``. ``__iter__``, ``__reversed__``.
Raymond Hettinger1712baa2009-01-28 23:58:16 +000060 :class:`Iterable`, ``index``, and ``count``
Georg Brandlc62ef8b2009-01-03 20:55:06 +000061 :class:`Container`
62
Raymond Hettinger1712baa2009-01-28 23:58:16 +000063:class:`MutableSequence` :class:`Sequence` ``__setitem__`` Inherited Sequence methods and
Georg Brandldbc59872008-07-08 07:05:23 +000064 ``__delitem__``, ``append``, ``reverse``, ``extend``, ``pop``,
Raymond Hettinger1712baa2009-01-28 23:58:16 +000065 and ``insert`` ``remove``, and ``__iadd__``
Georg Brandlc62ef8b2009-01-03 20:55:06 +000066
Raymond Hettinger1712baa2009-01-28 23:58:16 +000067:class:`Set` :class:`Sized`, ``__le__``, ``__lt__``, ``__eq__``, ``__ne__``,
68 :class:`Iterable`, ``__gt__``, ``__ge__``, ``__and__``, ``__or__``
69 :class:`Container` ``__sub__``, ``__xor__``, and ``isdisjoint``
Georg Brandlc62ef8b2009-01-03 20:55:06 +000070
Georg Brandldbc59872008-07-08 07:05:23 +000071:class:`MutableSet` :class:`Set` ``add`` and Inherited Set methods and
72 ``discard`` ``clear``, ``pop``, ``remove``, ``__ior__``,
73 ``__iand__``, ``__ixor__``, and ``__isub__``
Georg Brandlc62ef8b2009-01-03 20:55:06 +000074
Raymond Hettinger1712baa2009-01-28 23:58:16 +000075:class:`Mapping` :class:`Sized`, ``__getitem__`` ``__contains__``, ``keys``, ``items``, ``values``,
76 :class:`Iterable`, ``get``, ``__eq__``, and ``__ne__``
77 :class:`Container`
Georg Brandlc62ef8b2009-01-03 20:55:06 +000078
Raymond Hettinger1712baa2009-01-28 23:58:16 +000079:class:`MutableMapping` :class:`Mapping` ``__setitem__`` and Inherited Mapping methods and
80 ``__delitem__`` ``pop``, ``popitem``, ``clear``, ``update``,
81 and ``setdefault``
82
Georg Brandlc62ef8b2009-01-03 20:55:06 +000083
Georg Brandldbc59872008-07-08 07:05:23 +000084:class:`MappingView` :class:`Sized` ``__len__``
85:class:`KeysView` :class:`MappingView`, ``__contains__``,
86 :class:`Set` ``__iter__``
87:class:`ItemsView` :class:`MappingView`, ``__contains__``,
88 :class:`Set` ``__iter__``
89:class:`ValuesView` :class:`MappingView` ``__contains__``, ``__iter__``
90========================= ===================== ====================== ====================================================
Raymond Hettingerbc4ffc12008-02-11 23:38:00 +000091
92These ABCs allow us to ask classes or instances if they provide
93particular functionality, for example::
94
95 size = None
96 if isinstance(myvar, collections.Sized):
Georg Brandl7044b112009-01-03 21:04:55 +000097 size = len(myvar)
Raymond Hettingerbc4ffc12008-02-11 23:38:00 +000098
99Several of the ABCs are also useful as mixins that make it easier to develop
100classes supporting container APIs. For example, to write a class supporting
101the full :class:`Set` API, it only necessary to supply the three underlying
102abstract methods: :meth:`__contains__`, :meth:`__iter__`, and :meth:`__len__`.
103The ABC supplies the remaining methods such as :meth:`__and__` and
104:meth:`isdisjoint` ::
105
106 class ListBasedSet(collections.Set):
107 ''' Alternate set implementation favoring space over speed
108 and not requiring the set elements to be hashable. '''
109 def __init__(self, iterable):
110 self.elements = lst = []
111 for value in iterable:
112 if value not in lst:
113 lst.append(value)
114 def __iter__(self):
115 return iter(self.elements)
116 def __contains__(self, value):
117 return value in self.elements
118 def __len__(self):
119 return len(self.elements)
120
121 s1 = ListBasedSet('abcdef')
122 s2 = ListBasedSet('defghi')
123 overlap = s1 & s2 # The __and__() method is supported automatically
124
125Notes on using :class:`Set` and :class:`MutableSet` as a mixin:
126
Georg Brandl4c8bbe62008-03-22 21:06:20 +0000127(1)
Raymond Hettingerbc4ffc12008-02-11 23:38:00 +0000128 Since some set operations create new sets, the default mixin methods need
Georg Brandl4c8bbe62008-03-22 21:06:20 +0000129 a way to create new instances from an iterable. The class constructor is
130 assumed to have a signature in the form ``ClassName(iterable)``.
Raymond Hettinger96b42402008-05-23 17:34:34 +0000131 That assumption is factored-out to an internal classmethod called
Raymond Hettingerbc4ffc12008-02-11 23:38:00 +0000132 :meth:`_from_iterable` which calls ``cls(iterable)`` to produce a new set.
133 If the :class:`Set` mixin is being used in a class with a different
Georg Brandl4c8bbe62008-03-22 21:06:20 +0000134 constructor signature, you will need to override :meth:`from_iterable`
135 with a classmethod that can construct new instances from
Raymond Hettingerbc4ffc12008-02-11 23:38:00 +0000136 an iterable argument.
137
138(2)
139 To override the comparisons (presumably for speed, as the
140 semantics are fixed), redefine :meth:`__le__` and
141 then the other operations will automatically follow suit.
142
143(3)
144 The :class:`Set` mixin provides a :meth:`_hash` method to compute a hash value
145 for the set; however, :meth:`__hash__` is not defined because not all sets
146 are hashable or immutable. To add set hashabilty using mixins,
147 inherit from both :meth:`Set` and :meth:`Hashable`, then define
148 ``__hash__ = Set._hash``.
149
Raymond Hettinger2cef1a52009-03-20 18:25:49 +0000150.. seealso::
151
152 * `OrderedSet recipe <http://code.activestate.com/recipes/576694/>`_ for an
153 example built on :class:`MutableSet`.
154
155 * For more about ABCs, see the :mod:`abc` module and :pep:`3119`.
Raymond Hettingerbc4ffc12008-02-11 23:38:00 +0000156
157
Raymond Hettingerf94d7fa2009-01-12 22:58:41 +0000158:class:`Counter` objects
159------------------------
160
161A counter tool is provided to support convenient and rapid tallies.
162For example::
163
Raymond Hettinger939a3cc2009-02-04 11:31:30 +0000164 >>> # Tally occurrences of words in a list
Raymond Hettingerf94d7fa2009-01-12 22:58:41 +0000165 >>> cnt = Counter()
Raymond Hettinger0a1f7b82009-01-21 23:12:51 +0000166 >>> for word in ['red', 'blue', 'red', 'green', 'blue', 'blue']:
Raymond Hettingerf94d7fa2009-01-12 22:58:41 +0000167 ... cnt[word] += 1
168 >>> cnt
Raymond Hettingeraaa6e632009-01-13 01:05:03 +0000169 Counter({'blue': 3, 'red': 2, 'green': 1})
Raymond Hettingerf94d7fa2009-01-12 22:58:41 +0000170
Raymond Hettinger939a3cc2009-02-04 11:31:30 +0000171 >>> # Find the ten most common words in Hamlet
Raymond Hettingerf94d7fa2009-01-12 22:58:41 +0000172 >>> import re
173 >>> words = re.findall('\w+', open('hamlet.txt').read().lower())
Raymond Hettinger196a0f72009-01-20 12:59:36 +0000174 >>> Counter(words).most_common(10)
Raymond Hettingerf94d7fa2009-01-12 22:58:41 +0000175 [('the', 1143), ('and', 966), ('to', 762), ('of', 669), ('i', 631),
176 ('you', 554), ('a', 546), ('my', 514), ('hamlet', 471), ('in', 451)]
177
Raymond Hettinger82783852009-01-13 03:49:43 +0000178.. class:: Counter([iterable-or-mapping])
Raymond Hettingerf94d7fa2009-01-12 22:58:41 +0000179
Raymond Hettinger0a1f7b82009-01-21 23:12:51 +0000180 A :class:`Counter` is a :class:`dict` subclass for counting hashable objects.
Raymond Hettingeraaa6e632009-01-13 01:05:03 +0000181 It is an unordered collection where elements are stored as dictionary keys
182 and their counts are stored as dictionary values. Counts are allowed to be
183 any integer value including zero or negative counts. The :class:`Counter`
184 class is similar to bags or multisets in other languages.
Raymond Hettingerf94d7fa2009-01-12 22:58:41 +0000185
Raymond Hettinger82783852009-01-13 03:49:43 +0000186 Elements are counted from an *iterable* or initialized from another
Georg Brandlf6dab952009-04-28 21:48:35 +0000187 *mapping* (or counter):
Raymond Hettingerf94d7fa2009-01-12 22:58:41 +0000188
Raymond Hettinger22bfa9e2009-01-27 02:36:33 +0000189 >>> c = Counter() # a new, empty counter
190 >>> c = Counter('gallahad') # a new counter from an iterable
191 >>> c = Counter({'red': 4, 'blue': 2}) # a new counter from a mapping
192 >>> c = Counter(cats=4, dogs=8) # a new counter from keyword args
Raymond Hettingerf94d7fa2009-01-12 22:58:41 +0000193
Raymond Hettinger0a1f7b82009-01-21 23:12:51 +0000194 Counter objects have a dictionary interface except that they return a zero
Georg Brandlf6dab952009-04-28 21:48:35 +0000195 count for missing items instead of raising a :exc:`KeyError`:
Raymond Hettingerf94d7fa2009-01-12 22:58:41 +0000196
Raymond Hettinger7bdca052009-01-22 05:20:47 +0000197 >>> c = Counter(['eggs', 'ham'])
Raymond Hettinger59894122009-01-14 00:15:21 +0000198 >>> c['bacon'] # count of a missing element is zero
Raymond Hettingerf94d7fa2009-01-12 22:58:41 +0000199 0
200
Raymond Hettinger7bdca052009-01-22 05:20:47 +0000201 Setting a count to zero does not remove an element from a counter.
202 Use ``del`` to remove it entirely:
Raymond Hettingerf94d7fa2009-01-12 22:58:41 +0000203
Raymond Hettinger7bdca052009-01-22 05:20:47 +0000204 >>> c['sausage'] = 0 # counter entry with a zero count
205 >>> del c['sausage'] # del actually removes the entry
Raymond Hettingerf94d7fa2009-01-12 22:58:41 +0000206
207 .. versionadded:: 2.7
208
209
Ezio Melotti4edfe962010-04-04 06:50:58 +0000210 Counter objects support three methods beyond those available for all
Raymond Hettingerf94d7fa2009-01-12 22:58:41 +0000211 dictionaries:
212
213 .. method:: elements()
214
Raymond Hettinger0a1f7b82009-01-21 23:12:51 +0000215 Return an iterator over elements repeating each as many times as its
216 count. Elements are returned in arbitrary order. If an element's count
217 is less than one, :meth:`elements` will ignore it.
Raymond Hettingerf94d7fa2009-01-12 22:58:41 +0000218
Raymond Hettinger196a0f72009-01-20 12:59:36 +0000219 >>> c = Counter(a=4, b=2, c=0, d=-2)
Raymond Hettingerf94d7fa2009-01-12 22:58:41 +0000220 >>> list(c.elements())
221 ['a', 'a', 'a', 'a', 'b', 'b']
222
223 .. method:: most_common([n])
224
Raymond Hettinger22bfa9e2009-01-27 02:36:33 +0000225 Return a list of the *n* most common elements and their counts from the
Raymond Hettingerd507afd2009-02-04 10:52:32 +0000226 most common to the least. If *n* is not specified, :func:`most_common`
Raymond Hettinger22bfa9e2009-01-27 02:36:33 +0000227 returns *all* elements in the counter. Elements with equal counts are
Georg Brandlf6dab952009-04-28 21:48:35 +0000228 ordered arbitrarily:
Raymond Hettingerf94d7fa2009-01-12 22:58:41 +0000229
230 >>> Counter('abracadabra').most_common(3)
231 [('a', 5), ('r', 2), ('b', 2)]
232
Raymond Hettinger34c35b22010-04-03 10:22:00 +0000233 .. method:: subtract([iterable-or-mapping])
234
235 Elements are subtracted from an *iterable* or from another *mapping*
236 (or counter). Like :meth:`dict.update` but subtracts counts instead
237 of replacing them. Both inputs and outputs may be zero or negative.
238
239 >>> c = Counter(a=4, b=2, c=0, d=-2)
240 >>> d = Counter(a=1, b=2, c=3, d=4)
241 >>> c.subtract(d)
242 Counter({'a': 3, 'b': 0, 'c': -3, 'd': -6})
243
Raymond Hettinger0a1f7b82009-01-21 23:12:51 +0000244 The usual dictionary methods are available for :class:`Counter` objects
245 except for two which work differently for counters.
Raymond Hettingerf94d7fa2009-01-12 22:58:41 +0000246
247 .. method:: fromkeys(iterable)
248
Raymond Hettinger22bfa9e2009-01-27 02:36:33 +0000249 This class method is not implemented for :class:`Counter` objects.
Raymond Hettingerf94d7fa2009-01-12 22:58:41 +0000250
Raymond Hettinger82783852009-01-13 03:49:43 +0000251 .. method:: update([iterable-or-mapping])
Raymond Hettingerf94d7fa2009-01-12 22:58:41 +0000252
Raymond Hettinger22bfa9e2009-01-27 02:36:33 +0000253 Elements are counted from an *iterable* or added-in from another
254 *mapping* (or counter). Like :meth:`dict.update` but adds counts
255 instead of replacing them. Also, the *iterable* is expected to be a
256 sequence of elements, not a sequence of ``(key, value)`` pairs.
Raymond Hettingerf94d7fa2009-01-12 22:58:41 +0000257
Raymond Hettingerfbcf7492009-01-13 08:38:14 +0000258Common patterns for working with :class:`Counter` objects::
Raymond Hettingerf94d7fa2009-01-12 22:58:41 +0000259
Raymond Hettinger22bfa9e2009-01-27 02:36:33 +0000260 sum(c.values()) # total of all counts
261 c.clear() # reset all counts
262 list(c) # list unique elements
263 set(c) # convert to a set
264 dict(c) # convert to a regular dictionary
265 c.items() # convert to a list of (elem, cnt) pairs
266 Counter(dict(list_of_pairs)) # convert from a list of (elem, cnt) pairs
267 c.most_common()[:-n:-1] # n least common elements
268 c += Counter() # remove zero and negative counts
Raymond Hettingerf94d7fa2009-01-12 22:58:41 +0000269
Raymond Hettingera6658532009-02-25 22:48:24 +0000270Several mathematical operations are provided for combining :class:`Counter`
271objects to produce multisets (counters that have counts greater than zero).
272Addition and subtraction combine counters by adding or subtracting the counts
273of corresponding elements. Intersection and union return the minimum and
274maximum of corresponding counts. Each operation can accept inputs with signed
275counts, but the output will exclude results with counts of zero or less.
Raymond Hettingerbad1eb22009-01-20 01:19:26 +0000276
Raymond Hettinger4571f342009-01-21 20:31:50 +0000277 >>> c = Counter(a=3, b=1)
278 >>> d = Counter(a=1, b=2)
Raymond Hettinger22bfa9e2009-01-27 02:36:33 +0000279 >>> c + d # add two counters together: c[x] + d[x]
Raymond Hettingerbad1eb22009-01-20 01:19:26 +0000280 Counter({'a': 4, 'b': 3})
Raymond Hettinger22bfa9e2009-01-27 02:36:33 +0000281 >>> c - d # subtract (keeping only positive counts)
Raymond Hettingerbad1eb22009-01-20 01:19:26 +0000282 Counter({'a': 2})
Raymond Hettinger22bfa9e2009-01-27 02:36:33 +0000283 >>> c & d # intersection: min(c[x], d[x])
Raymond Hettingerbad1eb22009-01-20 01:19:26 +0000284 Counter({'a': 1, 'b': 1})
Raymond Hettinger22bfa9e2009-01-27 02:36:33 +0000285 >>> c | d # union: max(c[x], d[x])
Raymond Hettingerbad1eb22009-01-20 01:19:26 +0000286 Counter({'a': 3, 'b': 2})
287
Raymond Hettinger44340e62010-04-12 21:12:06 +0000288.. note::
289
290 Counters were primarily designed to work with positive integers to represent
291 running counts; however, care was taken to not unnecessarily preclude use
292 cases needing other types or negative values. To help with those use cases,
293 this section documents the minimum range and type restrictions.
294
295 * The :class:`Counter` class itself is a dictionary subclass with no
296 restrictions on its keys and values. The values are intended to be numbers
297 representing counts, but you *could* store anything in the value field.
298
299 * The :meth:`most_common` method requires only that the values be orderable.
300
301 * For in-place operations such as ``c[key] += 1``, the value type need only
302 support addition and subtraction. So fractions, floats, and decimals would
303 work and negative values are supported. The same is also true for
304 :meth:`update` and :meth:`subtract` which allow negative and zero values
305 for both inputs and outputs.
306
307 * The multiset methods are designed only for use cases with positive values.
308 The inputs may be negative or zero, but only outputs with positive values
309 are created. There are no type restrictions, but the value type needs to
310 support support addition, subtraction, and comparison.
311
312 * The :meth:`elements` method requires integer counts. It ignores zero and
313 negative counts.
314
Raymond Hettingeracdc84a2009-01-20 23:42:54 +0000315.. seealso::
Raymond Hettingerf94d7fa2009-01-12 22:58:41 +0000316
Raymond Hettinger7bdca052009-01-22 05:20:47 +0000317 * `Counter class <http://code.activestate.com/recipes/576611/>`_
318 adapted for Python 2.5 and an early `Bag recipe
319 <http://code.activestate.com/recipes/259174/>`_ for Python 2.4.
320
Raymond Hettingeracdc84a2009-01-20 23:42:54 +0000321 * `Bag class <http://www.gnu.org/software/smalltalk/manual-base/html_node/Bag.html>`_
322 in Smalltalk.
Raymond Hettingerf94d7fa2009-01-12 22:58:41 +0000323
Raymond Hettingeracdc84a2009-01-20 23:42:54 +0000324 * Wikipedia entry for `Multisets <http://en.wikipedia.org/wiki/Multiset>`_\.
Raymond Hettingerf94d7fa2009-01-12 22:58:41 +0000325
Raymond Hettingeracdc84a2009-01-20 23:42:54 +0000326 * `C++ multisets <http://www.demo2s.com/Tutorial/Cpp/0380__set-multiset/Catalog0380__set-multiset.htm>`_
Raymond Hettinger7bdca052009-01-22 05:20:47 +0000327 tutorial with examples.
Raymond Hettingeracdc84a2009-01-20 23:42:54 +0000328
Raymond Hettinger7bdca052009-01-22 05:20:47 +0000329 * For mathematical operations on multisets and their use cases, see
Raymond Hettingeracdc84a2009-01-20 23:42:54 +0000330 *Knuth, Donald. The Art of Computer Programming Volume II,
331 Section 4.6.3, Exercise 19*\.
332
Raymond Hettinger0a1f7b82009-01-21 23:12:51 +0000333 * To enumerate all distinct multisets of a given size over a given set of
Raymond Hettingerd081abc2009-01-27 02:58:49 +0000334 elements, see :func:`itertools.combinations_with_replacement`.
Raymond Hettingeracdc84a2009-01-20 23:42:54 +0000335
Raymond Hettinger7bdca052009-01-22 05:20:47 +0000336 map(Counter, combinations_with_replacement('ABC', 2)) --> AA AB AC BB BC CC
Raymond Hettingerfbcf7492009-01-13 08:38:14 +0000337
Raymond Hettingerf94d7fa2009-01-12 22:58:41 +0000338
Georg Brandl8ec7f652007-08-15 14:28:01 +0000339:class:`deque` objects
340----------------------
341
Raymond Hettingera7fc4b12007-10-05 02:47:07 +0000342.. class:: deque([iterable[, maxlen]])
Georg Brandl8ec7f652007-08-15 14:28:01 +0000343
344 Returns a new deque object initialized left-to-right (using :meth:`append`) with
345 data from *iterable*. If *iterable* is not specified, the new deque is empty.
346
347 Deques are a generalization of stacks and queues (the name is pronounced "deck"
348 and is short for "double-ended queue"). Deques support thread-safe, memory
349 efficient appends and pops from either side of the deque with approximately the
350 same O(1) performance in either direction.
351
352 Though :class:`list` objects support similar operations, they are optimized for
353 fast fixed-length operations and incur O(n) memory movement costs for
354 ``pop(0)`` and ``insert(0, v)`` operations which change both the size and
355 position of the underlying data representation.
356
357 .. versionadded:: 2.4
358
Raymond Hettinger68995862007-10-10 00:26:46 +0000359 If *maxlen* is not specified or is *None*, deques may grow to an
Raymond Hettingera7fc4b12007-10-05 02:47:07 +0000360 arbitrary length. Otherwise, the deque is bounded to the specified maximum
361 length. Once a bounded length deque is full, when new items are added, a
362 corresponding number of items are discarded from the opposite end. Bounded
363 length deques provide functionality similar to the ``tail`` filter in
364 Unix. They are also useful for tracking transactions and other pools of data
365 where only the most recent activity is of interest.
366
367 .. versionchanged:: 2.6
Georg Brandlb19be572007-12-29 10:57:00 +0000368 Added *maxlen* parameter.
Raymond Hettingera7fc4b12007-10-05 02:47:07 +0000369
Benjamin Petersonc7b05922008-04-25 01:29:10 +0000370 Deque objects support the following methods:
Georg Brandl8ec7f652007-08-15 14:28:01 +0000371
372
Benjamin Petersonc7b05922008-04-25 01:29:10 +0000373 .. method:: append(x)
Georg Brandl8ec7f652007-08-15 14:28:01 +0000374
Benjamin Petersonc7b05922008-04-25 01:29:10 +0000375 Add *x* to the right side of the deque.
Georg Brandl8ec7f652007-08-15 14:28:01 +0000376
377
Benjamin Petersonc7b05922008-04-25 01:29:10 +0000378 .. method:: appendleft(x)
Georg Brandl8ec7f652007-08-15 14:28:01 +0000379
Benjamin Petersonc7b05922008-04-25 01:29:10 +0000380 Add *x* to the left side of the deque.
Georg Brandl8ec7f652007-08-15 14:28:01 +0000381
382
Benjamin Petersonc7b05922008-04-25 01:29:10 +0000383 .. method:: clear()
Georg Brandl8ec7f652007-08-15 14:28:01 +0000384
Benjamin Petersonc7b05922008-04-25 01:29:10 +0000385 Remove all elements from the deque leaving it with length 0.
Georg Brandl8ec7f652007-08-15 14:28:01 +0000386
387
Raymond Hettinger5f516ed2010-04-03 18:10:37 +0000388 .. method:: count(x)
389
390 Count the number of deque elements equal to *x*.
391
392 .. versionadded:: 2.7
393
Benjamin Petersonc7b05922008-04-25 01:29:10 +0000394 .. method:: extend(iterable)
Georg Brandl8ec7f652007-08-15 14:28:01 +0000395
Benjamin Petersonc7b05922008-04-25 01:29:10 +0000396 Extend the right side of the deque by appending elements from the iterable
397 argument.
Georg Brandl8ec7f652007-08-15 14:28:01 +0000398
399
Benjamin Petersonc7b05922008-04-25 01:29:10 +0000400 .. method:: extendleft(iterable)
Georg Brandl8ec7f652007-08-15 14:28:01 +0000401
Benjamin Petersonc7b05922008-04-25 01:29:10 +0000402 Extend the left side of the deque by appending elements from *iterable*.
403 Note, the series of left appends results in reversing the order of
404 elements in the iterable argument.
Georg Brandl8ec7f652007-08-15 14:28:01 +0000405
406
Benjamin Petersonc7b05922008-04-25 01:29:10 +0000407 .. method:: pop()
Georg Brandl8ec7f652007-08-15 14:28:01 +0000408
Benjamin Petersonc7b05922008-04-25 01:29:10 +0000409 Remove and return an element from the right side of the deque. If no
410 elements are present, raises an :exc:`IndexError`.
Georg Brandl8ec7f652007-08-15 14:28:01 +0000411
412
Benjamin Petersonc7b05922008-04-25 01:29:10 +0000413 .. method:: popleft()
Georg Brandl8ec7f652007-08-15 14:28:01 +0000414
Benjamin Petersonc7b05922008-04-25 01:29:10 +0000415 Remove and return an element from the left side of the deque. If no
416 elements are present, raises an :exc:`IndexError`.
Georg Brandl8ec7f652007-08-15 14:28:01 +0000417
418
Benjamin Petersonc7b05922008-04-25 01:29:10 +0000419 .. method:: remove(value)
Georg Brandl8ec7f652007-08-15 14:28:01 +0000420
Benjamin Petersonc7b05922008-04-25 01:29:10 +0000421 Removed the first occurrence of *value*. If not found, raises a
422 :exc:`ValueError`.
Georg Brandl8ec7f652007-08-15 14:28:01 +0000423
Benjamin Petersonc7b05922008-04-25 01:29:10 +0000424 .. versionadded:: 2.5
Georg Brandl8ec7f652007-08-15 14:28:01 +0000425
Raymond Hettingera5fd24e2009-12-10 06:42:54 +0000426 .. method:: reverse()
427
428 Reverse the elements of the deque in-place and then return ``None``.
429
430 .. versionadded:: 2.7
Georg Brandl8ec7f652007-08-15 14:28:01 +0000431
Benjamin Petersonc7b05922008-04-25 01:29:10 +0000432 .. method:: rotate(n)
Georg Brandl8ec7f652007-08-15 14:28:01 +0000433
Benjamin Petersonc7b05922008-04-25 01:29:10 +0000434 Rotate the deque *n* steps to the right. If *n* is negative, rotate to
435 the left. Rotating one step to the right is equivalent to:
436 ``d.appendleft(d.pop())``.
437
Georg Brandl8ec7f652007-08-15 14:28:01 +0000438
Raymond Hettinger56411aa2009-03-10 12:50:59 +0000439 Deque objects also provide one read-only attribute:
440
441 .. attribute:: maxlen
442
443 Maximum size of a deque or *None* if unbounded.
444
445 .. versionadded:: 2.7
446
447
Georg Brandl8ec7f652007-08-15 14:28:01 +0000448In addition to the above, deques support iteration, pickling, ``len(d)``,
449``reversed(d)``, ``copy.copy(d)``, ``copy.deepcopy(d)``, membership testing with
Benjamin Peterson5c4e0062008-10-16 18:52:14 +0000450the :keyword:`in` operator, and subscript references such as ``d[-1]``. Indexed
451access is O(1) at both ends but slows to O(n) in the middle. For fast random
452access, use lists instead.
Georg Brandl8ec7f652007-08-15 14:28:01 +0000453
Georg Brandl4c8bbe62008-03-22 21:06:20 +0000454Example:
455
456.. doctest::
Georg Brandl8ec7f652007-08-15 14:28:01 +0000457
458 >>> from collections import deque
459 >>> d = deque('ghi') # make a new deque with three items
460 >>> for elem in d: # iterate over the deque's elements
Georg Brandl4c8bbe62008-03-22 21:06:20 +0000461 ... print elem.upper()
Georg Brandl8ec7f652007-08-15 14:28:01 +0000462 G
463 H
464 I
465
466 >>> d.append('j') # add a new entry to the right side
467 >>> d.appendleft('f') # add a new entry to the left side
468 >>> d # show the representation of the deque
469 deque(['f', 'g', 'h', 'i', 'j'])
470
471 >>> d.pop() # return and remove the rightmost item
472 'j'
473 >>> d.popleft() # return and remove the leftmost item
474 'f'
475 >>> list(d) # list the contents of the deque
476 ['g', 'h', 'i']
477 >>> d[0] # peek at leftmost item
478 'g'
479 >>> d[-1] # peek at rightmost item
480 'i'
481
482 >>> list(reversed(d)) # list the contents of a deque in reverse
483 ['i', 'h', 'g']
484 >>> 'h' in d # search the deque
485 True
486 >>> d.extend('jkl') # add multiple elements at once
487 >>> d
488 deque(['g', 'h', 'i', 'j', 'k', 'l'])
489 >>> d.rotate(1) # right rotation
490 >>> d
491 deque(['l', 'g', 'h', 'i', 'j', 'k'])
492 >>> d.rotate(-1) # left rotation
493 >>> d
494 deque(['g', 'h', 'i', 'j', 'k', 'l'])
495
496 >>> deque(reversed(d)) # make a new deque in reverse order
497 deque(['l', 'k', 'j', 'i', 'h', 'g'])
498 >>> d.clear() # empty the deque
499 >>> d.pop() # cannot pop from an empty deque
500 Traceback (most recent call last):
501 File "<pyshell#6>", line 1, in -toplevel-
502 d.pop()
503 IndexError: pop from an empty deque
504
505 >>> d.extendleft('abc') # extendleft() reverses the input order
506 >>> d
507 deque(['c', 'b', 'a'])
508
509
Raymond Hettingera7fc4b12007-10-05 02:47:07 +0000510:class:`deque` Recipes
511^^^^^^^^^^^^^^^^^^^^^^
Georg Brandl8ec7f652007-08-15 14:28:01 +0000512
513This section shows various approaches to working with deques.
514
Raymond Hettinger6bc94cb2009-03-31 22:43:03 +0000515Bounded length deques provide functionality similar to the ``tail`` filter
516in Unix::
517
518 def tail(filename, n=10):
519 'Return the last n lines of a file'
520 return deque(open(filename), n)
521
522Another approach to using deques is to maintain a sequence of recently
523added elements by appending to the right and popping to the left::
524
525 def moving_average(iterable, n=3):
526 # moving_average([40, 30, 50, 46, 39, 44]) --> 40.0 42.0 45.0 43.0
527 # http://en.wikipedia.org/wiki/Moving_average
Raymond Hettinger6bc94cb2009-03-31 22:43:03 +0000528 it = iter(iterable)
Raymond Hettinger9b6f13e2009-05-22 01:06:44 +0000529 d = deque(itertools.islice(it, n-1))
530 d.appendleft(0)
Raymond Hettinger6bc94cb2009-03-31 22:43:03 +0000531 s = sum(d)
Raymond Hettinger6bc94cb2009-03-31 22:43:03 +0000532 for elem in it:
533 s += elem - d.popleft()
534 d.append(elem)
Raymond Hettinger9b6f13e2009-05-22 01:06:44 +0000535 yield s / float(n)
Raymond Hettinger6bc94cb2009-03-31 22:43:03 +0000536
Georg Brandl8ec7f652007-08-15 14:28:01 +0000537The :meth:`rotate` method provides a way to implement :class:`deque` slicing and
Ezio Melotti062d2b52009-12-19 22:41:49 +0000538deletion. For example, a pure Python implementation of ``del d[n]`` relies on
Georg Brandl8ec7f652007-08-15 14:28:01 +0000539the :meth:`rotate` method to position elements to be popped::
540
541 def delete_nth(d, n):
542 d.rotate(-n)
543 d.popleft()
544 d.rotate(n)
545
546To implement :class:`deque` slicing, use a similar approach applying
547:meth:`rotate` to bring a target element to the left side of the deque. Remove
548old entries with :meth:`popleft`, add new entries with :meth:`extend`, and then
549reverse the rotation.
Georg Brandl8ec7f652007-08-15 14:28:01 +0000550With minor variations on that approach, it is easy to implement Forth style
551stack manipulations such as ``dup``, ``drop``, ``swap``, ``over``, ``pick``,
552``rot``, and ``roll``.
553
Georg Brandl8ec7f652007-08-15 14:28:01 +0000554
555:class:`defaultdict` objects
556----------------------------
557
Georg Brandl8ec7f652007-08-15 14:28:01 +0000558.. class:: defaultdict([default_factory[, ...]])
559
560 Returns a new dictionary-like object. :class:`defaultdict` is a subclass of the
Georg Brandld7d4fd72009-07-26 14:37:28 +0000561 built-in :class:`dict` class. It overrides one method and adds one writable
Georg Brandl8ec7f652007-08-15 14:28:01 +0000562 instance variable. The remaining functionality is the same as for the
563 :class:`dict` class and is not documented here.
564
565 The first argument provides the initial value for the :attr:`default_factory`
566 attribute; it defaults to ``None``. All remaining arguments are treated the same
567 as if they were passed to the :class:`dict` constructor, including keyword
568 arguments.
569
570 .. versionadded:: 2.5
571
Benjamin Petersonc7b05922008-04-25 01:29:10 +0000572 :class:`defaultdict` objects support the following method in addition to the
573 standard :class:`dict` operations:
Georg Brandl8ec7f652007-08-15 14:28:01 +0000574
575
Benjamin Petersonc7b05922008-04-25 01:29:10 +0000576 .. method:: defaultdict.__missing__(key)
Georg Brandl8ec7f652007-08-15 14:28:01 +0000577
Skip Montanarob40890d2008-09-17 11:50:36 +0000578 If the :attr:`default_factory` attribute is ``None``, this raises a
Benjamin Petersonc7b05922008-04-25 01:29:10 +0000579 :exc:`KeyError` exception with the *key* as argument.
Georg Brandl8ec7f652007-08-15 14:28:01 +0000580
Benjamin Petersonc7b05922008-04-25 01:29:10 +0000581 If :attr:`default_factory` is not ``None``, it is called without arguments
582 to provide a default value for the given *key*, this value is inserted in
583 the dictionary for the *key*, and returned.
Georg Brandl8ec7f652007-08-15 14:28:01 +0000584
Benjamin Petersonc7b05922008-04-25 01:29:10 +0000585 If calling :attr:`default_factory` raises an exception this exception is
586 propagated unchanged.
Georg Brandl8ec7f652007-08-15 14:28:01 +0000587
Benjamin Petersonc7b05922008-04-25 01:29:10 +0000588 This method is called by the :meth:`__getitem__` method of the
589 :class:`dict` class when the requested key is not found; whatever it
590 returns or raises is then returned or raised by :meth:`__getitem__`.
Georg Brandl8ec7f652007-08-15 14:28:01 +0000591
592
Benjamin Petersonc7b05922008-04-25 01:29:10 +0000593 :class:`defaultdict` objects support the following instance variable:
Georg Brandl8ec7f652007-08-15 14:28:01 +0000594
Benjamin Petersonc7b05922008-04-25 01:29:10 +0000595
596 .. attribute:: defaultdict.default_factory
597
598 This attribute is used by the :meth:`__missing__` method; it is
599 initialized from the first argument to the constructor, if present, or to
600 ``None``, if absent.
Georg Brandl8ec7f652007-08-15 14:28:01 +0000601
602
Georg Brandl8ec7f652007-08-15 14:28:01 +0000603:class:`defaultdict` Examples
604^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
605
606Using :class:`list` as the :attr:`default_factory`, it is easy to group a
Georg Brandl4c8bbe62008-03-22 21:06:20 +0000607sequence of key-value pairs into a dictionary of lists:
Georg Brandl8ec7f652007-08-15 14:28:01 +0000608
609 >>> s = [('yellow', 1), ('blue', 2), ('yellow', 3), ('blue', 4), ('red', 1)]
610 >>> d = defaultdict(list)
611 >>> for k, v in s:
612 ... d[k].append(v)
613 ...
614 >>> d.items()
615 [('blue', [2, 4]), ('red', [1]), ('yellow', [1, 3])]
616
617When each key is encountered for the first time, it is not already in the
618mapping; so an entry is automatically created using the :attr:`default_factory`
619function which returns an empty :class:`list`. The :meth:`list.append`
620operation then attaches the value to the new list. When keys are encountered
621again, the look-up proceeds normally (returning the list for that key) and the
622:meth:`list.append` operation adds another value to the list. This technique is
Georg Brandl4c8bbe62008-03-22 21:06:20 +0000623simpler and faster than an equivalent technique using :meth:`dict.setdefault`:
Georg Brandl8ec7f652007-08-15 14:28:01 +0000624
625 >>> d = {}
626 >>> for k, v in s:
627 ... d.setdefault(k, []).append(v)
628 ...
629 >>> d.items()
630 [('blue', [2, 4]), ('red', [1]), ('yellow', [1, 3])]
631
632Setting the :attr:`default_factory` to :class:`int` makes the
633:class:`defaultdict` useful for counting (like a bag or multiset in other
Georg Brandl4c8bbe62008-03-22 21:06:20 +0000634languages):
Georg Brandl8ec7f652007-08-15 14:28:01 +0000635
636 >>> s = 'mississippi'
637 >>> d = defaultdict(int)
638 >>> for k in s:
639 ... d[k] += 1
640 ...
641 >>> d.items()
642 [('i', 4), ('p', 2), ('s', 4), ('m', 1)]
643
644When a letter is first encountered, it is missing from the mapping, so the
645:attr:`default_factory` function calls :func:`int` to supply a default count of
646zero. The increment operation then builds up the count for each letter.
647
648The function :func:`int` which always returns zero is just a special case of
649constant functions. A faster and more flexible way to create constant functions
650is to use :func:`itertools.repeat` which can supply any constant value (not just
Georg Brandl4c8bbe62008-03-22 21:06:20 +0000651zero):
Georg Brandl8ec7f652007-08-15 14:28:01 +0000652
653 >>> def constant_factory(value):
654 ... return itertools.repeat(value).next
655 >>> d = defaultdict(constant_factory('<missing>'))
656 >>> d.update(name='John', action='ran')
657 >>> '%(name)s %(action)s to %(object)s' % d
658 'John ran to <missing>'
659
660Setting the :attr:`default_factory` to :class:`set` makes the
Georg Brandl4c8bbe62008-03-22 21:06:20 +0000661:class:`defaultdict` useful for building a dictionary of sets:
Georg Brandl8ec7f652007-08-15 14:28:01 +0000662
663 >>> s = [('red', 1), ('blue', 2), ('red', 3), ('blue', 4), ('red', 1), ('blue', 4)]
664 >>> d = defaultdict(set)
665 >>> for k, v in s:
666 ... d[k].add(v)
667 ...
668 >>> d.items()
669 [('blue', set([2, 4])), ('red', set([1, 3]))]
670
671
Raymond Hettingereeeb9c42007-11-15 02:44:53 +0000672:func:`namedtuple` Factory Function for Tuples with Named Fields
Georg Brandlb3255ed2008-01-07 16:43:47 +0000673----------------------------------------------------------------
Georg Brandl8ec7f652007-08-15 14:28:01 +0000674
Raymond Hettingercbab5942007-09-18 22:18:02 +0000675Named tuples assign meaning to each position in a tuple and allow for more readable,
676self-documenting code. They can be used wherever regular tuples are used, and
677they add the ability to access fields by name instead of position index.
Georg Brandl8ec7f652007-08-15 14:28:01 +0000678
Raymond Hettinger322daea2009-02-10 01:24:05 +0000679.. function:: namedtuple(typename, field_names, [verbose], [rename])
Georg Brandl8ec7f652007-08-15 14:28:01 +0000680
681 Returns a new tuple subclass named *typename*. The new subclass is used to
Georg Brandl907a7202008-02-22 12:31:45 +0000682 create tuple-like objects that have fields accessible by attribute lookup as
Georg Brandl8ec7f652007-08-15 14:28:01 +0000683 well as being indexable and iterable. Instances of the subclass also have a
Georg Brandl061d2e22008-11-23 19:17:25 +0000684 helpful docstring (with typename and field_names) and a helpful :meth:`__repr__`
Georg Brandl8ec7f652007-08-15 14:28:01 +0000685 method which lists the tuple contents in a ``name=value`` format.
686
Georg Brandl061d2e22008-11-23 19:17:25 +0000687 The *field_names* are a single string with each fieldname separated by whitespace
688 and/or commas, for example ``'x y'`` or ``'x, y'``. Alternatively, *field_names*
Raymond Hettinger15b5e552008-01-10 23:00:01 +0000689 can be a sequence of strings such as ``['x', 'y']``.
Raymond Hettingerabfd8df2007-10-16 21:28:32 +0000690
691 Any valid Python identifier may be used for a fieldname except for names
Raymond Hettinger42da8742007-12-14 02:49:47 +0000692 starting with an underscore. Valid identifiers consist of letters, digits,
693 and underscores but do not start with a digit or underscore and cannot be
Raymond Hettingerabfd8df2007-10-16 21:28:32 +0000694 a :mod:`keyword` such as *class*, *for*, *return*, *global*, *pass*, *print*,
695 or *raise*.
Raymond Hettingercbab5942007-09-18 22:18:02 +0000696
Raymond Hettinger322daea2009-02-10 01:24:05 +0000697 If *rename* is true, invalid fieldnames are automatically replaced
698 with positional names. For example, ``['abc', 'def', 'ghi', 'abc']`` is
Raymond Hettinger6df48a32009-04-02 22:34:17 +0000699 converted to ``['abc', '_1', 'ghi', '_3']``, eliminating the keyword
Raymond Hettinger322daea2009-02-10 01:24:05 +0000700 ``def`` and the duplicate fieldname ``abc``.
701
Raymond Hettinger15b5e552008-01-10 23:00:01 +0000702 If *verbose* is true, the class definition is printed just before being built.
Raymond Hettingercbab5942007-09-18 22:18:02 +0000703
Raymond Hettingera48a2992007-10-08 21:26:58 +0000704 Named tuple instances do not have per-instance dictionaries, so they are
Raymond Hettinger7268e9d2007-09-20 03:03:43 +0000705 lightweight and require no more memory than regular tuples.
Raymond Hettingercbab5942007-09-18 22:18:02 +0000706
Georg Brandl8ec7f652007-08-15 14:28:01 +0000707 .. versionadded:: 2.6
708
Raymond Hettinger322daea2009-02-10 01:24:05 +0000709 .. versionchanged:: 2.7
710 added support for *rename*.
711
Georg Brandl4c8bbe62008-03-22 21:06:20 +0000712Example:
713
714.. doctest::
715 :options: +NORMALIZE_WHITESPACE
Georg Brandl8ec7f652007-08-15 14:28:01 +0000716
Raymond Hettingereeeb9c42007-11-15 02:44:53 +0000717 >>> Point = namedtuple('Point', 'x y', verbose=True)
Raymond Hettingercbab5942007-09-18 22:18:02 +0000718 class Point(tuple):
719 'Point(x, y)'
Georg Brandl4c8bbe62008-03-22 21:06:20 +0000720 <BLANKLINE>
Raymond Hettingercbab5942007-09-18 22:18:02 +0000721 __slots__ = ()
Georg Brandl4c8bbe62008-03-22 21:06:20 +0000722 <BLANKLINE>
Raymond Hettingere0734e72008-01-04 03:22:53 +0000723 _fields = ('x', 'y')
Georg Brandl4c8bbe62008-03-22 21:06:20 +0000724 <BLANKLINE>
Raymond Hettingera68cad12009-05-27 02:24:45 +0000725 def __new__(_cls, x, y):
Raymond Hettinger9bd35082010-03-09 09:01:46 +0000726 'Create a new instance of Point(x, y)'
Raymond Hettingera68cad12009-05-27 02:24:45 +0000727 return _tuple.__new__(_cls, (x, y))
Georg Brandl4c8bbe62008-03-22 21:06:20 +0000728 <BLANKLINE>
Raymond Hettinger02740f72008-01-05 01:35:43 +0000729 @classmethod
Georg Brandl4c8bbe62008-03-22 21:06:20 +0000730 def _make(cls, iterable, new=tuple.__new__, len=len):
Raymond Hettinger02740f72008-01-05 01:35:43 +0000731 'Make a new Point object from a sequence or iterable'
Georg Brandl4c8bbe62008-03-22 21:06:20 +0000732 result = new(cls, iterable)
Raymond Hettinger02740f72008-01-05 01:35:43 +0000733 if len(result) != 2:
734 raise TypeError('Expected 2 arguments, got %d' % len(result))
735 return result
Georg Brandl4c8bbe62008-03-22 21:06:20 +0000736 <BLANKLINE>
Raymond Hettingercbab5942007-09-18 22:18:02 +0000737 def __repr__(self):
Raymond Hettinger9bd35082010-03-09 09:01:46 +0000738 'Return a nicely formatted representation string'
Raymond Hettingercbab5942007-09-18 22:18:02 +0000739 return 'Point(x=%r, y=%r)' % self
Georg Brandl4c8bbe62008-03-22 21:06:20 +0000740 <BLANKLINE>
Raymond Hettinger88a91642009-03-03 04:51:24 +0000741 def _asdict(self):
742 'Return a new OrderedDict which maps field names to their values'
743 return OrderedDict(zip(self._fields, self))
Georg Brandl4c8bbe62008-03-22 21:06:20 +0000744 <BLANKLINE>
Raymond Hettingera68cad12009-05-27 02:24:45 +0000745 def _replace(_self, **kwds):
Raymond Hettingereeeb9c42007-11-15 02:44:53 +0000746 'Return a new Point object replacing specified fields with new values'
Raymond Hettingera68cad12009-05-27 02:24:45 +0000747 result = _self._make(map(kwds.pop, ('x', 'y'), _self))
Raymond Hettinger1b50fd72008-01-05 02:17:24 +0000748 if kwds:
749 raise ValueError('Got unexpected field names: %r' % kwds.keys())
750 return result
Georg Brandlc62ef8b2009-01-03 20:55:06 +0000751 <BLANKLINE>
752 def __getnewargs__(self):
Raymond Hettinger9bd35082010-03-09 09:01:46 +0000753 'Return self as a plain tuple. Used by copy and pickle.'
Raymond Hettingeree51cff2008-06-27 21:34:24 +0000754 return tuple(self)
Georg Brandl4c8bbe62008-03-22 21:06:20 +0000755 <BLANKLINE>
Raymond Hettinger9bd35082010-03-09 09:01:46 +0000756 x = _property(_itemgetter(0), doc='Alias for field number 0')
757 y = _property(_itemgetter(1), doc='Alias for field number 1')
Georg Brandl8ec7f652007-08-15 14:28:01 +0000758
Raymond Hettingercbab5942007-09-18 22:18:02 +0000759 >>> p = Point(11, y=22) # instantiate with positional or keyword arguments
Raymond Hettinger88880b22007-12-18 00:13:45 +0000760 >>> p[0] + p[1] # indexable like the plain tuple (11, 22)
Raymond Hettingercbab5942007-09-18 22:18:02 +0000761 33
762 >>> x, y = p # unpack like a regular tuple
763 >>> x, y
764 (11, 22)
Georg Brandl907a7202008-02-22 12:31:45 +0000765 >>> p.x + p.y # fields also accessible by name
Raymond Hettingercbab5942007-09-18 22:18:02 +0000766 33
767 >>> p # readable __repr__ with a name=value style
768 Point(x=11, y=22)
Georg Brandl8ec7f652007-08-15 14:28:01 +0000769
Raymond Hettingercbab5942007-09-18 22:18:02 +0000770Named tuples are especially useful for assigning field names to result tuples returned
771by the :mod:`csv` or :mod:`sqlite3` modules::
Georg Brandl8ec7f652007-08-15 14:28:01 +0000772
Raymond Hettingereeeb9c42007-11-15 02:44:53 +0000773 EmployeeRecord = namedtuple('EmployeeRecord', 'name, age, title, department, paygrade')
Raymond Hettingera48a2992007-10-08 21:26:58 +0000774
Raymond Hettingercbab5942007-09-18 22:18:02 +0000775 import csv
Raymond Hettinger02740f72008-01-05 01:35:43 +0000776 for emp in map(EmployeeRecord._make, csv.reader(open("employees.csv", "rb"))):
Raymond Hettingercbab5942007-09-18 22:18:02 +0000777 print emp.name, emp.title
Georg Brandl8ec7f652007-08-15 14:28:01 +0000778
Raymond Hettingera48a2992007-10-08 21:26:58 +0000779 import sqlite3
780 conn = sqlite3.connect('/companydata')
781 cursor = conn.cursor()
782 cursor.execute('SELECT name, age, title, department, paygrade FROM employees')
Raymond Hettinger02740f72008-01-05 01:35:43 +0000783 for emp in map(EmployeeRecord._make, cursor.fetchall()):
Raymond Hettingera48a2992007-10-08 21:26:58 +0000784 print emp.name, emp.title
785
Raymond Hettinger85dfcf32007-12-18 23:51:15 +0000786In addition to the methods inherited from tuples, named tuples support
Raymond Hettingerac5742e2008-01-08 02:24:15 +0000787three additional methods and one attribute. To prevent conflicts with
788field names, the method and attribute names start with an underscore.
Raymond Hettinger85dfcf32007-12-18 23:51:15 +0000789
Georg Brandlb3255ed2008-01-07 16:43:47 +0000790.. method:: somenamedtuple._make(iterable)
Raymond Hettinger85dfcf32007-12-18 23:51:15 +0000791
Raymond Hettinger02740f72008-01-05 01:35:43 +0000792 Class method that makes a new instance from an existing sequence or iterable.
Raymond Hettinger85dfcf32007-12-18 23:51:15 +0000793
Raymond Hettinger2950bca2009-01-14 01:39:51 +0000794 .. doctest::
Georg Brandl8ec7f652007-08-15 14:28:01 +0000795
Raymond Hettinger02740f72008-01-05 01:35:43 +0000796 >>> t = [11, 22]
797 >>> Point._make(t)
798 Point(x=11, y=22)
Raymond Hettinger2b03d452007-09-18 03:33:19 +0000799
Georg Brandlb3255ed2008-01-07 16:43:47 +0000800.. method:: somenamedtuple._asdict()
Raymond Hettingera7fc4b12007-10-05 02:47:07 +0000801
Raymond Hettinger88a91642009-03-03 04:51:24 +0000802 Return a new :class:`OrderedDict` which maps field names to their corresponding
803 values::
Raymond Hettingera7fc4b12007-10-05 02:47:07 +0000804
Raymond Hettinger42da8742007-12-14 02:49:47 +0000805 >>> p._asdict()
Raymond Hettinger88a91642009-03-03 04:51:24 +0000806 OrderedDict([('x', 11), ('y', 22)])
807
Raymond Hettingera07038d2009-03-03 05:11:56 +0000808 .. versionchanged:: 2.7
Raymond Hettinger88a91642009-03-03 04:51:24 +0000809 Returns an :class:`OrderedDict` instead of a regular :class:`dict`.
Georg Brandl4c8bbe62008-03-22 21:06:20 +0000810
Georg Brandlb3255ed2008-01-07 16:43:47 +0000811.. method:: somenamedtuple._replace(kwargs)
Raymond Hettingerd36a60e2007-09-17 00:55:00 +0000812
Georg Brandl4c8bbe62008-03-22 21:06:20 +0000813 Return a new instance of the named tuple replacing specified fields with new
Raymond Hettinger2950bca2009-01-14 01:39:51 +0000814 values::
Raymond Hettingerd36a60e2007-09-17 00:55:00 +0000815
Raymond Hettingercbab5942007-09-18 22:18:02 +0000816 >>> p = Point(x=11, y=22)
Raymond Hettinger42da8742007-12-14 02:49:47 +0000817 >>> p._replace(x=33)
Raymond Hettingerd36a60e2007-09-17 00:55:00 +0000818 Point(x=33, y=22)
819
Raymond Hettinger7c3738e2007-11-15 03:16:09 +0000820 >>> for partnum, record in inventory.items():
Raymond Hettingere11230e2008-01-09 03:02:23 +0000821 ... inventory[partnum] = record._replace(price=newprices[partnum], timestamp=time.now())
Raymond Hettingerd36a60e2007-09-17 00:55:00 +0000822
Georg Brandlb3255ed2008-01-07 16:43:47 +0000823.. attribute:: somenamedtuple._fields
Raymond Hettingerd36a60e2007-09-17 00:55:00 +0000824
Raymond Hettingerf6b769b2008-01-07 21:33:51 +0000825 Tuple of strings listing the field names. Useful for introspection
Raymond Hettingera7fc4b12007-10-05 02:47:07 +0000826 and for creating new named tuple types from existing named tuples.
Raymond Hettinger7268e9d2007-09-20 03:03:43 +0000827
Raymond Hettinger2950bca2009-01-14 01:39:51 +0000828 .. doctest::
Raymond Hettingerd36a60e2007-09-17 00:55:00 +0000829
Raymond Hettinger42da8742007-12-14 02:49:47 +0000830 >>> p._fields # view the field names
Raymond Hettingercbab5942007-09-18 22:18:02 +0000831 ('x', 'y')
Raymond Hettingerd36a60e2007-09-17 00:55:00 +0000832
Raymond Hettingereeeb9c42007-11-15 02:44:53 +0000833 >>> Color = namedtuple('Color', 'red green blue')
Raymond Hettinger42da8742007-12-14 02:49:47 +0000834 >>> Pixel = namedtuple('Pixel', Point._fields + Color._fields)
Raymond Hettingercbab5942007-09-18 22:18:02 +0000835 >>> Pixel(11, 22, 128, 255, 0)
Raymond Hettingerdc1854d2008-01-09 03:13:20 +0000836 Pixel(x=11, y=22, red=128, green=255, blue=0)
Raymond Hettingerd36a60e2007-09-17 00:55:00 +0000837
Raymond Hettingere846f382007-12-14 21:51:50 +0000838To retrieve a field whose name is stored in a string, use the :func:`getattr`
Georg Brandl4c8bbe62008-03-22 21:06:20 +0000839function:
Raymond Hettingere846f382007-12-14 21:51:50 +0000840
841 >>> getattr(p, 'x')
842 11
843
Raymond Hettingere4ae63c2009-02-11 00:06:17 +0000844To convert a dictionary to a named tuple, use the double-star-operator
845(as described in :ref:`tut-unpacking-arguments`):
Raymond Hettinger85dfcf32007-12-18 23:51:15 +0000846
847 >>> d = {'x': 11, 'y': 22}
848 >>> Point(**d)
849 Point(x=11, y=22)
850
Raymond Hettingereeeb9c42007-11-15 02:44:53 +0000851Since a named tuple is a regular Python class, it is easy to add or change
Raymond Hettingerb8e00722008-01-07 04:24:49 +0000852functionality with a subclass. Here is how to add a calculated field and
Georg Brandl4c8bbe62008-03-22 21:06:20 +0000853a fixed-width print format:
Raymond Hettingereeeb9c42007-11-15 02:44:53 +0000854
Raymond Hettingerb8e00722008-01-07 04:24:49 +0000855 >>> class Point(namedtuple('Point', 'x y')):
Raymond Hettingere1655082008-01-10 19:15:10 +0000856 ... __slots__ = ()
Raymond Hettingere11230e2008-01-09 03:02:23 +0000857 ... @property
858 ... def hypot(self):
859 ... return (self.x ** 2 + self.y ** 2) ** 0.5
860 ... def __str__(self):
Raymond Hettinger15b5e552008-01-10 23:00:01 +0000861 ... return 'Point: x=%6.3f y=%6.3f hypot=%6.3f' % (self.x, self.y, self.hypot)
Raymond Hettingerb8e00722008-01-07 04:24:49 +0000862
Raymond Hettingere1655082008-01-10 19:15:10 +0000863 >>> for p in Point(3, 4), Point(14, 5/7.):
Raymond Hettingere11230e2008-01-09 03:02:23 +0000864 ... print p
Raymond Hettinger15b5e552008-01-10 23:00:01 +0000865 Point: x= 3.000 y= 4.000 hypot= 5.000
866 Point: x=14.000 y= 0.714 hypot=14.018
Raymond Hettingereeeb9c42007-11-15 02:44:53 +0000867
Georg Brandlfe8df4f2009-12-28 08:01:59 +0000868The subclass shown above sets ``__slots__`` to an empty tuple. This helps
Raymond Hettinger171f3912008-01-16 23:38:16 +0000869keep memory requirements low by preventing the creation of instance dictionaries.
Raymond Hettingerf59e9622008-01-15 20:52:42 +0000870
Raymond Hettingerac5742e2008-01-08 02:24:15 +0000871Subclassing is not useful for adding new, stored fields. Instead, simply
Georg Brandl4c8bbe62008-03-22 21:06:20 +0000872create a new named tuple type from the :attr:`_fields` attribute:
Raymond Hettingerac5742e2008-01-08 02:24:15 +0000873
Raymond Hettingere850c462008-01-10 20:37:12 +0000874 >>> Point3D = namedtuple('Point3D', Point._fields + ('z',))
Raymond Hettingerac5742e2008-01-08 02:24:15 +0000875
Raymond Hettingerfb3ced62008-01-07 20:17:35 +0000876Default values can be implemented by using :meth:`_replace` to
Georg Brandl4c8bbe62008-03-22 21:06:20 +0000877customize a prototype instance:
Raymond Hettingerbc693492007-11-15 22:39:34 +0000878
879 >>> Account = namedtuple('Account', 'owner balance transaction_count')
Raymond Hettinger0fe6ca42008-01-18 21:14:58 +0000880 >>> default_account = Account('<owner name>', 0.0, 0)
881 >>> johns_account = default_account._replace(owner='John')
Raymond Hettingerbc693492007-11-15 22:39:34 +0000882
Raymond Hettinger5a9fed72008-05-08 07:23:30 +0000883Enumerated constants can be implemented with named tuples, but it is simpler
884and more efficient to use a simple class declaration:
885
886 >>> Status = namedtuple('Status', 'open pending closed')._make(range(3))
887 >>> Status.open, Status.pending, Status.closed
888 (0, 1, 2)
889 >>> class Status:
890 ... open, pending, closed = range(3)
891
Raymond Hettingere4ae63c2009-02-11 00:06:17 +0000892.. seealso::
Mark Summerfield7f626f42007-08-30 15:03:03 +0000893
Raymond Hettingere4ae63c2009-02-11 00:06:17 +0000894 `Named tuple recipe <http://code.activestate.com/recipes/500261/>`_
895 adapted for Python 2.4.
Raymond Hettingerbc512d32009-03-03 04:45:34 +0000896
897
898:class:`OrderedDict` objects
899----------------------------
900
901Ordered dictionaries are just like regular dictionaries but they remember the
902order that items were inserted. When iterating over an ordered dictionary,
903the items are returned in the order their keys were first added.
904
905.. class:: OrderedDict([items])
906
907 Return an instance of a dict subclass, supporting the usual :class:`dict`
908 methods. An *OrderedDict* is a dict that remembers the order that keys
909 were first inserted. If a new entry overwrites an existing entry, the
910 original insertion position is left unchanged. Deleting an entry and
911 reinserting it will move it to the end.
912
913 .. versionadded:: 2.7
914
Raymond Hettinger24122992009-03-19 19:59:58 +0000915.. method:: OrderedDict.popitem(last=True)
916
917 The :meth:`popitem` method for ordered dictionaries returns and removes
918 a (key, value) pair. The pairs are returned in LIFO order if *last* is
919 true or FIFO order if false.
Raymond Hettingerbc512d32009-03-03 04:45:34 +0000920
Raymond Hettinger50f362f2009-05-19 17:43:59 +0000921In addition to the usual mapping methods, ordered dictionaries also support
922reverse iteration using :func:`reversed`.
923
Raymond Hettingerbc512d32009-03-03 04:45:34 +0000924Equality tests between :class:`OrderedDict` objects are order-sensitive
925and are implemented as ``list(od1.items())==list(od2.items())``.
926Equality tests between :class:`OrderedDict` objects and other
927:class:`Mapping` objects are order-insensitive like regular dictionaries.
928This allows :class:`OrderedDict` objects to be substituted anywhere a
929regular dictionary is used.
Raymond Hettinger24122992009-03-19 19:59:58 +0000930
Raymond Hettingerc473c5a2009-04-09 22:31:51 +0000931The :class:`OrderedDict` constructor and :meth:`update` method both accept
932keyword arguments, but their order is lost because Python's function call
933semantics pass-in keyword arguments using a regular unordered dictionary.
934
Raymond Hettinger24122992009-03-19 19:59:58 +0000935.. seealso::
936
937 `Equivalent OrderedDict recipe <http://code.activestate.com/recipes/576693/>`_
938 that runs on Python 2.4 or later.
Raymond Hettinger610326d2009-11-10 19:35:55 +0000939
940Since an ordered dictionary remembers its insertion order, it can be used
941in conjuction with sorting to make a sorted dictionary::
942
943 >>> # regular unsorted dictionary
944 >>> d = {'banana': 3, 'apple':4, 'pear': 1, 'orange': 2}
945
946 >>> # dictionary sorted by key
947 >>> OrderedDict(sorted(d.items(), key=lambda t: t[0]))
948 OrderedDict([('apple', 4), ('banana', 3), ('orange', 2), ('pear', 1)])
949
950 >>> # dictionary sorted by value
951 >>> OrderedDict(sorted(d.items(), key=lambda t: t[1]))
952 OrderedDict([('pear', 1), ('orange', 2), ('banana', 3), ('apple', 4)])
953
954 >>> # dictionary sorted by length of the key string
955 >>> OrderedDict(sorted(d.items(), key=lambda t: len(t[0])))
956 OrderedDict([('pear', 1), ('apple', 4), ('orange', 2), ('banana', 3)])
957
958The new sorted dictionaries maintain their sort order when entries
959are deleted. But when new keys are added, the keys are appended
960to the end and the sort is not maintained.