blob: 58278d1502043800e5b68e0abe9612ac6080527d [file] [log] [blame]
Georg Brandl8ec7f652007-08-15 14:28:01 +00001:mod:`collections` --- High-performance container datatypes
2===========================================================
3
4.. module:: collections
5 :synopsis: High-performance datatypes
6.. moduleauthor:: Raymond Hettinger <python@rcn.com>
7.. sectionauthor:: Raymond Hettinger <python@rcn.com>
8
Georg Brandl8ec7f652007-08-15 14:28:01 +00009.. versionadded:: 2.4
10
Georg Brandl4c8bbe62008-03-22 21:06:20 +000011.. testsetup:: *
12
13 from collections import *
14 import itertools
15 __name__ = '<doctest>'
16
Éric Araujo29a0b572011-08-19 02:14:03 +020017**Source code:** :source:`Lib/collections.py` and :source:`Lib/_abcoll.py`
18
19--------------
20
Raymond Hettinger3de8a302010-08-08 00:35:03 +000021This module implements specialized container datatypes providing alternatives to
22Python's general purpose built-in containers, :class:`dict`, :class:`list`,
23:class:`set`, and :class:`tuple`.
Georg Brandl8ec7f652007-08-15 14:28:01 +000024
Raymond Hettingeraa5f4aa2010-11-06 07:18:07 +000025===================== ==================================================================== ===========================
26:func:`namedtuple` factory function for creating tuple subclasses with named fields .. versionadded:: 2.6
27:class:`deque` list-like container with fast appends and pops on either end .. versionadded:: 2.4
28:class:`Counter` dict subclass for counting hashable objects .. versionadded:: 2.7
29:class:`OrderedDict` dict subclass that remembers the order entries were added .. versionadded:: 2.7
30:class:`defaultdict` dict subclass that calls a factory function to supply missing values .. versionadded:: 2.5
31===================== ==================================================================== ===========================
Raymond Hettingerbc4ffc12008-02-11 23:38:00 +000032
Raymond Hettingeracdafa82010-11-30 17:50:53 +000033In addition to the concrete container classes, the collections module provides
Éric Araujo569ff912011-08-13 02:47:38 +020034:ref:`abstract base classes <collections-abstract-base-classes>` that can be
35used to test whether a class provides a particular interface, for example,
36whether it is hashable or a mapping.
Raymond Hettingerbc4ffc12008-02-11 23:38:00 +000037
Raymond Hettingerbc4ffc12008-02-11 23:38:00 +000038
Raymond Hettingerf94d7fa2009-01-12 22:58:41 +000039:class:`Counter` objects
40------------------------
41
42A counter tool is provided to support convenient and rapid tallies.
43For example::
44
Raymond Hettinger939a3cc2009-02-04 11:31:30 +000045 >>> # Tally occurrences of words in a list
Raymond Hettingerf94d7fa2009-01-12 22:58:41 +000046 >>> cnt = Counter()
Raymond Hettinger0a1f7b82009-01-21 23:12:51 +000047 >>> for word in ['red', 'blue', 'red', 'green', 'blue', 'blue']:
Raymond Hettingerf94d7fa2009-01-12 22:58:41 +000048 ... cnt[word] += 1
49 >>> cnt
Raymond Hettingeraaa6e632009-01-13 01:05:03 +000050 Counter({'blue': 3, 'red': 2, 'green': 1})
Raymond Hettingerf94d7fa2009-01-12 22:58:41 +000051
Raymond Hettinger939a3cc2009-02-04 11:31:30 +000052 >>> # Find the ten most common words in Hamlet
Raymond Hettingerf94d7fa2009-01-12 22:58:41 +000053 >>> import re
Raymond Hettingerdce969d2013-02-28 11:11:11 -080054 >>> words = re.findall(r'\w+', open('hamlet.txt').read().lower())
Raymond Hettinger196a0f72009-01-20 12:59:36 +000055 >>> Counter(words).most_common(10)
Raymond Hettingerf94d7fa2009-01-12 22:58:41 +000056 [('the', 1143), ('and', 966), ('to', 762), ('of', 669), ('i', 631),
57 ('you', 554), ('a', 546), ('my', 514), ('hamlet', 471), ('in', 451)]
58
Raymond Hettinger82783852009-01-13 03:49:43 +000059.. class:: Counter([iterable-or-mapping])
Raymond Hettingerf94d7fa2009-01-12 22:58:41 +000060
Raymond Hettinger0a1f7b82009-01-21 23:12:51 +000061 A :class:`Counter` is a :class:`dict` subclass for counting hashable objects.
Raymond Hettingeraaa6e632009-01-13 01:05:03 +000062 It is an unordered collection where elements are stored as dictionary keys
63 and their counts are stored as dictionary values. Counts are allowed to be
64 any integer value including zero or negative counts. The :class:`Counter`
65 class is similar to bags or multisets in other languages.
Raymond Hettingerf94d7fa2009-01-12 22:58:41 +000066
Raymond Hettinger82783852009-01-13 03:49:43 +000067 Elements are counted from an *iterable* or initialized from another
Georg Brandlf6dab952009-04-28 21:48:35 +000068 *mapping* (or counter):
Raymond Hettingerf94d7fa2009-01-12 22:58:41 +000069
Raymond Hettinger22bfa9e2009-01-27 02:36:33 +000070 >>> c = Counter() # a new, empty counter
71 >>> c = Counter('gallahad') # a new counter from an iterable
72 >>> c = Counter({'red': 4, 'blue': 2}) # a new counter from a mapping
73 >>> c = Counter(cats=4, dogs=8) # a new counter from keyword args
Raymond Hettingerf94d7fa2009-01-12 22:58:41 +000074
Raymond Hettinger0a1f7b82009-01-21 23:12:51 +000075 Counter objects have a dictionary interface except that they return a zero
Georg Brandlf6dab952009-04-28 21:48:35 +000076 count for missing items instead of raising a :exc:`KeyError`:
Raymond Hettingerf94d7fa2009-01-12 22:58:41 +000077
Raymond Hettinger7bdca052009-01-22 05:20:47 +000078 >>> c = Counter(['eggs', 'ham'])
Raymond Hettinger59894122009-01-14 00:15:21 +000079 >>> c['bacon'] # count of a missing element is zero
Raymond Hettingerf94d7fa2009-01-12 22:58:41 +000080 0
81
Raymond Hettinger7bdca052009-01-22 05:20:47 +000082 Setting a count to zero does not remove an element from a counter.
83 Use ``del`` to remove it entirely:
Raymond Hettingerf94d7fa2009-01-12 22:58:41 +000084
Raymond Hettinger7bdca052009-01-22 05:20:47 +000085 >>> c['sausage'] = 0 # counter entry with a zero count
86 >>> del c['sausage'] # del actually removes the entry
Raymond Hettingerf94d7fa2009-01-12 22:58:41 +000087
88 .. versionadded:: 2.7
89
90
Ezio Melotti4edfe962010-04-04 06:50:58 +000091 Counter objects support three methods beyond those available for all
Raymond Hettingerf94d7fa2009-01-12 22:58:41 +000092 dictionaries:
93
94 .. method:: elements()
95
Raymond Hettinger0a1f7b82009-01-21 23:12:51 +000096 Return an iterator over elements repeating each as many times as its
97 count. Elements are returned in arbitrary order. If an element's count
98 is less than one, :meth:`elements` will ignore it.
Raymond Hettingerf94d7fa2009-01-12 22:58:41 +000099
Raymond Hettinger196a0f72009-01-20 12:59:36 +0000100 >>> c = Counter(a=4, b=2, c=0, d=-2)
Raymond Hettingerf94d7fa2009-01-12 22:58:41 +0000101 >>> list(c.elements())
102 ['a', 'a', 'a', 'a', 'b', 'b']
103
104 .. method:: most_common([n])
105
Raymond Hettinger22bfa9e2009-01-27 02:36:33 +0000106 Return a list of the *n* most common elements and their counts from the
Andrew Kuchlingc711b612015-05-13 11:25:40 -0400107 most common to the least. If *n* is omitted or ``None``,
108 :func:`most_common` returns *all* elements in the counter.
109 Elements with equal counts are ordered arbitrarily:
Raymond Hettingerf94d7fa2009-01-12 22:58:41 +0000110
111 >>> Counter('abracadabra').most_common(3)
112 [('a', 5), ('r', 2), ('b', 2)]
113
Raymond Hettinger34c35b22010-04-03 10:22:00 +0000114 .. method:: subtract([iterable-or-mapping])
115
116 Elements are subtracted from an *iterable* or from another *mapping*
117 (or counter). Like :meth:`dict.update` but subtracts counts instead
118 of replacing them. Both inputs and outputs may be zero or negative.
119
120 >>> c = Counter(a=4, b=2, c=0, d=-2)
121 >>> d = Counter(a=1, b=2, c=3, d=4)
122 >>> c.subtract(d)
Andrew Svetlov78efadb2012-12-17 14:01:03 +0200123 >>> c
Raymond Hettinger34c35b22010-04-03 10:22:00 +0000124 Counter({'a': 3, 'b': 0, 'c': -3, 'd': -6})
125
Raymond Hettinger0a1f7b82009-01-21 23:12:51 +0000126 The usual dictionary methods are available for :class:`Counter` objects
127 except for two which work differently for counters.
Raymond Hettingerf94d7fa2009-01-12 22:58:41 +0000128
129 .. method:: fromkeys(iterable)
130
Raymond Hettinger22bfa9e2009-01-27 02:36:33 +0000131 This class method is not implemented for :class:`Counter` objects.
Raymond Hettingerf94d7fa2009-01-12 22:58:41 +0000132
Raymond Hettinger82783852009-01-13 03:49:43 +0000133 .. method:: update([iterable-or-mapping])
Raymond Hettingerf94d7fa2009-01-12 22:58:41 +0000134
Raymond Hettinger22bfa9e2009-01-27 02:36:33 +0000135 Elements are counted from an *iterable* or added-in from another
136 *mapping* (or counter). Like :meth:`dict.update` but adds counts
137 instead of replacing them. Also, the *iterable* is expected to be a
138 sequence of elements, not a sequence of ``(key, value)`` pairs.
Raymond Hettingerf94d7fa2009-01-12 22:58:41 +0000139
Raymond Hettingerfbcf7492009-01-13 08:38:14 +0000140Common patterns for working with :class:`Counter` objects::
Raymond Hettingerf94d7fa2009-01-12 22:58:41 +0000141
Raymond Hettinger22bfa9e2009-01-27 02:36:33 +0000142 sum(c.values()) # total of all counts
143 c.clear() # reset all counts
144 list(c) # list unique elements
145 set(c) # convert to a set
146 dict(c) # convert to a regular dictionary
147 c.items() # convert to a list of (elem, cnt) pairs
148 Counter(dict(list_of_pairs)) # convert from a list of (elem, cnt) pairs
Georg Brandl97ea40a2013-10-06 12:36:39 +0200149 c.most_common()[:-n-1:-1] # n least common elements
Raymond Hettinger22bfa9e2009-01-27 02:36:33 +0000150 c += Counter() # remove zero and negative counts
Raymond Hettingerf94d7fa2009-01-12 22:58:41 +0000151
Raymond Hettingera6658532009-02-25 22:48:24 +0000152Several mathematical operations are provided for combining :class:`Counter`
153objects to produce multisets (counters that have counts greater than zero).
154Addition and subtraction combine counters by adding or subtracting the counts
155of corresponding elements. Intersection and union return the minimum and
156maximum of corresponding counts. Each operation can accept inputs with signed
157counts, but the output will exclude results with counts of zero or less.
Raymond Hettingerbad1eb22009-01-20 01:19:26 +0000158
Raymond Hettinger4571f342009-01-21 20:31:50 +0000159 >>> c = Counter(a=3, b=1)
160 >>> d = Counter(a=1, b=2)
Raymond Hettinger22bfa9e2009-01-27 02:36:33 +0000161 >>> c + d # add two counters together: c[x] + d[x]
Raymond Hettingerbad1eb22009-01-20 01:19:26 +0000162 Counter({'a': 4, 'b': 3})
Raymond Hettinger22bfa9e2009-01-27 02:36:33 +0000163 >>> c - d # subtract (keeping only positive counts)
Raymond Hettingerbad1eb22009-01-20 01:19:26 +0000164 Counter({'a': 2})
Raymond Hettinger22bfa9e2009-01-27 02:36:33 +0000165 >>> c & d # intersection: min(c[x], d[x])
Raymond Hettingerbad1eb22009-01-20 01:19:26 +0000166 Counter({'a': 1, 'b': 1})
Raymond Hettinger22bfa9e2009-01-27 02:36:33 +0000167 >>> c | d # union: max(c[x], d[x])
Raymond Hettingerbad1eb22009-01-20 01:19:26 +0000168 Counter({'a': 3, 'b': 2})
169
Raymond Hettinger44340e62010-04-12 21:12:06 +0000170.. note::
171
172 Counters were primarily designed to work with positive integers to represent
173 running counts; however, care was taken to not unnecessarily preclude use
174 cases needing other types or negative values. To help with those use cases,
175 this section documents the minimum range and type restrictions.
176
177 * The :class:`Counter` class itself is a dictionary subclass with no
178 restrictions on its keys and values. The values are intended to be numbers
179 representing counts, but you *could* store anything in the value field.
180
181 * The :meth:`most_common` method requires only that the values be orderable.
182
183 * For in-place operations such as ``c[key] += 1``, the value type need only
184 support addition and subtraction. So fractions, floats, and decimals would
185 work and negative values are supported. The same is also true for
186 :meth:`update` and :meth:`subtract` which allow negative and zero values
187 for both inputs and outputs.
188
189 * The multiset methods are designed only for use cases with positive values.
190 The inputs may be negative or zero, but only outputs with positive values
191 are created. There are no type restrictions, but the value type needs to
Ezio Melotti1e87da12011-10-19 10:39:35 +0300192 support addition, subtraction, and comparison.
Raymond Hettinger44340e62010-04-12 21:12:06 +0000193
194 * The :meth:`elements` method requires integer counts. It ignores zero and
195 negative counts.
196
Raymond Hettingeracdc84a2009-01-20 23:42:54 +0000197.. seealso::
Raymond Hettingerf94d7fa2009-01-12 22:58:41 +0000198
Raymond Hettinger7bdca052009-01-22 05:20:47 +0000199 * `Counter class <http://code.activestate.com/recipes/576611/>`_
200 adapted for Python 2.5 and an early `Bag recipe
201 <http://code.activestate.com/recipes/259174/>`_ for Python 2.4.
202
Raymond Hettingeracdc84a2009-01-20 23:42:54 +0000203 * `Bag class <http://www.gnu.org/software/smalltalk/manual-base/html_node/Bag.html>`_
204 in Smalltalk.
Raymond Hettingerf94d7fa2009-01-12 22:58:41 +0000205
Éric Araujof06568c2011-08-19 09:00:56 +0200206 * Wikipedia entry for `Multisets <http://en.wikipedia.org/wiki/Multiset>`_.
Raymond Hettingerf94d7fa2009-01-12 22:58:41 +0000207
Raymond Hettingeracdc84a2009-01-20 23:42:54 +0000208 * `C++ multisets <http://www.demo2s.com/Tutorial/Cpp/0380__set-multiset/Catalog0380__set-multiset.htm>`_
Raymond Hettinger7bdca052009-01-22 05:20:47 +0000209 tutorial with examples.
Raymond Hettingeracdc84a2009-01-20 23:42:54 +0000210
Raymond Hettinger7bdca052009-01-22 05:20:47 +0000211 * For mathematical operations on multisets and their use cases, see
Raymond Hettingeracdc84a2009-01-20 23:42:54 +0000212 *Knuth, Donald. The Art of Computer Programming Volume II,
Éric Araujof06568c2011-08-19 09:00:56 +0200213 Section 4.6.3, Exercise 19*.
Raymond Hettingeracdc84a2009-01-20 23:42:54 +0000214
Raymond Hettinger0a1f7b82009-01-21 23:12:51 +0000215 * To enumerate all distinct multisets of a given size over a given set of
Raymond Hettingerd081abc2009-01-27 02:58:49 +0000216 elements, see :func:`itertools.combinations_with_replacement`.
Raymond Hettingeracdc84a2009-01-20 23:42:54 +0000217
Raymond Hettinger7bdca052009-01-22 05:20:47 +0000218 map(Counter, combinations_with_replacement('ABC', 2)) --> AA AB AC BB BC CC
Raymond Hettingerfbcf7492009-01-13 08:38:14 +0000219
Raymond Hettingerf94d7fa2009-01-12 22:58:41 +0000220
Georg Brandl8ec7f652007-08-15 14:28:01 +0000221:class:`deque` objects
222----------------------
223
Raymond Hettingera7fc4b12007-10-05 02:47:07 +0000224.. class:: deque([iterable[, maxlen]])
Georg Brandl8ec7f652007-08-15 14:28:01 +0000225
226 Returns a new deque object initialized left-to-right (using :meth:`append`) with
227 data from *iterable*. If *iterable* is not specified, the new deque is empty.
228
229 Deques are a generalization of stacks and queues (the name is pronounced "deck"
230 and is short for "double-ended queue"). Deques support thread-safe, memory
231 efficient appends and pops from either side of the deque with approximately the
232 same O(1) performance in either direction.
233
234 Though :class:`list` objects support similar operations, they are optimized for
235 fast fixed-length operations and incur O(n) memory movement costs for
236 ``pop(0)`` and ``insert(0, v)`` operations which change both the size and
237 position of the underlying data representation.
238
239 .. versionadded:: 2.4
240
Raymond Hettinger68995862007-10-10 00:26:46 +0000241 If *maxlen* is not specified or is *None*, deques may grow to an
Raymond Hettingera7fc4b12007-10-05 02:47:07 +0000242 arbitrary length. Otherwise, the deque is bounded to the specified maximum
243 length. Once a bounded length deque is full, when new items are added, a
244 corresponding number of items are discarded from the opposite end. Bounded
245 length deques provide functionality similar to the ``tail`` filter in
246 Unix. They are also useful for tracking transactions and other pools of data
247 where only the most recent activity is of interest.
248
249 .. versionchanged:: 2.6
Georg Brandlb19be572007-12-29 10:57:00 +0000250 Added *maxlen* parameter.
Raymond Hettingera7fc4b12007-10-05 02:47:07 +0000251
Benjamin Petersonc7b05922008-04-25 01:29:10 +0000252 Deque objects support the following methods:
Georg Brandl8ec7f652007-08-15 14:28:01 +0000253
254
Benjamin Petersonc7b05922008-04-25 01:29:10 +0000255 .. method:: append(x)
Georg Brandl8ec7f652007-08-15 14:28:01 +0000256
Benjamin Petersonc7b05922008-04-25 01:29:10 +0000257 Add *x* to the right side of the deque.
Georg Brandl8ec7f652007-08-15 14:28:01 +0000258
259
Benjamin Petersonc7b05922008-04-25 01:29:10 +0000260 .. method:: appendleft(x)
Georg Brandl8ec7f652007-08-15 14:28:01 +0000261
Benjamin Petersonc7b05922008-04-25 01:29:10 +0000262 Add *x* to the left side of the deque.
Georg Brandl8ec7f652007-08-15 14:28:01 +0000263
264
Benjamin Petersonc7b05922008-04-25 01:29:10 +0000265 .. method:: clear()
Georg Brandl8ec7f652007-08-15 14:28:01 +0000266
Benjamin Petersonc7b05922008-04-25 01:29:10 +0000267 Remove all elements from the deque leaving it with length 0.
Georg Brandl8ec7f652007-08-15 14:28:01 +0000268
269
Raymond Hettinger5f516ed2010-04-03 18:10:37 +0000270 .. method:: count(x)
271
272 Count the number of deque elements equal to *x*.
273
274 .. versionadded:: 2.7
275
Benjamin Petersonc7b05922008-04-25 01:29:10 +0000276 .. method:: extend(iterable)
Georg Brandl8ec7f652007-08-15 14:28:01 +0000277
Benjamin Petersonc7b05922008-04-25 01:29:10 +0000278 Extend the right side of the deque by appending elements from the iterable
279 argument.
Georg Brandl8ec7f652007-08-15 14:28:01 +0000280
281
Benjamin Petersonc7b05922008-04-25 01:29:10 +0000282 .. method:: extendleft(iterable)
Georg Brandl8ec7f652007-08-15 14:28:01 +0000283
Benjamin Petersonc7b05922008-04-25 01:29:10 +0000284 Extend the left side of the deque by appending elements from *iterable*.
285 Note, the series of left appends results in reversing the order of
286 elements in the iterable argument.
Georg Brandl8ec7f652007-08-15 14:28:01 +0000287
288
Benjamin Petersonc7b05922008-04-25 01:29:10 +0000289 .. method:: pop()
Georg Brandl8ec7f652007-08-15 14:28:01 +0000290
Benjamin Petersonc7b05922008-04-25 01:29:10 +0000291 Remove and return an element from the right side of the deque. If no
292 elements are present, raises an :exc:`IndexError`.
Georg Brandl8ec7f652007-08-15 14:28:01 +0000293
294
Benjamin Petersonc7b05922008-04-25 01:29:10 +0000295 .. method:: popleft()
Georg Brandl8ec7f652007-08-15 14:28:01 +0000296
Benjamin Petersonc7b05922008-04-25 01:29:10 +0000297 Remove and return an element from the left side of the deque. If no
298 elements are present, raises an :exc:`IndexError`.
Georg Brandl8ec7f652007-08-15 14:28:01 +0000299
300
Benjamin Petersonc7b05922008-04-25 01:29:10 +0000301 .. method:: remove(value)
Georg Brandl8ec7f652007-08-15 14:28:01 +0000302
Benjamin Petersonc7b05922008-04-25 01:29:10 +0000303 Removed the first occurrence of *value*. If not found, raises a
304 :exc:`ValueError`.
Georg Brandl8ec7f652007-08-15 14:28:01 +0000305
Benjamin Petersonc7b05922008-04-25 01:29:10 +0000306 .. versionadded:: 2.5
Georg Brandl8ec7f652007-08-15 14:28:01 +0000307
Raymond Hettingera5fd24e2009-12-10 06:42:54 +0000308 .. method:: reverse()
309
310 Reverse the elements of the deque in-place and then return ``None``.
311
312 .. versionadded:: 2.7
Georg Brandl8ec7f652007-08-15 14:28:01 +0000313
Benjamin Petersonc7b05922008-04-25 01:29:10 +0000314 .. method:: rotate(n)
Georg Brandl8ec7f652007-08-15 14:28:01 +0000315
Benjamin Petersonc7b05922008-04-25 01:29:10 +0000316 Rotate the deque *n* steps to the right. If *n* is negative, rotate to
317 the left. Rotating one step to the right is equivalent to:
318 ``d.appendleft(d.pop())``.
319
Georg Brandl8ec7f652007-08-15 14:28:01 +0000320
Raymond Hettinger56411aa2009-03-10 12:50:59 +0000321 Deque objects also provide one read-only attribute:
322
323 .. attribute:: maxlen
324
325 Maximum size of a deque or *None* if unbounded.
326
327 .. versionadded:: 2.7
328
329
Georg Brandl8ec7f652007-08-15 14:28:01 +0000330In addition to the above, deques support iteration, pickling, ``len(d)``,
331``reversed(d)``, ``copy.copy(d)``, ``copy.deepcopy(d)``, membership testing with
Benjamin Peterson5c4e0062008-10-16 18:52:14 +0000332the :keyword:`in` operator, and subscript references such as ``d[-1]``. Indexed
333access is O(1) at both ends but slows to O(n) in the middle. For fast random
334access, use lists instead.
Georg Brandl8ec7f652007-08-15 14:28:01 +0000335
Georg Brandl4c8bbe62008-03-22 21:06:20 +0000336Example:
337
338.. doctest::
Georg Brandl8ec7f652007-08-15 14:28:01 +0000339
340 >>> from collections import deque
341 >>> d = deque('ghi') # make a new deque with three items
342 >>> for elem in d: # iterate over the deque's elements
Georg Brandl4c8bbe62008-03-22 21:06:20 +0000343 ... print elem.upper()
Georg Brandl8ec7f652007-08-15 14:28:01 +0000344 G
345 H
346 I
347
348 >>> d.append('j') # add a new entry to the right side
349 >>> d.appendleft('f') # add a new entry to the left side
350 >>> d # show the representation of the deque
351 deque(['f', 'g', 'h', 'i', 'j'])
352
353 >>> d.pop() # return and remove the rightmost item
354 'j'
355 >>> d.popleft() # return and remove the leftmost item
356 'f'
357 >>> list(d) # list the contents of the deque
358 ['g', 'h', 'i']
359 >>> d[0] # peek at leftmost item
360 'g'
361 >>> d[-1] # peek at rightmost item
362 'i'
363
364 >>> list(reversed(d)) # list the contents of a deque in reverse
365 ['i', 'h', 'g']
366 >>> 'h' in d # search the deque
367 True
368 >>> d.extend('jkl') # add multiple elements at once
369 >>> d
370 deque(['g', 'h', 'i', 'j', 'k', 'l'])
371 >>> d.rotate(1) # right rotation
372 >>> d
373 deque(['l', 'g', 'h', 'i', 'j', 'k'])
374 >>> d.rotate(-1) # left rotation
375 >>> d
376 deque(['g', 'h', 'i', 'j', 'k', 'l'])
377
378 >>> deque(reversed(d)) # make a new deque in reverse order
379 deque(['l', 'k', 'j', 'i', 'h', 'g'])
380 >>> d.clear() # empty the deque
381 >>> d.pop() # cannot pop from an empty deque
382 Traceback (most recent call last):
383 File "<pyshell#6>", line 1, in -toplevel-
384 d.pop()
385 IndexError: pop from an empty deque
386
387 >>> d.extendleft('abc') # extendleft() reverses the input order
388 >>> d
389 deque(['c', 'b', 'a'])
390
391
Raymond Hettingera7fc4b12007-10-05 02:47:07 +0000392:class:`deque` Recipes
393^^^^^^^^^^^^^^^^^^^^^^
Georg Brandl8ec7f652007-08-15 14:28:01 +0000394
395This section shows various approaches to working with deques.
396
Raymond Hettinger6bc94cb2009-03-31 22:43:03 +0000397Bounded length deques provide functionality similar to the ``tail`` filter
398in Unix::
399
400 def tail(filename, n=10):
401 'Return the last n lines of a file'
402 return deque(open(filename), n)
403
404Another approach to using deques is to maintain a sequence of recently
405added elements by appending to the right and popping to the left::
406
407 def moving_average(iterable, n=3):
408 # moving_average([40, 30, 50, 46, 39, 44]) --> 40.0 42.0 45.0 43.0
409 # http://en.wikipedia.org/wiki/Moving_average
Raymond Hettinger6bc94cb2009-03-31 22:43:03 +0000410 it = iter(iterable)
Raymond Hettinger9b6f13e2009-05-22 01:06:44 +0000411 d = deque(itertools.islice(it, n-1))
412 d.appendleft(0)
Raymond Hettinger6bc94cb2009-03-31 22:43:03 +0000413 s = sum(d)
Raymond Hettinger6bc94cb2009-03-31 22:43:03 +0000414 for elem in it:
415 s += elem - d.popleft()
416 d.append(elem)
Raymond Hettinger9b6f13e2009-05-22 01:06:44 +0000417 yield s / float(n)
Raymond Hettinger6bc94cb2009-03-31 22:43:03 +0000418
Georg Brandl8ec7f652007-08-15 14:28:01 +0000419The :meth:`rotate` method provides a way to implement :class:`deque` slicing and
Ezio Melotti062d2b52009-12-19 22:41:49 +0000420deletion. For example, a pure Python implementation of ``del d[n]`` relies on
Georg Brandl8ec7f652007-08-15 14:28:01 +0000421the :meth:`rotate` method to position elements to be popped::
422
423 def delete_nth(d, n):
424 d.rotate(-n)
425 d.popleft()
426 d.rotate(n)
427
428To implement :class:`deque` slicing, use a similar approach applying
429:meth:`rotate` to bring a target element to the left side of the deque. Remove
430old entries with :meth:`popleft`, add new entries with :meth:`extend`, and then
431reverse the rotation.
Georg Brandl8ec7f652007-08-15 14:28:01 +0000432With minor variations on that approach, it is easy to implement Forth style
433stack manipulations such as ``dup``, ``drop``, ``swap``, ``over``, ``pick``,
434``rot``, and ``roll``.
435
Georg Brandl8ec7f652007-08-15 14:28:01 +0000436
437:class:`defaultdict` objects
438----------------------------
439
Georg Brandl8ec7f652007-08-15 14:28:01 +0000440.. class:: defaultdict([default_factory[, ...]])
441
442 Returns a new dictionary-like object. :class:`defaultdict` is a subclass of the
Georg Brandld7d4fd72009-07-26 14:37:28 +0000443 built-in :class:`dict` class. It overrides one method and adds one writable
Georg Brandl8ec7f652007-08-15 14:28:01 +0000444 instance variable. The remaining functionality is the same as for the
445 :class:`dict` class and is not documented here.
446
447 The first argument provides the initial value for the :attr:`default_factory`
448 attribute; it defaults to ``None``. All remaining arguments are treated the same
449 as if they were passed to the :class:`dict` constructor, including keyword
450 arguments.
451
452 .. versionadded:: 2.5
453
Benjamin Petersonc7b05922008-04-25 01:29:10 +0000454 :class:`defaultdict` objects support the following method in addition to the
455 standard :class:`dict` operations:
Georg Brandl8ec7f652007-08-15 14:28:01 +0000456
Éric Araujof06568c2011-08-19 09:00:56 +0200457 .. method:: __missing__(key)
Georg Brandl8ec7f652007-08-15 14:28:01 +0000458
Skip Montanarob40890d2008-09-17 11:50:36 +0000459 If the :attr:`default_factory` attribute is ``None``, this raises a
Benjamin Petersonc7b05922008-04-25 01:29:10 +0000460 :exc:`KeyError` exception with the *key* as argument.
Georg Brandl8ec7f652007-08-15 14:28:01 +0000461
Benjamin Petersonc7b05922008-04-25 01:29:10 +0000462 If :attr:`default_factory` is not ``None``, it is called without arguments
463 to provide a default value for the given *key*, this value is inserted in
464 the dictionary for the *key*, and returned.
Georg Brandl8ec7f652007-08-15 14:28:01 +0000465
Benjamin Petersonc7b05922008-04-25 01:29:10 +0000466 If calling :attr:`default_factory` raises an exception this exception is
467 propagated unchanged.
Georg Brandl8ec7f652007-08-15 14:28:01 +0000468
Benjamin Petersonc7b05922008-04-25 01:29:10 +0000469 This method is called by the :meth:`__getitem__` method of the
470 :class:`dict` class when the requested key is not found; whatever it
471 returns or raises is then returned or raised by :meth:`__getitem__`.
Georg Brandl8ec7f652007-08-15 14:28:01 +0000472
Benjamin Peterson78028b02012-01-27 09:14:01 -0500473 Note that :meth:`__missing__` is *not* called for any operations besides
474 :meth:`__getitem__`. This means that :meth:`get` will, like normal
475 dictionaries, return ``None`` as a default rather than using
476 :attr:`default_factory`.
477
Georg Brandl8ec7f652007-08-15 14:28:01 +0000478
Benjamin Petersonc7b05922008-04-25 01:29:10 +0000479 :class:`defaultdict` objects support the following instance variable:
Georg Brandl8ec7f652007-08-15 14:28:01 +0000480
Benjamin Petersonc7b05922008-04-25 01:29:10 +0000481
Éric Araujof06568c2011-08-19 09:00:56 +0200482 .. attribute:: default_factory
Benjamin Petersonc7b05922008-04-25 01:29:10 +0000483
484 This attribute is used by the :meth:`__missing__` method; it is
485 initialized from the first argument to the constructor, if present, or to
486 ``None``, if absent.
Georg Brandl8ec7f652007-08-15 14:28:01 +0000487
488
Georg Brandl8ec7f652007-08-15 14:28:01 +0000489:class:`defaultdict` Examples
490^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
491
492Using :class:`list` as the :attr:`default_factory`, it is easy to group a
Georg Brandl4c8bbe62008-03-22 21:06:20 +0000493sequence of key-value pairs into a dictionary of lists:
Georg Brandl8ec7f652007-08-15 14:28:01 +0000494
495 >>> s = [('yellow', 1), ('blue', 2), ('yellow', 3), ('blue', 4), ('red', 1)]
496 >>> d = defaultdict(list)
497 >>> for k, v in s:
498 ... d[k].append(v)
499 ...
500 >>> d.items()
501 [('blue', [2, 4]), ('red', [1]), ('yellow', [1, 3])]
502
503When each key is encountered for the first time, it is not already in the
504mapping; so an entry is automatically created using the :attr:`default_factory`
505function which returns an empty :class:`list`. The :meth:`list.append`
506operation then attaches the value to the new list. When keys are encountered
507again, the look-up proceeds normally (returning the list for that key) and the
508:meth:`list.append` operation adds another value to the list. This technique is
Georg Brandl4c8bbe62008-03-22 21:06:20 +0000509simpler and faster than an equivalent technique using :meth:`dict.setdefault`:
Georg Brandl8ec7f652007-08-15 14:28:01 +0000510
511 >>> d = {}
512 >>> for k, v in s:
513 ... d.setdefault(k, []).append(v)
514 ...
515 >>> d.items()
516 [('blue', [2, 4]), ('red', [1]), ('yellow', [1, 3])]
517
518Setting the :attr:`default_factory` to :class:`int` makes the
519:class:`defaultdict` useful for counting (like a bag or multiset in other
Georg Brandl4c8bbe62008-03-22 21:06:20 +0000520languages):
Georg Brandl8ec7f652007-08-15 14:28:01 +0000521
522 >>> s = 'mississippi'
523 >>> d = defaultdict(int)
524 >>> for k in s:
525 ... d[k] += 1
526 ...
527 >>> d.items()
528 [('i', 4), ('p', 2), ('s', 4), ('m', 1)]
529
530When a letter is first encountered, it is missing from the mapping, so the
531:attr:`default_factory` function calls :func:`int` to supply a default count of
532zero. The increment operation then builds up the count for each letter.
533
534The function :func:`int` which always returns zero is just a special case of
535constant functions. A faster and more flexible way to create constant functions
536is to use :func:`itertools.repeat` which can supply any constant value (not just
Georg Brandl4c8bbe62008-03-22 21:06:20 +0000537zero):
Georg Brandl8ec7f652007-08-15 14:28:01 +0000538
539 >>> def constant_factory(value):
540 ... return itertools.repeat(value).next
541 >>> d = defaultdict(constant_factory('<missing>'))
542 >>> d.update(name='John', action='ran')
543 >>> '%(name)s %(action)s to %(object)s' % d
544 'John ran to <missing>'
545
546Setting the :attr:`default_factory` to :class:`set` makes the
Georg Brandl4c8bbe62008-03-22 21:06:20 +0000547:class:`defaultdict` useful for building a dictionary of sets:
Georg Brandl8ec7f652007-08-15 14:28:01 +0000548
549 >>> s = [('red', 1), ('blue', 2), ('red', 3), ('blue', 4), ('red', 1), ('blue', 4)]
550 >>> d = defaultdict(set)
551 >>> for k, v in s:
552 ... d[k].add(v)
553 ...
554 >>> d.items()
555 [('blue', set([2, 4])), ('red', set([1, 3]))]
556
557
Raymond Hettingereeeb9c42007-11-15 02:44:53 +0000558:func:`namedtuple` Factory Function for Tuples with Named Fields
Georg Brandlb3255ed2008-01-07 16:43:47 +0000559----------------------------------------------------------------
Georg Brandl8ec7f652007-08-15 14:28:01 +0000560
Raymond Hettingercbab5942007-09-18 22:18:02 +0000561Named tuples assign meaning to each position in a tuple and allow for more readable,
562self-documenting code. They can be used wherever regular tuples are used, and
563they add the ability to access fields by name instead of position index.
Georg Brandl8ec7f652007-08-15 14:28:01 +0000564
Georg Brandl27fa4822010-10-17 06:24:10 +0000565.. function:: namedtuple(typename, field_names, [verbose=False], [rename=False])
Georg Brandl8ec7f652007-08-15 14:28:01 +0000566
567 Returns a new tuple subclass named *typename*. The new subclass is used to
Georg Brandl907a7202008-02-22 12:31:45 +0000568 create tuple-like objects that have fields accessible by attribute lookup as
Georg Brandl8ec7f652007-08-15 14:28:01 +0000569 well as being indexable and iterable. Instances of the subclass also have a
Georg Brandl061d2e22008-11-23 19:17:25 +0000570 helpful docstring (with typename and field_names) and a helpful :meth:`__repr__`
Georg Brandl8ec7f652007-08-15 14:28:01 +0000571 method which lists the tuple contents in a ``name=value`` format.
572
Raymond Hettingerdbc5e122011-03-19 15:09:00 -0700573 The *field_names* are a sequence of strings such as ``['x', 'y']``.
574 Alternatively, *field_names* can be a single string with each fieldname
575 separated by whitespace and/or commas, for example ``'x y'`` or ``'x, y'``.
Raymond Hettingerabfd8df2007-10-16 21:28:32 +0000576
577 Any valid Python identifier may be used for a fieldname except for names
Raymond Hettinger42da8742007-12-14 02:49:47 +0000578 starting with an underscore. Valid identifiers consist of letters, digits,
579 and underscores but do not start with a digit or underscore and cannot be
Raymond Hettingerabfd8df2007-10-16 21:28:32 +0000580 a :mod:`keyword` such as *class*, *for*, *return*, *global*, *pass*, *print*,
581 or *raise*.
Raymond Hettingercbab5942007-09-18 22:18:02 +0000582
Raymond Hettinger322daea2009-02-10 01:24:05 +0000583 If *rename* is true, invalid fieldnames are automatically replaced
584 with positional names. For example, ``['abc', 'def', 'ghi', 'abc']`` is
Raymond Hettinger6df48a32009-04-02 22:34:17 +0000585 converted to ``['abc', '_1', 'ghi', '_3']``, eliminating the keyword
Raymond Hettinger322daea2009-02-10 01:24:05 +0000586 ``def`` and the duplicate fieldname ``abc``.
587
Raymond Hettinger15b5e552008-01-10 23:00:01 +0000588 If *verbose* is true, the class definition is printed just before being built.
Raymond Hettingercbab5942007-09-18 22:18:02 +0000589
Raymond Hettingera48a2992007-10-08 21:26:58 +0000590 Named tuple instances do not have per-instance dictionaries, so they are
Raymond Hettinger7268e9d2007-09-20 03:03:43 +0000591 lightweight and require no more memory than regular tuples.
Raymond Hettingercbab5942007-09-18 22:18:02 +0000592
Georg Brandl8ec7f652007-08-15 14:28:01 +0000593 .. versionadded:: 2.6
594
Raymond Hettinger322daea2009-02-10 01:24:05 +0000595 .. versionchanged:: 2.7
596 added support for *rename*.
597
Georg Brandl4c8bbe62008-03-22 21:06:20 +0000598Example:
599
600.. doctest::
601 :options: +NORMALIZE_WHITESPACE
Georg Brandl8ec7f652007-08-15 14:28:01 +0000602
Raymond Hettingerdbc5e122011-03-19 15:09:00 -0700603 >>> Point = namedtuple('Point', ['x', 'y'], verbose=True)
Raymond Hettingercbab5942007-09-18 22:18:02 +0000604 class Point(tuple):
Raymond Hettinger491f7072012-06-08 13:24:12 -0700605 'Point(x, y)'
Georg Brandl4c8bbe62008-03-22 21:06:20 +0000606 <BLANKLINE>
Raymond Hettinger491f7072012-06-08 13:24:12 -0700607 __slots__ = ()
Georg Brandl4c8bbe62008-03-22 21:06:20 +0000608 <BLANKLINE>
Raymond Hettinger491f7072012-06-08 13:24:12 -0700609 _fields = ('x', 'y')
Georg Brandl4c8bbe62008-03-22 21:06:20 +0000610 <BLANKLINE>
Raymond Hettinger491f7072012-06-08 13:24:12 -0700611 def __new__(_cls, x, y):
612 'Create a new instance of Point(x, y)'
613 return _tuple.__new__(_cls, (x, y))
Georg Brandl4c8bbe62008-03-22 21:06:20 +0000614 <BLANKLINE>
Raymond Hettinger491f7072012-06-08 13:24:12 -0700615 @classmethod
616 def _make(cls, iterable, new=tuple.__new__, len=len):
617 'Make a new Point object from a sequence or iterable'
618 result = new(cls, iterable)
619 if len(result) != 2:
620 raise TypeError('Expected 2 arguments, got %d' % len(result))
621 return result
Georg Brandl4c8bbe62008-03-22 21:06:20 +0000622 <BLANKLINE>
Raymond Hettinger491f7072012-06-08 13:24:12 -0700623 def __repr__(self):
624 'Return a nicely formatted representation string'
625 return 'Point(x=%r, y=%r)' % self
Georg Brandl4c8bbe62008-03-22 21:06:20 +0000626 <BLANKLINE>
Raymond Hettinger491f7072012-06-08 13:24:12 -0700627 def _asdict(self):
628 'Return a new OrderedDict which maps field names to their values'
629 return OrderedDict(zip(self._fields, self))
Georg Brandl4c8bbe62008-03-22 21:06:20 +0000630 <BLANKLINE>
Raymond Hettinger0801d732013-05-03 00:59:20 -0700631 def _replace(_self, **kwds):
Raymond Hettinger491f7072012-06-08 13:24:12 -0700632 'Return a new Point object replacing specified fields with new values'
633 result = _self._make(map(kwds.pop, ('x', 'y'), _self))
634 if kwds:
635 raise ValueError('Got unexpected field names: %r' % kwds.keys())
636 return result
Georg Brandlc62ef8b2009-01-03 20:55:06 +0000637 <BLANKLINE>
Raymond Hettinger491f7072012-06-08 13:24:12 -0700638 def __getnewargs__(self):
639 'Return self as a plain tuple. Used by copy and pickle.'
640 return tuple(self)
Georg Brandl4c8bbe62008-03-22 21:06:20 +0000641 <BLANKLINE>
Raymond Hettinger7393c692013-05-27 10:58:55 -0700642 __dict__ = _property(_asdict)
643 <BLANKLINE>
644 def __getstate__(self):
645 'Exclude the OrderedDict from pickling'
646 pass
647 <BLANKLINE>
Raymond Hettinger491f7072012-06-08 13:24:12 -0700648 x = _property(_itemgetter(0), doc='Alias for field number 0')
649 <BLANKLINE>
650 y = _property(_itemgetter(1), doc='Alias for field number 1')
651 <BLANKLINE>
Georg Brandl8ec7f652007-08-15 14:28:01 +0000652
Raymond Hettingercbab5942007-09-18 22:18:02 +0000653 >>> p = Point(11, y=22) # instantiate with positional or keyword arguments
Raymond Hettinger88880b22007-12-18 00:13:45 +0000654 >>> p[0] + p[1] # indexable like the plain tuple (11, 22)
Raymond Hettingercbab5942007-09-18 22:18:02 +0000655 33
656 >>> x, y = p # unpack like a regular tuple
657 >>> x, y
658 (11, 22)
Georg Brandl907a7202008-02-22 12:31:45 +0000659 >>> p.x + p.y # fields also accessible by name
Raymond Hettingercbab5942007-09-18 22:18:02 +0000660 33
661 >>> p # readable __repr__ with a name=value style
662 Point(x=11, y=22)
Georg Brandl8ec7f652007-08-15 14:28:01 +0000663
Raymond Hettingercbab5942007-09-18 22:18:02 +0000664Named tuples are especially useful for assigning field names to result tuples returned
665by the :mod:`csv` or :mod:`sqlite3` modules::
Georg Brandl8ec7f652007-08-15 14:28:01 +0000666
Raymond Hettingereeeb9c42007-11-15 02:44:53 +0000667 EmployeeRecord = namedtuple('EmployeeRecord', 'name, age, title, department, paygrade')
Raymond Hettingera48a2992007-10-08 21:26:58 +0000668
Raymond Hettingercbab5942007-09-18 22:18:02 +0000669 import csv
Raymond Hettinger02740f72008-01-05 01:35:43 +0000670 for emp in map(EmployeeRecord._make, csv.reader(open("employees.csv", "rb"))):
Raymond Hettingercbab5942007-09-18 22:18:02 +0000671 print emp.name, emp.title
Georg Brandl8ec7f652007-08-15 14:28:01 +0000672
Raymond Hettingera48a2992007-10-08 21:26:58 +0000673 import sqlite3
674 conn = sqlite3.connect('/companydata')
675 cursor = conn.cursor()
676 cursor.execute('SELECT name, age, title, department, paygrade FROM employees')
Raymond Hettinger02740f72008-01-05 01:35:43 +0000677 for emp in map(EmployeeRecord._make, cursor.fetchall()):
Raymond Hettingera48a2992007-10-08 21:26:58 +0000678 print emp.name, emp.title
679
Raymond Hettinger85dfcf32007-12-18 23:51:15 +0000680In addition to the methods inherited from tuples, named tuples support
Raymond Hettingerac5742e2008-01-08 02:24:15 +0000681three additional methods and one attribute. To prevent conflicts with
682field names, the method and attribute names start with an underscore.
Raymond Hettinger85dfcf32007-12-18 23:51:15 +0000683
Benjamin Peterson682f6032010-07-18 14:26:34 +0000684.. classmethod:: somenamedtuple._make(iterable)
Raymond Hettinger85dfcf32007-12-18 23:51:15 +0000685
Raymond Hettinger02740f72008-01-05 01:35:43 +0000686 Class method that makes a new instance from an existing sequence or iterable.
Raymond Hettinger85dfcf32007-12-18 23:51:15 +0000687
Raymond Hettinger2950bca2009-01-14 01:39:51 +0000688 .. doctest::
Georg Brandl8ec7f652007-08-15 14:28:01 +0000689
Raymond Hettinger02740f72008-01-05 01:35:43 +0000690 >>> t = [11, 22]
691 >>> Point._make(t)
692 Point(x=11, y=22)
Raymond Hettinger2b03d452007-09-18 03:33:19 +0000693
Georg Brandlb3255ed2008-01-07 16:43:47 +0000694.. method:: somenamedtuple._asdict()
Raymond Hettingera7fc4b12007-10-05 02:47:07 +0000695
Raymond Hettinger88a91642009-03-03 04:51:24 +0000696 Return a new :class:`OrderedDict` which maps field names to their corresponding
697 values::
Raymond Hettingera7fc4b12007-10-05 02:47:07 +0000698
Raymond Hettingerfe2f2872015-08-30 14:48:42 -0700699 >>> p = Point(x=11, y=22)
Raymond Hettinger42da8742007-12-14 02:49:47 +0000700 >>> p._asdict()
Raymond Hettinger88a91642009-03-03 04:51:24 +0000701 OrderedDict([('x', 11), ('y', 22)])
702
Raymond Hettingera07038d2009-03-03 05:11:56 +0000703 .. versionchanged:: 2.7
Raymond Hettinger88a91642009-03-03 04:51:24 +0000704 Returns an :class:`OrderedDict` instead of a regular :class:`dict`.
Georg Brandl4c8bbe62008-03-22 21:06:20 +0000705
Georg Brandlb3255ed2008-01-07 16:43:47 +0000706.. method:: somenamedtuple._replace(kwargs)
Raymond Hettingerd36a60e2007-09-17 00:55:00 +0000707
Georg Brandl4c8bbe62008-03-22 21:06:20 +0000708 Return a new instance of the named tuple replacing specified fields with new
Raymond Hettinger2950bca2009-01-14 01:39:51 +0000709 values::
Raymond Hettingerd36a60e2007-09-17 00:55:00 +0000710
Raymond Hettingercbab5942007-09-18 22:18:02 +0000711 >>> p = Point(x=11, y=22)
Raymond Hettinger42da8742007-12-14 02:49:47 +0000712 >>> p._replace(x=33)
Raymond Hettingerd36a60e2007-09-17 00:55:00 +0000713 Point(x=33, y=22)
714
Raymond Hettinger7c3738e2007-11-15 03:16:09 +0000715 >>> for partnum, record in inventory.items():
Raymond Hettingerdbc5e122011-03-19 15:09:00 -0700716 inventory[partnum] = record._replace(price=newprices[partnum], timestamp=time.now())
Raymond Hettingerd36a60e2007-09-17 00:55:00 +0000717
Georg Brandlb3255ed2008-01-07 16:43:47 +0000718.. attribute:: somenamedtuple._fields
Raymond Hettingerd36a60e2007-09-17 00:55:00 +0000719
Raymond Hettingerf6b769b2008-01-07 21:33:51 +0000720 Tuple of strings listing the field names. Useful for introspection
Raymond Hettingera7fc4b12007-10-05 02:47:07 +0000721 and for creating new named tuple types from existing named tuples.
Raymond Hettinger7268e9d2007-09-20 03:03:43 +0000722
Raymond Hettinger2950bca2009-01-14 01:39:51 +0000723 .. doctest::
Raymond Hettingerd36a60e2007-09-17 00:55:00 +0000724
Raymond Hettinger42da8742007-12-14 02:49:47 +0000725 >>> p._fields # view the field names
Raymond Hettingercbab5942007-09-18 22:18:02 +0000726 ('x', 'y')
Raymond Hettingerd36a60e2007-09-17 00:55:00 +0000727
Raymond Hettingereeeb9c42007-11-15 02:44:53 +0000728 >>> Color = namedtuple('Color', 'red green blue')
Raymond Hettinger42da8742007-12-14 02:49:47 +0000729 >>> Pixel = namedtuple('Pixel', Point._fields + Color._fields)
Raymond Hettingercbab5942007-09-18 22:18:02 +0000730 >>> Pixel(11, 22, 128, 255, 0)
Raymond Hettingerdc1854d2008-01-09 03:13:20 +0000731 Pixel(x=11, y=22, red=128, green=255, blue=0)
Raymond Hettingerd36a60e2007-09-17 00:55:00 +0000732
Raymond Hettingere846f382007-12-14 21:51:50 +0000733To retrieve a field whose name is stored in a string, use the :func:`getattr`
Georg Brandl4c8bbe62008-03-22 21:06:20 +0000734function:
Raymond Hettingere846f382007-12-14 21:51:50 +0000735
736 >>> getattr(p, 'x')
737 11
738
Raymond Hettingere4ae63c2009-02-11 00:06:17 +0000739To convert a dictionary to a named tuple, use the double-star-operator
740(as described in :ref:`tut-unpacking-arguments`):
Raymond Hettinger85dfcf32007-12-18 23:51:15 +0000741
742 >>> d = {'x': 11, 'y': 22}
743 >>> Point(**d)
744 Point(x=11, y=22)
745
Raymond Hettingereeeb9c42007-11-15 02:44:53 +0000746Since a named tuple is a regular Python class, it is easy to add or change
Raymond Hettingerb8e00722008-01-07 04:24:49 +0000747functionality with a subclass. Here is how to add a calculated field and
Georg Brandl4c8bbe62008-03-22 21:06:20 +0000748a fixed-width print format:
Raymond Hettingereeeb9c42007-11-15 02:44:53 +0000749
Raymond Hettingerb8e00722008-01-07 04:24:49 +0000750 >>> class Point(namedtuple('Point', 'x y')):
Raymond Hettingerdbc5e122011-03-19 15:09:00 -0700751 __slots__ = ()
752 @property
753 def hypot(self):
754 return (self.x ** 2 + self.y ** 2) ** 0.5
755 def __str__(self):
756 return 'Point: x=%6.3f y=%6.3f hypot=%6.3f' % (self.x, self.y, self.hypot)
Raymond Hettingerb8e00722008-01-07 04:24:49 +0000757
Raymond Hettingere1655082008-01-10 19:15:10 +0000758 >>> for p in Point(3, 4), Point(14, 5/7.):
Raymond Hettingerdbc5e122011-03-19 15:09:00 -0700759 print p
Raymond Hettinger15b5e552008-01-10 23:00:01 +0000760 Point: x= 3.000 y= 4.000 hypot= 5.000
761 Point: x=14.000 y= 0.714 hypot=14.018
Raymond Hettingereeeb9c42007-11-15 02:44:53 +0000762
Georg Brandlfe8df4f2009-12-28 08:01:59 +0000763The subclass shown above sets ``__slots__`` to an empty tuple. This helps
Raymond Hettinger171f3912008-01-16 23:38:16 +0000764keep memory requirements low by preventing the creation of instance dictionaries.
Raymond Hettingerf59e9622008-01-15 20:52:42 +0000765
Raymond Hettingerac5742e2008-01-08 02:24:15 +0000766Subclassing is not useful for adding new, stored fields. Instead, simply
Georg Brandl4c8bbe62008-03-22 21:06:20 +0000767create a new named tuple type from the :attr:`_fields` attribute:
Raymond Hettingerac5742e2008-01-08 02:24:15 +0000768
Raymond Hettingere850c462008-01-10 20:37:12 +0000769 >>> Point3D = namedtuple('Point3D', Point._fields + ('z',))
Raymond Hettingerac5742e2008-01-08 02:24:15 +0000770
Raymond Hettingerfb3ced62008-01-07 20:17:35 +0000771Default values can be implemented by using :meth:`_replace` to
Georg Brandl4c8bbe62008-03-22 21:06:20 +0000772customize a prototype instance:
Raymond Hettingerbc693492007-11-15 22:39:34 +0000773
774 >>> Account = namedtuple('Account', 'owner balance transaction_count')
Raymond Hettinger0fe6ca42008-01-18 21:14:58 +0000775 >>> default_account = Account('<owner name>', 0.0, 0)
776 >>> johns_account = default_account._replace(owner='John')
Raymond Hettingerbc693492007-11-15 22:39:34 +0000777
Raymond Hettinger5a9fed72008-05-08 07:23:30 +0000778Enumerated constants can be implemented with named tuples, but it is simpler
779and more efficient to use a simple class declaration:
780
781 >>> Status = namedtuple('Status', 'open pending closed')._make(range(3))
782 >>> Status.open, Status.pending, Status.closed
783 (0, 1, 2)
784 >>> class Status:
Raymond Hettingerdbc5e122011-03-19 15:09:00 -0700785 open, pending, closed = range(3)
Raymond Hettinger5a9fed72008-05-08 07:23:30 +0000786
Raymond Hettingere4ae63c2009-02-11 00:06:17 +0000787.. seealso::
Mark Summerfield7f626f42007-08-30 15:03:03 +0000788
Raymond Hettingere4ae63c2009-02-11 00:06:17 +0000789 `Named tuple recipe <http://code.activestate.com/recipes/500261/>`_
790 adapted for Python 2.4.
Raymond Hettingerbc512d32009-03-03 04:45:34 +0000791
792
793:class:`OrderedDict` objects
794----------------------------
795
796Ordered dictionaries are just like regular dictionaries but they remember the
797order that items were inserted. When iterating over an ordered dictionary,
798the items are returned in the order their keys were first added.
799
800.. class:: OrderedDict([items])
801
802 Return an instance of a dict subclass, supporting the usual :class:`dict`
803 methods. An *OrderedDict* is a dict that remembers the order that keys
804 were first inserted. If a new entry overwrites an existing entry, the
805 original insertion position is left unchanged. Deleting an entry and
806 reinserting it will move it to the end.
807
808 .. versionadded:: 2.7
809
Raymond Hettinger24122992009-03-19 19:59:58 +0000810.. method:: OrderedDict.popitem(last=True)
811
812 The :meth:`popitem` method for ordered dictionaries returns and removes
813 a (key, value) pair. The pairs are returned in LIFO order if *last* is
814 true or FIFO order if false.
Raymond Hettingerbc512d32009-03-03 04:45:34 +0000815
Raymond Hettinger50f362f2009-05-19 17:43:59 +0000816In addition to the usual mapping methods, ordered dictionaries also support
817reverse iteration using :func:`reversed`.
818
Raymond Hettingerbc512d32009-03-03 04:45:34 +0000819Equality tests between :class:`OrderedDict` objects are order-sensitive
820and are implemented as ``list(od1.items())==list(od2.items())``.
821Equality tests between :class:`OrderedDict` objects and other
Serhiy Storchakab33336f2013-10-13 23:09:00 +0300822:class:`Mapping` objects are order-insensitive like regular
823dictionaries. This allows :class:`OrderedDict` objects to be substituted
824anywhere a regular dictionary is used.
Raymond Hettinger24122992009-03-19 19:59:58 +0000825
Raymond Hettingerc473c5a2009-04-09 22:31:51 +0000826The :class:`OrderedDict` constructor and :meth:`update` method both accept
827keyword arguments, but their order is lost because Python's function call
828semantics pass-in keyword arguments using a regular unordered dictionary.
829
Raymond Hettinger24122992009-03-19 19:59:58 +0000830.. seealso::
831
832 `Equivalent OrderedDict recipe <http://code.activestate.com/recipes/576693/>`_
833 that runs on Python 2.4 or later.
Raymond Hettinger610326d2009-11-10 19:35:55 +0000834
Raymond Hettingerdf453fb2011-04-15 17:55:36 -0700835:class:`OrderedDict` Examples and Recipes
836^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
837
Raymond Hettinger610326d2009-11-10 19:35:55 +0000838Since an ordered dictionary remembers its insertion order, it can be used
839in conjuction with sorting to make a sorted dictionary::
840
841 >>> # regular unsorted dictionary
842 >>> d = {'banana': 3, 'apple':4, 'pear': 1, 'orange': 2}
843
844 >>> # dictionary sorted by key
845 >>> OrderedDict(sorted(d.items(), key=lambda t: t[0]))
846 OrderedDict([('apple', 4), ('banana', 3), ('orange', 2), ('pear', 1)])
847
848 >>> # dictionary sorted by value
849 >>> OrderedDict(sorted(d.items(), key=lambda t: t[1]))
850 OrderedDict([('pear', 1), ('orange', 2), ('banana', 3), ('apple', 4)])
851
852 >>> # dictionary sorted by length of the key string
853 >>> OrderedDict(sorted(d.items(), key=lambda t: len(t[0])))
854 OrderedDict([('pear', 1), ('apple', 4), ('orange', 2), ('banana', 3)])
855
856The new sorted dictionaries maintain their sort order when entries
857are deleted. But when new keys are added, the keys are appended
858to the end and the sort is not maintained.
Georg Brandlad8ac862010-08-01 19:21:26 +0000859
860It is also straight-forward to create an ordered dictionary variant
Andrew Svetlov2b326e42012-08-31 13:53:30 +0300861that remembers the order the keys were *last* inserted.
Georg Brandlad8ac862010-08-01 19:21:26 +0000862If a new entry overwrites an existing entry, the
863original insertion position is changed and moved to the end::
864
865 class LastUpdatedOrderedDict(OrderedDict):
Georg Brandl28dadd92011-02-25 10:50:32 +0000866 'Store items in the order the keys were last added'
Éric Araujo8fde9502011-07-29 11:34:17 +0200867
Georg Brandlad8ac862010-08-01 19:21:26 +0000868 def __setitem__(self, key, value):
869 if key in self:
870 del self[key]
871 OrderedDict.__setitem__(self, key, value)
Raymond Hettinger3de8a302010-08-08 00:35:03 +0000872
Éric Araujof06568c2011-08-19 09:00:56 +0200873An ordered dictionary can be combined with the :class:`Counter` class
Raymond Hettingerdf453fb2011-04-15 17:55:36 -0700874so that the counter remembers the order elements are first encountered::
875
876 class OrderedCounter(Counter, OrderedDict):
877 'Counter that remembers the order elements are first encountered'
878
Raymond Hettingerdf453fb2011-04-15 17:55:36 -0700879 def __repr__(self):
880 return '%s(%r)' % (self.__class__.__name__, OrderedDict(self))
881
882 def __reduce__(self):
883 return self.__class__, (OrderedDict(self),)
884
Raymond Hettinger3de8a302010-08-08 00:35:03 +0000885
Éric Araujo8fde9502011-07-29 11:34:17 +0200886.. _collections-abstract-base-classes:
Georg Brandl950b3d82010-12-04 19:02:20 +0000887
Éric Araujo8fde9502011-07-29 11:34:17 +0200888Collections Abstract Base Classes
889---------------------------------
Raymond Hettinger3de8a302010-08-08 00:35:03 +0000890
Ezio Melottid1135122011-03-28 13:46:28 +0300891The collections module offers the following :term:`ABCs <abstract base class>`:
Raymond Hettinger3de8a302010-08-08 00:35:03 +0000892
893========================= ===================== ====================== ====================================================
Ezio Melottid1135122011-03-28 13:46:28 +0300894ABC Inherits from Abstract Methods Mixin Methods
Raymond Hettinger3de8a302010-08-08 00:35:03 +0000895========================= ===================== ====================== ====================================================
896:class:`Container` ``__contains__``
897:class:`Hashable` ``__hash__``
898:class:`Iterable` ``__iter__``
899:class:`Iterator` :class:`Iterable` ``next`` ``__iter__``
900:class:`Sized` ``__len__``
901:class:`Callable` ``__call__``
902
Raymond Hettingercb800642013-03-23 08:57:00 -0700903:class:`Sequence` :class:`Sized`, ``__getitem__``, ``__contains__``, ``__iter__``, ``__reversed__``,
904 :class:`Iterable`, ``__len__`` ``index``, and ``count``
Raymond Hettinger3de8a302010-08-08 00:35:03 +0000905 :class:`Container`
906
Raymond Hettingercb800642013-03-23 08:57:00 -0700907:class:`MutableSequence` :class:`Sequence` ``__getitem__``, Inherited :class:`Sequence` methods and
908 ``__setitem__``, ``append``, ``reverse``, ``extend``, ``pop``,
909 ``__delitem__``, ``remove``, and ``__iadd__``
910 ``__len__``,
911 ``insert``
Raymond Hettinger3de8a302010-08-08 00:35:03 +0000912
Raymond Hettingercb800642013-03-23 08:57:00 -0700913:class:`Set` :class:`Sized`, ``__contains__``, ``__le__``, ``__lt__``, ``__eq__``, ``__ne__``,
914 :class:`Iterable`, ``__iter__``, ``__gt__``, ``__ge__``, ``__and__``, ``__or__``,
915 :class:`Container` ``__len__`` ``__sub__``, ``__xor__``, and ``isdisjoint``
Raymond Hettinger3de8a302010-08-08 00:35:03 +0000916
Raymond Hettingercb800642013-03-23 08:57:00 -0700917:class:`MutableSet` :class:`Set` ``__contains__``, Inherited :class:`Set` methods and
918 ``__iter__``, ``clear``, ``pop``, ``remove``, ``__ior__``,
919 ``__len__``, ``__iand__``, ``__ixor__``, and ``__isub__``
920 ``add``,
921 ``discard``
Raymond Hettinger3de8a302010-08-08 00:35:03 +0000922
Raymond Hettingercb800642013-03-23 08:57:00 -0700923:class:`Mapping` :class:`Sized`, ``__getitem__``, ``__contains__``, ``keys``, ``items``, ``values``,
924 :class:`Iterable`, ``__iter__``, ``get``, ``__eq__``, and ``__ne__``
925 :class:`Container` ``__len__``
Raymond Hettinger3de8a302010-08-08 00:35:03 +0000926
Raymond Hettingercb800642013-03-23 08:57:00 -0700927:class:`MutableMapping` :class:`Mapping` ``__getitem__``, Inherited :class:`Mapping` methods and
928 ``__setitem__``, ``pop``, ``popitem``, ``clear``, ``update``,
929 ``__delitem__``, and ``setdefault``
930 ``__iter__``,
931 ``__len__``
Raymond Hettinger3de8a302010-08-08 00:35:03 +0000932
933
934:class:`MappingView` :class:`Sized` ``__len__``
Raymond Hettinger3de8a302010-08-08 00:35:03 +0000935:class:`ItemsView` :class:`MappingView`, ``__contains__``,
936 :class:`Set` ``__iter__``
Ezio Melottid1135122011-03-28 13:46:28 +0300937:class:`KeysView` :class:`MappingView`, ``__contains__``,
938 :class:`Set` ``__iter__``
Raymond Hettinger3de8a302010-08-08 00:35:03 +0000939:class:`ValuesView` :class:`MappingView` ``__contains__``, ``__iter__``
940========================= ===================== ====================== ====================================================
941
Ezio Melottid1135122011-03-28 13:46:28 +0300942
943.. class:: Container
944 Hashable
945 Sized
946 Callable
947
948 ABCs for classes that provide respectively the methods :meth:`__contains__`,
949 :meth:`__hash__`, :meth:`__len__`, and :meth:`__call__`.
950
951.. class:: Iterable
952
953 ABC for classes that provide the :meth:`__iter__` method.
954 See also the definition of :term:`iterable`.
955
956.. class:: Iterator
957
Serhiy Storchakaea217722014-09-05 23:34:12 +0300958 ABC for classes that provide the :meth:`~iterator.__iter__` and
959 :meth:`~iterator.next` methods. See also the definition of :term:`iterator`.
Ezio Melottid1135122011-03-28 13:46:28 +0300960
961.. class:: Sequence
962 MutableSequence
963
964 ABCs for read-only and mutable :term:`sequences <sequence>`.
965
966.. class:: Set
967 MutableSet
968
969 ABCs for read-only and mutable sets.
970
971.. class:: Mapping
972 MutableMapping
973
974 ABCs for read-only and mutable :term:`mappings <mapping>`.
975
976.. class:: MappingView
977 ItemsView
978 KeysView
979 ValuesView
980
Martin Panter4a337242015-10-07 10:19:39 +0000981 ABCs for mapping, items, keys, and values :term:`views <dictionary view>`.
Ezio Melottid1135122011-03-28 13:46:28 +0300982
983
Raymond Hettinger3de8a302010-08-08 00:35:03 +0000984These ABCs allow us to ask classes or instances if they provide
985particular functionality, for example::
986
987 size = None
988 if isinstance(myvar, collections.Sized):
989 size = len(myvar)
990
991Several of the ABCs are also useful as mixins that make it easier to develop
992classes supporting container APIs. For example, to write a class supporting
993the full :class:`Set` API, it only necessary to supply the three underlying
994abstract methods: :meth:`__contains__`, :meth:`__iter__`, and :meth:`__len__`.
995The ABC supplies the remaining methods such as :meth:`__and__` and
996:meth:`isdisjoint` ::
997
998 class ListBasedSet(collections.Set):
999 ''' Alternate set implementation favoring space over speed
1000 and not requiring the set elements to be hashable. '''
1001 def __init__(self, iterable):
1002 self.elements = lst = []
1003 for value in iterable:
1004 if value not in lst:
1005 lst.append(value)
1006 def __iter__(self):
1007 return iter(self.elements)
1008 def __contains__(self, value):
1009 return value in self.elements
1010 def __len__(self):
1011 return len(self.elements)
1012
1013 s1 = ListBasedSet('abcdef')
1014 s2 = ListBasedSet('defghi')
1015 overlap = s1 & s2 # The __and__() method is supported automatically
1016
1017Notes on using :class:`Set` and :class:`MutableSet` as a mixin:
1018
1019(1)
1020 Since some set operations create new sets, the default mixin methods need
1021 a way to create new instances from an iterable. The class constructor is
1022 assumed to have a signature in the form ``ClassName(iterable)``.
1023 That assumption is factored-out to an internal classmethod called
1024 :meth:`_from_iterable` which calls ``cls(iterable)`` to produce a new set.
1025 If the :class:`Set` mixin is being used in a class with a different
Raymond Hettinger72507302011-03-22 09:06:38 -07001026 constructor signature, you will need to override :meth:`_from_iterable`
Raymond Hettinger3de8a302010-08-08 00:35:03 +00001027 with a classmethod that can construct new instances from
1028 an iterable argument.
1029
1030(2)
1031 To override the comparisons (presumably for speed, as the
Raymond Hettinger809b6652014-07-03 00:30:52 +01001032 semantics are fixed), redefine :meth:`__le__` and :meth:`__ge__`,
Raymond Hettinger3de8a302010-08-08 00:35:03 +00001033 then the other operations will automatically follow suit.
1034
1035(3)
1036 The :class:`Set` mixin provides a :meth:`_hash` method to compute a hash value
1037 for the set; however, :meth:`__hash__` is not defined because not all sets
1038 are hashable or immutable. To add set hashabilty using mixins,
1039 inherit from both :meth:`Set` and :meth:`Hashable`, then define
1040 ``__hash__ = Set._hash``.
1041
1042.. seealso::
1043
1044 * `OrderedSet recipe <http://code.activestate.com/recipes/576694/>`_ for an
1045 example built on :class:`MutableSet`.
1046
1047 * For more about ABCs, see the :mod:`abc` module and :pep:`3119`.