blob: 6433cd290f7ec1f74aa72c86537358567b5888ed [file] [log] [blame]
Georg Brandl8ec7f652007-08-15 14:28:01 +00001
2:mod:`collections` --- High-performance container datatypes
3===========================================================
4
5.. module:: collections
6 :synopsis: High-performance datatypes
7.. moduleauthor:: Raymond Hettinger <python@rcn.com>
8.. sectionauthor:: Raymond Hettinger <python@rcn.com>
9
Georg Brandl8ec7f652007-08-15 14:28:01 +000010.. versionadded:: 2.4
11
Georg Brandl4c8bbe62008-03-22 21:06:20 +000012.. testsetup:: *
13
14 from collections import *
15 import itertools
16 __name__ = '<doctest>'
17
Raymond Hettinger3de8a302010-08-08 00:35:03 +000018This module implements specialized container datatypes providing alternatives to
19Python's general purpose built-in containers, :class:`dict`, :class:`list`,
20:class:`set`, and :class:`tuple`.
Georg Brandl8ec7f652007-08-15 14:28:01 +000021
Raymond Hettinger3de8a302010-08-08 00:35:03 +000022===================== ====================================================================
23:func:`namedtuple` factory function for creating tuple subclasses with named fields
24:class:`deque` list-like container with fast appends and pops on either end
25:class:`Counter` dict subclass for counting hashable objects
26:class:`OrderedDict` dict subclass that remembers the order entries were added
27:class:`defaultdict` dict subclass that calls a factory function to supply missing values
28===================== ====================================================================
Raymond Hettingerbc4ffc12008-02-11 23:38:00 +000029
Raymond Hettingerf746a1f2009-02-17 08:33:01 +000030.. versionchanged:: 2.4
31 Added :class:`deque`.
32
33.. versionchanged:: 2.5
34 Added :class:`defaultdict`.
35
36.. versionchanged:: 2.6
37 Added :func:`namedtuple` and added abstract base classes.
38
39.. versionchanged:: 2.7
Raymond Hettingerbc512d32009-03-03 04:45:34 +000040 Added :class:`Counter` and :class:`OrderedDict`.
Raymond Hettingerbc4ffc12008-02-11 23:38:00 +000041
42In addition to containers, the collections module provides some ABCs
Georg Brandl4c8bbe62008-03-22 21:06:20 +000043(abstract base classes) that can be used to test whether a class
Raymond Hettingerf746a1f2009-02-17 08:33:01 +000044provides a particular interface, for example, whether it is hashable or
Georg Brandl4c8bbe62008-03-22 21:06:20 +000045a mapping.
Raymond Hettingerbc4ffc12008-02-11 23:38:00 +000046
Raymond Hettingere679a372010-11-05 23:58:42 +000047.. seealso::
48
49 Latest version of the `collections module Python source code
50 <http://svn.python.org/view/python/branches/release27-maint/Lib/collections.py?view=markup>`_
Raymond Hettingerbc4ffc12008-02-11 23:38:00 +000051
Raymond Hettingerf94d7fa2009-01-12 22:58:41 +000052:class:`Counter` objects
53------------------------
54
55A counter tool is provided to support convenient and rapid tallies.
56For example::
57
Raymond Hettinger939a3cc2009-02-04 11:31:30 +000058 >>> # Tally occurrences of words in a list
Raymond Hettingerf94d7fa2009-01-12 22:58:41 +000059 >>> cnt = Counter()
Raymond Hettinger0a1f7b82009-01-21 23:12:51 +000060 >>> for word in ['red', 'blue', 'red', 'green', 'blue', 'blue']:
Raymond Hettingerf94d7fa2009-01-12 22:58:41 +000061 ... cnt[word] += 1
62 >>> cnt
Raymond Hettingeraaa6e632009-01-13 01:05:03 +000063 Counter({'blue': 3, 'red': 2, 'green': 1})
Raymond Hettingerf94d7fa2009-01-12 22:58:41 +000064
Raymond Hettinger939a3cc2009-02-04 11:31:30 +000065 >>> # Find the ten most common words in Hamlet
Raymond Hettingerf94d7fa2009-01-12 22:58:41 +000066 >>> import re
67 >>> words = re.findall('\w+', open('hamlet.txt').read().lower())
Raymond Hettinger196a0f72009-01-20 12:59:36 +000068 >>> Counter(words).most_common(10)
Raymond Hettingerf94d7fa2009-01-12 22:58:41 +000069 [('the', 1143), ('and', 966), ('to', 762), ('of', 669), ('i', 631),
70 ('you', 554), ('a', 546), ('my', 514), ('hamlet', 471), ('in', 451)]
71
Raymond Hettinger82783852009-01-13 03:49:43 +000072.. class:: Counter([iterable-or-mapping])
Raymond Hettingerf94d7fa2009-01-12 22:58:41 +000073
Raymond Hettinger0a1f7b82009-01-21 23:12:51 +000074 A :class:`Counter` is a :class:`dict` subclass for counting hashable objects.
Raymond Hettingeraaa6e632009-01-13 01:05:03 +000075 It is an unordered collection where elements are stored as dictionary keys
76 and their counts are stored as dictionary values. Counts are allowed to be
77 any integer value including zero or negative counts. The :class:`Counter`
78 class is similar to bags or multisets in other languages.
Raymond Hettingerf94d7fa2009-01-12 22:58:41 +000079
Raymond Hettinger82783852009-01-13 03:49:43 +000080 Elements are counted from an *iterable* or initialized from another
Georg Brandlf6dab952009-04-28 21:48:35 +000081 *mapping* (or counter):
Raymond Hettingerf94d7fa2009-01-12 22:58:41 +000082
Raymond Hettinger22bfa9e2009-01-27 02:36:33 +000083 >>> c = Counter() # a new, empty counter
84 >>> c = Counter('gallahad') # a new counter from an iterable
85 >>> c = Counter({'red': 4, 'blue': 2}) # a new counter from a mapping
86 >>> c = Counter(cats=4, dogs=8) # a new counter from keyword args
Raymond Hettingerf94d7fa2009-01-12 22:58:41 +000087
Raymond Hettinger0a1f7b82009-01-21 23:12:51 +000088 Counter objects have a dictionary interface except that they return a zero
Georg Brandlf6dab952009-04-28 21:48:35 +000089 count for missing items instead of raising a :exc:`KeyError`:
Raymond Hettingerf94d7fa2009-01-12 22:58:41 +000090
Raymond Hettinger7bdca052009-01-22 05:20:47 +000091 >>> c = Counter(['eggs', 'ham'])
Raymond Hettinger59894122009-01-14 00:15:21 +000092 >>> c['bacon'] # count of a missing element is zero
Raymond Hettingerf94d7fa2009-01-12 22:58:41 +000093 0
94
Raymond Hettinger7bdca052009-01-22 05:20:47 +000095 Setting a count to zero does not remove an element from a counter.
96 Use ``del`` to remove it entirely:
Raymond Hettingerf94d7fa2009-01-12 22:58:41 +000097
Raymond Hettinger7bdca052009-01-22 05:20:47 +000098 >>> c['sausage'] = 0 # counter entry with a zero count
99 >>> del c['sausage'] # del actually removes the entry
Raymond Hettingerf94d7fa2009-01-12 22:58:41 +0000100
101 .. versionadded:: 2.7
102
103
Ezio Melotti4edfe962010-04-04 06:50:58 +0000104 Counter objects support three methods beyond those available for all
Raymond Hettingerf94d7fa2009-01-12 22:58:41 +0000105 dictionaries:
106
107 .. method:: elements()
108
Raymond Hettinger0a1f7b82009-01-21 23:12:51 +0000109 Return an iterator over elements repeating each as many times as its
110 count. Elements are returned in arbitrary order. If an element's count
111 is less than one, :meth:`elements` will ignore it.
Raymond Hettingerf94d7fa2009-01-12 22:58:41 +0000112
Raymond Hettinger196a0f72009-01-20 12:59:36 +0000113 >>> c = Counter(a=4, b=2, c=0, d=-2)
Raymond Hettingerf94d7fa2009-01-12 22:58:41 +0000114 >>> list(c.elements())
115 ['a', 'a', 'a', 'a', 'b', 'b']
116
117 .. method:: most_common([n])
118
Raymond Hettinger22bfa9e2009-01-27 02:36:33 +0000119 Return a list of the *n* most common elements and their counts from the
Raymond Hettingerd507afd2009-02-04 10:52:32 +0000120 most common to the least. If *n* is not specified, :func:`most_common`
Raymond Hettinger22bfa9e2009-01-27 02:36:33 +0000121 returns *all* elements in the counter. Elements with equal counts are
Georg Brandlf6dab952009-04-28 21:48:35 +0000122 ordered arbitrarily:
Raymond Hettingerf94d7fa2009-01-12 22:58:41 +0000123
124 >>> Counter('abracadabra').most_common(3)
125 [('a', 5), ('r', 2), ('b', 2)]
126
Raymond Hettinger34c35b22010-04-03 10:22:00 +0000127 .. method:: subtract([iterable-or-mapping])
128
129 Elements are subtracted from an *iterable* or from another *mapping*
130 (or counter). Like :meth:`dict.update` but subtracts counts instead
131 of replacing them. Both inputs and outputs may be zero or negative.
132
133 >>> c = Counter(a=4, b=2, c=0, d=-2)
134 >>> d = Counter(a=1, b=2, c=3, d=4)
135 >>> c.subtract(d)
136 Counter({'a': 3, 'b': 0, 'c': -3, 'd': -6})
137
Raymond Hettinger0a1f7b82009-01-21 23:12:51 +0000138 The usual dictionary methods are available for :class:`Counter` objects
139 except for two which work differently for counters.
Raymond Hettingerf94d7fa2009-01-12 22:58:41 +0000140
141 .. method:: fromkeys(iterable)
142
Raymond Hettinger22bfa9e2009-01-27 02:36:33 +0000143 This class method is not implemented for :class:`Counter` objects.
Raymond Hettingerf94d7fa2009-01-12 22:58:41 +0000144
Raymond Hettinger82783852009-01-13 03:49:43 +0000145 .. method:: update([iterable-or-mapping])
Raymond Hettingerf94d7fa2009-01-12 22:58:41 +0000146
Raymond Hettinger22bfa9e2009-01-27 02:36:33 +0000147 Elements are counted from an *iterable* or added-in from another
148 *mapping* (or counter). Like :meth:`dict.update` but adds counts
149 instead of replacing them. Also, the *iterable* is expected to be a
150 sequence of elements, not a sequence of ``(key, value)`` pairs.
Raymond Hettingerf94d7fa2009-01-12 22:58:41 +0000151
Raymond Hettingerfbcf7492009-01-13 08:38:14 +0000152Common patterns for working with :class:`Counter` objects::
Raymond Hettingerf94d7fa2009-01-12 22:58:41 +0000153
Raymond Hettinger22bfa9e2009-01-27 02:36:33 +0000154 sum(c.values()) # total of all counts
155 c.clear() # reset all counts
156 list(c) # list unique elements
157 set(c) # convert to a set
158 dict(c) # convert to a regular dictionary
159 c.items() # convert to a list of (elem, cnt) pairs
160 Counter(dict(list_of_pairs)) # convert from a list of (elem, cnt) pairs
161 c.most_common()[:-n:-1] # n least common elements
162 c += Counter() # remove zero and negative counts
Raymond Hettingerf94d7fa2009-01-12 22:58:41 +0000163
Raymond Hettingera6658532009-02-25 22:48:24 +0000164Several mathematical operations are provided for combining :class:`Counter`
165objects to produce multisets (counters that have counts greater than zero).
166Addition and subtraction combine counters by adding or subtracting the counts
167of corresponding elements. Intersection and union return the minimum and
168maximum of corresponding counts. Each operation can accept inputs with signed
169counts, but the output will exclude results with counts of zero or less.
Raymond Hettingerbad1eb22009-01-20 01:19:26 +0000170
Raymond Hettinger4571f342009-01-21 20:31:50 +0000171 >>> c = Counter(a=3, b=1)
172 >>> d = Counter(a=1, b=2)
Raymond Hettinger22bfa9e2009-01-27 02:36:33 +0000173 >>> c + d # add two counters together: c[x] + d[x]
Raymond Hettingerbad1eb22009-01-20 01:19:26 +0000174 Counter({'a': 4, 'b': 3})
Raymond Hettinger22bfa9e2009-01-27 02:36:33 +0000175 >>> c - d # subtract (keeping only positive counts)
Raymond Hettingerbad1eb22009-01-20 01:19:26 +0000176 Counter({'a': 2})
Raymond Hettinger22bfa9e2009-01-27 02:36:33 +0000177 >>> c & d # intersection: min(c[x], d[x])
Raymond Hettingerbad1eb22009-01-20 01:19:26 +0000178 Counter({'a': 1, 'b': 1})
Raymond Hettinger22bfa9e2009-01-27 02:36:33 +0000179 >>> c | d # union: max(c[x], d[x])
Raymond Hettingerbad1eb22009-01-20 01:19:26 +0000180 Counter({'a': 3, 'b': 2})
181
Raymond Hettinger44340e62010-04-12 21:12:06 +0000182.. note::
183
184 Counters were primarily designed to work with positive integers to represent
185 running counts; however, care was taken to not unnecessarily preclude use
186 cases needing other types or negative values. To help with those use cases,
187 this section documents the minimum range and type restrictions.
188
189 * The :class:`Counter` class itself is a dictionary subclass with no
190 restrictions on its keys and values. The values are intended to be numbers
191 representing counts, but you *could* store anything in the value field.
192
193 * The :meth:`most_common` method requires only that the values be orderable.
194
195 * For in-place operations such as ``c[key] += 1``, the value type need only
196 support addition and subtraction. So fractions, floats, and decimals would
197 work and negative values are supported. The same is also true for
198 :meth:`update` and :meth:`subtract` which allow negative and zero values
199 for both inputs and outputs.
200
201 * The multiset methods are designed only for use cases with positive values.
202 The inputs may be negative or zero, but only outputs with positive values
203 are created. There are no type restrictions, but the value type needs to
204 support support addition, subtraction, and comparison.
205
206 * The :meth:`elements` method requires integer counts. It ignores zero and
207 negative counts.
208
Raymond Hettingeracdc84a2009-01-20 23:42:54 +0000209.. seealso::
Raymond Hettingerf94d7fa2009-01-12 22:58:41 +0000210
Raymond Hettinger7bdca052009-01-22 05:20:47 +0000211 * `Counter class <http://code.activestate.com/recipes/576611/>`_
212 adapted for Python 2.5 and an early `Bag recipe
213 <http://code.activestate.com/recipes/259174/>`_ for Python 2.4.
214
Raymond Hettingeracdc84a2009-01-20 23:42:54 +0000215 * `Bag class <http://www.gnu.org/software/smalltalk/manual-base/html_node/Bag.html>`_
216 in Smalltalk.
Raymond Hettingerf94d7fa2009-01-12 22:58:41 +0000217
Raymond Hettingeracdc84a2009-01-20 23:42:54 +0000218 * Wikipedia entry for `Multisets <http://en.wikipedia.org/wiki/Multiset>`_\.
Raymond Hettingerf94d7fa2009-01-12 22:58:41 +0000219
Raymond Hettingeracdc84a2009-01-20 23:42:54 +0000220 * `C++ multisets <http://www.demo2s.com/Tutorial/Cpp/0380__set-multiset/Catalog0380__set-multiset.htm>`_
Raymond Hettinger7bdca052009-01-22 05:20:47 +0000221 tutorial with examples.
Raymond Hettingeracdc84a2009-01-20 23:42:54 +0000222
Raymond Hettinger7bdca052009-01-22 05:20:47 +0000223 * For mathematical operations on multisets and their use cases, see
Raymond Hettingeracdc84a2009-01-20 23:42:54 +0000224 *Knuth, Donald. The Art of Computer Programming Volume II,
225 Section 4.6.3, Exercise 19*\.
226
Raymond Hettinger0a1f7b82009-01-21 23:12:51 +0000227 * To enumerate all distinct multisets of a given size over a given set of
Raymond Hettingerd081abc2009-01-27 02:58:49 +0000228 elements, see :func:`itertools.combinations_with_replacement`.
Raymond Hettingeracdc84a2009-01-20 23:42:54 +0000229
Raymond Hettinger7bdca052009-01-22 05:20:47 +0000230 map(Counter, combinations_with_replacement('ABC', 2)) --> AA AB AC BB BC CC
Raymond Hettingerfbcf7492009-01-13 08:38:14 +0000231
Raymond Hettingerf94d7fa2009-01-12 22:58:41 +0000232
Georg Brandl8ec7f652007-08-15 14:28:01 +0000233:class:`deque` objects
234----------------------
235
Raymond Hettingera7fc4b12007-10-05 02:47:07 +0000236.. class:: deque([iterable[, maxlen]])
Georg Brandl8ec7f652007-08-15 14:28:01 +0000237
238 Returns a new deque object initialized left-to-right (using :meth:`append`) with
239 data from *iterable*. If *iterable* is not specified, the new deque is empty.
240
241 Deques are a generalization of stacks and queues (the name is pronounced "deck"
242 and is short for "double-ended queue"). Deques support thread-safe, memory
243 efficient appends and pops from either side of the deque with approximately the
244 same O(1) performance in either direction.
245
246 Though :class:`list` objects support similar operations, they are optimized for
247 fast fixed-length operations and incur O(n) memory movement costs for
248 ``pop(0)`` and ``insert(0, v)`` operations which change both the size and
249 position of the underlying data representation.
250
251 .. versionadded:: 2.4
252
Raymond Hettinger68995862007-10-10 00:26:46 +0000253 If *maxlen* is not specified or is *None*, deques may grow to an
Raymond Hettingera7fc4b12007-10-05 02:47:07 +0000254 arbitrary length. Otherwise, the deque is bounded to the specified maximum
255 length. Once a bounded length deque is full, when new items are added, a
256 corresponding number of items are discarded from the opposite end. Bounded
257 length deques provide functionality similar to the ``tail`` filter in
258 Unix. They are also useful for tracking transactions and other pools of data
259 where only the most recent activity is of interest.
260
261 .. versionchanged:: 2.6
Georg Brandlb19be572007-12-29 10:57:00 +0000262 Added *maxlen* parameter.
Raymond Hettingera7fc4b12007-10-05 02:47:07 +0000263
Benjamin Petersonc7b05922008-04-25 01:29:10 +0000264 Deque objects support the following methods:
Georg Brandl8ec7f652007-08-15 14:28:01 +0000265
266
Benjamin Petersonc7b05922008-04-25 01:29:10 +0000267 .. method:: append(x)
Georg Brandl8ec7f652007-08-15 14:28:01 +0000268
Benjamin Petersonc7b05922008-04-25 01:29:10 +0000269 Add *x* to the right side of the deque.
Georg Brandl8ec7f652007-08-15 14:28:01 +0000270
271
Benjamin Petersonc7b05922008-04-25 01:29:10 +0000272 .. method:: appendleft(x)
Georg Brandl8ec7f652007-08-15 14:28:01 +0000273
Benjamin Petersonc7b05922008-04-25 01:29:10 +0000274 Add *x* to the left side of the deque.
Georg Brandl8ec7f652007-08-15 14:28:01 +0000275
276
Benjamin Petersonc7b05922008-04-25 01:29:10 +0000277 .. method:: clear()
Georg Brandl8ec7f652007-08-15 14:28:01 +0000278
Benjamin Petersonc7b05922008-04-25 01:29:10 +0000279 Remove all elements from the deque leaving it with length 0.
Georg Brandl8ec7f652007-08-15 14:28:01 +0000280
281
Raymond Hettinger5f516ed2010-04-03 18:10:37 +0000282 .. method:: count(x)
283
284 Count the number of deque elements equal to *x*.
285
286 .. versionadded:: 2.7
287
Benjamin Petersonc7b05922008-04-25 01:29:10 +0000288 .. method:: extend(iterable)
Georg Brandl8ec7f652007-08-15 14:28:01 +0000289
Benjamin Petersonc7b05922008-04-25 01:29:10 +0000290 Extend the right side of the deque by appending elements from the iterable
291 argument.
Georg Brandl8ec7f652007-08-15 14:28:01 +0000292
293
Benjamin Petersonc7b05922008-04-25 01:29:10 +0000294 .. method:: extendleft(iterable)
Georg Brandl8ec7f652007-08-15 14:28:01 +0000295
Benjamin Petersonc7b05922008-04-25 01:29:10 +0000296 Extend the left side of the deque by appending elements from *iterable*.
297 Note, the series of left appends results in reversing the order of
298 elements in the iterable argument.
Georg Brandl8ec7f652007-08-15 14:28:01 +0000299
300
Benjamin Petersonc7b05922008-04-25 01:29:10 +0000301 .. method:: pop()
Georg Brandl8ec7f652007-08-15 14:28:01 +0000302
Benjamin Petersonc7b05922008-04-25 01:29:10 +0000303 Remove and return an element from the right side of the deque. If no
304 elements are present, raises an :exc:`IndexError`.
Georg Brandl8ec7f652007-08-15 14:28:01 +0000305
306
Benjamin Petersonc7b05922008-04-25 01:29:10 +0000307 .. method:: popleft()
Georg Brandl8ec7f652007-08-15 14:28:01 +0000308
Benjamin Petersonc7b05922008-04-25 01:29:10 +0000309 Remove and return an element from the left side of the deque. If no
310 elements are present, raises an :exc:`IndexError`.
Georg Brandl8ec7f652007-08-15 14:28:01 +0000311
312
Benjamin Petersonc7b05922008-04-25 01:29:10 +0000313 .. method:: remove(value)
Georg Brandl8ec7f652007-08-15 14:28:01 +0000314
Benjamin Petersonc7b05922008-04-25 01:29:10 +0000315 Removed the first occurrence of *value*. If not found, raises a
316 :exc:`ValueError`.
Georg Brandl8ec7f652007-08-15 14:28:01 +0000317
Benjamin Petersonc7b05922008-04-25 01:29:10 +0000318 .. versionadded:: 2.5
Georg Brandl8ec7f652007-08-15 14:28:01 +0000319
Raymond Hettingera5fd24e2009-12-10 06:42:54 +0000320 .. method:: reverse()
321
322 Reverse the elements of the deque in-place and then return ``None``.
323
324 .. versionadded:: 2.7
Georg Brandl8ec7f652007-08-15 14:28:01 +0000325
Benjamin Petersonc7b05922008-04-25 01:29:10 +0000326 .. method:: rotate(n)
Georg Brandl8ec7f652007-08-15 14:28:01 +0000327
Benjamin Petersonc7b05922008-04-25 01:29:10 +0000328 Rotate the deque *n* steps to the right. If *n* is negative, rotate to
329 the left. Rotating one step to the right is equivalent to:
330 ``d.appendleft(d.pop())``.
331
Georg Brandl8ec7f652007-08-15 14:28:01 +0000332
Raymond Hettinger56411aa2009-03-10 12:50:59 +0000333 Deque objects also provide one read-only attribute:
334
335 .. attribute:: maxlen
336
337 Maximum size of a deque or *None* if unbounded.
338
339 .. versionadded:: 2.7
340
341
Georg Brandl8ec7f652007-08-15 14:28:01 +0000342In addition to the above, deques support iteration, pickling, ``len(d)``,
343``reversed(d)``, ``copy.copy(d)``, ``copy.deepcopy(d)``, membership testing with
Benjamin Peterson5c4e0062008-10-16 18:52:14 +0000344the :keyword:`in` operator, and subscript references such as ``d[-1]``. Indexed
345access is O(1) at both ends but slows to O(n) in the middle. For fast random
346access, use lists instead.
Georg Brandl8ec7f652007-08-15 14:28:01 +0000347
Georg Brandl4c8bbe62008-03-22 21:06:20 +0000348Example:
349
350.. doctest::
Georg Brandl8ec7f652007-08-15 14:28:01 +0000351
352 >>> from collections import deque
353 >>> d = deque('ghi') # make a new deque with three items
354 >>> for elem in d: # iterate over the deque's elements
Georg Brandl4c8bbe62008-03-22 21:06:20 +0000355 ... print elem.upper()
Georg Brandl8ec7f652007-08-15 14:28:01 +0000356 G
357 H
358 I
359
360 >>> d.append('j') # add a new entry to the right side
361 >>> d.appendleft('f') # add a new entry to the left side
362 >>> d # show the representation of the deque
363 deque(['f', 'g', 'h', 'i', 'j'])
364
365 >>> d.pop() # return and remove the rightmost item
366 'j'
367 >>> d.popleft() # return and remove the leftmost item
368 'f'
369 >>> list(d) # list the contents of the deque
370 ['g', 'h', 'i']
371 >>> d[0] # peek at leftmost item
372 'g'
373 >>> d[-1] # peek at rightmost item
374 'i'
375
376 >>> list(reversed(d)) # list the contents of a deque in reverse
377 ['i', 'h', 'g']
378 >>> 'h' in d # search the deque
379 True
380 >>> d.extend('jkl') # add multiple elements at once
381 >>> d
382 deque(['g', 'h', 'i', 'j', 'k', 'l'])
383 >>> d.rotate(1) # right rotation
384 >>> d
385 deque(['l', 'g', 'h', 'i', 'j', 'k'])
386 >>> d.rotate(-1) # left rotation
387 >>> d
388 deque(['g', 'h', 'i', 'j', 'k', 'l'])
389
390 >>> deque(reversed(d)) # make a new deque in reverse order
391 deque(['l', 'k', 'j', 'i', 'h', 'g'])
392 >>> d.clear() # empty the deque
393 >>> d.pop() # cannot pop from an empty deque
394 Traceback (most recent call last):
395 File "<pyshell#6>", line 1, in -toplevel-
396 d.pop()
397 IndexError: pop from an empty deque
398
399 >>> d.extendleft('abc') # extendleft() reverses the input order
400 >>> d
401 deque(['c', 'b', 'a'])
402
403
Raymond Hettingera7fc4b12007-10-05 02:47:07 +0000404:class:`deque` Recipes
405^^^^^^^^^^^^^^^^^^^^^^
Georg Brandl8ec7f652007-08-15 14:28:01 +0000406
407This section shows various approaches to working with deques.
408
Raymond Hettinger6bc94cb2009-03-31 22:43:03 +0000409Bounded length deques provide functionality similar to the ``tail`` filter
410in Unix::
411
412 def tail(filename, n=10):
413 'Return the last n lines of a file'
414 return deque(open(filename), n)
415
416Another approach to using deques is to maintain a sequence of recently
417added elements by appending to the right and popping to the left::
418
419 def moving_average(iterable, n=3):
420 # moving_average([40, 30, 50, 46, 39, 44]) --> 40.0 42.0 45.0 43.0
421 # http://en.wikipedia.org/wiki/Moving_average
Raymond Hettinger6bc94cb2009-03-31 22:43:03 +0000422 it = iter(iterable)
Raymond Hettinger9b6f13e2009-05-22 01:06:44 +0000423 d = deque(itertools.islice(it, n-1))
424 d.appendleft(0)
Raymond Hettinger6bc94cb2009-03-31 22:43:03 +0000425 s = sum(d)
Raymond Hettinger6bc94cb2009-03-31 22:43:03 +0000426 for elem in it:
427 s += elem - d.popleft()
428 d.append(elem)
Raymond Hettinger9b6f13e2009-05-22 01:06:44 +0000429 yield s / float(n)
Raymond Hettinger6bc94cb2009-03-31 22:43:03 +0000430
Georg Brandl8ec7f652007-08-15 14:28:01 +0000431The :meth:`rotate` method provides a way to implement :class:`deque` slicing and
Ezio Melotti062d2b52009-12-19 22:41:49 +0000432deletion. For example, a pure Python implementation of ``del d[n]`` relies on
Georg Brandl8ec7f652007-08-15 14:28:01 +0000433the :meth:`rotate` method to position elements to be popped::
434
435 def delete_nth(d, n):
436 d.rotate(-n)
437 d.popleft()
438 d.rotate(n)
439
440To implement :class:`deque` slicing, use a similar approach applying
441:meth:`rotate` to bring a target element to the left side of the deque. Remove
442old entries with :meth:`popleft`, add new entries with :meth:`extend`, and then
443reverse the rotation.
Georg Brandl8ec7f652007-08-15 14:28:01 +0000444With minor variations on that approach, it is easy to implement Forth style
445stack manipulations such as ``dup``, ``drop``, ``swap``, ``over``, ``pick``,
446``rot``, and ``roll``.
447
Georg Brandl8ec7f652007-08-15 14:28:01 +0000448
449:class:`defaultdict` objects
450----------------------------
451
Georg Brandl8ec7f652007-08-15 14:28:01 +0000452.. class:: defaultdict([default_factory[, ...]])
453
454 Returns a new dictionary-like object. :class:`defaultdict` is a subclass of the
Georg Brandld7d4fd72009-07-26 14:37:28 +0000455 built-in :class:`dict` class. It overrides one method and adds one writable
Georg Brandl8ec7f652007-08-15 14:28:01 +0000456 instance variable. The remaining functionality is the same as for the
457 :class:`dict` class and is not documented here.
458
459 The first argument provides the initial value for the :attr:`default_factory`
460 attribute; it defaults to ``None``. All remaining arguments are treated the same
461 as if they were passed to the :class:`dict` constructor, including keyword
462 arguments.
463
464 .. versionadded:: 2.5
465
Benjamin Petersonc7b05922008-04-25 01:29:10 +0000466 :class:`defaultdict` objects support the following method in addition to the
467 standard :class:`dict` operations:
Georg Brandl8ec7f652007-08-15 14:28:01 +0000468
469
Benjamin Petersonc7b05922008-04-25 01:29:10 +0000470 .. method:: defaultdict.__missing__(key)
Georg Brandl8ec7f652007-08-15 14:28:01 +0000471
Skip Montanarob40890d2008-09-17 11:50:36 +0000472 If the :attr:`default_factory` attribute is ``None``, this raises a
Benjamin Petersonc7b05922008-04-25 01:29:10 +0000473 :exc:`KeyError` exception with the *key* as argument.
Georg Brandl8ec7f652007-08-15 14:28:01 +0000474
Benjamin Petersonc7b05922008-04-25 01:29:10 +0000475 If :attr:`default_factory` is not ``None``, it is called without arguments
476 to provide a default value for the given *key*, this value is inserted in
477 the dictionary for the *key*, and returned.
Georg Brandl8ec7f652007-08-15 14:28:01 +0000478
Benjamin Petersonc7b05922008-04-25 01:29:10 +0000479 If calling :attr:`default_factory` raises an exception this exception is
480 propagated unchanged.
Georg Brandl8ec7f652007-08-15 14:28:01 +0000481
Benjamin Petersonc7b05922008-04-25 01:29:10 +0000482 This method is called by the :meth:`__getitem__` method of the
483 :class:`dict` class when the requested key is not found; whatever it
484 returns or raises is then returned or raised by :meth:`__getitem__`.
Georg Brandl8ec7f652007-08-15 14:28:01 +0000485
486
Benjamin Petersonc7b05922008-04-25 01:29:10 +0000487 :class:`defaultdict` objects support the following instance variable:
Georg Brandl8ec7f652007-08-15 14:28:01 +0000488
Benjamin Petersonc7b05922008-04-25 01:29:10 +0000489
490 .. attribute:: defaultdict.default_factory
491
492 This attribute is used by the :meth:`__missing__` method; it is
493 initialized from the first argument to the constructor, if present, or to
494 ``None``, if absent.
Georg Brandl8ec7f652007-08-15 14:28:01 +0000495
496
Georg Brandl8ec7f652007-08-15 14:28:01 +0000497:class:`defaultdict` Examples
498^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
499
500Using :class:`list` as the :attr:`default_factory`, it is easy to group a
Georg Brandl4c8bbe62008-03-22 21:06:20 +0000501sequence of key-value pairs into a dictionary of lists:
Georg Brandl8ec7f652007-08-15 14:28:01 +0000502
503 >>> s = [('yellow', 1), ('blue', 2), ('yellow', 3), ('blue', 4), ('red', 1)]
504 >>> d = defaultdict(list)
505 >>> for k, v in s:
506 ... d[k].append(v)
507 ...
508 >>> d.items()
509 [('blue', [2, 4]), ('red', [1]), ('yellow', [1, 3])]
510
511When each key is encountered for the first time, it is not already in the
512mapping; so an entry is automatically created using the :attr:`default_factory`
513function which returns an empty :class:`list`. The :meth:`list.append`
514operation then attaches the value to the new list. When keys are encountered
515again, the look-up proceeds normally (returning the list for that key) and the
516:meth:`list.append` operation adds another value to the list. This technique is
Georg Brandl4c8bbe62008-03-22 21:06:20 +0000517simpler and faster than an equivalent technique using :meth:`dict.setdefault`:
Georg Brandl8ec7f652007-08-15 14:28:01 +0000518
519 >>> d = {}
520 >>> for k, v in s:
521 ... d.setdefault(k, []).append(v)
522 ...
523 >>> d.items()
524 [('blue', [2, 4]), ('red', [1]), ('yellow', [1, 3])]
525
526Setting the :attr:`default_factory` to :class:`int` makes the
527:class:`defaultdict` useful for counting (like a bag or multiset in other
Georg Brandl4c8bbe62008-03-22 21:06:20 +0000528languages):
Georg Brandl8ec7f652007-08-15 14:28:01 +0000529
530 >>> s = 'mississippi'
531 >>> d = defaultdict(int)
532 >>> for k in s:
533 ... d[k] += 1
534 ...
535 >>> d.items()
536 [('i', 4), ('p', 2), ('s', 4), ('m', 1)]
537
538When a letter is first encountered, it is missing from the mapping, so the
539:attr:`default_factory` function calls :func:`int` to supply a default count of
540zero. The increment operation then builds up the count for each letter.
541
542The function :func:`int` which always returns zero is just a special case of
543constant functions. A faster and more flexible way to create constant functions
544is to use :func:`itertools.repeat` which can supply any constant value (not just
Georg Brandl4c8bbe62008-03-22 21:06:20 +0000545zero):
Georg Brandl8ec7f652007-08-15 14:28:01 +0000546
547 >>> def constant_factory(value):
548 ... return itertools.repeat(value).next
549 >>> d = defaultdict(constant_factory('<missing>'))
550 >>> d.update(name='John', action='ran')
551 >>> '%(name)s %(action)s to %(object)s' % d
552 'John ran to <missing>'
553
554Setting the :attr:`default_factory` to :class:`set` makes the
Georg Brandl4c8bbe62008-03-22 21:06:20 +0000555:class:`defaultdict` useful for building a dictionary of sets:
Georg Brandl8ec7f652007-08-15 14:28:01 +0000556
557 >>> s = [('red', 1), ('blue', 2), ('red', 3), ('blue', 4), ('red', 1), ('blue', 4)]
558 >>> d = defaultdict(set)
559 >>> for k, v in s:
560 ... d[k].add(v)
561 ...
562 >>> d.items()
563 [('blue', set([2, 4])), ('red', set([1, 3]))]
564
565
Raymond Hettingereeeb9c42007-11-15 02:44:53 +0000566:func:`namedtuple` Factory Function for Tuples with Named Fields
Georg Brandlb3255ed2008-01-07 16:43:47 +0000567----------------------------------------------------------------
Georg Brandl8ec7f652007-08-15 14:28:01 +0000568
Raymond Hettingercbab5942007-09-18 22:18:02 +0000569Named tuples assign meaning to each position in a tuple and allow for more readable,
570self-documenting code. They can be used wherever regular tuples are used, and
571they add the ability to access fields by name instead of position index.
Georg Brandl8ec7f652007-08-15 14:28:01 +0000572
Georg Brandl27fa4822010-10-17 06:24:10 +0000573.. function:: namedtuple(typename, field_names, [verbose=False], [rename=False])
Georg Brandl8ec7f652007-08-15 14:28:01 +0000574
575 Returns a new tuple subclass named *typename*. The new subclass is used to
Georg Brandl907a7202008-02-22 12:31:45 +0000576 create tuple-like objects that have fields accessible by attribute lookup as
Georg Brandl8ec7f652007-08-15 14:28:01 +0000577 well as being indexable and iterable. Instances of the subclass also have a
Georg Brandl061d2e22008-11-23 19:17:25 +0000578 helpful docstring (with typename and field_names) and a helpful :meth:`__repr__`
Georg Brandl8ec7f652007-08-15 14:28:01 +0000579 method which lists the tuple contents in a ``name=value`` format.
580
Georg Brandl061d2e22008-11-23 19:17:25 +0000581 The *field_names* are a single string with each fieldname separated by whitespace
582 and/or commas, for example ``'x y'`` or ``'x, y'``. Alternatively, *field_names*
Raymond Hettinger15b5e552008-01-10 23:00:01 +0000583 can be a sequence of strings such as ``['x', 'y']``.
Raymond Hettingerabfd8df2007-10-16 21:28:32 +0000584
585 Any valid Python identifier may be used for a fieldname except for names
Raymond Hettinger42da8742007-12-14 02:49:47 +0000586 starting with an underscore. Valid identifiers consist of letters, digits,
587 and underscores but do not start with a digit or underscore and cannot be
Raymond Hettingerabfd8df2007-10-16 21:28:32 +0000588 a :mod:`keyword` such as *class*, *for*, *return*, *global*, *pass*, *print*,
589 or *raise*.
Raymond Hettingercbab5942007-09-18 22:18:02 +0000590
Raymond Hettinger322daea2009-02-10 01:24:05 +0000591 If *rename* is true, invalid fieldnames are automatically replaced
592 with positional names. For example, ``['abc', 'def', 'ghi', 'abc']`` is
Raymond Hettinger6df48a32009-04-02 22:34:17 +0000593 converted to ``['abc', '_1', 'ghi', '_3']``, eliminating the keyword
Raymond Hettinger322daea2009-02-10 01:24:05 +0000594 ``def`` and the duplicate fieldname ``abc``.
595
Raymond Hettinger15b5e552008-01-10 23:00:01 +0000596 If *verbose* is true, the class definition is printed just before being built.
Raymond Hettingercbab5942007-09-18 22:18:02 +0000597
Raymond Hettingera48a2992007-10-08 21:26:58 +0000598 Named tuple instances do not have per-instance dictionaries, so they are
Raymond Hettinger7268e9d2007-09-20 03:03:43 +0000599 lightweight and require no more memory than regular tuples.
Raymond Hettingercbab5942007-09-18 22:18:02 +0000600
Georg Brandl8ec7f652007-08-15 14:28:01 +0000601 .. versionadded:: 2.6
602
Raymond Hettinger322daea2009-02-10 01:24:05 +0000603 .. versionchanged:: 2.7
604 added support for *rename*.
605
Georg Brandl4c8bbe62008-03-22 21:06:20 +0000606Example:
607
608.. doctest::
609 :options: +NORMALIZE_WHITESPACE
Georg Brandl8ec7f652007-08-15 14:28:01 +0000610
Raymond Hettingereeeb9c42007-11-15 02:44:53 +0000611 >>> Point = namedtuple('Point', 'x y', verbose=True)
Raymond Hettingercbab5942007-09-18 22:18:02 +0000612 class Point(tuple):
613 'Point(x, y)'
Georg Brandl4c8bbe62008-03-22 21:06:20 +0000614 <BLANKLINE>
Raymond Hettingercbab5942007-09-18 22:18:02 +0000615 __slots__ = ()
Georg Brandl4c8bbe62008-03-22 21:06:20 +0000616 <BLANKLINE>
Raymond Hettingere0734e72008-01-04 03:22:53 +0000617 _fields = ('x', 'y')
Georg Brandl4c8bbe62008-03-22 21:06:20 +0000618 <BLANKLINE>
Raymond Hettingera68cad12009-05-27 02:24:45 +0000619 def __new__(_cls, x, y):
Raymond Hettinger9bd35082010-03-09 09:01:46 +0000620 'Create a new instance of Point(x, y)'
Raymond Hettingera68cad12009-05-27 02:24:45 +0000621 return _tuple.__new__(_cls, (x, y))
Georg Brandl4c8bbe62008-03-22 21:06:20 +0000622 <BLANKLINE>
Raymond Hettinger02740f72008-01-05 01:35:43 +0000623 @classmethod
Georg Brandl4c8bbe62008-03-22 21:06:20 +0000624 def _make(cls, iterable, new=tuple.__new__, len=len):
Raymond Hettinger02740f72008-01-05 01:35:43 +0000625 'Make a new Point object from a sequence or iterable'
Georg Brandl4c8bbe62008-03-22 21:06:20 +0000626 result = new(cls, iterable)
Raymond Hettinger02740f72008-01-05 01:35:43 +0000627 if len(result) != 2:
628 raise TypeError('Expected 2 arguments, got %d' % len(result))
629 return result
Georg Brandl4c8bbe62008-03-22 21:06:20 +0000630 <BLANKLINE>
Raymond Hettingercbab5942007-09-18 22:18:02 +0000631 def __repr__(self):
Raymond Hettinger9bd35082010-03-09 09:01:46 +0000632 'Return a nicely formatted representation string'
Raymond Hettingercbab5942007-09-18 22:18:02 +0000633 return 'Point(x=%r, y=%r)' % self
Georg Brandl4c8bbe62008-03-22 21:06:20 +0000634 <BLANKLINE>
Raymond Hettinger88a91642009-03-03 04:51:24 +0000635 def _asdict(self):
636 'Return a new OrderedDict which maps field names to their values'
637 return OrderedDict(zip(self._fields, self))
Georg Brandl4c8bbe62008-03-22 21:06:20 +0000638 <BLANKLINE>
Raymond Hettingera68cad12009-05-27 02:24:45 +0000639 def _replace(_self, **kwds):
Raymond Hettingereeeb9c42007-11-15 02:44:53 +0000640 'Return a new Point object replacing specified fields with new values'
Raymond Hettingera68cad12009-05-27 02:24:45 +0000641 result = _self._make(map(kwds.pop, ('x', 'y'), _self))
Raymond Hettinger1b50fd72008-01-05 02:17:24 +0000642 if kwds:
643 raise ValueError('Got unexpected field names: %r' % kwds.keys())
644 return result
Georg Brandlc62ef8b2009-01-03 20:55:06 +0000645 <BLANKLINE>
646 def __getnewargs__(self):
Raymond Hettinger9bd35082010-03-09 09:01:46 +0000647 'Return self as a plain tuple. Used by copy and pickle.'
Raymond Hettingeree51cff2008-06-27 21:34:24 +0000648 return tuple(self)
Georg Brandl4c8bbe62008-03-22 21:06:20 +0000649 <BLANKLINE>
Raymond Hettinger9bd35082010-03-09 09:01:46 +0000650 x = _property(_itemgetter(0), doc='Alias for field number 0')
651 y = _property(_itemgetter(1), doc='Alias for field number 1')
Georg Brandl8ec7f652007-08-15 14:28:01 +0000652
Raymond Hettingercbab5942007-09-18 22:18:02 +0000653 >>> p = Point(11, y=22) # instantiate with positional or keyword arguments
Raymond Hettinger88880b22007-12-18 00:13:45 +0000654 >>> p[0] + p[1] # indexable like the plain tuple (11, 22)
Raymond Hettingercbab5942007-09-18 22:18:02 +0000655 33
656 >>> x, y = p # unpack like a regular tuple
657 >>> x, y
658 (11, 22)
Georg Brandl907a7202008-02-22 12:31:45 +0000659 >>> p.x + p.y # fields also accessible by name
Raymond Hettingercbab5942007-09-18 22:18:02 +0000660 33
661 >>> p # readable __repr__ with a name=value style
662 Point(x=11, y=22)
Georg Brandl8ec7f652007-08-15 14:28:01 +0000663
Raymond Hettingercbab5942007-09-18 22:18:02 +0000664Named tuples are especially useful for assigning field names to result tuples returned
665by the :mod:`csv` or :mod:`sqlite3` modules::
Georg Brandl8ec7f652007-08-15 14:28:01 +0000666
Raymond Hettingereeeb9c42007-11-15 02:44:53 +0000667 EmployeeRecord = namedtuple('EmployeeRecord', 'name, age, title, department, paygrade')
Raymond Hettingera48a2992007-10-08 21:26:58 +0000668
Raymond Hettingercbab5942007-09-18 22:18:02 +0000669 import csv
Raymond Hettinger02740f72008-01-05 01:35:43 +0000670 for emp in map(EmployeeRecord._make, csv.reader(open("employees.csv", "rb"))):
Raymond Hettingercbab5942007-09-18 22:18:02 +0000671 print emp.name, emp.title
Georg Brandl8ec7f652007-08-15 14:28:01 +0000672
Raymond Hettingera48a2992007-10-08 21:26:58 +0000673 import sqlite3
674 conn = sqlite3.connect('/companydata')
675 cursor = conn.cursor()
676 cursor.execute('SELECT name, age, title, department, paygrade FROM employees')
Raymond Hettinger02740f72008-01-05 01:35:43 +0000677 for emp in map(EmployeeRecord._make, cursor.fetchall()):
Raymond Hettingera48a2992007-10-08 21:26:58 +0000678 print emp.name, emp.title
679
Raymond Hettinger85dfcf32007-12-18 23:51:15 +0000680In addition to the methods inherited from tuples, named tuples support
Raymond Hettingerac5742e2008-01-08 02:24:15 +0000681three additional methods and one attribute. To prevent conflicts with
682field names, the method and attribute names start with an underscore.
Raymond Hettinger85dfcf32007-12-18 23:51:15 +0000683
Benjamin Peterson682f6032010-07-18 14:26:34 +0000684.. classmethod:: somenamedtuple._make(iterable)
Raymond Hettinger85dfcf32007-12-18 23:51:15 +0000685
Raymond Hettinger02740f72008-01-05 01:35:43 +0000686 Class method that makes a new instance from an existing sequence or iterable.
Raymond Hettinger85dfcf32007-12-18 23:51:15 +0000687
Raymond Hettinger2950bca2009-01-14 01:39:51 +0000688 .. doctest::
Georg Brandl8ec7f652007-08-15 14:28:01 +0000689
Raymond Hettinger02740f72008-01-05 01:35:43 +0000690 >>> t = [11, 22]
691 >>> Point._make(t)
692 Point(x=11, y=22)
Raymond Hettinger2b03d452007-09-18 03:33:19 +0000693
Georg Brandlb3255ed2008-01-07 16:43:47 +0000694.. method:: somenamedtuple._asdict()
Raymond Hettingera7fc4b12007-10-05 02:47:07 +0000695
Raymond Hettinger88a91642009-03-03 04:51:24 +0000696 Return a new :class:`OrderedDict` which maps field names to their corresponding
697 values::
Raymond Hettingera7fc4b12007-10-05 02:47:07 +0000698
Raymond Hettinger42da8742007-12-14 02:49:47 +0000699 >>> p._asdict()
Raymond Hettinger88a91642009-03-03 04:51:24 +0000700 OrderedDict([('x', 11), ('y', 22)])
701
Raymond Hettingera07038d2009-03-03 05:11:56 +0000702 .. versionchanged:: 2.7
Raymond Hettinger88a91642009-03-03 04:51:24 +0000703 Returns an :class:`OrderedDict` instead of a regular :class:`dict`.
Georg Brandl4c8bbe62008-03-22 21:06:20 +0000704
Georg Brandlb3255ed2008-01-07 16:43:47 +0000705.. method:: somenamedtuple._replace(kwargs)
Raymond Hettingerd36a60e2007-09-17 00:55:00 +0000706
Georg Brandl4c8bbe62008-03-22 21:06:20 +0000707 Return a new instance of the named tuple replacing specified fields with new
Raymond Hettinger2950bca2009-01-14 01:39:51 +0000708 values::
Raymond Hettingerd36a60e2007-09-17 00:55:00 +0000709
Raymond Hettingercbab5942007-09-18 22:18:02 +0000710 >>> p = Point(x=11, y=22)
Raymond Hettinger42da8742007-12-14 02:49:47 +0000711 >>> p._replace(x=33)
Raymond Hettingerd36a60e2007-09-17 00:55:00 +0000712 Point(x=33, y=22)
713
Raymond Hettinger7c3738e2007-11-15 03:16:09 +0000714 >>> for partnum, record in inventory.items():
Raymond Hettingere11230e2008-01-09 03:02:23 +0000715 ... inventory[partnum] = record._replace(price=newprices[partnum], timestamp=time.now())
Raymond Hettingerd36a60e2007-09-17 00:55:00 +0000716
Georg Brandlb3255ed2008-01-07 16:43:47 +0000717.. attribute:: somenamedtuple._fields
Raymond Hettingerd36a60e2007-09-17 00:55:00 +0000718
Raymond Hettingerf6b769b2008-01-07 21:33:51 +0000719 Tuple of strings listing the field names. Useful for introspection
Raymond Hettingera7fc4b12007-10-05 02:47:07 +0000720 and for creating new named tuple types from existing named tuples.
Raymond Hettinger7268e9d2007-09-20 03:03:43 +0000721
Raymond Hettinger2950bca2009-01-14 01:39:51 +0000722 .. doctest::
Raymond Hettingerd36a60e2007-09-17 00:55:00 +0000723
Raymond Hettinger42da8742007-12-14 02:49:47 +0000724 >>> p._fields # view the field names
Raymond Hettingercbab5942007-09-18 22:18:02 +0000725 ('x', 'y')
Raymond Hettingerd36a60e2007-09-17 00:55:00 +0000726
Raymond Hettingereeeb9c42007-11-15 02:44:53 +0000727 >>> Color = namedtuple('Color', 'red green blue')
Raymond Hettinger42da8742007-12-14 02:49:47 +0000728 >>> Pixel = namedtuple('Pixel', Point._fields + Color._fields)
Raymond Hettingercbab5942007-09-18 22:18:02 +0000729 >>> Pixel(11, 22, 128, 255, 0)
Raymond Hettingerdc1854d2008-01-09 03:13:20 +0000730 Pixel(x=11, y=22, red=128, green=255, blue=0)
Raymond Hettingerd36a60e2007-09-17 00:55:00 +0000731
Raymond Hettingere846f382007-12-14 21:51:50 +0000732To retrieve a field whose name is stored in a string, use the :func:`getattr`
Georg Brandl4c8bbe62008-03-22 21:06:20 +0000733function:
Raymond Hettingere846f382007-12-14 21:51:50 +0000734
735 >>> getattr(p, 'x')
736 11
737
Raymond Hettingere4ae63c2009-02-11 00:06:17 +0000738To convert a dictionary to a named tuple, use the double-star-operator
739(as described in :ref:`tut-unpacking-arguments`):
Raymond Hettinger85dfcf32007-12-18 23:51:15 +0000740
741 >>> d = {'x': 11, 'y': 22}
742 >>> Point(**d)
743 Point(x=11, y=22)
744
Raymond Hettingereeeb9c42007-11-15 02:44:53 +0000745Since a named tuple is a regular Python class, it is easy to add or change
Raymond Hettingerb8e00722008-01-07 04:24:49 +0000746functionality with a subclass. Here is how to add a calculated field and
Georg Brandl4c8bbe62008-03-22 21:06:20 +0000747a fixed-width print format:
Raymond Hettingereeeb9c42007-11-15 02:44:53 +0000748
Raymond Hettingerb8e00722008-01-07 04:24:49 +0000749 >>> class Point(namedtuple('Point', 'x y')):
Raymond Hettingere1655082008-01-10 19:15:10 +0000750 ... __slots__ = ()
Raymond Hettingere11230e2008-01-09 03:02:23 +0000751 ... @property
752 ... def hypot(self):
753 ... return (self.x ** 2 + self.y ** 2) ** 0.5
754 ... def __str__(self):
Raymond Hettinger15b5e552008-01-10 23:00:01 +0000755 ... return 'Point: x=%6.3f y=%6.3f hypot=%6.3f' % (self.x, self.y, self.hypot)
Raymond Hettingerb8e00722008-01-07 04:24:49 +0000756
Raymond Hettingere1655082008-01-10 19:15:10 +0000757 >>> for p in Point(3, 4), Point(14, 5/7.):
Raymond Hettingere11230e2008-01-09 03:02:23 +0000758 ... print p
Raymond Hettinger15b5e552008-01-10 23:00:01 +0000759 Point: x= 3.000 y= 4.000 hypot= 5.000
760 Point: x=14.000 y= 0.714 hypot=14.018
Raymond Hettingereeeb9c42007-11-15 02:44:53 +0000761
Georg Brandlfe8df4f2009-12-28 08:01:59 +0000762The subclass shown above sets ``__slots__`` to an empty tuple. This helps
Raymond Hettinger171f3912008-01-16 23:38:16 +0000763keep memory requirements low by preventing the creation of instance dictionaries.
Raymond Hettingerf59e9622008-01-15 20:52:42 +0000764
Raymond Hettingerac5742e2008-01-08 02:24:15 +0000765Subclassing is not useful for adding new, stored fields. Instead, simply
Georg Brandl4c8bbe62008-03-22 21:06:20 +0000766create a new named tuple type from the :attr:`_fields` attribute:
Raymond Hettingerac5742e2008-01-08 02:24:15 +0000767
Raymond Hettingere850c462008-01-10 20:37:12 +0000768 >>> Point3D = namedtuple('Point3D', Point._fields + ('z',))
Raymond Hettingerac5742e2008-01-08 02:24:15 +0000769
Raymond Hettingerfb3ced62008-01-07 20:17:35 +0000770Default values can be implemented by using :meth:`_replace` to
Georg Brandl4c8bbe62008-03-22 21:06:20 +0000771customize a prototype instance:
Raymond Hettingerbc693492007-11-15 22:39:34 +0000772
773 >>> Account = namedtuple('Account', 'owner balance transaction_count')
Raymond Hettinger0fe6ca42008-01-18 21:14:58 +0000774 >>> default_account = Account('<owner name>', 0.0, 0)
775 >>> johns_account = default_account._replace(owner='John')
Raymond Hettingerbc693492007-11-15 22:39:34 +0000776
Raymond Hettinger5a9fed72008-05-08 07:23:30 +0000777Enumerated constants can be implemented with named tuples, but it is simpler
778and more efficient to use a simple class declaration:
779
780 >>> Status = namedtuple('Status', 'open pending closed')._make(range(3))
781 >>> Status.open, Status.pending, Status.closed
782 (0, 1, 2)
783 >>> class Status:
784 ... open, pending, closed = range(3)
785
Raymond Hettingere4ae63c2009-02-11 00:06:17 +0000786.. seealso::
Mark Summerfield7f626f42007-08-30 15:03:03 +0000787
Raymond Hettingere4ae63c2009-02-11 00:06:17 +0000788 `Named tuple recipe <http://code.activestate.com/recipes/500261/>`_
789 adapted for Python 2.4.
Raymond Hettingerbc512d32009-03-03 04:45:34 +0000790
791
792:class:`OrderedDict` objects
793----------------------------
794
795Ordered dictionaries are just like regular dictionaries but they remember the
796order that items were inserted. When iterating over an ordered dictionary,
797the items are returned in the order their keys were first added.
798
799.. class:: OrderedDict([items])
800
801 Return an instance of a dict subclass, supporting the usual :class:`dict`
802 methods. An *OrderedDict* is a dict that remembers the order that keys
803 were first inserted. If a new entry overwrites an existing entry, the
804 original insertion position is left unchanged. Deleting an entry and
805 reinserting it will move it to the end.
806
807 .. versionadded:: 2.7
808
Raymond Hettinger24122992009-03-19 19:59:58 +0000809.. method:: OrderedDict.popitem(last=True)
810
811 The :meth:`popitem` method for ordered dictionaries returns and removes
812 a (key, value) pair. The pairs are returned in LIFO order if *last* is
813 true or FIFO order if false.
Raymond Hettingerbc512d32009-03-03 04:45:34 +0000814
Raymond Hettinger50f362f2009-05-19 17:43:59 +0000815In addition to the usual mapping methods, ordered dictionaries also support
816reverse iteration using :func:`reversed`.
817
Raymond Hettingerbc512d32009-03-03 04:45:34 +0000818Equality tests between :class:`OrderedDict` objects are order-sensitive
819and are implemented as ``list(od1.items())==list(od2.items())``.
820Equality tests between :class:`OrderedDict` objects and other
821:class:`Mapping` objects are order-insensitive like regular dictionaries.
822This allows :class:`OrderedDict` objects to be substituted anywhere a
823regular dictionary is used.
Raymond Hettinger24122992009-03-19 19:59:58 +0000824
Raymond Hettingerc473c5a2009-04-09 22:31:51 +0000825The :class:`OrderedDict` constructor and :meth:`update` method both accept
826keyword arguments, but their order is lost because Python's function call
827semantics pass-in keyword arguments using a regular unordered dictionary.
828
Raymond Hettinger24122992009-03-19 19:59:58 +0000829.. seealso::
830
831 `Equivalent OrderedDict recipe <http://code.activestate.com/recipes/576693/>`_
832 that runs on Python 2.4 or later.
Raymond Hettinger610326d2009-11-10 19:35:55 +0000833
834Since an ordered dictionary remembers its insertion order, it can be used
835in conjuction with sorting to make a sorted dictionary::
836
837 >>> # regular unsorted dictionary
838 >>> d = {'banana': 3, 'apple':4, 'pear': 1, 'orange': 2}
839
840 >>> # dictionary sorted by key
841 >>> OrderedDict(sorted(d.items(), key=lambda t: t[0]))
842 OrderedDict([('apple', 4), ('banana', 3), ('orange', 2), ('pear', 1)])
843
844 >>> # dictionary sorted by value
845 >>> OrderedDict(sorted(d.items(), key=lambda t: t[1]))
846 OrderedDict([('pear', 1), ('orange', 2), ('banana', 3), ('apple', 4)])
847
848 >>> # dictionary sorted by length of the key string
849 >>> OrderedDict(sorted(d.items(), key=lambda t: len(t[0])))
850 OrderedDict([('pear', 1), ('apple', 4), ('orange', 2), ('banana', 3)])
851
852The new sorted dictionaries maintain their sort order when entries
853are deleted. But when new keys are added, the keys are appended
854to the end and the sort is not maintained.
Georg Brandlad8ac862010-08-01 19:21:26 +0000855
856It is also straight-forward to create an ordered dictionary variant
857that the remembers the order the keys were *last* inserted.
858If a new entry overwrites an existing entry, the
859original insertion position is changed and moved to the end::
860
861 class LastUpdatedOrderedDict(OrderedDict):
862 'Store items is the order the keys were last added'
863 def __setitem__(self, key, value):
864 if key in self:
865 del self[key]
866 OrderedDict.__setitem__(self, key, value)
Raymond Hettinger3de8a302010-08-08 00:35:03 +0000867
868
869ABCs - abstract base classes
870----------------------------
871
872The collections module offers the following ABCs:
873
874========================= ===================== ====================== ====================================================
875ABC Inherits Abstract Methods Mixin Methods
876========================= ===================== ====================== ====================================================
877:class:`Container` ``__contains__``
878:class:`Hashable` ``__hash__``
879:class:`Iterable` ``__iter__``
880:class:`Iterator` :class:`Iterable` ``next`` ``__iter__``
881:class:`Sized` ``__len__``
882:class:`Callable` ``__call__``
883
884:class:`Sequence` :class:`Sized`, ``__getitem__`` ``__contains__``. ``__iter__``, ``__reversed__``.
885 :class:`Iterable`, ``index``, and ``count``
886 :class:`Container`
887
888:class:`MutableSequence` :class:`Sequence` ``__setitem__`` Inherited Sequence methods and
889 ``__delitem__``, ``append``, ``reverse``, ``extend``, ``pop``,
890 and ``insert`` ``remove``, and ``__iadd__``
891
892:class:`Set` :class:`Sized`, ``__le__``, ``__lt__``, ``__eq__``, ``__ne__``,
893 :class:`Iterable`, ``__gt__``, ``__ge__``, ``__and__``, ``__or__``
894 :class:`Container` ``__sub__``, ``__xor__``, and ``isdisjoint``
895
896:class:`MutableSet` :class:`Set` ``add`` and Inherited Set methods and
897 ``discard`` ``clear``, ``pop``, ``remove``, ``__ior__``,
898 ``__iand__``, ``__ixor__``, and ``__isub__``
899
900:class:`Mapping` :class:`Sized`, ``__getitem__`` ``__contains__``, ``keys``, ``items``, ``values``,
901 :class:`Iterable`, ``get``, ``__eq__``, and ``__ne__``
902 :class:`Container`
903
904:class:`MutableMapping` :class:`Mapping` ``__setitem__`` and Inherited Mapping methods and
905 ``__delitem__`` ``pop``, ``popitem``, ``clear``, ``update``,
906 and ``setdefault``
907
908
909:class:`MappingView` :class:`Sized` ``__len__``
910:class:`KeysView` :class:`MappingView`, ``__contains__``,
911 :class:`Set` ``__iter__``
912:class:`ItemsView` :class:`MappingView`, ``__contains__``,
913 :class:`Set` ``__iter__``
914:class:`ValuesView` :class:`MappingView` ``__contains__``, ``__iter__``
915========================= ===================== ====================== ====================================================
916
917These ABCs allow us to ask classes or instances if they provide
918particular functionality, for example::
919
920 size = None
921 if isinstance(myvar, collections.Sized):
922 size = len(myvar)
923
924Several of the ABCs are also useful as mixins that make it easier to develop
925classes supporting container APIs. For example, to write a class supporting
926the full :class:`Set` API, it only necessary to supply the three underlying
927abstract methods: :meth:`__contains__`, :meth:`__iter__`, and :meth:`__len__`.
928The ABC supplies the remaining methods such as :meth:`__and__` and
929:meth:`isdisjoint` ::
930
931 class ListBasedSet(collections.Set):
932 ''' Alternate set implementation favoring space over speed
933 and not requiring the set elements to be hashable. '''
934 def __init__(self, iterable):
935 self.elements = lst = []
936 for value in iterable:
937 if value not in lst:
938 lst.append(value)
939 def __iter__(self):
940 return iter(self.elements)
941 def __contains__(self, value):
942 return value in self.elements
943 def __len__(self):
944 return len(self.elements)
945
946 s1 = ListBasedSet('abcdef')
947 s2 = ListBasedSet('defghi')
948 overlap = s1 & s2 # The __and__() method is supported automatically
949
950Notes on using :class:`Set` and :class:`MutableSet` as a mixin:
951
952(1)
953 Since some set operations create new sets, the default mixin methods need
954 a way to create new instances from an iterable. The class constructor is
955 assumed to have a signature in the form ``ClassName(iterable)``.
956 That assumption is factored-out to an internal classmethod called
957 :meth:`_from_iterable` which calls ``cls(iterable)`` to produce a new set.
958 If the :class:`Set` mixin is being used in a class with a different
959 constructor signature, you will need to override :meth:`from_iterable`
960 with a classmethod that can construct new instances from
961 an iterable argument.
962
963(2)
964 To override the comparisons (presumably for speed, as the
965 semantics are fixed), redefine :meth:`__le__` and
966 then the other operations will automatically follow suit.
967
968(3)
969 The :class:`Set` mixin provides a :meth:`_hash` method to compute a hash value
970 for the set; however, :meth:`__hash__` is not defined because not all sets
971 are hashable or immutable. To add set hashabilty using mixins,
972 inherit from both :meth:`Set` and :meth:`Hashable`, then define
973 ``__hash__ = Set._hash``.
974
975.. seealso::
976
Raymond Hettingere679a372010-11-05 23:58:42 +0000977 * Latest version of the `Python source code for the collections abstract base classes
978 <http://svn.python.org/view/python/branches/release27-maint/Lib/_abcoll.py?view=markup>`_
979
Raymond Hettinger3de8a302010-08-08 00:35:03 +0000980 * `OrderedSet recipe <http://code.activestate.com/recipes/576694/>`_ for an
981 example built on :class:`MutableSet`.
982
983 * For more about ABCs, see the :mod:`abc` module and :pep:`3119`.