blob: 12560e235e222865fdb290ce3855d7fd947b9e4d [file] [log] [blame]
Georg Brandl8ec7f652007-08-15 14:28:01 +00001
2:mod:`collections` --- High-performance container datatypes
3===========================================================
4
5.. module:: collections
6 :synopsis: High-performance datatypes
7.. moduleauthor:: Raymond Hettinger <python@rcn.com>
8.. sectionauthor:: Raymond Hettinger <python@rcn.com>
9
Georg Brandl8ec7f652007-08-15 14:28:01 +000010.. versionadded:: 2.4
11
Georg Brandl4c8bbe62008-03-22 21:06:20 +000012.. testsetup:: *
13
14 from collections import *
15 import itertools
16 __name__ = '<doctest>'
17
Raymond Hettinger3de8a302010-08-08 00:35:03 +000018This module implements specialized container datatypes providing alternatives to
19Python's general purpose built-in containers, :class:`dict`, :class:`list`,
20:class:`set`, and :class:`tuple`.
Georg Brandl8ec7f652007-08-15 14:28:01 +000021
Raymond Hettinger3de8a302010-08-08 00:35:03 +000022===================== ====================================================================
23:func:`namedtuple` factory function for creating tuple subclasses with named fields
24:class:`deque` list-like container with fast appends and pops on either end
25:class:`Counter` dict subclass for counting hashable objects
26:class:`OrderedDict` dict subclass that remembers the order entries were added
27:class:`defaultdict` dict subclass that calls a factory function to supply missing values
28===================== ====================================================================
Raymond Hettingerbc4ffc12008-02-11 23:38:00 +000029
Raymond Hettingerf746a1f2009-02-17 08:33:01 +000030.. versionchanged:: 2.4
31 Added :class:`deque`.
32
33.. versionchanged:: 2.5
34 Added :class:`defaultdict`.
35
36.. versionchanged:: 2.6
37 Added :func:`namedtuple` and added abstract base classes.
38
39.. versionchanged:: 2.7
Raymond Hettingerbc512d32009-03-03 04:45:34 +000040 Added :class:`Counter` and :class:`OrderedDict`.
Raymond Hettingerbc4ffc12008-02-11 23:38:00 +000041
42In addition to containers, the collections module provides some ABCs
Georg Brandl4c8bbe62008-03-22 21:06:20 +000043(abstract base classes) that can be used to test whether a class
Raymond Hettingerf746a1f2009-02-17 08:33:01 +000044provides a particular interface, for example, whether it is hashable or
Georg Brandl4c8bbe62008-03-22 21:06:20 +000045a mapping.
Raymond Hettingerbc4ffc12008-02-11 23:38:00 +000046
Raymond Hettingerbc4ffc12008-02-11 23:38:00 +000047
Raymond Hettingerf94d7fa2009-01-12 22:58:41 +000048:class:`Counter` objects
49------------------------
50
51A counter tool is provided to support convenient and rapid tallies.
52For example::
53
Raymond Hettinger939a3cc2009-02-04 11:31:30 +000054 >>> # Tally occurrences of words in a list
Raymond Hettingerf94d7fa2009-01-12 22:58:41 +000055 >>> cnt = Counter()
Raymond Hettinger0a1f7b82009-01-21 23:12:51 +000056 >>> for word in ['red', 'blue', 'red', 'green', 'blue', 'blue']:
Raymond Hettingerf94d7fa2009-01-12 22:58:41 +000057 ... cnt[word] += 1
58 >>> cnt
Raymond Hettingeraaa6e632009-01-13 01:05:03 +000059 Counter({'blue': 3, 'red': 2, 'green': 1})
Raymond Hettingerf94d7fa2009-01-12 22:58:41 +000060
Raymond Hettinger939a3cc2009-02-04 11:31:30 +000061 >>> # Find the ten most common words in Hamlet
Raymond Hettingerf94d7fa2009-01-12 22:58:41 +000062 >>> import re
63 >>> words = re.findall('\w+', open('hamlet.txt').read().lower())
Raymond Hettinger196a0f72009-01-20 12:59:36 +000064 >>> Counter(words).most_common(10)
Raymond Hettingerf94d7fa2009-01-12 22:58:41 +000065 [('the', 1143), ('and', 966), ('to', 762), ('of', 669), ('i', 631),
66 ('you', 554), ('a', 546), ('my', 514), ('hamlet', 471), ('in', 451)]
67
Raymond Hettinger82783852009-01-13 03:49:43 +000068.. class:: Counter([iterable-or-mapping])
Raymond Hettingerf94d7fa2009-01-12 22:58:41 +000069
Raymond Hettinger0a1f7b82009-01-21 23:12:51 +000070 A :class:`Counter` is a :class:`dict` subclass for counting hashable objects.
Raymond Hettingeraaa6e632009-01-13 01:05:03 +000071 It is an unordered collection where elements are stored as dictionary keys
72 and their counts are stored as dictionary values. Counts are allowed to be
73 any integer value including zero or negative counts. The :class:`Counter`
74 class is similar to bags or multisets in other languages.
Raymond Hettingerf94d7fa2009-01-12 22:58:41 +000075
Raymond Hettinger82783852009-01-13 03:49:43 +000076 Elements are counted from an *iterable* or initialized from another
Georg Brandlf6dab952009-04-28 21:48:35 +000077 *mapping* (or counter):
Raymond Hettingerf94d7fa2009-01-12 22:58:41 +000078
Raymond Hettinger22bfa9e2009-01-27 02:36:33 +000079 >>> c = Counter() # a new, empty counter
80 >>> c = Counter('gallahad') # a new counter from an iterable
81 >>> c = Counter({'red': 4, 'blue': 2}) # a new counter from a mapping
82 >>> c = Counter(cats=4, dogs=8) # a new counter from keyword args
Raymond Hettingerf94d7fa2009-01-12 22:58:41 +000083
Raymond Hettinger0a1f7b82009-01-21 23:12:51 +000084 Counter objects have a dictionary interface except that they return a zero
Georg Brandlf6dab952009-04-28 21:48:35 +000085 count for missing items instead of raising a :exc:`KeyError`:
Raymond Hettingerf94d7fa2009-01-12 22:58:41 +000086
Raymond Hettinger7bdca052009-01-22 05:20:47 +000087 >>> c = Counter(['eggs', 'ham'])
Raymond Hettinger59894122009-01-14 00:15:21 +000088 >>> c['bacon'] # count of a missing element is zero
Raymond Hettingerf94d7fa2009-01-12 22:58:41 +000089 0
90
Raymond Hettinger7bdca052009-01-22 05:20:47 +000091 Setting a count to zero does not remove an element from a counter.
92 Use ``del`` to remove it entirely:
Raymond Hettingerf94d7fa2009-01-12 22:58:41 +000093
Raymond Hettinger7bdca052009-01-22 05:20:47 +000094 >>> c['sausage'] = 0 # counter entry with a zero count
95 >>> del c['sausage'] # del actually removes the entry
Raymond Hettingerf94d7fa2009-01-12 22:58:41 +000096
97 .. versionadded:: 2.7
98
99
Ezio Melotti4edfe962010-04-04 06:50:58 +0000100 Counter objects support three methods beyond those available for all
Raymond Hettingerf94d7fa2009-01-12 22:58:41 +0000101 dictionaries:
102
103 .. method:: elements()
104
Raymond Hettinger0a1f7b82009-01-21 23:12:51 +0000105 Return an iterator over elements repeating each as many times as its
106 count. Elements are returned in arbitrary order. If an element's count
107 is less than one, :meth:`elements` will ignore it.
Raymond Hettingerf94d7fa2009-01-12 22:58:41 +0000108
Raymond Hettinger196a0f72009-01-20 12:59:36 +0000109 >>> c = Counter(a=4, b=2, c=0, d=-2)
Raymond Hettingerf94d7fa2009-01-12 22:58:41 +0000110 >>> list(c.elements())
111 ['a', 'a', 'a', 'a', 'b', 'b']
112
113 .. method:: most_common([n])
114
Raymond Hettinger22bfa9e2009-01-27 02:36:33 +0000115 Return a list of the *n* most common elements and their counts from the
Raymond Hettingerd507afd2009-02-04 10:52:32 +0000116 most common to the least. If *n* is not specified, :func:`most_common`
Raymond Hettinger22bfa9e2009-01-27 02:36:33 +0000117 returns *all* elements in the counter. Elements with equal counts are
Georg Brandlf6dab952009-04-28 21:48:35 +0000118 ordered arbitrarily:
Raymond Hettingerf94d7fa2009-01-12 22:58:41 +0000119
120 >>> Counter('abracadabra').most_common(3)
121 [('a', 5), ('r', 2), ('b', 2)]
122
Raymond Hettinger34c35b22010-04-03 10:22:00 +0000123 .. method:: subtract([iterable-or-mapping])
124
125 Elements are subtracted from an *iterable* or from another *mapping*
126 (or counter). Like :meth:`dict.update` but subtracts counts instead
127 of replacing them. Both inputs and outputs may be zero or negative.
128
129 >>> c = Counter(a=4, b=2, c=0, d=-2)
130 >>> d = Counter(a=1, b=2, c=3, d=4)
131 >>> c.subtract(d)
132 Counter({'a': 3, 'b': 0, 'c': -3, 'd': -6})
133
Raymond Hettinger0a1f7b82009-01-21 23:12:51 +0000134 The usual dictionary methods are available for :class:`Counter` objects
135 except for two which work differently for counters.
Raymond Hettingerf94d7fa2009-01-12 22:58:41 +0000136
137 .. method:: fromkeys(iterable)
138
Raymond Hettinger22bfa9e2009-01-27 02:36:33 +0000139 This class method is not implemented for :class:`Counter` objects.
Raymond Hettingerf94d7fa2009-01-12 22:58:41 +0000140
Raymond Hettinger82783852009-01-13 03:49:43 +0000141 .. method:: update([iterable-or-mapping])
Raymond Hettingerf94d7fa2009-01-12 22:58:41 +0000142
Raymond Hettinger22bfa9e2009-01-27 02:36:33 +0000143 Elements are counted from an *iterable* or added-in from another
144 *mapping* (or counter). Like :meth:`dict.update` but adds counts
145 instead of replacing them. Also, the *iterable* is expected to be a
146 sequence of elements, not a sequence of ``(key, value)`` pairs.
Raymond Hettingerf94d7fa2009-01-12 22:58:41 +0000147
Raymond Hettingerfbcf7492009-01-13 08:38:14 +0000148Common patterns for working with :class:`Counter` objects::
Raymond Hettingerf94d7fa2009-01-12 22:58:41 +0000149
Raymond Hettinger22bfa9e2009-01-27 02:36:33 +0000150 sum(c.values()) # total of all counts
151 c.clear() # reset all counts
152 list(c) # list unique elements
153 set(c) # convert to a set
154 dict(c) # convert to a regular dictionary
155 c.items() # convert to a list of (elem, cnt) pairs
156 Counter(dict(list_of_pairs)) # convert from a list of (elem, cnt) pairs
157 c.most_common()[:-n:-1] # n least common elements
158 c += Counter() # remove zero and negative counts
Raymond Hettingerf94d7fa2009-01-12 22:58:41 +0000159
Raymond Hettingera6658532009-02-25 22:48:24 +0000160Several mathematical operations are provided for combining :class:`Counter`
161objects to produce multisets (counters that have counts greater than zero).
162Addition and subtraction combine counters by adding or subtracting the counts
163of corresponding elements. Intersection and union return the minimum and
164maximum of corresponding counts. Each operation can accept inputs with signed
165counts, but the output will exclude results with counts of zero or less.
Raymond Hettingerbad1eb22009-01-20 01:19:26 +0000166
Raymond Hettinger4571f342009-01-21 20:31:50 +0000167 >>> c = Counter(a=3, b=1)
168 >>> d = Counter(a=1, b=2)
Raymond Hettinger22bfa9e2009-01-27 02:36:33 +0000169 >>> c + d # add two counters together: c[x] + d[x]
Raymond Hettingerbad1eb22009-01-20 01:19:26 +0000170 Counter({'a': 4, 'b': 3})
Raymond Hettinger22bfa9e2009-01-27 02:36:33 +0000171 >>> c - d # subtract (keeping only positive counts)
Raymond Hettingerbad1eb22009-01-20 01:19:26 +0000172 Counter({'a': 2})
Raymond Hettinger22bfa9e2009-01-27 02:36:33 +0000173 >>> c & d # intersection: min(c[x], d[x])
Raymond Hettingerbad1eb22009-01-20 01:19:26 +0000174 Counter({'a': 1, 'b': 1})
Raymond Hettinger22bfa9e2009-01-27 02:36:33 +0000175 >>> c | d # union: max(c[x], d[x])
Raymond Hettingerbad1eb22009-01-20 01:19:26 +0000176 Counter({'a': 3, 'b': 2})
177
Raymond Hettinger44340e62010-04-12 21:12:06 +0000178.. note::
179
180 Counters were primarily designed to work with positive integers to represent
181 running counts; however, care was taken to not unnecessarily preclude use
182 cases needing other types or negative values. To help with those use cases,
183 this section documents the minimum range and type restrictions.
184
185 * The :class:`Counter` class itself is a dictionary subclass with no
186 restrictions on its keys and values. The values are intended to be numbers
187 representing counts, but you *could* store anything in the value field.
188
189 * The :meth:`most_common` method requires only that the values be orderable.
190
191 * For in-place operations such as ``c[key] += 1``, the value type need only
192 support addition and subtraction. So fractions, floats, and decimals would
193 work and negative values are supported. The same is also true for
194 :meth:`update` and :meth:`subtract` which allow negative and zero values
195 for both inputs and outputs.
196
197 * The multiset methods are designed only for use cases with positive values.
198 The inputs may be negative or zero, but only outputs with positive values
199 are created. There are no type restrictions, but the value type needs to
200 support support addition, subtraction, and comparison.
201
202 * The :meth:`elements` method requires integer counts. It ignores zero and
203 negative counts.
204
Raymond Hettingeracdc84a2009-01-20 23:42:54 +0000205.. seealso::
Raymond Hettingerf94d7fa2009-01-12 22:58:41 +0000206
Raymond Hettinger7bdca052009-01-22 05:20:47 +0000207 * `Counter class <http://code.activestate.com/recipes/576611/>`_
208 adapted for Python 2.5 and an early `Bag recipe
209 <http://code.activestate.com/recipes/259174/>`_ for Python 2.4.
210
Raymond Hettingeracdc84a2009-01-20 23:42:54 +0000211 * `Bag class <http://www.gnu.org/software/smalltalk/manual-base/html_node/Bag.html>`_
212 in Smalltalk.
Raymond Hettingerf94d7fa2009-01-12 22:58:41 +0000213
Raymond Hettingeracdc84a2009-01-20 23:42:54 +0000214 * Wikipedia entry for `Multisets <http://en.wikipedia.org/wiki/Multiset>`_\.
Raymond Hettingerf94d7fa2009-01-12 22:58:41 +0000215
Raymond Hettingeracdc84a2009-01-20 23:42:54 +0000216 * `C++ multisets <http://www.demo2s.com/Tutorial/Cpp/0380__set-multiset/Catalog0380__set-multiset.htm>`_
Raymond Hettinger7bdca052009-01-22 05:20:47 +0000217 tutorial with examples.
Raymond Hettingeracdc84a2009-01-20 23:42:54 +0000218
Raymond Hettinger7bdca052009-01-22 05:20:47 +0000219 * For mathematical operations on multisets and their use cases, see
Raymond Hettingeracdc84a2009-01-20 23:42:54 +0000220 *Knuth, Donald. The Art of Computer Programming Volume II,
221 Section 4.6.3, Exercise 19*\.
222
Raymond Hettinger0a1f7b82009-01-21 23:12:51 +0000223 * To enumerate all distinct multisets of a given size over a given set of
Raymond Hettingerd081abc2009-01-27 02:58:49 +0000224 elements, see :func:`itertools.combinations_with_replacement`.
Raymond Hettingeracdc84a2009-01-20 23:42:54 +0000225
Raymond Hettinger7bdca052009-01-22 05:20:47 +0000226 map(Counter, combinations_with_replacement('ABC', 2)) --> AA AB AC BB BC CC
Raymond Hettingerfbcf7492009-01-13 08:38:14 +0000227
Raymond Hettingerf94d7fa2009-01-12 22:58:41 +0000228
Georg Brandl8ec7f652007-08-15 14:28:01 +0000229:class:`deque` objects
230----------------------
231
Raymond Hettingera7fc4b12007-10-05 02:47:07 +0000232.. class:: deque([iterable[, maxlen]])
Georg Brandl8ec7f652007-08-15 14:28:01 +0000233
234 Returns a new deque object initialized left-to-right (using :meth:`append`) with
235 data from *iterable*. If *iterable* is not specified, the new deque is empty.
236
237 Deques are a generalization of stacks and queues (the name is pronounced "deck"
238 and is short for "double-ended queue"). Deques support thread-safe, memory
239 efficient appends and pops from either side of the deque with approximately the
240 same O(1) performance in either direction.
241
242 Though :class:`list` objects support similar operations, they are optimized for
243 fast fixed-length operations and incur O(n) memory movement costs for
244 ``pop(0)`` and ``insert(0, v)`` operations which change both the size and
245 position of the underlying data representation.
246
247 .. versionadded:: 2.4
248
Raymond Hettinger68995862007-10-10 00:26:46 +0000249 If *maxlen* is not specified or is *None*, deques may grow to an
Raymond Hettingera7fc4b12007-10-05 02:47:07 +0000250 arbitrary length. Otherwise, the deque is bounded to the specified maximum
251 length. Once a bounded length deque is full, when new items are added, a
252 corresponding number of items are discarded from the opposite end. Bounded
253 length deques provide functionality similar to the ``tail`` filter in
254 Unix. They are also useful for tracking transactions and other pools of data
255 where only the most recent activity is of interest.
256
257 .. versionchanged:: 2.6
Georg Brandlb19be572007-12-29 10:57:00 +0000258 Added *maxlen* parameter.
Raymond Hettingera7fc4b12007-10-05 02:47:07 +0000259
Benjamin Petersonc7b05922008-04-25 01:29:10 +0000260 Deque objects support the following methods:
Georg Brandl8ec7f652007-08-15 14:28:01 +0000261
262
Benjamin Petersonc7b05922008-04-25 01:29:10 +0000263 .. method:: append(x)
Georg Brandl8ec7f652007-08-15 14:28:01 +0000264
Benjamin Petersonc7b05922008-04-25 01:29:10 +0000265 Add *x* to the right side of the deque.
Georg Brandl8ec7f652007-08-15 14:28:01 +0000266
267
Benjamin Petersonc7b05922008-04-25 01:29:10 +0000268 .. method:: appendleft(x)
Georg Brandl8ec7f652007-08-15 14:28:01 +0000269
Benjamin Petersonc7b05922008-04-25 01:29:10 +0000270 Add *x* to the left side of the deque.
Georg Brandl8ec7f652007-08-15 14:28:01 +0000271
272
Benjamin Petersonc7b05922008-04-25 01:29:10 +0000273 .. method:: clear()
Georg Brandl8ec7f652007-08-15 14:28:01 +0000274
Benjamin Petersonc7b05922008-04-25 01:29:10 +0000275 Remove all elements from the deque leaving it with length 0.
Georg Brandl8ec7f652007-08-15 14:28:01 +0000276
277
Raymond Hettinger5f516ed2010-04-03 18:10:37 +0000278 .. method:: count(x)
279
280 Count the number of deque elements equal to *x*.
281
282 .. versionadded:: 2.7
283
Benjamin Petersonc7b05922008-04-25 01:29:10 +0000284 .. method:: extend(iterable)
Georg Brandl8ec7f652007-08-15 14:28:01 +0000285
Benjamin Petersonc7b05922008-04-25 01:29:10 +0000286 Extend the right side of the deque by appending elements from the iterable
287 argument.
Georg Brandl8ec7f652007-08-15 14:28:01 +0000288
289
Benjamin Petersonc7b05922008-04-25 01:29:10 +0000290 .. method:: extendleft(iterable)
Georg Brandl8ec7f652007-08-15 14:28:01 +0000291
Benjamin Petersonc7b05922008-04-25 01:29:10 +0000292 Extend the left side of the deque by appending elements from *iterable*.
293 Note, the series of left appends results in reversing the order of
294 elements in the iterable argument.
Georg Brandl8ec7f652007-08-15 14:28:01 +0000295
296
Benjamin Petersonc7b05922008-04-25 01:29:10 +0000297 .. method:: pop()
Georg Brandl8ec7f652007-08-15 14:28:01 +0000298
Benjamin Petersonc7b05922008-04-25 01:29:10 +0000299 Remove and return an element from the right side of the deque. If no
300 elements are present, raises an :exc:`IndexError`.
Georg Brandl8ec7f652007-08-15 14:28:01 +0000301
302
Benjamin Petersonc7b05922008-04-25 01:29:10 +0000303 .. method:: popleft()
Georg Brandl8ec7f652007-08-15 14:28:01 +0000304
Benjamin Petersonc7b05922008-04-25 01:29:10 +0000305 Remove and return an element from the left side of the deque. If no
306 elements are present, raises an :exc:`IndexError`.
Georg Brandl8ec7f652007-08-15 14:28:01 +0000307
308
Benjamin Petersonc7b05922008-04-25 01:29:10 +0000309 .. method:: remove(value)
Georg Brandl8ec7f652007-08-15 14:28:01 +0000310
Benjamin Petersonc7b05922008-04-25 01:29:10 +0000311 Removed the first occurrence of *value*. If not found, raises a
312 :exc:`ValueError`.
Georg Brandl8ec7f652007-08-15 14:28:01 +0000313
Benjamin Petersonc7b05922008-04-25 01:29:10 +0000314 .. versionadded:: 2.5
Georg Brandl8ec7f652007-08-15 14:28:01 +0000315
Raymond Hettingera5fd24e2009-12-10 06:42:54 +0000316 .. method:: reverse()
317
318 Reverse the elements of the deque in-place and then return ``None``.
319
320 .. versionadded:: 2.7
Georg Brandl8ec7f652007-08-15 14:28:01 +0000321
Benjamin Petersonc7b05922008-04-25 01:29:10 +0000322 .. method:: rotate(n)
Georg Brandl8ec7f652007-08-15 14:28:01 +0000323
Benjamin Petersonc7b05922008-04-25 01:29:10 +0000324 Rotate the deque *n* steps to the right. If *n* is negative, rotate to
325 the left. Rotating one step to the right is equivalent to:
326 ``d.appendleft(d.pop())``.
327
Georg Brandl8ec7f652007-08-15 14:28:01 +0000328
Raymond Hettinger56411aa2009-03-10 12:50:59 +0000329 Deque objects also provide one read-only attribute:
330
331 .. attribute:: maxlen
332
333 Maximum size of a deque or *None* if unbounded.
334
335 .. versionadded:: 2.7
336
337
Georg Brandl8ec7f652007-08-15 14:28:01 +0000338In addition to the above, deques support iteration, pickling, ``len(d)``,
339``reversed(d)``, ``copy.copy(d)``, ``copy.deepcopy(d)``, membership testing with
Benjamin Peterson5c4e0062008-10-16 18:52:14 +0000340the :keyword:`in` operator, and subscript references such as ``d[-1]``. Indexed
341access is O(1) at both ends but slows to O(n) in the middle. For fast random
342access, use lists instead.
Georg Brandl8ec7f652007-08-15 14:28:01 +0000343
Georg Brandl4c8bbe62008-03-22 21:06:20 +0000344Example:
345
346.. doctest::
Georg Brandl8ec7f652007-08-15 14:28:01 +0000347
348 >>> from collections import deque
349 >>> d = deque('ghi') # make a new deque with three items
350 >>> for elem in d: # iterate over the deque's elements
Georg Brandl4c8bbe62008-03-22 21:06:20 +0000351 ... print elem.upper()
Georg Brandl8ec7f652007-08-15 14:28:01 +0000352 G
353 H
354 I
355
356 >>> d.append('j') # add a new entry to the right side
357 >>> d.appendleft('f') # add a new entry to the left side
358 >>> d # show the representation of the deque
359 deque(['f', 'g', 'h', 'i', 'j'])
360
361 >>> d.pop() # return and remove the rightmost item
362 'j'
363 >>> d.popleft() # return and remove the leftmost item
364 'f'
365 >>> list(d) # list the contents of the deque
366 ['g', 'h', 'i']
367 >>> d[0] # peek at leftmost item
368 'g'
369 >>> d[-1] # peek at rightmost item
370 'i'
371
372 >>> list(reversed(d)) # list the contents of a deque in reverse
373 ['i', 'h', 'g']
374 >>> 'h' in d # search the deque
375 True
376 >>> d.extend('jkl') # add multiple elements at once
377 >>> d
378 deque(['g', 'h', 'i', 'j', 'k', 'l'])
379 >>> d.rotate(1) # right rotation
380 >>> d
381 deque(['l', 'g', 'h', 'i', 'j', 'k'])
382 >>> d.rotate(-1) # left rotation
383 >>> d
384 deque(['g', 'h', 'i', 'j', 'k', 'l'])
385
386 >>> deque(reversed(d)) # make a new deque in reverse order
387 deque(['l', 'k', 'j', 'i', 'h', 'g'])
388 >>> d.clear() # empty the deque
389 >>> d.pop() # cannot pop from an empty deque
390 Traceback (most recent call last):
391 File "<pyshell#6>", line 1, in -toplevel-
392 d.pop()
393 IndexError: pop from an empty deque
394
395 >>> d.extendleft('abc') # extendleft() reverses the input order
396 >>> d
397 deque(['c', 'b', 'a'])
398
399
Raymond Hettingera7fc4b12007-10-05 02:47:07 +0000400:class:`deque` Recipes
401^^^^^^^^^^^^^^^^^^^^^^
Georg Brandl8ec7f652007-08-15 14:28:01 +0000402
403This section shows various approaches to working with deques.
404
Raymond Hettinger6bc94cb2009-03-31 22:43:03 +0000405Bounded length deques provide functionality similar to the ``tail`` filter
406in Unix::
407
408 def tail(filename, n=10):
409 'Return the last n lines of a file'
410 return deque(open(filename), n)
411
412Another approach to using deques is to maintain a sequence of recently
413added elements by appending to the right and popping to the left::
414
415 def moving_average(iterable, n=3):
416 # moving_average([40, 30, 50, 46, 39, 44]) --> 40.0 42.0 45.0 43.0
417 # http://en.wikipedia.org/wiki/Moving_average
Raymond Hettinger6bc94cb2009-03-31 22:43:03 +0000418 it = iter(iterable)
Raymond Hettinger9b6f13e2009-05-22 01:06:44 +0000419 d = deque(itertools.islice(it, n-1))
420 d.appendleft(0)
Raymond Hettinger6bc94cb2009-03-31 22:43:03 +0000421 s = sum(d)
Raymond Hettinger6bc94cb2009-03-31 22:43:03 +0000422 for elem in it:
423 s += elem - d.popleft()
424 d.append(elem)
Raymond Hettinger9b6f13e2009-05-22 01:06:44 +0000425 yield s / float(n)
Raymond Hettinger6bc94cb2009-03-31 22:43:03 +0000426
Georg Brandl8ec7f652007-08-15 14:28:01 +0000427The :meth:`rotate` method provides a way to implement :class:`deque` slicing and
Ezio Melotti062d2b52009-12-19 22:41:49 +0000428deletion. For example, a pure Python implementation of ``del d[n]`` relies on
Georg Brandl8ec7f652007-08-15 14:28:01 +0000429the :meth:`rotate` method to position elements to be popped::
430
431 def delete_nth(d, n):
432 d.rotate(-n)
433 d.popleft()
434 d.rotate(n)
435
436To implement :class:`deque` slicing, use a similar approach applying
437:meth:`rotate` to bring a target element to the left side of the deque. Remove
438old entries with :meth:`popleft`, add new entries with :meth:`extend`, and then
439reverse the rotation.
Georg Brandl8ec7f652007-08-15 14:28:01 +0000440With minor variations on that approach, it is easy to implement Forth style
441stack manipulations such as ``dup``, ``drop``, ``swap``, ``over``, ``pick``,
442``rot``, and ``roll``.
443
Georg Brandl8ec7f652007-08-15 14:28:01 +0000444
445:class:`defaultdict` objects
446----------------------------
447
Georg Brandl8ec7f652007-08-15 14:28:01 +0000448.. class:: defaultdict([default_factory[, ...]])
449
450 Returns a new dictionary-like object. :class:`defaultdict` is a subclass of the
Georg Brandld7d4fd72009-07-26 14:37:28 +0000451 built-in :class:`dict` class. It overrides one method and adds one writable
Georg Brandl8ec7f652007-08-15 14:28:01 +0000452 instance variable. The remaining functionality is the same as for the
453 :class:`dict` class and is not documented here.
454
455 The first argument provides the initial value for the :attr:`default_factory`
456 attribute; it defaults to ``None``. All remaining arguments are treated the same
457 as if they were passed to the :class:`dict` constructor, including keyword
458 arguments.
459
460 .. versionadded:: 2.5
461
Benjamin Petersonc7b05922008-04-25 01:29:10 +0000462 :class:`defaultdict` objects support the following method in addition to the
463 standard :class:`dict` operations:
Georg Brandl8ec7f652007-08-15 14:28:01 +0000464
465
Benjamin Petersonc7b05922008-04-25 01:29:10 +0000466 .. method:: defaultdict.__missing__(key)
Georg Brandl8ec7f652007-08-15 14:28:01 +0000467
Skip Montanarob40890d2008-09-17 11:50:36 +0000468 If the :attr:`default_factory` attribute is ``None``, this raises a
Benjamin Petersonc7b05922008-04-25 01:29:10 +0000469 :exc:`KeyError` exception with the *key* as argument.
Georg Brandl8ec7f652007-08-15 14:28:01 +0000470
Benjamin Petersonc7b05922008-04-25 01:29:10 +0000471 If :attr:`default_factory` is not ``None``, it is called without arguments
472 to provide a default value for the given *key*, this value is inserted in
473 the dictionary for the *key*, and returned.
Georg Brandl8ec7f652007-08-15 14:28:01 +0000474
Benjamin Petersonc7b05922008-04-25 01:29:10 +0000475 If calling :attr:`default_factory` raises an exception this exception is
476 propagated unchanged.
Georg Brandl8ec7f652007-08-15 14:28:01 +0000477
Benjamin Petersonc7b05922008-04-25 01:29:10 +0000478 This method is called by the :meth:`__getitem__` method of the
479 :class:`dict` class when the requested key is not found; whatever it
480 returns or raises is then returned or raised by :meth:`__getitem__`.
Georg Brandl8ec7f652007-08-15 14:28:01 +0000481
482
Benjamin Petersonc7b05922008-04-25 01:29:10 +0000483 :class:`defaultdict` objects support the following instance variable:
Georg Brandl8ec7f652007-08-15 14:28:01 +0000484
Benjamin Petersonc7b05922008-04-25 01:29:10 +0000485
486 .. attribute:: defaultdict.default_factory
487
488 This attribute is used by the :meth:`__missing__` method; it is
489 initialized from the first argument to the constructor, if present, or to
490 ``None``, if absent.
Georg Brandl8ec7f652007-08-15 14:28:01 +0000491
492
Georg Brandl8ec7f652007-08-15 14:28:01 +0000493:class:`defaultdict` Examples
494^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
495
496Using :class:`list` as the :attr:`default_factory`, it is easy to group a
Georg Brandl4c8bbe62008-03-22 21:06:20 +0000497sequence of key-value pairs into a dictionary of lists:
Georg Brandl8ec7f652007-08-15 14:28:01 +0000498
499 >>> s = [('yellow', 1), ('blue', 2), ('yellow', 3), ('blue', 4), ('red', 1)]
500 >>> d = defaultdict(list)
501 >>> for k, v in s:
502 ... d[k].append(v)
503 ...
504 >>> d.items()
505 [('blue', [2, 4]), ('red', [1]), ('yellow', [1, 3])]
506
507When each key is encountered for the first time, it is not already in the
508mapping; so an entry is automatically created using the :attr:`default_factory`
509function which returns an empty :class:`list`. The :meth:`list.append`
510operation then attaches the value to the new list. When keys are encountered
511again, the look-up proceeds normally (returning the list for that key) and the
512:meth:`list.append` operation adds another value to the list. This technique is
Georg Brandl4c8bbe62008-03-22 21:06:20 +0000513simpler and faster than an equivalent technique using :meth:`dict.setdefault`:
Georg Brandl8ec7f652007-08-15 14:28:01 +0000514
515 >>> d = {}
516 >>> for k, v in s:
517 ... d.setdefault(k, []).append(v)
518 ...
519 >>> d.items()
520 [('blue', [2, 4]), ('red', [1]), ('yellow', [1, 3])]
521
522Setting the :attr:`default_factory` to :class:`int` makes the
523:class:`defaultdict` useful for counting (like a bag or multiset in other
Georg Brandl4c8bbe62008-03-22 21:06:20 +0000524languages):
Georg Brandl8ec7f652007-08-15 14:28:01 +0000525
526 >>> s = 'mississippi'
527 >>> d = defaultdict(int)
528 >>> for k in s:
529 ... d[k] += 1
530 ...
531 >>> d.items()
532 [('i', 4), ('p', 2), ('s', 4), ('m', 1)]
533
534When a letter is first encountered, it is missing from the mapping, so the
535:attr:`default_factory` function calls :func:`int` to supply a default count of
536zero. The increment operation then builds up the count for each letter.
537
538The function :func:`int` which always returns zero is just a special case of
539constant functions. A faster and more flexible way to create constant functions
540is to use :func:`itertools.repeat` which can supply any constant value (not just
Georg Brandl4c8bbe62008-03-22 21:06:20 +0000541zero):
Georg Brandl8ec7f652007-08-15 14:28:01 +0000542
543 >>> def constant_factory(value):
544 ... return itertools.repeat(value).next
545 >>> d = defaultdict(constant_factory('<missing>'))
546 >>> d.update(name='John', action='ran')
547 >>> '%(name)s %(action)s to %(object)s' % d
548 'John ran to <missing>'
549
550Setting the :attr:`default_factory` to :class:`set` makes the
Georg Brandl4c8bbe62008-03-22 21:06:20 +0000551:class:`defaultdict` useful for building a dictionary of sets:
Georg Brandl8ec7f652007-08-15 14:28:01 +0000552
553 >>> s = [('red', 1), ('blue', 2), ('red', 3), ('blue', 4), ('red', 1), ('blue', 4)]
554 >>> d = defaultdict(set)
555 >>> for k, v in s:
556 ... d[k].add(v)
557 ...
558 >>> d.items()
559 [('blue', set([2, 4])), ('red', set([1, 3]))]
560
561
Raymond Hettingereeeb9c42007-11-15 02:44:53 +0000562:func:`namedtuple` Factory Function for Tuples with Named Fields
Georg Brandlb3255ed2008-01-07 16:43:47 +0000563----------------------------------------------------------------
Georg Brandl8ec7f652007-08-15 14:28:01 +0000564
Raymond Hettingercbab5942007-09-18 22:18:02 +0000565Named tuples assign meaning to each position in a tuple and allow for more readable,
566self-documenting code. They can be used wherever regular tuples are used, and
567they add the ability to access fields by name instead of position index.
Georg Brandl8ec7f652007-08-15 14:28:01 +0000568
Raymond Hettinger322daea2009-02-10 01:24:05 +0000569.. function:: namedtuple(typename, field_names, [verbose], [rename])
Georg Brandl8ec7f652007-08-15 14:28:01 +0000570
571 Returns a new tuple subclass named *typename*. The new subclass is used to
Georg Brandl907a7202008-02-22 12:31:45 +0000572 create tuple-like objects that have fields accessible by attribute lookup as
Georg Brandl8ec7f652007-08-15 14:28:01 +0000573 well as being indexable and iterable. Instances of the subclass also have a
Georg Brandl061d2e22008-11-23 19:17:25 +0000574 helpful docstring (with typename and field_names) and a helpful :meth:`__repr__`
Georg Brandl8ec7f652007-08-15 14:28:01 +0000575 method which lists the tuple contents in a ``name=value`` format.
576
Georg Brandl061d2e22008-11-23 19:17:25 +0000577 The *field_names* are a single string with each fieldname separated by whitespace
578 and/or commas, for example ``'x y'`` or ``'x, y'``. Alternatively, *field_names*
Raymond Hettinger15b5e552008-01-10 23:00:01 +0000579 can be a sequence of strings such as ``['x', 'y']``.
Raymond Hettingerabfd8df2007-10-16 21:28:32 +0000580
581 Any valid Python identifier may be used for a fieldname except for names
Raymond Hettinger42da8742007-12-14 02:49:47 +0000582 starting with an underscore. Valid identifiers consist of letters, digits,
583 and underscores but do not start with a digit or underscore and cannot be
Raymond Hettingerabfd8df2007-10-16 21:28:32 +0000584 a :mod:`keyword` such as *class*, *for*, *return*, *global*, *pass*, *print*,
585 or *raise*.
Raymond Hettingercbab5942007-09-18 22:18:02 +0000586
Raymond Hettinger322daea2009-02-10 01:24:05 +0000587 If *rename* is true, invalid fieldnames are automatically replaced
588 with positional names. For example, ``['abc', 'def', 'ghi', 'abc']`` is
Raymond Hettinger6df48a32009-04-02 22:34:17 +0000589 converted to ``['abc', '_1', 'ghi', '_3']``, eliminating the keyword
Raymond Hettinger322daea2009-02-10 01:24:05 +0000590 ``def`` and the duplicate fieldname ``abc``.
591
Raymond Hettinger15b5e552008-01-10 23:00:01 +0000592 If *verbose* is true, the class definition is printed just before being built.
Raymond Hettingercbab5942007-09-18 22:18:02 +0000593
Raymond Hettingera48a2992007-10-08 21:26:58 +0000594 Named tuple instances do not have per-instance dictionaries, so they are
Raymond Hettinger7268e9d2007-09-20 03:03:43 +0000595 lightweight and require no more memory than regular tuples.
Raymond Hettingercbab5942007-09-18 22:18:02 +0000596
Georg Brandl8ec7f652007-08-15 14:28:01 +0000597 .. versionadded:: 2.6
598
Raymond Hettinger322daea2009-02-10 01:24:05 +0000599 .. versionchanged:: 2.7
600 added support for *rename*.
601
Georg Brandl4c8bbe62008-03-22 21:06:20 +0000602Example:
603
604.. doctest::
605 :options: +NORMALIZE_WHITESPACE
Georg Brandl8ec7f652007-08-15 14:28:01 +0000606
Raymond Hettingereeeb9c42007-11-15 02:44:53 +0000607 >>> Point = namedtuple('Point', 'x y', verbose=True)
Raymond Hettingercbab5942007-09-18 22:18:02 +0000608 class Point(tuple):
609 'Point(x, y)'
Georg Brandl4c8bbe62008-03-22 21:06:20 +0000610 <BLANKLINE>
Raymond Hettingercbab5942007-09-18 22:18:02 +0000611 __slots__ = ()
Georg Brandl4c8bbe62008-03-22 21:06:20 +0000612 <BLANKLINE>
Raymond Hettingere0734e72008-01-04 03:22:53 +0000613 _fields = ('x', 'y')
Georg Brandl4c8bbe62008-03-22 21:06:20 +0000614 <BLANKLINE>
Raymond Hettingera68cad12009-05-27 02:24:45 +0000615 def __new__(_cls, x, y):
Raymond Hettinger9bd35082010-03-09 09:01:46 +0000616 'Create a new instance of Point(x, y)'
Raymond Hettingera68cad12009-05-27 02:24:45 +0000617 return _tuple.__new__(_cls, (x, y))
Georg Brandl4c8bbe62008-03-22 21:06:20 +0000618 <BLANKLINE>
Raymond Hettinger02740f72008-01-05 01:35:43 +0000619 @classmethod
Georg Brandl4c8bbe62008-03-22 21:06:20 +0000620 def _make(cls, iterable, new=tuple.__new__, len=len):
Raymond Hettinger02740f72008-01-05 01:35:43 +0000621 'Make a new Point object from a sequence or iterable'
Georg Brandl4c8bbe62008-03-22 21:06:20 +0000622 result = new(cls, iterable)
Raymond Hettinger02740f72008-01-05 01:35:43 +0000623 if len(result) != 2:
624 raise TypeError('Expected 2 arguments, got %d' % len(result))
625 return result
Georg Brandl4c8bbe62008-03-22 21:06:20 +0000626 <BLANKLINE>
Raymond Hettingercbab5942007-09-18 22:18:02 +0000627 def __repr__(self):
Raymond Hettinger9bd35082010-03-09 09:01:46 +0000628 'Return a nicely formatted representation string'
Raymond Hettingercbab5942007-09-18 22:18:02 +0000629 return 'Point(x=%r, y=%r)' % self
Georg Brandl4c8bbe62008-03-22 21:06:20 +0000630 <BLANKLINE>
Raymond Hettinger88a91642009-03-03 04:51:24 +0000631 def _asdict(self):
632 'Return a new OrderedDict which maps field names to their values'
633 return OrderedDict(zip(self._fields, self))
Georg Brandl4c8bbe62008-03-22 21:06:20 +0000634 <BLANKLINE>
Raymond Hettingera68cad12009-05-27 02:24:45 +0000635 def _replace(_self, **kwds):
Raymond Hettingereeeb9c42007-11-15 02:44:53 +0000636 'Return a new Point object replacing specified fields with new values'
Raymond Hettingera68cad12009-05-27 02:24:45 +0000637 result = _self._make(map(kwds.pop, ('x', 'y'), _self))
Raymond Hettinger1b50fd72008-01-05 02:17:24 +0000638 if kwds:
639 raise ValueError('Got unexpected field names: %r' % kwds.keys())
640 return result
Georg Brandlc62ef8b2009-01-03 20:55:06 +0000641 <BLANKLINE>
642 def __getnewargs__(self):
Raymond Hettinger9bd35082010-03-09 09:01:46 +0000643 'Return self as a plain tuple. Used by copy and pickle.'
Raymond Hettingeree51cff2008-06-27 21:34:24 +0000644 return tuple(self)
Georg Brandl4c8bbe62008-03-22 21:06:20 +0000645 <BLANKLINE>
Raymond Hettinger9bd35082010-03-09 09:01:46 +0000646 x = _property(_itemgetter(0), doc='Alias for field number 0')
647 y = _property(_itemgetter(1), doc='Alias for field number 1')
Georg Brandl8ec7f652007-08-15 14:28:01 +0000648
Raymond Hettingercbab5942007-09-18 22:18:02 +0000649 >>> p = Point(11, y=22) # instantiate with positional or keyword arguments
Raymond Hettinger88880b22007-12-18 00:13:45 +0000650 >>> p[0] + p[1] # indexable like the plain tuple (11, 22)
Raymond Hettingercbab5942007-09-18 22:18:02 +0000651 33
652 >>> x, y = p # unpack like a regular tuple
653 >>> x, y
654 (11, 22)
Georg Brandl907a7202008-02-22 12:31:45 +0000655 >>> p.x + p.y # fields also accessible by name
Raymond Hettingercbab5942007-09-18 22:18:02 +0000656 33
657 >>> p # readable __repr__ with a name=value style
658 Point(x=11, y=22)
Georg Brandl8ec7f652007-08-15 14:28:01 +0000659
Raymond Hettingercbab5942007-09-18 22:18:02 +0000660Named tuples are especially useful for assigning field names to result tuples returned
661by the :mod:`csv` or :mod:`sqlite3` modules::
Georg Brandl8ec7f652007-08-15 14:28:01 +0000662
Raymond Hettingereeeb9c42007-11-15 02:44:53 +0000663 EmployeeRecord = namedtuple('EmployeeRecord', 'name, age, title, department, paygrade')
Raymond Hettingera48a2992007-10-08 21:26:58 +0000664
Raymond Hettingercbab5942007-09-18 22:18:02 +0000665 import csv
Raymond Hettinger02740f72008-01-05 01:35:43 +0000666 for emp in map(EmployeeRecord._make, csv.reader(open("employees.csv", "rb"))):
Raymond Hettingercbab5942007-09-18 22:18:02 +0000667 print emp.name, emp.title
Georg Brandl8ec7f652007-08-15 14:28:01 +0000668
Raymond Hettingera48a2992007-10-08 21:26:58 +0000669 import sqlite3
670 conn = sqlite3.connect('/companydata')
671 cursor = conn.cursor()
672 cursor.execute('SELECT name, age, title, department, paygrade FROM employees')
Raymond Hettinger02740f72008-01-05 01:35:43 +0000673 for emp in map(EmployeeRecord._make, cursor.fetchall()):
Raymond Hettingera48a2992007-10-08 21:26:58 +0000674 print emp.name, emp.title
675
Raymond Hettinger85dfcf32007-12-18 23:51:15 +0000676In addition to the methods inherited from tuples, named tuples support
Raymond Hettingerac5742e2008-01-08 02:24:15 +0000677three additional methods and one attribute. To prevent conflicts with
678field names, the method and attribute names start with an underscore.
Raymond Hettinger85dfcf32007-12-18 23:51:15 +0000679
Benjamin Peterson682f6032010-07-18 14:26:34 +0000680.. classmethod:: somenamedtuple._make(iterable)
Raymond Hettinger85dfcf32007-12-18 23:51:15 +0000681
Raymond Hettinger02740f72008-01-05 01:35:43 +0000682 Class method that makes a new instance from an existing sequence or iterable.
Raymond Hettinger85dfcf32007-12-18 23:51:15 +0000683
Raymond Hettinger2950bca2009-01-14 01:39:51 +0000684 .. doctest::
Georg Brandl8ec7f652007-08-15 14:28:01 +0000685
Raymond Hettinger02740f72008-01-05 01:35:43 +0000686 >>> t = [11, 22]
687 >>> Point._make(t)
688 Point(x=11, y=22)
Raymond Hettinger2b03d452007-09-18 03:33:19 +0000689
Georg Brandlb3255ed2008-01-07 16:43:47 +0000690.. method:: somenamedtuple._asdict()
Raymond Hettingera7fc4b12007-10-05 02:47:07 +0000691
Raymond Hettinger88a91642009-03-03 04:51:24 +0000692 Return a new :class:`OrderedDict` which maps field names to their corresponding
693 values::
Raymond Hettingera7fc4b12007-10-05 02:47:07 +0000694
Raymond Hettinger42da8742007-12-14 02:49:47 +0000695 >>> p._asdict()
Raymond Hettinger88a91642009-03-03 04:51:24 +0000696 OrderedDict([('x', 11), ('y', 22)])
697
Raymond Hettingera07038d2009-03-03 05:11:56 +0000698 .. versionchanged:: 2.7
Raymond Hettinger88a91642009-03-03 04:51:24 +0000699 Returns an :class:`OrderedDict` instead of a regular :class:`dict`.
Georg Brandl4c8bbe62008-03-22 21:06:20 +0000700
Georg Brandlb3255ed2008-01-07 16:43:47 +0000701.. method:: somenamedtuple._replace(kwargs)
Raymond Hettingerd36a60e2007-09-17 00:55:00 +0000702
Georg Brandl4c8bbe62008-03-22 21:06:20 +0000703 Return a new instance of the named tuple replacing specified fields with new
Raymond Hettinger2950bca2009-01-14 01:39:51 +0000704 values::
Raymond Hettingerd36a60e2007-09-17 00:55:00 +0000705
Raymond Hettingercbab5942007-09-18 22:18:02 +0000706 >>> p = Point(x=11, y=22)
Raymond Hettinger42da8742007-12-14 02:49:47 +0000707 >>> p._replace(x=33)
Raymond Hettingerd36a60e2007-09-17 00:55:00 +0000708 Point(x=33, y=22)
709
Raymond Hettinger7c3738e2007-11-15 03:16:09 +0000710 >>> for partnum, record in inventory.items():
Raymond Hettingere11230e2008-01-09 03:02:23 +0000711 ... inventory[partnum] = record._replace(price=newprices[partnum], timestamp=time.now())
Raymond Hettingerd36a60e2007-09-17 00:55:00 +0000712
Georg Brandlb3255ed2008-01-07 16:43:47 +0000713.. attribute:: somenamedtuple._fields
Raymond Hettingerd36a60e2007-09-17 00:55:00 +0000714
Raymond Hettingerf6b769b2008-01-07 21:33:51 +0000715 Tuple of strings listing the field names. Useful for introspection
Raymond Hettingera7fc4b12007-10-05 02:47:07 +0000716 and for creating new named tuple types from existing named tuples.
Raymond Hettinger7268e9d2007-09-20 03:03:43 +0000717
Raymond Hettinger2950bca2009-01-14 01:39:51 +0000718 .. doctest::
Raymond Hettingerd36a60e2007-09-17 00:55:00 +0000719
Raymond Hettinger42da8742007-12-14 02:49:47 +0000720 >>> p._fields # view the field names
Raymond Hettingercbab5942007-09-18 22:18:02 +0000721 ('x', 'y')
Raymond Hettingerd36a60e2007-09-17 00:55:00 +0000722
Raymond Hettingereeeb9c42007-11-15 02:44:53 +0000723 >>> Color = namedtuple('Color', 'red green blue')
Raymond Hettinger42da8742007-12-14 02:49:47 +0000724 >>> Pixel = namedtuple('Pixel', Point._fields + Color._fields)
Raymond Hettingercbab5942007-09-18 22:18:02 +0000725 >>> Pixel(11, 22, 128, 255, 0)
Raymond Hettingerdc1854d2008-01-09 03:13:20 +0000726 Pixel(x=11, y=22, red=128, green=255, blue=0)
Raymond Hettingerd36a60e2007-09-17 00:55:00 +0000727
Raymond Hettingere846f382007-12-14 21:51:50 +0000728To retrieve a field whose name is stored in a string, use the :func:`getattr`
Georg Brandl4c8bbe62008-03-22 21:06:20 +0000729function:
Raymond Hettingere846f382007-12-14 21:51:50 +0000730
731 >>> getattr(p, 'x')
732 11
733
Raymond Hettingere4ae63c2009-02-11 00:06:17 +0000734To convert a dictionary to a named tuple, use the double-star-operator
735(as described in :ref:`tut-unpacking-arguments`):
Raymond Hettinger85dfcf32007-12-18 23:51:15 +0000736
737 >>> d = {'x': 11, 'y': 22}
738 >>> Point(**d)
739 Point(x=11, y=22)
740
Raymond Hettingereeeb9c42007-11-15 02:44:53 +0000741Since a named tuple is a regular Python class, it is easy to add or change
Raymond Hettingerb8e00722008-01-07 04:24:49 +0000742functionality with a subclass. Here is how to add a calculated field and
Georg Brandl4c8bbe62008-03-22 21:06:20 +0000743a fixed-width print format:
Raymond Hettingereeeb9c42007-11-15 02:44:53 +0000744
Raymond Hettingerb8e00722008-01-07 04:24:49 +0000745 >>> class Point(namedtuple('Point', 'x y')):
Raymond Hettingere1655082008-01-10 19:15:10 +0000746 ... __slots__ = ()
Raymond Hettingere11230e2008-01-09 03:02:23 +0000747 ... @property
748 ... def hypot(self):
749 ... return (self.x ** 2 + self.y ** 2) ** 0.5
750 ... def __str__(self):
Raymond Hettinger15b5e552008-01-10 23:00:01 +0000751 ... return 'Point: x=%6.3f y=%6.3f hypot=%6.3f' % (self.x, self.y, self.hypot)
Raymond Hettingerb8e00722008-01-07 04:24:49 +0000752
Raymond Hettingere1655082008-01-10 19:15:10 +0000753 >>> for p in Point(3, 4), Point(14, 5/7.):
Raymond Hettingere11230e2008-01-09 03:02:23 +0000754 ... print p
Raymond Hettinger15b5e552008-01-10 23:00:01 +0000755 Point: x= 3.000 y= 4.000 hypot= 5.000
756 Point: x=14.000 y= 0.714 hypot=14.018
Raymond Hettingereeeb9c42007-11-15 02:44:53 +0000757
Georg Brandlfe8df4f2009-12-28 08:01:59 +0000758The subclass shown above sets ``__slots__`` to an empty tuple. This helps
Raymond Hettinger171f3912008-01-16 23:38:16 +0000759keep memory requirements low by preventing the creation of instance dictionaries.
Raymond Hettingerf59e9622008-01-15 20:52:42 +0000760
Raymond Hettingerac5742e2008-01-08 02:24:15 +0000761Subclassing is not useful for adding new, stored fields. Instead, simply
Georg Brandl4c8bbe62008-03-22 21:06:20 +0000762create a new named tuple type from the :attr:`_fields` attribute:
Raymond Hettingerac5742e2008-01-08 02:24:15 +0000763
Raymond Hettingere850c462008-01-10 20:37:12 +0000764 >>> Point3D = namedtuple('Point3D', Point._fields + ('z',))
Raymond Hettingerac5742e2008-01-08 02:24:15 +0000765
Raymond Hettingerfb3ced62008-01-07 20:17:35 +0000766Default values can be implemented by using :meth:`_replace` to
Georg Brandl4c8bbe62008-03-22 21:06:20 +0000767customize a prototype instance:
Raymond Hettingerbc693492007-11-15 22:39:34 +0000768
769 >>> Account = namedtuple('Account', 'owner balance transaction_count')
Raymond Hettinger0fe6ca42008-01-18 21:14:58 +0000770 >>> default_account = Account('<owner name>', 0.0, 0)
771 >>> johns_account = default_account._replace(owner='John')
Raymond Hettingerbc693492007-11-15 22:39:34 +0000772
Raymond Hettinger5a9fed72008-05-08 07:23:30 +0000773Enumerated constants can be implemented with named tuples, but it is simpler
774and more efficient to use a simple class declaration:
775
776 >>> Status = namedtuple('Status', 'open pending closed')._make(range(3))
777 >>> Status.open, Status.pending, Status.closed
778 (0, 1, 2)
779 >>> class Status:
780 ... open, pending, closed = range(3)
781
Raymond Hettingere4ae63c2009-02-11 00:06:17 +0000782.. seealso::
Mark Summerfield7f626f42007-08-30 15:03:03 +0000783
Raymond Hettingere4ae63c2009-02-11 00:06:17 +0000784 `Named tuple recipe <http://code.activestate.com/recipes/500261/>`_
785 adapted for Python 2.4.
Raymond Hettingerbc512d32009-03-03 04:45:34 +0000786
787
788:class:`OrderedDict` objects
789----------------------------
790
791Ordered dictionaries are just like regular dictionaries but they remember the
792order that items were inserted. When iterating over an ordered dictionary,
793the items are returned in the order their keys were first added.
794
795.. class:: OrderedDict([items])
796
797 Return an instance of a dict subclass, supporting the usual :class:`dict`
798 methods. An *OrderedDict* is a dict that remembers the order that keys
799 were first inserted. If a new entry overwrites an existing entry, the
800 original insertion position is left unchanged. Deleting an entry and
801 reinserting it will move it to the end.
802
803 .. versionadded:: 2.7
804
Raymond Hettinger24122992009-03-19 19:59:58 +0000805.. method:: OrderedDict.popitem(last=True)
806
807 The :meth:`popitem` method for ordered dictionaries returns and removes
808 a (key, value) pair. The pairs are returned in LIFO order if *last* is
809 true or FIFO order if false.
Raymond Hettingerbc512d32009-03-03 04:45:34 +0000810
Raymond Hettinger50f362f2009-05-19 17:43:59 +0000811In addition to the usual mapping methods, ordered dictionaries also support
812reverse iteration using :func:`reversed`.
813
Raymond Hettingerbc512d32009-03-03 04:45:34 +0000814Equality tests between :class:`OrderedDict` objects are order-sensitive
815and are implemented as ``list(od1.items())==list(od2.items())``.
816Equality tests between :class:`OrderedDict` objects and other
817:class:`Mapping` objects are order-insensitive like regular dictionaries.
818This allows :class:`OrderedDict` objects to be substituted anywhere a
819regular dictionary is used.
Raymond Hettinger24122992009-03-19 19:59:58 +0000820
Raymond Hettingerc473c5a2009-04-09 22:31:51 +0000821The :class:`OrderedDict` constructor and :meth:`update` method both accept
822keyword arguments, but their order is lost because Python's function call
823semantics pass-in keyword arguments using a regular unordered dictionary.
824
Raymond Hettinger24122992009-03-19 19:59:58 +0000825.. seealso::
826
827 `Equivalent OrderedDict recipe <http://code.activestate.com/recipes/576693/>`_
828 that runs on Python 2.4 or later.
Raymond Hettinger610326d2009-11-10 19:35:55 +0000829
830Since an ordered dictionary remembers its insertion order, it can be used
831in conjuction with sorting to make a sorted dictionary::
832
833 >>> # regular unsorted dictionary
834 >>> d = {'banana': 3, 'apple':4, 'pear': 1, 'orange': 2}
835
836 >>> # dictionary sorted by key
837 >>> OrderedDict(sorted(d.items(), key=lambda t: t[0]))
838 OrderedDict([('apple', 4), ('banana', 3), ('orange', 2), ('pear', 1)])
839
840 >>> # dictionary sorted by value
841 >>> OrderedDict(sorted(d.items(), key=lambda t: t[1]))
842 OrderedDict([('pear', 1), ('orange', 2), ('banana', 3), ('apple', 4)])
843
844 >>> # dictionary sorted by length of the key string
845 >>> OrderedDict(sorted(d.items(), key=lambda t: len(t[0])))
846 OrderedDict([('pear', 1), ('apple', 4), ('orange', 2), ('banana', 3)])
847
848The new sorted dictionaries maintain their sort order when entries
849are deleted. But when new keys are added, the keys are appended
850to the end and the sort is not maintained.
Georg Brandlad8ac862010-08-01 19:21:26 +0000851
852It is also straight-forward to create an ordered dictionary variant
853that the remembers the order the keys were *last* inserted.
854If a new entry overwrites an existing entry, the
855original insertion position is changed and moved to the end::
856
857 class LastUpdatedOrderedDict(OrderedDict):
858 'Store items is the order the keys were last added'
859 def __setitem__(self, key, value):
860 if key in self:
861 del self[key]
862 OrderedDict.__setitem__(self, key, value)
Raymond Hettinger3de8a302010-08-08 00:35:03 +0000863
864
865ABCs - abstract base classes
866----------------------------
867
868The collections module offers the following ABCs:
869
870========================= ===================== ====================== ====================================================
871ABC Inherits Abstract Methods Mixin Methods
872========================= ===================== ====================== ====================================================
873:class:`Container` ``__contains__``
874:class:`Hashable` ``__hash__``
875:class:`Iterable` ``__iter__``
876:class:`Iterator` :class:`Iterable` ``next`` ``__iter__``
877:class:`Sized` ``__len__``
878:class:`Callable` ``__call__``
879
880:class:`Sequence` :class:`Sized`, ``__getitem__`` ``__contains__``. ``__iter__``, ``__reversed__``.
881 :class:`Iterable`, ``index``, and ``count``
882 :class:`Container`
883
884:class:`MutableSequence` :class:`Sequence` ``__setitem__`` Inherited Sequence methods and
885 ``__delitem__``, ``append``, ``reverse``, ``extend``, ``pop``,
886 and ``insert`` ``remove``, and ``__iadd__``
887
888:class:`Set` :class:`Sized`, ``__le__``, ``__lt__``, ``__eq__``, ``__ne__``,
889 :class:`Iterable`, ``__gt__``, ``__ge__``, ``__and__``, ``__or__``
890 :class:`Container` ``__sub__``, ``__xor__``, and ``isdisjoint``
891
892:class:`MutableSet` :class:`Set` ``add`` and Inherited Set methods and
893 ``discard`` ``clear``, ``pop``, ``remove``, ``__ior__``,
894 ``__iand__``, ``__ixor__``, and ``__isub__``
895
896:class:`Mapping` :class:`Sized`, ``__getitem__`` ``__contains__``, ``keys``, ``items``, ``values``,
897 :class:`Iterable`, ``get``, ``__eq__``, and ``__ne__``
898 :class:`Container`
899
900:class:`MutableMapping` :class:`Mapping` ``__setitem__`` and Inherited Mapping methods and
901 ``__delitem__`` ``pop``, ``popitem``, ``clear``, ``update``,
902 and ``setdefault``
903
904
905:class:`MappingView` :class:`Sized` ``__len__``
906:class:`KeysView` :class:`MappingView`, ``__contains__``,
907 :class:`Set` ``__iter__``
908:class:`ItemsView` :class:`MappingView`, ``__contains__``,
909 :class:`Set` ``__iter__``
910:class:`ValuesView` :class:`MappingView` ``__contains__``, ``__iter__``
911========================= ===================== ====================== ====================================================
912
913These ABCs allow us to ask classes or instances if they provide
914particular functionality, for example::
915
916 size = None
917 if isinstance(myvar, collections.Sized):
918 size = len(myvar)
919
920Several of the ABCs are also useful as mixins that make it easier to develop
921classes supporting container APIs. For example, to write a class supporting
922the full :class:`Set` API, it only necessary to supply the three underlying
923abstract methods: :meth:`__contains__`, :meth:`__iter__`, and :meth:`__len__`.
924The ABC supplies the remaining methods such as :meth:`__and__` and
925:meth:`isdisjoint` ::
926
927 class ListBasedSet(collections.Set):
928 ''' Alternate set implementation favoring space over speed
929 and not requiring the set elements to be hashable. '''
930 def __init__(self, iterable):
931 self.elements = lst = []
932 for value in iterable:
933 if value not in lst:
934 lst.append(value)
935 def __iter__(self):
936 return iter(self.elements)
937 def __contains__(self, value):
938 return value in self.elements
939 def __len__(self):
940 return len(self.elements)
941
942 s1 = ListBasedSet('abcdef')
943 s2 = ListBasedSet('defghi')
944 overlap = s1 & s2 # The __and__() method is supported automatically
945
946Notes on using :class:`Set` and :class:`MutableSet` as a mixin:
947
948(1)
949 Since some set operations create new sets, the default mixin methods need
950 a way to create new instances from an iterable. The class constructor is
951 assumed to have a signature in the form ``ClassName(iterable)``.
952 That assumption is factored-out to an internal classmethod called
953 :meth:`_from_iterable` which calls ``cls(iterable)`` to produce a new set.
954 If the :class:`Set` mixin is being used in a class with a different
955 constructor signature, you will need to override :meth:`from_iterable`
956 with a classmethod that can construct new instances from
957 an iterable argument.
958
959(2)
960 To override the comparisons (presumably for speed, as the
961 semantics are fixed), redefine :meth:`__le__` and
962 then the other operations will automatically follow suit.
963
964(3)
965 The :class:`Set` mixin provides a :meth:`_hash` method to compute a hash value
966 for the set; however, :meth:`__hash__` is not defined because not all sets
967 are hashable or immutable. To add set hashabilty using mixins,
968 inherit from both :meth:`Set` and :meth:`Hashable`, then define
969 ``__hash__ = Set._hash``.
970
971.. seealso::
972
973 * `OrderedSet recipe <http://code.activestate.com/recipes/576694/>`_ for an
974 example built on :class:`MutableSet`.
975
976 * For more about ABCs, see the :mod:`abc` module and :pep:`3119`.