blob: abb36dbbc2ef4a9d213b3bbc533a85fe47887cd5 [file] [log] [blame]
Georg Brandl8ec7f652007-08-15 14:28:01 +00001:mod:`collections` --- High-performance container datatypes
2===========================================================
3
4.. module:: collections
5 :synopsis: High-performance datatypes
6.. moduleauthor:: Raymond Hettinger <python@rcn.com>
7.. sectionauthor:: Raymond Hettinger <python@rcn.com>
8
Georg Brandl8ec7f652007-08-15 14:28:01 +00009.. versionadded:: 2.4
10
Georg Brandl4c8bbe62008-03-22 21:06:20 +000011.. testsetup:: *
12
13 from collections import *
14 import itertools
15 __name__ = '<doctest>'
16
Éric Araujo29a0b572011-08-19 02:14:03 +020017**Source code:** :source:`Lib/collections.py` and :source:`Lib/_abcoll.py`
18
19--------------
20
Raymond Hettinger3de8a302010-08-08 00:35:03 +000021This module implements specialized container datatypes providing alternatives to
22Python's general purpose built-in containers, :class:`dict`, :class:`list`,
23:class:`set`, and :class:`tuple`.
Georg Brandl8ec7f652007-08-15 14:28:01 +000024
Raymond Hettingeraa5f4aa2010-11-06 07:18:07 +000025===================== ==================================================================== ===========================
26:func:`namedtuple` factory function for creating tuple subclasses with named fields .. versionadded:: 2.6
27:class:`deque` list-like container with fast appends and pops on either end .. versionadded:: 2.4
28:class:`Counter` dict subclass for counting hashable objects .. versionadded:: 2.7
29:class:`OrderedDict` dict subclass that remembers the order entries were added .. versionadded:: 2.7
30:class:`defaultdict` dict subclass that calls a factory function to supply missing values .. versionadded:: 2.5
31===================== ==================================================================== ===========================
Raymond Hettingerbc4ffc12008-02-11 23:38:00 +000032
Raymond Hettingeracdafa82010-11-30 17:50:53 +000033In addition to the concrete container classes, the collections module provides
Éric Araujo569ff912011-08-13 02:47:38 +020034:ref:`abstract base classes <collections-abstract-base-classes>` that can be
35used to test whether a class provides a particular interface, for example,
36whether it is hashable or a mapping.
Raymond Hettingerbc4ffc12008-02-11 23:38:00 +000037
Raymond Hettingerbc4ffc12008-02-11 23:38:00 +000038
Raymond Hettingerf94d7fa2009-01-12 22:58:41 +000039:class:`Counter` objects
40------------------------
41
42A counter tool is provided to support convenient and rapid tallies.
43For example::
44
Raymond Hettinger939a3cc2009-02-04 11:31:30 +000045 >>> # Tally occurrences of words in a list
Raymond Hettingerf94d7fa2009-01-12 22:58:41 +000046 >>> cnt = Counter()
Raymond Hettinger0a1f7b82009-01-21 23:12:51 +000047 >>> for word in ['red', 'blue', 'red', 'green', 'blue', 'blue']:
Raymond Hettingerf94d7fa2009-01-12 22:58:41 +000048 ... cnt[word] += 1
49 >>> cnt
Raymond Hettingeraaa6e632009-01-13 01:05:03 +000050 Counter({'blue': 3, 'red': 2, 'green': 1})
Raymond Hettingerf94d7fa2009-01-12 22:58:41 +000051
Raymond Hettinger939a3cc2009-02-04 11:31:30 +000052 >>> # Find the ten most common words in Hamlet
Raymond Hettingerf94d7fa2009-01-12 22:58:41 +000053 >>> import re
54 >>> words = re.findall('\w+', open('hamlet.txt').read().lower())
Raymond Hettinger196a0f72009-01-20 12:59:36 +000055 >>> Counter(words).most_common(10)
Raymond Hettingerf94d7fa2009-01-12 22:58:41 +000056 [('the', 1143), ('and', 966), ('to', 762), ('of', 669), ('i', 631),
57 ('you', 554), ('a', 546), ('my', 514), ('hamlet', 471), ('in', 451)]
58
Raymond Hettinger82783852009-01-13 03:49:43 +000059.. class:: Counter([iterable-or-mapping])
Raymond Hettingerf94d7fa2009-01-12 22:58:41 +000060
Raymond Hettinger0a1f7b82009-01-21 23:12:51 +000061 A :class:`Counter` is a :class:`dict` subclass for counting hashable objects.
Raymond Hettingeraaa6e632009-01-13 01:05:03 +000062 It is an unordered collection where elements are stored as dictionary keys
63 and their counts are stored as dictionary values. Counts are allowed to be
64 any integer value including zero or negative counts. The :class:`Counter`
65 class is similar to bags or multisets in other languages.
Raymond Hettingerf94d7fa2009-01-12 22:58:41 +000066
Raymond Hettinger82783852009-01-13 03:49:43 +000067 Elements are counted from an *iterable* or initialized from another
Georg Brandlf6dab952009-04-28 21:48:35 +000068 *mapping* (or counter):
Raymond Hettingerf94d7fa2009-01-12 22:58:41 +000069
Raymond Hettinger22bfa9e2009-01-27 02:36:33 +000070 >>> c = Counter() # a new, empty counter
71 >>> c = Counter('gallahad') # a new counter from an iterable
72 >>> c = Counter({'red': 4, 'blue': 2}) # a new counter from a mapping
73 >>> c = Counter(cats=4, dogs=8) # a new counter from keyword args
Raymond Hettingerf94d7fa2009-01-12 22:58:41 +000074
Raymond Hettinger0a1f7b82009-01-21 23:12:51 +000075 Counter objects have a dictionary interface except that they return a zero
Georg Brandlf6dab952009-04-28 21:48:35 +000076 count for missing items instead of raising a :exc:`KeyError`:
Raymond Hettingerf94d7fa2009-01-12 22:58:41 +000077
Raymond Hettinger7bdca052009-01-22 05:20:47 +000078 >>> c = Counter(['eggs', 'ham'])
Raymond Hettinger59894122009-01-14 00:15:21 +000079 >>> c['bacon'] # count of a missing element is zero
Raymond Hettingerf94d7fa2009-01-12 22:58:41 +000080 0
81
Raymond Hettinger7bdca052009-01-22 05:20:47 +000082 Setting a count to zero does not remove an element from a counter.
83 Use ``del`` to remove it entirely:
Raymond Hettingerf94d7fa2009-01-12 22:58:41 +000084
Raymond Hettinger7bdca052009-01-22 05:20:47 +000085 >>> c['sausage'] = 0 # counter entry with a zero count
86 >>> del c['sausage'] # del actually removes the entry
Raymond Hettingerf94d7fa2009-01-12 22:58:41 +000087
88 .. versionadded:: 2.7
89
90
Ezio Melotti4edfe962010-04-04 06:50:58 +000091 Counter objects support three methods beyond those available for all
Raymond Hettingerf94d7fa2009-01-12 22:58:41 +000092 dictionaries:
93
94 .. method:: elements()
95
Raymond Hettinger0a1f7b82009-01-21 23:12:51 +000096 Return an iterator over elements repeating each as many times as its
97 count. Elements are returned in arbitrary order. If an element's count
98 is less than one, :meth:`elements` will ignore it.
Raymond Hettingerf94d7fa2009-01-12 22:58:41 +000099
Raymond Hettinger196a0f72009-01-20 12:59:36 +0000100 >>> c = Counter(a=4, b=2, c=0, d=-2)
Raymond Hettingerf94d7fa2009-01-12 22:58:41 +0000101 >>> list(c.elements())
102 ['a', 'a', 'a', 'a', 'b', 'b']
103
104 .. method:: most_common([n])
105
Raymond Hettinger22bfa9e2009-01-27 02:36:33 +0000106 Return a list of the *n* most common elements and their counts from the
Raymond Hettingerd507afd2009-02-04 10:52:32 +0000107 most common to the least. If *n* is not specified, :func:`most_common`
Raymond Hettinger22bfa9e2009-01-27 02:36:33 +0000108 returns *all* elements in the counter. Elements with equal counts are
Georg Brandlf6dab952009-04-28 21:48:35 +0000109 ordered arbitrarily:
Raymond Hettingerf94d7fa2009-01-12 22:58:41 +0000110
111 >>> Counter('abracadabra').most_common(3)
112 [('a', 5), ('r', 2), ('b', 2)]
113
Raymond Hettinger34c35b22010-04-03 10:22:00 +0000114 .. method:: subtract([iterable-or-mapping])
115
116 Elements are subtracted from an *iterable* or from another *mapping*
117 (or counter). Like :meth:`dict.update` but subtracts counts instead
118 of replacing them. Both inputs and outputs may be zero or negative.
119
120 >>> c = Counter(a=4, b=2, c=0, d=-2)
121 >>> d = Counter(a=1, b=2, c=3, d=4)
122 >>> c.subtract(d)
123 Counter({'a': 3, 'b': 0, 'c': -3, 'd': -6})
124
Raymond Hettinger0a1f7b82009-01-21 23:12:51 +0000125 The usual dictionary methods are available for :class:`Counter` objects
126 except for two which work differently for counters.
Raymond Hettingerf94d7fa2009-01-12 22:58:41 +0000127
128 .. method:: fromkeys(iterable)
129
Raymond Hettinger22bfa9e2009-01-27 02:36:33 +0000130 This class method is not implemented for :class:`Counter` objects.
Raymond Hettingerf94d7fa2009-01-12 22:58:41 +0000131
Raymond Hettinger82783852009-01-13 03:49:43 +0000132 .. method:: update([iterable-or-mapping])
Raymond Hettingerf94d7fa2009-01-12 22:58:41 +0000133
Raymond Hettinger22bfa9e2009-01-27 02:36:33 +0000134 Elements are counted from an *iterable* or added-in from another
135 *mapping* (or counter). Like :meth:`dict.update` but adds counts
136 instead of replacing them. Also, the *iterable* is expected to be a
137 sequence of elements, not a sequence of ``(key, value)`` pairs.
Raymond Hettingerf94d7fa2009-01-12 22:58:41 +0000138
Raymond Hettingerfbcf7492009-01-13 08:38:14 +0000139Common patterns for working with :class:`Counter` objects::
Raymond Hettingerf94d7fa2009-01-12 22:58:41 +0000140
Raymond Hettinger22bfa9e2009-01-27 02:36:33 +0000141 sum(c.values()) # total of all counts
142 c.clear() # reset all counts
143 list(c) # list unique elements
144 set(c) # convert to a set
145 dict(c) # convert to a regular dictionary
146 c.items() # convert to a list of (elem, cnt) pairs
147 Counter(dict(list_of_pairs)) # convert from a list of (elem, cnt) pairs
148 c.most_common()[:-n:-1] # n least common elements
149 c += Counter() # remove zero and negative counts
Raymond Hettingerf94d7fa2009-01-12 22:58:41 +0000150
Raymond Hettingera6658532009-02-25 22:48:24 +0000151Several mathematical operations are provided for combining :class:`Counter`
152objects to produce multisets (counters that have counts greater than zero).
153Addition and subtraction combine counters by adding or subtracting the counts
154of corresponding elements. Intersection and union return the minimum and
155maximum of corresponding counts. Each operation can accept inputs with signed
156counts, but the output will exclude results with counts of zero or less.
Raymond Hettingerbad1eb22009-01-20 01:19:26 +0000157
Raymond Hettinger4571f342009-01-21 20:31:50 +0000158 >>> c = Counter(a=3, b=1)
159 >>> d = Counter(a=1, b=2)
Raymond Hettinger22bfa9e2009-01-27 02:36:33 +0000160 >>> c + d # add two counters together: c[x] + d[x]
Raymond Hettingerbad1eb22009-01-20 01:19:26 +0000161 Counter({'a': 4, 'b': 3})
Raymond Hettinger22bfa9e2009-01-27 02:36:33 +0000162 >>> c - d # subtract (keeping only positive counts)
Raymond Hettingerbad1eb22009-01-20 01:19:26 +0000163 Counter({'a': 2})
Raymond Hettinger22bfa9e2009-01-27 02:36:33 +0000164 >>> c & d # intersection: min(c[x], d[x])
Raymond Hettingerbad1eb22009-01-20 01:19:26 +0000165 Counter({'a': 1, 'b': 1})
Raymond Hettinger22bfa9e2009-01-27 02:36:33 +0000166 >>> c | d # union: max(c[x], d[x])
Raymond Hettingerbad1eb22009-01-20 01:19:26 +0000167 Counter({'a': 3, 'b': 2})
168
Raymond Hettinger44340e62010-04-12 21:12:06 +0000169.. note::
170
171 Counters were primarily designed to work with positive integers to represent
172 running counts; however, care was taken to not unnecessarily preclude use
173 cases needing other types or negative values. To help with those use cases,
174 this section documents the minimum range and type restrictions.
175
176 * The :class:`Counter` class itself is a dictionary subclass with no
177 restrictions on its keys and values. The values are intended to be numbers
178 representing counts, but you *could* store anything in the value field.
179
180 * The :meth:`most_common` method requires only that the values be orderable.
181
182 * For in-place operations such as ``c[key] += 1``, the value type need only
183 support addition and subtraction. So fractions, floats, and decimals would
184 work and negative values are supported. The same is also true for
185 :meth:`update` and :meth:`subtract` which allow negative and zero values
186 for both inputs and outputs.
187
188 * The multiset methods are designed only for use cases with positive values.
189 The inputs may be negative or zero, but only outputs with positive values
190 are created. There are no type restrictions, but the value type needs to
Ezio Melotti1e87da12011-10-19 10:39:35 +0300191 support addition, subtraction, and comparison.
Raymond Hettinger44340e62010-04-12 21:12:06 +0000192
193 * The :meth:`elements` method requires integer counts. It ignores zero and
194 negative counts.
195
Raymond Hettingeracdc84a2009-01-20 23:42:54 +0000196.. seealso::
Raymond Hettingerf94d7fa2009-01-12 22:58:41 +0000197
Raymond Hettinger7bdca052009-01-22 05:20:47 +0000198 * `Counter class <http://code.activestate.com/recipes/576611/>`_
199 adapted for Python 2.5 and an early `Bag recipe
200 <http://code.activestate.com/recipes/259174/>`_ for Python 2.4.
201
Raymond Hettingeracdc84a2009-01-20 23:42:54 +0000202 * `Bag class <http://www.gnu.org/software/smalltalk/manual-base/html_node/Bag.html>`_
203 in Smalltalk.
Raymond Hettingerf94d7fa2009-01-12 22:58:41 +0000204
Éric Araujof06568c2011-08-19 09:00:56 +0200205 * Wikipedia entry for `Multisets <http://en.wikipedia.org/wiki/Multiset>`_.
Raymond Hettingerf94d7fa2009-01-12 22:58:41 +0000206
Raymond Hettingeracdc84a2009-01-20 23:42:54 +0000207 * `C++ multisets <http://www.demo2s.com/Tutorial/Cpp/0380__set-multiset/Catalog0380__set-multiset.htm>`_
Raymond Hettinger7bdca052009-01-22 05:20:47 +0000208 tutorial with examples.
Raymond Hettingeracdc84a2009-01-20 23:42:54 +0000209
Raymond Hettinger7bdca052009-01-22 05:20:47 +0000210 * For mathematical operations on multisets and their use cases, see
Raymond Hettingeracdc84a2009-01-20 23:42:54 +0000211 *Knuth, Donald. The Art of Computer Programming Volume II,
Éric Araujof06568c2011-08-19 09:00:56 +0200212 Section 4.6.3, Exercise 19*.
Raymond Hettingeracdc84a2009-01-20 23:42:54 +0000213
Raymond Hettinger0a1f7b82009-01-21 23:12:51 +0000214 * To enumerate all distinct multisets of a given size over a given set of
Raymond Hettingerd081abc2009-01-27 02:58:49 +0000215 elements, see :func:`itertools.combinations_with_replacement`.
Raymond Hettingeracdc84a2009-01-20 23:42:54 +0000216
Raymond Hettinger7bdca052009-01-22 05:20:47 +0000217 map(Counter, combinations_with_replacement('ABC', 2)) --> AA AB AC BB BC CC
Raymond Hettingerfbcf7492009-01-13 08:38:14 +0000218
Raymond Hettingerf94d7fa2009-01-12 22:58:41 +0000219
Georg Brandl8ec7f652007-08-15 14:28:01 +0000220:class:`deque` objects
221----------------------
222
Raymond Hettingera7fc4b12007-10-05 02:47:07 +0000223.. class:: deque([iterable[, maxlen]])
Georg Brandl8ec7f652007-08-15 14:28:01 +0000224
225 Returns a new deque object initialized left-to-right (using :meth:`append`) with
226 data from *iterable*. If *iterable* is not specified, the new deque is empty.
227
228 Deques are a generalization of stacks and queues (the name is pronounced "deck"
229 and is short for "double-ended queue"). Deques support thread-safe, memory
230 efficient appends and pops from either side of the deque with approximately the
231 same O(1) performance in either direction.
232
233 Though :class:`list` objects support similar operations, they are optimized for
234 fast fixed-length operations and incur O(n) memory movement costs for
235 ``pop(0)`` and ``insert(0, v)`` operations which change both the size and
236 position of the underlying data representation.
237
238 .. versionadded:: 2.4
239
Raymond Hettinger68995862007-10-10 00:26:46 +0000240 If *maxlen* is not specified or is *None*, deques may grow to an
Raymond Hettingera7fc4b12007-10-05 02:47:07 +0000241 arbitrary length. Otherwise, the deque is bounded to the specified maximum
242 length. Once a bounded length deque is full, when new items are added, a
243 corresponding number of items are discarded from the opposite end. Bounded
244 length deques provide functionality similar to the ``tail`` filter in
245 Unix. They are also useful for tracking transactions and other pools of data
246 where only the most recent activity is of interest.
247
248 .. versionchanged:: 2.6
Georg Brandlb19be572007-12-29 10:57:00 +0000249 Added *maxlen* parameter.
Raymond Hettingera7fc4b12007-10-05 02:47:07 +0000250
Benjamin Petersonc7b05922008-04-25 01:29:10 +0000251 Deque objects support the following methods:
Georg Brandl8ec7f652007-08-15 14:28:01 +0000252
253
Benjamin Petersonc7b05922008-04-25 01:29:10 +0000254 .. method:: append(x)
Georg Brandl8ec7f652007-08-15 14:28:01 +0000255
Benjamin Petersonc7b05922008-04-25 01:29:10 +0000256 Add *x* to the right side of the deque.
Georg Brandl8ec7f652007-08-15 14:28:01 +0000257
258
Benjamin Petersonc7b05922008-04-25 01:29:10 +0000259 .. method:: appendleft(x)
Georg Brandl8ec7f652007-08-15 14:28:01 +0000260
Benjamin Petersonc7b05922008-04-25 01:29:10 +0000261 Add *x* to the left side of the deque.
Georg Brandl8ec7f652007-08-15 14:28:01 +0000262
263
Benjamin Petersonc7b05922008-04-25 01:29:10 +0000264 .. method:: clear()
Georg Brandl8ec7f652007-08-15 14:28:01 +0000265
Benjamin Petersonc7b05922008-04-25 01:29:10 +0000266 Remove all elements from the deque leaving it with length 0.
Georg Brandl8ec7f652007-08-15 14:28:01 +0000267
268
Raymond Hettinger5f516ed2010-04-03 18:10:37 +0000269 .. method:: count(x)
270
271 Count the number of deque elements equal to *x*.
272
273 .. versionadded:: 2.7
274
Benjamin Petersonc7b05922008-04-25 01:29:10 +0000275 .. method:: extend(iterable)
Georg Brandl8ec7f652007-08-15 14:28:01 +0000276
Benjamin Petersonc7b05922008-04-25 01:29:10 +0000277 Extend the right side of the deque by appending elements from the iterable
278 argument.
Georg Brandl8ec7f652007-08-15 14:28:01 +0000279
280
Benjamin Petersonc7b05922008-04-25 01:29:10 +0000281 .. method:: extendleft(iterable)
Georg Brandl8ec7f652007-08-15 14:28:01 +0000282
Benjamin Petersonc7b05922008-04-25 01:29:10 +0000283 Extend the left side of the deque by appending elements from *iterable*.
284 Note, the series of left appends results in reversing the order of
285 elements in the iterable argument.
Georg Brandl8ec7f652007-08-15 14:28:01 +0000286
287
Benjamin Petersonc7b05922008-04-25 01:29:10 +0000288 .. method:: pop()
Georg Brandl8ec7f652007-08-15 14:28:01 +0000289
Benjamin Petersonc7b05922008-04-25 01:29:10 +0000290 Remove and return an element from the right side of the deque. If no
291 elements are present, raises an :exc:`IndexError`.
Georg Brandl8ec7f652007-08-15 14:28:01 +0000292
293
Benjamin Petersonc7b05922008-04-25 01:29:10 +0000294 .. method:: popleft()
Georg Brandl8ec7f652007-08-15 14:28:01 +0000295
Benjamin Petersonc7b05922008-04-25 01:29:10 +0000296 Remove and return an element from the left side of the deque. If no
297 elements are present, raises an :exc:`IndexError`.
Georg Brandl8ec7f652007-08-15 14:28:01 +0000298
299
Benjamin Petersonc7b05922008-04-25 01:29:10 +0000300 .. method:: remove(value)
Georg Brandl8ec7f652007-08-15 14:28:01 +0000301
Benjamin Petersonc7b05922008-04-25 01:29:10 +0000302 Removed the first occurrence of *value*. If not found, raises a
303 :exc:`ValueError`.
Georg Brandl8ec7f652007-08-15 14:28:01 +0000304
Benjamin Petersonc7b05922008-04-25 01:29:10 +0000305 .. versionadded:: 2.5
Georg Brandl8ec7f652007-08-15 14:28:01 +0000306
Raymond Hettingera5fd24e2009-12-10 06:42:54 +0000307 .. method:: reverse()
308
309 Reverse the elements of the deque in-place and then return ``None``.
310
311 .. versionadded:: 2.7
Georg Brandl8ec7f652007-08-15 14:28:01 +0000312
Benjamin Petersonc7b05922008-04-25 01:29:10 +0000313 .. method:: rotate(n)
Georg Brandl8ec7f652007-08-15 14:28:01 +0000314
Benjamin Petersonc7b05922008-04-25 01:29:10 +0000315 Rotate the deque *n* steps to the right. If *n* is negative, rotate to
316 the left. Rotating one step to the right is equivalent to:
317 ``d.appendleft(d.pop())``.
318
Georg Brandl8ec7f652007-08-15 14:28:01 +0000319
Raymond Hettinger56411aa2009-03-10 12:50:59 +0000320 Deque objects also provide one read-only attribute:
321
322 .. attribute:: maxlen
323
324 Maximum size of a deque or *None* if unbounded.
325
326 .. versionadded:: 2.7
327
328
Georg Brandl8ec7f652007-08-15 14:28:01 +0000329In addition to the above, deques support iteration, pickling, ``len(d)``,
330``reversed(d)``, ``copy.copy(d)``, ``copy.deepcopy(d)``, membership testing with
Benjamin Peterson5c4e0062008-10-16 18:52:14 +0000331the :keyword:`in` operator, and subscript references such as ``d[-1]``. Indexed
332access is O(1) at both ends but slows to O(n) in the middle. For fast random
333access, use lists instead.
Georg Brandl8ec7f652007-08-15 14:28:01 +0000334
Georg Brandl4c8bbe62008-03-22 21:06:20 +0000335Example:
336
337.. doctest::
Georg Brandl8ec7f652007-08-15 14:28:01 +0000338
339 >>> from collections import deque
340 >>> d = deque('ghi') # make a new deque with three items
341 >>> for elem in d: # iterate over the deque's elements
Georg Brandl4c8bbe62008-03-22 21:06:20 +0000342 ... print elem.upper()
Georg Brandl8ec7f652007-08-15 14:28:01 +0000343 G
344 H
345 I
346
347 >>> d.append('j') # add a new entry to the right side
348 >>> d.appendleft('f') # add a new entry to the left side
349 >>> d # show the representation of the deque
350 deque(['f', 'g', 'h', 'i', 'j'])
351
352 >>> d.pop() # return and remove the rightmost item
353 'j'
354 >>> d.popleft() # return and remove the leftmost item
355 'f'
356 >>> list(d) # list the contents of the deque
357 ['g', 'h', 'i']
358 >>> d[0] # peek at leftmost item
359 'g'
360 >>> d[-1] # peek at rightmost item
361 'i'
362
363 >>> list(reversed(d)) # list the contents of a deque in reverse
364 ['i', 'h', 'g']
365 >>> 'h' in d # search the deque
366 True
367 >>> d.extend('jkl') # add multiple elements at once
368 >>> d
369 deque(['g', 'h', 'i', 'j', 'k', 'l'])
370 >>> d.rotate(1) # right rotation
371 >>> d
372 deque(['l', 'g', 'h', 'i', 'j', 'k'])
373 >>> d.rotate(-1) # left rotation
374 >>> d
375 deque(['g', 'h', 'i', 'j', 'k', 'l'])
376
377 >>> deque(reversed(d)) # make a new deque in reverse order
378 deque(['l', 'k', 'j', 'i', 'h', 'g'])
379 >>> d.clear() # empty the deque
380 >>> d.pop() # cannot pop from an empty deque
381 Traceback (most recent call last):
382 File "<pyshell#6>", line 1, in -toplevel-
383 d.pop()
384 IndexError: pop from an empty deque
385
386 >>> d.extendleft('abc') # extendleft() reverses the input order
387 >>> d
388 deque(['c', 'b', 'a'])
389
390
Raymond Hettingera7fc4b12007-10-05 02:47:07 +0000391:class:`deque` Recipes
392^^^^^^^^^^^^^^^^^^^^^^
Georg Brandl8ec7f652007-08-15 14:28:01 +0000393
394This section shows various approaches to working with deques.
395
Raymond Hettinger6bc94cb2009-03-31 22:43:03 +0000396Bounded length deques provide functionality similar to the ``tail`` filter
397in Unix::
398
399 def tail(filename, n=10):
400 'Return the last n lines of a file'
401 return deque(open(filename), n)
402
403Another approach to using deques is to maintain a sequence of recently
404added elements by appending to the right and popping to the left::
405
406 def moving_average(iterable, n=3):
407 # moving_average([40, 30, 50, 46, 39, 44]) --> 40.0 42.0 45.0 43.0
408 # http://en.wikipedia.org/wiki/Moving_average
Raymond Hettinger6bc94cb2009-03-31 22:43:03 +0000409 it = iter(iterable)
Raymond Hettinger9b6f13e2009-05-22 01:06:44 +0000410 d = deque(itertools.islice(it, n-1))
411 d.appendleft(0)
Raymond Hettinger6bc94cb2009-03-31 22:43:03 +0000412 s = sum(d)
Raymond Hettinger6bc94cb2009-03-31 22:43:03 +0000413 for elem in it:
414 s += elem - d.popleft()
415 d.append(elem)
Raymond Hettinger9b6f13e2009-05-22 01:06:44 +0000416 yield s / float(n)
Raymond Hettinger6bc94cb2009-03-31 22:43:03 +0000417
Georg Brandl8ec7f652007-08-15 14:28:01 +0000418The :meth:`rotate` method provides a way to implement :class:`deque` slicing and
Ezio Melotti062d2b52009-12-19 22:41:49 +0000419deletion. For example, a pure Python implementation of ``del d[n]`` relies on
Georg Brandl8ec7f652007-08-15 14:28:01 +0000420the :meth:`rotate` method to position elements to be popped::
421
422 def delete_nth(d, n):
423 d.rotate(-n)
424 d.popleft()
425 d.rotate(n)
426
427To implement :class:`deque` slicing, use a similar approach applying
428:meth:`rotate` to bring a target element to the left side of the deque. Remove
429old entries with :meth:`popleft`, add new entries with :meth:`extend`, and then
430reverse the rotation.
Georg Brandl8ec7f652007-08-15 14:28:01 +0000431With minor variations on that approach, it is easy to implement Forth style
432stack manipulations such as ``dup``, ``drop``, ``swap``, ``over``, ``pick``,
433``rot``, and ``roll``.
434
Georg Brandl8ec7f652007-08-15 14:28:01 +0000435
436:class:`defaultdict` objects
437----------------------------
438
Georg Brandl8ec7f652007-08-15 14:28:01 +0000439.. class:: defaultdict([default_factory[, ...]])
440
441 Returns a new dictionary-like object. :class:`defaultdict` is a subclass of the
Georg Brandld7d4fd72009-07-26 14:37:28 +0000442 built-in :class:`dict` class. It overrides one method and adds one writable
Georg Brandl8ec7f652007-08-15 14:28:01 +0000443 instance variable. The remaining functionality is the same as for the
444 :class:`dict` class and is not documented here.
445
446 The first argument provides the initial value for the :attr:`default_factory`
447 attribute; it defaults to ``None``. All remaining arguments are treated the same
448 as if they were passed to the :class:`dict` constructor, including keyword
449 arguments.
450
451 .. versionadded:: 2.5
452
Benjamin Petersonc7b05922008-04-25 01:29:10 +0000453 :class:`defaultdict` objects support the following method in addition to the
454 standard :class:`dict` operations:
Georg Brandl8ec7f652007-08-15 14:28:01 +0000455
Éric Araujof06568c2011-08-19 09:00:56 +0200456 .. method:: __missing__(key)
Georg Brandl8ec7f652007-08-15 14:28:01 +0000457
Skip Montanarob40890d2008-09-17 11:50:36 +0000458 If the :attr:`default_factory` attribute is ``None``, this raises a
Benjamin Petersonc7b05922008-04-25 01:29:10 +0000459 :exc:`KeyError` exception with the *key* as argument.
Georg Brandl8ec7f652007-08-15 14:28:01 +0000460
Benjamin Petersonc7b05922008-04-25 01:29:10 +0000461 If :attr:`default_factory` is not ``None``, it is called without arguments
462 to provide a default value for the given *key*, this value is inserted in
463 the dictionary for the *key*, and returned.
Georg Brandl8ec7f652007-08-15 14:28:01 +0000464
Benjamin Petersonc7b05922008-04-25 01:29:10 +0000465 If calling :attr:`default_factory` raises an exception this exception is
466 propagated unchanged.
Georg Brandl8ec7f652007-08-15 14:28:01 +0000467
Benjamin Petersonc7b05922008-04-25 01:29:10 +0000468 This method is called by the :meth:`__getitem__` method of the
469 :class:`dict` class when the requested key is not found; whatever it
470 returns or raises is then returned or raised by :meth:`__getitem__`.
Georg Brandl8ec7f652007-08-15 14:28:01 +0000471
Benjamin Peterson78028b02012-01-27 09:14:01 -0500472 Note that :meth:`__missing__` is *not* called for any operations besides
473 :meth:`__getitem__`. This means that :meth:`get` will, like normal
474 dictionaries, return ``None`` as a default rather than using
475 :attr:`default_factory`.
476
Georg Brandl8ec7f652007-08-15 14:28:01 +0000477
Benjamin Petersonc7b05922008-04-25 01:29:10 +0000478 :class:`defaultdict` objects support the following instance variable:
Georg Brandl8ec7f652007-08-15 14:28:01 +0000479
Benjamin Petersonc7b05922008-04-25 01:29:10 +0000480
Éric Araujof06568c2011-08-19 09:00:56 +0200481 .. attribute:: default_factory
Benjamin Petersonc7b05922008-04-25 01:29:10 +0000482
483 This attribute is used by the :meth:`__missing__` method; it is
484 initialized from the first argument to the constructor, if present, or to
485 ``None``, if absent.
Georg Brandl8ec7f652007-08-15 14:28:01 +0000486
487
Georg Brandl8ec7f652007-08-15 14:28:01 +0000488:class:`defaultdict` Examples
489^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
490
491Using :class:`list` as the :attr:`default_factory`, it is easy to group a
Georg Brandl4c8bbe62008-03-22 21:06:20 +0000492sequence of key-value pairs into a dictionary of lists:
Georg Brandl8ec7f652007-08-15 14:28:01 +0000493
494 >>> s = [('yellow', 1), ('blue', 2), ('yellow', 3), ('blue', 4), ('red', 1)]
495 >>> d = defaultdict(list)
496 >>> for k, v in s:
497 ... d[k].append(v)
498 ...
499 >>> d.items()
500 [('blue', [2, 4]), ('red', [1]), ('yellow', [1, 3])]
501
502When each key is encountered for the first time, it is not already in the
503mapping; so an entry is automatically created using the :attr:`default_factory`
504function which returns an empty :class:`list`. The :meth:`list.append`
505operation then attaches the value to the new list. When keys are encountered
506again, the look-up proceeds normally (returning the list for that key) and the
507:meth:`list.append` operation adds another value to the list. This technique is
Georg Brandl4c8bbe62008-03-22 21:06:20 +0000508simpler and faster than an equivalent technique using :meth:`dict.setdefault`:
Georg Brandl8ec7f652007-08-15 14:28:01 +0000509
510 >>> d = {}
511 >>> for k, v in s:
512 ... d.setdefault(k, []).append(v)
513 ...
514 >>> d.items()
515 [('blue', [2, 4]), ('red', [1]), ('yellow', [1, 3])]
516
517Setting the :attr:`default_factory` to :class:`int` makes the
518:class:`defaultdict` useful for counting (like a bag or multiset in other
Georg Brandl4c8bbe62008-03-22 21:06:20 +0000519languages):
Georg Brandl8ec7f652007-08-15 14:28:01 +0000520
521 >>> s = 'mississippi'
522 >>> d = defaultdict(int)
523 >>> for k in s:
524 ... d[k] += 1
525 ...
526 >>> d.items()
527 [('i', 4), ('p', 2), ('s', 4), ('m', 1)]
528
529When a letter is first encountered, it is missing from the mapping, so the
530:attr:`default_factory` function calls :func:`int` to supply a default count of
531zero. The increment operation then builds up the count for each letter.
532
533The function :func:`int` which always returns zero is just a special case of
534constant functions. A faster and more flexible way to create constant functions
535is to use :func:`itertools.repeat` which can supply any constant value (not just
Georg Brandl4c8bbe62008-03-22 21:06:20 +0000536zero):
Georg Brandl8ec7f652007-08-15 14:28:01 +0000537
538 >>> def constant_factory(value):
539 ... return itertools.repeat(value).next
540 >>> d = defaultdict(constant_factory('<missing>'))
541 >>> d.update(name='John', action='ran')
542 >>> '%(name)s %(action)s to %(object)s' % d
543 'John ran to <missing>'
544
545Setting the :attr:`default_factory` to :class:`set` makes the
Georg Brandl4c8bbe62008-03-22 21:06:20 +0000546:class:`defaultdict` useful for building a dictionary of sets:
Georg Brandl8ec7f652007-08-15 14:28:01 +0000547
548 >>> s = [('red', 1), ('blue', 2), ('red', 3), ('blue', 4), ('red', 1), ('blue', 4)]
549 >>> d = defaultdict(set)
550 >>> for k, v in s:
551 ... d[k].add(v)
552 ...
553 >>> d.items()
554 [('blue', set([2, 4])), ('red', set([1, 3]))]
555
556
Raymond Hettingereeeb9c42007-11-15 02:44:53 +0000557:func:`namedtuple` Factory Function for Tuples with Named Fields
Georg Brandlb3255ed2008-01-07 16:43:47 +0000558----------------------------------------------------------------
Georg Brandl8ec7f652007-08-15 14:28:01 +0000559
Raymond Hettingercbab5942007-09-18 22:18:02 +0000560Named tuples assign meaning to each position in a tuple and allow for more readable,
561self-documenting code. They can be used wherever regular tuples are used, and
562they add the ability to access fields by name instead of position index.
Georg Brandl8ec7f652007-08-15 14:28:01 +0000563
Georg Brandl27fa4822010-10-17 06:24:10 +0000564.. function:: namedtuple(typename, field_names, [verbose=False], [rename=False])
Georg Brandl8ec7f652007-08-15 14:28:01 +0000565
566 Returns a new tuple subclass named *typename*. The new subclass is used to
Georg Brandl907a7202008-02-22 12:31:45 +0000567 create tuple-like objects that have fields accessible by attribute lookup as
Georg Brandl8ec7f652007-08-15 14:28:01 +0000568 well as being indexable and iterable. Instances of the subclass also have a
Georg Brandl061d2e22008-11-23 19:17:25 +0000569 helpful docstring (with typename and field_names) and a helpful :meth:`__repr__`
Georg Brandl8ec7f652007-08-15 14:28:01 +0000570 method which lists the tuple contents in a ``name=value`` format.
571
Raymond Hettingerdbc5e122011-03-19 15:09:00 -0700572 The *field_names* are a sequence of strings such as ``['x', 'y']``.
573 Alternatively, *field_names* can be a single string with each fieldname
574 separated by whitespace and/or commas, for example ``'x y'`` or ``'x, y'``.
Raymond Hettingerabfd8df2007-10-16 21:28:32 +0000575
576 Any valid Python identifier may be used for a fieldname except for names
Raymond Hettinger42da8742007-12-14 02:49:47 +0000577 starting with an underscore. Valid identifiers consist of letters, digits,
578 and underscores but do not start with a digit or underscore and cannot be
Raymond Hettingerabfd8df2007-10-16 21:28:32 +0000579 a :mod:`keyword` such as *class*, *for*, *return*, *global*, *pass*, *print*,
580 or *raise*.
Raymond Hettingercbab5942007-09-18 22:18:02 +0000581
Raymond Hettinger322daea2009-02-10 01:24:05 +0000582 If *rename* is true, invalid fieldnames are automatically replaced
583 with positional names. For example, ``['abc', 'def', 'ghi', 'abc']`` is
Raymond Hettinger6df48a32009-04-02 22:34:17 +0000584 converted to ``['abc', '_1', 'ghi', '_3']``, eliminating the keyword
Raymond Hettinger322daea2009-02-10 01:24:05 +0000585 ``def`` and the duplicate fieldname ``abc``.
586
Raymond Hettinger15b5e552008-01-10 23:00:01 +0000587 If *verbose* is true, the class definition is printed just before being built.
Raymond Hettingercbab5942007-09-18 22:18:02 +0000588
Raymond Hettingera48a2992007-10-08 21:26:58 +0000589 Named tuple instances do not have per-instance dictionaries, so they are
Raymond Hettinger7268e9d2007-09-20 03:03:43 +0000590 lightweight and require no more memory than regular tuples.
Raymond Hettingercbab5942007-09-18 22:18:02 +0000591
Georg Brandl8ec7f652007-08-15 14:28:01 +0000592 .. versionadded:: 2.6
593
Raymond Hettinger322daea2009-02-10 01:24:05 +0000594 .. versionchanged:: 2.7
595 added support for *rename*.
596
Georg Brandl4c8bbe62008-03-22 21:06:20 +0000597Example:
598
599.. doctest::
600 :options: +NORMALIZE_WHITESPACE
Georg Brandl8ec7f652007-08-15 14:28:01 +0000601
Raymond Hettingerdbc5e122011-03-19 15:09:00 -0700602 >>> Point = namedtuple('Point', ['x', 'y'], verbose=True)
Raymond Hettingercbab5942007-09-18 22:18:02 +0000603 class Point(tuple):
604 'Point(x, y)'
Georg Brandl4c8bbe62008-03-22 21:06:20 +0000605 <BLANKLINE>
Raymond Hettingercbab5942007-09-18 22:18:02 +0000606 __slots__ = ()
Georg Brandl4c8bbe62008-03-22 21:06:20 +0000607 <BLANKLINE>
Raymond Hettingere0734e72008-01-04 03:22:53 +0000608 _fields = ('x', 'y')
Georg Brandl4c8bbe62008-03-22 21:06:20 +0000609 <BLANKLINE>
Raymond Hettingera68cad12009-05-27 02:24:45 +0000610 def __new__(_cls, x, y):
Raymond Hettinger9bd35082010-03-09 09:01:46 +0000611 'Create a new instance of Point(x, y)'
Raymond Hettingera68cad12009-05-27 02:24:45 +0000612 return _tuple.__new__(_cls, (x, y))
Georg Brandl4c8bbe62008-03-22 21:06:20 +0000613 <BLANKLINE>
Raymond Hettinger02740f72008-01-05 01:35:43 +0000614 @classmethod
Georg Brandl4c8bbe62008-03-22 21:06:20 +0000615 def _make(cls, iterable, new=tuple.__new__, len=len):
Raymond Hettinger02740f72008-01-05 01:35:43 +0000616 'Make a new Point object from a sequence or iterable'
Georg Brandl4c8bbe62008-03-22 21:06:20 +0000617 result = new(cls, iterable)
Raymond Hettinger02740f72008-01-05 01:35:43 +0000618 if len(result) != 2:
619 raise TypeError('Expected 2 arguments, got %d' % len(result))
620 return result
Georg Brandl4c8bbe62008-03-22 21:06:20 +0000621 <BLANKLINE>
Raymond Hettingercbab5942007-09-18 22:18:02 +0000622 def __repr__(self):
Raymond Hettinger9bd35082010-03-09 09:01:46 +0000623 'Return a nicely formatted representation string'
Raymond Hettingercbab5942007-09-18 22:18:02 +0000624 return 'Point(x=%r, y=%r)' % self
Georg Brandl4c8bbe62008-03-22 21:06:20 +0000625 <BLANKLINE>
Raymond Hettinger88a91642009-03-03 04:51:24 +0000626 def _asdict(self):
627 'Return a new OrderedDict which maps field names to their values'
628 return OrderedDict(zip(self._fields, self))
Georg Brandl4c8bbe62008-03-22 21:06:20 +0000629 <BLANKLINE>
Raymond Hettinger45b08292011-06-02 20:40:35 -0700630 __dict__ = property(_asdict)
631 <BLANKLINE>
632 def _replace(_self, **kwds):
Raymond Hettingereeeb9c42007-11-15 02:44:53 +0000633 'Return a new Point object replacing specified fields with new values'
Raymond Hettingera68cad12009-05-27 02:24:45 +0000634 result = _self._make(map(kwds.pop, ('x', 'y'), _self))
Raymond Hettinger1b50fd72008-01-05 02:17:24 +0000635 if kwds:
636 raise ValueError('Got unexpected field names: %r' % kwds.keys())
637 return result
Georg Brandlc62ef8b2009-01-03 20:55:06 +0000638 <BLANKLINE>
639 def __getnewargs__(self):
Raymond Hettinger9bd35082010-03-09 09:01:46 +0000640 'Return self as a plain tuple. Used by copy and pickle.'
Raymond Hettingeree51cff2008-06-27 21:34:24 +0000641 return tuple(self)
Georg Brandl4c8bbe62008-03-22 21:06:20 +0000642 <BLANKLINE>
Raymond Hettinger9bd35082010-03-09 09:01:46 +0000643 x = _property(_itemgetter(0), doc='Alias for field number 0')
644 y = _property(_itemgetter(1), doc='Alias for field number 1')
Georg Brandl8ec7f652007-08-15 14:28:01 +0000645
Raymond Hettingercbab5942007-09-18 22:18:02 +0000646 >>> p = Point(11, y=22) # instantiate with positional or keyword arguments
Raymond Hettinger88880b22007-12-18 00:13:45 +0000647 >>> p[0] + p[1] # indexable like the plain tuple (11, 22)
Raymond Hettingercbab5942007-09-18 22:18:02 +0000648 33
649 >>> x, y = p # unpack like a regular tuple
650 >>> x, y
651 (11, 22)
Georg Brandl907a7202008-02-22 12:31:45 +0000652 >>> p.x + p.y # fields also accessible by name
Raymond Hettingercbab5942007-09-18 22:18:02 +0000653 33
654 >>> p # readable __repr__ with a name=value style
655 Point(x=11, y=22)
Georg Brandl8ec7f652007-08-15 14:28:01 +0000656
Raymond Hettingercbab5942007-09-18 22:18:02 +0000657Named tuples are especially useful for assigning field names to result tuples returned
658by the :mod:`csv` or :mod:`sqlite3` modules::
Georg Brandl8ec7f652007-08-15 14:28:01 +0000659
Raymond Hettingereeeb9c42007-11-15 02:44:53 +0000660 EmployeeRecord = namedtuple('EmployeeRecord', 'name, age, title, department, paygrade')
Raymond Hettingera48a2992007-10-08 21:26:58 +0000661
Raymond Hettingercbab5942007-09-18 22:18:02 +0000662 import csv
Raymond Hettinger02740f72008-01-05 01:35:43 +0000663 for emp in map(EmployeeRecord._make, csv.reader(open("employees.csv", "rb"))):
Raymond Hettingercbab5942007-09-18 22:18:02 +0000664 print emp.name, emp.title
Georg Brandl8ec7f652007-08-15 14:28:01 +0000665
Raymond Hettingera48a2992007-10-08 21:26:58 +0000666 import sqlite3
667 conn = sqlite3.connect('/companydata')
668 cursor = conn.cursor()
669 cursor.execute('SELECT name, age, title, department, paygrade FROM employees')
Raymond Hettinger02740f72008-01-05 01:35:43 +0000670 for emp in map(EmployeeRecord._make, cursor.fetchall()):
Raymond Hettingera48a2992007-10-08 21:26:58 +0000671 print emp.name, emp.title
672
Raymond Hettinger85dfcf32007-12-18 23:51:15 +0000673In addition to the methods inherited from tuples, named tuples support
Raymond Hettingerac5742e2008-01-08 02:24:15 +0000674three additional methods and one attribute. To prevent conflicts with
675field names, the method and attribute names start with an underscore.
Raymond Hettinger85dfcf32007-12-18 23:51:15 +0000676
Benjamin Peterson682f6032010-07-18 14:26:34 +0000677.. classmethod:: somenamedtuple._make(iterable)
Raymond Hettinger85dfcf32007-12-18 23:51:15 +0000678
Raymond Hettinger02740f72008-01-05 01:35:43 +0000679 Class method that makes a new instance from an existing sequence or iterable.
Raymond Hettinger85dfcf32007-12-18 23:51:15 +0000680
Raymond Hettinger2950bca2009-01-14 01:39:51 +0000681 .. doctest::
Georg Brandl8ec7f652007-08-15 14:28:01 +0000682
Raymond Hettinger02740f72008-01-05 01:35:43 +0000683 >>> t = [11, 22]
684 >>> Point._make(t)
685 Point(x=11, y=22)
Raymond Hettinger2b03d452007-09-18 03:33:19 +0000686
Georg Brandlb3255ed2008-01-07 16:43:47 +0000687.. method:: somenamedtuple._asdict()
Raymond Hettingera7fc4b12007-10-05 02:47:07 +0000688
Raymond Hettinger88a91642009-03-03 04:51:24 +0000689 Return a new :class:`OrderedDict` which maps field names to their corresponding
690 values::
Raymond Hettingera7fc4b12007-10-05 02:47:07 +0000691
Raymond Hettinger42da8742007-12-14 02:49:47 +0000692 >>> p._asdict()
Raymond Hettinger88a91642009-03-03 04:51:24 +0000693 OrderedDict([('x', 11), ('y', 22)])
694
Raymond Hettingera07038d2009-03-03 05:11:56 +0000695 .. versionchanged:: 2.7
Raymond Hettinger88a91642009-03-03 04:51:24 +0000696 Returns an :class:`OrderedDict` instead of a regular :class:`dict`.
Georg Brandl4c8bbe62008-03-22 21:06:20 +0000697
Georg Brandlb3255ed2008-01-07 16:43:47 +0000698.. method:: somenamedtuple._replace(kwargs)
Raymond Hettingerd36a60e2007-09-17 00:55:00 +0000699
Georg Brandl4c8bbe62008-03-22 21:06:20 +0000700 Return a new instance of the named tuple replacing specified fields with new
Raymond Hettinger2950bca2009-01-14 01:39:51 +0000701 values::
Raymond Hettingerd36a60e2007-09-17 00:55:00 +0000702
Raymond Hettingercbab5942007-09-18 22:18:02 +0000703 >>> p = Point(x=11, y=22)
Raymond Hettinger42da8742007-12-14 02:49:47 +0000704 >>> p._replace(x=33)
Raymond Hettingerd36a60e2007-09-17 00:55:00 +0000705 Point(x=33, y=22)
706
Raymond Hettinger7c3738e2007-11-15 03:16:09 +0000707 >>> for partnum, record in inventory.items():
Raymond Hettingerdbc5e122011-03-19 15:09:00 -0700708 inventory[partnum] = record._replace(price=newprices[partnum], timestamp=time.now())
Raymond Hettingerd36a60e2007-09-17 00:55:00 +0000709
Georg Brandlb3255ed2008-01-07 16:43:47 +0000710.. attribute:: somenamedtuple._fields
Raymond Hettingerd36a60e2007-09-17 00:55:00 +0000711
Raymond Hettingerf6b769b2008-01-07 21:33:51 +0000712 Tuple of strings listing the field names. Useful for introspection
Raymond Hettingera7fc4b12007-10-05 02:47:07 +0000713 and for creating new named tuple types from existing named tuples.
Raymond Hettinger7268e9d2007-09-20 03:03:43 +0000714
Raymond Hettinger2950bca2009-01-14 01:39:51 +0000715 .. doctest::
Raymond Hettingerd36a60e2007-09-17 00:55:00 +0000716
Raymond Hettinger42da8742007-12-14 02:49:47 +0000717 >>> p._fields # view the field names
Raymond Hettingercbab5942007-09-18 22:18:02 +0000718 ('x', 'y')
Raymond Hettingerd36a60e2007-09-17 00:55:00 +0000719
Raymond Hettingereeeb9c42007-11-15 02:44:53 +0000720 >>> Color = namedtuple('Color', 'red green blue')
Raymond Hettinger42da8742007-12-14 02:49:47 +0000721 >>> Pixel = namedtuple('Pixel', Point._fields + Color._fields)
Raymond Hettingercbab5942007-09-18 22:18:02 +0000722 >>> Pixel(11, 22, 128, 255, 0)
Raymond Hettingerdc1854d2008-01-09 03:13:20 +0000723 Pixel(x=11, y=22, red=128, green=255, blue=0)
Raymond Hettingerd36a60e2007-09-17 00:55:00 +0000724
Raymond Hettingere846f382007-12-14 21:51:50 +0000725To retrieve a field whose name is stored in a string, use the :func:`getattr`
Georg Brandl4c8bbe62008-03-22 21:06:20 +0000726function:
Raymond Hettingere846f382007-12-14 21:51:50 +0000727
728 >>> getattr(p, 'x')
729 11
730
Raymond Hettingere4ae63c2009-02-11 00:06:17 +0000731To convert a dictionary to a named tuple, use the double-star-operator
732(as described in :ref:`tut-unpacking-arguments`):
Raymond Hettinger85dfcf32007-12-18 23:51:15 +0000733
734 >>> d = {'x': 11, 'y': 22}
735 >>> Point(**d)
736 Point(x=11, y=22)
737
Raymond Hettingereeeb9c42007-11-15 02:44:53 +0000738Since a named tuple is a regular Python class, it is easy to add or change
Raymond Hettingerb8e00722008-01-07 04:24:49 +0000739functionality with a subclass. Here is how to add a calculated field and
Georg Brandl4c8bbe62008-03-22 21:06:20 +0000740a fixed-width print format:
Raymond Hettingereeeb9c42007-11-15 02:44:53 +0000741
Raymond Hettingerb8e00722008-01-07 04:24:49 +0000742 >>> class Point(namedtuple('Point', 'x y')):
Raymond Hettingerdbc5e122011-03-19 15:09:00 -0700743 __slots__ = ()
744 @property
745 def hypot(self):
746 return (self.x ** 2 + self.y ** 2) ** 0.5
747 def __str__(self):
748 return 'Point: x=%6.3f y=%6.3f hypot=%6.3f' % (self.x, self.y, self.hypot)
Raymond Hettingerb8e00722008-01-07 04:24:49 +0000749
Raymond Hettingere1655082008-01-10 19:15:10 +0000750 >>> for p in Point(3, 4), Point(14, 5/7.):
Raymond Hettingerdbc5e122011-03-19 15:09:00 -0700751 print p
Raymond Hettinger15b5e552008-01-10 23:00:01 +0000752 Point: x= 3.000 y= 4.000 hypot= 5.000
753 Point: x=14.000 y= 0.714 hypot=14.018
Raymond Hettingereeeb9c42007-11-15 02:44:53 +0000754
Georg Brandlfe8df4f2009-12-28 08:01:59 +0000755The subclass shown above sets ``__slots__`` to an empty tuple. This helps
Raymond Hettinger171f3912008-01-16 23:38:16 +0000756keep memory requirements low by preventing the creation of instance dictionaries.
Raymond Hettingerf59e9622008-01-15 20:52:42 +0000757
Raymond Hettingerac5742e2008-01-08 02:24:15 +0000758Subclassing is not useful for adding new, stored fields. Instead, simply
Georg Brandl4c8bbe62008-03-22 21:06:20 +0000759create a new named tuple type from the :attr:`_fields` attribute:
Raymond Hettingerac5742e2008-01-08 02:24:15 +0000760
Raymond Hettingere850c462008-01-10 20:37:12 +0000761 >>> Point3D = namedtuple('Point3D', Point._fields + ('z',))
Raymond Hettingerac5742e2008-01-08 02:24:15 +0000762
Raymond Hettingerfb3ced62008-01-07 20:17:35 +0000763Default values can be implemented by using :meth:`_replace` to
Georg Brandl4c8bbe62008-03-22 21:06:20 +0000764customize a prototype instance:
Raymond Hettingerbc693492007-11-15 22:39:34 +0000765
766 >>> Account = namedtuple('Account', 'owner balance transaction_count')
Raymond Hettinger0fe6ca42008-01-18 21:14:58 +0000767 >>> default_account = Account('<owner name>', 0.0, 0)
768 >>> johns_account = default_account._replace(owner='John')
Raymond Hettingerbc693492007-11-15 22:39:34 +0000769
Raymond Hettinger5a9fed72008-05-08 07:23:30 +0000770Enumerated constants can be implemented with named tuples, but it is simpler
771and more efficient to use a simple class declaration:
772
773 >>> Status = namedtuple('Status', 'open pending closed')._make(range(3))
774 >>> Status.open, Status.pending, Status.closed
775 (0, 1, 2)
776 >>> class Status:
Raymond Hettingerdbc5e122011-03-19 15:09:00 -0700777 open, pending, closed = range(3)
Raymond Hettinger5a9fed72008-05-08 07:23:30 +0000778
Raymond Hettingere4ae63c2009-02-11 00:06:17 +0000779.. seealso::
Mark Summerfield7f626f42007-08-30 15:03:03 +0000780
Raymond Hettingere4ae63c2009-02-11 00:06:17 +0000781 `Named tuple recipe <http://code.activestate.com/recipes/500261/>`_
782 adapted for Python 2.4.
Raymond Hettingerbc512d32009-03-03 04:45:34 +0000783
784
785:class:`OrderedDict` objects
786----------------------------
787
788Ordered dictionaries are just like regular dictionaries but they remember the
789order that items were inserted. When iterating over an ordered dictionary,
790the items are returned in the order their keys were first added.
791
792.. class:: OrderedDict([items])
793
794 Return an instance of a dict subclass, supporting the usual :class:`dict`
795 methods. An *OrderedDict* is a dict that remembers the order that keys
796 were first inserted. If a new entry overwrites an existing entry, the
797 original insertion position is left unchanged. Deleting an entry and
798 reinserting it will move it to the end.
799
800 .. versionadded:: 2.7
801
Raymond Hettinger24122992009-03-19 19:59:58 +0000802.. method:: OrderedDict.popitem(last=True)
803
804 The :meth:`popitem` method for ordered dictionaries returns and removes
805 a (key, value) pair. The pairs are returned in LIFO order if *last* is
806 true or FIFO order if false.
Raymond Hettingerbc512d32009-03-03 04:45:34 +0000807
Raymond Hettinger50f362f2009-05-19 17:43:59 +0000808In addition to the usual mapping methods, ordered dictionaries also support
809reverse iteration using :func:`reversed`.
810
Raymond Hettingerbc512d32009-03-03 04:45:34 +0000811Equality tests between :class:`OrderedDict` objects are order-sensitive
812and are implemented as ``list(od1.items())==list(od2.items())``.
813Equality tests between :class:`OrderedDict` objects and other
814:class:`Mapping` objects are order-insensitive like regular dictionaries.
815This allows :class:`OrderedDict` objects to be substituted anywhere a
816regular dictionary is used.
Raymond Hettinger24122992009-03-19 19:59:58 +0000817
Raymond Hettingerc473c5a2009-04-09 22:31:51 +0000818The :class:`OrderedDict` constructor and :meth:`update` method both accept
819keyword arguments, but their order is lost because Python's function call
820semantics pass-in keyword arguments using a regular unordered dictionary.
821
Raymond Hettinger24122992009-03-19 19:59:58 +0000822.. seealso::
823
824 `Equivalent OrderedDict recipe <http://code.activestate.com/recipes/576693/>`_
825 that runs on Python 2.4 or later.
Raymond Hettinger610326d2009-11-10 19:35:55 +0000826
Raymond Hettingerdf453fb2011-04-15 17:55:36 -0700827:class:`OrderedDict` Examples and Recipes
828^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
829
Raymond Hettinger610326d2009-11-10 19:35:55 +0000830Since an ordered dictionary remembers its insertion order, it can be used
831in conjuction with sorting to make a sorted dictionary::
832
833 >>> # regular unsorted dictionary
834 >>> d = {'banana': 3, 'apple':4, 'pear': 1, 'orange': 2}
835
836 >>> # dictionary sorted by key
837 >>> OrderedDict(sorted(d.items(), key=lambda t: t[0]))
838 OrderedDict([('apple', 4), ('banana', 3), ('orange', 2), ('pear', 1)])
839
840 >>> # dictionary sorted by value
841 >>> OrderedDict(sorted(d.items(), key=lambda t: t[1]))
842 OrderedDict([('pear', 1), ('orange', 2), ('banana', 3), ('apple', 4)])
843
844 >>> # dictionary sorted by length of the key string
845 >>> OrderedDict(sorted(d.items(), key=lambda t: len(t[0])))
846 OrderedDict([('pear', 1), ('apple', 4), ('orange', 2), ('banana', 3)])
847
848The new sorted dictionaries maintain their sort order when entries
849are deleted. But when new keys are added, the keys are appended
850to the end and the sort is not maintained.
Georg Brandlad8ac862010-08-01 19:21:26 +0000851
852It is also straight-forward to create an ordered dictionary variant
853that the remembers the order the keys were *last* inserted.
854If a new entry overwrites an existing entry, the
855original insertion position is changed and moved to the end::
856
857 class LastUpdatedOrderedDict(OrderedDict):
Georg Brandl28dadd92011-02-25 10:50:32 +0000858 'Store items in the order the keys were last added'
Éric Araujo8fde9502011-07-29 11:34:17 +0200859
Georg Brandlad8ac862010-08-01 19:21:26 +0000860 def __setitem__(self, key, value):
861 if key in self:
862 del self[key]
863 OrderedDict.__setitem__(self, key, value)
Raymond Hettinger3de8a302010-08-08 00:35:03 +0000864
Éric Araujof06568c2011-08-19 09:00:56 +0200865An ordered dictionary can be combined with the :class:`Counter` class
Raymond Hettingerdf453fb2011-04-15 17:55:36 -0700866so that the counter remembers the order elements are first encountered::
867
868 class OrderedCounter(Counter, OrderedDict):
869 'Counter that remembers the order elements are first encountered'
870
Raymond Hettingerdf453fb2011-04-15 17:55:36 -0700871 def __repr__(self):
872 return '%s(%r)' % (self.__class__.__name__, OrderedDict(self))
873
874 def __reduce__(self):
875 return self.__class__, (OrderedDict(self),)
876
Raymond Hettinger3de8a302010-08-08 00:35:03 +0000877
Éric Araujo8fde9502011-07-29 11:34:17 +0200878.. _collections-abstract-base-classes:
Georg Brandl950b3d82010-12-04 19:02:20 +0000879
Éric Araujo8fde9502011-07-29 11:34:17 +0200880Collections Abstract Base Classes
881---------------------------------
Raymond Hettinger3de8a302010-08-08 00:35:03 +0000882
Ezio Melottid1135122011-03-28 13:46:28 +0300883The collections module offers the following :term:`ABCs <abstract base class>`:
Raymond Hettinger3de8a302010-08-08 00:35:03 +0000884
885========================= ===================== ====================== ====================================================
Ezio Melottid1135122011-03-28 13:46:28 +0300886ABC Inherits from Abstract Methods Mixin Methods
Raymond Hettinger3de8a302010-08-08 00:35:03 +0000887========================= ===================== ====================== ====================================================
888:class:`Container` ``__contains__``
889:class:`Hashable` ``__hash__``
890:class:`Iterable` ``__iter__``
891:class:`Iterator` :class:`Iterable` ``next`` ``__iter__``
892:class:`Sized` ``__len__``
893:class:`Callable` ``__call__``
894
Éric Araujo8fde9502011-07-29 11:34:17 +0200895:class:`Sequence` :class:`Sized`, ``__getitem__`` ``__contains__``, ``__iter__``, ``__reversed__``,
Raymond Hettinger3de8a302010-08-08 00:35:03 +0000896 :class:`Iterable`, ``index``, and ``count``
897 :class:`Container`
898
Ezio Melottid1135122011-03-28 13:46:28 +0300899:class:`MutableSequence` :class:`Sequence` ``__setitem__``, Inherited :class:`Sequence` methods and
Raymond Hettinger3de8a302010-08-08 00:35:03 +0000900 ``__delitem__``, ``append``, ``reverse``, ``extend``, ``pop``,
Ezio Melottid1135122011-03-28 13:46:28 +0300901 ``insert`` ``remove``, and ``__iadd__``
Raymond Hettinger3de8a302010-08-08 00:35:03 +0000902
903:class:`Set` :class:`Sized`, ``__le__``, ``__lt__``, ``__eq__``, ``__ne__``,
Georg Brandl28046022011-02-25 11:01:04 +0000904 :class:`Iterable`, ``__gt__``, ``__ge__``, ``__and__``, ``__or__``,
Raymond Hettinger3de8a302010-08-08 00:35:03 +0000905 :class:`Container` ``__sub__``, ``__xor__``, and ``isdisjoint``
906
Ezio Melottid1135122011-03-28 13:46:28 +0300907:class:`MutableSet` :class:`Set` ``add``, Inherited :class:`Set` methods and
Raymond Hettinger3de8a302010-08-08 00:35:03 +0000908 ``discard`` ``clear``, ``pop``, ``remove``, ``__ior__``,
909 ``__iand__``, ``__ixor__``, and ``__isub__``
910
911:class:`Mapping` :class:`Sized`, ``__getitem__`` ``__contains__``, ``keys``, ``items``, ``values``,
912 :class:`Iterable`, ``get``, ``__eq__``, and ``__ne__``
913 :class:`Container`
914
Ezio Melottid1135122011-03-28 13:46:28 +0300915:class:`MutableMapping` :class:`Mapping` ``__setitem__``, Inherited :class:`Mapping` methods and
Raymond Hettinger3de8a302010-08-08 00:35:03 +0000916 ``__delitem__`` ``pop``, ``popitem``, ``clear``, ``update``,
917 and ``setdefault``
918
919
920:class:`MappingView` :class:`Sized` ``__len__``
Raymond Hettinger3de8a302010-08-08 00:35:03 +0000921:class:`ItemsView` :class:`MappingView`, ``__contains__``,
922 :class:`Set` ``__iter__``
Ezio Melottid1135122011-03-28 13:46:28 +0300923:class:`KeysView` :class:`MappingView`, ``__contains__``,
924 :class:`Set` ``__iter__``
Raymond Hettinger3de8a302010-08-08 00:35:03 +0000925:class:`ValuesView` :class:`MappingView` ``__contains__``, ``__iter__``
926========================= ===================== ====================== ====================================================
927
Ezio Melottid1135122011-03-28 13:46:28 +0300928
929.. class:: Container
930 Hashable
931 Sized
932 Callable
933
934 ABCs for classes that provide respectively the methods :meth:`__contains__`,
935 :meth:`__hash__`, :meth:`__len__`, and :meth:`__call__`.
936
937.. class:: Iterable
938
939 ABC for classes that provide the :meth:`__iter__` method.
940 See also the definition of :term:`iterable`.
941
942.. class:: Iterator
943
944 ABC for classes that provide the :meth:`__iter__` and :meth:`next` methods.
945 See also the definition of :term:`iterator`.
946
947.. class:: Sequence
948 MutableSequence
949
950 ABCs for read-only and mutable :term:`sequences <sequence>`.
951
952.. class:: Set
953 MutableSet
954
955 ABCs for read-only and mutable sets.
956
957.. class:: Mapping
958 MutableMapping
959
960 ABCs for read-only and mutable :term:`mappings <mapping>`.
961
962.. class:: MappingView
963 ItemsView
964 KeysView
965 ValuesView
966
967 ABCs for mapping, items, keys, and values :term:`views <view>`.
968
969
Raymond Hettinger3de8a302010-08-08 00:35:03 +0000970These ABCs allow us to ask classes or instances if they provide
971particular functionality, for example::
972
973 size = None
974 if isinstance(myvar, collections.Sized):
975 size = len(myvar)
976
977Several of the ABCs are also useful as mixins that make it easier to develop
978classes supporting container APIs. For example, to write a class supporting
979the full :class:`Set` API, it only necessary to supply the three underlying
980abstract methods: :meth:`__contains__`, :meth:`__iter__`, and :meth:`__len__`.
981The ABC supplies the remaining methods such as :meth:`__and__` and
982:meth:`isdisjoint` ::
983
984 class ListBasedSet(collections.Set):
985 ''' Alternate set implementation favoring space over speed
986 and not requiring the set elements to be hashable. '''
987 def __init__(self, iterable):
988 self.elements = lst = []
989 for value in iterable:
990 if value not in lst:
991 lst.append(value)
992 def __iter__(self):
993 return iter(self.elements)
994 def __contains__(self, value):
995 return value in self.elements
996 def __len__(self):
997 return len(self.elements)
998
999 s1 = ListBasedSet('abcdef')
1000 s2 = ListBasedSet('defghi')
1001 overlap = s1 & s2 # The __and__() method is supported automatically
1002
1003Notes on using :class:`Set` and :class:`MutableSet` as a mixin:
1004
1005(1)
1006 Since some set operations create new sets, the default mixin methods need
1007 a way to create new instances from an iterable. The class constructor is
1008 assumed to have a signature in the form ``ClassName(iterable)``.
1009 That assumption is factored-out to an internal classmethod called
1010 :meth:`_from_iterable` which calls ``cls(iterable)`` to produce a new set.
1011 If the :class:`Set` mixin is being used in a class with a different
Raymond Hettinger72507302011-03-22 09:06:38 -07001012 constructor signature, you will need to override :meth:`_from_iterable`
Raymond Hettinger3de8a302010-08-08 00:35:03 +00001013 with a classmethod that can construct new instances from
1014 an iterable argument.
1015
1016(2)
1017 To override the comparisons (presumably for speed, as the
1018 semantics are fixed), redefine :meth:`__le__` and
1019 then the other operations will automatically follow suit.
1020
1021(3)
1022 The :class:`Set` mixin provides a :meth:`_hash` method to compute a hash value
1023 for the set; however, :meth:`__hash__` is not defined because not all sets
1024 are hashable or immutable. To add set hashabilty using mixins,
1025 inherit from both :meth:`Set` and :meth:`Hashable`, then define
1026 ``__hash__ = Set._hash``.
1027
1028.. seealso::
1029
1030 * `OrderedSet recipe <http://code.activestate.com/recipes/576694/>`_ for an
1031 example built on :class:`MutableSet`.
1032
1033 * For more about ABCs, see the :mod:`abc` module and :pep:`3119`.