blob: 454817b7aa42a8919fe5f364ad8759413bfbc76b [file] [log] [blame]
Raymond Hettinger53dbe392008-02-12 20:03:09 +00001:mod:`collections` --- Container datatypes
2==========================================
Georg Brandl116aa622007-08-15 14:28:22 +00003
4.. module:: collections
Raymond Hettinger53dbe392008-02-12 20:03:09 +00005 :synopsis: Container datatypes
Georg Brandl116aa622007-08-15 14:28:22 +00006.. moduleauthor:: Raymond Hettinger <python@rcn.com>
7.. sectionauthor:: Raymond Hettinger <python@rcn.com>
8
Christian Heimesfe337bf2008-03-23 21:54:12 +00009.. testsetup:: *
10
11 from collections import *
12 import itertools
13 __name__ = '<doctest>'
Georg Brandl116aa622007-08-15 14:28:22 +000014
Raymond Hettingera6b76ba2010-08-08 00:29:08 +000015This module implements specialized container datatypes providing alternatives to
16Python's general purpose built-in containers, :class:`dict`, :class:`list`,
17:class:`set`, and :class:`tuple`.
Christian Heimes0bd4e112008-02-12 22:59:25 +000018
Raymond Hettingera6b76ba2010-08-08 00:29:08 +000019===================== ====================================================================
20:func:`namedtuple` factory function for creating tuple subclasses with named fields
21:class:`deque` list-like container with fast appends and pops on either end
22:class:`Counter` dict subclass for counting hashable objects
23:class:`OrderedDict` dict subclass that remembers the order entries were added
24:class:`defaultdict` dict subclass that calls a factory function to supply missing values
25:class:`UserDict` wrapper around dictionary objects for easier dict subclassing
26:class:`UserList` wrapper around list objects for easier list subclassing
27:class:`UserString` wrapper around string objects for easier string subclassing
28===================== ====================================================================
Georg Brandl116aa622007-08-15 14:28:22 +000029
Raymond Hettingera6b76ba2010-08-08 00:29:08 +000030In addition to the concrete container classes, the collections module provides
31ABCs (abstract base classes) that can be used to test whether a class provides a
32particular interface, for example, whether it is hashable or a mapping.
Mark Summerfield08898b42007-09-05 08:43:04 +000033
34
Raymond Hettingerb8baf632009-01-14 02:20:07 +000035:class:`Counter` objects
36------------------------
37
38A counter tool is provided to support convenient and rapid tallies.
39For example::
40
Raymond Hettinger1c62dc92009-02-04 11:41:45 +000041 >>> # Tally occurrences of words in a list
Raymond Hettingerb8baf632009-01-14 02:20:07 +000042 >>> cnt = Counter()
Raymond Hettinger670eaec2009-01-21 23:14:07 +000043 >>> for word in ['red', 'blue', 'red', 'green', 'blue', 'blue']:
Raymond Hettingerb8baf632009-01-14 02:20:07 +000044 ... cnt[word] += 1
45 >>> cnt
46 Counter({'blue': 3, 'red': 2, 'green': 1})
47
Raymond Hettinger1c62dc92009-02-04 11:41:45 +000048 >>> # Find the ten most common words in Hamlet
Raymond Hettingerb8baf632009-01-14 02:20:07 +000049 >>> import re
50 >>> words = re.findall('\w+', open('hamlet.txt').read().lower())
Raymond Hettinger0bae6622009-01-20 13:00:59 +000051 >>> Counter(words).most_common(10)
Raymond Hettingerb8baf632009-01-14 02:20:07 +000052 [('the', 1143), ('and', 966), ('to', 762), ('of', 669), ('i', 631),
53 ('you', 554), ('a', 546), ('my', 514), ('hamlet', 471), ('in', 451)]
54
55.. class:: Counter([iterable-or-mapping])
56
Raymond Hettinger670eaec2009-01-21 23:14:07 +000057 A :class:`Counter` is a :class:`dict` subclass for counting hashable objects.
Raymond Hettingerb8baf632009-01-14 02:20:07 +000058 It is an unordered collection where elements are stored as dictionary keys
59 and their counts are stored as dictionary values. Counts are allowed to be
60 any integer value including zero or negative counts. The :class:`Counter`
61 class is similar to bags or multisets in other languages.
62
63 Elements are counted from an *iterable* or initialized from another
Benjamin Peterson25c95f12009-05-08 20:42:26 +000064 *mapping* (or counter):
Raymond Hettingerb8baf632009-01-14 02:20:07 +000065
Raymond Hettinger73662a52009-01-27 02:38:22 +000066 >>> c = Counter() # a new, empty counter
67 >>> c = Counter('gallahad') # a new counter from an iterable
68 >>> c = Counter({'red': 4, 'blue': 2}) # a new counter from a mapping
69 >>> c = Counter(cats=4, dogs=8) # a new counter from keyword args
Raymond Hettingerb8baf632009-01-14 02:20:07 +000070
Raymond Hettinger670eaec2009-01-21 23:14:07 +000071 Counter objects have a dictionary interface except that they return a zero
Benjamin Peterson25c95f12009-05-08 20:42:26 +000072 count for missing items instead of raising a :exc:`KeyError`:
Raymond Hettingerb8baf632009-01-14 02:20:07 +000073
Raymond Hettinger94adc8e2009-01-22 05:27:37 +000074 >>> c = Counter(['eggs', 'ham'])
Raymond Hettingerb8baf632009-01-14 02:20:07 +000075 >>> c['bacon'] # count of a missing element is zero
76 0
77
Raymond Hettinger94adc8e2009-01-22 05:27:37 +000078 Setting a count to zero does not remove an element from a counter.
79 Use ``del`` to remove it entirely:
Raymond Hettingerb8baf632009-01-14 02:20:07 +000080
Raymond Hettinger94adc8e2009-01-22 05:27:37 +000081 >>> c['sausage'] = 0 # counter entry with a zero count
82 >>> del c['sausage'] # del actually removes the entry
Raymond Hettingerb8baf632009-01-14 02:20:07 +000083
Benjamin Petersond45bf582009-03-02 21:44:54 +000084 .. versionadded:: 3.1
Raymond Hettingerb8baf632009-01-14 02:20:07 +000085
86
Ezio Melotti0be8b1c2010-04-04 06:53:44 +000087 Counter objects support three methods beyond those available for all
Raymond Hettingerb8baf632009-01-14 02:20:07 +000088 dictionaries:
89
90 .. method:: elements()
91
Raymond Hettinger670eaec2009-01-21 23:14:07 +000092 Return an iterator over elements repeating each as many times as its
93 count. Elements are returned in arbitrary order. If an element's count
94 is less than one, :meth:`elements` will ignore it.
Raymond Hettingerb8baf632009-01-14 02:20:07 +000095
Raymond Hettinger0bae6622009-01-20 13:00:59 +000096 >>> c = Counter(a=4, b=2, c=0, d=-2)
Raymond Hettingerb8baf632009-01-14 02:20:07 +000097 >>> list(c.elements())
98 ['a', 'a', 'a', 'a', 'b', 'b']
99
100 .. method:: most_common([n])
101
Raymond Hettinger73662a52009-01-27 02:38:22 +0000102 Return a list of the *n* most common elements and their counts from the
Raymond Hettingerd04fa312009-02-04 19:45:13 +0000103 most common to the least. If *n* is not specified, :func:`most_common`
Raymond Hettinger73662a52009-01-27 02:38:22 +0000104 returns *all* elements in the counter. Elements with equal counts are
Benjamin Peterson25c95f12009-05-08 20:42:26 +0000105 ordered arbitrarily:
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000106
107 >>> Counter('abracadabra').most_common(3)
108 [('a', 5), ('r', 2), ('b', 2)]
109
Raymond Hettinger9c01e442010-04-03 10:32:58 +0000110 .. method:: subtract([iterable-or-mapping])
111
112 Elements are subtracted from an *iterable* or from another *mapping*
113 (or counter). Like :meth:`dict.update` but subtracts counts instead
114 of replacing them. Both inputs and outputs may be zero or negative.
115
116 >>> c = Counter(a=4, b=2, c=0, d=-2)
117 >>> d = Counter(a=1, b=2, c=3, d=4)
118 >>> c.subtract(d)
119 Counter({'a': 3, 'b': 0, 'c': -3, 'd': -6})
120
Ezio Melotti0be8b1c2010-04-04 06:53:44 +0000121 .. versionadded:: 3.2
122
Raymond Hettinger670eaec2009-01-21 23:14:07 +0000123 The usual dictionary methods are available for :class:`Counter` objects
124 except for two which work differently for counters.
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000125
126 .. method:: fromkeys(iterable)
127
Raymond Hettinger73662a52009-01-27 02:38:22 +0000128 This class method is not implemented for :class:`Counter` objects.
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000129
130 .. method:: update([iterable-or-mapping])
131
Raymond Hettinger73662a52009-01-27 02:38:22 +0000132 Elements are counted from an *iterable* or added-in from another
133 *mapping* (or counter). Like :meth:`dict.update` but adds counts
134 instead of replacing them. Also, the *iterable* is expected to be a
135 sequence of elements, not a sequence of ``(key, value)`` pairs.
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000136
137Common patterns for working with :class:`Counter` objects::
138
Raymond Hettinger73662a52009-01-27 02:38:22 +0000139 sum(c.values()) # total of all counts
140 c.clear() # reset all counts
141 list(c) # list unique elements
142 set(c) # convert to a set
143 dict(c) # convert to a regular dictionary
144 c.items() # convert to a list of (elem, cnt) pairs
145 Counter(dict(list_of_pairs)) # convert from a list of (elem, cnt) pairs
146 c.most_common()[:-n:-1] # n least common elements
147 c += Counter() # remove zero and negative counts
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000148
Raymond Hettinger72a95cc2009-02-25 22:51:40 +0000149Several mathematical operations are provided for combining :class:`Counter`
150objects to produce multisets (counters that have counts greater than zero).
151Addition and subtraction combine counters by adding or subtracting the counts
152of corresponding elements. Intersection and union return the minimum and
153maximum of corresponding counts. Each operation can accept inputs with signed
154counts, but the output will exclude results with counts of zero or less.
Raymond Hettinger4d2073a2009-01-20 03:41:22 +0000155
Raymond Hettingere0d1b9f2009-01-21 20:36:27 +0000156 >>> c = Counter(a=3, b=1)
157 >>> d = Counter(a=1, b=2)
Raymond Hettinger73662a52009-01-27 02:38:22 +0000158 >>> c + d # add two counters together: c[x] + d[x]
Raymond Hettinger4d2073a2009-01-20 03:41:22 +0000159 Counter({'a': 4, 'b': 3})
Raymond Hettinger73662a52009-01-27 02:38:22 +0000160 >>> c - d # subtract (keeping only positive counts)
Raymond Hettinger4d2073a2009-01-20 03:41:22 +0000161 Counter({'a': 2})
Raymond Hettinger73662a52009-01-27 02:38:22 +0000162 >>> c & d # intersection: min(c[x], d[x])
Raymond Hettinger4d2073a2009-01-20 03:41:22 +0000163 Counter({'a': 1, 'b': 1})
Raymond Hettinger73662a52009-01-27 02:38:22 +0000164 >>> c | d # union: max(c[x], d[x])
Raymond Hettinger4d2073a2009-01-20 03:41:22 +0000165 Counter({'a': 3, 'b': 2})
166
Raymond Hettinger22f18852010-04-12 21:45:14 +0000167.. note::
168
169 Counters were primarily designed to work with positive integers to represent
170 running counts; however, care was taken to not unnecessarily preclude use
171 cases needing other types or negative values. To help with those use cases,
172 this section documents the minimum range and type restrictions.
173
174 * The :class:`Counter` class itself is a dictionary subclass with no
175 restrictions on its keys and values. The values are intended to be numbers
176 representing counts, but you *could* store anything in the value field.
177
178 * The :meth:`most_common` method requires only that the values be orderable.
179
180 * For in-place operations such as ``c[key] += 1``, the value type need only
181 support addition and subtraction. So fractions, floats, and decimals would
182 work and negative values are supported. The same is also true for
183 :meth:`update` and :meth:`subtract` which allow negative and zero values
184 for both inputs and outputs.
185
186 * The multiset methods are designed only for use cases with positive values.
187 The inputs may be negative or zero, but only outputs with positive values
188 are created. There are no type restrictions, but the value type needs to
189 support support addition, subtraction, and comparison.
190
191 * The :meth:`elements` method requires integer counts. It ignores zero and
192 negative counts.
193
Raymond Hettingerb14043c2009-01-20 23:44:31 +0000194.. seealso::
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000195
Raymond Hettinger94adc8e2009-01-22 05:27:37 +0000196 * `Counter class <http://code.activestate.com/recipes/576611/>`_
197 adapted for Python 2.5 and an early `Bag recipe
198 <http://code.activestate.com/recipes/259174/>`_ for Python 2.4.
199
Raymond Hettingerb14043c2009-01-20 23:44:31 +0000200 * `Bag class <http://www.gnu.org/software/smalltalk/manual-base/html_node/Bag.html>`_
201 in Smalltalk.
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000202
Raymond Hettingerb14043c2009-01-20 23:44:31 +0000203 * Wikipedia entry for `Multisets <http://en.wikipedia.org/wiki/Multiset>`_\.
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000204
Raymond Hettingerb14043c2009-01-20 23:44:31 +0000205 * `C++ multisets <http://www.demo2s.com/Tutorial/Cpp/0380__set-multiset/Catalog0380__set-multiset.htm>`_
Raymond Hettinger94adc8e2009-01-22 05:27:37 +0000206 tutorial with examples.
Raymond Hettingerb14043c2009-01-20 23:44:31 +0000207
Raymond Hettinger94adc8e2009-01-22 05:27:37 +0000208 * For mathematical operations on multisets and their use cases, see
Raymond Hettingerb14043c2009-01-20 23:44:31 +0000209 *Knuth, Donald. The Art of Computer Programming Volume II,
210 Section 4.6.3, Exercise 19*\.
211
Raymond Hettinger670eaec2009-01-21 23:14:07 +0000212 * To enumerate all distinct multisets of a given size over a given set of
Raymond Hettingerd07d9392009-01-27 04:20:44 +0000213 elements, see :func:`itertools.combinations_with_replacement`.
Raymond Hettingerb14043c2009-01-20 23:44:31 +0000214
Raymond Hettinger94adc8e2009-01-22 05:27:37 +0000215 map(Counter, combinations_with_replacement('ABC', 2)) --> AA AB AC BB BC CC
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000216
217
Georg Brandl116aa622007-08-15 14:28:22 +0000218:class:`deque` objects
219----------------------
220
Georg Brandlc2a4f4f2009-04-10 09:03:43 +0000221.. class:: deque([iterable, [maxlen]])
Georg Brandl116aa622007-08-15 14:28:22 +0000222
223 Returns a new deque object initialized left-to-right (using :meth:`append`) with
224 data from *iterable*. If *iterable* is not specified, the new deque is empty.
225
226 Deques are a generalization of stacks and queues (the name is pronounced "deck"
227 and is short for "double-ended queue"). Deques support thread-safe, memory
228 efficient appends and pops from either side of the deque with approximately the
229 same O(1) performance in either direction.
230
231 Though :class:`list` objects support similar operations, they are optimized for
232 fast fixed-length operations and incur O(n) memory movement costs for
233 ``pop(0)`` and ``insert(0, v)`` operations which change both the size and
234 position of the underlying data representation.
235
Georg Brandl116aa622007-08-15 14:28:22 +0000236
Georg Brandl9afde1c2007-11-01 20:32:30 +0000237 If *maxlen* is not specified or is *None*, deques may grow to an
238 arbitrary length. Otherwise, the deque is bounded to the specified maximum
239 length. Once a bounded length deque is full, when new items are added, a
240 corresponding number of items are discarded from the opposite end. Bounded
241 length deques provide functionality similar to the ``tail`` filter in
242 Unix. They are also useful for tracking transactions and other pools of data
243 where only the most recent activity is of interest.
244
Georg Brandl9afde1c2007-11-01 20:32:30 +0000245
Benjamin Petersone41251e2008-04-25 01:59:09 +0000246 Deque objects support the following methods:
Georg Brandl116aa622007-08-15 14:28:22 +0000247
Benjamin Petersone41251e2008-04-25 01:59:09 +0000248 .. method:: append(x)
Georg Brandl116aa622007-08-15 14:28:22 +0000249
Benjamin Petersone41251e2008-04-25 01:59:09 +0000250 Add *x* to the right side of the deque.
Georg Brandl116aa622007-08-15 14:28:22 +0000251
252
Benjamin Petersone41251e2008-04-25 01:59:09 +0000253 .. method:: appendleft(x)
Georg Brandl116aa622007-08-15 14:28:22 +0000254
Benjamin Petersone41251e2008-04-25 01:59:09 +0000255 Add *x* to the left side of the deque.
Georg Brandl116aa622007-08-15 14:28:22 +0000256
257
Benjamin Petersone41251e2008-04-25 01:59:09 +0000258 .. method:: clear()
Georg Brandl116aa622007-08-15 14:28:22 +0000259
Benjamin Petersone41251e2008-04-25 01:59:09 +0000260 Remove all elements from the deque leaving it with length 0.
Georg Brandl116aa622007-08-15 14:28:22 +0000261
262
Raymond Hettinger44459de2010-04-03 23:20:46 +0000263 .. method:: count(x)
264
265 Count the number of deque elements equal to *x*.
266
267 .. versionadded:: 3.2
268
Georg Brandl67b21b72010-08-17 15:07:14 +0000269
Benjamin Petersone41251e2008-04-25 01:59:09 +0000270 .. method:: extend(iterable)
Georg Brandl116aa622007-08-15 14:28:22 +0000271
Benjamin Petersone41251e2008-04-25 01:59:09 +0000272 Extend the right side of the deque by appending elements from the iterable
273 argument.
Georg Brandl116aa622007-08-15 14:28:22 +0000274
275
Benjamin Petersone41251e2008-04-25 01:59:09 +0000276 .. method:: extendleft(iterable)
Georg Brandl116aa622007-08-15 14:28:22 +0000277
Benjamin Petersone41251e2008-04-25 01:59:09 +0000278 Extend the left side of the deque by appending elements from *iterable*.
279 Note, the series of left appends results in reversing the order of
280 elements in the iterable argument.
Georg Brandl116aa622007-08-15 14:28:22 +0000281
282
Benjamin Petersone41251e2008-04-25 01:59:09 +0000283 .. method:: pop()
Georg Brandl116aa622007-08-15 14:28:22 +0000284
Benjamin Petersone41251e2008-04-25 01:59:09 +0000285 Remove and return an element from the right side of the deque. If no
286 elements are present, raises an :exc:`IndexError`.
Georg Brandl116aa622007-08-15 14:28:22 +0000287
288
Benjamin Petersone41251e2008-04-25 01:59:09 +0000289 .. method:: popleft()
Georg Brandl116aa622007-08-15 14:28:22 +0000290
Benjamin Petersone41251e2008-04-25 01:59:09 +0000291 Remove and return an element from the left side of the deque. If no
292 elements are present, raises an :exc:`IndexError`.
Georg Brandl116aa622007-08-15 14:28:22 +0000293
294
Benjamin Petersone41251e2008-04-25 01:59:09 +0000295 .. method:: remove(value)
Georg Brandl116aa622007-08-15 14:28:22 +0000296
Benjamin Petersone41251e2008-04-25 01:59:09 +0000297 Removed the first occurrence of *value*. If not found, raises a
298 :exc:`ValueError`.
Georg Brandl116aa622007-08-15 14:28:22 +0000299
Georg Brandl67b21b72010-08-17 15:07:14 +0000300
Raymond Hettingere5fdedb2009-12-10 00:47:21 +0000301 .. method:: reverse()
302
303 Reverse the elements of the deque in-place and then return ``None``.
304
305 .. versionadded:: 3.2
Georg Brandl116aa622007-08-15 14:28:22 +0000306
Georg Brandl67b21b72010-08-17 15:07:14 +0000307
Benjamin Petersone41251e2008-04-25 01:59:09 +0000308 .. method:: rotate(n)
Georg Brandl116aa622007-08-15 14:28:22 +0000309
Benjamin Petersone41251e2008-04-25 01:59:09 +0000310 Rotate the deque *n* steps to the right. If *n* is negative, rotate to
311 the left. Rotating one step to the right is equivalent to:
312 ``d.appendleft(d.pop())``.
313
Georg Brandl116aa622007-08-15 14:28:22 +0000314
Raymond Hettinger5bb0f0e2009-03-10 12:56:32 +0000315 Deque objects also provide one read-only attribute:
316
317 .. attribute:: maxlen
318
319 Maximum size of a deque or *None* if unbounded.
320
Raymond Hettinger150fb9c2009-03-10 22:48:06 +0000321 .. versionadded:: 3.1
Raymond Hettinger5bb0f0e2009-03-10 12:56:32 +0000322
323
Georg Brandl116aa622007-08-15 14:28:22 +0000324In addition to the above, deques support iteration, pickling, ``len(d)``,
325``reversed(d)``, ``copy.copy(d)``, ``copy.deepcopy(d)``, membership testing with
Benjamin Peterson206e3072008-10-19 14:07:49 +0000326the :keyword:`in` operator, and subscript references such as ``d[-1]``. Indexed
327access is O(1) at both ends but slows to O(n) in the middle. For fast random
328access, use lists instead.
Georg Brandl116aa622007-08-15 14:28:22 +0000329
Christian Heimesfe337bf2008-03-23 21:54:12 +0000330Example:
331
332.. doctest::
Georg Brandl116aa622007-08-15 14:28:22 +0000333
334 >>> from collections import deque
335 >>> d = deque('ghi') # make a new deque with three items
336 >>> for elem in d: # iterate over the deque's elements
Neal Norwitz752abd02008-05-13 04:55:24 +0000337 ... print(elem.upper())
Georg Brandl116aa622007-08-15 14:28:22 +0000338 G
339 H
340 I
341
342 >>> d.append('j') # add a new entry to the right side
343 >>> d.appendleft('f') # add a new entry to the left side
344 >>> d # show the representation of the deque
345 deque(['f', 'g', 'h', 'i', 'j'])
346
347 >>> d.pop() # return and remove the rightmost item
348 'j'
349 >>> d.popleft() # return and remove the leftmost item
350 'f'
351 >>> list(d) # list the contents of the deque
352 ['g', 'h', 'i']
353 >>> d[0] # peek at leftmost item
354 'g'
355 >>> d[-1] # peek at rightmost item
356 'i'
357
358 >>> list(reversed(d)) # list the contents of a deque in reverse
359 ['i', 'h', 'g']
360 >>> 'h' in d # search the deque
361 True
362 >>> d.extend('jkl') # add multiple elements at once
363 >>> d
364 deque(['g', 'h', 'i', 'j', 'k', 'l'])
365 >>> d.rotate(1) # right rotation
366 >>> d
367 deque(['l', 'g', 'h', 'i', 'j', 'k'])
368 >>> d.rotate(-1) # left rotation
369 >>> d
370 deque(['g', 'h', 'i', 'j', 'k', 'l'])
371
372 >>> deque(reversed(d)) # make a new deque in reverse order
373 deque(['l', 'k', 'j', 'i', 'h', 'g'])
374 >>> d.clear() # empty the deque
375 >>> d.pop() # cannot pop from an empty deque
376 Traceback (most recent call last):
377 File "<pyshell#6>", line 1, in -toplevel-
378 d.pop()
379 IndexError: pop from an empty deque
380
381 >>> d.extendleft('abc') # extendleft() reverses the input order
382 >>> d
383 deque(['c', 'b', 'a'])
384
385
Georg Brandl9afde1c2007-11-01 20:32:30 +0000386:class:`deque` Recipes
387^^^^^^^^^^^^^^^^^^^^^^
Georg Brandl116aa622007-08-15 14:28:22 +0000388
389This section shows various approaches to working with deques.
390
Raymond Hettingerd2ee64d2009-03-31 22:52:48 +0000391Bounded length deques provide functionality similar to the ``tail`` filter
392in Unix::
393
394 def tail(filename, n=10):
395 'Return the last n lines of a file'
396 return deque(open(filename), n)
397
398Another approach to using deques is to maintain a sequence of recently
399added elements by appending to the right and popping to the left::
400
401 def moving_average(iterable, n=3):
402 # moving_average([40, 30, 50, 46, 39, 44]) --> 40.0 42.0 45.0 43.0
403 # http://en.wikipedia.org/wiki/Moving_average
404 it = iter(iterable)
Raymond Hettingerd40285a2009-05-22 01:11:26 +0000405 d = deque(itertools.islice(it, n-1))
406 d.appendleft(0)
Raymond Hettingerd2ee64d2009-03-31 22:52:48 +0000407 s = sum(d)
Raymond Hettingerd2ee64d2009-03-31 22:52:48 +0000408 for elem in it:
409 s += elem - d.popleft()
410 d.append(elem)
411 yield s / n
412
Georg Brandl116aa622007-08-15 14:28:22 +0000413The :meth:`rotate` method provides a way to implement :class:`deque` slicing and
Ezio Melotti0639d5a2009-12-19 23:26:38 +0000414deletion. For example, a pure Python implementation of ``del d[n]`` relies on
Georg Brandl116aa622007-08-15 14:28:22 +0000415the :meth:`rotate` method to position elements to be popped::
416
417 def delete_nth(d, n):
418 d.rotate(-n)
419 d.popleft()
420 d.rotate(n)
421
422To implement :class:`deque` slicing, use a similar approach applying
423:meth:`rotate` to bring a target element to the left side of the deque. Remove
424old entries with :meth:`popleft`, add new entries with :meth:`extend`, and then
425reverse the rotation.
Georg Brandl116aa622007-08-15 14:28:22 +0000426With minor variations on that approach, it is easy to implement Forth style
427stack manipulations such as ``dup``, ``drop``, ``swap``, ``over``, ``pick``,
428``rot``, and ``roll``.
429
Georg Brandl116aa622007-08-15 14:28:22 +0000430
431:class:`defaultdict` objects
432----------------------------
433
Georg Brandl116aa622007-08-15 14:28:22 +0000434.. class:: defaultdict([default_factory[, ...]])
435
436 Returns a new dictionary-like object. :class:`defaultdict` is a subclass of the
Georg Brandl22b34312009-07-26 14:54:51 +0000437 built-in :class:`dict` class. It overrides one method and adds one writable
Georg Brandl116aa622007-08-15 14:28:22 +0000438 instance variable. The remaining functionality is the same as for the
439 :class:`dict` class and is not documented here.
440
441 The first argument provides the initial value for the :attr:`default_factory`
442 attribute; it defaults to ``None``. All remaining arguments are treated the same
443 as if they were passed to the :class:`dict` constructor, including keyword
444 arguments.
445
Georg Brandl116aa622007-08-15 14:28:22 +0000446
Benjamin Petersone41251e2008-04-25 01:59:09 +0000447 :class:`defaultdict` objects support the following method in addition to the
448 standard :class:`dict` operations:
Georg Brandl116aa622007-08-15 14:28:22 +0000449
Benjamin Petersond319ad52010-07-18 14:27:02 +0000450 .. method:: __missing__(key)
Georg Brandl116aa622007-08-15 14:28:22 +0000451
Benjamin Peterson5478b472008-09-17 22:25:09 +0000452 If the :attr:`default_factory` attribute is ``None``, this raises a
Benjamin Petersone41251e2008-04-25 01:59:09 +0000453 :exc:`KeyError` exception with the *key* as argument.
Georg Brandl116aa622007-08-15 14:28:22 +0000454
Benjamin Petersone41251e2008-04-25 01:59:09 +0000455 If :attr:`default_factory` is not ``None``, it is called without arguments
456 to provide a default value for the given *key*, this value is inserted in
457 the dictionary for the *key*, and returned.
Georg Brandl116aa622007-08-15 14:28:22 +0000458
Benjamin Petersone41251e2008-04-25 01:59:09 +0000459 If calling :attr:`default_factory` raises an exception this exception is
460 propagated unchanged.
Georg Brandl116aa622007-08-15 14:28:22 +0000461
Benjamin Petersone41251e2008-04-25 01:59:09 +0000462 This method is called by the :meth:`__getitem__` method of the
463 :class:`dict` class when the requested key is not found; whatever it
464 returns or raises is then returned or raised by :meth:`__getitem__`.
Georg Brandl116aa622007-08-15 14:28:22 +0000465
466
Benjamin Petersone41251e2008-04-25 01:59:09 +0000467 :class:`defaultdict` objects support the following instance variable:
Georg Brandl116aa622007-08-15 14:28:22 +0000468
Benjamin Petersone41251e2008-04-25 01:59:09 +0000469
Benjamin Petersond319ad52010-07-18 14:27:02 +0000470 .. attribute:: default_factory
Benjamin Petersone41251e2008-04-25 01:59:09 +0000471
472 This attribute is used by the :meth:`__missing__` method; it is
473 initialized from the first argument to the constructor, if present, or to
474 ``None``, if absent.
Georg Brandl116aa622007-08-15 14:28:22 +0000475
476
Georg Brandl116aa622007-08-15 14:28:22 +0000477:class:`defaultdict` Examples
478^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
479
480Using :class:`list` as the :attr:`default_factory`, it is easy to group a
Christian Heimesfe337bf2008-03-23 21:54:12 +0000481sequence of key-value pairs into a dictionary of lists:
Georg Brandl116aa622007-08-15 14:28:22 +0000482
483 >>> s = [('yellow', 1), ('blue', 2), ('yellow', 3), ('blue', 4), ('red', 1)]
484 >>> d = defaultdict(list)
485 >>> for k, v in s:
486 ... d[k].append(v)
487 ...
Ezio Melottic53a8942009-09-12 01:52:05 +0000488 >>> list(d.items())
Georg Brandl116aa622007-08-15 14:28:22 +0000489 [('blue', [2, 4]), ('red', [1]), ('yellow', [1, 3])]
490
491When each key is encountered for the first time, it is not already in the
492mapping; so an entry is automatically created using the :attr:`default_factory`
493function which returns an empty :class:`list`. The :meth:`list.append`
494operation then attaches the value to the new list. When keys are encountered
495again, the look-up proceeds normally (returning the list for that key) and the
496:meth:`list.append` operation adds another value to the list. This technique is
Christian Heimesfe337bf2008-03-23 21:54:12 +0000497simpler and faster than an equivalent technique using :meth:`dict.setdefault`:
Georg Brandl116aa622007-08-15 14:28:22 +0000498
499 >>> d = {}
500 >>> for k, v in s:
501 ... d.setdefault(k, []).append(v)
502 ...
Ezio Melottic53a8942009-09-12 01:52:05 +0000503 >>> list(d.items())
Georg Brandl116aa622007-08-15 14:28:22 +0000504 [('blue', [2, 4]), ('red', [1]), ('yellow', [1, 3])]
505
506Setting the :attr:`default_factory` to :class:`int` makes the
507:class:`defaultdict` useful for counting (like a bag or multiset in other
Christian Heimesfe337bf2008-03-23 21:54:12 +0000508languages):
Georg Brandl116aa622007-08-15 14:28:22 +0000509
510 >>> s = 'mississippi'
511 >>> d = defaultdict(int)
512 >>> for k in s:
513 ... d[k] += 1
514 ...
Ezio Melottic53a8942009-09-12 01:52:05 +0000515 >>> list(d.items())
Georg Brandl116aa622007-08-15 14:28:22 +0000516 [('i', 4), ('p', 2), ('s', 4), ('m', 1)]
517
518When a letter is first encountered, it is missing from the mapping, so the
519:attr:`default_factory` function calls :func:`int` to supply a default count of
520zero. The increment operation then builds up the count for each letter.
521
522The function :func:`int` which always returns zero is just a special case of
523constant functions. A faster and more flexible way to create constant functions
524is to use a lambda function which can supply any constant value (not just
Christian Heimesfe337bf2008-03-23 21:54:12 +0000525zero):
Georg Brandl116aa622007-08-15 14:28:22 +0000526
527 >>> def constant_factory(value):
528 ... return lambda: value
529 >>> d = defaultdict(constant_factory('<missing>'))
530 >>> d.update(name='John', action='ran')
531 >>> '%(name)s %(action)s to %(object)s' % d
532 'John ran to <missing>'
533
534Setting the :attr:`default_factory` to :class:`set` makes the
Christian Heimesfe337bf2008-03-23 21:54:12 +0000535:class:`defaultdict` useful for building a dictionary of sets:
Georg Brandl116aa622007-08-15 14:28:22 +0000536
537 >>> s = [('red', 1), ('blue', 2), ('red', 3), ('blue', 4), ('red', 1), ('blue', 4)]
538 >>> d = defaultdict(set)
539 >>> for k, v in s:
540 ... d[k].add(v)
541 ...
Ezio Melottic53a8942009-09-12 01:52:05 +0000542 >>> list(d.items())
Georg Brandl116aa622007-08-15 14:28:22 +0000543 [('blue', set([2, 4])), ('red', set([1, 3]))]
544
545
Guido van Rossum3d392eb2007-11-16 00:35:22 +0000546:func:`namedtuple` Factory Function for Tuples with Named Fields
Christian Heimes790c8232008-01-07 21:14:23 +0000547----------------------------------------------------------------
Georg Brandl116aa622007-08-15 14:28:22 +0000548
Thomas Wouters1b7f8912007-09-19 03:06:30 +0000549Named tuples assign meaning to each position in a tuple and allow for more readable,
550self-documenting code. They can be used wherever regular tuples are used, and
551they add the ability to access fields by name instead of position index.
Georg Brandl116aa622007-08-15 14:28:22 +0000552
Georg Brandlc2a4f4f2009-04-10 09:03:43 +0000553.. function:: namedtuple(typename, field_names, verbose=False, rename=False)
Georg Brandl116aa622007-08-15 14:28:22 +0000554
555 Returns a new tuple subclass named *typename*. The new subclass is used to
Christian Heimesc3f30c42008-02-22 16:37:40 +0000556 create tuple-like objects that have fields accessible by attribute lookup as
Georg Brandl116aa622007-08-15 14:28:22 +0000557 well as being indexable and iterable. Instances of the subclass also have a
Benjamin Peterson4469d0c2008-11-30 22:46:23 +0000558 helpful docstring (with typename and field_names) and a helpful :meth:`__repr__`
Georg Brandl116aa622007-08-15 14:28:22 +0000559 method which lists the tuple contents in a ``name=value`` format.
560
Benjamin Peterson4469d0c2008-11-30 22:46:23 +0000561 The *field_names* are a single string with each fieldname separated by whitespace
562 and/or commas, for example ``'x y'`` or ``'x, y'``. Alternatively, *field_names*
Christian Heimes25bb7832008-01-11 16:17:00 +0000563 can be a sequence of strings such as ``['x', 'y']``.
Georg Brandl9afde1c2007-11-01 20:32:30 +0000564
565 Any valid Python identifier may be used for a fieldname except for names
Christian Heimes0449f632007-12-15 01:27:15 +0000566 starting with an underscore. Valid identifiers consist of letters, digits,
567 and underscores but do not start with a digit or underscore and cannot be
Georg Brandlf6945182008-02-01 11:56:49 +0000568 a :mod:`keyword` such as *class*, *for*, *return*, *global*, *pass*,
Georg Brandl9afde1c2007-11-01 20:32:30 +0000569 or *raise*.
Georg Brandl116aa622007-08-15 14:28:22 +0000570
Benjamin Petersona86f2c02009-02-10 02:41:10 +0000571 If *rename* is true, invalid fieldnames are automatically replaced
572 with positional names. For example, ``['abc', 'def', 'ghi', 'abc']`` is
Raymond Hettinger85737b82009-04-02 22:37:59 +0000573 converted to ``['abc', '_1', 'ghi', '_3']``, eliminating the keyword
Benjamin Petersona86f2c02009-02-10 02:41:10 +0000574 ``def`` and the duplicate fieldname ``abc``.
575
Christian Heimes25bb7832008-01-11 16:17:00 +0000576 If *verbose* is true, the class definition is printed just before being built.
Georg Brandl116aa622007-08-15 14:28:22 +0000577
Georg Brandl9afde1c2007-11-01 20:32:30 +0000578 Named tuple instances do not have per-instance dictionaries, so they are
Thomas Wouters8ce81f72007-09-20 18:22:40 +0000579 lightweight and require no more memory than regular tuples.
Georg Brandl116aa622007-08-15 14:28:22 +0000580
Raymond Hettingerb62ad242009-03-02 22:16:43 +0000581 .. versionchanged:: 3.1
Georg Brandl67b21b72010-08-17 15:07:14 +0000582 Added support for *rename*.
Benjamin Petersona86f2c02009-02-10 02:41:10 +0000583
Christian Heimesfe337bf2008-03-23 21:54:12 +0000584Example:
585
586.. doctest::
587 :options: +NORMALIZE_WHITESPACE
Georg Brandl116aa622007-08-15 14:28:22 +0000588
Guido van Rossum3d392eb2007-11-16 00:35:22 +0000589 >>> Point = namedtuple('Point', 'x y', verbose=True)
Thomas Wouters1b7f8912007-09-19 03:06:30 +0000590 class Point(tuple):
591 'Point(x, y)'
Christian Heimesfe337bf2008-03-23 21:54:12 +0000592 <BLANKLINE>
Thomas Wouters1b7f8912007-09-19 03:06:30 +0000593 __slots__ = ()
Christian Heimesfe337bf2008-03-23 21:54:12 +0000594 <BLANKLINE>
Christian Heimesfaf2f632008-01-06 16:59:19 +0000595 _fields = ('x', 'y')
Christian Heimesfe337bf2008-03-23 21:54:12 +0000596 <BLANKLINE>
Raymond Hettinger089ba7f2009-05-27 00:38:24 +0000597 def __new__(_cls, x, y):
Raymond Hettinger7b0d3c62010-04-02 18:54:02 +0000598 'Create a new instance of Point(x, y)'
Raymond Hettinger089ba7f2009-05-27 00:38:24 +0000599 return _tuple.__new__(_cls, (x, y))
Christian Heimesfe337bf2008-03-23 21:54:12 +0000600 <BLANKLINE>
Christian Heimesfaf2f632008-01-06 16:59:19 +0000601 @classmethod
Christian Heimesfe337bf2008-03-23 21:54:12 +0000602 def _make(cls, iterable, new=tuple.__new__, len=len):
Christian Heimesfaf2f632008-01-06 16:59:19 +0000603 'Make a new Point object from a sequence or iterable'
Christian Heimesfe337bf2008-03-23 21:54:12 +0000604 result = new(cls, iterable)
Christian Heimesfaf2f632008-01-06 16:59:19 +0000605 if len(result) != 2:
606 raise TypeError('Expected 2 arguments, got %d' % len(result))
607 return result
Christian Heimesfe337bf2008-03-23 21:54:12 +0000608 <BLANKLINE>
Thomas Wouters1b7f8912007-09-19 03:06:30 +0000609 def __repr__(self):
Raymond Hettinger7b0d3c62010-04-02 18:54:02 +0000610 'Return a nicely formatted representation string'
Raymond Hettingerd331ce92010-08-08 01:13:42 +0000611 return self.__class__.__name__ + '(x=%r, y=%r)' % self
Christian Heimesfe337bf2008-03-23 21:54:12 +0000612 <BLANKLINE>
Raymond Hettingera4f52b12009-03-02 22:28:31 +0000613 def _asdict(self):
614 'Return a new OrderedDict which maps field names to their values'
615 return OrderedDict(zip(self._fields, self))
Christian Heimesfe337bf2008-03-23 21:54:12 +0000616 <BLANKLINE>
Raymond Hettinger089ba7f2009-05-27 00:38:24 +0000617 def _replace(_self, **kwds):
Guido van Rossum3d392eb2007-11-16 00:35:22 +0000618 'Return a new Point object replacing specified fields with new values'
Raymond Hettinger089ba7f2009-05-27 00:38:24 +0000619 result = _self._make(map(kwds.pop, ('x', 'y'), _self))
Christian Heimesfaf2f632008-01-06 16:59:19 +0000620 if kwds:
Ezio Melotti8f7649e2009-09-13 04:48:45 +0000621 raise ValueError('Got unexpected field names: %r' % list(kwds.keys()))
Christian Heimesfaf2f632008-01-06 16:59:19 +0000622 return result
Georg Brandl48310cd2009-01-03 21:18:54 +0000623 <BLANKLINE>
624 def __getnewargs__(self):
Raymond Hettinger7b0d3c62010-04-02 18:54:02 +0000625 'Return self as a plain tuple. Used by copy and pickle.'
Benjamin Peterson41181742008-07-02 20:22:54 +0000626 return tuple(self)
Christian Heimesfe337bf2008-03-23 21:54:12 +0000627 <BLANKLINE>
Raymond Hettinger7b0d3c62010-04-02 18:54:02 +0000628 x = _property(_itemgetter(0), doc='Alias for field number 0')
629 y = _property(_itemgetter(1), doc='Alias for field number 1')
Georg Brandl116aa622007-08-15 14:28:22 +0000630
Thomas Wouters1b7f8912007-09-19 03:06:30 +0000631 >>> p = Point(11, y=22) # instantiate with positional or keyword arguments
Christian Heimes99170a52007-12-19 02:07:34 +0000632 >>> p[0] + p[1] # indexable like the plain tuple (11, 22)
Thomas Wouters1b7f8912007-09-19 03:06:30 +0000633 33
634 >>> x, y = p # unpack like a regular tuple
635 >>> x, y
636 (11, 22)
Christian Heimesc3f30c42008-02-22 16:37:40 +0000637 >>> p.x + p.y # fields also accessible by name
Thomas Wouters1b7f8912007-09-19 03:06:30 +0000638 33
639 >>> p # readable __repr__ with a name=value style
640 Point(x=11, y=22)
Georg Brandl116aa622007-08-15 14:28:22 +0000641
Thomas Wouters1b7f8912007-09-19 03:06:30 +0000642Named tuples are especially useful for assigning field names to result tuples returned
643by the :mod:`csv` or :mod:`sqlite3` modules::
644
Guido van Rossum3d392eb2007-11-16 00:35:22 +0000645 EmployeeRecord = namedtuple('EmployeeRecord', 'name, age, title, department, paygrade')
Georg Brandl9afde1c2007-11-01 20:32:30 +0000646
Thomas Wouters1b7f8912007-09-19 03:06:30 +0000647 import csv
Christian Heimesfaf2f632008-01-06 16:59:19 +0000648 for emp in map(EmployeeRecord._make, csv.reader(open("employees.csv", "rb"))):
Thomas Wouters1b7f8912007-09-19 03:06:30 +0000649 print(emp.name, emp.title)
650
Georg Brandl9afde1c2007-11-01 20:32:30 +0000651 import sqlite3
652 conn = sqlite3.connect('/companydata')
653 cursor = conn.cursor()
654 cursor.execute('SELECT name, age, title, department, paygrade FROM employees')
Christian Heimesfaf2f632008-01-06 16:59:19 +0000655 for emp in map(EmployeeRecord._make, cursor.fetchall()):
Christian Heimes00412232008-01-10 16:02:19 +0000656 print(emp.name, emp.title)
Georg Brandl9afde1c2007-11-01 20:32:30 +0000657
Christian Heimes99170a52007-12-19 02:07:34 +0000658In addition to the methods inherited from tuples, named tuples support
Christian Heimes2380ac72008-01-09 00:17:24 +0000659three additional methods and one attribute. To prevent conflicts with
660field names, the method and attribute names start with an underscore.
Christian Heimes99170a52007-12-19 02:07:34 +0000661
Benjamin Peterson0b9fb802010-07-18 14:23:36 +0000662.. classmethod:: somenamedtuple._make(iterable)
Christian Heimes99170a52007-12-19 02:07:34 +0000663
Christian Heimesfaf2f632008-01-06 16:59:19 +0000664 Class method that makes a new instance from an existing sequence or iterable.
Christian Heimes99170a52007-12-19 02:07:34 +0000665
Christian Heimesfe337bf2008-03-23 21:54:12 +0000666.. doctest::
Thomas Wouters1b7f8912007-09-19 03:06:30 +0000667
Christian Heimesfaf2f632008-01-06 16:59:19 +0000668 >>> t = [11, 22]
669 >>> Point._make(t)
670 Point(x=11, y=22)
Thomas Wouters1b7f8912007-09-19 03:06:30 +0000671
Christian Heimes790c8232008-01-07 21:14:23 +0000672.. method:: somenamedtuple._asdict()
Georg Brandl9afde1c2007-11-01 20:32:30 +0000673
Raymond Hettingera4f52b12009-03-02 22:28:31 +0000674 Return a new :class:`OrderedDict` which maps field names to their corresponding
675 values::
Georg Brandl9afde1c2007-11-01 20:32:30 +0000676
Christian Heimes0449f632007-12-15 01:27:15 +0000677 >>> p._asdict()
Raymond Hettingera4f52b12009-03-02 22:28:31 +0000678 OrderedDict([('x', 11), ('y', 22)])
679
Raymond Hettingera88e4da2009-03-03 05:12:27 +0000680 .. versionchanged:: 3.1
Raymond Hettingera4f52b12009-03-02 22:28:31 +0000681 Returns an :class:`OrderedDict` instead of a regular :class:`dict`.
Christian Heimesfe337bf2008-03-23 21:54:12 +0000682
Christian Heimes790c8232008-01-07 21:14:23 +0000683.. method:: somenamedtuple._replace(kwargs)
Thomas Wouters1b7f8912007-09-19 03:06:30 +0000684
Christian Heimesfe337bf2008-03-23 21:54:12 +0000685 Return a new instance of the named tuple replacing specified fields with new
686 values:
Thomas Wouters8ce81f72007-09-20 18:22:40 +0000687
688::
Thomas Wouters1b7f8912007-09-19 03:06:30 +0000689
690 >>> p = Point(x=11, y=22)
Christian Heimes0449f632007-12-15 01:27:15 +0000691 >>> p._replace(x=33)
Thomas Wouters1b7f8912007-09-19 03:06:30 +0000692 Point(x=33, y=22)
693
Guido van Rossum3d392eb2007-11-16 00:35:22 +0000694 >>> for partnum, record in inventory.items():
Christian Heimes454f37b2008-01-10 00:10:02 +0000695 ... inventory[partnum] = record._replace(price=newprices[partnum], timestamp=time.now())
Thomas Wouters1b7f8912007-09-19 03:06:30 +0000696
Christian Heimes790c8232008-01-07 21:14:23 +0000697.. attribute:: somenamedtuple._fields
Thomas Wouters1b7f8912007-09-19 03:06:30 +0000698
Christian Heimes2380ac72008-01-09 00:17:24 +0000699 Tuple of strings listing the field names. Useful for introspection
Georg Brandl9afde1c2007-11-01 20:32:30 +0000700 and for creating new named tuple types from existing named tuples.
Thomas Wouters8ce81f72007-09-20 18:22:40 +0000701
Christian Heimesfe337bf2008-03-23 21:54:12 +0000702.. doctest::
Thomas Wouters1b7f8912007-09-19 03:06:30 +0000703
Christian Heimes0449f632007-12-15 01:27:15 +0000704 >>> p._fields # view the field names
Thomas Wouters1b7f8912007-09-19 03:06:30 +0000705 ('x', 'y')
Thomas Wouters1b7f8912007-09-19 03:06:30 +0000706
Guido van Rossum3d392eb2007-11-16 00:35:22 +0000707 >>> Color = namedtuple('Color', 'red green blue')
Christian Heimes0449f632007-12-15 01:27:15 +0000708 >>> Pixel = namedtuple('Pixel', Point._fields + Color._fields)
Thomas Wouters1b7f8912007-09-19 03:06:30 +0000709 >>> Pixel(11, 22, 128, 255, 0)
Christian Heimes454f37b2008-01-10 00:10:02 +0000710 Pixel(x=11, y=22, red=128, green=255, blue=0)
Georg Brandl116aa622007-08-15 14:28:22 +0000711
Christian Heimes0449f632007-12-15 01:27:15 +0000712To retrieve a field whose name is stored in a string, use the :func:`getattr`
Christian Heimesfe337bf2008-03-23 21:54:12 +0000713function:
Christian Heimes0449f632007-12-15 01:27:15 +0000714
715 >>> getattr(p, 'x')
716 11
717
Raymond Hettinger651453a2009-02-11 00:20:02 +0000718To convert a dictionary to a named tuple, use the double-star-operator
719(as described in :ref:`tut-unpacking-arguments`):
Christian Heimes99170a52007-12-19 02:07:34 +0000720
721 >>> d = {'x': 11, 'y': 22}
722 >>> Point(**d)
723 Point(x=11, y=22)
724
Guido van Rossum3d392eb2007-11-16 00:35:22 +0000725Since a named tuple is a regular Python class, it is easy to add or change
Christian Heimes043d6f62008-01-07 17:19:16 +0000726functionality with a subclass. Here is how to add a calculated field and
Christian Heimesfe337bf2008-03-23 21:54:12 +0000727a fixed-width print format:
Guido van Rossum3d392eb2007-11-16 00:35:22 +0000728
Christian Heimes043d6f62008-01-07 17:19:16 +0000729 >>> class Point(namedtuple('Point', 'x y')):
Christian Heimes25bb7832008-01-11 16:17:00 +0000730 ... __slots__ = ()
Christian Heimes454f37b2008-01-10 00:10:02 +0000731 ... @property
732 ... def hypot(self):
733 ... return (self.x ** 2 + self.y ** 2) ** 0.5
734 ... def __str__(self):
Christian Heimes25bb7832008-01-11 16:17:00 +0000735 ... return 'Point: x=%6.3f y=%6.3f hypot=%6.3f' % (self.x, self.y, self.hypot)
Guido van Rossum3d392eb2007-11-16 00:35:22 +0000736
Georg Brandl0df79792008-10-04 18:33:26 +0000737 >>> for p in Point(3, 4), Point(14, 5/7):
Christian Heimes00412232008-01-10 16:02:19 +0000738 ... print(p)
Christian Heimes25bb7832008-01-11 16:17:00 +0000739 Point: x= 3.000 y= 4.000 hypot= 5.000
740 Point: x=14.000 y= 0.714 hypot=14.018
Christian Heimes043d6f62008-01-07 17:19:16 +0000741
Georg Brandlaf5c2382009-12-28 08:02:38 +0000742The subclass shown above sets ``__slots__`` to an empty tuple. This helps
Christian Heimes679db4a2008-01-18 09:56:22 +0000743keep memory requirements low by preventing the creation of instance dictionaries.
744
Christian Heimes2380ac72008-01-09 00:17:24 +0000745
746Subclassing is not useful for adding new, stored fields. Instead, simply
Christian Heimesfe337bf2008-03-23 21:54:12 +0000747create a new named tuple type from the :attr:`_fields` attribute:
Christian Heimes2380ac72008-01-09 00:17:24 +0000748
Christian Heimes25bb7832008-01-11 16:17:00 +0000749 >>> Point3D = namedtuple('Point3D', Point._fields + ('z',))
Christian Heimes2380ac72008-01-09 00:17:24 +0000750
751Default values can be implemented by using :meth:`_replace` to
Christian Heimesfe337bf2008-03-23 21:54:12 +0000752customize a prototype instance:
Guido van Rossum3d392eb2007-11-16 00:35:22 +0000753
754 >>> Account = namedtuple('Account', 'owner balance transaction_count')
Christian Heimes587c2bf2008-01-19 16:21:02 +0000755 >>> default_account = Account('<owner name>', 0.0, 0)
756 >>> johns_account = default_account._replace(owner='John')
Guido van Rossum3d392eb2007-11-16 00:35:22 +0000757
Christian Heimese4ca8152008-05-08 17:18:53 +0000758Enumerated constants can be implemented with named tuples, but it is simpler
759and more efficient to use a simple class declaration:
760
761 >>> Status = namedtuple('Status', 'open pending closed')._make(range(3))
762 >>> Status.open, Status.pending, Status.closed
763 (0, 1, 2)
764 >>> class Status:
765 ... open, pending, closed = range(3)
766
Raymond Hettinger651453a2009-02-11 00:20:02 +0000767.. seealso::
Thomas Wouters47b49bf2007-08-30 22:15:33 +0000768
Raymond Hettinger651453a2009-02-11 00:20:02 +0000769 `Named tuple recipe <http://code.activestate.com/recipes/500261/>`_
770 adapted for Python 2.4.
Raymond Hettingere4c96ad2008-02-06 01:23:58 +0000771
772
Raymond Hettinger2d32f632009-03-02 21:24:57 +0000773:class:`OrderedDict` objects
774----------------------------
775
776Ordered dictionaries are just like regular dictionaries but they remember the
777order that items were inserted. When iterating over an ordered dictionary,
778the items are returned in the order their keys were first added.
779
780.. class:: OrderedDict([items])
781
782 Return an instance of a dict subclass, supporting the usual :class:`dict`
783 methods. An *OrderedDict* is a dict that remembers the order that keys
784 were first inserted. If a new entry overwrites an existing entry, the
785 original insertion position is left unchanged. Deleting an entry and
786 reinserting it will move it to the end.
787
Benjamin Petersond45bf582009-03-02 21:44:54 +0000788 .. versionadded:: 3.1
Raymond Hettinger2d32f632009-03-02 21:24:57 +0000789
Benjamin Petersond319ad52010-07-18 14:27:02 +0000790 .. method:: popitem(last=True)
Raymond Hettingerdc879f02009-03-19 20:30:56 +0000791
Benjamin Petersond319ad52010-07-18 14:27:02 +0000792 The :meth:`popitem` method for ordered dictionaries returns and removes a
793 (key, value) pair. The pairs are returned in LIFO order if *last* is true
794 or FIFO order if false.
Raymond Hettinger2d32f632009-03-02 21:24:57 +0000795
Raymond Hettingere9091502009-05-19 17:40:07 +0000796In addition to the usual mapping methods, ordered dictionaries also support
797reverse iteration using :func:`reversed`.
798
Raymond Hettinger2d32f632009-03-02 21:24:57 +0000799Equality tests between :class:`OrderedDict` objects are order-sensitive
800and are implemented as ``list(od1.items())==list(od2.items())``.
801Equality tests between :class:`OrderedDict` objects and other
802:class:`Mapping` objects are order-insensitive like regular dictionaries.
803This allows :class:`OrderedDict` objects to be substituted anywhere a
804regular dictionary is used.
805
Raymond Hettinger36180782009-04-09 22:34:23 +0000806The :class:`OrderedDict` constructor and :meth:`update` method both accept
807keyword arguments, but their order is lost because Python's function call
808semantics pass-in keyword arguments using a regular unordered dictionary.
809
Raymond Hettingerdc879f02009-03-19 20:30:56 +0000810.. seealso::
811
812 `Equivalent OrderedDict recipe <http://code.activestate.com/recipes/576693/>`_
813 that runs on Python 2.4 or later.
814
Raymond Hettinger0e312012009-11-10 18:35:46 +0000815Since an ordered dictionary remembers its insertion order, it can be used
816in conjuction with sorting to make a sorted dictionary::
817
818 >>> # regular unsorted dictionary
819 >>> d = {'banana': 3, 'apple':4, 'pear': 1, 'orange': 2}
820
821 >>> # dictionary sorted by key
822 >>> OrderedDict(sorted(d.items(), key=lambda t: t[0]))
823 OrderedDict([('apple', 4), ('banana', 3), ('orange', 2), ('pear', 1)])
824
825 >>> # dictionary sorted by value
826 >>> OrderedDict(sorted(d.items(), key=lambda t: t[1]))
827 OrderedDict([('pear', 1), ('orange', 2), ('banana', 3), ('apple', 4)])
828
829 >>> # dictionary sorted by length of the key string
830 >>> OrderedDict(sorted(d.items(), key=lambda t: len(t[0])))
831 OrderedDict([('pear', 1), ('apple', 4), ('orange', 2), ('banana', 3)])
832
833The new sorted dictionaries maintain their sort order when entries
834are deleted. But when new keys are added, the keys are appended
835to the end and the sort is not maintained.
836
Raymond Hettinger4821ef82010-07-31 10:14:41 +0000837It is also straight-forward to create an ordered dictionary variant
838that the remembers the order the keys were *last* inserted.
839If a new entry overwrites an existing entry, the
840original insertion position is changed and moved to the end::
841
842 class LastUpdatedOrderedDict(OrderedDict):
843 'Store items is the order the keys were last added'
844 def __setitem__(self, key, value):
845 if key in self:
846 del self[key]
847 OrderedDict.__setitem__(self, key, value)
848
Raymond Hettingere4c96ad2008-02-06 01:23:58 +0000849
850:class:`UserDict` objects
Mark Summerfield8f2d0062008-02-06 13:30:44 +0000851-------------------------
Raymond Hettingere4c96ad2008-02-06 01:23:58 +0000852
Georg Brandl48310cd2009-01-03 21:18:54 +0000853The class, :class:`UserDict` acts as a wrapper around dictionary objects.
854The need for this class has been partially supplanted by the ability to
Raymond Hettingere4c96ad2008-02-06 01:23:58 +0000855subclass directly from :class:`dict`; however, this class can be easier
856to work with because the underlying dictionary is accessible as an
857attribute.
858
859.. class:: UserDict([initialdata])
860
861 Class that simulates a dictionary. The instance's contents are kept in a
862 regular dictionary, which is accessible via the :attr:`data` attribute of
863 :class:`UserDict` instances. If *initialdata* is provided, :attr:`data` is
864 initialized with its contents; note that a reference to *initialdata* will not
865 be kept, allowing it be used for other purposes.
866
Benjamin Petersond319ad52010-07-18 14:27:02 +0000867 In addition to supporting the methods and operations of mappings,
868 :class:`UserDict` instances provide the following attribute:
Raymond Hettingere4c96ad2008-02-06 01:23:58 +0000869
Benjamin Petersond319ad52010-07-18 14:27:02 +0000870 .. attribute:: data
Raymond Hettingere4c96ad2008-02-06 01:23:58 +0000871
Benjamin Petersond319ad52010-07-18 14:27:02 +0000872 A real dictionary used to store the contents of the :class:`UserDict`
873 class.
Raymond Hettinger53dbe392008-02-12 20:03:09 +0000874
875
876
877:class:`UserList` objects
878-------------------------
879
880This class acts as a wrapper around list objects. It is a useful base class
Georg Brandl48310cd2009-01-03 21:18:54 +0000881for your own list-like classes which can inherit from them and override
Raymond Hettinger53dbe392008-02-12 20:03:09 +0000882existing methods or add new ones. In this way, one can add new behaviors to
883lists.
884
Georg Brandl48310cd2009-01-03 21:18:54 +0000885The need for this class has been partially supplanted by the ability to
Raymond Hettinger53dbe392008-02-12 20:03:09 +0000886subclass directly from :class:`list`; however, this class can be easier
887to work with because the underlying list is accessible as an attribute.
888
889.. class:: UserList([list])
890
891 Class that simulates a list. The instance's contents are kept in a regular
892 list, which is accessible via the :attr:`data` attribute of :class:`UserList`
893 instances. The instance's contents are initially set to a copy of *list*,
894 defaulting to the empty list ``[]``. *list* can be any iterable, for
895 example a real Python list or a :class:`UserList` object.
896
Benjamin Petersond319ad52010-07-18 14:27:02 +0000897 In addition to supporting the methods and operations of mutable sequences,
898 :class:`UserList` instances provide the following attribute:
Raymond Hettinger53dbe392008-02-12 20:03:09 +0000899
Benjamin Petersond319ad52010-07-18 14:27:02 +0000900 .. attribute:: data
Raymond Hettinger53dbe392008-02-12 20:03:09 +0000901
Benjamin Petersond319ad52010-07-18 14:27:02 +0000902 A real :class:`list` object used to store the contents of the
903 :class:`UserList` class.
Raymond Hettinger53dbe392008-02-12 20:03:09 +0000904
905**Subclassing requirements:** Subclasses of :class:`UserList` are expect to
906offer a constructor which can be called with either no arguments or one
907argument. List operations which return a new sequence attempt to create an
908instance of the actual implementation class. To do so, it assumes that the
909constructor can be called with a single parameter, which is a sequence object
910used as a data source.
911
912If a derived class does not wish to comply with this requirement, all of the
913special methods supported by this class will need to be overridden; please
914consult the sources for information about the methods which need to be provided
915in that case.
Raymond Hettingerb3a65f82008-02-21 22:11:37 +0000916
917:class:`UserString` objects
Christian Heimesc3f30c42008-02-22 16:37:40 +0000918---------------------------
Raymond Hettingerb3a65f82008-02-21 22:11:37 +0000919
Georg Brandl48310cd2009-01-03 21:18:54 +0000920The class, :class:`UserString` acts as a wrapper around string objects.
921The need for this class has been partially supplanted by the ability to
Raymond Hettingerb3a65f82008-02-21 22:11:37 +0000922subclass directly from :class:`str`; however, this class can be easier
923to work with because the underlying string is accessible as an
924attribute.
925
926.. class:: UserString([sequence])
927
928 Class that simulates a string or a Unicode string object. The instance's
Georg Brandl48310cd2009-01-03 21:18:54 +0000929 content is kept in a regular string object, which is accessible via the
930 :attr:`data` attribute of :class:`UserString` instances. The instance's
Raymond Hettingerb3a65f82008-02-21 22:11:37 +0000931 contents are initially set to a copy of *sequence*. The *sequence* can
932 be an instance of :class:`bytes`, :class:`str`, :class:`UserString` (or a
933 subclass) or an arbitrary sequence which can be converted into a string using
934 the built-in :func:`str` function.
Raymond Hettingera6b76ba2010-08-08 00:29:08 +0000935
936
937ABCs - abstract base classes
938----------------------------
939
940The collections module offers the following ABCs:
941
942========================= ===================== ====================== ====================================================
943ABC Inherits Abstract Methods Mixin Methods
944========================= ===================== ====================== ====================================================
945:class:`Container` ``__contains__``
946:class:`Hashable` ``__hash__``
947:class:`Iterable` ``__iter__``
948:class:`Iterator` :class:`Iterable` ``__next__`` ``__iter__``
949:class:`Sized` ``__len__``
950:class:`Callable` ``__call__``
951
952:class:`Sequence` :class:`Sized`, ``__getitem__`` ``__contains__``. ``__iter__``, ``__reversed__``.
953 :class:`Iterable`, ``index``, and ``count``
954 :class:`Container`
955
956:class:`MutableSequence` :class:`Sequence` ``__setitem__`` Inherited Sequence methods and
957 ``__delitem__``, ``append``, ``reverse``, ``extend``, ``pop``,
958 and ``insert`` ``remove``, and ``__iadd__``
959
960:class:`Set` :class:`Sized`, ``__le__``, ``__lt__``, ``__eq__``, ``__ne__``,
961 :class:`Iterable`, ``__gt__``, ``__ge__``, ``__and__``, ``__or__``
962 :class:`Container` ``__sub__``, ``__xor__``, and ``isdisjoint``
963
964:class:`MutableSet` :class:`Set` ``add`` and Inherited Set methods and
965 ``discard`` ``clear``, ``pop``, ``remove``, ``__ior__``,
966 ``__iand__``, ``__ixor__``, and ``__isub__``
967
968:class:`Mapping` :class:`Sized`, ``__getitem__`` ``__contains__``, ``keys``, ``items``, ``values``,
969 :class:`Iterable`, ``get``, ``__eq__``, and ``__ne__``
970 :class:`Container`
971
972:class:`MutableMapping` :class:`Mapping` ``__setitem__`` and Inherited Mapping methods and
973 ``__delitem__`` ``pop``, ``popitem``, ``clear``, ``update``,
974 and ``setdefault``
975
976
977:class:`MappingView` :class:`Sized` ``__len__``
978:class:`KeysView` :class:`MappingView`, ``__contains__``,
979 :class:`Set` ``__iter__``
980:class:`ItemsView` :class:`MappingView`, ``__contains__``,
981 :class:`Set` ``__iter__``
982:class:`ValuesView` :class:`MappingView` ``__contains__``, ``__iter__``
983========================= ===================== ====================== ====================================================
984
985These ABCs allow us to ask classes or instances if they provide
986particular functionality, for example::
987
988 size = None
989 if isinstance(myvar, collections.Sized):
990 size = len(myvar)
991
992Several of the ABCs are also useful as mixins that make it easier to develop
993classes supporting container APIs. For example, to write a class supporting
994the full :class:`Set` API, it only necessary to supply the three underlying
995abstract methods: :meth:`__contains__`, :meth:`__iter__`, and :meth:`__len__`.
996The ABC supplies the remaining methods such as :meth:`__and__` and
997:meth:`isdisjoint` ::
998
999 class ListBasedSet(collections.Set):
1000 ''' Alternate set implementation favoring space over speed
1001 and not requiring the set elements to be hashable. '''
1002 def __init__(self, iterable):
1003 self.elements = lst = []
1004 for value in iterable:
1005 if value not in lst:
1006 lst.append(value)
1007 def __iter__(self):
1008 return iter(self.elements)
1009 def __contains__(self, value):
1010 return value in self.elements
1011 def __len__(self):
1012 return len(self.elements)
1013
1014 s1 = ListBasedSet('abcdef')
1015 s2 = ListBasedSet('defghi')
1016 overlap = s1 & s2 # The __and__() method is supported automatically
1017
1018Notes on using :class:`Set` and :class:`MutableSet` as a mixin:
1019
1020(1)
1021 Since some set operations create new sets, the default mixin methods need
1022 a way to create new instances from an iterable. The class constructor is
1023 assumed to have a signature in the form ``ClassName(iterable)``.
1024 That assumption is factored-out to an internal classmethod called
1025 :meth:`_from_iterable` which calls ``cls(iterable)`` to produce a new set.
1026 If the :class:`Set` mixin is being used in a class with a different
1027 constructor signature, you will need to override :meth:`from_iterable`
1028 with a classmethod that can construct new instances from
1029 an iterable argument.
1030
1031(2)
1032 To override the comparisons (presumably for speed, as the
1033 semantics are fixed), redefine :meth:`__le__` and
1034 then the other operations will automatically follow suit.
1035
1036(3)
1037 The :class:`Set` mixin provides a :meth:`_hash` method to compute a hash value
1038 for the set; however, :meth:`__hash__` is not defined because not all sets
1039 are hashable or immutable. To add set hashabilty using mixins,
1040 inherit from both :meth:`Set` and :meth:`Hashable`, then define
1041 ``__hash__ = Set._hash``.
1042
1043.. seealso::
1044
1045 * `OrderedSet recipe <http://code.activestate.com/recipes/576694/>`_ for an
1046 example built on :class:`MutableSet`.
1047
1048 * For more about ABCs, see the :mod:`abc` module and :pep:`3119`.