blob: d5ade412e42ced6740f3dab5d05558c4f0d70b7d [file] [log] [blame]
Raymond Hettinger53dbe392008-02-12 20:03:09 +00001:mod:`collections` --- Container datatypes
2==========================================
Georg Brandl116aa622007-08-15 14:28:22 +00003
4.. module:: collections
Raymond Hettinger53dbe392008-02-12 20:03:09 +00005 :synopsis: Container datatypes
Georg Brandl116aa622007-08-15 14:28:22 +00006.. moduleauthor:: Raymond Hettinger <python@rcn.com>
7.. sectionauthor:: Raymond Hettinger <python@rcn.com>
8
Christian Heimesfe337bf2008-03-23 21:54:12 +00009.. testsetup:: *
10
11 from collections import *
12 import itertools
13 __name__ = '<doctest>'
Georg Brandl116aa622007-08-15 14:28:22 +000014
Raymond Hettinger10480942011-01-10 03:26:08 +000015**Source code:** :source:`Lib/collections.py`
16
Raymond Hettingera6b76ba2010-08-08 00:29:08 +000017This module implements specialized container datatypes providing alternatives to
18Python's general purpose built-in containers, :class:`dict`, :class:`list`,
19:class:`set`, and :class:`tuple`.
Christian Heimes0bd4e112008-02-12 22:59:25 +000020
Raymond Hettingera6b76ba2010-08-08 00:29:08 +000021===================== ====================================================================
22:func:`namedtuple` factory function for creating tuple subclasses with named fields
23:class:`deque` list-like container with fast appends and pops on either end
24:class:`Counter` dict subclass for counting hashable objects
25:class:`OrderedDict` dict subclass that remembers the order entries were added
26:class:`defaultdict` dict subclass that calls a factory function to supply missing values
27:class:`UserDict` wrapper around dictionary objects for easier dict subclassing
28:class:`UserList` wrapper around list objects for easier list subclassing
29:class:`UserString` wrapper around string objects for easier string subclassing
30===================== ====================================================================
Georg Brandl116aa622007-08-15 14:28:22 +000031
Raymond Hettingera6b76ba2010-08-08 00:29:08 +000032In addition to the concrete container classes, the collections module provides
Raymond Hettingerdcb44912010-11-30 17:45:41 +000033:ref:`abstract-base-classes` that can be used to test whether a class provides a
Raymond Hettingera6b76ba2010-08-08 00:29:08 +000034particular interface, for example, whether it is hashable or a mapping.
Mark Summerfield08898b42007-09-05 08:43:04 +000035
36
Raymond Hettingerb8baf632009-01-14 02:20:07 +000037:class:`Counter` objects
38------------------------
39
40A counter tool is provided to support convenient and rapid tallies.
41For example::
42
Raymond Hettinger1c62dc92009-02-04 11:41:45 +000043 >>> # Tally occurrences of words in a list
Raymond Hettingerb8baf632009-01-14 02:20:07 +000044 >>> cnt = Counter()
Raymond Hettinger670eaec2009-01-21 23:14:07 +000045 >>> for word in ['red', 'blue', 'red', 'green', 'blue', 'blue']:
Raymond Hettingerb8baf632009-01-14 02:20:07 +000046 ... cnt[word] += 1
47 >>> cnt
48 Counter({'blue': 3, 'red': 2, 'green': 1})
49
Raymond Hettinger1c62dc92009-02-04 11:41:45 +000050 >>> # Find the ten most common words in Hamlet
Raymond Hettingerb8baf632009-01-14 02:20:07 +000051 >>> import re
52 >>> words = re.findall('\w+', open('hamlet.txt').read().lower())
Raymond Hettinger0bae6622009-01-20 13:00:59 +000053 >>> Counter(words).most_common(10)
Raymond Hettingerb8baf632009-01-14 02:20:07 +000054 [('the', 1143), ('and', 966), ('to', 762), ('of', 669), ('i', 631),
55 ('you', 554), ('a', 546), ('my', 514), ('hamlet', 471), ('in', 451)]
56
57.. class:: Counter([iterable-or-mapping])
58
Raymond Hettinger670eaec2009-01-21 23:14:07 +000059 A :class:`Counter` is a :class:`dict` subclass for counting hashable objects.
Raymond Hettingerb8baf632009-01-14 02:20:07 +000060 It is an unordered collection where elements are stored as dictionary keys
61 and their counts are stored as dictionary values. Counts are allowed to be
62 any integer value including zero or negative counts. The :class:`Counter`
63 class is similar to bags or multisets in other languages.
64
65 Elements are counted from an *iterable* or initialized from another
Benjamin Peterson25c95f12009-05-08 20:42:26 +000066 *mapping* (or counter):
Raymond Hettingerb8baf632009-01-14 02:20:07 +000067
Raymond Hettinger73662a52009-01-27 02:38:22 +000068 >>> c = Counter() # a new, empty counter
69 >>> c = Counter('gallahad') # a new counter from an iterable
70 >>> c = Counter({'red': 4, 'blue': 2}) # a new counter from a mapping
71 >>> c = Counter(cats=4, dogs=8) # a new counter from keyword args
Raymond Hettingerb8baf632009-01-14 02:20:07 +000072
Raymond Hettinger670eaec2009-01-21 23:14:07 +000073 Counter objects have a dictionary interface except that they return a zero
Benjamin Peterson25c95f12009-05-08 20:42:26 +000074 count for missing items instead of raising a :exc:`KeyError`:
Raymond Hettingerb8baf632009-01-14 02:20:07 +000075
Raymond Hettinger94adc8e2009-01-22 05:27:37 +000076 >>> c = Counter(['eggs', 'ham'])
Raymond Hettingerb8baf632009-01-14 02:20:07 +000077 >>> c['bacon'] # count of a missing element is zero
78 0
79
Raymond Hettinger94adc8e2009-01-22 05:27:37 +000080 Setting a count to zero does not remove an element from a counter.
81 Use ``del`` to remove it entirely:
Raymond Hettingerb8baf632009-01-14 02:20:07 +000082
Raymond Hettinger94adc8e2009-01-22 05:27:37 +000083 >>> c['sausage'] = 0 # counter entry with a zero count
84 >>> del c['sausage'] # del actually removes the entry
Raymond Hettingerb8baf632009-01-14 02:20:07 +000085
Benjamin Petersond45bf582009-03-02 21:44:54 +000086 .. versionadded:: 3.1
Raymond Hettingerb8baf632009-01-14 02:20:07 +000087
88
Ezio Melotti0be8b1c2010-04-04 06:53:44 +000089 Counter objects support three methods beyond those available for all
Raymond Hettingerb8baf632009-01-14 02:20:07 +000090 dictionaries:
91
92 .. method:: elements()
93
Raymond Hettinger670eaec2009-01-21 23:14:07 +000094 Return an iterator over elements repeating each as many times as its
95 count. Elements are returned in arbitrary order. If an element's count
96 is less than one, :meth:`elements` will ignore it.
Raymond Hettingerb8baf632009-01-14 02:20:07 +000097
Raymond Hettinger0bae6622009-01-20 13:00:59 +000098 >>> c = Counter(a=4, b=2, c=0, d=-2)
Raymond Hettingerb8baf632009-01-14 02:20:07 +000099 >>> list(c.elements())
100 ['a', 'a', 'a', 'a', 'b', 'b']
101
102 .. method:: most_common([n])
103
Raymond Hettinger73662a52009-01-27 02:38:22 +0000104 Return a list of the *n* most common elements and their counts from the
Raymond Hettingerd04fa312009-02-04 19:45:13 +0000105 most common to the least. If *n* is not specified, :func:`most_common`
Raymond Hettinger73662a52009-01-27 02:38:22 +0000106 returns *all* elements in the counter. Elements with equal counts are
Benjamin Peterson25c95f12009-05-08 20:42:26 +0000107 ordered arbitrarily:
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000108
109 >>> Counter('abracadabra').most_common(3)
110 [('a', 5), ('r', 2), ('b', 2)]
111
Raymond Hettinger9c01e442010-04-03 10:32:58 +0000112 .. method:: subtract([iterable-or-mapping])
113
114 Elements are subtracted from an *iterable* or from another *mapping*
115 (or counter). Like :meth:`dict.update` but subtracts counts instead
116 of replacing them. Both inputs and outputs may be zero or negative.
117
118 >>> c = Counter(a=4, b=2, c=0, d=-2)
119 >>> d = Counter(a=1, b=2, c=3, d=4)
120 >>> c.subtract(d)
121 Counter({'a': 3, 'b': 0, 'c': -3, 'd': -6})
122
Ezio Melotti0be8b1c2010-04-04 06:53:44 +0000123 .. versionadded:: 3.2
124
Raymond Hettinger670eaec2009-01-21 23:14:07 +0000125 The usual dictionary methods are available for :class:`Counter` objects
126 except for two which work differently for counters.
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000127
128 .. method:: fromkeys(iterable)
129
Raymond Hettinger73662a52009-01-27 02:38:22 +0000130 This class method is not implemented for :class:`Counter` objects.
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000131
132 .. method:: update([iterable-or-mapping])
133
Raymond Hettinger73662a52009-01-27 02:38:22 +0000134 Elements are counted from an *iterable* or added-in from another
135 *mapping* (or counter). Like :meth:`dict.update` but adds counts
136 instead of replacing them. Also, the *iterable* is expected to be a
137 sequence of elements, not a sequence of ``(key, value)`` pairs.
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000138
139Common patterns for working with :class:`Counter` objects::
140
Raymond Hettinger73662a52009-01-27 02:38:22 +0000141 sum(c.values()) # total of all counts
142 c.clear() # reset all counts
143 list(c) # list unique elements
144 set(c) # convert to a set
145 dict(c) # convert to a regular dictionary
146 c.items() # convert to a list of (elem, cnt) pairs
147 Counter(dict(list_of_pairs)) # convert from a list of (elem, cnt) pairs
148 c.most_common()[:-n:-1] # n least common elements
149 c += Counter() # remove zero and negative counts
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000150
Raymond Hettinger72a95cc2009-02-25 22:51:40 +0000151Several mathematical operations are provided for combining :class:`Counter`
152objects to produce multisets (counters that have counts greater than zero).
153Addition and subtraction combine counters by adding or subtracting the counts
154of corresponding elements. Intersection and union return the minimum and
155maximum of corresponding counts. Each operation can accept inputs with signed
156counts, but the output will exclude results with counts of zero or less.
Raymond Hettinger4d2073a2009-01-20 03:41:22 +0000157
Raymond Hettingere0d1b9f2009-01-21 20:36:27 +0000158 >>> c = Counter(a=3, b=1)
159 >>> d = Counter(a=1, b=2)
Raymond Hettinger73662a52009-01-27 02:38:22 +0000160 >>> c + d # add two counters together: c[x] + d[x]
Raymond Hettinger4d2073a2009-01-20 03:41:22 +0000161 Counter({'a': 4, 'b': 3})
Raymond Hettinger73662a52009-01-27 02:38:22 +0000162 >>> c - d # subtract (keeping only positive counts)
Raymond Hettinger4d2073a2009-01-20 03:41:22 +0000163 Counter({'a': 2})
Raymond Hettinger73662a52009-01-27 02:38:22 +0000164 >>> c & d # intersection: min(c[x], d[x])
Raymond Hettinger4d2073a2009-01-20 03:41:22 +0000165 Counter({'a': 1, 'b': 1})
Raymond Hettinger73662a52009-01-27 02:38:22 +0000166 >>> c | d # union: max(c[x], d[x])
Raymond Hettinger4d2073a2009-01-20 03:41:22 +0000167 Counter({'a': 3, 'b': 2})
168
Raymond Hettinger22f18852010-04-12 21:45:14 +0000169.. note::
170
171 Counters were primarily designed to work with positive integers to represent
172 running counts; however, care was taken to not unnecessarily preclude use
173 cases needing other types or negative values. To help with those use cases,
174 this section documents the minimum range and type restrictions.
175
176 * The :class:`Counter` class itself is a dictionary subclass with no
177 restrictions on its keys and values. The values are intended to be numbers
178 representing counts, but you *could* store anything in the value field.
179
180 * The :meth:`most_common` method requires only that the values be orderable.
181
182 * For in-place operations such as ``c[key] += 1``, the value type need only
183 support addition and subtraction. So fractions, floats, and decimals would
184 work and negative values are supported. The same is also true for
185 :meth:`update` and :meth:`subtract` which allow negative and zero values
186 for both inputs and outputs.
187
188 * The multiset methods are designed only for use cases with positive values.
189 The inputs may be negative or zero, but only outputs with positive values
190 are created. There are no type restrictions, but the value type needs to
191 support support addition, subtraction, and comparison.
192
193 * The :meth:`elements` method requires integer counts. It ignores zero and
194 negative counts.
195
Raymond Hettingerb14043c2009-01-20 23:44:31 +0000196.. seealso::
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000197
Raymond Hettinger94adc8e2009-01-22 05:27:37 +0000198 * `Counter class <http://code.activestate.com/recipes/576611/>`_
199 adapted for Python 2.5 and an early `Bag recipe
200 <http://code.activestate.com/recipes/259174/>`_ for Python 2.4.
201
Raymond Hettingerb14043c2009-01-20 23:44:31 +0000202 * `Bag class <http://www.gnu.org/software/smalltalk/manual-base/html_node/Bag.html>`_
203 in Smalltalk.
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000204
Raymond Hettingerb14043c2009-01-20 23:44:31 +0000205 * Wikipedia entry for `Multisets <http://en.wikipedia.org/wiki/Multiset>`_\.
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000206
Raymond Hettingerb14043c2009-01-20 23:44:31 +0000207 * `C++ multisets <http://www.demo2s.com/Tutorial/Cpp/0380__set-multiset/Catalog0380__set-multiset.htm>`_
Raymond Hettinger94adc8e2009-01-22 05:27:37 +0000208 tutorial with examples.
Raymond Hettingerb14043c2009-01-20 23:44:31 +0000209
Raymond Hettinger94adc8e2009-01-22 05:27:37 +0000210 * For mathematical operations on multisets and their use cases, see
Raymond Hettingerb14043c2009-01-20 23:44:31 +0000211 *Knuth, Donald. The Art of Computer Programming Volume II,
212 Section 4.6.3, Exercise 19*\.
213
Raymond Hettinger670eaec2009-01-21 23:14:07 +0000214 * To enumerate all distinct multisets of a given size over a given set of
Raymond Hettingerd07d9392009-01-27 04:20:44 +0000215 elements, see :func:`itertools.combinations_with_replacement`.
Raymond Hettingerb14043c2009-01-20 23:44:31 +0000216
Raymond Hettinger94adc8e2009-01-22 05:27:37 +0000217 map(Counter, combinations_with_replacement('ABC', 2)) --> AA AB AC BB BC CC
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000218
219
Georg Brandl116aa622007-08-15 14:28:22 +0000220:class:`deque` objects
221----------------------
222
Georg Brandlc2a4f4f2009-04-10 09:03:43 +0000223.. class:: deque([iterable, [maxlen]])
Georg Brandl116aa622007-08-15 14:28:22 +0000224
225 Returns a new deque object initialized left-to-right (using :meth:`append`) with
226 data from *iterable*. If *iterable* is not specified, the new deque is empty.
227
228 Deques are a generalization of stacks and queues (the name is pronounced "deck"
229 and is short for "double-ended queue"). Deques support thread-safe, memory
230 efficient appends and pops from either side of the deque with approximately the
231 same O(1) performance in either direction.
232
233 Though :class:`list` objects support similar operations, they are optimized for
234 fast fixed-length operations and incur O(n) memory movement costs for
235 ``pop(0)`` and ``insert(0, v)`` operations which change both the size and
236 position of the underlying data representation.
237
Georg Brandl116aa622007-08-15 14:28:22 +0000238
Georg Brandl9afde1c2007-11-01 20:32:30 +0000239 If *maxlen* is not specified or is *None*, deques may grow to an
240 arbitrary length. Otherwise, the deque is bounded to the specified maximum
241 length. Once a bounded length deque is full, when new items are added, a
242 corresponding number of items are discarded from the opposite end. Bounded
243 length deques provide functionality similar to the ``tail`` filter in
244 Unix. They are also useful for tracking transactions and other pools of data
245 where only the most recent activity is of interest.
246
Georg Brandl9afde1c2007-11-01 20:32:30 +0000247
Benjamin Petersone41251e2008-04-25 01:59:09 +0000248 Deque objects support the following methods:
Georg Brandl116aa622007-08-15 14:28:22 +0000249
Benjamin Petersone41251e2008-04-25 01:59:09 +0000250 .. method:: append(x)
Georg Brandl116aa622007-08-15 14:28:22 +0000251
Benjamin Petersone41251e2008-04-25 01:59:09 +0000252 Add *x* to the right side of the deque.
Georg Brandl116aa622007-08-15 14:28:22 +0000253
254
Benjamin Petersone41251e2008-04-25 01:59:09 +0000255 .. method:: appendleft(x)
Georg Brandl116aa622007-08-15 14:28:22 +0000256
Benjamin Petersone41251e2008-04-25 01:59:09 +0000257 Add *x* to the left side of the deque.
Georg Brandl116aa622007-08-15 14:28:22 +0000258
259
Benjamin Petersone41251e2008-04-25 01:59:09 +0000260 .. method:: clear()
Georg Brandl116aa622007-08-15 14:28:22 +0000261
Benjamin Petersone41251e2008-04-25 01:59:09 +0000262 Remove all elements from the deque leaving it with length 0.
Georg Brandl116aa622007-08-15 14:28:22 +0000263
264
Raymond Hettinger44459de2010-04-03 23:20:46 +0000265 .. method:: count(x)
266
267 Count the number of deque elements equal to *x*.
268
269 .. versionadded:: 3.2
270
Georg Brandl67b21b72010-08-17 15:07:14 +0000271
Benjamin Petersone41251e2008-04-25 01:59:09 +0000272 .. method:: extend(iterable)
Georg Brandl116aa622007-08-15 14:28:22 +0000273
Benjamin Petersone41251e2008-04-25 01:59:09 +0000274 Extend the right side of the deque by appending elements from the iterable
275 argument.
Georg Brandl116aa622007-08-15 14:28:22 +0000276
277
Benjamin Petersone41251e2008-04-25 01:59:09 +0000278 .. method:: extendleft(iterable)
Georg Brandl116aa622007-08-15 14:28:22 +0000279
Benjamin Petersone41251e2008-04-25 01:59:09 +0000280 Extend the left side of the deque by appending elements from *iterable*.
281 Note, the series of left appends results in reversing the order of
282 elements in the iterable argument.
Georg Brandl116aa622007-08-15 14:28:22 +0000283
284
Benjamin Petersone41251e2008-04-25 01:59:09 +0000285 .. method:: pop()
Georg Brandl116aa622007-08-15 14:28:22 +0000286
Benjamin Petersone41251e2008-04-25 01:59:09 +0000287 Remove and return an element from the right side of the deque. If no
288 elements are present, raises an :exc:`IndexError`.
Georg Brandl116aa622007-08-15 14:28:22 +0000289
290
Benjamin Petersone41251e2008-04-25 01:59:09 +0000291 .. method:: popleft()
Georg Brandl116aa622007-08-15 14:28:22 +0000292
Benjamin Petersone41251e2008-04-25 01:59:09 +0000293 Remove and return an element from the left side of the deque. If no
294 elements are present, raises an :exc:`IndexError`.
Georg Brandl116aa622007-08-15 14:28:22 +0000295
296
Benjamin Petersone41251e2008-04-25 01:59:09 +0000297 .. method:: remove(value)
Georg Brandl116aa622007-08-15 14:28:22 +0000298
Benjamin Petersone41251e2008-04-25 01:59:09 +0000299 Removed the first occurrence of *value*. If not found, raises a
300 :exc:`ValueError`.
Georg Brandl116aa622007-08-15 14:28:22 +0000301
Georg Brandl67b21b72010-08-17 15:07:14 +0000302
Raymond Hettingere5fdedb2009-12-10 00:47:21 +0000303 .. method:: reverse()
304
305 Reverse the elements of the deque in-place and then return ``None``.
306
307 .. versionadded:: 3.2
Georg Brandl116aa622007-08-15 14:28:22 +0000308
Georg Brandl67b21b72010-08-17 15:07:14 +0000309
Benjamin Petersone41251e2008-04-25 01:59:09 +0000310 .. method:: rotate(n)
Georg Brandl116aa622007-08-15 14:28:22 +0000311
Benjamin Petersone41251e2008-04-25 01:59:09 +0000312 Rotate the deque *n* steps to the right. If *n* is negative, rotate to
313 the left. Rotating one step to the right is equivalent to:
314 ``d.appendleft(d.pop())``.
315
Georg Brandl116aa622007-08-15 14:28:22 +0000316
Raymond Hettinger5bb0f0e2009-03-10 12:56:32 +0000317 Deque objects also provide one read-only attribute:
318
319 .. attribute:: maxlen
320
321 Maximum size of a deque or *None* if unbounded.
322
Raymond Hettinger150fb9c2009-03-10 22:48:06 +0000323 .. versionadded:: 3.1
Raymond Hettinger5bb0f0e2009-03-10 12:56:32 +0000324
325
Georg Brandl116aa622007-08-15 14:28:22 +0000326In addition to the above, deques support iteration, pickling, ``len(d)``,
327``reversed(d)``, ``copy.copy(d)``, ``copy.deepcopy(d)``, membership testing with
Benjamin Peterson206e3072008-10-19 14:07:49 +0000328the :keyword:`in` operator, and subscript references such as ``d[-1]``. Indexed
329access is O(1) at both ends but slows to O(n) in the middle. For fast random
330access, use lists instead.
Georg Brandl116aa622007-08-15 14:28:22 +0000331
Christian Heimesfe337bf2008-03-23 21:54:12 +0000332Example:
333
334.. doctest::
Georg Brandl116aa622007-08-15 14:28:22 +0000335
336 >>> from collections import deque
337 >>> d = deque('ghi') # make a new deque with three items
338 >>> for elem in d: # iterate over the deque's elements
Neal Norwitz752abd02008-05-13 04:55:24 +0000339 ... print(elem.upper())
Georg Brandl116aa622007-08-15 14:28:22 +0000340 G
341 H
342 I
343
344 >>> d.append('j') # add a new entry to the right side
345 >>> d.appendleft('f') # add a new entry to the left side
346 >>> d # show the representation of the deque
347 deque(['f', 'g', 'h', 'i', 'j'])
348
349 >>> d.pop() # return and remove the rightmost item
350 'j'
351 >>> d.popleft() # return and remove the leftmost item
352 'f'
353 >>> list(d) # list the contents of the deque
354 ['g', 'h', 'i']
355 >>> d[0] # peek at leftmost item
356 'g'
357 >>> d[-1] # peek at rightmost item
358 'i'
359
360 >>> list(reversed(d)) # list the contents of a deque in reverse
361 ['i', 'h', 'g']
362 >>> 'h' in d # search the deque
363 True
364 >>> d.extend('jkl') # add multiple elements at once
365 >>> d
366 deque(['g', 'h', 'i', 'j', 'k', 'l'])
367 >>> d.rotate(1) # right rotation
368 >>> d
369 deque(['l', 'g', 'h', 'i', 'j', 'k'])
370 >>> d.rotate(-1) # left rotation
371 >>> d
372 deque(['g', 'h', 'i', 'j', 'k', 'l'])
373
374 >>> deque(reversed(d)) # make a new deque in reverse order
375 deque(['l', 'k', 'j', 'i', 'h', 'g'])
376 >>> d.clear() # empty the deque
377 >>> d.pop() # cannot pop from an empty deque
378 Traceback (most recent call last):
379 File "<pyshell#6>", line 1, in -toplevel-
380 d.pop()
381 IndexError: pop from an empty deque
382
383 >>> d.extendleft('abc') # extendleft() reverses the input order
384 >>> d
385 deque(['c', 'b', 'a'])
386
387
Georg Brandl9afde1c2007-11-01 20:32:30 +0000388:class:`deque` Recipes
389^^^^^^^^^^^^^^^^^^^^^^
Georg Brandl116aa622007-08-15 14:28:22 +0000390
391This section shows various approaches to working with deques.
392
Raymond Hettingerd2ee64d2009-03-31 22:52:48 +0000393Bounded length deques provide functionality similar to the ``tail`` filter
394in Unix::
395
396 def tail(filename, n=10):
397 'Return the last n lines of a file'
398 return deque(open(filename), n)
399
400Another approach to using deques is to maintain a sequence of recently
401added elements by appending to the right and popping to the left::
402
403 def moving_average(iterable, n=3):
404 # moving_average([40, 30, 50, 46, 39, 44]) --> 40.0 42.0 45.0 43.0
405 # http://en.wikipedia.org/wiki/Moving_average
406 it = iter(iterable)
Raymond Hettingerd40285a2009-05-22 01:11:26 +0000407 d = deque(itertools.islice(it, n-1))
408 d.appendleft(0)
Raymond Hettingerd2ee64d2009-03-31 22:52:48 +0000409 s = sum(d)
Raymond Hettingerd2ee64d2009-03-31 22:52:48 +0000410 for elem in it:
411 s += elem - d.popleft()
412 d.append(elem)
413 yield s / n
414
Georg Brandl116aa622007-08-15 14:28:22 +0000415The :meth:`rotate` method provides a way to implement :class:`deque` slicing and
Ezio Melotti0639d5a2009-12-19 23:26:38 +0000416deletion. For example, a pure Python implementation of ``del d[n]`` relies on
Georg Brandl116aa622007-08-15 14:28:22 +0000417the :meth:`rotate` method to position elements to be popped::
418
419 def delete_nth(d, n):
420 d.rotate(-n)
421 d.popleft()
422 d.rotate(n)
423
424To implement :class:`deque` slicing, use a similar approach applying
425:meth:`rotate` to bring a target element to the left side of the deque. Remove
426old entries with :meth:`popleft`, add new entries with :meth:`extend`, and then
427reverse the rotation.
Georg Brandl116aa622007-08-15 14:28:22 +0000428With minor variations on that approach, it is easy to implement Forth style
429stack manipulations such as ``dup``, ``drop``, ``swap``, ``over``, ``pick``,
430``rot``, and ``roll``.
431
Georg Brandl116aa622007-08-15 14:28:22 +0000432
433:class:`defaultdict` objects
434----------------------------
435
Georg Brandl116aa622007-08-15 14:28:22 +0000436.. class:: defaultdict([default_factory[, ...]])
437
438 Returns a new dictionary-like object. :class:`defaultdict` is a subclass of the
Georg Brandl22b34312009-07-26 14:54:51 +0000439 built-in :class:`dict` class. It overrides one method and adds one writable
Georg Brandl116aa622007-08-15 14:28:22 +0000440 instance variable. The remaining functionality is the same as for the
441 :class:`dict` class and is not documented here.
442
443 The first argument provides the initial value for the :attr:`default_factory`
444 attribute; it defaults to ``None``. All remaining arguments are treated the same
445 as if they were passed to the :class:`dict` constructor, including keyword
446 arguments.
447
Georg Brandl116aa622007-08-15 14:28:22 +0000448
Benjamin Petersone41251e2008-04-25 01:59:09 +0000449 :class:`defaultdict` objects support the following method in addition to the
450 standard :class:`dict` operations:
Georg Brandl116aa622007-08-15 14:28:22 +0000451
Benjamin Petersond319ad52010-07-18 14:27:02 +0000452 .. method:: __missing__(key)
Georg Brandl116aa622007-08-15 14:28:22 +0000453
Benjamin Peterson5478b472008-09-17 22:25:09 +0000454 If the :attr:`default_factory` attribute is ``None``, this raises a
Benjamin Petersone41251e2008-04-25 01:59:09 +0000455 :exc:`KeyError` exception with the *key* as argument.
Georg Brandl116aa622007-08-15 14:28:22 +0000456
Benjamin Petersone41251e2008-04-25 01:59:09 +0000457 If :attr:`default_factory` is not ``None``, it is called without arguments
458 to provide a default value for the given *key*, this value is inserted in
459 the dictionary for the *key*, and returned.
Georg Brandl116aa622007-08-15 14:28:22 +0000460
Benjamin Petersone41251e2008-04-25 01:59:09 +0000461 If calling :attr:`default_factory` raises an exception this exception is
462 propagated unchanged.
Georg Brandl116aa622007-08-15 14:28:22 +0000463
Benjamin Petersone41251e2008-04-25 01:59:09 +0000464 This method is called by the :meth:`__getitem__` method of the
465 :class:`dict` class when the requested key is not found; whatever it
466 returns or raises is then returned or raised by :meth:`__getitem__`.
Georg Brandl116aa622007-08-15 14:28:22 +0000467
468
Benjamin Petersone41251e2008-04-25 01:59:09 +0000469 :class:`defaultdict` objects support the following instance variable:
Georg Brandl116aa622007-08-15 14:28:22 +0000470
Benjamin Petersone41251e2008-04-25 01:59:09 +0000471
Benjamin Petersond319ad52010-07-18 14:27:02 +0000472 .. attribute:: default_factory
Benjamin Petersone41251e2008-04-25 01:59:09 +0000473
474 This attribute is used by the :meth:`__missing__` method; it is
475 initialized from the first argument to the constructor, if present, or to
476 ``None``, if absent.
Georg Brandl116aa622007-08-15 14:28:22 +0000477
478
Georg Brandl116aa622007-08-15 14:28:22 +0000479:class:`defaultdict` Examples
480^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
481
482Using :class:`list` as the :attr:`default_factory`, it is easy to group a
Christian Heimesfe337bf2008-03-23 21:54:12 +0000483sequence of key-value pairs into a dictionary of lists:
Georg Brandl116aa622007-08-15 14:28:22 +0000484
485 >>> s = [('yellow', 1), ('blue', 2), ('yellow', 3), ('blue', 4), ('red', 1)]
486 >>> d = defaultdict(list)
487 >>> for k, v in s:
488 ... d[k].append(v)
489 ...
Ezio Melottic53a8942009-09-12 01:52:05 +0000490 >>> list(d.items())
Georg Brandl116aa622007-08-15 14:28:22 +0000491 [('blue', [2, 4]), ('red', [1]), ('yellow', [1, 3])]
492
493When each key is encountered for the first time, it is not already in the
494mapping; so an entry is automatically created using the :attr:`default_factory`
495function which returns an empty :class:`list`. The :meth:`list.append`
496operation then attaches the value to the new list. When keys are encountered
497again, the look-up proceeds normally (returning the list for that key) and the
498:meth:`list.append` operation adds another value to the list. This technique is
Christian Heimesfe337bf2008-03-23 21:54:12 +0000499simpler and faster than an equivalent technique using :meth:`dict.setdefault`:
Georg Brandl116aa622007-08-15 14:28:22 +0000500
501 >>> d = {}
502 >>> for k, v in s:
503 ... d.setdefault(k, []).append(v)
504 ...
Ezio Melottic53a8942009-09-12 01:52:05 +0000505 >>> list(d.items())
Georg Brandl116aa622007-08-15 14:28:22 +0000506 [('blue', [2, 4]), ('red', [1]), ('yellow', [1, 3])]
507
508Setting the :attr:`default_factory` to :class:`int` makes the
509:class:`defaultdict` useful for counting (like a bag or multiset in other
Christian Heimesfe337bf2008-03-23 21:54:12 +0000510languages):
Georg Brandl116aa622007-08-15 14:28:22 +0000511
512 >>> s = 'mississippi'
513 >>> d = defaultdict(int)
514 >>> for k in s:
515 ... d[k] += 1
516 ...
Ezio Melottic53a8942009-09-12 01:52:05 +0000517 >>> list(d.items())
Georg Brandl116aa622007-08-15 14:28:22 +0000518 [('i', 4), ('p', 2), ('s', 4), ('m', 1)]
519
520When a letter is first encountered, it is missing from the mapping, so the
521:attr:`default_factory` function calls :func:`int` to supply a default count of
522zero. The increment operation then builds up the count for each letter.
523
524The function :func:`int` which always returns zero is just a special case of
525constant functions. A faster and more flexible way to create constant functions
526is to use a lambda function which can supply any constant value (not just
Christian Heimesfe337bf2008-03-23 21:54:12 +0000527zero):
Georg Brandl116aa622007-08-15 14:28:22 +0000528
529 >>> def constant_factory(value):
530 ... return lambda: value
531 >>> d = defaultdict(constant_factory('<missing>'))
532 >>> d.update(name='John', action='ran')
533 >>> '%(name)s %(action)s to %(object)s' % d
534 'John ran to <missing>'
535
536Setting the :attr:`default_factory` to :class:`set` makes the
Christian Heimesfe337bf2008-03-23 21:54:12 +0000537:class:`defaultdict` useful for building a dictionary of sets:
Georg Brandl116aa622007-08-15 14:28:22 +0000538
539 >>> s = [('red', 1), ('blue', 2), ('red', 3), ('blue', 4), ('red', 1), ('blue', 4)]
540 >>> d = defaultdict(set)
541 >>> for k, v in s:
542 ... d[k].add(v)
543 ...
Ezio Melottic53a8942009-09-12 01:52:05 +0000544 >>> list(d.items())
Georg Brandl116aa622007-08-15 14:28:22 +0000545 [('blue', set([2, 4])), ('red', set([1, 3]))]
546
547
Guido van Rossum3d392eb2007-11-16 00:35:22 +0000548:func:`namedtuple` Factory Function for Tuples with Named Fields
Christian Heimes790c8232008-01-07 21:14:23 +0000549----------------------------------------------------------------
Georg Brandl116aa622007-08-15 14:28:22 +0000550
Thomas Wouters1b7f8912007-09-19 03:06:30 +0000551Named tuples assign meaning to each position in a tuple and allow for more readable,
552self-documenting code. They can be used wherever regular tuples are used, and
553they add the ability to access fields by name instead of position index.
Georg Brandl116aa622007-08-15 14:28:22 +0000554
Georg Brandlc2a4f4f2009-04-10 09:03:43 +0000555.. function:: namedtuple(typename, field_names, verbose=False, rename=False)
Georg Brandl116aa622007-08-15 14:28:22 +0000556
557 Returns a new tuple subclass named *typename*. The new subclass is used to
Christian Heimesc3f30c42008-02-22 16:37:40 +0000558 create tuple-like objects that have fields accessible by attribute lookup as
Georg Brandl116aa622007-08-15 14:28:22 +0000559 well as being indexable and iterable. Instances of the subclass also have a
Benjamin Peterson4469d0c2008-11-30 22:46:23 +0000560 helpful docstring (with typename and field_names) and a helpful :meth:`__repr__`
Georg Brandl116aa622007-08-15 14:28:22 +0000561 method which lists the tuple contents in a ``name=value`` format.
562
Benjamin Peterson4469d0c2008-11-30 22:46:23 +0000563 The *field_names* are a single string with each fieldname separated by whitespace
564 and/or commas, for example ``'x y'`` or ``'x, y'``. Alternatively, *field_names*
Christian Heimes25bb7832008-01-11 16:17:00 +0000565 can be a sequence of strings such as ``['x', 'y']``.
Georg Brandl9afde1c2007-11-01 20:32:30 +0000566
567 Any valid Python identifier may be used for a fieldname except for names
Christian Heimes0449f632007-12-15 01:27:15 +0000568 starting with an underscore. Valid identifiers consist of letters, digits,
569 and underscores but do not start with a digit or underscore and cannot be
Georg Brandlf6945182008-02-01 11:56:49 +0000570 a :mod:`keyword` such as *class*, *for*, *return*, *global*, *pass*,
Georg Brandl9afde1c2007-11-01 20:32:30 +0000571 or *raise*.
Georg Brandl116aa622007-08-15 14:28:22 +0000572
Benjamin Petersona86f2c02009-02-10 02:41:10 +0000573 If *rename* is true, invalid fieldnames are automatically replaced
574 with positional names. For example, ``['abc', 'def', 'ghi', 'abc']`` is
Raymond Hettinger85737b82009-04-02 22:37:59 +0000575 converted to ``['abc', '_1', 'ghi', '_3']``, eliminating the keyword
Benjamin Petersona86f2c02009-02-10 02:41:10 +0000576 ``def`` and the duplicate fieldname ``abc``.
577
Christian Heimes25bb7832008-01-11 16:17:00 +0000578 If *verbose* is true, the class definition is printed just before being built.
Georg Brandl116aa622007-08-15 14:28:22 +0000579
Georg Brandl9afde1c2007-11-01 20:32:30 +0000580 Named tuple instances do not have per-instance dictionaries, so they are
Thomas Wouters8ce81f72007-09-20 18:22:40 +0000581 lightweight and require no more memory than regular tuples.
Georg Brandl116aa622007-08-15 14:28:22 +0000582
Raymond Hettingerb62ad242009-03-02 22:16:43 +0000583 .. versionchanged:: 3.1
Georg Brandl67b21b72010-08-17 15:07:14 +0000584 Added support for *rename*.
Benjamin Petersona86f2c02009-02-10 02:41:10 +0000585
Christian Heimesfe337bf2008-03-23 21:54:12 +0000586
587.. doctest::
588 :options: +NORMALIZE_WHITESPACE
Georg Brandl116aa622007-08-15 14:28:22 +0000589
Raymond Hettinger0ef956f2010-11-21 23:23:29 +0000590 >>> # Basic example
591 >>> Point = namedtuple('Point', 'x y')
592 >>> p = Point(x=10, y=11)
593
594 >>> # Example using the verbose option to print the class definition
Guido van Rossum3d392eb2007-11-16 00:35:22 +0000595 >>> Point = namedtuple('Point', 'x y', verbose=True)
Thomas Wouters1b7f8912007-09-19 03:06:30 +0000596 class Point(tuple):
597 'Point(x, y)'
Christian Heimesfe337bf2008-03-23 21:54:12 +0000598 <BLANKLINE>
Thomas Wouters1b7f8912007-09-19 03:06:30 +0000599 __slots__ = ()
Christian Heimesfe337bf2008-03-23 21:54:12 +0000600 <BLANKLINE>
Christian Heimesfaf2f632008-01-06 16:59:19 +0000601 _fields = ('x', 'y')
Christian Heimesfe337bf2008-03-23 21:54:12 +0000602 <BLANKLINE>
Raymond Hettinger089ba7f2009-05-27 00:38:24 +0000603 def __new__(_cls, x, y):
Raymond Hettinger7b0d3c62010-04-02 18:54:02 +0000604 'Create a new instance of Point(x, y)'
Raymond Hettinger089ba7f2009-05-27 00:38:24 +0000605 return _tuple.__new__(_cls, (x, y))
Christian Heimesfe337bf2008-03-23 21:54:12 +0000606 <BLANKLINE>
Christian Heimesfaf2f632008-01-06 16:59:19 +0000607 @classmethod
Christian Heimesfe337bf2008-03-23 21:54:12 +0000608 def _make(cls, iterable, new=tuple.__new__, len=len):
Christian Heimesfaf2f632008-01-06 16:59:19 +0000609 'Make a new Point object from a sequence or iterable'
Christian Heimesfe337bf2008-03-23 21:54:12 +0000610 result = new(cls, iterable)
Christian Heimesfaf2f632008-01-06 16:59:19 +0000611 if len(result) != 2:
612 raise TypeError('Expected 2 arguments, got %d' % len(result))
613 return result
Christian Heimesfe337bf2008-03-23 21:54:12 +0000614 <BLANKLINE>
Thomas Wouters1b7f8912007-09-19 03:06:30 +0000615 def __repr__(self):
Raymond Hettinger7b0d3c62010-04-02 18:54:02 +0000616 'Return a nicely formatted representation string'
Raymond Hettingerd331ce92010-08-08 01:13:42 +0000617 return self.__class__.__name__ + '(x=%r, y=%r)' % self
Christian Heimesfe337bf2008-03-23 21:54:12 +0000618 <BLANKLINE>
Raymond Hettingera4f52b12009-03-02 22:28:31 +0000619 def _asdict(self):
620 'Return a new OrderedDict which maps field names to their values'
621 return OrderedDict(zip(self._fields, self))
Christian Heimesfe337bf2008-03-23 21:54:12 +0000622 <BLANKLINE>
Raymond Hettinger089ba7f2009-05-27 00:38:24 +0000623 def _replace(_self, **kwds):
Guido van Rossum3d392eb2007-11-16 00:35:22 +0000624 'Return a new Point object replacing specified fields with new values'
Raymond Hettinger089ba7f2009-05-27 00:38:24 +0000625 result = _self._make(map(kwds.pop, ('x', 'y'), _self))
Christian Heimesfaf2f632008-01-06 16:59:19 +0000626 if kwds:
Ezio Melotti8f7649e2009-09-13 04:48:45 +0000627 raise ValueError('Got unexpected field names: %r' % list(kwds.keys()))
Christian Heimesfaf2f632008-01-06 16:59:19 +0000628 return result
Georg Brandl48310cd2009-01-03 21:18:54 +0000629 <BLANKLINE>
630 def __getnewargs__(self):
Raymond Hettinger7b0d3c62010-04-02 18:54:02 +0000631 'Return self as a plain tuple. Used by copy and pickle.'
Benjamin Peterson41181742008-07-02 20:22:54 +0000632 return tuple(self)
Christian Heimesfe337bf2008-03-23 21:54:12 +0000633 <BLANKLINE>
Raymond Hettinger7b0d3c62010-04-02 18:54:02 +0000634 x = _property(_itemgetter(0), doc='Alias for field number 0')
635 y = _property(_itemgetter(1), doc='Alias for field number 1')
Georg Brandl116aa622007-08-15 14:28:22 +0000636
Thomas Wouters1b7f8912007-09-19 03:06:30 +0000637 >>> p = Point(11, y=22) # instantiate with positional or keyword arguments
Christian Heimes99170a52007-12-19 02:07:34 +0000638 >>> p[0] + p[1] # indexable like the plain tuple (11, 22)
Thomas Wouters1b7f8912007-09-19 03:06:30 +0000639 33
640 >>> x, y = p # unpack like a regular tuple
641 >>> x, y
642 (11, 22)
Christian Heimesc3f30c42008-02-22 16:37:40 +0000643 >>> p.x + p.y # fields also accessible by name
Thomas Wouters1b7f8912007-09-19 03:06:30 +0000644 33
645 >>> p # readable __repr__ with a name=value style
646 Point(x=11, y=22)
Georg Brandl116aa622007-08-15 14:28:22 +0000647
Thomas Wouters1b7f8912007-09-19 03:06:30 +0000648Named tuples are especially useful for assigning field names to result tuples returned
649by the :mod:`csv` or :mod:`sqlite3` modules::
650
Guido van Rossum3d392eb2007-11-16 00:35:22 +0000651 EmployeeRecord = namedtuple('EmployeeRecord', 'name, age, title, department, paygrade')
Georg Brandl9afde1c2007-11-01 20:32:30 +0000652
Thomas Wouters1b7f8912007-09-19 03:06:30 +0000653 import csv
Christian Heimesfaf2f632008-01-06 16:59:19 +0000654 for emp in map(EmployeeRecord._make, csv.reader(open("employees.csv", "rb"))):
Thomas Wouters1b7f8912007-09-19 03:06:30 +0000655 print(emp.name, emp.title)
656
Georg Brandl9afde1c2007-11-01 20:32:30 +0000657 import sqlite3
658 conn = sqlite3.connect('/companydata')
659 cursor = conn.cursor()
660 cursor.execute('SELECT name, age, title, department, paygrade FROM employees')
Christian Heimesfaf2f632008-01-06 16:59:19 +0000661 for emp in map(EmployeeRecord._make, cursor.fetchall()):
Christian Heimes00412232008-01-10 16:02:19 +0000662 print(emp.name, emp.title)
Georg Brandl9afde1c2007-11-01 20:32:30 +0000663
Christian Heimes99170a52007-12-19 02:07:34 +0000664In addition to the methods inherited from tuples, named tuples support
Christian Heimes2380ac72008-01-09 00:17:24 +0000665three additional methods and one attribute. To prevent conflicts with
666field names, the method and attribute names start with an underscore.
Christian Heimes99170a52007-12-19 02:07:34 +0000667
Benjamin Peterson0b9fb802010-07-18 14:23:36 +0000668.. classmethod:: somenamedtuple._make(iterable)
Christian Heimes99170a52007-12-19 02:07:34 +0000669
Christian Heimesfaf2f632008-01-06 16:59:19 +0000670 Class method that makes a new instance from an existing sequence or iterable.
Christian Heimes99170a52007-12-19 02:07:34 +0000671
Christian Heimesfe337bf2008-03-23 21:54:12 +0000672.. doctest::
Thomas Wouters1b7f8912007-09-19 03:06:30 +0000673
Christian Heimesfaf2f632008-01-06 16:59:19 +0000674 >>> t = [11, 22]
675 >>> Point._make(t)
676 Point(x=11, y=22)
Thomas Wouters1b7f8912007-09-19 03:06:30 +0000677
Christian Heimes790c8232008-01-07 21:14:23 +0000678.. method:: somenamedtuple._asdict()
Georg Brandl9afde1c2007-11-01 20:32:30 +0000679
Raymond Hettingera4f52b12009-03-02 22:28:31 +0000680 Return a new :class:`OrderedDict` which maps field names to their corresponding
681 values::
Georg Brandl9afde1c2007-11-01 20:32:30 +0000682
Christian Heimes0449f632007-12-15 01:27:15 +0000683 >>> p._asdict()
Raymond Hettingera4f52b12009-03-02 22:28:31 +0000684 OrderedDict([('x', 11), ('y', 22)])
685
Raymond Hettingera88e4da2009-03-03 05:12:27 +0000686 .. versionchanged:: 3.1
Raymond Hettingera4f52b12009-03-02 22:28:31 +0000687 Returns an :class:`OrderedDict` instead of a regular :class:`dict`.
Christian Heimesfe337bf2008-03-23 21:54:12 +0000688
Christian Heimes790c8232008-01-07 21:14:23 +0000689.. method:: somenamedtuple._replace(kwargs)
Thomas Wouters1b7f8912007-09-19 03:06:30 +0000690
Christian Heimesfe337bf2008-03-23 21:54:12 +0000691 Return a new instance of the named tuple replacing specified fields with new
692 values:
Thomas Wouters8ce81f72007-09-20 18:22:40 +0000693
694::
Thomas Wouters1b7f8912007-09-19 03:06:30 +0000695
696 >>> p = Point(x=11, y=22)
Christian Heimes0449f632007-12-15 01:27:15 +0000697 >>> p._replace(x=33)
Thomas Wouters1b7f8912007-09-19 03:06:30 +0000698 Point(x=33, y=22)
699
Guido van Rossum3d392eb2007-11-16 00:35:22 +0000700 >>> for partnum, record in inventory.items():
Christian Heimes454f37b2008-01-10 00:10:02 +0000701 ... inventory[partnum] = record._replace(price=newprices[partnum], timestamp=time.now())
Thomas Wouters1b7f8912007-09-19 03:06:30 +0000702
Christian Heimes790c8232008-01-07 21:14:23 +0000703.. attribute:: somenamedtuple._fields
Thomas Wouters1b7f8912007-09-19 03:06:30 +0000704
Christian Heimes2380ac72008-01-09 00:17:24 +0000705 Tuple of strings listing the field names. Useful for introspection
Georg Brandl9afde1c2007-11-01 20:32:30 +0000706 and for creating new named tuple types from existing named tuples.
Thomas Wouters8ce81f72007-09-20 18:22:40 +0000707
Christian Heimesfe337bf2008-03-23 21:54:12 +0000708.. doctest::
Thomas Wouters1b7f8912007-09-19 03:06:30 +0000709
Christian Heimes0449f632007-12-15 01:27:15 +0000710 >>> p._fields # view the field names
Thomas Wouters1b7f8912007-09-19 03:06:30 +0000711 ('x', 'y')
Thomas Wouters1b7f8912007-09-19 03:06:30 +0000712
Guido van Rossum3d392eb2007-11-16 00:35:22 +0000713 >>> Color = namedtuple('Color', 'red green blue')
Christian Heimes0449f632007-12-15 01:27:15 +0000714 >>> Pixel = namedtuple('Pixel', Point._fields + Color._fields)
Thomas Wouters1b7f8912007-09-19 03:06:30 +0000715 >>> Pixel(11, 22, 128, 255, 0)
Christian Heimes454f37b2008-01-10 00:10:02 +0000716 Pixel(x=11, y=22, red=128, green=255, blue=0)
Georg Brandl116aa622007-08-15 14:28:22 +0000717
Christian Heimes0449f632007-12-15 01:27:15 +0000718To retrieve a field whose name is stored in a string, use the :func:`getattr`
Christian Heimesfe337bf2008-03-23 21:54:12 +0000719function:
Christian Heimes0449f632007-12-15 01:27:15 +0000720
721 >>> getattr(p, 'x')
722 11
723
Raymond Hettinger651453a2009-02-11 00:20:02 +0000724To convert a dictionary to a named tuple, use the double-star-operator
725(as described in :ref:`tut-unpacking-arguments`):
Christian Heimes99170a52007-12-19 02:07:34 +0000726
727 >>> d = {'x': 11, 'y': 22}
728 >>> Point(**d)
729 Point(x=11, y=22)
730
Guido van Rossum3d392eb2007-11-16 00:35:22 +0000731Since a named tuple is a regular Python class, it is easy to add or change
Christian Heimes043d6f62008-01-07 17:19:16 +0000732functionality with a subclass. Here is how to add a calculated field and
Christian Heimesfe337bf2008-03-23 21:54:12 +0000733a fixed-width print format:
Guido van Rossum3d392eb2007-11-16 00:35:22 +0000734
Christian Heimes043d6f62008-01-07 17:19:16 +0000735 >>> class Point(namedtuple('Point', 'x y')):
Christian Heimes25bb7832008-01-11 16:17:00 +0000736 ... __slots__ = ()
Christian Heimes454f37b2008-01-10 00:10:02 +0000737 ... @property
738 ... def hypot(self):
739 ... return (self.x ** 2 + self.y ** 2) ** 0.5
740 ... def __str__(self):
Christian Heimes25bb7832008-01-11 16:17:00 +0000741 ... return 'Point: x=%6.3f y=%6.3f hypot=%6.3f' % (self.x, self.y, self.hypot)
Guido van Rossum3d392eb2007-11-16 00:35:22 +0000742
Georg Brandl0df79792008-10-04 18:33:26 +0000743 >>> for p in Point(3, 4), Point(14, 5/7):
Christian Heimes00412232008-01-10 16:02:19 +0000744 ... print(p)
Christian Heimes25bb7832008-01-11 16:17:00 +0000745 Point: x= 3.000 y= 4.000 hypot= 5.000
746 Point: x=14.000 y= 0.714 hypot=14.018
Christian Heimes043d6f62008-01-07 17:19:16 +0000747
Georg Brandlaf5c2382009-12-28 08:02:38 +0000748The subclass shown above sets ``__slots__`` to an empty tuple. This helps
Christian Heimes679db4a2008-01-18 09:56:22 +0000749keep memory requirements low by preventing the creation of instance dictionaries.
750
Christian Heimes2380ac72008-01-09 00:17:24 +0000751
752Subclassing is not useful for adding new, stored fields. Instead, simply
Christian Heimesfe337bf2008-03-23 21:54:12 +0000753create a new named tuple type from the :attr:`_fields` attribute:
Christian Heimes2380ac72008-01-09 00:17:24 +0000754
Christian Heimes25bb7832008-01-11 16:17:00 +0000755 >>> Point3D = namedtuple('Point3D', Point._fields + ('z',))
Christian Heimes2380ac72008-01-09 00:17:24 +0000756
757Default values can be implemented by using :meth:`_replace` to
Christian Heimesfe337bf2008-03-23 21:54:12 +0000758customize a prototype instance:
Guido van Rossum3d392eb2007-11-16 00:35:22 +0000759
760 >>> Account = namedtuple('Account', 'owner balance transaction_count')
Christian Heimes587c2bf2008-01-19 16:21:02 +0000761 >>> default_account = Account('<owner name>', 0.0, 0)
762 >>> johns_account = default_account._replace(owner='John')
Guido van Rossum3d392eb2007-11-16 00:35:22 +0000763
Christian Heimese4ca8152008-05-08 17:18:53 +0000764Enumerated constants can be implemented with named tuples, but it is simpler
765and more efficient to use a simple class declaration:
766
767 >>> Status = namedtuple('Status', 'open pending closed')._make(range(3))
768 >>> Status.open, Status.pending, Status.closed
769 (0, 1, 2)
770 >>> class Status:
771 ... open, pending, closed = range(3)
772
Raymond Hettinger651453a2009-02-11 00:20:02 +0000773.. seealso::
Thomas Wouters47b49bf2007-08-30 22:15:33 +0000774
Raymond Hettinger651453a2009-02-11 00:20:02 +0000775 `Named tuple recipe <http://code.activestate.com/recipes/500261/>`_
776 adapted for Python 2.4.
Raymond Hettingere4c96ad2008-02-06 01:23:58 +0000777
778
Raymond Hettinger2d32f632009-03-02 21:24:57 +0000779:class:`OrderedDict` objects
780----------------------------
781
782Ordered dictionaries are just like regular dictionaries but they remember the
783order that items were inserted. When iterating over an ordered dictionary,
784the items are returned in the order their keys were first added.
785
786.. class:: OrderedDict([items])
787
788 Return an instance of a dict subclass, supporting the usual :class:`dict`
789 methods. An *OrderedDict* is a dict that remembers the order that keys
790 were first inserted. If a new entry overwrites an existing entry, the
791 original insertion position is left unchanged. Deleting an entry and
792 reinserting it will move it to the end.
793
Benjamin Petersond45bf582009-03-02 21:44:54 +0000794 .. versionadded:: 3.1
Raymond Hettinger2d32f632009-03-02 21:24:57 +0000795
Benjamin Petersond319ad52010-07-18 14:27:02 +0000796 .. method:: popitem(last=True)
Raymond Hettingerdc879f02009-03-19 20:30:56 +0000797
Benjamin Petersond319ad52010-07-18 14:27:02 +0000798 The :meth:`popitem` method for ordered dictionaries returns and removes a
799 (key, value) pair. The pairs are returned in LIFO order if *last* is true
800 or FIFO order if false.
Raymond Hettinger2d32f632009-03-02 21:24:57 +0000801
Raymond Hettingerf45abc92010-09-06 21:26:09 +0000802 .. method:: move_to_end(key, last=True)
803
804 Move an existing *key* to either end of an ordered dictionary. The item
805 is moved to the right end if *last* is true (the default) or to the
806 beginning if *last* is false. Raises :exc:`KeyError` if the *key* does
807 not exist::
808
809 >>> d = OrderedDict.fromkeys('abcde')
810 >>> d.move_to_end('b')
811 >>> ''.join(d.keys)
812 'acdeb'
Éric Araujo1cb25aa2010-11-06 07:03:07 +0000813 >>> d.move_to_end('b', last=False)
Raymond Hettingerf45abc92010-09-06 21:26:09 +0000814 >>> ''.join(d.keys)
815 'bacde'
816
817 .. versionadded:: 3.2
818
Raymond Hettingere9091502009-05-19 17:40:07 +0000819In addition to the usual mapping methods, ordered dictionaries also support
820reverse iteration using :func:`reversed`.
821
Raymond Hettinger2d32f632009-03-02 21:24:57 +0000822Equality tests between :class:`OrderedDict` objects are order-sensitive
823and are implemented as ``list(od1.items())==list(od2.items())``.
824Equality tests between :class:`OrderedDict` objects and other
825:class:`Mapping` objects are order-insensitive like regular dictionaries.
826This allows :class:`OrderedDict` objects to be substituted anywhere a
827regular dictionary is used.
828
Raymond Hettinger36180782009-04-09 22:34:23 +0000829The :class:`OrderedDict` constructor and :meth:`update` method both accept
830keyword arguments, but their order is lost because Python's function call
831semantics pass-in keyword arguments using a regular unordered dictionary.
832
Raymond Hettingerdc879f02009-03-19 20:30:56 +0000833.. seealso::
834
835 `Equivalent OrderedDict recipe <http://code.activestate.com/recipes/576693/>`_
836 that runs on Python 2.4 or later.
837
Raymond Hettinger0e312012009-11-10 18:35:46 +0000838Since an ordered dictionary remembers its insertion order, it can be used
839in conjuction with sorting to make a sorted dictionary::
840
841 >>> # regular unsorted dictionary
842 >>> d = {'banana': 3, 'apple':4, 'pear': 1, 'orange': 2}
843
844 >>> # dictionary sorted by key
845 >>> OrderedDict(sorted(d.items(), key=lambda t: t[0]))
846 OrderedDict([('apple', 4), ('banana', 3), ('orange', 2), ('pear', 1)])
847
848 >>> # dictionary sorted by value
849 >>> OrderedDict(sorted(d.items(), key=lambda t: t[1]))
850 OrderedDict([('pear', 1), ('orange', 2), ('banana', 3), ('apple', 4)])
851
852 >>> # dictionary sorted by length of the key string
853 >>> OrderedDict(sorted(d.items(), key=lambda t: len(t[0])))
854 OrderedDict([('pear', 1), ('apple', 4), ('orange', 2), ('banana', 3)])
855
856The new sorted dictionaries maintain their sort order when entries
857are deleted. But when new keys are added, the keys are appended
858to the end and the sort is not maintained.
859
Raymond Hettinger4821ef82010-07-31 10:14:41 +0000860It is also straight-forward to create an ordered dictionary variant
861that the remembers the order the keys were *last* inserted.
862If a new entry overwrites an existing entry, the
863original insertion position is changed and moved to the end::
864
865 class LastUpdatedOrderedDict(OrderedDict):
Georg Brandl77570e22010-12-18 16:21:58 +0000866 'Store items in the order the keys were last added'
Raymond Hettinger4821ef82010-07-31 10:14:41 +0000867 def __setitem__(self, key, value):
868 if key in self:
869 del self[key]
870 OrderedDict.__setitem__(self, key, value)
871
Raymond Hettingere4c96ad2008-02-06 01:23:58 +0000872
873:class:`UserDict` objects
Mark Summerfield8f2d0062008-02-06 13:30:44 +0000874-------------------------
Raymond Hettingere4c96ad2008-02-06 01:23:58 +0000875
Georg Brandl48310cd2009-01-03 21:18:54 +0000876The class, :class:`UserDict` acts as a wrapper around dictionary objects.
877The need for this class has been partially supplanted by the ability to
Raymond Hettingere4c96ad2008-02-06 01:23:58 +0000878subclass directly from :class:`dict`; however, this class can be easier
879to work with because the underlying dictionary is accessible as an
880attribute.
881
882.. class:: UserDict([initialdata])
883
884 Class that simulates a dictionary. The instance's contents are kept in a
885 regular dictionary, which is accessible via the :attr:`data` attribute of
886 :class:`UserDict` instances. If *initialdata* is provided, :attr:`data` is
887 initialized with its contents; note that a reference to *initialdata* will not
888 be kept, allowing it be used for other purposes.
889
Benjamin Petersond319ad52010-07-18 14:27:02 +0000890 In addition to supporting the methods and operations of mappings,
891 :class:`UserDict` instances provide the following attribute:
Raymond Hettingere4c96ad2008-02-06 01:23:58 +0000892
Benjamin Petersond319ad52010-07-18 14:27:02 +0000893 .. attribute:: data
Raymond Hettingere4c96ad2008-02-06 01:23:58 +0000894
Benjamin Petersond319ad52010-07-18 14:27:02 +0000895 A real dictionary used to store the contents of the :class:`UserDict`
896 class.
Raymond Hettinger53dbe392008-02-12 20:03:09 +0000897
898
899
900:class:`UserList` objects
901-------------------------
902
903This class acts as a wrapper around list objects. It is a useful base class
Georg Brandl48310cd2009-01-03 21:18:54 +0000904for your own list-like classes which can inherit from them and override
Raymond Hettinger53dbe392008-02-12 20:03:09 +0000905existing methods or add new ones. In this way, one can add new behaviors to
906lists.
907
Georg Brandl48310cd2009-01-03 21:18:54 +0000908The need for this class has been partially supplanted by the ability to
Raymond Hettinger53dbe392008-02-12 20:03:09 +0000909subclass directly from :class:`list`; however, this class can be easier
910to work with because the underlying list is accessible as an attribute.
911
912.. class:: UserList([list])
913
914 Class that simulates a list. The instance's contents are kept in a regular
915 list, which is accessible via the :attr:`data` attribute of :class:`UserList`
916 instances. The instance's contents are initially set to a copy of *list*,
917 defaulting to the empty list ``[]``. *list* can be any iterable, for
918 example a real Python list or a :class:`UserList` object.
919
Benjamin Petersond319ad52010-07-18 14:27:02 +0000920 In addition to supporting the methods and operations of mutable sequences,
921 :class:`UserList` instances provide the following attribute:
Raymond Hettinger53dbe392008-02-12 20:03:09 +0000922
Benjamin Petersond319ad52010-07-18 14:27:02 +0000923 .. attribute:: data
Raymond Hettinger53dbe392008-02-12 20:03:09 +0000924
Benjamin Petersond319ad52010-07-18 14:27:02 +0000925 A real :class:`list` object used to store the contents of the
926 :class:`UserList` class.
Raymond Hettinger53dbe392008-02-12 20:03:09 +0000927
928**Subclassing requirements:** Subclasses of :class:`UserList` are expect to
929offer a constructor which can be called with either no arguments or one
930argument. List operations which return a new sequence attempt to create an
931instance of the actual implementation class. To do so, it assumes that the
932constructor can be called with a single parameter, which is a sequence object
933used as a data source.
934
935If a derived class does not wish to comply with this requirement, all of the
936special methods supported by this class will need to be overridden; please
937consult the sources for information about the methods which need to be provided
938in that case.
Raymond Hettingerb3a65f82008-02-21 22:11:37 +0000939
940:class:`UserString` objects
Christian Heimesc3f30c42008-02-22 16:37:40 +0000941---------------------------
Raymond Hettingerb3a65f82008-02-21 22:11:37 +0000942
Georg Brandl48310cd2009-01-03 21:18:54 +0000943The class, :class:`UserString` acts as a wrapper around string objects.
944The need for this class has been partially supplanted by the ability to
Raymond Hettingerb3a65f82008-02-21 22:11:37 +0000945subclass directly from :class:`str`; however, this class can be easier
946to work with because the underlying string is accessible as an
947attribute.
948
949.. class:: UserString([sequence])
950
951 Class that simulates a string or a Unicode string object. The instance's
Georg Brandl48310cd2009-01-03 21:18:54 +0000952 content is kept in a regular string object, which is accessible via the
953 :attr:`data` attribute of :class:`UserString` instances. The instance's
Raymond Hettingerb3a65f82008-02-21 22:11:37 +0000954 contents are initially set to a copy of *sequence*. The *sequence* can
955 be an instance of :class:`bytes`, :class:`str`, :class:`UserString` (or a
956 subclass) or an arbitrary sequence which can be converted into a string using
957 the built-in :func:`str` function.
Raymond Hettingera6b76ba2010-08-08 00:29:08 +0000958
Raymond Hettingerdcb44912010-11-30 17:45:41 +0000959.. _abstract-base-classes:
Raymond Hettingera6b76ba2010-08-08 00:29:08 +0000960
961ABCs - abstract base classes
962----------------------------
963
964The collections module offers the following ABCs:
965
966========================= ===================== ====================== ====================================================
967ABC Inherits Abstract Methods Mixin Methods
968========================= ===================== ====================== ====================================================
969:class:`Container` ``__contains__``
970:class:`Hashable` ``__hash__``
971:class:`Iterable` ``__iter__``
972:class:`Iterator` :class:`Iterable` ``__next__`` ``__iter__``
973:class:`Sized` ``__len__``
974:class:`Callable` ``__call__``
975
976:class:`Sequence` :class:`Sized`, ``__getitem__`` ``__contains__``. ``__iter__``, ``__reversed__``.
977 :class:`Iterable`, ``index``, and ``count``
978 :class:`Container`
979
980:class:`MutableSequence` :class:`Sequence` ``__setitem__`` Inherited Sequence methods and
981 ``__delitem__``, ``append``, ``reverse``, ``extend``, ``pop``,
982 and ``insert`` ``remove``, and ``__iadd__``
983
984:class:`Set` :class:`Sized`, ``__le__``, ``__lt__``, ``__eq__``, ``__ne__``,
985 :class:`Iterable`, ``__gt__``, ``__ge__``, ``__and__``, ``__or__``
986 :class:`Container` ``__sub__``, ``__xor__``, and ``isdisjoint``
987
988:class:`MutableSet` :class:`Set` ``add`` and Inherited Set methods and
989 ``discard`` ``clear``, ``pop``, ``remove``, ``__ior__``,
990 ``__iand__``, ``__ixor__``, and ``__isub__``
991
992:class:`Mapping` :class:`Sized`, ``__getitem__`` ``__contains__``, ``keys``, ``items``, ``values``,
993 :class:`Iterable`, ``get``, ``__eq__``, and ``__ne__``
994 :class:`Container`
995
996:class:`MutableMapping` :class:`Mapping` ``__setitem__`` and Inherited Mapping methods and
997 ``__delitem__`` ``pop``, ``popitem``, ``clear``, ``update``,
998 and ``setdefault``
999
1000
1001:class:`MappingView` :class:`Sized` ``__len__``
1002:class:`KeysView` :class:`MappingView`, ``__contains__``,
1003 :class:`Set` ``__iter__``
1004:class:`ItemsView` :class:`MappingView`, ``__contains__``,
1005 :class:`Set` ``__iter__``
1006:class:`ValuesView` :class:`MappingView` ``__contains__``, ``__iter__``
1007========================= ===================== ====================== ====================================================
1008
1009These ABCs allow us to ask classes or instances if they provide
1010particular functionality, for example::
1011
1012 size = None
1013 if isinstance(myvar, collections.Sized):
1014 size = len(myvar)
1015
1016Several of the ABCs are also useful as mixins that make it easier to develop
1017classes supporting container APIs. For example, to write a class supporting
1018the full :class:`Set` API, it only necessary to supply the three underlying
1019abstract methods: :meth:`__contains__`, :meth:`__iter__`, and :meth:`__len__`.
1020The ABC supplies the remaining methods such as :meth:`__and__` and
1021:meth:`isdisjoint` ::
1022
1023 class ListBasedSet(collections.Set):
1024 ''' Alternate set implementation favoring space over speed
1025 and not requiring the set elements to be hashable. '''
1026 def __init__(self, iterable):
1027 self.elements = lst = []
1028 for value in iterable:
1029 if value not in lst:
1030 lst.append(value)
1031 def __iter__(self):
1032 return iter(self.elements)
1033 def __contains__(self, value):
1034 return value in self.elements
1035 def __len__(self):
1036 return len(self.elements)
1037
1038 s1 = ListBasedSet('abcdef')
1039 s2 = ListBasedSet('defghi')
1040 overlap = s1 & s2 # The __and__() method is supported automatically
1041
1042Notes on using :class:`Set` and :class:`MutableSet` as a mixin:
1043
1044(1)
1045 Since some set operations create new sets, the default mixin methods need
1046 a way to create new instances from an iterable. The class constructor is
1047 assumed to have a signature in the form ``ClassName(iterable)``.
1048 That assumption is factored-out to an internal classmethod called
1049 :meth:`_from_iterable` which calls ``cls(iterable)`` to produce a new set.
1050 If the :class:`Set` mixin is being used in a class with a different
1051 constructor signature, you will need to override :meth:`from_iterable`
1052 with a classmethod that can construct new instances from
1053 an iterable argument.
1054
1055(2)
1056 To override the comparisons (presumably for speed, as the
1057 semantics are fixed), redefine :meth:`__le__` and
1058 then the other operations will automatically follow suit.
1059
1060(3)
1061 The :class:`Set` mixin provides a :meth:`_hash` method to compute a hash value
1062 for the set; however, :meth:`__hash__` is not defined because not all sets
1063 are hashable or immutable. To add set hashabilty using mixins,
1064 inherit from both :meth:`Set` and :meth:`Hashable`, then define
1065 ``__hash__ = Set._hash``.
1066
1067.. seealso::
1068
Éric Araujo6e6cb8e2010-11-16 19:13:50 +00001069 * Latest version of the :source:`Python source code for the collections
1070 abstract base classes <Lib/_abcoll.py>`
1071
Raymond Hettingera6b76ba2010-08-08 00:29:08 +00001072 * `OrderedSet recipe <http://code.activestate.com/recipes/576694/>`_ for an
1073 example built on :class:`MutableSet`.
1074
1075 * For more about ABCs, see the :mod:`abc` module and :pep:`3119`.