blob: 4a92b052aa5a47cb7d558a1762c48c8ce2ccece3 [file] [log] [blame]
Raymond Hettinger53dbe392008-02-12 20:03:09 +00001:mod:`collections` --- Container datatypes
2==========================================
Georg Brandl116aa622007-08-15 14:28:22 +00003
4.. module:: collections
Raymond Hettinger53dbe392008-02-12 20:03:09 +00005 :synopsis: Container datatypes
Georg Brandl116aa622007-08-15 14:28:22 +00006.. moduleauthor:: Raymond Hettinger <python@rcn.com>
7.. sectionauthor:: Raymond Hettinger <python@rcn.com>
8
Christian Heimesfe337bf2008-03-23 21:54:12 +00009.. testsetup:: *
10
11 from collections import *
12 import itertools
13 __name__ = '<doctest>'
Georg Brandl116aa622007-08-15 14:28:22 +000014
Raymond Hettingera1993682011-01-27 01:20:32 +000015**Source code:** :source:`Lib/collections.py` and :source:`Lib/_abcoll.py`
Raymond Hettinger10480942011-01-10 03:26:08 +000016
Raymond Hettinger4f707fd2011-01-10 19:54:11 +000017--------------
18
Raymond Hettingera6b76ba2010-08-08 00:29:08 +000019This module implements specialized container datatypes providing alternatives to
20Python's general purpose built-in containers, :class:`dict`, :class:`list`,
21:class:`set`, and :class:`tuple`.
Christian Heimes0bd4e112008-02-12 22:59:25 +000022
Raymond Hettingera6b76ba2010-08-08 00:29:08 +000023===================== ====================================================================
24:func:`namedtuple` factory function for creating tuple subclasses with named fields
25:class:`deque` list-like container with fast appends and pops on either end
26:class:`Counter` dict subclass for counting hashable objects
27:class:`OrderedDict` dict subclass that remembers the order entries were added
28:class:`defaultdict` dict subclass that calls a factory function to supply missing values
29:class:`UserDict` wrapper around dictionary objects for easier dict subclassing
30:class:`UserList` wrapper around list objects for easier list subclassing
31:class:`UserString` wrapper around string objects for easier string subclassing
32===================== ====================================================================
Georg Brandl116aa622007-08-15 14:28:22 +000033
Raymond Hettingera6b76ba2010-08-08 00:29:08 +000034In addition to the concrete container classes, the collections module provides
Raymond Hettingerdcb44912010-11-30 17:45:41 +000035:ref:`abstract-base-classes` that can be used to test whether a class provides a
Raymond Hettingera6b76ba2010-08-08 00:29:08 +000036particular interface, for example, whether it is hashable or a mapping.
Mark Summerfield08898b42007-09-05 08:43:04 +000037
38
Raymond Hettingerb8baf632009-01-14 02:20:07 +000039:class:`Counter` objects
40------------------------
41
42A counter tool is provided to support convenient and rapid tallies.
43For example::
44
Raymond Hettinger1c62dc92009-02-04 11:41:45 +000045 >>> # Tally occurrences of words in a list
Raymond Hettingerb8baf632009-01-14 02:20:07 +000046 >>> cnt = Counter()
Raymond Hettinger670eaec2009-01-21 23:14:07 +000047 >>> for word in ['red', 'blue', 'red', 'green', 'blue', 'blue']:
Raymond Hettingerb8baf632009-01-14 02:20:07 +000048 ... cnt[word] += 1
49 >>> cnt
50 Counter({'blue': 3, 'red': 2, 'green': 1})
51
Raymond Hettinger1c62dc92009-02-04 11:41:45 +000052 >>> # Find the ten most common words in Hamlet
Raymond Hettingerb8baf632009-01-14 02:20:07 +000053 >>> import re
54 >>> words = re.findall('\w+', open('hamlet.txt').read().lower())
Raymond Hettinger0bae6622009-01-20 13:00:59 +000055 >>> Counter(words).most_common(10)
Raymond Hettingerb8baf632009-01-14 02:20:07 +000056 [('the', 1143), ('and', 966), ('to', 762), ('of', 669), ('i', 631),
57 ('you', 554), ('a', 546), ('my', 514), ('hamlet', 471), ('in', 451)]
58
59.. class:: Counter([iterable-or-mapping])
60
Raymond Hettinger670eaec2009-01-21 23:14:07 +000061 A :class:`Counter` is a :class:`dict` subclass for counting hashable objects.
Raymond Hettingerb8baf632009-01-14 02:20:07 +000062 It is an unordered collection where elements are stored as dictionary keys
63 and their counts are stored as dictionary values. Counts are allowed to be
64 any integer value including zero or negative counts. The :class:`Counter`
65 class is similar to bags or multisets in other languages.
66
67 Elements are counted from an *iterable* or initialized from another
Benjamin Peterson25c95f12009-05-08 20:42:26 +000068 *mapping* (or counter):
Raymond Hettingerb8baf632009-01-14 02:20:07 +000069
Raymond Hettinger73662a52009-01-27 02:38:22 +000070 >>> c = Counter() # a new, empty counter
71 >>> c = Counter('gallahad') # a new counter from an iterable
72 >>> c = Counter({'red': 4, 'blue': 2}) # a new counter from a mapping
73 >>> c = Counter(cats=4, dogs=8) # a new counter from keyword args
Raymond Hettingerb8baf632009-01-14 02:20:07 +000074
Raymond Hettinger670eaec2009-01-21 23:14:07 +000075 Counter objects have a dictionary interface except that they return a zero
Benjamin Peterson25c95f12009-05-08 20:42:26 +000076 count for missing items instead of raising a :exc:`KeyError`:
Raymond Hettingerb8baf632009-01-14 02:20:07 +000077
Raymond Hettinger94adc8e2009-01-22 05:27:37 +000078 >>> c = Counter(['eggs', 'ham'])
Raymond Hettingerb8baf632009-01-14 02:20:07 +000079 >>> c['bacon'] # count of a missing element is zero
80 0
81
Raymond Hettinger94adc8e2009-01-22 05:27:37 +000082 Setting a count to zero does not remove an element from a counter.
83 Use ``del`` to remove it entirely:
Raymond Hettingerb8baf632009-01-14 02:20:07 +000084
Raymond Hettinger94adc8e2009-01-22 05:27:37 +000085 >>> c['sausage'] = 0 # counter entry with a zero count
86 >>> del c['sausage'] # del actually removes the entry
Raymond Hettingerb8baf632009-01-14 02:20:07 +000087
Benjamin Petersond45bf582009-03-02 21:44:54 +000088 .. versionadded:: 3.1
Raymond Hettingerb8baf632009-01-14 02:20:07 +000089
90
Ezio Melotti0be8b1c2010-04-04 06:53:44 +000091 Counter objects support three methods beyond those available for all
Raymond Hettingerb8baf632009-01-14 02:20:07 +000092 dictionaries:
93
94 .. method:: elements()
95
Raymond Hettinger670eaec2009-01-21 23:14:07 +000096 Return an iterator over elements repeating each as many times as its
97 count. Elements are returned in arbitrary order. If an element's count
98 is less than one, :meth:`elements` will ignore it.
Raymond Hettingerb8baf632009-01-14 02:20:07 +000099
Raymond Hettinger0bae6622009-01-20 13:00:59 +0000100 >>> c = Counter(a=4, b=2, c=0, d=-2)
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000101 >>> list(c.elements())
102 ['a', 'a', 'a', 'a', 'b', 'b']
103
104 .. method:: most_common([n])
105
Raymond Hettinger73662a52009-01-27 02:38:22 +0000106 Return a list of the *n* most common elements and their counts from the
Raymond Hettingerd04fa312009-02-04 19:45:13 +0000107 most common to the least. If *n* is not specified, :func:`most_common`
Raymond Hettinger73662a52009-01-27 02:38:22 +0000108 returns *all* elements in the counter. Elements with equal counts are
Benjamin Peterson25c95f12009-05-08 20:42:26 +0000109 ordered arbitrarily:
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000110
111 >>> Counter('abracadabra').most_common(3)
112 [('a', 5), ('r', 2), ('b', 2)]
113
Raymond Hettinger9c01e442010-04-03 10:32:58 +0000114 .. method:: subtract([iterable-or-mapping])
115
116 Elements are subtracted from an *iterable* or from another *mapping*
117 (or counter). Like :meth:`dict.update` but subtracts counts instead
118 of replacing them. Both inputs and outputs may be zero or negative.
119
120 >>> c = Counter(a=4, b=2, c=0, d=-2)
121 >>> d = Counter(a=1, b=2, c=3, d=4)
122 >>> c.subtract(d)
123 Counter({'a': 3, 'b': 0, 'c': -3, 'd': -6})
124
Ezio Melotti0be8b1c2010-04-04 06:53:44 +0000125 .. versionadded:: 3.2
126
Raymond Hettinger670eaec2009-01-21 23:14:07 +0000127 The usual dictionary methods are available for :class:`Counter` objects
128 except for two which work differently for counters.
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000129
130 .. method:: fromkeys(iterable)
131
Raymond Hettinger73662a52009-01-27 02:38:22 +0000132 This class method is not implemented for :class:`Counter` objects.
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000133
134 .. method:: update([iterable-or-mapping])
135
Raymond Hettinger73662a52009-01-27 02:38:22 +0000136 Elements are counted from an *iterable* or added-in from another
137 *mapping* (or counter). Like :meth:`dict.update` but adds counts
138 instead of replacing them. Also, the *iterable* is expected to be a
139 sequence of elements, not a sequence of ``(key, value)`` pairs.
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000140
141Common patterns for working with :class:`Counter` objects::
142
Raymond Hettinger73662a52009-01-27 02:38:22 +0000143 sum(c.values()) # total of all counts
144 c.clear() # reset all counts
145 list(c) # list unique elements
146 set(c) # convert to a set
147 dict(c) # convert to a regular dictionary
148 c.items() # convert to a list of (elem, cnt) pairs
149 Counter(dict(list_of_pairs)) # convert from a list of (elem, cnt) pairs
150 c.most_common()[:-n:-1] # n least common elements
151 c += Counter() # remove zero and negative counts
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000152
Raymond Hettinger72a95cc2009-02-25 22:51:40 +0000153Several mathematical operations are provided for combining :class:`Counter`
154objects to produce multisets (counters that have counts greater than zero).
155Addition and subtraction combine counters by adding or subtracting the counts
156of corresponding elements. Intersection and union return the minimum and
157maximum of corresponding counts. Each operation can accept inputs with signed
158counts, but the output will exclude results with counts of zero or less.
Raymond Hettinger4d2073a2009-01-20 03:41:22 +0000159
Raymond Hettingere0d1b9f2009-01-21 20:36:27 +0000160 >>> c = Counter(a=3, b=1)
161 >>> d = Counter(a=1, b=2)
Raymond Hettinger73662a52009-01-27 02:38:22 +0000162 >>> c + d # add two counters together: c[x] + d[x]
Raymond Hettinger4d2073a2009-01-20 03:41:22 +0000163 Counter({'a': 4, 'b': 3})
Raymond Hettinger73662a52009-01-27 02:38:22 +0000164 >>> c - d # subtract (keeping only positive counts)
Raymond Hettinger4d2073a2009-01-20 03:41:22 +0000165 Counter({'a': 2})
Raymond Hettinger73662a52009-01-27 02:38:22 +0000166 >>> c & d # intersection: min(c[x], d[x])
Raymond Hettinger4d2073a2009-01-20 03:41:22 +0000167 Counter({'a': 1, 'b': 1})
Raymond Hettinger73662a52009-01-27 02:38:22 +0000168 >>> c | d # union: max(c[x], d[x])
Raymond Hettinger4d2073a2009-01-20 03:41:22 +0000169 Counter({'a': 3, 'b': 2})
170
Raymond Hettinger22f18852010-04-12 21:45:14 +0000171.. note::
172
173 Counters were primarily designed to work with positive integers to represent
174 running counts; however, care was taken to not unnecessarily preclude use
175 cases needing other types or negative values. To help with those use cases,
176 this section documents the minimum range and type restrictions.
177
178 * The :class:`Counter` class itself is a dictionary subclass with no
179 restrictions on its keys and values. The values are intended to be numbers
180 representing counts, but you *could* store anything in the value field.
181
182 * The :meth:`most_common` method requires only that the values be orderable.
183
184 * For in-place operations such as ``c[key] += 1``, the value type need only
185 support addition and subtraction. So fractions, floats, and decimals would
186 work and negative values are supported. The same is also true for
187 :meth:`update` and :meth:`subtract` which allow negative and zero values
188 for both inputs and outputs.
189
190 * The multiset methods are designed only for use cases with positive values.
191 The inputs may be negative or zero, but only outputs with positive values
192 are created. There are no type restrictions, but the value type needs to
193 support support addition, subtraction, and comparison.
194
195 * The :meth:`elements` method requires integer counts. It ignores zero and
196 negative counts.
197
Raymond Hettingerb14043c2009-01-20 23:44:31 +0000198.. seealso::
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000199
Raymond Hettinger94adc8e2009-01-22 05:27:37 +0000200 * `Counter class <http://code.activestate.com/recipes/576611/>`_
201 adapted for Python 2.5 and an early `Bag recipe
202 <http://code.activestate.com/recipes/259174/>`_ for Python 2.4.
203
Raymond Hettingerb14043c2009-01-20 23:44:31 +0000204 * `Bag class <http://www.gnu.org/software/smalltalk/manual-base/html_node/Bag.html>`_
205 in Smalltalk.
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000206
Raymond Hettingerb14043c2009-01-20 23:44:31 +0000207 * Wikipedia entry for `Multisets <http://en.wikipedia.org/wiki/Multiset>`_\.
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000208
Raymond Hettingerb14043c2009-01-20 23:44:31 +0000209 * `C++ multisets <http://www.demo2s.com/Tutorial/Cpp/0380__set-multiset/Catalog0380__set-multiset.htm>`_
Raymond Hettinger94adc8e2009-01-22 05:27:37 +0000210 tutorial with examples.
Raymond Hettingerb14043c2009-01-20 23:44:31 +0000211
Raymond Hettinger94adc8e2009-01-22 05:27:37 +0000212 * For mathematical operations on multisets and their use cases, see
Raymond Hettingerb14043c2009-01-20 23:44:31 +0000213 *Knuth, Donald. The Art of Computer Programming Volume II,
214 Section 4.6.3, Exercise 19*\.
215
Raymond Hettinger670eaec2009-01-21 23:14:07 +0000216 * To enumerate all distinct multisets of a given size over a given set of
Raymond Hettingerd07d9392009-01-27 04:20:44 +0000217 elements, see :func:`itertools.combinations_with_replacement`.
Raymond Hettingerb14043c2009-01-20 23:44:31 +0000218
Raymond Hettinger94adc8e2009-01-22 05:27:37 +0000219 map(Counter, combinations_with_replacement('ABC', 2)) --> AA AB AC BB BC CC
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000220
221
Georg Brandl116aa622007-08-15 14:28:22 +0000222:class:`deque` objects
223----------------------
224
Georg Brandlc2a4f4f2009-04-10 09:03:43 +0000225.. class:: deque([iterable, [maxlen]])
Georg Brandl116aa622007-08-15 14:28:22 +0000226
227 Returns a new deque object initialized left-to-right (using :meth:`append`) with
228 data from *iterable*. If *iterable* is not specified, the new deque is empty.
229
230 Deques are a generalization of stacks and queues (the name is pronounced "deck"
231 and is short for "double-ended queue"). Deques support thread-safe, memory
232 efficient appends and pops from either side of the deque with approximately the
233 same O(1) performance in either direction.
234
235 Though :class:`list` objects support similar operations, they are optimized for
236 fast fixed-length operations and incur O(n) memory movement costs for
237 ``pop(0)`` and ``insert(0, v)`` operations which change both the size and
238 position of the underlying data representation.
239
Georg Brandl116aa622007-08-15 14:28:22 +0000240
Georg Brandl9afde1c2007-11-01 20:32:30 +0000241 If *maxlen* is not specified or is *None*, deques may grow to an
242 arbitrary length. Otherwise, the deque is bounded to the specified maximum
243 length. Once a bounded length deque is full, when new items are added, a
244 corresponding number of items are discarded from the opposite end. Bounded
245 length deques provide functionality similar to the ``tail`` filter in
246 Unix. They are also useful for tracking transactions and other pools of data
247 where only the most recent activity is of interest.
248
Georg Brandl9afde1c2007-11-01 20:32:30 +0000249
Benjamin Petersone41251e2008-04-25 01:59:09 +0000250 Deque objects support the following methods:
Georg Brandl116aa622007-08-15 14:28:22 +0000251
Benjamin Petersone41251e2008-04-25 01:59:09 +0000252 .. method:: append(x)
Georg Brandl116aa622007-08-15 14:28:22 +0000253
Benjamin Petersone41251e2008-04-25 01:59:09 +0000254 Add *x* to the right side of the deque.
Georg Brandl116aa622007-08-15 14:28:22 +0000255
256
Benjamin Petersone41251e2008-04-25 01:59:09 +0000257 .. method:: appendleft(x)
Georg Brandl116aa622007-08-15 14:28:22 +0000258
Benjamin Petersone41251e2008-04-25 01:59:09 +0000259 Add *x* to the left side of the deque.
Georg Brandl116aa622007-08-15 14:28:22 +0000260
261
Benjamin Petersone41251e2008-04-25 01:59:09 +0000262 .. method:: clear()
Georg Brandl116aa622007-08-15 14:28:22 +0000263
Benjamin Petersone41251e2008-04-25 01:59:09 +0000264 Remove all elements from the deque leaving it with length 0.
Georg Brandl116aa622007-08-15 14:28:22 +0000265
266
Raymond Hettinger44459de2010-04-03 23:20:46 +0000267 .. method:: count(x)
268
269 Count the number of deque elements equal to *x*.
270
271 .. versionadded:: 3.2
272
Georg Brandl67b21b72010-08-17 15:07:14 +0000273
Benjamin Petersone41251e2008-04-25 01:59:09 +0000274 .. method:: extend(iterable)
Georg Brandl116aa622007-08-15 14:28:22 +0000275
Benjamin Petersone41251e2008-04-25 01:59:09 +0000276 Extend the right side of the deque by appending elements from the iterable
277 argument.
Georg Brandl116aa622007-08-15 14:28:22 +0000278
279
Benjamin Petersone41251e2008-04-25 01:59:09 +0000280 .. method:: extendleft(iterable)
Georg Brandl116aa622007-08-15 14:28:22 +0000281
Benjamin Petersone41251e2008-04-25 01:59:09 +0000282 Extend the left side of the deque by appending elements from *iterable*.
283 Note, the series of left appends results in reversing the order of
284 elements in the iterable argument.
Georg Brandl116aa622007-08-15 14:28:22 +0000285
286
Benjamin Petersone41251e2008-04-25 01:59:09 +0000287 .. method:: pop()
Georg Brandl116aa622007-08-15 14:28:22 +0000288
Benjamin Petersone41251e2008-04-25 01:59:09 +0000289 Remove and return an element from the right side of the deque. If no
290 elements are present, raises an :exc:`IndexError`.
Georg Brandl116aa622007-08-15 14:28:22 +0000291
292
Benjamin Petersone41251e2008-04-25 01:59:09 +0000293 .. method:: popleft()
Georg Brandl116aa622007-08-15 14:28:22 +0000294
Benjamin Petersone41251e2008-04-25 01:59:09 +0000295 Remove and return an element from the left side of the deque. If no
296 elements are present, raises an :exc:`IndexError`.
Georg Brandl116aa622007-08-15 14:28:22 +0000297
298
Benjamin Petersone41251e2008-04-25 01:59:09 +0000299 .. method:: remove(value)
Georg Brandl116aa622007-08-15 14:28:22 +0000300
Benjamin Petersone41251e2008-04-25 01:59:09 +0000301 Removed the first occurrence of *value*. If not found, raises a
302 :exc:`ValueError`.
Georg Brandl116aa622007-08-15 14:28:22 +0000303
Georg Brandl67b21b72010-08-17 15:07:14 +0000304
Raymond Hettingere5fdedb2009-12-10 00:47:21 +0000305 .. method:: reverse()
306
307 Reverse the elements of the deque in-place and then return ``None``.
308
309 .. versionadded:: 3.2
Georg Brandl116aa622007-08-15 14:28:22 +0000310
Georg Brandl67b21b72010-08-17 15:07:14 +0000311
Benjamin Petersone41251e2008-04-25 01:59:09 +0000312 .. method:: rotate(n)
Georg Brandl116aa622007-08-15 14:28:22 +0000313
Benjamin Petersone41251e2008-04-25 01:59:09 +0000314 Rotate the deque *n* steps to the right. If *n* is negative, rotate to
315 the left. Rotating one step to the right is equivalent to:
316 ``d.appendleft(d.pop())``.
317
Georg Brandl116aa622007-08-15 14:28:22 +0000318
Raymond Hettinger5bb0f0e2009-03-10 12:56:32 +0000319 Deque objects also provide one read-only attribute:
320
321 .. attribute:: maxlen
322
323 Maximum size of a deque or *None* if unbounded.
324
Raymond Hettinger150fb9c2009-03-10 22:48:06 +0000325 .. versionadded:: 3.1
Raymond Hettinger5bb0f0e2009-03-10 12:56:32 +0000326
327
Georg Brandl116aa622007-08-15 14:28:22 +0000328In addition to the above, deques support iteration, pickling, ``len(d)``,
329``reversed(d)``, ``copy.copy(d)``, ``copy.deepcopy(d)``, membership testing with
Benjamin Peterson206e3072008-10-19 14:07:49 +0000330the :keyword:`in` operator, and subscript references such as ``d[-1]``. Indexed
331access is O(1) at both ends but slows to O(n) in the middle. For fast random
332access, use lists instead.
Georg Brandl116aa622007-08-15 14:28:22 +0000333
Christian Heimesfe337bf2008-03-23 21:54:12 +0000334Example:
335
336.. doctest::
Georg Brandl116aa622007-08-15 14:28:22 +0000337
338 >>> from collections import deque
339 >>> d = deque('ghi') # make a new deque with three items
340 >>> for elem in d: # iterate over the deque's elements
Neal Norwitz752abd02008-05-13 04:55:24 +0000341 ... print(elem.upper())
Georg Brandl116aa622007-08-15 14:28:22 +0000342 G
343 H
344 I
345
346 >>> d.append('j') # add a new entry to the right side
347 >>> d.appendleft('f') # add a new entry to the left side
348 >>> d # show the representation of the deque
349 deque(['f', 'g', 'h', 'i', 'j'])
350
351 >>> d.pop() # return and remove the rightmost item
352 'j'
353 >>> d.popleft() # return and remove the leftmost item
354 'f'
355 >>> list(d) # list the contents of the deque
356 ['g', 'h', 'i']
357 >>> d[0] # peek at leftmost item
358 'g'
359 >>> d[-1] # peek at rightmost item
360 'i'
361
362 >>> list(reversed(d)) # list the contents of a deque in reverse
363 ['i', 'h', 'g']
364 >>> 'h' in d # search the deque
365 True
366 >>> d.extend('jkl') # add multiple elements at once
367 >>> d
368 deque(['g', 'h', 'i', 'j', 'k', 'l'])
369 >>> d.rotate(1) # right rotation
370 >>> d
371 deque(['l', 'g', 'h', 'i', 'j', 'k'])
372 >>> d.rotate(-1) # left rotation
373 >>> d
374 deque(['g', 'h', 'i', 'j', 'k', 'l'])
375
376 >>> deque(reversed(d)) # make a new deque in reverse order
377 deque(['l', 'k', 'j', 'i', 'h', 'g'])
378 >>> d.clear() # empty the deque
379 >>> d.pop() # cannot pop from an empty deque
380 Traceback (most recent call last):
381 File "<pyshell#6>", line 1, in -toplevel-
382 d.pop()
383 IndexError: pop from an empty deque
384
385 >>> d.extendleft('abc') # extendleft() reverses the input order
386 >>> d
387 deque(['c', 'b', 'a'])
388
389
Georg Brandl9afde1c2007-11-01 20:32:30 +0000390:class:`deque` Recipes
391^^^^^^^^^^^^^^^^^^^^^^
Georg Brandl116aa622007-08-15 14:28:22 +0000392
393This section shows various approaches to working with deques.
394
Raymond Hettingerd2ee64d2009-03-31 22:52:48 +0000395Bounded length deques provide functionality similar to the ``tail`` filter
396in Unix::
397
398 def tail(filename, n=10):
399 'Return the last n lines of a file'
400 return deque(open(filename), n)
401
402Another approach to using deques is to maintain a sequence of recently
403added elements by appending to the right and popping to the left::
404
405 def moving_average(iterable, n=3):
406 # moving_average([40, 30, 50, 46, 39, 44]) --> 40.0 42.0 45.0 43.0
407 # http://en.wikipedia.org/wiki/Moving_average
408 it = iter(iterable)
Raymond Hettingerd40285a2009-05-22 01:11:26 +0000409 d = deque(itertools.islice(it, n-1))
410 d.appendleft(0)
Raymond Hettingerd2ee64d2009-03-31 22:52:48 +0000411 s = sum(d)
Raymond Hettingerd2ee64d2009-03-31 22:52:48 +0000412 for elem in it:
413 s += elem - d.popleft()
414 d.append(elem)
415 yield s / n
416
Georg Brandl116aa622007-08-15 14:28:22 +0000417The :meth:`rotate` method provides a way to implement :class:`deque` slicing and
Ezio Melotti0639d5a2009-12-19 23:26:38 +0000418deletion. For example, a pure Python implementation of ``del d[n]`` relies on
Georg Brandl116aa622007-08-15 14:28:22 +0000419the :meth:`rotate` method to position elements to be popped::
420
421 def delete_nth(d, n):
422 d.rotate(-n)
423 d.popleft()
424 d.rotate(n)
425
426To implement :class:`deque` slicing, use a similar approach applying
427:meth:`rotate` to bring a target element to the left side of the deque. Remove
428old entries with :meth:`popleft`, add new entries with :meth:`extend`, and then
429reverse the rotation.
Georg Brandl116aa622007-08-15 14:28:22 +0000430With minor variations on that approach, it is easy to implement Forth style
431stack manipulations such as ``dup``, ``drop``, ``swap``, ``over``, ``pick``,
432``rot``, and ``roll``.
433
Georg Brandl116aa622007-08-15 14:28:22 +0000434
435:class:`defaultdict` objects
436----------------------------
437
Georg Brandl116aa622007-08-15 14:28:22 +0000438.. class:: defaultdict([default_factory[, ...]])
439
440 Returns a new dictionary-like object. :class:`defaultdict` is a subclass of the
Georg Brandl22b34312009-07-26 14:54:51 +0000441 built-in :class:`dict` class. It overrides one method and adds one writable
Georg Brandl116aa622007-08-15 14:28:22 +0000442 instance variable. The remaining functionality is the same as for the
443 :class:`dict` class and is not documented here.
444
445 The first argument provides the initial value for the :attr:`default_factory`
446 attribute; it defaults to ``None``. All remaining arguments are treated the same
447 as if they were passed to the :class:`dict` constructor, including keyword
448 arguments.
449
Georg Brandl116aa622007-08-15 14:28:22 +0000450
Benjamin Petersone41251e2008-04-25 01:59:09 +0000451 :class:`defaultdict` objects support the following method in addition to the
452 standard :class:`dict` operations:
Georg Brandl116aa622007-08-15 14:28:22 +0000453
Benjamin Petersond319ad52010-07-18 14:27:02 +0000454 .. method:: __missing__(key)
Georg Brandl116aa622007-08-15 14:28:22 +0000455
Benjamin Peterson5478b472008-09-17 22:25:09 +0000456 If the :attr:`default_factory` attribute is ``None``, this raises a
Benjamin Petersone41251e2008-04-25 01:59:09 +0000457 :exc:`KeyError` exception with the *key* as argument.
Georg Brandl116aa622007-08-15 14:28:22 +0000458
Benjamin Petersone41251e2008-04-25 01:59:09 +0000459 If :attr:`default_factory` is not ``None``, it is called without arguments
460 to provide a default value for the given *key*, this value is inserted in
461 the dictionary for the *key*, and returned.
Georg Brandl116aa622007-08-15 14:28:22 +0000462
Benjamin Petersone41251e2008-04-25 01:59:09 +0000463 If calling :attr:`default_factory` raises an exception this exception is
464 propagated unchanged.
Georg Brandl116aa622007-08-15 14:28:22 +0000465
Benjamin Petersone41251e2008-04-25 01:59:09 +0000466 This method is called by the :meth:`__getitem__` method of the
467 :class:`dict` class when the requested key is not found; whatever it
468 returns or raises is then returned or raised by :meth:`__getitem__`.
Georg Brandl116aa622007-08-15 14:28:22 +0000469
470
Benjamin Petersone41251e2008-04-25 01:59:09 +0000471 :class:`defaultdict` objects support the following instance variable:
Georg Brandl116aa622007-08-15 14:28:22 +0000472
Benjamin Petersone41251e2008-04-25 01:59:09 +0000473
Benjamin Petersond319ad52010-07-18 14:27:02 +0000474 .. attribute:: default_factory
Benjamin Petersone41251e2008-04-25 01:59:09 +0000475
476 This attribute is used by the :meth:`__missing__` method; it is
477 initialized from the first argument to the constructor, if present, or to
478 ``None``, if absent.
Georg Brandl116aa622007-08-15 14:28:22 +0000479
480
Georg Brandl116aa622007-08-15 14:28:22 +0000481:class:`defaultdict` Examples
482^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
483
484Using :class:`list` as the :attr:`default_factory`, it is easy to group a
Christian Heimesfe337bf2008-03-23 21:54:12 +0000485sequence of key-value pairs into a dictionary of lists:
Georg Brandl116aa622007-08-15 14:28:22 +0000486
487 >>> s = [('yellow', 1), ('blue', 2), ('yellow', 3), ('blue', 4), ('red', 1)]
488 >>> d = defaultdict(list)
489 >>> for k, v in s:
490 ... d[k].append(v)
491 ...
Ezio Melottic53a8942009-09-12 01:52:05 +0000492 >>> list(d.items())
Georg Brandl116aa622007-08-15 14:28:22 +0000493 [('blue', [2, 4]), ('red', [1]), ('yellow', [1, 3])]
494
495When each key is encountered for the first time, it is not already in the
496mapping; so an entry is automatically created using the :attr:`default_factory`
497function which returns an empty :class:`list`. The :meth:`list.append`
498operation then attaches the value to the new list. When keys are encountered
499again, the look-up proceeds normally (returning the list for that key) and the
500:meth:`list.append` operation adds another value to the list. This technique is
Christian Heimesfe337bf2008-03-23 21:54:12 +0000501simpler and faster than an equivalent technique using :meth:`dict.setdefault`:
Georg Brandl116aa622007-08-15 14:28:22 +0000502
503 >>> d = {}
504 >>> for k, v in s:
505 ... d.setdefault(k, []).append(v)
506 ...
Ezio Melottic53a8942009-09-12 01:52:05 +0000507 >>> list(d.items())
Georg Brandl116aa622007-08-15 14:28:22 +0000508 [('blue', [2, 4]), ('red', [1]), ('yellow', [1, 3])]
509
510Setting the :attr:`default_factory` to :class:`int` makes the
511:class:`defaultdict` useful for counting (like a bag or multiset in other
Christian Heimesfe337bf2008-03-23 21:54:12 +0000512languages):
Georg Brandl116aa622007-08-15 14:28:22 +0000513
514 >>> s = 'mississippi'
515 >>> d = defaultdict(int)
516 >>> for k in s:
517 ... d[k] += 1
518 ...
Ezio Melottic53a8942009-09-12 01:52:05 +0000519 >>> list(d.items())
Georg Brandl116aa622007-08-15 14:28:22 +0000520 [('i', 4), ('p', 2), ('s', 4), ('m', 1)]
521
522When a letter is first encountered, it is missing from the mapping, so the
523:attr:`default_factory` function calls :func:`int` to supply a default count of
524zero. The increment operation then builds up the count for each letter.
525
526The function :func:`int` which always returns zero is just a special case of
527constant functions. A faster and more flexible way to create constant functions
528is to use a lambda function which can supply any constant value (not just
Christian Heimesfe337bf2008-03-23 21:54:12 +0000529zero):
Georg Brandl116aa622007-08-15 14:28:22 +0000530
531 >>> def constant_factory(value):
532 ... return lambda: value
533 >>> d = defaultdict(constant_factory('<missing>'))
534 >>> d.update(name='John', action='ran')
535 >>> '%(name)s %(action)s to %(object)s' % d
536 'John ran to <missing>'
537
538Setting the :attr:`default_factory` to :class:`set` makes the
Christian Heimesfe337bf2008-03-23 21:54:12 +0000539:class:`defaultdict` useful for building a dictionary of sets:
Georg Brandl116aa622007-08-15 14:28:22 +0000540
541 >>> s = [('red', 1), ('blue', 2), ('red', 3), ('blue', 4), ('red', 1), ('blue', 4)]
542 >>> d = defaultdict(set)
543 >>> for k, v in s:
544 ... d[k].add(v)
545 ...
Ezio Melottic53a8942009-09-12 01:52:05 +0000546 >>> list(d.items())
Georg Brandl116aa622007-08-15 14:28:22 +0000547 [('blue', set([2, 4])), ('red', set([1, 3]))]
548
549
Guido van Rossum3d392eb2007-11-16 00:35:22 +0000550:func:`namedtuple` Factory Function for Tuples with Named Fields
Christian Heimes790c8232008-01-07 21:14:23 +0000551----------------------------------------------------------------
Georg Brandl116aa622007-08-15 14:28:22 +0000552
Thomas Wouters1b7f8912007-09-19 03:06:30 +0000553Named tuples assign meaning to each position in a tuple and allow for more readable,
554self-documenting code. They can be used wherever regular tuples are used, and
555they add the ability to access fields by name instead of position index.
Georg Brandl116aa622007-08-15 14:28:22 +0000556
Georg Brandlc2a4f4f2009-04-10 09:03:43 +0000557.. function:: namedtuple(typename, field_names, verbose=False, rename=False)
Georg Brandl116aa622007-08-15 14:28:22 +0000558
559 Returns a new tuple subclass named *typename*. The new subclass is used to
Christian Heimesc3f30c42008-02-22 16:37:40 +0000560 create tuple-like objects that have fields accessible by attribute lookup as
Georg Brandl116aa622007-08-15 14:28:22 +0000561 well as being indexable and iterable. Instances of the subclass also have a
Benjamin Peterson4469d0c2008-11-30 22:46:23 +0000562 helpful docstring (with typename and field_names) and a helpful :meth:`__repr__`
Georg Brandl116aa622007-08-15 14:28:22 +0000563 method which lists the tuple contents in a ``name=value`` format.
564
Benjamin Peterson4469d0c2008-11-30 22:46:23 +0000565 The *field_names* are a single string with each fieldname separated by whitespace
566 and/or commas, for example ``'x y'`` or ``'x, y'``. Alternatively, *field_names*
Christian Heimes25bb7832008-01-11 16:17:00 +0000567 can be a sequence of strings such as ``['x', 'y']``.
Georg Brandl9afde1c2007-11-01 20:32:30 +0000568
569 Any valid Python identifier may be used for a fieldname except for names
Christian Heimes0449f632007-12-15 01:27:15 +0000570 starting with an underscore. Valid identifiers consist of letters, digits,
571 and underscores but do not start with a digit or underscore and cannot be
Georg Brandlf6945182008-02-01 11:56:49 +0000572 a :mod:`keyword` such as *class*, *for*, *return*, *global*, *pass*,
Georg Brandl9afde1c2007-11-01 20:32:30 +0000573 or *raise*.
Georg Brandl116aa622007-08-15 14:28:22 +0000574
Benjamin Petersona86f2c02009-02-10 02:41:10 +0000575 If *rename* is true, invalid fieldnames are automatically replaced
576 with positional names. For example, ``['abc', 'def', 'ghi', 'abc']`` is
Raymond Hettinger85737b82009-04-02 22:37:59 +0000577 converted to ``['abc', '_1', 'ghi', '_3']``, eliminating the keyword
Benjamin Petersona86f2c02009-02-10 02:41:10 +0000578 ``def`` and the duplicate fieldname ``abc``.
579
Christian Heimes25bb7832008-01-11 16:17:00 +0000580 If *verbose* is true, the class definition is printed just before being built.
Georg Brandl116aa622007-08-15 14:28:22 +0000581
Georg Brandl9afde1c2007-11-01 20:32:30 +0000582 Named tuple instances do not have per-instance dictionaries, so they are
Thomas Wouters8ce81f72007-09-20 18:22:40 +0000583 lightweight and require no more memory than regular tuples.
Georg Brandl116aa622007-08-15 14:28:22 +0000584
Raymond Hettingerb62ad242009-03-02 22:16:43 +0000585 .. versionchanged:: 3.1
Georg Brandl67b21b72010-08-17 15:07:14 +0000586 Added support for *rename*.
Benjamin Petersona86f2c02009-02-10 02:41:10 +0000587
Christian Heimesfe337bf2008-03-23 21:54:12 +0000588
589.. doctest::
590 :options: +NORMALIZE_WHITESPACE
Georg Brandl116aa622007-08-15 14:28:22 +0000591
Raymond Hettinger0ef956f2010-11-21 23:23:29 +0000592 >>> # Basic example
593 >>> Point = namedtuple('Point', 'x y')
594 >>> p = Point(x=10, y=11)
595
596 >>> # Example using the verbose option to print the class definition
Guido van Rossum3d392eb2007-11-16 00:35:22 +0000597 >>> Point = namedtuple('Point', 'x y', verbose=True)
Thomas Wouters1b7f8912007-09-19 03:06:30 +0000598 class Point(tuple):
599 'Point(x, y)'
Christian Heimesfe337bf2008-03-23 21:54:12 +0000600 <BLANKLINE>
Thomas Wouters1b7f8912007-09-19 03:06:30 +0000601 __slots__ = ()
Christian Heimesfe337bf2008-03-23 21:54:12 +0000602 <BLANKLINE>
Christian Heimesfaf2f632008-01-06 16:59:19 +0000603 _fields = ('x', 'y')
Christian Heimesfe337bf2008-03-23 21:54:12 +0000604 <BLANKLINE>
Raymond Hettinger089ba7f2009-05-27 00:38:24 +0000605 def __new__(_cls, x, y):
Raymond Hettinger7b0d3c62010-04-02 18:54:02 +0000606 'Create a new instance of Point(x, y)'
Raymond Hettinger089ba7f2009-05-27 00:38:24 +0000607 return _tuple.__new__(_cls, (x, y))
Christian Heimesfe337bf2008-03-23 21:54:12 +0000608 <BLANKLINE>
Christian Heimesfaf2f632008-01-06 16:59:19 +0000609 @classmethod
Christian Heimesfe337bf2008-03-23 21:54:12 +0000610 def _make(cls, iterable, new=tuple.__new__, len=len):
Christian Heimesfaf2f632008-01-06 16:59:19 +0000611 'Make a new Point object from a sequence or iterable'
Christian Heimesfe337bf2008-03-23 21:54:12 +0000612 result = new(cls, iterable)
Christian Heimesfaf2f632008-01-06 16:59:19 +0000613 if len(result) != 2:
614 raise TypeError('Expected 2 arguments, got %d' % len(result))
615 return result
Christian Heimesfe337bf2008-03-23 21:54:12 +0000616 <BLANKLINE>
Thomas Wouters1b7f8912007-09-19 03:06:30 +0000617 def __repr__(self):
Raymond Hettinger7b0d3c62010-04-02 18:54:02 +0000618 'Return a nicely formatted representation string'
Raymond Hettingerd331ce92010-08-08 01:13:42 +0000619 return self.__class__.__name__ + '(x=%r, y=%r)' % self
Christian Heimesfe337bf2008-03-23 21:54:12 +0000620 <BLANKLINE>
Raymond Hettingera4f52b12009-03-02 22:28:31 +0000621 def _asdict(self):
622 'Return a new OrderedDict which maps field names to their values'
623 return OrderedDict(zip(self._fields, self))
Christian Heimesfe337bf2008-03-23 21:54:12 +0000624 <BLANKLINE>
Raymond Hettinger089ba7f2009-05-27 00:38:24 +0000625 def _replace(_self, **kwds):
Guido van Rossum3d392eb2007-11-16 00:35:22 +0000626 'Return a new Point object replacing specified fields with new values'
Raymond Hettinger089ba7f2009-05-27 00:38:24 +0000627 result = _self._make(map(kwds.pop, ('x', 'y'), _self))
Christian Heimesfaf2f632008-01-06 16:59:19 +0000628 if kwds:
Ezio Melotti8f7649e2009-09-13 04:48:45 +0000629 raise ValueError('Got unexpected field names: %r' % list(kwds.keys()))
Christian Heimesfaf2f632008-01-06 16:59:19 +0000630 return result
Georg Brandl48310cd2009-01-03 21:18:54 +0000631 <BLANKLINE>
632 def __getnewargs__(self):
Raymond Hettinger7b0d3c62010-04-02 18:54:02 +0000633 'Return self as a plain tuple. Used by copy and pickle.'
Benjamin Peterson41181742008-07-02 20:22:54 +0000634 return tuple(self)
Christian Heimesfe337bf2008-03-23 21:54:12 +0000635 <BLANKLINE>
Raymond Hettinger7b0d3c62010-04-02 18:54:02 +0000636 x = _property(_itemgetter(0), doc='Alias for field number 0')
637 y = _property(_itemgetter(1), doc='Alias for field number 1')
Georg Brandl116aa622007-08-15 14:28:22 +0000638
Thomas Wouters1b7f8912007-09-19 03:06:30 +0000639 >>> p = Point(11, y=22) # instantiate with positional or keyword arguments
Christian Heimes99170a52007-12-19 02:07:34 +0000640 >>> p[0] + p[1] # indexable like the plain tuple (11, 22)
Thomas Wouters1b7f8912007-09-19 03:06:30 +0000641 33
642 >>> x, y = p # unpack like a regular tuple
643 >>> x, y
644 (11, 22)
Christian Heimesc3f30c42008-02-22 16:37:40 +0000645 >>> p.x + p.y # fields also accessible by name
Thomas Wouters1b7f8912007-09-19 03:06:30 +0000646 33
647 >>> p # readable __repr__ with a name=value style
648 Point(x=11, y=22)
Georg Brandl116aa622007-08-15 14:28:22 +0000649
Thomas Wouters1b7f8912007-09-19 03:06:30 +0000650Named tuples are especially useful for assigning field names to result tuples returned
651by the :mod:`csv` or :mod:`sqlite3` modules::
652
Guido van Rossum3d392eb2007-11-16 00:35:22 +0000653 EmployeeRecord = namedtuple('EmployeeRecord', 'name, age, title, department, paygrade')
Georg Brandl9afde1c2007-11-01 20:32:30 +0000654
Thomas Wouters1b7f8912007-09-19 03:06:30 +0000655 import csv
Christian Heimesfaf2f632008-01-06 16:59:19 +0000656 for emp in map(EmployeeRecord._make, csv.reader(open("employees.csv", "rb"))):
Thomas Wouters1b7f8912007-09-19 03:06:30 +0000657 print(emp.name, emp.title)
658
Georg Brandl9afde1c2007-11-01 20:32:30 +0000659 import sqlite3
660 conn = sqlite3.connect('/companydata')
661 cursor = conn.cursor()
662 cursor.execute('SELECT name, age, title, department, paygrade FROM employees')
Christian Heimesfaf2f632008-01-06 16:59:19 +0000663 for emp in map(EmployeeRecord._make, cursor.fetchall()):
Christian Heimes00412232008-01-10 16:02:19 +0000664 print(emp.name, emp.title)
Georg Brandl9afde1c2007-11-01 20:32:30 +0000665
Christian Heimes99170a52007-12-19 02:07:34 +0000666In addition to the methods inherited from tuples, named tuples support
Christian Heimes2380ac72008-01-09 00:17:24 +0000667three additional methods and one attribute. To prevent conflicts with
668field names, the method and attribute names start with an underscore.
Christian Heimes99170a52007-12-19 02:07:34 +0000669
Benjamin Peterson0b9fb802010-07-18 14:23:36 +0000670.. classmethod:: somenamedtuple._make(iterable)
Christian Heimes99170a52007-12-19 02:07:34 +0000671
Christian Heimesfaf2f632008-01-06 16:59:19 +0000672 Class method that makes a new instance from an existing sequence or iterable.
Christian Heimes99170a52007-12-19 02:07:34 +0000673
Christian Heimesfe337bf2008-03-23 21:54:12 +0000674.. doctest::
Thomas Wouters1b7f8912007-09-19 03:06:30 +0000675
Christian Heimesfaf2f632008-01-06 16:59:19 +0000676 >>> t = [11, 22]
677 >>> Point._make(t)
678 Point(x=11, y=22)
Thomas Wouters1b7f8912007-09-19 03:06:30 +0000679
Christian Heimes790c8232008-01-07 21:14:23 +0000680.. method:: somenamedtuple._asdict()
Georg Brandl9afde1c2007-11-01 20:32:30 +0000681
Raymond Hettingera4f52b12009-03-02 22:28:31 +0000682 Return a new :class:`OrderedDict` which maps field names to their corresponding
683 values::
Georg Brandl9afde1c2007-11-01 20:32:30 +0000684
Christian Heimes0449f632007-12-15 01:27:15 +0000685 >>> p._asdict()
Raymond Hettingera4f52b12009-03-02 22:28:31 +0000686 OrderedDict([('x', 11), ('y', 22)])
687
Raymond Hettingera88e4da2009-03-03 05:12:27 +0000688 .. versionchanged:: 3.1
Raymond Hettingera4f52b12009-03-02 22:28:31 +0000689 Returns an :class:`OrderedDict` instead of a regular :class:`dict`.
Christian Heimesfe337bf2008-03-23 21:54:12 +0000690
Christian Heimes790c8232008-01-07 21:14:23 +0000691.. method:: somenamedtuple._replace(kwargs)
Thomas Wouters1b7f8912007-09-19 03:06:30 +0000692
Christian Heimesfe337bf2008-03-23 21:54:12 +0000693 Return a new instance of the named tuple replacing specified fields with new
694 values:
Thomas Wouters8ce81f72007-09-20 18:22:40 +0000695
696::
Thomas Wouters1b7f8912007-09-19 03:06:30 +0000697
698 >>> p = Point(x=11, y=22)
Christian Heimes0449f632007-12-15 01:27:15 +0000699 >>> p._replace(x=33)
Thomas Wouters1b7f8912007-09-19 03:06:30 +0000700 Point(x=33, y=22)
701
Guido van Rossum3d392eb2007-11-16 00:35:22 +0000702 >>> for partnum, record in inventory.items():
Christian Heimes454f37b2008-01-10 00:10:02 +0000703 ... inventory[partnum] = record._replace(price=newprices[partnum], timestamp=time.now())
Thomas Wouters1b7f8912007-09-19 03:06:30 +0000704
Christian Heimes790c8232008-01-07 21:14:23 +0000705.. attribute:: somenamedtuple._fields
Thomas Wouters1b7f8912007-09-19 03:06:30 +0000706
Christian Heimes2380ac72008-01-09 00:17:24 +0000707 Tuple of strings listing the field names. Useful for introspection
Georg Brandl9afde1c2007-11-01 20:32:30 +0000708 and for creating new named tuple types from existing named tuples.
Thomas Wouters8ce81f72007-09-20 18:22:40 +0000709
Christian Heimesfe337bf2008-03-23 21:54:12 +0000710.. doctest::
Thomas Wouters1b7f8912007-09-19 03:06:30 +0000711
Christian Heimes0449f632007-12-15 01:27:15 +0000712 >>> p._fields # view the field names
Thomas Wouters1b7f8912007-09-19 03:06:30 +0000713 ('x', 'y')
Thomas Wouters1b7f8912007-09-19 03:06:30 +0000714
Guido van Rossum3d392eb2007-11-16 00:35:22 +0000715 >>> Color = namedtuple('Color', 'red green blue')
Christian Heimes0449f632007-12-15 01:27:15 +0000716 >>> Pixel = namedtuple('Pixel', Point._fields + Color._fields)
Thomas Wouters1b7f8912007-09-19 03:06:30 +0000717 >>> Pixel(11, 22, 128, 255, 0)
Christian Heimes454f37b2008-01-10 00:10:02 +0000718 Pixel(x=11, y=22, red=128, green=255, blue=0)
Georg Brandl116aa622007-08-15 14:28:22 +0000719
Christian Heimes0449f632007-12-15 01:27:15 +0000720To retrieve a field whose name is stored in a string, use the :func:`getattr`
Christian Heimesfe337bf2008-03-23 21:54:12 +0000721function:
Christian Heimes0449f632007-12-15 01:27:15 +0000722
723 >>> getattr(p, 'x')
724 11
725
Raymond Hettinger651453a2009-02-11 00:20:02 +0000726To convert a dictionary to a named tuple, use the double-star-operator
727(as described in :ref:`tut-unpacking-arguments`):
Christian Heimes99170a52007-12-19 02:07:34 +0000728
729 >>> d = {'x': 11, 'y': 22}
730 >>> Point(**d)
731 Point(x=11, y=22)
732
Guido van Rossum3d392eb2007-11-16 00:35:22 +0000733Since a named tuple is a regular Python class, it is easy to add or change
Christian Heimes043d6f62008-01-07 17:19:16 +0000734functionality with a subclass. Here is how to add a calculated field and
Christian Heimesfe337bf2008-03-23 21:54:12 +0000735a fixed-width print format:
Guido van Rossum3d392eb2007-11-16 00:35:22 +0000736
Christian Heimes043d6f62008-01-07 17:19:16 +0000737 >>> class Point(namedtuple('Point', 'x y')):
Christian Heimes25bb7832008-01-11 16:17:00 +0000738 ... __slots__ = ()
Christian Heimes454f37b2008-01-10 00:10:02 +0000739 ... @property
740 ... def hypot(self):
741 ... return (self.x ** 2 + self.y ** 2) ** 0.5
742 ... def __str__(self):
Christian Heimes25bb7832008-01-11 16:17:00 +0000743 ... return 'Point: x=%6.3f y=%6.3f hypot=%6.3f' % (self.x, self.y, self.hypot)
Guido van Rossum3d392eb2007-11-16 00:35:22 +0000744
Georg Brandl0df79792008-10-04 18:33:26 +0000745 >>> for p in Point(3, 4), Point(14, 5/7):
Christian Heimes00412232008-01-10 16:02:19 +0000746 ... print(p)
Christian Heimes25bb7832008-01-11 16:17:00 +0000747 Point: x= 3.000 y= 4.000 hypot= 5.000
748 Point: x=14.000 y= 0.714 hypot=14.018
Christian Heimes043d6f62008-01-07 17:19:16 +0000749
Georg Brandlaf5c2382009-12-28 08:02:38 +0000750The subclass shown above sets ``__slots__`` to an empty tuple. This helps
Christian Heimes679db4a2008-01-18 09:56:22 +0000751keep memory requirements low by preventing the creation of instance dictionaries.
752
Christian Heimes2380ac72008-01-09 00:17:24 +0000753
754Subclassing is not useful for adding new, stored fields. Instead, simply
Christian Heimesfe337bf2008-03-23 21:54:12 +0000755create a new named tuple type from the :attr:`_fields` attribute:
Christian Heimes2380ac72008-01-09 00:17:24 +0000756
Christian Heimes25bb7832008-01-11 16:17:00 +0000757 >>> Point3D = namedtuple('Point3D', Point._fields + ('z',))
Christian Heimes2380ac72008-01-09 00:17:24 +0000758
759Default values can be implemented by using :meth:`_replace` to
Christian Heimesfe337bf2008-03-23 21:54:12 +0000760customize a prototype instance:
Guido van Rossum3d392eb2007-11-16 00:35:22 +0000761
762 >>> Account = namedtuple('Account', 'owner balance transaction_count')
Christian Heimes587c2bf2008-01-19 16:21:02 +0000763 >>> default_account = Account('<owner name>', 0.0, 0)
764 >>> johns_account = default_account._replace(owner='John')
Guido van Rossum3d392eb2007-11-16 00:35:22 +0000765
Christian Heimese4ca8152008-05-08 17:18:53 +0000766Enumerated constants can be implemented with named tuples, but it is simpler
767and more efficient to use a simple class declaration:
768
769 >>> Status = namedtuple('Status', 'open pending closed')._make(range(3))
770 >>> Status.open, Status.pending, Status.closed
771 (0, 1, 2)
772 >>> class Status:
773 ... open, pending, closed = range(3)
774
Raymond Hettinger651453a2009-02-11 00:20:02 +0000775.. seealso::
Thomas Wouters47b49bf2007-08-30 22:15:33 +0000776
Raymond Hettinger651453a2009-02-11 00:20:02 +0000777 `Named tuple recipe <http://code.activestate.com/recipes/500261/>`_
778 adapted for Python 2.4.
Raymond Hettingere4c96ad2008-02-06 01:23:58 +0000779
780
Raymond Hettinger2d32f632009-03-02 21:24:57 +0000781:class:`OrderedDict` objects
782----------------------------
783
784Ordered dictionaries are just like regular dictionaries but they remember the
785order that items were inserted. When iterating over an ordered dictionary,
786the items are returned in the order their keys were first added.
787
788.. class:: OrderedDict([items])
789
790 Return an instance of a dict subclass, supporting the usual :class:`dict`
791 methods. An *OrderedDict* is a dict that remembers the order that keys
792 were first inserted. If a new entry overwrites an existing entry, the
793 original insertion position is left unchanged. Deleting an entry and
794 reinserting it will move it to the end.
795
Benjamin Petersond45bf582009-03-02 21:44:54 +0000796 .. versionadded:: 3.1
Raymond Hettinger2d32f632009-03-02 21:24:57 +0000797
Benjamin Petersond319ad52010-07-18 14:27:02 +0000798 .. method:: popitem(last=True)
Raymond Hettingerdc879f02009-03-19 20:30:56 +0000799
Benjamin Petersond319ad52010-07-18 14:27:02 +0000800 The :meth:`popitem` method for ordered dictionaries returns and removes a
801 (key, value) pair. The pairs are returned in LIFO order if *last* is true
802 or FIFO order if false.
Raymond Hettinger2d32f632009-03-02 21:24:57 +0000803
Raymond Hettingerf45abc92010-09-06 21:26:09 +0000804 .. method:: move_to_end(key, last=True)
805
806 Move an existing *key* to either end of an ordered dictionary. The item
807 is moved to the right end if *last* is true (the default) or to the
808 beginning if *last* is false. Raises :exc:`KeyError` if the *key* does
809 not exist::
810
811 >>> d = OrderedDict.fromkeys('abcde')
812 >>> d.move_to_end('b')
813 >>> ''.join(d.keys)
814 'acdeb'
Éric Araujo1cb25aa2010-11-06 07:03:07 +0000815 >>> d.move_to_end('b', last=False)
Raymond Hettingerf45abc92010-09-06 21:26:09 +0000816 >>> ''.join(d.keys)
817 'bacde'
818
819 .. versionadded:: 3.2
820
Raymond Hettingere9091502009-05-19 17:40:07 +0000821In addition to the usual mapping methods, ordered dictionaries also support
822reverse iteration using :func:`reversed`.
823
Raymond Hettinger2d32f632009-03-02 21:24:57 +0000824Equality tests between :class:`OrderedDict` objects are order-sensitive
825and are implemented as ``list(od1.items())==list(od2.items())``.
826Equality tests between :class:`OrderedDict` objects and other
827:class:`Mapping` objects are order-insensitive like regular dictionaries.
828This allows :class:`OrderedDict` objects to be substituted anywhere a
829regular dictionary is used.
830
Raymond Hettinger36180782009-04-09 22:34:23 +0000831The :class:`OrderedDict` constructor and :meth:`update` method both accept
832keyword arguments, but their order is lost because Python's function call
833semantics pass-in keyword arguments using a regular unordered dictionary.
834
Raymond Hettingerdc879f02009-03-19 20:30:56 +0000835.. seealso::
836
837 `Equivalent OrderedDict recipe <http://code.activestate.com/recipes/576693/>`_
838 that runs on Python 2.4 or later.
839
Raymond Hettinger0e312012009-11-10 18:35:46 +0000840Since an ordered dictionary remembers its insertion order, it can be used
841in conjuction with sorting to make a sorted dictionary::
842
843 >>> # regular unsorted dictionary
844 >>> d = {'banana': 3, 'apple':4, 'pear': 1, 'orange': 2}
845
846 >>> # dictionary sorted by key
847 >>> OrderedDict(sorted(d.items(), key=lambda t: t[0]))
848 OrderedDict([('apple', 4), ('banana', 3), ('orange', 2), ('pear', 1)])
849
850 >>> # dictionary sorted by value
851 >>> OrderedDict(sorted(d.items(), key=lambda t: t[1]))
852 OrderedDict([('pear', 1), ('orange', 2), ('banana', 3), ('apple', 4)])
853
854 >>> # dictionary sorted by length of the key string
855 >>> OrderedDict(sorted(d.items(), key=lambda t: len(t[0])))
856 OrderedDict([('pear', 1), ('apple', 4), ('orange', 2), ('banana', 3)])
857
858The new sorted dictionaries maintain their sort order when entries
859are deleted. But when new keys are added, the keys are appended
860to the end and the sort is not maintained.
861
Raymond Hettinger4821ef82010-07-31 10:14:41 +0000862It is also straight-forward to create an ordered dictionary variant
863that the remembers the order the keys were *last* inserted.
864If a new entry overwrites an existing entry, the
865original insertion position is changed and moved to the end::
866
867 class LastUpdatedOrderedDict(OrderedDict):
Georg Brandl77570e22010-12-18 16:21:58 +0000868 'Store items in the order the keys were last added'
Raymond Hettinger4821ef82010-07-31 10:14:41 +0000869 def __setitem__(self, key, value):
870 if key in self:
871 del self[key]
872 OrderedDict.__setitem__(self, key, value)
873
Raymond Hettingere4c96ad2008-02-06 01:23:58 +0000874
875:class:`UserDict` objects
Mark Summerfield8f2d0062008-02-06 13:30:44 +0000876-------------------------
Raymond Hettingere4c96ad2008-02-06 01:23:58 +0000877
Georg Brandl48310cd2009-01-03 21:18:54 +0000878The class, :class:`UserDict` acts as a wrapper around dictionary objects.
879The need for this class has been partially supplanted by the ability to
Raymond Hettingere4c96ad2008-02-06 01:23:58 +0000880subclass directly from :class:`dict`; however, this class can be easier
881to work with because the underlying dictionary is accessible as an
882attribute.
883
884.. class:: UserDict([initialdata])
885
886 Class that simulates a dictionary. The instance's contents are kept in a
887 regular dictionary, which is accessible via the :attr:`data` attribute of
888 :class:`UserDict` instances. If *initialdata* is provided, :attr:`data` is
889 initialized with its contents; note that a reference to *initialdata* will not
890 be kept, allowing it be used for other purposes.
891
Benjamin Petersond319ad52010-07-18 14:27:02 +0000892 In addition to supporting the methods and operations of mappings,
893 :class:`UserDict` instances provide the following attribute:
Raymond Hettingere4c96ad2008-02-06 01:23:58 +0000894
Benjamin Petersond319ad52010-07-18 14:27:02 +0000895 .. attribute:: data
Raymond Hettingere4c96ad2008-02-06 01:23:58 +0000896
Benjamin Petersond319ad52010-07-18 14:27:02 +0000897 A real dictionary used to store the contents of the :class:`UserDict`
898 class.
Raymond Hettinger53dbe392008-02-12 20:03:09 +0000899
900
901
902:class:`UserList` objects
903-------------------------
904
905This class acts as a wrapper around list objects. It is a useful base class
Georg Brandl48310cd2009-01-03 21:18:54 +0000906for your own list-like classes which can inherit from them and override
Raymond Hettinger53dbe392008-02-12 20:03:09 +0000907existing methods or add new ones. In this way, one can add new behaviors to
908lists.
909
Georg Brandl48310cd2009-01-03 21:18:54 +0000910The need for this class has been partially supplanted by the ability to
Raymond Hettinger53dbe392008-02-12 20:03:09 +0000911subclass directly from :class:`list`; however, this class can be easier
912to work with because the underlying list is accessible as an attribute.
913
914.. class:: UserList([list])
915
916 Class that simulates a list. The instance's contents are kept in a regular
917 list, which is accessible via the :attr:`data` attribute of :class:`UserList`
918 instances. The instance's contents are initially set to a copy of *list*,
919 defaulting to the empty list ``[]``. *list* can be any iterable, for
920 example a real Python list or a :class:`UserList` object.
921
Benjamin Petersond319ad52010-07-18 14:27:02 +0000922 In addition to supporting the methods and operations of mutable sequences,
923 :class:`UserList` instances provide the following attribute:
Raymond Hettinger53dbe392008-02-12 20:03:09 +0000924
Benjamin Petersond319ad52010-07-18 14:27:02 +0000925 .. attribute:: data
Raymond Hettinger53dbe392008-02-12 20:03:09 +0000926
Benjamin Petersond319ad52010-07-18 14:27:02 +0000927 A real :class:`list` object used to store the contents of the
928 :class:`UserList` class.
Raymond Hettinger53dbe392008-02-12 20:03:09 +0000929
930**Subclassing requirements:** Subclasses of :class:`UserList` are expect to
931offer a constructor which can be called with either no arguments or one
932argument. List operations which return a new sequence attempt to create an
933instance of the actual implementation class. To do so, it assumes that the
934constructor can be called with a single parameter, which is a sequence object
935used as a data source.
936
937If a derived class does not wish to comply with this requirement, all of the
938special methods supported by this class will need to be overridden; please
939consult the sources for information about the methods which need to be provided
940in that case.
Raymond Hettingerb3a65f82008-02-21 22:11:37 +0000941
942:class:`UserString` objects
Christian Heimesc3f30c42008-02-22 16:37:40 +0000943---------------------------
Raymond Hettingerb3a65f82008-02-21 22:11:37 +0000944
Georg Brandl48310cd2009-01-03 21:18:54 +0000945The class, :class:`UserString` acts as a wrapper around string objects.
946The need for this class has been partially supplanted by the ability to
Raymond Hettingerb3a65f82008-02-21 22:11:37 +0000947subclass directly from :class:`str`; however, this class can be easier
948to work with because the underlying string is accessible as an
949attribute.
950
951.. class:: UserString([sequence])
952
953 Class that simulates a string or a Unicode string object. The instance's
Georg Brandl48310cd2009-01-03 21:18:54 +0000954 content is kept in a regular string object, which is accessible via the
955 :attr:`data` attribute of :class:`UserString` instances. The instance's
Raymond Hettingerb3a65f82008-02-21 22:11:37 +0000956 contents are initially set to a copy of *sequence*. The *sequence* can
957 be an instance of :class:`bytes`, :class:`str`, :class:`UserString` (or a
958 subclass) or an arbitrary sequence which can be converted into a string using
959 the built-in :func:`str` function.
Raymond Hettingera6b76ba2010-08-08 00:29:08 +0000960
Raymond Hettingerdcb44912010-11-30 17:45:41 +0000961.. _abstract-base-classes:
Raymond Hettingera6b76ba2010-08-08 00:29:08 +0000962
963ABCs - abstract base classes
964----------------------------
965
966The collections module offers the following ABCs:
967
968========================= ===================== ====================== ====================================================
969ABC Inherits Abstract Methods Mixin Methods
970========================= ===================== ====================== ====================================================
971:class:`Container` ``__contains__``
972:class:`Hashable` ``__hash__``
973:class:`Iterable` ``__iter__``
974:class:`Iterator` :class:`Iterable` ``__next__`` ``__iter__``
975:class:`Sized` ``__len__``
976:class:`Callable` ``__call__``
977
978:class:`Sequence` :class:`Sized`, ``__getitem__`` ``__contains__``. ``__iter__``, ``__reversed__``.
979 :class:`Iterable`, ``index``, and ``count``
980 :class:`Container`
981
982:class:`MutableSequence` :class:`Sequence` ``__setitem__`` Inherited Sequence methods and
983 ``__delitem__``, ``append``, ``reverse``, ``extend``, ``pop``,
984 and ``insert`` ``remove``, and ``__iadd__``
985
986:class:`Set` :class:`Sized`, ``__le__``, ``__lt__``, ``__eq__``, ``__ne__``,
987 :class:`Iterable`, ``__gt__``, ``__ge__``, ``__and__``, ``__or__``
988 :class:`Container` ``__sub__``, ``__xor__``, and ``isdisjoint``
989
990:class:`MutableSet` :class:`Set` ``add`` and Inherited Set methods and
991 ``discard`` ``clear``, ``pop``, ``remove``, ``__ior__``,
992 ``__iand__``, ``__ixor__``, and ``__isub__``
993
994:class:`Mapping` :class:`Sized`, ``__getitem__`` ``__contains__``, ``keys``, ``items``, ``values``,
995 :class:`Iterable`, ``get``, ``__eq__``, and ``__ne__``
996 :class:`Container`
997
998:class:`MutableMapping` :class:`Mapping` ``__setitem__`` and Inherited Mapping methods and
999 ``__delitem__`` ``pop``, ``popitem``, ``clear``, ``update``,
1000 and ``setdefault``
1001
1002
1003:class:`MappingView` :class:`Sized` ``__len__``
1004:class:`KeysView` :class:`MappingView`, ``__contains__``,
1005 :class:`Set` ``__iter__``
1006:class:`ItemsView` :class:`MappingView`, ``__contains__``,
1007 :class:`Set` ``__iter__``
1008:class:`ValuesView` :class:`MappingView` ``__contains__``, ``__iter__``
1009========================= ===================== ====================== ====================================================
1010
1011These ABCs allow us to ask classes or instances if they provide
1012particular functionality, for example::
1013
1014 size = None
1015 if isinstance(myvar, collections.Sized):
1016 size = len(myvar)
1017
1018Several of the ABCs are also useful as mixins that make it easier to develop
1019classes supporting container APIs. For example, to write a class supporting
1020the full :class:`Set` API, it only necessary to supply the three underlying
1021abstract methods: :meth:`__contains__`, :meth:`__iter__`, and :meth:`__len__`.
1022The ABC supplies the remaining methods such as :meth:`__and__` and
1023:meth:`isdisjoint` ::
1024
1025 class ListBasedSet(collections.Set):
1026 ''' Alternate set implementation favoring space over speed
1027 and not requiring the set elements to be hashable. '''
1028 def __init__(self, iterable):
1029 self.elements = lst = []
1030 for value in iterable:
1031 if value not in lst:
1032 lst.append(value)
1033 def __iter__(self):
1034 return iter(self.elements)
1035 def __contains__(self, value):
1036 return value in self.elements
1037 def __len__(self):
1038 return len(self.elements)
1039
1040 s1 = ListBasedSet('abcdef')
1041 s2 = ListBasedSet('defghi')
1042 overlap = s1 & s2 # The __and__() method is supported automatically
1043
1044Notes on using :class:`Set` and :class:`MutableSet` as a mixin:
1045
1046(1)
1047 Since some set operations create new sets, the default mixin methods need
1048 a way to create new instances from an iterable. The class constructor is
1049 assumed to have a signature in the form ``ClassName(iterable)``.
1050 That assumption is factored-out to an internal classmethod called
1051 :meth:`_from_iterable` which calls ``cls(iterable)`` to produce a new set.
1052 If the :class:`Set` mixin is being used in a class with a different
1053 constructor signature, you will need to override :meth:`from_iterable`
1054 with a classmethod that can construct new instances from
1055 an iterable argument.
1056
1057(2)
1058 To override the comparisons (presumably for speed, as the
1059 semantics are fixed), redefine :meth:`__le__` and
1060 then the other operations will automatically follow suit.
1061
1062(3)
1063 The :class:`Set` mixin provides a :meth:`_hash` method to compute a hash value
1064 for the set; however, :meth:`__hash__` is not defined because not all sets
1065 are hashable or immutable. To add set hashabilty using mixins,
1066 inherit from both :meth:`Set` and :meth:`Hashable`, then define
1067 ``__hash__ = Set._hash``.
1068
1069.. seealso::
1070
Éric Araujo6e6cb8e2010-11-16 19:13:50 +00001071 * Latest version of the :source:`Python source code for the collections
1072 abstract base classes <Lib/_abcoll.py>`
1073
Raymond Hettingera6b76ba2010-08-08 00:29:08 +00001074 * `OrderedSet recipe <http://code.activestate.com/recipes/576694/>`_ for an
1075 example built on :class:`MutableSet`.
1076
1077 * For more about ABCs, see the :mod:`abc` module and :pep:`3119`.