blob: 68a79f12d0979f162ee2a4f2e8a166c216cd6af1 [file] [log] [blame]
Raymond Hettingere5820c62011-03-22 09:11:39 -07001
Raymond Hettinger53dbe392008-02-12 20:03:09 +00002:mod:`collections` --- Container datatypes
3==========================================
Georg Brandl116aa622007-08-15 14:28:22 +00004
5.. module:: collections
Raymond Hettinger53dbe392008-02-12 20:03:09 +00006 :synopsis: Container datatypes
Georg Brandl116aa622007-08-15 14:28:22 +00007.. moduleauthor:: Raymond Hettinger <python@rcn.com>
8.. sectionauthor:: Raymond Hettinger <python@rcn.com>
9
Christian Heimesfe337bf2008-03-23 21:54:12 +000010.. testsetup:: *
11
12 from collections import *
13 import itertools
14 __name__ = '<doctest>'
Georg Brandl116aa622007-08-15 14:28:22 +000015
Raymond Hettingerf56c9cd2010-08-08 00:32:32 +000016This module implements specialized container datatypes providing alternatives to
17Python's general purpose built-in containers, :class:`dict`, :class:`list`,
18:class:`set`, and :class:`tuple`.
Christian Heimes0bd4e112008-02-12 22:59:25 +000019
Raymond Hettingerf56c9cd2010-08-08 00:32:32 +000020===================== ====================================================================
21:func:`namedtuple` factory function for creating tuple subclasses with named fields
22:class:`deque` list-like container with fast appends and pops on either end
23:class:`Counter` dict subclass for counting hashable objects
24:class:`OrderedDict` dict subclass that remembers the order entries were added
25:class:`defaultdict` dict subclass that calls a factory function to supply missing values
26:class:`UserDict` wrapper around dictionary objects for easier dict subclassing
27:class:`UserList` wrapper around list objects for easier list subclassing
28:class:`UserString` wrapper around string objects for easier string subclassing
29===================== ====================================================================
Georg Brandl116aa622007-08-15 14:28:22 +000030
Raymond Hettingerf56c9cd2010-08-08 00:32:32 +000031In addition to the concrete container classes, the collections module provides
32ABCs (abstract base classes) that can be used to test whether a class provides a
33particular interface, for example, whether it is hashable or a mapping.
Mark Summerfield08898b42007-09-05 08:43:04 +000034
35
Raymond Hettingerb8baf632009-01-14 02:20:07 +000036:class:`Counter` objects
37------------------------
38
39A counter tool is provided to support convenient and rapid tallies.
40For example::
41
Raymond Hettinger1c62dc92009-02-04 11:41:45 +000042 >>> # Tally occurrences of words in a list
Raymond Hettingerb8baf632009-01-14 02:20:07 +000043 >>> cnt = Counter()
Raymond Hettinger670eaec2009-01-21 23:14:07 +000044 >>> for word in ['red', 'blue', 'red', 'green', 'blue', 'blue']:
Raymond Hettingerb8baf632009-01-14 02:20:07 +000045 ... cnt[word] += 1
46 >>> cnt
47 Counter({'blue': 3, 'red': 2, 'green': 1})
48
Raymond Hettinger1c62dc92009-02-04 11:41:45 +000049 >>> # Find the ten most common words in Hamlet
Raymond Hettingerb8baf632009-01-14 02:20:07 +000050 >>> import re
51 >>> words = re.findall('\w+', open('hamlet.txt').read().lower())
Raymond Hettinger0bae6622009-01-20 13:00:59 +000052 >>> Counter(words).most_common(10)
Raymond Hettingerb8baf632009-01-14 02:20:07 +000053 [('the', 1143), ('and', 966), ('to', 762), ('of', 669), ('i', 631),
54 ('you', 554), ('a', 546), ('my', 514), ('hamlet', 471), ('in', 451)]
55
56.. class:: Counter([iterable-or-mapping])
57
Raymond Hettinger670eaec2009-01-21 23:14:07 +000058 A :class:`Counter` is a :class:`dict` subclass for counting hashable objects.
Raymond Hettingerb8baf632009-01-14 02:20:07 +000059 It is an unordered collection where elements are stored as dictionary keys
60 and their counts are stored as dictionary values. Counts are allowed to be
61 any integer value including zero or negative counts. The :class:`Counter`
62 class is similar to bags or multisets in other languages.
63
64 Elements are counted from an *iterable* or initialized from another
Benjamin Peterson25c95f12009-05-08 20:42:26 +000065 *mapping* (or counter):
Raymond Hettingerb8baf632009-01-14 02:20:07 +000066
Raymond Hettinger73662a52009-01-27 02:38:22 +000067 >>> c = Counter() # a new, empty counter
68 >>> c = Counter('gallahad') # a new counter from an iterable
69 >>> c = Counter({'red': 4, 'blue': 2}) # a new counter from a mapping
70 >>> c = Counter(cats=4, dogs=8) # a new counter from keyword args
Raymond Hettingerb8baf632009-01-14 02:20:07 +000071
Raymond Hettinger670eaec2009-01-21 23:14:07 +000072 Counter objects have a dictionary interface except that they return a zero
Benjamin Peterson25c95f12009-05-08 20:42:26 +000073 count for missing items instead of raising a :exc:`KeyError`:
Raymond Hettingerb8baf632009-01-14 02:20:07 +000074
Raymond Hettinger94adc8e2009-01-22 05:27:37 +000075 >>> c = Counter(['eggs', 'ham'])
Raymond Hettingerb8baf632009-01-14 02:20:07 +000076 >>> c['bacon'] # count of a missing element is zero
77 0
78
Raymond Hettinger94adc8e2009-01-22 05:27:37 +000079 Setting a count to zero does not remove an element from a counter.
80 Use ``del`` to remove it entirely:
Raymond Hettingerb8baf632009-01-14 02:20:07 +000081
Raymond Hettinger94adc8e2009-01-22 05:27:37 +000082 >>> c['sausage'] = 0 # counter entry with a zero count
83 >>> del c['sausage'] # del actually removes the entry
Raymond Hettingerb8baf632009-01-14 02:20:07 +000084
Benjamin Petersond45bf582009-03-02 21:44:54 +000085 .. versionadded:: 3.1
Raymond Hettingerb8baf632009-01-14 02:20:07 +000086
87
88 Counter objects support two methods beyond those available for all
89 dictionaries:
90
91 .. method:: elements()
92
Raymond Hettinger670eaec2009-01-21 23:14:07 +000093 Return an iterator over elements repeating each as many times as its
94 count. Elements are returned in arbitrary order. If an element's count
95 is less than one, :meth:`elements` will ignore it.
Raymond Hettingerb8baf632009-01-14 02:20:07 +000096
Raymond Hettinger0bae6622009-01-20 13:00:59 +000097 >>> c = Counter(a=4, b=2, c=0, d=-2)
Raymond Hettingerb8baf632009-01-14 02:20:07 +000098 >>> list(c.elements())
99 ['a', 'a', 'a', 'a', 'b', 'b']
100
101 .. method:: most_common([n])
102
Raymond Hettinger73662a52009-01-27 02:38:22 +0000103 Return a list of the *n* most common elements and their counts from the
Raymond Hettingerd04fa312009-02-04 19:45:13 +0000104 most common to the least. If *n* is not specified, :func:`most_common`
Raymond Hettinger73662a52009-01-27 02:38:22 +0000105 returns *all* elements in the counter. Elements with equal counts are
Benjamin Peterson25c95f12009-05-08 20:42:26 +0000106 ordered arbitrarily:
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000107
108 >>> Counter('abracadabra').most_common(3)
109 [('a', 5), ('r', 2), ('b', 2)]
110
Raymond Hettinger670eaec2009-01-21 23:14:07 +0000111 The usual dictionary methods are available for :class:`Counter` objects
112 except for two which work differently for counters.
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000113
114 .. method:: fromkeys(iterable)
115
Raymond Hettinger73662a52009-01-27 02:38:22 +0000116 This class method is not implemented for :class:`Counter` objects.
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000117
118 .. method:: update([iterable-or-mapping])
119
Raymond Hettinger73662a52009-01-27 02:38:22 +0000120 Elements are counted from an *iterable* or added-in from another
121 *mapping* (or counter). Like :meth:`dict.update` but adds counts
122 instead of replacing them. Also, the *iterable* is expected to be a
123 sequence of elements, not a sequence of ``(key, value)`` pairs.
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000124
125Common patterns for working with :class:`Counter` objects::
126
Raymond Hettinger73662a52009-01-27 02:38:22 +0000127 sum(c.values()) # total of all counts
128 c.clear() # reset all counts
129 list(c) # list unique elements
130 set(c) # convert to a set
131 dict(c) # convert to a regular dictionary
132 c.items() # convert to a list of (elem, cnt) pairs
133 Counter(dict(list_of_pairs)) # convert from a list of (elem, cnt) pairs
134 c.most_common()[:-n:-1] # n least common elements
135 c += Counter() # remove zero and negative counts
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000136
Raymond Hettinger72a95cc2009-02-25 22:51:40 +0000137Several mathematical operations are provided for combining :class:`Counter`
138objects to produce multisets (counters that have counts greater than zero).
139Addition and subtraction combine counters by adding or subtracting the counts
140of corresponding elements. Intersection and union return the minimum and
141maximum of corresponding counts. Each operation can accept inputs with signed
142counts, but the output will exclude results with counts of zero or less.
Raymond Hettinger4d2073a2009-01-20 03:41:22 +0000143
Raymond Hettingere0d1b9f2009-01-21 20:36:27 +0000144 >>> c = Counter(a=3, b=1)
145 >>> d = Counter(a=1, b=2)
Raymond Hettinger73662a52009-01-27 02:38:22 +0000146 >>> c + d # add two counters together: c[x] + d[x]
Raymond Hettinger4d2073a2009-01-20 03:41:22 +0000147 Counter({'a': 4, 'b': 3})
Raymond Hettinger73662a52009-01-27 02:38:22 +0000148 >>> c - d # subtract (keeping only positive counts)
Raymond Hettinger4d2073a2009-01-20 03:41:22 +0000149 Counter({'a': 2})
Raymond Hettinger73662a52009-01-27 02:38:22 +0000150 >>> c & d # intersection: min(c[x], d[x])
Raymond Hettinger4d2073a2009-01-20 03:41:22 +0000151 Counter({'a': 1, 'b': 1})
Raymond Hettinger73662a52009-01-27 02:38:22 +0000152 >>> c | d # union: max(c[x], d[x])
Raymond Hettinger4d2073a2009-01-20 03:41:22 +0000153 Counter({'a': 3, 'b': 2})
154
Raymond Hettinger4c4d3b12010-04-12 21:47:14 +0000155.. note::
156
157 Counters were primarily designed to work with positive integers to represent
158 running counts; however, care was taken to not unnecessarily preclude use
159 cases needing other types or negative values. To help with those use cases,
160 this section documents the minimum range and type restrictions.
161
162 * The :class:`Counter` class itself is a dictionary subclass with no
163 restrictions on its keys and values. The values are intended to be numbers
164 representing counts, but you *could* store anything in the value field.
165
166 * The :meth:`most_common` method requires only that the values be orderable.
167
168 * For in-place operations such as ``c[key] += 1``, the value type need only
169 support addition and subtraction. So fractions, floats, and decimals would
170 work and negative values are supported. The same is also true for
171 :meth:`update` and :meth:`subtract` which allow negative and zero values
172 for both inputs and outputs.
173
174 * The multiset methods are designed only for use cases with positive values.
175 The inputs may be negative or zero, but only outputs with positive values
176 are created. There are no type restrictions, but the value type needs to
177 support support addition, subtraction, and comparison.
178
179 * The :meth:`elements` method requires integer counts. It ignores zero and
180 negative counts.
181
Raymond Hettingerb14043c2009-01-20 23:44:31 +0000182.. seealso::
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000183
Raymond Hettinger94adc8e2009-01-22 05:27:37 +0000184 * `Counter class <http://code.activestate.com/recipes/576611/>`_
185 adapted for Python 2.5 and an early `Bag recipe
186 <http://code.activestate.com/recipes/259174/>`_ for Python 2.4.
187
Raymond Hettingerb14043c2009-01-20 23:44:31 +0000188 * `Bag class <http://www.gnu.org/software/smalltalk/manual-base/html_node/Bag.html>`_
189 in Smalltalk.
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000190
Raymond Hettingerb14043c2009-01-20 23:44:31 +0000191 * Wikipedia entry for `Multisets <http://en.wikipedia.org/wiki/Multiset>`_\.
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000192
Raymond Hettingerb14043c2009-01-20 23:44:31 +0000193 * `C++ multisets <http://www.demo2s.com/Tutorial/Cpp/0380__set-multiset/Catalog0380__set-multiset.htm>`_
Raymond Hettinger94adc8e2009-01-22 05:27:37 +0000194 tutorial with examples.
Raymond Hettingerb14043c2009-01-20 23:44:31 +0000195
Raymond Hettinger94adc8e2009-01-22 05:27:37 +0000196 * For mathematical operations on multisets and their use cases, see
Raymond Hettingerb14043c2009-01-20 23:44:31 +0000197 *Knuth, Donald. The Art of Computer Programming Volume II,
198 Section 4.6.3, Exercise 19*\.
199
Raymond Hettinger670eaec2009-01-21 23:14:07 +0000200 * To enumerate all distinct multisets of a given size over a given set of
Raymond Hettingerd07d9392009-01-27 04:20:44 +0000201 elements, see :func:`itertools.combinations_with_replacement`.
Raymond Hettingerb14043c2009-01-20 23:44:31 +0000202
Raymond Hettinger94adc8e2009-01-22 05:27:37 +0000203 map(Counter, combinations_with_replacement('ABC', 2)) --> AA AB AC BB BC CC
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000204
205
Georg Brandl116aa622007-08-15 14:28:22 +0000206:class:`deque` objects
207----------------------
208
Georg Brandlc2a4f4f2009-04-10 09:03:43 +0000209.. class:: deque([iterable, [maxlen]])
Georg Brandl116aa622007-08-15 14:28:22 +0000210
211 Returns a new deque object initialized left-to-right (using :meth:`append`) with
212 data from *iterable*. If *iterable* is not specified, the new deque is empty.
213
214 Deques are a generalization of stacks and queues (the name is pronounced "deck"
215 and is short for "double-ended queue"). Deques support thread-safe, memory
216 efficient appends and pops from either side of the deque with approximately the
217 same O(1) performance in either direction.
218
219 Though :class:`list` objects support similar operations, they are optimized for
220 fast fixed-length operations and incur O(n) memory movement costs for
221 ``pop(0)`` and ``insert(0, v)`` operations which change both the size and
222 position of the underlying data representation.
223
Georg Brandl116aa622007-08-15 14:28:22 +0000224
Georg Brandl9afde1c2007-11-01 20:32:30 +0000225 If *maxlen* is not specified or is *None*, deques may grow to an
226 arbitrary length. Otherwise, the deque is bounded to the specified maximum
227 length. Once a bounded length deque is full, when new items are added, a
228 corresponding number of items are discarded from the opposite end. Bounded
229 length deques provide functionality similar to the ``tail`` filter in
230 Unix. They are also useful for tracking transactions and other pools of data
231 where only the most recent activity is of interest.
232
Georg Brandl9afde1c2007-11-01 20:32:30 +0000233
Benjamin Petersone41251e2008-04-25 01:59:09 +0000234 Deque objects support the following methods:
Georg Brandl116aa622007-08-15 14:28:22 +0000235
Benjamin Petersone41251e2008-04-25 01:59:09 +0000236 .. method:: append(x)
Georg Brandl116aa622007-08-15 14:28:22 +0000237
Benjamin Petersone41251e2008-04-25 01:59:09 +0000238 Add *x* to the right side of the deque.
Georg Brandl116aa622007-08-15 14:28:22 +0000239
240
Benjamin Petersone41251e2008-04-25 01:59:09 +0000241 .. method:: appendleft(x)
Georg Brandl116aa622007-08-15 14:28:22 +0000242
Benjamin Petersone41251e2008-04-25 01:59:09 +0000243 Add *x* to the left side of the deque.
Georg Brandl116aa622007-08-15 14:28:22 +0000244
245
Benjamin Petersone41251e2008-04-25 01:59:09 +0000246 .. method:: clear()
Georg Brandl116aa622007-08-15 14:28:22 +0000247
Benjamin Petersone41251e2008-04-25 01:59:09 +0000248 Remove all elements from the deque leaving it with length 0.
Georg Brandl116aa622007-08-15 14:28:22 +0000249
250
Benjamin Petersone41251e2008-04-25 01:59:09 +0000251 .. method:: extend(iterable)
Georg Brandl116aa622007-08-15 14:28:22 +0000252
Benjamin Petersone41251e2008-04-25 01:59:09 +0000253 Extend the right side of the deque by appending elements from the iterable
254 argument.
Georg Brandl116aa622007-08-15 14:28:22 +0000255
256
Benjamin Petersone41251e2008-04-25 01:59:09 +0000257 .. method:: extendleft(iterable)
Georg Brandl116aa622007-08-15 14:28:22 +0000258
Benjamin Petersone41251e2008-04-25 01:59:09 +0000259 Extend the left side of the deque by appending elements from *iterable*.
260 Note, the series of left appends results in reversing the order of
261 elements in the iterable argument.
Georg Brandl116aa622007-08-15 14:28:22 +0000262
263
Benjamin Petersone41251e2008-04-25 01:59:09 +0000264 .. method:: pop()
Georg Brandl116aa622007-08-15 14:28:22 +0000265
Benjamin Petersone41251e2008-04-25 01:59:09 +0000266 Remove and return an element from the right side of the deque. If no
267 elements are present, raises an :exc:`IndexError`.
Georg Brandl116aa622007-08-15 14:28:22 +0000268
269
Benjamin Petersone41251e2008-04-25 01:59:09 +0000270 .. method:: popleft()
Georg Brandl116aa622007-08-15 14:28:22 +0000271
Benjamin Petersone41251e2008-04-25 01:59:09 +0000272 Remove and return an element from the left side of the deque. If no
273 elements are present, raises an :exc:`IndexError`.
Georg Brandl116aa622007-08-15 14:28:22 +0000274
275
Benjamin Petersone41251e2008-04-25 01:59:09 +0000276 .. method:: remove(value)
Georg Brandl116aa622007-08-15 14:28:22 +0000277
Benjamin Petersone41251e2008-04-25 01:59:09 +0000278 Removed the first occurrence of *value*. If not found, raises a
279 :exc:`ValueError`.
Georg Brandl116aa622007-08-15 14:28:22 +0000280
Georg Brandl116aa622007-08-15 14:28:22 +0000281
Benjamin Petersone41251e2008-04-25 01:59:09 +0000282 .. method:: rotate(n)
Georg Brandl116aa622007-08-15 14:28:22 +0000283
Benjamin Petersone41251e2008-04-25 01:59:09 +0000284 Rotate the deque *n* steps to the right. If *n* is negative, rotate to
285 the left. Rotating one step to the right is equivalent to:
286 ``d.appendleft(d.pop())``.
287
Georg Brandl116aa622007-08-15 14:28:22 +0000288
Raymond Hettinger5bb0f0e2009-03-10 12:56:32 +0000289 Deque objects also provide one read-only attribute:
290
291 .. attribute:: maxlen
292
293 Maximum size of a deque or *None* if unbounded.
294
Raymond Hettinger150fb9c2009-03-10 22:48:06 +0000295 .. versionadded:: 3.1
Raymond Hettinger5bb0f0e2009-03-10 12:56:32 +0000296
297
Georg Brandl116aa622007-08-15 14:28:22 +0000298In addition to the above, deques support iteration, pickling, ``len(d)``,
299``reversed(d)``, ``copy.copy(d)``, ``copy.deepcopy(d)``, membership testing with
Benjamin Peterson206e3072008-10-19 14:07:49 +0000300the :keyword:`in` operator, and subscript references such as ``d[-1]``. Indexed
301access is O(1) at both ends but slows to O(n) in the middle. For fast random
302access, use lists instead.
Georg Brandl116aa622007-08-15 14:28:22 +0000303
Christian Heimesfe337bf2008-03-23 21:54:12 +0000304Example:
305
306.. doctest::
Georg Brandl116aa622007-08-15 14:28:22 +0000307
308 >>> from collections import deque
309 >>> d = deque('ghi') # make a new deque with three items
310 >>> for elem in d: # iterate over the deque's elements
Neal Norwitz752abd02008-05-13 04:55:24 +0000311 ... print(elem.upper())
Georg Brandl116aa622007-08-15 14:28:22 +0000312 G
313 H
314 I
315
316 >>> d.append('j') # add a new entry to the right side
317 >>> d.appendleft('f') # add a new entry to the left side
318 >>> d # show the representation of the deque
319 deque(['f', 'g', 'h', 'i', 'j'])
320
321 >>> d.pop() # return and remove the rightmost item
322 'j'
323 >>> d.popleft() # return and remove the leftmost item
324 'f'
325 >>> list(d) # list the contents of the deque
326 ['g', 'h', 'i']
327 >>> d[0] # peek at leftmost item
328 'g'
329 >>> d[-1] # peek at rightmost item
330 'i'
331
332 >>> list(reversed(d)) # list the contents of a deque in reverse
333 ['i', 'h', 'g']
334 >>> 'h' in d # search the deque
335 True
336 >>> d.extend('jkl') # add multiple elements at once
337 >>> d
338 deque(['g', 'h', 'i', 'j', 'k', 'l'])
339 >>> d.rotate(1) # right rotation
340 >>> d
341 deque(['l', 'g', 'h', 'i', 'j', 'k'])
342 >>> d.rotate(-1) # left rotation
343 >>> d
344 deque(['g', 'h', 'i', 'j', 'k', 'l'])
345
346 >>> deque(reversed(d)) # make a new deque in reverse order
347 deque(['l', 'k', 'j', 'i', 'h', 'g'])
348 >>> d.clear() # empty the deque
349 >>> d.pop() # cannot pop from an empty deque
350 Traceback (most recent call last):
351 File "<pyshell#6>", line 1, in -toplevel-
352 d.pop()
353 IndexError: pop from an empty deque
354
355 >>> d.extendleft('abc') # extendleft() reverses the input order
356 >>> d
357 deque(['c', 'b', 'a'])
358
359
Georg Brandl9afde1c2007-11-01 20:32:30 +0000360:class:`deque` Recipes
361^^^^^^^^^^^^^^^^^^^^^^
Georg Brandl116aa622007-08-15 14:28:22 +0000362
363This section shows various approaches to working with deques.
364
Raymond Hettingerd2ee64d2009-03-31 22:52:48 +0000365Bounded length deques provide functionality similar to the ``tail`` filter
366in Unix::
367
368 def tail(filename, n=10):
369 'Return the last n lines of a file'
370 return deque(open(filename), n)
371
372Another approach to using deques is to maintain a sequence of recently
373added elements by appending to the right and popping to the left::
374
375 def moving_average(iterable, n=3):
376 # moving_average([40, 30, 50, 46, 39, 44]) --> 40.0 42.0 45.0 43.0
377 # http://en.wikipedia.org/wiki/Moving_average
378 it = iter(iterable)
Raymond Hettingerd40285a2009-05-22 01:11:26 +0000379 d = deque(itertools.islice(it, n-1))
380 d.appendleft(0)
Raymond Hettingerd2ee64d2009-03-31 22:52:48 +0000381 s = sum(d)
Raymond Hettingerd2ee64d2009-03-31 22:52:48 +0000382 for elem in it:
383 s += elem - d.popleft()
384 d.append(elem)
385 yield s / n
386
Georg Brandl116aa622007-08-15 14:28:22 +0000387The :meth:`rotate` method provides a way to implement :class:`deque` slicing and
Ezio Melotti890c1932009-12-19 23:33:46 +0000388deletion. For example, a pure Python implementation of ``del d[n]`` relies on
Georg Brandl116aa622007-08-15 14:28:22 +0000389the :meth:`rotate` method to position elements to be popped::
390
391 def delete_nth(d, n):
392 d.rotate(-n)
393 d.popleft()
394 d.rotate(n)
395
396To implement :class:`deque` slicing, use a similar approach applying
397:meth:`rotate` to bring a target element to the left side of the deque. Remove
398old entries with :meth:`popleft`, add new entries with :meth:`extend`, and then
399reverse the rotation.
Georg Brandl116aa622007-08-15 14:28:22 +0000400With minor variations on that approach, it is easy to implement Forth style
401stack manipulations such as ``dup``, ``drop``, ``swap``, ``over``, ``pick``,
402``rot``, and ``roll``.
403
Georg Brandl116aa622007-08-15 14:28:22 +0000404
405:class:`defaultdict` objects
406----------------------------
407
Georg Brandl116aa622007-08-15 14:28:22 +0000408.. class:: defaultdict([default_factory[, ...]])
409
410 Returns a new dictionary-like object. :class:`defaultdict` is a subclass of the
Georg Brandlc5605df2009-08-13 08:26:44 +0000411 built-in :class:`dict` class. It overrides one method and adds one writable
Georg Brandl116aa622007-08-15 14:28:22 +0000412 instance variable. The remaining functionality is the same as for the
413 :class:`dict` class and is not documented here.
414
415 The first argument provides the initial value for the :attr:`default_factory`
416 attribute; it defaults to ``None``. All remaining arguments are treated the same
417 as if they were passed to the :class:`dict` constructor, including keyword
418 arguments.
419
Georg Brandl116aa622007-08-15 14:28:22 +0000420
Benjamin Petersone41251e2008-04-25 01:59:09 +0000421 :class:`defaultdict` objects support the following method in addition to the
422 standard :class:`dict` operations:
Georg Brandl116aa622007-08-15 14:28:22 +0000423
Benjamin Peterson7a502de2010-07-18 14:28:26 +0000424 .. method:: __missing__(key)
Georg Brandl116aa622007-08-15 14:28:22 +0000425
Benjamin Peterson5478b472008-09-17 22:25:09 +0000426 If the :attr:`default_factory` attribute is ``None``, this raises a
Benjamin Petersone41251e2008-04-25 01:59:09 +0000427 :exc:`KeyError` exception with the *key* as argument.
Georg Brandl116aa622007-08-15 14:28:22 +0000428
Benjamin Petersone41251e2008-04-25 01:59:09 +0000429 If :attr:`default_factory` is not ``None``, it is called without arguments
430 to provide a default value for the given *key*, this value is inserted in
431 the dictionary for the *key*, and returned.
Georg Brandl116aa622007-08-15 14:28:22 +0000432
Benjamin Petersone41251e2008-04-25 01:59:09 +0000433 If calling :attr:`default_factory` raises an exception this exception is
434 propagated unchanged.
Georg Brandl116aa622007-08-15 14:28:22 +0000435
Benjamin Petersone41251e2008-04-25 01:59:09 +0000436 This method is called by the :meth:`__getitem__` method of the
437 :class:`dict` class when the requested key is not found; whatever it
438 returns or raises is then returned or raised by :meth:`__getitem__`.
Georg Brandl116aa622007-08-15 14:28:22 +0000439
440
Benjamin Petersone41251e2008-04-25 01:59:09 +0000441 :class:`defaultdict` objects support the following instance variable:
Georg Brandl116aa622007-08-15 14:28:22 +0000442
Benjamin Petersone41251e2008-04-25 01:59:09 +0000443
Benjamin Peterson7a502de2010-07-18 14:28:26 +0000444 .. attribute:: default_factory
Benjamin Petersone41251e2008-04-25 01:59:09 +0000445
446 This attribute is used by the :meth:`__missing__` method; it is
447 initialized from the first argument to the constructor, if present, or to
448 ``None``, if absent.
Georg Brandl116aa622007-08-15 14:28:22 +0000449
450
Georg Brandl116aa622007-08-15 14:28:22 +0000451:class:`defaultdict` Examples
452^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
453
454Using :class:`list` as the :attr:`default_factory`, it is easy to group a
Christian Heimesfe337bf2008-03-23 21:54:12 +0000455sequence of key-value pairs into a dictionary of lists:
Georg Brandl116aa622007-08-15 14:28:22 +0000456
457 >>> s = [('yellow', 1), ('blue', 2), ('yellow', 3), ('blue', 4), ('red', 1)]
458 >>> d = defaultdict(list)
459 >>> for k, v in s:
460 ... d[k].append(v)
461 ...
Ezio Melotti306afd32009-09-12 19:50:05 +0000462 >>> list(d.items())
Georg Brandl116aa622007-08-15 14:28:22 +0000463 [('blue', [2, 4]), ('red', [1]), ('yellow', [1, 3])]
464
465When each key is encountered for the first time, it is not already in the
466mapping; so an entry is automatically created using the :attr:`default_factory`
467function which returns an empty :class:`list`. The :meth:`list.append`
468operation then attaches the value to the new list. When keys are encountered
469again, the look-up proceeds normally (returning the list for that key) and the
470:meth:`list.append` operation adds another value to the list. This technique is
Christian Heimesfe337bf2008-03-23 21:54:12 +0000471simpler and faster than an equivalent technique using :meth:`dict.setdefault`:
Georg Brandl116aa622007-08-15 14:28:22 +0000472
473 >>> d = {}
474 >>> for k, v in s:
475 ... d.setdefault(k, []).append(v)
476 ...
Ezio Melotti306afd32009-09-12 19:50:05 +0000477 >>> list(d.items())
Georg Brandl116aa622007-08-15 14:28:22 +0000478 [('blue', [2, 4]), ('red', [1]), ('yellow', [1, 3])]
479
480Setting the :attr:`default_factory` to :class:`int` makes the
481:class:`defaultdict` useful for counting (like a bag or multiset in other
Christian Heimesfe337bf2008-03-23 21:54:12 +0000482languages):
Georg Brandl116aa622007-08-15 14:28:22 +0000483
484 >>> s = 'mississippi'
485 >>> d = defaultdict(int)
486 >>> for k in s:
487 ... d[k] += 1
488 ...
Ezio Melotti306afd32009-09-12 19:50:05 +0000489 >>> list(d.items())
Georg Brandl116aa622007-08-15 14:28:22 +0000490 [('i', 4), ('p', 2), ('s', 4), ('m', 1)]
491
492When a letter is first encountered, it is missing from the mapping, so the
493:attr:`default_factory` function calls :func:`int` to supply a default count of
494zero. The increment operation then builds up the count for each letter.
495
496The function :func:`int` which always returns zero is just a special case of
497constant functions. A faster and more flexible way to create constant functions
498is to use a lambda function which can supply any constant value (not just
Christian Heimesfe337bf2008-03-23 21:54:12 +0000499zero):
Georg Brandl116aa622007-08-15 14:28:22 +0000500
501 >>> def constant_factory(value):
502 ... return lambda: value
503 >>> d = defaultdict(constant_factory('<missing>'))
504 >>> d.update(name='John', action='ran')
505 >>> '%(name)s %(action)s to %(object)s' % d
506 'John ran to <missing>'
507
508Setting the :attr:`default_factory` to :class:`set` makes the
Christian Heimesfe337bf2008-03-23 21:54:12 +0000509:class:`defaultdict` useful for building a dictionary of sets:
Georg Brandl116aa622007-08-15 14:28:22 +0000510
511 >>> s = [('red', 1), ('blue', 2), ('red', 3), ('blue', 4), ('red', 1), ('blue', 4)]
512 >>> d = defaultdict(set)
513 >>> for k, v in s:
514 ... d[k].add(v)
515 ...
Ezio Melotti306afd32009-09-12 19:50:05 +0000516 >>> list(d.items())
Georg Brandl116aa622007-08-15 14:28:22 +0000517 [('blue', set([2, 4])), ('red', set([1, 3]))]
518
519
Guido van Rossum3d392eb2007-11-16 00:35:22 +0000520:func:`namedtuple` Factory Function for Tuples with Named Fields
Christian Heimes790c8232008-01-07 21:14:23 +0000521----------------------------------------------------------------
Georg Brandl116aa622007-08-15 14:28:22 +0000522
Thomas Wouters1b7f8912007-09-19 03:06:30 +0000523Named tuples assign meaning to each position in a tuple and allow for more readable,
524self-documenting code. They can be used wherever regular tuples are used, and
525they add the ability to access fields by name instead of position index.
Georg Brandl116aa622007-08-15 14:28:22 +0000526
Georg Brandlc2a4f4f2009-04-10 09:03:43 +0000527.. function:: namedtuple(typename, field_names, verbose=False, rename=False)
Georg Brandl116aa622007-08-15 14:28:22 +0000528
529 Returns a new tuple subclass named *typename*. The new subclass is used to
Christian Heimesc3f30c42008-02-22 16:37:40 +0000530 create tuple-like objects that have fields accessible by attribute lookup as
Georg Brandl116aa622007-08-15 14:28:22 +0000531 well as being indexable and iterable. Instances of the subclass also have a
Benjamin Peterson4469d0c2008-11-30 22:46:23 +0000532 helpful docstring (with typename and field_names) and a helpful :meth:`__repr__`
Georg Brandl116aa622007-08-15 14:28:22 +0000533 method which lists the tuple contents in a ``name=value`` format.
534
Benjamin Peterson4469d0c2008-11-30 22:46:23 +0000535 The *field_names* are a single string with each fieldname separated by whitespace
536 and/or commas, for example ``'x y'`` or ``'x, y'``. Alternatively, *field_names*
Christian Heimes25bb7832008-01-11 16:17:00 +0000537 can be a sequence of strings such as ``['x', 'y']``.
Georg Brandl9afde1c2007-11-01 20:32:30 +0000538
539 Any valid Python identifier may be used for a fieldname except for names
Christian Heimes0449f632007-12-15 01:27:15 +0000540 starting with an underscore. Valid identifiers consist of letters, digits,
541 and underscores but do not start with a digit or underscore and cannot be
Georg Brandlf6945182008-02-01 11:56:49 +0000542 a :mod:`keyword` such as *class*, *for*, *return*, *global*, *pass*,
Georg Brandl9afde1c2007-11-01 20:32:30 +0000543 or *raise*.
Georg Brandl116aa622007-08-15 14:28:22 +0000544
Benjamin Petersona86f2c02009-02-10 02:41:10 +0000545 If *rename* is true, invalid fieldnames are automatically replaced
546 with positional names. For example, ``['abc', 'def', 'ghi', 'abc']`` is
Raymond Hettinger85737b82009-04-02 22:37:59 +0000547 converted to ``['abc', '_1', 'ghi', '_3']``, eliminating the keyword
Benjamin Petersona86f2c02009-02-10 02:41:10 +0000548 ``def`` and the duplicate fieldname ``abc``.
549
Christian Heimes25bb7832008-01-11 16:17:00 +0000550 If *verbose* is true, the class definition is printed just before being built.
Georg Brandl116aa622007-08-15 14:28:22 +0000551
Georg Brandl9afde1c2007-11-01 20:32:30 +0000552 Named tuple instances do not have per-instance dictionaries, so they are
Thomas Wouters8ce81f72007-09-20 18:22:40 +0000553 lightweight and require no more memory than regular tuples.
Georg Brandl116aa622007-08-15 14:28:22 +0000554
Raymond Hettingerb62ad242009-03-02 22:16:43 +0000555 .. versionchanged:: 3.1
Georg Brandl23b4f922010-10-06 08:43:56 +0000556 Added support for *rename*.
Benjamin Petersona86f2c02009-02-10 02:41:10 +0000557
Christian Heimesfe337bf2008-03-23 21:54:12 +0000558Example:
559
560.. doctest::
561 :options: +NORMALIZE_WHITESPACE
Georg Brandl116aa622007-08-15 14:28:22 +0000562
Guido van Rossum3d392eb2007-11-16 00:35:22 +0000563 >>> Point = namedtuple('Point', 'x y', verbose=True)
Thomas Wouters1b7f8912007-09-19 03:06:30 +0000564 class Point(tuple):
565 'Point(x, y)'
Christian Heimesfe337bf2008-03-23 21:54:12 +0000566 <BLANKLINE>
Thomas Wouters1b7f8912007-09-19 03:06:30 +0000567 __slots__ = ()
Christian Heimesfe337bf2008-03-23 21:54:12 +0000568 <BLANKLINE>
Christian Heimesfaf2f632008-01-06 16:59:19 +0000569 _fields = ('x', 'y')
Christian Heimesfe337bf2008-03-23 21:54:12 +0000570 <BLANKLINE>
Raymond Hettinger089ba7f2009-05-27 00:38:24 +0000571 def __new__(_cls, x, y):
572 return _tuple.__new__(_cls, (x, y))
Christian Heimesfe337bf2008-03-23 21:54:12 +0000573 <BLANKLINE>
Christian Heimesfaf2f632008-01-06 16:59:19 +0000574 @classmethod
Christian Heimesfe337bf2008-03-23 21:54:12 +0000575 def _make(cls, iterable, new=tuple.__new__, len=len):
Christian Heimesfaf2f632008-01-06 16:59:19 +0000576 'Make a new Point object from a sequence or iterable'
Christian Heimesfe337bf2008-03-23 21:54:12 +0000577 result = new(cls, iterable)
Christian Heimesfaf2f632008-01-06 16:59:19 +0000578 if len(result) != 2:
579 raise TypeError('Expected 2 arguments, got %d' % len(result))
580 return result
Christian Heimesfe337bf2008-03-23 21:54:12 +0000581 <BLANKLINE>
Thomas Wouters1b7f8912007-09-19 03:06:30 +0000582 def __repr__(self):
583 return 'Point(x=%r, y=%r)' % self
Christian Heimesfe337bf2008-03-23 21:54:12 +0000584 <BLANKLINE>
Raymond Hettingera4f52b12009-03-02 22:28:31 +0000585 def _asdict(self):
586 'Return a new OrderedDict which maps field names to their values'
587 return OrderedDict(zip(self._fields, self))
Christian Heimesfe337bf2008-03-23 21:54:12 +0000588 <BLANKLINE>
Raymond Hettinger089ba7f2009-05-27 00:38:24 +0000589 def _replace(_self, **kwds):
Guido van Rossum3d392eb2007-11-16 00:35:22 +0000590 'Return a new Point object replacing specified fields with new values'
Raymond Hettinger089ba7f2009-05-27 00:38:24 +0000591 result = _self._make(map(kwds.pop, ('x', 'y'), _self))
Christian Heimesfaf2f632008-01-06 16:59:19 +0000592 if kwds:
Ezio Melotti2befd9a2009-09-13 08:08:32 +0000593 raise ValueError('Got unexpected field names: %r' % list(kwds.keys()))
Christian Heimesfaf2f632008-01-06 16:59:19 +0000594 return result
Georg Brandl48310cd2009-01-03 21:18:54 +0000595 <BLANKLINE>
596 def __getnewargs__(self):
Benjamin Peterson41181742008-07-02 20:22:54 +0000597 return tuple(self)
Christian Heimesfe337bf2008-03-23 21:54:12 +0000598 <BLANKLINE>
Raymond Hettinger089ba7f2009-05-27 00:38:24 +0000599 x = _property(_itemgetter(0))
600 y = _property(_itemgetter(1))
Georg Brandl116aa622007-08-15 14:28:22 +0000601
Thomas Wouters1b7f8912007-09-19 03:06:30 +0000602 >>> p = Point(11, y=22) # instantiate with positional or keyword arguments
Christian Heimes99170a52007-12-19 02:07:34 +0000603 >>> p[0] + p[1] # indexable like the plain tuple (11, 22)
Thomas Wouters1b7f8912007-09-19 03:06:30 +0000604 33
605 >>> x, y = p # unpack like a regular tuple
606 >>> x, y
607 (11, 22)
Christian Heimesc3f30c42008-02-22 16:37:40 +0000608 >>> p.x + p.y # fields also accessible by name
Thomas Wouters1b7f8912007-09-19 03:06:30 +0000609 33
610 >>> p # readable __repr__ with a name=value style
611 Point(x=11, y=22)
Georg Brandl116aa622007-08-15 14:28:22 +0000612
Thomas Wouters1b7f8912007-09-19 03:06:30 +0000613Named tuples are especially useful for assigning field names to result tuples returned
614by the :mod:`csv` or :mod:`sqlite3` modules::
615
Guido van Rossum3d392eb2007-11-16 00:35:22 +0000616 EmployeeRecord = namedtuple('EmployeeRecord', 'name, age, title, department, paygrade')
Georg Brandl9afde1c2007-11-01 20:32:30 +0000617
Thomas Wouters1b7f8912007-09-19 03:06:30 +0000618 import csv
Christian Heimesfaf2f632008-01-06 16:59:19 +0000619 for emp in map(EmployeeRecord._make, csv.reader(open("employees.csv", "rb"))):
Thomas Wouters1b7f8912007-09-19 03:06:30 +0000620 print(emp.name, emp.title)
621
Georg Brandl9afde1c2007-11-01 20:32:30 +0000622 import sqlite3
623 conn = sqlite3.connect('/companydata')
624 cursor = conn.cursor()
625 cursor.execute('SELECT name, age, title, department, paygrade FROM employees')
Christian Heimesfaf2f632008-01-06 16:59:19 +0000626 for emp in map(EmployeeRecord._make, cursor.fetchall()):
Christian Heimes00412232008-01-10 16:02:19 +0000627 print(emp.name, emp.title)
Georg Brandl9afde1c2007-11-01 20:32:30 +0000628
Christian Heimes99170a52007-12-19 02:07:34 +0000629In addition to the methods inherited from tuples, named tuples support
Christian Heimes2380ac72008-01-09 00:17:24 +0000630three additional methods and one attribute. To prevent conflicts with
631field names, the method and attribute names start with an underscore.
Christian Heimes99170a52007-12-19 02:07:34 +0000632
Benjamin Peterson7a502de2010-07-18 14:28:26 +0000633.. classmethod:: somenamedtuple._make(iterable)
Christian Heimes99170a52007-12-19 02:07:34 +0000634
Christian Heimesfaf2f632008-01-06 16:59:19 +0000635 Class method that makes a new instance from an existing sequence or iterable.
Christian Heimes99170a52007-12-19 02:07:34 +0000636
Christian Heimesfe337bf2008-03-23 21:54:12 +0000637.. doctest::
Thomas Wouters1b7f8912007-09-19 03:06:30 +0000638
Christian Heimesfaf2f632008-01-06 16:59:19 +0000639 >>> t = [11, 22]
640 >>> Point._make(t)
641 Point(x=11, y=22)
Thomas Wouters1b7f8912007-09-19 03:06:30 +0000642
Christian Heimes790c8232008-01-07 21:14:23 +0000643.. method:: somenamedtuple._asdict()
Georg Brandl9afde1c2007-11-01 20:32:30 +0000644
Raymond Hettingera4f52b12009-03-02 22:28:31 +0000645 Return a new :class:`OrderedDict` which maps field names to their corresponding
646 values::
Georg Brandl9afde1c2007-11-01 20:32:30 +0000647
Christian Heimes0449f632007-12-15 01:27:15 +0000648 >>> p._asdict()
Raymond Hettingera4f52b12009-03-02 22:28:31 +0000649 OrderedDict([('x', 11), ('y', 22)])
650
Raymond Hettingera88e4da2009-03-03 05:12:27 +0000651 .. versionchanged:: 3.1
Raymond Hettingera4f52b12009-03-02 22:28:31 +0000652 Returns an :class:`OrderedDict` instead of a regular :class:`dict`.
Christian Heimesfe337bf2008-03-23 21:54:12 +0000653
Christian Heimes790c8232008-01-07 21:14:23 +0000654.. method:: somenamedtuple._replace(kwargs)
Thomas Wouters1b7f8912007-09-19 03:06:30 +0000655
Christian Heimesfe337bf2008-03-23 21:54:12 +0000656 Return a new instance of the named tuple replacing specified fields with new
657 values:
Thomas Wouters8ce81f72007-09-20 18:22:40 +0000658
659::
Thomas Wouters1b7f8912007-09-19 03:06:30 +0000660
661 >>> p = Point(x=11, y=22)
Christian Heimes0449f632007-12-15 01:27:15 +0000662 >>> p._replace(x=33)
Thomas Wouters1b7f8912007-09-19 03:06:30 +0000663 Point(x=33, y=22)
664
Guido van Rossum3d392eb2007-11-16 00:35:22 +0000665 >>> for partnum, record in inventory.items():
Christian Heimes454f37b2008-01-10 00:10:02 +0000666 ... inventory[partnum] = record._replace(price=newprices[partnum], timestamp=time.now())
Thomas Wouters1b7f8912007-09-19 03:06:30 +0000667
Christian Heimes790c8232008-01-07 21:14:23 +0000668.. attribute:: somenamedtuple._fields
Thomas Wouters1b7f8912007-09-19 03:06:30 +0000669
Christian Heimes2380ac72008-01-09 00:17:24 +0000670 Tuple of strings listing the field names. Useful for introspection
Georg Brandl9afde1c2007-11-01 20:32:30 +0000671 and for creating new named tuple types from existing named tuples.
Thomas Wouters8ce81f72007-09-20 18:22:40 +0000672
Christian Heimesfe337bf2008-03-23 21:54:12 +0000673.. doctest::
Thomas Wouters1b7f8912007-09-19 03:06:30 +0000674
Christian Heimes0449f632007-12-15 01:27:15 +0000675 >>> p._fields # view the field names
Thomas Wouters1b7f8912007-09-19 03:06:30 +0000676 ('x', 'y')
Thomas Wouters1b7f8912007-09-19 03:06:30 +0000677
Guido van Rossum3d392eb2007-11-16 00:35:22 +0000678 >>> Color = namedtuple('Color', 'red green blue')
Christian Heimes0449f632007-12-15 01:27:15 +0000679 >>> Pixel = namedtuple('Pixel', Point._fields + Color._fields)
Thomas Wouters1b7f8912007-09-19 03:06:30 +0000680 >>> Pixel(11, 22, 128, 255, 0)
Christian Heimes454f37b2008-01-10 00:10:02 +0000681 Pixel(x=11, y=22, red=128, green=255, blue=0)
Georg Brandl116aa622007-08-15 14:28:22 +0000682
Christian Heimes0449f632007-12-15 01:27:15 +0000683To retrieve a field whose name is stored in a string, use the :func:`getattr`
Christian Heimesfe337bf2008-03-23 21:54:12 +0000684function:
Christian Heimes0449f632007-12-15 01:27:15 +0000685
686 >>> getattr(p, 'x')
687 11
688
Raymond Hettinger651453a2009-02-11 00:20:02 +0000689To convert a dictionary to a named tuple, use the double-star-operator
690(as described in :ref:`tut-unpacking-arguments`):
Christian Heimes99170a52007-12-19 02:07:34 +0000691
692 >>> d = {'x': 11, 'y': 22}
693 >>> Point(**d)
694 Point(x=11, y=22)
695
Guido van Rossum3d392eb2007-11-16 00:35:22 +0000696Since a named tuple is a regular Python class, it is easy to add or change
Christian Heimes043d6f62008-01-07 17:19:16 +0000697functionality with a subclass. Here is how to add a calculated field and
Christian Heimesfe337bf2008-03-23 21:54:12 +0000698a fixed-width print format:
Guido van Rossum3d392eb2007-11-16 00:35:22 +0000699
Christian Heimes043d6f62008-01-07 17:19:16 +0000700 >>> class Point(namedtuple('Point', 'x y')):
Christian Heimes25bb7832008-01-11 16:17:00 +0000701 ... __slots__ = ()
Christian Heimes454f37b2008-01-10 00:10:02 +0000702 ... @property
703 ... def hypot(self):
704 ... return (self.x ** 2 + self.y ** 2) ** 0.5
705 ... def __str__(self):
Christian Heimes25bb7832008-01-11 16:17:00 +0000706 ... return 'Point: x=%6.3f y=%6.3f hypot=%6.3f' % (self.x, self.y, self.hypot)
Guido van Rossum3d392eb2007-11-16 00:35:22 +0000707
Georg Brandl0df79792008-10-04 18:33:26 +0000708 >>> for p in Point(3, 4), Point(14, 5/7):
Christian Heimes00412232008-01-10 16:02:19 +0000709 ... print(p)
Christian Heimes25bb7832008-01-11 16:17:00 +0000710 Point: x= 3.000 y= 4.000 hypot= 5.000
711 Point: x=14.000 y= 0.714 hypot=14.018
Christian Heimes043d6f62008-01-07 17:19:16 +0000712
Georg Brandl8ffe0bc2010-10-06 07:17:29 +0000713The subclass shown above sets ``__slots__`` to an empty tuple. This helps
Christian Heimes679db4a2008-01-18 09:56:22 +0000714keep memory requirements low by preventing the creation of instance dictionaries.
715
Christian Heimes2380ac72008-01-09 00:17:24 +0000716
717Subclassing is not useful for adding new, stored fields. Instead, simply
Christian Heimesfe337bf2008-03-23 21:54:12 +0000718create a new named tuple type from the :attr:`_fields` attribute:
Christian Heimes2380ac72008-01-09 00:17:24 +0000719
Christian Heimes25bb7832008-01-11 16:17:00 +0000720 >>> Point3D = namedtuple('Point3D', Point._fields + ('z',))
Christian Heimes2380ac72008-01-09 00:17:24 +0000721
722Default values can be implemented by using :meth:`_replace` to
Christian Heimesfe337bf2008-03-23 21:54:12 +0000723customize a prototype instance:
Guido van Rossum3d392eb2007-11-16 00:35:22 +0000724
725 >>> Account = namedtuple('Account', 'owner balance transaction_count')
Christian Heimes587c2bf2008-01-19 16:21:02 +0000726 >>> default_account = Account('<owner name>', 0.0, 0)
727 >>> johns_account = default_account._replace(owner='John')
Guido van Rossum3d392eb2007-11-16 00:35:22 +0000728
Christian Heimese4ca8152008-05-08 17:18:53 +0000729Enumerated constants can be implemented with named tuples, but it is simpler
730and more efficient to use a simple class declaration:
731
732 >>> Status = namedtuple('Status', 'open pending closed')._make(range(3))
733 >>> Status.open, Status.pending, Status.closed
734 (0, 1, 2)
735 >>> class Status:
736 ... open, pending, closed = range(3)
737
Raymond Hettinger651453a2009-02-11 00:20:02 +0000738.. seealso::
Thomas Wouters47b49bf2007-08-30 22:15:33 +0000739
Raymond Hettinger651453a2009-02-11 00:20:02 +0000740 `Named tuple recipe <http://code.activestate.com/recipes/500261/>`_
741 adapted for Python 2.4.
Raymond Hettingere4c96ad2008-02-06 01:23:58 +0000742
743
Raymond Hettinger2d32f632009-03-02 21:24:57 +0000744:class:`OrderedDict` objects
745----------------------------
746
747Ordered dictionaries are just like regular dictionaries but they remember the
748order that items were inserted. When iterating over an ordered dictionary,
749the items are returned in the order their keys were first added.
750
751.. class:: OrderedDict([items])
752
753 Return an instance of a dict subclass, supporting the usual :class:`dict`
754 methods. An *OrderedDict* is a dict that remembers the order that keys
755 were first inserted. If a new entry overwrites an existing entry, the
756 original insertion position is left unchanged. Deleting an entry and
757 reinserting it will move it to the end.
758
Benjamin Petersond45bf582009-03-02 21:44:54 +0000759 .. versionadded:: 3.1
Raymond Hettinger2d32f632009-03-02 21:24:57 +0000760
Benjamin Peterson7a502de2010-07-18 14:28:26 +0000761 .. method:: popitem(last=True)
Raymond Hettingerdc879f02009-03-19 20:30:56 +0000762
Benjamin Peterson7a502de2010-07-18 14:28:26 +0000763 The :meth:`popitem` method for ordered dictionaries returns and removes a
764 (key, value) pair. The pairs are returned in LIFO order if *last* is true
765 or FIFO order if false.
Raymond Hettinger2d32f632009-03-02 21:24:57 +0000766
Raymond Hettingere9091502009-05-19 17:40:07 +0000767In addition to the usual mapping methods, ordered dictionaries also support
768reverse iteration using :func:`reversed`.
769
Raymond Hettinger2d32f632009-03-02 21:24:57 +0000770Equality tests between :class:`OrderedDict` objects are order-sensitive
771and are implemented as ``list(od1.items())==list(od2.items())``.
772Equality tests between :class:`OrderedDict` objects and other
773:class:`Mapping` objects are order-insensitive like regular dictionaries.
774This allows :class:`OrderedDict` objects to be substituted anywhere a
775regular dictionary is used.
776
Raymond Hettinger36180782009-04-09 22:34:23 +0000777The :class:`OrderedDict` constructor and :meth:`update` method both accept
778keyword arguments, but their order is lost because Python's function call
779semantics pass-in keyword arguments using a regular unordered dictionary.
780
Raymond Hettingerdc879f02009-03-19 20:30:56 +0000781.. seealso::
782
783 `Equivalent OrderedDict recipe <http://code.activestate.com/recipes/576693/>`_
784 that runs on Python 2.4 or later.
785
Raymond Hettinger7bba6832011-04-15 17:43:19 -0700786:class:`OrderedDict` Examples and Recipes
787^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
788
Raymond Hettingerc529c2f2009-11-10 18:21:06 +0000789Since an ordered dictionary remembers its insertion order, it can be used
790in conjuction with sorting to make a sorted dictionary::
791
792 >>> # regular unsorted dictionary
793 >>> d = {'banana': 3, 'apple':4, 'pear': 1, 'orange': 2}
794
795 >>> # dictionary sorted by key
796 >>> OrderedDict(sorted(d.items(), key=lambda t: t[0]))
797 OrderedDict([('apple', 4), ('banana', 3), ('orange', 2), ('pear', 1)])
798
799 >>> # dictionary sorted by value
800 >>> OrderedDict(sorted(d.items(), key=lambda t: t[1]))
801 OrderedDict([('pear', 1), ('orange', 2), ('banana', 3), ('apple', 4)])
802
803 >>> # dictionary sorted by length of the key string
804 >>> OrderedDict(sorted(d.items(), key=lambda t: len(t[0])))
805 OrderedDict([('pear', 1), ('apple', 4), ('orange', 2), ('banana', 3)])
806
807The new sorted dictionaries maintain their sort order when entries
808are deleted. But when new keys are added, the keys are appended
809to the end and the sort is not maintained.
810
Raymond Hettinger5a34b7f2010-07-31 10:16:57 +0000811It is also straight-forward to create an ordered dictionary variant
812that the remembers the order the keys were *last* inserted.
813If a new entry overwrites an existing entry, the
814original insertion position is changed and moved to the end::
815
816 class LastUpdatedOrderedDict(OrderedDict):
Georg Brandld98934c2011-02-25 10:03:34 +0000817 'Store items in the order the keys were last added'
Raymond Hettinger7bba6832011-04-15 17:43:19 -0700818
Raymond Hettinger5a34b7f2010-07-31 10:16:57 +0000819 def __setitem__(self, key, value):
820 if key in self:
821 del self[key]
822 OrderedDict.__setitem__(self, key, value)
823
Raymond Hettinger7bba6832011-04-15 17:43:19 -0700824An ordered dictionary can combined with the :class:`Counter` class
825so that the counter remembers the order elements are first encountered::
826
827 class OrderedCounter(Counter, OrderedDict):
828 'Counter that remembers the order elements are first encountered'
829
830 def __init__(self, iterable=None, **kwds):
831 OrderedDict.__init__(self)
832 Counter.__init__(self, iterable, **kwds)
833
834 def __repr__(self):
835 return '%s(%r)' % (self.__class__.__name__, OrderedDict(self))
836
837 def __reduce__(self):
838 return self.__class__, (OrderedDict(self),)
839
Raymond Hettingere4c96ad2008-02-06 01:23:58 +0000840
841:class:`UserDict` objects
Mark Summerfield8f2d0062008-02-06 13:30:44 +0000842-------------------------
Raymond Hettingere4c96ad2008-02-06 01:23:58 +0000843
Georg Brandl48310cd2009-01-03 21:18:54 +0000844The class, :class:`UserDict` acts as a wrapper around dictionary objects.
845The need for this class has been partially supplanted by the ability to
Raymond Hettingere4c96ad2008-02-06 01:23:58 +0000846subclass directly from :class:`dict`; however, this class can be easier
847to work with because the underlying dictionary is accessible as an
848attribute.
849
850.. class:: UserDict([initialdata])
851
852 Class that simulates a dictionary. The instance's contents are kept in a
853 regular dictionary, which is accessible via the :attr:`data` attribute of
854 :class:`UserDict` instances. If *initialdata* is provided, :attr:`data` is
855 initialized with its contents; note that a reference to *initialdata* will not
856 be kept, allowing it be used for other purposes.
857
Benjamin Peterson7a502de2010-07-18 14:28:26 +0000858 In addition to supporting the methods and operations of mappings,
859 :class:`UserDict` instances provide the following attribute:
Raymond Hettingere4c96ad2008-02-06 01:23:58 +0000860
Benjamin Peterson7a502de2010-07-18 14:28:26 +0000861 .. attribute:: data
Raymond Hettingere4c96ad2008-02-06 01:23:58 +0000862
Benjamin Peterson7a502de2010-07-18 14:28:26 +0000863 A real dictionary used to store the contents of the :class:`UserDict`
864 class.
Raymond Hettinger53dbe392008-02-12 20:03:09 +0000865
866
Raymond Hettinger53dbe392008-02-12 20:03:09 +0000867:class:`UserList` objects
868-------------------------
869
870This class acts as a wrapper around list objects. It is a useful base class
Georg Brandl48310cd2009-01-03 21:18:54 +0000871for your own list-like classes which can inherit from them and override
Raymond Hettinger53dbe392008-02-12 20:03:09 +0000872existing methods or add new ones. In this way, one can add new behaviors to
873lists.
874
Georg Brandl48310cd2009-01-03 21:18:54 +0000875The need for this class has been partially supplanted by the ability to
Raymond Hettinger53dbe392008-02-12 20:03:09 +0000876subclass directly from :class:`list`; however, this class can be easier
877to work with because the underlying list is accessible as an attribute.
878
879.. class:: UserList([list])
880
881 Class that simulates a list. The instance's contents are kept in a regular
882 list, which is accessible via the :attr:`data` attribute of :class:`UserList`
883 instances. The instance's contents are initially set to a copy of *list*,
884 defaulting to the empty list ``[]``. *list* can be any iterable, for
885 example a real Python list or a :class:`UserList` object.
886
Benjamin Peterson7a502de2010-07-18 14:28:26 +0000887 In addition to supporting the methods and operations of mutable sequences,
888 :class:`UserList` instances provide the following attribute:
Raymond Hettinger53dbe392008-02-12 20:03:09 +0000889
Benjamin Peterson7a502de2010-07-18 14:28:26 +0000890 .. attribute:: data
Raymond Hettinger53dbe392008-02-12 20:03:09 +0000891
Benjamin Peterson7a502de2010-07-18 14:28:26 +0000892 A real :class:`list` object used to store the contents of the
893 :class:`UserList` class.
Raymond Hettinger53dbe392008-02-12 20:03:09 +0000894
895**Subclassing requirements:** Subclasses of :class:`UserList` are expect to
896offer a constructor which can be called with either no arguments or one
897argument. List operations which return a new sequence attempt to create an
898instance of the actual implementation class. To do so, it assumes that the
899constructor can be called with a single parameter, which is a sequence object
900used as a data source.
901
902If a derived class does not wish to comply with this requirement, all of the
903special methods supported by this class will need to be overridden; please
904consult the sources for information about the methods which need to be provided
905in that case.
Raymond Hettingerb3a65f82008-02-21 22:11:37 +0000906
907:class:`UserString` objects
Christian Heimesc3f30c42008-02-22 16:37:40 +0000908---------------------------
Raymond Hettingerb3a65f82008-02-21 22:11:37 +0000909
Georg Brandl48310cd2009-01-03 21:18:54 +0000910The class, :class:`UserString` acts as a wrapper around string objects.
911The need for this class has been partially supplanted by the ability to
Raymond Hettingerb3a65f82008-02-21 22:11:37 +0000912subclass directly from :class:`str`; however, this class can be easier
913to work with because the underlying string is accessible as an
914attribute.
915
916.. class:: UserString([sequence])
917
918 Class that simulates a string or a Unicode string object. The instance's
Georg Brandl48310cd2009-01-03 21:18:54 +0000919 content is kept in a regular string object, which is accessible via the
920 :attr:`data` attribute of :class:`UserString` instances. The instance's
Raymond Hettingerb3a65f82008-02-21 22:11:37 +0000921 contents are initially set to a copy of *sequence*. The *sequence* can
922 be an instance of :class:`bytes`, :class:`str`, :class:`UserString` (or a
923 subclass) or an arbitrary sequence which can be converted into a string using
924 the built-in :func:`str` function.
Raymond Hettingerf56c9cd2010-08-08 00:32:32 +0000925
926
927ABCs - abstract base classes
928----------------------------
929
Ezio Melotti9b2e67c2011-03-28 13:50:41 +0300930The collections module offers the following :term:`ABCs <abstract base class>`:
Raymond Hettingerf56c9cd2010-08-08 00:32:32 +0000931
932========================= ===================== ====================== ====================================================
Ezio Melotti9b2e67c2011-03-28 13:50:41 +0300933ABC Inherits from Abstract Methods Mixin Methods
Raymond Hettingerf56c9cd2010-08-08 00:32:32 +0000934========================= ===================== ====================== ====================================================
935:class:`Container` ``__contains__``
936:class:`Hashable` ``__hash__``
937:class:`Iterable` ``__iter__``
938:class:`Iterator` :class:`Iterable` ``__next__`` ``__iter__``
939:class:`Sized` ``__len__``
940:class:`Callable` ``__call__``
941
Georg Brandl27743102011-02-25 10:18:11 +0000942:class:`Sequence` :class:`Sized`, ``__getitem__`` ``__contains__``. ``__iter__``, ``__reversed__``,
Raymond Hettingerf56c9cd2010-08-08 00:32:32 +0000943 :class:`Iterable`, ``index``, and ``count``
944 :class:`Container`
945
Ezio Melotti9b2e67c2011-03-28 13:50:41 +0300946:class:`MutableSequence` :class:`Sequence` ``__setitem__``, Inherited :class:`Sequence` methods and
Raymond Hettingerf56c9cd2010-08-08 00:32:32 +0000947 ``__delitem__``, ``append``, ``reverse``, ``extend``, ``pop``,
Ezio Melotti9b2e67c2011-03-28 13:50:41 +0300948 ``insert`` ``remove``, and ``__iadd__``
Raymond Hettingerf56c9cd2010-08-08 00:32:32 +0000949
950:class:`Set` :class:`Sized`, ``__le__``, ``__lt__``, ``__eq__``, ``__ne__``,
Georg Brandl27743102011-02-25 10:18:11 +0000951 :class:`Iterable`, ``__gt__``, ``__ge__``, ``__and__``, ``__or__``,
Raymond Hettingerf56c9cd2010-08-08 00:32:32 +0000952 :class:`Container` ``__sub__``, ``__xor__``, and ``isdisjoint``
953
Ezio Melotti9b2e67c2011-03-28 13:50:41 +0300954:class:`MutableSet` :class:`Set` ``add``, Inherited :class:`Set` methods and
Raymond Hettingerf56c9cd2010-08-08 00:32:32 +0000955 ``discard`` ``clear``, ``pop``, ``remove``, ``__ior__``,
956 ``__iand__``, ``__ixor__``, and ``__isub__``
957
958:class:`Mapping` :class:`Sized`, ``__getitem__`` ``__contains__``, ``keys``, ``items``, ``values``,
959 :class:`Iterable`, ``get``, ``__eq__``, and ``__ne__``
960 :class:`Container`
961
Ezio Melotti9b2e67c2011-03-28 13:50:41 +0300962:class:`MutableMapping` :class:`Mapping` ``__setitem__``, Inherited :class:`Mapping` methods and
Raymond Hettingerf56c9cd2010-08-08 00:32:32 +0000963 ``__delitem__`` ``pop``, ``popitem``, ``clear``, ``update``,
964 and ``setdefault``
965
966
967:class:`MappingView` :class:`Sized` ``__len__``
Raymond Hettingerf56c9cd2010-08-08 00:32:32 +0000968:class:`ItemsView` :class:`MappingView`, ``__contains__``,
969 :class:`Set` ``__iter__``
Ezio Melotti9b2e67c2011-03-28 13:50:41 +0300970:class:`KeysView` :class:`MappingView`, ``__contains__``,
971 :class:`Set` ``__iter__``
Raymond Hettingerf56c9cd2010-08-08 00:32:32 +0000972:class:`ValuesView` :class:`MappingView` ``__contains__``, ``__iter__``
973========================= ===================== ====================== ====================================================
974
Ezio Melotti9b2e67c2011-03-28 13:50:41 +0300975
976.. class:: Container
977 Hashable
978 Sized
979 Callable
980
981 ABCs for classes that provide respectively the methods :meth:`__contains__`,
982 :meth:`__hash__`, :meth:`__len__`, and :meth:`__call__`.
983
984.. class:: Iterable
985
986 ABC for classes that provide the :meth:`__iter__` method.
987 See also the definition of :term:`iterable`.
988
989.. class:: Iterator
990
991 ABC for classes that provide the :meth:`__iter__` and :meth:`next` methods.
992 See also the definition of :term:`iterator`.
993
994.. class:: Sequence
995 MutableSequence
996
997 ABCs for read-only and mutable :term:`sequences <sequence>`.
998
999.. class:: Set
1000 MutableSet
1001
1002 ABCs for read-only and mutable sets.
1003
1004.. class:: Mapping
1005 MutableMapping
1006
1007 ABCs for read-only and mutable :term:`mappings <mapping>`.
1008
1009.. class:: MappingView
1010 ItemsView
1011 KeysView
1012 ValuesView
1013
1014 ABCs for mapping, items, keys, and values :term:`views <view>`.
1015
1016
Raymond Hettingerf56c9cd2010-08-08 00:32:32 +00001017These ABCs allow us to ask classes or instances if they provide
1018particular functionality, for example::
1019
1020 size = None
1021 if isinstance(myvar, collections.Sized):
1022 size = len(myvar)
1023
1024Several of the ABCs are also useful as mixins that make it easier to develop
1025classes supporting container APIs. For example, to write a class supporting
1026the full :class:`Set` API, it only necessary to supply the three underlying
1027abstract methods: :meth:`__contains__`, :meth:`__iter__`, and :meth:`__len__`.
1028The ABC supplies the remaining methods such as :meth:`__and__` and
1029:meth:`isdisjoint` ::
1030
1031 class ListBasedSet(collections.Set):
1032 ''' Alternate set implementation favoring space over speed
1033 and not requiring the set elements to be hashable. '''
1034 def __init__(self, iterable):
1035 self.elements = lst = []
1036 for value in iterable:
1037 if value not in lst:
1038 lst.append(value)
1039 def __iter__(self):
1040 return iter(self.elements)
1041 def __contains__(self, value):
1042 return value in self.elements
1043 def __len__(self):
1044 return len(self.elements)
1045
1046 s1 = ListBasedSet('abcdef')
1047 s2 = ListBasedSet('defghi')
1048 overlap = s1 & s2 # The __and__() method is supported automatically
1049
1050Notes on using :class:`Set` and :class:`MutableSet` as a mixin:
1051
1052(1)
1053 Since some set operations create new sets, the default mixin methods need
1054 a way to create new instances from an iterable. The class constructor is
1055 assumed to have a signature in the form ``ClassName(iterable)``.
1056 That assumption is factored-out to an internal classmethod called
1057 :meth:`_from_iterable` which calls ``cls(iterable)`` to produce a new set.
1058 If the :class:`Set` mixin is being used in a class with a different
Raymond Hettingere5820c62011-03-22 09:11:39 -07001059 constructor signature, you will need to override :meth:`_from_iterable`
Raymond Hettingerf56c9cd2010-08-08 00:32:32 +00001060 with a classmethod that can construct new instances from
1061 an iterable argument.
1062
1063(2)
1064 To override the comparisons (presumably for speed, as the
1065 semantics are fixed), redefine :meth:`__le__` and
1066 then the other operations will automatically follow suit.
1067
1068(3)
1069 The :class:`Set` mixin provides a :meth:`_hash` method to compute a hash value
1070 for the set; however, :meth:`__hash__` is not defined because not all sets
1071 are hashable or immutable. To add set hashabilty using mixins,
1072 inherit from both :meth:`Set` and :meth:`Hashable`, then define
1073 ``__hash__ = Set._hash``.
1074
1075.. seealso::
1076
1077 * `OrderedSet recipe <http://code.activestate.com/recipes/576694/>`_ for an
1078 example built on :class:`MutableSet`.
1079
1080 * For more about ABCs, see the :mod:`abc` module and :pep:`3119`.