blob: 67b69a6f5491f874a5b0e0cdad5d066bb7070a9d [file] [log] [blame]
Georg Brandl8ec7f652007-08-15 14:28:01 +00001
2:mod:`collections` --- High-performance container datatypes
3===========================================================
4
5.. module:: collections
6 :synopsis: High-performance datatypes
7.. moduleauthor:: Raymond Hettinger <python@rcn.com>
8.. sectionauthor:: Raymond Hettinger <python@rcn.com>
9
10
11.. versionadded:: 2.4
12
13This module implements high-performance container datatypes. Currently,
14there are two datatypes, :class:`deque` and :class:`defaultdict`, and
Raymond Hettingerbc4ffc12008-02-11 23:38:00 +000015one datatype factory function, :func:`namedtuple`.
Georg Brandl8ec7f652007-08-15 14:28:01 +000016
17.. versionchanged:: 2.5
18 Added :class:`defaultdict`.
19
20.. versionchanged:: 2.6
Raymond Hettingereeeb9c42007-11-15 02:44:53 +000021 Added :func:`namedtuple`.
Georg Brandl8ec7f652007-08-15 14:28:01 +000022
Raymond Hettingerbc4ffc12008-02-11 23:38:00 +000023The specialized containers provided in this module provide alternatives
24to Python's general purpose built-in containers, :class:`dict`,
25:class:`list`, :class:`set`, and :class:`tuple`.
26
27Besides the containers provided here, the optional :mod:`bsddb`
28module offers the ability to create in-memory or file based ordered
29dictionaries with string keys using the :meth:`bsddb.btopen` method.
30
31In addition to containers, the collections module provides some ABCs
32(abstract base classes) that can be used to test whether a class
33provides a particular interface, for example, is it hashable or
34a mapping.
35
36.. versionchanged:: 2.6
37 Added abstract base classes.
38
39ABCs - abstract base classes
40----------------------------
41
42The collections module offers the following ABCs:
43
44========================= ==================== ====================== ====================================================
45ABC Inherits Abstract Methods Mixin Methods
46========================= ==================== ====================== ====================================================
47:class:`Container` ``__contains__``
48:class:`Hashable` ``__hash__``
49:class:`Iterable` ``__iter__``
50:class:`Iterator` :class:`Iterable` ``__next__`` ``__iter__``
51:class:`Sized` ``__len__``
52
53:class:`Mapping` :class:`Sized`, ``__getitem__``, ``__contains__``, ``keys``, ``items``, ``values``,
54 :class:`Iterable`, ``__len__``. and ``get``, ``__eq__``, and ``__ne__``
55 :class:`Container` ``__iter__``
56
57:class:`MutableMapping` :class:`Mapping` ``__getitem__`` Inherited Mapping methods and
58 ``__setitem__``, ``pop``, ``popitem``, ``clear``, ``update``,
59 ``__delitem__``, and ``setdefault``
60 ``__iter__``, and
61 ``__len__``
62
63:class:`Sequence` :class:`Sized`, ``__getitem__`` ``__contains__``. ``__iter__``, ``__reversed__``.
64 :class:`Iterable`, and ``__len__`` ``index``, and ``count``
65 :class:`Container`
66
67:class:`MutableSequnce` :class:`Sequence` ``__getitem__`` Inherited Sequence methods and
68 ``__delitem__``, ``append``, ``reverse``, ``extend``, ``pop``,
69 ``insert``, ``remove``, and ``__iadd__``
70 and ``__len__``
71
72:class:`Set` :class:`Sized`, ``__len__``, ``__le__``, ``__lt__``, ``__eq__``, ``__ne__``,
73 :class:`Iterable`, ``__iter__``, and ``__gt__``, ``__ge__``, ``__and__``, ``__or__``
74 :class:`Container` ``__contains__`` ``__sub__``, ``__xor__``, and ``isdisjoint``
75
76:class:`MutableSet` :class:`Set` ``add`` and Inherited Set methods and
77 ``discard`` ``clear``, ``pop``, ``remove``, ``__ior__``,
78 ``__iand__``, ``__ixor__``, and ``__isub__``
79========================= ==================== ====================== ====================================================
80
81These ABCs allow us to ask classes or instances if they provide
82particular functionality, for example::
83
84 size = None
85 if isinstance(myvar, collections.Sized):
86 size = len(myvar)
87
88Several of the ABCs are also useful as mixins that make it easier to develop
89classes supporting container APIs. For example, to write a class supporting
90the full :class:`Set` API, it only necessary to supply the three underlying
91abstract methods: :meth:`__contains__`, :meth:`__iter__`, and :meth:`__len__`.
92The ABC supplies the remaining methods such as :meth:`__and__` and
93:meth:`isdisjoint` ::
94
95 class ListBasedSet(collections.Set):
96 ''' Alternate set implementation favoring space over speed
97 and not requiring the set elements to be hashable. '''
98 def __init__(self, iterable):
99 self.elements = lst = []
100 for value in iterable:
101 if value not in lst:
102 lst.append(value)
103 def __iter__(self):
104 return iter(self.elements)
105 def __contains__(self, value):
106 return value in self.elements
107 def __len__(self):
108 return len(self.elements)
109
110 s1 = ListBasedSet('abcdef')
111 s2 = ListBasedSet('defghi')
112 overlap = s1 & s2 # The __and__() method is supported automatically
113
114Notes on using :class:`Set` and :class:`MutableSet` as a mixin:
115
116(1)
117 Since some set operations create new sets, the default mixin methods need
118 a way to create new instances from an iterable. The class constructor is
119 assumed to have a signature in the form ``ClassName(iterable)``.
120 That assumption is factored-out to a singleinternal classmethod called
121 :meth:`_from_iterable` which calls ``cls(iterable)`` to produce a new set.
122 If the :class:`Set` mixin is being used in a class with a different
123 constructor signature, you will need to override :meth:`from_iterable`
124 with a classmethod that can construct new instances from
125 an iterable argument.
126
127(2)
128 To override the comparisons (presumably for speed, as the
129 semantics are fixed), redefine :meth:`__le__` and
130 then the other operations will automatically follow suit.
131
132(3)
133 The :class:`Set` mixin provides a :meth:`_hash` method to compute a hash value
134 for the set; however, :meth:`__hash__` is not defined because not all sets
135 are hashable or immutable. To add set hashabilty using mixins,
136 inherit from both :meth:`Set` and :meth:`Hashable`, then define
137 ``__hash__ = Set._hash``.
138
139(For more about ABCs, see the :mod:`abc` module and :pep:`3119`.)
140
141
Georg Brandl8ec7f652007-08-15 14:28:01 +0000142
143.. _deque-objects:
144
145:class:`deque` objects
146----------------------
147
148
Raymond Hettingera7fc4b12007-10-05 02:47:07 +0000149.. class:: deque([iterable[, maxlen]])
Georg Brandl8ec7f652007-08-15 14:28:01 +0000150
151 Returns a new deque object initialized left-to-right (using :meth:`append`) with
152 data from *iterable*. If *iterable* is not specified, the new deque is empty.
153
154 Deques are a generalization of stacks and queues (the name is pronounced "deck"
155 and is short for "double-ended queue"). Deques support thread-safe, memory
156 efficient appends and pops from either side of the deque with approximately the
157 same O(1) performance in either direction.
158
159 Though :class:`list` objects support similar operations, they are optimized for
160 fast fixed-length operations and incur O(n) memory movement costs for
161 ``pop(0)`` and ``insert(0, v)`` operations which change both the size and
162 position of the underlying data representation.
163
164 .. versionadded:: 2.4
165
Raymond Hettinger68995862007-10-10 00:26:46 +0000166 If *maxlen* is not specified or is *None*, deques may grow to an
Raymond Hettingera7fc4b12007-10-05 02:47:07 +0000167 arbitrary length. Otherwise, the deque is bounded to the specified maximum
168 length. Once a bounded length deque is full, when new items are added, a
169 corresponding number of items are discarded from the opposite end. Bounded
170 length deques provide functionality similar to the ``tail`` filter in
171 Unix. They are also useful for tracking transactions and other pools of data
172 where only the most recent activity is of interest.
173
174 .. versionchanged:: 2.6
Georg Brandlb19be572007-12-29 10:57:00 +0000175 Added *maxlen* parameter.
Raymond Hettingera7fc4b12007-10-05 02:47:07 +0000176
Georg Brandl8ec7f652007-08-15 14:28:01 +0000177Deque objects support the following methods:
178
179
180.. method:: deque.append(x)
181
182 Add *x* to the right side of the deque.
183
184
185.. method:: deque.appendleft(x)
186
187 Add *x* to the left side of the deque.
188
189
190.. method:: deque.clear()
191
192 Remove all elements from the deque leaving it with length 0.
193
194
195.. method:: deque.extend(iterable)
196
197 Extend the right side of the deque by appending elements from the iterable
198 argument.
199
200
201.. method:: deque.extendleft(iterable)
202
203 Extend the left side of the deque by appending elements from *iterable*. Note,
204 the series of left appends results in reversing the order of elements in the
205 iterable argument.
206
207
208.. method:: deque.pop()
209
210 Remove and return an element from the right side of the deque. If no elements
211 are present, raises an :exc:`IndexError`.
212
213
214.. method:: deque.popleft()
215
216 Remove and return an element from the left side of the deque. If no elements are
217 present, raises an :exc:`IndexError`.
218
219
220.. method:: deque.remove(value)
221
222 Removed the first occurrence of *value*. If not found, raises a
223 :exc:`ValueError`.
224
225 .. versionadded:: 2.5
226
227
228.. method:: deque.rotate(n)
229
230 Rotate the deque *n* steps to the right. If *n* is negative, rotate to the
231 left. Rotating one step to the right is equivalent to:
232 ``d.appendleft(d.pop())``.
233
234In addition to the above, deques support iteration, pickling, ``len(d)``,
235``reversed(d)``, ``copy.copy(d)``, ``copy.deepcopy(d)``, membership testing with
236the :keyword:`in` operator, and subscript references such as ``d[-1]``.
237
238Example::
239
240 >>> from collections import deque
241 >>> d = deque('ghi') # make a new deque with three items
242 >>> for elem in d: # iterate over the deque's elements
243 ... print elem.upper()
244 G
245 H
246 I
247
248 >>> d.append('j') # add a new entry to the right side
249 >>> d.appendleft('f') # add a new entry to the left side
250 >>> d # show the representation of the deque
251 deque(['f', 'g', 'h', 'i', 'j'])
252
253 >>> d.pop() # return and remove the rightmost item
254 'j'
255 >>> d.popleft() # return and remove the leftmost item
256 'f'
257 >>> list(d) # list the contents of the deque
258 ['g', 'h', 'i']
259 >>> d[0] # peek at leftmost item
260 'g'
261 >>> d[-1] # peek at rightmost item
262 'i'
263
264 >>> list(reversed(d)) # list the contents of a deque in reverse
265 ['i', 'h', 'g']
266 >>> 'h' in d # search the deque
267 True
268 >>> d.extend('jkl') # add multiple elements at once
269 >>> d
270 deque(['g', 'h', 'i', 'j', 'k', 'l'])
271 >>> d.rotate(1) # right rotation
272 >>> d
273 deque(['l', 'g', 'h', 'i', 'j', 'k'])
274 >>> d.rotate(-1) # left rotation
275 >>> d
276 deque(['g', 'h', 'i', 'j', 'k', 'l'])
277
278 >>> deque(reversed(d)) # make a new deque in reverse order
279 deque(['l', 'k', 'j', 'i', 'h', 'g'])
280 >>> d.clear() # empty the deque
281 >>> d.pop() # cannot pop from an empty deque
282 Traceback (most recent call last):
283 File "<pyshell#6>", line 1, in -toplevel-
284 d.pop()
285 IndexError: pop from an empty deque
286
287 >>> d.extendleft('abc') # extendleft() reverses the input order
288 >>> d
289 deque(['c', 'b', 'a'])
290
291
292.. _deque-recipes:
293
Raymond Hettingera7fc4b12007-10-05 02:47:07 +0000294:class:`deque` Recipes
295^^^^^^^^^^^^^^^^^^^^^^
Georg Brandl8ec7f652007-08-15 14:28:01 +0000296
297This section shows various approaches to working with deques.
298
299The :meth:`rotate` method provides a way to implement :class:`deque` slicing and
300deletion. For example, a pure python implementation of ``del d[n]`` relies on
301the :meth:`rotate` method to position elements to be popped::
302
303 def delete_nth(d, n):
304 d.rotate(-n)
305 d.popleft()
306 d.rotate(n)
307
308To implement :class:`deque` slicing, use a similar approach applying
309:meth:`rotate` to bring a target element to the left side of the deque. Remove
310old entries with :meth:`popleft`, add new entries with :meth:`extend`, and then
311reverse the rotation.
Georg Brandl8ec7f652007-08-15 14:28:01 +0000312With minor variations on that approach, it is easy to implement Forth style
313stack manipulations such as ``dup``, ``drop``, ``swap``, ``over``, ``pick``,
314``rot``, and ``roll``.
315
Georg Brandl8ec7f652007-08-15 14:28:01 +0000316Multi-pass data reduction algorithms can be succinctly expressed and efficiently
317coded by extracting elements with multiple calls to :meth:`popleft`, applying
Raymond Hettingera7fc4b12007-10-05 02:47:07 +0000318a reduction function, and calling :meth:`append` to add the result back to the
319deque.
Georg Brandl8ec7f652007-08-15 14:28:01 +0000320
321For example, building a balanced binary tree of nested lists entails reducing
322two adjacent nodes into one by grouping them in a list::
323
324 >>> def maketree(iterable):
325 ... d = deque(iterable)
326 ... while len(d) > 1:
327 ... pair = [d.popleft(), d.popleft()]
328 ... d.append(pair)
329 ... return list(d)
330 ...
331 >>> print maketree('abcdefgh')
332 [[[['a', 'b'], ['c', 'd']], [['e', 'f'], ['g', 'h']]]]
333
Raymond Hettingera7fc4b12007-10-05 02:47:07 +0000334Bounded length deques provide functionality similar to the ``tail`` filter
335in Unix::
Georg Brandl8ec7f652007-08-15 14:28:01 +0000336
Raymond Hettingera7fc4b12007-10-05 02:47:07 +0000337 def tail(filename, n=10):
338 'Return the last n lines of a file'
339 return deque(open(filename), n)
Georg Brandl8ec7f652007-08-15 14:28:01 +0000340
341.. _defaultdict-objects:
342
343:class:`defaultdict` objects
344----------------------------
345
346
347.. class:: defaultdict([default_factory[, ...]])
348
349 Returns a new dictionary-like object. :class:`defaultdict` is a subclass of the
350 builtin :class:`dict` class. It overrides one method and adds one writable
351 instance variable. The remaining functionality is the same as for the
352 :class:`dict` class and is not documented here.
353
354 The first argument provides the initial value for the :attr:`default_factory`
355 attribute; it defaults to ``None``. All remaining arguments are treated the same
356 as if they were passed to the :class:`dict` constructor, including keyword
357 arguments.
358
359 .. versionadded:: 2.5
360
361:class:`defaultdict` objects support the following method in addition to the
362standard :class:`dict` operations:
363
364
365.. method:: defaultdict.__missing__(key)
366
367 If the :attr:`default_factory` attribute is ``None``, this raises an
368 :exc:`KeyError` exception with the *key* as argument.
369
370 If :attr:`default_factory` is not ``None``, it is called without arguments to
371 provide a default value for the given *key*, this value is inserted in the
372 dictionary for the *key*, and returned.
373
374 If calling :attr:`default_factory` raises an exception this exception is
375 propagated unchanged.
376
377 This method is called by the :meth:`__getitem__` method of the :class:`dict`
378 class when the requested key is not found; whatever it returns or raises is then
379 returned or raised by :meth:`__getitem__`.
380
381:class:`defaultdict` objects support the following instance variable:
382
383
384.. attribute:: defaultdict.default_factory
385
386 This attribute is used by the :meth:`__missing__` method; it is initialized from
387 the first argument to the constructor, if present, or to ``None``, if absent.
388
389
390.. _defaultdict-examples:
391
392:class:`defaultdict` Examples
393^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
394
395Using :class:`list` as the :attr:`default_factory`, it is easy to group a
396sequence of key-value pairs into a dictionary of lists::
397
398 >>> s = [('yellow', 1), ('blue', 2), ('yellow', 3), ('blue', 4), ('red', 1)]
399 >>> d = defaultdict(list)
400 >>> for k, v in s:
401 ... d[k].append(v)
402 ...
403 >>> d.items()
404 [('blue', [2, 4]), ('red', [1]), ('yellow', [1, 3])]
405
406When each key is encountered for the first time, it is not already in the
407mapping; so an entry is automatically created using the :attr:`default_factory`
408function which returns an empty :class:`list`. The :meth:`list.append`
409operation then attaches the value to the new list. When keys are encountered
410again, the look-up proceeds normally (returning the list for that key) and the
411:meth:`list.append` operation adds another value to the list. This technique is
412simpler and faster than an equivalent technique using :meth:`dict.setdefault`::
413
414 >>> d = {}
415 >>> for k, v in s:
416 ... d.setdefault(k, []).append(v)
417 ...
418 >>> d.items()
419 [('blue', [2, 4]), ('red', [1]), ('yellow', [1, 3])]
420
421Setting the :attr:`default_factory` to :class:`int` makes the
422:class:`defaultdict` useful for counting (like a bag or multiset in other
423languages)::
424
425 >>> s = 'mississippi'
426 >>> d = defaultdict(int)
427 >>> for k in s:
428 ... d[k] += 1
429 ...
430 >>> d.items()
431 [('i', 4), ('p', 2), ('s', 4), ('m', 1)]
432
433When a letter is first encountered, it is missing from the mapping, so the
434:attr:`default_factory` function calls :func:`int` to supply a default count of
435zero. The increment operation then builds up the count for each letter.
436
437The function :func:`int` which always returns zero is just a special case of
438constant functions. A faster and more flexible way to create constant functions
439is to use :func:`itertools.repeat` which can supply any constant value (not just
440zero)::
441
442 >>> def constant_factory(value):
443 ... return itertools.repeat(value).next
444 >>> d = defaultdict(constant_factory('<missing>'))
445 >>> d.update(name='John', action='ran')
446 >>> '%(name)s %(action)s to %(object)s' % d
447 'John ran to <missing>'
448
449Setting the :attr:`default_factory` to :class:`set` makes the
450:class:`defaultdict` useful for building a dictionary of sets::
451
452 >>> s = [('red', 1), ('blue', 2), ('red', 3), ('blue', 4), ('red', 1), ('blue', 4)]
453 >>> d = defaultdict(set)
454 >>> for k, v in s:
455 ... d[k].add(v)
456 ...
457 >>> d.items()
458 [('blue', set([2, 4])), ('red', set([1, 3]))]
459
460
461.. _named-tuple-factory:
462
Raymond Hettingereeeb9c42007-11-15 02:44:53 +0000463:func:`namedtuple` Factory Function for Tuples with Named Fields
Georg Brandlb3255ed2008-01-07 16:43:47 +0000464----------------------------------------------------------------
Georg Brandl8ec7f652007-08-15 14:28:01 +0000465
Raymond Hettingercbab5942007-09-18 22:18:02 +0000466Named tuples assign meaning to each position in a tuple and allow for more readable,
467self-documenting code. They can be used wherever regular tuples are used, and
468they add the ability to access fields by name instead of position index.
Georg Brandl8ec7f652007-08-15 14:28:01 +0000469
Raymond Hettingereeeb9c42007-11-15 02:44:53 +0000470.. function:: namedtuple(typename, fieldnames, [verbose])
Georg Brandl8ec7f652007-08-15 14:28:01 +0000471
472 Returns a new tuple subclass named *typename*. The new subclass is used to
473 create tuple-like objects that have fields accessable by attribute lookup as
474 well as being indexable and iterable. Instances of the subclass also have a
475 helpful docstring (with typename and fieldnames) and a helpful :meth:`__repr__`
476 method which lists the tuple contents in a ``name=value`` format.
477
Raymond Hettingera48a2992007-10-08 21:26:58 +0000478 The *fieldnames* are a single string with each fieldname separated by whitespace
Raymond Hettinger15b5e552008-01-10 23:00:01 +0000479 and/or commas, for example ``'x y'`` or ``'x, y'``. Alternatively, *fieldnames*
480 can be a sequence of strings such as ``['x', 'y']``.
Raymond Hettingerabfd8df2007-10-16 21:28:32 +0000481
482 Any valid Python identifier may be used for a fieldname except for names
Raymond Hettinger42da8742007-12-14 02:49:47 +0000483 starting with an underscore. Valid identifiers consist of letters, digits,
484 and underscores but do not start with a digit or underscore and cannot be
Raymond Hettingerabfd8df2007-10-16 21:28:32 +0000485 a :mod:`keyword` such as *class*, *for*, *return*, *global*, *pass*, *print*,
486 or *raise*.
Raymond Hettingercbab5942007-09-18 22:18:02 +0000487
Raymond Hettinger15b5e552008-01-10 23:00:01 +0000488 If *verbose* is true, the class definition is printed just before being built.
Raymond Hettingercbab5942007-09-18 22:18:02 +0000489
Raymond Hettingera48a2992007-10-08 21:26:58 +0000490 Named tuple instances do not have per-instance dictionaries, so they are
Raymond Hettinger7268e9d2007-09-20 03:03:43 +0000491 lightweight and require no more memory than regular tuples.
Raymond Hettingercbab5942007-09-18 22:18:02 +0000492
Georg Brandl8ec7f652007-08-15 14:28:01 +0000493 .. versionadded:: 2.6
494
Raymond Hettingercbab5942007-09-18 22:18:02 +0000495Example::
Georg Brandl8ec7f652007-08-15 14:28:01 +0000496
Raymond Hettingereeeb9c42007-11-15 02:44:53 +0000497 >>> Point = namedtuple('Point', 'x y', verbose=True)
Raymond Hettingercbab5942007-09-18 22:18:02 +0000498 class Point(tuple):
499 'Point(x, y)'
Raymond Hettinger48eca672007-12-14 18:08:20 +0000500
Raymond Hettingercbab5942007-09-18 22:18:02 +0000501 __slots__ = ()
Raymond Hettinger48eca672007-12-14 18:08:20 +0000502
Raymond Hettingere0734e72008-01-04 03:22:53 +0000503 _fields = ('x', 'y')
504
Raymond Hettingercbab5942007-09-18 22:18:02 +0000505 def __new__(cls, x, y):
506 return tuple.__new__(cls, (x, y))
Raymond Hettinger48eca672007-12-14 18:08:20 +0000507
Raymond Hettinger02740f72008-01-05 01:35:43 +0000508 @classmethod
509 def _make(cls, iterable):
510 'Make a new Point object from a sequence or iterable'
511 result = tuple.__new__(cls, iterable)
512 if len(result) != 2:
513 raise TypeError('Expected 2 arguments, got %d' % len(result))
514 return result
Raymond Hettinger85dfcf32007-12-18 23:51:15 +0000515
Raymond Hettingercbab5942007-09-18 22:18:02 +0000516 def __repr__(self):
517 return 'Point(x=%r, y=%r)' % self
Raymond Hettinger48eca672007-12-14 18:08:20 +0000518
Raymond Hettinger8777bca2007-12-18 22:21:27 +0000519 def _asdict(t):
Raymond Hettinger48eca672007-12-14 18:08:20 +0000520 'Return a new dict which maps field names to their values'
Raymond Hettinger8777bca2007-12-18 22:21:27 +0000521 return {'x': t[0], 'y': t[1]}
Raymond Hettinger48eca672007-12-14 18:08:20 +0000522
Raymond Hettinger42da8742007-12-14 02:49:47 +0000523 def _replace(self, **kwds):
Raymond Hettingereeeb9c42007-11-15 02:44:53 +0000524 'Return a new Point object replacing specified fields with new values'
Raymond Hettinger11668722008-01-06 09:02:24 +0000525 result = self._make(map(kwds.pop, ('x', 'y'), self))
Raymond Hettinger1b50fd72008-01-05 02:17:24 +0000526 if kwds:
527 raise ValueError('Got unexpected field names: %r' % kwds.keys())
528 return result
Raymond Hettinger88880b22007-12-18 00:13:45 +0000529
Raymond Hettingercbab5942007-09-18 22:18:02 +0000530 x = property(itemgetter(0))
531 y = property(itemgetter(1))
Georg Brandl8ec7f652007-08-15 14:28:01 +0000532
Raymond Hettingercbab5942007-09-18 22:18:02 +0000533 >>> p = Point(11, y=22) # instantiate with positional or keyword arguments
Raymond Hettinger88880b22007-12-18 00:13:45 +0000534 >>> p[0] + p[1] # indexable like the plain tuple (11, 22)
Raymond Hettingercbab5942007-09-18 22:18:02 +0000535 33
536 >>> x, y = p # unpack like a regular tuple
537 >>> x, y
538 (11, 22)
539 >>> p.x + p.y # fields also accessable by name
540 33
541 >>> p # readable __repr__ with a name=value style
542 Point(x=11, y=22)
Georg Brandl8ec7f652007-08-15 14:28:01 +0000543
Raymond Hettingercbab5942007-09-18 22:18:02 +0000544Named tuples are especially useful for assigning field names to result tuples returned
545by the :mod:`csv` or :mod:`sqlite3` modules::
Georg Brandl8ec7f652007-08-15 14:28:01 +0000546
Raymond Hettingereeeb9c42007-11-15 02:44:53 +0000547 EmployeeRecord = namedtuple('EmployeeRecord', 'name, age, title, department, paygrade')
Raymond Hettingera48a2992007-10-08 21:26:58 +0000548
Raymond Hettingercbab5942007-09-18 22:18:02 +0000549 import csv
Raymond Hettinger02740f72008-01-05 01:35:43 +0000550 for emp in map(EmployeeRecord._make, csv.reader(open("employees.csv", "rb"))):
Raymond Hettingercbab5942007-09-18 22:18:02 +0000551 print emp.name, emp.title
Georg Brandl8ec7f652007-08-15 14:28:01 +0000552
Raymond Hettingera48a2992007-10-08 21:26:58 +0000553 import sqlite3
554 conn = sqlite3.connect('/companydata')
555 cursor = conn.cursor()
556 cursor.execute('SELECT name, age, title, department, paygrade FROM employees')
Raymond Hettinger02740f72008-01-05 01:35:43 +0000557 for emp in map(EmployeeRecord._make, cursor.fetchall()):
Raymond Hettingera48a2992007-10-08 21:26:58 +0000558 print emp.name, emp.title
559
Raymond Hettinger85dfcf32007-12-18 23:51:15 +0000560In addition to the methods inherited from tuples, named tuples support
Raymond Hettingerac5742e2008-01-08 02:24:15 +0000561three additional methods and one attribute. To prevent conflicts with
562field names, the method and attribute names start with an underscore.
Raymond Hettinger85dfcf32007-12-18 23:51:15 +0000563
Georg Brandlb3255ed2008-01-07 16:43:47 +0000564.. method:: somenamedtuple._make(iterable)
Raymond Hettinger85dfcf32007-12-18 23:51:15 +0000565
Raymond Hettinger02740f72008-01-05 01:35:43 +0000566 Class method that makes a new instance from an existing sequence or iterable.
Raymond Hettinger85dfcf32007-12-18 23:51:15 +0000567
568::
Georg Brandl8ec7f652007-08-15 14:28:01 +0000569
Raymond Hettinger02740f72008-01-05 01:35:43 +0000570 >>> t = [11, 22]
571 >>> Point._make(t)
572 Point(x=11, y=22)
Raymond Hettinger2b03d452007-09-18 03:33:19 +0000573
Georg Brandlb3255ed2008-01-07 16:43:47 +0000574.. method:: somenamedtuple._asdict()
Raymond Hettingera7fc4b12007-10-05 02:47:07 +0000575
576 Return a new dict which maps field names to their corresponding values:
577
578::
579
Raymond Hettinger42da8742007-12-14 02:49:47 +0000580 >>> p._asdict()
Raymond Hettingera7fc4b12007-10-05 02:47:07 +0000581 {'x': 11, 'y': 22}
582
Georg Brandlb3255ed2008-01-07 16:43:47 +0000583.. method:: somenamedtuple._replace(kwargs)
Raymond Hettingerd36a60e2007-09-17 00:55:00 +0000584
Raymond Hettingereeeb9c42007-11-15 02:44:53 +0000585 Return a new instance of the named tuple replacing specified fields with new values:
Raymond Hettinger7268e9d2007-09-20 03:03:43 +0000586
587::
Raymond Hettingerd36a60e2007-09-17 00:55:00 +0000588
Raymond Hettingercbab5942007-09-18 22:18:02 +0000589 >>> p = Point(x=11, y=22)
Raymond Hettinger42da8742007-12-14 02:49:47 +0000590 >>> p._replace(x=33)
Raymond Hettingerd36a60e2007-09-17 00:55:00 +0000591 Point(x=33, y=22)
592
Raymond Hettinger7c3738e2007-11-15 03:16:09 +0000593 >>> for partnum, record in inventory.items():
Raymond Hettingere11230e2008-01-09 03:02:23 +0000594 ... inventory[partnum] = record._replace(price=newprices[partnum], timestamp=time.now())
Raymond Hettingerd36a60e2007-09-17 00:55:00 +0000595
Georg Brandlb3255ed2008-01-07 16:43:47 +0000596.. attribute:: somenamedtuple._fields
Raymond Hettingerd36a60e2007-09-17 00:55:00 +0000597
Raymond Hettingerf6b769b2008-01-07 21:33:51 +0000598 Tuple of strings listing the field names. Useful for introspection
Raymond Hettingera7fc4b12007-10-05 02:47:07 +0000599 and for creating new named tuple types from existing named tuples.
Raymond Hettinger7268e9d2007-09-20 03:03:43 +0000600
601::
Raymond Hettingerd36a60e2007-09-17 00:55:00 +0000602
Raymond Hettinger42da8742007-12-14 02:49:47 +0000603 >>> p._fields # view the field names
Raymond Hettingercbab5942007-09-18 22:18:02 +0000604 ('x', 'y')
Raymond Hettingerd36a60e2007-09-17 00:55:00 +0000605
Raymond Hettingereeeb9c42007-11-15 02:44:53 +0000606 >>> Color = namedtuple('Color', 'red green blue')
Raymond Hettinger42da8742007-12-14 02:49:47 +0000607 >>> Pixel = namedtuple('Pixel', Point._fields + Color._fields)
Raymond Hettingercbab5942007-09-18 22:18:02 +0000608 >>> Pixel(11, 22, 128, 255, 0)
Raymond Hettingerdc1854d2008-01-09 03:13:20 +0000609 Pixel(x=11, y=22, red=128, green=255, blue=0)
Raymond Hettingerd36a60e2007-09-17 00:55:00 +0000610
Raymond Hettingere846f382007-12-14 21:51:50 +0000611To retrieve a field whose name is stored in a string, use the :func:`getattr`
Georg Brandlb3255ed2008-01-07 16:43:47 +0000612function::
Raymond Hettingere846f382007-12-14 21:51:50 +0000613
614 >>> getattr(p, 'x')
615 11
616
Raymond Hettingerd1ef8542008-01-11 00:23:13 +0000617To convert a dictionary to a named tuple, use the double-star-operator [#]_::
Raymond Hettinger85dfcf32007-12-18 23:51:15 +0000618
619 >>> d = {'x': 11, 'y': 22}
620 >>> Point(**d)
621 Point(x=11, y=22)
622
Raymond Hettingereeeb9c42007-11-15 02:44:53 +0000623Since a named tuple is a regular Python class, it is easy to add or change
Raymond Hettingerb8e00722008-01-07 04:24:49 +0000624functionality with a subclass. Here is how to add a calculated field and
Georg Brandl503f2932008-01-07 09:18:17 +0000625a fixed-width print format::
Raymond Hettingereeeb9c42007-11-15 02:44:53 +0000626
Raymond Hettingerb8e00722008-01-07 04:24:49 +0000627 >>> class Point(namedtuple('Point', 'x y')):
Raymond Hettingere1655082008-01-10 19:15:10 +0000628 ... __slots__ = ()
Raymond Hettingere11230e2008-01-09 03:02:23 +0000629 ... @property
630 ... def hypot(self):
631 ... return (self.x ** 2 + self.y ** 2) ** 0.5
632 ... def __str__(self):
Raymond Hettinger15b5e552008-01-10 23:00:01 +0000633 ... return 'Point: x=%6.3f y=%6.3f hypot=%6.3f' % (self.x, self.y, self.hypot)
Raymond Hettingerb8e00722008-01-07 04:24:49 +0000634
Raymond Hettingere1655082008-01-10 19:15:10 +0000635 >>> for p in Point(3, 4), Point(14, 5/7.):
Raymond Hettingere11230e2008-01-09 03:02:23 +0000636 ... print p
Raymond Hettinger9a359212008-01-07 20:07:38 +0000637
Raymond Hettinger15b5e552008-01-10 23:00:01 +0000638 Point: x= 3.000 y= 4.000 hypot= 5.000
639 Point: x=14.000 y= 0.714 hypot=14.018
Raymond Hettingereeeb9c42007-11-15 02:44:53 +0000640
Raymond Hettinger9bba7b72008-01-27 10:47:55 +0000641The subclass shown above sets ``__slots__`` to an empty tuple. This keeps
Raymond Hettinger171f3912008-01-16 23:38:16 +0000642keep memory requirements low by preventing the creation of instance dictionaries.
Raymond Hettingerf59e9622008-01-15 20:52:42 +0000643
Raymond Hettingerac5742e2008-01-08 02:24:15 +0000644Subclassing is not useful for adding new, stored fields. Instead, simply
645create a new named tuple type from the :attr:`_fields` attribute::
646
Raymond Hettingere850c462008-01-10 20:37:12 +0000647 >>> Point3D = namedtuple('Point3D', Point._fields + ('z',))
Raymond Hettingerac5742e2008-01-08 02:24:15 +0000648
Raymond Hettingerfb3ced62008-01-07 20:17:35 +0000649Default values can be implemented by using :meth:`_replace` to
Raymond Hettinger9a359212008-01-07 20:07:38 +0000650customize a prototype instance::
Raymond Hettingerbc693492007-11-15 22:39:34 +0000651
652 >>> Account = namedtuple('Account', 'owner balance transaction_count')
Raymond Hettinger0fe6ca42008-01-18 21:14:58 +0000653 >>> default_account = Account('<owner name>', 0.0, 0)
654 >>> johns_account = default_account._replace(owner='John')
Raymond Hettingerbc693492007-11-15 22:39:34 +0000655
Mark Summerfield7f626f42007-08-30 15:03:03 +0000656.. rubric:: Footnotes
657
Raymond Hettinger85dfcf32007-12-18 23:51:15 +0000658.. [#] For information on the double-star-operator see
Mark Summerfield7f626f42007-08-30 15:03:03 +0000659 :ref:`tut-unpacking-arguments` and :ref:`calls`.