blob: b1c1e3aa7fdb485ea170a25ae3b8e0342945da27 [file] [log] [blame]
Raymond Hettinger53dbe392008-02-12 20:03:09 +00001:mod:`collections` --- Container datatypes
2==========================================
Georg Brandl116aa622007-08-15 14:28:22 +00003
4.. module:: collections
Raymond Hettinger7929cfb2012-06-09 19:15:26 -07005 :synopsis: Container datatypes
Georg Brandl116aa622007-08-15 14:28:22 +00006.. moduleauthor:: Raymond Hettinger <python@rcn.com>
7.. sectionauthor:: Raymond Hettinger <python@rcn.com>
8
Christian Heimesfe337bf2008-03-23 21:54:12 +00009.. testsetup:: *
10
Raymond Hettinger7929cfb2012-06-09 19:15:26 -070011 from collections import *
12 import itertools
13 __name__ = '<doctest>'
Georg Brandl116aa622007-08-15 14:28:22 +000014
Raymond Hettinger158c9c22011-02-22 00:41:50 +000015**Source code:** :source:`Lib/collections/__init__.py`
Raymond Hettinger10480942011-01-10 03:26:08 +000016
Raymond Hettinger4f707fd2011-01-10 19:54:11 +000017--------------
18
Raymond Hettingera6b76ba2010-08-08 00:29:08 +000019This module implements specialized container datatypes providing alternatives to
20Python's general purpose built-in containers, :class:`dict`, :class:`list`,
21:class:`set`, and :class:`tuple`.
Christian Heimes0bd4e112008-02-12 22:59:25 +000022
Raymond Hettingera6b76ba2010-08-08 00:29:08 +000023===================== ====================================================================
24:func:`namedtuple` factory function for creating tuple subclasses with named fields
25:class:`deque` list-like container with fast appends and pops on either end
Raymond Hettinger9fe1ccf2011-02-26 01:02:51 +000026:class:`ChainMap` dict-like class for creating a single view of multiple mappings
Raymond Hettingera6b76ba2010-08-08 00:29:08 +000027:class:`Counter` dict subclass for counting hashable objects
28:class:`OrderedDict` dict subclass that remembers the order entries were added
29:class:`defaultdict` dict subclass that calls a factory function to supply missing values
30:class:`UserDict` wrapper around dictionary objects for easier dict subclassing
31:class:`UserList` wrapper around list objects for easier list subclassing
32:class:`UserString` wrapper around string objects for easier string subclassing
33===================== ====================================================================
Georg Brandl116aa622007-08-15 14:28:22 +000034
Raymond Hettinger158c9c22011-02-22 00:41:50 +000035.. versionchanged:: 3.3
Raymond Hettinger7929cfb2012-06-09 19:15:26 -070036 Moved :ref:`collections-abstract-base-classes` to the :mod:`collections.abc` module.
37 For backwards compatibility, they continue to be visible in this module
38 as well.
Mark Summerfield08898b42007-09-05 08:43:04 +000039
40
Raymond Hettinger9fe1ccf2011-02-26 01:02:51 +000041:class:`ChainMap` objects
42-------------------------
43
Georg Brandl283b96b2012-04-03 09:16:46 +020044.. versionadded:: 3.3
45
Raymond Hettinger9fe1ccf2011-02-26 01:02:51 +000046A :class:`ChainMap` class is provided for quickly linking a number of mappings
47so they can be treated as a single unit. It is often much faster than creating
48a new dictionary and running multiple :meth:`~dict.update` calls.
49
50The class can be used to simulate nested scopes and is useful in templating.
51
52.. class:: ChainMap(*maps)
53
Raymond Hettinger7929cfb2012-06-09 19:15:26 -070054 A :class:`ChainMap` groups multiple dicts or other mappings together to
55 create a single, updateable view. If no *maps* are specified, a single empty
56 dictionary is provided so that a new chain always has at least one mapping.
Raymond Hettinger9fe1ccf2011-02-26 01:02:51 +000057
Raymond Hettinger7929cfb2012-06-09 19:15:26 -070058 The underlying mappings are stored in a list. That list is public and can
59 accessed or updated using the *maps* attribute. There is no other state.
Raymond Hettinger9fe1ccf2011-02-26 01:02:51 +000060
Raymond Hettinger7929cfb2012-06-09 19:15:26 -070061 Lookups search the underlying mappings successively until a key is found. In
62 contrast, writes, updates, and deletions only operate on the first mapping.
Raymond Hettinger9fe1ccf2011-02-26 01:02:51 +000063
Raymond Hettinger7929cfb2012-06-09 19:15:26 -070064 A :class:`ChainMap` incorporates the underlying mappings by reference. So, if
65 one of the underlying mappings gets updated, those changes will be reflected
66 in :class:`ChainMap`.
Raymond Hettinger9fe1ccf2011-02-26 01:02:51 +000067
Raymond Hettinger7929cfb2012-06-09 19:15:26 -070068 All of the usual dictionary methods are supported. In addition, there is a
69 *maps* attribute, a method for creating new subcontexts, and a property for
70 accessing all but the first mapping:
Raymond Hettinger9fe1ccf2011-02-26 01:02:51 +000071
Raymond Hettinger7929cfb2012-06-09 19:15:26 -070072 .. attribute:: maps
Raymond Hettinger9fe1ccf2011-02-26 01:02:51 +000073
Raymond Hettinger7929cfb2012-06-09 19:15:26 -070074 A user updateable list of mappings. The list is ordered from
75 first-searched to last-searched. It is the only stored state and can
76 be modified to change which mappings are searched. The list should
77 always contain at least one mapping.
Raymond Hettinger9fe1ccf2011-02-26 01:02:51 +000078
Raymond Hettinger7929cfb2012-06-09 19:15:26 -070079 .. method:: new_child()
Raymond Hettinger9fe1ccf2011-02-26 01:02:51 +000080
Raymond Hettinger7929cfb2012-06-09 19:15:26 -070081 Returns a new :class:`ChainMap` containing a new :class:`dict` followed by
82 all of the maps in the current instance. A call to ``d.new_child()`` is
83 equivalent to: ``ChainMap({}, *d.maps)``. This method is used for
84 creating subcontexts that can be updated without altering values in any
85 of the parent mappings.
Raymond Hettinger9fe1ccf2011-02-26 01:02:51 +000086
Raymond Hettinger7929cfb2012-06-09 19:15:26 -070087 .. method:: parents()
Raymond Hettinger9fe1ccf2011-02-26 01:02:51 +000088
Raymond Hettinger7929cfb2012-06-09 19:15:26 -070089 Returns a new :class:`ChainMap` containing all of the maps in the current
90 instance except the first one. This is useful for skipping the first map
91 in the search. The use-cases are similar to those for the
92 :keyword:`nonlocal` keyword used in :term:`nested scopes <nested scope>`.
93 The use-cases also parallel those for the builtin :func:`super` function.
94 A reference to ``d.parents`` is equivalent to: ``ChainMap(*d.maps[1:])``.
Raymond Hettinger9fe1ccf2011-02-26 01:02:51 +000095
Raymond Hettingerbfcb4292012-06-10 11:39:44 -070096
97.. seealso::
98
99 * The `MultiContext class
100 <http://svn.enthought.com/svn/enthought/CodeTools/trunk/enthought/contexts/multi_context.py>`_
101 in the Enthought `CodeTools package
102 <https://github.com/enthought/codetools>`_ has options to support
103 writing to any mapping in the chain.
104
105 * Django's `Context class
106 <http://code.djangoproject.com/browser/django/trunk/django/template/context.py>`_
107 for templating is a read-only chain of mappings. It also features
108 pushing and popping of contexts similar to the
109 :meth:`~collections.ChainMap.new_child` method and the
110 :meth:`~collections.ChainMap.parents` property.
111
112 * The `Nested Contexts recipe
113 <http://code.activestate.com/recipes/577434/>`_ has options to control
114 whether writes and other mutations apply only to the first mapping or to
115 any mapping in the chain.
116
117 * A `greatly simplified read-only version of Chainmap
118 <http://code.activestate.com/recipes/305268/>`_.
119
120
121:class:`ChainMap` Examples and Recipes
122^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
123
124This section shows various approaches to working with chained maps.
125
126
127Example of simulating Python's internal lookup chain::
Raymond Hettinger9fe1ccf2011-02-26 01:02:51 +0000128
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700129 import builtins
130 pylookup = ChainMap(locals(), globals(), vars(builtins))
Raymond Hettinger9fe1ccf2011-02-26 01:02:51 +0000131
Raymond Hettingerbfcb4292012-06-10 11:39:44 -0700132Example of letting user specified values take precedence over environment
133variables which in turn take precedence over default values::
Raymond Hettinger9fe1ccf2011-02-26 01:02:51 +0000134
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700135 import os, argparse
136 defaults = {'color': 'red', 'user': guest}
137 parser = argparse.ArgumentParser()
138 parser.add_argument('-u', '--user')
139 parser.add_argument('-c', '--color')
140 user_specified = vars(parser.parse_args())
141 combined = ChainMap(user_specified, os.environ, defaults)
Raymond Hettinger9fe1ccf2011-02-26 01:02:51 +0000142
Raymond Hettingerbfcb4292012-06-10 11:39:44 -0700143Example patterns for using the :class:`ChainMap` class to simulate nested
144contexts::
Raymond Hettinger9fe1ccf2011-02-26 01:02:51 +0000145
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700146 c = ChainMap() # Create root context
147 d = c.new_child() # Create nested child context
148 e = c.new_child() # Child of c, independent from d
149 e.maps[0] # Current context dictionary -- like Python's locals()
150 e.maps[-1] # Root context -- like Python's globals()
151 e.parents # Enclosing context chain -- like Python's nonlocals
Raymond Hettinger9fe1ccf2011-02-26 01:02:51 +0000152
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700153 d['x'] # Get first key in the chain of contexts
154 d['x'] = 1 # Set value in current context
155 del['x'] # Delete from current context
156 list(d) # All nested values
157 k in d # Check all nested values
158 len(d) # Number of nested values
159 d.items() # All nested items
160 dict(d) # Flatten into a regular dictionary
Raymond Hettinger9fe1ccf2011-02-26 01:02:51 +0000161
Raymond Hettingerbfcb4292012-06-10 11:39:44 -0700162The :class:`ChainMap` class only makes updates (writes and deletions) to the
163first mapping in the chain while lookups will search the full chain. However,
164if deep writes and deletions are desired, it is easy to make a subclass that
165updates keys found deeper in the chain::
Raymond Hettinger9fe1ccf2011-02-26 01:02:51 +0000166
Raymond Hettingerbfcb4292012-06-10 11:39:44 -0700167 class DeepChainMap(ChainMap):
168 'Variant of ChainMap that allows direct updates to inner scopes'
Raymond Hettinger9fe1ccf2011-02-26 01:02:51 +0000169
Raymond Hettingerbfcb4292012-06-10 11:39:44 -0700170 def __setitem__(self, key, value):
171 for mapping in self.maps:
172 if key in mapping:
173 mapping[key] = value
174 return
175 self.maps[0][key] = value
Raymond Hettinger9fe1ccf2011-02-26 01:02:51 +0000176
Raymond Hettingerbfcb4292012-06-10 11:39:44 -0700177 def __delitem__(self, key):
178 for mapping in self.maps:
179 if key in mapping:
180 del mapping[key]
181 return
182 raise KeyError(key)
Raymond Hettinger9fe1ccf2011-02-26 01:02:51 +0000183
Raymond Hettingerbfcb4292012-06-10 11:39:44 -0700184 >>> d = DeepChainMap({'zebra': 'black'}, {'elephant' : 'blue'}, {'lion' : 'yellow'})
185 >>> d['lion'] = 'orange' # update an existing key two levels down
186 >>> d['snake'] = 'red' # new keys get added to the topmost dict
187 >>> del d['elephant'] # remove an existing key one level down
188 DeepChainMap({'zebra': 'black', 'snake': 'red'}, {}, {'lion': 'orange'})
Georg Brandl4dcf4742012-03-08 20:35:08 +0100189
Raymond Hettinger9fe1ccf2011-02-26 01:02:51 +0000190
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000191:class:`Counter` objects
192------------------------
193
194A counter tool is provided to support convenient and rapid tallies.
195For example::
196
Raymond Hettinger1c62dc92009-02-04 11:41:45 +0000197 >>> # Tally occurrences of words in a list
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000198 >>> cnt = Counter()
Raymond Hettinger670eaec2009-01-21 23:14:07 +0000199 >>> for word in ['red', 'blue', 'red', 'green', 'blue', 'blue']:
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000200 ... cnt[word] += 1
201 >>> cnt
202 Counter({'blue': 3, 'red': 2, 'green': 1})
203
Raymond Hettinger1c62dc92009-02-04 11:41:45 +0000204 >>> # Find the ten most common words in Hamlet
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000205 >>> import re
206 >>> words = re.findall('\w+', open('hamlet.txt').read().lower())
Raymond Hettinger0bae6622009-01-20 13:00:59 +0000207 >>> Counter(words).most_common(10)
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000208 [('the', 1143), ('and', 966), ('to', 762), ('of', 669), ('i', 631),
209 ('you', 554), ('a', 546), ('my', 514), ('hamlet', 471), ('in', 451)]
210
211.. class:: Counter([iterable-or-mapping])
212
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700213 A :class:`Counter` is a :class:`dict` subclass for counting hashable objects.
214 It is an unordered collection where elements are stored as dictionary keys
215 and their counts are stored as dictionary values. Counts are allowed to be
216 any integer value including zero or negative counts. The :class:`Counter`
217 class is similar to bags or multisets in other languages.
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000218
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700219 Elements are counted from an *iterable* or initialized from another
220 *mapping* (or counter):
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000221
Raymond Hettinger73662a52009-01-27 02:38:22 +0000222 >>> c = Counter() # a new, empty counter
223 >>> c = Counter('gallahad') # a new counter from an iterable
224 >>> c = Counter({'red': 4, 'blue': 2}) # a new counter from a mapping
225 >>> c = Counter(cats=4, dogs=8) # a new counter from keyword args
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000226
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700227 Counter objects have a dictionary interface except that they return a zero
228 count for missing items instead of raising a :exc:`KeyError`:
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000229
Raymond Hettinger94adc8e2009-01-22 05:27:37 +0000230 >>> c = Counter(['eggs', 'ham'])
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000231 >>> c['bacon'] # count of a missing element is zero
232 0
233
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700234 Setting a count to zero does not remove an element from a counter.
235 Use ``del`` to remove it entirely:
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000236
Raymond Hettinger94adc8e2009-01-22 05:27:37 +0000237 >>> c['sausage'] = 0 # counter entry with a zero count
238 >>> del c['sausage'] # del actually removes the entry
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000239
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700240 .. versionadded:: 3.1
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000241
242
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700243 Counter objects support three methods beyond those available for all
244 dictionaries:
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000245
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700246 .. method:: elements()
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000247
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700248 Return an iterator over elements repeating each as many times as its
249 count. Elements are returned in arbitrary order. If an element's count
250 is less than one, :meth:`elements` will ignore it.
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000251
Raymond Hettinger0bae6622009-01-20 13:00:59 +0000252 >>> c = Counter(a=4, b=2, c=0, d=-2)
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000253 >>> list(c.elements())
254 ['a', 'a', 'a', 'a', 'b', 'b']
255
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700256 .. method:: most_common([n])
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000257
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700258 Return a list of the *n* most common elements and their counts from the
259 most common to the least. If *n* is not specified, :func:`most_common`
260 returns *all* elements in the counter. Elements with equal counts are
261 ordered arbitrarily:
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000262
263 >>> Counter('abracadabra').most_common(3)
264 [('a', 5), ('r', 2), ('b', 2)]
265
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700266 .. method:: subtract([iterable-or-mapping])
Raymond Hettinger9c01e442010-04-03 10:32:58 +0000267
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700268 Elements are subtracted from an *iterable* or from another *mapping*
269 (or counter). Like :meth:`dict.update` but subtracts counts instead
270 of replacing them. Both inputs and outputs may be zero or negative.
Raymond Hettinger9c01e442010-04-03 10:32:58 +0000271
272 >>> c = Counter(a=4, b=2, c=0, d=-2)
273 >>> d = Counter(a=1, b=2, c=3, d=4)
274 >>> c.subtract(d)
275 Counter({'a': 3, 'b': 0, 'c': -3, 'd': -6})
276
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700277 .. versionadded:: 3.2
Ezio Melotti0be8b1c2010-04-04 06:53:44 +0000278
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700279 The usual dictionary methods are available for :class:`Counter` objects
280 except for two which work differently for counters.
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000281
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700282 .. method:: fromkeys(iterable)
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000283
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700284 This class method is not implemented for :class:`Counter` objects.
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000285
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700286 .. method:: update([iterable-or-mapping])
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000287
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700288 Elements are counted from an *iterable* or added-in from another
289 *mapping* (or counter). Like :meth:`dict.update` but adds counts
290 instead of replacing them. Also, the *iterable* is expected to be a
291 sequence of elements, not a sequence of ``(key, value)`` pairs.
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000292
293Common patterns for working with :class:`Counter` objects::
294
Raymond Hettinger73662a52009-01-27 02:38:22 +0000295 sum(c.values()) # total of all counts
296 c.clear() # reset all counts
297 list(c) # list unique elements
298 set(c) # convert to a set
299 dict(c) # convert to a regular dictionary
300 c.items() # convert to a list of (elem, cnt) pairs
301 Counter(dict(list_of_pairs)) # convert from a list of (elem, cnt) pairs
302 c.most_common()[:-n:-1] # n least common elements
Raymond Hettingerfcb393c2011-08-09 13:00:40 -0700303 +c # remove zero and negative counts
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000304
Raymond Hettinger72a95cc2009-02-25 22:51:40 +0000305Several mathematical operations are provided for combining :class:`Counter`
306objects to produce multisets (counters that have counts greater than zero).
307Addition and subtraction combine counters by adding or subtracting the counts
308of corresponding elements. Intersection and union return the minimum and
309maximum of corresponding counts. Each operation can accept inputs with signed
310counts, but the output will exclude results with counts of zero or less.
Raymond Hettinger4d2073a2009-01-20 03:41:22 +0000311
Raymond Hettingere0d1b9f2009-01-21 20:36:27 +0000312 >>> c = Counter(a=3, b=1)
313 >>> d = Counter(a=1, b=2)
Raymond Hettinger73662a52009-01-27 02:38:22 +0000314 >>> c + d # add two counters together: c[x] + d[x]
Raymond Hettinger4d2073a2009-01-20 03:41:22 +0000315 Counter({'a': 4, 'b': 3})
Raymond Hettinger73662a52009-01-27 02:38:22 +0000316 >>> c - d # subtract (keeping only positive counts)
Raymond Hettinger4d2073a2009-01-20 03:41:22 +0000317 Counter({'a': 2})
Raymond Hettinger73662a52009-01-27 02:38:22 +0000318 >>> c & d # intersection: min(c[x], d[x])
Raymond Hettinger4d2073a2009-01-20 03:41:22 +0000319 Counter({'a': 1, 'b': 1})
Raymond Hettinger73662a52009-01-27 02:38:22 +0000320 >>> c | d # union: max(c[x], d[x])
Raymond Hettinger4d2073a2009-01-20 03:41:22 +0000321 Counter({'a': 3, 'b': 2})
322
Raymond Hettingerfcb393c2011-08-09 13:00:40 -0700323Unary addition and substraction are shortcuts for adding an empty counter
324or subtracting from an empty counter.
325
326 >>> c = Counter(a=2, b=-4)
327 >>> +c
328 Counter({'a': 2})
329 >>> -c
330 Counter({'b': 4})
331
332.. versionadded:: 3.3
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700333 Added support for unary plus, unary minus, and in-place multiset operations.
Raymond Hettingerfcb393c2011-08-09 13:00:40 -0700334
Raymond Hettinger22f18852010-04-12 21:45:14 +0000335.. note::
336
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700337 Counters were primarily designed to work with positive integers to represent
338 running counts; however, care was taken to not unnecessarily preclude use
339 cases needing other types or negative values. To help with those use cases,
340 this section documents the minimum range and type restrictions.
Raymond Hettinger22f18852010-04-12 21:45:14 +0000341
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700342 * The :class:`Counter` class itself is a dictionary subclass with no
343 restrictions on its keys and values. The values are intended to be numbers
344 representing counts, but you *could* store anything in the value field.
Raymond Hettinger22f18852010-04-12 21:45:14 +0000345
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700346 * The :meth:`most_common` method requires only that the values be orderable.
Raymond Hettinger22f18852010-04-12 21:45:14 +0000347
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700348 * For in-place operations such as ``c[key] += 1``, the value type need only
349 support addition and subtraction. So fractions, floats, and decimals would
350 work and negative values are supported. The same is also true for
351 :meth:`update` and :meth:`subtract` which allow negative and zero values
352 for both inputs and outputs.
Raymond Hettinger22f18852010-04-12 21:45:14 +0000353
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700354 * The multiset methods are designed only for use cases with positive values.
355 The inputs may be negative or zero, but only outputs with positive values
356 are created. There are no type restrictions, but the value type needs to
357 support addition, subtraction, and comparison.
Raymond Hettinger22f18852010-04-12 21:45:14 +0000358
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700359 * The :meth:`elements` method requires integer counts. It ignores zero and
360 negative counts.
Raymond Hettinger22f18852010-04-12 21:45:14 +0000361
Raymond Hettingerb14043c2009-01-20 23:44:31 +0000362.. seealso::
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000363
Raymond Hettinger94adc8e2009-01-22 05:27:37 +0000364 * `Counter class <http://code.activestate.com/recipes/576611/>`_
Raymond Hettingerbfcb4292012-06-10 11:39:44 -0700365 adapted for Python 2.5 and an early `Bag recipe
366 <http://code.activestate.com/recipes/259174/>`_ for Python 2.4.
Raymond Hettinger94adc8e2009-01-22 05:27:37 +0000367
Raymond Hettingerb14043c2009-01-20 23:44:31 +0000368 * `Bag class <http://www.gnu.org/software/smalltalk/manual-base/html_node/Bag.html>`_
Raymond Hettingerbfcb4292012-06-10 11:39:44 -0700369 in Smalltalk.
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000370
Éric Araujo08c9bd52011-04-24 02:59:02 +0200371 * Wikipedia entry for `Multisets <http://en.wikipedia.org/wiki/Multiset>`_.
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000372
Raymond Hettingerb14043c2009-01-20 23:44:31 +0000373 * `C++ multisets <http://www.demo2s.com/Tutorial/Cpp/0380__set-multiset/Catalog0380__set-multiset.htm>`_
Raymond Hettingerbfcb4292012-06-10 11:39:44 -0700374 tutorial with examples.
Raymond Hettingerb14043c2009-01-20 23:44:31 +0000375
Raymond Hettinger94adc8e2009-01-22 05:27:37 +0000376 * For mathematical operations on multisets and their use cases, see
Raymond Hettingerbfcb4292012-06-10 11:39:44 -0700377 *Knuth, Donald. The Art of Computer Programming Volume II,
378 Section 4.6.3, Exercise 19*.
Raymond Hettingerb14043c2009-01-20 23:44:31 +0000379
Raymond Hettinger670eaec2009-01-21 23:14:07 +0000380 * To enumerate all distinct multisets of a given size over a given set of
Raymond Hettingerbfcb4292012-06-10 11:39:44 -0700381 elements, see :func:`itertools.combinations_with_replacement`.
Raymond Hettingerb14043c2009-01-20 23:44:31 +0000382
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700383 map(Counter, combinations_with_replacement('ABC', 2)) --> AA AB AC BB BC CC
Raymond Hettingerb8baf632009-01-14 02:20:07 +0000384
385
Georg Brandl116aa622007-08-15 14:28:22 +0000386:class:`deque` objects
387----------------------
388
Georg Brandlc2a4f4f2009-04-10 09:03:43 +0000389.. class:: deque([iterable, [maxlen]])
Georg Brandl116aa622007-08-15 14:28:22 +0000390
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700391 Returns a new deque object initialized left-to-right (using :meth:`append`) with
392 data from *iterable*. If *iterable* is not specified, the new deque is empty.
Georg Brandl116aa622007-08-15 14:28:22 +0000393
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700394 Deques are a generalization of stacks and queues (the name is pronounced "deck"
395 and is short for "double-ended queue"). Deques support thread-safe, memory
396 efficient appends and pops from either side of the deque with approximately the
397 same O(1) performance in either direction.
Georg Brandl116aa622007-08-15 14:28:22 +0000398
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700399 Though :class:`list` objects support similar operations, they are optimized for
400 fast fixed-length operations and incur O(n) memory movement costs for
401 ``pop(0)`` and ``insert(0, v)`` operations which change both the size and
402 position of the underlying data representation.
Georg Brandl116aa622007-08-15 14:28:22 +0000403
Georg Brandl116aa622007-08-15 14:28:22 +0000404
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700405 If *maxlen* is not specified or is *None*, deques may grow to an
406 arbitrary length. Otherwise, the deque is bounded to the specified maximum
407 length. Once a bounded length deque is full, when new items are added, a
408 corresponding number of items are discarded from the opposite end. Bounded
409 length deques provide functionality similar to the ``tail`` filter in
410 Unix. They are also useful for tracking transactions and other pools of data
411 where only the most recent activity is of interest.
Georg Brandl9afde1c2007-11-01 20:32:30 +0000412
Georg Brandl9afde1c2007-11-01 20:32:30 +0000413
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700414 Deque objects support the following methods:
Georg Brandl116aa622007-08-15 14:28:22 +0000415
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700416 .. method:: append(x)
Georg Brandl116aa622007-08-15 14:28:22 +0000417
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700418 Add *x* to the right side of the deque.
Georg Brandl116aa622007-08-15 14:28:22 +0000419
420
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700421 .. method:: appendleft(x)
Georg Brandl116aa622007-08-15 14:28:22 +0000422
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700423 Add *x* to the left side of the deque.
Georg Brandl116aa622007-08-15 14:28:22 +0000424
425
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700426 .. method:: clear()
Georg Brandl116aa622007-08-15 14:28:22 +0000427
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700428 Remove all elements from the deque leaving it with length 0.
Georg Brandl116aa622007-08-15 14:28:22 +0000429
430
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700431 .. method:: count(x)
Raymond Hettinger44459de2010-04-03 23:20:46 +0000432
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700433 Count the number of deque elements equal to *x*.
Raymond Hettinger44459de2010-04-03 23:20:46 +0000434
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700435 .. versionadded:: 3.2
Raymond Hettinger44459de2010-04-03 23:20:46 +0000436
Georg Brandl67b21b72010-08-17 15:07:14 +0000437
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700438 .. method:: extend(iterable)
Georg Brandl116aa622007-08-15 14:28:22 +0000439
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700440 Extend the right side of the deque by appending elements from the iterable
441 argument.
Georg Brandl116aa622007-08-15 14:28:22 +0000442
443
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700444 .. method:: extendleft(iterable)
Georg Brandl116aa622007-08-15 14:28:22 +0000445
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700446 Extend the left side of the deque by appending elements from *iterable*.
447 Note, the series of left appends results in reversing the order of
448 elements in the iterable argument.
Georg Brandl116aa622007-08-15 14:28:22 +0000449
450
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700451 .. method:: pop()
Georg Brandl116aa622007-08-15 14:28:22 +0000452
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700453 Remove and return an element from the right side of the deque. If no
454 elements are present, raises an :exc:`IndexError`.
Georg Brandl116aa622007-08-15 14:28:22 +0000455
456
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700457 .. method:: popleft()
Georg Brandl116aa622007-08-15 14:28:22 +0000458
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700459 Remove and return an element from the left side of the deque. If no
460 elements are present, raises an :exc:`IndexError`.
Georg Brandl116aa622007-08-15 14:28:22 +0000461
462
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700463 .. method:: remove(value)
Georg Brandl116aa622007-08-15 14:28:22 +0000464
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700465 Removed the first occurrence of *value*. If not found, raises a
466 :exc:`ValueError`.
Georg Brandl116aa622007-08-15 14:28:22 +0000467
Georg Brandl67b21b72010-08-17 15:07:14 +0000468
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700469 .. method:: reverse()
Raymond Hettingere5fdedb2009-12-10 00:47:21 +0000470
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700471 Reverse the elements of the deque in-place and then return ``None``.
Raymond Hettingere5fdedb2009-12-10 00:47:21 +0000472
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700473 .. versionadded:: 3.2
Georg Brandl116aa622007-08-15 14:28:22 +0000474
Georg Brandl67b21b72010-08-17 15:07:14 +0000475
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700476 .. method:: rotate(n)
Georg Brandl116aa622007-08-15 14:28:22 +0000477
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700478 Rotate the deque *n* steps to the right. If *n* is negative, rotate to
479 the left. Rotating one step to the right is equivalent to:
480 ``d.appendleft(d.pop())``.
Benjamin Petersone41251e2008-04-25 01:59:09 +0000481
Georg Brandl116aa622007-08-15 14:28:22 +0000482
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700483 Deque objects also provide one read-only attribute:
Raymond Hettinger5bb0f0e2009-03-10 12:56:32 +0000484
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700485 .. attribute:: maxlen
Raymond Hettinger5bb0f0e2009-03-10 12:56:32 +0000486
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700487 Maximum size of a deque or *None* if unbounded.
Raymond Hettinger5bb0f0e2009-03-10 12:56:32 +0000488
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700489 .. versionadded:: 3.1
Raymond Hettinger5bb0f0e2009-03-10 12:56:32 +0000490
491
Georg Brandl116aa622007-08-15 14:28:22 +0000492In addition to the above, deques support iteration, pickling, ``len(d)``,
493``reversed(d)``, ``copy.copy(d)``, ``copy.deepcopy(d)``, membership testing with
Benjamin Peterson206e3072008-10-19 14:07:49 +0000494the :keyword:`in` operator, and subscript references such as ``d[-1]``. Indexed
495access is O(1) at both ends but slows to O(n) in the middle. For fast random
496access, use lists instead.
Georg Brandl116aa622007-08-15 14:28:22 +0000497
Christian Heimesfe337bf2008-03-23 21:54:12 +0000498Example:
499
500.. doctest::
Georg Brandl116aa622007-08-15 14:28:22 +0000501
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700502 >>> from collections import deque
503 >>> d = deque('ghi') # make a new deque with three items
504 >>> for elem in d: # iterate over the deque's elements
505 ... print(elem.upper())
506 G
507 H
508 I
Georg Brandl116aa622007-08-15 14:28:22 +0000509
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700510 >>> d.append('j') # add a new entry to the right side
511 >>> d.appendleft('f') # add a new entry to the left side
512 >>> d # show the representation of the deque
513 deque(['f', 'g', 'h', 'i', 'j'])
Georg Brandl116aa622007-08-15 14:28:22 +0000514
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700515 >>> d.pop() # return and remove the rightmost item
516 'j'
517 >>> d.popleft() # return and remove the leftmost item
518 'f'
519 >>> list(d) # list the contents of the deque
520 ['g', 'h', 'i']
521 >>> d[0] # peek at leftmost item
522 'g'
523 >>> d[-1] # peek at rightmost item
524 'i'
Georg Brandl116aa622007-08-15 14:28:22 +0000525
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700526 >>> list(reversed(d)) # list the contents of a deque in reverse
527 ['i', 'h', 'g']
528 >>> 'h' in d # search the deque
529 True
530 >>> d.extend('jkl') # add multiple elements at once
531 >>> d
532 deque(['g', 'h', 'i', 'j', 'k', 'l'])
533 >>> d.rotate(1) # right rotation
534 >>> d
535 deque(['l', 'g', 'h', 'i', 'j', 'k'])
536 >>> d.rotate(-1) # left rotation
537 >>> d
538 deque(['g', 'h', 'i', 'j', 'k', 'l'])
Georg Brandl116aa622007-08-15 14:28:22 +0000539
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700540 >>> deque(reversed(d)) # make a new deque in reverse order
541 deque(['l', 'k', 'j', 'i', 'h', 'g'])
542 >>> d.clear() # empty the deque
543 >>> d.pop() # cannot pop from an empty deque
544 Traceback (most recent call last):
545 File "<pyshell#6>", line 1, in -toplevel-
546 d.pop()
547 IndexError: pop from an empty deque
Georg Brandl116aa622007-08-15 14:28:22 +0000548
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700549 >>> d.extendleft('abc') # extendleft() reverses the input order
550 >>> d
551 deque(['c', 'b', 'a'])
Georg Brandl116aa622007-08-15 14:28:22 +0000552
553
Georg Brandl9afde1c2007-11-01 20:32:30 +0000554:class:`deque` Recipes
555^^^^^^^^^^^^^^^^^^^^^^
Georg Brandl116aa622007-08-15 14:28:22 +0000556
557This section shows various approaches to working with deques.
558
Raymond Hettingerd2ee64d2009-03-31 22:52:48 +0000559Bounded length deques provide functionality similar to the ``tail`` filter
560in Unix::
561
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700562 def tail(filename, n=10):
563 'Return the last n lines of a file'
564 with open(filename) as f:
565 return deque(f, n)
Raymond Hettingerd2ee64d2009-03-31 22:52:48 +0000566
567Another approach to using deques is to maintain a sequence of recently
568added elements by appending to the right and popping to the left::
569
570 def moving_average(iterable, n=3):
571 # moving_average([40, 30, 50, 46, 39, 44]) --> 40.0 42.0 45.0 43.0
572 # http://en.wikipedia.org/wiki/Moving_average
573 it = iter(iterable)
Raymond Hettingerd40285a2009-05-22 01:11:26 +0000574 d = deque(itertools.islice(it, n-1))
575 d.appendleft(0)
Raymond Hettingerd2ee64d2009-03-31 22:52:48 +0000576 s = sum(d)
Raymond Hettingerd2ee64d2009-03-31 22:52:48 +0000577 for elem in it:
578 s += elem - d.popleft()
579 d.append(elem)
580 yield s / n
581
Georg Brandl116aa622007-08-15 14:28:22 +0000582The :meth:`rotate` method provides a way to implement :class:`deque` slicing and
Ezio Melotti0639d5a2009-12-19 23:26:38 +0000583deletion. For example, a pure Python implementation of ``del d[n]`` relies on
Georg Brandl116aa622007-08-15 14:28:22 +0000584the :meth:`rotate` method to position elements to be popped::
585
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700586 def delete_nth(d, n):
587 d.rotate(-n)
588 d.popleft()
589 d.rotate(n)
Georg Brandl116aa622007-08-15 14:28:22 +0000590
591To implement :class:`deque` slicing, use a similar approach applying
592:meth:`rotate` to bring a target element to the left side of the deque. Remove
593old entries with :meth:`popleft`, add new entries with :meth:`extend`, and then
594reverse the rotation.
Georg Brandl116aa622007-08-15 14:28:22 +0000595With minor variations on that approach, it is easy to implement Forth style
596stack manipulations such as ``dup``, ``drop``, ``swap``, ``over``, ``pick``,
597``rot``, and ``roll``.
598
Georg Brandl116aa622007-08-15 14:28:22 +0000599
600:class:`defaultdict` objects
601----------------------------
602
Georg Brandl116aa622007-08-15 14:28:22 +0000603.. class:: defaultdict([default_factory[, ...]])
604
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700605 Returns a new dictionary-like object. :class:`defaultdict` is a subclass of the
606 built-in :class:`dict` class. It overrides one method and adds one writable
607 instance variable. The remaining functionality is the same as for the
608 :class:`dict` class and is not documented here.
Georg Brandl116aa622007-08-15 14:28:22 +0000609
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700610 The first argument provides the initial value for the :attr:`default_factory`
611 attribute; it defaults to ``None``. All remaining arguments are treated the same
612 as if they were passed to the :class:`dict` constructor, including keyword
613 arguments.
Georg Brandl116aa622007-08-15 14:28:22 +0000614
Georg Brandl116aa622007-08-15 14:28:22 +0000615
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700616 :class:`defaultdict` objects support the following method in addition to the
617 standard :class:`dict` operations:
Georg Brandl116aa622007-08-15 14:28:22 +0000618
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700619 .. method:: __missing__(key)
Georg Brandl116aa622007-08-15 14:28:22 +0000620
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700621 If the :attr:`default_factory` attribute is ``None``, this raises a
622 :exc:`KeyError` exception with the *key* as argument.
Georg Brandl116aa622007-08-15 14:28:22 +0000623
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700624 If :attr:`default_factory` is not ``None``, it is called without arguments
625 to provide a default value for the given *key*, this value is inserted in
626 the dictionary for the *key*, and returned.
Georg Brandl116aa622007-08-15 14:28:22 +0000627
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700628 If calling :attr:`default_factory` raises an exception this exception is
629 propagated unchanged.
Georg Brandl116aa622007-08-15 14:28:22 +0000630
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700631 This method is called by the :meth:`__getitem__` method of the
632 :class:`dict` class when the requested key is not found; whatever it
633 returns or raises is then returned or raised by :meth:`__getitem__`.
Georg Brandl116aa622007-08-15 14:28:22 +0000634
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700635 Note that :meth:`__missing__` is *not* called for any operations besides
636 :meth:`__getitem__`. This means that :meth:`get` will, like normal
637 dictionaries, return ``None`` as a default rather than using
638 :attr:`default_factory`.
Benjamin Peterson871b9d12012-01-27 09:14:01 -0500639
Georg Brandl116aa622007-08-15 14:28:22 +0000640
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700641 :class:`defaultdict` objects support the following instance variable:
Georg Brandl116aa622007-08-15 14:28:22 +0000642
Benjamin Petersone41251e2008-04-25 01:59:09 +0000643
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700644 .. attribute:: default_factory
Benjamin Petersone41251e2008-04-25 01:59:09 +0000645
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700646 This attribute is used by the :meth:`__missing__` method; it is
647 initialized from the first argument to the constructor, if present, or to
648 ``None``, if absent.
Georg Brandl116aa622007-08-15 14:28:22 +0000649
650
Georg Brandl116aa622007-08-15 14:28:22 +0000651:class:`defaultdict` Examples
652^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
653
654Using :class:`list` as the :attr:`default_factory`, it is easy to group a
Christian Heimesfe337bf2008-03-23 21:54:12 +0000655sequence of key-value pairs into a dictionary of lists:
Georg Brandl116aa622007-08-15 14:28:22 +0000656
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700657 >>> s = [('yellow', 1), ('blue', 2), ('yellow', 3), ('blue', 4), ('red', 1)]
658 >>> d = defaultdict(list)
659 >>> for k, v in s:
660 ... d[k].append(v)
661 ...
662 >>> list(d.items())
663 [('blue', [2, 4]), ('red', [1]), ('yellow', [1, 3])]
Georg Brandl116aa622007-08-15 14:28:22 +0000664
665When each key is encountered for the first time, it is not already in the
666mapping; so an entry is automatically created using the :attr:`default_factory`
667function which returns an empty :class:`list`. The :meth:`list.append`
668operation then attaches the value to the new list. When keys are encountered
669again, the look-up proceeds normally (returning the list for that key) and the
670:meth:`list.append` operation adds another value to the list. This technique is
Christian Heimesfe337bf2008-03-23 21:54:12 +0000671simpler and faster than an equivalent technique using :meth:`dict.setdefault`:
Georg Brandl116aa622007-08-15 14:28:22 +0000672
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700673 >>> d = {}
674 >>> for k, v in s:
675 ... d.setdefault(k, []).append(v)
676 ...
677 >>> list(d.items())
678 [('blue', [2, 4]), ('red', [1]), ('yellow', [1, 3])]
Georg Brandl116aa622007-08-15 14:28:22 +0000679
680Setting the :attr:`default_factory` to :class:`int` makes the
681:class:`defaultdict` useful for counting (like a bag or multiset in other
Christian Heimesfe337bf2008-03-23 21:54:12 +0000682languages):
Georg Brandl116aa622007-08-15 14:28:22 +0000683
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700684 >>> s = 'mississippi'
685 >>> d = defaultdict(int)
686 >>> for k in s:
687 ... d[k] += 1
688 ...
689 >>> list(d.items())
690 [('i', 4), ('p', 2), ('s', 4), ('m', 1)]
Georg Brandl116aa622007-08-15 14:28:22 +0000691
692When a letter is first encountered, it is missing from the mapping, so the
693:attr:`default_factory` function calls :func:`int` to supply a default count of
694zero. The increment operation then builds up the count for each letter.
695
696The function :func:`int` which always returns zero is just a special case of
697constant functions. A faster and more flexible way to create constant functions
698is to use a lambda function which can supply any constant value (not just
Christian Heimesfe337bf2008-03-23 21:54:12 +0000699zero):
Georg Brandl116aa622007-08-15 14:28:22 +0000700
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700701 >>> def constant_factory(value):
702 ... return lambda: value
703 >>> d = defaultdict(constant_factory('<missing>'))
704 >>> d.update(name='John', action='ran')
705 >>> '%(name)s %(action)s to %(object)s' % d
706 'John ran to <missing>'
Georg Brandl116aa622007-08-15 14:28:22 +0000707
708Setting the :attr:`default_factory` to :class:`set` makes the
Christian Heimesfe337bf2008-03-23 21:54:12 +0000709:class:`defaultdict` useful for building a dictionary of sets:
Georg Brandl116aa622007-08-15 14:28:22 +0000710
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700711 >>> s = [('red', 1), ('blue', 2), ('red', 3), ('blue', 4), ('red', 1), ('blue', 4)]
712 >>> d = defaultdict(set)
713 >>> for k, v in s:
714 ... d[k].add(v)
715 ...
716 >>> list(d.items())
717 [('blue', {2, 4}), ('red', {1, 3})]
Georg Brandl116aa622007-08-15 14:28:22 +0000718
719
Guido van Rossum3d392eb2007-11-16 00:35:22 +0000720:func:`namedtuple` Factory Function for Tuples with Named Fields
Christian Heimes790c8232008-01-07 21:14:23 +0000721----------------------------------------------------------------
Georg Brandl116aa622007-08-15 14:28:22 +0000722
Thomas Wouters1b7f8912007-09-19 03:06:30 +0000723Named tuples assign meaning to each position in a tuple and allow for more readable,
724self-documenting code. They can be used wherever regular tuples are used, and
725they add the ability to access fields by name instead of position index.
Georg Brandl116aa622007-08-15 14:28:22 +0000726
Georg Brandlc2a4f4f2009-04-10 09:03:43 +0000727.. function:: namedtuple(typename, field_names, verbose=False, rename=False)
Georg Brandl116aa622007-08-15 14:28:22 +0000728
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700729 Returns a new tuple subclass named *typename*. The new subclass is used to
730 create tuple-like objects that have fields accessible by attribute lookup as
731 well as being indexable and iterable. Instances of the subclass also have a
732 helpful docstring (with typename and field_names) and a helpful :meth:`__repr__`
733 method which lists the tuple contents in a ``name=value`` format.
Georg Brandl116aa622007-08-15 14:28:22 +0000734
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700735 The *field_names* are a single string with each fieldname separated by whitespace
736 and/or commas, for example ``'x y'`` or ``'x, y'``. Alternatively, *field_names*
737 can be a sequence of strings such as ``['x', 'y']``.
Georg Brandl9afde1c2007-11-01 20:32:30 +0000738
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700739 Any valid Python identifier may be used for a fieldname except for names
740 starting with an underscore. Valid identifiers consist of letters, digits,
741 and underscores but do not start with a digit or underscore and cannot be
742 a :mod:`keyword` such as *class*, *for*, *return*, *global*, *pass*,
743 or *raise*.
Georg Brandl116aa622007-08-15 14:28:22 +0000744
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700745 If *rename* is true, invalid fieldnames are automatically replaced
746 with positional names. For example, ``['abc', 'def', 'ghi', 'abc']`` is
747 converted to ``['abc', '_1', 'ghi', '_3']``, eliminating the keyword
748 ``def`` and the duplicate fieldname ``abc``.
Benjamin Petersona86f2c02009-02-10 02:41:10 +0000749
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700750 If *verbose* is true, the class definition is printed after it is
751 built. This option is outdated; instead, it is simpler to print the
752 :attr:`_source` attribute.
Georg Brandl116aa622007-08-15 14:28:22 +0000753
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700754 Named tuple instances do not have per-instance dictionaries, so they are
755 lightweight and require no more memory than regular tuples.
Georg Brandl116aa622007-08-15 14:28:22 +0000756
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700757 .. versionchanged:: 3.1
758 Added support for *rename*.
Benjamin Petersona86f2c02009-02-10 02:41:10 +0000759
Christian Heimesfe337bf2008-03-23 21:54:12 +0000760
761.. doctest::
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700762 :options: +NORMALIZE_WHITESPACE
Georg Brandl116aa622007-08-15 14:28:22 +0000763
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700764 >>> # Basic example
765 >>> Point = namedtuple('Point', ['x', 'y'])
766 >>> p = Point(11, y=22) # instantiate with positional or keyword arguments
767 >>> p[0] + p[1] # indexable like the plain tuple (11, 22)
768 33
769 >>> x, y = p # unpack like a regular tuple
770 >>> x, y
771 (11, 22)
772 >>> p.x + p.y # fields also accessible by name
773 33
774 >>> p # readable __repr__ with a name=value style
775 Point(x=11, y=22)
Georg Brandl116aa622007-08-15 14:28:22 +0000776
Thomas Wouters1b7f8912007-09-19 03:06:30 +0000777Named tuples are especially useful for assigning field names to result tuples returned
778by the :mod:`csv` or :mod:`sqlite3` modules::
779
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700780 EmployeeRecord = namedtuple('EmployeeRecord', 'name, age, title, department, paygrade')
Georg Brandl9afde1c2007-11-01 20:32:30 +0000781
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700782 import csv
783 for emp in map(EmployeeRecord._make, csv.reader(open("employees.csv", "rb"))):
784 print(emp.name, emp.title)
Thomas Wouters1b7f8912007-09-19 03:06:30 +0000785
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700786 import sqlite3
787 conn = sqlite3.connect('/companydata')
788 cursor = conn.cursor()
789 cursor.execute('SELECT name, age, title, department, paygrade FROM employees')
790 for emp in map(EmployeeRecord._make, cursor.fetchall()):
791 print(emp.name, emp.title)
Georg Brandl9afde1c2007-11-01 20:32:30 +0000792
Christian Heimes99170a52007-12-19 02:07:34 +0000793In addition to the methods inherited from tuples, named tuples support
Raymond Hettinger2ebea412011-03-23 12:52:23 -0700794three additional methods and two attributes. To prevent conflicts with
Christian Heimes2380ac72008-01-09 00:17:24 +0000795field names, the method and attribute names start with an underscore.
Christian Heimes99170a52007-12-19 02:07:34 +0000796
Benjamin Peterson0b9fb802010-07-18 14:23:36 +0000797.. classmethod:: somenamedtuple._make(iterable)
Christian Heimes99170a52007-12-19 02:07:34 +0000798
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700799 Class method that makes a new instance from an existing sequence or iterable.
Christian Heimes99170a52007-12-19 02:07:34 +0000800
Christian Heimesfe337bf2008-03-23 21:54:12 +0000801.. doctest::
Thomas Wouters1b7f8912007-09-19 03:06:30 +0000802
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700803 >>> t = [11, 22]
804 >>> Point._make(t)
805 Point(x=11, y=22)
Thomas Wouters1b7f8912007-09-19 03:06:30 +0000806
Christian Heimes790c8232008-01-07 21:14:23 +0000807.. method:: somenamedtuple._asdict()
Georg Brandl9afde1c2007-11-01 20:32:30 +0000808
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700809 Return a new :class:`OrderedDict` which maps field names to their corresponding
810 values. Note, this method is no longer needed now that the same effect can
811 be achieved by using the built-in :func:`vars` function::
Georg Brandl9afde1c2007-11-01 20:32:30 +0000812
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700813 >>> vars(p)
814 OrderedDict([('x', 11), ('y', 22)])
Raymond Hettingera4f52b12009-03-02 22:28:31 +0000815
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700816 .. versionchanged:: 3.1
817 Returns an :class:`OrderedDict` instead of a regular :class:`dict`.
Christian Heimesfe337bf2008-03-23 21:54:12 +0000818
Christian Heimes790c8232008-01-07 21:14:23 +0000819.. method:: somenamedtuple._replace(kwargs)
Thomas Wouters1b7f8912007-09-19 03:06:30 +0000820
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700821 Return a new instance of the named tuple replacing specified fields with new
822 values:
Thomas Wouters8ce81f72007-09-20 18:22:40 +0000823
824::
Thomas Wouters1b7f8912007-09-19 03:06:30 +0000825
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700826 >>> p = Point(x=11, y=22)
827 >>> p._replace(x=33)
828 Point(x=33, y=22)
Thomas Wouters1b7f8912007-09-19 03:06:30 +0000829
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700830 >>> for partnum, record in inventory.items():
831 ... inventory[partnum] = record._replace(price=newprices[partnum], timestamp=time.now())
Thomas Wouters1b7f8912007-09-19 03:06:30 +0000832
Raymond Hettinger2ebea412011-03-23 12:52:23 -0700833.. attribute:: somenamedtuple._source
834
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700835 A string with the pure Python source code used to create the named
836 tuple class. The source makes the named tuple self-documenting.
837 It can be printed, executed using :func:`exec`, or saved to a file
838 and imported.
Raymond Hettinger2ebea412011-03-23 12:52:23 -0700839
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700840 .. versionadded:: 3.3
Raymond Hettinger2ebea412011-03-23 12:52:23 -0700841
Christian Heimes790c8232008-01-07 21:14:23 +0000842.. attribute:: somenamedtuple._fields
Thomas Wouters1b7f8912007-09-19 03:06:30 +0000843
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700844 Tuple of strings listing the field names. Useful for introspection
845 and for creating new named tuple types from existing named tuples.
Thomas Wouters8ce81f72007-09-20 18:22:40 +0000846
Christian Heimesfe337bf2008-03-23 21:54:12 +0000847.. doctest::
Thomas Wouters1b7f8912007-09-19 03:06:30 +0000848
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700849 >>> p._fields # view the field names
850 ('x', 'y')
Thomas Wouters1b7f8912007-09-19 03:06:30 +0000851
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700852 >>> Color = namedtuple('Color', 'red green blue')
853 >>> Pixel = namedtuple('Pixel', Point._fields + Color._fields)
854 >>> Pixel(11, 22, 128, 255, 0)
855 Pixel(x=11, y=22, red=128, green=255, blue=0)
Georg Brandl116aa622007-08-15 14:28:22 +0000856
Christian Heimes0449f632007-12-15 01:27:15 +0000857To retrieve a field whose name is stored in a string, use the :func:`getattr`
Christian Heimesfe337bf2008-03-23 21:54:12 +0000858function:
Christian Heimes0449f632007-12-15 01:27:15 +0000859
860 >>> getattr(p, 'x')
861 11
862
Raymond Hettinger651453a2009-02-11 00:20:02 +0000863To convert a dictionary to a named tuple, use the double-star-operator
864(as described in :ref:`tut-unpacking-arguments`):
Christian Heimes99170a52007-12-19 02:07:34 +0000865
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700866 >>> d = {'x': 11, 'y': 22}
867 >>> Point(**d)
868 Point(x=11, y=22)
Christian Heimes99170a52007-12-19 02:07:34 +0000869
Guido van Rossum3d392eb2007-11-16 00:35:22 +0000870Since a named tuple is a regular Python class, it is easy to add or change
Christian Heimes043d6f62008-01-07 17:19:16 +0000871functionality with a subclass. Here is how to add a calculated field and
Christian Heimesfe337bf2008-03-23 21:54:12 +0000872a fixed-width print format:
Guido van Rossum3d392eb2007-11-16 00:35:22 +0000873
Christian Heimes043d6f62008-01-07 17:19:16 +0000874 >>> class Point(namedtuple('Point', 'x y')):
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700875 __slots__ = ()
876 @property
877 def hypot(self):
878 return (self.x ** 2 + self.y ** 2) ** 0.5
879 def __str__(self):
880 return 'Point: x=%6.3f y=%6.3f hypot=%6.3f' % (self.x, self.y, self.hypot)
Guido van Rossum3d392eb2007-11-16 00:35:22 +0000881
Georg Brandl0df79792008-10-04 18:33:26 +0000882 >>> for p in Point(3, 4), Point(14, 5/7):
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700883 print(p)
Christian Heimes25bb7832008-01-11 16:17:00 +0000884 Point: x= 3.000 y= 4.000 hypot= 5.000
885 Point: x=14.000 y= 0.714 hypot=14.018
Christian Heimes043d6f62008-01-07 17:19:16 +0000886
Georg Brandlaf5c2382009-12-28 08:02:38 +0000887The subclass shown above sets ``__slots__`` to an empty tuple. This helps
Christian Heimes679db4a2008-01-18 09:56:22 +0000888keep memory requirements low by preventing the creation of instance dictionaries.
889
Christian Heimes2380ac72008-01-09 00:17:24 +0000890Subclassing is not useful for adding new, stored fields. Instead, simply
Christian Heimesfe337bf2008-03-23 21:54:12 +0000891create a new named tuple type from the :attr:`_fields` attribute:
Christian Heimes2380ac72008-01-09 00:17:24 +0000892
Christian Heimes25bb7832008-01-11 16:17:00 +0000893 >>> Point3D = namedtuple('Point3D', Point._fields + ('z',))
Christian Heimes2380ac72008-01-09 00:17:24 +0000894
895Default values can be implemented by using :meth:`_replace` to
Christian Heimesfe337bf2008-03-23 21:54:12 +0000896customize a prototype instance:
Guido van Rossum3d392eb2007-11-16 00:35:22 +0000897
898 >>> Account = namedtuple('Account', 'owner balance transaction_count')
Christian Heimes587c2bf2008-01-19 16:21:02 +0000899 >>> default_account = Account('<owner name>', 0.0, 0)
900 >>> johns_account = default_account._replace(owner='John')
Raymond Hettingerb2d09452011-03-22 22:36:21 -0700901 >>> janes_account = default_account._replace(owner='Jane')
Guido van Rossum3d392eb2007-11-16 00:35:22 +0000902
Christian Heimese4ca8152008-05-08 17:18:53 +0000903Enumerated constants can be implemented with named tuples, but it is simpler
904and more efficient to use a simple class declaration:
905
906 >>> Status = namedtuple('Status', 'open pending closed')._make(range(3))
907 >>> Status.open, Status.pending, Status.closed
908 (0, 1, 2)
909 >>> class Status:
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700910 open, pending, closed = range(3)
Christian Heimese4ca8152008-05-08 17:18:53 +0000911
Raymond Hettinger651453a2009-02-11 00:20:02 +0000912.. seealso::
Thomas Wouters47b49bf2007-08-30 22:15:33 +0000913
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700914 * `Named tuple recipe <http://code.activestate.com/recipes/500261/>`_
Raymond Hettingerbfcb4292012-06-10 11:39:44 -0700915 adapted for Python 2.4.
Raymond Hettinger6c94e6f2011-03-31 15:46:06 -0700916
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700917 * `Recipe for named tuple abstract base class with a metaclass mix-in
Raymond Hettingerbfcb4292012-06-10 11:39:44 -0700918 <http://code.activestate.com/recipes/577629-namedtupleabc-abstract-base-class-mix-in-for-named/>`_
919 by Jan Kaliszewski. Besides providing an :term:`abstract base class` for
920 named tuples, it also supports an alternate :term:`metaclass`-based
921 constructor that is convenient for use cases where named tuples are being
922 subclassed.
Raymond Hettingere4c96ad2008-02-06 01:23:58 +0000923
924
Raymond Hettinger2d32f632009-03-02 21:24:57 +0000925:class:`OrderedDict` objects
926----------------------------
927
928Ordered dictionaries are just like regular dictionaries but they remember the
929order that items were inserted. When iterating over an ordered dictionary,
930the items are returned in the order their keys were first added.
931
932.. class:: OrderedDict([items])
933
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700934 Return an instance of a dict subclass, supporting the usual :class:`dict`
935 methods. An *OrderedDict* is a dict that remembers the order that keys
936 were first inserted. If a new entry overwrites an existing entry, the
937 original insertion position is left unchanged. Deleting an entry and
938 reinserting it will move it to the end.
Raymond Hettinger2d32f632009-03-02 21:24:57 +0000939
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700940 .. versionadded:: 3.1
Raymond Hettinger2d32f632009-03-02 21:24:57 +0000941
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700942 .. method:: popitem(last=True)
Raymond Hettingerdc879f02009-03-19 20:30:56 +0000943
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700944 The :meth:`popitem` method for ordered dictionaries returns and removes a
945 (key, value) pair. The pairs are returned in LIFO order if *last* is true
946 or FIFO order if false.
Raymond Hettinger2d32f632009-03-02 21:24:57 +0000947
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700948 .. method:: move_to_end(key, last=True)
Raymond Hettingerf45abc92010-09-06 21:26:09 +0000949
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700950 Move an existing *key* to either end of an ordered dictionary. The item
951 is moved to the right end if *last* is true (the default) or to the
952 beginning if *last* is false. Raises :exc:`KeyError` if the *key* does
953 not exist::
Raymond Hettingerf45abc92010-09-06 21:26:09 +0000954
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700955 >>> d = OrderedDict.fromkeys('abcde')
956 >>> d.move_to_end('b')
957 >>> ''.join(d.keys())
958 'acdeb'
959 >>> d.move_to_end('b', last=False)
960 >>> ''.join(d.keys())
961 'bacde'
Raymond Hettingerf45abc92010-09-06 21:26:09 +0000962
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700963 .. versionadded:: 3.2
Raymond Hettingerf45abc92010-09-06 21:26:09 +0000964
Raymond Hettingere9091502009-05-19 17:40:07 +0000965In addition to the usual mapping methods, ordered dictionaries also support
966reverse iteration using :func:`reversed`.
967
Raymond Hettinger2d32f632009-03-02 21:24:57 +0000968Equality tests between :class:`OrderedDict` objects are order-sensitive
969and are implemented as ``list(od1.items())==list(od2.items())``.
970Equality tests between :class:`OrderedDict` objects and other
971:class:`Mapping` objects are order-insensitive like regular dictionaries.
972This allows :class:`OrderedDict` objects to be substituted anywhere a
973regular dictionary is used.
974
Raymond Hettinger36180782009-04-09 22:34:23 +0000975The :class:`OrderedDict` constructor and :meth:`update` method both accept
976keyword arguments, but their order is lost because Python's function call
977semantics pass-in keyword arguments using a regular unordered dictionary.
978
Raymond Hettingerdc879f02009-03-19 20:30:56 +0000979.. seealso::
980
Raymond Hettinger7929cfb2012-06-09 19:15:26 -0700981 `Equivalent OrderedDict recipe <http://code.activestate.com/recipes/576693/>`_
982 that runs on Python 2.4 or later.
Raymond Hettingerdc879f02009-03-19 20:30:56 +0000983
Raymond Hettinger7bba6832011-04-15 17:43:19 -0700984:class:`OrderedDict` Examples and Recipes
985^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
986
Raymond Hettinger0e312012009-11-10 18:35:46 +0000987Since an ordered dictionary remembers its insertion order, it can be used
988in conjuction with sorting to make a sorted dictionary::
989
990 >>> # regular unsorted dictionary
991 >>> d = {'banana': 3, 'apple':4, 'pear': 1, 'orange': 2}
992
993 >>> # dictionary sorted by key
994 >>> OrderedDict(sorted(d.items(), key=lambda t: t[0]))
995 OrderedDict([('apple', 4), ('banana', 3), ('orange', 2), ('pear', 1)])
996
997 >>> # dictionary sorted by value
998 >>> OrderedDict(sorted(d.items(), key=lambda t: t[1]))
999 OrderedDict([('pear', 1), ('orange', 2), ('banana', 3), ('apple', 4)])
1000
1001 >>> # dictionary sorted by length of the key string
1002 >>> OrderedDict(sorted(d.items(), key=lambda t: len(t[0])))
1003 OrderedDict([('pear', 1), ('apple', 4), ('orange', 2), ('banana', 3)])
1004
1005The new sorted dictionaries maintain their sort order when entries
1006are deleted. But when new keys are added, the keys are appended
1007to the end and the sort is not maintained.
1008
Raymond Hettinger4821ef82010-07-31 10:14:41 +00001009It is also straight-forward to create an ordered dictionary variant
1010that the remembers the order the keys were *last* inserted.
1011If a new entry overwrites an existing entry, the
1012original insertion position is changed and moved to the end::
1013
1014 class LastUpdatedOrderedDict(OrderedDict):
Georg Brandl77570e22010-12-18 16:21:58 +00001015 'Store items in the order the keys were last added'
Raymond Hettinger7bba6832011-04-15 17:43:19 -07001016
Raymond Hettinger4821ef82010-07-31 10:14:41 +00001017 def __setitem__(self, key, value):
1018 if key in self:
1019 del self[key]
1020 OrderedDict.__setitem__(self, key, value)
1021
Éric Araujo889a7dc2011-08-19 00:40:46 +02001022An ordered dictionary can be combined with the :class:`Counter` class
Raymond Hettinger7bba6832011-04-15 17:43:19 -07001023so that the counter remembers the order elements are first encountered::
1024
Raymond Hettinger7929cfb2012-06-09 19:15:26 -07001025 class OrderedCounter(Counter, OrderedDict):
Raymond Hettinger7bba6832011-04-15 17:43:19 -07001026 'Counter that remembers the order elements are first encountered'
1027
Raymond Hettinger7bba6832011-04-15 17:43:19 -07001028 def __repr__(self):
1029 return '%s(%r)' % (self.__class__.__name__, OrderedDict(self))
1030
1031 def __reduce__(self):
1032 return self.__class__, (OrderedDict(self),)
1033
Raymond Hettingere4c96ad2008-02-06 01:23:58 +00001034
1035:class:`UserDict` objects
Mark Summerfield8f2d0062008-02-06 13:30:44 +00001036-------------------------
Raymond Hettingere4c96ad2008-02-06 01:23:58 +00001037
Georg Brandl48310cd2009-01-03 21:18:54 +00001038The class, :class:`UserDict` acts as a wrapper around dictionary objects.
1039The need for this class has been partially supplanted by the ability to
Raymond Hettingere4c96ad2008-02-06 01:23:58 +00001040subclass directly from :class:`dict`; however, this class can be easier
1041to work with because the underlying dictionary is accessible as an
1042attribute.
1043
1044.. class:: UserDict([initialdata])
1045
Raymond Hettinger7929cfb2012-06-09 19:15:26 -07001046 Class that simulates a dictionary. The instance's contents are kept in a
1047 regular dictionary, which is accessible via the :attr:`data` attribute of
1048 :class:`UserDict` instances. If *initialdata* is provided, :attr:`data` is
1049 initialized with its contents; note that a reference to *initialdata* will not
1050 be kept, allowing it be used for other purposes.
Raymond Hettingere4c96ad2008-02-06 01:23:58 +00001051
Raymond Hettinger7929cfb2012-06-09 19:15:26 -07001052 In addition to supporting the methods and operations of mappings,
1053 :class:`UserDict` instances provide the following attribute:
Raymond Hettingere4c96ad2008-02-06 01:23:58 +00001054
Raymond Hettinger7929cfb2012-06-09 19:15:26 -07001055 .. attribute:: data
Raymond Hettingere4c96ad2008-02-06 01:23:58 +00001056
Raymond Hettinger7929cfb2012-06-09 19:15:26 -07001057 A real dictionary used to store the contents of the :class:`UserDict`
1058 class.
Raymond Hettinger53dbe392008-02-12 20:03:09 +00001059
1060
1061
1062:class:`UserList` objects
1063-------------------------
1064
1065This class acts as a wrapper around list objects. It is a useful base class
Georg Brandl48310cd2009-01-03 21:18:54 +00001066for your own list-like classes which can inherit from them and override
Raymond Hettinger53dbe392008-02-12 20:03:09 +00001067existing methods or add new ones. In this way, one can add new behaviors to
1068lists.
1069
Georg Brandl48310cd2009-01-03 21:18:54 +00001070The need for this class has been partially supplanted by the ability to
Raymond Hettinger53dbe392008-02-12 20:03:09 +00001071subclass directly from :class:`list`; however, this class can be easier
1072to work with because the underlying list is accessible as an attribute.
1073
1074.. class:: UserList([list])
1075
Raymond Hettinger7929cfb2012-06-09 19:15:26 -07001076 Class that simulates a list. The instance's contents are kept in a regular
1077 list, which is accessible via the :attr:`data` attribute of :class:`UserList`
1078 instances. The instance's contents are initially set to a copy of *list*,
1079 defaulting to the empty list ``[]``. *list* can be any iterable, for
1080 example a real Python list or a :class:`UserList` object.
Raymond Hettinger53dbe392008-02-12 20:03:09 +00001081
Raymond Hettinger7929cfb2012-06-09 19:15:26 -07001082 In addition to supporting the methods and operations of mutable sequences,
1083 :class:`UserList` instances provide the following attribute:
Raymond Hettinger53dbe392008-02-12 20:03:09 +00001084
Raymond Hettinger7929cfb2012-06-09 19:15:26 -07001085 .. attribute:: data
Raymond Hettinger53dbe392008-02-12 20:03:09 +00001086
Raymond Hettinger7929cfb2012-06-09 19:15:26 -07001087 A real :class:`list` object used to store the contents of the
1088 :class:`UserList` class.
Raymond Hettinger53dbe392008-02-12 20:03:09 +00001089
1090**Subclassing requirements:** Subclasses of :class:`UserList` are expect to
1091offer a constructor which can be called with either no arguments or one
1092argument. List operations which return a new sequence attempt to create an
1093instance of the actual implementation class. To do so, it assumes that the
1094constructor can be called with a single parameter, which is a sequence object
1095used as a data source.
1096
1097If a derived class does not wish to comply with this requirement, all of the
1098special methods supported by this class will need to be overridden; please
1099consult the sources for information about the methods which need to be provided
1100in that case.
Raymond Hettingerb3a65f82008-02-21 22:11:37 +00001101
1102:class:`UserString` objects
Christian Heimesc3f30c42008-02-22 16:37:40 +00001103---------------------------
Raymond Hettingerb3a65f82008-02-21 22:11:37 +00001104
Georg Brandl48310cd2009-01-03 21:18:54 +00001105The class, :class:`UserString` acts as a wrapper around string objects.
1106The need for this class has been partially supplanted by the ability to
Raymond Hettingerb3a65f82008-02-21 22:11:37 +00001107subclass directly from :class:`str`; however, this class can be easier
1108to work with because the underlying string is accessible as an
1109attribute.
1110
1111.. class:: UserString([sequence])
1112
Raymond Hettinger7929cfb2012-06-09 19:15:26 -07001113 Class that simulates a string or a Unicode string object. The instance's
1114 content is kept in a regular string object, which is accessible via the
1115 :attr:`data` attribute of :class:`UserString` instances. The instance's
1116 contents are initially set to a copy of *sequence*. The *sequence* can
1117 be an instance of :class:`bytes`, :class:`str`, :class:`UserString` (or a
1118 subclass) or an arbitrary sequence which can be converted into a string using
1119 the built-in :func:`str` function.