Raymond Hettinger | e52f3b1 | 2004-01-29 07:27:45 +0000 | [diff] [blame] | 1 | \section{\module{collections} --- |
Raymond Hettinger | 5c5eb86 | 2004-02-07 21:13:00 +0000 | [diff] [blame] | 2 | High-performance container datatypes} |
Raymond Hettinger | e52f3b1 | 2004-01-29 07:27:45 +0000 | [diff] [blame] | 3 | |
| 4 | \declaremodule{standard}{collections} |
| 5 | \modulesynopsis{High-performance datatypes} |
| 6 | \moduleauthor{Raymond Hettinger}{python@rcn.com} |
| 7 | \sectionauthor{Raymond Hettinger}{python@rcn.com} |
| 8 | \versionadded{2.4} |
| 9 | |
| 10 | |
Raymond Hettinger | 5c5eb86 | 2004-02-07 21:13:00 +0000 | [diff] [blame] | 11 | This module implements high-performance container datatypes. Currently, the |
Raymond Hettinger | e52f3b1 | 2004-01-29 07:27:45 +0000 | [diff] [blame] | 12 | only datatype is a deque. Future additions may include B-trees |
| 13 | and Fibonacci heaps. |
| 14 | |
| 15 | \begin{funcdesc}{deque}{\optional{iterable}} |
| 16 | Returns a new deque objected initialized left-to-right (using |
| 17 | \method{append()}) with data from \var{iterable}. If \var{iterable} |
| 18 | is not specified, the new deque is empty. |
| 19 | |
Raymond Hettinger | 5c5eb86 | 2004-02-07 21:13:00 +0000 | [diff] [blame] | 20 | Deques are a generalization of stacks and queues (the name is pronounced |
| 21 | ``deck'' and is short for ``double-ended queue''). Deques support |
| 22 | thread-safe, memory efficient appends and pops from either side of the deque |
| 23 | with approximately the same \code{O(1)} performance in either direction. |
| 24 | |
| 25 | Though \class{list} objects support similar operations, they are optimized |
| 26 | for fast fixed-length operations and incur \code{O(n)} memory movement costs |
| 27 | for \samp{pop(0)} and \samp{insert(0, v)} operations which change both the |
| 28 | size and position of the underlying data representation. |
Raymond Hettinger | e52f3b1 | 2004-01-29 07:27:45 +0000 | [diff] [blame] | 29 | \versionadded{2.4} |
| 30 | \end{funcdesc} |
| 31 | |
| 32 | Deque objects support the following methods: |
| 33 | |
| 34 | \begin{methoddesc}{append}{x} |
| 35 | Add \var{x} to the right side of the deque. |
| 36 | \end{methoddesc} |
| 37 | |
| 38 | \begin{methoddesc}{appendleft}{x} |
| 39 | Add \var{x} to the left side of the deque. |
| 40 | \end{methoddesc} |
| 41 | |
| 42 | \begin{methoddesc}{clear}{} |
| 43 | Remove all elements from the deque leaving it with length 0. |
| 44 | \end{methoddesc} |
| 45 | |
Raymond Hettinger | 3ba85c2 | 2004-02-06 19:04:56 +0000 | [diff] [blame] | 46 | \begin{methoddesc}{extend}{iterable} |
| 47 | Extend the right side of the deque by appending elements from |
| 48 | the iterable argument. |
| 49 | \end{methoddesc} |
| 50 | |
| 51 | \begin{methoddesc}{extendleft}{iterable} |
| 52 | Extend the left side of the deque by appending elements from |
| 53 | \var{iterable}. Note, the series of left appends results in |
| 54 | reversing the order of elements in the iterable argument. |
| 55 | \end{methoddesc} |
| 56 | |
Raymond Hettinger | e52f3b1 | 2004-01-29 07:27:45 +0000 | [diff] [blame] | 57 | \begin{methoddesc}{pop}{} |
| 58 | Remove and return an element from the right side of the deque. |
Raymond Hettinger | 738ec90 | 2004-02-29 02:15:56 +0000 | [diff] [blame] | 59 | If no elements are present, raises a \exception{IndexError}. |
Raymond Hettinger | e52f3b1 | 2004-01-29 07:27:45 +0000 | [diff] [blame] | 60 | \end{methoddesc} |
| 61 | |
| 62 | \begin{methoddesc}{popleft}{} |
| 63 | Remove and return an element from the left side of the deque. |
Raymond Hettinger | 738ec90 | 2004-02-29 02:15:56 +0000 | [diff] [blame] | 64 | If no elements are present, raises a \exception{IndexError}. |
| 65 | \end{methoddesc} |
| 66 | |
Raymond Hettinger | 4aec61e | 2005-03-18 21:20:23 +0000 | [diff] [blame^] | 67 | \begin{methoddesc}{remove}{value} |
| 68 | Removed the first occurrence of \var{value}. If not found, |
| 69 | raises a \exception{ValueError}. |
| 70 | \versionadded{2.5} |
| 71 | \end{methoddesc} |
| 72 | |
Raymond Hettinger | 5c5eb86 | 2004-02-07 21:13:00 +0000 | [diff] [blame] | 73 | \begin{methoddesc}{rotate}{n} |
| 74 | Rotate the deque \var{n} steps to the right. If \var{n} is |
| 75 | negative, rotate to the left. Rotating one step to the right |
Raymond Hettinger | f5f9a370 | 2004-04-30 22:52:50 +0000 | [diff] [blame] | 76 | is equivalent to: \samp{d.appendleft(d.pop())}. |
Raymond Hettinger | 5c5eb86 | 2004-02-07 21:13:00 +0000 | [diff] [blame] | 77 | \end{methoddesc} |
| 78 | |
| 79 | In addition to the above, deques support iteration, pickling, \samp{len(d)}, |
Raymond Hettinger | 0a4977c | 2004-03-01 23:16:22 +0000 | [diff] [blame] | 80 | \samp{reversed(d)}, \samp{copy.copy(d)}, \samp{copy.deepcopy(d)}, |
| 81 | membership testing with the \keyword{in} operator, and subscript references |
| 82 | such as \samp{d[-1]}. |
Raymond Hettinger | e52f3b1 | 2004-01-29 07:27:45 +0000 | [diff] [blame] | 83 | |
| 84 | Example: |
| 85 | |
| 86 | \begin{verbatim} |
| 87 | >>> from collections import deque |
Raymond Hettinger | 5c5eb86 | 2004-02-07 21:13:00 +0000 | [diff] [blame] | 88 | >>> d = deque('ghi') # make a new deque with three items |
| 89 | >>> for elem in d: # iterate over the deque's elements |
Raymond Hettinger | 738ec90 | 2004-02-29 02:15:56 +0000 | [diff] [blame] | 90 | ... print elem.upper() |
Raymond Hettinger | e52f3b1 | 2004-01-29 07:27:45 +0000 | [diff] [blame] | 91 | G |
| 92 | H |
| 93 | I |
Raymond Hettinger | 738ec90 | 2004-02-29 02:15:56 +0000 | [diff] [blame] | 94 | |
Raymond Hettinger | 5c5eb86 | 2004-02-07 21:13:00 +0000 | [diff] [blame] | 95 | >>> d.append('j') # add a new entry to the right side |
| 96 | >>> d.appendleft('f') # add a new entry to the left side |
| 97 | >>> d # show the representation of the deque |
Raymond Hettinger | e52f3b1 | 2004-01-29 07:27:45 +0000 | [diff] [blame] | 98 | deque(['f', 'g', 'h', 'i', 'j']) |
Raymond Hettinger | 738ec90 | 2004-02-29 02:15:56 +0000 | [diff] [blame] | 99 | |
Raymond Hettinger | 5c5eb86 | 2004-02-07 21:13:00 +0000 | [diff] [blame] | 100 | >>> d.pop() # return and remove the rightmost item |
Raymond Hettinger | e52f3b1 | 2004-01-29 07:27:45 +0000 | [diff] [blame] | 101 | 'j' |
Raymond Hettinger | 5c5eb86 | 2004-02-07 21:13:00 +0000 | [diff] [blame] | 102 | >>> d.popleft() # return and remove the leftmost item |
Raymond Hettinger | e52f3b1 | 2004-01-29 07:27:45 +0000 | [diff] [blame] | 103 | 'f' |
Raymond Hettinger | 5c5eb86 | 2004-02-07 21:13:00 +0000 | [diff] [blame] | 104 | >>> list(d) # list the contents of the deque |
Raymond Hettinger | e52f3b1 | 2004-01-29 07:27:45 +0000 | [diff] [blame] | 105 | ['g', 'h', 'i'] |
Raymond Hettinger | 0a4977c | 2004-03-01 23:16:22 +0000 | [diff] [blame] | 106 | >>> d[0] # peek at leftmost item |
Raymond Hettinger | 738ec90 | 2004-02-29 02:15:56 +0000 | [diff] [blame] | 107 | 'g' |
Raymond Hettinger | 0a4977c | 2004-03-01 23:16:22 +0000 | [diff] [blame] | 108 | >>> d[-1] # peek at rightmost item |
Raymond Hettinger | 738ec90 | 2004-02-29 02:15:56 +0000 | [diff] [blame] | 109 | 'i' |
Raymond Hettinger | 0a4977c | 2004-03-01 23:16:22 +0000 | [diff] [blame] | 110 | |
Raymond Hettinger | 5c5eb86 | 2004-02-07 21:13:00 +0000 | [diff] [blame] | 111 | >>> list(reversed(d)) # list the contents of a deque in reverse |
Raymond Hettinger | c058fd1 | 2004-02-07 02:45:22 +0000 | [diff] [blame] | 112 | ['i', 'h', 'g'] |
Raymond Hettinger | 5c5eb86 | 2004-02-07 21:13:00 +0000 | [diff] [blame] | 113 | >>> 'h' in d # search the deque |
Raymond Hettinger | e52f3b1 | 2004-01-29 07:27:45 +0000 | [diff] [blame] | 114 | True |
Raymond Hettinger | 5c5eb86 | 2004-02-07 21:13:00 +0000 | [diff] [blame] | 115 | >>> d.extend('jkl') # add multiple elements at once |
Raymond Hettinger | e52f3b1 | 2004-01-29 07:27:45 +0000 | [diff] [blame] | 116 | >>> d |
| 117 | deque(['g', 'h', 'i', 'j', 'k', 'l']) |
Raymond Hettinger | 5c5eb86 | 2004-02-07 21:13:00 +0000 | [diff] [blame] | 118 | >>> d.rotate(1) # right rotation |
| 119 | >>> d |
| 120 | deque(['l', 'g', 'h', 'i', 'j', 'k']) |
| 121 | >>> d.rotate(-1) # left rotation |
| 122 | >>> d |
| 123 | deque(['g', 'h', 'i', 'j', 'k', 'l']) |
Raymond Hettinger | 738ec90 | 2004-02-29 02:15:56 +0000 | [diff] [blame] | 124 | |
Raymond Hettinger | 5c5eb86 | 2004-02-07 21:13:00 +0000 | [diff] [blame] | 125 | >>> deque(reversed(d)) # make a new deque in reverse order |
| 126 | deque(['l', 'k', 'j', 'i', 'h', 'g']) |
| 127 | >>> d.clear() # empty the deque |
| 128 | >>> d.pop() # cannot pop from an empty deque |
Raymond Hettinger | e52f3b1 | 2004-01-29 07:27:45 +0000 | [diff] [blame] | 129 | Traceback (most recent call last): |
| 130 | File "<pyshell#6>", line 1, in -toplevel- |
| 131 | d.pop() |
Raymond Hettinger | 738ec90 | 2004-02-29 02:15:56 +0000 | [diff] [blame] | 132 | IndexError: pop from an empty deque |
Raymond Hettinger | 3ba85c2 | 2004-02-06 19:04:56 +0000 | [diff] [blame] | 133 | |
Raymond Hettinger | 5c5eb86 | 2004-02-07 21:13:00 +0000 | [diff] [blame] | 134 | >>> d.extendleft('abc') # extendleft() reverses the input order |
Raymond Hettinger | 3ba85c2 | 2004-02-06 19:04:56 +0000 | [diff] [blame] | 135 | >>> d |
| 136 | deque(['c', 'b', 'a']) |
Raymond Hettinger | f5f9a370 | 2004-04-30 22:52:50 +0000 | [diff] [blame] | 137 | \end{verbatim} |
Raymond Hettinger | 3ba85c2 | 2004-02-06 19:04:56 +0000 | [diff] [blame] | 138 | |
Raymond Hettinger | e7169eb | 2004-05-09 01:15:01 +0000 | [diff] [blame] | 139 | \subsection{Recipes \label{deque-recipes}} |
| 140 | |
| 141 | This section shows various approaches to working with deques. |
| 142 | |
| 143 | The \method{rotate()} method provides a way to implement \class{deque} |
Raymond Hettinger | 2e66940 | 2004-06-12 07:59:40 +0000 | [diff] [blame] | 144 | slicing and deletion. For example, a pure python implementation of |
| 145 | \code{del d[n]} relies on the \method{rotate()} method to position |
| 146 | elements to be popped: |
| 147 | |
Raymond Hettinger | e7169eb | 2004-05-09 01:15:01 +0000 | [diff] [blame] | 148 | \begin{verbatim} |
| 149 | def delete_nth(d, n): |
Raymond Hettinger | e7169eb | 2004-05-09 01:15:01 +0000 | [diff] [blame] | 150 | d.rotate(-n) |
| 151 | d.popleft() |
| 152 | d.rotate(n) |
Raymond Hettinger | e7169eb | 2004-05-09 01:15:01 +0000 | [diff] [blame] | 153 | \end{verbatim} |
| 154 | |
Raymond Hettinger | 0e371f2 | 2004-05-12 20:55:56 +0000 | [diff] [blame] | 155 | To implement \class{deque} slicing, use a similar approach applying |
| 156 | \method{rotate()} to bring a target element to the left side of the deque. |
| 157 | Remove old entries with \method{popleft()}, add new entries with |
| 158 | \method{extend()}, and then reverse the rotation. |
Raymond Hettinger | e7169eb | 2004-05-09 01:15:01 +0000 | [diff] [blame] | 159 | |
| 160 | With minor variations on that approach, it is easy to implement Forth style |
| 161 | stack manipulations such as \code{dup}, \code{drop}, \code{swap}, \code{over}, |
| 162 | \code{pick}, \code{rot}, and \code{roll}. |
Raymond Hettinger | f5f9a370 | 2004-04-30 22:52:50 +0000 | [diff] [blame] | 163 | |
| 164 | A roundrobin task server can be built from a \class{deque} using |
| 165 | \method{popleft()} to select the current task and \method{append()} |
| 166 | to add it back to the tasklist if the input stream is not exhausted: |
| 167 | |
| 168 | \begin{verbatim} |
| 169 | def roundrobin(*iterables): |
| 170 | pending = deque(iter(i) for i in iterables) |
| 171 | while pending: |
| 172 | task = pending.popleft() |
| 173 | try: |
| 174 | yield task.next() |
| 175 | except StopIteration: |
| 176 | continue |
| 177 | pending.append(task) |
| 178 | |
| 179 | >>> for value in roundrobin('abc', 'd', 'efgh'): |
Raymond Hettinger | e7169eb | 2004-05-09 01:15:01 +0000 | [diff] [blame] | 180 | ... print value |
Raymond Hettinger | f5f9a370 | 2004-04-30 22:52:50 +0000 | [diff] [blame] | 181 | |
| 182 | a |
| 183 | d |
| 184 | e |
| 185 | b |
| 186 | f |
| 187 | c |
| 188 | g |
| 189 | h |
| 190 | |
| 191 | \end{verbatim} |
Raymond Hettinger | e7169eb | 2004-05-09 01:15:01 +0000 | [diff] [blame] | 192 | |
| 193 | |
| 194 | Multi-pass data reduction algorithms can be succinctly expressed and |
Raymond Hettinger | 2e66940 | 2004-06-12 07:59:40 +0000 | [diff] [blame] | 195 | efficiently coded by extracting elements with multiple calls to |
| 196 | \method{popleft()}, applying the reduction function, and calling |
| 197 | \method{append()} to add the result back to the queue. |
Raymond Hettinger | e7169eb | 2004-05-09 01:15:01 +0000 | [diff] [blame] | 198 | |
| 199 | For example, building a balanced binary tree of nested lists entails |
| 200 | reducing two adjacent nodes into one by grouping them in a list: |
| 201 | |
| 202 | \begin{verbatim} |
| 203 | def maketree(iterable): |
| 204 | d = deque(iterable) |
| 205 | while len(d) > 1: |
| 206 | pair = [d.popleft(), d.popleft()] |
| 207 | d.append(pair) |
| 208 | return list(d) |
| 209 | |
| 210 | >>> print maketree('abcdefgh') |
| 211 | [[[['a', 'b'], ['c', 'd']], [['e', 'f'], ['g', 'h']]]] |
| 212 | |
| 213 | \end{verbatim} |