blob: 8c1f08b354056c0507361bc7cc5623b1e88861a8 [file] [log] [blame]
Georg Brandl116aa622007-08-15 14:28:22 +00001:mod:`shelve` --- Python object persistence
2===========================================
3
4.. module:: shelve
5 :synopsis: Python object persistence.
6
7
8.. index:: module: pickle
9
10A "shelf" is a persistent, dictionary-like object. The difference with "dbm"
11databases is that the values (not the keys!) in a shelf can be essentially
12arbitrary Python objects --- anything that the :mod:`pickle` module can handle.
13This includes most class instances, recursive data types, and objects containing
14lots of shared sub-objects. The keys are ordinary strings.
15
16
17.. function:: open(filename[, flag='c'[, protocol=None[, writeback=False]]])
18
19 Open a persistent dictionary. The filename specified is the base filename for
20 the underlying database. As a side-effect, an extension may be added to the
21 filename and more than one file may be created. By default, the underlying
22 database file is opened for reading and writing. The optional *flag* parameter
Georg Brandl0a7ac7d2008-05-26 10:29:35 +000023 has the same interpretation as the *flag* parameter of :func:`dbm.open`.
Georg Brandl116aa622007-08-15 14:28:22 +000024
Raymond Hettinger85602262009-02-03 04:19:10 +000025 By default, version 3 pickles are used to serialize values. The version of the
Georg Brandl116aa622007-08-15 14:28:22 +000026 pickle protocol can be specified with the *protocol* parameter.
27
Georg Brandl116aa622007-08-15 14:28:22 +000028 By default, mutations to persistent-dictionary mutable entries are not
29 automatically written back. If the optional *writeback* parameter is set to
30 *True*, all entries accessed are cached in memory, and written back at close
31 time; this can make it handier to mutate mutable entries in the persistent
32 dictionary, but, if many entries are accessed, it can consume vast amounts of
33 memory for the cache, and it can make the close operation very slow since all
34 accessed entries are written back (there is no way to determine which accessed
35 entries are mutable, nor which ones were actually mutated).
36
Benjamin Peterson25c95f12009-05-08 20:42:26 +000037Shelf objects support all methods supported by dictionaries. This eases the
Georg Brandl116aa622007-08-15 14:28:22 +000038transition from dictionary based scripts to those requiring persistent storage.
39
40One additional method is supported:
41
42
43.. method:: Shelf.sync()
44
45 Write back all entries in the cache if the shelf was opened with *writeback* set
46 to *True*. Also empty the cache and synchronize the persistent dictionary on
47 disk, if feasible. This is called automatically when the shelf is closed with
48 :meth:`close`.
49
Raymond Hettinger65c9eb22009-04-04 05:39:52 +000050.. seealso::
51
52 `Persistent dictionary recipe <http://code.activestate.com/recipes/576642/>`_
53 with widely supported storage formats and having the speed of native
54 dictionaries.
55
Georg Brandl116aa622007-08-15 14:28:22 +000056
57Restrictions
58------------
59
60 .. index::
Georg Brandl0a7ac7d2008-05-26 10:29:35 +000061 module: dbm.ndbm
62 module: dbm.gnu
Georg Brandl116aa622007-08-15 14:28:22 +000063
Benjamin Peterson9a46cab2008-09-08 02:49:30 +000064* The choice of which database package will be used (such as :mod:`dbm.ndbm` or
65 :mod:`dbm.gnu`) depends on which interface is available. Therefore it is not
66 safe to open the database directly using :mod:`dbm`. The database is also
67 (unfortunately) subject to the limitations of :mod:`dbm`, if it is used ---
Georg Brandl116aa622007-08-15 14:28:22 +000068 this means that (the pickled representation of) the objects stored in the
Benjamin Peterson9a46cab2008-09-08 02:49:30 +000069 database should be fairly small, and in rare cases key collisions may cause
70 the database to refuse updates.
Georg Brandl116aa622007-08-15 14:28:22 +000071
72* Depending on the implementation, closing a persistent dictionary may or may
73 not be necessary to flush changes to disk. The :meth:`__del__` method of the
74 :class:`Shelf` class calls the :meth:`close` method, so the programmer generally
75 need not do this explicitly.
76
77* The :mod:`shelve` module does not support *concurrent* read/write access to
78 shelved objects. (Multiple simultaneous read accesses are safe.) When a
79 program has a shelf open for writing, no other program should have it open for
80 reading or writing. Unix file locking can be used to solve this, but this
81 differs across Unix versions and requires knowledge about the database
82 implementation used.
83
84
85.. class:: Shelf(dict[, protocol=None[, writeback=False]])
86
Georg Brandlc7723722008-05-26 17:47:11 +000087 A subclass of :class:`collections.MutableMapping` which stores pickled values
88 in the *dict* object.
Georg Brandl116aa622007-08-15 14:28:22 +000089
90 By default, version 0 pickles are used to serialize values. The version of the
91 pickle protocol can be specified with the *protocol* parameter. See the
92 :mod:`pickle` documentation for a discussion of the pickle protocols.
93
Georg Brandl116aa622007-08-15 14:28:22 +000094 If the *writeback* parameter is ``True``, the object will hold a cache of all
95 entries accessed and write them back to the *dict* at sync and close times.
96 This allows natural operations on mutable entries, but can consume much more
97 memory and make sync and close take a long time.
98
99
100.. class:: BsdDbShelf(dict[, protocol=None[, writeback=False]])
101
102 A subclass of :class:`Shelf` which exposes :meth:`first`, :meth:`next`,
103 :meth:`previous`, :meth:`last` and :meth:`set_location` which are available in
104 the :mod:`bsddb` module but not in other database modules. The *dict* object
105 passed to the constructor must support those methods. This is generally
106 accomplished by calling one of :func:`bsddb.hashopen`, :func:`bsddb.btopen` or
107 :func:`bsddb.rnopen`. The optional *protocol* and *writeback* parameters have
108 the same interpretation as for the :class:`Shelf` class.
109
110
111.. class:: DbfilenameShelf(filename[, flag='c'[, protocol=None[, writeback=False]]])
112
113 A subclass of :class:`Shelf` which accepts a *filename* instead of a dict-like
Georg Brandl0a7ac7d2008-05-26 10:29:35 +0000114 object. The underlying file will be opened using :func:`dbm.open`. By
Georg Brandl116aa622007-08-15 14:28:22 +0000115 default, the file will be created and opened for both read and write. The
116 optional *flag* parameter has the same interpretation as for the :func:`open`
117 function. The optional *protocol* and *writeback* parameters have the same
118 interpretation as for the :class:`Shelf` class.
119
120
121Example
122-------
123
124To summarize the interface (``key`` is a string, ``data`` is an arbitrary
125object)::
126
127 import shelve
128
129 d = shelve.open(filename) # open -- file may get suffix added by low-level
130 # library
131
132 d[key] = data # store data at key (overwrites old data if
133 # using an existing key)
134 data = d[key] # retrieve a COPY of data at key (raise KeyError if no
135 # such key)
136 del d[key] # delete data stored at key (raises KeyError
137 # if no such key)
Collin Winterc79461b2007-09-01 23:34:30 +0000138 flag = key in d # true if the key exists
Georg Brandl116aa622007-08-15 14:28:22 +0000139 klist = d.keys() # a list of all existing keys (slow!)
140
141 # as d was opened WITHOUT writeback=True, beware:
142 d['xx'] = range(4) # this works as expected, but...
Benjamin Petersond23f8222009-04-05 19:13:16 +0000143 d['xx'].append(5) # *this doesn't!* -- d['xx'] is STILL range(4)!
Georg Brandl116aa622007-08-15 14:28:22 +0000144
145 # having opened d without writeback=True, you need to code carefully:
146 temp = d['xx'] # extracts the copy
147 temp.append(5) # mutates the copy
148 d['xx'] = temp # stores the copy right back, to persist it
149
150 # or, d=shelve.open(filename,writeback=True) would let you just code
151 # d['xx'].append(5) and have it work as expected, BUT it would also
152 # consume more memory and make the d.close() operation slower.
153
154 d.close() # close it
155
156
157.. seealso::
158
Georg Brandl0a7ac7d2008-05-26 10:29:35 +0000159 Module :mod:`dbm`
160 Generic interface to ``dbm``-style databases.
Georg Brandl116aa622007-08-15 14:28:22 +0000161
Georg Brandl116aa622007-08-15 14:28:22 +0000162 Module :mod:`pickle`
163 Object serialization used by :mod:`shelve`.
164
165 Module :mod:`cPickle`
166 High-performance version of :mod:`pickle`.
167