blob: 86ac4ef4816d836eb024bfab9f952bed7b123591 [file] [log] [blame]
Guido van Rossum2bc13791999-03-24 19:06:42 +00001/* Dictionary object implementation using a hash table */
Guido van Rossum9bfef441993-03-29 10:43:31 +00002
Raymond Hettinger930427b2003-05-03 06:51:59 +00003/* The distribution includes a separate file, Objects/dictnotes.txt,
Tim Peters60b29962006-01-01 01:19:23 +00004 describing explorations into dictionary design and optimization.
Raymond Hettinger930427b2003-05-03 06:51:59 +00005 It covers typical dictionary use patterns, the parameters for
6 tuning dictionaries, and several ideas for possible optimizations.
7*/
8
Victor Stinner742da042016-09-07 17:40:12 -07009/* PyDictKeysObject
10
11This implements the dictionary's hashtable.
12
Raymond Hettingerb12785d2016-10-22 09:58:14 -070013As of Python 3.6, this is compact and ordered. Basic idea is described here:
14* https://mail.python.org/pipermail/python-dev/2012-December/123028.html
15* https://morepypy.blogspot.com/2015/01/faster-more-memory-efficient-and-more.html
Victor Stinner742da042016-09-07 17:40:12 -070016
17layout:
18
19+---------------+
20| dk_refcnt |
21| dk_size |
22| dk_lookup |
23| dk_usable |
24| dk_nentries |
25+---------------+
26| dk_indices |
27| |
28+---------------+
29| dk_entries |
30| |
31+---------------+
32
33dk_indices is actual hashtable. It holds index in entries, or DKIX_EMPTY(-1)
34or DKIX_DUMMY(-2).
35Size of indices is dk_size. Type of each index in indices is vary on dk_size:
36
37* int8 for dk_size <= 128
38* int16 for 256 <= dk_size <= 2**15
39* int32 for 2**16 <= dk_size <= 2**31
40* int64 for 2**32 <= dk_size
41
dalgarno359143c2019-09-10 10:45:07 +010042dk_entries is array of PyDictKeyEntry. Its size is USABLE_FRACTION(dk_size).
Victor Stinner742da042016-09-07 17:40:12 -070043DK_ENTRIES(dk) can be used to get pointer to entries.
44
45NOTE: Since negative value is used for DKIX_EMPTY and DKIX_DUMMY, type of
46dk_indices entry is signed integer and int16 is used for table which
47dk_size == 256.
48*/
49
Benjamin Peterson7d95e402012-04-23 11:24:50 -040050
51/*
Benjamin Peterson7d95e402012-04-23 11:24:50 -040052The DictObject can be in one of two forms.
Victor Stinner742da042016-09-07 17:40:12 -070053
Benjamin Peterson7d95e402012-04-23 11:24:50 -040054Either:
55 A combined table:
56 ma_values == NULL, dk_refcnt == 1.
57 Values are stored in the me_value field of the PyDictKeysObject.
Benjamin Peterson7d95e402012-04-23 11:24:50 -040058Or:
59 A split table:
60 ma_values != NULL, dk_refcnt >= 1
61 Values are stored in the ma_values array.
Victor Stinner742da042016-09-07 17:40:12 -070062 Only string (unicode) keys are allowed.
63 All dicts sharing same key must have same insertion order.
Benjamin Peterson7d95e402012-04-23 11:24:50 -040064
Victor Stinner742da042016-09-07 17:40:12 -070065There are four kinds of slots in the table (slot is index, and
66DK_ENTRIES(keys)[index] if index >= 0):
67
681. Unused. index == DKIX_EMPTY
69 Does not hold an active (key, value) pair now and never did. Unused can
70 transition to Active upon key insertion. This is each slot's initial state.
71
722. Active. index >= 0, me_key != NULL and me_value != NULL
73 Holds an active (key, value) pair. Active can transition to Dummy or
74 Pending upon key deletion (for combined and split tables respectively).
75 This is the only case in which me_value != NULL.
76
773. Dummy. index == DKIX_DUMMY (combined only)
78 Previously held an active (key, value) pair, but that was deleted and an
79 active pair has not yet overwritten the slot. Dummy can transition to
80 Active upon key insertion. Dummy slots cannot be made Unused again
81 else the probe sequence in case of collision would have no way to know
82 they were once active.
83
844. Pending. index >= 0, key != NULL, and value == NULL (split only)
85 Not yet inserted in split-table.
Benjamin Peterson7d95e402012-04-23 11:24:50 -040086*/
87
Victor Stinner742da042016-09-07 17:40:12 -070088/*
89Preserving insertion order
Benjamin Peterson7d95e402012-04-23 11:24:50 -040090
Victor Stinner742da042016-09-07 17:40:12 -070091It's simple for combined table. Since dk_entries is mostly append only, we can
92get insertion order by just iterating dk_entries.
93
94One exception is .popitem(). It removes last item in dk_entries and decrement
95dk_nentries to achieve amortized O(1). Since there are DKIX_DUMMY remains in
96dk_indices, we can't increment dk_usable even though dk_nentries is
97decremented.
98
99In split table, inserting into pending entry is allowed only for dk_entries[ix]
100where ix == mp->ma_used. Inserting into other index and deleting item cause
101converting the dict to the combined table.
102*/
103
104/* PyDict_MINSIZE is the starting size for any new dict.
Benjamin Peterson7d95e402012-04-23 11:24:50 -0400105 * 8 allows dicts with no more than 5 active entries; experiments suggested
106 * this suffices for the majority of dicts (consisting mostly of usually-small
107 * dicts created to pass keyword arguments).
108 * Making this 8, rather than 4 reduces the number of resizes for most
109 * dictionaries, without any significant extra memory use.
110 */
Victor Stinner742da042016-09-07 17:40:12 -0700111#define PyDict_MINSIZE 8
Benjamin Peterson7d95e402012-04-23 11:24:50 -0400112
Guido van Rossumc0b618a1997-05-02 03:12:38 +0000113#include "Python.h"
Victor Stinnerbcda8f12018-11-21 22:27:47 +0100114#include "pycore_object.h"
Victor Stinner621cebe2018-11-12 16:53:38 +0100115#include "pycore_pystate.h"
Eric Snow96c6af92015-05-29 22:21:39 -0600116#include "dict-common.h"
Victor Stinner990397e2016-09-09 20:22:59 -0700117#include "stringlib/eq.h" /* to get unicode_eq() */
Guido van Rossum4b1302b1993-03-27 18:11:32 +0000118
Larry Hastings61272b72014-01-07 12:41:53 -0800119/*[clinic input]
Larry Hastingsc2047262014-01-25 20:43:29 -0800120class dict "PyDictObject *" "&PyDict_Type"
Larry Hastings61272b72014-01-07 12:41:53 -0800121[clinic start generated code]*/
Larry Hastings581ee362014-01-28 05:00:08 -0800122/*[clinic end generated code: output=da39a3ee5e6b4b0d input=f157a5a0ce9589d6]*/
Larry Hastings44e2eaa2013-11-23 15:37:55 -0800123
Benjamin Peterson7d95e402012-04-23 11:24:50 -0400124
125/*
126To ensure the lookup algorithm terminates, there must be at least one Unused
127slot (NULL key) in the table.
128To avoid slowing down lookups on a near-full table, we resize the table when
129it's USABLE_FRACTION (currently two-thirds) full.
130*/
Guido van Rossum16e93a81997-01-28 00:00:11 +0000131
Tim Peterseb28ef22001-06-02 05:27:19 +0000132#define PERTURB_SHIFT 5
133
Guido van Rossum16e93a81997-01-28 00:00:11 +0000134/*
Tim Peterseb28ef22001-06-02 05:27:19 +0000135Major subtleties ahead: Most hash schemes depend on having a "good" hash
136function, in the sense of simulating randomness. Python doesn't: its most
R David Murray537ad7a2016-07-10 12:33:18 -0400137important hash functions (for ints) are very regular in common
Tim Peterseb28ef22001-06-02 05:27:19 +0000138cases:
Tim Peters15d49292001-05-27 07:39:22 +0000139
R David Murray537ad7a2016-07-10 12:33:18 -0400140 >>>[hash(i) for i in range(4)]
Guido van Rossumdc5f6b22006-08-24 21:29:26 +0000141 [0, 1, 2, 3]
Tim Peters15d49292001-05-27 07:39:22 +0000142
Tim Peterseb28ef22001-06-02 05:27:19 +0000143This isn't necessarily bad! To the contrary, in a table of size 2**i, taking
144the low-order i bits as the initial table index is extremely fast, and there
R David Murray537ad7a2016-07-10 12:33:18 -0400145are no collisions at all for dicts indexed by a contiguous range of ints. So
146this gives better-than-random behavior in common cases, and that's very
147desirable.
Tim Peters15d49292001-05-27 07:39:22 +0000148
Tim Peterseb28ef22001-06-02 05:27:19 +0000149OTOH, when collisions occur, the tendency to fill contiguous slices of the
150hash table makes a good collision resolution strategy crucial. Taking only
151the last i bits of the hash code is also vulnerable: for example, consider
Antoine Pitrouf95a1b32010-05-09 15:52:27 +0000152the list [i << 16 for i in range(20000)] as a set of keys. Since ints are
Guido van Rossumdc5f6b22006-08-24 21:29:26 +0000153their own hash codes, and this fits in a dict of size 2**15, the last 15 bits
154 of every hash code are all 0: they *all* map to the same table index.
Tim Peters15d49292001-05-27 07:39:22 +0000155
Tim Peterseb28ef22001-06-02 05:27:19 +0000156But catering to unusual cases should not slow the usual ones, so we just take
157the last i bits anyway. It's up to collision resolution to do the rest. If
158we *usually* find the key we're looking for on the first try (and, it turns
159out, we usually do -- the table load factor is kept under 2/3, so the odds
160are solidly in our favor), then it makes best sense to keep the initial index
161computation dirt cheap.
Tim Peters15d49292001-05-27 07:39:22 +0000162
Tim Peterseb28ef22001-06-02 05:27:19 +0000163The first half of collision resolution is to visit table indices via this
164recurrence:
Tim Peters15d49292001-05-27 07:39:22 +0000165
Tim Peterseb28ef22001-06-02 05:27:19 +0000166 j = ((5*j) + 1) mod 2**i
Tim Peters15d49292001-05-27 07:39:22 +0000167
Tim Peterseb28ef22001-06-02 05:27:19 +0000168For any initial j in range(2**i), repeating that 2**i times generates each
169int in range(2**i) exactly once (see any text on random-number generation for
170proof). By itself, this doesn't help much: like linear probing (setting
171j += 1, or j -= 1, on each loop trip), it scans the table entries in a fixed
172order. This would be bad, except that's not the only thing we do, and it's
173actually *good* in the common cases where hash keys are consecutive. In an
174example that's really too small to make this entirely clear, for a table of
175size 2**3 the order of indices is:
Tim Peters15d49292001-05-27 07:39:22 +0000176
Tim Peterseb28ef22001-06-02 05:27:19 +0000177 0 -> 1 -> 6 -> 7 -> 4 -> 5 -> 2 -> 3 -> 0 [and here it's repeating]
178
179If two things come in at index 5, the first place we look after is index 2,
180not 6, so if another comes in at index 6 the collision at 5 didn't hurt it.
181Linear probing is deadly in this case because there the fixed probe order
182is the *same* as the order consecutive keys are likely to arrive. But it's
183extremely unlikely hash codes will follow a 5*j+1 recurrence by accident,
184and certain that consecutive hash codes do not.
185
186The other half of the strategy is to get the other bits of the hash code
187into play. This is done by initializing a (unsigned) vrbl "perturb" to the
188full hash code, and changing the recurrence to:
189
Tim Peterseb28ef22001-06-02 05:27:19 +0000190 perturb >>= PERTURB_SHIFT;
INADA Naoki267941c2016-10-06 15:19:07 +0900191 j = (5*j) + 1 + perturb;
Tim Peterseb28ef22001-06-02 05:27:19 +0000192 use j % 2**i as the next table index;
193
194Now the probe sequence depends (eventually) on every bit in the hash code,
195and the pseudo-scrambling property of recurring on 5*j+1 is more valuable,
196because it quickly magnifies small differences in the bits that didn't affect
197the initial index. Note that because perturb is unsigned, if the recurrence
198is executed often enough perturb eventually becomes and remains 0. At that
199point (very rarely reached) the recurrence is on (just) 5*j+1 again, and
200that's certain to find an empty slot eventually (since it generates every int
201in range(2**i), and we make sure there's always at least one empty slot).
202
203Selecting a good value for PERTURB_SHIFT is a balancing act. You want it
204small so that the high bits of the hash code continue to affect the probe
205sequence across iterations; but you want it large so that in really bad cases
206the high-order hash bits have an effect on early iterations. 5 was "the
207best" in minimizing total collisions across experiments Tim Peters ran (on
208both normal and pathological cases), but 4 and 6 weren't significantly worse.
209
Guido van Rossumdc5f6b22006-08-24 21:29:26 +0000210Historical: Reimer Behrends contributed the idea of using a polynomial-based
Tim Peterseb28ef22001-06-02 05:27:19 +0000211approach, using repeated multiplication by x in GF(2**n) where an irreducible
212polynomial for each table size was chosen such that x was a primitive root.
213Christian Tismer later extended that to use division by x instead, as an
214efficient way to get the high bits of the hash code into play. This scheme
Guido van Rossum8ce8a782007-11-01 19:42:39 +0000215also gave excellent collision statistics, but was more expensive: two
216if-tests were required inside the loop; computing "the next" index took about
217the same number of operations but without as much potential parallelism
218(e.g., computing 5*j can go on at the same time as computing 1+perturb in the
219above, and then shifting perturb can be done while the table index is being
220masked); and the PyDictObject struct required a member to hold the table's
221polynomial. In Tim's experiments the current scheme ran faster, produced
222equally good collision statistics, needed less code & used less memory.
Thomas Wouters4d70c3d2006-06-08 14:42:34 +0000223
Guido van Rossum4b1302b1993-03-27 18:11:32 +0000224*/
Tim Petersdea48ec2001-05-22 20:40:22 +0000225
Fred Drake1bff34a2000-08-31 19:31:38 +0000226/* forward declarations */
Victor Stinner742da042016-09-07 17:40:12 -0700227static Py_ssize_t lookdict(PyDictObject *mp, PyObject *key,
INADA Naoki778928b2017-08-03 23:45:15 +0900228 Py_hash_t hash, PyObject **value_addr);
Victor Stinner742da042016-09-07 17:40:12 -0700229static Py_ssize_t lookdict_unicode(PyDictObject *mp, PyObject *key,
INADA Naoki778928b2017-08-03 23:45:15 +0900230 Py_hash_t hash, PyObject **value_addr);
Victor Stinner742da042016-09-07 17:40:12 -0700231static Py_ssize_t
Benjamin Peterson7d95e402012-04-23 11:24:50 -0400232lookdict_unicode_nodummy(PyDictObject *mp, PyObject *key,
INADA Naoki778928b2017-08-03 23:45:15 +0900233 Py_hash_t hash, PyObject **value_addr);
Victor Stinner742da042016-09-07 17:40:12 -0700234static Py_ssize_t lookdict_split(PyDictObject *mp, PyObject *key,
INADA Naoki778928b2017-08-03 23:45:15 +0900235 Py_hash_t hash, PyObject **value_addr);
Fred Drake1bff34a2000-08-31 19:31:38 +0000236
Benjamin Peterson7d95e402012-04-23 11:24:50 -0400237static int dictresize(PyDictObject *mp, Py_ssize_t minused);
Tim Petersdea48ec2001-05-22 20:40:22 +0000238
INADA Naoki2aaf98c2018-09-26 12:59:00 +0900239static PyObject* dict_iter(PyDictObject *dict);
240
Benjamin Peterson3c569292016-09-08 13:16:41 -0700241/*Global counter used to set ma_version_tag field of dictionary.
Victor Stinner3b6a6b42016-09-08 12:51:24 -0700242 * It is incremented each time that a dictionary is created and each
243 * time that a dictionary is modified. */
244static uint64_t pydict_global_version = 0;
245
246#define DICT_NEXT_VERSION() (++pydict_global_version)
247
Victor Stinner742da042016-09-07 17:40:12 -0700248/* Dictionary reuse scheme to save calls to malloc and free */
Christian Heimes2202f872008-02-06 14:31:34 +0000249#ifndef PyDict_MAXFREELIST
250#define PyDict_MAXFREELIST 80
251#endif
252static PyDictObject *free_list[PyDict_MAXFREELIST];
253static int numfree = 0;
Victor Stinner742da042016-09-07 17:40:12 -0700254static PyDictKeysObject *keys_free_list[PyDict_MAXFREELIST];
255static int numfreekeys = 0;
Raymond Hettinger43442782004-03-17 21:55:03 +0000256
Serhiy Storchaka1009bf12015-04-03 23:53:51 +0300257#include "clinic/dictobject.c.h"
258
Antoine Pitrou9a812cb2011-11-15 00:00:12 +0100259int
260PyDict_ClearFreeList(void)
Christian Heimes77c02eb2008-02-09 02:18:51 +0000261{
Antoine Pitrouf95a1b32010-05-09 15:52:27 +0000262 PyDictObject *op;
Victor Stinner742da042016-09-07 17:40:12 -0700263 int ret = numfree + numfreekeys;
Antoine Pitrouf95a1b32010-05-09 15:52:27 +0000264 while (numfree) {
265 op = free_list[--numfree];
266 assert(PyDict_CheckExact(op));
267 PyObject_GC_Del(op);
268 }
Victor Stinner742da042016-09-07 17:40:12 -0700269 while (numfreekeys) {
270 PyObject_FREE(keys_free_list[--numfreekeys]);
271 }
Antoine Pitrou9a812cb2011-11-15 00:00:12 +0100272 return ret;
273}
274
David Malcolm49526f42012-06-22 14:55:41 -0400275/* Print summary info about the state of the optimized allocator */
276void
277_PyDict_DebugMallocStats(FILE *out)
278{
279 _PyDebugAllocatorStats(out,
280 "free PyDictObject", numfree, sizeof(PyDictObject));
281}
282
283
Antoine Pitrou9a812cb2011-11-15 00:00:12 +0100284void
Victor Stinnerbed48172019-08-27 00:12:32 +0200285_PyDict_Fini(void)
Antoine Pitrou9a812cb2011-11-15 00:00:12 +0100286{
287 PyDict_ClearFreeList();
Christian Heimes77c02eb2008-02-09 02:18:51 +0000288}
289
Victor Stinner742da042016-09-07 17:40:12 -0700290#define DK_SIZE(dk) ((dk)->dk_size)
291#if SIZEOF_VOID_P > 4
Victor Stinner58f7c5a2016-09-08 11:37:36 -0700292#define DK_IXSIZE(dk) \
293 (DK_SIZE(dk) <= 0xff ? \
294 1 : DK_SIZE(dk) <= 0xffff ? \
295 2 : DK_SIZE(dk) <= 0xffffffff ? \
Benjamin Peterson3c569292016-09-08 13:16:41 -0700296 4 : sizeof(int64_t))
Victor Stinner742da042016-09-07 17:40:12 -0700297#else
Victor Stinner58f7c5a2016-09-08 11:37:36 -0700298#define DK_IXSIZE(dk) \
299 (DK_SIZE(dk) <= 0xff ? \
300 1 : DK_SIZE(dk) <= 0xffff ? \
Benjamin Peterson3c569292016-09-08 13:16:41 -0700301 2 : sizeof(int32_t))
Victor Stinner742da042016-09-07 17:40:12 -0700302#endif
Victor Stinner58f7c5a2016-09-08 11:37:36 -0700303#define DK_ENTRIES(dk) \
Gregory P. Smith397f1b22018-04-19 22:41:19 -0700304 ((PyDictKeyEntry*)(&((int8_t*)((dk)->dk_indices))[DK_SIZE(dk) * DK_IXSIZE(dk)]))
Victor Stinner742da042016-09-07 17:40:12 -0700305
Benjamin Peterson7d95e402012-04-23 11:24:50 -0400306#define DK_MASK(dk) (((dk)->dk_size)-1)
307#define IS_POWER_OF_2(x) (((x) & (x-1)) == 0)
308
INADA Naokia7576492018-11-14 18:39:27 +0900309static void free_keys_object(PyDictKeysObject *keys);
310
311static inline void
312dictkeys_incref(PyDictKeysObject *dk)
313{
Victor Stinner49932fe2020-02-03 17:55:05 +0100314#ifdef Py_REF_DEBUG
315 _Py_RefTotal++;
316#endif
INADA Naokia7576492018-11-14 18:39:27 +0900317 dk->dk_refcnt++;
318}
319
320static inline void
321dictkeys_decref(PyDictKeysObject *dk)
322{
323 assert(dk->dk_refcnt > 0);
Victor Stinner49932fe2020-02-03 17:55:05 +0100324#ifdef Py_REF_DEBUG
325 _Py_RefTotal--;
326#endif
INADA Naokia7576492018-11-14 18:39:27 +0900327 if (--dk->dk_refcnt == 0) {
328 free_keys_object(dk);
329 }
330}
331
Victor Stinner742da042016-09-07 17:40:12 -0700332/* lookup indices. returns DKIX_EMPTY, DKIX_DUMMY, or ix >=0 */
Benjamin Peterson73222252016-09-08 09:58:47 -0700333static inline Py_ssize_t
INADA Naokia7576492018-11-14 18:39:27 +0900334dictkeys_get_index(PyDictKeysObject *keys, Py_ssize_t i)
Victor Stinner742da042016-09-07 17:40:12 -0700335{
336 Py_ssize_t s = DK_SIZE(keys);
Victor Stinner71211e32016-09-08 10:52:46 -0700337 Py_ssize_t ix;
338
Victor Stinner742da042016-09-07 17:40:12 -0700339 if (s <= 0xff) {
Gregory P. Smith397f1b22018-04-19 22:41:19 -0700340 int8_t *indices = (int8_t*)(keys->dk_indices);
Victor Stinner208857e2016-09-08 11:35:46 -0700341 ix = indices[i];
Victor Stinner742da042016-09-07 17:40:12 -0700342 }
343 else if (s <= 0xffff) {
Gregory P. Smith397f1b22018-04-19 22:41:19 -0700344 int16_t *indices = (int16_t*)(keys->dk_indices);
Victor Stinner208857e2016-09-08 11:35:46 -0700345 ix = indices[i];
Victor Stinner742da042016-09-07 17:40:12 -0700346 }
Benjamin Peterson3c569292016-09-08 13:16:41 -0700347#if SIZEOF_VOID_P > 4
Serhiy Storchaka473e0e42016-09-10 21:34:43 +0300348 else if (s > 0xffffffff) {
Gregory P. Smith397f1b22018-04-19 22:41:19 -0700349 int64_t *indices = (int64_t*)(keys->dk_indices);
Victor Stinner208857e2016-09-08 11:35:46 -0700350 ix = indices[i];
Victor Stinner742da042016-09-07 17:40:12 -0700351 }
Benjamin Peterson3c569292016-09-08 13:16:41 -0700352#endif
Serhiy Storchaka473e0e42016-09-10 21:34:43 +0300353 else {
Gregory P. Smith397f1b22018-04-19 22:41:19 -0700354 int32_t *indices = (int32_t*)(keys->dk_indices);
Serhiy Storchaka473e0e42016-09-10 21:34:43 +0300355 ix = indices[i];
356 }
Victor Stinner71211e32016-09-08 10:52:46 -0700357 assert(ix >= DKIX_DUMMY);
358 return ix;
Victor Stinner742da042016-09-07 17:40:12 -0700359}
360
361/* write to indices. */
Benjamin Peterson73222252016-09-08 09:58:47 -0700362static inline void
INADA Naokia7576492018-11-14 18:39:27 +0900363dictkeys_set_index(PyDictKeysObject *keys, Py_ssize_t i, Py_ssize_t ix)
Victor Stinner742da042016-09-07 17:40:12 -0700364{
365 Py_ssize_t s = DK_SIZE(keys);
Victor Stinner71211e32016-09-08 10:52:46 -0700366
367 assert(ix >= DKIX_DUMMY);
368
Victor Stinner742da042016-09-07 17:40:12 -0700369 if (s <= 0xff) {
Gregory P. Smith397f1b22018-04-19 22:41:19 -0700370 int8_t *indices = (int8_t*)(keys->dk_indices);
Victor Stinner71211e32016-09-08 10:52:46 -0700371 assert(ix <= 0x7f);
Victor Stinner208857e2016-09-08 11:35:46 -0700372 indices[i] = (char)ix;
Victor Stinner742da042016-09-07 17:40:12 -0700373 }
374 else if (s <= 0xffff) {
Gregory P. Smith397f1b22018-04-19 22:41:19 -0700375 int16_t *indices = (int16_t*)(keys->dk_indices);
Victor Stinner71211e32016-09-08 10:52:46 -0700376 assert(ix <= 0x7fff);
Victor Stinner208857e2016-09-08 11:35:46 -0700377 indices[i] = (int16_t)ix;
Victor Stinner742da042016-09-07 17:40:12 -0700378 }
Benjamin Peterson3c569292016-09-08 13:16:41 -0700379#if SIZEOF_VOID_P > 4
Serhiy Storchaka473e0e42016-09-10 21:34:43 +0300380 else if (s > 0xffffffff) {
Gregory P. Smith397f1b22018-04-19 22:41:19 -0700381 int64_t *indices = (int64_t*)(keys->dk_indices);
Victor Stinner208857e2016-09-08 11:35:46 -0700382 indices[i] = ix;
Victor Stinner742da042016-09-07 17:40:12 -0700383 }
Benjamin Peterson3c569292016-09-08 13:16:41 -0700384#endif
Serhiy Storchaka473e0e42016-09-10 21:34:43 +0300385 else {
Gregory P. Smith397f1b22018-04-19 22:41:19 -0700386 int32_t *indices = (int32_t*)(keys->dk_indices);
Serhiy Storchaka473e0e42016-09-10 21:34:43 +0300387 assert(ix <= 0x7fffffff);
388 indices[i] = (int32_t)ix;
389 }
Victor Stinner742da042016-09-07 17:40:12 -0700390}
391
392
Antoine Pitroua504a7a2012-06-24 21:03:45 +0200393/* USABLE_FRACTION is the maximum dictionary load.
Victor Stinner742da042016-09-07 17:40:12 -0700394 * Increasing this ratio makes dictionaries more dense resulting in more
395 * collisions. Decreasing it improves sparseness at the expense of spreading
396 * indices over more cache lines and at the cost of total memory consumed.
Antoine Pitroua504a7a2012-06-24 21:03:45 +0200397 *
398 * USABLE_FRACTION must obey the following:
Benjamin Peterson7d95e402012-04-23 11:24:50 -0400399 * (0 < USABLE_FRACTION(n) < n) for all n >= 2
400 *
Victor Stinner742da042016-09-07 17:40:12 -0700401 * USABLE_FRACTION should be quick to calculate.
402 * Fractions around 1/2 to 2/3 seem to work well in practice.
Benjamin Peterson7d95e402012-04-23 11:24:50 -0400403 */
Victor Stinner742da042016-09-07 17:40:12 -0700404#define USABLE_FRACTION(n) (((n) << 1)/3)
Benjamin Peterson7d95e402012-04-23 11:24:50 -0400405
Victor Stinner742da042016-09-07 17:40:12 -0700406/* ESTIMATE_SIZE is reverse function of USABLE_FRACTION.
407 * This can be used to reserve enough size to insert n entries without
408 * resizing.
409 */
INADA Naoki92c50ee2016-11-22 00:57:02 +0900410#define ESTIMATE_SIZE(n) (((n)*3+1) >> 1)
Benjamin Peterson7d95e402012-04-23 11:24:50 -0400411
Victor Stinner742da042016-09-07 17:40:12 -0700412/* Alternative fraction that is otherwise close enough to 2n/3 to make
Benjamin Peterson7d95e402012-04-23 11:24:50 -0400413 * little difference. 8 * 2/3 == 8 * 5/8 == 5. 16 * 2/3 == 16 * 5/8 == 10.
414 * 32 * 2/3 = 21, 32 * 5/8 = 20.
415 * Its advantage is that it is faster to compute on machines with slow division.
416 * #define USABLE_FRACTION(n) (((n) >> 1) + ((n) >> 2) - ((n) >> 3))
Victor Stinner742da042016-09-07 17:40:12 -0700417 */
Benjamin Peterson7d95e402012-04-23 11:24:50 -0400418
Victor Stinnera9f61a52013-07-16 22:17:26 +0200419/* GROWTH_RATE. Growth rate upon hitting maximum load.
INADA Naoki5fbc5112018-04-17 15:53:34 +0900420 * Currently set to used*3.
Victor Stinnera9f61a52013-07-16 22:17:26 +0200421 * This means that dicts double in size when growing without deletions,
Raymond Hettinger36f74aa2013-05-17 03:01:13 -0700422 * but have more head room when the number of deletions is on a par with the
INADA Naoki5fbc5112018-04-17 15:53:34 +0900423 * number of insertions. See also bpo-17563 and bpo-33205.
424 *
Raymond Hettinger36f74aa2013-05-17 03:01:13 -0700425 * GROWTH_RATE was set to used*4 up to version 3.2.
426 * GROWTH_RATE was set to used*2 in version 3.3.0
INADA Naoki5fbc5112018-04-17 15:53:34 +0900427 * GROWTH_RATE was set to used*2 + capacity/2 in 3.4.0-3.6.0.
Antoine Pitroua504a7a2012-06-24 21:03:45 +0200428 */
INADA Naoki5fbc5112018-04-17 15:53:34 +0900429#define GROWTH_RATE(d) ((d)->ma_used*3)
Benjamin Peterson7d95e402012-04-23 11:24:50 -0400430
431#define ENSURE_ALLOWS_DELETIONS(d) \
432 if ((d)->ma_keys->dk_lookup == lookdict_unicode_nodummy) { \
433 (d)->ma_keys->dk_lookup = lookdict_unicode; \
Antoine Pitrouf95a1b32010-05-09 15:52:27 +0000434 }
Benjamin Peterson7d95e402012-04-23 11:24:50 -0400435
436/* This immutable, empty PyDictKeysObject is used for PyDict_Clear()
437 * (which cannot fail and thus can do no allocation).
438 */
439static PyDictKeysObject empty_keys_struct = {
Serhiy Storchaka97932e42016-09-26 23:01:23 +0300440 1, /* dk_refcnt */
Benjamin Peterson7d95e402012-04-23 11:24:50 -0400441 1, /* dk_size */
442 lookdict_split, /* dk_lookup */
443 0, /* dk_usable (immutable) */
Victor Stinner742da042016-09-07 17:40:12 -0700444 0, /* dk_nentries */
Gregory P. Smith397f1b22018-04-19 22:41:19 -0700445 {DKIX_EMPTY, DKIX_EMPTY, DKIX_EMPTY, DKIX_EMPTY,
446 DKIX_EMPTY, DKIX_EMPTY, DKIX_EMPTY, DKIX_EMPTY}, /* dk_indices */
Benjamin Peterson7d95e402012-04-23 11:24:50 -0400447};
448
449static PyObject *empty_values[1] = { NULL };
450
451#define Py_EMPTY_KEYS &empty_keys_struct
452
Victor Stinner611b0fa2016-09-14 15:02:01 +0200453/* Uncomment to check the dict content in _PyDict_CheckConsistency() */
454/* #define DEBUG_PYDICT */
455
Victor Stinner0fc91ee2019-04-12 21:51:34 +0200456#ifdef DEBUG_PYDICT
457# define ASSERT_CONSISTENT(op) assert(_PyDict_CheckConsistency((PyObject *)(op), 1))
458#else
459# define ASSERT_CONSISTENT(op) assert(_PyDict_CheckConsistency((PyObject *)(op), 0))
460#endif
Victor Stinner611b0fa2016-09-14 15:02:01 +0200461
Victor Stinner0fc91ee2019-04-12 21:51:34 +0200462
463int
464_PyDict_CheckConsistency(PyObject *op, int check_content)
Victor Stinner611b0fa2016-09-14 15:02:01 +0200465{
Victor Stinner68762572019-10-07 18:42:01 +0200466#define CHECK(expr) \
467 do { if (!(expr)) { _PyObject_ASSERT_FAILED_MSG(op, Py_STRINGIFY(expr)); } } while (0)
468
469 assert(op != NULL);
470 CHECK(PyDict_Check(op));
Victor Stinner0fc91ee2019-04-12 21:51:34 +0200471 PyDictObject *mp = (PyDictObject *)op;
Victor Stinner50fe3f82018-10-26 18:47:15 +0200472
Victor Stinner611b0fa2016-09-14 15:02:01 +0200473 PyDictKeysObject *keys = mp->ma_keys;
474 int splitted = _PyDict_HasSplitTable(mp);
475 Py_ssize_t usable = USABLE_FRACTION(keys->dk_size);
Victor Stinner611b0fa2016-09-14 15:02:01 +0200476
Victor Stinner68762572019-10-07 18:42:01 +0200477 CHECK(0 <= mp->ma_used && mp->ma_used <= usable);
478 CHECK(IS_POWER_OF_2(keys->dk_size));
479 CHECK(0 <= keys->dk_usable && keys->dk_usable <= usable);
480 CHECK(0 <= keys->dk_nentries && keys->dk_nentries <= usable);
481 CHECK(keys->dk_usable + keys->dk_nentries <= usable);
Victor Stinner611b0fa2016-09-14 15:02:01 +0200482
483 if (!splitted) {
484 /* combined table */
Victor Stinner68762572019-10-07 18:42:01 +0200485 CHECK(keys->dk_refcnt == 1);
Victor Stinner611b0fa2016-09-14 15:02:01 +0200486 }
487
Victor Stinner0fc91ee2019-04-12 21:51:34 +0200488 if (check_content) {
489 PyDictKeyEntry *entries = DK_ENTRIES(keys);
490 Py_ssize_t i;
Victor Stinner611b0fa2016-09-14 15:02:01 +0200491
Victor Stinner0fc91ee2019-04-12 21:51:34 +0200492 for (i=0; i < keys->dk_size; i++) {
493 Py_ssize_t ix = dictkeys_get_index(keys, i);
Victor Stinner68762572019-10-07 18:42:01 +0200494 CHECK(DKIX_DUMMY <= ix && ix <= usable);
Victor Stinner0fc91ee2019-04-12 21:51:34 +0200495 }
Victor Stinner611b0fa2016-09-14 15:02:01 +0200496
Victor Stinner0fc91ee2019-04-12 21:51:34 +0200497 for (i=0; i < usable; i++) {
498 PyDictKeyEntry *entry = &entries[i];
499 PyObject *key = entry->me_key;
500
501 if (key != NULL) {
502 if (PyUnicode_CheckExact(key)) {
503 Py_hash_t hash = ((PyASCIIObject *)key)->hash;
Victor Stinner68762572019-10-07 18:42:01 +0200504 CHECK(hash != -1);
505 CHECK(entry->me_hash == hash);
Victor Stinner0fc91ee2019-04-12 21:51:34 +0200506 }
507 else {
508 /* test_dict fails if PyObject_Hash() is called again */
Victor Stinner68762572019-10-07 18:42:01 +0200509 CHECK(entry->me_hash != -1);
Victor Stinner0fc91ee2019-04-12 21:51:34 +0200510 }
511 if (!splitted) {
Victor Stinner68762572019-10-07 18:42:01 +0200512 CHECK(entry->me_value != NULL);
Victor Stinner0fc91ee2019-04-12 21:51:34 +0200513 }
Victor Stinner611b0fa2016-09-14 15:02:01 +0200514 }
Victor Stinner0fc91ee2019-04-12 21:51:34 +0200515
516 if (splitted) {
Victor Stinner68762572019-10-07 18:42:01 +0200517 CHECK(entry->me_value == NULL);
Victor Stinner611b0fa2016-09-14 15:02:01 +0200518 }
519 }
520
521 if (splitted) {
Victor Stinner0fc91ee2019-04-12 21:51:34 +0200522 /* splitted table */
523 for (i=0; i < mp->ma_used; i++) {
Victor Stinner68762572019-10-07 18:42:01 +0200524 CHECK(mp->ma_values[i] != NULL);
Victor Stinner0fc91ee2019-04-12 21:51:34 +0200525 }
Victor Stinner611b0fa2016-09-14 15:02:01 +0200526 }
527 }
Victor Stinner611b0fa2016-09-14 15:02:01 +0200528 return 1;
Victor Stinner68762572019-10-07 18:42:01 +0200529
530#undef CHECK
Victor Stinner611b0fa2016-09-14 15:02:01 +0200531}
Victor Stinner611b0fa2016-09-14 15:02:01 +0200532
533
Benjamin Peterson7d95e402012-04-23 11:24:50 -0400534static PyDictKeysObject *new_keys_object(Py_ssize_t size)
535{
536 PyDictKeysObject *dk;
Victor Stinner742da042016-09-07 17:40:12 -0700537 Py_ssize_t es, usable;
Benjamin Peterson7d95e402012-04-23 11:24:50 -0400538
Victor Stinner742da042016-09-07 17:40:12 -0700539 assert(size >= PyDict_MINSIZE);
Benjamin Peterson7d95e402012-04-23 11:24:50 -0400540 assert(IS_POWER_OF_2(size));
Victor Stinner742da042016-09-07 17:40:12 -0700541
542 usable = USABLE_FRACTION(size);
543 if (size <= 0xff) {
544 es = 1;
545 }
546 else if (size <= 0xffff) {
547 es = 2;
548 }
549#if SIZEOF_VOID_P > 4
550 else if (size <= 0xffffffff) {
551 es = 4;
552 }
553#endif
554 else {
555 es = sizeof(Py_ssize_t);
556 }
557
558 if (size == PyDict_MINSIZE && numfreekeys > 0) {
559 dk = keys_free_list[--numfreekeys];
560 }
561 else {
Victor Stinner98ee9d52016-09-08 09:33:56 -0700562 dk = PyObject_MALLOC(sizeof(PyDictKeysObject)
Victor Stinner98ee9d52016-09-08 09:33:56 -0700563 + es * size
564 + sizeof(PyDictKeyEntry) * usable);
Victor Stinner742da042016-09-07 17:40:12 -0700565 if (dk == NULL) {
566 PyErr_NoMemory();
567 return NULL;
568 }
Benjamin Peterson7d95e402012-04-23 11:24:50 -0400569 }
Victor Stinner49932fe2020-02-03 17:55:05 +0100570#ifdef Py_REF_DEBUG
571 _Py_RefTotal++;
572#endif
INADA Naokia7576492018-11-14 18:39:27 +0900573 dk->dk_refcnt = 1;
Benjamin Peterson7d95e402012-04-23 11:24:50 -0400574 dk->dk_size = size;
Victor Stinner742da042016-09-07 17:40:12 -0700575 dk->dk_usable = usable;
Benjamin Peterson7d95e402012-04-23 11:24:50 -0400576 dk->dk_lookup = lookdict_unicode_nodummy;
Victor Stinner742da042016-09-07 17:40:12 -0700577 dk->dk_nentries = 0;
Gregory P. Smith397f1b22018-04-19 22:41:19 -0700578 memset(&dk->dk_indices[0], 0xff, es * size);
Victor Stinner742da042016-09-07 17:40:12 -0700579 memset(DK_ENTRIES(dk), 0, sizeof(PyDictKeyEntry) * usable);
Benjamin Peterson7d95e402012-04-23 11:24:50 -0400580 return dk;
581}
582
583static void
584free_keys_object(PyDictKeysObject *keys)
585{
Victor Stinner742da042016-09-07 17:40:12 -0700586 PyDictKeyEntry *entries = DK_ENTRIES(keys);
Benjamin Peterson7d95e402012-04-23 11:24:50 -0400587 Py_ssize_t i, n;
Victor Stinner742da042016-09-07 17:40:12 -0700588 for (i = 0, n = keys->dk_nentries; i < n; i++) {
Benjamin Peterson7d95e402012-04-23 11:24:50 -0400589 Py_XDECREF(entries[i].me_key);
590 Py_XDECREF(entries[i].me_value);
591 }
Victor Stinner742da042016-09-07 17:40:12 -0700592 if (keys->dk_size == PyDict_MINSIZE && numfreekeys < PyDict_MAXFREELIST) {
593 keys_free_list[numfreekeys++] = keys;
594 return;
595 }
Raymond Hettingerce5179f2016-01-31 08:56:21 -0800596 PyObject_FREE(keys);
Benjamin Peterson7d95e402012-04-23 11:24:50 -0400597}
598
599#define new_values(size) PyMem_NEW(PyObject *, size)
Benjamin Peterson7d95e402012-04-23 11:24:50 -0400600#define free_values(values) PyMem_FREE(values)
601
602/* Consumes a reference to the keys object */
603static PyObject *
604new_dict(PyDictKeysObject *keys, PyObject **values)
605{
606 PyDictObject *mp;
Victor Stinnerc9b7f512013-07-08 22:19:20 +0200607 assert(keys != NULL);
Antoine Pitrouf95a1b32010-05-09 15:52:27 +0000608 if (numfree) {
609 mp = free_list[--numfree];
610 assert (mp != NULL);
Dong-hee Na1b55b652020-02-17 19:09:15 +0900611 assert (Py_IS_TYPE(mp, &PyDict_Type));
Antoine Pitrouf95a1b32010-05-09 15:52:27 +0000612 _Py_NewReference((PyObject *)mp);
Antoine Pitrouf95a1b32010-05-09 15:52:27 +0000613 }
Benjamin Peterson7d95e402012-04-23 11:24:50 -0400614 else {
615 mp = PyObject_GC_New(PyDictObject, &PyDict_Type);
616 if (mp == NULL) {
INADA Naokia7576492018-11-14 18:39:27 +0900617 dictkeys_decref(keys);
Zackery Spytz3d07c1e2019-03-23 20:23:29 -0600618 if (values != empty_values) {
619 free_values(values);
620 }
Benjamin Peterson7d95e402012-04-23 11:24:50 -0400621 return NULL;
622 }
623 }
624 mp->ma_keys = keys;
625 mp->ma_values = values;
626 mp->ma_used = 0;
Victor Stinner3b6a6b42016-09-08 12:51:24 -0700627 mp->ma_version_tag = DICT_NEXT_VERSION();
Victor Stinner0fc91ee2019-04-12 21:51:34 +0200628 ASSERT_CONSISTENT(mp);
Antoine Pitrouf95a1b32010-05-09 15:52:27 +0000629 return (PyObject *)mp;
Guido van Rossum4b1302b1993-03-27 18:11:32 +0000630}
631
Benjamin Peterson7d95e402012-04-23 11:24:50 -0400632/* Consumes a reference to the keys object */
633static PyObject *
634new_dict_with_shared_keys(PyDictKeysObject *keys)
635{
636 PyObject **values;
637 Py_ssize_t i, size;
638
Victor Stinner742da042016-09-07 17:40:12 -0700639 size = USABLE_FRACTION(DK_SIZE(keys));
Benjamin Peterson7d95e402012-04-23 11:24:50 -0400640 values = new_values(size);
641 if (values == NULL) {
INADA Naokia7576492018-11-14 18:39:27 +0900642 dictkeys_decref(keys);
Benjamin Peterson7d95e402012-04-23 11:24:50 -0400643 return PyErr_NoMemory();
644 }
645 for (i = 0; i < size; i++) {
646 values[i] = NULL;
647 }
648 return new_dict(keys, values);
649}
650
Yury Selivanovb0a7a032018-01-22 11:54:41 -0500651
652static PyObject *
653clone_combined_dict(PyDictObject *orig)
654{
655 assert(PyDict_CheckExact(orig));
656 assert(orig->ma_values == NULL);
657 assert(orig->ma_keys->dk_refcnt == 1);
658
659 Py_ssize_t keys_size = _PyDict_KeysSize(orig->ma_keys);
660 PyDictKeysObject *keys = PyObject_Malloc(keys_size);
661 if (keys == NULL) {
662 PyErr_NoMemory();
663 return NULL;
664 }
665
666 memcpy(keys, orig->ma_keys, keys_size);
667
668 /* After copying key/value pairs, we need to incref all
669 keys and values and they are about to be co-owned by a
670 new dict object. */
671 PyDictKeyEntry *ep0 = DK_ENTRIES(keys);
672 Py_ssize_t n = keys->dk_nentries;
673 for (Py_ssize_t i = 0; i < n; i++) {
674 PyDictKeyEntry *entry = &ep0[i];
675 PyObject *value = entry->me_value;
676 if (value != NULL) {
677 Py_INCREF(value);
678 Py_INCREF(entry->me_key);
679 }
680 }
681
682 PyDictObject *new = (PyDictObject *)new_dict(keys, NULL);
683 if (new == NULL) {
684 /* In case of an error, `new_dict()` takes care of
685 cleaning up `keys`. */
686 return NULL;
687 }
688 new->ma_used = orig->ma_used;
Victor Stinner0fc91ee2019-04-12 21:51:34 +0200689 ASSERT_CONSISTENT(new);
Yury Selivanovb0a7a032018-01-22 11:54:41 -0500690 if (_PyObject_GC_IS_TRACKED(orig)) {
691 /* Maintain tracking. */
692 _PyObject_GC_TRACK(new);
693 }
Yury Selivanov0b752282018-07-06 12:20:07 -0400694
695 /* Since we copied the keys table we now have an extra reference
Victor Stinner49932fe2020-02-03 17:55:05 +0100696 in the system. Manually call increment _Py_RefTotal to signal that
INADA Naokia7576492018-11-14 18:39:27 +0900697 we have it now; calling dictkeys_incref would be an error as
Yury Selivanov0b752282018-07-06 12:20:07 -0400698 keys->dk_refcnt is already set to 1 (after memcpy). */
Victor Stinner49932fe2020-02-03 17:55:05 +0100699#ifdef Py_REF_DEBUG
700 _Py_RefTotal++;
701#endif
Yury Selivanov0b752282018-07-06 12:20:07 -0400702
Yury Selivanovb0a7a032018-01-22 11:54:41 -0500703 return (PyObject *)new;
704}
705
Benjamin Peterson7d95e402012-04-23 11:24:50 -0400706PyObject *
707PyDict_New(void)
708{
Inada Naokif2a18672019-03-12 17:25:44 +0900709 dictkeys_incref(Py_EMPTY_KEYS);
710 return new_dict(Py_EMPTY_KEYS, empty_values);
Benjamin Peterson7d95e402012-04-23 11:24:50 -0400711}
712
Victor Stinner742da042016-09-07 17:40:12 -0700713/* Search index of hash table from offset of entry table */
714static Py_ssize_t
715lookdict_index(PyDictKeysObject *k, Py_hash_t hash, Py_ssize_t index)
716{
Victor Stinner742da042016-09-07 17:40:12 -0700717 size_t mask = DK_MASK(k);
INADA Naoki073ae482017-06-23 15:22:50 +0900718 size_t perturb = (size_t)hash;
719 size_t i = (size_t)hash & mask;
Victor Stinner742da042016-09-07 17:40:12 -0700720
INADA Naoki073ae482017-06-23 15:22:50 +0900721 for (;;) {
INADA Naokia7576492018-11-14 18:39:27 +0900722 Py_ssize_t ix = dictkeys_get_index(k, i);
Victor Stinner742da042016-09-07 17:40:12 -0700723 if (ix == index) {
724 return i;
725 }
726 if (ix == DKIX_EMPTY) {
727 return DKIX_EMPTY;
728 }
INADA Naoki073ae482017-06-23 15:22:50 +0900729 perturb >>= PERTURB_SHIFT;
730 i = mask & (i*5 + perturb + 1);
Victor Stinner742da042016-09-07 17:40:12 -0700731 }
Barry Warsawb2e57942017-09-14 18:13:16 -0700732 Py_UNREACHABLE();
Victor Stinner742da042016-09-07 17:40:12 -0700733}
734
Guido van Rossum4b1302b1993-03-27 18:11:32 +0000735/*
736The basic lookup function used by all operations.
Guido van Rossum16e93a81997-01-28 00:00:11 +0000737This is based on Algorithm D from Knuth Vol. 3, Sec. 6.4.
Guido van Rossum4b1302b1993-03-27 18:11:32 +0000738Open addressing is preferred over chaining since the link overhead for
739chaining would be substantial (100% with typical malloc overhead).
740
Tim Peterseb28ef22001-06-02 05:27:19 +0000741The initial probe index is computed as hash mod the table size. Subsequent
742probe indices are computed as explained earlier.
Guido van Rossum2bc13791999-03-24 19:06:42 +0000743
744All arithmetic on hash should ignore overflow.
Guido van Rossum16e93a81997-01-28 00:00:11 +0000745
Guido van Rossumdc5f6b22006-08-24 21:29:26 +0000746The details in this version are due to Tim Peters, building on many past
Tim Peterseb28ef22001-06-02 05:27:19 +0000747contributions by Reimer Behrends, Jyrki Alakuijala, Vladimir Marangozov and
Guido van Rossumdc5f6b22006-08-24 21:29:26 +0000748Christian Tismer.
Fred Drake1bff34a2000-08-31 19:31:38 +0000749
Victor Stinner742da042016-09-07 17:40:12 -0700750lookdict() is general-purpose, and may return DKIX_ERROR if (and only if) a
Victor Stinnera4348cc2016-09-08 12:01:25 -0700751comparison raises an exception.
Guido van Rossum89d8c602007-09-18 17:26:56 +0000752lookdict_unicode() below is specialized to string keys, comparison of which can
INADA Naoki1b8df102017-02-20 22:48:10 +0900753never raise an exception; that function can never return DKIX_ERROR when key
754is string. Otherwise, it falls back to lookdict().
Benjamin Peterson7d95e402012-04-23 11:24:50 -0400755lookdict_unicode_nodummy is further specialized for string keys that cannot be
756the <dummy> value.
INADA Naoki778928b2017-08-03 23:45:15 +0900757For both, when the key isn't found a DKIX_EMPTY is returned.
Guido van Rossum4b1302b1993-03-27 18:11:32 +0000758*/
Victor Stinnerc7a8f672016-11-15 15:13:40 +0100759static Py_ssize_t _Py_HOT_FUNCTION
Benjamin Peterson7d95e402012-04-23 11:24:50 -0400760lookdict(PyDictObject *mp, PyObject *key,
INADA Naoki778928b2017-08-03 23:45:15 +0900761 Py_hash_t hash, PyObject **value_addr)
Guido van Rossum4b1302b1993-03-27 18:11:32 +0000762{
INADA Naoki778928b2017-08-03 23:45:15 +0900763 size_t i, mask, perturb;
Victor Stinner742da042016-09-07 17:40:12 -0700764 PyDictKeysObject *dk;
INADA Naoki778928b2017-08-03 23:45:15 +0900765 PyDictKeyEntry *ep0;
Tim Peterseb28ef22001-06-02 05:27:19 +0000766
Antoine Pitrou9a234902012-05-13 20:48:01 +0200767top:
Victor Stinner742da042016-09-07 17:40:12 -0700768 dk = mp->ma_keys;
Victor Stinner742da042016-09-07 17:40:12 -0700769 ep0 = DK_ENTRIES(dk);
INADA Naoki778928b2017-08-03 23:45:15 +0900770 mask = DK_MASK(dk);
771 perturb = hash;
Antoine Pitrouf95a1b32010-05-09 15:52:27 +0000772 i = (size_t)hash & mask;
Victor Stinner742da042016-09-07 17:40:12 -0700773
INADA Naoki778928b2017-08-03 23:45:15 +0900774 for (;;) {
INADA Naokia7576492018-11-14 18:39:27 +0900775 Py_ssize_t ix = dictkeys_get_index(dk, i);
Victor Stinner742da042016-09-07 17:40:12 -0700776 if (ix == DKIX_EMPTY) {
Victor Stinner742da042016-09-07 17:40:12 -0700777 *value_addr = NULL;
778 return ix;
Benjamin Peterson7d95e402012-04-23 11:24:50 -0400779 }
INADA Naoki778928b2017-08-03 23:45:15 +0900780 if (ix >= 0) {
781 PyDictKeyEntry *ep = &ep0[ix];
782 assert(ep->me_key != NULL);
783 if (ep->me_key == key) {
784 *value_addr = ep->me_value;
785 return ix;
Victor Stinner742da042016-09-07 17:40:12 -0700786 }
INADA Naoki778928b2017-08-03 23:45:15 +0900787 if (ep->me_hash == hash) {
788 PyObject *startkey = ep->me_key;
789 Py_INCREF(startkey);
790 int cmp = PyObject_RichCompareBool(startkey, key, Py_EQ);
791 Py_DECREF(startkey);
792 if (cmp < 0) {
793 *value_addr = NULL;
794 return DKIX_ERROR;
795 }
796 if (dk == mp->ma_keys && ep->me_key == startkey) {
797 if (cmp > 0) {
798 *value_addr = ep->me_value;
799 return ix;
Victor Stinner742da042016-09-07 17:40:12 -0700800 }
INADA Naoki778928b2017-08-03 23:45:15 +0900801 }
802 else {
803 /* The dict was mutated, restart */
804 goto top;
Benjamin Peterson7d95e402012-04-23 11:24:50 -0400805 }
Antoine Pitrouf95a1b32010-05-09 15:52:27 +0000806 }
Antoine Pitrouf95a1b32010-05-09 15:52:27 +0000807 }
INADA Naoki778928b2017-08-03 23:45:15 +0900808 perturb >>= PERTURB_SHIFT;
809 i = (i*5 + perturb + 1) & mask;
Antoine Pitrouf95a1b32010-05-09 15:52:27 +0000810 }
Barry Warsawb2e57942017-09-14 18:13:16 -0700811 Py_UNREACHABLE();
Guido van Rossum4b1302b1993-03-27 18:11:32 +0000812}
813
Benjamin Peterson7d95e402012-04-23 11:24:50 -0400814/* Specialized version for string-only keys */
Victor Stinnerc7a8f672016-11-15 15:13:40 +0100815static Py_ssize_t _Py_HOT_FUNCTION
Benjamin Peterson7d95e402012-04-23 11:24:50 -0400816lookdict_unicode(PyDictObject *mp, PyObject *key,
INADA Naoki778928b2017-08-03 23:45:15 +0900817 Py_hash_t hash, PyObject **value_addr)
Fred Drake1bff34a2000-08-31 19:31:38 +0000818{
Victor Stinner742da042016-09-07 17:40:12 -0700819 assert(mp->ma_values == NULL);
Antoine Pitrouf95a1b32010-05-09 15:52:27 +0000820 /* Make sure this function doesn't have to handle non-unicode keys,
821 including subclasses of str; e.g., one reason to subclass
822 unicodes is to override __eq__, and for speed we don't cater to
823 that here. */
824 if (!PyUnicode_CheckExact(key)) {
Benjamin Peterson7d95e402012-04-23 11:24:50 -0400825 mp->ma_keys->dk_lookup = lookdict;
INADA Naoki778928b2017-08-03 23:45:15 +0900826 return lookdict(mp, key, hash, value_addr);
Antoine Pitrouf95a1b32010-05-09 15:52:27 +0000827 }
Tim Peters15d49292001-05-27 07:39:22 +0000828
INADA Naoki778928b2017-08-03 23:45:15 +0900829 PyDictKeyEntry *ep0 = DK_ENTRIES(mp->ma_keys);
830 size_t mask = DK_MASK(mp->ma_keys);
831 size_t perturb = (size_t)hash;
832 size_t i = (size_t)hash & mask;
833
834 for (;;) {
INADA Naokia7576492018-11-14 18:39:27 +0900835 Py_ssize_t ix = dictkeys_get_index(mp->ma_keys, i);
Victor Stinner742da042016-09-07 17:40:12 -0700836 if (ix == DKIX_EMPTY) {
Victor Stinner742da042016-09-07 17:40:12 -0700837 *value_addr = NULL;
838 return DKIX_EMPTY;
Benjamin Peterson7d95e402012-04-23 11:24:50 -0400839 }
INADA Naoki778928b2017-08-03 23:45:15 +0900840 if (ix >= 0) {
841 PyDictKeyEntry *ep = &ep0[ix];
842 assert(ep->me_key != NULL);
843 assert(PyUnicode_CheckExact(ep->me_key));
844 if (ep->me_key == key ||
845 (ep->me_hash == hash && unicode_eq(ep->me_key, key))) {
846 *value_addr = ep->me_value;
847 return ix;
Victor Stinner742da042016-09-07 17:40:12 -0700848 }
Benjamin Peterson7d95e402012-04-23 11:24:50 -0400849 }
INADA Naoki778928b2017-08-03 23:45:15 +0900850 perturb >>= PERTURB_SHIFT;
851 i = mask & (i*5 + perturb + 1);
Antoine Pitrouf95a1b32010-05-09 15:52:27 +0000852 }
Barry Warsawb2e57942017-09-14 18:13:16 -0700853 Py_UNREACHABLE();
Fred Drake1bff34a2000-08-31 19:31:38 +0000854}
855
Benjamin Peterson7d95e402012-04-23 11:24:50 -0400856/* Faster version of lookdict_unicode when it is known that no <dummy> keys
857 * will be present. */
Victor Stinnerc7a8f672016-11-15 15:13:40 +0100858static Py_ssize_t _Py_HOT_FUNCTION
Benjamin Peterson7d95e402012-04-23 11:24:50 -0400859lookdict_unicode_nodummy(PyDictObject *mp, PyObject *key,
INADA Naoki778928b2017-08-03 23:45:15 +0900860 Py_hash_t hash, PyObject **value_addr)
Benjamin Peterson7d95e402012-04-23 11:24:50 -0400861{
Victor Stinner742da042016-09-07 17:40:12 -0700862 assert(mp->ma_values == NULL);
Benjamin Peterson7d95e402012-04-23 11:24:50 -0400863 /* Make sure this function doesn't have to handle non-unicode keys,
864 including subclasses of str; e.g., one reason to subclass
865 unicodes is to override __eq__, and for speed we don't cater to
866 that here. */
867 if (!PyUnicode_CheckExact(key)) {
868 mp->ma_keys->dk_lookup = lookdict;
INADA Naoki778928b2017-08-03 23:45:15 +0900869 return lookdict(mp, key, hash, value_addr);
Benjamin Peterson7d95e402012-04-23 11:24:50 -0400870 }
INADA Naoki778928b2017-08-03 23:45:15 +0900871
872 PyDictKeyEntry *ep0 = DK_ENTRIES(mp->ma_keys);
873 size_t mask = DK_MASK(mp->ma_keys);
874 size_t perturb = (size_t)hash;
875 size_t i = (size_t)hash & mask;
876
877 for (;;) {
INADA Naokia7576492018-11-14 18:39:27 +0900878 Py_ssize_t ix = dictkeys_get_index(mp->ma_keys, i);
Victor Stinner742da042016-09-07 17:40:12 -0700879 assert (ix != DKIX_DUMMY);
880 if (ix == DKIX_EMPTY) {
Victor Stinner742da042016-09-07 17:40:12 -0700881 *value_addr = NULL;
882 return DKIX_EMPTY;
883 }
INADA Naoki778928b2017-08-03 23:45:15 +0900884 PyDictKeyEntry *ep = &ep0[ix];
885 assert(ep->me_key != NULL);
886 assert(PyUnicode_CheckExact(ep->me_key));
Victor Stinner742da042016-09-07 17:40:12 -0700887 if (ep->me_key == key ||
Benjamin Peterson7d95e402012-04-23 11:24:50 -0400888 (ep->me_hash == hash && unicode_eq(ep->me_key, key))) {
INADA Naokiba609772016-12-07 20:41:42 +0900889 *value_addr = ep->me_value;
Victor Stinner742da042016-09-07 17:40:12 -0700890 return ix;
Benjamin Peterson7d95e402012-04-23 11:24:50 -0400891 }
INADA Naoki778928b2017-08-03 23:45:15 +0900892 perturb >>= PERTURB_SHIFT;
893 i = mask & (i*5 + perturb + 1);
Benjamin Peterson7d95e402012-04-23 11:24:50 -0400894 }
Barry Warsawb2e57942017-09-14 18:13:16 -0700895 Py_UNREACHABLE();
Benjamin Peterson7d95e402012-04-23 11:24:50 -0400896}
897
898/* Version of lookdict for split tables.
899 * All split tables and only split tables use this lookup function.
900 * Split tables only contain unicode keys and no dummy keys,
901 * so algorithm is the same as lookdict_unicode_nodummy.
902 */
Victor Stinnerc7a8f672016-11-15 15:13:40 +0100903static Py_ssize_t _Py_HOT_FUNCTION
Benjamin Peterson7d95e402012-04-23 11:24:50 -0400904lookdict_split(PyDictObject *mp, PyObject *key,
INADA Naoki778928b2017-08-03 23:45:15 +0900905 Py_hash_t hash, PyObject **value_addr)
Benjamin Peterson7d95e402012-04-23 11:24:50 -0400906{
Victor Stinner742da042016-09-07 17:40:12 -0700907 /* mp must split table */
908 assert(mp->ma_values != NULL);
Benjamin Peterson7d95e402012-04-23 11:24:50 -0400909 if (!PyUnicode_CheckExact(key)) {
INADA Naoki778928b2017-08-03 23:45:15 +0900910 Py_ssize_t ix = lookdict(mp, key, hash, value_addr);
Victor Stinner742da042016-09-07 17:40:12 -0700911 if (ix >= 0) {
INADA Naokiba609772016-12-07 20:41:42 +0900912 *value_addr = mp->ma_values[ix];
Victor Stinner742da042016-09-07 17:40:12 -0700913 }
914 return ix;
Benjamin Peterson7d95e402012-04-23 11:24:50 -0400915 }
Victor Stinner742da042016-09-07 17:40:12 -0700916
INADA Naoki778928b2017-08-03 23:45:15 +0900917 PyDictKeyEntry *ep0 = DK_ENTRIES(mp->ma_keys);
918 size_t mask = DK_MASK(mp->ma_keys);
919 size_t perturb = (size_t)hash;
920 size_t i = (size_t)hash & mask;
921
922 for (;;) {
INADA Naokia7576492018-11-14 18:39:27 +0900923 Py_ssize_t ix = dictkeys_get_index(mp->ma_keys, i);
INADA Naoki778928b2017-08-03 23:45:15 +0900924 assert (ix != DKIX_DUMMY);
Victor Stinner742da042016-09-07 17:40:12 -0700925 if (ix == DKIX_EMPTY) {
Victor Stinner742da042016-09-07 17:40:12 -0700926 *value_addr = NULL;
927 return DKIX_EMPTY;
928 }
INADA Naoki778928b2017-08-03 23:45:15 +0900929 PyDictKeyEntry *ep = &ep0[ix];
930 assert(ep->me_key != NULL);
931 assert(PyUnicode_CheckExact(ep->me_key));
Victor Stinner742da042016-09-07 17:40:12 -0700932 if (ep->me_key == key ||
Benjamin Peterson7d95e402012-04-23 11:24:50 -0400933 (ep->me_hash == hash && unicode_eq(ep->me_key, key))) {
INADA Naokiba609772016-12-07 20:41:42 +0900934 *value_addr = mp->ma_values[ix];
Victor Stinner742da042016-09-07 17:40:12 -0700935 return ix;
Benjamin Peterson7d95e402012-04-23 11:24:50 -0400936 }
INADA Naoki778928b2017-08-03 23:45:15 +0900937 perturb >>= PERTURB_SHIFT;
938 i = mask & (i*5 + perturb + 1);
Benjamin Peterson7d95e402012-04-23 11:24:50 -0400939 }
Barry Warsawb2e57942017-09-14 18:13:16 -0700940 Py_UNREACHABLE();
Benjamin Peterson7d95e402012-04-23 11:24:50 -0400941}
942
Benjamin Petersonfb886362010-04-24 18:21:17 +0000943int
944_PyDict_HasOnlyStringKeys(PyObject *dict)
945{
Antoine Pitrouf95a1b32010-05-09 15:52:27 +0000946 Py_ssize_t pos = 0;
947 PyObject *key, *value;
Benjamin Petersonf6096542010-11-17 22:33:12 +0000948 assert(PyDict_Check(dict));
Antoine Pitrouf95a1b32010-05-09 15:52:27 +0000949 /* Shortcut */
Benjamin Peterson7d95e402012-04-23 11:24:50 -0400950 if (((PyDictObject *)dict)->ma_keys->dk_lookup != lookdict)
Antoine Pitrouf95a1b32010-05-09 15:52:27 +0000951 return 1;
952 while (PyDict_Next(dict, &pos, &key, &value))
953 if (!PyUnicode_Check(key))
954 return 0;
955 return 1;
Benjamin Petersonfb886362010-04-24 18:21:17 +0000956}
957
Antoine Pitrou3a652b12009-03-23 18:52:06 +0000958#define MAINTAIN_TRACKING(mp, key, value) \
Antoine Pitrouf95a1b32010-05-09 15:52:27 +0000959 do { \
960 if (!_PyObject_GC_IS_TRACKED(mp)) { \
961 if (_PyObject_GC_MAY_BE_TRACKED(key) || \
962 _PyObject_GC_MAY_BE_TRACKED(value)) { \
963 _PyObject_GC_TRACK(mp); \
Antoine Pitrouf95a1b32010-05-09 15:52:27 +0000964 } \
965 } \
966 } while(0)
Antoine Pitrou3a652b12009-03-23 18:52:06 +0000967
968void
969_PyDict_MaybeUntrack(PyObject *op)
970{
Antoine Pitrouf95a1b32010-05-09 15:52:27 +0000971 PyDictObject *mp;
972 PyObject *value;
Victor Stinner742da042016-09-07 17:40:12 -0700973 Py_ssize_t i, numentries;
974 PyDictKeyEntry *ep0;
Antoine Pitrou3a652b12009-03-23 18:52:06 +0000975
Antoine Pitrouf95a1b32010-05-09 15:52:27 +0000976 if (!PyDict_CheckExact(op) || !_PyObject_GC_IS_TRACKED(op))
977 return;
978
979 mp = (PyDictObject *) op;
Victor Stinner742da042016-09-07 17:40:12 -0700980 ep0 = DK_ENTRIES(mp->ma_keys);
981 numentries = mp->ma_keys->dk_nentries;
Benjamin Peterson7d95e402012-04-23 11:24:50 -0400982 if (_PyDict_HasSplitTable(mp)) {
Victor Stinner742da042016-09-07 17:40:12 -0700983 for (i = 0; i < numentries; i++) {
Benjamin Peterson7d95e402012-04-23 11:24:50 -0400984 if ((value = mp->ma_values[i]) == NULL)
985 continue;
986 if (_PyObject_GC_MAY_BE_TRACKED(value)) {
Victor Stinner742da042016-09-07 17:40:12 -0700987 assert(!_PyObject_GC_MAY_BE_TRACKED(ep0[i].me_key));
Benjamin Peterson7d95e402012-04-23 11:24:50 -0400988 return;
989 }
990 }
Antoine Pitrouf95a1b32010-05-09 15:52:27 +0000991 }
Benjamin Peterson7d95e402012-04-23 11:24:50 -0400992 else {
Victor Stinner742da042016-09-07 17:40:12 -0700993 for (i = 0; i < numentries; i++) {
Benjamin Peterson7d95e402012-04-23 11:24:50 -0400994 if ((value = ep0[i].me_value) == NULL)
995 continue;
996 if (_PyObject_GC_MAY_BE_TRACKED(value) ||
997 _PyObject_GC_MAY_BE_TRACKED(ep0[i].me_key))
998 return;
999 }
1000 }
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00001001 _PyObject_GC_UNTRACK(op);
Antoine Pitrou3a652b12009-03-23 18:52:06 +00001002}
1003
Benjamin Peterson7d95e402012-04-23 11:24:50 -04001004/* Internal function to find slot for an item from its hash
Victor Stinner3c336c52016-09-12 14:17:40 +02001005 when it is known that the key is not present in the dict.
1006
1007 The dict must be combined. */
INADA Naokiba609772016-12-07 20:41:42 +09001008static Py_ssize_t
INADA Naoki778928b2017-08-03 23:45:15 +09001009find_empty_slot(PyDictKeysObject *keys, Py_hash_t hash)
Guido van Rossum4b1302b1993-03-27 18:11:32 +00001010{
INADA Naoki778928b2017-08-03 23:45:15 +09001011 assert(keys != NULL);
Tim Peters6d6c1a32001-08-02 04:15:00 +00001012
INADA Naoki778928b2017-08-03 23:45:15 +09001013 const size_t mask = DK_MASK(keys);
1014 size_t i = hash & mask;
INADA Naokia7576492018-11-14 18:39:27 +09001015 Py_ssize_t ix = dictkeys_get_index(keys, i);
INADA Naoki778928b2017-08-03 23:45:15 +09001016 for (size_t perturb = hash; ix >= 0;) {
INADA Naoki267941c2016-10-06 15:19:07 +09001017 perturb >>= PERTURB_SHIFT;
INADA Naoki778928b2017-08-03 23:45:15 +09001018 i = (i*5 + perturb + 1) & mask;
INADA Naokia7576492018-11-14 18:39:27 +09001019 ix = dictkeys_get_index(keys, i);
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00001020 }
INADA Naoki778928b2017-08-03 23:45:15 +09001021 return i;
Thomas Wouters4d70c3d2006-06-08 14:42:34 +00001022}
1023
Benjamin Peterson7d95e402012-04-23 11:24:50 -04001024static int
1025insertion_resize(PyDictObject *mp)
1026{
Raymond Hettinger36f74aa2013-05-17 03:01:13 -07001027 return dictresize(mp, GROWTH_RATE(mp));
Benjamin Peterson7d95e402012-04-23 11:24:50 -04001028}
Antoine Pitroue965d972012-02-27 00:45:12 +01001029
1030/*
1031Internal routine to insert a new item into the table.
1032Used both by the internal resize routine and by the public insert routine.
Antoine Pitroue965d972012-02-27 00:45:12 +01001033Returns -1 if an error occurred, or 0 on success.
1034*/
1035static int
Benjamin Peterson7d95e402012-04-23 11:24:50 -04001036insertdict(PyDictObject *mp, PyObject *key, Py_hash_t hash, PyObject *value)
Antoine Pitroue965d972012-02-27 00:45:12 +01001037{
Benjamin Peterson7d95e402012-04-23 11:24:50 -04001038 PyObject *old_value;
INADA Naokiba609772016-12-07 20:41:42 +09001039 PyDictKeyEntry *ep;
Antoine Pitroue965d972012-02-27 00:45:12 +01001040
Serhiy Storchaka753bca32017-05-20 12:30:02 +03001041 Py_INCREF(key);
1042 Py_INCREF(value);
Benjamin Peterson7d95e402012-04-23 11:24:50 -04001043 if (mp->ma_values != NULL && !PyUnicode_CheckExact(key)) {
1044 if (insertion_resize(mp) < 0)
Serhiy Storchaka753bca32017-05-20 12:30:02 +03001045 goto Fail;
Benjamin Peterson7d95e402012-04-23 11:24:50 -04001046 }
1047
INADA Naoki778928b2017-08-03 23:45:15 +09001048 Py_ssize_t ix = mp->ma_keys->dk_lookup(mp, key, hash, &old_value);
Serhiy Storchaka753bca32017-05-20 12:30:02 +03001049 if (ix == DKIX_ERROR)
1050 goto Fail;
Victor Stinner742da042016-09-07 17:40:12 -07001051
Antoine Pitroud6967322014-10-18 00:35:00 +02001052 assert(PyUnicode_CheckExact(key) || mp->ma_keys->dk_lookup == lookdict);
Benjamin Peterson7d95e402012-04-23 11:24:50 -04001053 MAINTAIN_TRACKING(mp, key, value);
Victor Stinner742da042016-09-07 17:40:12 -07001054
1055 /* When insertion order is different from shared key, we can't share
1056 * the key anymore. Convert this instance to combine table.
1057 */
1058 if (_PyDict_HasSplitTable(mp) &&
INADA Naokiba609772016-12-07 20:41:42 +09001059 ((ix >= 0 && old_value == NULL && mp->ma_used != ix) ||
Victor Stinner742da042016-09-07 17:40:12 -07001060 (ix == DKIX_EMPTY && mp->ma_used != mp->ma_keys->dk_nentries))) {
Serhiy Storchaka753bca32017-05-20 12:30:02 +03001061 if (insertion_resize(mp) < 0)
1062 goto Fail;
Victor Stinner742da042016-09-07 17:40:12 -07001063 ix = DKIX_EMPTY;
Benjamin Peterson7d95e402012-04-23 11:24:50 -04001064 }
Victor Stinner742da042016-09-07 17:40:12 -07001065
1066 if (ix == DKIX_EMPTY) {
1067 /* Insert into new slot. */
INADA Naokiba609772016-12-07 20:41:42 +09001068 assert(old_value == NULL);
Victor Stinner742da042016-09-07 17:40:12 -07001069 if (mp->ma_keys->dk_usable <= 0) {
1070 /* Need to resize. */
Serhiy Storchaka753bca32017-05-20 12:30:02 +03001071 if (insertion_resize(mp) < 0)
1072 goto Fail;
Victor Stinner742da042016-09-07 17:40:12 -07001073 }
INADA Naoki778928b2017-08-03 23:45:15 +09001074 Py_ssize_t hashpos = find_empty_slot(mp->ma_keys, hash);
INADA Naokiba609772016-12-07 20:41:42 +09001075 ep = &DK_ENTRIES(mp->ma_keys)[mp->ma_keys->dk_nentries];
INADA Naokia7576492018-11-14 18:39:27 +09001076 dictkeys_set_index(mp->ma_keys, hashpos, mp->ma_keys->dk_nentries);
Victor Stinner742da042016-09-07 17:40:12 -07001077 ep->me_key = key;
1078 ep->me_hash = hash;
1079 if (mp->ma_values) {
1080 assert (mp->ma_values[mp->ma_keys->dk_nentries] == NULL);
1081 mp->ma_values[mp->ma_keys->dk_nentries] = value;
Benjamin Peterson7d95e402012-04-23 11:24:50 -04001082 }
1083 else {
Victor Stinner742da042016-09-07 17:40:12 -07001084 ep->me_value = value;
Benjamin Peterson7d95e402012-04-23 11:24:50 -04001085 }
1086 mp->ma_used++;
Victor Stinner3b6a6b42016-09-08 12:51:24 -07001087 mp->ma_version_tag = DICT_NEXT_VERSION();
Victor Stinner742da042016-09-07 17:40:12 -07001088 mp->ma_keys->dk_usable--;
1089 mp->ma_keys->dk_nentries++;
1090 assert(mp->ma_keys->dk_usable >= 0);
Victor Stinner0fc91ee2019-04-12 21:51:34 +02001091 ASSERT_CONSISTENT(mp);
Victor Stinner742da042016-09-07 17:40:12 -07001092 return 0;
Benjamin Peterson7d95e402012-04-23 11:24:50 -04001093 }
Victor Stinner742da042016-09-07 17:40:12 -07001094
Inada Naoki91234a12019-06-03 21:30:58 +09001095 if (old_value != value) {
1096 if (_PyDict_HasSplitTable(mp)) {
1097 mp->ma_values[ix] = value;
1098 if (old_value == NULL) {
1099 /* pending state */
1100 assert(ix == mp->ma_used);
1101 mp->ma_used++;
1102 }
INADA Naokiba609772016-12-07 20:41:42 +09001103 }
Inada Naoki91234a12019-06-03 21:30:58 +09001104 else {
1105 assert(old_value != NULL);
1106 DK_ENTRIES(mp->ma_keys)[ix].me_value = value;
1107 }
1108 mp->ma_version_tag = DICT_NEXT_VERSION();
INADA Naokiba609772016-12-07 20:41:42 +09001109 }
INADA Naokiba609772016-12-07 20:41:42 +09001110 Py_XDECREF(old_value); /* which **CAN** re-enter (see issue #22653) */
Victor Stinner0fc91ee2019-04-12 21:51:34 +02001111 ASSERT_CONSISTENT(mp);
Serhiy Storchaka753bca32017-05-20 12:30:02 +03001112 Py_DECREF(key);
Benjamin Peterson7d95e402012-04-23 11:24:50 -04001113 return 0;
Serhiy Storchaka753bca32017-05-20 12:30:02 +03001114
1115Fail:
1116 Py_DECREF(value);
1117 Py_DECREF(key);
1118 return -1;
Antoine Pitroue965d972012-02-27 00:45:12 +01001119}
1120
Inada Naoki2ddc7f62019-03-18 20:38:33 +09001121// Same to insertdict but specialized for ma_keys = Py_EMPTY_KEYS.
1122static int
1123insert_to_emptydict(PyDictObject *mp, PyObject *key, Py_hash_t hash,
1124 PyObject *value)
1125{
1126 assert(mp->ma_keys == Py_EMPTY_KEYS);
1127
1128 PyDictKeysObject *newkeys = new_keys_object(PyDict_MINSIZE);
1129 if (newkeys == NULL) {
1130 return -1;
1131 }
1132 if (!PyUnicode_CheckExact(key)) {
1133 newkeys->dk_lookup = lookdict;
1134 }
1135 dictkeys_decref(Py_EMPTY_KEYS);
1136 mp->ma_keys = newkeys;
1137 mp->ma_values = NULL;
1138
1139 Py_INCREF(key);
1140 Py_INCREF(value);
1141 MAINTAIN_TRACKING(mp, key, value);
1142
1143 size_t hashpos = (size_t)hash & (PyDict_MINSIZE-1);
Dong-hee Nac39d1dd2019-10-11 17:43:11 +09001144 PyDictKeyEntry *ep = DK_ENTRIES(mp->ma_keys);
Inada Naoki2ddc7f62019-03-18 20:38:33 +09001145 dictkeys_set_index(mp->ma_keys, hashpos, 0);
1146 ep->me_key = key;
1147 ep->me_hash = hash;
1148 ep->me_value = value;
1149 mp->ma_used++;
1150 mp->ma_version_tag = DICT_NEXT_VERSION();
1151 mp->ma_keys->dk_usable--;
1152 mp->ma_keys->dk_nentries++;
1153 return 0;
1154}
1155
Thomas Wouters4d70c3d2006-06-08 14:42:34 +00001156/*
luzpaza5293b42017-11-05 07:37:50 -06001157Internal routine used by dictresize() to build a hashtable of entries.
Thomas Wouters4d70c3d2006-06-08 14:42:34 +00001158*/
1159static void
Serhiy Storchakae26e20d2016-10-29 10:50:00 +03001160build_indices(PyDictKeysObject *keys, PyDictKeyEntry *ep, Py_ssize_t n)
Thomas Wouters4d70c3d2006-06-08 14:42:34 +00001161{
Serhiy Storchakae26e20d2016-10-29 10:50:00 +03001162 size_t mask = (size_t)DK_SIZE(keys) - 1;
1163 for (Py_ssize_t ix = 0; ix != n; ix++, ep++) {
1164 Py_hash_t hash = ep->me_hash;
1165 size_t i = hash & mask;
INADA Naokia7576492018-11-14 18:39:27 +09001166 for (size_t perturb = hash; dictkeys_get_index(keys, i) != DKIX_EMPTY;) {
Serhiy Storchakae26e20d2016-10-29 10:50:00 +03001167 perturb >>= PERTURB_SHIFT;
INADA Naoki870c2862017-06-24 09:03:19 +09001168 i = mask & (i*5 + perturb + 1);
Serhiy Storchakae26e20d2016-10-29 10:50:00 +03001169 }
INADA Naokia7576492018-11-14 18:39:27 +09001170 dictkeys_set_index(keys, i, ix);
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00001171 }
Guido van Rossum4b1302b1993-03-27 18:11:32 +00001172}
1173
1174/*
1175Restructure the table by allocating a new table and reinserting all
1176items again. When entries have been deleted, the new table may
1177actually be smaller than the old one.
Benjamin Peterson7d95e402012-04-23 11:24:50 -04001178If a table is split (its keys and hashes are shared, its values are not),
1179then the values are temporarily copied into the table, it is resized as
1180a combined table, then the me_value slots in the old table are NULLed out.
1181After resizing a table is always combined,
1182but can be resplit by make_keys_shared().
Guido van Rossum4b1302b1993-03-27 18:11:32 +00001183*/
Guido van Rossum4b1302b1993-03-27 18:11:32 +00001184static int
Victor Stinner3d3f2642016-12-15 17:21:23 +01001185dictresize(PyDictObject *mp, Py_ssize_t minsize)
Guido van Rossum4b1302b1993-03-27 18:11:32 +00001186{
Serhiy Storchakae26e20d2016-10-29 10:50:00 +03001187 Py_ssize_t newsize, numentries;
Benjamin Peterson7d95e402012-04-23 11:24:50 -04001188 PyDictKeysObject *oldkeys;
1189 PyObject **oldvalues;
Serhiy Storchakae26e20d2016-10-29 10:50:00 +03001190 PyDictKeyEntry *oldentries, *newentries;
Tim Peters91a364d2001-05-19 07:04:38 +00001191
Victor Stinner742da042016-09-07 17:40:12 -07001192 /* Find the smallest table size > minused. */
1193 for (newsize = PyDict_MINSIZE;
Victor Stinner3d3f2642016-12-15 17:21:23 +01001194 newsize < minsize && newsize > 0;
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00001195 newsize <<= 1)
1196 ;
1197 if (newsize <= 0) {
1198 PyErr_NoMemory();
1199 return -1;
1200 }
Serhiy Storchakae26e20d2016-10-29 10:50:00 +03001201
Benjamin Peterson7d95e402012-04-23 11:24:50 -04001202 oldkeys = mp->ma_keys;
Serhiy Storchakae26e20d2016-10-29 10:50:00 +03001203
1204 /* NOTE: Current odict checks mp->ma_keys to detect resize happen.
1205 * So we can't reuse oldkeys even if oldkeys->dk_size == newsize.
1206 * TODO: Try reusing oldkeys when reimplement odict.
1207 */
1208
Benjamin Peterson7d95e402012-04-23 11:24:50 -04001209 /* Allocate a new table. */
1210 mp->ma_keys = new_keys_object(newsize);
1211 if (mp->ma_keys == NULL) {
1212 mp->ma_keys = oldkeys;
1213 return -1;
1214 }
Victor Stinner3d3f2642016-12-15 17:21:23 +01001215 // New table must be large enough.
1216 assert(mp->ma_keys->dk_usable >= mp->ma_used);
Benjamin Peterson7d95e402012-04-23 11:24:50 -04001217 if (oldkeys->dk_lookup == lookdict)
1218 mp->ma_keys->dk_lookup = lookdict;
Serhiy Storchakae26e20d2016-10-29 10:50:00 +03001219
1220 numentries = mp->ma_used;
1221 oldentries = DK_ENTRIES(oldkeys);
1222 newentries = DK_ENTRIES(mp->ma_keys);
1223 oldvalues = mp->ma_values;
Benjamin Peterson7d95e402012-04-23 11:24:50 -04001224 if (oldvalues != NULL) {
Serhiy Storchakae26e20d2016-10-29 10:50:00 +03001225 /* Convert split table into new combined table.
1226 * We must incref keys; we can transfer values.
1227 * Note that values of split table is always dense.
1228 */
1229 for (Py_ssize_t i = 0; i < numentries; i++) {
1230 assert(oldvalues[i] != NULL);
1231 PyDictKeyEntry *ep = &oldentries[i];
1232 PyObject *key = ep->me_key;
1233 Py_INCREF(key);
1234 newentries[i].me_key = key;
1235 newentries[i].me_hash = ep->me_hash;
1236 newentries[i].me_value = oldvalues[i];
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00001237 }
Serhiy Storchakae26e20d2016-10-29 10:50:00 +03001238
INADA Naokia7576492018-11-14 18:39:27 +09001239 dictkeys_decref(oldkeys);
Serhiy Storchakae26e20d2016-10-29 10:50:00 +03001240 mp->ma_values = NULL;
Victor Stinner742da042016-09-07 17:40:12 -07001241 if (oldvalues != empty_values) {
1242 free_values(oldvalues);
1243 }
Benjamin Peterson7d95e402012-04-23 11:24:50 -04001244 }
Serhiy Storchakae26e20d2016-10-29 10:50:00 +03001245 else { // combined table.
1246 if (oldkeys->dk_nentries == numentries) {
1247 memcpy(newentries, oldentries, numentries * sizeof(PyDictKeyEntry));
1248 }
1249 else {
1250 PyDictKeyEntry *ep = oldentries;
1251 for (Py_ssize_t i = 0; i < numentries; i++) {
1252 while (ep->me_value == NULL)
1253 ep++;
1254 newentries[i] = *ep++;
1255 }
1256 }
1257
Benjamin Peterson7d95e402012-04-23 11:24:50 -04001258 assert(oldkeys->dk_lookup != lookdict_split);
Benjamin Peterson7d95e402012-04-23 11:24:50 -04001259 assert(oldkeys->dk_refcnt == 1);
Victor Stinner49932fe2020-02-03 17:55:05 +01001260#ifdef Py_REF_DEBUG
1261 _Py_RefTotal--;
1262#endif
Serhiy Storchakae26e20d2016-10-29 10:50:00 +03001263 if (oldkeys->dk_size == PyDict_MINSIZE &&
Victor Stinner49932fe2020-02-03 17:55:05 +01001264 numfreekeys < PyDict_MAXFREELIST)
1265 {
INADA Naokia7576492018-11-14 18:39:27 +09001266 keys_free_list[numfreekeys++] = oldkeys;
Serhiy Storchakae26e20d2016-10-29 10:50:00 +03001267 }
1268 else {
INADA Naokia7576492018-11-14 18:39:27 +09001269 PyObject_FREE(oldkeys);
Serhiy Storchakae26e20d2016-10-29 10:50:00 +03001270 }
Benjamin Peterson7d95e402012-04-23 11:24:50 -04001271 }
Serhiy Storchakae26e20d2016-10-29 10:50:00 +03001272
1273 build_indices(mp->ma_keys, newentries, numentries);
1274 mp->ma_keys->dk_usable -= numentries;
1275 mp->ma_keys->dk_nentries = numentries;
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00001276 return 0;
Guido van Rossum4b1302b1993-03-27 18:11:32 +00001277}
1278
Benjamin Peterson15ee8212012-04-24 14:44:18 -04001279/* Returns NULL if unable to split table.
1280 * A NULL return does not necessarily indicate an error */
Benjamin Peterson7d95e402012-04-23 11:24:50 -04001281static PyDictKeysObject *
1282make_keys_shared(PyObject *op)
1283{
1284 Py_ssize_t i;
1285 Py_ssize_t size;
1286 PyDictObject *mp = (PyDictObject *)op;
1287
Benjamin Peterson15ee8212012-04-24 14:44:18 -04001288 if (!PyDict_CheckExact(op))
1289 return NULL;
Benjamin Peterson7d95e402012-04-23 11:24:50 -04001290 if (!_PyDict_HasSplitTable(mp)) {
1291 PyDictKeyEntry *ep0;
1292 PyObject **values;
1293 assert(mp->ma_keys->dk_refcnt == 1);
1294 if (mp->ma_keys->dk_lookup == lookdict) {
1295 return NULL;
1296 }
1297 else if (mp->ma_keys->dk_lookup == lookdict_unicode) {
1298 /* Remove dummy keys */
1299 if (dictresize(mp, DK_SIZE(mp->ma_keys)))
1300 return NULL;
1301 }
1302 assert(mp->ma_keys->dk_lookup == lookdict_unicode_nodummy);
1303 /* Copy values into a new array */
Victor Stinner742da042016-09-07 17:40:12 -07001304 ep0 = DK_ENTRIES(mp->ma_keys);
1305 size = USABLE_FRACTION(DK_SIZE(mp->ma_keys));
Benjamin Peterson7d95e402012-04-23 11:24:50 -04001306 values = new_values(size);
1307 if (values == NULL) {
1308 PyErr_SetString(PyExc_MemoryError,
1309 "Not enough memory to allocate new values array");
1310 return NULL;
1311 }
1312 for (i = 0; i < size; i++) {
1313 values[i] = ep0[i].me_value;
1314 ep0[i].me_value = NULL;
1315 }
1316 mp->ma_keys->dk_lookup = lookdict_split;
1317 mp->ma_values = values;
1318 }
INADA Naokia7576492018-11-14 18:39:27 +09001319 dictkeys_incref(mp->ma_keys);
Benjamin Peterson7d95e402012-04-23 11:24:50 -04001320 return mp->ma_keys;
1321}
Christian Heimes99170a52007-12-19 02:07:34 +00001322
1323PyObject *
1324_PyDict_NewPresized(Py_ssize_t minused)
1325{
INADA Naoki92c50ee2016-11-22 00:57:02 +09001326 const Py_ssize_t max_presize = 128 * 1024;
Benjamin Peterson7d95e402012-04-23 11:24:50 -04001327 Py_ssize_t newsize;
1328 PyDictKeysObject *new_keys;
INADA Naoki92c50ee2016-11-22 00:57:02 +09001329
Inada Naoki2ddc7f62019-03-18 20:38:33 +09001330 if (minused <= USABLE_FRACTION(PyDict_MINSIZE)) {
Inada Naokif2a18672019-03-12 17:25:44 +09001331 return PyDict_New();
1332 }
INADA Naoki92c50ee2016-11-22 00:57:02 +09001333 /* There are no strict guarantee that returned dict can contain minused
1334 * items without resize. So we create medium size dict instead of very
1335 * large dict or MemoryError.
1336 */
1337 if (minused > USABLE_FRACTION(max_presize)) {
1338 newsize = max_presize;
1339 }
1340 else {
1341 Py_ssize_t minsize = ESTIMATE_SIZE(minused);
Inada Naoki2ddc7f62019-03-18 20:38:33 +09001342 newsize = PyDict_MINSIZE*2;
INADA Naoki92c50ee2016-11-22 00:57:02 +09001343 while (newsize < minsize) {
1344 newsize <<= 1;
1345 }
1346 }
1347 assert(IS_POWER_OF_2(newsize));
1348
Benjamin Peterson7d95e402012-04-23 11:24:50 -04001349 new_keys = new_keys_object(newsize);
1350 if (new_keys == NULL)
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00001351 return NULL;
Benjamin Peterson7d95e402012-04-23 11:24:50 -04001352 return new_dict(new_keys, NULL);
Christian Heimes99170a52007-12-19 02:07:34 +00001353}
1354
Thomas Wouters4d70c3d2006-06-08 14:42:34 +00001355/* Note that, for historical reasons, PyDict_GetItem() suppresses all errors
1356 * that may occur (originally dicts supported only string keys, and exceptions
1357 * weren't possible). So, while the original intent was that a NULL return
Thomas Wouters0e3f5912006-08-11 14:57:12 +00001358 * meant the key wasn't present, in reality it can mean that, or that an error
Thomas Wouters4d70c3d2006-06-08 14:42:34 +00001359 * (suppressed) occurred while computing the key's hash, or that some error
1360 * (suppressed) occurred when comparing keys in the dict's internal probe
1361 * sequence. A nasty example of the latter is when a Python-coded comparison
1362 * function hits a stack-depth error, which can cause this to return NULL
1363 * even if the key is present.
1364 */
Guido van Rossumc0b618a1997-05-02 03:12:38 +00001365PyObject *
Tim Peters1f5871e2000-07-04 17:44:48 +00001366PyDict_GetItem(PyObject *op, PyObject *key)
Guido van Rossum4b1302b1993-03-27 18:11:32 +00001367{
Benjamin Peterson8f67d082010-10-17 20:54:53 +00001368 Py_hash_t hash;
Victor Stinner742da042016-09-07 17:40:12 -07001369 Py_ssize_t ix;
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00001370 PyDictObject *mp = (PyDictObject *)op;
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00001371 PyThreadState *tstate;
INADA Naokiba609772016-12-07 20:41:42 +09001372 PyObject *value;
Benjamin Peterson7d95e402012-04-23 11:24:50 -04001373
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00001374 if (!PyDict_Check(op))
1375 return NULL;
1376 if (!PyUnicode_CheckExact(key) ||
Martin v. Löwisd63a3b82011-09-28 07:41:54 +02001377 (hash = ((PyASCIIObject *) key)->hash) == -1)
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00001378 {
1379 hash = PyObject_Hash(key);
1380 if (hash == -1) {
1381 PyErr_Clear();
1382 return NULL;
1383 }
1384 }
Thomas Wouters4d70c3d2006-06-08 14:42:34 +00001385
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00001386 /* We can arrive here with a NULL tstate during initialization: try
1387 running "python -Wi" for an example related to string interning.
1388 Let's just hope that no exception occurs then... This must be
Victor Stinner50b48572018-11-01 01:51:40 +01001389 _PyThreadState_GET() and not PyThreadState_Get() because the latter
Victor Stinner9204fb82018-10-30 15:13:17 +01001390 abort Python if tstate is NULL. */
Victor Stinner50b48572018-11-01 01:51:40 +01001391 tstate = _PyThreadState_GET();
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00001392 if (tstate != NULL && tstate->curexc_type != NULL) {
1393 /* preserve the existing exception */
1394 PyObject *err_type, *err_value, *err_tb;
1395 PyErr_Fetch(&err_type, &err_value, &err_tb);
INADA Naoki778928b2017-08-03 23:45:15 +09001396 ix = (mp->ma_keys->dk_lookup)(mp, key, hash, &value);
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00001397 /* ignore errors */
1398 PyErr_Restore(err_type, err_value, err_tb);
Victor Stinner742da042016-09-07 17:40:12 -07001399 if (ix < 0)
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00001400 return NULL;
1401 }
1402 else {
INADA Naoki778928b2017-08-03 23:45:15 +09001403 ix = (mp->ma_keys->dk_lookup)(mp, key, hash, &value);
Victor Stinner742da042016-09-07 17:40:12 -07001404 if (ix < 0) {
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00001405 PyErr_Clear();
1406 return NULL;
1407 }
1408 }
INADA Naokiba609772016-12-07 20:41:42 +09001409 return value;
Guido van Rossum4b1302b1993-03-27 18:11:32 +00001410}
1411
Serhiy Storchakaf0b311b2016-11-06 13:18:24 +02001412/* Same as PyDict_GetItemWithError() but with hash supplied by caller.
1413 This returns NULL *with* an exception set if an exception occurred.
1414 It returns NULL *without* an exception set if the key wasn't present.
1415*/
Raymond Hettinger4b74fba2014-05-03 16:32:11 -07001416PyObject *
1417_PyDict_GetItem_KnownHash(PyObject *op, PyObject *key, Py_hash_t hash)
1418{
Victor Stinner742da042016-09-07 17:40:12 -07001419 Py_ssize_t ix;
Raymond Hettinger4b74fba2014-05-03 16:32:11 -07001420 PyDictObject *mp = (PyDictObject *)op;
INADA Naokiba609772016-12-07 20:41:42 +09001421 PyObject *value;
Raymond Hettinger4b74fba2014-05-03 16:32:11 -07001422
Serhiy Storchakaf0b311b2016-11-06 13:18:24 +02001423 if (!PyDict_Check(op)) {
1424 PyErr_BadInternalCall();
Raymond Hettinger4b74fba2014-05-03 16:32:11 -07001425 return NULL;
Raymond Hettinger4b74fba2014-05-03 16:32:11 -07001426 }
Serhiy Storchakaf0b311b2016-11-06 13:18:24 +02001427
INADA Naoki778928b2017-08-03 23:45:15 +09001428 ix = (mp->ma_keys->dk_lookup)(mp, key, hash, &value);
Serhiy Storchakaf0b311b2016-11-06 13:18:24 +02001429 if (ix < 0) {
1430 return NULL;
Raymond Hettinger4b74fba2014-05-03 16:32:11 -07001431 }
INADA Naokiba609772016-12-07 20:41:42 +09001432 return value;
Raymond Hettinger4b74fba2014-05-03 16:32:11 -07001433}
1434
Guido van Rossum47b9ff62006-08-24 00:41:19 +00001435/* Variant of PyDict_GetItem() that doesn't suppress exceptions.
1436 This returns NULL *with* an exception set if an exception occurred.
1437 It returns NULL *without* an exception set if the key wasn't present.
1438*/
1439PyObject *
1440PyDict_GetItemWithError(PyObject *op, PyObject *key)
1441{
Victor Stinner742da042016-09-07 17:40:12 -07001442 Py_ssize_t ix;
Benjamin Peterson8f67d082010-10-17 20:54:53 +00001443 Py_hash_t hash;
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00001444 PyDictObject*mp = (PyDictObject *)op;
INADA Naokiba609772016-12-07 20:41:42 +09001445 PyObject *value;
Guido van Rossum47b9ff62006-08-24 00:41:19 +00001446
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00001447 if (!PyDict_Check(op)) {
1448 PyErr_BadInternalCall();
1449 return NULL;
1450 }
1451 if (!PyUnicode_CheckExact(key) ||
Martin v. Löwisd63a3b82011-09-28 07:41:54 +02001452 (hash = ((PyASCIIObject *) key)->hash) == -1)
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00001453 {
1454 hash = PyObject_Hash(key);
1455 if (hash == -1) {
1456 return NULL;
1457 }
1458 }
Guido van Rossum47b9ff62006-08-24 00:41:19 +00001459
INADA Naoki778928b2017-08-03 23:45:15 +09001460 ix = (mp->ma_keys->dk_lookup)(mp, key, hash, &value);
Victor Stinner742da042016-09-07 17:40:12 -07001461 if (ix < 0)
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00001462 return NULL;
INADA Naokiba609772016-12-07 20:41:42 +09001463 return value;
Guido van Rossum47b9ff62006-08-24 00:41:19 +00001464}
1465
Brett Cannonfd074152012-04-14 14:10:13 -04001466PyObject *
1467_PyDict_GetItemIdWithError(PyObject *dp, struct _Py_Identifier *key)
1468{
1469 PyObject *kv;
1470 kv = _PyUnicode_FromId(key); /* borrowed */
1471 if (kv == NULL)
1472 return NULL;
1473 return PyDict_GetItemWithError(dp, kv);
1474}
1475
Serhiy Storchakaa24107b2019-02-25 17:59:46 +02001476PyObject *
1477_PyDict_GetItemStringWithError(PyObject *v, const char *key)
1478{
1479 PyObject *kv, *rv;
1480 kv = PyUnicode_FromString(key);
1481 if (kv == NULL) {
1482 return NULL;
1483 }
1484 rv = PyDict_GetItemWithError(v, kv);
1485 Py_DECREF(kv);
1486 return rv;
1487}
1488
Victor Stinnerb4efc962015-11-20 09:24:02 +01001489/* Fast version of global value lookup (LOAD_GLOBAL).
Benjamin Peterson7d95e402012-04-23 11:24:50 -04001490 * Lookup in globals, then builtins.
Victor Stinnerb4efc962015-11-20 09:24:02 +01001491 *
1492 * Raise an exception and return NULL if an error occurred (ex: computing the
1493 * key hash failed, key comparison failed, ...). Return NULL if the key doesn't
1494 * exist. Return the value if the key exists.
Benjamin Peterson7d95e402012-04-23 11:24:50 -04001495 */
1496PyObject *
1497_PyDict_LoadGlobal(PyDictObject *globals, PyDictObject *builtins, PyObject *key)
Guido van Rossum4b1302b1993-03-27 18:11:32 +00001498{
Victor Stinner742da042016-09-07 17:40:12 -07001499 Py_ssize_t ix;
Victor Stinnerb4efc962015-11-20 09:24:02 +01001500 Py_hash_t hash;
INADA Naokiba609772016-12-07 20:41:42 +09001501 PyObject *value;
Victor Stinnerb4efc962015-11-20 09:24:02 +01001502
1503 if (!PyUnicode_CheckExact(key) ||
1504 (hash = ((PyASCIIObject *) key)->hash) == -1)
1505 {
1506 hash = PyObject_Hash(key);
1507 if (hash == -1)
1508 return NULL;
Antoine Pitroue965d972012-02-27 00:45:12 +01001509 }
Victor Stinnerb4efc962015-11-20 09:24:02 +01001510
1511 /* namespace 1: globals */
INADA Naoki778928b2017-08-03 23:45:15 +09001512 ix = globals->ma_keys->dk_lookup(globals, key, hash, &value);
Victor Stinner742da042016-09-07 17:40:12 -07001513 if (ix == DKIX_ERROR)
Benjamin Peterson7d95e402012-04-23 11:24:50 -04001514 return NULL;
INADA Naokiba609772016-12-07 20:41:42 +09001515 if (ix != DKIX_EMPTY && value != NULL)
1516 return value;
Victor Stinnerb4efc962015-11-20 09:24:02 +01001517
1518 /* namespace 2: builtins */
INADA Naoki778928b2017-08-03 23:45:15 +09001519 ix = builtins->ma_keys->dk_lookup(builtins, key, hash, &value);
Victor Stinner742da042016-09-07 17:40:12 -07001520 if (ix < 0)
Victor Stinnerb4efc962015-11-20 09:24:02 +01001521 return NULL;
INADA Naokiba609772016-12-07 20:41:42 +09001522 return value;
Guido van Rossum4b1302b1993-03-27 18:11:32 +00001523}
1524
Antoine Pitroue965d972012-02-27 00:45:12 +01001525/* CAUTION: PyDict_SetItem() must guarantee that it won't resize the
1526 * dictionary if it's merely replacing the value for an existing key.
1527 * This means that it's safe to loop over a dictionary with PyDict_Next()
1528 * and occasionally replace a value -- but you can't insert new keys or
1529 * remove them.
1530 */
1531int
Benjamin Peterson7d95e402012-04-23 11:24:50 -04001532PyDict_SetItem(PyObject *op, PyObject *key, PyObject *value)
Antoine Pitroue965d972012-02-27 00:45:12 +01001533{
Benjamin Peterson7d95e402012-04-23 11:24:50 -04001534 PyDictObject *mp;
1535 Py_hash_t hash;
Antoine Pitroue965d972012-02-27 00:45:12 +01001536 if (!PyDict_Check(op)) {
1537 PyErr_BadInternalCall();
1538 return -1;
1539 }
1540 assert(key);
1541 assert(value);
Benjamin Peterson7d95e402012-04-23 11:24:50 -04001542 mp = (PyDictObject *)op;
1543 if (!PyUnicode_CheckExact(key) ||
1544 (hash = ((PyASCIIObject *) key)->hash) == -1)
1545 {
Antoine Pitroue965d972012-02-27 00:45:12 +01001546 hash = PyObject_Hash(key);
1547 if (hash == -1)
1548 return -1;
1549 }
Benjamin Peterson7d95e402012-04-23 11:24:50 -04001550
Inada Naoki2ddc7f62019-03-18 20:38:33 +09001551 if (mp->ma_keys == Py_EMPTY_KEYS) {
1552 return insert_to_emptydict(mp, key, hash, value);
1553 }
Benjamin Peterson7d95e402012-04-23 11:24:50 -04001554 /* insertdict() handles any resizing that might be necessary */
1555 return insertdict(mp, key, hash, value);
Antoine Pitroue965d972012-02-27 00:45:12 +01001556}
1557
Guido van Rossum4b1302b1993-03-27 18:11:32 +00001558int
Raymond Hettinger4b74fba2014-05-03 16:32:11 -07001559_PyDict_SetItem_KnownHash(PyObject *op, PyObject *key, PyObject *value,
1560 Py_hash_t hash)
1561{
1562 PyDictObject *mp;
1563
1564 if (!PyDict_Check(op)) {
1565 PyErr_BadInternalCall();
1566 return -1;
1567 }
1568 assert(key);
1569 assert(value);
Serhiy Storchakab9d98d52015-10-02 12:47:11 +03001570 assert(hash != -1);
Raymond Hettinger4b74fba2014-05-03 16:32:11 -07001571 mp = (PyDictObject *)op;
1572
Inada Naoki2ddc7f62019-03-18 20:38:33 +09001573 if (mp->ma_keys == Py_EMPTY_KEYS) {
1574 return insert_to_emptydict(mp, key, hash, value);
1575 }
Raymond Hettinger4b74fba2014-05-03 16:32:11 -07001576 /* insertdict() handles any resizing that might be necessary */
1577 return insertdict(mp, key, hash, value);
1578}
1579
Antoine Pitroue10ca3a2016-12-27 14:19:20 +01001580static int
INADA Naoki778928b2017-08-03 23:45:15 +09001581delitem_common(PyDictObject *mp, Py_hash_t hash, Py_ssize_t ix,
Antoine Pitrouc06ae202016-12-27 14:34:54 +01001582 PyObject *old_value)
Antoine Pitroue10ca3a2016-12-27 14:19:20 +01001583{
Antoine Pitrouc06ae202016-12-27 14:34:54 +01001584 PyObject *old_key;
Antoine Pitroud741ed42016-12-27 14:23:43 +01001585 PyDictKeyEntry *ep;
Antoine Pitroue10ca3a2016-12-27 14:19:20 +01001586
INADA Naoki778928b2017-08-03 23:45:15 +09001587 Py_ssize_t hashpos = lookdict_index(mp->ma_keys, hash, ix);
1588 assert(hashpos >= 0);
1589
Antoine Pitroue10ca3a2016-12-27 14:19:20 +01001590 mp->ma_used--;
Antoine Pitroud741ed42016-12-27 14:23:43 +01001591 mp->ma_version_tag = DICT_NEXT_VERSION();
1592 ep = &DK_ENTRIES(mp->ma_keys)[ix];
INADA Naokia7576492018-11-14 18:39:27 +09001593 dictkeys_set_index(mp->ma_keys, hashpos, DKIX_DUMMY);
Antoine Pitroud741ed42016-12-27 14:23:43 +01001594 ENSURE_ALLOWS_DELETIONS(mp);
1595 old_key = ep->me_key;
1596 ep->me_key = NULL;
Antoine Pitrouc06ae202016-12-27 14:34:54 +01001597 ep->me_value = NULL;
Antoine Pitroud741ed42016-12-27 14:23:43 +01001598 Py_DECREF(old_key);
Antoine Pitroue10ca3a2016-12-27 14:19:20 +01001599 Py_DECREF(old_value);
Antoine Pitroud741ed42016-12-27 14:23:43 +01001600
Victor Stinner0fc91ee2019-04-12 21:51:34 +02001601 ASSERT_CONSISTENT(mp);
Antoine Pitroue10ca3a2016-12-27 14:19:20 +01001602 return 0;
1603}
1604
Raymond Hettinger4b74fba2014-05-03 16:32:11 -07001605int
Tim Peters1f5871e2000-07-04 17:44:48 +00001606PyDict_DelItem(PyObject *op, PyObject *key)
Guido van Rossum4b1302b1993-03-27 18:11:32 +00001607{
Benjamin Peterson7d95e402012-04-23 11:24:50 -04001608 Py_hash_t hash;
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00001609 assert(key);
1610 if (!PyUnicode_CheckExact(key) ||
Martin v. Löwisd63a3b82011-09-28 07:41:54 +02001611 (hash = ((PyASCIIObject *) key)->hash) == -1) {
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00001612 hash = PyObject_Hash(key);
1613 if (hash == -1)
1614 return -1;
1615 }
Victor Stinner742da042016-09-07 17:40:12 -07001616
1617 return _PyDict_DelItem_KnownHash(op, key, hash);
Guido van Rossum4b1302b1993-03-27 18:11:32 +00001618}
1619
Serhiy Storchakab9d98d52015-10-02 12:47:11 +03001620int
1621_PyDict_DelItem_KnownHash(PyObject *op, PyObject *key, Py_hash_t hash)
1622{
INADA Naoki778928b2017-08-03 23:45:15 +09001623 Py_ssize_t ix;
Serhiy Storchakab9d98d52015-10-02 12:47:11 +03001624 PyDictObject *mp;
Antoine Pitrouc06ae202016-12-27 14:34:54 +01001625 PyObject *old_value;
Serhiy Storchakab9d98d52015-10-02 12:47:11 +03001626
1627 if (!PyDict_Check(op)) {
1628 PyErr_BadInternalCall();
1629 return -1;
1630 }
1631 assert(key);
1632 assert(hash != -1);
1633 mp = (PyDictObject *)op;
INADA Naoki778928b2017-08-03 23:45:15 +09001634 ix = (mp->ma_keys->dk_lookup)(mp, key, hash, &old_value);
Victor Stinner742da042016-09-07 17:40:12 -07001635 if (ix == DKIX_ERROR)
Serhiy Storchakab9d98d52015-10-02 12:47:11 +03001636 return -1;
INADA Naokiba609772016-12-07 20:41:42 +09001637 if (ix == DKIX_EMPTY || old_value == NULL) {
Serhiy Storchakab9d98d52015-10-02 12:47:11 +03001638 _PyErr_SetKeyError(key);
1639 return -1;
1640 }
Victor Stinner78601a32016-09-09 19:28:36 -07001641
1642 // Split table doesn't allow deletion. Combine it.
1643 if (_PyDict_HasSplitTable(mp)) {
1644 if (dictresize(mp, DK_SIZE(mp->ma_keys))) {
1645 return -1;
1646 }
INADA Naoki778928b2017-08-03 23:45:15 +09001647 ix = (mp->ma_keys->dk_lookup)(mp, key, hash, &old_value);
Victor Stinner78601a32016-09-09 19:28:36 -07001648 assert(ix >= 0);
1649 }
1650
INADA Naoki778928b2017-08-03 23:45:15 +09001651 return delitem_common(mp, hash, ix, old_value);
Serhiy Storchakab9d98d52015-10-02 12:47:11 +03001652}
1653
Antoine Pitroud741ed42016-12-27 14:23:43 +01001654/* This function promises that the predicate -> deletion sequence is atomic
1655 * (i.e. protected by the GIL), assuming the predicate itself doesn't
1656 * release the GIL.
1657 */
Antoine Pitroue10ca3a2016-12-27 14:19:20 +01001658int
1659_PyDict_DelItemIf(PyObject *op, PyObject *key,
1660 int (*predicate)(PyObject *value))
1661{
Antoine Pitroud741ed42016-12-27 14:23:43 +01001662 Py_ssize_t hashpos, ix;
Antoine Pitroue10ca3a2016-12-27 14:19:20 +01001663 PyDictObject *mp;
1664 Py_hash_t hash;
Antoine Pitrouc06ae202016-12-27 14:34:54 +01001665 PyObject *old_value;
Antoine Pitroue10ca3a2016-12-27 14:19:20 +01001666 int res;
1667
1668 if (!PyDict_Check(op)) {
1669 PyErr_BadInternalCall();
1670 return -1;
1671 }
1672 assert(key);
1673 hash = PyObject_Hash(key);
1674 if (hash == -1)
1675 return -1;
1676 mp = (PyDictObject *)op;
INADA Naoki778928b2017-08-03 23:45:15 +09001677 ix = (mp->ma_keys->dk_lookup)(mp, key, hash, &old_value);
Antoine Pitroud741ed42016-12-27 14:23:43 +01001678 if (ix == DKIX_ERROR)
Antoine Pitroue10ca3a2016-12-27 14:19:20 +01001679 return -1;
Antoine Pitrouc06ae202016-12-27 14:34:54 +01001680 if (ix == DKIX_EMPTY || old_value == NULL) {
Antoine Pitroue10ca3a2016-12-27 14:19:20 +01001681 _PyErr_SetKeyError(key);
1682 return -1;
1683 }
Antoine Pitroud741ed42016-12-27 14:23:43 +01001684
1685 // Split table doesn't allow deletion. Combine it.
1686 if (_PyDict_HasSplitTable(mp)) {
1687 if (dictresize(mp, DK_SIZE(mp->ma_keys))) {
1688 return -1;
1689 }
INADA Naoki778928b2017-08-03 23:45:15 +09001690 ix = (mp->ma_keys->dk_lookup)(mp, key, hash, &old_value);
Antoine Pitroud741ed42016-12-27 14:23:43 +01001691 assert(ix >= 0);
1692 }
1693
Antoine Pitrouc06ae202016-12-27 14:34:54 +01001694 res = predicate(old_value);
Antoine Pitroue10ca3a2016-12-27 14:19:20 +01001695 if (res == -1)
1696 return -1;
INADA Naoki778928b2017-08-03 23:45:15 +09001697
1698 hashpos = lookdict_index(mp->ma_keys, hash, ix);
1699 assert(hashpos >= 0);
1700
Antoine Pitroue10ca3a2016-12-27 14:19:20 +01001701 if (res > 0)
Antoine Pitrouc06ae202016-12-27 14:34:54 +01001702 return delitem_common(mp, hashpos, ix, old_value);
Antoine Pitroue10ca3a2016-12-27 14:19:20 +01001703 else
1704 return 0;
1705}
1706
1707
Guido van Rossum25831651993-05-19 14:50:45 +00001708void
Tim Peters1f5871e2000-07-04 17:44:48 +00001709PyDict_Clear(PyObject *op)
Guido van Rossum4b1302b1993-03-27 18:11:32 +00001710{
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00001711 PyDictObject *mp;
Benjamin Peterson7d95e402012-04-23 11:24:50 -04001712 PyDictKeysObject *oldkeys;
1713 PyObject **oldvalues;
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00001714 Py_ssize_t i, n;
Tim Petersdea48ec2001-05-22 20:40:22 +00001715
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00001716 if (!PyDict_Check(op))
1717 return;
Benjamin Peterson7d95e402012-04-23 11:24:50 -04001718 mp = ((PyDictObject *)op);
1719 oldkeys = mp->ma_keys;
1720 oldvalues = mp->ma_values;
1721 if (oldvalues == empty_values)
1722 return;
1723 /* Empty the dict... */
INADA Naokia7576492018-11-14 18:39:27 +09001724 dictkeys_incref(Py_EMPTY_KEYS);
Benjamin Peterson7d95e402012-04-23 11:24:50 -04001725 mp->ma_keys = Py_EMPTY_KEYS;
1726 mp->ma_values = empty_values;
1727 mp->ma_used = 0;
Victor Stinner3b6a6b42016-09-08 12:51:24 -07001728 mp->ma_version_tag = DICT_NEXT_VERSION();
Benjamin Peterson7d95e402012-04-23 11:24:50 -04001729 /* ...then clear the keys and values */
1730 if (oldvalues != NULL) {
Victor Stinner742da042016-09-07 17:40:12 -07001731 n = oldkeys->dk_nentries;
Benjamin Peterson7d95e402012-04-23 11:24:50 -04001732 for (i = 0; i < n; i++)
1733 Py_CLEAR(oldvalues[i]);
1734 free_values(oldvalues);
INADA Naokia7576492018-11-14 18:39:27 +09001735 dictkeys_decref(oldkeys);
Benjamin Peterson7d95e402012-04-23 11:24:50 -04001736 }
1737 else {
1738 assert(oldkeys->dk_refcnt == 1);
INADA Naokia7576492018-11-14 18:39:27 +09001739 dictkeys_decref(oldkeys);
Benjamin Peterson7d95e402012-04-23 11:24:50 -04001740 }
Victor Stinner0fc91ee2019-04-12 21:51:34 +02001741 ASSERT_CONSISTENT(mp);
Benjamin Peterson7d95e402012-04-23 11:24:50 -04001742}
1743
Serhiy Storchaka49f5cdd2016-10-09 23:08:05 +03001744/* Internal version of PyDict_Next that returns a hash value in addition
1745 * to the key and value.
1746 * Return 1 on success, return 0 when the reached the end of the dictionary
1747 * (or if op is not a dictionary)
Benjamin Peterson7d95e402012-04-23 11:24:50 -04001748 */
Serhiy Storchaka49f5cdd2016-10-09 23:08:05 +03001749int
1750_PyDict_Next(PyObject *op, Py_ssize_t *ppos, PyObject **pkey,
1751 PyObject **pvalue, Py_hash_t *phash)
Benjamin Peterson7d95e402012-04-23 11:24:50 -04001752{
INADA Naokica2d8be2016-11-04 16:59:10 +09001753 Py_ssize_t i;
Benjamin Peterson7d95e402012-04-23 11:24:50 -04001754 PyDictObject *mp;
Serhiy Storchaka49f5cdd2016-10-09 23:08:05 +03001755 PyDictKeyEntry *entry_ptr;
1756 PyObject *value;
Benjamin Peterson7d95e402012-04-23 11:24:50 -04001757
1758 if (!PyDict_Check(op))
Serhiy Storchaka49f5cdd2016-10-09 23:08:05 +03001759 return 0;
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00001760 mp = (PyDictObject *)op;
Serhiy Storchaka49f5cdd2016-10-09 23:08:05 +03001761 i = *ppos;
Benjamin Peterson7d95e402012-04-23 11:24:50 -04001762 if (mp->ma_values) {
INADA Naokica2d8be2016-11-04 16:59:10 +09001763 if (i < 0 || i >= mp->ma_used)
Serhiy Storchaka49f5cdd2016-10-09 23:08:05 +03001764 return 0;
INADA Naokica2d8be2016-11-04 16:59:10 +09001765 /* values of split table is always dense */
Serhiy Storchaka49f5cdd2016-10-09 23:08:05 +03001766 entry_ptr = &DK_ENTRIES(mp->ma_keys)[i];
INADA Naokica2d8be2016-11-04 16:59:10 +09001767 value = mp->ma_values[i];
1768 assert(value != NULL);
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00001769 }
Benjamin Peterson7d95e402012-04-23 11:24:50 -04001770 else {
INADA Naokica2d8be2016-11-04 16:59:10 +09001771 Py_ssize_t n = mp->ma_keys->dk_nentries;
1772 if (i < 0 || i >= n)
1773 return 0;
Serhiy Storchaka49f5cdd2016-10-09 23:08:05 +03001774 entry_ptr = &DK_ENTRIES(mp->ma_keys)[i];
1775 while (i < n && entry_ptr->me_value == NULL) {
1776 entry_ptr++;
1777 i++;
Victor Stinner742da042016-09-07 17:40:12 -07001778 }
Serhiy Storchaka49f5cdd2016-10-09 23:08:05 +03001779 if (i >= n)
1780 return 0;
1781 value = entry_ptr->me_value;
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00001782 }
Serhiy Storchaka49f5cdd2016-10-09 23:08:05 +03001783 *ppos = i+1;
1784 if (pkey)
1785 *pkey = entry_ptr->me_key;
1786 if (phash)
1787 *phash = entry_ptr->me_hash;
Benjamin Peterson7d95e402012-04-23 11:24:50 -04001788 if (pvalue)
Serhiy Storchaka49f5cdd2016-10-09 23:08:05 +03001789 *pvalue = value;
1790 return 1;
Guido van Rossum4b1302b1993-03-27 18:11:32 +00001791}
1792
Tim Peters080c88b2003-02-15 03:01:11 +00001793/*
1794 * Iterate over a dict. Use like so:
1795 *
Thomas Wouters4d70c3d2006-06-08 14:42:34 +00001796 * Py_ssize_t i;
Tim Peters080c88b2003-02-15 03:01:11 +00001797 * PyObject *key, *value;
1798 * i = 0; # important! i should not otherwise be changed by you
Neal Norwitz07323012003-02-15 14:45:12 +00001799 * while (PyDict_Next(yourdict, &i, &key, &value)) {
Serhiy Storchaka49f5cdd2016-10-09 23:08:05 +03001800 * Refer to borrowed references in key and value.
Tim Peters080c88b2003-02-15 03:01:11 +00001801 * }
1802 *
Serhiy Storchaka49f5cdd2016-10-09 23:08:05 +03001803 * Return 1 on success, return 0 when the reached the end of the dictionary
1804 * (or if op is not a dictionary)
1805 *
Tim Peters080c88b2003-02-15 03:01:11 +00001806 * CAUTION: In general, it isn't safe to use PyDict_Next in a loop that
Tim Peters67830702001-03-21 19:23:56 +00001807 * mutates the dict. One exception: it is safe if the loop merely changes
1808 * the values associated with the keys (but doesn't insert new keys or
1809 * delete keys), via PyDict_SetItem().
1810 */
Guido van Rossum25831651993-05-19 14:50:45 +00001811int
Martin v. Löwis18e16552006-02-15 17:27:45 +00001812PyDict_Next(PyObject *op, Py_ssize_t *ppos, PyObject **pkey, PyObject **pvalue)
Guido van Rossum4b1302b1993-03-27 18:11:32 +00001813{
Serhiy Storchaka49f5cdd2016-10-09 23:08:05 +03001814 return _PyDict_Next(op, ppos, pkey, pvalue, NULL);
Thomas Wouterscf297e42007-02-23 15:07:44 +00001815}
1816
Eric Snow96c6af92015-05-29 22:21:39 -06001817/* Internal version of dict.pop(). */
1818PyObject *
Serhiy Storchaka42e1ea92017-01-12 19:12:21 +02001819_PyDict_Pop_KnownHash(PyObject *dict, PyObject *key, Py_hash_t hash, PyObject *deflt)
Eric Snow96c6af92015-05-29 22:21:39 -06001820{
Victor Stinner742da042016-09-07 17:40:12 -07001821 Py_ssize_t ix, hashpos;
Eric Snow96c6af92015-05-29 22:21:39 -06001822 PyObject *old_value, *old_key;
1823 PyDictKeyEntry *ep;
Yury Selivanov684ef2c2016-10-28 19:01:21 -04001824 PyDictObject *mp;
1825
1826 assert(PyDict_Check(dict));
1827 mp = (PyDictObject *)dict;
Eric Snow96c6af92015-05-29 22:21:39 -06001828
1829 if (mp->ma_used == 0) {
1830 if (deflt) {
1831 Py_INCREF(deflt);
1832 return deflt;
1833 }
1834 _PyErr_SetKeyError(key);
1835 return NULL;
1836 }
INADA Naoki778928b2017-08-03 23:45:15 +09001837 ix = (mp->ma_keys->dk_lookup)(mp, key, hash, &old_value);
Victor Stinner742da042016-09-07 17:40:12 -07001838 if (ix == DKIX_ERROR)
Eric Snow96c6af92015-05-29 22:21:39 -06001839 return NULL;
INADA Naokiba609772016-12-07 20:41:42 +09001840 if (ix == DKIX_EMPTY || old_value == NULL) {
Eric Snow96c6af92015-05-29 22:21:39 -06001841 if (deflt) {
1842 Py_INCREF(deflt);
1843 return deflt;
1844 }
1845 _PyErr_SetKeyError(key);
1846 return NULL;
1847 }
Victor Stinner3b6a6b42016-09-08 12:51:24 -07001848
Victor Stinner78601a32016-09-09 19:28:36 -07001849 // Split table doesn't allow deletion. Combine it.
1850 if (_PyDict_HasSplitTable(mp)) {
1851 if (dictresize(mp, DK_SIZE(mp->ma_keys))) {
1852 return NULL;
1853 }
INADA Naoki778928b2017-08-03 23:45:15 +09001854 ix = (mp->ma_keys->dk_lookup)(mp, key, hash, &old_value);
Victor Stinner78601a32016-09-09 19:28:36 -07001855 assert(ix >= 0);
1856 }
1857
INADA Naoki778928b2017-08-03 23:45:15 +09001858 hashpos = lookdict_index(mp->ma_keys, hash, ix);
1859 assert(hashpos >= 0);
Victor Stinner78601a32016-09-09 19:28:36 -07001860 assert(old_value != NULL);
Eric Snow96c6af92015-05-29 22:21:39 -06001861 mp->ma_used--;
Victor Stinner3b6a6b42016-09-08 12:51:24 -07001862 mp->ma_version_tag = DICT_NEXT_VERSION();
INADA Naokia7576492018-11-14 18:39:27 +09001863 dictkeys_set_index(mp->ma_keys, hashpos, DKIX_DUMMY);
Victor Stinner78601a32016-09-09 19:28:36 -07001864 ep = &DK_ENTRIES(mp->ma_keys)[ix];
1865 ENSURE_ALLOWS_DELETIONS(mp);
1866 old_key = ep->me_key;
1867 ep->me_key = NULL;
INADA Naokiba609772016-12-07 20:41:42 +09001868 ep->me_value = NULL;
Victor Stinner78601a32016-09-09 19:28:36 -07001869 Py_DECREF(old_key);
Victor Stinner611b0fa2016-09-14 15:02:01 +02001870
Victor Stinner0fc91ee2019-04-12 21:51:34 +02001871 ASSERT_CONSISTENT(mp);
Eric Snow96c6af92015-05-29 22:21:39 -06001872 return old_value;
1873}
1874
Serhiy Storchaka67796522017-01-12 18:34:33 +02001875PyObject *
Serhiy Storchaka42e1ea92017-01-12 19:12:21 +02001876_PyDict_Pop(PyObject *dict, PyObject *key, PyObject *deflt)
Serhiy Storchaka67796522017-01-12 18:34:33 +02001877{
1878 Py_hash_t hash;
1879
Serhiy Storchaka42e1ea92017-01-12 19:12:21 +02001880 if (((PyDictObject *)dict)->ma_used == 0) {
Serhiy Storchaka67796522017-01-12 18:34:33 +02001881 if (deflt) {
1882 Py_INCREF(deflt);
1883 return deflt;
1884 }
1885 _PyErr_SetKeyError(key);
1886 return NULL;
1887 }
1888 if (!PyUnicode_CheckExact(key) ||
1889 (hash = ((PyASCIIObject *) key)->hash) == -1) {
1890 hash = PyObject_Hash(key);
1891 if (hash == -1)
1892 return NULL;
1893 }
Serhiy Storchaka42e1ea92017-01-12 19:12:21 +02001894 return _PyDict_Pop_KnownHash(dict, key, hash, deflt);
Serhiy Storchaka67796522017-01-12 18:34:33 +02001895}
1896
Eric Snow96c6af92015-05-29 22:21:39 -06001897/* Internal version of dict.from_keys(). It is subclass-friendly. */
1898PyObject *
1899_PyDict_FromKeys(PyObject *cls, PyObject *iterable, PyObject *value)
1900{
1901 PyObject *it; /* iter(iterable) */
1902 PyObject *key;
1903 PyObject *d;
1904 int status;
1905
Victor Stinnera5ed5f02016-12-06 18:45:50 +01001906 d = _PyObject_CallNoArg(cls);
Eric Snow96c6af92015-05-29 22:21:39 -06001907 if (d == NULL)
1908 return NULL;
1909
1910 if (PyDict_CheckExact(d) && ((PyDictObject *)d)->ma_used == 0) {
1911 if (PyDict_CheckExact(iterable)) {
1912 PyDictObject *mp = (PyDictObject *)d;
1913 PyObject *oldvalue;
1914 Py_ssize_t pos = 0;
1915 PyObject *key;
1916 Py_hash_t hash;
1917
Serhiy Storchakac61ac162017-03-21 08:52:38 +02001918 if (dictresize(mp, ESTIMATE_SIZE(PyDict_GET_SIZE(iterable)))) {
Eric Snow96c6af92015-05-29 22:21:39 -06001919 Py_DECREF(d);
1920 return NULL;
1921 }
1922
1923 while (_PyDict_Next(iterable, &pos, &key, &oldvalue, &hash)) {
1924 if (insertdict(mp, key, hash, value)) {
1925 Py_DECREF(d);
1926 return NULL;
1927 }
1928 }
1929 return d;
1930 }
1931 if (PyAnySet_CheckExact(iterable)) {
1932 PyDictObject *mp = (PyDictObject *)d;
1933 Py_ssize_t pos = 0;
1934 PyObject *key;
1935 Py_hash_t hash;
1936
Victor Stinner742da042016-09-07 17:40:12 -07001937 if (dictresize(mp, ESTIMATE_SIZE(PySet_GET_SIZE(iterable)))) {
Eric Snow96c6af92015-05-29 22:21:39 -06001938 Py_DECREF(d);
1939 return NULL;
1940 }
1941
1942 while (_PySet_NextEntry(iterable, &pos, &key, &hash)) {
1943 if (insertdict(mp, key, hash, value)) {
1944 Py_DECREF(d);
1945 return NULL;
1946 }
1947 }
1948 return d;
1949 }
1950 }
1951
1952 it = PyObject_GetIter(iterable);
1953 if (it == NULL){
1954 Py_DECREF(d);
1955 return NULL;
1956 }
1957
1958 if (PyDict_CheckExact(d)) {
1959 while ((key = PyIter_Next(it)) != NULL) {
1960 status = PyDict_SetItem(d, key, value);
1961 Py_DECREF(key);
1962 if (status < 0)
1963 goto Fail;
1964 }
1965 } else {
1966 while ((key = PyIter_Next(it)) != NULL) {
1967 status = PyObject_SetItem(d, key, value);
1968 Py_DECREF(key);
1969 if (status < 0)
1970 goto Fail;
1971 }
1972 }
1973
1974 if (PyErr_Occurred())
1975 goto Fail;
1976 Py_DECREF(it);
1977 return d;
1978
1979Fail:
1980 Py_DECREF(it);
1981 Py_DECREF(d);
1982 return NULL;
1983}
1984
Guido van Rossum4b1302b1993-03-27 18:11:32 +00001985/* Methods */
1986
1987static void
Benjamin Peterson7d95e402012-04-23 11:24:50 -04001988dict_dealloc(PyDictObject *mp)
Guido van Rossum4b1302b1993-03-27 18:11:32 +00001989{
Benjamin Peterson7d95e402012-04-23 11:24:50 -04001990 PyObject **values = mp->ma_values;
1991 PyDictKeysObject *keys = mp->ma_keys;
1992 Py_ssize_t i, n;
INADA Naokia6296d32017-08-24 14:55:17 +09001993
1994 /* bpo-31095: UnTrack is needed before calling any callbacks */
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00001995 PyObject_GC_UnTrack(mp);
Jeroen Demeyer351c6742019-05-10 19:21:11 +02001996 Py_TRASHCAN_BEGIN(mp, dict_dealloc)
Benjamin Peterson7d95e402012-04-23 11:24:50 -04001997 if (values != NULL) {
1998 if (values != empty_values) {
Victor Stinner742da042016-09-07 17:40:12 -07001999 for (i = 0, n = mp->ma_keys->dk_nentries; i < n; i++) {
Benjamin Peterson7d95e402012-04-23 11:24:50 -04002000 Py_XDECREF(values[i]);
2001 }
2002 free_values(values);
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00002003 }
INADA Naokia7576492018-11-14 18:39:27 +09002004 dictkeys_decref(keys);
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00002005 }
Victor Stinnerac2a4fe2013-07-16 22:19:00 +02002006 else if (keys != NULL) {
Antoine Pitrou2d169b22012-05-12 23:43:44 +02002007 assert(keys->dk_refcnt == 1);
INADA Naokia7576492018-11-14 18:39:27 +09002008 dictkeys_decref(keys);
Benjamin Peterson7d95e402012-04-23 11:24:50 -04002009 }
Dong-hee Na1b55b652020-02-17 19:09:15 +09002010 if (numfree < PyDict_MAXFREELIST && Py_IS_TYPE(mp, &PyDict_Type))
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00002011 free_list[numfree++] = mp;
2012 else
2013 Py_TYPE(mp)->tp_free((PyObject *)mp);
Jeroen Demeyer351c6742019-05-10 19:21:11 +02002014 Py_TRASHCAN_END
Guido van Rossum4b1302b1993-03-27 18:11:32 +00002015}
2016
Benjamin Peterson7d95e402012-04-23 11:24:50 -04002017
Guido van Rossumc0b618a1997-05-02 03:12:38 +00002018static PyObject *
Guido van Rossum8ce8a782007-11-01 19:42:39 +00002019dict_repr(PyDictObject *mp)
Guido van Rossum4b1302b1993-03-27 18:11:32 +00002020{
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00002021 Py_ssize_t i;
Victor Stinnerf91929b2013-11-19 13:07:38 +01002022 PyObject *key = NULL, *value = NULL;
2023 _PyUnicodeWriter writer;
2024 int first;
Guido van Rossum255443b1998-04-10 22:47:14 +00002025
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00002026 i = Py_ReprEnter((PyObject *)mp);
2027 if (i != 0) {
2028 return i > 0 ? PyUnicode_FromString("{...}") : NULL;
2029 }
Guido van Rossum255443b1998-04-10 22:47:14 +00002030
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00002031 if (mp->ma_used == 0) {
Victor Stinnerf91929b2013-11-19 13:07:38 +01002032 Py_ReprLeave((PyObject *)mp);
2033 return PyUnicode_FromString("{}");
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00002034 }
Tim Petersa7259592001-06-16 05:11:17 +00002035
Victor Stinnerf91929b2013-11-19 13:07:38 +01002036 _PyUnicodeWriter_Init(&writer);
2037 writer.overallocate = 1;
2038 /* "{" + "1: 2" + ", 3: 4" * (len - 1) + "}" */
2039 writer.min_length = 1 + 4 + (2 + 4) * (mp->ma_used - 1) + 1;
Tim Petersa7259592001-06-16 05:11:17 +00002040
Victor Stinnerf91929b2013-11-19 13:07:38 +01002041 if (_PyUnicodeWriter_WriteChar(&writer, '{') < 0)
2042 goto error;
Tim Petersa7259592001-06-16 05:11:17 +00002043
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00002044 /* Do repr() on each key+value pair, and insert ": " between them.
2045 Note that repr may mutate the dict. */
2046 i = 0;
Victor Stinnerf91929b2013-11-19 13:07:38 +01002047 first = 1;
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00002048 while (PyDict_Next((PyObject *)mp, &i, &key, &value)) {
Victor Stinnerf91929b2013-11-19 13:07:38 +01002049 PyObject *s;
2050 int res;
2051
Benjamin Peterson7d95e402012-04-23 11:24:50 -04002052 /* Prevent repr from deleting key or value during key format. */
2053 Py_INCREF(key);
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00002054 Py_INCREF(value);
Victor Stinnerf97dfd72013-07-18 01:00:45 +02002055
Victor Stinnerf91929b2013-11-19 13:07:38 +01002056 if (!first) {
2057 if (_PyUnicodeWriter_WriteASCIIString(&writer, ", ", 2) < 0)
2058 goto error;
2059 }
2060 first = 0;
2061
2062 s = PyObject_Repr(key);
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00002063 if (s == NULL)
Victor Stinnerf91929b2013-11-19 13:07:38 +01002064 goto error;
2065 res = _PyUnicodeWriter_WriteStr(&writer, s);
2066 Py_DECREF(s);
2067 if (res < 0)
2068 goto error;
2069
2070 if (_PyUnicodeWriter_WriteASCIIString(&writer, ": ", 2) < 0)
2071 goto error;
2072
2073 s = PyObject_Repr(value);
2074 if (s == NULL)
2075 goto error;
2076 res = _PyUnicodeWriter_WriteStr(&writer, s);
2077 Py_DECREF(s);
2078 if (res < 0)
2079 goto error;
2080
2081 Py_CLEAR(key);
2082 Py_CLEAR(value);
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00002083 }
Tim Petersa7259592001-06-16 05:11:17 +00002084
Victor Stinnerf91929b2013-11-19 13:07:38 +01002085 writer.overallocate = 0;
2086 if (_PyUnicodeWriter_WriteChar(&writer, '}') < 0)
2087 goto error;
Tim Petersa7259592001-06-16 05:11:17 +00002088
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00002089 Py_ReprLeave((PyObject *)mp);
Victor Stinnerf91929b2013-11-19 13:07:38 +01002090
2091 return _PyUnicodeWriter_Finish(&writer);
2092
2093error:
2094 Py_ReprLeave((PyObject *)mp);
2095 _PyUnicodeWriter_Dealloc(&writer);
2096 Py_XDECREF(key);
2097 Py_XDECREF(value);
2098 return NULL;
Guido van Rossum4b1302b1993-03-27 18:11:32 +00002099}
2100
Martin v. Löwis18e16552006-02-15 17:27:45 +00002101static Py_ssize_t
Guido van Rossum8ce8a782007-11-01 19:42:39 +00002102dict_length(PyDictObject *mp)
Guido van Rossum4b1302b1993-03-27 18:11:32 +00002103{
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00002104 return mp->ma_used;
Guido van Rossum4b1302b1993-03-27 18:11:32 +00002105}
2106
Guido van Rossumc0b618a1997-05-02 03:12:38 +00002107static PyObject *
Antoine Pitrou9ed5f272013-08-13 20:18:52 +02002108dict_subscript(PyDictObject *mp, PyObject *key)
Guido van Rossum4b1302b1993-03-27 18:11:32 +00002109{
Victor Stinner742da042016-09-07 17:40:12 -07002110 Py_ssize_t ix;
Benjamin Peterson8f67d082010-10-17 20:54:53 +00002111 Py_hash_t hash;
INADA Naokiba609772016-12-07 20:41:42 +09002112 PyObject *value;
Benjamin Peterson7d95e402012-04-23 11:24:50 -04002113
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00002114 if (!PyUnicode_CheckExact(key) ||
Martin v. Löwisd63a3b82011-09-28 07:41:54 +02002115 (hash = ((PyASCIIObject *) key)->hash) == -1) {
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00002116 hash = PyObject_Hash(key);
2117 if (hash == -1)
2118 return NULL;
2119 }
INADA Naoki778928b2017-08-03 23:45:15 +09002120 ix = (mp->ma_keys->dk_lookup)(mp, key, hash, &value);
Victor Stinner742da042016-09-07 17:40:12 -07002121 if (ix == DKIX_ERROR)
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00002122 return NULL;
INADA Naokiba609772016-12-07 20:41:42 +09002123 if (ix == DKIX_EMPTY || value == NULL) {
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00002124 if (!PyDict_CheckExact(mp)) {
2125 /* Look up __missing__ method if we're a subclass. */
2126 PyObject *missing, *res;
Benjamin Petersonce798522012-01-22 11:24:29 -05002127 _Py_IDENTIFIER(__missing__);
2128 missing = _PyObject_LookupSpecial((PyObject *)mp, &PyId___missing__);
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00002129 if (missing != NULL) {
Petr Viktorinffd97532020-02-11 17:46:57 +01002130 res = PyObject_CallOneArg(missing, key);
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00002131 Py_DECREF(missing);
2132 return res;
2133 }
2134 else if (PyErr_Occurred())
2135 return NULL;
2136 }
Raymond Hettinger69492da2013-09-02 15:59:26 -07002137 _PyErr_SetKeyError(key);
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00002138 return NULL;
2139 }
INADA Naokiba609772016-12-07 20:41:42 +09002140 Py_INCREF(value);
2141 return value;
Guido van Rossum4b1302b1993-03-27 18:11:32 +00002142}
2143
2144static int
Guido van Rossum8ce8a782007-11-01 19:42:39 +00002145dict_ass_sub(PyDictObject *mp, PyObject *v, PyObject *w)
Guido van Rossum4b1302b1993-03-27 18:11:32 +00002146{
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00002147 if (w == NULL)
2148 return PyDict_DelItem((PyObject *)mp, v);
2149 else
2150 return PyDict_SetItem((PyObject *)mp, v, w);
Guido van Rossum4b1302b1993-03-27 18:11:32 +00002151}
2152
Guido van Rossuma9e7a811997-05-13 21:02:11 +00002153static PyMappingMethods dict_as_mapping = {
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00002154 (lenfunc)dict_length, /*mp_length*/
2155 (binaryfunc)dict_subscript, /*mp_subscript*/
2156 (objobjargproc)dict_ass_sub, /*mp_ass_subscript*/
Guido van Rossum4b1302b1993-03-27 18:11:32 +00002157};
2158
Guido van Rossumc0b618a1997-05-02 03:12:38 +00002159static PyObject *
Antoine Pitrou9ed5f272013-08-13 20:18:52 +02002160dict_keys(PyDictObject *mp)
Guido van Rossum4b1302b1993-03-27 18:11:32 +00002161{
Antoine Pitrou9ed5f272013-08-13 20:18:52 +02002162 PyObject *v;
2163 Py_ssize_t i, j;
Benjamin Peterson7d95e402012-04-23 11:24:50 -04002164 PyDictKeyEntry *ep;
Cheryl Sabellaf66e3362019-04-05 06:08:43 -04002165 Py_ssize_t n, offset;
Benjamin Peterson7d95e402012-04-23 11:24:50 -04002166 PyObject **value_ptr;
Guido van Rossuma4dd0112001-04-15 22:16:26 +00002167
Guido van Rossuma4dd0112001-04-15 22:16:26 +00002168 again:
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00002169 n = mp->ma_used;
2170 v = PyList_New(n);
2171 if (v == NULL)
2172 return NULL;
2173 if (n != mp->ma_used) {
2174 /* Durnit. The allocations caused the dict to resize.
2175 * Just start over, this shouldn't normally happen.
2176 */
2177 Py_DECREF(v);
2178 goto again;
2179 }
Victor Stinner742da042016-09-07 17:40:12 -07002180 ep = DK_ENTRIES(mp->ma_keys);
Benjamin Peterson7d95e402012-04-23 11:24:50 -04002181 if (mp->ma_values) {
2182 value_ptr = mp->ma_values;
2183 offset = sizeof(PyObject *);
2184 }
2185 else {
2186 value_ptr = &ep[0].me_value;
2187 offset = sizeof(PyDictKeyEntry);
2188 }
Cheryl Sabellaf66e3362019-04-05 06:08:43 -04002189 for (i = 0, j = 0; j < n; i++) {
Benjamin Peterson7d95e402012-04-23 11:24:50 -04002190 if (*value_ptr != NULL) {
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00002191 PyObject *key = ep[i].me_key;
2192 Py_INCREF(key);
2193 PyList_SET_ITEM(v, j, key);
2194 j++;
2195 }
Benjamin Peterson7d95e402012-04-23 11:24:50 -04002196 value_ptr = (PyObject **)(((char *)value_ptr) + offset);
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00002197 }
2198 assert(j == n);
2199 return v;
Guido van Rossum4b1302b1993-03-27 18:11:32 +00002200}
2201
Guido van Rossumc0b618a1997-05-02 03:12:38 +00002202static PyObject *
Antoine Pitrou9ed5f272013-08-13 20:18:52 +02002203dict_values(PyDictObject *mp)
Guido van Rossum25831651993-05-19 14:50:45 +00002204{
Antoine Pitrou9ed5f272013-08-13 20:18:52 +02002205 PyObject *v;
2206 Py_ssize_t i, j;
Benjamin Petersonf0acae22016-09-08 09:50:08 -07002207 PyDictKeyEntry *ep;
Cheryl Sabellaf66e3362019-04-05 06:08:43 -04002208 Py_ssize_t n, offset;
Benjamin Peterson7d95e402012-04-23 11:24:50 -04002209 PyObject **value_ptr;
Guido van Rossuma4dd0112001-04-15 22:16:26 +00002210
Guido van Rossuma4dd0112001-04-15 22:16:26 +00002211 again:
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00002212 n = mp->ma_used;
2213 v = PyList_New(n);
2214 if (v == NULL)
2215 return NULL;
2216 if (n != mp->ma_used) {
2217 /* Durnit. The allocations caused the dict to resize.
2218 * Just start over, this shouldn't normally happen.
2219 */
2220 Py_DECREF(v);
2221 goto again;
2222 }
Benjamin Petersonf0acae22016-09-08 09:50:08 -07002223 ep = DK_ENTRIES(mp->ma_keys);
Benjamin Peterson7d95e402012-04-23 11:24:50 -04002224 if (mp->ma_values) {
2225 value_ptr = mp->ma_values;
2226 offset = sizeof(PyObject *);
2227 }
2228 else {
Benjamin Petersonf0acae22016-09-08 09:50:08 -07002229 value_ptr = &ep[0].me_value;
Benjamin Peterson7d95e402012-04-23 11:24:50 -04002230 offset = sizeof(PyDictKeyEntry);
2231 }
Cheryl Sabellaf66e3362019-04-05 06:08:43 -04002232 for (i = 0, j = 0; j < n; i++) {
Benjamin Peterson7d95e402012-04-23 11:24:50 -04002233 PyObject *value = *value_ptr;
2234 value_ptr = (PyObject **)(((char *)value_ptr) + offset);
2235 if (value != NULL) {
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00002236 Py_INCREF(value);
2237 PyList_SET_ITEM(v, j, value);
2238 j++;
2239 }
2240 }
2241 assert(j == n);
2242 return v;
Guido van Rossum25831651993-05-19 14:50:45 +00002243}
2244
Guido van Rossumc0b618a1997-05-02 03:12:38 +00002245static PyObject *
Antoine Pitrou9ed5f272013-08-13 20:18:52 +02002246dict_items(PyDictObject *mp)
Guido van Rossum25831651993-05-19 14:50:45 +00002247{
Antoine Pitrou9ed5f272013-08-13 20:18:52 +02002248 PyObject *v;
2249 Py_ssize_t i, j, n;
Cheryl Sabellaf66e3362019-04-05 06:08:43 -04002250 Py_ssize_t offset;
Benjamin Peterson7d95e402012-04-23 11:24:50 -04002251 PyObject *item, *key;
2252 PyDictKeyEntry *ep;
2253 PyObject **value_ptr;
Guido van Rossuma4dd0112001-04-15 22:16:26 +00002254
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00002255 /* Preallocate the list of tuples, to avoid allocations during
2256 * the loop over the items, which could trigger GC, which
2257 * could resize the dict. :-(
2258 */
Guido van Rossuma4dd0112001-04-15 22:16:26 +00002259 again:
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00002260 n = mp->ma_used;
2261 v = PyList_New(n);
2262 if (v == NULL)
2263 return NULL;
2264 for (i = 0; i < n; i++) {
2265 item = PyTuple_New(2);
2266 if (item == NULL) {
2267 Py_DECREF(v);
2268 return NULL;
2269 }
2270 PyList_SET_ITEM(v, i, item);
2271 }
2272 if (n != mp->ma_used) {
2273 /* Durnit. The allocations caused the dict to resize.
2274 * Just start over, this shouldn't normally happen.
2275 */
2276 Py_DECREF(v);
2277 goto again;
2278 }
2279 /* Nothing we do below makes any function calls. */
Victor Stinner742da042016-09-07 17:40:12 -07002280 ep = DK_ENTRIES(mp->ma_keys);
Benjamin Peterson7d95e402012-04-23 11:24:50 -04002281 if (mp->ma_values) {
2282 value_ptr = mp->ma_values;
2283 offset = sizeof(PyObject *);
2284 }
2285 else {
2286 value_ptr = &ep[0].me_value;
2287 offset = sizeof(PyDictKeyEntry);
2288 }
Cheryl Sabellaf66e3362019-04-05 06:08:43 -04002289 for (i = 0, j = 0; j < n; i++) {
Benjamin Peterson7d95e402012-04-23 11:24:50 -04002290 PyObject *value = *value_ptr;
2291 value_ptr = (PyObject **)(((char *)value_ptr) + offset);
2292 if (value != NULL) {
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00002293 key = ep[i].me_key;
2294 item = PyList_GET_ITEM(v, j);
2295 Py_INCREF(key);
2296 PyTuple_SET_ITEM(item, 0, key);
2297 Py_INCREF(value);
2298 PyTuple_SET_ITEM(item, 1, value);
2299 j++;
2300 }
2301 }
2302 assert(j == n);
2303 return v;
Guido van Rossum25831651993-05-19 14:50:45 +00002304}
2305
Larry Hastings5c661892014-01-24 06:17:25 -08002306/*[clinic input]
2307@classmethod
2308dict.fromkeys
Larry Hastings5c661892014-01-24 06:17:25 -08002309 iterable: object
2310 value: object=None
2311 /
2312
Serhiy Storchaka78d9e582017-01-25 00:30:04 +02002313Create a new dictionary with keys from iterable and values set to value.
Larry Hastings5c661892014-01-24 06:17:25 -08002314[clinic start generated code]*/
2315
Larry Hastings5c661892014-01-24 06:17:25 -08002316static PyObject *
2317dict_fromkeys_impl(PyTypeObject *type, PyObject *iterable, PyObject *value)
Serhiy Storchaka78d9e582017-01-25 00:30:04 +02002318/*[clinic end generated code: output=8fb98e4b10384999 input=382ba4855d0f74c3]*/
Larry Hastings5c661892014-01-24 06:17:25 -08002319{
Eric Snow96c6af92015-05-29 22:21:39 -06002320 return _PyDict_FromKeys((PyObject *)type, iterable, value);
Raymond Hettingere33d3df2002-11-27 07:29:33 +00002321}
2322
Raymond Hettinger31017ae2004-03-04 08:25:44 +00002323static int
Victor Stinner742da042016-09-07 17:40:12 -07002324dict_update_common(PyObject *self, PyObject *args, PyObject *kwds,
2325 const char *methname)
Guido van Rossume3f5b9c1997-05-28 19:15:28 +00002326{
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00002327 PyObject *arg = NULL;
2328 int result = 0;
Raymond Hettinger31017ae2004-03-04 08:25:44 +00002329
Serhiy Storchaka60c3d352017-11-11 16:19:56 +02002330 if (!PyArg_UnpackTuple(args, methname, 0, 1, &arg)) {
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00002331 result = -1;
Serhiy Storchaka60c3d352017-11-11 16:19:56 +02002332 }
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00002333 else if (arg != NULL) {
Serhiy Storchakaf163aea2019-09-25 09:47:00 +03002334 if (PyDict_CheckExact(arg)) {
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00002335 result = PyDict_Merge(self, arg, 1);
Serhiy Storchaka60c3d352017-11-11 16:19:56 +02002336 }
Serhiy Storchaka60c3d352017-11-11 16:19:56 +02002337 else {
Serhiy Storchakaf163aea2019-09-25 09:47:00 +03002338 _Py_IDENTIFIER(keys);
2339 PyObject *func;
2340 if (_PyObject_LookupAttrId(arg, &PyId_keys, &func) < 0) {
2341 result = -1;
2342 }
2343 else if (func != NULL) {
2344 Py_DECREF(func);
2345 result = PyDict_Merge(self, arg, 1);
2346 }
2347 else {
2348 result = PyDict_MergeFromSeq2(self, arg, 1);
2349 }
Serhiy Storchaka60c3d352017-11-11 16:19:56 +02002350 }
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00002351 }
Serhiy Storchaka60c3d352017-11-11 16:19:56 +02002352
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00002353 if (result == 0 && kwds != NULL) {
2354 if (PyArg_ValidateKeywordArguments(kwds))
2355 result = PyDict_Merge(self, kwds, 1);
2356 else
2357 result = -1;
2358 }
2359 return result;
Raymond Hettinger31017ae2004-03-04 08:25:44 +00002360}
2361
Victor Stinner91f0d4a2017-01-19 12:45:06 +01002362/* Note: dict.update() uses the METH_VARARGS|METH_KEYWORDS calling convention.
Serhiy Storchaka6969eaf2017-07-03 21:20:15 +03002363 Using METH_FASTCALL|METH_KEYWORDS would make dict.update(**dict2) calls
2364 slower, see the issue #29312. */
Raymond Hettinger31017ae2004-03-04 08:25:44 +00002365static PyObject *
2366dict_update(PyObject *self, PyObject *args, PyObject *kwds)
2367{
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00002368 if (dict_update_common(self, args, kwds, "update") != -1)
2369 Py_RETURN_NONE;
2370 return NULL;
Tim Peters6d6c1a32001-08-02 04:15:00 +00002371}
2372
Guido van Rossum05ac6de2001-08-10 20:28:28 +00002373/* Update unconditionally replaces existing items.
2374 Merge has a 3rd argument 'override'; if set, it acts like Update,
Tim Peters1fc240e2001-10-26 05:06:50 +00002375 otherwise it leaves existing items unchanged.
2376
2377 PyDict_{Update,Merge} update/merge from a mapping object.
2378
Tim Petersf582b822001-12-11 18:51:08 +00002379 PyDict_MergeFromSeq2 updates/merges from any iterable object
Tim Peters1fc240e2001-10-26 05:06:50 +00002380 producing iterable objects of length 2.
2381*/
2382
Tim Petersf582b822001-12-11 18:51:08 +00002383int
Tim Peters1fc240e2001-10-26 05:06:50 +00002384PyDict_MergeFromSeq2(PyObject *d, PyObject *seq2, int override)
2385{
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00002386 PyObject *it; /* iter(seq2) */
2387 Py_ssize_t i; /* index into seq2 of current element */
2388 PyObject *item; /* seq2[i] */
2389 PyObject *fast; /* item as a 2-tuple or 2-list */
Tim Peters1fc240e2001-10-26 05:06:50 +00002390
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00002391 assert(d != NULL);
2392 assert(PyDict_Check(d));
2393 assert(seq2 != NULL);
Tim Peters1fc240e2001-10-26 05:06:50 +00002394
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00002395 it = PyObject_GetIter(seq2);
2396 if (it == NULL)
2397 return -1;
Tim Peters1fc240e2001-10-26 05:06:50 +00002398
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00002399 for (i = 0; ; ++i) {
2400 PyObject *key, *value;
2401 Py_ssize_t n;
Tim Peters1fc240e2001-10-26 05:06:50 +00002402
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00002403 fast = NULL;
2404 item = PyIter_Next(it);
2405 if (item == NULL) {
2406 if (PyErr_Occurred())
2407 goto Fail;
2408 break;
2409 }
Tim Peters1fc240e2001-10-26 05:06:50 +00002410
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00002411 /* Convert item to sequence, and verify length 2. */
2412 fast = PySequence_Fast(item, "");
2413 if (fast == NULL) {
2414 if (PyErr_ExceptionMatches(PyExc_TypeError))
2415 PyErr_Format(PyExc_TypeError,
2416 "cannot convert dictionary update "
2417 "sequence element #%zd to a sequence",
2418 i);
2419 goto Fail;
2420 }
2421 n = PySequence_Fast_GET_SIZE(fast);
2422 if (n != 2) {
2423 PyErr_Format(PyExc_ValueError,
2424 "dictionary update sequence element #%zd "
2425 "has length %zd; 2 is required",
2426 i, n);
2427 goto Fail;
2428 }
Tim Peters1fc240e2001-10-26 05:06:50 +00002429
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00002430 /* Update/merge with this (key, value) pair. */
2431 key = PySequence_Fast_GET_ITEM(fast, 0);
2432 value = PySequence_Fast_GET_ITEM(fast, 1);
Serhiy Storchaka753bca32017-05-20 12:30:02 +03002433 Py_INCREF(key);
2434 Py_INCREF(value);
Serhiy Storchakaa24107b2019-02-25 17:59:46 +02002435 if (override) {
2436 if (PyDict_SetItem(d, key, value) < 0) {
Serhiy Storchaka753bca32017-05-20 12:30:02 +03002437 Py_DECREF(key);
2438 Py_DECREF(value);
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00002439 goto Fail;
Serhiy Storchaka753bca32017-05-20 12:30:02 +03002440 }
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00002441 }
Serhiy Storchakaa24107b2019-02-25 17:59:46 +02002442 else if (PyDict_GetItemWithError(d, key) == NULL) {
2443 if (PyErr_Occurred() || PyDict_SetItem(d, key, value) < 0) {
2444 Py_DECREF(key);
2445 Py_DECREF(value);
2446 goto Fail;
2447 }
2448 }
2449
Serhiy Storchaka753bca32017-05-20 12:30:02 +03002450 Py_DECREF(key);
2451 Py_DECREF(value);
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00002452 Py_DECREF(fast);
2453 Py_DECREF(item);
2454 }
Tim Peters1fc240e2001-10-26 05:06:50 +00002455
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00002456 i = 0;
Victor Stinner0fc91ee2019-04-12 21:51:34 +02002457 ASSERT_CONSISTENT(d);
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00002458 goto Return;
Tim Peters1fc240e2001-10-26 05:06:50 +00002459Fail:
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00002460 Py_XDECREF(item);
2461 Py_XDECREF(fast);
2462 i = -1;
Tim Peters1fc240e2001-10-26 05:06:50 +00002463Return:
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00002464 Py_DECREF(it);
2465 return Py_SAFE_DOWNCAST(i, Py_ssize_t, int);
Tim Peters1fc240e2001-10-26 05:06:50 +00002466}
2467
doko@ubuntu.comc96df682016-10-11 08:04:02 +02002468static int
Serhiy Storchakae036ef82016-10-02 11:06:43 +03002469dict_merge(PyObject *a, PyObject *b, int override)
Guido van Rossum05ac6de2001-08-10 20:28:28 +00002470{
Antoine Pitrou9ed5f272013-08-13 20:18:52 +02002471 PyDictObject *mp, *other;
2472 Py_ssize_t i, n;
Victor Stinner742da042016-09-07 17:40:12 -07002473 PyDictKeyEntry *entry, *ep0;
Tim Peters6d6c1a32001-08-02 04:15:00 +00002474
Serhiy Storchakae036ef82016-10-02 11:06:43 +03002475 assert(0 <= override && override <= 2);
2476
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00002477 /* We accept for the argument either a concrete dictionary object,
2478 * or an abstract "mapping" object. For the former, we can do
2479 * things quite efficiently. For the latter, we only require that
2480 * PyMapping_Keys() and PyObject_GetItem() be supported.
2481 */
2482 if (a == NULL || !PyDict_Check(a) || b == NULL) {
2483 PyErr_BadInternalCall();
2484 return -1;
2485 }
2486 mp = (PyDictObject*)a;
INADA Naoki2aaf98c2018-09-26 12:59:00 +09002487 if (PyDict_Check(b) && (Py_TYPE(b)->tp_iter == (getiterfunc)dict_iter)) {
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00002488 other = (PyDictObject*)b;
2489 if (other == mp || other->ma_used == 0)
2490 /* a.update(a) or a.update({}); nothing to do */
2491 return 0;
2492 if (mp->ma_used == 0)
2493 /* Since the target dict is empty, PyDict_GetItem()
2494 * always returns NULL. Setting override to 1
2495 * skips the unnecessary test.
2496 */
2497 override = 1;
2498 /* Do one big resize at the start, rather than
2499 * incrementally resizing as we insert new items. Expect
2500 * that there will be no (or few) overlapping keys.
2501 */
INADA Naokib1152be2016-10-27 19:26:50 +09002502 if (USABLE_FRACTION(mp->ma_keys->dk_size) < other->ma_used) {
2503 if (dictresize(mp, ESTIMATE_SIZE(mp->ma_used + other->ma_used))) {
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00002504 return -1;
INADA Naokib1152be2016-10-27 19:26:50 +09002505 }
2506 }
Victor Stinner742da042016-09-07 17:40:12 -07002507 ep0 = DK_ENTRIES(other->ma_keys);
2508 for (i = 0, n = other->ma_keys->dk_nentries; i < n; i++) {
Benjamin Petersona82f77f2015-07-04 19:55:16 -05002509 PyObject *key, *value;
2510 Py_hash_t hash;
Victor Stinner742da042016-09-07 17:40:12 -07002511 entry = &ep0[i];
Benjamin Petersona82f77f2015-07-04 19:55:16 -05002512 key = entry->me_key;
2513 hash = entry->me_hash;
Benjamin Peterson7d95e402012-04-23 11:24:50 -04002514 if (other->ma_values)
2515 value = other->ma_values[i];
2516 else
2517 value = entry->me_value;
2518
Benjamin Petersona82f77f2015-07-04 19:55:16 -05002519 if (value != NULL) {
2520 int err = 0;
2521 Py_INCREF(key);
2522 Py_INCREF(value);
Serhiy Storchakaf0b311b2016-11-06 13:18:24 +02002523 if (override == 1)
Benjamin Petersona82f77f2015-07-04 19:55:16 -05002524 err = insertdict(mp, key, hash, value);
Serhiy Storchakaf0b311b2016-11-06 13:18:24 +02002525 else if (_PyDict_GetItem_KnownHash(a, key, hash) == NULL) {
2526 if (PyErr_Occurred()) {
2527 Py_DECREF(value);
2528 Py_DECREF(key);
2529 return -1;
2530 }
2531 err = insertdict(mp, key, hash, value);
2532 }
Serhiy Storchakae036ef82016-10-02 11:06:43 +03002533 else if (override != 0) {
2534 _PyErr_SetKeyError(key);
2535 Py_DECREF(value);
2536 Py_DECREF(key);
2537 return -1;
2538 }
Benjamin Petersona82f77f2015-07-04 19:55:16 -05002539 Py_DECREF(value);
2540 Py_DECREF(key);
2541 if (err != 0)
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00002542 return -1;
Benjamin Petersona82f77f2015-07-04 19:55:16 -05002543
Victor Stinner742da042016-09-07 17:40:12 -07002544 if (n != other->ma_keys->dk_nentries) {
Benjamin Petersona82f77f2015-07-04 19:55:16 -05002545 PyErr_SetString(PyExc_RuntimeError,
2546 "dict mutated during update");
2547 return -1;
2548 }
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00002549 }
2550 }
2551 }
2552 else {
2553 /* Do it the generic, slower way */
2554 PyObject *keys = PyMapping_Keys(b);
2555 PyObject *iter;
2556 PyObject *key, *value;
2557 int status;
Barry Warsaw66a0d1d2001-06-26 20:08:32 +00002558
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00002559 if (keys == NULL)
2560 /* Docstring says this is equivalent to E.keys() so
2561 * if E doesn't have a .keys() method we want
2562 * AttributeError to percolate up. Might as well
2563 * do the same for any other error.
2564 */
2565 return -1;
Barry Warsaw66a0d1d2001-06-26 20:08:32 +00002566
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00002567 iter = PyObject_GetIter(keys);
2568 Py_DECREF(keys);
2569 if (iter == NULL)
2570 return -1;
Barry Warsaw66a0d1d2001-06-26 20:08:32 +00002571
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00002572 for (key = PyIter_Next(iter); key; key = PyIter_Next(iter)) {
Serhiy Storchakaa24107b2019-02-25 17:59:46 +02002573 if (override != 1) {
2574 if (PyDict_GetItemWithError(a, key) != NULL) {
2575 if (override != 0) {
2576 _PyErr_SetKeyError(key);
2577 Py_DECREF(key);
2578 Py_DECREF(iter);
2579 return -1;
2580 }
2581 Py_DECREF(key);
2582 continue;
2583 }
2584 else if (PyErr_Occurred()) {
Serhiy Storchakae036ef82016-10-02 11:06:43 +03002585 Py_DECREF(key);
2586 Py_DECREF(iter);
2587 return -1;
2588 }
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00002589 }
2590 value = PyObject_GetItem(b, key);
2591 if (value == NULL) {
2592 Py_DECREF(iter);
2593 Py_DECREF(key);
2594 return -1;
2595 }
2596 status = PyDict_SetItem(a, key, value);
2597 Py_DECREF(key);
2598 Py_DECREF(value);
2599 if (status < 0) {
2600 Py_DECREF(iter);
2601 return -1;
2602 }
2603 }
2604 Py_DECREF(iter);
2605 if (PyErr_Occurred())
2606 /* Iterator completed, via error */
2607 return -1;
2608 }
Victor Stinner0fc91ee2019-04-12 21:51:34 +02002609 ASSERT_CONSISTENT(a);
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00002610 return 0;
Guido van Rossume3f5b9c1997-05-28 19:15:28 +00002611}
2612
Serhiy Storchakae036ef82016-10-02 11:06:43 +03002613int
2614PyDict_Update(PyObject *a, PyObject *b)
2615{
2616 return dict_merge(a, b, 1);
2617}
2618
2619int
2620PyDict_Merge(PyObject *a, PyObject *b, int override)
2621{
2622 /* XXX Deprecate override not in (0, 1). */
2623 return dict_merge(a, b, override != 0);
2624}
2625
2626int
2627_PyDict_MergeEx(PyObject *a, PyObject *b, int override)
2628{
2629 return dict_merge(a, b, override);
2630}
2631
Guido van Rossume3f5b9c1997-05-28 19:15:28 +00002632static PyObject *
Siddhesh Poyarekar55edd0c2018-04-30 00:29:33 +05302633dict_copy(PyDictObject *mp, PyObject *Py_UNUSED(ignored))
Guido van Rossume3f5b9c1997-05-28 19:15:28 +00002634{
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00002635 return PyDict_Copy((PyObject*)mp);
Jeremy Hyltona12c7a72000-03-30 22:27:31 +00002636}
2637
2638PyObject *
Tim Peters1f5871e2000-07-04 17:44:48 +00002639PyDict_Copy(PyObject *o)
Jeremy Hyltona12c7a72000-03-30 22:27:31 +00002640{
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00002641 PyObject *copy;
Benjamin Peterson7d95e402012-04-23 11:24:50 -04002642 PyDictObject *mp;
2643 Py_ssize_t i, n;
Jeremy Hyltona12c7a72000-03-30 22:27:31 +00002644
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00002645 if (o == NULL || !PyDict_Check(o)) {
2646 PyErr_BadInternalCall();
2647 return NULL;
2648 }
Yury Selivanovb0a7a032018-01-22 11:54:41 -05002649
Benjamin Peterson7d95e402012-04-23 11:24:50 -04002650 mp = (PyDictObject *)o;
Yury Selivanovb0a7a032018-01-22 11:54:41 -05002651 if (mp->ma_used == 0) {
2652 /* The dict is empty; just return a new dict. */
2653 return PyDict_New();
2654 }
2655
Benjamin Peterson7d95e402012-04-23 11:24:50 -04002656 if (_PyDict_HasSplitTable(mp)) {
2657 PyDictObject *split_copy;
Victor Stinner742da042016-09-07 17:40:12 -07002658 Py_ssize_t size = USABLE_FRACTION(DK_SIZE(mp->ma_keys));
2659 PyObject **newvalues;
2660 newvalues = new_values(size);
Benjamin Peterson7d95e402012-04-23 11:24:50 -04002661 if (newvalues == NULL)
2662 return PyErr_NoMemory();
2663 split_copy = PyObject_GC_New(PyDictObject, &PyDict_Type);
2664 if (split_copy == NULL) {
2665 free_values(newvalues);
2666 return NULL;
2667 }
2668 split_copy->ma_values = newvalues;
2669 split_copy->ma_keys = mp->ma_keys;
2670 split_copy->ma_used = mp->ma_used;
INADA Naokid1c82c52018-04-03 11:43:53 +09002671 split_copy->ma_version_tag = DICT_NEXT_VERSION();
INADA Naokia7576492018-11-14 18:39:27 +09002672 dictkeys_incref(mp->ma_keys);
Victor Stinner742da042016-09-07 17:40:12 -07002673 for (i = 0, n = size; i < n; i++) {
Benjamin Peterson7d95e402012-04-23 11:24:50 -04002674 PyObject *value = mp->ma_values[i];
2675 Py_XINCREF(value);
2676 split_copy->ma_values[i] = value;
2677 }
Benjamin Peterson7ce67e42012-04-24 10:32:57 -04002678 if (_PyObject_GC_IS_TRACKED(mp))
2679 _PyObject_GC_TRACK(split_copy);
Benjamin Peterson7d95e402012-04-23 11:24:50 -04002680 return (PyObject *)split_copy;
2681 }
Yury Selivanovb0a7a032018-01-22 11:54:41 -05002682
2683 if (PyDict_CheckExact(mp) && mp->ma_values == NULL &&
2684 (mp->ma_used >= (mp->ma_keys->dk_nentries * 2) / 3))
2685 {
2686 /* Use fast-copy if:
2687
2688 (1) 'mp' is an instance of a subclassed dict; and
2689
2690 (2) 'mp' is not a split-dict; and
2691
2692 (3) if 'mp' is non-compact ('del' operation does not resize dicts),
2693 do fast-copy only if it has at most 1/3 non-used keys.
2694
Ville Skyttä61f82e02018-04-20 23:08:45 +03002695 The last condition (3) is important to guard against a pathological
Yury Selivanovb0a7a032018-01-22 11:54:41 -05002696 case when a large dict is almost emptied with multiple del/pop
2697 operations and copied after that. In cases like this, we defer to
2698 PyDict_Merge, which produces a compacted copy.
2699 */
2700 return clone_combined_dict(mp);
2701 }
2702
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00002703 copy = PyDict_New();
2704 if (copy == NULL)
2705 return NULL;
2706 if (PyDict_Merge(copy, o, 1) == 0)
2707 return copy;
2708 Py_DECREF(copy);
2709 return NULL;
Guido van Rossume3f5b9c1997-05-28 19:15:28 +00002710}
2711
Martin v. Löwis18e16552006-02-15 17:27:45 +00002712Py_ssize_t
Tim Peters1f5871e2000-07-04 17:44:48 +00002713PyDict_Size(PyObject *mp)
Guido van Rossum4199fac1993-11-05 10:18:44 +00002714{
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00002715 if (mp == NULL || !PyDict_Check(mp)) {
2716 PyErr_BadInternalCall();
2717 return -1;
2718 }
2719 return ((PyDictObject *)mp)->ma_used;
Guido van Rossum4199fac1993-11-05 10:18:44 +00002720}
2721
Guido van Rossumc0b618a1997-05-02 03:12:38 +00002722PyObject *
Tim Peters1f5871e2000-07-04 17:44:48 +00002723PyDict_Keys(PyObject *mp)
Guido van Rossum4b1302b1993-03-27 18:11:32 +00002724{
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00002725 if (mp == NULL || !PyDict_Check(mp)) {
2726 PyErr_BadInternalCall();
2727 return NULL;
2728 }
2729 return dict_keys((PyDictObject *)mp);
Guido van Rossum4b1302b1993-03-27 18:11:32 +00002730}
2731
Guido van Rossumc0b618a1997-05-02 03:12:38 +00002732PyObject *
Tim Peters1f5871e2000-07-04 17:44:48 +00002733PyDict_Values(PyObject *mp)
Guido van Rossum25831651993-05-19 14:50:45 +00002734{
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00002735 if (mp == NULL || !PyDict_Check(mp)) {
2736 PyErr_BadInternalCall();
2737 return NULL;
2738 }
2739 return dict_values((PyDictObject *)mp);
Guido van Rossum25831651993-05-19 14:50:45 +00002740}
2741
Guido van Rossumc0b618a1997-05-02 03:12:38 +00002742PyObject *
Tim Peters1f5871e2000-07-04 17:44:48 +00002743PyDict_Items(PyObject *mp)
Guido van Rossum25831651993-05-19 14:50:45 +00002744{
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00002745 if (mp == NULL || !PyDict_Check(mp)) {
2746 PyErr_BadInternalCall();
2747 return NULL;
2748 }
2749 return dict_items((PyDictObject *)mp);
Guido van Rossum25831651993-05-19 14:50:45 +00002750}
2751
Tim Peterse63415e2001-05-08 04:38:29 +00002752/* Return 1 if dicts equal, 0 if not, -1 if error.
2753 * Gets out as soon as any difference is detected.
2754 * Uses only Py_EQ comparison.
2755 */
2756static int
Guido van Rossum8ce8a782007-11-01 19:42:39 +00002757dict_equal(PyDictObject *a, PyDictObject *b)
Tim Peterse63415e2001-05-08 04:38:29 +00002758{
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00002759 Py_ssize_t i;
Tim Peterse63415e2001-05-08 04:38:29 +00002760
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00002761 if (a->ma_used != b->ma_used)
2762 /* can't be equal if # of entries differ */
2763 return 0;
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00002764 /* Same # of entries -- check all of 'em. Exit early on any diff. */
Victor Stinner742da042016-09-07 17:40:12 -07002765 for (i = 0; i < a->ma_keys->dk_nentries; i++) {
2766 PyDictKeyEntry *ep = &DK_ENTRIES(a->ma_keys)[i];
Benjamin Peterson7d95e402012-04-23 11:24:50 -04002767 PyObject *aval;
2768 if (a->ma_values)
2769 aval = a->ma_values[i];
2770 else
2771 aval = ep->me_value;
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00002772 if (aval != NULL) {
2773 int cmp;
2774 PyObject *bval;
Benjamin Peterson7d95e402012-04-23 11:24:50 -04002775 PyObject *key = ep->me_key;
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00002776 /* temporarily bump aval's refcount to ensure it stays
2777 alive until we're done with it */
2778 Py_INCREF(aval);
2779 /* ditto for key */
2780 Py_INCREF(key);
Antoine Pitrou0e9958b2012-12-02 19:10:07 +01002781 /* reuse the known hash value */
INADA Naoki778928b2017-08-03 23:45:15 +09002782 b->ma_keys->dk_lookup(b, key, ep->me_hash, &bval);
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00002783 if (bval == NULL) {
Serhiy Storchaka753bca32017-05-20 12:30:02 +03002784 Py_DECREF(key);
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00002785 Py_DECREF(aval);
2786 if (PyErr_Occurred())
2787 return -1;
2788 return 0;
2789 }
Dong-hee Na2d5bf562019-12-31 10:04:22 +09002790 Py_INCREF(bval);
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00002791 cmp = PyObject_RichCompareBool(aval, bval, Py_EQ);
Serhiy Storchaka753bca32017-05-20 12:30:02 +03002792 Py_DECREF(key);
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00002793 Py_DECREF(aval);
Dong-hee Na2d5bf562019-12-31 10:04:22 +09002794 Py_DECREF(bval);
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00002795 if (cmp <= 0) /* error or not equal */
2796 return cmp;
2797 }
2798 }
2799 return 1;
Benjamin Peterson7d95e402012-04-23 11:24:50 -04002800}
Tim Peterse63415e2001-05-08 04:38:29 +00002801
2802static PyObject *
2803dict_richcompare(PyObject *v, PyObject *w, int op)
2804{
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00002805 int cmp;
2806 PyObject *res;
Tim Peterse63415e2001-05-08 04:38:29 +00002807
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00002808 if (!PyDict_Check(v) || !PyDict_Check(w)) {
2809 res = Py_NotImplemented;
2810 }
2811 else if (op == Py_EQ || op == Py_NE) {
2812 cmp = dict_equal((PyDictObject *)v, (PyDictObject *)w);
2813 if (cmp < 0)
2814 return NULL;
2815 res = (cmp == (op == Py_EQ)) ? Py_True : Py_False;
2816 }
2817 else
2818 res = Py_NotImplemented;
2819 Py_INCREF(res);
2820 return res;
Benjamin Peterson7d95e402012-04-23 11:24:50 -04002821}
Tim Peterse63415e2001-05-08 04:38:29 +00002822
Larry Hastings61272b72014-01-07 12:41:53 -08002823/*[clinic input]
Larry Hastings31826802013-10-19 00:09:25 -07002824
2825@coexist
2826dict.__contains__
2827
2828 key: object
2829 /
2830
Serhiy Storchaka78d9e582017-01-25 00:30:04 +02002831True if the dictionary has the specified key, else False.
Larry Hastings61272b72014-01-07 12:41:53 -08002832[clinic start generated code]*/
Larry Hastings31826802013-10-19 00:09:25 -07002833
Guido van Rossumc0b618a1997-05-02 03:12:38 +00002834static PyObject *
Larry Hastingsc2047262014-01-25 20:43:29 -08002835dict___contains__(PyDictObject *self, PyObject *key)
Serhiy Storchaka19d25972017-02-04 08:05:07 +02002836/*[clinic end generated code: output=a3d03db709ed6e6b input=fe1cb42ad831e820]*/
Guido van Rossum4b1302b1993-03-27 18:11:32 +00002837{
Larry Hastingsc2047262014-01-25 20:43:29 -08002838 register PyDictObject *mp = self;
Benjamin Peterson8f67d082010-10-17 20:54:53 +00002839 Py_hash_t hash;
Victor Stinner742da042016-09-07 17:40:12 -07002840 Py_ssize_t ix;
INADA Naokiba609772016-12-07 20:41:42 +09002841 PyObject *value;
Thomas Wouters4d70c3d2006-06-08 14:42:34 +00002842
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00002843 if (!PyUnicode_CheckExact(key) ||
Martin v. Löwisd63a3b82011-09-28 07:41:54 +02002844 (hash = ((PyASCIIObject *) key)->hash) == -1) {
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00002845 hash = PyObject_Hash(key);
2846 if (hash == -1)
2847 return NULL;
2848 }
INADA Naoki778928b2017-08-03 23:45:15 +09002849 ix = (mp->ma_keys->dk_lookup)(mp, key, hash, &value);
Victor Stinner742da042016-09-07 17:40:12 -07002850 if (ix == DKIX_ERROR)
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00002851 return NULL;
INADA Naokiba609772016-12-07 20:41:42 +09002852 if (ix == DKIX_EMPTY || value == NULL)
Victor Stinner742da042016-09-07 17:40:12 -07002853 Py_RETURN_FALSE;
2854 Py_RETURN_TRUE;
Guido van Rossum4b1302b1993-03-27 18:11:32 +00002855}
2856
Victor Stinner7dc6a5f2017-01-19 12:37:13 +01002857/*[clinic input]
2858dict.get
2859
2860 key: object
Serhiy Storchaka48088ee2017-01-19 19:00:30 +02002861 default: object = None
Victor Stinner7dc6a5f2017-01-19 12:37:13 +01002862 /
2863
Serhiy Storchaka78d9e582017-01-25 00:30:04 +02002864Return the value for key if key is in the dictionary, else default.
Victor Stinner7dc6a5f2017-01-19 12:37:13 +01002865[clinic start generated code]*/
2866
Guido van Rossumc0b618a1997-05-02 03:12:38 +00002867static PyObject *
Serhiy Storchaka48088ee2017-01-19 19:00:30 +02002868dict_get_impl(PyDictObject *self, PyObject *key, PyObject *default_value)
Serhiy Storchaka78d9e582017-01-25 00:30:04 +02002869/*[clinic end generated code: output=bba707729dee05bf input=279ddb5790b6b107]*/
Barry Warsawc38c5da1997-10-06 17:49:20 +00002870{
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00002871 PyObject *val = NULL;
Benjamin Peterson8f67d082010-10-17 20:54:53 +00002872 Py_hash_t hash;
Victor Stinner742da042016-09-07 17:40:12 -07002873 Py_ssize_t ix;
Barry Warsawc38c5da1997-10-06 17:49:20 +00002874
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00002875 if (!PyUnicode_CheckExact(key) ||
Martin v. Löwisd63a3b82011-09-28 07:41:54 +02002876 (hash = ((PyASCIIObject *) key)->hash) == -1) {
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00002877 hash = PyObject_Hash(key);
2878 if (hash == -1)
2879 return NULL;
2880 }
INADA Naoki778928b2017-08-03 23:45:15 +09002881 ix = (self->ma_keys->dk_lookup) (self, key, hash, &val);
Victor Stinner742da042016-09-07 17:40:12 -07002882 if (ix == DKIX_ERROR)
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00002883 return NULL;
INADA Naokiba609772016-12-07 20:41:42 +09002884 if (ix == DKIX_EMPTY || val == NULL) {
Serhiy Storchaka48088ee2017-01-19 19:00:30 +02002885 val = default_value;
INADA Naokiba609772016-12-07 20:41:42 +09002886 }
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00002887 Py_INCREF(val);
2888 return val;
Barry Warsawc38c5da1997-10-06 17:49:20 +00002889}
2890
Benjamin Peterson00e98862013-03-07 22:16:29 -05002891PyObject *
2892PyDict_SetDefault(PyObject *d, PyObject *key, PyObject *defaultobj)
Guido van Rossum164452c2000-08-08 16:12:54 +00002893{
Benjamin Peterson00e98862013-03-07 22:16:29 -05002894 PyDictObject *mp = (PyDictObject *)d;
INADA Naoki93f26f72016-11-02 18:45:16 +09002895 PyObject *value;
Benjamin Peterson8f67d082010-10-17 20:54:53 +00002896 Py_hash_t hash;
Guido van Rossum164452c2000-08-08 16:12:54 +00002897
Benjamin Peterson00e98862013-03-07 22:16:29 -05002898 if (!PyDict_Check(d)) {
2899 PyErr_BadInternalCall();
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00002900 return NULL;
Benjamin Peterson00e98862013-03-07 22:16:29 -05002901 }
INADA Naoki93f26f72016-11-02 18:45:16 +09002902
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00002903 if (!PyUnicode_CheckExact(key) ||
Martin v. Löwisd63a3b82011-09-28 07:41:54 +02002904 (hash = ((PyASCIIObject *) key)->hash) == -1) {
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00002905 hash = PyObject_Hash(key);
2906 if (hash == -1)
2907 return NULL;
2908 }
Inada Naoki2ddc7f62019-03-18 20:38:33 +09002909 if (mp->ma_keys == Py_EMPTY_KEYS) {
2910 if (insert_to_emptydict(mp, key, hash, defaultobj) < 0) {
2911 return NULL;
2912 }
2913 return defaultobj;
2914 }
INADA Naoki93f26f72016-11-02 18:45:16 +09002915
2916 if (mp->ma_values != NULL && !PyUnicode_CheckExact(key)) {
2917 if (insertion_resize(mp) < 0)
2918 return NULL;
2919 }
2920
INADA Naoki778928b2017-08-03 23:45:15 +09002921 Py_ssize_t ix = (mp->ma_keys->dk_lookup)(mp, key, hash, &value);
Victor Stinner742da042016-09-07 17:40:12 -07002922 if (ix == DKIX_ERROR)
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00002923 return NULL;
INADA Naoki93f26f72016-11-02 18:45:16 +09002924
2925 if (_PyDict_HasSplitTable(mp) &&
INADA Naokiba609772016-12-07 20:41:42 +09002926 ((ix >= 0 && value == NULL && mp->ma_used != ix) ||
INADA Naoki93f26f72016-11-02 18:45:16 +09002927 (ix == DKIX_EMPTY && mp->ma_used != mp->ma_keys->dk_nentries))) {
2928 if (insertion_resize(mp) < 0) {
2929 return NULL;
2930 }
INADA Naoki93f26f72016-11-02 18:45:16 +09002931 ix = DKIX_EMPTY;
2932 }
2933
2934 if (ix == DKIX_EMPTY) {
2935 PyDictKeyEntry *ep, *ep0;
2936 value = defaultobj;
Benjamin Peterson7d95e402012-04-23 11:24:50 -04002937 if (mp->ma_keys->dk_usable <= 0) {
Victor Stinner3c336c52016-09-12 14:17:40 +02002938 if (insertion_resize(mp) < 0) {
Benjamin Peterson7d95e402012-04-23 11:24:50 -04002939 return NULL;
Victor Stinner3c336c52016-09-12 14:17:40 +02002940 }
Benjamin Peterson7d95e402012-04-23 11:24:50 -04002941 }
INADA Naoki778928b2017-08-03 23:45:15 +09002942 Py_ssize_t hashpos = find_empty_slot(mp->ma_keys, hash);
INADA Naoki93f26f72016-11-02 18:45:16 +09002943 ep0 = DK_ENTRIES(mp->ma_keys);
2944 ep = &ep0[mp->ma_keys->dk_nentries];
INADA Naokia7576492018-11-14 18:39:27 +09002945 dictkeys_set_index(mp->ma_keys, hashpos, mp->ma_keys->dk_nentries);
Benjamin Petersonb1efa532013-03-04 09:47:50 -05002946 Py_INCREF(key);
INADA Naoki93f26f72016-11-02 18:45:16 +09002947 Py_INCREF(value);
2948 MAINTAIN_TRACKING(mp, key, value);
Benjamin Peterson7d95e402012-04-23 11:24:50 -04002949 ep->me_key = key;
2950 ep->me_hash = hash;
INADA Naokiba609772016-12-07 20:41:42 +09002951 if (_PyDict_HasSplitTable(mp)) {
INADA Naoki93f26f72016-11-02 18:45:16 +09002952 assert(mp->ma_values[mp->ma_keys->dk_nentries] == NULL);
2953 mp->ma_values[mp->ma_keys->dk_nentries] = value;
Victor Stinner742da042016-09-07 17:40:12 -07002954 }
2955 else {
INADA Naoki93f26f72016-11-02 18:45:16 +09002956 ep->me_value = value;
Victor Stinner742da042016-09-07 17:40:12 -07002957 }
Benjamin Peterson7d95e402012-04-23 11:24:50 -04002958 mp->ma_used++;
Victor Stinner3b6a6b42016-09-08 12:51:24 -07002959 mp->ma_version_tag = DICT_NEXT_VERSION();
INADA Naoki93f26f72016-11-02 18:45:16 +09002960 mp->ma_keys->dk_usable--;
2961 mp->ma_keys->dk_nentries++;
2962 assert(mp->ma_keys->dk_usable >= 0);
2963 }
INADA Naokiba609772016-12-07 20:41:42 +09002964 else if (value == NULL) {
INADA Naoki93f26f72016-11-02 18:45:16 +09002965 value = defaultobj;
2966 assert(_PyDict_HasSplitTable(mp));
2967 assert(ix == mp->ma_used);
2968 Py_INCREF(value);
2969 MAINTAIN_TRACKING(mp, key, value);
INADA Naokiba609772016-12-07 20:41:42 +09002970 mp->ma_values[ix] = value;
INADA Naoki93f26f72016-11-02 18:45:16 +09002971 mp->ma_used++;
2972 mp->ma_version_tag = DICT_NEXT_VERSION();
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00002973 }
INADA Naoki93f26f72016-11-02 18:45:16 +09002974
Victor Stinner0fc91ee2019-04-12 21:51:34 +02002975 ASSERT_CONSISTENT(mp);
INADA Naoki93f26f72016-11-02 18:45:16 +09002976 return value;
Guido van Rossum164452c2000-08-08 16:12:54 +00002977}
2978
Victor Stinner7dc6a5f2017-01-19 12:37:13 +01002979/*[clinic input]
2980dict.setdefault
2981
2982 key: object
Serhiy Storchaka48088ee2017-01-19 19:00:30 +02002983 default: object = None
Victor Stinner7dc6a5f2017-01-19 12:37:13 +01002984 /
2985
Serhiy Storchaka78d9e582017-01-25 00:30:04 +02002986Insert key with a value of default if key is not in the dictionary.
2987
2988Return the value for key if key is in the dictionary, else default.
Victor Stinner7dc6a5f2017-01-19 12:37:13 +01002989[clinic start generated code]*/
2990
Benjamin Peterson00e98862013-03-07 22:16:29 -05002991static PyObject *
Serhiy Storchaka48088ee2017-01-19 19:00:30 +02002992dict_setdefault_impl(PyDictObject *self, PyObject *key,
2993 PyObject *default_value)
Serhiy Storchaka78d9e582017-01-25 00:30:04 +02002994/*[clinic end generated code: output=f8c1101ebf69e220 input=0f063756e815fd9d]*/
Benjamin Peterson00e98862013-03-07 22:16:29 -05002995{
Victor Stinner7dc6a5f2017-01-19 12:37:13 +01002996 PyObject *val;
Benjamin Peterson00e98862013-03-07 22:16:29 -05002997
Serhiy Storchaka48088ee2017-01-19 19:00:30 +02002998 val = PyDict_SetDefault((PyObject *)self, key, default_value);
Benjamin Peterson00e98862013-03-07 22:16:29 -05002999 Py_XINCREF(val);
3000 return val;
3001}
Guido van Rossum164452c2000-08-08 16:12:54 +00003002
3003static PyObject *
Siddhesh Poyarekar55edd0c2018-04-30 00:29:33 +05303004dict_clear(PyDictObject *mp, PyObject *Py_UNUSED(ignored))
Guido van Rossumfb8f1ca1997-03-21 21:55:12 +00003005{
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00003006 PyDict_Clear((PyObject *)mp);
3007 Py_RETURN_NONE;
Guido van Rossumfb8f1ca1997-03-21 21:55:12 +00003008}
3009
Inada Naoki9e4f2f32019-04-12 16:11:28 +09003010/*[clinic input]
3011dict.pop
3012
3013 key: object
3014 default: object = NULL
3015 /
3016
Serhiy Storchaka279f4462019-09-14 12:24:05 +03003017D.pop(k[,d]) -> v, remove specified key and return the corresponding value.
Inada Naoki9e4f2f32019-04-12 16:11:28 +09003018
3019If key is not found, default is returned if given, otherwise KeyError is raised
3020[clinic start generated code]*/
3021
Guido van Rossumba6ab842000-12-12 22:02:18 +00003022static PyObject *
Inada Naoki9e4f2f32019-04-12 16:11:28 +09003023dict_pop_impl(PyDictObject *self, PyObject *key, PyObject *default_value)
Serhiy Storchaka279f4462019-09-14 12:24:05 +03003024/*[clinic end generated code: output=3abb47b89f24c21c input=eeebec7812190348]*/
Guido van Rossume027d982002-04-12 15:11:59 +00003025{
Inada Naoki9e4f2f32019-04-12 16:11:28 +09003026 return _PyDict_Pop((PyObject*)self, key, default_value);
Guido van Rossume027d982002-04-12 15:11:59 +00003027}
3028
Inada Naoki9e4f2f32019-04-12 16:11:28 +09003029/*[clinic input]
3030dict.popitem
3031
3032Remove and return a (key, value) pair as a 2-tuple.
3033
3034Pairs are returned in LIFO (last-in, first-out) order.
3035Raises KeyError if the dict is empty.
3036[clinic start generated code]*/
3037
Guido van Rossume027d982002-04-12 15:11:59 +00003038static PyObject *
Inada Naoki9e4f2f32019-04-12 16:11:28 +09003039dict_popitem_impl(PyDictObject *self)
3040/*[clinic end generated code: output=e65fcb04420d230d input=1c38a49f21f64941]*/
Guido van Rossumba6ab842000-12-12 22:02:18 +00003041{
Victor Stinner742da042016-09-07 17:40:12 -07003042 Py_ssize_t i, j;
3043 PyDictKeyEntry *ep0, *ep;
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00003044 PyObject *res;
Guido van Rossumba6ab842000-12-12 22:02:18 +00003045
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00003046 /* Allocate the result tuple before checking the size. Believe it
3047 * or not, this allocation could trigger a garbage collection which
3048 * could empty the dict, so if we checked the size first and that
3049 * happened, the result would be an infinite loop (searching for an
3050 * entry that no longer exists). Note that the usual popitem()
3051 * idiom is "while d: k, v = d.popitem()". so needing to throw the
3052 * tuple away if the dict *is* empty isn't a significant
3053 * inefficiency -- possible, but unlikely in practice.
3054 */
3055 res = PyTuple_New(2);
3056 if (res == NULL)
3057 return NULL;
Inada Naoki9e4f2f32019-04-12 16:11:28 +09003058 if (self->ma_used == 0) {
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00003059 Py_DECREF(res);
Inada Naoki9e4f2f32019-04-12 16:11:28 +09003060 PyErr_SetString(PyExc_KeyError, "popitem(): dictionary is empty");
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00003061 return NULL;
3062 }
Benjamin Peterson7d95e402012-04-23 11:24:50 -04003063 /* Convert split table to combined table */
Inada Naoki9e4f2f32019-04-12 16:11:28 +09003064 if (self->ma_keys->dk_lookup == lookdict_split) {
3065 if (dictresize(self, DK_SIZE(self->ma_keys))) {
Benjamin Peterson7d95e402012-04-23 11:24:50 -04003066 Py_DECREF(res);
3067 return NULL;
3068 }
3069 }
Inada Naoki9e4f2f32019-04-12 16:11:28 +09003070 ENSURE_ALLOWS_DELETIONS(self);
Victor Stinner742da042016-09-07 17:40:12 -07003071
3072 /* Pop last item */
Inada Naoki9e4f2f32019-04-12 16:11:28 +09003073 ep0 = DK_ENTRIES(self->ma_keys);
3074 i = self->ma_keys->dk_nentries - 1;
Victor Stinner742da042016-09-07 17:40:12 -07003075 while (i >= 0 && ep0[i].me_value == NULL) {
3076 i--;
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00003077 }
Victor Stinner742da042016-09-07 17:40:12 -07003078 assert(i >= 0);
3079
3080 ep = &ep0[i];
Inada Naoki9e4f2f32019-04-12 16:11:28 +09003081 j = lookdict_index(self->ma_keys, ep->me_hash, i);
Victor Stinner742da042016-09-07 17:40:12 -07003082 assert(j >= 0);
Inada Naoki9e4f2f32019-04-12 16:11:28 +09003083 assert(dictkeys_get_index(self->ma_keys, j) == i);
3084 dictkeys_set_index(self->ma_keys, j, DKIX_DUMMY);
Victor Stinner742da042016-09-07 17:40:12 -07003085
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00003086 PyTuple_SET_ITEM(res, 0, ep->me_key);
3087 PyTuple_SET_ITEM(res, 1, ep->me_value);
Victor Stinner742da042016-09-07 17:40:12 -07003088 ep->me_key = NULL;
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00003089 ep->me_value = NULL;
Victor Stinner742da042016-09-07 17:40:12 -07003090 /* We can't dk_usable++ since there is DKIX_DUMMY in indices */
Inada Naoki9e4f2f32019-04-12 16:11:28 +09003091 self->ma_keys->dk_nentries = i;
3092 self->ma_used--;
3093 self->ma_version_tag = DICT_NEXT_VERSION();
Victor Stinner0fc91ee2019-04-12 21:51:34 +02003094 ASSERT_CONSISTENT(self);
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00003095 return res;
Guido van Rossumba6ab842000-12-12 22:02:18 +00003096}
3097
Jeremy Hylton8caad492000-06-23 14:18:11 +00003098static int
3099dict_traverse(PyObject *op, visitproc visit, void *arg)
3100{
Benjamin Peterson7d95e402012-04-23 11:24:50 -04003101 PyDictObject *mp = (PyDictObject *)op;
Benjamin Peterson55f44522016-09-05 12:12:59 -07003102 PyDictKeysObject *keys = mp->ma_keys;
Serhiy Storchaka46825d22016-09-26 21:29:34 +03003103 PyDictKeyEntry *entries = DK_ENTRIES(keys);
Victor Stinner742da042016-09-07 17:40:12 -07003104 Py_ssize_t i, n = keys->dk_nentries;
3105
Benjamin Peterson55f44522016-09-05 12:12:59 -07003106 if (keys->dk_lookup == lookdict) {
3107 for (i = 0; i < n; i++) {
3108 if (entries[i].me_value != NULL) {
3109 Py_VISIT(entries[i].me_value);
3110 Py_VISIT(entries[i].me_key);
Benjamin Peterson7d95e402012-04-23 11:24:50 -04003111 }
3112 }
Victor Stinner742da042016-09-07 17:40:12 -07003113 }
3114 else {
Benjamin Peterson7d95e402012-04-23 11:24:50 -04003115 if (mp->ma_values != NULL) {
Benjamin Peterson55f44522016-09-05 12:12:59 -07003116 for (i = 0; i < n; i++) {
Benjamin Peterson7d95e402012-04-23 11:24:50 -04003117 Py_VISIT(mp->ma_values[i]);
3118 }
3119 }
3120 else {
Benjamin Peterson55f44522016-09-05 12:12:59 -07003121 for (i = 0; i < n; i++) {
3122 Py_VISIT(entries[i].me_value);
Benjamin Peterson7d95e402012-04-23 11:24:50 -04003123 }
3124 }
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00003125 }
3126 return 0;
Jeremy Hylton8caad492000-06-23 14:18:11 +00003127}
3128
3129static int
3130dict_tp_clear(PyObject *op)
3131{
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00003132 PyDict_Clear(op);
3133 return 0;
Jeremy Hylton8caad492000-06-23 14:18:11 +00003134}
3135
Guido van Rossum8ce8a782007-11-01 19:42:39 +00003136static PyObject *dictiter_new(PyDictObject *, PyTypeObject *);
Guido van Rossum09e563a2001-05-01 12:10:21 +00003137
Serhiy Storchaka0ce7a3a2015-12-22 08:16:18 +02003138Py_ssize_t
Eric Snow96c6af92015-05-29 22:21:39 -06003139_PyDict_SizeOf(PyDictObject *mp)
Martin v. Löwis00709aa2008-06-04 14:18:43 +00003140{
Victor Stinner742da042016-09-07 17:40:12 -07003141 Py_ssize_t size, usable, res;
Martin v. Löwis00709aa2008-06-04 14:18:43 +00003142
Benjamin Peterson7d95e402012-04-23 11:24:50 -04003143 size = DK_SIZE(mp->ma_keys);
Victor Stinner742da042016-09-07 17:40:12 -07003144 usable = USABLE_FRACTION(size);
3145
Serhiy Storchaka5c4064e2015-12-19 20:05:25 +02003146 res = _PyObject_SIZE(Py_TYPE(mp));
Benjamin Peterson7d95e402012-04-23 11:24:50 -04003147 if (mp->ma_values)
Victor Stinner742da042016-09-07 17:40:12 -07003148 res += usable * sizeof(PyObject*);
Martin v. Loewis4f2f3b62012-04-24 19:13:57 +02003149 /* If the dictionary is split, the keys portion is accounted-for
3150 in the type object. */
3151 if (mp->ma_keys->dk_refcnt == 1)
Victor Stinner98ee9d52016-09-08 09:33:56 -07003152 res += (sizeof(PyDictKeysObject)
Victor Stinner98ee9d52016-09-08 09:33:56 -07003153 + DK_IXSIZE(mp->ma_keys) * size
3154 + sizeof(PyDictKeyEntry) * usable);
Serhiy Storchaka0ce7a3a2015-12-22 08:16:18 +02003155 return res;
Martin v. Loewis4f2f3b62012-04-24 19:13:57 +02003156}
3157
3158Py_ssize_t
3159_PyDict_KeysSize(PyDictKeysObject *keys)
3160{
Victor Stinner98ee9d52016-09-08 09:33:56 -07003161 return (sizeof(PyDictKeysObject)
Victor Stinner98ee9d52016-09-08 09:33:56 -07003162 + DK_IXSIZE(keys) * DK_SIZE(keys)
3163 + USABLE_FRACTION(DK_SIZE(keys)) * sizeof(PyDictKeyEntry));
Martin v. Löwis00709aa2008-06-04 14:18:43 +00003164}
3165
doko@ubuntu.com17210f52016-01-14 14:04:59 +01003166static PyObject *
Siddhesh Poyarekar55edd0c2018-04-30 00:29:33 +05303167dict_sizeof(PyDictObject *mp, PyObject *Py_UNUSED(ignored))
Serhiy Storchaka0ce7a3a2015-12-22 08:16:18 +02003168{
3169 return PyLong_FromSsize_t(_PyDict_SizeOf(mp));
3170}
3171
Raymond Hettinger8f5cdaa2003-12-13 11:26:12 +00003172PyDoc_STRVAR(getitem__doc__, "x.__getitem__(y) <==> x[y]");
3173
Martin v. Löwis00709aa2008-06-04 14:18:43 +00003174PyDoc_STRVAR(sizeof__doc__,
3175"D.__sizeof__() -> size of D in memory, in bytes");
3176
Martin v. Löwis14f8b4c2002-06-13 20:33:02 +00003177PyDoc_STRVAR(update__doc__,
Brett Cannonf2754162013-05-11 14:46:48 -04003178"D.update([E, ]**F) -> None. Update D from dict/iterable E and F.\n\
3179If E is present and has a .keys() method, then does: for k in E: D[k] = E[k]\n\
3180If E is present and lacks a .keys() method, then does: for k, v in E: D[k] = v\n\
3181In either case, this is followed by: for k in F: D[k] = F[k]");
Tim Petersf7f88b12000-12-13 23:18:45 +00003182
Martin v. Löwis14f8b4c2002-06-13 20:33:02 +00003183PyDoc_STRVAR(clear__doc__,
3184"D.clear() -> None. Remove all items from D.");
Tim Petersf7f88b12000-12-13 23:18:45 +00003185
Martin v. Löwis14f8b4c2002-06-13 20:33:02 +00003186PyDoc_STRVAR(copy__doc__,
3187"D.copy() -> a shallow copy of D");
Tim Petersf7f88b12000-12-13 23:18:45 +00003188
Guido van Rossumb90c8482007-02-10 01:11:45 +00003189/* Forward */
Siddhesh Poyarekar55edd0c2018-04-30 00:29:33 +05303190static PyObject *dictkeys_new(PyObject *, PyObject *);
3191static PyObject *dictitems_new(PyObject *, PyObject *);
3192static PyObject *dictvalues_new(PyObject *, PyObject *);
Guido van Rossumb90c8482007-02-10 01:11:45 +00003193
Guido van Rossum45c85d12007-07-27 16:31:40 +00003194PyDoc_STRVAR(keys__doc__,
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00003195 "D.keys() -> a set-like object providing a view on D's keys");
Guido van Rossum45c85d12007-07-27 16:31:40 +00003196PyDoc_STRVAR(items__doc__,
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00003197 "D.items() -> a set-like object providing a view on D's items");
Guido van Rossum45c85d12007-07-27 16:31:40 +00003198PyDoc_STRVAR(values__doc__,
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00003199 "D.values() -> an object providing a view on D's values");
Guido van Rossumb90c8482007-02-10 01:11:45 +00003200
Guido van Rossumc0b618a1997-05-02 03:12:38 +00003201static PyMethodDef mapp_methods[] = {
Larry Hastings31826802013-10-19 00:09:25 -07003202 DICT___CONTAINS___METHODDEF
Serhiy Storchaka62be7422018-11-27 13:27:31 +02003203 {"__getitem__", (PyCFunction)(void(*)(void))dict_subscript, METH_O | METH_COEXIST,
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00003204 getitem__doc__},
Serhiy Storchaka62be7422018-11-27 13:27:31 +02003205 {"__sizeof__", (PyCFunction)(void(*)(void))dict_sizeof, METH_NOARGS,
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00003206 sizeof__doc__},
Victor Stinner7dc6a5f2017-01-19 12:37:13 +01003207 DICT_GET_METHODDEF
3208 DICT_SETDEFAULT_METHODDEF
Inada Naoki9e4f2f32019-04-12 16:11:28 +09003209 DICT_POP_METHODDEF
3210 DICT_POPITEM_METHODDEF
Siddhesh Poyarekar55edd0c2018-04-30 00:29:33 +05303211 {"keys", dictkeys_new, METH_NOARGS,
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00003212 keys__doc__},
Siddhesh Poyarekar55edd0c2018-04-30 00:29:33 +05303213 {"items", dictitems_new, METH_NOARGS,
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00003214 items__doc__},
Siddhesh Poyarekar55edd0c2018-04-30 00:29:33 +05303215 {"values", dictvalues_new, METH_NOARGS,
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00003216 values__doc__},
Serhiy Storchaka62be7422018-11-27 13:27:31 +02003217 {"update", (PyCFunction)(void(*)(void))dict_update, METH_VARARGS | METH_KEYWORDS,
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00003218 update__doc__},
Larry Hastings5c661892014-01-24 06:17:25 -08003219 DICT_FROMKEYS_METHODDEF
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00003220 {"clear", (PyCFunction)dict_clear, METH_NOARGS,
3221 clear__doc__},
3222 {"copy", (PyCFunction)dict_copy, METH_NOARGS,
3223 copy__doc__},
Rémi Lapeyre6531bf62018-11-06 01:38:54 +01003224 DICT___REVERSED___METHODDEF
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00003225 {NULL, NULL} /* sentinel */
Guido van Rossum4b1302b1993-03-27 18:11:32 +00003226};
3227
Thomas Wouters4d70c3d2006-06-08 14:42:34 +00003228/* Return 1 if `key` is in dict `op`, 0 if not, and -1 on error. */
Raymond Hettingerbc0f2ab2003-11-25 21:12:14 +00003229int
3230PyDict_Contains(PyObject *op, PyObject *key)
Guido van Rossum0dbb4fb2001-04-20 16:50:40 +00003231{
Benjamin Peterson8f67d082010-10-17 20:54:53 +00003232 Py_hash_t hash;
Victor Stinner742da042016-09-07 17:40:12 -07003233 Py_ssize_t ix;
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00003234 PyDictObject *mp = (PyDictObject *)op;
INADA Naokiba609772016-12-07 20:41:42 +09003235 PyObject *value;
Guido van Rossum0dbb4fb2001-04-20 16:50:40 +00003236
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00003237 if (!PyUnicode_CheckExact(key) ||
Martin v. Löwisd63a3b82011-09-28 07:41:54 +02003238 (hash = ((PyASCIIObject *) key)->hash) == -1) {
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00003239 hash = PyObject_Hash(key);
3240 if (hash == -1)
3241 return -1;
3242 }
INADA Naoki778928b2017-08-03 23:45:15 +09003243 ix = (mp->ma_keys->dk_lookup)(mp, key, hash, &value);
Victor Stinner742da042016-09-07 17:40:12 -07003244 if (ix == DKIX_ERROR)
3245 return -1;
INADA Naokiba609772016-12-07 20:41:42 +09003246 return (ix != DKIX_EMPTY && value != NULL);
Guido van Rossum0dbb4fb2001-04-20 16:50:40 +00003247}
3248
Thomas Wouterscf297e42007-02-23 15:07:44 +00003249/* Internal version of PyDict_Contains used when the hash value is already known */
3250int
Benjamin Peterson8f67d082010-10-17 20:54:53 +00003251_PyDict_Contains(PyObject *op, PyObject *key, Py_hash_t hash)
Thomas Wouterscf297e42007-02-23 15:07:44 +00003252{
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00003253 PyDictObject *mp = (PyDictObject *)op;
INADA Naokiba609772016-12-07 20:41:42 +09003254 PyObject *value;
Victor Stinner742da042016-09-07 17:40:12 -07003255 Py_ssize_t ix;
Thomas Wouterscf297e42007-02-23 15:07:44 +00003256
INADA Naoki778928b2017-08-03 23:45:15 +09003257 ix = (mp->ma_keys->dk_lookup)(mp, key, hash, &value);
Victor Stinner742da042016-09-07 17:40:12 -07003258 if (ix == DKIX_ERROR)
3259 return -1;
INADA Naokiba609772016-12-07 20:41:42 +09003260 return (ix != DKIX_EMPTY && value != NULL);
Thomas Wouterscf297e42007-02-23 15:07:44 +00003261}
3262
Guido van Rossum0dbb4fb2001-04-20 16:50:40 +00003263/* Hack to implement "key in dict" */
3264static PySequenceMethods dict_as_sequence = {
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00003265 0, /* sq_length */
3266 0, /* sq_concat */
3267 0, /* sq_repeat */
3268 0, /* sq_item */
3269 0, /* sq_slice */
3270 0, /* sq_ass_item */
3271 0, /* sq_ass_slice */
3272 PyDict_Contains, /* sq_contains */
3273 0, /* sq_inplace_concat */
3274 0, /* sq_inplace_repeat */
Guido van Rossum0dbb4fb2001-04-20 16:50:40 +00003275};
3276
Guido van Rossum09e563a2001-05-01 12:10:21 +00003277static PyObject *
Tim Peters6d6c1a32001-08-02 04:15:00 +00003278dict_new(PyTypeObject *type, PyObject *args, PyObject *kwds)
3279{
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00003280 PyObject *self;
Victor Stinnera9f61a52013-07-16 22:17:26 +02003281 PyDictObject *d;
Tim Peters6d6c1a32001-08-02 04:15:00 +00003282
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00003283 assert(type != NULL && type->tp_alloc != NULL);
3284 self = type->tp_alloc(type, 0);
Victor Stinnera9f61a52013-07-16 22:17:26 +02003285 if (self == NULL)
3286 return NULL;
Victor Stinnera9f61a52013-07-16 22:17:26 +02003287 d = (PyDictObject *)self;
Victor Stinnerac2a4fe2013-07-16 22:19:00 +02003288
Victor Stinnera9f61a52013-07-16 22:17:26 +02003289 /* The object has been implicitly tracked by tp_alloc */
3290 if (type == &PyDict_Type)
3291 _PyObject_GC_UNTRACK(d);
Victor Stinnerac2a4fe2013-07-16 22:19:00 +02003292
3293 d->ma_used = 0;
Victor Stinner3b6a6b42016-09-08 12:51:24 -07003294 d->ma_version_tag = DICT_NEXT_VERSION();
Victor Stinner742da042016-09-07 17:40:12 -07003295 d->ma_keys = new_keys_object(PyDict_MINSIZE);
Victor Stinnerac2a4fe2013-07-16 22:19:00 +02003296 if (d->ma_keys == NULL) {
3297 Py_DECREF(self);
3298 return NULL;
3299 }
Victor Stinner0fc91ee2019-04-12 21:51:34 +02003300 ASSERT_CONSISTENT(d);
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00003301 return self;
Tim Peters6d6c1a32001-08-02 04:15:00 +00003302}
3303
Tim Peters25786c02001-09-02 08:22:48 +00003304static int
3305dict_init(PyObject *self, PyObject *args, PyObject *kwds)
3306{
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00003307 return dict_update_common(self, args, kwds, "dict");
Tim Peters25786c02001-09-02 08:22:48 +00003308}
3309
Tim Peters6d6c1a32001-08-02 04:15:00 +00003310static PyObject *
Guido van Rossum8ce8a782007-11-01 19:42:39 +00003311dict_iter(PyDictObject *dict)
Guido van Rossum09e563a2001-05-01 12:10:21 +00003312{
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00003313 return dictiter_new(dict, &PyDictIterKey_Type);
Guido van Rossum09e563a2001-05-01 12:10:21 +00003314}
Guido van Rossum59d1d2b2001-04-20 19:13:02 +00003315
Martin v. Löwis14f8b4c2002-06-13 20:33:02 +00003316PyDoc_STRVAR(dictionary_doc,
Ezio Melotti7f807b72010-03-01 04:08:34 +00003317"dict() -> new empty dictionary\n"
Tim Petersa427a2b2001-10-29 22:25:45 +00003318"dict(mapping) -> new dictionary initialized from a mapping object's\n"
Ezio Melotti7f807b72010-03-01 04:08:34 +00003319" (key, value) pairs\n"
3320"dict(iterable) -> new dictionary initialized as if via:\n"
Tim Peters4d859532001-10-27 18:27:48 +00003321" d = {}\n"
Ezio Melotti7f807b72010-03-01 04:08:34 +00003322" for k, v in iterable:\n"
Just van Rossuma797d812002-11-23 09:45:04 +00003323" d[k] = v\n"
3324"dict(**kwargs) -> new dictionary initialized with the name=value pairs\n"
3325" in the keyword argument list. For example: dict(one=1, two=2)");
Tim Peters25786c02001-09-02 08:22:48 +00003326
Guido van Rossumc0b618a1997-05-02 03:12:38 +00003327PyTypeObject PyDict_Type = {
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00003328 PyVarObject_HEAD_INIT(&PyType_Type, 0)
3329 "dict",
3330 sizeof(PyDictObject),
3331 0,
3332 (destructor)dict_dealloc, /* tp_dealloc */
Jeroen Demeyer530f5062019-05-31 04:13:39 +02003333 0, /* tp_vectorcall_offset */
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00003334 0, /* tp_getattr */
3335 0, /* tp_setattr */
Jeroen Demeyer530f5062019-05-31 04:13:39 +02003336 0, /* tp_as_async */
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00003337 (reprfunc)dict_repr, /* tp_repr */
3338 0, /* tp_as_number */
3339 &dict_as_sequence, /* tp_as_sequence */
3340 &dict_as_mapping, /* tp_as_mapping */
Georg Brandl00da4e02010-10-18 07:32:48 +00003341 PyObject_HashNotImplemented, /* tp_hash */
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00003342 0, /* tp_call */
3343 0, /* tp_str */
3344 PyObject_GenericGetAttr, /* tp_getattro */
3345 0, /* tp_setattro */
3346 0, /* tp_as_buffer */
3347 Py_TPFLAGS_DEFAULT | Py_TPFLAGS_HAVE_GC |
3348 Py_TPFLAGS_BASETYPE | Py_TPFLAGS_DICT_SUBCLASS, /* tp_flags */
3349 dictionary_doc, /* tp_doc */
3350 dict_traverse, /* tp_traverse */
3351 dict_tp_clear, /* tp_clear */
3352 dict_richcompare, /* tp_richcompare */
3353 0, /* tp_weaklistoffset */
3354 (getiterfunc)dict_iter, /* tp_iter */
3355 0, /* tp_iternext */
3356 mapp_methods, /* tp_methods */
3357 0, /* tp_members */
3358 0, /* tp_getset */
3359 0, /* tp_base */
3360 0, /* tp_dict */
3361 0, /* tp_descr_get */
3362 0, /* tp_descr_set */
3363 0, /* tp_dictoffset */
3364 dict_init, /* tp_init */
3365 PyType_GenericAlloc, /* tp_alloc */
3366 dict_new, /* tp_new */
3367 PyObject_GC_Del, /* tp_free */
Guido van Rossum4b1302b1993-03-27 18:11:32 +00003368};
3369
Victor Stinner3c1e4812012-03-26 22:10:51 +02003370PyObject *
3371_PyDict_GetItemId(PyObject *dp, struct _Py_Identifier *key)
3372{
3373 PyObject *kv;
3374 kv = _PyUnicode_FromId(key); /* borrowed */
Victor Stinner5b3b1002013-07-22 23:50:57 +02003375 if (kv == NULL) {
3376 PyErr_Clear();
Victor Stinner3c1e4812012-03-26 22:10:51 +02003377 return NULL;
Victor Stinner5b3b1002013-07-22 23:50:57 +02003378 }
Victor Stinner3c1e4812012-03-26 22:10:51 +02003379 return PyDict_GetItem(dp, kv);
3380}
3381
Guido van Rossum3cca2451997-05-16 14:23:33 +00003382/* For backward compatibility with old dictionary interface */
3383
Guido van Rossumc0b618a1997-05-02 03:12:38 +00003384PyObject *
Martin v. Löwis32b4a1b2002-12-11 13:21:12 +00003385PyDict_GetItemString(PyObject *v, const char *key)
Guido van Rossum4b1302b1993-03-27 18:11:32 +00003386{
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00003387 PyObject *kv, *rv;
3388 kv = PyUnicode_FromString(key);
Victor Stinnerfdcbab92013-07-16 22:16:05 +02003389 if (kv == NULL) {
3390 PyErr_Clear();
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00003391 return NULL;
Victor Stinnerfdcbab92013-07-16 22:16:05 +02003392 }
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00003393 rv = PyDict_GetItem(v, kv);
3394 Py_DECREF(kv);
3395 return rv;
Guido van Rossum4b1302b1993-03-27 18:11:32 +00003396}
3397
3398int
Victor Stinner3c1e4812012-03-26 22:10:51 +02003399_PyDict_SetItemId(PyObject *v, struct _Py_Identifier *key, PyObject *item)
3400{
3401 PyObject *kv;
3402 kv = _PyUnicode_FromId(key); /* borrowed */
3403 if (kv == NULL)
3404 return -1;
3405 return PyDict_SetItem(v, kv, item);
3406}
3407
3408int
Martin v. Löwis32b4a1b2002-12-11 13:21:12 +00003409PyDict_SetItemString(PyObject *v, const char *key, PyObject *item)
Guido van Rossum4b1302b1993-03-27 18:11:32 +00003410{
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00003411 PyObject *kv;
3412 int err;
3413 kv = PyUnicode_FromString(key);
3414 if (kv == NULL)
3415 return -1;
3416 PyUnicode_InternInPlace(&kv); /* XXX Should we really? */
3417 err = PyDict_SetItem(v, kv, item);
3418 Py_DECREF(kv);
3419 return err;
Guido van Rossum4b1302b1993-03-27 18:11:32 +00003420}
3421
3422int
Victor Stinner5fd2e5a2013-11-06 18:58:22 +01003423_PyDict_DelItemId(PyObject *v, _Py_Identifier *key)
3424{
3425 PyObject *kv = _PyUnicode_FromId(key); /* borrowed */
3426 if (kv == NULL)
3427 return -1;
3428 return PyDict_DelItem(v, kv);
3429}
3430
3431int
Martin v. Löwis32b4a1b2002-12-11 13:21:12 +00003432PyDict_DelItemString(PyObject *v, const char *key)
Guido van Rossum4b1302b1993-03-27 18:11:32 +00003433{
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00003434 PyObject *kv;
3435 int err;
3436 kv = PyUnicode_FromString(key);
3437 if (kv == NULL)
3438 return -1;
3439 err = PyDict_DelItem(v, kv);
3440 Py_DECREF(kv);
3441 return err;
Guido van Rossum4b1302b1993-03-27 18:11:32 +00003442}
Guido van Rossum59d1d2b2001-04-20 19:13:02 +00003443
Raymond Hettinger019a1482004-03-18 02:41:19 +00003444/* Dictionary iterator types */
Guido van Rossum59d1d2b2001-04-20 19:13:02 +00003445
3446typedef struct {
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00003447 PyObject_HEAD
3448 PyDictObject *di_dict; /* Set to NULL when iterator is exhausted */
3449 Py_ssize_t di_used;
3450 Py_ssize_t di_pos;
3451 PyObject* di_result; /* reusable result tuple for iteritems */
3452 Py_ssize_t len;
Guido van Rossum59d1d2b2001-04-20 19:13:02 +00003453} dictiterobject;
3454
3455static PyObject *
Guido van Rossum8ce8a782007-11-01 19:42:39 +00003456dictiter_new(PyDictObject *dict, PyTypeObject *itertype)
Guido van Rossum59d1d2b2001-04-20 19:13:02 +00003457{
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00003458 dictiterobject *di;
3459 di = PyObject_GC_New(dictiterobject, itertype);
Rémi Lapeyre6531bf62018-11-06 01:38:54 +01003460 if (di == NULL) {
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00003461 return NULL;
Rémi Lapeyre6531bf62018-11-06 01:38:54 +01003462 }
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00003463 Py_INCREF(dict);
3464 di->di_dict = dict;
3465 di->di_used = dict->ma_used;
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00003466 di->len = dict->ma_used;
Dong-hee Na24dc2f82019-10-20 05:01:08 +09003467 if (itertype == &PyDictRevIterKey_Type ||
Rémi Lapeyre6531bf62018-11-06 01:38:54 +01003468 itertype == &PyDictRevIterItem_Type ||
Dong-hee Na24dc2f82019-10-20 05:01:08 +09003469 itertype == &PyDictRevIterValue_Type) {
3470 if (dict->ma_values) {
3471 di->di_pos = dict->ma_used - 1;
3472 }
3473 else {
Rémi Lapeyre6531bf62018-11-06 01:38:54 +01003474 di->di_pos = dict->ma_keys->dk_nentries - 1;
Dong-hee Na24dc2f82019-10-20 05:01:08 +09003475 }
Rémi Lapeyre6531bf62018-11-06 01:38:54 +01003476 }
3477 else {
3478 di->di_pos = 0;
3479 }
3480 if (itertype == &PyDictIterItem_Type ||
3481 itertype == &PyDictRevIterItem_Type) {
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00003482 di->di_result = PyTuple_Pack(2, Py_None, Py_None);
3483 if (di->di_result == NULL) {
3484 Py_DECREF(di);
3485 return NULL;
3486 }
3487 }
Rémi Lapeyre6531bf62018-11-06 01:38:54 +01003488 else {
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00003489 di->di_result = NULL;
Rémi Lapeyre6531bf62018-11-06 01:38:54 +01003490 }
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00003491 _PyObject_GC_TRACK(di);
3492 return (PyObject *)di;
Guido van Rossum59d1d2b2001-04-20 19:13:02 +00003493}
3494
3495static void
3496dictiter_dealloc(dictiterobject *di)
3497{
INADA Naokia6296d32017-08-24 14:55:17 +09003498 /* bpo-31095: UnTrack is needed before calling any callbacks */
3499 _PyObject_GC_UNTRACK(di);
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00003500 Py_XDECREF(di->di_dict);
3501 Py_XDECREF(di->di_result);
3502 PyObject_GC_Del(di);
Antoine Pitrou7ddda782009-01-01 15:35:33 +00003503}
3504
3505static int
3506dictiter_traverse(dictiterobject *di, visitproc visit, void *arg)
3507{
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00003508 Py_VISIT(di->di_dict);
3509 Py_VISIT(di->di_result);
3510 return 0;
Guido van Rossum59d1d2b2001-04-20 19:13:02 +00003511}
3512
Raymond Hettinger6b27cda2005-09-24 21:23:05 +00003513static PyObject *
Siddhesh Poyarekar55edd0c2018-04-30 00:29:33 +05303514dictiter_len(dictiterobject *di, PyObject *Py_UNUSED(ignored))
Raymond Hettinger0ce6dc82004-03-18 08:38:00 +00003515{
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00003516 Py_ssize_t len = 0;
3517 if (di->di_dict != NULL && di->di_used == di->di_dict->ma_used)
3518 len = di->len;
3519 return PyLong_FromSize_t(len);
Raymond Hettinger0ce6dc82004-03-18 08:38:00 +00003520}
3521
Guido van Rossumb90c8482007-02-10 01:11:45 +00003522PyDoc_STRVAR(length_hint_doc,
3523 "Private method returning an estimate of len(list(it)).");
Raymond Hettinger6b27cda2005-09-24 21:23:05 +00003524
Kristján Valur Jónsson31668b82012-04-03 10:49:41 +00003525static PyObject *
Siddhesh Poyarekar55edd0c2018-04-30 00:29:33 +05303526dictiter_reduce(dictiterobject *di, PyObject *Py_UNUSED(ignored));
Kristján Valur Jónsson31668b82012-04-03 10:49:41 +00003527
3528PyDoc_STRVAR(reduce_doc, "Return state information for pickling.");
3529
Raymond Hettinger6b27cda2005-09-24 21:23:05 +00003530static PyMethodDef dictiter_methods[] = {
Serhiy Storchaka62be7422018-11-27 13:27:31 +02003531 {"__length_hint__", (PyCFunction)(void(*)(void))dictiter_len, METH_NOARGS,
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00003532 length_hint_doc},
Serhiy Storchaka62be7422018-11-27 13:27:31 +02003533 {"__reduce__", (PyCFunction)(void(*)(void))dictiter_reduce, METH_NOARGS,
Kristján Valur Jónsson31668b82012-04-03 10:49:41 +00003534 reduce_doc},
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00003535 {NULL, NULL} /* sentinel */
Raymond Hettinger0ce6dc82004-03-18 08:38:00 +00003536};
3537
Serhiy Storchaka49f5cdd2016-10-09 23:08:05 +03003538static PyObject*
3539dictiter_iternextkey(dictiterobject *di)
Guido van Rossum213c7a62001-04-23 14:08:49 +00003540{
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00003541 PyObject *key;
INADA Naokica2d8be2016-11-04 16:59:10 +09003542 Py_ssize_t i;
Antoine Pitrou9ed5f272013-08-13 20:18:52 +02003543 PyDictKeysObject *k;
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00003544 PyDictObject *d = di->di_dict;
Guido van Rossum213c7a62001-04-23 14:08:49 +00003545
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00003546 if (d == NULL)
3547 return NULL;
3548 assert (PyDict_Check(d));
Guido van Rossum2147df72002-07-16 20:30:22 +00003549
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00003550 if (di->di_used != d->ma_used) {
3551 PyErr_SetString(PyExc_RuntimeError,
3552 "dictionary changed size during iteration");
3553 di->di_used = -1; /* Make this state sticky */
3554 return NULL;
3555 }
Guido van Rossum2147df72002-07-16 20:30:22 +00003556
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00003557 i = di->di_pos;
Benjamin Peterson7d95e402012-04-23 11:24:50 -04003558 k = d->ma_keys;
INADA Naokica2d8be2016-11-04 16:59:10 +09003559 assert(i >= 0);
Benjamin Peterson7d95e402012-04-23 11:24:50 -04003560 if (d->ma_values) {
INADA Naokica2d8be2016-11-04 16:59:10 +09003561 if (i >= d->ma_used)
Serhiy Storchaka49f5cdd2016-10-09 23:08:05 +03003562 goto fail;
3563 key = DK_ENTRIES(k)[i].me_key;
INADA Naokica2d8be2016-11-04 16:59:10 +09003564 assert(d->ma_values[i] != NULL);
Benjamin Peterson7d95e402012-04-23 11:24:50 -04003565 }
3566 else {
INADA Naokica2d8be2016-11-04 16:59:10 +09003567 Py_ssize_t n = k->dk_nentries;
Serhiy Storchaka49f5cdd2016-10-09 23:08:05 +03003568 PyDictKeyEntry *entry_ptr = &DK_ENTRIES(k)[i];
3569 while (i < n && entry_ptr->me_value == NULL) {
3570 entry_ptr++;
3571 i++;
3572 }
3573 if (i >= n)
3574 goto fail;
3575 key = entry_ptr->me_key;
Benjamin Peterson7d95e402012-04-23 11:24:50 -04003576 }
Thomas Perl796cc6e2019-03-28 07:03:25 +01003577 // We found an element (key), but did not expect it
3578 if (di->len == 0) {
3579 PyErr_SetString(PyExc_RuntimeError,
3580 "dictionary keys changed during iteration");
3581 goto fail;
3582 }
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00003583 di->di_pos = i+1;
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00003584 di->len--;
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00003585 Py_INCREF(key);
3586 return key;
Raymond Hettinger019a1482004-03-18 02:41:19 +00003587
3588fail:
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00003589 di->di_dict = NULL;
Serhiy Storchakafbb1c5e2016-03-30 20:40:02 +03003590 Py_DECREF(d);
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00003591 return NULL;
Guido van Rossum59d1d2b2001-04-20 19:13:02 +00003592}
3593
Raymond Hettinger019a1482004-03-18 02:41:19 +00003594PyTypeObject PyDictIterKey_Type = {
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00003595 PyVarObject_HEAD_INIT(&PyType_Type, 0)
3596 "dict_keyiterator", /* tp_name */
3597 sizeof(dictiterobject), /* tp_basicsize */
3598 0, /* tp_itemsize */
3599 /* methods */
3600 (destructor)dictiter_dealloc, /* tp_dealloc */
Jeroen Demeyer530f5062019-05-31 04:13:39 +02003601 0, /* tp_vectorcall_offset */
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00003602 0, /* tp_getattr */
3603 0, /* tp_setattr */
Jeroen Demeyer530f5062019-05-31 04:13:39 +02003604 0, /* tp_as_async */
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00003605 0, /* tp_repr */
3606 0, /* tp_as_number */
3607 0, /* tp_as_sequence */
3608 0, /* tp_as_mapping */
3609 0, /* tp_hash */
3610 0, /* tp_call */
3611 0, /* tp_str */
3612 PyObject_GenericGetAttr, /* tp_getattro */
3613 0, /* tp_setattro */
3614 0, /* tp_as_buffer */
3615 Py_TPFLAGS_DEFAULT | Py_TPFLAGS_HAVE_GC,/* tp_flags */
3616 0, /* tp_doc */
3617 (traverseproc)dictiter_traverse, /* tp_traverse */
3618 0, /* tp_clear */
3619 0, /* tp_richcompare */
3620 0, /* tp_weaklistoffset */
3621 PyObject_SelfIter, /* tp_iter */
3622 (iternextfunc)dictiter_iternextkey, /* tp_iternext */
3623 dictiter_methods, /* tp_methods */
3624 0,
Raymond Hettinger019a1482004-03-18 02:41:19 +00003625};
3626
Serhiy Storchaka49f5cdd2016-10-09 23:08:05 +03003627static PyObject *
3628dictiter_iternextvalue(dictiterobject *di)
Raymond Hettinger019a1482004-03-18 02:41:19 +00003629{
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00003630 PyObject *value;
INADA Naokica2d8be2016-11-04 16:59:10 +09003631 Py_ssize_t i;
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00003632 PyDictObject *d = di->di_dict;
Raymond Hettinger019a1482004-03-18 02:41:19 +00003633
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00003634 if (d == NULL)
3635 return NULL;
3636 assert (PyDict_Check(d));
Raymond Hettinger019a1482004-03-18 02:41:19 +00003637
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00003638 if (di->di_used != d->ma_used) {
3639 PyErr_SetString(PyExc_RuntimeError,
3640 "dictionary changed size during iteration");
3641 di->di_used = -1; /* Make this state sticky */
3642 return NULL;
3643 }
Raymond Hettinger019a1482004-03-18 02:41:19 +00003644
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00003645 i = di->di_pos;
INADA Naokica2d8be2016-11-04 16:59:10 +09003646 assert(i >= 0);
Benjamin Peterson7d95e402012-04-23 11:24:50 -04003647 if (d->ma_values) {
INADA Naokica2d8be2016-11-04 16:59:10 +09003648 if (i >= d->ma_used)
Serhiy Storchaka49f5cdd2016-10-09 23:08:05 +03003649 goto fail;
INADA Naokica2d8be2016-11-04 16:59:10 +09003650 value = d->ma_values[i];
3651 assert(value != NULL);
Benjamin Peterson7d95e402012-04-23 11:24:50 -04003652 }
3653 else {
INADA Naokica2d8be2016-11-04 16:59:10 +09003654 Py_ssize_t n = d->ma_keys->dk_nentries;
Serhiy Storchaka49f5cdd2016-10-09 23:08:05 +03003655 PyDictKeyEntry *entry_ptr = &DK_ENTRIES(d->ma_keys)[i];
3656 while (i < n && entry_ptr->me_value == NULL) {
3657 entry_ptr++;
3658 i++;
3659 }
3660 if (i >= n)
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00003661 goto fail;
Serhiy Storchaka49f5cdd2016-10-09 23:08:05 +03003662 value = entry_ptr->me_value;
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00003663 }
Thomas Perlb8311cf2019-04-02 11:30:10 +02003664 // We found an element, but did not expect it
3665 if (di->len == 0) {
3666 PyErr_SetString(PyExc_RuntimeError,
3667 "dictionary keys changed during iteration");
3668 goto fail;
3669 }
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00003670 di->di_pos = i+1;
3671 di->len--;
3672 Py_INCREF(value);
3673 return value;
Raymond Hettinger019a1482004-03-18 02:41:19 +00003674
3675fail:
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00003676 di->di_dict = NULL;
Serhiy Storchakafbb1c5e2016-03-30 20:40:02 +03003677 Py_DECREF(d);
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00003678 return NULL;
Raymond Hettinger019a1482004-03-18 02:41:19 +00003679}
3680
3681PyTypeObject PyDictIterValue_Type = {
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00003682 PyVarObject_HEAD_INIT(&PyType_Type, 0)
3683 "dict_valueiterator", /* tp_name */
3684 sizeof(dictiterobject), /* tp_basicsize */
3685 0, /* tp_itemsize */
3686 /* methods */
3687 (destructor)dictiter_dealloc, /* tp_dealloc */
Jeroen Demeyer530f5062019-05-31 04:13:39 +02003688 0, /* tp_vectorcall_offset */
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00003689 0, /* tp_getattr */
3690 0, /* tp_setattr */
Jeroen Demeyer530f5062019-05-31 04:13:39 +02003691 0, /* tp_as_async */
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00003692 0, /* tp_repr */
3693 0, /* tp_as_number */
3694 0, /* tp_as_sequence */
3695 0, /* tp_as_mapping */
3696 0, /* tp_hash */
3697 0, /* tp_call */
3698 0, /* tp_str */
3699 PyObject_GenericGetAttr, /* tp_getattro */
3700 0, /* tp_setattro */
3701 0, /* tp_as_buffer */
Serhiy Storchaka49f5cdd2016-10-09 23:08:05 +03003702 Py_TPFLAGS_DEFAULT | Py_TPFLAGS_HAVE_GC, /* tp_flags */
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00003703 0, /* tp_doc */
3704 (traverseproc)dictiter_traverse, /* tp_traverse */
3705 0, /* tp_clear */
3706 0, /* tp_richcompare */
3707 0, /* tp_weaklistoffset */
3708 PyObject_SelfIter, /* tp_iter */
3709 (iternextfunc)dictiter_iternextvalue, /* tp_iternext */
3710 dictiter_methods, /* tp_methods */
3711 0,
Raymond Hettinger019a1482004-03-18 02:41:19 +00003712};
3713
Serhiy Storchaka49f5cdd2016-10-09 23:08:05 +03003714static PyObject *
3715dictiter_iternextitem(dictiterobject *di)
Raymond Hettinger019a1482004-03-18 02:41:19 +00003716{
Serhiy Storchaka753bca32017-05-20 12:30:02 +03003717 PyObject *key, *value, *result;
INADA Naokica2d8be2016-11-04 16:59:10 +09003718 Py_ssize_t i;
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00003719 PyDictObject *d = di->di_dict;
Raymond Hettinger019a1482004-03-18 02:41:19 +00003720
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00003721 if (d == NULL)
3722 return NULL;
3723 assert (PyDict_Check(d));
Raymond Hettinger019a1482004-03-18 02:41:19 +00003724
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00003725 if (di->di_used != d->ma_used) {
3726 PyErr_SetString(PyExc_RuntimeError,
3727 "dictionary changed size during iteration");
3728 di->di_used = -1; /* Make this state sticky */
3729 return NULL;
3730 }
Raymond Hettinger019a1482004-03-18 02:41:19 +00003731
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00003732 i = di->di_pos;
INADA Naokica2d8be2016-11-04 16:59:10 +09003733 assert(i >= 0);
Benjamin Peterson7d95e402012-04-23 11:24:50 -04003734 if (d->ma_values) {
INADA Naokica2d8be2016-11-04 16:59:10 +09003735 if (i >= d->ma_used)
Serhiy Storchaka49f5cdd2016-10-09 23:08:05 +03003736 goto fail;
3737 key = DK_ENTRIES(d->ma_keys)[i].me_key;
INADA Naokica2d8be2016-11-04 16:59:10 +09003738 value = d->ma_values[i];
3739 assert(value != NULL);
Benjamin Peterson7d95e402012-04-23 11:24:50 -04003740 }
3741 else {
INADA Naokica2d8be2016-11-04 16:59:10 +09003742 Py_ssize_t n = d->ma_keys->dk_nentries;
Serhiy Storchaka49f5cdd2016-10-09 23:08:05 +03003743 PyDictKeyEntry *entry_ptr = &DK_ENTRIES(d->ma_keys)[i];
3744 while (i < n && entry_ptr->me_value == NULL) {
3745 entry_ptr++;
3746 i++;
3747 }
3748 if (i >= n)
3749 goto fail;
3750 key = entry_ptr->me_key;
3751 value = entry_ptr->me_value;
Benjamin Peterson7d95e402012-04-23 11:24:50 -04003752 }
Thomas Perlb8311cf2019-04-02 11:30:10 +02003753 // We found an element, but did not expect it
3754 if (di->len == 0) {
3755 PyErr_SetString(PyExc_RuntimeError,
3756 "dictionary keys changed during iteration");
3757 goto fail;
3758 }
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00003759 di->di_pos = i+1;
Serhiy Storchaka49f5cdd2016-10-09 23:08:05 +03003760 di->len--;
Serhiy Storchaka753bca32017-05-20 12:30:02 +03003761 Py_INCREF(key);
3762 Py_INCREF(value);
3763 result = di->di_result;
3764 if (Py_REFCNT(result) == 1) {
3765 PyObject *oldkey = PyTuple_GET_ITEM(result, 0);
3766 PyObject *oldvalue = PyTuple_GET_ITEM(result, 1);
3767 PyTuple_SET_ITEM(result, 0, key); /* steals reference */
3768 PyTuple_SET_ITEM(result, 1, value); /* steals reference */
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00003769 Py_INCREF(result);
Serhiy Storchaka753bca32017-05-20 12:30:02 +03003770 Py_DECREF(oldkey);
3771 Py_DECREF(oldvalue);
Serhiy Storchaka49f5cdd2016-10-09 23:08:05 +03003772 }
3773 else {
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00003774 result = PyTuple_New(2);
3775 if (result == NULL)
3776 return NULL;
Serhiy Storchaka753bca32017-05-20 12:30:02 +03003777 PyTuple_SET_ITEM(result, 0, key); /* steals reference */
3778 PyTuple_SET_ITEM(result, 1, value); /* steals reference */
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00003779 }
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00003780 return result;
Raymond Hettinger019a1482004-03-18 02:41:19 +00003781
3782fail:
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00003783 di->di_dict = NULL;
Serhiy Storchakafbb1c5e2016-03-30 20:40:02 +03003784 Py_DECREF(d);
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00003785 return NULL;
Raymond Hettinger019a1482004-03-18 02:41:19 +00003786}
3787
3788PyTypeObject PyDictIterItem_Type = {
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00003789 PyVarObject_HEAD_INIT(&PyType_Type, 0)
3790 "dict_itemiterator", /* tp_name */
3791 sizeof(dictiterobject), /* tp_basicsize */
3792 0, /* tp_itemsize */
3793 /* methods */
3794 (destructor)dictiter_dealloc, /* tp_dealloc */
Jeroen Demeyer530f5062019-05-31 04:13:39 +02003795 0, /* tp_vectorcall_offset */
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00003796 0, /* tp_getattr */
3797 0, /* tp_setattr */
Jeroen Demeyer530f5062019-05-31 04:13:39 +02003798 0, /* tp_as_async */
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00003799 0, /* tp_repr */
3800 0, /* tp_as_number */
3801 0, /* tp_as_sequence */
3802 0, /* tp_as_mapping */
3803 0, /* tp_hash */
3804 0, /* tp_call */
3805 0, /* tp_str */
3806 PyObject_GenericGetAttr, /* tp_getattro */
3807 0, /* tp_setattro */
3808 0, /* tp_as_buffer */
3809 Py_TPFLAGS_DEFAULT | Py_TPFLAGS_HAVE_GC,/* tp_flags */
3810 0, /* tp_doc */
3811 (traverseproc)dictiter_traverse, /* tp_traverse */
3812 0, /* tp_clear */
3813 0, /* tp_richcompare */
3814 0, /* tp_weaklistoffset */
3815 PyObject_SelfIter, /* tp_iter */
3816 (iternextfunc)dictiter_iternextitem, /* tp_iternext */
3817 dictiter_methods, /* tp_methods */
3818 0,
Guido van Rossum59d1d2b2001-04-20 19:13:02 +00003819};
Guido van Rossumb90c8482007-02-10 01:11:45 +00003820
3821
Rémi Lapeyre6531bf62018-11-06 01:38:54 +01003822/* dictreviter */
3823
3824static PyObject *
3825dictreviter_iternext(dictiterobject *di)
3826{
3827 PyDictObject *d = di->di_dict;
3828
3829 if (d == NULL) {
3830 return NULL;
3831 }
3832 assert (PyDict_Check(d));
3833
3834 if (di->di_used != d->ma_used) {
3835 PyErr_SetString(PyExc_RuntimeError,
3836 "dictionary changed size during iteration");
3837 di->di_used = -1; /* Make this state sticky */
3838 return NULL;
3839 }
3840
3841 Py_ssize_t i = di->di_pos;
3842 PyDictKeysObject *k = d->ma_keys;
3843 PyObject *key, *value, *result;
3844
Serhiy Storchaka2e3d8732019-10-23 14:48:08 +03003845 if (i < 0) {
3846 goto fail;
3847 }
Rémi Lapeyre6531bf62018-11-06 01:38:54 +01003848 if (d->ma_values) {
Rémi Lapeyre6531bf62018-11-06 01:38:54 +01003849 key = DK_ENTRIES(k)[i].me_key;
3850 value = d->ma_values[i];
3851 assert (value != NULL);
3852 }
3853 else {
3854 PyDictKeyEntry *entry_ptr = &DK_ENTRIES(k)[i];
Serhiy Storchaka2e3d8732019-10-23 14:48:08 +03003855 while (entry_ptr->me_value == NULL) {
3856 if (--i < 0) {
3857 goto fail;
3858 }
Rémi Lapeyre6531bf62018-11-06 01:38:54 +01003859 entry_ptr--;
Rémi Lapeyre6531bf62018-11-06 01:38:54 +01003860 }
3861 key = entry_ptr->me_key;
3862 value = entry_ptr->me_value;
3863 }
3864 di->di_pos = i-1;
3865 di->len--;
3866
Dong-hee Na1b55b652020-02-17 19:09:15 +09003867 if (Py_IS_TYPE(di, &PyDictRevIterKey_Type)) {
Rémi Lapeyre6531bf62018-11-06 01:38:54 +01003868 Py_INCREF(key);
3869 return key;
3870 }
Dong-hee Na1b55b652020-02-17 19:09:15 +09003871 else if (Py_IS_TYPE(di, &PyDictRevIterValue_Type)) {
Rémi Lapeyre6531bf62018-11-06 01:38:54 +01003872 Py_INCREF(value);
3873 return value;
3874 }
Dong-hee Na1b55b652020-02-17 19:09:15 +09003875 else if (Py_IS_TYPE(di, &PyDictRevIterItem_Type)) {
Rémi Lapeyre6531bf62018-11-06 01:38:54 +01003876 Py_INCREF(key);
3877 Py_INCREF(value);
3878 result = di->di_result;
3879 if (Py_REFCNT(result) == 1) {
3880 PyObject *oldkey = PyTuple_GET_ITEM(result, 0);
3881 PyObject *oldvalue = PyTuple_GET_ITEM(result, 1);
3882 PyTuple_SET_ITEM(result, 0, key); /* steals reference */
3883 PyTuple_SET_ITEM(result, 1, value); /* steals reference */
3884 Py_INCREF(result);
3885 Py_DECREF(oldkey);
3886 Py_DECREF(oldvalue);
3887 }
3888 else {
3889 result = PyTuple_New(2);
3890 if (result == NULL) {
3891 return NULL;
3892 }
3893 PyTuple_SET_ITEM(result, 0, key); /* steals reference */
3894 PyTuple_SET_ITEM(result, 1, value); /* steals reference */
3895 }
3896 return result;
3897 }
3898 else {
3899 Py_UNREACHABLE();
3900 }
3901
3902fail:
3903 di->di_dict = NULL;
3904 Py_DECREF(d);
3905 return NULL;
3906}
3907
3908PyTypeObject PyDictRevIterKey_Type = {
3909 PyVarObject_HEAD_INIT(&PyType_Type, 0)
3910 "dict_reversekeyiterator",
3911 sizeof(dictiterobject),
3912 .tp_dealloc = (destructor)dictiter_dealloc,
3913 .tp_flags = Py_TPFLAGS_DEFAULT | Py_TPFLAGS_HAVE_GC,
3914 .tp_traverse = (traverseproc)dictiter_traverse,
3915 .tp_iter = PyObject_SelfIter,
3916 .tp_iternext = (iternextfunc)dictreviter_iternext,
3917 .tp_methods = dictiter_methods
3918};
3919
3920
3921/*[clinic input]
3922dict.__reversed__
3923
3924Return a reverse iterator over the dict keys.
3925[clinic start generated code]*/
3926
3927static PyObject *
3928dict___reversed___impl(PyDictObject *self)
3929/*[clinic end generated code: output=e674483336d1ed51 input=23210ef3477d8c4d]*/
3930{
3931 assert (PyDict_Check(self));
3932 return dictiter_new(self, &PyDictRevIterKey_Type);
3933}
3934
Kristján Valur Jónsson31668b82012-04-03 10:49:41 +00003935static PyObject *
Siddhesh Poyarekar55edd0c2018-04-30 00:29:33 +05303936dictiter_reduce(dictiterobject *di, PyObject *Py_UNUSED(ignored))
Kristján Valur Jónsson31668b82012-04-03 10:49:41 +00003937{
Serhiy Storchakabb86bf42018-12-11 08:28:18 +02003938 _Py_IDENTIFIER(iter);
Sergey Fedoseev63958442018-10-20 05:43:33 +05003939 /* copy the iterator state */
3940 dictiterobject tmp = *di;
Kristján Valur Jónsson31668b82012-04-03 10:49:41 +00003941 Py_XINCREF(tmp.di_dict);
Benjamin Peterson7d95e402012-04-23 11:24:50 -04003942
Sergey Fedoseev63958442018-10-20 05:43:33 +05003943 PyObject *list = PySequence_List((PyObject*)&tmp);
Kristján Valur Jónsson31668b82012-04-03 10:49:41 +00003944 Py_XDECREF(tmp.di_dict);
Sergey Fedoseev63958442018-10-20 05:43:33 +05003945 if (list == NULL) {
Kristján Valur Jónsson31668b82012-04-03 10:49:41 +00003946 return NULL;
3947 }
Serhiy Storchakabb86bf42018-12-11 08:28:18 +02003948 return Py_BuildValue("N(N)", _PyEval_GetBuiltinId(&PyId_iter), list);
Kristján Valur Jónsson31668b82012-04-03 10:49:41 +00003949}
3950
Rémi Lapeyre6531bf62018-11-06 01:38:54 +01003951PyTypeObject PyDictRevIterItem_Type = {
3952 PyVarObject_HEAD_INIT(&PyType_Type, 0)
3953 "dict_reverseitemiterator",
3954 sizeof(dictiterobject),
3955 .tp_dealloc = (destructor)dictiter_dealloc,
3956 .tp_flags = Py_TPFLAGS_DEFAULT | Py_TPFLAGS_HAVE_GC,
3957 .tp_traverse = (traverseproc)dictiter_traverse,
3958 .tp_iter = PyObject_SelfIter,
3959 .tp_iternext = (iternextfunc)dictreviter_iternext,
3960 .tp_methods = dictiter_methods
3961};
3962
3963PyTypeObject PyDictRevIterValue_Type = {
3964 PyVarObject_HEAD_INIT(&PyType_Type, 0)
3965 "dict_reversevalueiterator",
3966 sizeof(dictiterobject),
3967 .tp_dealloc = (destructor)dictiter_dealloc,
3968 .tp_flags = Py_TPFLAGS_DEFAULT | Py_TPFLAGS_HAVE_GC,
3969 .tp_traverse = (traverseproc)dictiter_traverse,
3970 .tp_iter = PyObject_SelfIter,
3971 .tp_iternext = (iternextfunc)dictreviter_iternext,
3972 .tp_methods = dictiter_methods
3973};
3974
Guido van Rossum3ac67412007-02-10 18:55:06 +00003975/***********************************************/
Guido van Rossumb90c8482007-02-10 01:11:45 +00003976/* View objects for keys(), items(), values(). */
Guido van Rossum3ac67412007-02-10 18:55:06 +00003977/***********************************************/
3978
Guido van Rossumb90c8482007-02-10 01:11:45 +00003979/* The instance lay-out is the same for all three; but the type differs. */
3980
Guido van Rossumb90c8482007-02-10 01:11:45 +00003981static void
Eric Snow96c6af92015-05-29 22:21:39 -06003982dictview_dealloc(_PyDictViewObject *dv)
Guido van Rossumb90c8482007-02-10 01:11:45 +00003983{
INADA Naokia6296d32017-08-24 14:55:17 +09003984 /* bpo-31095: UnTrack is needed before calling any callbacks */
3985 _PyObject_GC_UNTRACK(dv);
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00003986 Py_XDECREF(dv->dv_dict);
3987 PyObject_GC_Del(dv);
Antoine Pitrou7ddda782009-01-01 15:35:33 +00003988}
3989
3990static int
Eric Snow96c6af92015-05-29 22:21:39 -06003991dictview_traverse(_PyDictViewObject *dv, visitproc visit, void *arg)
Antoine Pitrou7ddda782009-01-01 15:35:33 +00003992{
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00003993 Py_VISIT(dv->dv_dict);
3994 return 0;
Guido van Rossumb90c8482007-02-10 01:11:45 +00003995}
3996
Guido van Rossum83825ac2007-02-10 04:54:19 +00003997static Py_ssize_t
Eric Snow96c6af92015-05-29 22:21:39 -06003998dictview_len(_PyDictViewObject *dv)
Guido van Rossumb90c8482007-02-10 01:11:45 +00003999{
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00004000 Py_ssize_t len = 0;
4001 if (dv->dv_dict != NULL)
4002 len = dv->dv_dict->ma_used;
4003 return len;
Guido van Rossumb90c8482007-02-10 01:11:45 +00004004}
4005
Eric Snow96c6af92015-05-29 22:21:39 -06004006PyObject *
4007_PyDictView_New(PyObject *dict, PyTypeObject *type)
Guido van Rossumb90c8482007-02-10 01:11:45 +00004008{
Eric Snow96c6af92015-05-29 22:21:39 -06004009 _PyDictViewObject *dv;
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00004010 if (dict == NULL) {
4011 PyErr_BadInternalCall();
4012 return NULL;
4013 }
4014 if (!PyDict_Check(dict)) {
4015 /* XXX Get rid of this restriction later */
4016 PyErr_Format(PyExc_TypeError,
4017 "%s() requires a dict argument, not '%s'",
Victor Stinner58ac7002020-02-07 03:04:21 +01004018 type->tp_name, Py_TYPE(dict)->tp_name);
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00004019 return NULL;
4020 }
Eric Snow96c6af92015-05-29 22:21:39 -06004021 dv = PyObject_GC_New(_PyDictViewObject, type);
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00004022 if (dv == NULL)
4023 return NULL;
4024 Py_INCREF(dict);
4025 dv->dv_dict = (PyDictObject *)dict;
4026 _PyObject_GC_TRACK(dv);
4027 return (PyObject *)dv;
Guido van Rossumb90c8482007-02-10 01:11:45 +00004028}
4029
Neal Norwitze36f2ba2007-02-26 23:12:28 +00004030/* TODO(guido): The views objects are not complete:
4031
4032 * support more set operations
4033 * support arbitrary mappings?
4034 - either these should be static or exported in dictobject.h
4035 - if public then they should probably be in builtins
4036*/
4037
Guido van Rossumaac530c2007-08-24 22:33:45 +00004038/* Return 1 if self is a subset of other, iterating over self;
4039 0 if not; -1 if an error occurred. */
Guido van Rossumd9214d12007-02-12 02:23:40 +00004040static int
4041all_contained_in(PyObject *self, PyObject *other)
4042{
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00004043 PyObject *iter = PyObject_GetIter(self);
4044 int ok = 1;
Guido van Rossumd9214d12007-02-12 02:23:40 +00004045
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00004046 if (iter == NULL)
4047 return -1;
4048 for (;;) {
4049 PyObject *next = PyIter_Next(iter);
4050 if (next == NULL) {
4051 if (PyErr_Occurred())
4052 ok = -1;
4053 break;
4054 }
4055 ok = PySequence_Contains(other, next);
4056 Py_DECREF(next);
4057 if (ok <= 0)
4058 break;
4059 }
4060 Py_DECREF(iter);
4061 return ok;
Guido van Rossumd9214d12007-02-12 02:23:40 +00004062}
4063
4064static PyObject *
4065dictview_richcompare(PyObject *self, PyObject *other, int op)
4066{
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00004067 Py_ssize_t len_self, len_other;
4068 int ok;
4069 PyObject *result;
Guido van Rossumaac530c2007-08-24 22:33:45 +00004070
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00004071 assert(self != NULL);
4072 assert(PyDictViewSet_Check(self));
4073 assert(other != NULL);
Guido van Rossumd9214d12007-02-12 02:23:40 +00004074
Brian Curtindfc80e32011-08-10 20:28:54 -05004075 if (!PyAnySet_Check(other) && !PyDictViewSet_Check(other))
4076 Py_RETURN_NOTIMPLEMENTED;
Guido van Rossumaac530c2007-08-24 22:33:45 +00004077
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00004078 len_self = PyObject_Size(self);
4079 if (len_self < 0)
4080 return NULL;
4081 len_other = PyObject_Size(other);
4082 if (len_other < 0)
4083 return NULL;
Guido van Rossumaac530c2007-08-24 22:33:45 +00004084
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00004085 ok = 0;
4086 switch(op) {
Guido van Rossumaac530c2007-08-24 22:33:45 +00004087
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00004088 case Py_NE:
4089 case Py_EQ:
4090 if (len_self == len_other)
4091 ok = all_contained_in(self, other);
4092 if (op == Py_NE && ok >= 0)
4093 ok = !ok;
4094 break;
Guido van Rossumaac530c2007-08-24 22:33:45 +00004095
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00004096 case Py_LT:
4097 if (len_self < len_other)
4098 ok = all_contained_in(self, other);
4099 break;
Guido van Rossumaac530c2007-08-24 22:33:45 +00004100
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00004101 case Py_LE:
4102 if (len_self <= len_other)
4103 ok = all_contained_in(self, other);
4104 break;
Guido van Rossumaac530c2007-08-24 22:33:45 +00004105
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00004106 case Py_GT:
4107 if (len_self > len_other)
4108 ok = all_contained_in(other, self);
4109 break;
Guido van Rossumaac530c2007-08-24 22:33:45 +00004110
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00004111 case Py_GE:
4112 if (len_self >= len_other)
4113 ok = all_contained_in(other, self);
4114 break;
Guido van Rossumaac530c2007-08-24 22:33:45 +00004115
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00004116 }
4117 if (ok < 0)
4118 return NULL;
4119 result = ok ? Py_True : Py_False;
4120 Py_INCREF(result);
4121 return result;
Guido van Rossumd9214d12007-02-12 02:23:40 +00004122}
4123
Raymond Hettingerb0d56af2009-03-03 10:52:49 +00004124static PyObject *
Eric Snow96c6af92015-05-29 22:21:39 -06004125dictview_repr(_PyDictViewObject *dv)
Raymond Hettingerb0d56af2009-03-03 10:52:49 +00004126{
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00004127 PyObject *seq;
bennorthd7773d92018-01-26 15:46:01 +00004128 PyObject *result = NULL;
4129 Py_ssize_t rc;
Raymond Hettingerb0d56af2009-03-03 10:52:49 +00004130
bennorthd7773d92018-01-26 15:46:01 +00004131 rc = Py_ReprEnter((PyObject *)dv);
4132 if (rc != 0) {
4133 return rc > 0 ? PyUnicode_FromString("...") : NULL;
4134 }
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00004135 seq = PySequence_List((PyObject *)dv);
bennorthd7773d92018-01-26 15:46:01 +00004136 if (seq == NULL) {
4137 goto Done;
4138 }
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00004139 result = PyUnicode_FromFormat("%s(%R)", Py_TYPE(dv)->tp_name, seq);
4140 Py_DECREF(seq);
bennorthd7773d92018-01-26 15:46:01 +00004141
4142Done:
4143 Py_ReprLeave((PyObject *)dv);
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00004144 return result;
Raymond Hettingerb0d56af2009-03-03 10:52:49 +00004145}
4146
Guido van Rossum3ac67412007-02-10 18:55:06 +00004147/*** dict_keys ***/
Guido van Rossumb90c8482007-02-10 01:11:45 +00004148
4149static PyObject *
Eric Snow96c6af92015-05-29 22:21:39 -06004150dictkeys_iter(_PyDictViewObject *dv)
Guido van Rossumb90c8482007-02-10 01:11:45 +00004151{
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00004152 if (dv->dv_dict == NULL) {
4153 Py_RETURN_NONE;
4154 }
4155 return dictiter_new(dv->dv_dict, &PyDictIterKey_Type);
Guido van Rossum3ac67412007-02-10 18:55:06 +00004156}
4157
4158static int
Eric Snow96c6af92015-05-29 22:21:39 -06004159dictkeys_contains(_PyDictViewObject *dv, PyObject *obj)
Guido van Rossum3ac67412007-02-10 18:55:06 +00004160{
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00004161 if (dv->dv_dict == NULL)
4162 return 0;
4163 return PyDict_Contains((PyObject *)dv->dv_dict, obj);
Guido van Rossumb90c8482007-02-10 01:11:45 +00004164}
4165
Guido van Rossum83825ac2007-02-10 04:54:19 +00004166static PySequenceMethods dictkeys_as_sequence = {
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00004167 (lenfunc)dictview_len, /* sq_length */
4168 0, /* sq_concat */
4169 0, /* sq_repeat */
4170 0, /* sq_item */
4171 0, /* sq_slice */
4172 0, /* sq_ass_item */
4173 0, /* sq_ass_slice */
4174 (objobjproc)dictkeys_contains, /* sq_contains */
Guido van Rossum83825ac2007-02-10 04:54:19 +00004175};
4176
Inada Naoki6cbc84f2019-11-08 00:59:04 +09004177// Create an set object from dictviews object.
4178// Returns a new reference.
4179// This utility function is used by set operations.
Guido van Rossum523259b2007-08-24 23:41:22 +00004180static PyObject*
Inada Naoki6cbc84f2019-11-08 00:59:04 +09004181dictviews_to_set(PyObject *self)
Guido van Rossum523259b2007-08-24 23:41:22 +00004182{
Inada Naoki6cbc84f2019-11-08 00:59:04 +09004183 PyObject *left = self;
4184 if (PyDictKeys_Check(self)) {
4185 // PySet_New() has fast path for the dict object.
4186 PyObject *dict = (PyObject *)((_PyDictViewObject *)self)->dv_dict;
4187 if (PyDict_CheckExact(dict)) {
4188 left = dict;
4189 }
4190 }
4191 return PySet_New(left);
4192}
Martin v. Löwisafe55bb2011-10-09 10:38:36 +02004193
Inada Naoki6cbc84f2019-11-08 00:59:04 +09004194static PyObject*
4195dictviews_sub(PyObject *self, PyObject *other)
4196{
4197 PyObject *result = dictviews_to_set(self);
4198 if (result == NULL) {
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00004199 return NULL;
Inada Naoki6cbc84f2019-11-08 00:59:04 +09004200 }
Guido van Rossum523259b2007-08-24 23:41:22 +00004201
Inada Naoki6cbc84f2019-11-08 00:59:04 +09004202 _Py_IDENTIFIER(difference_update);
4203 PyObject *tmp = _PyObject_CallMethodIdOneArg(
4204 result, &PyId_difference_update, other);
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00004205 if (tmp == NULL) {
4206 Py_DECREF(result);
4207 return NULL;
4208 }
Guido van Rossum523259b2007-08-24 23:41:22 +00004209
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00004210 Py_DECREF(tmp);
4211 return result;
Guido van Rossum523259b2007-08-24 23:41:22 +00004212}
4213
Forest Gregg998cf1f2019-08-26 02:17:43 -05004214static int
4215dictitems_contains(_PyDictViewObject *dv, PyObject *obj);
4216
4217PyObject *
Benjamin Peterson025e9eb2015-05-05 20:16:41 -04004218_PyDictView_Intersect(PyObject* self, PyObject *other)
Guido van Rossum523259b2007-08-24 23:41:22 +00004219{
Forest Gregg998cf1f2019-08-26 02:17:43 -05004220 PyObject *result;
4221 PyObject *it;
4222 PyObject *key;
4223 Py_ssize_t len_self;
4224 int rv;
4225 int (*dict_contains)(_PyDictViewObject *, PyObject *);
Martin v. Löwisafe55bb2011-10-09 10:38:36 +02004226
Forest Gregg998cf1f2019-08-26 02:17:43 -05004227 /* Python interpreter swaps parameters when dict view
4228 is on right side of & */
4229 if (!PyDictViewSet_Check(self)) {
4230 PyObject *tmp = other;
4231 other = self;
4232 self = tmp;
4233 }
4234
4235 len_self = dictview_len((_PyDictViewObject *)self);
4236
4237 /* if other is a set and self is smaller than other,
4238 reuse set intersection logic */
Dong-hee Na1b55b652020-02-17 19:09:15 +09004239 if (Py_IS_TYPE(other, &PySet_Type) && len_self <= PyObject_Size(other)) {
Forest Gregg998cf1f2019-08-26 02:17:43 -05004240 _Py_IDENTIFIER(intersection);
4241 return _PyObject_CallMethodIdObjArgs(other, &PyId_intersection, self, NULL);
4242 }
4243
4244 /* if other is another dict view, and it is bigger than self,
4245 swap them */
4246 if (PyDictViewSet_Check(other)) {
4247 Py_ssize_t len_other = dictview_len((_PyDictViewObject *)other);
4248 if (len_other > len_self) {
4249 PyObject *tmp = other;
4250 other = self;
4251 self = tmp;
4252 }
4253 }
4254
4255 /* at this point, two things should be true
4256 1. self is a dictview
4257 2. if other is a dictview then it is smaller than self */
4258 result = PySet_New(NULL);
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00004259 if (result == NULL)
4260 return NULL;
Guido van Rossum523259b2007-08-24 23:41:22 +00004261
Forest Gregg998cf1f2019-08-26 02:17:43 -05004262 it = PyObject_GetIter(other);
Zackery Spytzb16e3822019-10-13 05:49:05 -06004263 if (it == NULL) {
4264 Py_DECREF(result);
4265 return NULL;
4266 }
Forest Gregg998cf1f2019-08-26 02:17:43 -05004267
Forest Gregg998cf1f2019-08-26 02:17:43 -05004268 if (PyDictKeys_Check(self)) {
4269 dict_contains = dictkeys_contains;
4270 }
4271 /* else PyDictItems_Check(self) */
4272 else {
4273 dict_contains = dictitems_contains;
4274 }
4275
4276 while ((key = PyIter_Next(it)) != NULL) {
4277 rv = dict_contains((_PyDictViewObject *)self, key);
4278 if (rv < 0) {
4279 goto error;
4280 }
4281 if (rv) {
4282 if (PySet_Add(result, key)) {
4283 goto error;
4284 }
4285 }
4286 Py_DECREF(key);
4287 }
4288 Py_DECREF(it);
4289 if (PyErr_Occurred()) {
4290 Py_DECREF(result);
4291 return NULL;
4292 }
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00004293 return result;
Forest Gregg998cf1f2019-08-26 02:17:43 -05004294
4295error:
4296 Py_DECREF(it);
4297 Py_DECREF(result);
4298 Py_DECREF(key);
4299 return NULL;
Guido van Rossum523259b2007-08-24 23:41:22 +00004300}
4301
4302static PyObject*
4303dictviews_or(PyObject* self, PyObject *other)
4304{
Inada Naoki6cbc84f2019-11-08 00:59:04 +09004305 PyObject *result = dictviews_to_set(self);
4306 if (result == NULL) {
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00004307 return NULL;
4308 }
Guido van Rossum523259b2007-08-24 23:41:22 +00004309
Inada Naoki6cbc84f2019-11-08 00:59:04 +09004310 if (_PySet_Update(result, other) < 0) {
4311 Py_DECREF(result);
4312 return NULL;
4313 }
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00004314 return result;
Guido van Rossum523259b2007-08-24 23:41:22 +00004315}
4316
4317static PyObject*
4318dictviews_xor(PyObject* self, PyObject *other)
4319{
Inada Naoki6cbc84f2019-11-08 00:59:04 +09004320 PyObject *result = dictviews_to_set(self);
4321 if (result == NULL) {
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00004322 return NULL;
Inada Naoki6cbc84f2019-11-08 00:59:04 +09004323 }
Guido van Rossum523259b2007-08-24 23:41:22 +00004324
Inada Naoki6cbc84f2019-11-08 00:59:04 +09004325 _Py_IDENTIFIER(symmetric_difference_update);
4326 PyObject *tmp = _PyObject_CallMethodIdOneArg(
4327 result, &PyId_symmetric_difference_update, other);
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00004328 if (tmp == NULL) {
4329 Py_DECREF(result);
4330 return NULL;
4331 }
Guido van Rossum523259b2007-08-24 23:41:22 +00004332
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00004333 Py_DECREF(tmp);
4334 return result;
Guido van Rossum523259b2007-08-24 23:41:22 +00004335}
4336
4337static PyNumberMethods dictviews_as_number = {
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00004338 0, /*nb_add*/
4339 (binaryfunc)dictviews_sub, /*nb_subtract*/
4340 0, /*nb_multiply*/
4341 0, /*nb_remainder*/
4342 0, /*nb_divmod*/
4343 0, /*nb_power*/
4344 0, /*nb_negative*/
4345 0, /*nb_positive*/
4346 0, /*nb_absolute*/
4347 0, /*nb_bool*/
4348 0, /*nb_invert*/
4349 0, /*nb_lshift*/
4350 0, /*nb_rshift*/
Benjamin Peterson025e9eb2015-05-05 20:16:41 -04004351 (binaryfunc)_PyDictView_Intersect, /*nb_and*/
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00004352 (binaryfunc)dictviews_xor, /*nb_xor*/
4353 (binaryfunc)dictviews_or, /*nb_or*/
Guido van Rossum523259b2007-08-24 23:41:22 +00004354};
4355
Daniel Stutzbach045b3ba2010-09-02 15:06:06 +00004356static PyObject*
4357dictviews_isdisjoint(PyObject *self, PyObject *other)
4358{
4359 PyObject *it;
4360 PyObject *item = NULL;
4361
4362 if (self == other) {
Eric Snow96c6af92015-05-29 22:21:39 -06004363 if (dictview_len((_PyDictViewObject *)self) == 0)
Daniel Stutzbach045b3ba2010-09-02 15:06:06 +00004364 Py_RETURN_TRUE;
4365 else
4366 Py_RETURN_FALSE;
4367 }
4368
4369 /* Iterate over the shorter object (only if other is a set,
4370 * because PySequence_Contains may be expensive otherwise): */
4371 if (PyAnySet_Check(other) || PyDictViewSet_Check(other)) {
Eric Snow96c6af92015-05-29 22:21:39 -06004372 Py_ssize_t len_self = dictview_len((_PyDictViewObject *)self);
Daniel Stutzbach045b3ba2010-09-02 15:06:06 +00004373 Py_ssize_t len_other = PyObject_Size(other);
4374 if (len_other == -1)
4375 return NULL;
4376
4377 if ((len_other > len_self)) {
4378 PyObject *tmp = other;
4379 other = self;
4380 self = tmp;
4381 }
4382 }
4383
4384 it = PyObject_GetIter(other);
4385 if (it == NULL)
4386 return NULL;
4387
4388 while ((item = PyIter_Next(it)) != NULL) {
4389 int contains = PySequence_Contains(self, item);
4390 Py_DECREF(item);
4391 if (contains == -1) {
4392 Py_DECREF(it);
4393 return NULL;
4394 }
4395
4396 if (contains) {
4397 Py_DECREF(it);
4398 Py_RETURN_FALSE;
4399 }
4400 }
4401 Py_DECREF(it);
4402 if (PyErr_Occurred())
4403 return NULL; /* PyIter_Next raised an exception. */
4404 Py_RETURN_TRUE;
4405}
4406
4407PyDoc_STRVAR(isdisjoint_doc,
4408"Return True if the view and the given iterable have a null intersection.");
4409
Serhiy Storchaka81524022018-11-27 13:05:02 +02004410static PyObject* dictkeys_reversed(_PyDictViewObject *dv, PyObject *Py_UNUSED(ignored));
Rémi Lapeyre6531bf62018-11-06 01:38:54 +01004411
4412PyDoc_STRVAR(reversed_keys_doc,
4413"Return a reverse iterator over the dict keys.");
4414
Guido van Rossumb90c8482007-02-10 01:11:45 +00004415static PyMethodDef dictkeys_methods[] = {
Daniel Stutzbach045b3ba2010-09-02 15:06:06 +00004416 {"isdisjoint", (PyCFunction)dictviews_isdisjoint, METH_O,
4417 isdisjoint_doc},
Serhiy Storchaka62be7422018-11-27 13:27:31 +02004418 {"__reversed__", (PyCFunction)(void(*)(void))dictkeys_reversed, METH_NOARGS,
Rémi Lapeyre6531bf62018-11-06 01:38:54 +01004419 reversed_keys_doc},
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00004420 {NULL, NULL} /* sentinel */
Guido van Rossumb90c8482007-02-10 01:11:45 +00004421};
4422
4423PyTypeObject PyDictKeys_Type = {
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00004424 PyVarObject_HEAD_INIT(&PyType_Type, 0)
4425 "dict_keys", /* tp_name */
Eric Snow96c6af92015-05-29 22:21:39 -06004426 sizeof(_PyDictViewObject), /* tp_basicsize */
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00004427 0, /* tp_itemsize */
4428 /* methods */
4429 (destructor)dictview_dealloc, /* tp_dealloc */
Jeroen Demeyer530f5062019-05-31 04:13:39 +02004430 0, /* tp_vectorcall_offset */
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00004431 0, /* tp_getattr */
4432 0, /* tp_setattr */
Jeroen Demeyer530f5062019-05-31 04:13:39 +02004433 0, /* tp_as_async */
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00004434 (reprfunc)dictview_repr, /* tp_repr */
4435 &dictviews_as_number, /* tp_as_number */
4436 &dictkeys_as_sequence, /* tp_as_sequence */
4437 0, /* tp_as_mapping */
4438 0, /* tp_hash */
4439 0, /* tp_call */
4440 0, /* tp_str */
4441 PyObject_GenericGetAttr, /* tp_getattro */
4442 0, /* tp_setattro */
4443 0, /* tp_as_buffer */
4444 Py_TPFLAGS_DEFAULT | Py_TPFLAGS_HAVE_GC,/* tp_flags */
4445 0, /* tp_doc */
4446 (traverseproc)dictview_traverse, /* tp_traverse */
4447 0, /* tp_clear */
4448 dictview_richcompare, /* tp_richcompare */
4449 0, /* tp_weaklistoffset */
4450 (getiterfunc)dictkeys_iter, /* tp_iter */
4451 0, /* tp_iternext */
4452 dictkeys_methods, /* tp_methods */
4453 0,
Guido van Rossumb90c8482007-02-10 01:11:45 +00004454};
4455
4456static PyObject *
Siddhesh Poyarekar55edd0c2018-04-30 00:29:33 +05304457dictkeys_new(PyObject *dict, PyObject *Py_UNUSED(ignored))
Guido van Rossumb90c8482007-02-10 01:11:45 +00004458{
Eric Snow96c6af92015-05-29 22:21:39 -06004459 return _PyDictView_New(dict, &PyDictKeys_Type);
Guido van Rossumb90c8482007-02-10 01:11:45 +00004460}
4461
Rémi Lapeyre6531bf62018-11-06 01:38:54 +01004462static PyObject *
Serhiy Storchaka81524022018-11-27 13:05:02 +02004463dictkeys_reversed(_PyDictViewObject *dv, PyObject *Py_UNUSED(ignored))
Rémi Lapeyre6531bf62018-11-06 01:38:54 +01004464{
4465 if (dv->dv_dict == NULL) {
4466 Py_RETURN_NONE;
4467 }
4468 return dictiter_new(dv->dv_dict, &PyDictRevIterKey_Type);
4469}
4470
Guido van Rossum3ac67412007-02-10 18:55:06 +00004471/*** dict_items ***/
Guido van Rossumb90c8482007-02-10 01:11:45 +00004472
4473static PyObject *
Eric Snow96c6af92015-05-29 22:21:39 -06004474dictitems_iter(_PyDictViewObject *dv)
Guido van Rossumb90c8482007-02-10 01:11:45 +00004475{
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00004476 if (dv->dv_dict == NULL) {
4477 Py_RETURN_NONE;
4478 }
4479 return dictiter_new(dv->dv_dict, &PyDictIterItem_Type);
Guido van Rossum3ac67412007-02-10 18:55:06 +00004480}
4481
4482static int
Eric Snow96c6af92015-05-29 22:21:39 -06004483dictitems_contains(_PyDictViewObject *dv, PyObject *obj)
Guido van Rossum3ac67412007-02-10 18:55:06 +00004484{
Serhiy Storchaka753bca32017-05-20 12:30:02 +03004485 int result;
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00004486 PyObject *key, *value, *found;
4487 if (dv->dv_dict == NULL)
4488 return 0;
4489 if (!PyTuple_Check(obj) || PyTuple_GET_SIZE(obj) != 2)
4490 return 0;
4491 key = PyTuple_GET_ITEM(obj, 0);
4492 value = PyTuple_GET_ITEM(obj, 1);
Raymond Hettinger6692f012016-09-18 21:46:08 -07004493 found = PyDict_GetItemWithError((PyObject *)dv->dv_dict, key);
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00004494 if (found == NULL) {
4495 if (PyErr_Occurred())
4496 return -1;
4497 return 0;
4498 }
Serhiy Storchaka753bca32017-05-20 12:30:02 +03004499 Py_INCREF(found);
Serhiy Storchaka18b711c2019-08-04 14:12:48 +03004500 result = PyObject_RichCompareBool(found, value, Py_EQ);
Serhiy Storchaka753bca32017-05-20 12:30:02 +03004501 Py_DECREF(found);
4502 return result;
Guido van Rossumb90c8482007-02-10 01:11:45 +00004503}
4504
Guido van Rossum83825ac2007-02-10 04:54:19 +00004505static PySequenceMethods dictitems_as_sequence = {
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00004506 (lenfunc)dictview_len, /* sq_length */
4507 0, /* sq_concat */
4508 0, /* sq_repeat */
4509 0, /* sq_item */
4510 0, /* sq_slice */
4511 0, /* sq_ass_item */
4512 0, /* sq_ass_slice */
4513 (objobjproc)dictitems_contains, /* sq_contains */
Guido van Rossum83825ac2007-02-10 04:54:19 +00004514};
4515
Rémi Lapeyre6531bf62018-11-06 01:38:54 +01004516static PyObject* dictitems_reversed(_PyDictViewObject *dv);
4517
4518PyDoc_STRVAR(reversed_items_doc,
4519"Return a reverse iterator over the dict items.");
4520
Guido van Rossumb90c8482007-02-10 01:11:45 +00004521static PyMethodDef dictitems_methods[] = {
Daniel Stutzbach045b3ba2010-09-02 15:06:06 +00004522 {"isdisjoint", (PyCFunction)dictviews_isdisjoint, METH_O,
4523 isdisjoint_doc},
Serhiy Storchaka62be7422018-11-27 13:27:31 +02004524 {"__reversed__", (PyCFunction)(void(*)(void))dictitems_reversed, METH_NOARGS,
Rémi Lapeyre6531bf62018-11-06 01:38:54 +01004525 reversed_items_doc},
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00004526 {NULL, NULL} /* sentinel */
Guido van Rossumb90c8482007-02-10 01:11:45 +00004527};
4528
4529PyTypeObject PyDictItems_Type = {
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00004530 PyVarObject_HEAD_INIT(&PyType_Type, 0)
4531 "dict_items", /* tp_name */
Eric Snow96c6af92015-05-29 22:21:39 -06004532 sizeof(_PyDictViewObject), /* tp_basicsize */
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00004533 0, /* tp_itemsize */
4534 /* methods */
4535 (destructor)dictview_dealloc, /* tp_dealloc */
Jeroen Demeyer530f5062019-05-31 04:13:39 +02004536 0, /* tp_vectorcall_offset */
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00004537 0, /* tp_getattr */
4538 0, /* tp_setattr */
Jeroen Demeyer530f5062019-05-31 04:13:39 +02004539 0, /* tp_as_async */
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00004540 (reprfunc)dictview_repr, /* tp_repr */
4541 &dictviews_as_number, /* tp_as_number */
4542 &dictitems_as_sequence, /* tp_as_sequence */
4543 0, /* tp_as_mapping */
4544 0, /* tp_hash */
4545 0, /* tp_call */
4546 0, /* tp_str */
4547 PyObject_GenericGetAttr, /* tp_getattro */
4548 0, /* tp_setattro */
4549 0, /* tp_as_buffer */
4550 Py_TPFLAGS_DEFAULT | Py_TPFLAGS_HAVE_GC,/* tp_flags */
4551 0, /* tp_doc */
4552 (traverseproc)dictview_traverse, /* tp_traverse */
4553 0, /* tp_clear */
4554 dictview_richcompare, /* tp_richcompare */
4555 0, /* tp_weaklistoffset */
4556 (getiterfunc)dictitems_iter, /* tp_iter */
4557 0, /* tp_iternext */
4558 dictitems_methods, /* tp_methods */
4559 0,
Guido van Rossumb90c8482007-02-10 01:11:45 +00004560};
4561
4562static PyObject *
Siddhesh Poyarekar55edd0c2018-04-30 00:29:33 +05304563dictitems_new(PyObject *dict, PyObject *Py_UNUSED(ignored))
Guido van Rossumb90c8482007-02-10 01:11:45 +00004564{
Eric Snow96c6af92015-05-29 22:21:39 -06004565 return _PyDictView_New(dict, &PyDictItems_Type);
Guido van Rossumb90c8482007-02-10 01:11:45 +00004566}
4567
Rémi Lapeyre6531bf62018-11-06 01:38:54 +01004568static PyObject *
4569dictitems_reversed(_PyDictViewObject *dv)
4570{
4571 if (dv->dv_dict == NULL) {
4572 Py_RETURN_NONE;
4573 }
4574 return dictiter_new(dv->dv_dict, &PyDictRevIterItem_Type);
4575}
4576
Guido van Rossum3ac67412007-02-10 18:55:06 +00004577/*** dict_values ***/
Guido van Rossumb90c8482007-02-10 01:11:45 +00004578
4579static PyObject *
Eric Snow96c6af92015-05-29 22:21:39 -06004580dictvalues_iter(_PyDictViewObject *dv)
Guido van Rossumb90c8482007-02-10 01:11:45 +00004581{
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00004582 if (dv->dv_dict == NULL) {
4583 Py_RETURN_NONE;
4584 }
4585 return dictiter_new(dv->dv_dict, &PyDictIterValue_Type);
Guido van Rossumb90c8482007-02-10 01:11:45 +00004586}
4587
Guido van Rossum83825ac2007-02-10 04:54:19 +00004588static PySequenceMethods dictvalues_as_sequence = {
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00004589 (lenfunc)dictview_len, /* sq_length */
4590 0, /* sq_concat */
4591 0, /* sq_repeat */
4592 0, /* sq_item */
4593 0, /* sq_slice */
4594 0, /* sq_ass_item */
4595 0, /* sq_ass_slice */
4596 (objobjproc)0, /* sq_contains */
Guido van Rossum83825ac2007-02-10 04:54:19 +00004597};
4598
Rémi Lapeyre6531bf62018-11-06 01:38:54 +01004599static PyObject* dictvalues_reversed(_PyDictViewObject *dv);
4600
4601PyDoc_STRVAR(reversed_values_doc,
4602"Return a reverse iterator over the dict values.");
4603
Guido van Rossumb90c8482007-02-10 01:11:45 +00004604static PyMethodDef dictvalues_methods[] = {
Serhiy Storchaka62be7422018-11-27 13:27:31 +02004605 {"__reversed__", (PyCFunction)(void(*)(void))dictvalues_reversed, METH_NOARGS,
Rémi Lapeyre6531bf62018-11-06 01:38:54 +01004606 reversed_values_doc},
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00004607 {NULL, NULL} /* sentinel */
Guido van Rossumb90c8482007-02-10 01:11:45 +00004608};
4609
4610PyTypeObject PyDictValues_Type = {
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00004611 PyVarObject_HEAD_INIT(&PyType_Type, 0)
4612 "dict_values", /* tp_name */
Eric Snow96c6af92015-05-29 22:21:39 -06004613 sizeof(_PyDictViewObject), /* tp_basicsize */
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00004614 0, /* tp_itemsize */
4615 /* methods */
4616 (destructor)dictview_dealloc, /* tp_dealloc */
Jeroen Demeyer530f5062019-05-31 04:13:39 +02004617 0, /* tp_vectorcall_offset */
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00004618 0, /* tp_getattr */
4619 0, /* tp_setattr */
Jeroen Demeyer530f5062019-05-31 04:13:39 +02004620 0, /* tp_as_async */
Antoine Pitrouf95a1b32010-05-09 15:52:27 +00004621 (reprfunc)dictview_repr, /* tp_repr */
4622 0, /* tp_as_number */
4623 &dictvalues_as_sequence, /* tp_as_sequence */
4624 0, /* tp_as_mapping */
4625 0, /* tp_hash */
4626 0, /* tp_call */
4627 0, /* tp_str */
4628 PyObject_GenericGetAttr, /* tp_getattro */
4629 0, /* tp_setattro */
4630 0, /* tp_as_buffer */
4631 Py_TPFLAGS_DEFAULT | Py_TPFLAGS_HAVE_GC,/* tp_flags */
4632 0, /* tp_doc */
4633 (traverseproc)dictview_traverse, /* tp_traverse */
4634 0, /* tp_clear */
4635 0, /* tp_richcompare */
4636 0, /* tp_weaklistoffset */
4637 (getiterfunc)dictvalues_iter, /* tp_iter */
4638 0, /* tp_iternext */
4639 dictvalues_methods, /* tp_methods */
4640 0,
Guido van Rossumb90c8482007-02-10 01:11:45 +00004641};
4642
4643static PyObject *
Siddhesh Poyarekar55edd0c2018-04-30 00:29:33 +05304644dictvalues_new(PyObject *dict, PyObject *Py_UNUSED(ignored))
Guido van Rossumb90c8482007-02-10 01:11:45 +00004645{
Eric Snow96c6af92015-05-29 22:21:39 -06004646 return _PyDictView_New(dict, &PyDictValues_Type);
Guido van Rossumb90c8482007-02-10 01:11:45 +00004647}
Benjamin Peterson7d95e402012-04-23 11:24:50 -04004648
Rémi Lapeyre6531bf62018-11-06 01:38:54 +01004649static PyObject *
4650dictvalues_reversed(_PyDictViewObject *dv)
4651{
4652 if (dv->dv_dict == NULL) {
4653 Py_RETURN_NONE;
4654 }
4655 return dictiter_new(dv->dv_dict, &PyDictRevIterValue_Type);
4656}
4657
4658
Benjamin Peterson7d95e402012-04-23 11:24:50 -04004659/* Returns NULL if cannot allocate a new PyDictKeysObject,
4660 but does not set an error */
4661PyDictKeysObject *
4662_PyDict_NewKeysForClass(void)
4663{
Victor Stinner742da042016-09-07 17:40:12 -07004664 PyDictKeysObject *keys = new_keys_object(PyDict_MINSIZE);
Benjamin Peterson7d95e402012-04-23 11:24:50 -04004665 if (keys == NULL)
4666 PyErr_Clear();
4667 else
4668 keys->dk_lookup = lookdict_split;
4669 return keys;
4670}
4671
4672#define CACHED_KEYS(tp) (((PyHeapTypeObject*)tp)->ht_cached_keys)
4673
4674PyObject *
4675PyObject_GenericGetDict(PyObject *obj, void *context)
4676{
4677 PyObject *dict, **dictptr = _PyObject_GetDictPtr(obj);
4678 if (dictptr == NULL) {
4679 PyErr_SetString(PyExc_AttributeError,
4680 "This object has no __dict__");
4681 return NULL;
4682 }
4683 dict = *dictptr;
4684 if (dict == NULL) {
4685 PyTypeObject *tp = Py_TYPE(obj);
4686 if ((tp->tp_flags & Py_TPFLAGS_HEAPTYPE) && CACHED_KEYS(tp)) {
INADA Naokia7576492018-11-14 18:39:27 +09004687 dictkeys_incref(CACHED_KEYS(tp));
Benjamin Peterson7d95e402012-04-23 11:24:50 -04004688 *dictptr = dict = new_dict_with_shared_keys(CACHED_KEYS(tp));
4689 }
4690 else {
4691 *dictptr = dict = PyDict_New();
4692 }
4693 }
4694 Py_XINCREF(dict);
4695 return dict;
4696}
4697
4698int
4699_PyObjectDict_SetItem(PyTypeObject *tp, PyObject **dictptr,
Victor Stinner742da042016-09-07 17:40:12 -07004700 PyObject *key, PyObject *value)
Benjamin Peterson7d95e402012-04-23 11:24:50 -04004701{
4702 PyObject *dict;
4703 int res;
4704 PyDictKeysObject *cached;
4705
4706 assert(dictptr != NULL);
4707 if ((tp->tp_flags & Py_TPFLAGS_HEAPTYPE) && (cached = CACHED_KEYS(tp))) {
4708 assert(dictptr != NULL);
4709 dict = *dictptr;
4710 if (dict == NULL) {
INADA Naokia7576492018-11-14 18:39:27 +09004711 dictkeys_incref(cached);
Benjamin Peterson7d95e402012-04-23 11:24:50 -04004712 dict = new_dict_with_shared_keys(cached);
4713 if (dict == NULL)
4714 return -1;
4715 *dictptr = dict;
4716 }
4717 if (value == NULL) {
4718 res = PyDict_DelItem(dict, key);
INADA Naoki2294f3a2017-02-12 13:51:30 +09004719 // Since key sharing dict doesn't allow deletion, PyDict_DelItem()
4720 // always converts dict to combined form.
4721 if ((cached = CACHED_KEYS(tp)) != NULL) {
Benjamin Peterson7d95e402012-04-23 11:24:50 -04004722 CACHED_KEYS(tp) = NULL;
INADA Naokia7576492018-11-14 18:39:27 +09004723 dictkeys_decref(cached);
Benjamin Peterson7d95e402012-04-23 11:24:50 -04004724 }
Victor Stinner3d3f2642016-12-15 17:21:23 +01004725 }
4726 else {
INADA Naoki2294f3a2017-02-12 13:51:30 +09004727 int was_shared = (cached == ((PyDictObject *)dict)->ma_keys);
Benjamin Peterson7d95e402012-04-23 11:24:50 -04004728 res = PyDict_SetItem(dict, key, value);
INADA Naoki2294f3a2017-02-12 13:51:30 +09004729 if (was_shared &&
4730 (cached = CACHED_KEYS(tp)) != NULL &&
4731 cached != ((PyDictObject *)dict)->ma_keys) {
Victor Stinner3d3f2642016-12-15 17:21:23 +01004732 /* PyDict_SetItem() may call dictresize and convert split table
4733 * into combined table. In such case, convert it to split
4734 * table again and update type's shared key only when this is
4735 * the only dict sharing key with the type.
4736 *
4737 * This is to allow using shared key in class like this:
4738 *
4739 * class C:
4740 * def __init__(self):
4741 * # one dict resize happens
4742 * self.a, self.b, self.c = 1, 2, 3
4743 * self.d, self.e, self.f = 4, 5, 6
4744 * a = C()
4745 */
Benjamin Peterson15ee8212012-04-24 14:44:18 -04004746 if (cached->dk_refcnt == 1) {
Benjamin Peterson7d95e402012-04-23 11:24:50 -04004747 CACHED_KEYS(tp) = make_keys_shared(dict);
Victor Stinner742da042016-09-07 17:40:12 -07004748 }
4749 else {
Benjamin Peterson7d95e402012-04-23 11:24:50 -04004750 CACHED_KEYS(tp) = NULL;
4751 }
INADA Naokia7576492018-11-14 18:39:27 +09004752 dictkeys_decref(cached);
Benjamin Peterson15ee8212012-04-24 14:44:18 -04004753 if (CACHED_KEYS(tp) == NULL && PyErr_Occurred())
4754 return -1;
Benjamin Peterson7d95e402012-04-23 11:24:50 -04004755 }
4756 }
4757 } else {
4758 dict = *dictptr;
4759 if (dict == NULL) {
4760 dict = PyDict_New();
4761 if (dict == NULL)
4762 return -1;
4763 *dictptr = dict;
4764 }
4765 if (value == NULL) {
4766 res = PyDict_DelItem(dict, key);
4767 } else {
4768 res = PyDict_SetItem(dict, key, value);
4769 }
4770 }
4771 return res;
4772}
4773
4774void
4775_PyDictKeys_DecRef(PyDictKeysObject *keys)
4776{
INADA Naokia7576492018-11-14 18:39:27 +09004777 dictkeys_decref(keys);
Benjamin Peterson7d95e402012-04-23 11:24:50 -04004778}