blob: 85253123aa593ad1766a97e9e3e6c1c17f31d448 [file] [log] [blame]
Georg Brandl116aa622007-08-15 14:28:22 +00001:mod:`random` --- Generate pseudo-random numbers
2================================================
3
4.. module:: random
5 :synopsis: Generate pseudo-random numbers with various common distributions.
6
Raymond Hettinger10480942011-01-10 03:26:08 +00007**Source code:** :source:`Lib/random.py`
Georg Brandl116aa622007-08-15 14:28:22 +00008
9This module implements pseudo-random number generators for various
10distributions.
11
Raymond Hettingerb21dac12010-09-07 05:32:49 +000012For integers, there is uniform selection from a range. For sequences, there is
13uniform selection of a random element, a function to generate a random
14permutation of a list in-place, and a function for random sampling without
15replacement.
Georg Brandl116aa622007-08-15 14:28:22 +000016
17On the real line, there are functions to compute uniform, normal (Gaussian),
18lognormal, negative exponential, gamma, and beta distributions. For generating
19distributions of angles, the von Mises distribution is available.
20
21Almost all module functions depend on the basic function :func:`random`, which
22generates a random float uniformly in the semi-open range [0.0, 1.0). Python
23uses the Mersenne Twister as the core generator. It produces 53-bit precision
24floats and has a period of 2\*\*19937-1. The underlying implementation in C is
25both fast and threadsafe. The Mersenne Twister is one of the most extensively
26tested random number generators in existence. However, being completely
27deterministic, it is not suitable for all purposes, and is completely unsuitable
28for cryptographic purposes.
29
30The functions supplied by this module are actually bound methods of a hidden
31instance of the :class:`random.Random` class. You can instantiate your own
Raymond Hettinger28de64f2008-01-13 23:40:30 +000032instances of :class:`Random` to get generators that don't share state.
Georg Brandl116aa622007-08-15 14:28:22 +000033
34Class :class:`Random` can also be subclassed if you want to use a different
35basic generator of your own devising: in that case, override the :meth:`random`,
Raymond Hettingerafd30452009-02-24 10:57:02 +000036:meth:`seed`, :meth:`getstate`, and :meth:`setstate` methods.
Benjamin Petersond18de0e2008-07-31 20:21:46 +000037Optionally, a new generator can supply a :meth:`getrandbits` method --- this
Georg Brandl116aa622007-08-15 14:28:22 +000038allows :meth:`randrange` to produce selections over an arbitrarily large range.
39
Benjamin Peterson21896a32010-03-21 22:03:03 +000040The :mod:`random` module also provides the :class:`SystemRandom` class which
41uses the system function :func:`os.urandom` to generate random numbers
42from sources provided by the operating system.
Georg Brandl116aa622007-08-15 14:28:22 +000043
Georg Brandl116aa622007-08-15 14:28:22 +000044
Raymond Hettinger10480942011-01-10 03:26:08 +000045Bookkeeping functions:
Georg Brandl116aa622007-08-15 14:28:22 +000046
Raymond Hettingerf763a722010-09-07 00:38:15 +000047.. function:: seed([x], version=2)
Georg Brandl116aa622007-08-15 14:28:22 +000048
Raymond Hettingerf763a722010-09-07 00:38:15 +000049 Initialize the random number generator.
Georg Brandl116aa622007-08-15 14:28:22 +000050
Raymond Hettingerf763a722010-09-07 00:38:15 +000051 If *x* is omitted or ``None``, the current system time is used. If
52 randomness sources are provided by the operating system, they are used
53 instead of the system time (see the :func:`os.urandom` function for details
54 on availability).
Georg Brandl116aa622007-08-15 14:28:22 +000055
Raymond Hettingerf763a722010-09-07 00:38:15 +000056 If *x* is an int, it is used directly.
57
58 With version 2 (the default), a :class:`str`, :class:`bytes`, or :class:`bytearray`
Raymond Hettinger3149f9c2010-09-07 05:35:10 +000059 object gets converted to an :class:`int` and all of its bits are used. With version 1,
Raymond Hettingerf763a722010-09-07 00:38:15 +000060 the :func:`hash` of *x* is used instead.
61
62 .. versionchanged:: 3.2
63 Moved to the version 2 scheme which uses all of the bits in a string seed.
Georg Brandl116aa622007-08-15 14:28:22 +000064
65.. function:: getstate()
66
67 Return an object capturing the current internal state of the generator. This
68 object can be passed to :func:`setstate` to restore the state.
69
Georg Brandl116aa622007-08-15 14:28:22 +000070
71.. function:: setstate(state)
72
73 *state* should have been obtained from a previous call to :func:`getstate`, and
74 :func:`setstate` restores the internal state of the generator to what it was at
75 the time :func:`setstate` was called.
76
Georg Brandl116aa622007-08-15 14:28:22 +000077
Georg Brandl116aa622007-08-15 14:28:22 +000078.. function:: getrandbits(k)
79
Ezio Melotti0639d5a2009-12-19 23:26:38 +000080 Returns a Python integer with *k* random bits. This method is supplied with
Georg Brandl5c106642007-11-29 17:41:05 +000081 the MersenneTwister generator and some other generators may also provide it
Georg Brandl116aa622007-08-15 14:28:22 +000082 as an optional part of the API. When available, :meth:`getrandbits` enables
83 :meth:`randrange` to handle arbitrarily large ranges.
84
Georg Brandl116aa622007-08-15 14:28:22 +000085
86Functions for integers:
87
Georg Brandl116aa622007-08-15 14:28:22 +000088.. function:: randrange([start,] stop[, step])
89
90 Return a randomly selected element from ``range(start, stop, step)``. This is
91 equivalent to ``choice(range(start, stop, step))``, but doesn't actually build a
92 range object.
93
Raymond Hettinger05156612010-09-07 04:44:52 +000094 The positional argument pattern matches that of :func:`range`. Keyword arguments
95 should not be used because the function may use them in unexpected ways.
96
97 .. versionchanged:: 3.2
98 :meth:`randrange` is more sophisticated about producing equally distributed
99 values. Formerly it used a style like ``int(random()*n)`` which could produce
100 slightly uneven distributions.
Georg Brandl116aa622007-08-15 14:28:22 +0000101
102.. function:: randint(a, b)
103
Raymond Hettingerafd30452009-02-24 10:57:02 +0000104 Return a random integer *N* such that ``a <= N <= b``. Alias for
105 ``randrange(a, b+1)``.
Georg Brandl116aa622007-08-15 14:28:22 +0000106
Georg Brandl116aa622007-08-15 14:28:22 +0000107
Georg Brandl55ac8f02007-09-01 13:51:09 +0000108Functions for sequences:
Georg Brandl116aa622007-08-15 14:28:22 +0000109
110.. function:: choice(seq)
111
112 Return a random element from the non-empty sequence *seq*. If *seq* is empty,
113 raises :exc:`IndexError`.
114
115
116.. function:: shuffle(x[, random])
117
118 Shuffle the sequence *x* in place. The optional argument *random* is a
119 0-argument function returning a random float in [0.0, 1.0); by default, this is
120 the function :func:`random`.
121
122 Note that for even rather small ``len(x)``, the total number of permutations of
123 *x* is larger than the period of most random number generators; this implies
124 that most permutations of a long sequence can never be generated.
125
126
127.. function:: sample(population, k)
128
Raymond Hettinger1acde192008-01-14 01:00:53 +0000129 Return a *k* length list of unique elements chosen from the population sequence
130 or set. Used for random sampling without replacement.
Georg Brandl116aa622007-08-15 14:28:22 +0000131
Georg Brandl116aa622007-08-15 14:28:22 +0000132 Returns a new list containing elements from the population while leaving the
133 original population unchanged. The resulting list is in selection order so that
134 all sub-slices will also be valid random samples. This allows raffle winners
135 (the sample) to be partitioned into grand prize and second place winners (the
136 subslices).
137
Guido van Rossum2cc30da2007-11-02 23:46:40 +0000138 Members of the population need not be :term:`hashable` or unique. If the population
Georg Brandl116aa622007-08-15 14:28:22 +0000139 contains repeats, then each occurrence is a possible selection in the sample.
140
141 To choose a sample from a range of integers, use an :func:`range` object as an
142 argument. This is especially fast and space efficient for sampling from a large
143 population: ``sample(range(10000000), 60)``.
144
145The following functions generate specific real-valued distributions. Function
146parameters are named after the corresponding variables in the distribution's
147equation, as used in common mathematical practice; most of these equations can
148be found in any statistics text.
149
150
151.. function:: random()
152
153 Return the next random floating point number in the range [0.0, 1.0).
154
155
156.. function:: uniform(a, b)
157
Benjamin Petersonb58dda72009-01-18 22:27:04 +0000158 Return a random floating point number *N* such that ``a <= N <= b`` for
159 ``a <= b`` and ``b <= N <= a`` for ``b < a``.
Georg Brandl116aa622007-08-15 14:28:22 +0000160
Raymond Hettingerbe40db02009-06-11 23:12:14 +0000161 The end-point value ``b`` may or may not be included in the range
162 depending on floating-point rounding in the equation ``a + (b-a) * random()``.
Benjamin Peterson35e8c462008-04-24 02:34:53 +0000163
Christian Heimesfe337bf2008-03-23 21:54:12 +0000164.. function:: triangular(low, high, mode)
165
Benjamin Petersonb58dda72009-01-18 22:27:04 +0000166 Return a random floating point number *N* such that ``low <= N <= high`` and
Christian Heimescc47b052008-03-25 14:56:36 +0000167 with the specified *mode* between those bounds. The *low* and *high* bounds
168 default to zero and one. The *mode* argument defaults to the midpoint
169 between the bounds, giving a symmetric distribution.
Christian Heimesfe337bf2008-03-23 21:54:12 +0000170
Georg Brandl116aa622007-08-15 14:28:22 +0000171
172.. function:: betavariate(alpha, beta)
173
Benjamin Petersonb58dda72009-01-18 22:27:04 +0000174 Beta distribution. Conditions on the parameters are ``alpha > 0`` and
175 ``beta > 0``. Returned values range between 0 and 1.
Georg Brandl116aa622007-08-15 14:28:22 +0000176
177
178.. function:: expovariate(lambd)
179
Mark Dickinson2f947362009-01-07 17:54:07 +0000180 Exponential distribution. *lambd* is 1.0 divided by the desired
181 mean. It should be nonzero. (The parameter would be called
182 "lambda", but that is a reserved word in Python.) Returned values
183 range from 0 to positive infinity if *lambd* is positive, and from
184 negative infinity to 0 if *lambd* is negative.
Georg Brandl116aa622007-08-15 14:28:22 +0000185
186
187.. function:: gammavariate(alpha, beta)
188
Benjamin Petersonb58dda72009-01-18 22:27:04 +0000189 Gamma distribution. (*Not* the gamma function!) Conditions on the
190 parameters are ``alpha > 0`` and ``beta > 0``.
Georg Brandl116aa622007-08-15 14:28:22 +0000191
192
193.. function:: gauss(mu, sigma)
194
Benjamin Petersonb58dda72009-01-18 22:27:04 +0000195 Gaussian distribution. *mu* is the mean, and *sigma* is the standard
196 deviation. This is slightly faster than the :func:`normalvariate` function
197 defined below.
Georg Brandl116aa622007-08-15 14:28:22 +0000198
199
200.. function:: lognormvariate(mu, sigma)
201
202 Log normal distribution. If you take the natural logarithm of this
203 distribution, you'll get a normal distribution with mean *mu* and standard
204 deviation *sigma*. *mu* can have any value, and *sigma* must be greater than
205 zero.
206
207
208.. function:: normalvariate(mu, sigma)
209
210 Normal distribution. *mu* is the mean, and *sigma* is the standard deviation.
211
212
213.. function:: vonmisesvariate(mu, kappa)
214
215 *mu* is the mean angle, expressed in radians between 0 and 2\*\ *pi*, and *kappa*
216 is the concentration parameter, which must be greater than or equal to zero. If
217 *kappa* is equal to zero, this distribution reduces to a uniform random angle
218 over the range 0 to 2\*\ *pi*.
219
220
221.. function:: paretovariate(alpha)
222
223 Pareto distribution. *alpha* is the shape parameter.
224
225
226.. function:: weibullvariate(alpha, beta)
227
228 Weibull distribution. *alpha* is the scale parameter and *beta* is the shape
229 parameter.
230
231
Raymond Hettinger3cdf8712010-12-02 05:35:35 +0000232Alternative Generator:
Georg Brandl116aa622007-08-15 14:28:22 +0000233
Georg Brandl116aa622007-08-15 14:28:22 +0000234.. class:: SystemRandom([seed])
235
236 Class that uses the :func:`os.urandom` function for generating random numbers
237 from sources provided by the operating system. Not available on all systems.
Raymond Hettinger3cdf8712010-12-02 05:35:35 +0000238 Does not rely on software state, and sequences are not reproducible. Accordingly,
Raymond Hettingerafd30452009-02-24 10:57:02 +0000239 the :meth:`seed` method has no effect and is ignored.
Georg Brandl116aa622007-08-15 14:28:22 +0000240 The :meth:`getstate` and :meth:`setstate` methods raise
241 :exc:`NotImplementedError` if called.
242
Georg Brandl116aa622007-08-15 14:28:22 +0000243
Georg Brandl116aa622007-08-15 14:28:22 +0000244.. seealso::
245
246 M. Matsumoto and T. Nishimura, "Mersenne Twister: A 623-dimensionally
247 equidistributed uniform pseudorandom number generator", ACM Transactions on
248 Modeling and Computer Simulation Vol. 8, No. 1, January pp.3-30 1998.
249
Georg Brandl116aa622007-08-15 14:28:22 +0000250
Raymond Hettinger1fd32a62009-04-01 20:52:13 +0000251 `Complementary-Multiply-with-Carry recipe
Raymond Hettinger9743fd02009-04-03 05:47:33 +0000252 <http://code.activestate.com/recipes/576707/>`_ for a compatible alternative
253 random number generator with a long period and comparatively simple update
Raymond Hettinger1fd32a62009-04-01 20:52:13 +0000254 operations.
Raymond Hettinger435cb0f2010-09-06 23:36:31 +0000255
Raymond Hettinger3cdf8712010-12-02 05:35:35 +0000256
Raymond Hettinger435cb0f2010-09-06 23:36:31 +0000257Notes on Reproducibility
Antoine Pitroue72b5862010-12-12 20:13:31 +0000258------------------------
Raymond Hettinger435cb0f2010-09-06 23:36:31 +0000259
260Sometimes it is useful to be able to reproduce the sequences given by a pseudo
261random number generator. By re-using a seed value, the same sequence should be
262reproducible from run to run as long as multiple threads are not running.
263
264Most of the random module's algorithms and seeding functions are subject to
265change across Python versions, but two aspects are guaranteed not to change:
266
267* If a new seeding method is added, then a backward compatible seeder will be
268 offered.
269
270* The generator's :meth:`random` method will continue to produce the same
271 sequence when the compatible seeder is given the same seed.
Raymond Hettinger2fdc7b12010-12-02 02:41:33 +0000272
Raymond Hettinger6e353942010-12-04 23:42:12 +0000273.. _random-examples:
Raymond Hettinger2fdc7b12010-12-02 02:41:33 +0000274
Raymond Hettinger2fdc7b12010-12-02 02:41:33 +0000275Examples and Recipes
Antoine Pitroue72b5862010-12-12 20:13:31 +0000276--------------------
Raymond Hettinger2fdc7b12010-12-02 02:41:33 +0000277
Raymond Hettinger3cdf8712010-12-02 05:35:35 +0000278Basic usage::
279
280 >>> random.random() # Random float x, 0.0 <= x < 1.0
281 0.37444887175646646
282
283 >>> random.uniform(1, 10) # Random float x, 1.0 <= x < 10.0
284 1.1800146073117523
285
286 >>> random.randrange(10) # Integer from 0 to 9
287 7
288
289 >>> random.randrange(0, 101, 2) # Even integer from 0 to 100
290 26
291
292 >>> random.choice('abcdefghij') # Single random element
293 'c'
294
295 >>> items = [1, 2, 3, 4, 5, 6, 7]
296 >>> random.shuffle(items)
297 >>> items
298 [7, 3, 2, 5, 6, 4, 1]
299
300 >>> random.sample([1, 2, 3, 4, 5], 3) # Three samples without replacement
301 [4, 1, 5]
302
Raymond Hettinger2fdc7b12010-12-02 02:41:33 +0000303A common task is to make a :func:`random.choice` with weighted probababilites.
304
305If the weights are small integer ratios, a simple technique is to build a sample
306population with repeats::
307
308 >>> weighted_choices = [('Red', 3), ('Blue', 2), ('Yellow', 1), ('Green', 4)]
309 >>> population = [val for val, cnt in weighted_choices for i in range(cnt)]
310 >>> random.choice(population)
311 'Green'
312
Raymond Hettinger3cdf8712010-12-02 05:35:35 +0000313A more general approach is to arrange the weights in a cumulative distribution
314with :func:`itertools.accumulate`, and then locate the random value with
315:func:`bisect.bisect`::
Raymond Hettinger2fdc7b12010-12-02 02:41:33 +0000316
317 >>> choices, weights = zip(*weighted_choices)
318 >>> cumdist = list(itertools.accumulate(weights))
319 >>> x = random.random() * cumdist[-1]
320 >>> choices[bisect.bisect(cumdist, x)]
321 'Blue'