blob: 33ab5e5b49fff495be0f76b25313c185ef3dd057 [file] [log] [blame]
Georg Brandl116aa622007-08-15 14:28:22 +00001:mod:`random` --- Generate pseudo-random numbers
2================================================
3
4.. module:: random
5 :synopsis: Generate pseudo-random numbers with various common distributions.
6
7
8This module implements pseudo-random number generators for various
9distributions.
10
Éric Araujo6e6cb8e2010-11-16 19:13:50 +000011.. seealso::
12
13 Latest version of the :source:`random module Python source code
14 <Lib/random.py>`
15
Raymond Hettingerb21dac12010-09-07 05:32:49 +000016For integers, there is uniform selection from a range. For sequences, there is
17uniform selection of a random element, a function to generate a random
18permutation of a list in-place, and a function for random sampling without
19replacement.
Georg Brandl116aa622007-08-15 14:28:22 +000020
21On the real line, there are functions to compute uniform, normal (Gaussian),
22lognormal, negative exponential, gamma, and beta distributions. For generating
23distributions of angles, the von Mises distribution is available.
24
25Almost all module functions depend on the basic function :func:`random`, which
26generates a random float uniformly in the semi-open range [0.0, 1.0). Python
27uses the Mersenne Twister as the core generator. It produces 53-bit precision
28floats and has a period of 2\*\*19937-1. The underlying implementation in C is
29both fast and threadsafe. The Mersenne Twister is one of the most extensively
30tested random number generators in existence. However, being completely
31deterministic, it is not suitable for all purposes, and is completely unsuitable
32for cryptographic purposes.
33
34The functions supplied by this module are actually bound methods of a hidden
35instance of the :class:`random.Random` class. You can instantiate your own
Raymond Hettinger28de64f2008-01-13 23:40:30 +000036instances of :class:`Random` to get generators that don't share state.
Georg Brandl116aa622007-08-15 14:28:22 +000037
38Class :class:`Random` can also be subclassed if you want to use a different
39basic generator of your own devising: in that case, override the :meth:`random`,
Raymond Hettingerafd30452009-02-24 10:57:02 +000040:meth:`seed`, :meth:`getstate`, and :meth:`setstate` methods.
Benjamin Petersond18de0e2008-07-31 20:21:46 +000041Optionally, a new generator can supply a :meth:`getrandbits` method --- this
Georg Brandl116aa622007-08-15 14:28:22 +000042allows :meth:`randrange` to produce selections over an arbitrarily large range.
43
Benjamin Peterson21896a32010-03-21 22:03:03 +000044The :mod:`random` module also provides the :class:`SystemRandom` class which
45uses the system function :func:`os.urandom` to generate random numbers
46from sources provided by the operating system.
Georg Brandl116aa622007-08-15 14:28:22 +000047
Georg Brandl116aa622007-08-15 14:28:22 +000048Bookkeeping functions:
49
50
Raymond Hettingerf763a722010-09-07 00:38:15 +000051.. function:: seed([x], version=2)
Georg Brandl116aa622007-08-15 14:28:22 +000052
Raymond Hettingerf763a722010-09-07 00:38:15 +000053 Initialize the random number generator.
Georg Brandl116aa622007-08-15 14:28:22 +000054
Raymond Hettingerf763a722010-09-07 00:38:15 +000055 If *x* is omitted or ``None``, the current system time is used. If
56 randomness sources are provided by the operating system, they are used
57 instead of the system time (see the :func:`os.urandom` function for details
58 on availability).
Georg Brandl116aa622007-08-15 14:28:22 +000059
Raymond Hettingerf763a722010-09-07 00:38:15 +000060 If *x* is an int, it is used directly.
61
62 With version 2 (the default), a :class:`str`, :class:`bytes`, or :class:`bytearray`
Raymond Hettinger3149f9c2010-09-07 05:35:10 +000063 object gets converted to an :class:`int` and all of its bits are used. With version 1,
Raymond Hettingerf763a722010-09-07 00:38:15 +000064 the :func:`hash` of *x* is used instead.
65
66 .. versionchanged:: 3.2
67 Moved to the version 2 scheme which uses all of the bits in a string seed.
Georg Brandl116aa622007-08-15 14:28:22 +000068
69.. function:: getstate()
70
71 Return an object capturing the current internal state of the generator. This
72 object can be passed to :func:`setstate` to restore the state.
73
Georg Brandl116aa622007-08-15 14:28:22 +000074
75.. function:: setstate(state)
76
77 *state* should have been obtained from a previous call to :func:`getstate`, and
78 :func:`setstate` restores the internal state of the generator to what it was at
79 the time :func:`setstate` was called.
80
Georg Brandl116aa622007-08-15 14:28:22 +000081
Georg Brandl116aa622007-08-15 14:28:22 +000082.. function:: getrandbits(k)
83
Ezio Melotti0639d5a2009-12-19 23:26:38 +000084 Returns a Python integer with *k* random bits. This method is supplied with
Georg Brandl5c106642007-11-29 17:41:05 +000085 the MersenneTwister generator and some other generators may also provide it
Georg Brandl116aa622007-08-15 14:28:22 +000086 as an optional part of the API. When available, :meth:`getrandbits` enables
87 :meth:`randrange` to handle arbitrarily large ranges.
88
Georg Brandl116aa622007-08-15 14:28:22 +000089
90Functions for integers:
91
Georg Brandl116aa622007-08-15 14:28:22 +000092.. function:: randrange([start,] stop[, step])
93
94 Return a randomly selected element from ``range(start, stop, step)``. This is
95 equivalent to ``choice(range(start, stop, step))``, but doesn't actually build a
96 range object.
97
Raymond Hettinger05156612010-09-07 04:44:52 +000098 The positional argument pattern matches that of :func:`range`. Keyword arguments
99 should not be used because the function may use them in unexpected ways.
100
101 .. versionchanged:: 3.2
102 :meth:`randrange` is more sophisticated about producing equally distributed
103 values. Formerly it used a style like ``int(random()*n)`` which could produce
104 slightly uneven distributions.
Georg Brandl116aa622007-08-15 14:28:22 +0000105
106.. function:: randint(a, b)
107
Raymond Hettingerafd30452009-02-24 10:57:02 +0000108 Return a random integer *N* such that ``a <= N <= b``. Alias for
109 ``randrange(a, b+1)``.
Georg Brandl116aa622007-08-15 14:28:22 +0000110
Georg Brandl116aa622007-08-15 14:28:22 +0000111
Georg Brandl55ac8f02007-09-01 13:51:09 +0000112Functions for sequences:
Georg Brandl116aa622007-08-15 14:28:22 +0000113
114.. function:: choice(seq)
115
116 Return a random element from the non-empty sequence *seq*. If *seq* is empty,
117 raises :exc:`IndexError`.
118
119
120.. function:: shuffle(x[, random])
121
122 Shuffle the sequence *x* in place. The optional argument *random* is a
123 0-argument function returning a random float in [0.0, 1.0); by default, this is
124 the function :func:`random`.
125
126 Note that for even rather small ``len(x)``, the total number of permutations of
127 *x* is larger than the period of most random number generators; this implies
128 that most permutations of a long sequence can never be generated.
129
130
131.. function:: sample(population, k)
132
Raymond Hettinger1acde192008-01-14 01:00:53 +0000133 Return a *k* length list of unique elements chosen from the population sequence
134 or set. Used for random sampling without replacement.
Georg Brandl116aa622007-08-15 14:28:22 +0000135
Georg Brandl116aa622007-08-15 14:28:22 +0000136 Returns a new list containing elements from the population while leaving the
137 original population unchanged. The resulting list is in selection order so that
138 all sub-slices will also be valid random samples. This allows raffle winners
139 (the sample) to be partitioned into grand prize and second place winners (the
140 subslices).
141
Guido van Rossum2cc30da2007-11-02 23:46:40 +0000142 Members of the population need not be :term:`hashable` or unique. If the population
Georg Brandl116aa622007-08-15 14:28:22 +0000143 contains repeats, then each occurrence is a possible selection in the sample.
144
145 To choose a sample from a range of integers, use an :func:`range` object as an
146 argument. This is especially fast and space efficient for sampling from a large
147 population: ``sample(range(10000000), 60)``.
148
149The following functions generate specific real-valued distributions. Function
150parameters are named after the corresponding variables in the distribution's
151equation, as used in common mathematical practice; most of these equations can
152be found in any statistics text.
153
154
155.. function:: random()
156
157 Return the next random floating point number in the range [0.0, 1.0).
158
159
160.. function:: uniform(a, b)
161
Benjamin Petersonb58dda72009-01-18 22:27:04 +0000162 Return a random floating point number *N* such that ``a <= N <= b`` for
163 ``a <= b`` and ``b <= N <= a`` for ``b < a``.
Georg Brandl116aa622007-08-15 14:28:22 +0000164
Raymond Hettingerbe40db02009-06-11 23:12:14 +0000165 The end-point value ``b`` may or may not be included in the range
166 depending on floating-point rounding in the equation ``a + (b-a) * random()``.
Benjamin Peterson35e8c462008-04-24 02:34:53 +0000167
Christian Heimesfe337bf2008-03-23 21:54:12 +0000168.. function:: triangular(low, high, mode)
169
Benjamin Petersonb58dda72009-01-18 22:27:04 +0000170 Return a random floating point number *N* such that ``low <= N <= high`` and
Christian Heimescc47b052008-03-25 14:56:36 +0000171 with the specified *mode* between those bounds. The *low* and *high* bounds
172 default to zero and one. The *mode* argument defaults to the midpoint
173 between the bounds, giving a symmetric distribution.
Christian Heimesfe337bf2008-03-23 21:54:12 +0000174
Georg Brandl116aa622007-08-15 14:28:22 +0000175
176.. function:: betavariate(alpha, beta)
177
Benjamin Petersonb58dda72009-01-18 22:27:04 +0000178 Beta distribution. Conditions on the parameters are ``alpha > 0`` and
179 ``beta > 0``. Returned values range between 0 and 1.
Georg Brandl116aa622007-08-15 14:28:22 +0000180
181
182.. function:: expovariate(lambd)
183
Mark Dickinson2f947362009-01-07 17:54:07 +0000184 Exponential distribution. *lambd* is 1.0 divided by the desired
185 mean. It should be nonzero. (The parameter would be called
186 "lambda", but that is a reserved word in Python.) Returned values
187 range from 0 to positive infinity if *lambd* is positive, and from
188 negative infinity to 0 if *lambd* is negative.
Georg Brandl116aa622007-08-15 14:28:22 +0000189
190
191.. function:: gammavariate(alpha, beta)
192
Benjamin Petersonb58dda72009-01-18 22:27:04 +0000193 Gamma distribution. (*Not* the gamma function!) Conditions on the
194 parameters are ``alpha > 0`` and ``beta > 0``.
Georg Brandl116aa622007-08-15 14:28:22 +0000195
196
197.. function:: gauss(mu, sigma)
198
Benjamin Petersonb58dda72009-01-18 22:27:04 +0000199 Gaussian distribution. *mu* is the mean, and *sigma* is the standard
200 deviation. This is slightly faster than the :func:`normalvariate` function
201 defined below.
Georg Brandl116aa622007-08-15 14:28:22 +0000202
203
204.. function:: lognormvariate(mu, sigma)
205
206 Log normal distribution. If you take the natural logarithm of this
207 distribution, you'll get a normal distribution with mean *mu* and standard
208 deviation *sigma*. *mu* can have any value, and *sigma* must be greater than
209 zero.
210
211
212.. function:: normalvariate(mu, sigma)
213
214 Normal distribution. *mu* is the mean, and *sigma* is the standard deviation.
215
216
217.. function:: vonmisesvariate(mu, kappa)
218
219 *mu* is the mean angle, expressed in radians between 0 and 2\*\ *pi*, and *kappa*
220 is the concentration parameter, which must be greater than or equal to zero. If
221 *kappa* is equal to zero, this distribution reduces to a uniform random angle
222 over the range 0 to 2\*\ *pi*.
223
224
225.. function:: paretovariate(alpha)
226
227 Pareto distribution. *alpha* is the shape parameter.
228
229
230.. function:: weibullvariate(alpha, beta)
231
232 Weibull distribution. *alpha* is the scale parameter and *beta* is the shape
233 parameter.
234
235
Raymond Hettinger3cdf8712010-12-02 05:35:35 +0000236Alternative Generator:
Georg Brandl116aa622007-08-15 14:28:22 +0000237
Georg Brandl116aa622007-08-15 14:28:22 +0000238.. class:: SystemRandom([seed])
239
240 Class that uses the :func:`os.urandom` function for generating random numbers
241 from sources provided by the operating system. Not available on all systems.
Raymond Hettinger3cdf8712010-12-02 05:35:35 +0000242 Does not rely on software state, and sequences are not reproducible. Accordingly,
Raymond Hettingerafd30452009-02-24 10:57:02 +0000243 the :meth:`seed` method has no effect and is ignored.
Georg Brandl116aa622007-08-15 14:28:22 +0000244 The :meth:`getstate` and :meth:`setstate` methods raise
245 :exc:`NotImplementedError` if called.
246
Georg Brandl116aa622007-08-15 14:28:22 +0000247
Georg Brandl116aa622007-08-15 14:28:22 +0000248.. seealso::
249
250 M. Matsumoto and T. Nishimura, "Mersenne Twister: A 623-dimensionally
251 equidistributed uniform pseudorandom number generator", ACM Transactions on
252 Modeling and Computer Simulation Vol. 8, No. 1, January pp.3-30 1998.
253
Georg Brandl116aa622007-08-15 14:28:22 +0000254
Raymond Hettinger1fd32a62009-04-01 20:52:13 +0000255 `Complementary-Multiply-with-Carry recipe
Raymond Hettinger9743fd02009-04-03 05:47:33 +0000256 <http://code.activestate.com/recipes/576707/>`_ for a compatible alternative
257 random number generator with a long period and comparatively simple update
Raymond Hettinger1fd32a62009-04-01 20:52:13 +0000258 operations.
Raymond Hettinger435cb0f2010-09-06 23:36:31 +0000259
Raymond Hettinger3cdf8712010-12-02 05:35:35 +0000260
Raymond Hettinger435cb0f2010-09-06 23:36:31 +0000261Notes on Reproducibility
262========================
263
264Sometimes it is useful to be able to reproduce the sequences given by a pseudo
265random number generator. By re-using a seed value, the same sequence should be
266reproducible from run to run as long as multiple threads are not running.
267
268Most of the random module's algorithms and seeding functions are subject to
269change across Python versions, but two aspects are guaranteed not to change:
270
271* If a new seeding method is added, then a backward compatible seeder will be
272 offered.
273
274* The generator's :meth:`random` method will continue to produce the same
275 sequence when the compatible seeder is given the same seed.
Raymond Hettinger2fdc7b12010-12-02 02:41:33 +0000276
Raymond Hettinger6e353942010-12-04 23:42:12 +0000277.. _random-examples:
Raymond Hettinger2fdc7b12010-12-02 02:41:33 +0000278
Raymond Hettinger2fdc7b12010-12-02 02:41:33 +0000279Examples and Recipes
280====================
281
Raymond Hettinger3cdf8712010-12-02 05:35:35 +0000282Basic usage::
283
284 >>> random.random() # Random float x, 0.0 <= x < 1.0
285 0.37444887175646646
286
287 >>> random.uniform(1, 10) # Random float x, 1.0 <= x < 10.0
288 1.1800146073117523
289
290 >>> random.randrange(10) # Integer from 0 to 9
291 7
292
293 >>> random.randrange(0, 101, 2) # Even integer from 0 to 100
294 26
295
296 >>> random.choice('abcdefghij') # Single random element
297 'c'
298
299 >>> items = [1, 2, 3, 4, 5, 6, 7]
300 >>> random.shuffle(items)
301 >>> items
302 [7, 3, 2, 5, 6, 4, 1]
303
304 >>> random.sample([1, 2, 3, 4, 5], 3) # Three samples without replacement
305 [4, 1, 5]
306
Raymond Hettinger2fdc7b12010-12-02 02:41:33 +0000307A common task is to make a :func:`random.choice` with weighted probababilites.
308
309If the weights are small integer ratios, a simple technique is to build a sample
310population with repeats::
311
312 >>> weighted_choices = [('Red', 3), ('Blue', 2), ('Yellow', 1), ('Green', 4)]
313 >>> population = [val for val, cnt in weighted_choices for i in range(cnt)]
314 >>> random.choice(population)
315 'Green'
316
Raymond Hettinger3cdf8712010-12-02 05:35:35 +0000317A more general approach is to arrange the weights in a cumulative distribution
318with :func:`itertools.accumulate`, and then locate the random value with
319:func:`bisect.bisect`::
Raymond Hettinger2fdc7b12010-12-02 02:41:33 +0000320
321 >>> choices, weights = zip(*weighted_choices)
322 >>> cumdist = list(itertools.accumulate(weights))
323 >>> x = random.random() * cumdist[-1]
324 >>> choices[bisect.bisect(cumdist, x)]
325 'Blue'