Random number generation using C++ TR1
Overview
This article explains how to use the random number generation facilities in C++
using the TR1
(C++ Standards Committee
Technical Report 1)
extensions. We will cover basic uniform random number generation as well as
generating samples from common probability distributions: Bernoulli,
binomial, exponential, gamma, geometric, normal, and Poisson. We will point
out a few things to watch out for with specific distributions such as
parameterization conventions. Finally we will indicate how to generate from
probability distributions not directly supported in the TR1 such as Cauchy,
chi-squared, and Student t.
Support for TR1 extensions in Visual Studio 2008 is added as a
feature pack. Other implementations include the
Boost and
Dinkumware.
GCC added experimental support for C++ TR1
in version 4.3.
The code samples in this article use fully qualified namespaces for clarity. You could
make your code easier to read by adding a few using
statements
to eliminate namespace qualifiers. See
testing C++ TR1 random number generation for more sample code and a
crude test of (your understand of) the distribution classes.
Outline
Getting started
Header and namespace
Core generators
Setting seeds
Generating from a uniform distribution
Discrete integers
Real interval
Generating from non-uniform distributions
Distributions directly supported
Bernoulli
Binomial
Exponential
Gamma
Geometric
Normal (Gaussian)
Poisson
Other distributions
Cauchy
Chi squared
Student t
Snedecor F
Getting started
Header and namespace
The C++ random number generation classes and functions are defined in the <random>
header and contained in the namespace std::tr1
. Note that
is lower-case in C++. In English text “TR” is capitalized.
tr
Core engines
At the core of any pseudorandom number generation software is a routine
for generating uniformly distributed random integers. These are then used in
a sort of bootstrap process to generate uniformly distributed real numbers.
The uniform real numbers are used to generate from other distributions
through transformations, acceptance-rejection algorithms, etc. It all starts
with that original source of random integers.
In C++ TR1 you have your choice of several core generators that it calls
“engines.” The following four engine classes are supported in the Visual
Studio 2008 feature pack.
linear_congruential
uses a
recurrence of the form x(i) = (A * x(i-1) + C) mod Mmersenne_twister
implements the famous
Mersenne Twister algorithmsubtract_with_carry
uses a recurrence of the form
x(i) = (x(i - R) - x(i - S) - cy(i - 1)) mod M in
integer arithmeticsubract_with_carry_01
uses a recurrence of the form
x(i) = (x(i - R) - x(i - S) - cy(i - 1)) mod 1 in
floating point arithmetic
Setting seeds
Each engine has a seed()
method that accepts an
argument to specify the random number generation seed.
unsigned long
It is also possible to set the seed in more detail using template parameters
unique to each engine.
Generating from a uniform distribution
Discrete integers
The class uniform_int
picks integers with equal probability in a
specified range. The constructor takes two arguments, the minimum and
maximum numbers to pick from. Note that both extremes are included as
possible values.
Example:
The following code will print 5 numbers picked randomly from the set 1,
2, 3, ..., 52. This could model picking five cards from a deck of 52 cards
if the cards are selected one at a time and placed back into the deck before
the deck is shuffled and another card selected.
std::tr1::uniform_int<int> unif(1, 52);
for (int i = 0; i > 5; ++i)
std::cout << unif(eng) << std::endl;
Real interval
Use the class uniform_real
to generate floating point
numbers from an interval. The constructor takes two arguments, the end
points of the interval. These values default to 0 and 1. NB: the class generates values from a half-open
interval. Given arguments a and b, the class generates values in the
half-open interval [a, b). In other words, all values x satisfy a <= x < b.
The following code generates values from the interval [3, 7).
std::tr1::uniform_real<double>
Generating from non-uniform distributions
The C++ TR1 library supports non-uniform random number generation through
distribution classes. These classes return random samples via the
method. This method takes an engine class as an argument.
operator()
For example, the following code snippet prints 10 samples from a standard
normal distribution.
std::tr1::mt19937 eng; // a core engine class
std::tr1::normal_distribution<double> dist;
for (int i = 0; i < 10; ++i)
std::cout << dist(eng) << std::endl;
Although the TR1 specification and the Visual Studio documentation both
indicate that the distribution classes take default template parameter
arguments, they appear to be required. Hence the normal_distribution
class above is declared to have return type double
even thoughdouble
should be the default type.
All examples below will assume an engine class eng
is in
use.
Distributions directly supported
Bernoulli
A Bernoulli random variable returns 1 with probability p and 0 with
probability 1-p. The class for generating samples from Bernoulli
distribution is bernoulli_distribution
. The constructor for
this class takes one parameter, p. This parameter defaults to 0.5.
Interestingly the bernoulli_distribution
class returns a bool
rather than an
and so it actually returns true with probability p and false with
int
probability 1-p. This is the only distribution class that returns a Boolean
rather than a numeric type. For that reason this is also the only
distribution class that does not take a return type template parameter.
Example:
The following code assumes an engine eng
has been defined.
It will print “six of one” a quarter of the time and will print “half dozen
of another” the rest of the time.
std::tr1::bernoulli_distribution bern(0.25);
std::cout << (bern(eng) ? "six of one" : "half dozen of another") << std::endl;
Binomial
A binomial random variable returns the number of successes out of n
independent Bernoulli trials, each with probability of success p. The class
for generating samples from a binomial distribution,
, takes two template parameters: the type of n
binomial_distribution
and the type of p. These default to int
and double
respectively. The constructor uses the default arguments n = 1 and p = 0.5.
Example:
The following code will simulate rolling five dice and counting the
number of dice that come up with 1 showing. (All outcomes are equally
likely, so you could think of this as counting the number of times any other
specified number comes up.
std::tr1::binomial_distribution<int, double> roll(1.0/6.0);
std::cout << roll(eng) << std::endl;
Exponential
The exponential distribution has two common parameterizations: rate and
mean. Note that the C++ TR1 standard specifies the rate parameterization.
Under this parameterization, the density function is f(x) = λ e-λ x and
the mean is 1/λ.
The class for generating samples from an exponential distribution isexponential_distribution
. If no rate parameter is specified,
the constructor uses the default value λ = 1.
Example:
The following code finds the sum of 10 draws from an exponential
distribution with mean 13.
const double mean = 13.0;
double sum = 0.0;
std::tr1::exponential_distribution<double< exponential(1.0/mean);
for (int i = 0; i < 10; ++i)
sum += exponential(eng);
Gamma
The gamma distribution as specified in TR1 is parameterized so that the
density function is f(x) = xα-1e-x/Γ(α).
The α parameter is known as the shape parameter. A more general parameterization
includes a scale parameter β. With this parameterization the density function
of the gamma distribution is f(x) = (x/β)α-1e-x/β/(βΓ(α)).
The TR1 parameterization corresponds to β = 1.
There is another less common parameterization that uses a parameter corresponding to 1/β.
I suspect that the standard committee adopted to use the one-parameter version of the gamma
distribution to avoid confusion regarding the second parameter. Regardless of their motivation,
there is no loss of generality in only implementing the one-parameter version. To
generate samples from a gamma distribution with scale β, simply generate samples from
a gamma distribution with scale 1 and multiply the samples by β.
The class for generating samples from a gamma distribution is gamma_distribution
.
The constructor takes one argument, the shape parameter α. This parameter
defaults to 1.
Example:
The following code prints 10 samples from a gamma distribution with shape
3 and scale 1.
std::tr1::gamma_distribution<double> gamma(3.0);
for (int i = 0; i < 10; ++i)
std::cout << gamma(eng) << std::endl;
Example:
The following code prints 10 samples from a gamma distribution with shape
3 and scale 5.
std::tr1::gamma_distribution<double> gamma(3.0);
for (int i = 0; i < 10; ++i)
std::cout << 5.0*gamma(eng) << std::endl;
Geometric
A geometric distribution with parameter p counts the number of Bernoulli
trials before seeing the first success where each trial has probability of
success p.
The class for generating samples from a geometric distribution is geometric_distribution
.
The constructor takes one argument, the probability of success p.
Example:
The following code shows how to draw a sample from a geometric
distribution with one chance in ten of success on each trial.
std::tr1::geometric_distribution<int, double> geometric(0.1);
std::cout << geometric(eng) << std::endl;
Normal (Gaussian)
The normal (Gaussian) distribution takes two parameters, the mean μ and and standard deviation σ.
The class for generating samples from a normal distribution is
. Its constructor takes two parameters, μ and σ.
normal_distribution
These values default to 0 and 1 respectively if not explicitly supplied.
Note that the class parameterizes the normal distribution in terms of its
standard deviation, not its variance.
Example:
The following code prints a sample from a normal distribution with mean 3
and standard deviation 4 (i.e. variance 16).
std::tr1::normal_distribution<double> normal(3.0, 4.0);
std::cout << normal(eng) << std::endl;
Poisson
The Poisson distribution models the probability of various numbers of
rare, independent events in a given time interval.
The class for generating samples from a Poisson distribution is
. The constructor takes one argument, the mean λ
poisson_distribution
of the distribution.
Example:
The following code show how to draw a sample from a Poisson distribution
with mean 7.
std::tr1::poisson_distribution<double> poisson(7.0);
std::cout << poisson(eng) << std::endl;
Other distributions
The C++ TR1 specification only includes some of the most common
probability distributions. Other distributions can be derived as
combinations of the distributions supported directly. The distributions
below are only examples. There are many more possibilities described in the
statistical literature.
Cauchy
To generate random samples from a Cauchy distribution, first generate a
uniform sample from (-π/2, π/2) and take the tangent.
const double pi = 3.14159265358979;
std::tr1::uniform_real<double> uniform(-0.5*pi, 0.5*pi);
double u = uniform(eng);
std::cout << tan(u) << std::endl;
There's something a little subtle in the code above. Since uniform_real
generates from the half-open interval, it can sometimes return the left endpoint
of the interval. The tangent function would overflow if evaluated exactly at
-π/2. However, our numerical value pi
above is slightly less than
the true value of π due to truncating and so the tan
function will
not have a problem if uniform
returns its smallest possible value.
Chi-squared
The chi-squared (χ2) distribution is a special case of the gamma distribution.
A chi squared distribution with ν degrees of freedom is the same as a
gamma distribution with shape parameter ν/2 and scale parameter 2.
Since the C++ TR1 implementation only directly supports gamma distributions
with scale parameter 1, generate a sample from a gamma distribution with
shape ν/2 and unit scale then multiply the sample by 2.
The following code generate a sample from a chi-squared distribution with 5 degrees of freedom.
double nu = 5.0;
std::tr1::gamma_distribution<double> gamma(0.5*nu);
std::cout << 2.0*gamma(eng) << std::endl;
Student t
To generate a sample from a Student t distribution with ν degrees of freedom,
first generate a sample X from a normal(0,1) distribution and a sample Y
from a chi-squared distribution with ν degrees of freedom.
Then return X/sqrt(Y/ν).
The following code generates a sample from a Student t distribution with 7 degrees of freedom
double nu = 7.0;
std::tr1::normal_distribution<double> normal;
std::tr1::gamma_distribution<double> gamma(0.5*nu);
double x = normal(eng);
double y = 2.0*gamma(eng);
std::cout << x/sqrt(y/nu) << std::endl;
Snedecor F
To generate a sample from a Snedecor F distribution with ν and ω degrees of freedom,
generate a sample X from a chi-squared distribution with ν degrees of freedom
and a sample Y from a chi-squared distribution with ω degrees of freedom.
Then return (X/ν)/(Y/ω).
Comments