In
probability theory the
expected value (or
expectation value, or
mathematical expectation, or
mean) of a discrete
random variable is the sum of the probability of each possible outcome of the experiment multiplied by the outcome value (or
payoff). Thus, it represents the average amount one "expects" as the outcome of the random trial when identical odds are repeated many times. Note that the value itself may not be
expected in the general sense - the "expected value" itself may be unlikely or even impossible.
Examples
The expected value from the roll of an ordinary six-sided die is
operatorname{E}(X) = frac{1 + 2 + 3 + 4 + 5 + 6}{6} = 3.5
which is not among the possible outcomes.
A common application of expected value is to gambling. For example, an American roulette wheel has 38 places where the ball may land, all equally likely. A winning bet on a single number pays 35-to-1, meaning that the original stake is not lost, and 35 times that amount is won, so you receive 36 times what you've bet. Considering all 38 possible outcomes, the expected value of the profit resulting from a dollar bet on a single number is the sum of what you may lose times the odds of losing and what you will win times the odds of winning, that is,
operatorname{E}(X) = left(-$1 times frac{37}{38} right) + left($35 times frac{1}{38} right) = -$frac{1}{19} approx -$0.0526.
The change in your financial holdings is −$1 when you lose, and $35 when you win. Thus one may expect, on average, to lose about five cents for every dollar bet, and the expected value of a one-dollar bet is $0.9474. In gambling, an event of which the expected value equals the stake (of which the bettor's expected profit is zero) is called a "fair game."
Mathematical definition
In general, if is a random variable defined on a probability space , then the expected value of , denoted , or sometimes , or , is defined as
where the Lebesgue integral is employed. Note that not all random variables have an expected value, since the integral may not exist (e.g., Cauchy distribution). Two variables with the same probability distribution will have the same expected value, if it is defined.
If is a discrete random variable with probability mass function , then the expected value becomes
as in the gambling example mentioned above.
If the probability distribution of admits a probability density function , then the expected value can be computed as
It follows directly from the discrete case definition that if is a constant random variable, i.e. for some fixed real number , then the expected value of is also .
The expected value of an arbitrary function of X, g(X), with respect to the probability density function f(x) is given by the inner product of f and g:
Conventional terminology
- When one speaks of the "expected price", "expected height", etc. one means the expected value of a random variable that is a price, a height, etc.
- When one speaks of the "expected number of attempts needed to get one successful attempt," one might conservatively approximate it as the reciprocal of the probability of success for such an attempt. Cf. expected value of the geometric distribution.
Properties
Constants
The expected value of a constant is equal to the constant itself; i.e., if 'c' is a constant, then E(c) = c
Monotonicity
If
X and
Y are random variables so that
almost surely, then
.
Linearity
The expected value operator (or
expectation operator)
is
linear in the sense that
Combining the results from previous three equations, we can see that -
for any two random variables and (which need to be defined on the same probability space) and any real numbers and .
Iterated expectation
Iterated expectation for discrete random variables
For any two
discrete random variables
one may define the
conditional expectation:
which means that is a function on .
Then the expectation of satisfies
operatorname{E} left(operatorname{E}(X|Y) right)= sumlimits_y operatorname{E}(X|Y=y) cdot operatorname{P}(Y=y) ,
Hence, the following equation holds:
The right hand side of this equation is referred to as the iterated expectation and is also sometimes called the tower rule. This proposition is treated in law of total expectation.
Iterated expectation for continuous random variables
In the
continuous case, the results are completely analogous. The definition of conditional expectation would use inequalities, density functions, and integrals to replace equalities, mass functions, and summations, respectively. However, the main result still holds:
Inequality
If a random variable X is always less than or equal to another random variable Y, the expectation of X is less than or equal to that of Y:
If , then .
In particular, since and , the absolute value of expectation of a random variable is less than or equal to the expectation of its absolute value:
Representation
The following formula holds for any nonnegative real-valued random variable
X (such that
), and positive real number
:
In particular, for , this reduces to:
Non-multiplicativity
In general, the expected value operator is not multiplicative, i.e.
is not necessarily equal to
. If multiplicativity occurs, the
and
variables are said to be
uncorrelated (
independent variables are a notable case of uncorrelated variables). The lack of multiplicativity gives rise to study of
covariance and
correlation.
Functional non-invariance
In general, the expectation operator and
functions of random variables do not
commute; that is
A notable inequality concerning this topic is Jensen's inequality, involving expected values of convex (or concave) functions.
Uses and applications of the expected value
The expected values of the powers of
are called the
moments of
; the
moments about the mean of
are expected values of powers of
. The moments of some random variables can be used to specify their distributions, via their
moment generating functions.
To empirically estimate the expected value of a random variable, one repeatedly measures observations of the variable and computes the arithmetic mean of the results. If the expected value exists, this procedure estimates the true expected value in an unbiased manner and has the property of minimizing the sum of the squares of the residuals (the sum of the squared differences between the observations and the estimate). The law of large numbers demonstrates (under fairly mild conditions) that, as the size of the sample gets larger, the variance of this estimate gets smaller.
In classical mechanics, the center of mass is an analogous concept to expectation. For example, suppose is a discrete random variable with values and corresponding probabilities . Now consider a weightless rod on which are placed weights, at locations along the rod and having masses (whose sum is one). The point at which the rod balances is .
Expected values can also be used to compute the variance, by means of the computational formula for the variance
A very important application of the expectation value is in the field of quantum mechanics. The expectation value of a quantum mechanical operator operating on a quantum state vector is written as . The uncertainty in can be calculated using the formula
.
Expectation of matrices
If
is an
matrix, then the expected value of the matrix is defined as the matrix of expected values:
operatorname{E}(X)
=
operatorname{E}
begin{pmatrix}
x_{1,1} & x_{1,2} & cdots & x_{1,n}
x_{2,1} & x_{2,2} & cdots & x_{2,n}
vdots
x_{m,1} & x_{m,2} & cdots & x_{m,n}
end{pmatrix}
=
begin{pmatrix}
operatorname{E}(x_{1,1}) & operatorname{E}(x_{1,2}) & cdots & operatorname{E}(x_{1,n})
operatorname{E}(x_{2,1}) & operatorname{E}(x_{2,2}) & cdots & operatorname{E}(x_{2,n})
vdots
operatorname{E}(x_{m,1}) & operatorname{E}(x_{m,2}) & cdots & operatorname{E}(x_{m,n})
end{pmatrix}.
This is utilized in covariance matrices.
Computation
It is often useful to update a computed expected value as new data come in. This can be done as follows, where is the -th value, and we use the previous estimate to compute :
operatorname{E}_mathrm{new}(X) = [(count-1) cdot operatorname{E}_mathrm{prev}(X) + new_value]/count
Formula for non-negative integral values
When a random variable takes only values in
we can use the following formula
for computing its expectation:
operatorname{E}(X)=sumlimits_{i=1}^infty P(Xgeq i).
For example, suppose we toss a coin where the probability of heads is
. How many tosses can we expect until the first heads? Let
be this number. Note that we are counting only the tails and not the heads which ends the experiment; in particular, we can have
. The expectation of
may be computed by
. This is because the number of tosses is at least
exactly when the first
tosses yielded tails. This matches the
expectation of a random variable with an
Exponential distribution.
We used the formula for
Geometric progression:
See also
External links