Expectation

Expected Value

Definition 1: If a discrete random variable x has frequency function f(x) then the expected value of the function g(x) is defined as

Expected value of a discrete function

The equivalent for a continuous random variable x is

Expected value of a continuous function

where f(x) is the probability density function. This is the total area between the curve of the function h(x) and the x-axis where h(x) = f(x)g(x).   For those of you familiar with calculus

Expected value using integrals

In order to avoid using calculus, we will restrict ourselves to the discrete case in the rest of this chapter, although all the results shown here for discrete random variables extend to continuous random variables. Click here for more details about how to extend the results presented here to continuous distributions.

Property 1: For any random variables x and y and constant c

  1. E[c] = c
  2. E[cg(x)] = cE[g(x)]
  3. E[g(x) + h(x)] = E[g(x)] + E[h(x)]
  4. E[xy] = E[x] ∙ E[y] if x and y are independent

Proof: (a) – (c) are simple consequences of Definition 1. (d) is a consequence of Property 2 of Discrete Distributions.

Population Mean and Variance

Definition 2: If a random variable x has probability density function f(x) then the (population) mean μ of f(x) is defined as

Mean as expected value

Here the function g(x) in Definition 1 is the identity function g(x) = x.

The (population) variance σ2 is defined as

Variance as expected value

Property 2: The variance can also be expressed as

image272

Proof: By Property 1,

image273

image274

Property 3: For any random variable x and constants a and b

image275

image276

Proof: The first assertion is a consequence of Property 1c, namely

image277

For the second assertion, by Properties 1 and 2, we have

image278

image279

image280

It follows from Property 3 that for any constant b, Mean(b) = b and Var(b) = 0.

Property 4: For random variables x and y

image283

If in addition, x and y are independent, then

image284

Proof: The first assertion follows from Property 1:

image285

For the second assertion, by Properties 1 and 2

image286

image287

image288

But by Property 1d, E[xy] – E[x]E[y] = 0 since x and y are independent, and so

image290

Standardization

Definition 3: For any random variable x with mean μ and standard deviation σ, the standardization z of x is defined by

Standardization

Property 5: The standardization of any random variable has a mean of 0 and a variance of 1.

Proof: Since
image5022

by Property 3 the mean of z is

image5020

and the variance of z is

Excel Function: Excel provides the following function for calculating the value of z from x, μ, and σ:

STANDARDIZE(x, μ, σ) = (x – μ) / σ

Real Statistics Function: The Real Statistics Resource Pack provides the following array function for the array or cell range R1.

XSTANDARDIZE(R1) = STANDARDIZE(R1,AVERAGE(R1),STDEV.S(R1))

Moments

Definition 4: The nth moment around the mean is defined as

nth moment around mean

Click here for more advanced information about moments and related subjects.

Observation: It follows from Definitions 2 and 4 that the variance can be expressed as

image303Skewness and Kurtosis

In Symmetry and Kurtosis we define the skew and kurtosis of a sample. We now define the population equivalents of these concepts as follows.

Definition 5: The (population) skewness is defined as

Skewness formula

The (population) kurtosis is defined as

Kurtosis formula

The 3 in the kurtosis definition is the value of μ4/σ4 for the normal distribution function (see Normal Distribution). Thus the kurtosis of the normal distribution function is 0.

Reference

Wikipedia (2012) Expected value
https://en.wikipedia.org/wiki/Expected_value

Penn State (2021) Mathematical expectation. Introduction to probability theory
https://online.stat.psu.edu/stat414/lesson/8

6 thoughts on “Expectation”

  1. For property 4, it implicitly state that x and y are independent through the proof process.Iit will be clearer if we can have the conditional statement (if only x and y are independent) for the variance addition.

    Reply
    • Sun Kim,
      Thanks for catching this error. As you have stated, the second assertion depends on x and y being independent, as is clear from the proof.
      I have now corrected the webpage. I appreciate your help in making the Real Statistics website clearer and more accurate.
      Charles

      Reply
  2. I was looking for the information on moments, the link given above is not working. Could please provide with the working link. Your explanation and derivation is very intuitive.

    Thanks Regards
    Harsha

    Reply
  3. Hi,
    a little error here
    Sigma/Sigma^2 = 1
    To correct, Signa^2/Sigma^2 = 1
    This web site is awesome. It’s really useful providing many excel add-in functions.
    Thanks, Sir

    Reply
    • Hi Mobb,
      I am very pleased that you like the website.
      I also appreciate your identifying the typo. I have now corrected the error on the referenced webpage. Thanks for catching it.
      Charles

      Reply

Leave a Comment