The goal of this activity is to explain the reasoning behind significance
tests (also called hypothesis tests).
To help explain this reasoning, consider the following scenario:
A trusted friend offers you an investment where you give them $1000 and at the end of each month, your friend pays you your earnings in cash (your $1000 remains in the investment). After each 3-month period, you can continue the investment for another 3 months, or withdrawal your $1000. The monthly earnings are claimed to vary in a normal fashion with a mean return of $15.00 per month and a standard deviation of $4.00 per month. Recognizing that your average annual return will be $180 ($15 * 12 = $180 per year, for an average profit of 18% per year), you consider this a good deal and decide to invest $1000.
Your first three monthly returns, $13.39, $13.29 and $13.64 (so
= $13.44) are all noticably below average.
In accordance with the scientific method outlined above, we have our
original hypothesis, namely that X = your monthly earnings are normally
distributed with a mean of 15 and a standard deviation of 4 (briefly we
say that X is N(15,4) ). We also have a single value of
,
from a sample of size n = 3, namely
= 13.44.
The question we're interested in answering is
We need to figure out how likely values like
=
13.44 are, when X is N(15,4). The problem is that since X (and hence
)
is normal, the probability that
=13.44
is zero (as is the probability that
equals any other single value). We can only check probabilities of
associated with intervals. We need an interval for our probability calculation,
but we only have a single point from which to determine this interval.
There are only two reasonable intervals that we could consider, namely
< 13.44 and
> 13.44. These two intervals have probabilities that sum to 1 (so they
are essentially equivalent, if you know one, the other is easy to find).
Since we think our
value is low, let's calculate P(
<13.44).
That is, that probability that we get an
value equal to ours, or smaller.
Let's summarize our results and define some terminology to help describe
this situation. If we assume that our friend's hypothesis that monthly
returns are distributed in a normal fashion with m
= $15 per month and s = $4 per month are true,
then 3-month returns averaging $13.44 (or less) should happen one out of
every four 3-month periods. We are testing the original assumption
(m = 15) against the alternative that
m
< 15. Because of our alternative, we calculated the probability that
< 13.44, which is called a left-tailed test (we calculate the
area under the density curve to the left of our observed value of
).
We call the probability we calculated, namely P(
< 13.44) the p-value of our test. If our original assumption
is true, then values of
like ours (or lower), will occur roughly once in every four trials. This
means that we have very little, or no evidence that our original assumption
is false (our value is quite typical if we assume that
m
= 15). Based on this, we decide to leave our $1000 invested for another
3-month period.
After 6 months in the investment scheme, our monthly returns have averaged $12.91 (note, n = 6 now). Again, our returns are well below the stated value of $15 per month, but a 6-month average of $12.91 might be just due to the usual variation based on chance.
Your friend is a good one, and you'd like to be quite sure that m
< 15, before you pull your money out of the investment. The p-value
you just calculated isn't that small. It says that there is some evidence
against the assumption that m = 15, but not
a huge amount (
values like ours, or worse, don't happen all that often, but they are not
that rare either). Suppose you wanted to be 95% confident that m
< 15, before you pulled your money out of the investment. This translates
to saying that you will pull your money out, if your
value is in the lowest 5% of all
values (equivalently if your p-value, or probability calculation
is less than 0.05). You want to find the value of
that has 5% of all values to its left (or 95% to the right). That
is, you are looking for the number C, so that P(
< C) = 0.05.