-
What is the difference between a parameter and a statistic?
A parameter is associated with the population, a statistic with a sample.
-
Give an example where using stratified random samples is appropriate.
A company has 35 male and 50 female employees. They need an SRS of size
10 to poll their employees on a newly proposed work project and would like
the sample to match their employment ratios (of male to female) since the
proposal may be viewed differently by the different sexes.
-
Name two types of sampling methods that can give unreliable results.
Volunteer or self-response sampling and convenience sampling (as well
as others).
-
Researchers want to compare how well Miracle Whip Salad Dressing sells
in two different locations in grocery stores: on a display in the front
of the store (nearest the cash registers) and on a display at the back
of the store (away from the cash registers). The 30 Super Valu stores in
Iowa will be randomly assigned to two groups (15 stores each); those which
sell Miracle Whip on a front display and those which sell Miracle
Whip on a back display. No advertisements are used for any of the
stores, but Miracle Whip is sold at the same slightly reduced price in
each Super Valu store for a week. The total number of jars sold from front
displays and back displays will be compared.
-
Is this a study or an experiment? Explain what the difference is.
This is an experiment, actual treatments are being used (putting the
items on displays). In a study, you simply observe, in an experiment, you
subject participants to treatments and measure their responses.
-
What are the response and explanatory variables?
Response = number of jars sold. Explanatory = location (front or back
display).
-
Assume that the 30 stores have been labeled with the numbers from 1 to
30. Use your randInt command to select the two groups (front
display and back display). Explain how you used your randInt
commands to determine these groups.
Use randInt(1,30,15) to select the 15 stores for front
displays. Eliminate any duplicates with a single call to randInt(1,30).
-
In comparing the total sales (number of jars sold) from the front-display
group and the back-display group, a potential problem arises. The Miracle
Whip jars sold may not have come from the displays, but perhaps were picked
up from the normal shelf locations of Miracle Whip in each store. If a
given store sells an above average number of jars of Miracle Whip (from
the normal shelf location), then this store will have a false positive
influence on the group to which it belongs (making either front or back
displays look better). Explain how our design handles this potential problem.
Randomization should divide the "above average" stores evenly into the
2 groups.
-
The total number of jars sold from front and back-display stores for this
week are given in the table below. Calculate the mean number of jars sold
for each group and the standard deviation (of the number of jars sold)
for each group.
Front: mean = 85.67 std dev = 8.37
Back: mean = 77.87 std dev = 10.14
Number of Jars Sold
| Front |
95 |
87 |
72 |
90 |
105 |
84 |
78 |
78 |
77 |
85 |
92 |
90 |
84 |
79 |
89 |
| Back |
70 |
81 |
66 |
93 |
75 |
73 |
87 |
76 |
61 |
80 |
69 |
96 |
79 |
72 |
90 |
-
Past market research indicates that each one of these stores would sell
X jars of Miracle Whip in a week (at this price, from the normal shelf
locations), where X is normally distributed with a mean of 75.2 jars and
a standard deviation of 9.0 jars. Both groups did better than average (most
likely due to the placing of Miracle Whip on a display), but the front-display
stores out-performed both the back-display stores and the usual shelf sales,
by quite a bit. Assuming that the front-display group is a sample of size
15 from a normal distribution with mean 75.2 and standard deviation of
9.0 jars (for a single store), calculate the probability that a sample
of size 15 has an x-bar value greater than or equal to that of the 15 stores
that sold Miracle Whip on a front display.
normalcdf(85.67, 9999, 75.2, 9/sqrt(15) ) = 0.000003313
-
Based on your answer to the last part, do you think selling Miracle Whip
on a front display increases sales or not?
The probability is tiny. I would say that front displays definitely
improve sales over regular shelf locations.
-
Now assume that weekly sales of Miracle Whip (in a single store) have a
mean equal to the mean from the back-display group, and a standard deviation
equal to the standard deviation of the back-display sample. How likely
is it that a sample of size 15 from this distribution has an x-bar value
greater than or equal to that of the front-display group? Do you think
front-displays sell Miracle Whip better than back-displays, or is the difference
just due to chance?
normalcdf(85.67, 9999, 77.87, 10.14/sqrt(15) ) = 0.00144. Again,
the probability that the front display sales are just an "above average"
group from the same distribution as the back display sales is tiny. It
appears as if front displays definitiely out-sell back displays.
-
The Minnesota Twins committed 0 to 4 errors (bumbles or bad throws) in
each of their games in the 1999 season. The probability distribution for
X = number of errors per game is given in the table below.
The above setting describes a binomial random variable X.
-
What is a success?
A cookie with no chips.
-
What are the values of n and p?
n = 12, p = 0.14.
-
What is the probability that you get 2 cookies with no chips?
binompdf(12,.14,2) = 0.2863.
-
What is the expected number (or mean number) of cookies you receive without
chips?
n*p = 12*.14 = 1.68.
-
What is the standard deviation of the number of cookies in a dozen that
have no chips?
sqrt(n*p*(1-p) ) = 1.202.
-
What is the probability that all 12 of your cookies have chips?
This is P( X = 0) = binompdf(12,.14,0) = 0.164.
-
Twenty percent of all M&Ms are red. If you select a random sample of
n
M&Ms and let X = the number that are red, then X is a binomial random
variable with parameters n and p = 0.2.
-
In a sample of size 25, what is the exact probability that X = 4?
binompdf(25, .2, 4) = 0.1867.
-
In a sample of size 25, what is the exact probability that X < 3?
binomcdf(25, .2, 2) = 0.0982.
-
Using the normal approximation to the binomial, what is the approximate
probability that a random sample of 10,000 M&Ms has fewer than 1900
red M&Ms?
normalcdf(-9999, 1900, 10000*.2, sqrt(10000*.2*.8)) = 0.0062.
normalcdf(-9999, 1899, 10000*.2, sqrt(10000*.2*.8)) = 0.0058
(either answer is fine).
-
Assume that the height of six-year old girls is normally distributed with
a mean of 46 inches and a standard deviation of 2.17 inches.
-
How likely is it that a randomly selected six-year old girl is more than
4 feet tall?
normalcdf(48,9999,46,2.17) = 0.178.
-
How likely is it that the average height of 6 randomly selected six-year
old girls is more than 4 feet?
normalcdf(48,9999,46, 2.17 / sqrt(6) ) = 0.012