LAB SESSION 8

LAB SESSION 9

ESTIMATION AND HYPOTHESIS TESTING

INTRODUCTION: Two indispensable statistical decision-making tools for a single parameter are (i)confidence intervals, and (ii) hypothesis tests to investigate theories about parameters. In this lab you will learn how to calculate confidence intervals and perform hypothesis tests (assuming we know sigma) using Excel.

CONFIDENCE INTERVALS

As an introduction, let’s follow Illustration 8.3 in your text.

Begin a new worksheet and generate 40 random integers the range 0 to 9 in column A. You can use either the Random Numbers Table (Table 1) or the Random Number Generation tool of Excel and then use the INT (integer) function to transform to integers in the range 0 to 9:

Choose: Tools > Data Analysis Plus > Random Number Generation > OK

Enter: Number of Variables: 1

Number of Random Numbers: 40

Select: Distribution: Uniform

Enter: Parameters, Between 0 and 9

Output Range: B1 > OK

Enter: in cell A1: =INT(B1)

Click and drag: lower right corner to cell A40

To see the mean, standard deviation and maximum and minimum values for the data set use:

Select: Tools > Data Analysis > Descriptive Statistics > OK

enter input and output range as appropriate, and select Summary Statistics

Column1

Mean	3.9
Standard Error	0.446927
Median	3
Mode	3
Standard Deviation	2.826613
Sample Variance	7.989744
Range	8
Minimum	0
Maximum	8
Sum	156
Count	40

(Your results may be slightly different, since we are using random data.)

Find the 90% confidence interval for the mean of these values:

Choose: Tools > Data Analysis Plus > Z-Estimate: Mean > OK

Enter: Input Range: A1:A40

Standard Deviation (SIGMA): 2.87 > OK

Alpha : .10 > OK

z-Estimate: Mean

		Column 1
Mean		4.9412
Standard Deviation		2.6094
Observations		17
SIGMA		2.87
LCL		3.796232
UCL		6.086121

So the 90% confidence interval for the mean is 3.80 to 6.09.

Find the 95% and 99% confidence intervals for the mean of this same set of data and record the results.

Looking at these three intervals

1. Consider the means obtained from 100 samples of size 40. If these means were used to construct 100 confidence intervals, determine the expected number of times the population mean would be included in one of these intervals.

2. In the 99% confidence interval that you found, the level of significance is 99%. What is the value of a ? What does a represent?

3. In which of these intervals is the maximum error, E, the smallest? What does this mean? In which of these intervals are you being more certain to include the population mean?

HYPOTHESIS TESTING

A standard final examination in an elementary statistics course is designed to produce a mean score of 75 and a standard deviation of 12. The hypothesis you will try to verify is: "This particular statistics class is above average." At the .05 level of significance, test the claim that the following sample scores reflect an above-average class (assuming sigma = 12):

79 79 78 74 82 89 74 75 78 73

74 84 82 66 84 82 82 71 72 83

Enter the data and get a preliminary graphical analysis.

Column1

Mean	78.05
Standard Error	1.251263
Median	78.5
Mode	82
Standard Deviation	5.595816
Sample Variance	31.31316
Range	23
Minimum	66
Maximum	89
Sum	1561
Count	20

Test the hypothesis, "The mean test grade for this class is greater than 75."

Choose: Tools > Data Analysis Plus > Z-Test: Mean > OK

Enter: Input Range: A1:A20 or select cells > OK

Hypothesized mean: 75

Standard Deviation (SIGMA): 12 > OK

Alpha: .05 > OK

The results are as follows:

Z-Test: Mean

			Column 1
Mean			78.05
Standard Deviation			5.5958
Observations			20
Hypothesized Mean			75
SIGMA			12
z Stat			1.1367
P(Z<=z) one-tail			0.1278
z Critical one-tail			1.6449
P(Z<=z) two-tail			0.2556
z Critical two-tail			1.96

Note that the p-values and critical values for both one-tail and two-tail tests are given.

Questions:

1. What are the formal null and alternative hypotheses?

2. What is the value of the test statistic, and what is your decision? Is the mean of this class above “average”?

ASSIGNMENT: Do Exercises 8.30, and 8.129 in your text, and the following two problems.

1. In one region of a city, a random survey of households includes a question about the number of people in the household. The results are given in the accompanying frequency table. Construct the 90% confidence interval for the mean size of all such households. Assume that the sample standard deviation can be used as an estimate of the population standard deviation.

Household size 1 2 3 4 5 6 7

Frequency 15 20 37 23 14 4 2

2. An aeronautical research team collects data on the stall speeds (in knots) of ultralight aircraft. The results are summarized in the accompanying stem-and-leaf plot. Construct the 95% confidence interval for the mean stall speed of all such aircraft. Assume sigma = 1.

MTB > Stem-and-Leaf c1.

Stem-and-leaf of C1 N = 16

Leaf Unit = 0.10

21. | 7 8

22. | 3 4 4 6

23. | 2 2 5 8 9 9

24. | 0 1 3

25. | 2