LAB SESSION 9

ESTIMATION AND HYPOTHESIS TESTING

 

INTRODUCTION: Two indispensable statistical decision-making tools for a single parameter are (i)confidence intervals, and (ii) hypothesis tests to investigate theories about parameters.  In this lab you will learn how to calculate confidence intervals and perform hypothesis tests (assuming we know sigma) using Excel.

 

CONFIDENCE INTERVALS

            As an introduction, let’s follow Illustration 8.3 in your text.

 

Begin a new worksheet and generate 40 random integers the range 0 to 9 in column A.  You can use either the Random Numbers Table (Table 1) or the Random Number Generation tool of Excel and then use the INT (integer) function to transform to integers in the range 0 to 9:

            Choose:  Tools > Data Analysis Plus > Random Number Generation > OK

            Enter:      Number of Variables: 1

                           Number of Random Numbers:  40

            Select:     Distribution:  Uniform

            Enter:      Parameters, Between   0   and   9 

                           Output Range:   B1    > OK

            Enter:     in cell A1:  =INT(B1)

            Click and drag: lower right corner to cell A40

 

To see the mean, standard deviation and maximum and minimum values for the data set use:

            Select:  Tools > Data Analysis > Descriptive Statistics > OK

            enter input and output range as appropriate, and select Summary Statistics

 

Column1

 

 

Mean

3.9

Standard Error

0.446927

Median

3

Mode

3

Standard Deviation

2.826613

Sample Variance

7.989744

Range

8

Minimum

0

Maximum

8

Sum

156

Count

40

 

(Your results may be slightly different, since we are using random data.)


Find the 90% confidence interval for the mean of these values:

            Choose:  Tools > Data Analysis Plus > Z-Estimate: Mean > OK

            Enter:      Input Range:  A1:A40

                           Standard Deviation (SIGMA): 2.87 > OK

                           Alpha :  .10 > OK

 

z-Estimate: Mean

 

 

 

 

 

 

Column 1

Mean

 

4.9412

Standard Deviation

2.6094

Observations

17

SIGMA

 

2.87

LCL

 

3.796232

UCL

 

6.086121

 

So the 90% confidence interval for the mean is 3.80 to 6.09.

 

Find the 95% and 99% confidence intervals for the mean of this same set of data and record the results.

 

 

 

Looking at these three intervals

1.   Consider the means obtained from 100 samples of size 40.  If these means were used to construct 100 confidence intervals, determine the expected number of times the population mean would be included in one of these intervals.

 

2.   In the 99% confidence interval that you found, the level of significance is  99%.  What is the value of a ?  What does a  represent?

 

3.   In which of these intervals is the maximum error, E, the smallest?  What does this mean?  In which of these intervals are you being more certain to include the population mean?

 

 

 

 


HYPOTHESIS TESTING

 

A standard final examination in an elementary statistics course is designed to produce a mean score of 75 and a standard deviation of 12.  The hypothesis you will try to verify is: "This particular statistics class is above average."  At the .05 level of significance, test the claim that the following sample scores reflect an above-average class (assuming sigma = 12):

 

79   79   78   74   82   89   74   75   78   73

74   84   82   66   84   82   82   71   72   83

 

Enter the data and get a preliminary graphical analysis.

 

 

Column1

 

 

Mean

78.05

Standard Error

1.251263

Median

78.5

Mode

82

Standard Deviation

5.595816

Sample Variance

31.31316

Range

23

Minimum

66

Maximum

89

Sum

1561

Count

20

 

 

 


Test the hypothesis, "The mean test grade for this class is greater than 75."

 

                  Choose:  Tools > Data Analysis Plus > Z-Test: Mean > OK

                  Enter:      Input Range:  A1:A20 or select cells > OK

                                  Hypothesized mean:  75

                                  Standard Deviation (SIGMA):  12  > OK

                                  Alpha:  .05 > OK

 

The results are as follows:

 

Z-Test: Mean

 

 

 

 

 

 

 

 

 

Column 1

Mean

 

 

78.05

Standard Deviation

 

5.5958

Observations

 

20

Hypothesized Mean

 

75

SIGMA

 

 

12

z Stat

 

 

1.1367

P(Z<=z) one-tail

 

0.1278

z Critical one-tail

 

1.6449

P(Z<=z) two-tail

 

0.2556

z Critical two-tail

 

1.96

 

Note that the p-values and critical values for both one-tail and two-tail tests are given.

 

Questions:

1.   What are the formal null and alternative hypotheses?

 

2.   What is the value of the test statistic, and what is your decision?  Is the mean of this class above “average”?

 


ASSIGNMENT: Do Exercises 8.30, and 8.129 in your text, and the following two problems.

 

1.   In one region of a city, a random survey of households includes a question about the number of people in the household.  The results are given in the accompanying frequency table.  Construct the 90% confidence interval for the mean size of all such households.  Assume that the sample standard deviation can be used as an estimate of the population standard deviation.

 

Household size                1      2      3        4      5     6     7            

Frequency                    15     20    37     23     14    4     2

 

2.   An aeronautical research team collects data on the stall speeds (in knots) of ultralight aircraft.  The results are summarized in the accompanying stem-and-leaf plot.  Construct the 95% confidence interval for the mean stall speed of all such aircraft.  Assume sigma = 1.

 

MTB > Stem-and-Leaf c1.

 

Stem-and-leaf of C1        N  = 16

Leaf Unit = 0.10

                     

                   21. | 7 8

       22. | 3 4 4 6

       23. | 2 2 5 8 9 9

       24. | 0 1 3

                   25. | 2