Notes for Class 9 of Engineering Statistics

WSU STAT 360
Class Session 9 Summary and Notes. October 27, 2000

Note: There are no new notes for this class session, but please go check the Statistics advisor which I put together in response to class concerns over how to choose a probability distribution to solve any particular problem.

Change of Syllabus

We have decided to alter the class structure according to the following items

I will produce an alternate examination which I will place on this web page by Thursday October 21, 1999. It is a take home exam, obviously, and is due by the beginning of class on Friday October 29. You may use the grade on this examination to replace a lower grade on your first exam.
As I mentioned in class, the examination median score was only 17/50. I was disappointed, as I am sure many of you were, but a little effort will accomplish a lot of improvement I'm sure. We just don't give up.
As far as projects are concerned, I am going to ask for a progress report (verbal) from each or you at each class session. This will keep all of us moving forward, and help me detect any drifting away from our target values...speaking of which here is a detailed analysis of target values from the text

Problem 4.7 done in excruciating detail

What we are given about the problem is as follows.

Our target value for the process is 5.6.
We know, from historical data that the standard deviation of the process is 0.7.
We have just taken a sample of 3 measurements, and the mean of this sample is 5.8.

Part A asks us to form a hypothesis concerning the true mean of the process and test the hypothesis. I began this in class by stating that in hypothesis testing the recently sampled data takes center stage. We begin by forming two hypotheses. Vining expects us to use a two tailed test, but I'll examine both a two tailed test and a one tailed test, and contrast and compare the two.

Consider a two tailed test first. The nominal hypothesis is


H₀: The true mean=5.6

and the alternative is

H_a: The true mean is not 5.6

We shall assume that Y follows a normal distribution and use the standard normal deviate Z=(Y-m)/s/sqrt(3), the null hypothesis is then...


Z₀₂₅ < (Y-m)/s/sqrt(3) < Z₉₇₅

I plan to leave all hypothesis stated in terms of the standard normal deviate. You can translate to the deviate of the problem (Y), but it is simpler to use Z because our tables are for Z. If we substitute 5.8 for Y, and use the appropriate values of Z_x from the table on pages 456 and 457, then the null hypothesis becomes

-1.96 < 0.49 < 1.96

which is true and we thus accept the null hypothesis. In other words there is no evidence at the 5% significance level to believe that the true process mean is other than our target of 5.6

It seems, to me at least, that the two tailed test is a bit silly. We have just measured a sample and found a mean value of 5.8. We ought to be concerned that the true process mean is drifting upward, which means that we ought to apply a one-tailed test. I'll formulate the test hypotheses as follows...


H₀: The true mean=5.6

and the alternative is

H_a: The true mean is greater than 5.6

Once again we shall assume that Y follows a normal distribution and use the standard normal deviate Z=(Y-m)/s/sqrt(3), the null hypothesis is then...


-Infinity < (Y-m)/s/sqrt(3) < Z₉₅

I am only concerned with the upper tail of the distribution in this test because the possibility of the true process mean being less than the target is less consistent with the alternative. If we substitute 5.8 for Y, and use the appropriate values of Z_x from the table on pages 456 and 457. Then the null hypothesis is

-Infinity < 0.49 < 1.65

which is true and we thus accept the null hypothesis once again. In other words there is no evidence at the 5% significance level to believe that the true process mean is higher than our target of 5.6. However, notice that the one-tailed test has placed the rejection region closer to our sample value. In this sense it is a more conservative test. One final thought about this test is worthwhile. Figure 1, below, shows a sampling distribution centered on what we believe to be the current process mean, namely the target value of 5.6. The shaded region in the figure is the one-tailed rejection region and our hypothesis entails determining whether or not the sample mean has entered this region. If it has, we must conclude either that the nominal hypothesis is not true, and we reject it, or that a 1/20 chance occurence has lead us into making a Type I error.

Figure 1. The sampling distribution is centered on the target value and our rejection region for the null hypothesis lies beyond the 95% confidence interval.

Part B asks us to examine the confidence interval on the true mean of the process and make a judgement from it about the current process state. In using confidence intervals the likely value of process mean takes center stage. We begin by forming a symmetric, or two-tailed, confidence interval for the true process mean based on our recent sample mean.

Once again we shall assume that Y follows a normal distribution and use the standard normal deviate Z=(Y-m)/s/sqrt(3), the confidence interval is then...


Z₀₂₅ < (Y-m)/s/sqrt(3) < Z₉₇₅

Instead of leaving this in terms of standard normal deviate I'll transform back to physical units

Y-Z₀₂₅s/sqrt(3) > m > Y-Z₉₇₅s/sqrt(3)

If we substitute 5.8 for Y, and use the appropriate values of Z_x from the table on pages 456 and 457, then the confidence interval on the process mean becomes

6.59 > 5.6 > 5.01

which is true. The confidence interval of process mean contains the target value of 5.6 at the 95% confidence level. In other words there is no evidence at the 95% confidence level to believe that the true process mean is other than our target of 5.6

Yet, once again if we are concerned about the process mean drifting upward it makes sense, at least to me, to use an asymmetric confidence interval.

Once again we shall assume that Y follows a normal distribution and use the standard normal deviate Z=.... Oh Gawd, must I write this again? Then the confidence interval on process mean is...


-Infinity < (Y-m)/s/sqrt(3) < Z₉₅

And once again, instead of leaving this in terms of standard normal deviate I'll transform back to physical units

Infinity > m > Y-Z₉₅s/sqrt(3)

The one sided confidence interval switched from -Infinity to +Infinity because of multiplying through the inequality by the minus sign in front of m. If we substitute 5.8 for Y, and use the appropriate values of Z_x from the table on pages 456 and 457, then the confidence interval is

Infinity > 5.6 > 5.135

This inequality is true and the confidence interval contains the target mean. We conclude there is no evidence at the 95% confidence level to believe that the true process mean is higher than our target of 5.6. Notice once again that the asymmetric confidence interval is closer to our sample mean, and is a more conservative test. Figure 2 shows a sampling distribution centered on the current sample mean. The unshaded region is our asymmetric confidence region. If the target value lies within the shaded area we must conclude that either the current process mean has drifted away from the target, or that a 1/20 chance occurence is making it appear so. The sampling distribution in figure 2 is the same as the one in figure 1, and the analyses are mirror images of one another. Is it any wonder that we arrive at the same conclusions?

Figure 2. The sampling distribution is centered on the current sample mean and hopefully the target value lies within the 95% confidence interval.

Part C asks us to rate the power of our test of significance in its ability to detect a change of process mean from the target value of 5.6 to 5.8. This is a small change considering that the standard error of the sample is 0.7/sqrt(3)=0.404, so we expect the power to be small.

The easiest way to think of the power measure is that it represents the probability of rejecting the nominal hypothesis when, in fact, the mean has drifted away from its nominal value. Of course, the power is related to how far the mean has drifted. We are powerless, so to speak, to detect small changes. I'll work this problem by examining power for both a two-tailed and one-tailed test.

We are going to calculate power for detecting a process mean that equals the sample mean of 5.8. The conditional probability statement is
Power = P(Reject H₀ | process mean=5.8)
which translates in English to the probability of rejecting the nominal hypothesis knowing that the process mean equals 5.8. That is, power equals the probability of being correct.

In a two tailed test we reject the null hypothesis if Y < 4.81 or if Y > 6.39. (The reader can fill in the details from Part A.) Therefore, we find the above probability to be...


Power = {1-P( (6.39-5.8)/0.7/sqrt(3) )} + P( (4.81-5.8)/0.7/sqrt(3) )

or

Power = {1-P(1.46)}+P(-2.45) = {1-.9279}+.0071 = .0792

Indeed there is very little power in this test to detect a change in process mean as small as 0.2. Let me now examine the power of the one-tailed test. Here we reject the nominal hypothesis only if Y > 6.26. Thus,...

Power = {1-P( (6.26-5.8)/0.7/sqrt(3) )} = {1-P(1.14)} = 1-0.8729 = .13

The one-tailed test has nearly twice the power to detect a change in process mean of only 0.2. Of course, this increased power did not come to us for free. It has increased the likelihood of making a Type II error. The person testing the hypothesis has to decide what the ramifications of making Type I versus type II errors is and act accordingly.

Link forward to the next set of class notes for Friday, November 3, 2000