This seems like a useful thing to know, for example, during
sales negotiations: If a buyer proposes 1) to buy a larger ring
with 2/10 more carat instead of a smaller one, and 2) to pay
400 more and throw in an heirloom ring of his that we estimate to
be worth 220, do we go for the deal? Maybe not: Our CI for an additional 2/10
carats is (712, 776), so 400+220=620 falls short of the
range where we think the average price of an additional 2/10 carats
should be.
(We'll get to know other ways of answering such questions
with prediction intervals that might be more satisfactory.)
We explain these quantities in a minute, but first we show where to find them in JMP.
(Ignore other sections such as "Lack of Fit" and "Analysis of Variance" for now.)
These conditions all describe the situation that the observed estimate b1 and the assumed parameter value b1=0 are too far apart to be compatible: b1 is too unlikely to be observed under the assumption b1=0. The yard stick for measuring distance between b1 and anything else is the SE, because the SE measures the uncertainty in b1.
The most confusing among the above conditions for rejection of H0 is in terms of the p-value. Recall: the p-value is a measure of evidence in favor of H0 on a probability scale. In order to reject H0, we want this evidence for H0 to be small, namely, less than 0.05 (a convention corresponding to 0.95 confidence for CIs).
Slides 3-3 and 3-5 show how to test any other slope value also, such as b1=$3,800 for the diamond data. We just check whether b1=3721 and b1=3800 are further apart than two SEs (=82). They are not: |3721-3800|=79 < 2*82=164. Hence an assumed price of 3800 per additional carat could not be rejected based on the data.
Preliminary remark: From now on se = RMSE is our new notation for the estimate of s, the spread of the response values around the true line:
se 1 SE(b1) = --- * -- n1/2 sXWe will never use the above formula for actual calculation of SE(b1) since JMP does that. But we find the formula insightful.
First a reminder: sX is the simple standard deviation of the x-values.
se (x-x)2
SE(yhatx) = --- ( 1 + ----- )1/2
n1/2 sX2
where we use the more intuitive abbreviation se = RMSE. The 95% CI is
Why would we show this arcane formula for SE(yhatx) above?
Not for calculations (JMP does that). Again, it's for a qualitative insight:
The second point is new: SE(yhatx) = se/n1/2 only for x=x.
As x moves away from x, SE(yhatx) grows, that is, the CI widens!
The growth is slow, though, as we can convince ourselves in examples.
In summary, x=x is the fulcrum where the estimated lines
wobble the least under sample-to-sample variability.
Fine print: The above CI holds only if the SRM is correct. If there is undiscovered curvature or heteroscedasticity, the CI for yhatx is not valid, meaning, the coverage of the CI will not be 95%.
Examples: Used Cars data (left) and the Philadelphia Crime data (right)
Note the above two plots do linear fits (Fit Line) to transformed variables that obviously have been computed with JMP formulas. If we do the same thing but on the untransformed variables but let "Fit Special" do the transformation, then the CI band is shown on the plot of the original, untransformed variables:
contains 95% of future observations y at x.
Replacing the truth with estimates (yhat(x) = b0 + b1*x) works ok
if we're not extrapolating outside the range of observed x-values:
will contain about 95% of future observations y at x.
(Recall se = RMSE.)
sY2-se2 R2 = ------- sY2This modification of se is not desirable, which is why the right hand quantity with the actual, unmodified se2 is called adjusted R2, which you recall as the second number in JMP's regression outputs. The adjusted R2 is slightly more realistic as an estimate of R2 for the population (which is a "sample with n=infinity").