Class 4
What you need to have learnt from Class 3.
- Two types of model.
- Parallel lines model: different intercepts - same slopes.
- Non-parallel lines: different intercepts and different slopes.
- Two key facts in understanding the JMP output.
- JMP always makes comparisons to the ``average''
of the groups.
- JMP always leaves one group out - you figure out the missing
difference (easy).
- Non-parallel slopes, an interaction model.
- Interaction. A three variable concept (Y,X1,X2). Generic
description: the impact of X1 on Y depends on the value of X2.
- Tests: the null hypothesis is always that the differences are
zero, that is no difference between the groups. Three types of test:
(a) Slope or intercept differences non-zero. (b) Slope
differences non-zero? (c) Intercept differences non-zero?
- Are any of the slope or intercept differences non-zero. (i.e.
does adding the categorical variable and its interaction buy us
any explanatory power?). Use the partial-F. You have to calculate
this one yourself, see p. 233 of the BulkPack.
- Are any of the slope differences non-zero? Do we need separate
slopes (i.e. do we need an interaction term)? Use the
partial-F as given on the interaction
term in the ``effect test''.
- Are any of the intercept differences non-zero? Given we don't
need interaction, do we need separate
intercepts? Use the partial-F as given on the categorical variable
term in the ``effect test'' from a model excluding the interaction.
- A model with different intercepts and same slopes is OK. A
model with different intercepts and different slopes is OK. A model
with same intercepts but different slopes is not
desirable.
- Our rule: if you have an interaction term in the model (i.e.
different slopes) then make sure you have the variables that make
up the interaction in the model as well (even if they are not
significant).
- We know the rule for calculating the missing group on the
output. It's difference is the number that makes all the
differences sum to zero. What about it's t-statistic and p-value?
Rule of thumb - (so long as the missing group has roughly as many
observations as the included groups and the X-values are similar)
use the standard error from the included groups to calculate an
approximate t-statistic. Alternatively recode the categorical
variable so that the missing group has a coding that comes first in
the alphabet and re-run the regression.
New material for today: ANOVA.
ONEWAY ANOVA
- Objective: compare means (of a Y-variable) across different groups.
Example: Is CEO compensation different between sectors?
- A single continuous Y-variable and one categorical X-variable.
- Recognize: X (the group variable) is categorical.
- Conceptually different from regression.
- Regression usually has a model building and prediction objective.
- ANOVA has a group comparison objective - no model building.
- Two basic questions:
- Are the group means all the same or are some significantly
different? Look in the overall ANOVA table to answer
this. Analysis done from ``Fit Y by X'' button.
- If some are different (first test does not tell you which) use
follow up and refocus question: compare groups to one
another - which ones are significantly different?
Various comparison procedures:
- Compare each pair, one at a time. BAD.
- Compare all pairs at once. GOOD. Tukey.
- Compare each group with best. GOOD. Hsu.
- Critical issue to understand: why is comparing each pair, one
pair at a time BAD? Must read pp. 232-234 in Bulk Pack.
- The procedure which compares each pair, one pair at a time (a
two-sample t-test) fails to take into account the number of
comparisons we are making. If we make a lot of comparisons then
just by chance alone we tend to see something significant. (If we
buy many lottery tickets we tend to win the lottery even though any
single ticket is unlikely to win.) No fishing.
- We want to use a procedure that adjusts for the number of
comparisons that are made and also recognizes that the comparisons
may be data driven. Tukey's and Hsu's do just this. They are
multiple comparison procedures with honest Type I error rates.
(Recall: Type I error - saying there's a difference when really
there is not.)
Honest means that when they declare a 5% error rate, then there is a 5%
chance of one or more errors in the entire set of comparisons
NOT a 5% chance of any particular comparison being wrong.
- Multiple comparison procedures achieve honesty by making it
harder to declare a difference significant.
- Assumptions: p-values only have credibility if assumptions
hold. Check by graphing residuals.
- Independent errors.
- Same variance in each group.
- Approximately normal.
- Dealing with JMP output for multiple comparisons. Two choices
- exactly the same conclusions:
- Use graphical output (circle clicking).
- Use table output (reading numbers).
ANOVA with two X-variables
- Objective: compare means (of a Y-variable) across different
groups and combinations of groups.
- Example: how do gas station average profits depend on
incentive scheme and geographic location?
- A single continuous Y-variable and TWO categorical X-variables
- Recognize: the X-variables are both categorical.
- Two basic models:
- No interaction: the impact of X1 on Y does not depend on the
level of X2.
- Interaction: the impact of X1 on Y depends on the level of X2.
- Practical consequences:
- If NO interaction, then you can investigate the impact of
each X by itself.
- If there is interaction (consider practical importance as
well as statistical significance) then you must consider both X1
and X2 together.
- Key graphic - the profile plot. A graphical diagnostic for
interaction - look for parallel versus non-parallel lines.
- After doing a TWOWAY ANOVA, we often compare different
combinations of the variables by concatenating the two X's into a
single column and
doing multiple comparisons. See p.261 and p.270 of the BulkPack.
- We have the usual assumptions on the errors: independent,
constant variance and approximately normal.
- In JMP we do the TWOWAY ANOVA from the ``fit model''
platform. Residuals can be saved from here. Profile plots are also
obtained via this output.
Examples
Repairs.jmp p235.
Design.jmp p257.
Flextime.jmp p264.
Richard Waterman
Wed Aug 20 16:20:47 EDT 1997