Stat 601, Fall 2001, Class 4

What you need to know from last time

*
Confidence intervals - why make them, how to interpret them
*
Sampling - what makes a good sample, when samples go bad
*
Decision making - dichotomous decision setup

Hypothesis tests on means

*
All todays tests are standard error counters
*
How many standard errors is the null hypothesis mean away from the sample mean
*
If the null hypothesis mean is many standard errors (typically greater than 2) away from the sample mean, then the observed data is not in accordance with the null hypothesis, and we believe the data and reject the null
*
Types of test
*
One sample t-test; testing a single population mean, p.131
*
Two sample t-test; assuming equal variances, p.141.
*
Two sample t-test; NOT assuming equal variances, p.146.
*
Two sample non-parametric tests; NOT assuming approximate normality, p.152. Median + Van der Waerden
*
Assumptions within groups
*
Independence
*
Constant variance
*
Approximately normal

The p-value

*
A measure of the credibility of the null hypothesis
*
Small p-values give evidence against the null
*
In English; the probability that if you did the experiment again and the null hypothesis were true, that you would observe a value of the test statistic as extreme as the one you saw the first time.
*
It picks up the repeatability idea. If something is true (ie the null hypothesis) then you should be able to replicate the observed results. A small p-value says that it would be hard to replicate, hence the small p-value offers evidence against the null

The paired t-test

*
The idea; two repeat observations on the same experimental unit
*
Twins, feet, sales territories, similar firms etc
*
Controls for unwanted variability between subjects
*
Controlling for variability is a very powerful idea. It's like denoising the data, so that the signal is more apparent
*
Essentially a one sample t-test on the differences


Putting ideas together

*
Dealing with confounding variables
*
Review example + introducing correlation

Dealing with confounding variables

*
Marginal association - the relationship between two variables ignoring other possible explanatory variables
*
Partial association - the relationship between two variables having taken into account other explanatory variables
*
Association does NOT imply causation

Example Salary.jmp


Review example - FinMark.jmp

*
Points to note
*
Log transforms to linearize exponential growth
*
Returns remove most time trend, reveal volatility
*
Relative volatility between T-bills and VW-return
*
Good normality properties but a bit fat tailed
*
Return/variance tradeoff in portfolios

Exam discussion

*
Format: similar to homework questions, but relevant output provided
*
Expect to take not more than 2 hours
*
Covered: chapters 1 - 8 in the Case Book
*
Key concepts
*
Descriptive summaries, mean, median, sd, variance
*
The empirical rule
*
The standard error of the mean
*
Quality control charts
*
Confidence intervals
*
Sampling and assumptions
*
Hypothesis tests on means, test statistics and p-values



Subsections


2001-10-05