\(t\)-Tests#
We begin our investigation of statistical hypothesis testing by looking at the \(t\)-tests. Though mathematicians had long understood many of the properties of the normal distribution and how it might be used for data analysis, the \(z\)-test based upon it generated a brittle statistic that required large sample sizes.
The \(t\)-tests generate a robust statistics appropriate for use with much smaller sample sizes than the original \(z\)-tests based upon the normal distribution.
Assumptions#
The assumptions of the \(t\)-test and similar procedures are:
Normality
Independence
Homogeneity of the Variances
The normality assumption requires that the sample data were drawn from a normal (bell-shaped) distribution. The independence assumption asserts that each data point collected was independent of all other data. In other words, if the participants were human beings, they were not allowed to colloborate with, speak to or copy from another research participant. Homogeneity of the variances asserts that the standard deviations of all research samples are equal.
Checking the assumptions can only be accomplished in the cases of normality and homogeneity of the variances. Independence is a research condition requiring exacting data collection standards.
Normality#
We can check the normality assumption by assessing a QQ plot and a density plot. The QQ plot should be roughly linear if the sample data are approximately normally distributed. The density plot will show a bell-shaped distribution if the sample data are approximately normally distributed. The density plot doesn’t have to be perfectly bell-shaped nor perfectly symmetric. Some minor anomalies can exist in the density plot as long as an overall shape is approximately bell-shaped.
Homogeneity of the Variances#
The vital modernaization that the \(t\)-tests have brought to statistics is that the homogeneity of the variances assumption can be violated, at least to some degree, yet the test still retains its accuracy. We will conduct Welch’s \(t\)-test which is the modern variant that focuses upon unequal standard deviations. Still, we need to avoid sharply unequal sample sizes which accentuate the impact of unequal standard deviations. Hence, in this course, we require the ratio of the sample sizes to be no more unequal than \(2:1\).
Hypothesis Testing#
In statistics, hypothesis testing is an umbrella terms covering more than a dozen different statistical tests and procedures. The steps to conducting a hypothesis test are, generally speaking, as follows:
Identify a population parameter and state null and alternative hypotheses about that parameter.
Create a model consistent with the null hypothesis.
Use the model to determine a \(p\)-value (the probability that results as extreme as those we observed would occur by random chance IF the null hypothesis were true).
Based on the p-value, decide whether to reject the null hypothesis in favor of the alternative.
Draw a conclusion in the context of the scenario given.
Types of \(t\)-Tests#
There are three different types of \(t\)-tests:
1-Sample \(t\)-Test
Independent Samples \(t\)-Test
Matched Pairs \(t\)-Test
The independent samples test compares and constrasts the means between two research samples as in a treament vs. control design. The test also can be used for 2-way demographic comparisons such as, for example, comparing young adults (age 25 - 40) to seniors (age 55 - 75) on a measure of high blood pressure.
The matched pairs test compares and constrasts the means between two matched research samples as in a pretest vs posttest design. The matching can occur in studies of twins to determine whether or not genetics factor into a certain ailment. The matching can occur in reaction-time studies where dominant hand reactions are compared to non-dominant hand reactions.
The 1-sample test compares the research sample mean to a population mean. For example, a soft drink bottling company has a machine that can fill thousands of 12 oz. cans of soda each hour. To test it, the company could collect 25 different cans from different batches and find the average number of ounces in the cans. The population mean is clearly 12 oz. The compnay is hoping not to reject the null which would indicate the cans are being filled to some different value, not the 12 oz.