The z-Proportion Test

The \(z\)-Proportion Test#

While the \(z\) procedures have been, in the main, replaced by \(t\) procedures, one area of study still uses the the normal curve: qualitative data with research questions framed as proportions per category.

Uses for \(z\)-Proportion Testing#

The quintessential example of using \(z\)-proportion procedures is political polling where researchers estimate the percentage of votes a candidate is likely to receive in the primaries or general election. As our class formula sheet makes clear, we are estimating the proportion data for the qualitative variable \(p\) using the following conversion to a normal distribution:

\[ z = \frac{\hat p - p_0}{\sqrt{\frac{p_0(1-p_0)}{n}}} \]

where the distribution of \(p\) is approximately \(N\left(p, \sqrt{\frac{p(1-p))}{n}}\right)\).

Example: Voting#

A gun control resolution is being considered for the metro Atlanta area. A pollster wishes to gauge opinion for or against the law in both Fulton and Gwinett counties. Test at the \(\alpha = 0.10\) level given that the data collected are shown in the table below.

	For	Against
Fulton	281	192
Gwinnett	214	229

Hypotheses#

This is a 2-sample proportion test where we test for a difference in the percent of voters who plan to vote for the resolution.

\[\begin{split}\begin{align}H_0 &: p_f = p_g \\ H_a &: p_f \neq p_g\end{align}\end{split}\]

Conduct the Test#

We first proceed with table and a formula. The \(z\) table shows that \(t^* = \pm 1.65\) corresponds to \(\alpha = 0.05\) and \(\alpha = 0.95\) which, since we are conducting a 2-tailed hypothesis test, will be correct. The correct formula for a 2-sample \(z\)-proportion test is given below

\[ z = \frac{\hat p_f - \hat p_g}{\sqrt{\hat p(1-\hat p)\left(\frac{1}{n_f}+\frac{1}{n_g}\right)}} \]

where \(\hat p\) is the pooled (overall) proportion of successes.

\[\begin{split}\begin{align}z &= \frac{\hat p_f - \hat p_g}{\sqrt{\hat p(1-\hat p)\left(\frac{1}{n_f}+\frac{1}{n_g}\right)}}\\&= \frac{0.594 - 0.483}{\sqrt{0.5404(0.4596)\left(\frac{1}{473}+\frac{1}{443}\right)}}\\&= \frac{0.111}{\sqrt{0.2484\left(0.004372\right)}}\implies\\z &\approx 3.3683 \end{align}\end{split}\]

Since \(z = 3.3683 > 1.65 = z^*\), we reject the null.

Using R to do all computations, we have the following:#

Creating the observed data matrix:

obs = matrix(c(281, 214, 192, 229), ncol =2)
obs

281	192
214	229

We set \(\textbf{correct} = \textbf{FALSE}\) to turn off the continuity correction which will force R into a similar computation as we do with formulas and tables:

test <- prop.test(obs, correct = FALSE)
test

	2-sample test for equality of proportions without continuity
	correction

data:  obs
X-squared = 11.35, df = 1, p-value = 0.0007544
alternative hypothesis: two.sided
95 percent confidence interval:
 0.04679303 0.17522769
sample estimates:
   prop 1    prop 2 
0.5940803 0.4830700 

Our calculated test statistic \(z = 3.3683\) should be within rounding error of the square root of R’s calcuated test statistic:

sqrt(test$statistic)

X-squared: 3.36899244646954

We find that it is, and with \(p = 0.000754 < 0.05 = \alpha\), we reject the null.

Reporting Out#

Given that we reject the null (\(p < 0.001\)), we have evidence for a difference in voter preferrence on this referrendum. A higher percentage of Fulton County voters favor the referrendum than do Gwinnett County voters.