Skip to ContentGo to accessibility pageKeyboard shortcuts menu
OpenStax Logo

8.2 A Confidence Interval When the Population Standard Deviation Is Unknown and Small Sample Case

In many cases, the researcher does not know the population standard deviation, σ, of the measure being studied. In these cases, it is common to use the sample standard deviation, s, as an estimate of σ. The normal distribution creates accurate confidence intervals when σ is known, but it is not as accurate when s is used as an estimate. In this case, the Student’s t-distribution is much better. Define a t-score using the following formula:

t=  x -μ s n t=  x -μ s n

The t-score follows the Student’s t-distribution with n – 1 degrees of freedom. The confidence interval under this distribution is calculated with x¯± ( t α 2 ) s n x¯± ( t α 2 ) s n where t α 2 t α 2 is the t-score with area to the right equal to α 2 α 2 , s is the sample standard deviation, and n is the sample size. Use a table, calculator, or computer to find t α 2 t α 2 for a given α.

8.3 A Confidence Interval for A Population Proportion

Some statistical measures, like many survey questions, measure qualitative rather than quantitative data. In this case, the population parameter being estimated is a proportion. It is possible to create a confidence interval for the true population proportion following procedures similar to those used in creating confidence intervals for population means. The formulas are slightly different, but they follow the same reasoning.

Let p′ represent the sample proportion, x/n, where x represents the number of successes and n represents the sample size. Let q′ = 1 – p′. Then the confidence interval for a population proportion is given by the following formula:

p' - Z α p'q' n p p' + Z α p'q' n p'- Z α p'q' n pp'+ Z α p'q' n

8.4 Calculating the Sample Size n: Continuous and Binary Random Variables

Sometimes researchers know in advance that they want to estimate a population mean within a specific margin of error for a given level of confidence. In that case, solve the relevant confidence interval formula for n to discover the size of the sample that is needed to achieve this goal:

n=  Z α 2 σ 2 (x¯-μ)2 n=  Z α 2 σ 2 (x¯-μ)2

If the random variable is binary then the formula for the appropriate sample size to maintain a particular level of confidence with a specific tolerance level is given by

n = Zα2pq e2 n= Zα2pq e2


This book may not be used in the training of large language models or otherwise be ingested into large language models or generative AI offerings without OpenStax's permission.

Want to cite, share, or modify this book? This book uses the Creative Commons Attribution License and you must attribute OpenStax.

Attribution information
  • If you are redistributing all or part of this book in a print format, then you must include on every physical page the following attribution:
    Access for free at
  • If you are redistributing all or part of this book in a digital format, then you must include on every digital page view the following attribution:
    Access for free at
Citation information

© Dec 6, 2023 OpenStax. Textbook content produced by OpenStax is licensed under a Creative Commons Attribution License . The OpenStax name, OpenStax logo, OpenStax book covers, OpenStax CNX name, and OpenStax CNX logo are not subject to the Creative Commons license and may not be reproduced without the prior and express written consent of Rice University.