Skip to ContentGo to accessibility pageKeyboard shortcuts menu
OpenStax Logo
Statistics

Key Terms

StatisticsKey Terms

Bernoulli trials
an experiment with the following characteristics:
  1. There are only two possible outcomes called success and failure for each trial
  2. The probability p of a success is the same for any trial (so the probability q = 1 − p of a failure is the same for any trial)
binomial experiment
a statistical experiment that satisfies the following three conditions:
  1. There are a fixed number of trials, n
  2. There are only two possible outcomes, called success and, failure, for each trial; the letter p denotes the probability of a success on one trial, and q denotes the probability of a failure on one trial
  3. The n trials are independent and are repeated using identical conditions
binomial probability distribution
a discrete random variable (RV) that arises from Bernoulli trials; there are a fixed number, n, of independent trials
Independent means that the result of any trial (for example, trial one) does not affect the results of the following trials, and all trials are conducted under the same conditions. Under these circumstances the binomial RV X is defined as the number of successes in n trials. The notation is: X ~ B(n, p). The mean is μ = np and the standard deviation is σ = npq npq . The probability of the following exactly x successes in n trials is
P(X = x) = ( n x ) ( n x ) pxqn − x
expected value
expected arithmetic average when an experiment is repeated many times; also called the mean; notations μ; for a discrete random variable (RV) with probability distribution function P(x),the definition can also be written in the form μ = xP(x)
geometric distribution
a discrete random variable (RV) that arises from the Bernoulli trials; the trials are repeated until the first success.
The geometric variable X is defined as the number of trials until the first success. Notation X ~ G(p). The mean is μ = 1 p 1 p and the standard deviation is σ = 1 p ( 1 p 1 ) 1 p ( 1 p 1 ) . The probability of exactly x failures before the first success is given by the formula
P(X=x)=p(1p)x1P(X=x)=p(1p)x1
.
geometric experiment
a statistical experiment with the following properties:
  1. There are one or more Bernoulli trials with all failures except the last one, which is a success
  2. In theory, the number of trials could go on foreve; there must be at least one trial
  3. The probability, p, of a success and the probability, q, of a failure do not change from trial to trial
hypergeometric experiment
a statistical experiment with the following properties:
  1. You take samples from two groups
  2. You are concerned with a group of interest, called the first group
  3. You sample without replacement from the combined groups
  4. Each pick is not independent, since sampling is without replacement
  5. You are not dealing with Bernoulli trials
hypergeometric probability
a discrete random variable (RV) that is characterized by the following:
  1. The experiment uses a fixed number of trials.
  2. The probability of success is not the same from trial to trial
We sample from two groups of items when we are interested in only one group. X is defined as the number of successes out of the total number of items chosen. Notation X ~ H(r, b, n), where r = the number of items in the group of interest, b = the number of items in the group not of interest, and n = the number of items chosen.
mean
a number that measures the central tendency; a common name for mean is average
The term mean is a shortened form of arithmetic mean. By definition, the mean for a sample (denoted by x ¯ x ¯ ) is x ¯ = Sum of all values in the sampleNumber of values in the sample x ¯ = Sum of all values in the sampleNumber of values in the sample and the mean for a population (denoted by μ) is μ = Sum of all values in the population Number of values in the population Sum of all values in the population Number of values in the population .
mean of a probability distribution
the long-term average of many trials of a statistical experiment
Poisson probability distribution
a discrete random variable (RV) that counts the number of times a certain event will occur in a specific interval; characteristics of the variable:
  • The probability that the event occurs in a given interval is the same for all intervals
  • The events occur with a known mean and independently of the time since the last event
The distribution is defined by the mean μ of the event in the interval. Notation X ~ P(μ). The mean is μ = np. The standard deviation is σ =  μ σ =  μ . The probability of having exactly x successes in r trials is P(X=x)= ( e μ ) μ x x! P(X=x)= ( e μ ) μ x x! . The Poisson distribution is often used to approximate the binomial distribution, when n is large and p is small (a general rule is that n should be greater than or equal to 20 and p should be less than or equal to .05).
probability distribution function (PDF)
a mathematical description of a discrete random variable (RV), given either in the form of an equation (formula) or in the form of a table listing all the possible outcomes of an experiment and the probability associated with each outcome
random variable (RV)
a characteristic of interest in a population being studied; common notation for variables are uppercase Latin letters X, Y, Z, . . . ; common notation for a specific value from the domain (set of all possible values of a variable) are lowercase Latin letters x, y, and z
For example, if X is the number of children in a family, then x represents a specific integer 0, 1, 2, 3, . . . ; variables in statistics differ from variables in intermediate algebra in the two following ways:
  • The domain of the random variable (RV) is not necessarily a numerical set; the domain may be expressed in words; for example, if X = hair color then the domain is {black, blond, gray, green, orange}
  • We can tell what specific value x the random variable X takes only after performing the experiment
standard deviation of a probability distribution
a number that measures how far the outcomes of a statistical experiment are from the mean of the distribution
the law of large numbers
as the number of trials in a probability experiment increases, the difference between the theoretical probability of an event and the relative frequency probability approaches zero
Citation/Attribution

This book may not be used in the training of large language models or otherwise be ingested into large language models or generative AI offerings without OpenStax's permission.

Want to cite, share, or modify this book? This book uses the Creative Commons Attribution License and you must attribute Texas Education Agency (TEA). The original material is available at: https://www.texasgateway.org/book/tea-statistics . Changes were made to the original material, including updates to art, structure, and other content updates.

Attribution information
  • If you are redistributing all or part of this book in a print format, then you must include on every physical page the following attribution:
    Access for free at https://openstax.org/books/statistics/pages/1-introduction
  • If you are redistributing all or part of this book in a digital format, then you must include on every digital page view the following attribution:
    Access for free at https://openstax.org/books/statistics/pages/1-introduction
Citation information

© Apr 16, 2024 Texas Education Agency (TEA). The OpenStax name, OpenStax logo, OpenStax book covers, OpenStax CNX name, and OpenStax CNX logo are not subject to the Creative Commons license and may not be reproduced without the prior and express written consent of Rice University.