Skip to ContentGo to accessibility pageKeyboard shortcuts menu
OpenStax Logo
Statistics

A | Appendix A Review Exercises (Ch 3–13)

StatisticsA | Appendix A Review Exercises (Ch 3–13)

These review exercises are designed to provide extra practice on concepts learned before a particular chapter. For example, the review exercises for Chapter 3 cover material learned in Chapters 1 and 2.

Chapter 3

Use the following information to answer the next six exercises. In a survey of 100 stocks on NASDAQ, the average percent increase for the past year was 9 percent for NASDAQ stocks.

1. The average increase for all NASDAQ stocks is the —

  1. population
  2. statistic
  3. parameter
  4. sample
  5. variable


2. All of the NASDAQ stocks are —

  1. population
  2. statistics
  3. parameter
  4. sample
  5. variable


3. Nine percent is —

  1. population
  2. statistics
  3. parameter
  4. sample
  5. variable


4. The 100 NASDAQ stocks in the survey are —

  1. population
  2. statistic
  3. parameter
  4. sample
  5. variable


5. The percent increase for one stock in the survey is —

  1. population
  2. statistic
  3. parameter
  4. sample
  5. variable


6. Would the data collected by qualitative, quantitative discrete, or quantitative continuous?

Use the following information to answer the next two exercises. Thirty people spent two weeks around Mardi Gras in New Orleans. Their two-week weight gain is below. Note—a loss is shown by a negative weight gain.

Weight Gain Frequency
–2 3
–1 5
0 2
1 4
4 13
6 2
11 1
Table A1

7. Calculate the following values:

  1. The average weight gain for the two weeks
  2. The standard deviation
  3. The first, second, and third quartiles


8. Construct a histogram and box plot of the data.

Chapter 4

Use the following information to answer the next two exercises. A recent poll concerning credit cards found that 35 percent of respondents use a credit card that gives them a mile of air travel for every dollar they charge. Thirty percent of the respondents charge more than $2,000 per month. Of those respondents who charge more than $2,000, 80 percent use a credit card that gives them a mile of air travel for every dollar they charge.

9. What is the probability that a randomly selected respondent will spend more than $2,000 and use a credit card that gives them a mile of air travel for every dollar they charge?

  1. (.30)(.35)
  2. (.80)(.35)
  3. (.80)(.30)
  4. (.80)


10. Are using a credit card that gives a mile of air travel for each dollar spent and charging more than $2,000 per month independent events?

  1. Yes
  2. No, and they are not mutually exclusive either
  3. No, but they are mutually exclusive
  4. Not enough information given to determine the answer


11. A sociologist wants to know the opinions of employed adult women about government funding for day care. She obtains a list of 520 members of a local business and professional women’s club and mails a questionnaire to 100 of these women selected at random. Sixty-eight questionnaires are returned. What is the population in this study?

  1. All employed adult women
  2. All the members of a local business and professional women’s club
  3. The 100 women who received the questionnaire
  4. All employed women with children


Use the following information to answer the next two exercises. An article from the San Jose Mercury News was concerned with the racial mix of the 1,500 students at Prospect High School in Saratoga, CA. The table summarizes the results. Male and female values are approximate. Suppose one Prospect High School student is randomly selected.

Gender/Ethnic Group White Asian Hispanic Black American Indian
Male 400 468 115 35 16
Female 440 132 140 40 14
Table A2

12. Find the probability that a student is Asian or male.

13. Find the probability that a student is black given that the student is female.

14. A sample of pounds lost, in a certain month, by individual members of a weight reducing clinic produced the following statistics:

  • Mean = 5 lbs
  • Median = 4.5 lbs
  • Mode = 4 lbs
  • Standard deviation = 3.8 lbs
  • First quartile = 2 lbs
  • Third quartile = 8.5 lbs


What is the correct statement?

  1. One fourth of the members lost exactly two pounds.
  2. The middle 50 percent of the members lost from two to 8.5 lbs.
  3. Most people lost 3.5 to 4.5 lbs.
  4. All of the choices above are correct.


15. What does it mean when a data set has a standard deviation equal to zero?

  1. All values of the data appear with the same frequency.
  2. The mean of the data is also zero.
  3. All of the data have the same value.
  4. There are no data to begin with.


16. Which statement describes the illustration?

This is a boxplot. There is no left whisker. The boxplot consists of a box with dashed line at the left edge, and a right whisker.
Figure A1
  1. The mean is equal to the median.
  2. There is no first quartile.
  3. The lowest data value is the median.
  4. The median equals Q 1 + Q 3 2 Q 1 + Q 3 2 .


17. According to a recent article in the San Jose Mercury News the average number of babies born with significant hearing loss—deafness—is approximately 2 per 1,000 babies in a healthy baby nursery. The number climbs to an average of 30 per 1,000 babies in an intensive care nursery. Suppose that 1,000 babies from healthy baby nurseries were randomly surveyed. Find the probability that exactly two babies were born deaf.

18. A friend offers you the following deal: For a $10 fee, you may pick an envelope from a box containing 100 seemingly identical envelopes. However, each envelope contains a coupon for a free gift.

  • Ten of the coupons are for a free gift worth $6.
  • Eighty of the coupons are for a free gift worth $8.
  • Six of the coupons are for a free gift worth $12.
  • Four of the coupons are for a free gift worth $40.


Based upon the financial gain or loss over the long run, should you play the game?

  1. Yes, I expect to come out ahead in money.
  2. No, I expect to come out behind in money.
  3. It doesn’t matter. I expect to break even.


Use the following information to answer the next four exercises. Recently, a nurse commented that when a patient calls the medical advice line claiming to have the flu, the chance that he/she truly has the flu—and not just a nasty cold—is only about 4 percent. Of the next 25 patients calling in claiming to have the flu, we are interested in how many actually have the flu.

19. Define the random variable and list its possible values.

20. State the distribution of X.

21. Find the probability that at least four of the 25 patients actually have the flu.

22. On average, for every 25 patients calling in, how many do you expect to have the flu?

Use the following information to answer the next two exercises. Different types of writing can sometimes be distinguished by the number of letters in the words used. A student interested in this fact wants to study the number of letters of words used by Tom Clancy in his novels. She opens a Clancy novel at random and records the number of letters of the first 250 words on the page.

23. What kind of data was collected?

  1. Qualitative
  2. Quantitative continuous
  3. Quantitative discrete


24. What is the population under study?

Chapter 5

Use the following information to answer the next five exercises. A recent study of mothers of junior high school children in Santa Clara County reported that 76 percent of the mothers are employed in paid positions. Of those mothers who are employed, 64 percent work full-time—more than 35 hours per week—and 36 percent work part-time. However, out of all of the mothers in the population, 49 percent work full-time. The population under study is made up of mothers of junior high school children in Santa Clara County. Let E = employed and F = full-time employment.

25.

  1. Find the percent of all mothers in the population that are not employed.
  2. Find the percent of mothers in the population that are employed part-time.


26. The type of employment is considered to be what type of data?

27. Find the probability that a randomly selected mother works part-time given that she is employed.

28. Find the probability that a randomly selected person from the population will be employed or work full-time.

29. Being employed and working part-time—

  1. mutually exclusive events? Why or why not?
  2. independent events? Why or why not?


Use the following additional information to answer the next two exercises. We randomly pick 10 mothers from the above population. We are interested in the number of the mothers that are employed. Let X = number of mothers that are employed.

30. State the distribution for X.

31. Find the probability that at least six are employed.

32. We expect the statistics discussion board to have, on average, 14 questions posted to it per week. We are interested in the number of questions posted to it per day.

  1. Define X.
  2. What are the values that the random variable may take on?
  3. State the distribution for X.
  4. Find the probability that from 10 to 14—inclusive—questions are posted to the listserv on a randomly picked day.


33. A person invests $1,000 into stock of a company that hopes to go public in one year. The probability that the person will lose all his money after one year, that is, his stock will be worthless, is 35 percent. The probability that the person’s stock will still have a value of $1,000 after one year, that is, no profit and no loss, is 60 percent. The probability that the person’s stock will increase in value by $10,000 after one year, that is, will be worth $11,000, is 5 percent. Find the expected profit after one year.

34. Rachel’s piano cost $3,000. The average cost for a piano is $4,000 with a standard deviation of $2,500. Becca’s guitar cost $550. The average cost for a guitar is $500 with a standard deviation of $200. Matt’s drums cost $600. The average cost for drums is $700 with a standard deviation of $100. Whose cost was lowest when compared to his or her own instrument?

This is a boxplot over a number line  from 0 to 7. The left whisker ranges from minimum, 0, to lower quartile, 2. The box runs from lower quartile, 2, to upper quartile, 5. A dashed line marks the median at 4. The right whisker runs from 5 to maximum value 7.
Figure A2

35. Explain why each statement is either true or false given the box plot in Figure A2.

  1. Twenty-five percent of the data are at most five.
  2. There is the same amount of data from 4–5 as there is from 5–7.
  3. There are no data values of three.
  4. Fifty percent of the data are four.


Using the following information to answer the next two exercises. 64 faculty members were asked the number of cars they owned—including spouse and children’s cars. The results are given in the following graph.

This shows a relative frequency bar graph. The horizontal axis shows the number of cars using whole numbers from 0 to 6. The vertical axis shows relative frequency in units of 0.1 from 0.15 to 0.45. The graph shows the following proportions: 0.075 of responses are 1, 0.15 are 2, 0.45 are 3, 0.25 are 4, and 0.075 of responses are 6.
Figure A3

36. Find the approximate number of responses that were three.

37. Find the first, second, and third quartiles. Use them to construct a box plot of the data.

Use the following information to answer the next three exercises. Table A3 shows data gathered from 15 girls on the Snow Leopard soccer team when they were asked how they liked to wear their hair. Supposed one girl from the team is randomly selected.

Hair Style/Hair Color Blond Brown Black
Ponytail 3 2 5
Plain 2 2 1
Table A3

38. Find the probability that the girl has black hair GIVEN that she wears a ponytail.

39. Find the probability that the girl wears her hair plain OR has brown hair.

40. Find the probability that the girl has blond hair AND that she wears her hair plain.

Chapter 6

Use the following information to answer the next two exercises. X ~ U(3, 13)

41. Explain which of the following are false and which are true.

  1. f(x) = 1 10 1 10 , 3 ≤ x ≤ 13
  2. There is no mode.
  3. The median is less than the mean.
  4. P(x > 10) = P(x ≤ 6)


42. Calculate

  1. the mean,
  2. the median, and
  3. the 65th percentile.


This is a boxplot over a number line  from 0 to 7. The left whisker ranges from minimum, 0, to lower quartile, 2. The box runs from lower quartile, 2, to upper quartile, 5. A dashed line marks the median at 4. The right whisker runs from 5 to maximum value 7.
Figure A4

43. Which of the following is true for the box plot in Figure A4?

  1. Twenty-five percent of the data are at most five.
  2. There is about the same amount of data from 4–5 as there is from 5–7.
  3. There are no data values of three.
  4. Fifty percent of the data are four.


44. If P(G|H) = P(G), then which of the following is correct?

  1. G and H are mutually exclusive events.
  2. P(G) = P(H)
  3. Knowing that H has occurred will affect the chance that G will happen.
  4. G and H are independent events.


45. If P(J) = .3, P(K) = .63, and J and K are independent events, then explain which are correct and which are incorrect.

  1. P(J AND K) = 0
  2. P(J OR K) = .9
  3. P(J OR K) = .72
  4. P(J) ≠ P(J|K)


46. On average, five students from each high school class get full scholarships to four-year colleges. Assume that most high school classes have about 500 students. X = the number of students from a high school class that get full scholarships to four-year schools. Which of the following is the distribution of X?

  1. P(5)
  2. B(500, 5)
  3. Exp ( 1 5 ) ( 1 5 )
  4. N( 5, (.01)(.99) 500 ) N( 5, (.01)(.99) 500 )


Chapter 7

Use the following information to answer the next three exercises. Richard’s Furniture Company delivers furniture from 10 a.m. to 2 p.m. continuously and uniformly. We are interested in how long—in hours—past the 10 a.m. start time that individuals wait for their delivery.

47. X ~ ________

  1. U(0, 4)
  2. U(10, 20)
  3. Exp(2)
  4. N(2, 1)


48. The average wait time is —

  1. one hour
  2. two hours
  3. two and a half hours
  4. four hours


49. Suppose that it is now past noon on a delivery day. The probability that a person must wait at least 1.5 more hours is —

  1. 1 4 1 4
  2. 1 2 1 2
  3. 3 4 3 4
  4. 3 8 3 8


50. Given X ~ Exp ( 1 3 ) ( 1 3 )

  1. Find P(x > 1).
  2. Calculate the minimum value for the upper quartile.
  3. Find P ( x= 1 3 ) ( x= 1 3 )


51.

  • Forty percent of full-time students took four years to graduate.
  • Thirty percent of full-time students took five years to graduate.
  • Twenty percent of full-time students took six years to graduate.
  • Ten percent of full-time students took seven years to graduate.


The expected time for full-time students to graduate is —

  1. four years
  2. four and a half years
  3. five years
  4. five and a half years


52. Which of the following distributions is described by the following example?
Many people can run a short distance of under two miles, but as the distance increases, fewer people can run that far.

  1. binomial
  2. uniform
  3. exponential
  4. normal


53. The length of time to brush one’s teeth is generally thought to be exponentially distributed with a mean of 3 4 3 4 minutes. Find the probability that a randomly selected person brushes his or her teeth less than 3 4 3 4 minutes.

  1. .5
  2. 3 4 3 4
  3. .43
  4. .63


54. Which distribution accurately describes the following situation?
The chance that a teenage boy regularly gives his mother a kiss goodnight is about 20 percent. Fourteen teenage boys are randomly surveyed. Let X = the number of teenage boys that regularly give their mother a kiss goodnight.

  1. B(14,.20)
  2. P(2.8)
  3. N(2.8,2.24)
  4. Exp ( 1 .20 ) ( 1 .20 )


55. A 2008 report on technology use states that approximately 20 percent of U.S. households have never sent an email. Suppose that we select a random sample of fourteen U.S. households. Let X = the number of households in a 2008 sample of 14 households that have never sent an email.

  1. B(14,.20)
  2. P(2.8)
  3. N(2.8,2.24)
  4. Exp ( 1 .20 ) ( 1 .20 )


Chapter 8

Use the following information to answer the next three exercises. Suppose that a sample of 15 randomly chosen people were put on a special weight-loss diet. The amount of weight lost, in pounds, follows an unknown distribution with mean equal to 12 pounds and standard deviation equal to three pounds. Assume that the distribution for the weight loss is normal.

56. To find the probability that the mean amount of weight lost by 15 people is no more than 14 pounds, the random variable should be ________.

  1. number of people who lost weight on the special weight-loss diet
  2. the number of people who were on the diet
  3. the mean amount of weight lost by 15 people on the special weight-loss diet
  4. the total amount of weight lost by 15 people on the special weight-loss diet


57. Find the probability asked for in Question 56.

58. Find the 90th percentile for the mean amount of weight lost by 15 people.

Using the following information to answer the next three exercises. The time of occurrence of the first accident during rush-hour traffic at a major intersection is uniformly distributed between the three hour interval 4 p.m. to 7 p.m. Let X = the amount of time—hours—it takes for the first accident to occur.

59. What is the probability that the time of occurrence is within the first half-hour or the last hour of the period from 4 to 7 p.m.?

  1. It cannot be determined from the information given.
  2. 1 6 1 6
  3. 1 2 1 2
  4. 1 3 1 3


60. The 20th percentile occurs after how many hours?

  1. .20
  2. .60
  3. .50
  4. 1


61. Assume Ramon has kept track of the times for the first accidents to occur for 40 different days. Let C = the total cumulative time. Then C follows which distribution?

  1. U(0,3)
  2. Exp(13)
  3. N(60, 5.477)
  4. N(1.5, .01875)


62. Using the information in Question 61, find the probability that the total time for all first accidents to occur is more than 43 hours.

Use the following information to answer the next two exercises. The length of time a parent must wait for his children to clean their rooms is uniformly distributed in the time interval from one to 15 days.

63. How long must a parent expect to wait for his children to clean their rooms?

  1. 8 days
  2. 3 days
  3. 14 days
  4. 6 days


64. What is the probability that a parent will wait more than six days given that the parent has already waited more than three days?

  1. .5174
  2. .0174
  3. .7500
  4. .2143


Use the following information to answer the next five exercises. Twenty percent of the students at a local community college live in within five miles of the campus. Thirty percent of the students at the same community college receive some kind of financial aid. Of those who live within five miles of the campus, 75 percent receive some kind of financial aid.

65. Find the probability that a randomly chosen student at the local community college does not live within five miles of the campus.

  1. 80 percent
  2. 20 percent
  3. 30 percent
  4. Cannot be determined


66. Find the probability that a randomly chosen student at the local community college lives within five miles of the campus or receives some kind of financial aid.

  1. 50 percent
  2. 35 percent
  3. 27.5 percent
  4. 75 percent


67. Are living in student housing within five miles of the campus and receiving some kind of financial aid mutually exclusive?

  1. Yes
  2. No
  3. Cannot be determined


68. The interest rate charged on the financial aid is ________ data.

  1. Quantitative discrete
  2. Quantitative continuous
  3. Qualitative discrete
  4. Qualitative


69. The following information is about the students who receive financial aid at the local community college.

  • 1st quartile = $250
  • 2nd quartile = $700
  • 3rd quartile = $1,200


These amounts are for the school year. If a sample of 200 students is taken, how many are expected to receive $250 or more?

  1. 50
  2. 250
  3. 150
  4. Cannot be determined


Use the following information to answer the next two exercises. P(A) = .2, P(B) = .3; A and B are independent events.

70. P(A AND B) = —

  1. .5
  2. .6
  3. 0
  4. .06


71. P(A OR B) = —

  1. .56
  2. .5
  3. .44
  4. 1


72. If H and D are mutually exclusive events, P(H) = .25, P(D) = .15, then P(H|D).

  1. 1
  2. 0
  3. .40
  4. .0375


Chapter 9

73. Rebecca and Matt are 14 year old twins. Matt’s height is two standard deviations below the mean for 14 year old boys’ height. Rebecca’s height is .10 standard deviations above the mean for 14 year old girls’ height. Interpret this.

  1. Matt is 2.1 inches shorter than Rebecca.
  2. Rebecca is very tall compared to other 14 year old girls.
  3. Rebecca is taller than Matt.
  4. Matt is shorter than the average 14 year old boy.


74. Construct a histogram of the IPO data (see Appendix C Data Sets).

Use the following information to answer the next three exercises. Ninety homeowners were asked the number of estimates they obtained before having their homes fumigated. Let X = the number of estimates.

x Relative Frequency Cumulative Relative Frequency
1 .3
2 .2
4 .4
5 .1
Table A4

75. Complete the cumulative frequency column.

76. Calculate the sample mean (a), the sample standard deviation (b), and the percent of the estimates that fall at or below four (c).

77. Calculate the median, M, the first quartile, Q1, and the third quartile Q3. Then construct a box plot of the data.

78. The middle 50 percent of the data are between ________ and ________.

Use the following information to answer the next three exercises. Seventy fifth and sixth graders were asked their favorite dinner.

Pizza Hamburgers Spaghetti Fried Shrimp
5th Grader 15 6 9 0
6th Grader 15 7 10 8
Table A5

79. Find the probability that one randomly chosen child is in the 6th grade and prefers fried shrimp.

  1. 32 70 32 70
  2. 8 32 8 32
  3. 8 8 8 8
  4. 8 70 8 70


80. Find the probability that a child does not prefer pizza.

  1. 30 70 30 70
  2. 30 40 30 40
  3. 40 70 40 70
  4. 1


81. Find the probability a child is in the fifth grade given that the child prefers spaghetti.

  1. 9 19 9 19
  2. 9 70 9 70
  3. 9 30 9 30
  4. 19 70 19 70


82. A sample of convenience is a random sample.

  1. True
  2. False


83. A statistic is a number that is a property of the population.

  1. True
  2. False


84. You should always throw out any data that are outliers.

  1. True
  2. False


85. Lee bakes pies for a small restaurant in Felton, CA. She generally bakes 20 pies in a day, on average. Of interest is the number of pies she bakes each day.

  1. Define the random variable X.
  2. State the distribution for X.
  3. Find the probability that Lee bakes more than 25 pies in any given day.


86. Six different brands of Italian salad dressing were randomly selected at a supermarket. The grams of fat per serving are 7, 7, 9, 6, 8, and 5. Assume that the underlying distribution is normal. Calculate a 95 percent confidence interval for the population mean grams of fat per serving of Italian salad dressing sold in supermarkets.

87. Given: uniform, exponential, normal distributions. Match each to a statement below.

  1. mean = median ≠ mode
  2. mean > median > mode
  3. mean = median = mode


Chapter 10

Use the following information to answer the next three exercises. In a survey at Kirkwood Ski Resort the following information was recorded.

0–10 11–20 21–40 40+
Ski 10 12 30 8
Snowboard 6 17 12 5
Table A6

Suppose that one person from Table A6 was randomly selected.

88. Find the probability that the person was a skier or was age 11–20.

89. Find the probability that the person was a snowboarder given he or she was age 21–40.

90. Explain which of the following are true and which are false.

  1. Sport and age are independent events.
  2. Ski and age 11–20 are mutually exclusive events.
  3. P(Ski AND age 21–40) < P(Ski|age 21–40)
  4. P(Snowboard OR age 0–10) < P(Snowboard|age 0–10)


91. The average length of time a person with a broken leg wears a cast is approximately six weeks. The standard deviation is about three weeks. Thirty people who had recently healed from broken legs were interviewed. State the distribution that most accurately reflects total time to heal for the 30 people.

92. The distribution for X is uniform. What can we say for certain about the distribution for X ¯ X ¯ when n = 1?

  1. The distribution for X ¯ X ¯ is still uniform with the same mean and standard deviation as the distribution for X.
  2. The distribution for X ¯ X ¯ is normal with the different mean and a different standard deviation as the distribution for X.
  3. The distribution for X ¯ X ¯ is normal with the same mean but a larger standard deviation than the distribution for X.
  4. The distribution for X ¯ X ¯ is normal with the same mean but a smaller standard deviation than the distribution for X.


93. The distribution for X is uniform. What can we say for certain about the distribution for X X when n = 50?

  1. The distribution for X X is still uniform with the same mean and standard deviation as the distribution for X.
  2. The distribution for X X is normal with the same mean but a larger standard deviation as the distribution for X.
  3. The distribution for X X is normal with a larger mean and a larger standard deviation than the distribution for X.
  4. The distribution for X X is normal with the same mean but a smaller standard deviation than the distribution for X.


Use the following information to answer the next three exercises. A group of students measured the lengths of all the carrots in a five-pound bag of baby carrots. They calculated the average length of baby carrots to be 2.0 inches with a standard deviation of 0.25 inches. Suppose we randomly survey 16 five-pound bags of baby carrots.

94. State the approximate distribution for X ¯ X ¯ , the distribution for the average lengths of baby carrots in 16 five-pound bags. X ¯ X ¯ ~ ________.

95. Explain why we cannot find the probability that one individual randomly chosen carrot is greater than 2.25 inches.

96. Find the probability that x ¯ x ¯ is between 2.0 and 2.25 inches.

Use the following information to answer the next three exercises. At the beginning of the term, the amount of time a student waits in line at the campus store is normally distributed with a mean of five minutes and a standard deviation of two minutes.

97. Find the 90th percentile of waiting time in minutes.

98. Find the median waiting time for one student.

99. Find the probability that the average waiting time for 40 students is at least 4.5 minutes.

Chapter 11

Use the following information to answer the next four exercises. Suppose that the time that owners keep their cars—purchased new—is normally distributed with a mean of seven years and a standard deviation of two years. We are interested in how long an individual keeps his car—purchased new. Our population is people who buy their cars new.

100. Sixty percent of individuals keep their cars at most how many years?

101. Suppose that we randomly survey one person. Find the probability that person keeps his or her car less than 2.5 years.

102. If we are to pick individuals 10 at a time, find the distribution for the mean car length ownership.

103. If we are to pick 10 individuals, find the probability that the sum of their ownership time is more than 55 years.

104. For which distribution is the median not equal to the mean?

  1. Uniform
  2. Exponential
  3. Normal
  4. Student t


105. Compare the standard normal distribution to the Student’s t distribution, centered at zero. Explain which of the following are true and which are false.

  1. As the number surveyed increases, the area to the left of –1 for the Student’s t distribution approaches the area for the standard normal distribution.
  2. As the degrees of freedom decrease, the graph of the Student’s t distribution looks more like the graph of the standard normal distribution.
  3. If the number surveyed is 15, the normal distribution should never be used.


Use the following information to answer the next five exercises. We are interested in the checking account balance of 24-old college students. We randomly survey 16 20-year-old college students. We obtain a sample mean of $640 and a sample standard deviation of $150. Let X = checking account balance of an individual 20-year-old college student.

106. Explain why we cannot determine the distribution of X.

107. If you were to create a confidence interval or perform a hypothesis test for the population mean checking account balance of 20-year-old college students, what distribution would you use?

108. Find the 95 percent confidence interval for the true mean checking account balance of a 20-year-old college student.

109. What type of data is the balance of the checking account considered to be?

110. What type of data is the number of 20-year-olds considered to be?

111. On average, a busy emergency room gets a patient with a shotgun wound about once per week. We are interested in the number of patients with a shotgun wound the emergency room gets per 28 days.

  1. Define the random variable X.
  2. State the distribution for X.
  3. Find the probability that the emergency room gets no patients with shotgun wounds in the next 28 days.


Use the following information to answer the next two exercises. The probability that a certain slot machine will pay back money when a quarter is inserted is .30. Assume that each play of the slot machine is independent from each other. A person puts in 15 quarters for 15 plays.

112. Is the expected number of plays of the slot machine that will pay back money greater than, less than, or the same as the median? Explain your answer.

113. Is it likely that exactly eight of the 15 plays would pay back money? Justify your answer numerically.

114. A game is played with the following rules:

  • It costs $10 to enter.
  • A fair coin is tossed four times.
  • If you do not get four heads or four tails, you lose your $10.
  • If you get four heads or four tails, you get back your $10, plus $30 more.


Over the long run of playing this game, what are your expected earnings?

115.

  • The mean grade on a math exam in Rachel’s class was 74, with a standard deviation of five. Rachel earned an 80.
  • The mean grade on a math exam in Becca’s class was 47, with a standard deviation of two. Becca earned a 51.
  • The mean grade on a math exam in Matt’s class was 70, with a standard deviation of eight. Matt earned an 83.


Find whose score was the best, compared to his or her own class. Justify your answer numerically.

Use the following information to answer the next two exercises. A random sample of 70 compulsive gamblers were asked the number of days they go to casinos per week. The results are given in the following graph.

This shows a relative frequency histogram. The horizontal axis shows the number of days using whole numbers from 1 to 7. The vertical axis shows relative frequency in units of 0.1 from 0.1 to 0.3. The graph shows the following proportions: 0.2 of responses are 1, 0.2 are 2, 0.3 are 3, 0.2 are 5, and 0.1 of responses are 7.
Figure A5

116. Find the number of responses that were five.

117. Find the mean, standard deviation, the median, the first quartile, the third quartile, and the IQR.

118. Based upon research at De Anza College, it is believed that about 19 percent of the student population speaks a language other than English at home. Suppose that a study was done this year to see if that percent has decreased. Ninety-eight students were randomly surveyed with the following results: Fourteen said that they speak a language other than English at home.

  1. State an appropriate null hypothesis.
  2. State an appropriate alternative hypothesis.
  3. Define the random variable, P′.
  4. Calculate the test statistic.
  5. Calculate the p-value.
  6. At the 5 percent level of decision, what is your decision about the null hypothesis?
  7. What is the Type I error?
  8. What is the Type II error?


119. Assume that you are an emergency paramedic called in to rescue victims of an accident. You need to help a patient who is bleeding profusely. The patient is also considered to be a high risk for contracting a blood-borne illness. Assume that the null hypothesis is that the patient does not have the a blood-borne illness. What is a Type I error?

120. It is often said that Californians are more casual than the rest of Americans. Suppose that a survey was done to see if the proportion of Californian professionals that wear jeans to work is greater than the proportion of non-Californian professionals. Fifty of each was surveyed with the following results: Fifteen Californians wear jeans to work and six non-Californians wear jeans to work.
Let C = Californian professional; NC = non-Californian professional

  1. State appropriate null and alternate hypotheses.
  2. Define the random variable.
  3. Calculate the test statistic and p-value.
  4. At the 5 percent significance level, what is your decision?
  5. What is the Type I error?
  6. What is the Type II error?


Use the following information to answer the next two exercises. A group of statistics students have developed a technique that they feel will lower their anxiety level on statistics exams. They measured their anxiety level at the start of the quarter and again at the end of the quarter. Recorded is the paired data in that order: (1,000, 900); (1,200, 1,050); (600, 700); (1,300, 1,100); (1,000, 900); (900, 900).

121. This is a test of (pick the best answer) —

  1. large samples, and independent means
  2. small samples, and independent means
  3. dependent means


122. State the distribution to use for the test.

Chapter 12

Use the following information to answer the next two exercises. A recent survey of U.S. teenagers was answered by 720 teenagers, age 15–18. Six percent of teenagers surveyed said they are planning on going to college in another country. We are interested in the true proportion of U.S. teens, ages 15–18, who are planning on going to college in another country.

123. Find the 95 percent confidence interval for the true proportion of U.S. teens, ages 15–19, who are planning to go to college in another country.

124. The report also stated that the results of the survey are accurate to within ±3.7 percent at the 95 percent confidence level. Suppose that a new study is to be done. It is desired to be accurate to within 2 percent of the 95 percent confidence level. What is the minimum number that should be surveyed?

125. Given X ~ Exp ( 1 3 ) ( 1 3 ) . Sketch the graph that depicts: P(x > 1).

Use the following information to answer the next three exercises. The amount of money a customer spends in one trip to the supermarket is known to have an exponential distribution. Suppose the mean amount of money a customer spends in one trip to the supermarket is $72.

126. Find the probability that one customer spends less than $72 in one trip to the supermarket?

127. Suppose five customers pool their money. How much money altogether would you expect the five customers to spend in one trip to the supermarket in dollars?

128. State the distribution to use if you want to find the probability that the mean amount spent by five customers in one trip to the supermarket is less than $60.

Chapter 13

Use the following information to answer the next two exercises. Suppose that the probability of a drought in any independent year is 20 percent. Out of those years in which a drought occurs, the probability of water rationing is 10 percent. However, in any year, the probability of water rationing is 5 percent.

129. What is the probability of both a drought and water rationing occurring?

130. Out of the years with water rationing, find the probability that there is a drought.

Use the following information to answer the next three exercises.

Apple Pumpkin Pecan
Female 40 10 30
Male 20 30 10
Table A7

131. Suppose that one individual is randomly chosen. Find the probability that the person’s favorite pie is apple or the person is male.

132. Suppose that one male is randomly chosen. Find the probability his favorite pie is pecan.

133. Conduct a hypothesis test to determine if favorite pie type and gender are independent.

Use the following information to answer the next two exercises. Let’s say that the probability that an adult watches the news at least once per week is .60.

134. We randomly survey 14 people. On average, how many people do we expect to watch the news at least once per week?

135. We randomly survey 14 people. Of interest is the number that watch the news at least once per week. State the distribution of X. X ~ ________.

136. The following histogram is most likely to be a result of sampling from which distribution?

This graph is an unlabeled histogram. The distribution is roughly symmetric. There is a single peak in the center of the graph and heights of bars decrease from that point toward each end of the graph.
Figure A6
  1. Chi-square
  2. Geometric
  3. Uniform
  4. Binomial


137. The ages of De Anza evening students is known to be normally distributed with a population mean of 40 and a population standard deviation of six. A sample of six De Anza evening students reported their ages in years as: 28; 35; 47; 45; 30; 50. Find the probability that the mean of six ages of randomly chosen students is less than 35 years. Hint—Find the sample mean.

138. A math exam was given to all the fifth grade children attending Country School. Two random samples of scores were taken. The null hypothesis is that the mean math scores for boys and girls in fifth grade are the same. Conduct a hypothesis test.

n x ¯ x ¯ s2
Boys 55 82 29
Girls 60 86 46
Table A8

139. In a survey of 80 males, 55 had played an organized sport growing up. Of the 70 females surveyed, 25 had played an organized sport growing up. We are interested in whether the proportion for males is higher than the proportion for females. Conduct a hypothesis test.

140. Which of the following is preferable when designing a hypothesis test?

  1. Maximize α and minimize β
  2. Minimize α and maximize β
  3. Maximize α and β
  4. Minimize α and β


Use the following information to answer the next three exercises. One hundred twenty people were surveyed as to their favorite beverage. The results are below.

Beverage/Age 0–9 10–19 20–29 30+ Totals
Milk 14 10 6 0 30
Soda 3 8 26 15 52
Juice 7 12 12 7 38
Totals 24 330 44 22 120
Table A9

141. Are the events of milk and 30+—

  1. independent events? Justify your answer.
  2. mutually exclusive events? Justify your answer.


142. Suppose that one person is randomly chosen. Find the probability that person is 10–19 given that he or she prefers juice.

143. Are Preferred Beverage and Age independent events? Conduct a hypothesis test.

144. Given the following histogram, which distribution is the data most likely to come from?

This graph is an unlabeled histogram. The heights of the bars do not vary much across the distribution.
Figure A7
  1. Uniform
  2. Exponential
  3. Normal
  4. Chi-square


Solutions

Chapter 3

1. C Parameter

2. A Population

3. B Statistic

4. D Sample

5. E Variable

6. quantitative continuous

7.

  1. 2.27
  2. 3.04
  3. –1, 4, 4


8. Answers will vary.

Chapter 4

9. C (.80)(.30)

10. B No, and they are not mutually exclusive either.

11. A All employed adult women

12. .5773

13. .0522

14. B The middle fifty percent of the members lost from 2 to 8.5 lbs.

15. C All of the data have the same value.

16. C The lowest data value is the median.

17. .279

18. B No, I expect to come out behind in money.

19. X = the number of patients calling in claiming to have the flu, who actually have the flu.
X = 0, 1, 2, …25

20. B(25, .04)

21. .0165

22. 1

23. C Quantitative discrete

24. all words used by Tom Clancy in his novels

Chapter 5

25.

  1. 24 percent
  2. 27 percent


26. qualitative

27. .36

28. .7636

29.

  1. no
  2. no


30. B(10, .76)

31. .9330

32.

  1. X = the number of questions posted to the statistics listserv per day.
  2. X = 0, 1, 2,…
  3. X ~ P(2)
  4. 0


33. $150

34. Matt

35.

  1. False
  2. True
  3. False
  4. False


36. 16

37. first quartile: 2
second quartile: 2
third quartile: 3

38. 0.5

39. 7 15 7 15

40. 2 15 2 15

Chapter 6

41.

  1. True
  2. True
  3. False – the median and the mean are the same for this symmetric distribution.
  4. True


42.

  1. 8
  2. 8
  3. P(x < k) = 0.65 = (k – 3) ( 1 10 ) ( 1 10 ) . k = 9.5


43.

  1. False – 3 4 3 4 of the data are at most five.
  2. True – each quartile has 25 percent of the data.
  3. False – that is unknown.
  4. False – 50 percent of the data are four or less.


44. D G and H are independent events.

45.

  1. False – J and K are independent so they are not mutually exclusive which would imply dependency (meaning P(J AND K) is not 0).
  2. False – see answer c.
  3. True – P(J OR K) = P(J) + P(K) – P(J AND K) = P(J) + P(K) – P(J)P(K) = .3 + .6 – (.3)(.6) = .72. Note the P(J AND K) = P(J)P(K) because J and K are independent.
  4. False – J and K are independent so P(J) = P(J|K).


46. A P(5)

Chapter 7

47. A U(0, 4)

48. B 2 hours

49. A 1 4 1 4

50.

  1. .7165
  2. 4.16
  3. 0


51. C 5 years

52. C exponential

53. .63

54. A B(14, .20)

55. A B(14, .20)

Chapter 8

56. C The mean amount of weight lost by 15 people on the special weight-loss diet.

57. .9951

58. 12.99

59. C 1 2 1 2

60. B .60

61. C N(60, 5.477)

62. .9990

63. A eight days

64. C .7500

65. A 80 percent

66. B 35 percent

67. B no

68. B Quantitative continuous

69. C 150

70. D .06

71. C .44

72. B 0

Chapter 9

73. D Matt is shorter than the average 14 year old boy.

74. Answers will vary.

75.

x Relative Frequency Cumulative Relative Frequency
1 .3 .3
2 .2 .2
4 .4 .4
5 .1 .1
Table A10

76.

  1. 2.8
  2. 1.48
  3. 90 percent


77. M = 3; Q1 = 1; Q3 = 4

78. 1 and 4

79. D 8 70 8 70

80. C 40 70 40 70

81. A 9 19 9 19

82. B False

83. B False

84. B False

85.

  1. X = the number of pies Lee bakes every day.
  2. P(20)
  3. .1122


86. CI: (5.25, 8.48)

87.

  1. uniform
  2. exponential
  3. normal


Chapter 10

88. 77 100 77 100

89. 12 42 12 42

90.

  1. False
  2. False
  3. True
  4. False


91. N(180, 16.43)

92. A The distribution for X ¯ X ¯ is still uniform with the same mean and standard deviation as the distribution for X.

93. C The distribution for X X is normal with a larger mean and a larger standard deviation than the distribution for X.

94. N( 2,  .25 16 ) N( 2,  .25 16 )

95. Answers will vary.

96. .5000

97. 7.6

98. 5

99. .9431

Chapter 11

100. 7.5

101. .0122

102. N(7, .63)

103. .9911

104. B exponential

105.

  1. True
  2. False
  3. False


106. Answers will vary.

107. Student’s t with df = 15

108. (560.07, 719.93)

109. quantitative continuous data

110. quantitative discrete data

111.

  1. X = the number of patients with a shotgun wound the emergency room gets per 28 days.
  2. P(4)
  3. .0183


112. greater than

113. no; P(x = 8) = .0348

114. You will lose $5.

115. Becca

116. 14

117. sample mean = 3.2
sample standard deviation = 1.85
median = 3
Q1 = 2
Q3 = 5
IQR = 3

118. d. z = –1.19
e. .1171
f. Do not reject the null hypothesis.

119. We conclude that the patient does have the illness when, in fact, the patient does not.

120. c. z = 2.21; p = .0136
d. Reject the null hypothesis.
e. We conclude that the proportion of Californian professionals that wear jeans to work is greater than the proportion of non-Californian professionals when, in fact, it is not greater.
f. We cannot conclude that the proportion of Californian professionals that wear jeans to work is greater than the proportion of non-Californian professionals when, in fact, it is greater.

121. C dependent means

122. t5

Chapter 12

123. (.0424, .0770)

124. 2,401

125. Check student's solution.

126. .6321

127. $360

128. N( 72,  72 5 ) N( 72,  72 5 )

Chapter 13

129. .02

130. .40

131. 100 140 100 140

132. 10 60 10 60

133. p-value = 0; reject the null hypothesis; conclude that they are dependent events

134. 8.4

135. B(14, .60)

136. D Binomial

137. .3669

138. p-value = .0006; reject the null hypothesis; conclude that the averages are not equal

139. p-value = 0; reject the null hypothesis; conclude that the proportion of males is higher

140. minimize α and β

141.

  1. no
  2. yes, P(M AND 30+) = 0


142. 12 38 12 38

143. no; p-value = 0

144. A uniform

References

Baran, D. (2010). Twenty percent of Americans have never used email. Retrieved from http://www.webguild.org/20080519/20-percent-of-americans-have-never-used-email.

Parade Magazine. (n.d.). Retrieved from https://parade.com/.

San Jose Mercury News. (n.d.). Retrieved from http://www.mercurynews.com/.

Order a print copy

As an Amazon Associate we earn from qualifying purchases.

Citation/Attribution

This book may not be used in the training of large language models or otherwise be ingested into large language models or generative AI offerings without OpenStax's permission.

Want to cite, share, or modify this book? This book uses the Creative Commons Attribution License and you must attribute Texas Education Agency (TEA). The original material is available at: https://www.texasgateway.org/book/tea-statistics . Changes were made to the original material, including updates to art, structure, and other content updates.

Attribution information
  • If you are redistributing all or part of this book in a print format, then you must include on every physical page the following attribution:
    Access for free at https://openstax.org/books/statistics/pages/1-introduction
  • If you are redistributing all or part of this book in a digital format, then you must include on every digital page view the following attribution:
    Access for free at https://openstax.org/books/statistics/pages/1-introduction
Citation information

© Jan 23, 2024 Texas Education Agency (TEA). The OpenStax name, OpenStax logo, OpenStax book covers, OpenStax CNX name, and OpenStax CNX logo are not subject to the Creative Commons license and may not be reproduced without the prior and express written consent of Rice University.