Skip to ContentGo to accessibility pageKeyboard shortcuts menu
OpenStax Logo
Introductory Statistics

Bringing It Together: Homework

Introductory StatisticsBringing It Together: Homework

108.

Santa Clara County, CA, has approximately 27,873 Japanese-Americans. Their ages are as follows:

Age Group Percent of Community
0–17 18.9
18–24 8.0
25–34 22.8
35–44 15.0
45–54 13.1
55–64 11.9
65+ 10.3
Table 2.77
  1. Construct a histogram of the Japanese-American community in Santa Clara County, CA. The bars will not be the same width for this example. Why not? What impact does this have on the reliability of the graph?
  2. What percentage of the community is under age 35?
  3. Which box plot most resembles the information above?
Three box plots with values between 0 and 100.  Plot i has Q1 at 24, M at 34, and Q3 at 53; Plot ii has Q1 at 18, M at 34, and Q3 at 45; Plot iii has Q1 at 24, M at 25, and Q3 at 54.
Figure 2.47
109.

Javier and Ercilia are supervisors at a shopping mall. Each was given the task of estimating the mean distance that shoppers live from the mall. They each randomly surveyed 100 shoppers. The samples yielded the following information.

Javier Ercilia
x¯x¯ 6.0 miles 6.0 miles
ss 4.0 miles 7.0 miles
Table 2.78
  1. How can you determine which survey was correct ?
  2. Explain what the difference in the results of the surveys implies about the data.
  3. If the two histograms depict the distribution of values for each supervisor, which one depicts Ercilia's sample? How do you know?
    This shows two histograms. The first histogram shows a fairly symmetrical distribution with a mode of 6. The second histogram shows a uniform distribution.
    Figure 2.48
  4. If the two box plots depict the distribution of values for each supervisor, which one depicts Ercilia’s sample? How do you know?
    This shows two horizontal boxplots. The first boxplot is graphed over a number line from 0 to 21. The first whisker extends from 0 to 1. The box begins at the first quartile, 1, and ends at the third quartile, 14. A vertical, dashed line marks the median at 6. The second whisker extends from the third quartile to the largest value, 21. The second boxplot is graphed over a number line from 0 to 12.  The first whisker extends from 0 to 4. The box begins at the first quartile, 4, and ends at the third quartile, 9. A vertical, dashed line marks the median at 6. The second whisker extends from the third quartile to the largest value, 12.
    Figure 2.49

Use the following information to answer the next three exercises: We are interested in the number of years students in a particular elementary statistics class have lived in California. The information in the following table is from the entire section.

Number of years Frequency Number of years Frequency
7 1 22 1
14 3 23 1
15 1 26 1
18 1 40 2
19 4 42 2
20 3
Total = 20
Table 2.79
110.

What is the IQR?

  1. 8
  2. 11
  3. 15
  4. 35
111.

What is the mode?

  1. 19
  2. 19.5
  3. 14 and 20
  4. 22.65
112.

Is this a sample or the entire population?

  1. sample
  2. entire population
  3. neither
113.

Twenty-five randomly selected students were asked the number of movies they watched the previous week. The results are as follows:

# of movies Frequency
0 5
1 9
2 6
3 4
4 1
Table 2.80
  1. Find the sample mean x¯x¯.
  2. Find the approximate sample standard deviation, s.
114.

Forty randomly selected students were asked the number of pairs of sneakers they owned. Let X = the number of pairs of sneakers owned. The results are as follows:

X Frequency
1 2
2 5
3 8
4 12
5 12
6 0
7 1
Table 2.81
  1. Find the sample mean x x
  2. Find the sample standard deviation, s
  3. Construct a histogram of the data.
  4. Complete the columns of the chart.
  5. Find the first quartile.
  6. Find the median.
  7. Find the third quartile.
  8. Construct a box plot of the data.
  9. What percent of the students owned at least five pairs?
  10. Find the 40th percentile.
  11. Find the 90th percentile.
  12. Construct a line graph of the data
  13. Construct a stemplot of the data
115.

Following are the published weights (in pounds) of all of the team members of the San Francisco 49ers from a previous year.

177; 205; 210; 210; 232; 205; 185; 185; 178; 210; 206; 212; 184; 174; 185; 242; 188; 212; 215; 247; 241; 223; 220; 260; 245; 259; 278; 270; 280; 295; 275; 285; 290; 272; 273; 280; 285; 286; 200; 215; 185; 230; 250; 241; 190; 260; 250; 302; 265; 290; 276; 228; 265

  1. Organize the data from smallest to largest value.
  2. Find the median.
  3. Find the first quartile.
  4. Find the third quartile.
  5. Construct a box plot of the data.
  6. The middle 50% of the weights are from _______ to _______.
  7. If our population were all professional football players, would the above data be a sample of weights or the population of weights? Why?
  8. Assume the population was the San Francisco 49ers. Find:
    1. the population mean, μ.
    2. the population standard deviation, σ.
    3. the weight that is two standard deviations below the mean.
    4. When Steve Young, quarterback, played football, he weighed 205 pounds. How many standard deviations above or below the mean was he?
  9. That same year, the mean weight for the Dallas Cowboys was 240.08 pounds with a standard deviation of 44.38 pounds. Emmit Smith weighed in at 209 pounds. With respect to his team, who was lighter, Smith or Young? How did you determine your answer?
116.

One hundred teachers attended a seminar on mathematical problem solving. The attitudes of a representative sample of 12 of the teachers were measured before and after the seminar. A positive number for change in attitude indicates that a teacher's attitude toward math became more positive. The 12 change scores are as follows:

3; 8; –1; 2; 0; 5; –3; 1; –1; 6; 5; –2

  1. What is the mean change score?
  2. What is the standard deviation for this population?
  3. What is the median change score?
  4. Find the change score that is 2.2 standard deviations below the mean.
117.

Refer to Figure 2.50 determine which of the following are true and which are false. Explain your solution to each part in complete sentences.

This shows three graphs. The first is a histogram with a mode of 3 and fairly symmetrical distribution between 1 (minimum value) and 5 (maximum value). The second graph is a histogram with peaks at 1 (minimum value) and 5 (maximum value) with 3 having the lowest frequency. The third graph is a box plot. The first whisker extends from 0 to 1. The box begins at the firs quartile, 1, and ends at the third quartile,6. A vertical, dashed line marks the median at 3. The second whisker extends from 6 on.
Figure 2.50
  1. The medians for all three graphs are the same.
  2. We cannot determine if any of the means for the three graphs is different.
  3. The standard deviation for graph b is larger than the standard deviation for graph a.
  4. We cannot determine if any of the third quartiles for the three graphs is different.
118.

In a recent issue of the IEEE Spectrum, 84 engineering conferences were announced. Four conferences lasted two days. Thirty-six lasted three days. Eighteen lasted four days. Nineteen lasted five days. Four lasted six days. One lasted seven days. One lasted eight days. One lasted nine days. Let X = the length (in days) of an engineering conference.

  1. Organize the data in a chart.
  2. Find the median, the first quartile, and the third quartile.
  3. Find the 65th percentile.
  4. Find the 10th percentile.
  5. Construct a box plot of the data.
  6. The middle 50% of the conferences last from _______ days to _______ days.
  7. Calculate the sample mean of days of engineering conferences.
  8. Calculate the sample standard deviation of days of engineering conferences.
  9. Find the mode.
  10. If you were planning an engineering conference, which would you choose as the length of the conference: mean; median; or mode? Explain why you made that choice.
  11. Give two reasons why you think that three to five days seem to be popular lengths of engineering conferences.
119.

A survey of enrollment at 35 community colleges across the United States yielded the following figures:

6414; 1550; 2109; 9350; 21828; 4300; 5944; 5722; 2825; 2044; 5481; 5200; 5853; 2750; 10012; 6357; 27000; 9414; 7681; 3200; 17500; 9200; 7380; 18314; 6557; 13713; 17768; 7493; 2771; 2861; 1263; 7285; 28165; 5080; 11622

  1. Organize the data into a chart with five intervals of equal width. Label the two columns "Enrollment" and "Frequency."
  2. Construct a histogram of the data.
  3. If you were to build a new community college, which piece of information would be more valuable: the mode or the mean?
  4. Calculate the sample mean.
  5. Calculate the sample standard deviation.
  6. A school with an enrollment of 8000 would be how many standard deviations away from the mean?


Use the following information to answer the next two exercises. X = the number of days per week that 100 clients use a particular exercise facility.

x Frequency
0 3
1 12
2 33
3 28
4 11
5 9
6 4
Table 2.82
120.

The 80th percentile is _____

  1. 5
  2. 80
  3. 3
  4. 4
121.

The number that is 1.5 standard deviations BELOW the mean is approximately _____

  1. 0.7
  2. 4.8
  3. –2.8
  4. Cannot be determined
122.

Suppose that a publisher conducted a survey asking adult consumers the number of fiction paperback books they had purchased in the previous month. The results are summarized in the Table 2.83.

# of books Freq. Rel. Freq.
0 18
124
2 24
322
4 15
510
7 5
91
Table 2.83
  1. Are there any outliers in the data? Use an appropriate numerical test involving the IQR to identify outliers, if any, and clearly state your conclusion.
  2. If a data value is identified as an outlier, what should be done about it?
  3. Are any data values further than two standard deviations away from the mean? In some situations, statisticians may use this criteria to identify data values that are unusual, compared to the other data values. (Note that this criteria is most appropriate to use for data that is mound-shaped and symmetric, rather than for skewed data.)
  4. Do parts a and c of this problem give the same answer?
  5. Examine the shape of the data. Which part, a or c, of this question gives a more appropriate result for this data?
  6. Based on the shape of the data which is the most appropriate measure of center for this data: mean, median or mode?
Order a print copy

As an Amazon Associate we earn from qualifying purchases.

Citation/Attribution

This book may not be used in the training of large language models or otherwise be ingested into large language models or generative AI offerings without OpenStax's permission.

Want to cite, share, or modify this book? This book uses the Creative Commons Attribution License and you must attribute OpenStax.

Attribution information
  • If you are redistributing all or part of this book in a print format, then you must include on every physical page the following attribution:
    Access for free at https://openstax.org/books/introductory-statistics/pages/1-introduction
  • If you are redistributing all or part of this book in a digital format, then you must include on every digital page view the following attribution:
    Access for free at https://openstax.org/books/introductory-statistics/pages/1-introduction
Citation information

© Jun 23, 2022 OpenStax. Textbook content produced by OpenStax is licensed under a Creative Commons Attribution License . The OpenStax name, OpenStax logo, OpenStax book covers, OpenStax CNX name, and OpenStax CNX logo are not subject to the Creative Commons license and may not be reproduced without the prior and express written consent of Rice University.