Barbara Illowsky; Susan Dean

1.

Stem	Leaf
1	9 9 9
2	0 1 1 5 5 5 6 6 8 9
3	1 1 2 2 3 4 5 6 7 7 8 8 8 8
4	1 3 3

Table 2.87

3.

Stem	Leaf
2	5 5 6 7 7 8
3	0 0 1 2 3 3 5 5 5 7 7 9
4	1 6 9
5	6 7 7
6	1

Table 2.88

5.

This is a line graph that matches the supplied data. The x-axis shows the number of times people reported visiting a store before making a major purchase, and the y-axis shows the frequency.

Figure 2.53

7.

This is a line graph that matches the supplied data. The x-axis shows the number of TV shows a kid watches each day, and the y-axis shows the frequency.

Figure 2.54

9.

This is a bar graph that matches the supplied data. The x-axis shows the seasons of the year, and the y-axis shows the proportion of birthdays.

Figure 2.55

11.

This is a bar graph that matches the supplied data. The x-axis shows the county high schools, and the y-axis shows the proportion of county students.

Figure 2.56

13.

65

15.

The relative frequency shows the proportion of data points that have each value. The frequency tells the number of data points that have each value.

17.

Answers will vary. One possible histogram is shown below.

On this bar graph the x-axis holds the number of cars sold and the y-axis shoes the frequency. 3 cars selling has a frequency of 14, 4 cars selling has a frequency of 19, 5 cars selling has a frequency of 12, 6 cars selling has a frequency of 9, and seven cars selling has a frequency of 11.

Figure 2.57

19.

Find the midpoint for each class. These will be graphed on the x-axis. The frequency values will be graphed on the y-axis values.

This is a frequency polygon that matches the supplied data. The x-axis shows the depth of hunger, and the y-axis shows the frequency.

Figure 2.58

21.

A graph is shown. The X axis has years 1855-1875. The Y axis show a number of births 40,000 through 130,000 going up by 5,000. There are three lines that indicate the number of males in blue, females, in yellow, and box sexes in a dark blue. The dark blue line with both sexes is the highest bar and it climbs by 90,000 up to119,000. The light blue line representing males begins at 47,000 and climbs to 64,000, and lastly the female line begins on 45,000 and ends on 55,000.

Figure 2.59

23.

The 40^th percentile is 37 years.
The 78^th percentile is 70 years.

25.

Jesse graduated 37^th out of a class of 180 students. There are 180 – 37 = 143 students ranked below Jesse. There is one rank of 37.

x = 143 and y = 1. $\frac{x + .5 y}{n}$ (100) = $\frac{143 + .5 (1)}{180}$ (100) = 79.72. Jesse’s rank of 37 puts him at the 80^th percentile.

27.

For runners in a race, it is more desirable to have a high percentile for speed. A high percentile means a higher speed, which is faster.
40 percent of runners ran at speeds of 7.5 miles per hour or less (slower), and 60 percent of runners ran at speeds of 7.5 miles per hour or more (faster).

29.

When waiting in line at the DMV, the 85^th percentile would be a long wait time compared to the other people waiting. 85 percent of people had shorter wait times than Mina. In this context, Mina would prefer a wait time corresponding to a lower percentile. 85 percent of people at the DMV waited 32 minutes or less. 15 percent of people at the DMV waited 32 minutes or longer.

31.

The manufacturer and the consumer would be upset. This is a large repair cost for the damages, compared to the other cars in the sample. INTERPRETATION: 90 percent of the crash-tested cars had damage repair costs of $1,700 or less; only 10 percent had damage repair costs of $1,700 or more.

33.

You can afford 34 percent of houses. 66 percent of the houses are too expensive for your budget. INTERPRETATION: 34 percent of houses cost $240,000 or less; 66 percent of houses cost $240,000 or more.

35.

4

37.

6 – 4 = 2

39.

6

41.

More than 25 percent of salespersons sell four cars in a typical week. You can see this concentration in the box plot because the first quartile is equal to the median. The top 25 percent and the bottom 25 percent are spread out evenly; the whiskers have the same length.

43.

Mean: 16 + 17 + 19 + 20 + 20 + 21 + 23 + 24 + 25 + 25 + 25 + 26 + 26 + 27 + 27 + 27 + 28 + 29 + 30 + 32 + 33 + 33 + 34 + 35 + 37 + 39 + 40 = 738;

$\frac{738}{27}$ = 27.33

45.

The most frequent lengths are 25 and 27, which occur three times. Mode = 25, 27

47.

4

49.

The data are symmetrical. The median is 3, and the mean is 2.85. They are close, and the mode lies close to the middle of the data, so the data are symmetrical.

51.

The data are skewed right. The median is 87.5, and the mean is 88.2. Even though they are close, the mode lies to the left of the middle of the data, and there are many more instances of 87 than any other number, so the data are skewed right.

53.

When the data are symmetrical, the mean and median are close or the same.

55.

The distribution is skewed right because it looks pulled out to the right.

57.

The mean is 4.1 and is slightly greater than the median, which is 4.

59.

The mode and the median are the same. In this case, both 5.

61.

The distribution is skewed left because it looks pulled out to the left.

63.

Both the mean and the median are 6.

65.

The mode is 12, the median is 13.5, and the mean is 15.1. The mean is the largest.

67.

The mean tends to reflect skewing the most because it is affected the most by outliers.

69.

sampling variability

70.

induced variability

71.

measurement variability

72.

natural variability

73.

s = 34.5

75.

For Fredo: z = $\frac{.158 – .166}{.012}$ = –0.67.

For Karl: z = $\frac{.177 – .189}{.015}$ = –.8.

Fredo’s z score of –.67 is higher than Karl’s z score of –.8. For batting average, higher values are better, so Fredo has a better batting average compared to his team.

77.

$s_{x} = \sqrt{\frac{\sum f m^{2}}{n} - {\bar{x}}^{2}} = \sqrt{\frac{193,157.45}{30} - {79.5}^{2}} = 10.88$
$s_{x} = \sqrt{\frac{\sum f m^{2}}{n} - {\bar{x}}^{2}} = \sqrt{\frac{380,945.3}{101} - {60.94}^{2}} = 7.62$
$s_{x} = \sqrt{\frac{\sum f m^{2}}{n} - {\bar{x}}^{2}} = \sqrt{\frac{440,051.5}{86} - {70.66}^{2}} = 11.14$

79.

Example solution for using the random number generator for the TI-84+ to generate a simple random sample of eight states. Instructions are as follows.
- Number the entries in the table 1–51 (includes Washington, DC; numbered vertically)
- Press MATH
- Arrow over to PRB
- Press 5:randInt(
- Enter 51,1,8)
Eight numbers are generated (use the right arrow key to scroll through the numbers). The numbers correspond to the numbered states (for this example: {47 21 9 23 51 13 25 4}. If any numbers are repeated, generate a different number by using 5:randInt(51,1)). Here, the states (and Washington DC) are {Arkansas, Washington DC, Idaho, Maryland, Michigan, Mississippi, Virginia, Wyoming}.

Corresponding percents are {30.1, 22.2, 26.5, 27.1, 30.9, 34.0, 26.0, 25.1}.

Figure 2.60
Figure 2.61
Figure 2.62

81.

Amount($)	Frequency	Relative Frequency
51–100	5	.08
101–150	10	.17
151–200	15	.25
201–250	15	.25
251–300	10	.17
301–350	5	.08

Table 2.89 Singles

Amount ($)	Frequency	Relative Frequency
100–150	5	.07
201–250	5	.07
251–300	5	.07
301–350	5	.07
351–400	10	.14
401–450	10	.14
451–500	10	.14
501–550	10	.14
551–600	5	.07
601–650	5	.07

Table 2.90 Couples

See Table 2.89 and Table 2.90.
In the following histogram, data values that fall on the right boundary are counted in the class interval, while values that fall on the left boundary are not counted, with the exception of the first interval, where both boundary values are included.

Figure 2.63
In the following histogram, the data values that fall on the right boundary are counted in the class interval, while values that fall on the left boundary are not counted, with the exception of the first interval, where values on both boundaries are included.

Figure 2.64
Compare the two graphs.
1. Answers may vary. Possible answers include the following:
  - Both graphs have a single peak.
  - Both graphs use class intervals with width equal to $50
2. Answers may vary. Possible answers include the following:
  - The couples graph has a class interval with no values
  - It takes almost twice as many class intervals to display the data for couples
3. Answers may vary. Possible answers include the following. The graphs are more similar than different because the overall patterns for the graphs are the same.
Check student's solution.
Compare the graph for the singles with the new graph for the couples:
1. - Both graphs have a single peak
  - Both graphs display six class intervals
  - Both graphs show the same general pattern
2. Answers may vary. Possible answers include the following. Although the width of the class intervals for couples is double that of the class intervals for singles, the graphs are more similar than they are different.
Answers may vary. Possible answers include the following. You are able to compare the graphs interval by interval. It is easier to compare the overall patterns with the new scale on the couples graph. Because a couple represents two individuals, the new scale leads to a more accurate comparison.
Answers may vary. Possible answers include the following. Based on the histograms, it seems that spending does not vary much from singles to individuals who are part of a couple. The overall patterns are the same. The range of spending for couples is approximately double the range for individuals.

83.

c

85.

Answers will vary.

87.

1 – (.02+.09+.19+.26+.18+.17+.02+.01) = .06
.19+.26+.18 = .63
Check student’s solution.
40^th percentile will fall between 30,000 and 40,000

80^th percentile will fall between 50,000 and 75,000
Check student’s solution.

89.

more children; the left whisker shows that 25 percent of the population are children 17 and younger; the right whisker shows that 25 percent of the population are adults 50 and older, so adults 65 and over represent less than 25 percent
62.4 percent

91.

Answers will vary. Possible answer: State University conducted a survey to see how involved its students are in community service. The box plot shows the number of community service hours logged by participants over the past year.
Because the first and second quartiles are close, the data in this quarter is very similar. There is not much variation in the values. The data in the third quarter is much more variable, or spread out. This is clear because the second quartile is so far away from the third quartile.

93.

Each box plot is spread out more in the greater values. Each plot is skewed to the right, so the ages of the top 50 percent of buyers are more variable than the ages of the lower 50 percent.
The black sports car is most likely to have an outlier. It has the longest whisker.
Comparing the median ages, younger people tend to buy the black sports car, while older people tend to buy the white sports car. However, this is not a rule, because there is so much variability in each data set.
The second quarter has the smallest spread. There seems to be only a three-year difference between the first quartile and the median.
The third quarter has the largest spread. There seems to be approximately a 14-year difference between the median and the third quartile.
IQR ~ 17 years
There is not enough information to tell. Each interval lies within a quarter, so we cannot tell exactly where the data in that quarter is are concentrated.
The interval from 31 to 35 years has the fewest data values. Twenty-five percent of the values fall in the interval 38 to 41, and 25 percent fall between 41 and 64. Since 25 percent of values fall between 31 and 38, we know that fewer than 25 percent fall between 31 and 35.

96.

the mean percentage, $\bar{x} = \frac{1,328.65}{50} = 26.75$

98.

The median value is the middle value in the ordered list of data values. The median value of a set of 11 will be the sixth number in order. Six years will have totals at or below the median.

100.

474 FTES

102.

919

104.

mean = 1,809.3
median = 1,812.5
standard deviation = 151.2
first quartile = 1,690
third quartile = 1,935
IQR = 245

106.

Hint: think about the number of years covered by each time period and what happened to higher education during those periods.

108.

For pianos, the cost of the piano is .4 standard deviations BELOW the mean. For guitars, the cost of the guitar is 0.25 standard deviations ABOVE the mean. For drums, the cost of the drum set is 1.0 standard deviations BELOW the mean. Of the three, the drums cost the lowest in comparison to the cost of other instruments of the same type. The guitar costs the most in comparison to the cost of other instruments of the same type.

110.

$\bar{x} = 23.32$
Using the TI 83/84, we obtain a standard deviation of: $s_{x} = 12.95.$
The obesity rate of the United States is 10.58 percent higher than the average obesity rate.
Since the standard deviation is 12.95, we see that 23.32 + 12.95 = 36.27 is the disease percentage that is one standard deviation from the mean. The U.S. disease rate is slightly less than one standard deviation from the mean. Therefore, we can assume that the United States, although 34 percent have the disease, does not have an unusually high percentage of people with the disease.

112.

For graph, check student's solution.
49.7 percent of the community is under the age of 35
Based on the information in the table, graph (a) most closely represents the data.

114.

a

116.

b

117.

1.48
1.12

119.

174, 177, 178, 184, 185, 185, 185, 185, 188, 190, 200, 205, 205, 206, 210, 210, 210, 212, 212, 215, 215, 220, 223, 228, 230, 232, 241, 241, 242, 245, 247, 250, 250, 259, 260, 260, 265, 265, 270, 272, 273, 275, 276, 278, 280, 280, 285, 285, 286, 290, 290, 295, 302
241
205.5
272.5
205.5, 272.5
sample
population
1. 236.34
2. 37.50
3. 161.34
4. .84 standard deviations below the mean
young

121.

true
true
true
false

123.

Enrollment Frequency

1,000–5,000 10

5,000–10,000 16

10,000–15,000 3

15,000–20,000 3

20,000–25,000 1

25,000–30,000 2

Table 2.91
Check student’s solution.
mode
8,628.74
6,943.88
–0.09

125.

a

Enrollment	Frequency
1,000–5,000	10
5,000–10,000	16
10,000–15,000	3
15,000–20,000	3
20,000–25,000	1
25,000–30,000	2

Solutions