Solutions Manual For Statistics For Business and Economics 13th Edition by Mcclave Ibsn 9780134648361

October 12, 2022 | Author: Anonymous | Category: N/A
Share Embed Donate


Short Description

Download Solutions Manual For Statistics For Business and Economics 13th Edition by Mcclave Ibsn 9780134648361...

Description

 

Solutions Manual for Statistics for Business and Economics 13th Edition by McClave IBSN 9780134648361 product/solutions-manual-for-statisti ons-manual-for-statistics-for-business-and-econ cs-for-business-and-economics-13th-editi omics-13th-edition-by-m on-by-mccl ccl Full Download: http://downloadlink.org/ http://downloadlink.org/product/soluti  

Chapter 2  Methods for Describing Sets of Data 2.1

First, we find the frequency of the grade A. The sum of the frequencies for all five grades must be 200. Therefore, subtract the sum of the frequencies of the other four grades from 200. The frequency for grade A is: 200  (36 + 90 + 30 + 28) = 200  184 = 16 To find the relative frequency for for each grade, divide the frequency frequency by the total sample size, 200. The relative frequency for the grade B is 36/200 = .18. The rest of the relative frequencies are found in a similar manner and appear in the table: Grade on Statistics Exam

Frequency

16 36 90 30 28 200

A: 90 100 B: 80   89 C: 65   79 D: 50   64 F: Below 50 Total 2.2

a.

.08 .18 .45 .15 .14 1.00

To find the frequency for each class, count the number of times each letter occurs. The frequencies for the three classes are: Class X Y Z Total

 b.

Relative Frequency

Frequency 8 9 3 20

The relative frequency for each class is found by dividing the frequency by the total sample size. The relative frequency for the class X is 8/20 = .40. The relative frequency for the class Y is 9/20 = .45. The relative frequency for the class Z is 3/20 = .15. Class X Y Z Total

Frequency 8 9 3 20

Relative Frequency .40 .45 .15 1.00

10 Copyright © 2018 Pearson Education, Inc.  

Full all chapters instant download please go to Solutions Manual, Manual, Test Bank site: downloadlink.org downloadlink.org

 

 Methods for Describing Describing Sets of Data

c.

The frequency bar chart is:

9 8 7   y   c   n   e   u   q   e   r    F

6 5 4 3 2 1 0 X

d.

 Y  Class

Z

 

The pie chart for the frequency distribution is: Pie Chart of Class Category X Y Z

Z 15.0%

X 40.0%

Y 45.0%

2.3

107

a.

 pU   

 b.

 pS   

c.

 p R 

d.

.615  360  221.4 , .328 360  118.1 , .057  360  20.5  



174 57 174 10 174

 .615

 

 .328  

 .057

 

Copyright © 2018 Pearson Education, Inc.  

11 

 

12

Chapter 2

e.

Using MINITAB, the pie chart is: Pie Chart of Location Category Urban Suburban Rural

Rural 5.7%

Suburban 32.8%

Urban 61.5%

f.

61.5% of the STEM participants are from urban areas, 32.8% are are from suburban areas, and 5.7% aare re from rural areas.

g.

Using MINITAB, the bar chart is: 70 60 50    t 40   n   e   c   r   e    P 30

20 10 0

Urban

Suburban

Rural

Loc Percent is calculated within all data.

Both charts give the same information. 2.4

a.

According to the pie chart, .760 of the sample currently have.a cable/satellite TV subscription at home. The total number of adults sampled who have a cable/satellite TV subscription at home is 1,521  .760 . 1, 521  180  300  2, 001 . The proportion is 2,001

Copyright © 2018 Pearson Education, Inc.  

 

 Methods for Describing Describing Sets of Data

 b.

13 

Using MINITAB, the pie chart is: Pie Chart of Subscribe Category Cable TV Cord cutter

Cord cutter 16.5%

Cable TV 83.5%

2.5

a.

The type of graph is a bar graph.

 b.

The variable measured for each of the robots is type of robotic limbs.

c.

From the graph, the design used the most is the “legs only” design.

d.

The relative frequencies are computed by dividing the frequencies by the total sam sample ple size. The total sample size is n = 106. The relative frequencies for each of the categories ar are: e:

e.

Type of Limbs

Frequency

 None Both Legs ONLY Wheels ONLY Total

15 8 63 20 106

Relative Frequency

15/106 = .142 8 / 106 = .075 63/106 = .594 20/106 = .189 1.000

Using MINITAB, the Pareto diagram is:

.60

.50   y   c   n   e .40   u   q   e   r    F   e .30   v    i    t   a    l   e    R .20

.10

0 Legs

Wheels

None

Both

Type Percent within all data.

 

Copyright © 2018 Pearson Education, Inc.  

 

14

2.6

Chapter 2

a.

Region is qualitative because it is not measured using numbers.

 b.

 p A  P     pUS   

c.

48 150 26

150





.32 ,

 pC   

10 150



.07 ,

 p E   

34 150



.23 ,

 p LA 



29 150



.19 ,

 p ME / A



3 150

 .02

,

.17  

Using MINITAB, the plot is 35

30

25    t 20   n   e   c   r   e    P 15

10

5

0

Asia-Pacific

Canada

Europe

Latin America Middle East/Africa United States

Region Percent is calculated within all data.

2.7

d.

The regions that mos mostt of the top 150 credit card card users serve are Asia-Pacific, Eur Europe, ope, Latin Latin America, and the United States.

a.

Using MINITAB, the pie chart is: Pie Chart of Product Explorer 12.0% Office 24.0%

Category Office Windows Explorer

Windows 64.0%

Explorer had the lowest proportion of security issues with the proportion

Copyright © 2018 Pearson Education, Inc.  

6 50



.12 .

 

 Methods for Describing Describing Sets of Data

 b.

Using MINITAB, the Pareto chart is: 50 40    t   n   e   c   r   e    P

30

20 10 0

  n    i  o   t   c  u   e   e  x   e    o  d   c     e   o  t   m    R  e

  n    i  o   a  t   e  v    l   e   e    g     i   l  e   v    i    P  r

  r  e   s  u    l  o   c   s    i   d   n     i  o   a  t   r  m   f  o    I  n

   i  c  e   r  v   s   e   f      l  o    i  a   n    D  e

  g     i  n   f   o  o   p   S

Bulletins Percent is calculated within all data.

The security bulletin with the highest frequency is Remote code code execution. Microsoft should focus on this repercussion. 2.8

a.

Using MINITAB, the Pareto chart is: 40

30

   t   n   e   c 20   r   e    P

10

0

WLAN/Single

WLAN/Multi

WSN/SINGLE

WSN/Multi

AHN/SINGLE

AHN/Multi

Network/Channel Percent is calculated within all data.

The network type and number of channels that suffered the most number of jamming attacks is WLAN/Single. The network/number of channels type that received the next most number of  jamming attacks is WSN/Single and WLAN/Multi. WLAN/Multi . The network/Number of channels type that suffered the least number of jamming attacks is AHN/Multi.  b.

Using MINITAB, the pie chart is: Pie Chart of Network Category WLAN WSN AHN

AHN 16.3%

WSN 27.5%

 

WLAN 56.3%

Copyright © 2018 Pearson Education, Inc.  

15 

 

16

Chapter 2

The network type that suffered the most jamming attacks is WLAN with more than half. The network type that suffered the least number of jamming attacks is AHN. 2.9

Using MINITAB, the pie chart is: Pie Chart of Degree Category None First Post

Post 10.4%

None 36.9%

First 52.7%

A little of half of the successful candidates had a First (Bachelor’s) degree, while a little more than a third of the successful candidates had no degree. Only about 10% of the successful candidates had graduate degrees. 2.10

Using MINITAB, the bar graphs of the 2 waves is: Chart of Job Status S tatus      A      B      M     o      N      k     r     o      W 90 80    t 70   n   e 60   c 50   r   e 40    P 30 20 10 0

1

     A      B      M     o      N      k     r     o      W

     A      B      M      k     r     o      W

     h     c      S     s     u      B      k     r     o      W     o      N

     A      B      M      k     r     o      W

     h     c      S     s     u      B      k     r     o      W     o      N

     d     a     r      G      k     r     o      W     o      N

     h     c      S

2

     d     a     r      G      k     r     o      W     o      N

     h     c      S

Job Status Panel variable: Wave; Percent with within in all data.

In wave 1, most of those taking the GMAT were working  265 2657 7 / 3244

  

.819 .819 and none had MBA’s. About

20% were not working but were in either a 4-year institution or other graduate school 3244  .181 . In wave 2, almost all were now working  1787  1372 / 32 3 244  .974 . Of those 36  551 / 32 working, more than half had MBA’s 1787 / 1787  1372  .566 . Of those not not working, most were in another graduate school.

Copyright © 2018 Pearson Education, Inc.  

 

 Methods for Describing Describing Sets of Data

2.11

17  

Using MINITAB, the Pareto diagram for the data is: Chart of Tenants 50

40

   t   n   e   c   r   e    P

30

20

10

0 Small

SmallStandard

Large Tenants

Major

 Anchor

Percent within all data.

 

Most of the tenants in UK shopping malls are small or small standard. They account for approximately 1, 821  .84 . Very few (less than 1%) of the tenants are anchor 84% of all tenants    711  819 / 1, anchors. s. 2.12

Using MINITAB, the side-by-side bar graphs are:

Chart of Acquisitions No 1980

 Yes 1990 100 75 50 25

   t   n   e   c   r   e    P 100

0

2000

75 50 25 0 No

 Yes

 Acquisitions  Acquisitio ns Panel variable: Year; Year; Percent within all data.

 

proportion of firms having In 1980, very few firms had acquisitions 18 / 1 1,, 96 963  .0 09  . By 1990, the proportion acquisitions increased to 350/ 2,197  .159 . By 2000, the proportion of firms having acquisitions increased ,778  .269 . to 748/ 2,77

Copyright © 2018 Pearson Education, Inc.  

 

18

2.13

Chapter 2

a.

Using MINITAB, the pie chart of the data is: Pie Chart of City Category SF NY LA CH

SF 25.0%

CH 25.9%

LA 16.1% NY 33.0%

 b.

Using MINITAB, the pie chart for San Francisco is: Pie Chart of Rating City = SF Category Excellent Good Bad

Excellent 10.1%

Bad 21.7%

Good 68.1%

c.

Using MINITAB, the bar charts are: Chart of Tweets Excellent

Good

Bad

LA

CH

60 45

  s    t   e   e   w    T    f   o    t   n   e 60   c   r   e    P 45

30 15

NY

SF

30 15 0

Excellent

Good

Bad

Rating Panel variable: City  Percent is calculated within all data.

Copyright © 2018 Pearson Education, Inc.  

0

 

 Methods for Describing Describing Sets of Data

d.

2.14

19 

In all cities, most customers rated the iphone 6 as ‘good’, while very few rated the iphone 6 as excellent.

Using MINITAB, a pie chart of the data is: Pie Chart of Measure Category Big Shows Funds Raised Members Paying visitors Total visitors

Big Shows 20.0%

Total visitors 26.7%

Funds Raised 23.3% Paying visitors 16.7%

Members 13.3%

Since the sizes of the slices are close to each other, it appears that the researcher is correct. There is a large amount of variation within the museum community with regard to performance measurement and evaluation. 2.15

a.

The variable measured by Performark is the length of time it took for each advertiser to respond back.

 b.

The pie chart is: Pie Chart of Response Time

13-59 days 25.6%

Never responded 23.3%

Category Never responded >120 days 60-120 days 13-59 days

>120 days 13.3%

60-120 days 37.8%

c.

00 00  3, 5 57 70 of the advertisers never respond to the sales lead. Twenty-one percent or .2117, 0

d.

The information from the pie chart does not indicate how effective the "bingo cards" are. It just indicates how long it takes advertisers to respond, if at all.

Copyright © 2018 Pearson Education, Inc.  

 

20

2.16

Chapter 2

Using MINITAB, the side-by-side bar graphs are:

Chart of Dive Left  Ahead

Middle

Right

Behind

80 60 40 20

   t   n   e   c   r   e    P 80

0

Tied

60 40 20 0 Left

Middle

Right

Dive Panel variable: Situation; Percent within all data.

 

From the graphs, it appears that if the team is either tied or ahead, the goal-keepers tend to dive either right or left with equal probability, with very few diving in the middle. However, if the team is behind, then the majority of goal-keepers tend to dive div e right (71%). 2.17

a.

Using MINITAB, bar charts for the 3 variables are: Chart of Well Class 120 100 80    t   n   u   o    C

60 40 20 0 Private

Public Well Class

 

Copyright © 2018 Pearson Education, Inc.  

 

 Methods for Describing Describing Sets of Data

Chart of Aquifer 200

150    t   n   u   o    C

100

50

0 Bedrock 

Unconsolidated  Aquifer

 

Chart of Detection 160 140 120 100    t   n   u   o    C

80 60 40 20 0 Below Limit

Detect Detection

 b.

 

Using MINITAB, the side-by-side bar chart is: Chart of Detection Below Limit Private

Detect Public

80 70 60    t   n   e   c   r   e    P

50 40 30 20 10 0 Below Limit

Detect Detection

Panel variable: variable: Well Class; Percent with within in all data.

Copyright © 2018 Pearson Education, Inc.  

 

21 

 

22

Chapter 2

c.

Using MINITAB, the side-by-side bar chart is: Chart of Detection Below Limit Bedr ock

Detect Unconsoli

70 60    t   n   e   c   r   e    P

50 40 30 20 10 0 Below Limit

Detect Detection

Panel variable: Aquifer; Percent within all data.

d.

 

From the bar charts charts in parts aa-c, -c, one can inf infer er that most aquifers are bedrock and most levels of ( 2 / 3 ) MTBE were below the limit  . Also the percentages of public wells verses private wells are relatively close. Approximately 80% of private wells are not contaminated, while only about 60% of  public wells are not contaminated. The percentage of contaminated wells is about the same for both types of aquifers (  30%) .

2.18

Using MINITAB, the relative frequency histogram is:

.25

.20   y   c   n   e   u   q .15   e   r    F   e   v    i    t   a .10    l   e    R

.05

0 0

.5

2.5

4.5

6.5

8.5 Class

10.5

12.5

14.5

16.5

 

Copyright © 2018 Pearson Education, Inc.  

 

 Methods for Describing Describing Sets of Data

2.19

23 

To find the number of measurements for each measurement class, multiply the relative frequency by the total number of observations, n = 500. The frequency table is: Measurement Class 

.5   2.5 2.5   4.5 4.5   6.5 6.5   8.5 8.5  10.5 10.5  12.5 12.5  14.5 14.5  16.5

Relative Frequency  .10 .15 .25 .20 .05 .10 .10 .05

Frequency  500(.10) = 50 500(.15) = 75 500(.25) = 125 500(.20) = 100 500(.05) = 25 500(.10) = 50 500(.10) = 50 500(.05) = 25

500 Using MINITAB, the frequency histogram is:

140 120 100   y   c   n   e   u   q   e   r    F

80 60 40 20 0 0

.5

2.5

4.5

6.5

8.5 Class

10.5

12.5

14.6

16.5

  2.20

a.

The original data set has 1 + 3 + 5 + 7 + 4 + 3 = 23 observations.

 b.

For the bottom row of the stem-and-leaf display: The stem is 0. The leaves are 0, 1, 2. Assuming that the data are up to two digits, rounded off to the nearest whole number, the numbers in the original data set are 0, 1, and 2.

2.21

c.

Again, assuming that the data ar aree up to two digits, rounded off to the nearest whole number, the dot  plot corresponding to all the data points is:

a.

This is a frequency histogram because the number of observations is graphed for each interval rather than the relative frequency.

 b.

There are 14 measurement classes. Copyright © 2018 Pearson Education, Inc.  

 

24

2.22

Chapter 2

c.

There are 49 measurements in the data set.

a.

The graph is a frequency histogram.

 b.

The quantitative variable summarized in the graph is the fup/fumic ratio.

c.

The proportion of ratios greater than 1 is

8  5 1



416

2.23

2.24

1 81  108

14



.034 .

416 289

.695 .

d.

The proportion of ratios less than .4 is

a.

Since the label on the vertical axis is Percent, this is a relative frequency histogram. We can divide the percents by 100% to get the relative frequencies.

 b.

Summing the percents represented by all of the bars above 100, we get approximately 12%.

a.

Using MINITAB, the stem-and-leaf display and histogram are:

41 6



416



Stem-and-Leaf Stem-and-L eaf Display: SCORE Stem-and-leaf of SCORE Leaf Unit = 1.0 1 1 2 2 3 4 4 4 6 16 26 40 57 88 (47) 60 26

6 7 7 7 7 7 8 8 8 8 8 9 9 9 9 9 10

N

= 195

9 3 6 8

44 6666677777 8888899999 00001111111111 22222222223333333 4444444444444555555555555555555 6666666666666666666677777777 66666666666666666666777777777777777777777777777 7777777777777777777 8888888888999999999999999999999999 00000000000000000000000000

Histogram of SCORE 50

40

  y   c   n 30   e   u   q   e   r    F 20

10

0

72

76

80

84

88

92

96

100

SCORE

Copyright © 2018 Pearson Education, Inc.  

   

 Methods for Describing Describing Sets of Data

 b.

2.25

From the stem-and-leaf display, there are only 6 observations with sanitation scores less than 86. The  proportion of ships with accepted sanitation standards is (195  6) / 195  189 / 195  .97 .

c.

The score of 69 is highlighted in the stem-and-leaf display.

a.

Using MINITAB, a dot plot of the data is: Dotplot of Acquisitions

0

120

240

360 480  Acquisitions

600

720

840

   b.

2.26

a.

25 

By looking at the dot plot, one can conclude that the years 1996-2000 had the highest number of firms with at least one acquisition. The lowest number of acquisitions in that time frame (748) is almost 100 higher than the highest value from the remaining years.

Using MINITAB, a histogram of the current values of the 32 NFL teams is: Histogram of VALUE ($mil) 16 14 12   y 10   c   n   e   u 8   q   e   r

   F 6 4 2 0

1800

2400

3000

3600

VALUE ($mil)

Copyright © 2018 Pearson Education, Inc.  

 

26

Chapter 2

 b.

Using MINITAB, a histogram of the 1-year change in current value for the 32 NFL teams is: Histogram of CHANGE (%) 7 6 5   y   c   e   n 4   u   q   e   r    F 3

2 1 0

20

30

40

50

60

70

CHANGE (%)

c.

Using MINITAB, a histogram of the debt-to-value ratios for the 32 NFL teams is: Histogram of DEBT/VALUE (%) 14 12 10   y   c   n 8   e   u   q   e   r    F 6

4 2 0

10

20

30

40

50

DEBT/VALUE (%)

d.

Using MINITAB, a histogram of the annual reve revenues nues for the 32 NFL teams is: Histogram of REVENUE ($ mil) 18 16 14 12   y   c   n 10   e   u   q   e   r 8    F 6 4 2 0

300

350

400

450

500

550

600

REVENUE ($mil)

Copyright © 2018 Pearson Education, Inc.  

 

 Methods for Describing Describing Sets of Data

e.

27  

Using MINITAB, a histogram of the operating incomes for the 32 NFL teams is: Histogram of INCOME ($mil) 12

10

8   y   c   n   e   u   q 6   e   r    F 4

2

0

60

120

180

240

INCOME ($mil)

2.27

f.

For all of the histograms, there is 1 team that has a very high score. score. The Dallas Cowboys have the largest values for current value, annual revenues, and operating income. income. However, the San Francisco 49ers have the highest 1-year change, while the Atlanta Falcons have the highest debt-to-value ratio. All of the graphs except the one showing the 1-Yr Value Changes are skewed to the right.

a.

Using MINITAB, the frequency histograms for 2014 and 2010 SAT mathematics scores are: Histogram of M ATH2014, MATH2010 440

480

520

MATH2014

14

560

600

MATH2010

12 10   y   c   n 8   e   u   q   e   r    F 6 4 2 0

440

480

520

560

600

It appears that the scores have have not changed very much at all. The graphs are very similar.  b.

Using MINITAB, the frequency histogram of the differences is: Histogram of Diff Math 30

25

20   y   c   n   e   u   q 15   e   r    F 10

5

0

-90

-60

-30

0

30

Diff Math

Copyright © 2018 Pearson Education, Inc.  

 

28

Chapter 2

From this graph of the differences, we can see that there are more observations to the right of 0 than to the left of 0. This indicates that, in general, the scores have improved since 2010. c.

2.28

From the graph, the largest improvement score is between 22.5 and 37.5. The actual largest score is 34 and it is associated with Wyoming.

Using MINITAB, the two dot plots are: Dotplot Dotp lot of Arri ve, Depart

 Arrive Depart 108

120

132

144

156

168

Data

  Yes. Most of the numbers of items arriving at the work center per hour are in the 135 to 165 are area. a. Most of the numbers of items departing the work center per hour are in the 110 to 140 area. Because the number of items arriving is larger than the number of items departing, there will probably be some sort of bottleneck. 2.29

Using MINITAB, the stem-and-leaf display is:

Stem-and-Leaf Stem-and-Le af Display: Dioxide Stem-and-leaf of Dioxide Leaf Unit = 0.10 5

0

12234

7 (2) 7 7 5 5 4 4

0 1 1 2 2 3 3 4

55 34

N

= 16

44 3 0000

The highlighted values are values that correspond correspond to water specimens that contain oil. There is a tendency for crude oil to be present in water with lower levels of dioxide as 6 of the lowest 8 specimens with the lowest levels of dioxide contain oil.

Copyright © 2018 Pearson Education, Inc.  

 

 Methods for Describing Describing Sets of Data

2.30

a.

29 

Using MINTAB, the histograms of the number of deaths is: Histogram of Deaths 12

10

8   y   c   n   e   u   q   e   r    F

6

4

2

0

0

200

400

600

800

1000

Deaths

 b.

2.31

The interval containing the largest proportion of estimates is 0-50. Almost half of the estimates fall in this interval.

Yes, we would ag agree ree with the statement that honey may be the pre preferable ferable treatment for for the cough cough aand nd sleep difficulty associated with childhood upper respiratory tract infection. For those receiving the honey dosage, 14 of the 35 children (or 40%) had improvement scores of 12 or higher. higher. For those receiving the DM dosage, only 9 of the 33 (or 24%) children had improvement scores of 12 or higher. For those receiving no dosage, only 2 of the 37 children (or 5%) had improvement scores of 12 or higher. In addition, the median improvement score for those receiving the honey dosage was 11, the median for those receiving the DM dosage was 9 and the median for those receiving no dosage was 7.

2.32

a.

Using MINITAB, the stem-and-leaf display is as follows, where the stems are the units place and the leaves are the decimal places:

Stem-and-Leaf Display: Time Stem-and-leaf of Time Leaf Unit = 0.10 (26) 23 15 9 4 2 2 1 1 1

 b.

N

= 49

1 2 3 4

00001122222344444445555679 00001122222344444445555679 11446799 002899 11125

5 6 7 8 9 10

24 8

1

A little more than half (26/49 = .53) of all companies spent less than 2 months mont hs in bankruptcy. Only two of the 49 companies spent more than 6 months in bankruptcy. It appears that, in general, the length of time in bankruptcy for firms using "prepacks" is less than that of firms not using prepacks."

Copyright © 2018 Pearson Education, Inc.  

 

30

Chapter 2

c.

A dot diagram will be used to compare the time in bankruptcy for the three types of "prepack" firms:

Dotplot Dotp lot of Time vs Votes

  s   e    t   o    V

Joint None Prepack  1.2

2.4

3.6

4.8

6.0

7.2

8.4

9.6

Time

  d.

The highlighted times in part a correspond to companies that were reorganized through a leverage  buyout. There does not appear to be any pattern to these points. They appear to be scattered about evenly throughout the distribution of all times.

2.33

Using MINITAB, the histogram of the data is: Histogram of INTTIME 60

50

40   y   c   n   e   u   q   e   r    F

30

20

10 0 0

75

150

225 300 INTTIME

375

450

525

 

This histogram looks very similar to the one shown shown in the problem. Thus, there appears that there was minimal or no collaboration or collusion from within the company. We could conclude that the phishing attack against the organization was not an inside job.

Copyright © 2018 Pearson Education, Inc.  

 

 Methods for Describing Describing Sets of Data

2.34

31 

Using MINITAB, the stem-and-leaf display for the data is:

Stem-and-Leaf Display: Time Stem-and-leaf of Time Leaf Unit = 1.0 3 7

N

= 25

3 239 4 3499

(7) 11 6 4 2 1

5 0011469 6 34458  7 13 8 26  9 5  10 2

The numbers in bold represent delivery times associated with customers who subsequently did not place additional orders with the firm. Since there were only 2 customers with delivery times of 68 days or longer that placed additional orders, I would say the maximum tolerable delivery time is about 65 to 67 days. Everyone with delivery times less than 67 days placed additional orders. 2.35

Assume the data are a sample. The sample mean is:  x



 x  3.2  2.5  2.1  3.7  2.8  2.0  16.3  2.717   6

n

6

The median is the average of the middle two numbers when the data are arranged in order (since n = 6 is even). The data arranged in order are: 2.0, 2.1, 2.5, 2.8, 3.2, 3.7. The middle two numbers are 2.5 and 2.8. 2.8. The median is: 2.5  2.8 2 2.36



a.

 x

 b.

 x   



5.3



2

2.65  

 x  85  8.5   10

n

400 16



25  

35

c.

 x  

d.

 x   





45

242 18

.778  

 13.44

 

2.37

The m mean ean and median of a symmetric data set are equal to each other. The mean is large largerr than than the median when the data set is skewed to the right. The mean is less than the median when the data set is skewed to the left. Thus, by comparing the mean and and median, one can determine whether the data set is symmetric, skewed right, or skewed left.

2.38

The median is the middle number once the data have been arranged in order. If n is even, there is not a single middle number. Thus, to compute the median, we take the average of the middle two numbers. If n  is odd, there is a single middle number. The median is this middle number.

Copyright © 2018 Pearson Education, Inc.  

 

32

Chapter 2

A data set with five measurements arranged in order is 1, 3, 5, 6, 8. The median is the middle number, which is 5. A data set with six measurements arranged in in order is 1, 3, 5, 5, 6, 8. The median is the average of the 5  5 10 middle two numbers which is   5. 2 2 2.39

Assume the data are a sample. The mode is the observation that occurs most frequently. For this sample, the mode is 15, which occurs three times. The sample mean is:  x



 x  18  10  15  13  17  15  12  15  18  16  11  160  14.545   11

n

11

The median is the middle number when the data are arranged in order. order. The data arranged in order order are: 10, 11, 12, 13, 15, 15, 15, 16, 17, 18, 18. The middle number is the 6th number, which is 15. 15. 2.40

a.

 x



 x   7    4  15  2.5   6

n

Median =

3  3

Mode = 3



6

3   (mean of 3rd and 4th numbers, after after ordering)

2

 b.

 x



 x   2    4  40  3.08  

c.

 x



 x   51    37  496  49.6  

13 13 n Median = 3 (7th number, after ordering) Mode = 3

n

Median =

10 48  50 2

10



49  (mean of 5th and 6th numbers, after ordering)

Mode = 50 2.41

2.42

a.

For a distribution that is skewed to the left, the mean is less than the median.

 b.

For a distribution that is skewed to the right, the th e mean is greater than the median.

c.

For a symmetric distribution, the mean and median are equal.

a.

The average score for Energy Star is 4.44. The average score is close to 5 meaning the average score is close to ‘very familiar’.

 b.

The median score for Energy Star is 5. At least half of the respondents indicated that they are very familiar with the ecolabel Energy Star.

c.

The mode score for Energy Star is 5. More respondents answered ‘very familiar’ to Energy Star than any other option.

d.

The ecolabel that appears to be most familiar to travelers is Energy Star. Copyright © 2018 Pearson Education, Inc.  

 

 Methods for Describing Describing Sets of Data

2.43

2.44

33 

a.

This statistic represents a population mean because it is computed for every freshman who attended the university in 2015. The average financial aid awarded awarded to freshmen at Harvard Harvard University is $41,555.

 b.

This statistic represents a sample median because it is computed for a sample of alumni. The median salary during early career for alumni of Harvard University is $ $61,400. 61,400. Half of the alumni from Harvard make more than $61,400 during their early career.

a.

The mean is  x 9  ( .1)  ( 1.6)  14.6  16.0  7.7  19.9  9.8  3.2  24.8  17.6  10.7  9.1 140.7    10.82  x  13 13 n The average annualized percentage return on investment for 13 randomly selected stock screeners is 10.82.

 b.

Since the number of observations is odd, the median is the middle number once the data have been arranged in order. The data arranged in orde orderr are:



-1.6 -.1 3.2 7.7 9.0 9.1 9.8 10.7 14.6 16.0 17.6 19.9 24.8 The middle number is 9.8 which is the median. Half of the annualized percentage returns on investment are below 9.8 and half are above 9.8. 2.45

a.

The mean years of experience is x



n x  30  15  1170    25  31073  17.824 .

The average average num number ber

of years of experience is 17.824 years.  b.

To find the median, we first arrange the data in order from lowest to highest: 3 5 6 9 10 10 10 15 20 20 25 25 25 30 30 30 30 Since there are an odd number number of observations, the median is the middle middle number which is 20. Half of interviewees have less than 20 years of experience.

2.46

c.

The mode is 30. More interviewees had 30 years of experience than any other value.

a.

The sample mean is: n

 x

i

 x



i 1

n

  1.72  2.50  2.16      1.95  

20

37.62 20

 1.881  

The sample average surface roughness of the 20 observations is 1.881.  b. 

The median is found as the average of the 10th and 11th observations, once the data have been ordered. The order ordered ed data are: 1.06 1.09 1.19 1.26 1.27 1.40 1.51 1.72 1.95 2.03 2.05 2.13 2.13 2.16 2.24 2.31 2.41 2.50 2.57 2.64

The 10th and 11th observations are 2.03 and 2.05. The median is: 2.03  2.05 2



4.08



2.04  

2 Copyright © 2018 Pearson Education, Inc.  

 

34

Chapter 2

The middle surface roughness measurement measurement is 2.04. Half of the sample measureme measurements nts were less than 2.04 and half were greater than 2.04.

2.47

2.48

2.49

c. 

The data are somewhat skewed to the left. Thus, the median might be a better measure measure of central tendency than the mean. The few small values in the data tend to make the mean smaller than the median.

a.

The mean permeability forThe group A sandstone slices for isr 73.62mD. The average permeability A sandstone is 73.62mD. median permeability fo group A sandstone is 70.45mD. Halffor of group the sandstone slices in group A have permeability less than 70.45mD.

 b.

The mean permeability for group B sandstone slices is 128.54mD. The average permeability for group B sandstone is 128.54mD. The median permeability for group B sandstone is 139.30mD. Half of the sandstone slices in group B have permeability less than 139.30mD.

c.

The mean permeability for group C sandstone slices is 83.07mD. The average permeability for group C sandstone is 83.07mD. The median permeability for group C sandstone is 78.650mD. Half of the sandstone slices in group C have permeability less than 78.65mD.

d.

The mode permeability score for group C sandstone is 70.9. More sandstone slices in group C had  permeability scores of 70.9 than any other value.

e.

Weathering type B appears to result result in faster decay because the mean, mean, median, and mode values fore group B is higher than those for group C.

a.

The mean is 67.755. The statement is accurate.

 b.

The median is 68.000. The statement is accurate.

c.

The mode is 64. The statement is not accurate. A better statement would be: “The most common reported level of support for corporate sustainability for the 992 senior managers was 64.

d.

Since the mean and median are almost the same, the distribution of the 992 support levels sh should ould be fairly symmetric. The histogram in Exercise 2.23 is almost symmetric.

a.

The median is the middle number (18th) once the data have been arranged in order because n = 35 is odd. The honey dos dosage age data arrange arranged d in order are: 4,5,6,8,8,8,8,9,9,9,9,10,10,10,10,10,10,11,11,11,11,12,12,12,12,12,12,13,13,14,15,15,15,15,16 The 18th number is the median = 11.

 b.

The median is the middle number (17 th) once the data have been arranged in order because n = 33 is odd. The DM dosage data arranged in order are: 3,4,4,4,4,4,4,6,6,6,7,7,7,7,7,8,9,9,9,9,9,10,10,10,11,12,12,12,12,12,13,13,15 The 17th number is the median = 9.

c.

The median is the middle number (19th) once the data have been arranged in order because n = 37 is odd. The No dosage data arranged in order are: 0,1,1,1,3,3,4,4,5,5,5,6,6,6,6,7,7,7,7,7,7,7,7,8,8,8,8,8,8,9,9,9,9,10,11,12,12 The 19th number is the median = 7. Copyright © 2018 Pearson Education, Inc.  

 

 Methods for Describing Describing Sets of Data

2.50

35 

d.

Since the median for the Honey dosa dosage ge is larger than the other two, it appears appears that the honey dosage dosage leads to more improvement than the other two treatments.

a.

The mean dioxide level is x



3  .3  0.5  1.3    4.0 16



29 16

 1.81 .

The average dioxide amount is

1.81.  b.

Since the number of observations is even, the median is the average of the middle 2 numbers once the data are arranged in order. order. The data arranged in order order are: 0.1 0.2 0.2 0.3 0.4 0.5 0.5 1.3 1.4 2.4 2.4 3.3 4.0 4.0 4.0 4.0 The median is

1.3  1.4 2



2.7 2

 1.35 .

Half of the dioxide levels are below 1.3 1.35 5 and half are above

1.35. c.

The mode is the number that occurs the most. For this data set the mode is 4.0. The most frequent level of dioxide is 4.0.

d.

Since the number of obs observations ervations is even, the median is the average average of the middle 2 numbers numbers once once the data are arranged in order. order. The data arranged in order order are: 0.1 0.3 1.4 2.4 2.4 3.3 4.0 4.0 4.0 4.0 The median is

e.

2.4  3.3 2



5.7 2



2.85 .

Since the number of obs observations ervations is even, the median is the average average of the middle 2 numbers numbers once once the data are arranged in order. order. The data arranged in order order are: 0.2 0.2 0.4 0.5 0.5 1.3 The median is

f.

0.4  0.5 2



0.9 2



0.45 .

The median level of dioxide when crude oil is present is 0.45. The median level of dioxide when crude oil is not present is 2.85. It is apparent that the level of dioxide is much higher when crude oil is not present.

2.51

a.

Skewed to the right. There will be a few people with very high salaries such as the president and football coach.

 b.

Skewed to the left. On an easy test, most students will have high scores with only a few lo low w scores.

c.

Skewed to the right. On a difficult test, mos mostt students will have low scores with only a few high scores.

d.

Skewed to the right. Most students will have a modera moderate te amount of time studying while a few students might study a long time.

e.

Skewed to the left. Most ccars ars will be rrelatively elatively new with a few much much older.

f.

Skewed to the left. Most sstudents tudents will take the entire time to take the exam while a few might leave early. Copyright © 2018 Pearson Education, Inc.  

 

36

2.52

Chapter 2

a.

The sample means is:  x



 x  3.58  3.48  3.27      1.17  77.07  1.927   40

n

40

The median is found as the 20th and 21st observations, once the data have been ordered. The 20th and 21st observations are 1.75 and 1.76. The median is: 1.75  1.76 2



3.51 2

 1.755  

The mode is the number that occurs the most and is 1.4, which occurs 3 times.  b.

The sample average driving performance index is 1.927. The median driving performance index is 1.755. Half of all driving performance indexes are less than 1.755 and half are higher. The most common driving performance index value is 1.4.

c.

Since the mean is larger than the median, the data are skewed to the right. Using MINITAB, a histogram of the driving performance index values is: i s: Histogram of INDEX  10

8   y   c   n   e   u   q   e   r    F

6

4

2

0 1.5

2.53

2.0

2.5 INDEX 

3.0

3.5

 

For the "Joint exchange offer with prepack" prepack" ffirms, irms, the mean time is 2.6545 months, months, and the median median is 1.5 months. Thus, the average time spent in bankruptcy bankruptcy for "Joint" firms is 2.6545 months, while half of the firms spend 1.5 months or less in bankruptcy. For the "No prefiling vote held" firms, the mean time is 4.2364 months, and the median is 3.2 months. Thus, the average time spent in bankruptcy for "No prefiling vote held" firms is 4.2364 months, while half of the firms spend 3.2 months or less in bankruptcy. For the "Prepack solicitation only" firms, the mean time is 1.8185 months, and the median is 1.4 months. Thus, the average time spent in bankruptcy for "Prepack solicitation only" firms is 1.8185 months, while half of the firms spend 1.4 months or less in bankruptcy. Since the means and medians for the three groups of firms differ quite a bit, it would be unreasonable to use a single number to locate the center of the time in bankruptcy. Three different "centers" sh should ould be used.

2.54

a.

The mean is    

 x  2   1  1  1  62  2.067 . n

30

30

The average number of nuclear power plants per

state for states that have nuclear power plants is 2.067. Copyright © 2018 Pearson Education, Inc.  

 

 Methods for Describing Describing Sets of Data

37  

The median is found by first arranging the data in order from smallest to largest: 1 1 1 1 1 1 1 1 1 1 1 1 1 2 2 2 2 2 2 2 2 2 3 3 3 3 4 4 5 6 Since there are an even number of data points, the median is the average of the middle two numbers which is

22 2



2 . Half of the states with nuclear power plants have 2 or fewer plants.

The mode is 1. Most states that have nuclear power plants have just 1.  b.

For regulated states: The mean is   

 x  2   1  1  1  31  1.824 . 30

n

17

The average average number of

nuclear power plants per state for states that have nuclear power plants is 1.824. The median is found by first arranging the data in order from smallest to largest: 1 1 1 1 1 1 1 2 2 2 2 2 2 2 3 3 4 Since there are an odd number of data points, the median is the middle number which is 2. Half of the states with nuclear power plants have 2 or fewer plants. The mode is 1 and 2. Most states that have nuclear power plants have 1 or 2. c.

For deregulated states: The mean is    

 x  1   1  1  1  31  2.385 . n

13

13

The average average number of

nuclear power plants per state for states that have nuclear power plants is 2.385. The median is found by first arranging the data in order from smallest to largest: 1 1 1 1 1 1 2 2 3 3 4 5 6 Since there are an odd number of data points, the median is the middle number which is 2. Half of the states with nuclear power plants have 2 or fewer plants. The mode is 1. Most states that have nuclear power plants have 1. d.

Because the average average number of nuclear power plants in states that are deregulated is greater than the average number of nuclear power plants in states that are regulated, it appears that regulations limits the number of nuclear power plants.

e.

After deleting the largest observation, the mean is    

 x  2   1  1  1  56  1.931 . n

30

29

The average

number of nuclear power plants per state for states that have nuclear power plants is 1.931. The median is found by first arranging the data in order from smallest to largest: 1 1 1 1 1 1 1 1 1 1 1 1 1 2 2 2 2 2 2 2 2 2 3 3 3 3 4 4 5 Since there are an odd number of data points, the median is the middle number which is 2. Half of the states with nuclear power plants have 2 or fewer plants. The mode is 1. Most states that have nuclear power plants have just 1. By deleting the largest observation, the mean decrease, but the median and mode remain the same. Copyright © 2018 Pearson Education, Inc.  

 

38

2.55

2.56

Chapter 2

 x  2   1  1  1  49  1.885 .

f.

The trimmed mean is    

a.

extreme values. Due to the "elite" superstars, the salary distribution is skewed to the right. Since this implies that the median is less than the mean, the players' association would want to use the median.

 b.

The owners, by the logic of part a, would want to use the mean.

a.

The primary disadvantage of using the range to compare variability of data sets is that the two data sets can have the same range and be vastly different with respect to data variation. Also, the range is greatly affected by extreme measures. The sample variance is the sum of the squared deviations of the observations from the sample mean divided by the sample size minus 1. The population variance is the sum of the squared deviations of the values from the population mean divided by the population size.

 b.

c.

26

n

26

The trimmed mean is not affected by

The variance of a data set can can never be negative. The variance of a sa sample mple is the sum of the squared   deviations from the mean divided by n  1. The square of any number, positive or negative, is always positive. Thus, the va variance riance will be positive. The variance is usually greater than the standard deviation. However, it is possible for the var variance iance to  be smaller than the standard deviation. If the data are between 0 and 1, the variance will be smaller than the standard deviation. For example, suppose the data set is .8, .7, .9, .5, and .3. The sample mean is:  x



 x  .8  .7  .9  .5  .3  3.2  .64   .5

n

5



 x

The sample variance is:  s 2

2





a.

  s 2

3.2 2 13 5 1

2.28  



.232 4

 .058  

 . 241  

2



  x   n 1

2

22  

n

82

5 5 1



2.3  

 s  

2. 3

 1.52  

 s  

3.619

Range = 6  0 = 6

 s

c.

2

Range = 4  0 = 4  x

 b.

n

n 1

The standard deviation is  s   .058 2.57

  x 

2

 x

2





  x 

2

n



n 1

17 2 63  7 7 1



3.619  

 1.9  

Range = 8  (2) = 10



 x 2

 s

2





  x 

n 1

n

2

302 10 10  1

154  



7.111  

 s  

7. 111  2. 67  

Copyright © 2018 Pearson Education, Inc.  

 

 Methods for Describing Describing Sets of Data

d.

Range = 1  (3) = 4



 x

 s 2

a.

 s

2

 s 2



2





2.59

2



2







  x 

2

84  

n

  x 





n

 2

17 2 20 20  1

n

 x   3 1  10  10  4  28   

 b.

2



 

5





5.6





 s  

3.3333 3.

 1.826

 s  

.1868

3.3333  



 .1868

 

 x    3 2

2

2

2

  x 

2

282 5 5 1

226 

n





69.2 4

2



  x 

n 1

 x    8 2

2

1213 

 

n



226

 

2

2

2



4.1593  

2

 10  32  5  1213

 

552

4 1

4



456.75 3

 152.25

square feet

 12.339 feet

6

n

 



2

2

2

2

2



 

  x 

n 1

n

2

(15) 2 6 6 1

59  



21.5 5



2

 (4 4))  ( 3) 3)  1  ( 4) 4)  ( 4 4))

 x  15  2.5  x

2

2

 s   17.3

 17.3  

2

 s

 .432  

 1  10  10  4

1) 3)  1  (4) 4)  (4) 4)  15  x    ( 1)  x    1  (44))  (3)   

 

4

n

 s   152.25

 x

2.211  

 x  55  13.75 feet 

 s 2

4.8889

 

n 1

 x 2

c.



 x   8  10  32  5  55    x

 s  

28

 x 2

 s



n 1

n

4.8889  

18 

a.

 x



100 2 40 40  1

 s

 x

 1.18  

380 

c.



 s   1.395

 1.395  

20 2

10 10  1

2

n 1

  x 

(6.8) 2 17 17  1

25.04 

n

n 1

 x 2

2

  x 

n 1

 x

 b.

2



 x 2.58

39 

4.3  

Copyright © 2018 Pearson Education, Inc.  

 s  

4. 3



2.0736  

2

 59  

 

40

Chapter 2



d.

 x   

 x



1 5



2.60

a.

2



5

1 5



2 5



1 5

6

n



2



4 5



10 5







 x 2

  1   5

2

1  5



2

1  5

2

2  5





2

1  5



2

4  5



2



2 24 4 25

 .96

 



3

  x 

2

n



n 1

24 2 2  25 6 6 1



.2933 5

square re ounc ouncee  .0587 squa

 s  

.0 .058 587 7

.242 422 2 ounce  .2

Range = 42  37 = 5



 x

 s 2

 b.

2





  x 

2

1992 5 5 1

7935 

n



n 1



3.7

 1. 92  

3.7  

 s  

 1,949.25  

949.25  s   1, 94



44.15  

 1,307.84

 s   1, 30 307.84



36.16  

Range = 100  1 = 99



 x 2

 s

2



  x 

2

25,795 

n





n 1

c.

3032 9

9 1

Range = 100  2 = 98



 x

 s 2

2.61



 x  2  1  .33 ounce  x

 s

1



2



  x 

n 1

n

2

2952 8 8 1

20,033  

 

This is one possibility for the two data sets. Data Set 1: 0, 1, 2, 3, 4, 5, 6, 7, 8, 9 Data Set 2: 0, 0, 1, 1, 2, 2, 3, 3, 9, 9 The two sets of data dat a above have the same range = llargest argest measurement  smallest measurement = 9  0 = 9. The means for the two data sets are:  x1

 x2





 x  0  1  2  3  4  5  6  7  8  9  45  4.5 n

10

10

 

 x  0  0  1  1 2  2  3  3  9  9  30  3   n

10

10

Copyright © 2018 Pearson Education, Inc.  

 

 Methods for Describing Describing Sets of Data

The dot diagrams for the two data sets are shown below. Dotplot of of x1, x2

x1

0

2

 x

4

6

8

6

8

x2 0

 x

2

4

  2.62

This is one possibility for the two data sets. Data Set 1: 1, 1, 2, 2, 3, 3, 4, 4, 5, 5 Data Set 2: 1, 1, 1, 1, 1, 5, 5, 5, 5, 5  x1

 x2





 x  1  1 2  2  3  3  4  4  5  5  30  3 10

n

10

 

 x  1  1  1  1  1  5  5  5  5  5  30  3   10

n

10

Therefore, the two data sets have have the same mean. The variances for the two data sets are:

 s12

 x 

 



  x 

2

110 

n



n 1

 x

 s22

2

2



  x 

n 1

n

9

2

130  

302 10

9

302 10





20 9

40 9



2.2222  



4.4444  

Copyright © 2018 Pearson Education, Inc.  

41 

 

42

Chapter 2

The dot diagrams for the two data sets are shown below.

Dotplot Dotp lot of x1, x2

x1  x 1

2

3 x2

4

5

4

5

 x 1

2

3

  2.63

a.

Range = 3  0 = 3

 s 2

 b.

 x

2





  x 

2

n

n 1

72 15  5  5 1

 1.3  

 s   1.3

 1.14

 

 s   1.3

 1.14

 

After adding 3 to each of the data points, Range = 6  3 = 3



 x

 s 2

c.



2



  x  n

n 1

2

22 2 5 5 1

102  

 1.3  

After subtracting 4 from each of the data points, Range = 1  (4) = 3 2   x  ( 13) 2 2 39   x  5 n   s 2  5 1 n 1



d.



 1.3  

 s   1.3  1.14  

The range, variance, and standard deviation remain the same when any number is added to or subtracted from each measurement in the data set.

2.64

The ecolabel that had the most variation in the numerical responses is Audubon International because it has the largest standard deviation.

2.65

a.

The range of permeability scores for group A sandstone slices is Range  max  min  122.4  55.2  67.2 .

Copyright © 2018 Pearson Education, Inc.  

 

 Methods for Describing Describing Sets of Data



 x

 b.

The variance of group A sandstone slices is s 2



2

  x 

2

n

562,778  

n 1

7,362.32 100 100  1

43 



209.5292 .

The standard deviation is s   209. 209.5 5292 292  14 14..475 475 . c.

Condition B has the largest range and the largest standard deviation. Thus, condition B has more variable permeability data.

2.66

a.

The range in the difference between the maximum and minimum values. The range  24.8 –  1.6   26.4 . The units of measurement are percents.

 b.

The variance is

 s 2

 x 

2



  x  n

2



n 1

140.7 2 2236.41  13 13  1



2236.4 36.41 1  1522.8 .806 069 9 12



713.6 .60 031 12



59.4669  

The units are square percents.

2.67

2.68

c.

The sta standa ndard rd deviati deviation on is  s  

a.

The range is 155. The statement is accurate.

 b.

The variance is 722.036. The statement is not accurate. A more accurate statement would be: “The variance of the levels of supports for corporate sustainability sustainabilit y for the 992 senior managers is 722.036.”

c.

The standard deviation is 26.871. If the u units nits of measure for the two distributions are are the same, same, then the distribution of support levels for the 992 senior managers has less variation than a distribution with a standard deviation of 50. If the units of measure measure for the second distribution is not known, then we cannot compare the variation in the two distributions by looking at the standard deviations alone.

d.

The standard deviation best describes the variation in the distribution. The range can be greatly affected by extreme measures. measures. The variance is measured measured in square units which is hard to interpr interpret. et. Thus, the standard deviation is the best measure to describe the variation.

a.



7.7115 7.7115 . The units are percents. percents.

The sample variance of the honey dosage group is:

 s

2

 x

2





  x 

2

n



n 1

3752 429535 35-1

The sta standa ndard rd deviat deviation ion is:  s    b.

59.466 59.4669 9



277.142857 34

 8.1512605  

8.1512 8.1512605 605  2.855 2.855  

The sample variance of the DM dosage group is:



 x 2

 s 2





  x 

n 1

n

2

2752 33 33-1

2631



339.33333 32

The standard deviation is:  s   10.60 .604167



 10.604167

 

3.256  

Copyright © 2018 Pearson Education, Inc.  

 

44

Chapter 2

c.

The sample variance of the control group is:



 x

 s 2



2



  x 

2

n

n 1

2412 37 37-1

1881

The sta standa ndard rd deviat deviation ion is:  s   d.

2.69

a.



311.243243 36



8.6456456  

8.6456 8.6456456 456  2.94 2.940 0 

The group with the most variability is the group with the largest standard deviation, which is the DM group. The group with the least variability is the group with the sm smallest allest standard deviation, which is the honey group. The range is the largest observation minus the smallest observation or 6 – 1 = 5.

 x

2 i

The variance is:  s 2





    xi  i n

i

2



n 1

s2

The standard deviation is:  s 

62 2 30 30  1

178 

 

1.7195



 1.7195  

 1.311  

 b.  The largest observation observation is 6. It is deleted ffrom rom the data set. The new range is: 5 – 1 = 4.



 xi2

The variance is:  s 2





   x  i   i n

i

2

 

n 1

The standard deviation is:  s 

s2

56 2 142  29  29  1

1.2094



 1.2094

 

 1.100  

When the largest observation is deleted, the range, variance and standard deviation decrease. c. 

The largest observation is 6 and the smallest smallest is 1. When these two observations are deleted from the data set, the new range is: 5 – 1 = 4.

 x

2 i

The variance is:  s 2





a.

n

i

s2

2



552 28 28  1

141 

  

n 1

The standard deviation is:  s 

2.70

    xi  i

1.2209

 1.2209

 

 1.1049  

When the largest and smallest observations are deleted, the range, variance and standard deviation decrease. A worker's overall time to complete the operation under study is determined by adding the subtasktime averages.

Copyright © 2018 Pearson Education, Inc.  

 

 Methods for Describing Describing Sets of Data Worker A 

The average for subtask 1 is:  x



 x  211  30.14  

The average for subtask 2 is:  x



 x  21  3  

The average for subtask 1 is:  x



 x  213  30.43  

The average for subtask 2 is:  x



 x  29  4.14  

45 

7

n

7 n Worker A's overall time is 30.14 + 3 = 33.14. Worker B 

7

n

7 n Worker B's overall time is 30.43 + 4.14 = 34.57.  b.

Worker A 



 x

2

 s 



  x 

2

n



n 1

Worker B 

 x

2

  x  

2

n

 s 

2112 7 7 1

6455 



n 1

2132 6487  7 7 1



15.8095  3.98  



.9524

 .98

 

c.

The st standard andard deviations deviations rrepresent epresent the amount amount of variability in the time it takes the worker to to comple complete te subtask 1.

d.

Worker A 

 x

2

 s 



  x 

2

n



n 1

212 67  7 7 1



.6667

 .82

 

Worker B 



 x

 s 

e.

2



  x 

n 1

n

2

292 7 7 1

147  



4.4762



2.12  

I wo would uld cho choose ose workers workers similar to w worker orker B to perform perform subtask 1. Worker B has a slightly slightly higher average time on subtask 1 (A:  x    30.14 , B:  x    30.43 ). However, Worker Worker B has a sm smaller aller variability in the time it takes to complete subtask 1 (part b). He or she is more consistent in the time needed to complete the task. I would choose workers similar to Worker A to perform subtask 2. Worker A has a smaller average time on subtask 2 (A:  x    3 , B:  x    4.14 ). Worker A also has a smaller smaller variability in the tim timee needed to complete subtask 2 (part d).

2.71

a.

The unit of measurement of the variable of interest is dollars (the same as the mean and standard deviation). Based on this, the data are quantitative. quantitative. Copyright © 2018 Pearson Education, Inc.  

 

46

Chapter 2

 b.

Since no inform information ation is given about about the shape of the data set, we can only use Che Chebyshev's byshev's Rule. $900 is 2 standard deviations below the mean, and $2100 is 2 standard deviations above the mean. Using Chebyshev's Rule, at least 3/4 of the measurements (or 3/4  200 = 150 measurements) will fall between $900 and $2100. $600 is 3 standard deviations below the mean and $2400 is 3 standard deviations above the mean. Using Chebyshev's Rule, at least 8/9 of the measurements (or 8/9  200  178 measurements) will fall between $600 and $2400. $1200 is 1 standard deviation below the mean and $1800 is 1 standard deviation above the mean. Using Chebyshev's Rule, nothing can be said about the number of measurements that will fall  between $1200 and $1800. $1800. $1500 is equal to the mean and $2100 is 2 standard standard deviations above the mean. Using Chebyshev's Rule, at least 3/4 of the measurements (or 3/4  200 = 150 measurements) will fall between $900 and $2100. It is possible that all of tthe he 150 measuremen measurements ts will be between between $900 and $1500. Thus, nothing can be said about the number of measurements between $1500 and $2100.

2.72

2.73

2.74

Since no information is given about the data set, we can only use Chebyshev's Rule. a.

Nothing can be said about the percentage of measurements which will fall between  x  s  and  x  s .

 b.

At least 3/4 or 75% of the measur measurements ements will fall between x  2s and x  2s .

c.

At least 8/9 or 89% of the measurements will fall between x  3s and x  3s .

According to the Empirical Rule: a.

Approximately 68% of the measurements will be contained in the interval x  s to x  s .

 b.

Approximately 9 95% 5% of the measurements measurements will be contained contained in the int interval erval x  2s to x  2s .

c.

Essentially all the measurement measurementss will be contained in the interval x  3s to x  3s .

a.

 x



 8.24 n x  206 25



 x

 s 2



2



  x 

n 1

n

 

2

206 2 25 25  1

1778  



3.357  

 s  

3.357

 1.83  

 b. Number of Measurements in Interval 

Interval 

Percentage 

 x  s , or (6.41, (6.41, 10.07)

18

18 / 25 25  .72  or 72%

 x  2s , or (4.58, 11.90)

24

24 / 25 25  .96  or 96%

 x  3s , or (2.75, 13.73)

25

25 / 25  1.00 or 100%

Copyright © 2018 Pearson Education, Inc. 

 

 Methods for Describing Describing Sets of Data

c.

d.

47  

The percentages in part b are in agreement with Chebyshev's Rule and agree fairly well with the  percentages given by the Empirica Empiricall Rule.

Range  12  5  7  and  s  

Range 4



7 4

 1.75  

The range approximation provides a satisfactory estimate of  s   1.83  from part a. 2.75

Using Chebyshev's Rule, at lleast east 8/9 of the measurements will fall within 3 standard standard deviations of tthe he mean. Thus, the ran range ge of the data would would be around 6 standard standard deviations. Using the Em Empirical pirical Rule, approximately 95% of the observations are within 2 standard deviations deviations of the mean. Thus, the range of the data would be around 4 standard deviations. We would expect the standard deviation to be somewhere  between Range/6 and and Range/4. For our data, the range  760  135  625 . The

Range 6



625 6

 104.17

and

Range 4



625 4

 156.25 .

Therefore, I would estimate that the standard deviation of the data set is between 104.17 and 156.25. It would not be feasible to have a standard standard deviation of 25. If the standard deviation wer weree 25, the data would span 625/25 = 25 standard deviations. deviations. This would be extremel extremely y unlikely. 2.76

a.

 z 

 z 

2  63  353 30 4  43  353 30

 3



  A score of 263 would be 3 standar standard d deviations below the mean.

deviations above the mean. 3   A score of 443 would be 3 standard deviations

Using Chebyshev’s Rule, at least 8/9 of the observations will be within 3 standard deviations of the mean.  b.

For a mound-shaped, symmetric distribution, distribution, approxi approximately mately 99.7% of the observati observations ons will be wi within thin 3 standard deviations of the mean, using the Empirical Rule. 1  09  184

c.

 z 

 z 

d.

2.77

a.

25 2  59  184 25

 3  



A score of 109 would be 3 standard deviations deviations below the mean.

deviations above the mean. 3   A score of 259 would be 3 standard deviations

Using Chebyshev’s Rule, at least 8/9 of the observations will be within 3 standard deviations of the mean. For a m mound-shaped, ound-shaped, symmetric distribution, approximately 99.7% of tthe he obse observations rvations will be wi within thin 3 standard deviations of the mean, using the Empirical Rule. Because the distribution is skewed, we will use Chebyshev’s Rule. At least 8/9 of the observations will be within 3 standard deviations of the mean:  x A  3s  A  73.  62  31 14 4 .48  73.  62  43.  44   30.18, 117.06  

 b.

Because the di distribution stribution is skewed, skewed, we will use Chebyshev’s Rule. At least 8/9 of of the observations Copyright © 2018 Pearson Education, Inc. 

 

48

Chapter 2

will be within 3 standard deviations of the mean:  x A  3s  A  128 .54  3 21  .97   128 .54  65 .91   62.63, 194.45  

c.

Because the distribution is skewed, we will use Chebyshev’s Rule. At least 8/9 of the observations will be within 3 standard deviations of the mean:  x A  3s  A  83.  07  3 20  .05  83.  07  60.  15   22.92, 143.22  

2.78

d.

Although all the the intervals intervals overla overlap, p, it appears tthat hat weathering weathering group B results in fas faster ter dec decay ay because the sample mean is higher and the upper limit of the interval is much higher than the upper limit for the other two weathering types.

a.

Using MINITAB, MINITAB, the histogram of the data is: Histogram of Wheels 12

10

8   y   c   n   e   u   q   e   r    F

6

4

2

0 1

2

3

4

5

6

7

8

Wheels

 

Since the distribution is skewed to the right, it is not mound-shaped and it is not symmetric.  b.

Using MINIT MINITAB, AB, the results results are: Descriptive Statistics: Wheels  Variable  Wheels

N 28

Mean 3.214

StDev 1.371

Minimum 1.000

Q1 2.000

Median 3.000

Q3 4.000

Maximum 8.000

The mean is 3.214 and the standard deviation is 1.371.

2.79

c.

The interval is:  x  2 s  3.2  14  2(1.3  71)  3.2  14  2.7 42  (0.472, 5.956) .

d.

According to Chebyshev’s Chebyshev’s rrule, ule, at least 75% of the obse observations rvations will fall w within ithin 2 standard deviations of the mean.

e.

According to the Empirical Rule, approximately 95% of the obse observations rvations will fall w within ithin 2 standard deviations of the mean.

f.

Actually, 26 of the 28 or 26/ 26/28 28 = .929 of the observations fall within the interval. interval. This value is cl close ose to the 95% that we would expect with the Empirical Rule.

a.

The interval x  2s will contain at least least 75% of the observations. This interval is  x  2 s  3. 11  2(.6  6)  3. 11  1.3  2  (1.79, 4.43) .

Copyright © 2018 Pearson Education, Inc. 

 

 Methods for Describing Describing Sets of Data

2.80

49 

 b.

No. The value 1.25 does not fall in tthe he interval x  2s . We know that at least least 75% of all observations will fall within 2 standard deviations of the mean. Since 1.25 falls more than 2 standard deviations from the mean, it would not be a likely value to observe.

a.

Since the data are mound-shaped and symmetric, we know from the Empirical Rule that approximately 95% of the observations will will fall within 2 standard deviations of the mean. This interval will be:  x  2 s  3 9  2( 6)  3 9  1 2  (27, 51) .

 b.

We know that approximately .05 .05 of the observations observations will fall outside the rang rangee 27 to 51. Since the distribution of scores is symmetric, we know that half of the .05 or .025 will fall above 51.

c.

We know know from the Empirical Empirical Rule that that approximately approximately 99 99.7% .7% (ess (essentially entially all) of the observations will fall within 3 standard deviations deviations of the mean. This interval is:  x  3 s  3 9  3( 6)  3 9  1 8  (21, 57) . n

 x

i

2.81

a.

The sample mean is:  x



i 1



n

18,482 195

n

 x The sample variance is:  s 2

 i 1

The standard deviation is:  s  s 2  b.

2



94.78  

 n    xi  i 1

2



n 1 



n

 

1,756,550 



24.9254



18,482 2

195  1

195



24.9254  

4.9925  

 x  s   94.  78  4.9   9  (89.79, 9999.77)    x  2 s  94.  78  2( 4.  99)  94.  78  9.9  8  (84.80, 104.76)    x  3 s  94.  78  3(4. 99)  94.  78  14. 97  (79.81, 1 10 09.75)  

c.

19 95)  100%  73.3% . There There are 143 out of 195 observations in the first interval. This is (143 / 1

are 189 out of 195 observations in the second interval. interval. This is (189 / 1 19 95)  100%  96.9% . There are 195)  100%  97.9% . 191 out of 195 observations in the second interval. interval. This is (191 / 19

The percentages for the first 2 intervals are somewhat larger than we would expect using the Empirical Rule. The Empirical Rule indicates indicates that approximately 68% of the observations will fall within 1 standard deviation of the mean. It also indicates that approximately approximately 95% of the observations will fall within 2 standard deviations of the mean. Chebyshev’s Theorem says that at least ¾ or 75% of the observations will fall within 2 standard deviations of the mean and at least 8/9 or 88.9% of the observations will fall within within 3 standard deviations of the mean. It appears that our observed  percentages agree with with Chebyshev’s Theorem bet better ter than the Empirical Empirical Rule. 2.82

Using MINITAB, MINITAB, the descriptive statistics are: Descriptive Statistics: Deaths  Variable Deaths 

N 27

Mean 163.4

StDev 227.4

Minimum 4.0

Q1 29.0

Median 68.0

Q3 184.0

Maximum 955.0

Since the data are not mound-shaped, we will use Chebyshev’s Rule. Most of the observations (8/9) will fall within 3 standard deviations deviations of the mean. This interval is: Copyright © 2018 Pearson Education, Inc. 

 

50

Chapter 2

 x  3 s  163  .4  3( 22 7.4)  163  .4  682   .2  (  518.8, 845.6) . Since no observations observations can be negati negative, ve,

then most observations will fall between 0 and 845.6. 2.83

Using MINITAB, MINITAB, the descriptive statistics are: Descriptive Statistics: Q2  Variable Q2

Q1 No Undecided Yes

N 1 5 30

Mean 2.0000 4.800 3.967

StDev * 0.447 0.850

Minimum 2.0000 4.000 2.000

Q1 * 4.500 3.000

Median 2.0000 5.000 4.000

Q3 * 5.000 5.000

Maximum 2.0000 5.000 5.000

The data for those users who believe there should be national standards is close to being mound-shaped and symmetric. Therefore, we wi will ll use the Empirical Rule. Rule. Approximately 95% of the the observations fall within within 2 standard deviations deviations of the mean. This interval is:   x  2 s  3.9  67  2(.8   5)  3.9  67  1.7  0  (2.267, 5.667)  

2.84

a.

The average ranking for contestants with a first degree who competed for a job with Lord Sugar is 7.796.

 b.

Approximately 95% of the observatio observations ns will fall within 2 stand standard ard deviations of tthe he mean. This interval is:  x  2 s  7.7  96  2(4. 231)  7.7  96  8.4 62  (  .666, 16.258)   Since no observations observations can be

negative, the interval will be 0 to 16.258.

2.85

c.

No. It appe appears ars that just just the the op opposite posite is is true. true. When the prize was a jo job, b, the higher the eeducation ducation level of the contestant, the higher the mean ratting. When the prize was a partnership, the higher the education level of the contestant, the lower mean the rating.

a.

The interval x  2s for the flexed arm group is x  2 s  5 9  3( 4)  5 9  1 2  (47, 71) . The interval for the extended are group is x  2 s  4 3  3( 2)  4 3  6   (37, 49) . We know that at least 8/9 or 88.9% of the observations will fall within 3 standard deviations of the mean using Chebyshev’s Rule. Since these 2 intervals barely overlap, the information supports supports the researchers’ theory. The shoppers from the flexed arm group are more likely to select vice options than the extended arm group.

 b.

The interval x  2s  for the flexed arm group is x  2 s  5 9  2(1  0)  5 9  2 0  (39, 7 79 9) . The 73) . Since these two interval for the extended are group is x  2 s  4 3  2(1  5)  4 3  3 0  (13, 73 intervals overlap almost completely, completely, the information does not support the researcher’s theory. There does not appear to be any difference between the two groups.

2.86

a.

Yes. The distribution of the buy-side analysts is fairly flat and skewed to the right. The distribution of the sell-side analysts is more mound shaped and is not spread out as far as the buy-side distribution. Since the buy-side distribution distribution is more spread out, the variance of the buy-side distribution will be larger than the variance of the sell-side distribution. distribution. Because the buy-side distribution is skewed to the right, the mean will be pulled to the right. Thus, the mean of the buyside distribution will be greater than the mean of the sell-side distribution.

 b.

Since the sell-s sell-side ide distribution is fairly mound-shaped, mound-shaped, we can use the Empiri Empirical cal Rule. The Empirical Rule says that approximately 95% of the observations will fall within 2 standard deviations of the mean. The interval for the sel sell-side l-side distribution would would be: Copyright © 2018 Pearson Education, Inc. 

 

 Methods for Describing Describing Sets of Data

51 

 x  2 s    .05  2(.8   5)    .05  1. 7   (1.75, 1.65)  

Since the buy-side distribution is skewed to the right, we cannot use the Empirical Empirical Rule. Thus, we 2 will use Chebyshev’s Rule. Rule. We know that at least (1 – 1/k  ) will fall within k  standard  standard deviations of 2 the mean. If we choose k    4 , then (1  1 / 4 )  .9375  or 93.75%. This is very cclose lose to 95% requested in the problem. The interval for the buy-side distribution to contain at least 93.75% of the observations would be:  x  4 s  .8  5  4(1. 93)  .8  5  7.7  2  (  6.87, 8.57)    Note: This interval will contain at least  93.75%  93.75% of the observations. It may contain more than 93.75% of the observations. 2.87

Since w wee do not know if the distribution of the heights of the trees iiss mound-s mound-shaped, haped, w wee need to apply  30       3  3    3   0  3 ( 3 ) 3   0 Chebyshev's Rule. Rule. We know and . Therefore,        9   (21, 39) .  According to Chebyshev's Rule, at least 8 / 9  .89 of the tree heights on this piece of land fall within this interval and at most 1 / 9  .11  of the tree heights will fall fall above the interval. However, the buyer will will only 1000  purchase the land if if at least buyer er  .20 of the tree heights are at least 40 feet tall. Therefore, the buy 5000 should not buy the piece of land.

2.88

a.

Since we do not have any idea of the shape of the distribution of SAT-Ma SAT-Math th score changes, we must use Chebyshev’s Theorem. We know that at least 8/9 of the observations will fall within 3 standard deviations of the mean. This interval would be:  x  3 s  1 9  3(6  5)  1 9  19  5   ( 176, 214)  

Thus, for a randomly selected student, we could be pretty sure that this student’s score would be anywhere from 176 points below his/her previous SAT-Math score to 214 points above his/her  previous SAT-Math SAT-Math score.  b.

Since we do not have any idea of tthe he shape of the distribution of SAT-Verbal SAT-Verbal sc score ore changes, we must use Chebyshev’s Theorem. We know that at least 8/9 of the observations will fall within 3 standard deviations of the mean. This interval would be:  x  3 s  7   3( 4  9)  7   14  7   ( 140, 154)  

Thus, for a randomly selected student, we could be pretty sure that this student’s score would be anywhere from 140 points below his/her previous SAT-Verbal score to 154 points above his/her  previous SAT-Verbal SAT-Verbal score. c.

2.89

A change of 140 points on the SAT-Math would be a lit little tle less than 2 standard deviations from the mean. A change of 140 points on the SAT-Verbal would be a little less than 3 standard deviations from the mean. Since the 140 point change for the SAT-Math SAT-Math is not as big big a change as the 140 point on the SAT-Verbal, it would be most likely that the score was a SAT-Math score.

We know



25 and      1 . Therefore,

 2      2 5  2( .1)

25.2)    2 5  .2   ( 24.8, 25

The machine is shut down for adjustment if the contents of two consecutive bags fall more than 2 standard deviations from the mean (i.e., outside the interval (24.8, (24.8, 25.2)). Therefore, the machine was shut down yesterday at 11:30 (25.23 and 25.25 are outside the interval) and again at 4:00 (24.71 and 25.31 are outside the interval).

Copyright © 2018 Pearson Education, Inc. 

 

52

2.90

Chapter 2

a. 

 z



 b. 

 z



c. 

 z

 x  x  s



 x    



2.91

 z

5 90  89

  

2

      x 

50  50 5

  

d. 

40  30



 x  x  s



20  30 4



2  (sample)

 .5



 (population)

0  (population)

 2.5  (sample)

2 standard deviations above the mean.

.5 standard deviations above the mean.

0 standard deviations above the mean.

2.5 standard deviations below the mean.

Using the definition of a percentile:

Percentile 

Percentage Above

Percentage Below

a.

75th

25%

75%

 b.

50th

50%

50%

c.

20th

80%

20%

d.

84th

16%

84%

2.92

QL corresponds to the 25th percentile. QM corresponds to the 50th percentile. QU corresponds to the 75th   percentile.

2.93

We first compute z -scores -scores for each x value. a.

 z



 x    



100  50 25

  

 b.

 z



 x     1  4 

1

  

c.

 x  z



0

  



 z



 3

 x       



 

2 00



 

2

100

  

d.







10  5 3

 1.67

 

 

The above z -scores -scores indicate that the x value in part a lies the greatest distance above the mean and the  x value of part b lies the greatest distance below the mean.

Copyright © 2018 Pearson Education, Inc. 

 

 Methods for Describing Describing Sets of Data

2.94

53 

Since the element 40 has a z -score -score of 2 and 90 has a z -score -score of 3, 2 

40       

and

3

90    

 

  

 

   2  40     3   90         2    40            3   90     40  2   By substitution, 40  2

 3 

9 0  5  



5 0   

   1 0 and



40  2(10)  60   .

Therefore, the population mean is 60 and the standard deviation is 10. 2.95

The mean score of U.S. eighth-graders on a mathematics assessment test is 282. This is the average score. The 25th percentile is 258. This means that 25% of the U.S. eighth-graders score score below 258 on the test and 75% score higher. The 75th percentile is 308. This means that 75% of the U.S. U.S. eighth-grader eighth-graderss score below th 308 on the test and 25% score hig higher. her. The 90  percentile is 329. This means that 90% of the U.S. eighthgraders score below 329 on the test and 10% score higher.

2.96

a.

 z 

 x  x  s

400  353



30

 1.57

  A transformer wit with h 400 sags in a week is 1.57 standard deviations deviations above

the mean.  b. 2.97

2.98

 z 

 x  x  s



100  184 25

 3.36

  A transformer wi with th 100 swells in a week is 3.36 standar standard d deviations

 below the mean. Am mean ean current current salary of $57,000 $57,000 iindicates ndicates that the average current salary of th thee Uni University versity of S South outh Florida graduates is $57,000. At mid-career, half of the University of South Fl Florida orida graduates had a salary less than $48,000 and half had salaries greater than $48,000. At mid-career, 90% of the University of South Florida graduates had salaries under $131,000 and 10% had salaries greater than $131,000. a.

From Exercise 2.81,  x    94.78 and s   4.99 . The z -score -score for an observation of 73 is:  z  

 x  x  s



73  94.78 4.99

 4.36  

This z -score -score indicates that an observation of 73 is 4.36 standard deviations below the mean. Very few observations will be lower than this one.  b.

The z-score for an observation of 91 is:  z  

 x  x  s



91  94.78 4.99

 0.76  

This z -score -score indicates that an observation of 91 is .76 standard deviations below the mean. This score is not an unusual observation in the data set. 2.99

Since the 90th percentile of the the study sample in the subdivision was .00372 m mg/L, g/L, which is less than the USEPA level of .015 mg/L, the water customers in the subdivision are not at risk of drinking water with unhealthy lead levels.

Copyright © 2018 Pearson Education, Inc. 

 

54

Chapter 2

 x  x

155  67.755

2.100

The z -score -score associated with a score of 155 is z  

2.101

The avera average ge ROE is 13.93. The m median edian ROE ROE is 14.86, m meaning eaning 50% 50% of firms have ROE ROE below 14.86. The th th 5  percentile is -19.64 meaning meaning 5% of firms have ROE below –19.64. The 25  percentile is 7.59 meaning 25% of firms have ROE below 7.59. The 75th percentile is 21.32 meaning 75% of firms have ROE below 21.32. The 95th percentile is 38.42 meaning 95% of firms have ROE below 38.42. The standard deviation is 21.65. Most observations will fall within 2s or 43.30 units of mean. The distribution will be somewhat

 3.25 . This score would not be 26.871  s considered a typical level of ssupport. upport. It is 3.25 standard deviations deviations above the mean. Very few observations would be above this value. 

skewed to the left as the 5 th percentile value is much further from the median than the 95 th percentile value.

2.102

a.

Since the data are approximately mound-shaped, we can use the Empirical Rule. On the blue exam, the mean is 53% and the standard deviation is 15%. We know that approximatel approximately y 68% of all students will score score within 1 standard deviat deviation ion of the mean. This interval is:  68 8)    x  s   5 3  1 5  (38, 6

About 95% of all students will score within 2 standard deviations of the mean. This interval is:  x  2 s  5 3  2(1  5)  5 3  3 0  (23, 83)  

About 99.7% of all students will score within 3 standard deviations of the mean. This interval is:  x  3 s  5 3  3(1  5)  5 3  4 5  (8, 98)    b.

Since the data are approximately approximately mound-shaped, we can use the Em Empirical pirical Rule. On the red exam, exam, the mean is 39% 39% and the standard standard deviation is 12% 12%.. We know that approximately 68% of all students will score within 1 standard deviation of the mean. This interval is:  x  s   3 9  1 2  ( 27, 51)  

About 95% of all students will score within 2 standard deviations of the mean. This interval is:  x  2 s  3 9  2(1  2)  3 9  2 4  (15, 63)  

About 99.7% of all students will score within 3 standard deviations of the mean. This interval is:  x  3 s  3 9  3(1  2)  3 9  3 6  (3, 75)

2.103

 

c.

The st student udent would would have been more more likely to have taken tthe he red exam. For the the blue exam, we know that approximately 95% of all all scores will be from 23% to to 83%. The observed 20% score does not fall in this range. For the red exam, we know that approximately 95% of all scores will be from 15% to 63%. The observed 20% score does fall iin n this range. Thus, it is m more ore likely that the student would have taken the red exam.

a.

The z -score -score for Harvard is z  =  = 5.08. This means that Harvard’s Harvard’s productivity score wa wass 5.08 standard deviations above the mean. This is extremely high and extremely extremely unusual.

 b.

The z -score -score for Howard University is z   = = .85. This means that Howard Howard University’s pr productivity oductivity score was .85 standard deviations deviations below the mean. This is not an unusual z -score. -score.

c.

Yes. Other indicators that the distribution distribution is skewed to the the ri right ght aare re tthe he val values ues of the highest and lowest z -scores. -scores. The lowest z -score -score is less than 1 standard deviation below the mean while the highest z -score -score is 5.08 standard deviations above the mean. Copyright © 2018 Pearson Education, Inc. 

 

 Methods for Describing Describing Sets of Data

55 

Using MINITAB, the histogram of the  z -scores -scores is: Histogram of Z-Score 70 60 50   y   c   n   e   u   q   e   r    F

40 30 20 10 0 -1

0

1

2 Z-Score

3

4

5

 

This histogram does imply that the data are skewed to the right. 2.104

a.

From the problem,

 z



 x    

 

 z 





2.7 and

     .5  

x     x     z   

 

 

  

For z  =  = 2.0,  x   2.7  2.0(.5)  3.7   For z  =  = 1.0,  x   2.7  1.0(.5)  2.2   For z  =  = .5,  x   2.7  .5(.5)  2.95   For z  =  = 2.5,  x   2.7  2.5(.5)  1.45    b.

For z  =  = 1.6,  x   2.7  1.6(.5)  1.9  

c.

If w wee as assume sume the distribution distribution of G GPAs PAs is approximately approximately mound-shaped, we can use the Empirical Rule.

From the Empirical Rule, we know that .025 or 2.5% of the students will have GPAs above 3.7 (with z  =  = 2). Thus, the GPA corresponding corresponding to summa cum laude (top (top 2.5%) will be greater than 3.7 z  > (  z   > 2). We know that .16 or 16% of the students will have GPAs above 3.2 (  z  z  =  = 1). Thus, the limit on z  > GPAs for cum laude (top 16%) will be greater than 3.2 (  z   > 1).

Copyright © 2018 Pearson Education, Inc. 

 

56

Chapter 2

We must assume the distribution is mound-shaped. 2.105

Not n necessarily. ecessarily. Because the distribution distribution iiss highly skewed to the right, the st standard andard devia deviation tion is very lar large. ge. Remember that the z -score -score represents the number number of standard deviations a scor scoree is from the mean. If the standard deviation is very large, then the  z -scores -scores for observations somewhat near the mean will appear to  be fairly small. small. If we deleted the schools with the very high prod productivity uctivity scores and recomputed the mean and standard deviation, the standard standard deviation would be much sm smaller. aller. Thus, most of the z -scores -scores would be larger because we would be dividing by a much smaller standard deviation. This would imply a bigger spread among the rest of the schools than the original distribution with the few outliers.

2.106

2.107

To determine if the measurements are outliers, compute the z -score. -score. a. 

 z



 b. 

 z



c. 

 z



d. 

 z



 x  x  s  x  x

 s outlier.  x  x  s  x  x

 s outlier.









65  57 11 21  57 11

72  5 7 11 98  57 11

 .727

  Since the z -score -score is less than 3, this would not be an outlier.

 3.273  

 1.364



Since the z --score score is greater than 3 in absolute value, this would be an

  Since the z -score -score is less than 3, this would not be an outlier.

-score is greater than 3 in absolute value, this would be an 3.727   Since the z -score

The interquartile range is IQR  QU

 QL 

85  60  25 .

The lower inner fence = Q L  1.5( IQR)  60  1.5( 25)  22.5 . The upper inner fence = QU   1.5( IQR )  85  1.5( 25)  122.5 . The lower outer fence = Q L

 3( IQR) 

60  3( 25)  15 .

The upper outer fence = QU   3( IQR)  85  3( 25)  160 . With only this information, the box plot would look something like the following:

*

────────────  + ──────────────────│  │──────  ──────────── 

─┼────┼────┼────┼────┼────┼────┼────┼────┼────┼────┼───  10

20

30

40

50

60

70

80

90

100

110

The whiskers extend to the inner fences unless no data points are that small or that large. The upper inner fence is 122.5. However, the largest data data point is 100, so the whisker st stops ops at 100. The lower inner fence is 22.5. The smallest data point is is 18, so the whisker extends to 22.5. 22.5. Since 18 is between the inner inner and outer fences, it is designated with a *. We do not know if there is any more than one data point below 22.5, so we cannot be sure that the box plot is entirely correct. Copyright © 2018 Pearson Education, Inc. 

 

 Methods for Describing Describing Sets of Data

2.108

a.

Median is approximately 4.

 b. 

QL is approximately 3 (Lower Quartile)

57  

QU is approximately 6 (Upper Quartile)

63  3 

c.

 IQR  QU

d.

The data set is skewed to the right since the right w whisker hisker is longer than the left left,, there is one outlier, and there are two potential outliers.

e.

50% of the m measurements easurements are to the right of the median and 75% are to the lef leftt of the upper quartile.

f.

The upper inner fence is QU   1.5( IQR)  6  1.5(3)  10.5 . The upper outer fence iiss

 QL 

QU   3( IQR)  6  3(3)

 15

. Thus, there are two suspect outliers, outliers, 12 and 13. There is one hi highly ghly

suspect outlier, 16. 2.109

a.

Using MINITAB, MINITAB, the box plots for samples A and B are: Boxplot of Sample A, Sample B

Sample A

Sample B

100

125

150

175

200

Data

 b.

In sample A, tthe he measurement 84 is an outlier. This measurement measurement falls outside the lower oute outerr fence. Lower outer fence = Lower hinge 3( IQR)  150  3(172  150)  150  3(22)  84   Lower inner fence = Lower hinge 1.5( IQR)  150  1.5(22)  117   Upper inner fence = Upper hinge 1.5( IQR)  172  1.5( 22)  205   In addition, 100 may be an outlier. outlier. It lies outside the inne innerr fence. In sample B, 140 and 206 may be outliers. outliers. The point 140 lies outside the iinner nner fence while the point 206 lies right at the inner fence. Lower outer fence = Lower hinge 3( IQR)  168  3(184  169)  168  3(15)  123   Lower inner fence = Lower hinge 1.5( IQR)  168  1.5(15)  145.5   Upper inner fence = Upper hinge 1.5( IQR)  184  1.5(15)  206.5  

Copyright © 2018 Pearson Education, Inc. 

 

58

Chapter 2

2.110

a.

Using MINITAB, MINITAB, the descriptive statistics are: Descriptive Statistics: Acad Rep Score  Variable Acad Rep Score

N 50

Mean 76.42

Minimum 47.00

Q1 64.75

Median 76.00

Q3 89.00

Maximum 100.00

IQR 24.25

The median is 76, the lower quartile is 64.75, and the upper quartile is 89.  b.

 IQR  QU

c.

Using MINITAB, MINITAB, the boxplot is:

 QL 

89  64.75  24.25  

Boxplot of Academic Rep Score

40

d.

50

60

70

Academic Rep Score

80

90

100

Suspect outliers lie between Q L   1.5 IQR  and Q L   3 IQR  or between QU    1.5 IQR  and QU    3 IQR  . Q L

 1.5

 IQR   64.75  1.5  24.25  28.375 ,

Q L

3

 IQR   64.75  3 24.25  8  

QU   1.5 IQR   89  1.5 24.25  125.375 , QU   3 IQR   89  3 2 24 4.25  161.75  

 No scores are less less than 28.375 nor larger than 125.3 125.375. 75. Therefore, there are no outlier outlierss or suspect outliers. 2.111

a.

 z 

 x  x  s



400  353 30

 1.57

  Since the z --score score is less than 2, 400 sags per week would not be

considered unusual.  b.

 z 

 x  x  s



100  184 25

 3.36

  Since the absolute value value of the z -score -score is greater than 3, 100 swells per

week would be considered unusual. 2.112

2.113

a.

The approximate 25th percentile PASI score before treatment is 10. The approximate median before treatment is 15. The approximate approximate 75th percentile PASI PASI score before treatment is 28.

 b.

The approximate 25th percentile PASI score after treatment is 3. The approximate median after treatment is 5. The approxim approximate ate 75th percentile PASI score score after treatment is 7.5.

c.

Since the 75th percentile after treatment is lower than the 25 th percentile before treatment, it appears that the ichthyotherapy is effective in treating psoriasis.

a.

The average per full-time employeewere is $6,563. The median per employee $6,232. Halfexpenditure of all expenditures pe perr employee less than $6,2 $6,232 32 andexpenditure half were greater than is $6,232. The lower quart quartile ile is $5,309. Twenty-five Twenty-five percent of all all expenditures per eemployee mployee were Copyright © 2018 Pearson Education, Inc. 

 

 Methods for Describing Describing Sets of Data

59 

 below $5,309. The upper quartile quartile is $7,216. Seventy-five percent percent of all eexpenditures xpenditures per em employee ployee were below $7,216.

2.114

$7, 21 216  $5, 30 309  $1, 90 907 .

 b.

 IQR  QU

c.

The interquartile range goes from the 25th percentile to the 75th percentile. Thus, .5  .75  .25  of the 1,751 army hospitals have expenses between $5,309 and $7,216.

a.

From the printout,  x    52.334  and s = 9.224.

 QL 

The highest salary is 75 (thousand). The z -score -score is  z  

 x  x  s



75  52.334 9.224



2.46  

Therefore, the highest salary is 2.46 standard deviations above the mean. The lowest salary is 35.0 (thousand). The z -score -score is  z  

 x  x  s



35.0  52.334 9.224

 1.88  

Therefore, the lowest salary is 1.88 standard deviations below the mean. The mean salary offer is 52.33 (thousand). The z -score -score is  z  

 x  x  s



52.33  52.33 .334 9.224





The z -score -score for the mean salary offer is 0 standard deviations from the mean.  No, the highest salary offer is not unusually high. For any distr distribution, ibution, at lea least st 8/9 of the sal salaries aries should have z -scores -scores between 3 and 3. A z -score -score of 2.46 would not be that unusual.

2.115

 b.

Since no salari salaries es are outside the the inner fences, none of them are are suspect or highl highly y suspect outlie outliers. rs.

a.

Using MINITAB, MINITAB, the boxplots for each type of firm are: Boxplot of Time

Joint

  s   e    t   o    V

None

Prepack

0

2

4

6

8

10

Time

 b.

The median bankr bankruptcy uptcy time for No prefiling firms firms is about 3.2. The median bankruptcy bankruptcy time for Copyright © 2018 Pearson Education, Inc. 

 

60

Chapter 2

Joint firms is about 1.5. The median bankruptcy time time for Prepack firms is about 1. 1.4. 4.

2.116

c.

The range of the "Prepack" firms is less than the other tw two, o, whil whilee the range of the "Non "None" e" firm firmss is the largest. The interquartile range of the "Prepack" firms firms is less than the other two, while the interquartile range of the "Joint" firms is larger than the other two.

d.

No. The iinterquartile nterquartile range ffor or the the "P "Prepack" repack" firms is the smallest smallest which corresponds to the sm smallest allest standard deviation. However, the second smallest interquart interquartile ile range corresponds to the "None" firms. The second smallest standard deviation corresponds corresponds to the "Joint" firms.

e.

Yes. There is evidence of two outliers in the "Prepack" firms. These are indicated by tthe he two *'s. There is also evidence of two outliers in the "None" firms. These are indicated by the two *'s.

From Exercise 2.100,  x    67.755  and s   26.87 . Using MINITAB, MINITAB, a boxplot of the data is: Boxplot of Support

0

20

40

60

80

100

120

140

160

Support

From the boxplot, the support level of 155 would be an outlier. From Exercise 2.100, we found the z -score -score   x  x 155  67.755 associated with a score of 155 as  z   -score is greater than 3, the   3.25 . Since this z -score 26.871  s observation 155 is considered an outlier. 2.117

a.

Using MINITAB, MINITAB, the boxplot is: Boxplot of SCORE

70

75

80

85

90

95

100

SCORE

From the boxplot, there appears appears to be 4 outliers: 69, 73, 76, and 78.  b. 

From Exercise 2.81,  x    94.78 and s   4.99 . Since the data are skewed skewed to the left, we will consider consider observations more than 2 standard deviations from the mean to be outliers. An observation with a z score of 2 would have the value:

Copyright © 2018 Pearson Education, Inc. 

 

 Methods for Describing Describing Sets of Data

 z 

 x  x

 

 s

x  94.78

 2  

 

4.99

61 

 2( 4.  99)  x  94. 78  9.9  8  x  94. 78  x   104.76  

An observation with a z -score -score of -2 would have the value:  z 

 x  x

 

 s

   2 

x  94.78

 

4.99

   2( 4.99)  x  94. 78    9.98  x  94. 78  x   84.80  

Observations greater than than 104.76 or less than 84.80 would be considered considered outliers. Using this criterion, the following following observations would be outliers: outliers: 69, 73, 76, and 78.

2.118

c. 

Yes, these methods do not agree exactly. Using the boxplot, 4 observations were identified identified as outliers. Using the z -score -score method, 4 observations were also identified as outliers.

a.

Using MINITAB, MINITAB, the box plot is: Boxplot of Downtime

0

10

20

30

40

50

60

70

Downtime

The median is about 18. The data appear to be skewed to the right right since there are 3 suspect outliers outliers to the right and none to the left. The variability of the data is fairly small because the IQR is fairly fairly  small, approximately 26  10 = 16.  b.

The customers associated with the the suspected outli outliers ers are customer customerss 268, 269, and 264.

c.

In order to find the z -scores, -scores, we must first find the mean and standard deviation. 2

 x





 x

n

 x    x   n  2



815 40



 s   192.90705

 s 2

20.375

n 1

 



 13.89 .89  

The z -scores -scores associated with the suspected outliers are: Customer 268  z  

Customer 269  z  

 49  20.375

13.89  50  20.375

13.89



2.06

  

2.13

 

Copyright © 2018 Pearson Education, Inc. 

24129  815 40 40  1

2

 192.90705

 

 

62

Chapter 2

Customer 264  z  

 64 

20.375

13.89



3.14

 

All the z -scores -scores are greater than than 2. These are unusual values. 2.119

Using MINITAB, MINITAB, the boxplots of the data are: Boxplot of PermA, PermB, PermC

PermA

PermB

PermC

50

75

100

125

150

Data

The descriptive statistics are: Descriptive Statistics: PermA, PermB, PermC  Variable PermA PermB PermC

a.

N 100 100 100

Mean 73.62 128.54 83.07

StDev 14.48 21.97 20.05

Minimum 55.20 50.40 52.20

Q1 62.00 108.65 67.72

Median 70.45 139.30 78.65

Q3 81.42 147.02 95.35

Maximum 122.40 150.00 129.00

IQR 19.42 38.37 27.63

For group A, the suspect outliers are any observations greater than QU   1.5 IQR   81.42  1.51 19 9.42  110.55 or less than Q L   1.5 IQR   62  1.519 19.42  32.87 . There are 3 observations greater than 110.55: 117.3, 118.5, and 122.4.

2.120

 b.

For group B, tthe he suspect outlier outlierss are any obser observations vations greater than than QU   1.5 IQR   147.02  1.5 38.37  204.575 or less than Q L    1.5 IQR   108.65  1.5 38.37  51.095 . There is 1 observation less than 51.095: 50.4.

c.

For group C, the suspect outliers are any observations greater than Q L  1.5  IQR   95.35  1.5  27.63  136.795 or less than Q L    1.5 IQR   67.72  1.5 27.63  26.275 .  No observations are greater tha than n 136.795 or less than 26.275.

d.

For group A, if the outli outliers ers are removed, the the mean will decrease, decrease, the median will sl slightly ightly decreas decrease, e, and the standard deviation will decrease. For group B, if the outlier is removed, the mean will increase, the median will slightly increase, and the standard deviation will decrease.

For Perturbed Intrinsics, but no Perturbed Projections:

n

n



 xi

 x



i 1

n



8.1 5

 n    xi  i 1  2  x  

  1.62  

 s 2





i

n

i 1

n 1

2

 

8.12 15.63  5  5 1

2.508 4



.792



 s  s 2

 .627

 

4.5  1.62

 x  x

The z -score -score corresponding to a value of 4.5 is z    s



3.63  

Copyright © 2018 Pearson Education, Inc. 



.627

 .792  

 

 Methods for Describing Describing Sets of Data

63 

Since this z -score -score is greater than 3, we would consider this an outlier for perturbed intrinsics, but no  perturbed projections. For Perturbed Projections, but no Perturbed Intrinsics:

n

n





 xi

 x



i 1



n

 s  s

2



 n    xi  i 1  2  x  

125.8 5



 s 2

25.16



2



i

n

i 1

The z -score -score corresponding to a value of 4.5 is z  

5



n 1

  46.243  6.800  

3350.1 

 

 x  x  s

125.82 

5 1



4. 5  25. 16 6.800

184.972 4



46.243

 

 3.038  

Since this z -score -score is less than -3, we would consider this an outlier for perturbed projections, but no  perturbed intrinsics. intrinsics. Since the z -score -score corresponding to 4.5 for the perturbed projections, but no perturbed intrinsics is smaller in absolute value than that for perturbed intrinsics, but no perturbed projections, it is more likely that the that the type of camera perturbation is perturbed projections, but no perturbed intrinsics. 2.121

From tthe he stem-andstem-and-leaf leaf display display in Exercise Exercise 2.34, the data are fairly mound-shaped, but skewed somewhat to the right.

The sample mean is x



 x  1493  59.72 .

The sample variance is s 2

25

n

 x 

2



  x 

n 1

The sample standard deviation is s  

n

2



14932 96,885  25 25  1



321.7933 .

321. 321.79 7933 33  17.9 17.938 386 6.

The z -score -score associated with the largest value is z  

 x  x  s



102  59.72



2.36 .

17.9386

This observation is a suspect outlier. The observations associated with the one-time one-time customers are 5 of the largest 7 observations. Thus, repeat customers tend to have shorter delivery times than one-time customers.

Copyright © 2018 Pearson Education, Inc. 

 

64

Chapter 2

2.122

Using MINITAB, MINITAB, a scatterplot of the data is: Scatterplot of Var 2 vs Var 1 14 12 10    2 8   r   a    V 6

4 2 0 0

2

4

6

8

Var 1

2.123

Using MINITAB, the scatterplot is: Scatterplot of Var 2 vs Var 1 18 16 14 12    2 10   r   a    V 8 6 4 2 0 1

2

3

4

5

Var 1

2.124

Using MINITAB, the scatterplot is: Scatterplot of RATIO vs DIAMETER 10.0 9.5 9.0 8.5    O    I    T    A    R 8.0 7.5 7.0 6.5 0

10 100 0

200

300

400

500

600

700

DIAMETER

It appears that as the pipe diameter increases, the ratio of repair to replacement cost increases.

Copyright © 2018 Pearson Education, Inc. 

 

 Methods for Describing Describing Sets of Data

65 

2.125.

From the scatterplot of the data, data, it appe appears ars that as as the num number ber of punishments punishments increases increases,, the avera average ge payoff decreases. Thus, there appears to be a negative linear relationship between punishm punishment ent use and average  payoff. This supports tthe he researchers co conclusion nclusion that “winners” “winners” don’t puni punish”. sh”.

2.126

Using MINITAB, MINITAB, the scatterplot of the data is: Scatterplot of Catch vs Search 7000

6000

   h   c 5000    t   a    C

4000

3000 15

20

25

30

35

Search

There is an apparent negative linear trend between the search frequency and the total catch. As the search frequency increases, the total catch tends to decrease. 2.127

Using MINITAB, MINITAB, a scattergram of the data is: Scatterplot of SLUGPCT vs ELEVATION 0.625 0.600 0.575    T    C 0.550    P    G    U    L    S 0.525 0.500 0.475 0.450 0

1000

2000

3000

4000

5000

6000

ELEVATION

If we include the observation from Denver, then we would say there might be a linear relationship between slugging percentage and elevation. If we eliminated the observation from Denver Denver,, it appears that there might not be a relationship between slugging percentage and elevation.

Copyright © 2018 Pearson Education, Inc. 

 

66

Chapter 2

2.128

Using MINITAB, MINITAB, the scatterplot of the data is: Scatterplot of MATH2014 vs MA TH2010 625 600 575 550    4    1    0    2 525    H    T    A    M 500 475 450

450

475

500

525

550

575

600

625

MATH2010

There appears to be a positive linear trend between the Math SAT scores in 2010 and the Math SAT scores in 2014. As the 2010 Math SAT scores increase, the 2014 Math SAT scores scores also tend to increase. 2.129

Using MINITAB, MINITAB, the scatterplot of the data is: Scatterplot of Number vs Hour 400

350

300   r   e    b   m250   u    N 200

150

100 0

2

4

6

8

10

12

Hour

There appears to be a positive linear trend to the data. As the hours increase, the number of accidents tends to increase. 2.130

Using MINITAB, MINITAB, the scatterplot of the data is: Scatterplot of Mass vs Time 7 6 5 4   s   s   a    M3 2 1 0 0

10

20

30

40

50

60

Time

There is evidence to indicate that the mass of the spill tends to diminish as time increases. As time is getting larger, the mass is decreasing.

Copyright © 2018 Pearson Education, Inc. 

 

 Methods for Describing Describing Sets of Data

2.131

a.

67  

Using MINITAB, MINITAB, a scatterplot of the data is: Scatterplot of Year2 vs Year1 55

50

45    2   r   a   e 40    Y

35

30

20

30

40

50

60

Year1

There is a moderate positive trend to the data. As the scores for Year1 increase, the scores for Year2 also tend to increase.  b.

2.132

From the gr graph, aph, two agencie agenciess that had greater greater than expected PARS PARS evaluati evaluation on scores for Yea Year2 r2 were USAID and State.

Using MINITAB, MINITAB, the scatterplot of the data is: Scatterplot of VALUE vs OPINCOME 4000

3500

3000    E    U    L    A    V 2500

2000

1500 0

50

100

150

200

250

300

OPINCOME

There is a positive trend to the data. As operating income increases, the the 2015value also tends to increase. Since the trend is positive, we would recommend that an NFL executive use operating income to predict a team’s current value. 2.133

a.

Using MINITAB, MINITAB, the scatterplot of the data is: Scatterplot of Ratio vs Age 2000

1750

1500   o 1250    i    t   a    R 1000

750

500 45

50

55

60

65

70

75

80

Age

There appears the CEO’s age.to be a weak, negative relationship between a CEO’s ratio of salary to worker pay and Copyright © 2018 Pearson Education, Inc. 

 

68

Chapter 2

 b.

Using MINIT MINITAB AB the descriptive descriptive statistics are: Descriptive Statistics: Ratio  Variable Ratio

N 40

Mean 641.8

StDev 314.8

Minimum 415.0

Q1 481.0

Median 536.5

Q3 660.8

Maximum 1951.0

IQR 179.8

Using the interquartile range, the highly suspect outliers are any observations greater than QU   3 IQR   660.8  317 179.8  1, 200.2 or less than Q L   3  IQR   481.0  3179.8  58.4 . There are 2 highly suspect outliers: outliers: 1,522 and 1,951. Using the z --score, score, any observation greater than 3 standard deviations above or below the mean are highly suspect outliers. outliers. Three standard devia deviations tions above the mean is:  x  x x  641.8  z     3      3(31  4.8)  x  641  .8  944  .4  x  641  .8  x   1, 586.2   314.8  s Three standard deviations below the mean is:  x  x x  641.8  z       3       3(314.8)  x  641  .8    944.4  x  641  .8  x   302.6   314.8  s Using this method, there there is one highly suspect outlier: outlier: 1,951. c.

Removing the observation 1,951, the scatterplot of the data is: Scatterplot of Ratio-Remove1 vs Age-Remove1 1500

1250    1   e   v   o   m1000   e    R     o    i    t   a    R 750

500

45

50

55

60

65

70

75

80

Age-Remove1

By removing the one highly suspect outlier, the relationship is still negative, but it is a stronger, negative relationship. 2.134

Using MINITAB, MINITAB, a scatterplot of the data is: Scatterplot of ACCURACY vs DISTANCE 75

70

65    Y    C    A    R 60    U    C    C    A 55

50

45 280

290

300

310

320

DISTANCE

Copyright © 2018 Pearson Education, Inc. 

 

 Methods for Describing Describing Sets of Data

69 

Yes, his concern is a valid one. From the scatterplot, scatterplot, there appears to be a fairly strong negative relationship between accuracy and driving distance. As driving distance increases, the driving accuracy tend to decrease. 2.135

One way the bar graph can mislead the viewer is that the vertical vertical axis has been cut off. Instead of starting at 0, the vertical axis starts at 12. Another way the bar graph can mislead the viewer is that as the bars get taller, the widths of the bars also increase.

2.136

a.

If you work for Volkswagon, you would choose to use the median number of deaths because this is much lower than the mean. The data are skewed to the right, right, so the median would probably probably be a  better representation representation of the middl middlee of the dist distribution. ribution.

 b.

If you support an eenvironmental nvironmental watch watch group, you would would choose to use the the mean number of deaths  because this is much much greater than tthe he median. The average number of deaths is much high than tthe he median number of deaths.

2.137

a.

The graph might be misleading because the scales on the vertical axes are different. The left vertical axis ranges from 0 to $120 million. million. The right vertical axis ranges ranges from 0 to $20 billion.

 b.

Using MINIT MINITAB, AB, the redrawn redrawn graph is: Time Series Plot of Craigslist, NewspaperAds 18000

Variable

16000

Craigslist NewspaperAds

14000 12000 10000   a    t   a    D 8000 6000 4000 2000 0 2003

2004

2005

2006

2007

2008

2009

Year

Although the amount of revenue produced by Craigslist has increased dramatically from 2003 to 2009, it is still much smaller than the revenue produced by newspaper ad sales. 2.138

a.

This graph is misleading because it looks like as the days are increasing, the number of barrels collected per day are also increasing. However, the bars are the cumulative number of barrels collected. The cumulative value can never dec decrease. rease.

Copyright © 2018 Pearson Education, Inc. 

 

70

Chapter 2

 b.

Using MINIT MINITAB, AB, the graph of the daily collection collection of oi oill is: Chart of Barrells 2500

2000

  s    l    l 1500   e   r   r   a    B

1000

500

0 May-16

May-17

May-18

May-19 May-20 Day

May-21

May-22

May-23

 

From this graph, it shows that there has not been a steady improvement in the suctioning process. There was an increase for 3 days, then a leveling off for 3 days, then a decrease. 2.139

The relative frequency histogram is: Histogram of Class .20

.15

  y   c   n   e   u   q   e   r    f .10   e   v    i    t   a    l   e    R

.05

0 1.125

2.625

4.125 5.625 Measurement Class

7.125

8.625

2.140

  The m mean ean is sensitive tto o extreme values in a data set. Therefore, the m median edian is preferred tto o the mean when a data set is skewed in one direction or the other.

2.141

a.

 z



 x    



  

 b.

 z



 x    



  

c

d.

 z

 z



 x    



50  60 10 50  50 5 50  40

  

10

  x     

50  40 100

  

 1  



0

 z  

 z  

   1 

 .1  

 z  

 70  60

10  70  50

5

 70  40

10

 z    70  40 100

 1 

 z  



 z 





3

 z  

   .3  

 80  60

10  80  50

5  80 

40

10

 z   80  40 100













 .4

Copyright © 2018 Pearson Education, Inc. 

 

 

 Methods for Describing Describing Sets of Data

2.142

2.143

a.

If we assume that the data are about mound-shaped, then any observation with a z -score -score greater than 3 in absolute value would be considered an outlier. outlier. From Exercise 2.139, 2.139, the z -score -score corresponding to 50 is 1, the z -score -score corresponding to 70 is 1, and the z -score -score corresponding corresponding to 80 is 2. Since none of these z -scores -scores is greater than 3 in absolute value, none would be considered outliers.

 b.

From Exercise 2.139, the z -score -score corresponding to 50 is 2, the z -score -score corresponding to 70 is 2, and the z -score -score corresponding corresponding to 80 is 4. Since the z -score -score corresponding to 80 is greater than 3, 80 would be considered an outlier.

c.

From Exercise 2.139, the z -score -score corresponding to 50 is 1, the z -score -score corresponding to 70 is 3, and the z -score -score corresponding corresponding to 80 is 4. Since the z -scores -scores corresponding to 70 and 80 are greater than or equal to 3, 70 and 80 would be considered outliers.

d.

From Exercise 2.139, the z -score -score corresponding to 50 is .1, the  z -score -score corresponding to 70 is .3, and the z -score -score corresponding corresponding to 80 is .4. Since none of these these z -scores -scores is greater than 3 in absolute value, none would be considered outliers.

a.

 x    13  1  10  3  3  30    x    13 2

 x  x

 b.

c.

5



6

 s

 



2

n

4

2



 x 2

 s 2







 x  49  n

7



 x

2



 s





4





2

  x 

 x    4  6  6  5  6  7  34  

 x



 x  34  n

6

2

2

5.67  

 s

2

 x 

2

3 3 

  x  n

2

2



2

  x 

n 1

n

32

2

6 

2

84.75 3

2





28.25



5.20  

6



28.25

 s  



5. 32  

  2

226



0 3



2

2

37.67  

 s

 

2



37. 67

569 .



6. 14  

0

 s  

62  5 2  6 2  7 2

342 6 6 1

0





 

198  

27

 36  

122 4 4 1



 s  

 

 



2

27

 0  1  10  11  11  15

36 

 

 x    4 2

2

49 2 569  7 7 1



n 1

 

252 4 4 1



4

241 

  

2

n



241

2

n 1



2

 x    1

2

 s 2

5  15

2

288  



108 

2

n

 x    3

2

30 2

 6 6  0 

  x 

 x 2 

 x  12  3 n

2

2

288 

n 1

 

2

2

n 1 n

 x    13

 x  25  6.25

2

 1  10  3  3

  x 





 x    3  3  3  3  12    x

a.

n



 x

2

2

2

 x    1  0  1  10  11  11  15  49  

 x

d.



30

 x    13  6  6  0  25    x

2.144

71 



5.3333 5

Copyright © 2018 Pearson Education, Inc. 

 198



 

1.0667  

 s

 

1.067



1. 03  

 

72

Chapter 2

 x    1  4  (33))  0  (3)3)  (6)6)  9    x    (1) 2

 b.

 x



 x  9  $1.5 6

n

 s   11.5



c.

 x   

 x



3 5



d.





  x  n

42

2

2

2

 ( 3)  0  ( 3)  ( 6)

( 9) 2 6 6 1

71  

n 1

 

2





57.5 5

 11.5

2



71

 

dollars squared

$3.39  



4 5

2



5



1 5



1 16



 2.0625

 x

 

2

  3   5

2

 4   5

2

 2   5

2

1   5

2

1    16 

2

 1.2039

 

 x  2.0625  .4125% 5

n



 x

 s 2

 s 2

 x

2

2

2





 

  x 

2

2.06252 5 5 1

1.2039 

n



n 1



.3531 4

 .0883%

squared

 . 30%  

 s  

.0883

(a)

Range = 7  4 = 3

(b)

Range = $4  ($-6) = $10

(c)

Range =

4 5

%

1 16

%



64 80

%

5 80

%



59 80

%  .7 .7375%  

2.145

The range is found by taking the largest measurement in the data set and subtracting subtracting the smallest measurement. Therefore, it only uses tw two o measurements from the the whole data set. The standard deviation uses every measurement in the data set. Therefore, it takes every measurem measurement ent into account—not just two. The range is affected by extreme values more than the standard deviation.

2.146

    

2.147

Using MINITAB, the scatterplot is:

range 4



20 4





Scatterplot of Var 2 vs Var 1 30

25

   2 20   r   a    V 15

10

100

200

300

400

500

Var 1

Copyright © 2018 Pearson Education, Inc. 

 

 Methods for Describing Describing Sets of Data

2.148

a.

The z -score -score is  z  

 x  x  s



30  39 6

 1.5 .

73 

A score of 30 is 1.5 standard deviations deviations below the mean.

 b.

Since the data are mound-shaped and symmetric and 39 is the mean, mean, .5 of the sam sampled pled drug dealers will have WR scores below 39.

c.

If 5% of the drug dealer dealerss have WR scores scores above 49, then 95% wil willl have WR scores below below 49. Thus, th

49 will be the 95  percentile. 2.149

a.

Using MINITAB, the pie chart is: Pie Chart of Blog/Forum Category Company Employees Third Party Not Identified

Not Identified 15.4%

Company 38.5%

Third Party 11.5%

Employees 34.6%

Companies and Employees represent (38.5 + 34.6 = 73.1) slightly more than 73% of the entities creating blogs/forums. Third parties are the least common entity.  b.

Using Chebyshev’s Rule, at leas leastt 75% of the observations observations wil willl fall withi within n 2 standard deviat deviations ions of the mean.  x  2 s  4.2   5  2(12 .02)  4.2   5  24. 04  (   19.79, 28.29)  or (0, 28.29) since we cannot have a

negative number blogs. c.

We woul would d expect the dist distribution ribution tto o be ske skewed wed to the right. right. We know that we cannot have a negative number of blogs/forums. blogs/f standard deviation deviationbecause below the is a deviation negative number. would assume that there orums. are a fewEven very1large observations themean standard is so big We compared to the mean.

2.150

a.

To find relative frequencies, we divide the frequencies of each category by the total number of incidents. The relative frequencies of the number of incidents for each of the cause categories are:  Management System System Cause Category Engineering & Design Procedures & Practices Management & Oversight Training & Communication Communication TOTAL

 Number of Incidents Incidents

Relative Fr Frequencies equencies

27 24 22 10 83

27 / 83 = .325 24 / 83 = .289 22 / 83 = .265 10 / 83 = .120 1

Copyright © 2018 Pearson Education, Inc. 

 

74

Chapter 2

 b.  The Pareto diagram is: Management Systen Cause Category 35 30 25    t   n   e   c   r   e    P

20 15 10 5 0 Eng&Des

Proc&Pract Mgmt&Over Category

Trn&Comm

  c. 2.151

a.

 

The category the highest relative frequency of incidents is Engineering Engineer ing and Design. The category withwith the lowest relative frequency of incidents is Training and Communication. The relative frequency for each response category is found by dividing the frequency by the total sample size. The relative frequency frequency for the category category “Global Marketing” is 235/2863 = .082. The rest of the relative frequencies are found in a similar manner and are reported in the table.  Area Global Marketing Sales Management Buyer Behavior Relationships Innovation Marketing Strategy Channels/Distribution Channels/Dist ribution

Number 235 494 478 498 398 280 213

Relative Frequenc Frequencies ies 235/2863 = .082 494/2863 = .173 478/2863 = .167 498/2863 = .174 398/2863 = .139 280/2863 = .098 213/2863 = .074

Marketing Research Services TOTAL

131 136 2,863

131/2863 = .046 136/2863 = .048 1.00

Relationships and sales management had the most articles published with 17.4% and 17.3%, respectively. Not far behind was Buyer Behavior with 16.7%. Of the rest of the areas, only innovation had more than 10%.

Copyright © 2018 Pearson Education, Inc. 

 

 Methods for Describing Describing Sets of Data

 b.

75 

Using MINIT MINITAB, AB, the pie chart of the dat dataa is: Pie Chart of Area Services Marketing research 4.8% 4.6%

Global Marketing 8.2%

Channells/Distribution 7.4% Sales Management 17.3% Marketing Strategy 9.8%

Inovation 13.9%

Category Global Marketing Sales Management Buyer Behavior Relationships Inovation Marketing Strategy Channells/Distribution Marketing research Services

Buyer Behavior 16.7%

Relationships 17.4%

The slice for Marketing Research is smaller than the slice for Sales Management because there were fewer articles on Marketing Research than for Sales Management. 2.152

a.

The data are time series data because the numbers of bankruptcies were collected over a  period of 8 quarters. quarters.

 b.

Using MINIT MINITAB, AB, the time time series plot iis: s: Time Series Plot of Bankruptcies 14000 12000 10000   s   e 8000    i   c    t   p   u   r    k 6000   n   a    B 4000 2000 0

0 Q1

Q2

Q3

Q4

Q1

Q2

Q3

Q4

Quarter

2.153

c.

There is a generally decreasing trend in the the number of ban bankruptcies kruptcies as tthe he qua quarters rters increase.

a.

Using MINITAB, the pie chart is: Pie Chart of Drivstar 2 4.1%

5 18.4%

3 17.3%

Category 2 3 4 5

4 60.2%

 b.

The average driver driver’s ’s severity of head injury in head-on head-on collisions is is 603.7. Copyright © 2018 Pearson Education, Inc. 

 

76

Chapter 2

c.

Since the mean and median median are cl close ose in in value, value, the the data data should be fairly symmetric. Thus, we can use the Empirical Rule. We know that about 95% of all observations will fall within 2 standard deviations of the mean. mean. This interval is  x  2 s  603  .7  2(18 5.4)  603  .7  370  .8  (232.9, 974.5)   Most of the head-injury ratings will fall between 232.9 and 974.5.

2.154





d.

The z -score -score would be:  z   x x  408 603.7  1.06    s 185.4   Since the absolute value is not very big, this is not an unusual value to observe.

a.

The data collection method was a survey.

 b.

Since the data were 4 different different categories, the the variable is qualitative. qualitative.

c.

Using MINITAB, MINITAB, a pie chart of the data is: Pie Chart of M ade in USA
View more...

Comments

Copyright ©2017 KUPDF Inc.
SUPPORT KUPDF