Understanding Basic Statistics Solutions 1


 These are my work from taking 'Statistics' class at Hudson County Community College. (Mar 2015-May 2015)


  • Book : Understanding Basic Statistics 
  • ISBN-13:9780495831488, ISBN-10:0495831484
  • For  students, please use these as resources, do not  plagiarize.
  • If you have a question, please leave a comment.
  • Graphs and Formulas may not shown. 

Solutions part 1.
http://jieunkimresume.blogspot.com/2015/05/understanding-basic-statistics.html
Solutions part 2.

http://jieunkimresume.blogspot.com/2015/05/understanding-basic-statistics_21.html


Page 45, #7, Jieun Kim Unit1
 (a) The class width = 25
Min : 236, Max: 360
The Class width, Calculation: (360-236)/5=24.8≈25
Increase the value to 25.
(b) Frequency Table
Class Limits

Lower-Upper
Class Boundaries
Midpoints
Frequency
Relative
Frequency
236-260
235.5-260.5
248
4
0.070
261-285
260.5-285.5
273
9
0.158
286-310
285.5-310.5
298
25
0.438
311-335
310.5-335.5
323
16
0.281
336-360
335.5-360.5
348
3
0.053

(c) Histogram

Class Boundaries
Frequency
235.5-260.5
4
260.5-285.5
9
285.5-310.5
25
310.5-335.5
16
335.5-360.5
3


(d)Relative-Frequency Histogram
X : Finish Times(hours)
Y: Relative Frequency
Class Boundaries
Relative Frequency
235.5-260.5
0.070
260.5-285.5
0.158
285.5-310.5
0.438
310.5-335.5
0.281
335.5-360.5
0.053


(e) Mound-shaped symmetrical histogram




46p, #8.
(a) The class width = 11
min=45
max=109
(109-45)/6=64/6=10/6≈11
Increase the value to 11.
(b) Frequency Table
Class Limits

Lower-Upper
Class Boundaries
Midpoints
Frequency
Relative
Frequency
45-55
44.5-55.5
50
3
0.043 (3/70)
56-66
55.5-66.5
61
7
0.010 (7/70)
67-77
66.5-77.5
72
22
0.314 (22/70)
78-88
77.5-88.5
83
26
0.371 (26/70)
89-99
88.5-99.5
94
9
0.129 (9/70)
100-110
99.5-110.5
105
3
0.043 (3/70)

(c) Histogram

X : Glucose blood levels(mg/100ml), Y : Frequency
Class Boundaries
Frequency
44.5-55.5
3
55.5-66.5
7
66.5-77.5
22
77.5-88.5
26
88.5-99.5
9
99.5-110.5
3


(d) Relative-Frequency Histogram


X : Glucose blood levels(mg/100ml), Y : Relative Frequency

Class Boundaries
Relative
Frequency
44.5-55.5
0.043
55.5-66.5
0.010
66.5-77.5
0.314
77.5-88.5
0.371
88.5-99.5
0.129
99.5-110.5
0.043


(e) Mound-shaped symmetrical histogram


#6 (page 54)
(a) Pareto Chart
- Pareto chart is displayed by the heights of vertical bars, which are arranged in order from highest to lowest. So, the highest number of spearheads is 33, Shannon and the lowest number of spearheads is 8, Blackwater.

 (b) Circle Graph
Bann 19/89≈0.21, 21%,  21% x 360˚= 75.6˚
Blackwater 8/89≈0.09, 9%, 9% x 360˚=32.4˚
Erne 15/89≈0.17, 17%, 17% x 360˚=61.2˚
Shannon 33/89≈0.37, 37%, 37% x 360˚=133.2˚
Barrow 14/89≈0.16, 16%, 16% x 360˚=57.6˚



#12 (page 56)




 #1 (page 60)
(a) Make a stem-and-leaf display.
<Leaves not ordered>
4
4 represents 44years old.
Stem
Leaves
4
4







5
8
2
7
8




6
8
6
6
8
1



7
2
3
0
5
2
3
7
6
8
6
9
4
7
6
5
4
4
9
7
1
1
2
0
3



<Final display with leaves ordered>
4
4 represents 44 years old.
Stem
Leaves
4
4







5
2
7
8
8




6
1
6
6
8
8



7
0
2
2
3
3
5
6
7
8
4
4
4
5
6
6
7
9
9
0
1
1
2
3
7



(b) Their average age is 76years old(sum of their age/32=76) and 69%(22/32) of cowboys' age range is between 70 and 97. So, these data support the quote. Because if the cowboys are drunken, gambling lot, quick to draw and fire their pistols, they die earlier.   

Jieun Kim Unit2-1
#5 (page 81)

2,3,4,5,5
(a) compute the mode, median, and mean.
<Mode>
mode=5. Because the mode of a data set is the value that occurs most frequently. The value,5 occurs the most. So, the value 5 is the mode.
<Median>
median=4.
First, put those numbers in order. 2,3,4,5,5.
These are five number, so the middle number is the third number, 4.

<Mean>
mean=3.8
mean= sum of all entries/number of entries=(2+3+4+5+5)/5=19/5=3.8
(b) If the numbers represented codes for the colors of T-shirts ordered from a catalog, which average(s) would make sense?
Mode makes sense for this case. Because, Two of Code 5 T-shirts are sold. The code 5 T-shirt is popular so far.
(c) If the numbers represented one-way mileages for trails to different lakes, which average(s) would make sense?
Mean makes sense for this case. Because people will compare the distance of the five ways and the mean of distances.
Median makes sense too. Because the median is the central value of an ordered distribution and some people just choose the middle number of the five ways. 
(d) Suppose the numbers represent survey responses from 1 to 5, with 1 = disagree strongly, 2 = disagree, 3=agree, 4 = agree strongly, and 5= agree very strongly. Which averages make sense?  
Mean makes sense for this case. Because it needs mean for sum of all entries to get the mean score of survey responses.

#9 (page 82)
(a)
Mean=3.27
Mean=sum of all entries/number of entrie=36/11≈3.27
sum=36=2+3+1+1+3+4+6+9+3+1+3
Median=3
Numbers in order : 1,1,1,2,3,3,3,3,4,6,9
These are 11 numbers, 6th number is the median. So, number 3 is the median. 
Mode=3
Number 3 occurs most frequently, so number 3 is the mode.
(b)
Mean=4.21
Mean=sum of all entries/number of entries=59/14≈4.21
sum=8+1+1+0+6+7+2+14+3+0+1+13+2+1=59
Median=2
Numbers in order : 0,0,1,1,1,1,2,2,3,6,7,8,13,14
These are 14 numbers, 7th and 8th numbers are the middle pair numbers.
(2+2)/2=2
Mode=1
Number 1 occurs most frequently, so number 1 is the mode.
(c) compare the results of parts (a) and (b).
At the upper canyon,  11 accidents happened and resulted 36 injured people for a 5-year period. The mean is 3.27, the median is 3, and the mode is 3. So, each accident resulted usually 3(rounded) injured.
At the lower canyon,  14 accidents happened and resulted 59 injured people for a 5-year period. The mean is 4.21, the median is 2 and the mode is 1. But at lower canyon, each accident resulted various number of injured people. Sometimes, there was accident, but no one injured. Or there was accident and it was quite serious so it resulted 13 injured or 14 injured.  So, I think the mean can be the best average for the lower canyon accidents. In this case, the lower canyon has one more injured than the upper canyon for each accident. But, if compare median or mode to the upper canyon, it is different. So, the lower canyon' data needs trimming.

(d)
14 x 5%=0.7≈1
I eliminate one data value from the bottom of the list and one from the top. Then take the mean of the remaining 12 entries.
0,0,1,1,1,1,2,2,3,6,7,8,13,14
0
0
1
1
1
1
2
2
3
6
7
8
13
14

0
1
1
1
1
2
2
3
6
7
8
13


sum of the 12 entries=45
mean=45/12=3.75
After I trimmed data of the lower canyon,  mean is 3.75 and the upper canyon's mean is 3.27. So difference is 0.27. So, the lower canyon has 0.27 more injured than the upper canyon.


















#10(page 82)
(a) compute the mean, median, and mode of the ages.
First, I made these ages in order.
21
21
22
22
22
22
23
23
23
23
24
24
24
25
25
25
25
25
25
25
26
26
26
26
27
27
28
28
28
29
29
29
29
29
30
31
31
32
33
37

<mean>
mean=sum  of all entries/number of entries=1050/40=26.25
Mean is 26.25
<median>
For forty numbers, 20th and 21th are the pair of middle numbers. So, 25 and 26 are the pair of middle numbers. To find the value half-way between them, (25+26)/2=51/2=25.5
Median is 25.5
<mode>
Age
21
22
23
24
25
26
27
28
29
30
31
32
33
37
Frequency
2
4
4
3
7
4
2
3
5
1
2
1
1
1

The mode of a data set is the value that occurs most frequently. So the value 25 occurs most frequently.
Mode is 25.

(b) compare the average.
 Mode has the lowest average value because there are 7 foot ball players who are 25 years old. Median is 25.5. I think Mode and Median didn't cover all range of the football player's age. I think that the Mean which has the highest average value is the most accurate average. Because it included 37years old football player and it included all data as well as.  






#12 (page 83)
Lab
Major test
Major test
Final exam
25%
22.5%
22.5%
30%
92
81
93
85

 =87.65=weighted mean.






Jieun Kim Unit 2-2
#5 (page 93)
(a) (i),(ii),(iii). Because (i) has consecutive numbers and The numbers' distance from the mean in (ii) is shorter than the numbers' distance from the mean in (iii). So, I think that standard deviation of (iii) will have the highest value.
(b) The difference in standard deviation between (i) and (ii) is greater than the difference in standard deviation between (ii) and (iii).
(i)Value
Mean
x-Mean
(ii)Value
Mean
X-Mean
(iii)Value
Mean
X-Mean
8
10
2
7
10
3
7
10
3
9
10
1
9
10
1
8
10
2
10
10
0
10
10
0
10
10
0
11
10
1
11
10
1
12
10
2
12
10
2
13
10
3
13
10
3

 Because the difference between 8 in (i) and the mean is 2. The difference between 7 in (ii) and the mean is 3. So, there is difference between these differences (3-2=1). But the difference between 7 in (ii) and the mean is 3. The difference between 7 in (iii) and the mean is also 4. So, there is no difference(3-3=0).  So, I expect that the difference in standard deviation between (i) and (ii) is greater than the difference in standard deviation between (ii) and (iii).












 #7 (page 93)
x:23,17,15,30,25
(a) range=15
Range=Largest value-Smallest value=30-15=15
(b) means 23+17+15+30+25=110
 means 232+172+152+302+252=529+289+225+900+625=2568
(c)
The sample variance
 ==(2568-2420)/4=148/4=37

The sample standard deviation
=









(d) Defining formulas (sample statistic)
The sample variance
==148/(5-1)=148/4=37
Data Value x
Mean 110/5=22
23
22
1
1
17
22
-5
25
15
22
-7
49
30
22
8
64
25
22
3
9
Total 110


Total 148

The sample standard deviation
=

(e)
Data Value x
Mean==110/5=22
23
22
1
1
17
22
-5
25
15
22
-7
49
30
22
8
64
25
22
3
9
Total 110


Total 148

the population variance  ==148/5=29.6
the population standard deviation ==5.44




#6 (page 106)
Arranged data.
3
6
7
14
15
17
18
20
22
22
24
25
25
26
29
31
31
32
42
72

1. The lowest value of the data set = 3
2. Q1=15
3. The median=(22+24)/2=23
4. Q3=31
5. The highest value of the data set=72
Interquartile range=Q3-Q1=31-15=16
(a)
download
(I used website to make a chart) www.mathwarehouse.com







(b)
download
The data of nurses(question5) have a larger range that the clerical staff's data. These two chart's median are both 23. But the median of nurses' data is to the right, the distribution is negatively skewed. The median of staff's data is near the center of the box, the distribution is approximately symmetric.
The middle halves of nurses' data are 8 and 29. The middle halves of staff's data are 15 and 31. Their first middle halves are quite different. But the second middle halves are similar.
The interquartile of nurses' hour is 29-8=21. The interquartile of staff's hour is 31-15=16. The distance to the extreme value of nurses' data is 42-21=21 and the distance to the extreme value of staff's data is 72-16=56. So, the distance to the extreme value of staff's data is much further than the nurses' one. These charts also show that staff's box is further from the extreme value 72.













#7 (page 106)
(a)
1. The lowest value of the data set = 17
2. Q1=21.5
3. The median=24
4. Q3=27
5. The highest value of the data set=38
Interquartile range=Q3-Q1=27-21.5=5.5
(b) The Bachelor's degree percentage of Illinois is 26% and this rate falls into Quartile3. Because 26% is higher than the median and lower than Q3.  

(Section 5.1) #3 (page 164)
(a)  An event A that is certain to occur means that the probability of A is 1.
Let's say that there are 20 students studying Statistics and they all wear a school uniform.
In this case, the probability of wearing a school uniform is P(A)=20/20=1.

(b) An event B that is impossible means that the probability of B is 0.
There are 20 girls attending prom, no one wears a school uniform.
In this case, the probability of wearing a school uniform is P(B)=0/20=0.

#6 (page 165)
(a) -0.41 cannot be the probability of some event. Because a probability formula is
.
0≤Number of outcomes favorable to event≤1 is the only possible range.

(b)1.21 cannot be the probability of some event. The reason is same as answer from (a)
 0≤Number of outcomes favorable to event≤1 is the only possible range. But, the 1.21 is over 1.

(c) 120% cannot be the probability of some event. 120% equals to 1.2 and it has the same reason.
1.2 is not between 0≤Number of outcomes favorable to event≤1.

(d) 0.56 can be the probability of an event. Because it is between 0≤Number of outcomes favorable to event≤1. For example, I threw a coin 100 times and I got a tail 56 times. In this case probability of getting tail is 0.56



 #10 (page 165)
(a) sample space : 1,2,3,4,5,6
These outcomes are equally likely. Because, the shape of dice is a cubic and each side has same length. So, the movement of rolling a dice doesn't affect a outcome.
(b)
P(getting 1)=1/6
P(getting 2)=1/6
P(getting 3)=1/6
P(getting 4)=1/6
P(getting 5)=1/6
P(getting 6)=1/6
 The sum of the probabilities of all simple events in a sample space is 1.
1/6+ 1/6+ 1/6+ 1/6+ 1/6+ 1/6=1
The sum of the probabilities of all simple events in a sample space should be 1. Because, in this case, all the possible outcomes are 6 of sample space. 6/6=1.
(c)
P(getting 1)=1/6
P(getting 2)=1/6
P(getting 3)=1/6
P(getting 4)=1/6
1/6+ 1/6+ 1/6+ 1/6=4/6=2/3≈0.67
(d) Getting 5 or 6 is mutually exclusive events.
P(getting 5)=1/6
P(getting 6)=1/6
1/6+ 1/6=2/6=1/3≈0.33

#13 (page 166)
(a) 58/127 ≈0.457
58 of 127 people enter the store.
(b) 25/58 ≈0.431
25 of 58 people bought something.
(c) 58/127 x 25/58=25/127≈0.197
P(A)=58/127 and P(B)=25/58 are independent, then P(A and B)= P(A)xP(B).
(d)1-(25/58)=33/58≈0.569
Because the complement of P(A) equals to 1-P(A).

 (Section 5.2) #5 (page 180)
(a)P(A and B)=P(A) x P(B)
(b)P(B I A)=P(A and B)/P(A), when P(A)≠0.
(c)P( IB)=P( and B)/P(B), when P(B)≠0.
(d)P(A or B)=P(A)+P(B)
(e)P(  or A)=P( )+P(A)

#8 (page 181)
*Total number of arches in park=288
(a) 111/288≈0.39
(b) 0.28≈81/288 = 30/288 + 33/288 + 18/288
(c) 0.82≈237/288 = 111/288 + 96/288 + 30/288
(d) 0.55≈159/288 = 96/288 + 30/288 + 33/288
(e) 0.0625=18/288
#11 (page 181)
(a) 5/36≈0.139
Possible outcomes of getting a sum of 6:  5
 (1,5) (2,4) (3,3) (4,2) (5,1)
Sample space : 36
 (1,1) (1,2) (1,3) (1,4) (1,5) (1,6)
(2,1) (2,2) (2,3) (2,4) (2,5) (2,6)
(3,1) (3,2) (3,3) (3,4) (3,5) (3,6)
(4,1) (4,2) (4,3) (4,4) (4,5) (4,6)
(5,1) (5,2) (5,3) (5,4) (5,5) (5,6)
(6,1) (6,2) (6,3) (6,4) (6,5) (6,6)
There is no overlap because the colors of dice are different.

(b)3/36=1/12≈0.083
 Possible outcomes of getting a sum of 4 : 3
(1,3) (2,2) (3,1)
Sample space : 36
 (1,1) (1,2) (1,3) (1,4) (1,5) (1,6)
(2,1) (2,2) (2,3) (2,4) (2,5) (2,6)
(3,1) (3,2) (3,3) (3,4) (3,5) (3,6)
(4,1) (4,2) (4,3) (4,4) (4,5) (4,6)
(5,1) (5,2) (5,3) (5,4) (5,5) (5,6)
(6,1) (6,2) (6,3) (6,4) (6,5) (6,6)
There is no overlap because the colors of dice are different.

(c)These are mutually exclusive. Because those outcomes are different and it is not overlapped each other outcomes.
Possible outcomes of getting a sum of 6 : 5 , (1,5) (2,4) (3,3) (4,2) (5,1)
Possible outcomes of getting a sum of 4 : 3 , (1,3) (2,2) (3,1)



#15 (page 182)
(a) The outcomes on the two cards are independent. Because, after drawing the first card, the first card was not replaced. So, it doesn't affect the second drawing.
(b)P(ace on 1st card and king on 2nd)=4/52 x 4/51 = 16/2652=4/663≈0.006033
(c) P(king on 1st card and ace on 2nd)=4/52 x 4/51 = 16/2652=4/663≈0.006033
(d) P(drawing an ace and a king in either order)=16/2652 + 16/2652= 32/2652≈0.012066


 #17 (page 182)
(a) 27/100 + 14/100 + 22/100= 63/100= 0.63
(b) 15/100 + 22/100 + 27/100 + 14/100 = 78/100=0.78
(c) 27/100 + 14/100 = 41/100 = 0.41
(d) 22/100 + 27/100 = 49/100 = 0.49
How would you respond? Because the class limits are different. '13 and order' means that it could be 13, 14, 15, 16, 17 and 18 years old.




 #5 (page 193)
(a) 8 sequences.
WP_20150412_004[1].jpg
(b) 3 sequences. (HHT), (HTH), (THH).
(c) 3/8

#9 (page193)
4x3x2x1=24.
Because there are four choices for the first wire, three choices for the second, two choices for the third and only one for the fourth.


#11 (page 193)
4x3x3=36
36 of different lawn plots she need.
I used the Multiplication rule of counting.

#13 (page194)



* For the questions 18, 25, and 27, I used the Counting rule for combinations.
#18 (page 194)

 #25 (page 194)
==


#27 (page 194)
(a)===
(b)
(c)


Read all of the highlighted information on pages 204-209 as well as complete the suggested Study Guide exercises #1 (page 204), #2 (page 206-207), and #3 (page 209).
Submit your solutions to problems

 #2 (page 210)
(a) Continuous variables. Because the speed is the measurement. 
(b) Discrete variables. Because the age is countable.
(c) Discrete variables. Because the number of book is countable.
(d) Continuous variable. Because the weight is countless number of values in a line interval.
(e) Discrete variables. Because the number of lightning strikes is countable.

#3 (page 210)
(a) The sum of probabilities of all the events in the sample space is 1.
x
0
1
2

P(x)
0.25
0.60
0.15
So, (a) is a valid probability.
(b) The sum of probabilities of all the events in the sample space is more than 1.
x
0
1
2

P(x)
0.25
0.60
0.20
.05
The sum of probabilities of all the events in the sample space must equal 1. So, (b) is not a valid probability.







#7 (page 211)
(a) It is a valid probability distribution. Because, the sum of the percentage is 100%. The sum of probabilities must equal 1 (100%).
(b)
(c) µ=32.3
Midpoint x
10
20
30
40
50
60
Percent of super shoppers
21%
14%
22%
15%
20%
8%

2.1
2.8
6.6
6
10
4.8
µ=2.1+2.8+6.6+6+10+4.8=32.3

(d)  ≈16
  =-
=21+56+198+240+500+288-1043.29=1303-1043.29=259.71
 ==≈16


#11 (page 212)

(a)
P(win)= 
P(not win)==
(b)

Win
Not win
Gain X
$35
$-15
Probability P(x)

0.73
-14.69
Lisa's expected earning is $0.73
Lisa contributed $13.96 to the hiking club.
$0.73+ (-14.69)=-13.96


Read all of the highlighted information on pages 214-229 as well as complete the suggested Study Guide exercises #4 (page215), #5 (page 220-221), #6 (page 227-228), and #7 (page 229).
Submit your solutions to problems
(Section 6.2) #9 (page 223)
(a) = = 0.125
n=3, r=3, P=0.5
(b) = = 0.375
n=3, r=2, p=0.5
(c) P(r≥2)=P(3)+P(2)=0.125+0.375
n=3, r≥2, p=0.5
(d) = = 0.125


#12 (page 224)
(a) n=7, r=0, p=0.1
P(0)=0.478
(b) n=7, r≥1, p=0.1
P(r≥1)=P(1)+P(2)+P(3)+P(4)=0.372+0.124+0.023+0.003=0.522
(c) n=7, r≤2, p=0.1
P(r≤2)=P(0)+P(1)+P(2)=0.478+0.372+0.124=0.974





#13 (page 224)
(a) n=6, r=6, p=0.90
P(6)=0.531
(b) n=6, p=0.90, r=0
P(0)=0.000
(c) n=6, p=0.90, r≥4,  r=4,5,6
P(r≥4)=P(4)+P(5)+P(6)=0.098+0.354+0.531=0.983
(d) n=6, p=0.90 r≤3, r=0,1,2 or 3
P(r≤3)=P(0)+P(1)+P(2)+P(3)=0.000+0.000+0.001+0.015=0.016
















(Section 6.3) #4 (page 231)
(a) P=0.50 is the distribution symmetric.
The expected value
=np=10x0.5=5
The distribution (n=10, p=0.5, 0≤r≤10) is centered over 5.
(b) The distribution for small values of p skewed right.
(c) The distribution for large values of p skewed left.
 #5 (page 231)
(a)
 Symmetric distribution
(b)
 Skewed right
(c)
 Skewed left
(d) Based on the histograms, both charts are binomial probability distributions. So, (b) is skewed to right and (c) is skewed to left.


(e)  If the probability of success is p=0.73, the distribution is skewed to left. Because p=0.73 is between p=0.70 and p=0.75 and n=5, p=0.70 is also skewed to left.

#6 (page 231)
(a)
 
(b) µ=np=8x0.01=0.08
(c) q=1-p=1-0.01=0.99
(d)0.28
=npq=8x0.01x0.99=0.0792
==0.28









#7 (page 235)
(a)
(b)
r≥6, n=10, p=0.55
P(6)+P(7)+P(8)+P(9)+P(10)=0.238+0.166+0.076+0.021+0.003=0.504
(c)
The expected number of claims=µ=np=10x0.55=5.5
The standard deviation ≈1.57
=npq=10x0.55x0.45=2.475
q=1-p=1-0.55=0.45
=≈1.57

#9 (page 235)
(a) n=16, p=0.5, r≥12
P(12)+P(13)+P(14)+P(15)+P(16)=0.028+0.009+0.002+0.000+0.000=0.039
(b) n=16, p=0.5, r≤7
P(7)+P(6)+P(5)+P(4)+P(3)+P(2)+P(1)+P(0)=0.175+0.122+0.067+0.028+0.009+0.002+0.000+0.000=0.403
(c) µ=np=16x0.5=8


#11 (page 235)
(a)

(b)
n=10, p=0.25, r≤1
P(0)+P(1)=0.056+0.188=0.244

n=10, p=0.75, r≥1
P(1)+P(2)+P(3)+P(4)+P(5)+P(6)+P(7)+P(8)+P(9)+P(10)
=0.000+0.003+0.016+0.058+0.146+0.250+0.282+0.188+0.056=0.999

(c) µ=np=10x0.75=7.5

(d) ≈1.37
=npq=10x0.75x0.25=1.875
=≈1.37


(Section 7.1) #5 (page 249)                                                                                   Jieun Kim 5-1
5.
(a) 49.85%
Because the normal curve is symmetric and has bell-shaped with a single peak. So, the percentage of the area under the normal curve is 49.85%, a half of total percentage.
(34%+13.5%+2.35%=49.85%)
(b) 68%
µ - σ = 34%, µ + σ = 34%, 34%+34%=68%
(c) 99.7%
µ - 3σ = 49.85%, µ + 3σ = 49.85%, 49.85% + 49.85% = 99.7%










#7 (page249)
µ=65, σ=2.5
µ - 3σ
µ - 2σ
µ - σ
µ
µ + σ
µ + 2σ
µ + 3σ
57.5
60
62.5
65
67.5
70
72.5
Between µ - 3σ and µ - 2σ : 2.35%
Between µ - 2σ and µ - σ : 13.5%
Between µ - σ and µ : 34%
Between µ  and µ + σ : 34%
Between µ + σ and µ + 2σ : 13.5%
Between µ + 2σ and µ + 3σ : 2. 35%
(a) 49.85 % (34%+13.5%+2.35%=49.85%)
(b) 49.85% (2.35%+13.5%+34%=49.85%)
(c) 68% (34%+34%=68%)
(d) 95% (13.5%+34%+34%+13.5%=95%)
#10 (page250)
µ=7.6, σ=0.4
µ - 3σ
µ - 2σ
µ - σ
µ
µ + σ
µ + 2σ
µ + 3σ
6.4
6.8
7.2
7.6
8
8.4
8.8
Between µ - 3σ and µ - 2σ : 2.35%
Between µ - 2σ and µ - σ : 13.5%
Between µ - σ and µ : 34%
Between µ  and µ + σ : 34%
Between µ + σ and µ + 2σ : 13.5%
Between µ + 2σ and µ + 3σ : 2. 35%
(a) 15.85% (13.5+2.35%=15.85%)
(b) 83.85% (2.35%+13.5%+34%+34%=83.85%)
(c) About 135 cups
850cups x 0.1585 = 134.7 ≈135
(Section 7.2) #7(page 259)
(a) Robert, Juan, Linda. Because their z score is above 0.
(b) Joel. Because z score 0 is the mean.
(c) Susan, Jan. Because their z score is below 0.
(d) µ=150, σ=20
Name
Robert
Juan
Susan
Joel
Jan
Linda
z
1.10
1.70
-2.00
0.00
-0.80
1.60
X=zσ+ µ
172=1.10x20+150
184=1.70x20+150
110=-2.00 x20+150
150=0.00 x20+150
134=-0.80 x20+150
182=1.60 x20+150
x
172
184
110
150
134
182
 
 #9 (page 259)
(a) -1 < z
(b) z <-2
(c) -2.67 < z < 2.33
4.5 < x
(4.5-4.8)/0.3 < z
-0.3/0.3 < z
-1 < z

x < 4.2
z < (4.2-4.8)/0.3
z < -0.6/0.3
z <-2
4.0 < x < 5.5
(4.0-4.8)/0.3 < z < (5.5-4.8)/0.3
-0.8/0.3 < z < 0.7/0.3
-2.67 < z < 2.33

(d) x < 4.368
(e) 5.184 < x 
(f) 4.125<x<4.5
z < -1.44
x < -1.44 x 0.3 + 4.8
x < 4.368
1.28 < z
1.28 x 0.3 + 4.8 < x
5.184 < x 
-2.25<z<-1.00
-2.25x0.3+4.8<x<-1.00x0.3+4.8
4.125<x<4.5

(g) A female had an RBC count of 5.9 or higher would be considered unusually high because z score is greater than 3 and the percentage of z≥3.67 is out of 99.7% of area. So, an RBC count of 5.9 or higher is unusually high.
x≥5.9
z≥(5.9-4.8)/0.3
z≥3.67




 #13 (page260)
z=-1.32
Area to the left of -1.32 = 0.0934

#25 (page 260)
Between z=0.32 and z=1.92
Area of Between z=0.32 and z=1.92
0.9726-0.6255=0.3471



 #37(page 260)
P(z≥-1.20)=1.000-P(z≤-1.20)=1.000-0.1151=0.8849

 #43 (page 260)
P(0 ≤ z ≤ 1.62)=P(z≤ 1.62)-P(z≤0)=0.9474-0.5000=0.4474






 #47 ( page 260)

P(-0.45≤ z ≤ 2.73)=P(z≤2.73)-P(z≤-0.45)=0.9968-0.3264=0.6704


 (Section7.3) #11 (page 270)                                                                                    Jieun Kim 5-2
P(x≥30)=P(z≥)=P(z≥)=P(z≥2.94)=1-0.9984=0.0016
#15 (page271)
6%=0.0600
The closest area is 0.0594. This is to the left of z=-1.56.
#21 (page271)
82%=0.8200
1-0.8200=0.1800
The closest area is 0.1788. This is to the right of z=-0.92.
#25 (page271)
(a)
P(x>60)=P(z>)=P(z>-1)=1-0.1587=0.8413
(b)
P(x<110)=P(z<)=P(z<)=P(z<1)=1-0.8413=0.1587
(c)
P(60<x<110)=(a)-(b)=0.8413-0.1587=0.6826
Apply    to prove it.
1-0.6826=0.3174
0.3174/2=0.1587
(d)
P(x>140)=P(z>)=P(z>)=P(z>2.2)=1-0.9868=0.0132

 #27 (page 271)
(a)
P(x<3.0)=P(z<)=P(z<)=P(z<-2.33)=0.0099
(b)
P(x>7.0)=P(z>)=P(z>)=P(z>2.11)=1-0.9826=0.0174
(c)
P(3.0<x<7.0)=1-0.0099-0.0174=0.9727

(Section7.5) #9 (page 287)
(a)
=µ=15
=σ/=14/=14/7=2
P(15≤≤17)=P(≤z≤)=P(0≤z≤1)=0.8413-0.5000=0.3413
(b)
=µ=15
=σ/=14/=14/8=1.75
P(15≤≤17)=P(≤z≤)=P(0≤z≤1.14)=0.8729-0.5000=0.3729
(c)
z of (a) = < z of (b) =
Because dividing smaller number results larger number.

#11 (page 287)
(a)
P(x<74.5)=P(z<)=P(z<)=P(z<-0.63)=0.2643
(b)
P(z<)=P(z<)=P(z<)=P(z<-2.78)=0.0027
(c)
I will be suspicious of the loader because the probability of the weight of coal in one car was less than 74.5 tons is approximately 26%. But the probability of the weight of coal in 20 cars was less than 74.5 tons is only 0.27%. So, I won't be suspicious of the second case.


#13 (page 287)
(a)
P(x<40)=P(z<)=P(z<)=P(z<-1.80)=0.0359
(b) P(<40)=P(z<)=P(z<)=P(z<)=P(z<-2.54)=0.0055
(c) P(<40)=P(z<)=P(z<)=P(z<)=P(z<-3.11)=0.0009
(d) P(<40)=P(z<)=P(z<)=P(z<)=P(z<-4.03)=0.0000
(e) The probabilities decreased as n increased. If more samples has lower blood glucose than 40, that is very serious case and it barely happens based on the probabilities of (a), (b), (c) and (d). In addition, the opposite side of the normal curve also happens very rare, because the probability is also close to 0.0000.


(Section7.6) #5 (page 294)
(a)
µ=np=200 x 0.88= 176
σ===≈4.60
P(x≥50)=P(z≥)=P(z≥)=P(z≥-27.39)=1.0000

(b)
µ=np=200 x 0.09= 18
σ===≈4.05
P(x≥50)=P(z≥)=P(z≥)=P(z≥7.90)=0.0000








#7 (page 295)
(a)
P(x≥15)=P(z≥)=P(z≥)=P(z≥-2.25)=1-0.0122=0.9878
(b)
P(x≥30)=P(z≥)=P(z≥)=P(z≥0.72)=1-0.7642=0.2358
(c)
P(25<x<35)=P(z<)-P(z<)=P(z<-P(z<)=P(z<1.71)-P(z<-0.27)=0.9564-0.3936=0.5628
(d)
P(x>40)= P(z>)=P(z>)= P(z>2.71)=1-0.9966=0.0034

 #9 (page 295)
n=66, p=0.80, q=1-0.80=0.20
µ=np=66x0.80=52.8
σ===≈3.25
(a)
P(x≥47)=P(z≥)=P(z≥)=P(z≥-1.78)=1-0.0375=0.9625
(b)
P(x≤58)=P(z≤)=P(z≤)=P(z≤1.6)=0.9452


n=66, p=0.20, q=1-0.20=0.80
µ=np=66x0.20=13.2
σ===≈3.25
(c)
P(x≥15)=P(z≥)=P(z≥)=P(z≥0.55)=1-0.7088=0.2912
(d)
P(x<10)=P(z<)=P(z<)P(z<-0.98)=0.1635



Jieun Kim 6-1

 (Section8.1) #11 (page 316)
(a)
Critical value of 80%=1.28
E=1.28=1.28x0.085≈0.11
-E<µ<+E
3.15-0.11<µ<3.15+0.11
3.04gram < µ < 3.26gram
(b) The distribution of bird's weight should be normal curve and it requires standard deviation(σ) and sample size.
(c) The level of confidence is 80% and with this, I can get an interval that includes these bird' s weight distribution.  It also means that there is 80% chance that the confidence interval is one of the intervals that contains the population average weight of birds.  
(d) E=1.28×=0.08
1.28x0.33x=0.08
=0.08=0.19
=
=
n=10000/361=27.7≈28






#13 (page 317)
(a)
Critical value of 99%=2.58
E==2.58=2.58x1.12≈2.88
-E<µ<+E
37.5-2.88<µ<37.5+2.88
34.62<µ<40.38
(b) It needs sample size which is larger than 30 and the standard deviation(σ)
(c) 99% of confidence level means, there is 99% chance that the confidence interval is one of the intervals that includes the population average blood plasma level.
(d) E=2.58×=2.50
=2.507.50=0.13=
=
=
n=10000/169=59.17≈60









#16 (page 317)
Critical value of 90%=1.645
(a)
E=1.645=1.645x=1.645x2521.61≈4148
-E<µ<+E
50340-4148<µ<50340+4148
46192<µ<54488
(b)
E=1.645=1.645=1.645x1606.56=2642.79≈2643
-E<µ<+E
50340-2643<µ<50340+2643
47697<µ<52983
(c)
E=1.645=1.645=1.645x719.82=1184.11≈1184
-E<µ<+E
50340-1184<µ<50340+1184
49156<µ<51524
(d)
As the standard deviation decrease, the margin of error also decrease.
σ :16920>10780>4830
E :4148>2643>1184
(e)
As the standard deviation decrease, the length of a 90% confidence interval also decrease.
The length of interval : 8296>5286>2368


(Section 8.2) #1 (page 326)
d.f.=n-1=18-1=17
=2.110

#3 (page326)
d.f.=n-1=22-1=21
=1.721

#11 (page 327)
(a)
sample mean=11450/9=1272.22≈1272
x
1189
1271
1267
1272
1268
1316
1275
1317
1275
1413721
1615441
1605289
1617984
1607824
1731856
1625625
1734489
1625625
=14577854
==131102500
=====1363.75
s==36.92≈37
(b)
d.f.=n-1=9-1=8
E==1.860x37/=1.860x37/3=1.860x12.33=22.94≈23
1272-23<µ<1272+23
1249<µ<1295


#13 (page 327)
(a)
x
68
104
128
122
60
64
4624
10816
16384
14884
3600
4096

=54404
==298116
=====943.8≈944
s=≈30.7
=546/6=91
(b)E==1.301=1.301x12.53=16.3
91-16.3<µ<91+16.3
74.7<µ<107.3



#15 (page 327)
x
9.3
8.8
10.1
8.9
9.4
9.8
10.0
9.9
11.2
12.1
86.49
77.44
102.01
79.21
88.36
96.04
100
98.01
125.44
146.41

(a)
=999.41
==9900
===≈1.046
s=≈1.02
=99.5/10=9.95
(b)
E==4.781x=4.781x0.32=1.53
9.95-1.53<µ<9.95+1.53
8.42< µ<11.48
(c)
All the values are above 6mg/dl and 99% confidence interval is also above 6mg.dl. So, these people are not anymore issue with a calcium deficiency. 





Solutions part 1.
http://jieunkimresume.blogspot.com/2015/05/understanding-basic-statistics.html
Solutions part 2.

http://jieunkimresume.blogspot.com/2015/05/understanding-basic-statistics_21.html


Comments