The following measurements were recorded for the drying time, in hours, of a certain brand of latex paint.
3.4 | 2.5 | 4.8 | 2.9 | 3.6 |
2.8 | 3.3 | 5.6 | 3.7 | 2.8 |
4.4 | 4.0 | 5.2 | 3.0 | 4.8 |
Assume that the measurements are a simple random sample.
(a) What is the sample size for the above sample?
(b) Calculate the sample mean for these data.
(c) Calculate the sample median.
(d) Plot the data by way of a dot plot.
(e) Compute the 20% trimmed mean for the above data set.
(f) Is the sample mean for these data more or less descriptive as a center of location than the trimmed mean?
Solution:
Part A. Sample size, n is the total number of measurements.
n=15 \ \qquad \ \color{DarkOrange} \left( \text{Answer} \right)
Part B. The sample mean, \bar x is computed as follows:
\begin{align*} \bar x & = \sum_{i=1}^{n}\frac{x_{i}}{n} \\ & = \frac{3.4+2.5+4.8+2.9+3.6+2.8+3.3+5.6+3.7+2.8+4.4+4.0+5.2+3.0+4.8}{15} \\ & = \frac{56.8}{15} \\ & = 3.79 \ \text{hours} \ \qquad \ \color{DarkOrange} \left( \text{Answer} \right) \end{align*}
Part C. The sample median, \tilde x is the number at the middle of the arranged measurements in increasing magnitude. There are 15 measurements, n=15. If we arranged the data in increasing magnitude, the median is the measurement in the middle.
2.5, \ 2.8, \ 2.8, \ 2.9, \ 3.0, \ 3.3, \ 3.4, \ \underset{\color{Blue} \text{middle number}}{3.6}, \ 3.7, \ 4.0, \ 4.4, \ 4.8, \ 4.8, \ 5.2, \ 5.6
The middle number is 3.6. That is
\tilde x= 3.6 \ \text{hours} \ \qquad \ \color{DarkOrange} \left( \text{Answer} \right)
Part D. This dot plot has been created using the Statistical Software Rguroo
Part E. The 20% trimmed mean means the average of the measurements left after removing 20% highest and 20% lowest data. This means we remove the 3 highest and 3 lowest numbers. Therefore, the data becomes
\ 2.9, \ 3.0, \ 3.3, \ 3.4, \ 3.6, \ 3.7, \ 4.0, \ 4.4, \ 4.8,
The 20% trimmed mean, \bar x _{tr20} is
\begin{align*} \bar x _{tr20} & = \frac{2.9+3.0+ \cdots+4.8}{9} \\ \bar x _{tr20} & = \frac{33.1}{9} \\ \bar x _{tr20} & = 3.678 \ \text{hours} \ \qquad \ \color{DarkOrange} \left( \text{Answer} \right) \end{align*}
Part F. The sample mean for these data is \bar x = 3.79 \ \text{hours} while the 20% trimmed mean is \bar x _{tr20} = 3.678 \ \text{hours}. Seems like the two means are not really far from each other, but because of the elimination of the extreme values, we can treat the trimmed mean as a better descriptive mean.