The mean (or average) of observations is the sum of the values of all the observations divided by the total number of observations.
Where:
This method is used when the numerical values of \(x_i\) and \(f_i\) are large.
Where:
This method is used when all the \(d_i\)'s have a common factor.
Where:
The marks obtained by 30 students of Class X in a Mathematics paper:
| Marks obtained (xi) | 10 | 20 | 36 | 40 | 50 | 56 | 60 | 70 | 72 | 80 | 88 | 92 | 95 |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Number of students (fi) | 1 | 1 | 3 | 4 | 3 | 2 | 4 | 4 | 1 | 1 | 2 | 3 | 1 |
Solution:
We calculate \(f_i x_i\) for each class:
| Marks obtained (xi) | Number of students (fi) | fixi |
|---|---|---|
| 10 | 1 | 10 |
| 20 | 1 | 20 |
| 36 | 3 | 108 |
| 40 | 4 | 160 |
| 50 | 3 | 150 |
| 56 | 2 | 112 |
| 60 | 4 | 240 |
| 70 | 4 | 280 |
| 72 | 1 | 72 |
| 80 | 1 | 80 |
| 88 | 2 | 176 |
| 92 | 3 | 276 |
| 95 | 1 | 95 |
| Total | \(\sum f_i = 30\) | \(\sum f_i x_i = 1779\) |
Now, \(\bar{x} = \frac{\sum f_i x_i}{\sum f_i} = \frac{1779}{30} = 59.3\)
Therefore, the mean marks obtained is 59.3.
Percentage distribution of female teachers in primary schools:
| Percentage of female teachers | 15-25 | 25-35 | 35-45 | 45-55 | 55-65 | 65-75 | 75-85 |
|---|---|---|---|---|---|---|---|
| Number of States/U.T. | 6 | 11 | 7 | 4 | 4 | 2 | 1 |
Solution:
First, we find the class marks (\(x_i\)) and then calculate using all three methods:
| Class Interval | fi | xi | di = xi - 50 | ui = (xi - 50)/10 | fixi | fidi | fiui |
|---|---|---|---|---|---|---|---|
| 15-25 | 6 | 20 | -30 | -3 | 120 | -180 | -18 |
| 25-35 | 11 | 30 | -20 | -2 | 330 | -220 | -22 |
| 35-45 | 7 | 40 | -10 | -1 | 280 | -70 | -7 |
| 45-55 | 4 | 50 | 0 | 0 | 200 | 0 | 0 |
| 55-65 | 4 | 60 | 10 | 1 | 240 | 40 | 4 |
| 65-75 | 2 | 70 | 20 | 2 | 140 | 40 | 4 |
| 75-85 | 1 | 80 | 30 | 3 | 80 | 30 | 3 |
| Total | 35 | 1390 | -360 | -36 |
Direct Method: \(\bar{x} = \frac{\sum f_i x_i}{\sum f_i} = \frac{1390}{35} = 39.71\)
Assumed Mean Method: \(\bar{x} = a + \frac{\sum f_i d_i}{\sum f_i} = 50 + \frac{-360}{35} = 39.71\)
Step Deviation Method: \(\bar{x} = a + \left( \frac{\sum f_i u_i}{\sum f_i} \right) \times h = 50 + \left( \frac{-36}{35} \right) \times 10 = 39.71\)
Therefore, the mean percentage of female teachers is 39.71%.
For grouped data, the mode is calculated using the formula:
Where:
Family size distribution in 20 households:
| Family size | 1-3 | 3-5 | 5-7 | 7-9 | 9-11 |
|---|---|---|---|---|---|
| Number of families | 7 | 8 | 2 | 2 | 1 |
Solution:
The maximum class frequency is 8, and the class corresponding to this frequency is 3-5. So, the modal class is 3-5.
Now:
Using the formula:
\(\text{Mode} = 3 + \left( \frac{8 - 7}{2 \times 8 - 7 - 2} \right) \times 2 = 3 + \frac{2}{7} = 3.286\)
Therefore, the mode of the data is 3.286.
Marks distribution of 30 students (from Example 1):
| Class interval | 10-25 | 25-40 | 40-55 | 55-70 | 70-85 | 85-100 |
|---|---|---|---|---|---|---|
| Number of students | 2 | 3 | 7 | 6 | 6 | 6 |
Solution:
The maximum number of students (7) have got marks in the interval 40-55, so the modal class is 40-55.
Now:
Using the formula:
\(\text{Mode} = 40 + \left( \frac{7 - 3}{14 - 6 - 3} \right) \times 15 = 52\)
From Example 1, the mean marks is 62.
So, the maximum number of students obtained 52 marks, while on average a student obtained 62 marks.
For grouped data, the median is calculated using the formula:
Where:
Marks distribution of 53 students:
| Marks | Number of students | Cumulative frequency |
|---|---|---|
| 0-10 | 5 | 5 |
| 10-20 | 3 | 8 |
| 20-30 | 4 | 12 |
| 30-40 | 3 | 15 |
| 40-50 | 3 | 18 |
| 50-60 | 4 | 22 |
| 60-70 | 7 | 29 |
| 70-80 | 9 | 38 |
| 80-90 | 7 | 45 |
| 90-100 | 8 | 53 |
Solution:
Here \(n = 53\), so \(\frac{n}{2} = 26.5\)
The class whose cumulative frequency is greater than and nearest to 26.5 is 60-70 (cumulative frequency = 29). So, the median class is 60-70.
Now:
Using the formula:
\(\text{Median} = 60 + \left( \frac{26.5 - 22}{7} \right) \times 10 = 60 + \frac{45}{7} = 66.4\)
So, about half the students have scored marks less than 66.4, and the other half have scored marks more than 66.4.
Heights of 51 girls:
| Height (in cm) | Number of girls |
|---|---|
| Less than 140 | 4 |
| Less than 145 | 11 |
| Less than 150 | 29 |
| Less than 155 | 40 |
| Less than 160 | 46 |
| Less than 165 | 51 |
Solution:
First, we convert this to a frequency distribution:
| Class intervals | Frequency | Cumulative frequency |
|---|---|---|
| Below 140 | 4 | 4 |
| 140-145 | 7 | 11 |
| 145-150 | 18 | 29 |
| 150-155 | 11 | 40 |
| 155-160 | 6 | 46 |
| 160-165 | 5 | 51 |
Here \(n = 51\), so \(\frac{n}{2} = 25.5\)
The median class is 145-150 (cumulative frequency = 29)
Now:
Using the formula:
\(\text{Median} = 145 + \left( \frac{25.5 - 11}{18} \right) \times 5 = 145 + \frac{72.5}{18} = 149.03\)
So, the median height of the girls is 149.03 cm.
Number of plants in 20 houses:
| Number of plants | 0-2 | 2-4 | 4-6 | 6-8 | 8-10 | 10-12 | 12-14 |
|---|---|---|---|---|---|---|---|
| Number of houses | 1 | 2 | 1 | 5 | 6 | 2 | 3 |
Solution:
We'll use the direct method:
| Class Interval | fi | xi | fixi |
|---|---|---|---|
| 0-2 | 1 | 1 | 1 |
| 2-4 | 2 | 3 | 6 |
| 4-6 | 1 | 5 | 5 |
| 6-8 | 5 | 7 | 35 |
| 8-10 | 6 | 9 | 54 |
| 10-12 | 2 | 11 | 22 |
| 12-14 | 3 | 13 | 39 |
| Total | 20 | 162 |
\(\bar{x} = \frac{\sum f_i x_i}{\sum f_i} = \frac{162}{20} = 8.1\)
Therefore, the mean number of plants per house is 8.1.
I used the direct method because the values are small and calculations are straightforward.
Daily wages of 50 workers:
| Daily wages (in ₹) | 500-520 | 520-540 | 540-560 | 560-580 | 580-600 |
|---|---|---|---|---|---|
| Number of workers | 12 | 14 | 8 | 6 | 10 |
Solution:
We'll use the step deviation method with \(a = 550\) and \(h = 20\):
| Class Interval | fi | xi | ui = (xi - 550)/20 | fiui |
|---|---|---|---|---|
| 500-520 | 12 | 510 | -2 | -24 |
| 520-540 | 14 | 530 | -1 | -14 |
| 540-560 | 8 | 550 | 0 | 0 |
| 560-580 | 6 | 570 | 1 | 6 |
| 580-600 | 10 | 590 | 2 | 20 |
| Total | 50 | -12 |
\(\bar{x} = a + \left( \frac{\sum f_i u_i}{\sum f_i} \right) \times h = 550 + \left( \frac{-12}{50} \right) \times 20 = 550 - 4.8 = 545.2\)
Therefore, the mean daily wage is ₹545.20.
Ages of patients admitted in a hospital:
| Age (in years) | 5-15 | 15-25 | 25-35 | 35-45 | 45-55 | 55-65 |
|---|---|---|---|---|---|---|
| Number of patients | 6 | 11 | 21 | 23 | 14 | 5 |
Solution:
Mode: The modal class is 35-45 (highest frequency = 23)
\(l = 35\), \(h = 10\), \(f_1 = 23\), \(f_0 = 21\), \(f_2 = 14\)
\(\text{Mode} = 35 + \left( \frac{23 - 21}{46 - 21 - 14} \right) \times 10 = 35 + \frac{20}{11} = 36.8\)
So, the modal age is 36.8 years.
Mean: Using direct method:
| Class Interval | fi | xi | fixi |
|---|---|---|---|
| 5-15 | 6 | 10 | 60 |
| 15-25 | 11 | 20 | 220 |
| 25-35 | 21 | 30 | 630 |
| 35-45 | 23 | 40 | 920 |
| 45-55 | 14 | 50 | 700 |
| 55-65 | 5 | 60 | 300 |
| Total | 80 | 2830 |
\(\bar{x} = \frac{2830}{80} = 35.375\)
So, the mean age is 35.375 years.
The mode and mean are different because the distribution is not perfectly symmetrical.
Distribution of 100 families by their expenditure:
| Expenditure (in ₹) | 1000-1500 | 1500-2000 | 2000-2500 | 2500-3000 | 3000-3500 | 3500-4000 | 4000-4500 | 4500-5000 |
|---|---|---|---|---|---|---|---|---|
| Number of families | 24 | 40 | 33 | 28 | 30 | 22 | 16 | 7 |
Solution:
First, we calculate cumulative frequencies:
| Class Interval | fi | Cumulative frequency |
|---|---|---|
| 1000-1500 | 24 | 24 |
| 1500-2000 | 40 | 64 |
| 2000-2500 | 33 | 97 |
| 2500-3000 | 28 | 125 |
| 3000-3500 | 30 | 155 |
| 3500-4000 | 22 | 177 |
| 4000-4500 | 16 | 193 |
| 4500-5000 | 7 | 200 |
Here \(n = 200\), so \(\frac{n}{2} = 100\)
The median class is 2000-2500 (cumulative frequency = 97)
Now:
Using the formula:
\(\text{Median} = 2000 + \left( \frac{100 - 64}{33} \right) \times 500 = 2000 + \frac{18000}{33} = 2545.45\)
So, the median expenditure is ₹2545.45.