Image Credit: Pixabay.com |
Mean/Average
It is defined as the ratio of sum of all observations to the number of observations.Mean | = | Sum of all observations | |
Number of observations |
͞x
| = |
fi xi
| |
fi |
fi × xi = Sum of all observations
fi = Total number of Observations
↪ In most of real life situations, data is usually so large that to make a meaningful study it needs to be condensed as grouped data.
↪ In Grouped frequency distribution, observations are classified into class intervals of same widths.
↪ By convention, the common observation belongs to the higher class, i.e., 10 belongs to the class interval 10-20 (and not to 0-10).
↪ The number of observations in each class is called Class frequency.
↪ It is assumed that the frequency of each class interval is centered around its mid-point. So the mid-point (or class mark) of each class can be chosen to represent the observations falling in the class.
Class-mark = (Upper limit + Lower limit)/2
Direct Method
The class marks serve as xi’s in this method. For the ith class interval, the frequency fi corresponds to the class mark xi. Now, the mean can be computed in the same manner as in case of ungrouped data.
͞x
| = |
fi xi
| |
fi |
↪ This method gives an approximate mean because of the mid-point assumption.
↪ Remember, when this formula is used
(i) For Ungrouped frequency distribution,
xi = ith observation
fi = frequency of the ith observation.
(ii) For Grouped frequency distribution,
xi = class mark of the ith class interval
fi = frequency of the ith class interval.
Ex - Find the mean for given data
͞x
| = |
fi xi
| |
fi |
Assumed Mean Method
Sometimes when the numerical values of xi (class mark) and fi are large, finding the product of xi and fi becomes tedious and time consuming. We can't change the fi’s, but we can change each xi to a smaller number, so that our calculations become easy. We can achieve this by subtracting a fixed number from each of these xi’s.↪ The first step is to choose one among the xi’s as the assumed mean, and denote it by ‘a’. We may take ‘a’ to be that xi which lies in the center of x1, x2, . . ., xn.
So, in previous example, we can choose a = 47.5 or a = 62.5. Let us choose a = 47.5.
↪ The next step is to find the difference between a and each of the xi’s, that is, the deviation (di) of ‘a’ from each of the xi’s i.e.,
di = xi – a
= xi – 47.5
↪ The third step is to find the product of di with the corresponding fi, and take the sum of all the fidi’s (Σfidi).
↪ Then the mean of the deviations,
͞d
| = |
fi di
| |
fi |
Mean of deviations,
͞d
| = |
fi di
| |
fi |
= |
fi (xi -a)
| ||
fi |
= |
fi xi
|
-
|
fi a
| ||||||||
fi | fi |
͞d
| = |
͞x - a
|
fi
| ||||||||
fi |
͞x
| = |
a +
|
fidi
| ||||||||
fi |
For previous example, we can write mean deviation table as following (a = 47.5)
͞x = 47.5 + 435/30
47.5 + 14.5 = 62
Therefore, the mean of the marks obtained by the students is 62.
Step-deviation method
↪ In previous example, if we find the mean by taking each of xi (i.e., 17.5, 32.5,and so on) as ‘a’, then the mean determined in each case is the same, i.e., 62.So, we can say that the the value of the mean obtained does not depend on the choice of ‘a’.
↪ We can also observe that deviations are common multiples of the class size i.e., the values in Column 4 are all multiples of 15. So, if we divide the values in the entire Column 4 by 15, we would get smaller numbers to multiply with fi. (Here, 15 is the class size of each class interval.)
↪ Let
ui
| = |
xi −a
| |
h |
↪ Then, Mean of reduced deviations,
͞u
| = |
fi ui
| |
fi |
↪ Now, ͞x can be find as following
͞u
| = |
fi ui
| |
fi |
= |
fi (xi -a)/h
| ||
fi |
x̄ = a + hū
x̄ = |
a + h
|
fi ui
| |
fi |
x̄ = 47.5 + 15×29/30
= 47.5 + 14.5 = 62
So, the mean marks obtained by a student is 62.
The method discussed above is called the Step-deviation method.
Note :
↪ the step-deviation method will be convenient to apply if all the di’s have a common factor (=h).
↪ The mean obtained by all the three methods is the same (an approximate mean).
↪ The assumed mean method and step-deviation method are just simplified forms of the direct method. Calculation is simplified by reducing xi .
↪ The choice of method to be used depends on the numerical values of xi and fi. If xi and fi are sufficiently small, then the direct method is an appropriate choice. If xi and fi are numerically large numbers, then we can go for the assumed mean method or step-deviation method. If the class sizes are unequal, and xi are large numerically, we can still apply the step-deviation method by taking h to be a suitable divisor of all the di’s.
↪ The formula x̄ = a + hū still holds if a and h are not as given above (i.e., a = xi & h = class size), but are any non-zero numbers such that ui = (xi − a)/h .
Measure of Central Tendency for Ungrouped Data | |