Mean and Standard deviation
Problems with Solutions

Mean and standard deviation problems along with their solutions at the bottom of the page are presented. Problems related to data sets as well as grouped data are discussed.

Problems

  1. Consider the following three data sets A, B and C.
    A = {9,10,11,7,13}
    B = {10,10,10,10,10}
    C = {1,1,10,19,19}
    a) Calculate the mean of each data set.
    b) Calculate the standard deviation of each data set.
    c) Which set has the largest standard deviation?
    d) Is it possible to answer question c) without calculations of the standard deviation?

  2. A given data set has a mean ? and a standard deviation ?.
    a) What are the new values of the mean and the standard deviation if the same constant k is added to each data value in the given set?Explain.
    b) What are the new values of the mean and the standard deviation if each data value of the set is multiplied by the same constant k?Explain.

  3. If the standard deviation of a given data set is equal to zero, what can we say about the data values included in the given data set?

  4. The frequency table of the monthly salaries of 20 people is shown below.
    salary(in $)frequency
    35005
    40008
    42005
    43002

    a) Calculate the mean of the salaries of the 20 people.
    b) Calculate the standard deviation of the salaries of the 20 people.

  5. The following table shows the grouped data, in classes, for the heights of 50 people.
    height (in cm) - classesfrequency
    ( 120 , 130 ] 2
    ( 130 , 140 ]5
    ( 140 , 150 ]25
    ( 150 , 160 ]10
    ( 160 , 170 ]8

    a) Calculate the mean of the salaries of the 20 people.
    b) Calculate the standard deviation of the salaries of the 20 people.



Solutions

    1. mean of Data set A = (9+10+11+7+13)/5 = 10
      mean of Data set B = (10+10+10+10+10)/5 = 10
      mean of Data set C = (1+1+10+19+19)/5 = 10

    2. Standard Deviation Data set A
      = √[ ( (9-10)2+(10-10)2+(11-10)2+(7-10)2+(13-10)2 )/5 ] = 2
      Standard Deviation Data set B
      = √[ ( (10-10)2+(10-10)2+(10-10)2+(10-10)2+(10-10)2 )/5 ] = 0
      Standard Deviation Data set C
      = √[ ( (1-10)2+(1-10)2+(10-10)2+(19-10)2+(19-10)2 )/5 ] = 8.05
    3. Data set C has the largest standard deviation.
    4. Yes, since data Set C has data values that are further away from the mean compared to sets A and B.


    1. We limit the discussion to a data set with 3 values for simplicity, but the conclusions are true for any data set with quantitative data.
      Let x, y and z be the data values making a data set.
      The mean ? = (x + y + z) / 3
      The standard deviation ? = √[ ((x - ?)2 + (y - ?)2 + (z - ?)2)/3 ]
      We now add a constant k to each data value and calculate the new mean ?'.
      ?' = ((x + k) + (y + k) + (z + k)) / 3 = (x + y + z) / 3 + 3k/3 = ? + k
      We now calculate the new mean standard deviation ?'.
      ?' = √[ ((x + k - ?')2 +(y + k - ?')2+(z + k - ?')2)/3 ]
      Note that x + k - ?' = x + k - ? - k = x - ?
      also y + k - ?' = y + k - ? - k = y - ? and z + k - ?' = z + k - ? - k = z - ?
      Therefore ?' = √[ ((x - ?)2 +(y - ?)2+(z - ?)2)/3 ] = ?
      If we add the same constant k to all data values included in a data set, we obtain a new data set whose mean is the mean of the original data set PLUS k. The standard deviation does not change.
    2. We now multiply all data values by a constant k and calculate the new mean ?' and the new standard deviation ?'.
      ?' = (kx + ky + kz) / 3 = k?
      ?' = √[ ((kx - k?)2 +(ky - k?)2+(kz - k?)2)/3 ] = |k| ?
      If we multiply all data values included in a data set by a constant k, we obtain a new data set whose mean is the mean of the original data set TIMES k and standard deviation is the standard deviation of the original data set TIMES the absolute value of k.


    1. Again, we limit the discussion to a data set with 4 values for simplicity, but the conclusions are true for any data set with quantitative data.
      Let x, y, z and w be the data values making a data set with mean ?.
      The standard deviation ? = √[ ((x - ?)2 + (y - ?)2 + (z - ?)2 + (w - ?)2)/3 ]
      Let ? = 0, hence
      √[ ((x - ?)2 + (y - ?)2 + (z - ?)2 + (w - ?)2)/3 ] = 0
      Which gives
      (x - ?)2 + (y - ?)2 + (z - ?)2 + (w - ?)2 = 0
      All terms in the equation are positive and therefore, the above equation is equivalent to
      (x - ?)2 = 0, (y - ?)2 = 0, (z - ?)2 = 0 and (w - ?)2 = 0.
      Which gives
      x = y = z = w = ? : all data values in the set with ? = 0 are equal.


    1. Let xi be the i th salary and fi be the corresponding frequency.
      mean of grouped data = ? = (?xi*fi) / ?fi
      = (3500*5 + 4000*8 + 4200*5 + 4300*2) /(5 + 8 + 5 + 2)
      = $3955
      b) standard deviation of grouped data = √[ (?(xi-?)2*fi) / ?fi ]
      = √[ (5*(3500-3955)2+8*(4000-3955)2+5*(4200-3955)2+2*(4300-3955)2) /(20) ]
      = 282 (rounded to the nearest unit)


    1. We first find the midpoints of the given classes.
      height (in cm) - classesmidpointfrequency
      ( 120 , 130 ](120+130) ÷ 2 = 1252
      ( 130 , 140 ](130+140) ÷ 2 = 1355
      ( 140 , 150 ](140+150) ÷ 2 = 14525
      ( 150 , 160 ](150+160) ÷ 2 = 15510
      ( 160 , 170 ](160+170) ÷ 2 = 1658

      Let mi be the midpoint of the i th clss and fi be the corresponding frequency.
      mean of grouped data = ? = (?mi*fi) / ?fi
      = (125*2 + 135*5 + 145*25 + 155*10 + 165*8) /(2+5+25+10+8)
      = 148.4
      b) standard deviation of grouped data = √[ (?(mi-?)2*fi) / ?fi ]
      = √[ (2*(125-148.4)2+5*(135-148.4)2+25*(145-148.4)2+10*(155-148.4)2+8*(165-148.4)2) /(50) ]
      = 9.9

More References and links

elementary statistics and probabilities.
Home Page