Measures of Spread

  • Range: Max - min
  • Variance: roughly the average squared deviation from the mean
    • It is calculated using
      variance
    • Sample variance is denoted using sample variance
    • Population variance is denoted using population variance
  • Standard Deviation: roughly the average deviation around the mean, and has the same units as the ata
    • It is square root of the variance:
      standard deviation
    • Sample sd is denoted using sample sd
    • Population sd is denoted using population sd
  • Inter-quartile range (IQR): range of the middle 50% of the data, distance between the first quartile (25th percentile) and third quartile (75th percentile)
    • IQR = Q3 - Q1
    • Example:
      example of interquartile range
      The value of the IQR itself, the number 12, isn't as informative on its own, but would be very useful when comparing two distributions. The reason why the IQR is a more reliable measure of spread in sample data than the range, which we said was the difference between the maximum and the minimum. Is that it doesn't rely on the end points, which may be unusual observations or potential outliers. Instead, it measures the variability for the bulk of the data around the center.

References & Resources

  • N/A