What do you understand by dispersion ? Discuss the methods of measuring dispersion.

Website can be closed on 12th to 14th Jan 2025 due to server maintainance work.


Dispersion refers to the extent to which data points in a dataset vary or spread out from the central value (mean, median, or mode). Understanding dispersion is crucial because it provides insights into the variability of data, indicating how much the data points differ from each other and from the central tendency. A low dispersion indicates that data points are close to each other, while a high dispersion suggests a wide range of values.


  1. Understanding Variability: It helps to understand the degree of variation in the data.
  2. Comparing Datasets: Different datasets can be compared based on their variability, which is vital in fields like finance, quality control, and research.
  3. Statistical Inference: Dispersion measures are critical in making inferences about populations from sample data.
  4. Decision Making: It aids in risk assessment and decision-making by understanding the potential variability in data.

There are several methods to measure dispersion, including the following:

    • Definition: The range is the difference between the highest and lowest values in a dataset.
    • Formula:
      Range = Maximum Value – Minimum Value
    • Example:
      Dataset: 10, 15, 20, 25, 30
      Range = 30 – 10 = 20
    • Advantages:
      • Easy to calculate and understand.
    • Disadvantages:
      • Sensitive to extreme values (outliers), which can skew the range.

      • Definition: Variance measures the average of the squared differences from the mean. It quantifies how much the data points deviate from the mean.
      • Formula: For a population: σ² = Σ(X – μ)² / N For a sample: s² = Σ(X – X̄)² / (n – 1) Where:
        • σ² = population variance
        • s² = sample variance
        • X = individual data point
        • μ = population mean
        • X̄ = sample mean
        • N = population size
        • n = sample size
      • Example:
        Dataset: 10, 15, 20
        Mean = (10 + 15 + 20) / 3 = 15
        Variance = [(10 – 15)² + (15 – 15)² + (20 – 15)²] / 3
        = [25 + 0 + 25] / 3 = 16.67
      • Advantages:
        • Considers all data points and their deviations.
      • Disadvantages:
        • Not in the same units as the data, making interpretation difficult.

        • Definition: Standard deviation is the square root of the variance and provides a measure of dispersion in the same units as the data.
        • Formula:
          For a population:
          σ = √(σ²)
          For a sample:
          s = √(s²)
        • Example:
          Continuing with the previous example,
          Standard Deviation = √16.67 ≈ 4.08
        • Advantages:
          • Easier to interpret as it is in the same unit as the original data.
        • Disadvantages:
          • Still sensitive to outliers.

          • Definition: The interquartile range measures the range of the middle 50% of data, which is less affected by outliers.
          • Formula: IQR = Q₃ – Q₁ Where:
            • Q₁ = First quartile (25th percentile)
            • Q₃ = Third quartile (75th percentile)
          • Example:
            Dataset: 10, 15, 20, 25, 30
            Q₁ = 15 and Q₃ = 25
            IQR = 25 – 15 = 10
          • Advantages:
            • Robust against outliers and provides a better measure of spread for skewed data.
          • Disadvantages:
            • Does not consider the full range of data.

            • Definition: The mean absolute deviation measures the average of the absolute differences between each data point and the mean.
            • Formula:
              MAD = Σ|X – X̄| / n
            • Example:
              Dataset: 10, 15, 20
              Mean = 15
              MAD = (|10 – 15| + |15 – 15| + |20 – 15|) / 3
              = (5 + 0 + 5) / 3 = 3.33
            • Advantages:
              • Provides a clearer understanding of dispersion in the same unit as the data.
            • Disadvantages:
              • Less commonly used than standard deviation and variance.

            MeasureDescriptionSensitivity to OutliersUsefulness
            RangeDifference between highest and lowest valuesHighQuick summary of spread
            VarianceAverage of squared deviations from the meanHighUnderstanding data variability
            Standard DeviationSquare root of varianceHighInterpretable measure of variability
            Interquartile Range (IQR)Range of the middle 50% of dataLowRobust measure for skewed distributions
            Mean Absolute Deviation (MAD)Average of absolute differences from the meanModerateIntuitive measure of average deviation

            Dispersion is a critical concept in statistics that helps to understand the variability of data. By measuring dispersion through various methods, such as range, variance, standard deviation, interquartile range, and mean absolute deviation, statisticians can gain insights into the distribution and spread of data points. Each method has its strengths and weaknesses, and the choice of which measure to use depends on the nature of the dataset and the specific analysis requirements.


            1. Gupta, S. C., & Kapoor, V. K. (2014). Fundamentals of Mathematical Statistics. Sultan Chand & Sons.
            2. Spiegel, M. R., & Stephens, L. J. (2018). Statistics. McGraw-Hill Education.
            3. Wallis, W. A., & Roberts, R. C. (2020). Statistical Analysis and Data Display. Springer.

            Leave a Comment

            Your email address will not be published. Required fields are marked *

            Scroll to Top