Title Image: Normal Distribution

Normal Distribution and the Empirical Rule

Yashmeet Singh · · 3 minute read

Normal Distribution

We covered Data Distributions and Density Curves in recent posts (see here and here).

Now we turn our attention to Normal Distribution - a particular type of distribution used widely in Statistics and Machine Learning.

You can recognize normal distribution by some of its signature features. Let’s explore these features using an example.

The weight of newborn babies is normally distributed. Suppose the mean weight is 7.75 pounds with a standard deviation of 0.85. Then the probability density curve for newborn weights will look like this:

Normal Distribution for Weights of Newborn Babies

The x-axis has the newborn weights in increasing order. The curve’s height shows the probability (on the y-axis) corresponding to the weight.

Since newborn weight follows a normal distribution, this curve will exhibit some unique properties:

  • Mean, median, and mode are the same value and lie at the center (red line).
  • Symmetrical: If you split the curve by the mean, the left and the right sides are mirror images of each other.
  • Bell Shaped: The curve has a peak at the mean. The probability drops off as you move away from the mean in either direction.
  • The curve never touches the x-axis. No matter how far from the center, every value has a probability greater than 0.
  • The total area under the curve equals 1. That’s because it represents all possible weight measurements and their probabilities.

Empirical Rule

You’ll notice that the majority of the values in FIG 1 are clustered around the mean.

When a variable follows a normal distribution, almost all of its values occur within three standard deviations from the mean.

To be more precise, 68% of the values fall within one standard deviation, 95% within two, and 99.7% within three standard deviations from the mean.

This is known as the Empirical Rule or 68-95-99.7 Rule.

Let’s apply it to the newborn weights.

The weight 6.90 pounds is one standard deviation below the mean. And 8.60 pounds is one standard deviation above the mean.

Thus, the empirical rule dictates that 68% of newborn babies will weigh between 6.90 and 8.60 pounds. Below figure shows this information graphically:

Normal Distribution and Empirical Rule - graph showing 68% observations within one standard deviation of the mean


Similarly, 95% of the babies will weigh between 6.05 and 9.45 pounds (two standard deviations from the mean):

Normal Distribution and Empirical Rule - graph showing 95% observations within two standard deviations of the mean


Finally, the weights of almost all (99.7%) newborns will fall between 5.20 and 10.30 pounds (three standard deviations from the mean):

Normal Distribution and Empirical Rule - graph showing 99.7% observations within three standard deviations of the mean


Using the Empirical Rule

Let’s say you want to know:

How many newborns weigh between 7.75 pounds (mean) and 8.60 pounds (one standard deviation above the mean)?

We know from FIG 2 that 68% of newborns weigh between one standard deviation below and above the mean. And the weight range in the question above is the right half of the shaded area in FIG 2.

Since the normal distribution is symmetrical about the mean, the left and right half will have equal area and thus equal percentages.

Thus, 34% of the newborns will weigh between 7.75 pounds and 8.60 pounds, as the below figure shows (in the darker blue shade):

Normal Distribution and Empirical Rule - split by the mean and percentage halved

You can use the empirical rule and symmetry property to solve a variety of probability problems.

For example, you could calculate probabilities for intervals of one standard deviation as you move away from the mean. The below plot shows the results:

Normal Distribution: Percentages for intervals of one standard deviation

As expected, the probability drops dramatically as the distance from the mean increases.

Summary

This post covered the basics of Normal Distribution. It also introduced Empirical Rule and showed how you could use it.

Check out the next post, where we’ll use Python and SciPy to generate and plot Normal Distributions.