Find study content
Learning Materials

Discover learning materials by subject, university or textbook.

Explanations
All Subjects

Anthropology

Archaeology

Architecture

Art and Design

Bengali

Biology

Business Studies

Chemistry

Chinese

Combined Science

Computer Science

Economics

Engineering

English

English Literature

Environmental Science

French

Geography

German

Greek

History

Hospitality and Tourism

Human Geography

Japanese

Italian

Law

Macroeconomics

Marketing

Math

Media Studies

Medicine

Microeconomics

Music

Nursing

Nutrition and Food Science

Physics

Politics

Polish

Psychology

Religious Studies

Sociology

Spanish

Sports Sciences

Translation
Features
Features

Discover all of these amazing features with a free account.

Flashcards

StudySmarter AI

Notes

Study Plans

Study Sets

Exams
What’s new?

Flashcards
Study your flashcards with three learning modes.

Study Sets
All of your learning materials stored in one place.

Notes
Create and edit notes or documents.

Study Plans
Organise your studies and prepare for exams.
Resources
Discover

All the hacks around your studies and career - in one place.

Find a job

Student Deals

Magazine

Mobile App
Featured

Magazine
Trusted advice for anyone who wants to ace their studies & career.

Job Board
The largest student job board with the most exciting opportunities.

StudySmarter Deals
Verified student deals from top brands.

Our App
Discover our mobile app to take your studies anywhere.

Learning Materials

Features

Discover

Empirical Rule

Q: What is the empirical rule formula?

The empirical rule does not have a formula but it does state that for normally distributed data sets, 68% of observations fall within one standard deviation of the mean, 95% of observations fall within two standard deviations of the mean, and 99.7% of observations fall within three standard deviations of the mean.

Q: What is the empirical rule in simple terms?

In its simplest terms, the empirical rule states that virtually all data in a normally distributed data set falls within three standard deviations of the mean.

Q: What is the empirical rule for 95%?

According to the empirical rule, 95% of all observations in a normally distributed data set fall within two standard deviations of the mean.

Q: What is the empirical rule example?

If the average lifespan of a dog is 12 years (i.e mean) and the standard deviation of the mean is 2 years, and if you want to know the probability of the dog living more than 14 years, you will use the empirical rule.

Suppose that you have a set of data that is approximately normally distributed. Suppose, also, that you know the standard deviation of the data set. Is there much you can discern about the data from this information? Well, as a matter of fact, there is quite a bit, thanks to the empirical rule.

Get started

+ Add tag
Immunology
Cell Biology
Mo

What is StudySmarter?

Show Answer

+ Add tag
Immunology
Cell Biology
Mo

How does StudySmarter help me study more efficiently?

Show Answer

+ Add tag
Immunology
Cell Biology
Mo

Where can I find more explanations like this?

Show Answer

+ Add tag
Immunology
Cell Biology
Mo

What's smart about StudySmarter's flashcards?

Show Answer

+ Add tag
Immunology
Cell Biology
Mo

Can I create my own content on StudySmarter?

Show Answer

+ Add tag
Immunology
Cell Biology
Mo

How does spaced repetition work in StudySmarter flashcards?

Show Answer

+ Add tag
Immunology
Cell Biology
Mo

What can you do with flashcards in StudySmarter?

Show Answer

+ Add tag
Immunology
Cell Biology
Mo

Is StudySmarter a science-based learning platform?

Show Answer

+ Add tag
Immunology
Cell Biology
Mo

How do StudySmarter's smart learning plans support your exam prep?

Show Answer

+ Add tag
Immunology
Cell Biology
Mo

Can you create your own study sets in StudySmarter?

Show Answer

+ Add tag
Immunology
Cell Biology
Mo

What is StudySmarter?

Show Answer

+ Add tag
Immunology
Cell Biology
Mo

How does StudySmarter help me study more efficiently?

Show Answer

+ Add tag
Immunology
Cell Biology
Mo

Where can I find more explanations like this?

Show Answer

+ Add tag
Immunology
Cell Biology
Mo

What's smart about StudySmarter's flashcards?

Show Answer

+ Add tag
Immunology
Cell Biology
Mo

Can I create my own content on StudySmarter?

Show Answer

+ Add tag
Immunology
Cell Biology
Mo

How does spaced repetition work in StudySmarter flashcards?

Show Answer

+ Add tag
Immunology
Cell Biology
Mo

What can you do with flashcards in StudySmarter?

Show Answer

+ Add tag
Immunology
Cell Biology
Mo

Is StudySmarter a science-based learning platform?

Show Answer

+ Add tag
Immunology
Cell Biology
Mo

How do StudySmarter's smart learning plans support your exam prep?

Show Answer

+ Add tag
Immunology
Cell Biology
Mo

Can you create your own study sets in StudySmarter?

Show Answer

Fact Checked Content
Last Updated: 15.01.2023
6 min reading time

Content creation process designed by
Content cross-checked by
Content quality checked by

The empirical rule can be used to judge the likelihood of certain values in a dataset, as well as to check for outliers in your data set and much more. What is the empirical rule, and how does it relate to normal distributions and standard deviations?

Definition of the Empirical Rule

The Empirical rule goes by several names, Sometimes it is called the \(95 \%\) rule, the three-sigma rule, or the \(68\)-\(95\)-\(99.7\) rule.

It is usually called the empirical rule as it is a rule informed by many observations of data sets, not a logical or definitive mathematical proof.

The empirical rule is a statistical rule based on observations that show almost all data in a normal data distribution fall within three standard deviations of the mean.

Where do the other names come from? Well, there's even more that the empirical rule can tell you, and the clues are in the names. It's all about percentages, and standard deviation.

Empirical Rule Percentages

As mentioned previously, one of the names for the empirical rule is the \(68\)-\(95\)-\(99.7\) rule. This name is actually quite telling when we look at the empirical rule in full. It states

For a set of normally distributed data, approximately \(68\%\) of observations fall within one standard deviation of the mean, approximately \(95\%\) of observations fall within two standard deviations of the mean, and approximately \(99.7\%\) of observations fall within three standard deviations of the mean.

\(68\%\), \(95\%\), \(99.7\%\), get it?

If you remember those three percentages, then you can use them to infer all sorts of normally distributed data sets.

But wait a minute, it's also sometimes called the three-sigma rule, why on earth is that?

Well, the symbol for standard deviation is sigma, \(\sigma\). It is sometimes called the three-sigma rule because it states that almost all observations fall within three sigmas of the mean.

It is a standard convention to consider any observations that lie outside of these three sigmas as outliers. This means that they are not typically expected observations, and aren't indicative of the overall trend. In some applications, the bar for what is considered an outlier might be explicitly stated to be something else, but three sigmas is a good rule of thumb.

Let's take a look at what all of this looks like when put into a graph.

Empirical Rule Normal Distribution Graph

Take the following normal distribution with a mean of \(m\) and a standard deviation of \(\sigma\) as an example.

The Empirical Rule Simple normal distribution graph StudySmarter Fig. 1. Normal Distribution Curve.

It's possible to divide it up according to the empirical rule.

The Empirical Rule A normal distribution graph divided up according to the empirical rule StudySmarter Fig. 2. The empirical rule.

This graphical representation really demonstrates the main takeaways we can make of the empirical rule. It's very clear to see that virtually all observations fall within three standard deviations of the mean. There may very occasionally be outliers, but these are exceedingly rare.

The biggest chunk is clearly the middle \(-\sigma\) to \(\sigma\), just as the empirical rule states.

You may be thinking, 'great this rule seems useful, I'm gonna use it all the time!' But beware, and be careful. The empirical rule only holds true for data that is normally distributed.

Empirical Rule Examples

Let's take a look at some examples to see how we can put all of this into practice.

(1) The heights of all the female pupils in a class are measured. The data is found to be approximately normally distributed, with a mean height of \(5ft\,2\) and a standard deviation of \(2\, in\). There are \(12\) female pupils in the class.

(a) Using the empirical rule, roughly how many of the pupils are between \(5ft\,2\) and \(5ft\,4\)?

(b) Using the empirical rule, roughly how many of the pupils are between \(4ft\,8\) and \(5ft\)?

(c) One pupil is a height of \(5ft\,9\), can this pupil be considered an outlier?

Solution:

(a) \(5ft\,4\) is the mean plus one standard deviation. The empirical rule states that \(68\%\) of observations will fall within one standard deviation of the mean. Since the question is only concerned with the upper half of this interval, it will be \(34\%\). Therefore

\[0.34 \cdot 12 = 4.08\]

The number of female pupils in the class with a height between \(5ft\,2\) and \(5ft\,4\) is \(4\).

(b) \(4ft\,8\) is the mean minus two standard deviations, and \(5ft\) is the mean minus one standard deviation. According to the empirical rule, \(95\%\) of observations fall within two standard deviations of the mean, and \(68\%\) of observations fall within one standard deviation of the mean.

Since the question is only concerned with the lower halves of these intervals, they become \(47.5\%\) and \(34\%\) respectively. The interval we are looking for is the difference between these two.

\[47.5\% - 34\% = 13.5\%\]

Therefore

\[0.135 \cdot 12 = 1.62\]

The number of female pupils in the class with a height between \(4ft\,8\) and \(5ft\) is \(1\).

(c) \(5ft\,9\) is over \(3\) standard deviations greater than the mean, therefore this pupil can be considered an outlier.

(2) An ecologist records the population of foxes in a forest every year for ten years. He finds that on average there are \(150\) foxes living in the forest in a given year in that period, with a standard deviation of \(15\) foxes. The data is roughly normally distributed.

(a) According to the empirical rule, what range of population size could be expected over the ten years?

(b) Which of the following would be considered outlying population values?

\[ 100, \space 170, \space 110, \space 132 \]

Answer:

(a) According to the empirical rule, any observation not within three standard deviations of the mean is usually considered an outlier. Therefore our range is

\[ \mu - 3\sigma < P < \mu + 3\sigma\]

\[150 - 3 \cdot 15 < P < 150+ 3 \cdot 15\]

\[150-45 < P < 150+45\]

\[105 < P < 195\]

(b) \(100\) is the only one not within three standard deviations of the mean, therefore it is the only outlier.

Empirical Rule - Key takeaways

The empirical rule states that for normally distributed data sets, \(68\%\) of observations fall within one standard deviation of the mean, \(95\%\) of observations fall within two standard deviations of the mean, and \(99.7\%\) of observations fall within three standard deviations of the mean.
It is also known as the \(68\%\)-\(95\%\)-\(99.7\%\) rule, the three-sigma rule, and the \(95\%\) rule.
Usually, any observation not within three standard deviations of the mean can be considered an outlier.

Already have an account? Log in

Frequently Asked Questions about Empirical Rule

What is the empirical rule formula?

The empirical rule does not have a formula but it does state that for normally distributed data sets, 68% of observations fall within one standard deviation of the mean, 95% of observations fall within two standard deviations of the mean, and 99.7% of observations fall within three standard deviations of the mean.

What is the empirical rule in simple terms?

In its simplest terms, the empirical rule states that virtually all data in a normally distributed data set falls within three standard deviations of the mean.

What is the empirical rule for 95%?

According to the empirical rule, 95% of all observations in a normally distributed data set fall within two standard deviations of the mean.

Why is the Empirical Rule important in statistics?

The empirical rule can be used to judge the likelihood of certain values in a dataset, as well as to check for outliers in your data set.

What is the empirical rule example?

If the average lifespan of a dog is 12 years (i.e mean) and the standard deviation of the mean is 2 years, and if you want to know the probability of the dog living more than 14 years, you will use the empirical rule.

Save Article

How we ensure our content is accurate and trustworthy?

At StudySmarter, we have created a learning platform that serves millions of students. Meet the people who work hard to deliver fact based content as well as making sure it is verified.

Content Creation Process:

Lily Hulatt is a Digital Content Specialist with over three years of experience in content strategy and curriculum design. She gained her PhD in English Literature from Durham University in 2022, taught in Durham University’s English Studies Department, and has contributed to a number of publications. Lily specialises in English Literature, English Language, History, and Philosophy.

Get to know Lily

Content Quality Monitored by:

Gabriel Freitas is an AI Engineer with a solid experience in software development, machine learning algorithms, and generative AI, including large language models’ (LLMs) applications. Graduated in Electrical Engineering at the University of São Paulo, he is currently pursuing an MSc in Computer Engineering at the University of Campinas, specializing in machine learning topics. Gabriel has a strong background in software engineering and has worked on projects involving computer vision, embedded AI, and LLM applications.

Get to know Gabriel

Discover learning materials with the free StudySmarter app

About StudySmarter

StudySmarter is a globally recognized educational technology company, offering a holistic learning platform designed for students of all ages and educational levels. Our platform provides learning support for a wide range of subjects, including STEM, Social Sciences, and Languages and also helps students to successfully master various tests and exams worldwide, such as GCSE, A Level, SAT, ACT, Abitur, and more. We offer an extensive library of learning materials, including interactive flashcards, comprehensive textbook solutions, and detailed explanations. The cutting-edge technology and tools we provide help students create their own learning materials. StudySmarter’s content is not only expert-verified but also regularly updated to ensure accuracy and relevance.

Learn more

StudySmarter Editorial Team

Team Math Teachers