|
|
Variance for Binomial Distribution

How many times has it happened to you that no matter how hard you study, the questions on the exam are the ones you didn't get to study?

Mockup Schule

Explore our app and discover over 50 million learning materials for free.

Variance for Binomial Distribution

Illustration

Lerne mit deinen Freunden und bleibe auf dem richtigen Kurs mit deinen persönlichen Lernstatistiken

Jetzt kostenlos anmelden

Nie wieder prokastinieren mit unseren Lernerinnerungen.

Jetzt kostenlos anmelden
Illustration

How many times has it happened to you that no matter how hard you study, the questions on the exam are the ones you didn't get to study?

Suppose your teacher provided a list of \(300\) exercises in preparation for the final exam. The teacher assures you that the exam will have \(10\) questions, and they will be taken from the list provided.

Although you prepared well in advance, you only managed to solve \(200\) exercises. What the probability is that the teacher will choose \(10\) questions that you have solved?

This type of question can be answered using the binomial distribution, and in this article you will learn more about it.

What is a binomial distribution?

A binomial distribution is a discrete probability distribution used to calculate the probability of observing a certain number of successes in a finite number of Bernoulli trials. A Bernoulli trial is a random experiment where you can only have two possible outcomes that are mutually exclusive, one of which is called success and the other failure.

If \(X\) is a binomial random variable with \(X\sim \text{B}(n,p)\), then the probability of getting exactly \(x\) successes in \(n\) independent Bernoulli trials is given by the probability mass function:

\[P(X=x)={n\choose{x}}p^x(1-p)^{n-x}\]

for \(x=0,1,2,\dots , n\), where

\[\displaystyle {n\choose{x}}=\frac{n!}{x!(n-x)!}\]

are known as the binomial coefficient.

Visit our article Binomial Distribution for more details about this distribution.

Let's look at an example to see how to calculate the probabilities in a binomial distribution.

Suppose you are going to take a multiple choice test with \(10\) questions, where each question has \(5\) possible answers, but only \(1\) option is correct. If you had to guess randomly on each question.

a) What is the probability that you would guess exactly \(4\) correct?

b) What is the probability that you would guess \(2\) or less correctly?

c) What is the probability that you would guess \(8\) or more correctly?

Solution:First, let's note that there are \(10\) questions, so \(n=10\). Now, since each question has \(5\) choices and only \(1\) is correct, the probability of getting the correct one is \(\dfrac{1}{5}\), so \(p=\dfrac{1}{5}\). Therefore,

\[1-p=1-\dfrac{1}{5}=\frac{4}{5} .\]

a) The probability of getting exactly \(4\) correct is given by

\[\begin{align} P(X=4)&={10\choose{4}}\left(\frac{1}{5}\right)^4\left(\frac{4}{5}\right)^{6} \\ &\approx 0.088. \end{align}\]

b) The probability of getting \(2\) or less correct is given by

\[\begin{align} P(X\leq 2)&=P(X=0)+P(X=1)+P(X=2) \\ &= {10\choose{0}} \left(\frac{1}{5}\right)^0\left(\frac{4}{5}\right)^{10}+{10\choose{1}}\left(\frac{1}{5}\right)^1\left(\frac{4}{5}\right)^{9}\\ &\quad +{10\choose{2}}\left(\frac{1}{5}\right)^2\left(\frac{4}{5}\right)^{8} \\ &\approx 0.678.\end{align}\]

c) The probability of getting \(8\) or more correct is given by \[\begin{align} P(X\geq 8)&=P(X=8)+P(X=9)+P(X=10) \\ &= {10\choose{8}} \left(\frac{1}{5}\right)^8\left(\frac{4}{5}\right)^{2}+{10\choose{9}}\left(\frac{1}{5}\right)^9\left(\frac{4}{5}\right)^{1} \\ & \quad+{10\choose{10}}\left(\frac{1}{5}\right)^{10}\left(\frac{4}{5}\right)^{0} \\ &\approx 0.00008.\end{align}\]

In other words, guessing the answers is a very bad test strategy if that is all you are going to do!

Derivation of mean and variance of binomial distribution

Note that a binomial variable \(X\) is the sum of \(n\) independent Bernoulli trials with the same probability of success \(p\), that means \(X=X_1+X_2+\ldots+X_n\), where each \(X_i\) is a Bernoulli variable. Using this, let's see how to derive the formulas for the mean and variance.

Derivation of mean of binomial distribution

To calculate the expected value of \(X\), from the above you have

\[\text{E}(X)=\text{E}(X_1+X_2+\ldots+X_n),\]

as the expected value is linear

\[\text{E}(X_1+X_2+\ldots+X_n)=\text{E}(X_1)+\text{E}(X_2)+\ldots+\text{E}(X_n).\]

Finally, recall that for a Bernoulli variable \(Y\) with probability of success \(q\), the expected value is \(q\). Thus,

\[\text{E}(X_1)+\text{E}(X_2)+\ldots+\text{E}(X_n)=\underbrace{p+p+\ldots+p}_{n\text{ times}}=np.\]

Putting everything together, you have the previously mentioned formula

\[\text{E}(X)=np.\]

Derivation of variance of binomial distribution

To calculate the variance of \(X\), you have

\[\text{Var}(X)=\text{Var}(X_1+X_2+\ldots+X_n),\]

using that the variance is additive for independent variables

\[\begin{align} \text{Var}(X_1+X_2+\ldots+X_n)&=\text{Var}(X_1)+\text{Var}(X_2) \\ &\quad +\ldots+\text{Var}(X_n). \end{align}\]

Again, recall that for a Bernoulli variable \(Y\), with probability of success \(q\), the variance is \(q(1-q)\). Then,

\[\begin{align} \text{Var}(X) &= \text{Var}(X_1)+\text{Var}(X_2)+\ldots+\text{Var}(X_n)\\ &= \underbrace{p(1-p)+p(1-p)+\ldots+p(1-p)}_{n\text{ times}} \\ & =np(1-p).\end{align}\]

Putting it all together,

\[\text{Var}(X)=np(1-p).\]

Mean and standard deviation for a binomial distribution

In the previous section you saw that the mean of the binomial distribution is

\[\text{E}(X)=np,\]

and the variance is

\[\text{Var}(X)=np(1-p).\]

To obtain the standard deviation, \(\sigma\), of the binomial distribution, just take the square root of the variance, so

\[\sigma = \sqrt{np(1-p) }.\]

Formula for mean of binomial distribution

The mean of a variable is the average value expected to be observed when an experiment is performed multiple times.

If \(X\) is a binomial random variable with \(X\sim \text{B}(n,p)\), then the expected value or mean of \(X\) is given by \[\text{E}(X)=\mu=np.\]

Formula for variance of a binomial distribution

The variance of a variable is a measure of how different the values are from the mean.

If \(X\) is a binomial random variable with \(X\sim \text{B}(n,p)\), then:

  • The variance of \(X\) is given by \[\text{Var}(X)=\sigma^2=np(1-p).\]

  • The standard deviation of \(X\) is the square root of the variance and is given by \[\sigma=\sqrt{np(1-p)}.\]

For a more detailed explanation of these concepts, please review our article Mean and Variance of Discrete Probability Distributions.

Examples of mean and variance of binomial distribution

Let's look at some examples, starting with a classic one.

Let \(X\) be a random variable such that \(X\sim \text{B}(10,0.3)\). Find the mean \(\text{E}(X)\) and the variance \(\text{Var}(X)\).

Solution:

Using the formula for the mean, you have

\[\text{E}(X)=np=(10)(0.3)=3.\]

For the variance you have

\[\text{Var}(X)=np(1-p) =(10)(0.3)(0.7)=2.1.\]

Let's take another example.

Let \(X\) be a random variable such that \(X\sim \text{B}(12,p)\) and \(\text{Var}(X)=2.88\). Find the two possible values of \(p\).

Solution:

From the variance formula, you have

\[\text{Var}(X)=np(1-p)=2.88.\]Since you know \(n=12\), substituting it in the above equation gives

\[12p(1-p)=2.88,\]

which is the same as

\[p(1-p)=0.24\]

or

\[p^2-p+0.24=0.\]

Note that you now have a quadratic equation, so using the quadratic formula you get that the solutions are \(p=0.4\) and \(p=0.6\).

The previous example shows that you can have two different binomial distributions with the same variance!

Finally, note that by using the mean and variance of a variable, you can recover its distribution.

Let \(X\) be a random variable such that \(X\sim \text{B}(n,p)\), with \(\text{E}(X)=3.6\) and \(\text{Var}(X)=2.88\).

Find the values of \(n\) and \(p\).

Solution:

Recall that by the formulas of the mean and variance

\[\text{E}(X)=np=3.6\]

and

\[\text{Var}(X)=np(1-p)=2.88.\]

From here, substituting you have

\[3.6(1-p)=2.88,\]

which implies that

\[1-p=\frac{2.88}{3.6}=0.8.\]

Therefore, \(p=0.2\) and again, from the formula of the mean, you have

\[n=\frac{3.6}{0.2}=18.\]

So the original distribution is \(X\sim \text{B}(18,0.8)\).

Mean and Variance of Binomial Distribution - Key takeaways

  • If \(X\) is a binomial random variable with \(X\sim \text{B}(n,p)\). Then, \[P(X=x)={n\choose{x}}p^x(1-p)^{n-x}\]for \(x=0,1,2,\dots,n\) where \[\displaystyle {n\choose{x}}=\frac{n!}{x!(n-x)!}\]

  • If \(X\sim \text{B}(n,p)\), then the expected value or mean of \(X\) is \(\text{E}(X)=\mu=np\).

  • If \(X\sim \text{B}(n,p)\), then the variance is \(\text{Var}(X)=\sigma^2=np(1-p) \) and the standard deviation is \(\sigma=\sqrt{np(1-p)}\).

Frequently Asked Questions about Variance for Binomial Distribution

The mean of a variable is the average value expected to be observed when an experiment is performed multiple times. In a binomial distribution, the mean is equal to np.

If X is a binomial random variable such that X~B(n,p). Then, the mean is given by E(X)=np, and the variance is given by Var(X)=np(1-p).

No, they can't be equal. Since the mean is given by np and the variance by np(1-p), then for np to be equal to np(1-p), necessarily 1-p=1, which means that p=0. This means that the experiment only fails and therefore does not follow a binomial distribution.

The variance of a variable is a measure of how different the values are from the mean. In a binomial distribution, the mean is equal to np(1-p).

If X is a binomial variable, i.e., X~B(n,p), then the mean is E(X)=np and the variance is Var(X)=np(1-p), so they are related by Var(X)=(1-p)E(X).
If Y is a Poisson variable, i.e, Y~Poi(λ), then the mean is E(Y)=λ and the variance is Var(Y)=λ, so the mean and the variance are the same.

Test your knowledge with multiple choice flashcards

If \(X\) is a random variable with \(X\sim \text{B}(n,p)\). What is the value of the mean?

If \(X\) is a random variable with \(X\sim \text{B}(n,p)\). What is the value of the variance?

If \(X\) is a random variable with \(X\sim \text{B}(n,p)\). What is the value of the standard deviation?

Next
More about Variance for Binomial Distribution

Join over 22 million students in learning with our StudySmarter App

The first learning app that truly has everything you need to ace your exams in one place

  • Flashcards & Quizzes
  • AI Study Assistant
  • Study Planner
  • Mock-Exams
  • Smart Note-Taking
Join over 22 million students in learning with our StudySmarter App Join over 22 million students in learning with our StudySmarter App

Sign up to highlight and take notes. It’s 100% free.

Entdecke Lernmaterial in der StudySmarter-App

Google Popup

Join over 22 million students in learning with our StudySmarter App

Join over 22 million students in learning with our StudySmarter App

The first learning app that truly has everything you need to ace your exams in one place

  • Flashcards & Quizzes
  • AI Study Assistant
  • Study Planner
  • Mock-Exams
  • Smart Note-Taking
Join over 22 million students in learning with our StudySmarter App