Normal Approximation to Binomial Distribution
Introduction
In this section, we will discuss the shape of Binomial Distribution, and take a look at how they change as we tweak some of its paramaters, such as the number of trials or the probability of success. We will also talk about the fact that when the number of trials increases, the shape of the Binomial actually starts looking closer and closer to a full Normal Distribution. So, for such situations, we are going to use methods we've learned to calculate Normal Probability to approximate Binomial Probabilities.
The Shape of Binomial Distribution
Suppose we have a binomial random variable with the probability of success is 0.25 (25%).
This is what the distribution looks like when n = 10.
- Each bar represents a potential outcome.
- With 10 trials, the number of successes could range anywhere from 0 to 10 and therefore we have 11 bar here.
- Height of the bars represents the likelihood of these outcomes. For example, the probability of zero successes can be calculated as ten failures = 0.7510 = 0.056, which is the height of this bar.
When n = 20, we see a change in the center of the distribution, which is expected since n × p is now different. The distribution is looking much less skewed.
Increasing the sample size further to 50, the distribution looks even more symetric, and much smoother, and increasing the size even further to 100, the distribution looks no different than the normal distribution.
Normal Approximation to Binomial
How can we tell if the shape of the binomial is going to be unimodal, symetric, and closely follow the normal distribution?
The rule of thumb is the success-failure condition.
Success-failure rule: A binomial distribution meet the following conditions closely following a normal distribution.
- np ≥ 10: at least 10 expected successes;
- n(1-p) ≥ 10: 10 expected failures;
And in cases where it does we can approximate the binomial distribution with the normal, where the parameters of the normal distribution are calculated as the mean and standard deviation of the binomial.
Binomial(n, p) ~ Normal(μ, σ) where μ = np and σ = sqrt(np(1 - p))
In addition, there is a 0.5 adjustment to make the probabilities calculated the exact probabilities from the binomial distribution.
Example
What is the minimum required n for a binomial distribution with p = 0.25 to closely follow a normal distribution?
We know that it must meet the following conditions:
n × 0.25 ≥ 10 and n × (1 - 0.25) ≥ 10
So, for both above, we need to find the maximum of n since that is going to be the minimum requried sample size. Then, we found n ≥ 40, that is we need the sample size n at least 40
References & Resources
- N/A
Latest Post
- Dependency injection
- Directives and Pipes
- Data binding
- HTTP Get vs. Post
- Node.js is everywhere
- MongoDB root user
- Combine JavaScript and CSS
- Inline Small JavaScript and CSS
- Minify JavaScript and CSS
- Defer Parsing of JavaScript
- Prefer Async Script Loading
- Components, Bootstrap and DOM
- What is HEAD in git?
- Show the changes in Git.
- What is AngularJS 2?
- Confidence Interval for a Population Mean
- Accuracy vs. Precision
- Sampling Distribution
- Working with the Normal Distribution
- Standardized score - Z score
- Percentile
- Evaluating the Normal Distribution
- What is Nodejs? Advantages and disadvantage?
- How do I debug Nodejs applications?
- Sync directory search using fs.readdirSync