binomial distribution - Everything2.com

by mandrax

Sun May 13 2001 at 19:30:27

A statistical function to calculate the probability of getting n successes out of N attempts in a mutually exclusive, dichotomous, independent, random function with the probability of p.

Well, that's a load of gibberish, isn't it? Allow me to explain:

Dichotomous means that it has two possible outcomes, usually labelled success and failure.
Mutually exclusive means that in every case, only one of these two outcomes may occur.
Independent means that the result of one case is in no way dependent on the result in any other case in that series.
Random should be pretty self evident.

This can be described mathematically by the following function:


           N!
P(n) = ---------- pⁿ(1-p)^N-n
       n!(N - n)!

      |_________|
           |
           |
           |
           |
  ------------------------------------------------
  | Side note: This is the binomial coefficient  |
  | of N and n, or "N choose n"                  |
  ------------------------------------------------

P(n) is the probability of achieving exactly n successes. p is the probability of success in one single case. N is the number of trials.

An example: If I flip a coin seven times, what is the probability that it will land heads up twice? This gives us the following values:
N = 7
n = 2
p = 0.5 (it's a 50/50 chance of heads/tails)
Insert these values into the function:

           7!
P(2) = ---------- 0.5²(1-0.5)^7-2 = 0.1640625
       2!(7 - 2)!

The probability is approximately 16%.

This function is often used to calculate the probability to achieve at least m successes in N tries. This can be done in the following way:

 N
---
\   P(n_i)
/
---
i=m

In the previous example, if I had wanted to calculate the probability to achieve at least 5 heads up, here's the formula:

 7
---
\   P(n_i) = P(5) + P(6) + P(7)
/
---
i=5

This will cause a distribution not entirely unlike the normal distribution (the gaussian distribution).

I like it!

2 C!s

(idea)

by inciteful

Thu Mar 14 2002 at 3:15:18

The similarity of the binomial and normal distributions is very important. Calculating the chances that a binomial random variable, with large N, will be less that something around, say, N/2 requires a lot of calculation.

The normal distribution can be used as a much easier-to-calculate approximation. Set up the variable with mu equaling n*p and sigma-squared equaling n*p*(1-p) and you have an easier way to find your answer. This is considered to be a very good approximation as long as sigma-squared is greater than 10.

I like it!

(thing)

by BrianShader

Sun Feb 16 2003 at 9:24:40

I feel the above formula needs a little more explanation. Say we have a random variable X, where X~Bin(N,p). This means we are counting the number of successes in N trials, where each trial has a probability p of success. We also define q as 1-p, in other words the probability of failure. We are interested in the probability that X takes the particular value x. To calculate this, we use:

P(X = x) = ^NC_n x pⁿ x q^N-n

You see we have three terms here. First, the term pⁿ is the probability of n success. Simple enough. Now if we have n successes, we must have N - n failures to make up the N trials. So we have the term q^N-n for the probability of N-n failures. We multiply these two probabilities together because we need both of the events to occur.

Now the term ^NC_n. Perhaps you recognise the C as the Choose function, but perhaps not. Put simply, ^aC_b is the number of ways of choosing b objects from a, where the order in which you pick doesn't matter. Remember, when we have n successes and N - n failures, they can happen in any order; we don't mind. Therefore the Choose term effectively represents the number of different orders in which the successes and failures can occur.

Note that we could equally have chosen the number of failures, and put ^NC_{N - n} at the front. However, this is exactly the same! By choosing N - n failures, you automatically choose n successes. You can see this in symmetry of Pascal's Triangle, but that's another story. Conventionally, we use the one that is shorter to write. Mathematicians like brevity.

I like it!

Poisson distribution	Dichotomous	normal distribution	Gaussian Distribution
Central Limit Theorem	binomial coefficient	statistics	probability distribution
probability	Gorgeous George	Chernoff bounds	binostat
Binomial Theorem	Neyman-Pearson Lemma	Lemma	mutually exclusive
monopoly	hypergeometric distribution	hypothesis test	Everything Statistics - September 29, 2001 (3)
negative binomial distribution	Events leading to the Massacre at Wounded Knee	continuity correction	Normal Curve