Hypergeometric Probability Calculator

Use this tool to calculate the probability of drawing a specific number of successes in a sample without replacement from a finite population.

Calculate Hypergeometric Probability

Total number of items in the population. Must be a positive integer.
Total number of "success" items within the population. Must be a non-negative integer, K ≤ N.
Number of items drawn in the sample. Must be a non-negative integer, n ≤ N.
Desired number of "success" items in the sample. Must be a non-negative integer, k ≤ n and k ≤ K.

Calculation Results

Hypergeometric Probability P(X=k)

0.0000

Intermediate Values:

  • Combinations (K choose k): 0
  • Combinations (N-K choose n-k): 0
  • Total Favorable Combinations: 0
  • Total Possible Combinations (N choose n): 0

The hypergeometric probability calculation determines the likelihood of drawing exactly 'k' successes in a sample of size 'n', from a population of 'N' items containing 'K' successes. All values are unitless counts.

Probability Distribution Chart

This chart displays the probability mass function (PMF) for all possible 'k' values given the current N, K, and n inputs.

Full Probability Distribution Table

Hypergeometric Probability Distribution for Current Inputs
Number of Successes (k) P(X=k) Cumulative P(X≤k)

This table shows the probability for each possible number of successes (k) in the sample, along with the cumulative probability.

A) What is Hypergeometric Probability?

Hypergeometric probability is a fundamental concept in statistics that helps us calculate the likelihood of selecting a specific number of "successes" when drawing items from a finite population without replacement. Unlike the binomial distribution, where each draw is independent and returns the item to the population, the hypergeometric distribution accounts for the fact that each item drawn changes the composition of the remaining population.

This makes it particularly useful in scenarios where the population size is small or when sampling significantly impacts the remaining pool of items. For example, if you're drawing cards from a deck, once a card is drawn, it's gone, and the probabilities for subsequent draws change. This is a classic probability calculation scenario for the hypergeometric distribution.

Who Should Use This Calculator?

Common Misunderstandings

A frequent point of confusion is distinguishing hypergeometric probability from binomial probability. The key difference lies in the sampling method:

Always remember that if the population is finite and items are not replaced, hypergeometric distribution is the appropriate model.

B) Hypergeometric Probability Formula and Explanation

The formula for calculating hypergeometric probability, denoted as P(X=k), is as follows:

P(X=k) = [ (K C k) * ((N-K) C (n-k)) ] / (N C n)

Where 'C' represents the combinations function (read as "N choose K"), calculated as C(a, b) = a! / (b! * (a-b)!). Let's break down each variable:

Variable Meaning Unit Typical Range
N Population Size Unitless (count) Any positive integer (e.g., 10 to 1,000,000)
K Number of Successes in Population Unitless (count) 0 to N
n Sample Size Unitless (count) 0 to N
k Number of Successes in Sample Unitless (count) max(0, n+K-N) to min(n, K)
P(X=k) Hypergeometric Probability Unitless (probability) 0 to 1

Explanation of the Formula Components:

The ratio of "favorable combinations" to "total possible combinations" yields the hypergeometric probability P(X=k).

C) Practical Examples of Hypergeometric Probability

Let's illustrate hypergeometric probability with a couple of real-world scenarios:

Example 1: Defective Products in a Batch

An electronics company manufactures a batch of 100 circuit boards. Due to a known issue, 10 of these boards are defective. A quality control inspector randomly selects a sample of 15 boards for testing. What is the probability that exactly 2 of the selected boards are defective?

  • Inputs:
    • Population Size (N) = 100 (total circuit boards)
    • Number of Successes in Population (K) = 10 (defective boards)
    • Sample Size (n) = 15 (boards selected for testing)
    • Number of Successes in Sample (k) = 2 (desired defective boards in sample)
  • Units: All inputs are unitless counts.
  • Calculation:
    • (10 C 2) = 45
    • (90 C 13) = 1,460,049,600
    • (100 C 15) = 25,333,847,131,500
    • P(X=2) = (45 * 1,460,049,600) / 25,333,847,131,500 ≈ 0.00259
  • Result: The probability of finding exactly 2 defective boards in the sample is approximately 0.00259 (or 0.259%).

Example 2: Drawing Cards from a Deck

Imagine a standard deck of 52 playing cards. If you draw 5 cards randomly without replacement, what is the probability that exactly 3 of them are hearts?

  • Inputs:
    • Population Size (N) = 52 (total cards in a deck)
    • Number of Successes in Population (K) = 13 (total hearts in a deck)
    • Sample Size (n) = 5 (cards drawn)
    • Number of Successes in Sample (k) = 3 (desired hearts in the hand)
  • Units: All inputs are unitless counts.
  • Calculation:
    • (13 C 3) = 286
    • (39 C 2) = 741 (number of ways to choose 2 non-hearts from 39)
    • (52 C 5) = 2,598,960
    • P(X=3) = (286 * 741) / 2,598,960 ≈ 0.0815
  • Result: The probability of drawing exactly 3 hearts in a 5-card hand is approximately 0.0815 (or 8.15%).

D) How to Use This Hypergeometric Probability Calculator

Our online hypergeometric probability calculator is designed for ease of use. Follow these simple steps to get your results:

  1. Enter Population Size (N): Input the total number of items in your population. This must be a positive integer.
  2. Enter Number of Successes in Population (K): Input the total count of "success" items within your population. This must be a non-negative integer and less than or equal to N.
  3. Enter Sample Size (n): Input the number of items you are drawing in your sample. This must be a non-negative integer and less than or equal to N.
  4. Enter Number of Successes in Sample (k): Input the specific number of "success" items you wish to find in your sample. This must be a non-negative integer, less than or equal to n, and less than or equal to K.
  5. Click "Calculate Probability": The calculator will instantly process your inputs and display the hypergeometric probability P(X=k).
  6. Interpret Results: The primary result shows the exact probability. Intermediate values are also provided for transparency. The chart and table visually represent the full probability distribution for all possible 'k' values given your N, K, and n inputs.
  7. Reset for New Calculations: Use the "Reset" button to clear all fields and start a new calculation with default values.

Unit Handling: For hypergeometric probability, all inputs (N, K, n, k) represent counts and are therefore unitless. The resulting probability P(X=k) is also unitless, expressed as a value between 0 and 1.

E) Key Factors That Affect Hypergeometric Probability

The probability calculated by the hypergeometric distribution is sensitive to changes in any of its four input parameters. Understanding how each factor influences the outcome is crucial for accurate interpretation.

F) Hypergeometric Probability FAQ

Q: What is the main difference between hypergeometric and binomial distribution?

A: The main difference is the sampling method. Hypergeometric distribution applies when sampling is done without replacement from a finite population, meaning each item drawn changes the probabilities for subsequent draws. Binomial distribution applies when sampling is done with replacement (or from an infinite population), meaning each trial is independent.

Q: Are there any units involved in hypergeometric probability calculations?

A: No, all inputs (Population Size N, Successes in Population K, Sample Size n, Successes in Sample k) are unitless counts. The output probability P(X=k) is also a unitless value between 0 and 1.

Q: What are the valid ranges for the input variables?

A:

  • N (Population Size): Must be a positive integer.
  • K (Successes in Population): Must be a non-negative integer, K ≤ N.
  • n (Sample Size): Must be a non-negative integer, n ≤ N.
  • k (Successes in Sample): Must be a non-negative integer, k ≤ n AND k ≤ K. Also, k must be at least max(0, n + K - N).

Q: When would hypergeometric probability be 0?

A: P(X=k) would be 0 if it's impossible to draw 'k' successes. This happens if:

  • You try to draw more successes than available in the population (k > K).
  • You try to draw more successes than your sample size allows (k > n).
  • You try to draw fewer successes than implied by drawing all non-successes (k < n + K - N).

Q: Can I use this calculator for cumulative probabilities (e.g., P(X ≤ k))?

A: This calculator directly computes P(X=k). To find cumulative probabilities like P(X ≤ k), you would need to sum the individual probabilities P(X=i) for all valid 'i' from 0 up to 'k'. Our probability distribution table provides cumulative probabilities to assist with this.

Q: What is the expected value of a hypergeometric distribution?

A: The expected number of successes in a hypergeometric distribution is E(X) = n * (K / N). This represents the average number of successes you would expect to find in your sample over many repeated trials.

Q: How does the finite population correction factor relate to this?

A: The hypergeometric distribution inherently accounts for the "finite population correction" because it models sampling without replacement from a finite pool. In other contexts, a correction factor is sometimes applied to adjust formulas designed for infinite populations when used with finite ones.

Q: What are some real-world applications of hypergeometric probability?

A: Beyond quality control and card games, it's used in genetic analysis (e.g., probability of inheriting certain alleles), ecological studies (e.g., capture-recapture methods for population estimation), and even lottery odds calculations.

G) Related Tools and Internal Resources

Explore other statistical and mathematical tools on our site to deepen your understanding:

🔗 Related Calculators