# Mathematics | Hypergeometric Distribution model

Hypergeometric Distribution Model is used for estimating the number of faults initially resident in a program at the beginning of the test or debugging process based on the hypergeometric distribution. Let be the cumulative number of errors already detected so far by , and let be the number of newly detected errors by time .

**Assumptions:**

- A program initially contains m faults when the test phase starts.
- A test is defined as a number of test instances which are couples of input data and output data. In other words, the collection of test operations performed in a day or a week is called a test instance. The test instances are denoted by for i = 1, 2, . . ., n.
- Detected faults are not removed between test instances.

Therefore, from the latter assumption, the same faults can be experienced at several test instances. Let be the number of faults experienced by test instance . It should be noted that some of the faults may be those that are already counted in , and the remaining Wi faults account for the newly detected faults.

If is an observed instance of , then we can see that . Each fault can be classified into one of two categories:

- Newly discovered faults
- Rediscovered faults

If we assume that the number of newly detected faults follows a hypergeometric distribution, then the probability of obtaining exactly newly detected faults among faults is,

where

and

for all i. Since is assumed to be hypergeometrically distributed, the expected number of newly detected faults during the interval is,

and the expected value of is given by,

where

## Recommended Posts:

- Mathematics | Probability Distributions Set 5 (Poisson Distribution)
- Mathematics | Probability Distributions Set 1 (Uniform Distribution)
- Mathematics | Probability Distributions Set 3 (Normal Distribution)
- Mathematics | Probability Distributions Set 2 (Exponential Distribution)
- Mathematics | Probability Distributions Set 4 (Binomial Distribution)
- Software Engineering | Pham-Nordmann-Zhang Model (PNZ model)
- Difference between Waterfall model and Incremental model
- Computer Network | TCP/IP Model
- DBMS | Enhanced ER Model
- Devices used in each layer of TCP/IP model
- Software Engineering | COCOMO Model
- Software Engineering | COCOMO II Model
- Computer Network | Layers of OSI Model
- Database Management System | ER Model
- Software Engineering | SDLC V-Model

If you like GeeksforGeeks and would like to contribute, you can also write an article using contribute.geeksforgeeks.org or mail your article to contribute@geeksforgeeks.org. See your article appearing on the GeeksforGeeks main page and help other Geeks.

Please Improve this article if you find anything incorrect by clicking on the "Improve Article" button below.