Absolute and Relative frequency in Pandas
Last Updated :
20 Apr, 2020
Frequency is the number of occurrences of an outcome in the given sample. It can be termed in two different ways.
1. Absolute Frequency:
It is the number of observations in a particular category. It has always an integer value or we can say it has discrete values.
Example:
Following data are given about pass or fail of students in an exam held of Mathematics in a class.
P, P, F, P, F, P, P, F, F, P, P, P
where, P = Passed and F = Failed.
Solution:
From the given data we can say that,
There are 8 students who passed the exam
There are 4 students who failed the exam
Implementation in Python:
Let’s the result of 12 persons declared in two categories Pass(P) and Fail(F) is categorized as 1 and 0 respectively.
P, P, F, P, F, P, P, F, F, P, P, P
1, 1, 0, 1, 0, 1, 1, 0, 0, 1, 1, 1
import pandas as pd
data = [ 1 , 1 , 0 , 1 , 0 , 1 , 1 , 0 , 0 , 1 , 1 , 1 ]
df = pd.Series(data).value_counts()
print (df)
|
Output:
1 8
0 4
dtype: int64
2. Relative Frequency:
It is the fraction of observations of a particular category in given data set. It has floating values and also represented in percentage. Let us consider the given example of passed and failed students in the Mathematics exam. Then,
relative frequency of passed students = 8 / ( 8 + 4 ) = 0.666 = 66.6 %
relative frequency of failed students = 4 / ( 8 + 4 ) = 0.333 = 33.3 %
import pandas as pd
data = [ 1 , 1 , 0 , 1 , 0 , 1 , 1 , 0 , 0 , 1 , 1 , 1 ]
df = pd.Series(data).value_counts()
print (df / len (data))
|
Output:
1 0.666667
0 0.333333
dtype: float64
Like Article
Suggest improvement
Share your thoughts in the comments
Please Login to comment...