Open In App

Python | Pandas Index.drop_duplicates()

Pandas Index.drop_duplicates() function return Index with duplicate values removed in Python.

Syntax of Pandas Index.drop_duplicates()

Syntax: Index.drop_duplicates(labels, errors=’raise’) 



Parameters : keep : {‘first’, ‘last’, False}

  • ‘first’ : Drop duplicates except for the first occurrence.(default)
  • ‘last’ : Drop duplicates except for the last occurrence. 
  • False : Drop all duplicates. 

Returns : deduplicated: Index



Examples of Index.drop_duplicates()

The function provides the flexibility to choose which duplicate value to be retained. We can drop all duplicate values from the list or leave the first/last occurrence of the duplicated values.

Example 1: Use Index.drop_duplicates() function to drop all the occurrences of the duplicate value. Let’s drop all occurrences of duplicate values in the Index except the first occurrence. 




# importing pandas as pd
import pandas as pd
 
# Creating the Index
idx = pd.Index([10, 11, 5, 5, 22, 5, 3, 11])
 
# drop all duplicate occurrences of the
# labels and keep the first occurrence
idx.drop_duplicates(keep ='first')
print(idx)

Output:

  

Example 2: Use Index.drop_duplicate() function to drop all duplicate occurrences of the label. Do not keep any duplicated values in the Index. 




# importing pandas as pd
import pandas as pd
 
# Creating the Index
idx = pd.Index([10, 11, 5, 5, 22, 5, 3, 11])
 
# drop all duplicate occurrences of the labels
idx.drop_duplicates(keep=False)
 
# Print the Index
idx

Output:

  


Article Tags :