Open In App

Python | Altering duplicate values from given list

Many times we deal with the lists having identical numbers as a sequence and we wish to just keep the 1st occurrence of element and substituting all the occurrences with the different number. Let’s discuss certain ways in which this can be done. 

Method #1 : Using list comprehension + enumerate() This task can be achieved using the list comprehension for traversal and checking for element occurrence and index checking can be done using enumerate function. 






# Python3 code to demonstrate
# Altering duplicated
# using list comprehension + enumerate()
 
# initializing list
test_list = [2, 2, 3, 3, 3, 3, 4, 4, 5, 5, 5]
 
# printing original list
print("The original list : " + str(test_list))
 
# using list comprehension + enumerate()
# Altering duplicated
res = [False if (ele in test_list[ :idx]) else ele
             for idx, ele in enumerate(test_list)]
 
# print result
print("The altered duplicate list is : " + str(res))

Output : 

The original list : [2, 2, 3, 3, 3, 3, 4, 4, 5, 5, 5] The altered duplicate list is : [2, False, 3, False, False, False, 4, False, 5, False, False]



Time Complexity: O(n*n), where n is the length of the input list. This is because we’re using list comprehension + enumerate() which has a time complexity of O(n*n) in the worst case.
Auxiliary Space: O(n), as we’re using additional space res other than the input list itself with the same size of input list

  Method #2 : Using itertools.groupby() + list comprehension This particular task can also be performed using a combination of above function, using groupby function to get the groups of different elements and list comprehension to alter duplicates. 




# Python3 code to demonstrate
# Altering duplicated
# using itertools.groupby() + list comprehension
from itertools import groupby
 
# initializing list
test_list = [2, 2, 3, 3, 3, 3, 4, 4, 5, 5, 5]
 
# printing original list
print("The original list : " + str(test_list))
 
# using itertools.groupby() + list comprehension
# Altering duplicated
res = [val for key, grp in groupby(test_list)
       for val in [key] + [False] * (len(list(grp))-1)]
 
# print result
print("The altered duplicate list is : " + str(res))

Output : 

The original list : [2, 2, 3, 3, 3, 3, 4, 4, 5, 5, 5] The altered duplicate list is : [2, False, 3, False, False, False, 4, False, 5, False, False]

 Method #3 : Using dictionary

Initialize an empty result list and a dictionary to track seen values. If the value has been seen before, append False to the result list
 




# Initialize the list of values
test_list = [2, 2, 3, 3, 3, 3, 4, 4, 5, 5, 5]
 
# Print the original list
print("Original list:", test_list)
 
# Initialize an empty result list and a dictionary to track seen values
res = []
seen = {}
 
# Iterate through the list of values
for ele in test_list:
    # If the value has not been seen before, append it to the result list
    # and add it to the dictionary
    if ele not in seen:
        res.append(ele)
        seen[ele] = True
    # If the value has been seen before, append False to the result list
    else:
        res.append(False)
 
# Print the altered list
print("Altered list:", res)
#This code is contributed by Edula Vinay Kumar Reddy

Output
Original list: [2, 2, 3, 3, 3, 3, 4, 4, 5, 5, 5]
Altered list: [2, False, 3, False, False, False, 4, False, 5, False, False]

Time complexity: O(n)
Auxiliary space: O(n)

Using numpy:

Algorithm:




import numpy as np
 
# Initializing list
test_list = [2, 2, 3, 3, 3, 3, 4, 4, 5, 5, 5]
 
# Printing original list
print("The original list : " + str(test_list))
 
# Creating numpy array
arr = np.array(test_list)
 
# Getting unique values and their indices
unique, indices = np.unique(arr, return_index=True)
 
# Creating a boolean array with the same size as input list
res = np.zeros_like(arr, dtype=bool)
 
# Setting the first occurrence of each unique value to True
res[indices] = True
res = np.where(res, arr, False)
res = [False if val == 0 else val for val in res] # replace 0 with False
 
# Printing the altered duplicate list
print("The altered duplicate list is : ", res)

Output:

The original list : [2, 2, 3, 3, 3, 3, 4, 4, 5, 5, 5]
The altered duplicate list is : [2, False, 3, False, False, False, 4, False, 5, False, False]

Time complexity: O(nlogn), where n is the length of the input list. This is because np.unique() function has a time complexity of O(nlogn) in the worst case.
Auxiliary space: O(n), as we’re using additional space for the numpy array and the boolean array, both with the same size as the input list.


Article Tags :