Let us see how to shuffle the rows of a DataFrame. We will be using the sample() method of the pandas module to randomly shuffle DataFrame rows in Pandas. Algorithm :
- Import the pandas and numpy modules.
- Create a DataFrame.
- Shuffle the rows of the DataFrame using the sample() method with the parameter frac as 1, it determines what fraction of total instances need to be returned.
- Print the original and the shuffled DataFrames.
python
# import the modules import pandas as pd
import numpy as np
# create a DataFrame ODI_runs = { 'name' : [ 'Tendulkar' , 'Sangakkara' , 'Ponting' ,
'Jayasurya' , 'Jayawardene' , 'Kohli' ,
'Haq' , 'Kallis' , 'Ganguly' , 'Dravid' ],
'runs' : [ 18426 , 14234 , 13704 , 13430 , 12650 ,
11867 , 11739 , 11579 , 11363 , 10889 ]}
df = pd.DataFrame(ODI_runs)
# print the original DataFrame print ("Original DataFrame :")
print (df)
# shuffle the DataFrame rows df = df.sample(frac = 1 )
# print the shuffled DataFrame print ("\nShuffled DataFrame:")
print (df)
|
Output :