Open In App

sciPy stats.binned_statistic_2d() function | Python

Improve
Improve
Like Article
Like
Save
Share
Report

stats.binned_statistic_2d(arr1, arr2, values, statistic='mean', bins=10, range=None) function computes the binned statistics value for the given two dimensional data.
It works similar to histogram2d. As histogram function makes bins and counts the no. of points in each bin; this function computes the sum, mean, median, count or other statistics of the values for each bin.

Parameters :
arr1 : [array_like]input array to be binned along the first dimension.
arr2 : [array_like]input array to be binned along the second dimension.
values : [array_like]on which stats to be calculated.
statistics : Statistics to compute {mean, count, median, sum, function}. Default is mean.
bin : [int or scalars]If bins is an int, it defines the number of equal-width bins in the given range (10, by default). If bins is a sequence, it defines the bin edges.
range : (float, float) Lower and upper range of the bins and if not provided, range is from x.max() to x.min().

Results : Statistics value for each bin; bin edges along first and second dimension; bin number.

Code #1 :




# stats.binned_statistic_2d() method 
import numpy as np
from scipy import stats
  
x = np.random.rand(10)
y = np.random.rand(10)
  
z = np.arange(10)
  
print ("x : \n", x)
print ("\ny : \n", y)
print ("\nz : \n", z)
  
# count
print ("\nbinned_statistic_2d for count : "
       stats.binned_statistic_2d(x, y, values = z, 
                statistic ='count', bins = [5, 5]))


Output :

x :
[0.31218238 0.86791445 0.42763346 0.79798587 0.91361299 0.09005856
0.54419846 0.18973948 0.67016378 0.8083121 ]

y :
[0.35959238 0.69265819 0.18751529 0.98863414 0.97810927 0.24054104
0.76764562 0.60635485 0.61551806 0.63884672]

z :
[0 1 2 3 4 5 6 7 8 9]

binned_statistic_2d for count : BinnedStatistic2dResult(statistic=array([[1., 0., 1., 0., 0.],
[0., 1., 0., 0., 0.],
[1., 0., 0., 1., 0.],
[0., 0., 1., 0., 0.],
[0., 0., 1., 1., 2.]]), x_edge=array([0.09005856, 0.25476945, 0.41948033, 0.58419122, 0.74890211,
0.91361299]), y_edge=array([0.18751529, 0.34773906, 0.50796283, 0.6681866, 0.82841037,
0.98863414]), binnumber=array([16, 39, 22, 40, 40, 8, 25, 10, 31, 38], dtype=int64))

 
Code #2 :




# stats.binned_statistic_2d() method 
import numpy as np
from scipy import stats
  
x = np.random.rand(10)
y = np.random.rand(10)
z = np.arange(10)
  
# mean
print ("\nbinned_statistic_2d for mean : "
       stats.binned_statistic_2d(x, y, values = z,
                   statistic ='mean', bins = [5, 5])) 


Output :

binned_statistic_2d for mean : BinnedStatistic2dResult(statistic=array([[5., nan, 7., nan, nan],
[nan, 0., nan, nan, nan],
[2., nan, nan, 6., nan],
[nan, nan, 8., nan, nan],
[nan, nan, 9., 1., 3.5]]), x_edge=array([0.09005856, 0.25476945, 0.41948033, 0.58419122, 0.74890211,
0.91361299]), y_edge=array([0.18751529, 0.34773906, 0.50796283, 0.6681866, 0.82841037,
0.98863414]), binnumber=array([16, 39, 22, 40, 40, 8, 25, 10, 31, 38], dtype=int64))



Last Updated : 18 Feb, 2019
Like Article
Save Article
Previous
Next
Share your thoughts in the comments
Similar Reads