Python | Pandas series.cummax() to find Cumulative maximum of a series
Python is a great language for doing data analysis, primarily because of the fantastic ecosystem of data-centric Python packages. Pandas is one of those packages and makes importing and analyzing data much easier.
Series.cummax() is used to find Cumulative maximum of a series. In cumulative maximum, the length of returned series is same as input series and every element is equal to the greater one between current element and previous element.
Syntax: Series.cummax(axis=None, skipna=True)
axis: 0 or ‘index’ for row wise operation and 1 or ‘columns’ for column wise operation.
skipna: Skips NaN addition for elements after the very next one if True.
Return type: Series
In this example, a series is created from a Python list. The list also contains a Null value and the
skipna parameter is kept default, that is True.
0 3.0 1 4.0 2 NaN 3 7.0 4 7.0 5 7.0 dtype: float64
Explanation: Cummax is comparison of current value with previous value. The first element is always equal to first of caller series.
3 4 (4>3) NaN (Since NaN cannot be compared to integer values) 7 (7>4) 7 (7>2) 7 (7>0)
Example #2: Keeping
skipna = False
In this example, a series is created just like in the above example. But the skipna parameter is kept False. Hence NULL values won’t be ignored and it would be compared every time on it’s occurrence.
0 9.0 1 9.0 2 33.0 3 NaN 4 NaN 5 NaN 6 NaN 7 NaN dtype: float64
Explanation: Just like in the above example, maximum of current and previous values were stored at every position until NaN occurred. Since NaN compared with anything returns NaN and skipna parameter is kept False, the cumulative maximum after its occurrence is NaN due to comparison of all the values with NaN.