pandas.core.groupby.GroupBy.median

Compute median of groups, excluding missing values.

For multiple groupings, the result index will be a MultiIndex

return

Series or DataFrame Median of values within each group.

Limitations

  • This function may reveal slower performance than Pandas* on user system. Users should exercise a tradeoff between staying in JIT-region with that function or going back to interpreter mode.

Examples

Compute median of groups, excluding missing values.
import pandas as pd
from numba import njit


@njit
def df_groupby_median():
    df = pd.DataFrame({'A': [1, 2, 3, 1, 2, 3, 3, 3, 2],
                       'B': [0, 1, 5, 0, 3, 4, 3, 2, 4],
                       'C': [1, 2, 3, 4, 5, 6, 7, 8, 9]})
    out_df = df.groupby('A').median()

    # Expect DataFrame of
    # {'B': [0.0, 3.0, 3.5], 'C': [2.5, 5.0, 6.5} with index=[1, 2, 3]
    return out_df


print(df_groupby_median())
$ python ./dataframe/groupby/dataframe_groupby_median.py
     B    C
1  0.0  2.5
2  3.0  5.0
3  3.5  6.5

See also

Series.groupby

Group Series using a mapper or by a Series of columns.

DataFrame.groupby

Group DataFrame using a mapper or by a Series of columns.