Pandas Tutorial Pandas References

Pandas DataFrame - plot.box() function



A box plot also known as a whisker plot is a method for graphically describing groups of numerical data through their quartiles. The box is drawn from first quartile (Q1) to third quartile (Q3) , indicating position of the median in the dataset. The whiskers extend from the box to show the range of the data.

Box Plot

The DataFrame.plot.box() functions makes makes a box and whisker plot of the DataFrame columns.

Syntax

DataFrame.plot.box(by=None)

Parameters

by Optional. Specify columns in the DataFrame to group by as str or sequence.

Return Value

Returns matplotlib.axes.Axes or numpy.ndarray of them.

Example: box plot example

In the example below, a DataFrame df is created. A box plot of is created based on the data of df.

import pandas as pd
import numpy as np
import matplotlib.pyplot as plt

#providing seed for reproducibility
#of the result
np.random.seed(10)

df = pd.DataFrame(np.random.randn(10, 3),
  index = pd.date_range('1/1/2018', periods=10),
  columns = ['col_A', 'col_B', 'col_C']
)

#displaying the DataFrame
print(df,"\n")
#creating the plot
df.plot.box()

#displaying the plot
plt.show()

The output of the above code will be:

               col_A     col_B     col_C
2018-01-01  1.331587  0.715279 -1.545400
2018-01-02 -0.008384  0.621336 -0.720086
2018-01-03  0.265512  0.108549  0.004291
2018-01-04 -0.174600  0.433026  1.203037
2018-01-05 -0.965066  1.028274  0.228630
2018-01-06  0.445138 -1.136602  0.135137
2018-01-07  1.484537 -1.079805 -1.977728
2018-01-08 -1.743372  0.266070  2.384967
2018-01-09  1.123691  1.672622  0.099149
2018-01-10  1.397996 -0.271248  0.613204 
Box Plot

❮ Pandas DataFrame - Functions