Pandas Tutorial Pandas References

Pandas DataFrame - plot.bar() function



A bar plot (or bar chart) is a graph that presents categorical data with rectangular bars with heights or lengths proportional to the values that they represent. The bars can be plotted vertically or horizontally.

The DataFrame.plot.bar() functions makes a vertical bar plot. A bar plot shows comparisons among discrete categories. One axis of the plot shows the specific categories being compared, and the other axis represents a measured value.

Syntax

DataFrame.plot.bar(x=None, y=None, color)

Parameters

x Optional. Specify label or position. Allows plotting of one column versus another. If not specified, the index of the DataFrame is used.
y Optional. Specify label or position. Allows plotting of one column versus another. If not specified, all numerical columns are used.
colors Optional. Specify color for each of the DataFrame’s columns as str, array-like, or dict. Possible values are:
  • A single color string referred to by name, RGB or RGBA code. For example - 'red' or '#a98d19'.
  • A sequence of color strings referred to by name, RGB or RGBA. It will be used for each column recursively. For example - ['green','yellow']. Each column's bar will be filled in green or yellow, alternatively.
  • A dict of the form {column namecolor}, so that each column will be colored accordingly. For example - {'columnA': 'green', 'columnB': 'red'}.

Return Value

Returns matplotlib.axes.Axes or an ndarray with one matplotlib.axes.Axes per column when subplots=True.

Example: bar plot on single column

In the example below, a DataFrame NumberOfStudents is created. A bar plot is created containing information about number of student studying different languages for year 2016.

import pandas as pd
import numpy as np
import matplotlib.pyplot as plt

NumberOfStudents = pd.DataFrame({
  "2016": [50, 55, 60, 70, 70, 60],
  "2017": [60, 65, 70, 80, 65, 55],
  "2018": [50, 65, 75, 100, 70, 60]},
  index= ["Java", "C#", "PHP", "Python", "C++", "C"]
)

#displaying the DataFrame
print(NumberOfStudents,"\n")
#creating the plot
NumberOfStudents.plot.bar(y=['2016'], rot=0)

#displaying the plot
plt.show()

The output of the above code will be:

        2016  2017  2018
Java      50    60    50
C#        55    65    65
PHP       60    70    75
Python    70    80   100
C++       70    65    70
C         60    55    60 
Bar Plot

Example: bar plot on multiple columns

In the example below the bar plot is drawn for specified columns (2016, 2018).

import pandas as pd
import numpy as np
import matplotlib.pyplot as plt

NumberOfStudents = pd.DataFrame({
  "2016": [50, 55, 60, 70, 70, 60],
  "2017": [60, 65, 70, 80, 65, 55],
  "2018": [50, 65, 75, 100, 70, 60]},
  index= ["Java", "C#", "PHP", "Python", "C++", "C"]
)

#displaying the DataFrame
print(NumberOfStudents,"\n")
#creating the plot
NumberOfStudents.plot.bar(y = ['2016', '2018'], rot=0)

#displaying the plot
plt.show()

The output of the above code will be:

        2016  2017  2018
Java      50    60    50
C#        55    65    65
PHP       60    70    75
Python    70    80   100
C++       70    65    70
C         60    55    60 
Bar Plot

Example: stacked bar plot

By using stacked=True parameter, a stack bar plot can be created. Consider the example below:

import pandas as pd
import numpy as np
import matplotlib.pyplot as plt

NumberOfStudents = pd.DataFrame({
  "2016": [50, 55, 60, 70, 70, 60],
  "2017": [60, 65, 70, 80, 65, 55],
  "2018": [50, 65, 75, 100, 70, 60]},
  index= ["Java", "C#", "PHP", "Python", "C++", "C"]
)

#displaying the DataFrame
print(NumberOfStudents,"\n")
#creating the plot
NumberOfStudents.plot.bar(rot=0, stacked=True)

#displaying the plot
plt.show()

The output of the above code will be:

        2016  2017  2018
Java      50    60    50
C#        55    65    65
PHP       60    70    75
Python    70    80   100
C++       70    65    70
C         60    55    60 
Bar Plot

Example: splitting the plot column wise

By using subplots=True parameter, we can split the plots column wise. Consider the example below:

import pandas as pd
import numpy as np
import matplotlib.pyplot as plt

NumberOfStudents = pd.DataFrame({
  "2016": [50, 55, 60, 70, 70, 60],
  "2017": [60, 65, 70, 80, 65, 55],
  "2018": [50, 65, 75, 100, 70, 60]},
  index= ["Java", "C#", "PHP", "Python", "C++", "C"]
)

#displaying the DataFrame
print(NumberOfStudents,"\n")
#creating the plot
NumberOfStudents.plot.bar(y=['2016','2017'], 
                          rot=0, subplots=True)

#displaying the plot
plt.show()

The output of the above code will be:

        2016  2017  2018
Java      50    60    50
C#        55    65    65
PHP       60    70    75
Python    70    80   100
C++       70    65    70
C         60    55    60 
Bar Plot

❮ Pandas DataFrame - Functions