RESEARCH

The basics of using Matplotlib and Seaborn for data visualization

24 JANUARY 2025
Mark Sikaundi - Data Scientist and AI Researcher.

Share this post

A new generation of African talent brings cutting-edge AI to scientific challenges

Let's go through the basics of using Matplotlib and Seaborn for data visualization in Python. We'll cover how to create basic plots such as line plots, bar plots, and scatter plots.

Introduction

Data visualization is an essential skill for data scientists and machine learning engineers. It helps in understanding the data and communicating insights to stakeholders. In this article, we'll cover the basics of using Matplotlib and Seaborn for data visualization in Python.

What is Matplotlib?

Matplotlib is a popular data visualization library in Python. It provides a wide range of plotting functions to create various types of plots such as line plots, bar plots, scatter plots, and more. Matplotlib is highly customizable and allows you to create publication-quality plots.

What is Seaborn?

Seaborn is a statistical data visualization library built on top of Matplotlib. It provides a high-level interface for creating attractive and informative statistical graphics. Seaborn simplifies the process of creating complex plots and provides a wide range of built-in themes and color palettes.

Installing Matplotlib and Seaborn

You can install Matplotlib and Seaborn using pip, the Python package manager. Run the following commands in your terminal:

pip install matplotlib seaborn

Importing Matplotlib and Seaborn

To use Matplotlib and Seaborn in your Python script, you need to import the libraries. Here's how you can import them:

import matplotlib.pyplot as plt
import seaborn as sns

Creating a Line Plot

A line plot is a type of plot that displays data points connected by straight lines. It is useful for visualizing trends and patterns in data. Here's how you can create a line plot using Matplotlib:

import matplotlib.pyplot as plt
import numpy as np

# Sample data
x = np.linspace(0, 10, 100)
y = np.sin(x)

# Line plot
plt.plot(x, y)
plt.show()

How to create a line plot using Matplotlib

To create a line plot using Matplotlib, you need to import the library and use the plt.plot() function. Here's an example:

import matplotlib.pyplot as plt
import numpy as np

# Sample data
x = np.linspace(0, 10, 100)
y = np.sin(x)

# Line plot
plt.plot(x, y)
plt.show()

Creating Basic Plots

import matplotlib.pyplot as plt
import seaborn as sns
import numpy as np
import pandas as pd

# Sample data
x = np.linspace(0, 10, 100)
y = np.sin(x)

# Matplotlib line plot
plt.figure(figsize=(8, 6))
plt.plot(x, y, label='sin(x)')
plt.xlabel('X-axis')
plt.ylabel('Y-axis')
plt.title('Line Plot using Matplotlib')
plt.legend()
plt.show()

# Seaborn line plot
data = pd.DataFrame({'x': x, 'y': y})
plt.figure(figsize=(8, 6))
sns.lineplot(data=data, x='x', y='y', label='sin(x)')
plt.xlabel('X-axis')
plt.ylabel('Y-axis')
plt.title('Line Plot using Seaborn')
plt.legend()
plt.show()

Creating Bar Plots

import matplotlib.pyplot as plt
import seaborn as sns
import numpy as np
import pandas as pd

# Sample data
x = ['A', 'B', 'C', 'D', 'E']
y = [10, 20, 30, 40, 50]

# Matplotlib bar plot
plt.figure(figsize=(8, 6))
plt.bar(x, y)
plt.xlabel('X-axis')
plt.ylabel('Y-axis')
plt.title('Bar Plot using Matplotlib')
plt.show()

# Seaborn bar plot
data = pd.DataFrame({'x': x, 'y': y})
plt.figure(figsize=(8, 6))
sns.barplot(data=data, x='x', y='y')
plt.xlabel('X-axis')
plt.ylabel('Y-axis')
plt.title('Bar Plot using Seaborn')
plt.show()

Creating Scatter Plots

import matplotlib.pyplot as plt
import seaborn as sns
import numpy as np

# Sample data
x = np.random.rand(100)
y = np.random.rand(100)

# Matplotlib scatter plot
plt.figure(figsize=(8, 6))
plt.scatter(x, y)
plt.xlabel('X-axis')
plt.ylabel('Y-axis')
plt.title('Scatter Plot using Matplotlib')
plt.show()

# Seaborn scatter plot
data = pd.DataFrame({'x': x, 'y': y})
plt.figure(figsize=(8, 6))
sns.scatterplot(data=data, x='x', y='y')
plt.xlabel('X-axis')
plt.ylabel('Y-axis')
plt.title('Scatter Plot using Seaborn')
plt.show()

Conclusion

In this article, we covered the basics of using Matplotlib and Seaborn for data visualization in Python. We created line plots, bar plots, and scatter plots using both libraries. Data visualization is an essential skill for data scientists and machine learning engineers, as it helps in understanding the data and communicating insights to stakeholders.

Explore more with ExamplesLupleg Community