Data ScienceIntermediate

Data Analysis with Python & Pandas: A Practical Guide

Aisha Nair

Published: Feb 20, 2026

4 steps 45 min

From raw CSV to actionable insights — learn data cleaning, exploration, and visualization with Pandas and Matplotlib.

PythonPandasDataMatplotlib

Loading and Exploring Data

Start with pd.read_csv() to load your dataset. Use df.head(), df.info(), and df.describe() to understand shape, types, and statistical summary.

explore.py

import pandas as pd

df = pd.read_csv('data.csv')
print(df.shape)       # rows × columns
print(df.info())      # dtypes + nulls
print(df.describe())  # statistical summary

Cleaning Missing Values

Use df.isnull().sum() to find gaps. Drop rows with dropna() or fill them with fillna(). Choose based on how much data you can afford to lose.

Warning

Dropping rows when missing values exceed 20% of a column will heavily bias your analysis. Consider imputation instead.

Grouping & Aggregation

groupby() is the most powerful tool in Pandas. Combine it with agg() to compute multiple statistics per group in a single pass.

aggregate.py

result = df.groupby('category').agg(
    total_sales=('revenue', 'sum'),
    avg_order=('revenue', 'mean'),
    order_count=('order_id', 'count')
).reset_index()

Visualisation with Matplotlib

Use Matplotlib for quick plots and Seaborn for statistical visualisations. Always label axes and titles — a chart without context is useless in a report.

Pro Tip

For datasets over 1M rows, consider switching from Pandas to Polars — it runs on Rust and is 5–10x faster for most operations.

Aisha Nair

Data scientist and Python educator.

Deep tech stories, delivered weekly.

Join 15,000+ Indian developers and creators receiving our curated newsletter every Sunday morning.

No spam. Only high-quality content. Unsubscribe anytime.

Data Analysis with Python & Pandas: A Practical Guide

Loading and Exploring Data

Cleaning Missing Values

Grouping & Aggregation

Visualisation with Matplotlib

More Tutorials

Build a REST API with Node.js & Express from Scratch

Complete Guide to Next.js App Router & Server Components

Top AI Tools Every Developer Should Use in 2026

Deep tech stories, delivered weekly.

Data Analysis with Python & Pandas: A Practical Guide

Loading and Exploring Data

Cleaning Missing Values

Grouping & Aggregation

Visualisation with Matplotlib

More Tutorials

Build a REST API with Node.js & Express from Scratch

Complete Guide to Next.js App Router & Server Components

Top AI Tools Every Developer Should Use in 2026

Deep tech stories, delivered weekly.