0% found this document useful (0 votes)

16 views8 pages

Pandas Notes

The document provides detailed notes on the basics of using pandas, focusing on creating DataFrames, inspecting data, checking structure, and performing aggregations. It includes real-life examples, concise definitions, syntax breakdowns, full working code, and key takeaways for each topic. The notes aim to help users effectively analyze and manipulate data using pandas.

Uploaded by

raghavendramamidala92

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

16 views8 pages

Pandas Notes

Uploaded by

raghavendramamidala92

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 8

Pandas Notes

Here are detailed notes on the pandas basics we’ve covered. Each section follows
the same five-point pattern, with very simple examples and full code you can run.

1. Creating a DataFrame
1. Real-Life Example
You run a roadside tea stall and note today’s sales of two items:

Cups of tea sold

Packets of biscuits sold

You write it on paper:

Tea – 30 cups
Biscuits – 20 packets

To analyse sales, you want this in a neat computer table.

2. Concise Definition
A DataFrame is pandas’ way to store table data (rows × columns), just like an
Excel sheet.

Pandas Notes 1
3. Syntax Breakdown

pd.DataFrame(data, # your raw data (dict or list of dicts)

columns=None, # (optional) list of column names
index=None) # (optional) list of row labels

data: often a dict of equal-length lists, e.g. {'item': [...], 'count': [...]}

columns: lets you pick or rename which columns appear, and in what order

index: lets you label rows (e.g. dates or IDs)

4. Full, Working Code

import pandas as pd

# 1) Raw sales data as a dictionary

data = {
'item': ['Tea', 'Biscuits'],
'sold': [30, 20]
}

# 2) Create the DataFrame

df = pd.DataFrame(data)

# 3) Show the table

print(df)

Output:

item sold
0 Tea 30

Pandas Notes 2
1 Biscuits 20

5. Three Key Takeaways

1. Dict → Table: A dict of lists becomes a neat table.

2. Check with print(df): Always look at your table right after creating it.

3. Flexible Input: You can also start from a list of records ( [{'item':'Tea','sold':30}, …] ).

2. Inspecting with head()

1. Real-Life Example
You have a big list of student marks. You want to peek at the first few entries to
confirm you loaded them correctly.

2. Concise Definition
df.head(n) shows the first n rows of your DataFrame (default n=5 ).

3. Syntax Breakdown

df.head(n)

df: your DataFrame

.head: the function to look at top rows

(n): number of rows to show (optional; default = 5)

4. Full, Working Code

import pandas as pd

data = {

Pandas Notes 3
'student': ['Amit', 'Bina', 'Chirag', 'Deepa', 'Esha', 'Farhan'],
'marks': [85, 90, 78, 92, 88, 75]
}
df = pd.DataFrame(data)

# Show the first 3 students

print(df.head(3))

Output:

student marks
0 Amit 85
1 Bina 90
2 Chirag 78

5. Three Key Takeaways

1. Quick Peek: head() avoids scrolling through hundreds of rows.

2. Default = 5: Without (n) , you see the first 5.

3. Errors Show Early: If your data header is wrong, you spot it immediately.

3. Checking Structure ( shape , columns , dtypes )

1. Real-Life Example
You have a guest list for a family function with names, ages, and gifts they bring.
You want to know:

How many guests?

What columns do you have?

Are ages stored as numbers or text?

Pandas Notes 4
2. Concise Definition
df.shape → returns (rows, columns)

df.columns → lists the column names

df.dtypes → shows each column’s data type (int, float, object)

3. Syntax Breakdown

df.shape # no (), returns a tuple like (10, 3)

df.columns # no (), returns an Index of column names
df.dtypes # no (), returns a Series of column:data_type

4. Full, Working Code

import pandas as pd

guests = {
'name': ['Ravi', 'Sara', 'Manoj'],
'age': [28, 25, 30],
'gift': ['Flowers','Chocolates','Book']
}
df = pd.DataFrame(guests)

# Check structure
print("Shape :", df.shape)
print("Columns :", df.columns)
print("Data types:\n", df.dtypes)

Output:

Pandas Notes 5
Shape : (3, 3)
Columns : Index(['name', 'age', 'gift'], dtype='object')
Data types:
name object
age int64
gift object
dtype: object

5. Three Key Takeaways

1. Know Size: shape tells you exactly how many rows and columns.

2. See Fields: columns avoids guessing field names.

3. Type Safety: dtypes lets you catch “ages as text” before you do math.

4. Aggregation with agg()

1. Real-Life Example
You track daily sales of two sweets at your mithai shop. After a week, you want:

Total sweets sold

Average price you charged

Day with maximum laddoos sold

Instead of manual sums, you use pandas to tell you in one step.

2. Concise Definition
(or df.agg() ) computes summary numbers (sum, mean, max, min)
df.aggregate()

across entire DataFrame.

Combined with groupby() , it does the same per category (e.g., per sweet type).

3. Syntax Breakdown

Pandas Notes 6
# Overall summary
df.agg({'sold':'sum', 'price':'mean'})

# By category
df.groupby('sweet').agg({
'sold':['sum','max'],
'price':['mean','min']
})

df: your table

.agg / .aggregate: the summary function

groupby('col'): first split rows by that column

func dict/list: choose which statistics you want

4. Full, Working Code

import pandas as pd

data = {
'day': ['Mon','Tue','Wed','Thu','Fri','Sat','Sun'],
'sweet': ['laddoo','laddoo','gulab','laddoo','gulab','gulab','laddoo'],
'sold': [10, 12, 8, 15, 10, 9, 11],
'price': [20, 20, 25, 20, 25, 25, 20]
}
df = pd.DataFrame(data)

# 1) Overall summary
print(df.agg({'sold':'sum','price':'mean'}))

# 2) Summary by sweet

Pandas Notes 7
print(df.groupby('sweet').agg({'sold':['sum','max'],'price':['mean','min']}))

5. Three Key Takeaways

1. One-Step Summary: .agg() gives totals and averages in one command.

2. Compare Groups: groupby()+agg() shows stats per category (like laddoo vs

gulab).

3. Customizable: Pass your own list or dict of functions— .agg(['min','max','mean']) or

even your own Python function.

End of Notes
Keep practicing with your own data every
day!

Pandas Notes 8

Pandas Handbook
No ratings yet
Pandas Handbook
33 pages
Data Frame
No ratings yet
Data Frame
95 pages
Pandas PDF
No ratings yet
Pandas PDF
25 pages
Pandas 1705297450
No ratings yet
Pandas 1705297450
21 pages
Introduction To Pandas
No ratings yet
Introduction To Pandas
14 pages
Chapter Notes - Data Handling Using Pandas DataFrame
No ratings yet
Chapter Notes - Data Handling Using Pandas DataFrame
16 pages
Pandas Research
No ratings yet
Pandas Research
14 pages
CHP 8 Pandas
No ratings yet
CHP 8 Pandas
49 pages
Pandas
No ratings yet
Pandas
16 pages
Exercise 3
No ratings yet
Exercise 3
12 pages
2 Pandas
No ratings yet
2 Pandas
22 pages
Data Wrangling With Python and Pandas
No ratings yet
Data Wrangling With Python and Pandas
7 pages
Pandas
No ratings yet
Pandas
9 pages
Lecture 7 Understanding Dataframes in Python and R
No ratings yet
Lecture 7 Understanding Dataframes in Python and R
17 pages
Pandas
No ratings yet
Pandas
21 pages
Pandas
No ratings yet
Pandas
13 pages
Cheat Sheet: Python For Data Science
No ratings yet
Cheat Sheet: Python For Data Science
1 page
Pandas 1
No ratings yet
Pandas 1
2 pages
Pandas DataFrame Notes
No ratings yet
Pandas DataFrame Notes
13 pages
Pandas
No ratings yet
Pandas
41 pages
Exp1 - Manipulating Datasets Using Pandas
No ratings yet
Exp1 - Manipulating Datasets Using Pandas
15 pages
Commands SQL, Python (BASICS)
No ratings yet
Commands SQL, Python (BASICS)
7 pages
Pandas Cheat Sheet - Python For Data Science
No ratings yet
Pandas Cheat Sheet - Python For Data Science
5 pages
FDS Module 2 Notes
No ratings yet
FDS Module 2 Notes
24 pages
Data Frame in Panda 01
No ratings yet
Data Frame in Panda 01
9 pages
Data Aggregation and Group Operations
No ratings yet
Data Aggregation and Group Operations
34 pages
W04L01 - FA23 - AIC270 - Programming for AI - Syed Ahmed
No ratings yet
W04L01 - FA23 - AIC270 - Programming for AI - Syed Ahmed
66 pages
Data Analysis With Pandas
No ratings yet
Data Analysis With Pandas
28 pages
Cheat Sheet
No ratings yet
Cheat Sheet
10 pages
Lab-3 Pandas Library
No ratings yet
Lab-3 Pandas Library
14 pages
Pandas Notes
No ratings yet
Pandas Notes
27 pages
Pandas
No ratings yet
Pandas
8 pages
Pandas (Ziad)
No ratings yet
Pandas (Ziad)
38 pages
Pandas Data Frame
No ratings yet
Pandas Data Frame
11 pages
EDA Pandas
No ratings yet
EDA Pandas
228 pages
DAP 3 Module
No ratings yet
DAP 3 Module
62 pages
Pandas Notes
No ratings yet
Pandas Notes
10 pages
Pandas Learndatasci
No ratings yet
Pandas Learndatasci
86 pages
Mdad - Numpy ML
No ratings yet
Mdad - Numpy ML
85 pages
For Assignment-3 (Final - Pandas - Lab)
No ratings yet
For Assignment-3 (Final - Pandas - Lab)
40 pages
Pandas DataFrame Notes
100% (1)
Pandas DataFrame Notes
10 pages
Unit IV
No ratings yet
Unit IV
49 pages
Pandas DataFrameObject
No ratings yet
Pandas DataFrameObject
4 pages
Cheat Sheet: The Pandas Dataframe Object: Preliminaries Get Your Data Into A Dataframe
100% (1)
Cheat Sheet: The Pandas Dataframe Object: Preliminaries Get Your Data Into A Dataframe
10 pages
5CS037 WS02 PandasForDataAnalysis
No ratings yet
5CS037 WS02 PandasForDataAnalysis
30 pages
Pandas Dataframe Export The CSV File
No ratings yet
Pandas Dataframe Export The CSV File
9 pages
Pandas DataFrame Notes
No ratings yet
Pandas DataFrame Notes
10 pages
Pandas Cheat Sheet........
No ratings yet
Pandas Cheat Sheet........
11 pages
Pandas
No ratings yet
Pandas
13 pages
Pandas Data Structures: Sections
No ratings yet
Pandas Data Structures: Sections
13 pages
Pandas
No ratings yet
Pandas
25 pages
Pandas - Digitalocean
No ratings yet
Pandas - Digitalocean
15 pages
Python Pandas Tutorial For Beginners
No ratings yet
Python Pandas Tutorial For Beginners
203 pages
The Pandas Library
No ratings yet
The Pandas Library
39 pages
Pandas
No ratings yet
Pandas
50 pages
The Basics of Pandas Library
No ratings yet
The Basics of Pandas Library
8 pages
Pandas Complete Notes
No ratings yet
Pandas Complete Notes
105 pages
Ccna Cloud
No ratings yet
Ccna Cloud
294 pages
Isabela State University: Republic of The Philippines Cauayan City, Isabela
No ratings yet
Isabela State University: Republic of The Philippines Cauayan City, Isabela
20 pages
Design and Implementation of An Embedded Edge-Processing Water Quality Monitoring System For Underground Waters
No ratings yet
Design and Implementation of An Embedded Edge-Processing Water Quality Monitoring System For Underground Waters
4 pages
RT070 DS R2011 V1.0.3
No ratings yet
RT070 DS R2011 V1.0.3
2 pages
Tutorial 2
No ratings yet
Tutorial 2
2 pages
P702CV
No ratings yet
P702CV
4 pages
Transcript - Participate Safely and Responsibly Online PDF
No ratings yet
Transcript - Participate Safely and Responsibly Online PDF
11 pages
Assignment 4 - OSF
No ratings yet
Assignment 4 - OSF
3 pages
Word 2019 Intermediate and Advanced
No ratings yet
Word 2019 Intermediate and Advanced
1 page
Inocontroller Control Module Instructions Manual Sames DRT7134 Uk
No ratings yet
Inocontroller Control Module Instructions Manual Sames DRT7134 Uk
44 pages
1725021614548
No ratings yet
1725021614548
293 pages
PGP Machine Learning Brochure
No ratings yet
PGP Machine Learning Brochure
12 pages
DE 3000 Brochure
No ratings yet
DE 3000 Brochure
4 pages
TTC Catalog - EN 2013
No ratings yet
TTC Catalog - EN 2013
148 pages
What Is A Software Process?
No ratings yet
What Is A Software Process?
30 pages
Unit-Ii 5-Marks Question: Thin
No ratings yet
Unit-Ii 5-Marks Question: Thin
16 pages
Twin-Turbine Centrifugal Compressor MODEL TT-300: Service Monitor User Manual
No ratings yet
Twin-Turbine Centrifugal Compressor MODEL TT-300: Service Monitor User Manual
68 pages
Final Project Report Found
No ratings yet
Final Project Report Found
86 pages
Syllabus For The Subjective Test For Students Seeking Admission To M.Tech. (CS) and (CRS) Course
No ratings yet
Syllabus For The Subjective Test For Students Seeking Admission To M.Tech. (CS) and (CRS) Course
2 pages
Through A Gender Lens: An Empirical Study of Emoji Usage Over Large-Scale Android Users
No ratings yet
Through A Gender Lens: An Empirical Study of Emoji Usage Over Large-Scale Android Users
20 pages
Home Appliances Management System Using Controller Area Network (CAN)
No ratings yet
Home Appliances Management System Using Controller Area Network (CAN)
7 pages
Finding The Groove
No ratings yet
Finding The Groove
7 pages
Project Diary - Major
No ratings yet
Project Diary - Major
12 pages
Samsung Max-Vl65 Vl69 SCH
No ratings yet
Samsung Max-Vl65 Vl69 SCH
12 pages
Com 3501
No ratings yet
Com 3501
5 pages
Engineering Aptitude
No ratings yet
Engineering Aptitude
2 pages
Factorizing Polynomials
No ratings yet
Factorizing Polynomials
51 pages
Sih PS 2024
No ratings yet
Sih PS 2024
5 pages
Sjg18-046 (03) - Guangri New Control
No ratings yet
Sjg18-046 (03) - Guangri New Control
53 pages
IT 118 - SIA - Module 5
No ratings yet
IT 118 - SIA - Module 5
23 pages

Pandas Notes

Uploaded by

Pandas Notes

Uploaded by

Pandas Notes

Cups of tea sold

Packets of biscuits sold

You write it on paper:

To analyse sales, you want this in a neat computer table.

pd.DataFrame(data, # your raw data (dict or list of dicts)

index: lets you label rows (e.g. dates or IDs)

4. Full, Working Code

# 1) Raw sales data as a dictionary

# 2) Create the DataFrame

# 3) Show the table

5. Three Key Takeaways

2. Inspecting with head()

df: your DataFrame

.head: the function to look at top rows

(n): number of rows to show (optional; default = 5)

4. Full, Working Code

# Show the first 3 students

5. Three Key Takeaways

2. Default = 5: Without (n) , you see the first 5.

3. Checking Structure ( shape , columns , dtypes )

How many guests?

What columns do you have?

Are ages stored as numbers or text?

df.columns → lists the column names

df.dtypes → shows each column’s data type (int, float, object)

df.shape # no (), returns a tuple like (10, 3)

4. Full, Working Code

5. Three Key Takeaways

2. See Fields: columns avoids guessing field names.

4. Aggregation with agg()

Total sweets sold

Average price you charged

Day with maximum laddoos sold

across entire DataFrame.

df: your table

.agg / .aggregate: the summary function

groupby('col'): first split rows by that column

func dict/list: choose which statistics you want

4. Full, Working Code

5. Three Key Takeaways

2. Compare Groups: groupby()+agg() shows stats per category (like laddoo vs

3. Customizable: Pass your own list or dict of functions— .agg(['min','max','mean']) or

You might also like