0% found this document useful (0 votes)

35 views9 pages

Pandas Tutorial

Pandas is a Python library for data manipulation and analysis, utilizing Series and DataFrames for efficient structured data handling. It provides functionalities for reading various data formats, data cleaning, manipulation, grouping, and merging. This tutorial covers essential operations that form the foundation of data analysis workflows using Pandas.

Uploaded by

otj7w

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

35 views9 pages

Pandas Tutorial

Uploaded by

otj7w

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 9

Pandas Tutorial

### Pandas Overview

Pandas is a Python library designed for data manipulation and
analysis. It provides powerful, flexible data structures-Series and
DataFrames-for working with structured data efficiently.

---

## 1. DataFrames and Series

### Series
A Series is a one-dimensional array-like object that can hold data of
any type (integers, strings, floats, etc.), along with an associated
index. It is similar to a column in a spreadsheet or a dictionary where
keys are the index.

Example:
```python
import pandas as pd

data = [10, 20, 30, 40]

index = ['A', 'B', 'C', 'D']
series = pd.Series(data, index=index)

print(series)
```
Output:
```
A 10
B 20
C 30
D 40
dtype: int64
```

### DataFrame
A DataFrame is a two-dimensional, tabular data structure with labeled
rows and columns, akin to a spreadsheet. It is essentially a collection
of Series sharing the same index.

Example:
```python
data = {
'Name': ['Alice', 'Bob', 'Charlie'],
'Age': [25, 30, 35],
'Salary': [50000, 60000, 70000]
}

df = pd.DataFrame(data)
print(df)
```
Output:
```
Name Age Salary
0 Alice 25 50000
1 Bob 30 60000
2 Charlie 35 70000
```

---

## 2. Reading Data

Pandas makes it easy to read and write data in various formats like
CSV, Excel, JSON, SQL, and more.

### Reading CSV Files

```python
df = pd.read_csv('data.csv') # Reads data from a CSV file
```

### Reading Excel Files

```python
df = pd.read_excel('data.xlsx', sheet_name='Sheet1')
```

### Reading JSON Files

```python
df = pd.read_json('data.json')
```

---

## 3. Data Cleaning

Data cleaning involves preparing raw data by handling

inconsistencies or errors.

### Dropping Rows/Columns

```python
df = df.drop(columns=['UnnecessaryColumn'])
df = df.dropna() # Drops rows with missing values
```

### Renaming Columns

```python
df = df.rename(columns={'OldName': 'NewName'})
```

### Replacing Values

```python
df['ColumnName'] = df['ColumnName'].replace({'OldValue':
'NewValue'})
```

### Changing Data Types

```python
df['Age'] = df['Age'].astype(int) # Converts to integer type
```

---

## 4. Data Manipulation

### Selecting Data

- By column name:
```python
df['ColumnName']
```
- By multiple columns:
```python
df[['Column1', 'Column2']]
```
- By condition:
```python
df[df['Age'] > 30]
```

### Adding New Columns

```python
df['NewColumn'] = df['Column1'] + df['Column2']
```
### Sorting Data
```python
df = df.sort_values(by='Age', ascending=True)
```

---

## 5. Handling Missing Data

Pandas provides tools to detect and handle missing data effectively.

### Detecting Missing Data

```python
df.isnull() # Returns a DataFrame of True/False for missing values
df.isnull().sum() # Counts missing values for each column
```

### Filling Missing Data

- Fill with a specific value:
```python
df['ColumnName'] = df['ColumnName'].fillna(0)
```
- Fill with column mean/median/mode:
```python
df['ColumnName'] =
df['ColumnName'].fillna(df['ColumnName'].mean())
```
### Dropping Missing Data
```python
df = df.dropna() # Drops rows with missing values
```

---

## 6. Grouping Data

Grouping allows you to aggregate data based on one or more keys.

### Group By
```python
grouped = df.groupby('Category')
```

### Aggregate Functions

```python
grouped['ColumnName'].mean() # Computes the mean for each
group
grouped['ColumnName'].sum() # Computes the sum for each group
```

### Multiple Aggregations

```python
df.groupby('Category').agg({'Column1': 'mean', 'Column2': 'sum'})
```

---

## 7. Merging Data

Pandas provides several methods to merge or join datasets.

### Merging DataFrames

```python
merged_df = pd.merge(df1, df2, on='common_column')
```

### Join Types

- Inner Join (default):
Matches rows with keys in both DataFrames.
- Outer Join:
Includes all rows, filling missing values with NaN.
```python
pd.merge(df1, df2, on='common_column', how='outer')
```
- Left Join:
Includes all rows from the left DataFrame.
```python
pd.merge(df1, df2, on='common_column', how='left')
```
- Right Join:
Includes all rows from the right DataFrame.
```python
pd.merge(df1, df2, on='common_column', how='right')
```

### Concatenating DataFrames

Combine rows or columns of DataFrames:
```python
pd.concat([df1, df2], axis=0) # Stacks rows
pd.concat([df1, df2], axis=1) # Combines columns
```

---

### Summary
Pandas is a versatile tool that allows efficient handling of structured
data. Whether you're cleaning messy data, performing calculations,
or preparing data for visualization, Pandas is your go-to library in
Python. Each operation-reading, cleaning, manipulating, grouping,
and merging-forms the foundation of data analysis workflows.

Fanuc 21i Model A - Alarm List
No ratings yet
Fanuc 21i Model A - Alarm List
53 pages
E-Commerce: Kenneth C. Laudon Carol Guercio Traver
No ratings yet
E-Commerce: Kenneth C. Laudon Carol Guercio Traver
51 pages
Modules 9 - 12 Group Exam Answers PART 1
No ratings yet
Modules 9 - 12 Group Exam Answers PART 1
2 pages
Introduction To Pandas in Data Analytics
No ratings yet
Introduction To Pandas in Data Analytics
12 pages
Python Programming For Data Science
No ratings yet
Python Programming For Data Science
36 pages
Python Unit 3 4
No ratings yet
Python Unit 3 4
92 pages
Pandas
No ratings yet
Pandas
2 pages
FDS Module 2 Notes
No ratings yet
FDS Module 2 Notes
24 pages
Pandas Tutorial
No ratings yet
Pandas Tutorial
7 pages
Pandas Cheat Sheet
No ratings yet
Pandas Cheat Sheet
5 pages
Pandas
No ratings yet
Pandas
4 pages
Mypnotes
No ratings yet
Mypnotes
3 pages
Content Pandas Cheat Sheet
No ratings yet
Content Pandas Cheat Sheet
9 pages
What Is Pandas
No ratings yet
What Is Pandas
9 pages
Introduction To Pandas Programming 2
No ratings yet
Introduction To Pandas Programming 2
3 pages
DAP 3 Module
No ratings yet
DAP 3 Module
62 pages
Python Pandas Tutorial For Beginners
No ratings yet
Python Pandas Tutorial For Beginners
203 pages
Data Handling Module
No ratings yet
Data Handling Module
10 pages
Pandas
No ratings yet
Pandas
9 pages
learnPandas
No ratings yet
learnPandas
37 pages
Pandas
No ratings yet
Pandas
13 pages
Pandas
No ratings yet
Pandas
94 pages
Pandas Introduction: What Is Python Pandas Used For?
No ratings yet
Pandas Introduction: What Is Python Pandas Used For?
28 pages
Python 2.1.3
No ratings yet
Python 2.1.3
6 pages
Pandas Notes
No ratings yet
Pandas Notes
3 pages
Pandas For Python Pro Level Cheat Sheet
No ratings yet
Pandas For Python Pro Level Cheat Sheet
14 pages
Python 2.1.2
No ratings yet
Python 2.1.2
7 pages
Pandas Cheat Sheet
No ratings yet
Pandas Cheat Sheet
2 pages
Pandas CheatSheet
No ratings yet
Pandas CheatSheet
18 pages
Loki Temp PPT Pandas 2
No ratings yet
Loki Temp PPT Pandas 2
31 pages
Pandas
No ratings yet
Pandas
26 pages
Pandas Library
No ratings yet
Pandas Library
6 pages
Pandas Notes
No ratings yet
Pandas Notes
4 pages
Unit IV
No ratings yet
Unit IV
49 pages
ML Unit-2 Notes
No ratings yet
ML Unit-2 Notes
17 pages
Exp3 Python
No ratings yet
Exp3 Python
15 pages
JOINS
No ratings yet
JOINS
10 pages
Pandas
No ratings yet
Pandas
13 pages
Python Interviews
No ratings yet
Python Interviews
154 pages
Usage of NumPy For Numerical Data in Detail
No ratings yet
Usage of NumPy For Numerical Data in Detail
52 pages
Data Analysis With Python
No ratings yet
Data Analysis With Python
60 pages
DevOps Session 3 Pandas
No ratings yet
DevOps Session 3 Pandas
33 pages
Pandas
No ratings yet
Pandas
50 pages
Pandas For Data Science
No ratings yet
Pandas For Data Science
42 pages
Data Wrangling With Python and Pandas
No ratings yet
Data Wrangling With Python and Pandas
7 pages
Advanced Analytic Techniques
No ratings yet
Advanced Analytic Techniques
2 pages
Pandas
No ratings yet
Pandas
8 pages
Pandas Notes
No ratings yet
Pandas Notes
6 pages
Pandas programs
No ratings yet
Pandas programs
2 pages
Python Notes by Prof T
No ratings yet
Python Notes by Prof T
10 pages
Pandas Library
No ratings yet
Pandas Library
15 pages
Exercise 3
No ratings yet
Exercise 3
12 pages
Unit 3
No ratings yet
Unit 3
10 pages
Pandas Roadmap
No ratings yet
Pandas Roadmap
6 pages
Introduction To Pandas For Data Analysis
No ratings yet
Introduction To Pandas For Data Analysis
6 pages
Pandas
No ratings yet
Pandas
25 pages
Pandas
No ratings yet
Pandas
25 pages
Pandas Notes Design
No ratings yet
Pandas Notes Design
5 pages
NumPy and Pandas
No ratings yet
NumPy and Pandas
12 pages
Pandas Notes
No ratings yet
Pandas Notes
8 pages
CHP 8 Pandas
No ratings yet
CHP 8 Pandas
49 pages
The Essential R Reference
From Everand
The Essential R Reference
Mark Gardener
No ratings yet
Introduction to PHP, Part 2, Second Edition
From Everand
Introduction to PHP, Part 2, Second Edition
Adam Majczak
No ratings yet
Project Name Delivery Date Test Design Date Designed by Browsers Android Versions iOS Versions Carrier Information (If Needed)
No ratings yet
Project Name Delivery Date Test Design Date Designed by Browsers Android Versions iOS Versions Carrier Information (If Needed)
26 pages
Year 1 Exponentials and Logarithsm Unit Test 1.1
No ratings yet
Year 1 Exponentials and Logarithsm Unit Test 1.1
7 pages
Customer Workflow CF en
No ratings yet
Customer Workflow CF en
262 pages
Data Structure and Algorithms Sylabus Sta Rita
No ratings yet
Data Structure and Algorithms Sylabus Sta Rita
15 pages
Unit 4
No ratings yet
Unit 4
215 pages
MegaProject Report Final-1
No ratings yet
MegaProject Report Final-1
39 pages
Golden Master Pedal Manual
No ratings yet
Golden Master Pedal Manual
17 pages
Chapter 2-3 PHP Working With Strings
No ratings yet
Chapter 2-3 PHP Working With Strings
47 pages
STM32G0B1xB/xC/xE Device Errata
No ratings yet
STM32G0B1xB/xC/xE Device Errata
22 pages
Openvms Cluster
No ratings yet
Openvms Cluster
354 pages
Poverty Alleviation Through Information and Communication Technology and Its Implication Towards Local Economy (FINAL)
No ratings yet
Poverty Alleviation Through Information and Communication Technology and Its Implication Towards Local Economy (FINAL)
221 pages
Audio Technica AT LP60XUSB Product Information
No ratings yet
Audio Technica AT LP60XUSB Product Information
1 page
Urgent Registration-Integer TelecomB - Tech EC/EE/EEE 2024 Batch GU/GCE
No ratings yet
Urgent Registration-Integer TelecomB - Tech EC/EE/EEE 2024 Batch GU/GCE
2 pages
History of Video Games - Wikipedia
No ratings yet
History of Video Games - Wikipedia
74 pages
Parça 9
No ratings yet
Parça 9
2 pages
All School - Avenues OPEN
No ratings yet
All School - Avenues OPEN
1 page
YUNZII AL66 Function Card 240408
No ratings yet
YUNZII AL66 Function Card 240408
4 pages
Week 03
No ratings yet
Week 03
28 pages
Chapter 2 Hands-On MIS Application Problem Statement: Excel Tutorials Links
No ratings yet
Chapter 2 Hands-On MIS Application Problem Statement: Excel Tutorials Links
2 pages
Roblox Shirt Emo White Skin - Google Search
No ratings yet
Roblox Shirt Emo White Skin - Google Search
1 page
Call For Papers New v6
No ratings yet
Call For Papers New v6
2 pages
Acceptable Usage Policy
No ratings yet
Acceptable Usage Policy
5 pages
Ex No 5
No ratings yet
Ex No 5
9 pages
ScaleProtect Competitive Battlecard - DELL
No ratings yet
ScaleProtect Competitive Battlecard - DELL
2 pages
MFL41037104 42lg3000-Za
No ratings yet
MFL41037104 42lg3000-Za
32 pages
Vacon NX All in One Application Manual DPD00903A E
No ratings yet
Vacon NX All in One Application Manual DPD00903A E
248 pages
Cyber Security Awareness - 2019
No ratings yet
Cyber Security Awareness - 2019
3 pages

Pandas Tutorial

Uploaded by

Pandas Tutorial

Uploaded by

Pandas Tutorial

### Pandas Overview

## 1. DataFrames and Series

data = [10, 20, 30, 40]

### Reading CSV Files

### Reading Excel Files

### Reading JSON Files

Data cleaning involves preparing raw data by handling

### Dropping Rows/Columns

### Renaming Columns

### Replacing Values

### Changing Data Types

### Selecting Data

### Adding New Columns

## 5. Handling Missing Data

Pandas provides tools to detect and handle missing data effectively.

### Detecting Missing Data

### Filling Missing Data

Grouping allows you to aggregate data based on one or more keys.

### Aggregate Functions

### Multiple Aggregations

Pandas provides several methods to merge or join datasets.

### Merging DataFrames

### Join Types

### Concatenating DataFrames

You might also like