Session - 7 Data Operations in A File Using Pandas

Uploaded by

mouneshyatham99

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

11 views3 pages

Session - 7 Data Operations in A File Using Pandas

Uploaded by

mouneshyatham99

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 3

Session-7: Data file operations using pandas

Aim: develop a python program that reads data from a CSV file and applies various
operations using the Pandas library.

Software requirement: Python

Program:

import pandas as pd
# Read data from a CSV file (replace 'data.csv' with your file path)
df = pd.read_csv('data.csv')
# Display the first few rows of the DataFrame
print("First 5 rows:")
print(df.head())
# Basic statistics
print("\nSummary Statistics:")
print(df.describe())
# Filtering data
filtered_df = df[df['Age'] > 25]
# Sorting data
sorted_df = df.sort_values(by='Age', ascending=False)
# Grouping and aggregation
grouped_df = df.groupby('Department')['Salary'].mean()
# Adding a new column
df['Salary Increased'] = df['Salary'] * 1.1
# Save the modified DataFrame to a new CSV file
df.to_csv('modified_data.csv', index=False)
# Pivot table
pivot_table = df.pivot_table(index='Department', columns='Gender', values='Salary',
aggfunc='mean')

# Display the results

print("\nFiltered DataFrame:")
print(filtered_df)
print("\nSorted DataFrame:")
print(sorted_df)
print("\nGrouped DataFrame:")
print(grouped_df)
print("\nDataFrame with Added Column:")
print(df)
print("\nPivot Table:")
print(pivot_table)
# This program reads data from a CSV file, applies various operations like filtering,
sorting,
# grouping, adding a new column, and creating a pivot table using Pandas. Make sure
to
# replace 'data.csv' with the path to your CSV file, and adjust the operations as
needed # for your specific data and requirements.

Theory: Reading data from a .doc file directly using Pandas can be a bit challenging
since Pandas is primarily designed to work with structured data like CSV, Excel, and
databases. However, you can convert the data from a .doc file to a format that Pandas
can handle, such as text or CSV, and then perform operations on it. Here's an example
of how to do that using the python-docx library to read data from a Word document:
First, you'll need to install the python-docx library:
pip install python-docx
Now, let's create a Python program that reads data from a Word document, converts it
to a DataFrame, and applies some operations using Pandas:

import pandas as pd
from docx import Document

# Read data from a Word document (replace 'document.docx' with your file path)
document = Document('document.docx')
# Extract text from the Word document
text = []
for paragraph in document.paragraphs:
text.append(paragraph.text)
# Create a DataFrame from the extracted text
df = pd.DataFrame({'Text': text})
# Display the first few rows of the DataFrame
print("First 5 rows:")
print(df.head())
# Basic statistics
print("\nSummary Statistics:")
print(df.describe())
# Filter data
filtered_df = df[df['Text'].str.contains('keyword')]
# Save the filtered DataFrame to a new CSV file
filtered_df.to_csv('filtered_data.csv', index=False)
# Display the filtered DataFrame
print("\nFiltered DataFrame:")
print(filtered_df)
Result: A python program developed to read data from a CSV file and applies various
operations using the Pandas library.

Light Designer - V8 - Paradigm 2.1.2
100% (1)
Light Designer - V8 - Paradigm 2.1.2
49 pages
Employee Data Analysis System (Ip Class 12) (2024-25)
No ratings yet
Employee Data Analysis System (Ip Class 12) (2024-25)
30 pages
Employee Data Analysis System (Ip Class Xii)
No ratings yet
Employee Data Analysis System (Ip Class Xii)
26 pages
File Handling
No ratings yet
File Handling
6 pages
Experiment No 3 Importing and Exporting Data in Python Using Pandas Student
No ratings yet
Experiment No 3 Importing and Exporting Data in Python Using Pandas Student
6 pages
Chapter5 3CSVFile
No ratings yet
Chapter5 3CSVFile
7 pages
05 Data Loading, Storage and Wrangling-1
No ratings yet
05 Data Loading, Storage and Wrangling-1
22 pages
20 Pandas Codes To Master Data Analysis
No ratings yet
20 Pandas Codes To Master Data Analysis
3 pages
CSV New
No ratings yet
CSV New
4 pages
Pandas 1
No ratings yet
Pandas 1
64 pages
Experiment 678910
No ratings yet
Experiment 678910
12 pages
Introduction To Pandas Programming 1
No ratings yet
Introduction To Pandas Programming 1
2 pages
Project IP 2023
No ratings yet
Project IP 2023
16 pages
Fds Unit - III
No ratings yet
Fds Unit - III
58 pages
Lecture 21 Working With Pandas
No ratings yet
Lecture 21 Working With Pandas
11 pages
Pandas
No ratings yet
Pandas
35 pages
SET-2 Python Practical (3-5
No ratings yet
SET-2 Python Practical (3-5
4 pages
III Unit Fds
No ratings yet
III Unit Fds
24 pages
Python & MySQL For Data Analysis
No ratings yet
Python & MySQL For Data Analysis
45 pages
Basic Operations With CSV Files: CSV (Comma Separated Values) May Be A Simple File Format Accustomed To
No ratings yet
Basic Operations With CSV Files: CSV (Comma Separated Values) May Be A Simple File Format Accustomed To
7 pages
Importing Data Cheat Sheet Python For Data Science: Pickled Files Exploring Your Data
No ratings yet
Importing Data Cheat Sheet Python For Data Science: Pickled Files Exploring Your Data
1 page
PP Manual Exp No. 07
No ratings yet
PP Manual Exp No. 07
9 pages
Unit6 - Working With Data
No ratings yet
Unit6 - Working With Data
29 pages
Week 2 Laboratory Activity
No ratings yet
Week 2 Laboratory Activity
7 pages
BTech 5 CSE Data Analytics Using Python Unit 4 Notes
No ratings yet
BTech 5 CSE Data Analytics Using Python Unit 4 Notes
25 pages
Pandas Basics Guide
No ratings yet
Pandas Basics Guide
4 pages
7.2 - Data Frame Basics - mp4
No ratings yet
7.2 - Data Frame Basics - mp4
3 pages
learnPandas
No ratings yet
learnPandas
37 pages
Practical (XI) I (4)
No ratings yet
Practical (XI) I (4)
6 pages
DSBDAL
No ratings yet
DSBDAL
87 pages
Pandas Notes
No ratings yet
Pandas Notes
2 pages
Pandas
No ratings yet
Pandas
4 pages
Lab Manual 5
No ratings yet
Lab Manual 5
5 pages
CSV Files
No ratings yet
CSV Files
24 pages
Python Data Science 101
100% (1)
Python Data Science 101
41 pages
Pandas Dataframe All Operations 1735471870
No ratings yet
Pandas Dataframe All Operations 1735471870
4 pages
EDA - Session-1 - Basic Dataframe Opertaions-1
No ratings yet
EDA - Session-1 - Basic Dataframe Opertaions-1
7 pages
How To Perform Common Excel Commands in Python: Reading The Data
No ratings yet
How To Perform Common Excel Commands in Python: Reading The Data
3 pages
Python Series Day20
No ratings yet
Python Series Day20
7 pages
Pandas I Notes 06 - June 20
No ratings yet
Pandas I Notes 06 - June 20
13 pages
Prac 1
No ratings yet
Prac 1
5 pages
Chapter Notes - Data Handling Using Pandas DataFrame
No ratings yet
Chapter Notes - Data Handling Using Pandas DataFrame
16 pages
Cheat Sheet: The Pandas Dataframe Object I: Preliminaries Get Your Data Into A Dataframe
No ratings yet
Cheat Sheet: The Pandas Dataframe Object I: Preliminaries Get Your Data Into A Dataframe
12 pages
Pandas 1
No ratings yet
Pandas 1
2 pages
CSV File Handling
No ratings yet
CSV File Handling
8 pages
Introduction To Pandas
No ratings yet
Introduction To Pandas
27 pages
CSV Comma Separated Values
No ratings yet
CSV Comma Separated Values
7 pages
Dav 2 Unit
No ratings yet
Dav 2 Unit
55 pages
Prac 1
No ratings yet
Prac 1
5 pages
14oct Pandas 2024
No ratings yet
14oct Pandas 2024
13 pages
L CsvReadWrite
No ratings yet
L CsvReadWrite
10 pages
INFORMATIC Complete Project
No ratings yet
INFORMATIC Complete Project
27 pages
Pandas - Programs
No ratings yet
Pandas - Programs
22 pages
Lesson 23 Notes - Pandas Reading Data
No ratings yet
Lesson 23 Notes - Pandas Reading Data
17 pages
Pandas Introduction: What Is Python Pandas Used For?
No ratings yet
Pandas Introduction: What Is Python Pandas Used For?
28 pages
Data Representation
No ratings yet
Data Representation
13 pages
What Is Pandas
No ratings yet
What Is Pandas
9 pages
Ainotes
No ratings yet
Ainotes
5 pages
Chesta-Draksha Project File
No ratings yet
Chesta-Draksha Project File
57 pages
Scala Data Analysis Cookbook (new): Navigate the world of data analysis, visualization, and machine learning with over 100 hands-on Scala recipes
From Everand
Scala Data Analysis Cookbook (new): Navigate the world of data analysis, visualization, and machine learning with over 100 hands-on Scala recipes
Arun Manivannan
No ratings yet
Firebase Storage for Angular: A reliable file upload solution for your applications
From Everand
Firebase Storage for Angular: A reliable file upload solution for your applications
Abdelfattah Ragab
No ratings yet
Cancel Statement in Cobol
No ratings yet
Cancel Statement in Cobol
2 pages
Turning The World Inside Out and 174 Other Simpl
No ratings yet
Turning The World Inside Out and 174 Other Simpl
2 pages
CSS Float
No ratings yet
CSS Float
2 pages
CS6100: Topics in Design and Analysis of Algorithms: Delaunay Triangulation John Augustine
No ratings yet
CS6100: Topics in Design and Analysis of Algorithms: Delaunay Triangulation John Augustine
25 pages
Fortigate 1100E Series: Data Sheet
No ratings yet
Fortigate 1100E Series: Data Sheet
6 pages
HITAG Classification Scorpio PDF
No ratings yet
HITAG Classification Scorpio PDF
2 pages
Chapter 12-Domain Analysis
No ratings yet
Chapter 12-Domain Analysis
79 pages
Object Oriented Programming
No ratings yet
Object Oriented Programming
81 pages
Megha Sood HCL Agile Coach Resume
No ratings yet
Megha Sood HCL Agile Coach Resume
2 pages
Autopart Documentation
No ratings yet
Autopart Documentation
47 pages
Database Management System LAB Final Q2
No ratings yet
Database Management System LAB Final Q2
22 pages
Commview For Wifi Technical Specifications: Wireless Network Monitor and Analyzer
No ratings yet
Commview For Wifi Technical Specifications: Wireless Network Monitor and Analyzer
1 page
Table of Contents - Best of Game Programming Gems
No ratings yet
Table of Contents - Best of Game Programming Gems
5 pages
Premiere Product Excersice Answers
No ratings yet
Premiere Product Excersice Answers
7 pages
Black Hawk Down Server Manager Documentation
No ratings yet
Black Hawk Down Server Manager Documentation
18 pages
RBE-Revolution by Education
No ratings yet
RBE-Revolution by Education
6 pages
Manual MyCloudPR4100 PDF
No ratings yet
Manual MyCloudPR4100 PDF
125 pages
B 12xcucsag PDF
No ratings yet
B 12xcucsag PDF
340 pages
Meet XL
No ratings yet
Meet XL
50 pages
Charisma Medical Software
No ratings yet
Charisma Medical Software
8 pages
Tukaram Sabale Resume
No ratings yet
Tukaram Sabale Resume
2 pages
D23 B.Tech CSE
No ratings yet
D23 B.Tech CSE
50 pages
Insert Tab MCQ
No ratings yet
Insert Tab MCQ
8 pages
Installation and Operation Manual: Ac Smartstart® Ac Power Distribution Units (Pdu)
No ratings yet
Installation and Operation Manual: Ac Smartstart® Ac Power Distribution Units (Pdu)
50 pages
Eco System Notes
100% (1)
Eco System Notes
15 pages
Unlock Recruitment Efficiency With Resume Parser Software: A Comprehensive Guide
No ratings yet
Unlock Recruitment Efficiency With Resume Parser Software: A Comprehensive Guide
9 pages
Linux Basic Usage
No ratings yet
Linux Basic Usage
20 pages
Arcsight Info
No ratings yet
Arcsight Info
34 pages
Relativity - Mass Printing
No ratings yet
Relativity - Mass Printing
2 pages

Session - 7 Data Operations in A File Using Pandas

Uploaded by

Session - 7 Data Operations in A File Using Pandas

Uploaded by

Session-7: Data file operations using pandas

Software requirement: Python

# Display the results

You might also like