0% found this document useful (0 votes)

4 views6 pages

Pandas Library

The document provides a Python code example using the Pandas library for cleaning and visualizing wireline well log data. It includes data extraction, transformation into a DataFrame, and plotting various geological parameters like Gamma Ray and Resistivity. Additionally, it outlines essential Pandas functions for data exploration, selection, cleaning, manipulation, aggregation, merging, and visualization.

Uploaded by

baqeeer2004

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

4 views6 pages

Pandas Library

Uploaded by

baqeeer2004

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 6

Pandas library

Code for cleaning data

import pandas as pd
import matplotlib.pyplot as plt
import re

# --- ‫ قراءة البيانات من الملف‬---

with open(r"D:\Python data analysis\PVT properties\Log - G 02 - W 04.TXT", enc
content = file.read()

# ‫ استخراج قسم‬WIRELINE WELL LOGS

wireline_start = content.find("WIRELINE WELL LOGS")
wireline_text = content[wireline_start:]
lines = wireline_text.strip().splitlines()
data_lines = [line for line in lines if re.match(r'^\d', line.strip())]

# ‫تحويل النص إلى جدول‬

columns = ["DEPTH", "GR", "SP", "Rt", "Rxo", "DEN", "SONIC", "CNL", "DIP", "AZIM
data = [re.split(r'\s{2,}', line.strip()) for line in data_lines]

df_log = pd.DataFrame(data, columns=columns)

# ‫تحويل الأعمدة الرقمية‬

numeric_cols = df_log.columns.drop("MUDLOG")
df_log[numeric_cols] = df_log[numeric_cols].astype(float)

# --- ‫ إضافة تفسير الصخور بناًء على‬GR ---

df_log["LITHOLOGY"] = df_log["GR"].apply(lambda x: "SANDSTONE" if x < 25 el

Pandas library 1
#for showing data
df_log

# --- ‫ الرسم‬---
fig, axes = plt.subplots(nrows=1, ncols=4, figsize=(12, 10), sharey=True)

depth = df_log["DEPTH"]

# GR: ‫لون مختلف حسب الليثولوجيا‬

for litho, color in zip(["SHALE", "SANDSTONE"], ["green", "orange"]):
subset = df_log[df_log["LITHOLOGY"] == litho]
axes[0].plot(subset["GR"], subset["DEPTH"], label=litho, color=color)
axes[0].set_xlabel("GR (API)")
axes[0].set_xlim(0, 150)
axes[0].set_title("Gamma Ray")
axes[0].grid()
axes[0].legend()

# Rt (‫)مقياس لوغاريتمي‬
axes[1].semilogx(df_log["Rt"], df_log["DEPTH"], color="red")
axes[1].set_xlabel("Rt (ohm.m)")
axes[1].set_xlim(0.2, 200)
axes[1].set_title("Resistivity")
axes[1].grid()

# Density
axes[2].plot(df_log["DEN"], df_log["DEPTH"], color="brown")
axes[2].set_xlabel("DEN (g/cc)")
axes[2].set_xlim(1.9, 2.7)
axes[2].set_title("Density")
axes[2].grid()

# Neutron Porosity

Pandas library 2
axes[3].plot(df_log["CNL"], df_log["DEPTH"], color="blue")
axes[3].set_xlabel("CNL (%)")
axes[3].set_xlim(0, 45)
axes[3].set_title("Neutron Porosity")
axes[3].grid()

# ‫إعداد محور العمق‬

axes[0].set_ylabel("Depth (ft)")
axes[0].invert_yaxis()

plt.tight_layout()
plt.suptitle("Interpreted Wireline Log with Lithology", fontsize=14, y=1.02)
plt.show()

The output of the code

DEPTH GR SP Rt Rxo DEN SONIC CNL DIP AZIM MUDLOG

0 5745.0 91.8 -23.0 3.6 6.8 2.1 104.4 37.8 15.1 312.4 SHALE
1 5745.5 94.9 -20.3 3.7 6.5 2.1 107.6 39.8 17.0 313.1 SHALE
2 5746.0 94.5 -22.3 4.9 9.3 2.1 106.6 37.3 18.0 306.3 SHALE
3 5746.5 92.8 -20.1 3.8 6.8 2.1 106.8 39.1 19.4 320.0 SHALE
4 5747.0 93.6 -21.9 2.2 4.1 2.1 105.0 39.1 17.5 300.9 SHALE
... ... ... ... ... ... ... ... ... ... ... ...
1622 6556.0 32.8 -17.7 4.1 6.8 2.3 79.7 12.3 15.6 307.4 SANDSTONE
1623 6556.5 32.7 -19.0 4.1 7.0 2.4 78.4 12.1 20.0 325.8 SANDSTONE
1624 6557.0 32.6 -22.1 3.9 7.4 2.3 82.9 11.5 20.6 303.2 SANDSTONE
1625 6557.5 35.7 -20.4 4.1 7.4 2.5 83.0 11.5 9.9 328.7 SANDSTONE
1626 6558.0 32.4 -22.2 4.0 7.4 2.4 81.1 11.9 19.9 309.0 SANDSTONE
1627 rows × 11 columns

Some function of pandas library

Pandas library 3
# 🐼 Pandas Essential Functions with Purpose
# Importing pandas
import pandas as pd

# ------------------------------
# 🧾Basic DataFrame Exploration
# ------------------------------

df.head(n=5) # Shows first n rows (default is 5)

df.tail(n=5) # Shows last n rows
df.shape # Returns (rows, columns)
df.columns # Lists column names
df.index # Lists index range or values
df.info() # Summary: columns, non-nulls, datatypes
df.describe() # Descriptive stats for numeric columns

# ------------------------------
# 🧹Data Selection & Filtering
# ------------------------------

df['column'] # Select a single column

df[['col1', 'col2']] # Select multiple columns
df.loc[row, col] # Label-based selection
df.iloc[row, col] # Index-based selection
df[df['col'] > value] # Conditional filtering

# ------------------------------
# 🧼Data Cleaning
# ------------------------------

df.isnull() # Detect missing values

df.isnull().sum() # Count of missing values per column
df.dropna() # Remove missing rows
df.fillna(value) # Fill missing with value

Pandas library 4
df.duplicated() # Check for duplicate rows
df.drop_duplicates() # Remove duplicates
df.replace(a, b) # Replace values

# ------------------------------
# 🧱Data Manipulation
# ------------------------------

df.sort_values('col') # Sort by column

df.rename(columns={'old':'new'}) # Rename columns
df.set_index('col') # Set column as index
df.reset_index() # Reset index to default
df['new'] = df['col1'] + df['col2'] # Create new column
df.drop(['col1', 'col2'], axis=1) # Drop columns

# ------------------------------
# 🧮Aggregation & Grouping
# ------------------------------

df.sum() # Column-wise sum

df.mean() # Column-wise mean
df.count() # Non-NA count
df.groupby('col') # Group by column
df.groupby('col').mean() # Group and aggregate

# ------------------------------
# 🧰 Merging & Joining
# ------------------------------

pd.concat([df1, df2]) # Concatenate vertically

pd.merge(df1, df2, on='key') # Merge on common column
df1.join(df2, how='left') # Join by index

# ------------------------------
# 📤Input / Output
# ------------------------------

Pandas library 5
pd.read_csv('file.csv') # Load CSV
df.to_csv('file.csv') # Save to CSV
pd.read_excel('file.xlsx') # Load Excel file
df.to_excel('file.xlsx') # Save to Excel

# ------------------------------
# 📊 Visualization (with matplotlib)
# ------------------------------

df.plot() # Basic line plot

df['col'].hist() # Histogram
df.plot.scatter(x='col1', y='col2') # Scatter plot

# ------------------------------
# 🧪 Other Useful Functions
# ------------------------------

df.value_counts() # Frequency of unique values

df.nunique() # Count of unique values per column
df.apply(function) # Apply a function to each column/row
df.astype('int') # Change data type

Pandas library 6

1 - Structural Stability of Steel - Galambos & Surovek PDF
100% (2)
1 - Structural Stability of Steel - Galambos & Surovek PDF
381 pages
Purpose Rediscovered
No ratings yet
Purpose Rediscovered
102 pages
IQN CPMA Practice Book 2023
No ratings yet
IQN CPMA Practice Book 2023
78 pages
Pandas Cheat Sheet PDF
67% (3)
Pandas Cheat Sheet PDF
1 page
Python Cheat Sheet: Pandas - Numpy - Sklearn Matplotlib - Seaborn BS4 - Selenium - Scrapy
100% (3)
Python Cheat Sheet: Pandas - Numpy - Sklearn Matplotlib - Seaborn BS4 - Selenium - Scrapy
9 pages
Exp3 Python
No ratings yet
Exp3 Python
15 pages
Commands SQL, Python (BASICS)
No ratings yet
Commands SQL, Python (BASICS)
7 pages
Python Cheat Sheet Code Academy
100% (1)
Python Cheat Sheet Code Academy
1 page
Data Science Cheat Sheet: KEY Imports
100% (1)
Data Science Cheat Sheet: KEY Imports
1 page
Fundamental - Python
No ratings yet
Fundamental - Python
3 pages
Pandas Cheat Sheet
100% (2)
Pandas Cheat Sheet
6 pages
Pandas Tutorial
No ratings yet
Pandas Tutorial
9 pages
EDA With Pandas CheatSheet
No ratings yet
EDA With Pandas CheatSheet
3 pages
Python Cheat Sheets
97% (33)
Python Cheat Sheets
11 pages
Pandas
No ratings yet
Pandas
94 pages
Pandas Syntax Revision For ML
No ratings yet
Pandas Syntax Revision For ML
10 pages
Pandas DataFrame Notes
No ratings yet
Pandas DataFrame Notes
13 pages
Python Libraries Cheat Sheets
No ratings yet
Python Libraries Cheat Sheets
6 pages
Pandas Cheat Sheet
No ratings yet
Pandas Cheat Sheet
5 pages
Pandas Introduction: What Is Python Pandas Used For?
No ratings yet
Pandas Introduction: What Is Python Pandas Used For?
28 pages
Pandas Data Wrangling Cheatsheet Datacamp PDF
No ratings yet
Pandas Data Wrangling Cheatsheet Datacamp PDF
1 page
Pandas For Python Pro Level Cheat Sheet
No ratings yet
Pandas For Python Pro Level Cheat Sheet
14 pages
Top Machine Learning Artificial Intelligence AI Data Science Cheat Sheets ForML & Deep Learning Engineers
No ratings yet
Top Machine Learning Artificial Intelligence AI Data Science Cheat Sheets ForML & Deep Learning Engineers
14 pages
Python Cheatsheet.pptx
No ratings yet
Python Cheatsheet.pptx
2 pages
EDS - Python Cheat Sheet
0% (1)
EDS - Python Cheat Sheet
3 pages
Dev Lab Record
No ratings yet
Dev Lab Record
21 pages
Pandas
No ratings yet
Pandas
13 pages
Pandas
No ratings yet
Pandas
44 pages
Introduction To Pandas in Data Analytics
No ratings yet
Introduction To Pandas in Data Analytics
12 pages
Introduction To Pandas Programming 2
No ratings yet
Introduction To Pandas Programming 2
3 pages
Pandas: Import
100% (1)
Pandas: Import
13 pages
Pandas Cheat Sheet
100% (1)
Pandas Cheat Sheet
2 pages
04-Data Manipulation With Pandas
No ratings yet
04-Data Manipulation With Pandas
28 pages
Pandas
No ratings yet
Pandas
2 pages
Data Manipulation in Python Using Pandas
No ratings yet
Data Manipulation in Python Using Pandas
12 pages
Exp 3
No ratings yet
Exp 3
10 pages
Creation of Series Using List, Dictionary & Ndarray
No ratings yet
Creation of Series Using List, Dictionary & Ndarray
65 pages
Pandas Notes Design
No ratings yet
Pandas Notes Design
5 pages
Series and Pandas Methods
No ratings yet
Series and Pandas Methods
5 pages
Important Pandas Operations 1697910759
No ratings yet
Important Pandas Operations 1697910759
6 pages
Class 12 Practical File
No ratings yet
Class 12 Practical File
29 pages
Unit IV
No ratings yet
Unit IV
49 pages
What Is Pandas
No ratings yet
What Is Pandas
9 pages
learnPandas
No ratings yet
learnPandas
37 pages
Pandas
No ratings yet
Pandas
5 pages
EDA With Pandas
No ratings yet
EDA With Pandas
8 pages
Pandas Cheat Sheet
No ratings yet
Pandas Cheat Sheet
2 pages
Pandas Commands
No ratings yet
Pandas Commands
3 pages
PDF&Rendition 1
No ratings yet
PDF&Rendition 1
47 pages
DAP 3 Module
No ratings yet
DAP 3 Module
62 pages
Pandas Cheat Sheet
85% (13)
Pandas Cheat Sheet
2 pages
Pandas Cheat Sheet
100% (4)
Pandas Cheat Sheet
2 pages
Pandas Cheat Sheet CN
No ratings yet
Pandas Cheat Sheet CN
4 pages
Chapter-2 Python Pandas
100% (2)
Chapter-2 Python Pandas
33 pages
Python Cheat Sheet For Excel Users
100% (2)
Python Cheat Sheet For Excel Users
5 pages
Python Programming For Data Science
No ratings yet
Python Programming For Data Science
36 pages
Lecture No.1 (Geophysics 2025)
No ratings yet
Lecture No.1 (Geophysics 2025)
34 pages
Reservoirs and Oil Fields in Iraq
No ratings yet
Reservoirs and Oil Fields in Iraq
21 pages
Wetability 20241208 234324 0000
No ratings yet
Wetability 20241208 234324 0000
20 pages
Content - 2024-12-20 - 2024-12-26 - MohammedBaqer A
No ratings yet
Content - 2024-12-20 - 2024-12-26 - MohammedBaqer A
6 pages
Slope Deflection Method
No ratings yet
Slope Deflection Method
39 pages
Project Report - Group 6 - Osh
No ratings yet
Project Report - Group 6 - Osh
26 pages
Title Lorem Ipsum: Practical Research 1 Mr. Anthony V. de Luna
No ratings yet
Title Lorem Ipsum: Practical Research 1 Mr. Anthony V. de Luna
20 pages
Assignment 1 Questions
No ratings yet
Assignment 1 Questions
16 pages
PHD Thesis
No ratings yet
PHD Thesis
25 pages
Climatology 3rd Layer 80 Qus MCQ
No ratings yet
Climatology 3rd Layer 80 Qus MCQ
10 pages
The ACT Matrix and Prosocial Matrix Form
No ratings yet
The ACT Matrix and Prosocial Matrix Form
4 pages
7 Recap On Week 1-7
No ratings yet
7 Recap On Week 1-7
38 pages
The Ball Poem Notes and Worksheet
No ratings yet
The Ball Poem Notes and Worksheet
4 pages
Operating Instructions: Subject Siemens Tool Management Date / Version 16.03.2011 14:53:00 / V2.0.1.0 Control Siemens
No ratings yet
Operating Instructions: Subject Siemens Tool Management Date / Version 16.03.2011 14:53:00 / V2.0.1.0 Control Siemens
45 pages
Civil Engineering Technology
No ratings yet
Civil Engineering Technology
34 pages
ABB Drives ACS580 Control DC Magnetization Pre-Heating
No ratings yet
ABB Drives ACS580 Control DC Magnetization Pre-Heating
5 pages
Hydraulic Seals - Rod Seals
No ratings yet
Hydraulic Seals - Rod Seals
176 pages
On The Face of It - Notes
No ratings yet
On The Face of It - Notes
6 pages
Pharmaceutical Calibration, Qualification
No ratings yet
Pharmaceutical Calibration, Qualification
10 pages
Brainly and Social Question and Answer - Docx (A)
No ratings yet
Brainly and Social Question and Answer - Docx (A)
3 pages
cs188 Su24 Lec07
No ratings yet
cs188 Su24 Lec07
89 pages
A Page by Page
No ratings yet
A Page by Page
21 pages
Making Cities Resilient by Integrating Nature-Based Solutions Into Urban Planning
No ratings yet
Making Cities Resilient by Integrating Nature-Based Solutions Into Urban Planning
2 pages
Shapirostolz Eded Tre Inprint Vol17no.1
No ratings yet
Shapirostolz Eded Tre Inprint Vol17no.1
22 pages
1st Lecture Introduction
No ratings yet
1st Lecture Introduction
35 pages
Exercise Sheet 10 (1) mth301 Iitk
No ratings yet
Exercise Sheet 10 (1) mth301 Iitk
11 pages
Pengumuman DEDC
No ratings yet
Pengumuman DEDC
1 page
2223-S1 FINALTERM Electrical-Circuits
No ratings yet
2223-S1 FINALTERM Electrical-Circuits
8 pages
Group 3 - Psychological Perspective of The Self
No ratings yet
Group 3 - Psychological Perspective of The Self
8 pages
Comparing Prince2 and Pmbok Ebook PDF
No ratings yet
Comparing Prince2 and Pmbok Ebook PDF
19 pages
Gabriela Worked For A Multinational Company As A Successful Project Manager in Brazil and Was Transferred To Manage A Team in Sweden
No ratings yet
Gabriela Worked For A Multinational Company As A Successful Project Manager in Brazil and Was Transferred To Manage A Team in Sweden
3 pages

Pandas Library

Uploaded by

Pandas Library

Uploaded by

Pandas library

Code for cleaning data

# --- ‫ قراءة البيانات من الملف‬---

# ‫ استخراج قسم‬WIRELINE WELL LOGS

# ‫تحويل النص إلى جدول‬

df_log = pd.DataFrame(data, columns=columns)

# ‫تحويل الأعمدة الرقمية‬

# --- ‫ إضافة تفسير الصخور بناًء على‬GR ---

# GR: ‫لون مختلف حسب الليثولوجيا‬

# ‫إعداد محور العمق‬

The output of the code

DEPTH GR SP Rt Rxo DEN SONIC CNL DIP AZIM MUDLOG

Some function of pandas library

df.head(n=5) # Shows first n rows (default is 5)

df['column'] # Select a single column

df.isnull() # Detect missing values

df.sort_values('col') # Sort by column

df.sum() # Column-wise sum

pd.concat([df1, df2]) # Concatenate vertically

df.plot() # Basic line plot

df.value_counts() # Frequency of unique values

You might also like