0% found this document useful (0 votes)

69 views12 pages

Data Analysis Using Python

The document analyzes employee data from a CSV file with 24 entries. It performs tasks like finding number of employees by governorate, department metrics like average age and salary, filtering data for specific departments, and calculating bonuses based on hire date. Visualizations are also planned to represent the data.

Uploaded by

talithasyahda.ts

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

69 views12 pages

Data Analysis Using Python

Uploaded by

talithasyahda.ts

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 12

3/27/24, 11:20 PM Untitled11 - Jupyter Notebook

~import libraries
In [1]: import numpy as np
import pandas as pd
import matplotlib.pyplot as plt
import seaborn as sns
import datetime
%matplotlib inline
sns.set()

In [2]: Hr = pd.read_csv(r'C:\Users\compucity\Downloads\Book1.csv')
Hr.head()
Out[2]:
E_ID Name Age Address Telephone Salary Department Hire Date

0 11 Aleya 46 Cairo 4218483 3000 Account 24-02-03

1 9 Hassan 25 Cairo 3578283 2000 Sales 21-01-06

2 15 Ramy 57 Alex 3674313 5000 Computer 21-03-00

3 18 Ola 28 Milan 4186473 5000 Sales 04-02-07

4 22 Zeiad 29 Milan 3642303 2000 Sales 01-03-98

task of Data
1- Find the number of employees in each governorate
2- Find the number of employees in each Department
3- Average age of employees in each department
4- Average Salary of employees in each department
5- Retrieving the data of employees who work in the computer department only,
as well as the rest of the employees in other departments
6- Find the number of employees in each department who work in Cairo Governorate only
7- Search for the employee who receives the highest salary and retrieve his complete data+
8- Number of employees by department in each governorate
9- Bonus ... Based on Hire Date
Hire Date1 >= -1-2005 5% of Salary
1-1-2003 10%
1-1-2000 15%
1-1-1995 20%
1-1-1990 25%
Else 30%
Based on Hire Date

10 - Find some suitable graph for the data

localhost:8889/notebooks/mahmoud1/Untitled11.ipynb 1/12
3/27/24, 11:20 PM Untitled11 - Jupyter Notebook

In [4]: Hr.info()

<class 'pandas.core.frame.DataFrame'>
RangeIndex: 24 entries, 0 to 23
Data columns (total 8 columns):
# Column Non-Null Count Dtype
--- ------ -------------- -----
0 E_ID 24 non-null int64
1 Name 24 non-null object
2 Age 24 non-null int64
3 Address 24 non-null object
4 Telephone 24 non-null int64
5 Salary 24 non-null int64
6 Department 24 non-null object
7 Hire Date 24 non-null object
dtypes: int64(4), object(4)
memory usage: 1.6+ KB

In [5]: Hr.dtypes

Out[5]: E_ID int64

Name object
Age int64
Address object
Telephone int64
Salary int64
Department object
Hire Date object
dtype: object

In [6]: # Total isnull data

Hr.isnull().sum()

Out[6]: E_ID 0
Name 0
Age 0
Address 0
Telephone 0
Salary 0
Department 0
Hire Date 0
dtype: int64

In [7]: # duplicated
Hr.duplicated().sum()
Out[7]: 0

In [8]: Hr['Address'].value_counts()

Out[8]: Cairo 8
Alex 6
Giza 5
Milan 2
Alexandria 1
Alixandria 1
milan 1
Name: Address, dtype: int64

localhost:8889/notebooks/mahmoud1/Untitled11.ipynb 2/12
3/27/24, 11:20 PM Untitled11 - Jupyter Notebook

In [9]: Hr['Department'].value_counts()

Out[9]: Sales 9
Computer 8
Account 7
Name: Department, dtype: int64

In [10]: # AVERAGE Age

Hr.groupby('Department')['Age'].mean()
Out[10]: Department
Account 46.142857
Computer 41.625000
Sales 33.222222
Name: Age, dtype: float64

In [11]: # AVERAGE Salary of Department

Hr.groupby('Department')['Salary'].mean().round()

Out[11]: Department
Account 2143.0
Computer 4375.0
Sales 2778.0
Name: Salary, dtype: float64

In [12]: Hr[Hr['Department']== "Computer"]

Out[12]: E_ID Name Age Address Telephone Salary Department Hire Date

2 15 Ramy 57 Alex 3674313 5000 Computer 21-03-00

5 12 Salwa 34 Alexandria 4090443 4000 Computer 06-12-05

8 27 Yousef 46 Alixandria 3706323 7000 Computer 10-04-03

16 58 Neveen 43 Alex 3834363 4000 Computer 29-06-03

19 61 Yasser 37 Cairo 3962403 3000 Computer 17-09-06

21 79 Maged 33 Cairo 3930393 6000 Computer 28-08-98

22 90 Ahmed 28 Cairo 4250493 4000 Computer 16-03-90

23 94 Dina 55 Cairo 4282503 2000 Computer 05-04-99

In [13]: Hr[Hr['Department']== "Sales"]

Out[13]: E_ID Name Age Address Telephone Salary Department Hire Date

1 9 Hassan 25 Cairo 3578283 2000 Sales 21-01-06

3 18 Ola 28 Milan 4186473 5000 Sales 04-02-07

4 22 Zeiad 29 Milan 3642303 2000 Sales 01-03-98

6 24 Ali 24 Giza 4154463 1000 Sales 15-01-91

7 25 Tahany 39 Alex 3546273 3000 Sales 01-01-00

10 35 Mahmoud 57 Giza 3610293 4000 Sales 10-02-99

12 48 Wagdy 24 Cairo 4058433 5000 Sales 16-11-07

14 55 Samah 38 milan 4026423 1000 Sales 27-10-01

15 57 Rawan 35 Alex 3994413 2000 Sales 07-10-02

localhost:8889/notebooks/mahmoud1/Untitled11.ipynb 3/12
3/27/24, 11:20 PM Untitled11 - Jupyter Notebook

In [14]: Hr[Hr['Department']== "Account"]

Out[14]: E_ID Name Age Address Telephone Salary Department Hire Date

0 11 Aleya 46 Cairo 4218483 3000 Account 24-02-03

9 29 Khaled 29 Alex 3770343 3000 Account 20-05-01

11 47 Talaat 48 Giza 3866373 3000 Account 19-07-01

13 54 Samy 55 Giza 3738333 1000 Account 30-04-04

17 62 Amr 44 Cairo 4122453 2000 Account 26-12-95

18 65 Hala 53 Alex 3802353 2000 Account 09-06-90

20 71 Radwa 48 Giza 3898383 1000 Account 08-08-05

In [15]: # Find employees who work in Alexandria in the computer department

Hr[Hr['Department']== "Computer"]['Address']== "Alex"
Out[15]: 2 True
5 False
8 False
16 True
19 False
21 False
22 False
23 False
Name: Address, dtype: bool

In [16]: # Find the number of employees in each department who work in Cairo Governorate only
Hr[Hr['Address']=='Cairo']['Department'].value_counts()

Out[16]: Computer 4
Account 2
Sales 2
Name: Department, dtype: int64

In [36]: # Search for the employee who receives the highest salary and retrieve his complete data
Hr[Hr['Salary']==Hr['Salary']].max()[['Name']+['Address']+['Department']+['Age']+['Salary']
Out[36]: Name Zeiad
Address milan
Department Sales
Age 57
Salary 7000
dtype: object

In [37]: # Search for the employee who receives the min salary and retrieve his complete data
Hr[Hr['Salary']==Hr['Salary']].min()[['Name']+['Address']+['Department']+['Age']+['Salary']
Out[37]: Name Ahmed
Address Alex
Department Account
Age 24
Salary 1000
dtype: object

localhost:8889/notebooks/mahmoud1/Untitled11.ipynb 4/12
3/27/24, 11:20 PM Untitled11 - Jupyter Notebook

In [18]: # Number of employees by department in each governorate

Hr.groupby('Address')['Department'].value_counts()
Out[18]: Address Department
Alex Account 2
Computer 2
Sales 2
Alexandria Computer 1
Alixandria Computer 1
Cairo Computer 4
Account 2
Sales 2
Giza Account 3
Sales 2
Milan Sales 2
milan Sales 1
Name: Department, dtype: int64

#Two different ways to find the solution and how to deal with history to do the calculation

localhost:8889/notebooks/mahmoud1/Untitled11.ipynb 5/12
3/27/24, 11:20 PM Untitled11 - Jupyter Notebook

In [14]: def an (Bonus):

#for r in range(len(Hr['Hire Date'])):

hire_date = datetime.datetime.strptime(Hr['Hire Date'], '%d-%m-%y')
if hire_date >= datetime.datetime(2005, 1, 1):
Hr['Bonus'] = 5/100 * Hr['Salary']
elif hire_date >= datetime.datetime(2003, 1, 1):
Hr['Bonus']= 10/100 * Hr['Salary']
elif hire_date >= datetime.datetime(2000, 1, 1):
Hr['Bonus'] = 15/100 * Hr['Salary']
elif hire_date >= datetime.datetime(1995, 1, 1):
Hr['Bonus'] = 20/100 * Hr['Salary']
elif hire_date >= datetime.datetime(1990, 1, 1):
Hr['Bonus'] = 25/100 * Hr['Salary']
else :
Hr['Bonus']= 30/100 * Hr['Salary']

#Hr['Bonus'] = Hr.apply(an, axis=1)

print(Hr['Bonus'])

0 300.0
1 100.0
2 750.0
3 250.0
4 400.0
5 200.0
6 250.0
7 450.0
8 700.0
9 450.0
10 800.0
11 450.0
12 250.0
13 100.0
14 150.0
15 300.0
16 400.0
17 400.0
18 500.0
19 150.0
20 50.0
21 1200.0
22 1000.0
23 400.0
Name: Bonus, dtype: float64

localhost:8889/notebooks/mahmoud1/Untitled11.ipynb 6/12
3/27/24, 11:20 PM Untitled11 - Jupyter Notebook

In [7]: def an(row):

hire_date = datetime.datetime.strptime(row['Hire Date'], '%d-%m-%y')

if hire_date >= datetime.datetime(2005, 1, 1):
return 5/100 * row['Salary']
elif hire_date >= datetime.datetime(2003, 1, 1):
return 10/100 * row['Salary']
elif hire_date >= datetime.datetime(2000, 1, 1):
return 15/100 * row['Salary']
elif hire_date >= datetime.datetime(1995, 1, 1):
return 20/100 * row['Salary']
elif hire_date >= datetime.datetime(1990, 1, 1):
return 25/100 * row['Salary']
else:
return 30/100 * row['Salary']

Hr['Bonus'] = Hr.apply(an, axis=1)

print(Hr['Bonus'])
0 300.0
1 100.0
2 750.0
3 250.0
4 400.0
5 200.0
6 250.0
7 450.0
8 700.0
9 450.0
10 800.0
11 450.0
12 250.0
13 100.0
14 150.0
15 300.0
16 400.0
17 400.0
18 500.0
19 150.0
20 50.0
21 1200.0
22 1000.0
23 400.0
Name: Bonus, dtype: float64

In [9]: # Spreadsheet after adding the increment column

Hr.head()
Out[9]:
E_ID Name Age Address Telephone Salary Department Hire Date Bonus

0 11 Aleya 46 Cairo 4218483 3000 Account 24-02-03 300.0

1 9 Hassan 25 Cairo 3578283 2000 Sales 21-01-06 100.0

2 15 Ramy 57 Alex 3674313 5000 Computer 21-03-00 750.0

3 18 Ola 28 Milan 4186473 5000 Sales 04-02-07 250.0

4 22 Zeiad 29 Milan 3642303 2000 Sales 01-03-98 400.0

localhost:8889/notebooks/mahmoud1/Untitled11.ipynb 7/12
3/27/24, 11:20 PM Untitled11 - Jupyter Notebook

In [30]: # boxplot mean of age

sns.boxplot(x=Hr['Age'],palette='YlGn')
Out[30]: <AxesSubplot:xlabel='Age'>

In [18]: xa = sns.countplot(x=Hr['Department'],palette='PuBu')
for bar in xa.containers:
xa.bar_label(bar)

localhost:8889/notebooks/mahmoud1/Untitled11.ipynb 8/12
3/27/24, 11:20 PM Untitled11 - Jupyter Notebook

In [22]: xa = sns.countplot(x=Hr['Address'],palette='Set1')
for bar in xa.containers:
xa.bar_label(bar)

In [19]: plt.figure(figsize=(10,5))
sns.countplot(x='Department',hue='Address',data=Hr,palette='hsv')
Out[19]: <AxesSubplot:xlabel='Department', ylabel='count'>

localhost:8889/notebooks/mahmoud1/Untitled11.ipynb 9/12
3/27/24, 11:20 PM Untitled11 - Jupyter Notebook

In [95]: # Age with Department

#Comparing ages with salaries in different governorates
plt.figure(figsize=(8,5))
sns.boxplot(x=Hr['Address'],hue=Hr['Department'],y=Hr['Age'],palette='Set1')
Out[95]: <AxesSubplot:xlabel='Address', ylabel='Age'>

localhost:8889/notebooks/mahmoud1/Untitled11.ipynb 10/12
3/27/24, 11:20 PM Untitled11 - Jupyter Notebook

In [96]: #Compare ages with salaries in different departments

sns.jointplot(x='Salary',y='Age',hue='Department',data=Hr)
Out[96]: <seaborn.axisgrid.JointGrid at 0x2043cd52d30>

localhost:8889/notebooks/mahmoud1/Untitled11.ipynb 11/12
3/27/24, 11:20 PM Untitled11 - Jupyter Notebook

In [41]: #Compare ages with salaries in different Address

sns.jointplot(x='Salary',y='Age',hue='Address',data=Hr)
Out[41]: <seaborn.axisgrid.JointGrid at 0x1a1bade56d0>

In [ ]:

localhost:8889/notebooks/mahmoud1/Untitled11.ipynb 12/12

6 Fusion GL 22 Rapid Implementation Etree
No ratings yet
6 Fusion GL 22 Rapid Implementation Etree
27 pages
Acta Paediatrica - 2017 - Horowitz Kraus - Brain Connectivity in Children Is Increased by The Time They Spend Reading Books
No ratings yet
Acta Paediatrica - 2017 - Horowitz Kraus - Brain Connectivity in Children Is Increased by The Time They Spend Reading Books
9 pages
Introducing Defaulting and Validations For Redwood With VB Express Mode in Cloud HCM v2
No ratings yet
Introducing Defaulting and Validations For Redwood With VB Express Mode in Cloud HCM v2
22 pages
CXML Invoices
No ratings yet
CXML Invoices
36 pages
Gram Panchayat Atlas 2016 PDF
50% (2)
Gram Panchayat Atlas 2016 PDF
527 pages
Frequently Asked Interview Questions in Oracle Financials Functional
No ratings yet
Frequently Asked Interview Questions in Oracle Financials Functional
123 pages
Inv Convert Body
No ratings yet
Inv Convert Body
24 pages
Chapter 06 Test Records For Retail Banking PDF
100% (1)
Chapter 06 Test Records For Retail Banking PDF
11 pages
Oracle EBS Tables
100% (2)
Oracle EBS Tables
13 pages
Introduction To Theories of Neurological Rehabilitation
80% (5)
Introduction To Theories of Neurological Rehabilitation
30 pages
National and Regional ITS Architectures
No ratings yet
National and Regional ITS Architectures
74 pages
PES IUP Brochure
No ratings yet
PES IUP Brochure
14 pages
Install Oracle Developer Suite On Win7
No ratings yet
Install Oracle Developer Suite On Win7
7 pages
PM PDF
No ratings yet
PM PDF
81 pages
Using Rxi To Build A Fixed Assets Report
No ratings yet
Using Rxi To Build A Fixed Assets Report
17 pages
Use Only: Oracle® Hyperion Planning 11.1.1: Create and Manage Applications
No ratings yet
Use Only: Oracle® Hyperion Planning 11.1.1: Create and Manage Applications
338 pages
Creating A CRUD Form On A REST Service With APEX 18
No ratings yet
Creating A CRUD Form On A REST Service With APEX 18
25 pages
SAML Setup
No ratings yet
SAML Setup
3 pages
Create Cross Validation Rules
No ratings yet
Create Cross Validation Rules
12 pages
1Z0-1055-23 (Latest-Final) - 240531 - 131712
No ratings yet
1Z0-1055-23 (Latest-Final) - 240531 - 131712
41 pages
Anil OIC Resume
No ratings yet
Anil OIC Resume
1 page
Oracle - Receivables: Date - 1 MAR 2006 1 OF 5
No ratings yet
Oracle - Receivables: Date - 1 MAR 2006 1 OF 5
5 pages
Sales Rep Creation Sample Code N
No ratings yet
Sales Rep Creation Sample Code N
2 pages
Learn SQL Tutorial - Javatpoint
100% (1)
Learn SQL Tutorial - Javatpoint
13 pages
Aimdocuments
No ratings yet
Aimdocuments
6 pages
Unable To View Value Set Values in Oracle EBS R12.2
No ratings yet
Unable To View Value Set Values in Oracle EBS R12.2
2 pages
You Can Track and Eliminate Profit in Inventory v4
100% (2)
You Can Track and Eliminate Profit in Inventory v4
72 pages
JD Edwards World Product Costing and Manufacturing Accounting A91 Guide
100% (1)
JD Edwards World Product Costing and Manufacturing Accounting A91 Guide
306 pages
AME - Oracle AP Invoice Approval Setups and Process Training Manual
No ratings yet
AME - Oracle AP Invoice Approval Setups and Process Training Manual
55 pages
What Is Oracle Fusion Cloud
No ratings yet
What Is Oracle Fusion Cloud
16 pages
Steps To Setup Set of Books
No ratings yet
Steps To Setup Set of Books
12 pages
r12 Insert Delete Pricelist Line
No ratings yet
r12 Insert Delete Pricelist Line
6 pages
E Bus Tax Queries
100% (2)
E Bus Tax Queries
9 pages
416 Bi Publisher Blob Image
No ratings yet
416 Bi Publisher Blob Image
5 pages
FA Query
No ratings yet
FA Query
11 pages
Aim Methodology For Oracle Ebusiness Suite: Author: Abhijit Ray
No ratings yet
Aim Methodology For Oracle Ebusiness Suite: Author: Abhijit Ray
32 pages
Oracle Order Management Implementation Manual
No ratings yet
Oracle Order Management Implementation Manual
912 pages
Duplicate Inv Num Personalization
No ratings yet
Duplicate Inv Num Personalization
3 pages
TA040 Application Architecture
No ratings yet
TA040 Application Architecture
11 pages
R12 Financial Upgrade Training
No ratings yet
R12 Financial Upgrade Training
5 pages
AR Technical
No ratings yet
AR Technical
7 pages
Fa Formula PDF
No ratings yet
Fa Formula PDF
38 pages
Erp Asset Revaluation
No ratings yet
Erp Asset Revaluation
20 pages
BI Publisher RTF Running Total Report
No ratings yet
BI Publisher RTF Running Total Report
5 pages
Oracle Blog
No ratings yet
Oracle Blog
5 pages
Fi Aa
No ratings yet
Fi Aa
12 pages
Microsoft Dynamics AX 2012 R3 Preview Final English 040914
No ratings yet
Microsoft Dynamics AX 2012 R3 Preview Final English 040914
25 pages
Basic Setups
No ratings yet
Basic Setups
19 pages
BOM Tables and Query
No ratings yet
BOM Tables and Query
3 pages
r12 Module List
No ratings yet
r12 Module List
2 pages
OTBI Training Guide - Updated
No ratings yet
OTBI Training Guide - Updated
26 pages
12954
No ratings yet
12954
1 page
How To Delete AR Cash Receipt
No ratings yet
How To Delete AR Cash Receipt
11 pages
HCM - Let's Talk Tech Role of VB Express Mode and Business Rules in Redwood Uptake
No ratings yet
HCM - Let's Talk Tech Role of VB Express Mode and Business Rules in Redwood Uptake
28 pages
Oracle Applications - Query To Get The Approved Cost Budget Version Detail On Project Accounting
0% (1)
Oracle Applications - Query To Get The Approved Cost Budget Version Detail On Project Accounting
3 pages
Training Manual-Fsg PDF
No ratings yet
Training Manual-Fsg PDF
29 pages
Oracle Fusion Troubleshooting Positive Pay File in Payments (Doc ID 1386162.1)
No ratings yet
Oracle Fusion Troubleshooting Positive Pay File in Payments (Doc ID 1386162.1)
8 pages
Erp Periodic Mass
No ratings yet
Erp Periodic Mass
10 pages
Costing 141001195435 Phpapp02
100% (1)
Costing 141001195435 Phpapp02
34 pages
How To Set Up A Modifier To Discount Lines Based On The Accumulated Ordered Quantity
100% (1)
How To Set Up A Modifier To Discount Lines Based On The Accumulated Ordered Quantity
5 pages
Peoplesoft Commonly Used Query Tables
No ratings yet
Peoplesoft Commonly Used Query Tables
5 pages
Python Assignment-2
No ratings yet
Python Assignment-2
3 pages
SQL & Python Interview Q&A
No ratings yet
SQL & Python Interview Q&A
7 pages
CORE_PYTHON_QUESTION
No ratings yet
CORE_PYTHON_QUESTION
1 page
Earth and Life Science Week 5
No ratings yet
Earth and Life Science Week 5
10 pages
River Reaches
No ratings yet
River Reaches
1 page
VGG322419 6uflwa Evervision
No ratings yet
VGG322419 6uflwa Evervision
28 pages
Max Jammer
No ratings yet
Max Jammer
3 pages
5 LTR Blow Molding Machine Details
No ratings yet
5 LTR Blow Molding Machine Details
3 pages
Pulsator 551 Plus
No ratings yet
Pulsator 551 Plus
2 pages
Right Layout
No ratings yet
Right Layout
13 pages
Power Plant Lecture Notes - CHAPTER-4 STEAM Turbine: October 2014
No ratings yet
Power Plant Lecture Notes - CHAPTER-4 STEAM Turbine: October 2014
42 pages
Practical Datesheet
No ratings yet
Practical Datesheet
6 pages
New Spiral Periodic Table of The
No ratings yet
New Spiral Periodic Table of The
5 pages
Ubuntu Cloud Installer
No ratings yet
Ubuntu Cloud Installer
13 pages
Traffic Flow Prediction Models A Review of Deep Learning Techniques
No ratings yet
Traffic Flow Prediction Models A Review of Deep Learning Techniques
25 pages
Maintenance Presentation PDF
No ratings yet
Maintenance Presentation PDF
12 pages
Tender Due Date PL No Qty Unit Place of Delivary Tender No PL Description SRL No Eligibility Criteria / Other Terms
No ratings yet
Tender Due Date PL No Qty Unit Place of Delivary Tender No PL Description SRL No Eligibility Criteria / Other Terms
9 pages
(Ebook PDF) Statistics For Political Analysis: Understanding The Numbers Revised Edition Full Chapter Instant Download
No ratings yet
(Ebook PDF) Statistics For Political Analysis: Understanding The Numbers Revised Edition Full Chapter Instant Download
44 pages
Digital Wattmeter: Instruction Manual
No ratings yet
Digital Wattmeter: Instruction Manual
13 pages
MODULE 2 - Types of Language Assessments
100% (4)
MODULE 2 - Types of Language Assessments
4 pages
FINAL12
No ratings yet
FINAL12
11 pages
Layout and Stick Diagram
No ratings yet
Layout and Stick Diagram
70 pages
HR Audit Checklist 2025
No ratings yet
HR Audit Checklist 2025
13 pages
Casual Inference Project
No ratings yet
Casual Inference Project
30 pages
Soil of Sundarban Delta Is Rich in Sodium, Potassium, Silicate and Phosphorus
No ratings yet
Soil of Sundarban Delta Is Rich in Sodium, Potassium, Silicate and Phosphorus
2 pages
Climatology (Unit-7)
No ratings yet
Climatology (Unit-7)
12 pages
Meteorologist For A Day Project-Revised
No ratings yet
Meteorologist For A Day Project-Revised
1 page
Modern Primary Mathematics 6B
No ratings yet
Modern Primary Mathematics 6B
98 pages
Obstetrics and Gynaecology An Evidencebased Text For MRCOG 2E Edition 2
No ratings yet
Obstetrics and Gynaecology An Evidencebased Text For MRCOG 2E Edition 2
313 pages

Data Analysis Using Python

Uploaded by

Data Analysis Using Python

Uploaded by

3/27/24, 11:20 PM Untitled11 - Jupyter Notebook

0 11 Aleya 46 Cairo 4218483 3000 Account 24-02-03

1 9 Hassan 25 Cairo 3578283 2000 Sales 21-01-06

2 15 Ramy 57 Alex 3674313 5000 Computer 21-03-00

3 18 Ola 28 Milan 4186473 5000 Sales 04-02-07

4 22 Zeiad 29 Milan 3642303 2000 Sales 01-03-98

Out[5]: E_ID int64

In [6]: # Total isnull data

In [10]: # AVERAGE Age

In [11]: # AVERAGE Salary of Department

In [12]: Hr[Hr['Department']== "Computer"]

2 15 Ramy 57 Alex 3674313 5000 Computer 21-03-00

5 12 Salwa 34 Alexandria 4090443 4000 Computer 06-12-05

8 27 Yousef 46 Alixandria 3706323 7000 Computer 10-04-03

16 58 Neveen 43 Alex 3834363 4000 Computer 29-06-03

19 61 Yasser 37 Cairo 3962403 3000 Computer 17-09-06

21 79 Maged 33 Cairo 3930393 6000 Computer 28-08-98

22 90 Ahmed 28 Cairo 4250493 4000 Computer 16-03-90

23 94 Dina 55 Cairo 4282503 2000 Computer 05-04-99

In [13]: Hr[Hr['Department']== "Sales"]

1 9 Hassan 25 Cairo 3578283 2000 Sales 21-01-06

3 18 Ola 28 Milan 4186473 5000 Sales 04-02-07

4 22 Zeiad 29 Milan 3642303 2000 Sales 01-03-98

6 24 Ali 24 Giza 4154463 1000 Sales 15-01-91

7 25 Tahany 39 Alex 3546273 3000 Sales 01-01-00

10 35 Mahmoud 57 Giza 3610293 4000 Sales 10-02-99

12 48 Wagdy 24 Cairo 4058433 5000 Sales 16-11-07

14 55 Samah 38 milan 4026423 1000 Sales 27-10-01

15 57 Rawan 35 Alex 3994413 2000 Sales 07-10-02

In [14]: Hr[Hr['Department']== "Account"]

0 11 Aleya 46 Cairo 4218483 3000 Account 24-02-03

9 29 Khaled 29 Alex 3770343 3000 Account 20-05-01

11 47 Talaat 48 Giza 3866373 3000 Account 19-07-01

13 54 Samy 55 Giza 3738333 1000 Account 30-04-04

17 62 Amr 44 Cairo 4122453 2000 Account 26-12-95

18 65 Hala 53 Alex 3802353 2000 Account 09-06-90

20 71 Radwa 48 Giza 3898383 1000 Account 08-08-05

In [15]: # Find employees who work in Alexandria in the computer department

In [18]: # Number of employees by department in each governorate

In [14]: def an (Bonus):

#for r in range(len(Hr['Hire Date'])):

#Hr['Bonus'] = Hr.apply(an, axis=1)

In [7]: def an(row):

In [9]: # Spreadsheet after adding the increment column

0 11 Aleya 46 Cairo 4218483 3000 Account 24-02-03 300.0

1 9 Hassan 25 Cairo 3578283 2000 Sales 21-01-06 100.0

2 15 Ramy 57 Alex 3674313 5000 Computer 21-03-00 750.0

3 18 Ola 28 Milan 4186473 5000 Sales 04-02-07 250.0

4 22 Zeiad 29 Milan 3642303 2000 Sales 01-03-98 400.0

In [30]: # boxplot mean of age

In [95]: # Age with Department

In [96]: #Compare ages with salaries in different departments

In [41]: #Compare ages with salaries in different Address

You might also like