0% found this document useful (0 votes)

9 views5 pages

Data Science Python All Units

The document outlines a comprehensive curriculum for Data Science using Python, divided into five units covering topics such as Python programming, file handling, object-oriented programming, NumPy, and data manipulation with Pandas. Each unit includes practical examples and exercises on key concepts like data types, exception handling, data cleaning, and visualization techniques. The curriculum aims to equip learners with essential skills for data analysis and manipulation.

Uploaded by

oneshotsejee

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

9 views5 pages

Data Science Python All Units

Uploaded by

oneshotsejee

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 5

DATA SCIENCE USING PYTHON - COMPLETE UNITS

UNIT 1: INTRODUCTION TO DATA SCIENCE AND PYTHON PROGRAMMING

1. Implement basic Python programs for reading input from console.

name = input("Enter your name: ")

print("Hello", name)

2. Perform Creation, indexing, slicing, concatenation and repetition operations on

Python built-in datatypes: Strings, List, Tuples, Dictionary, Set

s = "Python"
print(s[1:4])
l = [1,2,3]
print(l[0:2] + l*2)
t = (1,2,3)
print(t[1:])
d = {'a':1}
print(d['a'])
s = {1,2}
s.add(3)

3. Solve problems using decision and looping statements.

x=5
if x > 0:
print("Positive")
for i in range(3):
print(i)

4. Apply Python built-in data types: Strings, List, Tuples, Dictionary. Set and their
methods to solve any given problem

print("hi".upper())
l = [1,2]; l.append(3)
t = (1,2,3)
print(t.count(2))
d = {'a':1}; print(d.get('a'))
s = {1}; s.add(2)

5. Handle numerical operations using math and random number functions

import math, random
print(math.sqrt(25))
print(random.randint(1, 10))

6. Create user-defined functions with different types of function arguments.

def greet(name, msg="Hi"):

print(msg, name)
greet("Nitesh")

UNIT 2: FILE, EXCEPTION HANDLING AND OOP

1. Create packages and import modules from packages.

from mypkg import module1

module1.say_hello()

2. Perform File manipulations- open, close, read, write, append and copy from one file
to another.

with open("f1.txt", "w") as f: f.write("Hi")

with open("f1.txt") as f: data = f.read()
with open("f2.txt", "w") as f: f.write(data)

3. Handle Exceptions using Python Built-in Exceptions

try:
x = 1/0
except ZeroDivisionError:
print("Cannot divide by zero")

4. Solve problems using Class declaration and Object creation.

class A:
def __init__(self, x): self.x = x
a = A(5)
print(a.x)

5. Implement OOP concepts like Data hiding and Data Abstraction

class Test:
def __init__(self): self.__val = 10
def get(self): return self.__val
t = Test()
print(t.get())

6. Solve any real-time problem using inheritance concept.

class Animal:
def speak(self): print("Sound")
class Dog(Animal):
def speak(self): print("Bark")
d = Dog()
d.speak()

UNIT 3: INTRODUCTION TO NUMPY

1. Create NumPy arrays from Python Data Structures, Intrinsic NumPy objects and
Random Functions.

import numpy as np
np.array([1,2,3])
np.arange(5)
np.random.rand(2)

2. Manipulation of NumPy arrays- Indexing, Slicing, Reshaping. Joining and Splitting.

a = np.array([[1,2],[3,4]])
print(a[1,1])
print(a.reshape(4,1))
print(np.hstack([a,a]))

3. Computation on NumPy arrays using Universal Functions and Mathematical

methods.

a = np.array([1,2,3])
print(np.mean(a), np.sum(a), np.sqrt(a))

4. Import a CSV file and perform various Statistical and Comparison operations on
rows/columns.

data = np.genfromtxt("data.csv", delimiter=",", skip_header=1)

print(np.mean(data, axis=0))

5. Load an image file and do crop and flip operation using NumPy Indexing.

from imageio import imread

img = imread("img.jpg")
crop = img[100:200,100:200]
flip = img[::-1]

UNIT 4: DATA MANIPULATION WITH PANDAS

1. Create Pandas Series and DataFrame from various inputs.
import pandas as pd
s = pd.Series([1,2,3])
df = pd.DataFrame({'A':[1,2]})

2. Import any CSV file to Pandas DataFrame and perform the following:

df = pd.read_csv("file.csv")
print(df.head(10))
print(df.tail(10))

(b) Get the shape, index and column details

print(df.shape, df.index, df.columns)

(c) Select/Delete the records (rows)/columns based on conditions.

print(df[df['Age'] > 20])

df.drop(columns=['Name'])

(d) Perform ranking and sorting operations.

df['Rank'] = df['Score'].rank()
df.sort_values(by='Age')

(e) Do required statistical operations on the given columns.

print(df.describe())
print(df['Score'].mean())

(f) Find the count and uniqueness of the given categorical values.

print(df['Gender'].value_counts())
print(df['Gender'].unique())

(g) Rename single/multiple columns.

df.rename(columns={'Name':'Full Name'}, inplace=True)

UNIT 5: DATA CLEANING, PREPARATION AND VISUALIZATION

1. Import any CSV file to Pandas DataFrame

import pandas as pd
df = pd.read_csv("data.csv")

(a) Handle missing data by detecting and dropping/ filling missing values.

df.isnull().sum()
df.dropna()
df.fillna(0)
df['Age'].fillna(df['Age'].mean())

(b) Transform data using apply() and map() method.

df['col'] = df['col'].apply(lambda x: x*2)

df['gender'] = df['gender'].map({'M':'Male','F':'Female'})

(c) Detect and filter outliers.

Q1 = df['col'].quantile(0.25)
Q3 = df['col'].quantile(0.75)
IQR = Q3 - Q1
df[(df['col'] < Q1 - 1.5*IQR) | (df['col'] > Q3 + 1.5*IQR)]

(d) Perform Vectorized String operations on Pandas Series.

df['Name'].str.upper()
df['Email'].str.contains('@gmail')

(e) Visualize data using Line Plots. Bar Plots, Histograms, Density Plots and Scatter
Plots.

import matplotlib.pyplot as plt

df['Marks'].plot(kind='line')
plt.show()

Phantom LUTs
No ratings yet
Phantom LUTs
10 pages
Buber, Martin - I and Thou PDF
92% (12)
Buber, Martin - I and Thou PDF
127 pages
TU 515R Technical Specification
No ratings yet
TU 515R Technical Specification
24 pages
Python Programming For Data Analysis
No ratings yet
Python Programming For Data Analysis
6 pages
Python For Data Science
No ratings yet
Python For Data Science
5 pages
Numpy Merged
No ratings yet
Numpy Merged
59 pages
Data Processing With Python and R
No ratings yet
Data Processing With Python and R
6 pages
Python Lab PRG
No ratings yet
Python Lab PRG
20 pages
Python Basics by A K Singh
No ratings yet
Python Basics by A K Singh
3 pages
Python Notes
No ratings yet
Python Notes
10 pages
Python Programming Syallabus
No ratings yet
Python Programming Syallabus
3 pages
DL Lab1
No ratings yet
DL Lab1
4 pages
IDS Syllabus
No ratings yet
IDS Syllabus
5 pages
Foundation of Data Science Lab Manual Full
No ratings yet
Foundation of Data Science Lab Manual Full
8 pages
Python by Example Book 2 (Data Manipulation and Analysis)
No ratings yet
Python by Example Book 2 (Data Manipulation and Analysis)
105 pages
ML Lab File Vijay Kumar
No ratings yet
ML Lab File Vijay Kumar
27 pages
Question-Bank Python
No ratings yet
Question-Bank Python
3 pages
AIML Lab Manual
No ratings yet
AIML Lab Manual
39 pages
22am901 Data Science Using Python Unit 2
No ratings yet
22am901 Data Science Using Python Unit 2
116 pages
Python Sylllabus
No ratings yet
Python Sylllabus
4 pages
Python Programming Changing
No ratings yet
Python Programming Changing
3 pages
DS-DS Lab-1
No ratings yet
DS-DS Lab-1
4 pages
Fdsa Lab Manual Final
No ratings yet
Fdsa Lab Manual Final
70 pages
Machine Learning Codes
No ratings yet
Machine Learning Codes
30 pages
Univelcity Data Science Curriculum
No ratings yet
Univelcity Data Science Curriculum
7 pages
Document
No ratings yet
Document
16 pages
Dsa Record-1
No ratings yet
Dsa Record-1
153 pages
OCS353-Data Science Fundamentals Manual 1
No ratings yet
OCS353-Data Science Fundamentals Manual 1
34 pages
Python Syllabus
No ratings yet
Python Syllabus
4 pages
Python - Data Science Lecture 1
No ratings yet
Python - Data Science Lecture 1
55 pages
All Unit Question Bank
No ratings yet
All Unit Question Bank
4 pages
Revision Questions
No ratings yet
Revision Questions
19 pages
PDS Question Bank
No ratings yet
PDS Question Bank
1 page
FDS Lab
No ratings yet
FDS Lab
43 pages
ML Lab File Vijay Kumar
No ratings yet
ML Lab File Vijay Kumar
16 pages
Data Science Online Training Course Content 1626830873
No ratings yet
Data Science Online Training Course Content 1626830873
26 pages
DAP Lab Manual
No ratings yet
DAP Lab Manual
20 pages
Foundations of Data Science - Syllabus
No ratings yet
Foundations of Data Science - Syllabus
4 pages
Syllabus 017032391 Python-1 Sem-3 Batch-21-22
No ratings yet
Syllabus 017032391 Python-1 Sem-3 Batch-21-22
7 pages
Python Syllabuss
No ratings yet
Python Syllabuss
4 pages
DS Lab Programs
No ratings yet
DS Lab Programs
47 pages
Data Science Book1
No ratings yet
Data Science Book1
9 pages
Python Lab Manual
No ratings yet
Python Lab Manual
17 pages
Syllabus CD163
No ratings yet
Syllabus CD163
1 page
PDSP 1
No ratings yet
PDSP 1
15 pages
DSP U1
No ratings yet
DSP U1
89 pages
Detailed 100 Slides Presentation v2
No ratings yet
Detailed 100 Slides Presentation v2
21 pages
3rd EXPERIMENT
No ratings yet
3rd EXPERIMENT
13 pages
Data Science 21CSS303T Question Bank Unit - 1: Part-A
No ratings yet
Data Science 21CSS303T Question Bank Unit - 1: Part-A
2 pages
DataScience - ML DEEP LEARNING - LPEI - 120 Days
No ratings yet
DataScience - ML DEEP LEARNING - LPEI - 120 Days
8 pages
Ds - With - Pyton - FinalPrint
No ratings yet
Ds - With - Pyton - FinalPrint
28 pages
227C4A Data Science
No ratings yet
227C4A Data Science
2 pages
Gujarat Technological University: Overview of Python and Data Structures
No ratings yet
Gujarat Technological University: Overview of Python and Data Structures
4 pages
Python - Final
No ratings yet
Python - Final
3 pages
Data Science Using Python Lab Manual
No ratings yet
Data Science Using Python Lab Manual
68 pages
Pythonmanual
No ratings yet
Pythonmanual
21 pages
DSP U2
No ratings yet
DSP U2
172 pages
Data Science Lab Exp Lis
No ratings yet
Data Science Lab Exp Lis
72 pages
Learninng Plan
No ratings yet
Learninng Plan
6 pages
Laboratory Manual UE24CS1204
No ratings yet
Laboratory Manual UE24CS1204
47 pages
How To Use AI Prompt Analytics
No ratings yet
How To Use AI Prompt Analytics
21 pages
Summary 1
No ratings yet
Summary 1
10 pages
Front Page Pds
No ratings yet
Front Page Pds
3 pages
Wa0003
No ratings yet
Wa0003
4 pages
Theory of Structures - SEM IX - Long Span Structures
No ratings yet
Theory of Structures - SEM IX - Long Span Structures
99 pages
KL-21B Fiber Cleaver Manual
No ratings yet
KL-21B Fiber Cleaver Manual
3 pages
The Kemetic Tree of Life
No ratings yet
The Kemetic Tree of Life
1 page
ENGL 110: Introduction To Academic Writing
No ratings yet
ENGL 110: Introduction To Academic Writing
7 pages
Ahu Catalogue
No ratings yet
Ahu Catalogue
96 pages
Boundary Layer Notes PDF
No ratings yet
Boundary Layer Notes PDF
10 pages
Welding 101 For Hobbyists (And Nerds!) - Practical Engineering
No ratings yet
Welding 101 For Hobbyists (And Nerds!) - Practical Engineering
6 pages
Modular Midterm Activities 2
No ratings yet
Modular Midterm Activities 2
8 pages
Lesson Plan
100% (3)
Lesson Plan
2 pages
Tribox FRP Enclosures - 2023
No ratings yet
Tribox FRP Enclosures - 2023
8 pages
McCarthy, Patsy and Hatcher, Caroline (2002) 'Selling Your Ideas'
No ratings yet
McCarthy, Patsy and Hatcher, Caroline (2002) 'Selling Your Ideas'
22 pages
Corpus-Based Studies of Translational Chinese in EnglishChinese Translation
100% (1)
Corpus-Based Studies of Translational Chinese in EnglishChinese Translation
15 pages
Elements of Dance
100% (1)
Elements of Dance
4 pages
Remedial Teaching
No ratings yet
Remedial Teaching
12 pages
Lect 1 Intro
No ratings yet
Lect 1 Intro
10 pages
1.1 Introduction and Historical Landmarks in The Development of MicrobiologyPowerpoint Presentation
No ratings yet
1.1 Introduction and Historical Landmarks in The Development of MicrobiologyPowerpoint Presentation
24 pages
Transportationupdated
No ratings yet
Transportationupdated
40 pages
Suction Distance Calculation
No ratings yet
Suction Distance Calculation
2 pages
Notice of Awardssss 22 23 Updated
No ratings yet
Notice of Awardssss 22 23 Updated
82 pages
7 Demountable Flare Rev A
No ratings yet
7 Demountable Flare Rev A
3 pages
Compact NSX Lv438803
No ratings yet
Compact NSX Lv438803
3 pages
Weight Steel
No ratings yet
Weight Steel
128 pages
BlueSky 6.0 Release Notes
No ratings yet
BlueSky 6.0 Release Notes
3 pages
Computer Terminology and History
No ratings yet
Computer Terminology and History
15 pages
Jackson 7.19 Homework Problem Solution
No ratings yet
Jackson 7.19 Homework Problem Solution
7 pages
Local Knowledge, Global Goals
100% (1)
Local Knowledge, Global Goals
48 pages
Pre-Test - Performing The Engagement
No ratings yet
Pre-Test - Performing The Engagement
2 pages

Data Science Python All Units

Uploaded by

Data Science Python All Units

Uploaded by

DATA SCIENCE USING PYTHON - COMPLETE UNITS

UNIT 1: INTRODUCTION TO DATA SCIENCE AND PYTHON PROGRAMMING

name = input("Enter your name: ")

2. Perform Creation, indexing, slicing, concatenation and repetition operations on

3. Solve problems using decision and looping statements.

5. Handle numerical operations using math and random number functions

6. Create user-defined functions with different types of function arguments.

def greet(name, msg="Hi"):

UNIT 2: FILE, EXCEPTION HANDLING AND OOP

from mypkg import module1

with open("f1.txt", "w") as f: f.write("Hi")

3. Handle Exceptions using Python Built-in Exceptions

4. Solve problems using Class declaration and Object creation.

5. Implement OOP concepts like Data hiding and Data Abstraction

6. Solve any real-time problem using inheritance concept.

UNIT 3: INTRODUCTION TO NUMPY

2. Manipulation of NumPy arrays- Indexing, Slicing, Reshaping. Joining and Splitting.

3. Computation on NumPy arrays using Universal Functions and Mathematical

data = np.genfromtxt("data.csv", delimiter=",", skip_header=1)

from imageio import imread

UNIT 4: DATA MANIPULATION WITH PANDAS

(b) Get the shape, index and column details

print(df.shape, df.index, df.columns)

(c) Select/Delete the records (rows)/columns based on conditions.

print(df[df['Age'] > 20])

(d) Perform ranking and sorting operations.

(e) Do required statistical operations on the given columns.

(g) Rename single/multiple columns.

df.rename(columns={'Name':'Full Name'}, inplace=True)

UNIT 5: DATA CLEANING, PREPARATION AND VISUALIZATION

(b) Transform data using apply() and map() method.

df['col'] = df['col'].apply(lambda x: x*2)

(c) Detect and filter outliers.

(d) Perform Vectorized String operations on Pandas Series.

import matplotlib.pyplot as plt

You might also like