2.1 Importing Python Data

The document is a cheat sheet for importing various data types in Python, including Excel, pickled files, HDF5, SAS, Matlab, Stata, and relational databases using libraries like pandas, NumPy, and SQLAlchemy. It provides code snippets for reading data from different file formats and accessing their contents. Additionally, it includes tips for navigating the filesystem and using context managers for file operations.

Uploaded by

ashishsinghji1212

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

18 views1 page

2.1 Importing Python Data

Uploaded by

ashishsinghji1212

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 1

Excel Spreadsheets Pickled Files

Python For Data Science Cheat Sheet

>>> file = ‘urbanpop.xlsx’ >>> import pickle
>>> data = pd.ExcelFile(file) >>> with open(‘pickled_fruit.pkl’, ‘rb’) as file:

Importing Data
>>> df_sheet2 = data.parse(‘1960-1966’, pickled_data = pickle.load(file)
skiprows=[0],
names=[‘Country’,
‘AAM: War(2002)’])
>>> df_sheet1 = data.parse(0,
parse_cols=[0],
skiprows=[0], HDF5 Files
names=[‘Country’])
Learn Python for Data Science Interactively To access the sheet names, use the sheet_names attribute:
>>> import h5py
>>> filename = ‘H-H1_LOSC_4_v1-815411200-4096.hdf5’
>>> data = h5py.File(filename, ‘r’)
>>> data.sheet_names
Importing Data in Python
Most of the time, you’ll use either NumPy or pandas to import
your data:

>>> import numpy as np

SAS Files Matlab Files
>>> import pandas as pd
>>> from sas7bdat import SAS7BDAT
>>> with SAS7BDAT(‘urbanpop.sas7bdat’) as file: >>> import scipy.io
df_sas = file.to_data_frame() >>> filename = ‘workspace.mat’
>>> mat = scipy.io.loadmat(filename)
Help
>>> np.info(np.ndarray.dtype)
>>> help(pd.read_csv)
Stata Files
>>> data = pd.read_stata(‘urbanpop.dta’) Exploring Dictionaries
Text Files Accessing Elements with Functions
Relational Databases >>> print(mat.keys())
>>> for key in data.keys():
Print dictionary keys
Print dictionary keys
Plain Text Files print(key)
>>> from sqlalchemy import create_engine meta
>>> filename = ‘huck_finn.txt’ >>> engine = create_engine(‘sqlite://Northwind.sqlite’) quality
>>> file = open(filename, mode=’r’) Open the file for reading strain
>>> text = file.read() Read a file’s contents >>> pickled_data.values() Return dictionary values
>>> print(file.closed) Check whether file is closed Use the table_names() method to fetch a list of table names: >>> print(mat.items()) Returns items in list format
>>> file.close() Close file of (key, value)tuple pairs
>>> print(text) >>> table_names = engine.table_names()

Using the context manager with Accessing Data Items with Keys
Querying Relational Databases
>>> with open(‘huck_finn.txt’, ‘r’) as file:
>>> con = engine.connect() >>> for key in data [‘meta’].keys() Explore the HDF5 structure
print(file.readline()) Read a single line
>>> rs = con.execute(“SELECT * FROM Orders”) print(key)
print(file.readline())
>>> df = pd.DataFrame(rs.fetchall()) Description
print(file.readline())
>>> df.columns = rs.keys() DescriptionURL
>>> con.close() Detector
Duration
GPSstart
Using the context manager with Observatory
Table Data: Flat Files Type
>>> with engine.connect() as con:
UTCstart
rs = con.execute(“SELECT OrderID FROM Orders”)
Importing Flat Files with numpy >>> print(data[‘meta’][‘Description’].value) Retrieve the value for a key
df = pd.DataFrame(rs.fetchmany(size=5))
Files with one data type df.columns = rs.keys()

>>> filename = ‘mnist.txt’

>>> data = np.loadtxt(filename,
Querying relational databases with pandas
Navigating Your FileSystem
delimiter=’,’, String used to separate values
skiprows=2, Skip the first 2 lines
usecols=[0,2], Read the 1st and 3rd column >>> df = pd.read_sql_query(“SELECT * FROM Orders”, engine)
dtype=str) The type of the resulting array Magic Commands
Files with mixed data types !ls List directory contents of files and directories
%cd .. Change current working directory
>>> filename = ‘titanic.csv’ Exploring Your Data %pwd Return the current working directory path
>>> data = np.genfromtxt(filename,
delimiter=’,’,
names=True, Look for column header NumPy Arrays
dtype=None) os Library
>>> data_array.dtype Data type of array elements
>>> data_array.shape Array dimensions >>> import os
>>> data_array = np.recfromcsv(filename) >>> len(data_array) Length of array >>> path = “/usr/tmp”
>>> wd = os.getcwd() Store the name of current
directory in a string
Importing Flat Files with numpy pandas DataFrames >>> os.listdir(wd) Output contents of the di
rectory in a list
>>> filename = ‘winequality-red.csv’ >>> df.head() Return first DataFrame rows >>> os.chdir(path) Change current working
>>> data = pd.read_csv(filename, >>> df.tail() Return last DataFrame rows directory
nrows=5, Number of rows of file to read >>> df.index Describe index >>> os.rename(“test1.txt”, Rename a file
header=None, Row number to use as col names >>> df.columns Describe DataFrame columns “test2.txt”)
sep=’\t’, Delimiter to use >>> df.info() Info on DataFrame >>> os.remove(“test1.txt”) Delete an existing file
comment=’#’, Character to split comments >>> data_array = data.values Convert a DataFrame to an a >>> os.mkdir(“newdir”) Create a new directory
n a_values=[“”]) String to recognize as NA/NaN NumPy array

Title P What If Collected Thought Experiments in Philosophy
100% (2)
Title P What If Collected Thought Experiments in Philosophy
257 pages
Python Data Import
100% (1)
Python Data Import
28 pages
Robert Venturi - Idea of A Duck and A Decorated Shed
No ratings yet
Robert Venturi - Idea of A Duck and A Decorated Shed
7 pages
Importing Data Cheat Sheet Python For Data Science: Pickled Files Exploring Your Data
No ratings yet
Importing Data Cheat Sheet Python For Data Science: Pickled Files Exploring Your Data
1 page
Importing Data Python Cheat Sheet PDF
No ratings yet
Importing Data Python Cheat Sheet PDF
1 page
Experiment No 3 Importing and Exporting Data in Python Using Pandas Student
No ratings yet
Experiment No 3 Importing and Exporting Data in Python Using Pandas Student
6 pages
DAP Module3
No ratings yet
DAP Module3
42 pages
01 Python For Data Analysis (Ziad)
No ratings yet
01 Python For Data Analysis (Ziad)
53 pages
Lecture Week2
No ratings yet
Lecture Week2
72 pages
Module 3 Notes
No ratings yet
Module 3 Notes
45 pages
Introduction To Pandas
No ratings yet
Introduction To Pandas
13 pages
Importing Data in Python
No ratings yet
Importing Data in Python
13 pages
Week 3 Python
No ratings yet
Week 3 Python
152 pages
Python Unit 5
No ratings yet
Python Unit 5
21 pages
WS#3
No ratings yet
WS#3
4 pages
Data Type in Python
No ratings yet
Data Type in Python
20 pages
Data Analysis With Python
No ratings yet
Data Analysis With Python
12 pages
Rest of The Ip Project
No ratings yet
Rest of The Ip Project
26 pages
Dav 2 Unit
No ratings yet
Dav 2 Unit
55 pages
CH 3 2
No ratings yet
CH 3 2
17 pages
Chapter 4 - Import-Export Data
No ratings yet
Chapter 4 - Import-Export Data
30 pages
Learn Python Pandas For Data Science Quick TutorialExamples For All Primary Operations of DataFrames
No ratings yet
Learn Python Pandas For Data Science Quick TutorialExamples For All Primary Operations of DataFrames
37 pages
Pandas - Data Manipulation and Analysis Library - Educative
No ratings yet
Pandas - Data Manipulation and Analysis Library - Educative
7 pages
Week 2 Laboratory Activity
No ratings yet
Week 2 Laboratory Activity
7 pages
Fds Unit - III
No ratings yet
Fds Unit - III
58 pages
Comprehensive Guide Data Exploration Sas Using Python Numpy Scipy Matplotlib Pandas
100% (1)
Comprehensive Guide Data Exploration Sas Using Python Numpy Scipy Matplotlib Pandas
12 pages
Cheat Sheet: The Pandas Dataframe Object I: Preliminaries Get Your Data Into A Dataframe
No ratings yet
Cheat Sheet: The Pandas Dataframe Object I: Preliminaries Get Your Data Into A Dataframe
12 pages
III Unit Fds
No ratings yet
III Unit Fds
24 pages
Python Programming Tutorial For Machine Learning Beginners Using
No ratings yet
Python Programming Tutorial For Machine Learning Beginners Using
13 pages
Pandas
No ratings yet
Pandas
57 pages
Ainotes
No ratings yet
Ainotes
5 pages
DAwHPC L03 Data Cleaning Practical
No ratings yet
DAwHPC L03 Data Cleaning Practical
43 pages
RM - Pandas - Importing Data
No ratings yet
RM - Pandas - Importing Data
15 pages
Pandas
No ratings yet
Pandas
12 pages
Pandas PDF
No ratings yet
Pandas PDF
25 pages
Course - Introduction To Data Science (SD211105)
No ratings yet
Course - Introduction To Data Science (SD211105)
10 pages
Ch2 PDF Slides
No ratings yet
Ch2 PDF Slides
26 pages
Unit6 - Working With Data
No ratings yet
Unit6 - Working With Data
29 pages
Importing Data From A .CSV File: Brandon Krakowsky
No ratings yet
Importing Data From A .CSV File: Brandon Krakowsky
26 pages
Lecture 21 Working With Pandas
No ratings yet
Lecture 21 Working With Pandas
11 pages
Pandas Basics For Data Science
No ratings yet
Pandas Basics For Data Science
2 pages
Data Mining Using Python Manual
No ratings yet
Data Mining Using Python Manual
69 pages
05 Data Loading, Storage and Wrangling-1
No ratings yet
05 Data Loading, Storage and Wrangling-1
22 pages
Pandas 1
No ratings yet
Pandas 1
64 pages
Data Analysis With Python
No ratings yet
Data Analysis With Python
4 pages
Data Frame
No ratings yet
Data Frame
95 pages
Pandas Cheat Sheet Free Resources At: Dataquest - Io/guide
No ratings yet
Pandas Cheat Sheet Free Resources At: Dataquest - Io/guide
7 pages
Cheat Sheet: The Pandas Dataframe Object: Preliminaries Get Your Data Into A Dataframe
100% (1)
Cheat Sheet: The Pandas Dataframe Object: Preliminaries Get Your Data Into A Dataframe
10 pages
Pandas DataFrameObject
No ratings yet
Pandas DataFrameObject
4 pages
DAP Module4 Notes
No ratings yet
DAP Module4 Notes
17 pages
DevOps Session 3 Pandas
No ratings yet
DevOps Session 3 Pandas
33 pages
Pandas DataFrame Notes
100% (1)
Pandas DataFrame Notes
10 pages
Exp 1
No ratings yet
Exp 1
5 pages
CSV New
No ratings yet
CSV New
4 pages
CSV File
No ratings yet
CSV File
30 pages
Data Analysis Using Python Day - 1 To Day - 4
No ratings yet
Data Analysis Using Python Day - 1 To Day - 4
30 pages
Cheat Sheet
No ratings yet
Cheat Sheet
10 pages
Quick Python Guide
From Everand
Quick Python Guide
Coder1
No ratings yet
Inspiring Powershell Articles
From Everand
Inspiring Powershell Articles
Murat Yildirimoglu
No ratings yet
Learning Pandas 2.0: A Comprehensive Guide to Data Manipulation and Analysis for Data Scientists and Machine Learning Professionals
From Everand
Learning Pandas 2.0: A Comprehensive Guide to Data Manipulation and Analysis for Data Scientists and Machine Learning Professionals
Matthew Rosch
No ratings yet
Big Data Analytics
From Everand
Big Data Analytics
Nitin Kumar Yadav
No ratings yet
Firebase Storage for Angular: A reliable file upload solution for your applications
From Everand
Firebase Storage for Angular: A reliable file upload solution for your applications
Abdelfattah Ragab
No ratings yet
Course Certificate Aman Verma
No ratings yet
Course Certificate Aman Verma
1 page
Aman - Data Science Internship - Internship
No ratings yet
Aman - Data Science Internship - Internship
1 page
RRB NTPC Syllabus
No ratings yet
RRB NTPC Syllabus
3 pages
Gate Mega Sheet New
No ratings yet
Gate Mega Sheet New
20 pages
Cse Btech (Admission Data)
No ratings yet
Cse Btech (Admission Data)
30 pages
Basic Understaning of Detection of Intrusion Aand Threats
No ratings yet
Basic Understaning of Detection of Intrusion Aand Threats
9 pages
Formal Language & Automata
No ratings yet
Formal Language & Automata
2 pages
Group Theory Problems
No ratings yet
Group Theory Problems
3 pages
Construction Economics B-Assignment 01
No ratings yet
Construction Economics B-Assignment 01
9 pages
Pedrotti Et Al. 2024
No ratings yet
Pedrotti Et Al. 2024
11 pages
ARP - Bolt Catalogue (Bolting Information) - Cat03
100% (1)
ARP - Bolt Catalogue (Bolting Information) - Cat03
82 pages
JLPT N4 Study Plan
No ratings yet
JLPT N4 Study Plan
2 pages
Bai Tap Unit 2 The Generation Gap Global Success 11
No ratings yet
Bai Tap Unit 2 The Generation Gap Global Success 11
8 pages
Irr of Republic Act No. 12009
No ratings yet
Irr of Republic Act No. 12009
57 pages
Language: Investigating
No ratings yet
Language: Investigating
308 pages
Government of Haryana: Acknowledgement
No ratings yet
Government of Haryana: Acknowledgement
1 page
Syed Safiya Resume 1-1
No ratings yet
Syed Safiya Resume 1-1
2 pages
Indian Culture
No ratings yet
Indian Culture
3 pages
A Guide To Solid Waste Management Planning: September, 2016
No ratings yet
A Guide To Solid Waste Management Planning: September, 2016
100 pages
Iwamoto Digital Fabrication PDF
No ratings yet
Iwamoto Digital Fabrication PDF
12 pages
1983-SDEE - Foundation Vibrations STATE of The ART
No ratings yet
1983-SDEE - Foundation Vibrations STATE of The ART
16 pages
Mid Year Performance Review and Evaluations Project Proposal 2023 2024
No ratings yet
Mid Year Performance Review and Evaluations Project Proposal 2023 2024
5 pages
Clippers
No ratings yet
Clippers
5 pages
The Changing Seasons Y3
No ratings yet
The Changing Seasons Y3
14 pages
Maintenance Kit (HB 2000 DP) Lower Breaker Part Dust Protector Cast Iron
No ratings yet
Maintenance Kit (HB 2000 DP) Lower Breaker Part Dust Protector Cast Iron
4 pages
Income From Business
No ratings yet
Income From Business
12 pages
Swot Analysis With Summary Template
No ratings yet
Swot Analysis With Summary Template
2 pages
4 Hofstede Summary
100% (2)
4 Hofstede Summary
6 pages
Aqua4Trans FM New
100% (1)
Aqua4Trans FM New
6 pages
Annexure A - Common Library Specifications Unified Payment Interface
No ratings yet
Annexure A - Common Library Specifications Unified Payment Interface
16 pages
Icebreaker Research Material
No ratings yet
Icebreaker Research Material
5 pages
Spelling Bee Words 3b
No ratings yet
Spelling Bee Words 3b
3 pages
(Part1) EEIC PRACTICE PROBLEMS 2024 - ESAS
No ratings yet
(Part1) EEIC PRACTICE PROBLEMS 2024 - ESAS
6 pages
SR KG Syllabus
No ratings yet
SR KG Syllabus
19 pages
Chem142 Lab+1 AE Report Gradescope 040223
No ratings yet
Chem142 Lab+1 AE Report Gradescope 040223
8 pages
2 - Quantum QP3
No ratings yet
2 - Quantum QP3
13 pages

2.1 Importing Python Data

Uploaded by

2.1 Importing Python Data

Uploaded by

Excel Spreadsheets Pickled Files

Python For Data Science Cheat Sheet

>>> import numpy as np

>>> filename = ‘mnist.txt’

You might also like