Lecture 1 - Introduction to Data Science
Lecture 1 - Introduction to Data Science
Science
ANUP APREM
Big Data Phenomenon
• We are collecting and storing data at an unprecedented
rate.
• Examples: –
• YouTube, Facebook, MOOCs, news sites.
• Credit cards transactions and Amazon purchases.
• Transportation data (Google Maps, Waze, Uber)
• Gene expression data and protein interaction assays. –
• Maps and satellite data.
• Large hadron collider and surveying the sky.
• Phone call records and speech recognition results.
• Video game worlds and user actions.
Data Science
• What to do with all this data?
• Too much data to search through manually
• But there is valuable information in the data
• How can we use it for fun, profit, and/or greater good
• Process of extracting information from raw data is called data analysis.
• Interface to databases
• SQL, NoSQL
Course Project: Identify a suitable data problem, obtain the dataset, create a
database and perform data visualization on the problem
Proposal due: One week after Midterm
Course Project due: Last but one week of class (one week for evaluation/viva)
Acknowledgement
• Couse developed in 2021 through British Council Going Global
Exploratory Grant in partnership with Oxford Brookes University, UK