Subject code- 13305 Semester - III
Subject title - Introduction to Database, Big Data and SQL
4 credits
Objectives:
1. To enable learners to develop knowledge in database field.
2. To enable the learner to identify, develop and apply problem solving skills.
3. Provide the learner with a comprehensive platform for career development, innovation and
further study.
Course Outcome –
1. Overview of data with its meaning, scope, and key features of database concepts
2. Understand and use of relational database in financial industry
3. Analyse the data using programming language like SQL
4. Understand the concepts of Hadoop components, its application for big data
5. Apply Database concept in financial problem.
Unit Topic No. of
lectures
1 Data Management and Storage: 6
Types of data
Database-Advantages and Applications , Types of
database - Structured and Unstructured database
Relational databases and SQL
NoSQL databases – Types of NoSQL DB (MongoDB,
Cassandra, etc.)
Hadoop Distributed File System (HDFS) and data
replication
2 Introduction to DBMS 12
Data Models – Definition , types of data models
Database users ,System Structure , Data base
administrator – Basic concepts, Difference between
DBMS and RDBMS
Mapping constraints ,Keys ,ER Diagram – Work
Entity sets – Design of an ER Database Schema –
reduction of ER Schema to tables
3 SQL(Structured Query Language) 15
Types of languages - DDL, DML – other SQL
features
Aggregate functions
Null values –queries Modification of databases using
join Relations
4 Introduction to Big data 12
Introduction to big data, its uses and applications
Characteristics, the relevance of big data
Evolution of big data
Data storage and processing challenges
Introduction to distributed computing and parallel
processing
Vs of big data, case studies
Intelligence vs big data
Operational vs Analytical Big data
Types and need for big data Analytics
5 Big data using Hadoop 15
Introduction to Hadoop
Features of Hadoop, synergies to business
Overview of Hadoop systems, its versions
Hadoop vs SQL, RDBMS vs Hadoop
Hadoop architecture, components
Apache Spark and its ecosystem.
Challenges in the use of big data
Benefits of big data with respect to operational
segments
Total 60
Reference Books:
1. Database System Concepts by Avi Silberschatz ,Henry F. Korth,S. Sudarshan McGraw-Hill
2. SQL, PL/SQL – The Programming Language of Oracle by Ivan Bayross
3. SeemaAcharya, SubhasiniChellappan, "Big Data Analytics" Wiley 2015.
4. Jay Liebowitz, “Big Data and Business Analytics” Auerbach Publications, CRC press
(2013)
Teaching Methodology/Pedagogy-
Lectures using PPTs, Demo of oracle software packages through examples,hands on
programming language using SQL. assignments, homework, practice problems