0% found this document useful (0 votes)
33 views

1-Big Data Systems, Programming and Management

Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
33 views

1-Big Data Systems, Programming and Management

Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 3

Symbiosis International (Deemed University)

COURSE DETAILS

Name of the Programme : Master of Science (Computer Application)


Semester : 2
Course Name : Big Data: Systems, Programming and
Management
Course Code : T3527
No. of Credit : 4

Learning Objectives:
This course aims to deliver the knowledge about Big Data and associated
technologies. At the end of the course, students will possess the skills
necessary for utilizing tools to handle big data and able to apply the analytics
techniques on a variety of applications.

Pre-requisites:
 Good knowledge of RDBMS and Software Engineering.

Course Outline:
Module Topic
No.
1 Big Data Overview: -
What is Big data? Why Big-Data? Why Are Big Data Systems
Different? Explain unstructured data, industry examples of big
data, web analytics, big data and marketing, fraud and big data,
risk and big data 5 ,credit risk management, big data and
algorithmic trading, big data and healthcare, big data in
medicine, advertising and big data
2 Introduction to big data technologies: -
Symbiosis International (Deemed University)

introduction to Hadoop, open source technologies, cloud and


big data mobile business intelligence, Crowd sourcing analytics
,inter and trans firewall analytics, Case Studies
3 NoSQL:
Introduction to NoSQL, what is NoSQL? Benefits of NoSQL over
RDBMS , Types of NoSQL Databases
4 Hadoop – Introduction:
introduction to Hadoop Hadoop Architecture, MapReduce,
Hadoop distributed File System -HDFS Overview , HDFS
Operations, Hadoop - Command , Practical Sessions
5 MapReduce – Introduction
What is MapReduce? MapReduce – Algorithm, How MapReduce
is used? MapReduce - Hadoop Implementation - anatomy of
Map Reduce job run , classic Map-reduce , YARN , failures in
classic Map-reduce and YARN , job scheduling , shuffle and sort
, task execution , MapReduce types , input formats , output
format Map Reduce application for word counting on Hadoop
cluster
6 Hadoop Related Tools:
Hbase,data model and implementations, Hbase clients ,Hbase
examples – praxis. Cassandra Cassandra data model,
cassandra examples, cassandra clients, Hadoop integration.
Pig, Grunt, pig data model, Pig Latin, developing and testing Pig
Latin scripts. Hive , data types and file formats , HiveQL data
definition , HiveQL data manipulation – HiveQL queries
Symbiosis International (Deemed University)

Books Recommended:
1. Michael Minelli, MichelleChambers, and Ambiga Dhiraj, "Big Data, Big
Analytics: Emerging Business Intelligence and Analytic Trends for
Today's Businesses", Wiley, 2013.
2. Big-Data Black Book, DT Editorial Services, Wily India
3. P. J. Sadalage and M. Fowler, "NoSQL Distilled: A Brief Guide to the
Emerging World of Polyglot Persistence", Addison-Wesley Professional,
2012.
4. Tom White, "Hadoop: The Definitive Guide", Third Edition, O'Reilley,
2012.
5. Eric Sammer, "Hadoop Operations", O'Reilley, 2012.
6. E. Capriolo, D. Wampler, and J. Rutherglen, "Programming Hive",
O'Reilley, 2012.
7. Lars George, "HBase: The Definitive Guide", O'Reilley, 2011.
8. Eben Hewitt, "Cassandra: The Definitive Guide", O'Reilley, 2010.
9. Alan Gates, "Programming Pig", O'Reilley, 2011.

You might also like