0% found this document useful (0 votes)
26 views5 pages

CCS334 Bda

The document outlines a course on Big Data Analytics, covering key topics such as big data understanding, NoSQL data management, Hadoop basics, MapReduce applications, and Hadoop-related tools. It includes course objectives, outcomes, a list of experiments, and required software. The course aims to equip students with practical skills in big data technologies and analytics tools.

Uploaded by

rajdmice
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
26 views5 pages

CCS334 Bda

The document outlines a course on Big Data Analytics, covering key topics such as big data understanding, NoSQL data management, Hadoop basics, MapReduce applications, and Hadoop-related tools. It includes course objectives, outcomes, a list of experiments, and required software. The course aims to equip students with practical skills in big data technologies and analytics tools.

Uploaded by

rajdmice
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 5

CCS334 – BIG DATA ANALYTICS

AIM OF OBJECTIVES:

 To understand big data.


 To learn and use NoSQL big data management.
 To learn Map Reduce analytics using Hadoop and related tools.
 To work with map-reduce applications.
 To understand the usage of Hadoop-related tools for Big Data Analytics.

UNIT I: UNDERSTANDING BIG DATA


Introduction to big data – convergence of key trends – unstructured data – industry examples of big data –
web analytics – big data applications– big data technologies – introduction to Hadoop – open source
technologies – cloud and big data – mobile business intelligence – Crowd sourcing analytics – inter and
trans firewall analytics.

UNIT II: NOSQL DATA MANAGEMENT


Introduction to NoSQL – aggregate data models – key-value and document data models – relationships –
graph databases – schemaless databases – materialized views – distribution models – master-slave
replication – consistency – Cassandra – Cassandra data model – Cassandra examples – Cassandra clients.

UNIT III: BASICS OF HADOOP


Data format – analyzing data with Hadoop – scaling out – Hadoop streaming – Hadoop pipes – design of
Hadoop distributed file system (HDFS) – HDFS concepts – Java interface – data flow – Hadoop I/O – data
integrity – compression – serialization – Avro – file-based data structures Cassandra – Hadoop integration.

UNIT IV: MAP REDUCE APPLICATIONS


MapReduce workflows – unit tests with MRUnit – test data and local tests – anatomy of MapReduce job
run – classic Map-reduce – YARN – failures in classic Map-reduce and YARN – job scheduling – shuffle
and sort – task execution – MapReduce types – input formats – output formats.

UNIT V: HADOOP RELATED TOOLS


Hbase – data model and implementations – Hbase clients – Hbase examples – praxis. Pig – Grunt – pig
data model – Pig Latin – developing and testing Pig Latin scripts. Hive – data types and file formats –
HiveQL data definition – HiveQL data manipulation – HiveQL queries.
COURSE OUTCOMES:
After the completion of this course, students will be able to:
CO1:Describe big data and use cases from selected business domains.
CO2:Explain NoSQL big data management.
CO3:Install, configure, and run Hadoop and HDFS.
CO4:Perform map-reduce analytics using Hadoop.
CO5:Use Hadoop-related tools such as HBase, Cassandra, Pig, and Hive for big data analytics

TEXT BOOK:

1. Michael Minelli, Michelle Chambers, and AmbigaDhiraj, “Big Data, Big Analytics: Emerging
Business Intelligence and Analytic Trends for Today’s Businesses”, Wiley, 2013.
2. Eric Sammer, “Hadoop Operations”, O’Reilley, 2012.
3. Sadalage, Pramod J. “NoSQL distilled”, 2013

REFERENCES:

1. E. Capriolo, D. Wampler, and J. Rutherglen, “Programming Hive”, O’Reilley, 2012.


2. Lars George, “HBase: The Definitive Guide”, O’Reilley, 2011.
3. Eben Hewitt, “Cassandra: The Definitive Guide”, O’Reilley, 2010.
4. Alan Gates, “Programming Pig”, O’Reilley, 2011.

LIST OF EXPERIMENTS: 30
PERIODS
1. Downloading and installing
Hadoop; Understanding
different Hadoop modes.
Startup scripts,
Configuration files.
2. Hadoop Implementation of
file management tasks, such
as Adding files and
directories, retrieving files and
Deleting files
3. Implement of Matrix
Multiplication with Hadoop
Map Reduce
4. Run a basic Word Count
Map Reduce program to
understand Map Reduce
Paradigm.
5. Installation of Hive along
with practice examples.
7. Installation of HBase,
Installing thrift along with
Practice examples
8. Practice importing and
exporting data from various
databases.
LIST OF EXPERIMENTS: 30 PERIODS

1. Downloading and installing Hadoop; Understanding different Hadoop modes. Startup scripts,
Configuration files.

2. Hadoop Implementation of file management tasks, such as Adding files and directories, retrieving files
and Deleting files

3. Implement of Matrix Multiplication with Hadoop Map Reduce

4. Run a basic Word Count Map Reduce program to understand Map Reduce Paradigm.

5. Installation of Hive along with practice examples.

6. Installation of HBase, Installing thrift along with Practice examples

7. Practice importing and exporting data from various databases.

Software Requirements:
Cassandra, Hadoop, Java, Pig,
Hive and HBase.
TEXT BOOKS:
TOTAL: 60 PERIODS
SOFTWARE REQUIREMENTS:

Cassandra, Hadoop, Java, Pig, Hive and HBase. TOTAL: 60 PERIODS

CO’s- PO’s & PSO’s MAPPING

CO’ PO’s PSO’s


s 1 2 3 4 5 6 7 8 9 10 11 12 1 2
1 3 3 3 3 3 - - - 2 2 3 1 1 3
2 3 3 2 3 2 - - - 2 2 3 3 2 3
3 3 3 3 2 3 - - - 2 2 1 2 2 3
4 2 3 3 3 3 - - - 2 2 3 2 3 3
5 3 3 3 3 3 - - - 3 1 3 2 3 2
AVG 2.8 3 2.8 2.8 2.8 - - - 2.2 1.8 2.6 2 2.2 2.8

1 - LOW, 2 - MEDIUM, 3 - HIGH, ‘-' - NO CORRELATION

You might also like