Hadoop
Hadoop
Hadoop
INTRODUCTION
Hadoop is an open-source software framework
for storing and processing large sets of data. It
provides massive storage for any kind of data,
enormous processing power, and the ability to
handle virtually limitless concurrent tasks or
jobs.
As we all know Hadoop is a framework written in Java that utilizes
a large cluster of commodity hardware to maintain and store big
size data. Hadoop works on MapReduce Programming Algorithm
that was introduced by Google. Today lots of Big Brand Companies
are using Hadoop in their Organization to deal with big data, eg.
Facebook, Yahoo, Netflix, eBay, etc.
YARN is a Framework on which MapReduce works. It processes job requests and manages
cluster resources.
YARN contains:
1. Resource Manager: The use of Resource Manager is to manage all the resources that are
made available for running a Hadoop cluster.
2. Node Manager: Handles the nodes and monitors the resources.
3. Application Manager: works as an interface between resource and node manager.
4. Container: It holds collection of multiple physical resources.