ProgrammingHadoop ApacheConUS08
ProgrammingHadoop ApacheConUS08
Hadoop Map-Reduce
Programming, Tuning & Debugging
Arun C Murthy
Yahoo! CCDI
acm@yahoo-inc.com
ApacheCon US 2008
Existential angst: Who am I?
• Yahoo!
– Grid Team (CCDI)
• Apache Hadoop
– Developer since April 2006
– Core Committer (Map-Reduce)
– Member of the Hadoop PMC
Hadoop - Overview
• Hadoop includes:
– Distributed File System - distributes data
– Map/Reduce - distributes application
• Open source from Apache
• Written in Java
• Runs on
– Linux, Mac OS/X, Windows, and Solaris
– Commodity hardware
Distributed File System