Skip to content

github764/bigdata-notebook

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

This branch is an update to the storm-kafka-poc, with code changes corresponding to apache storm version 1.0 For other projects, change branch to master.

Hadoop and ML repository

A repository to hold all my Hadoop and Machine Learning related codes.

Visit my blog at : www.vishnuviswanath.com

Contents

  1. Spark ML, Streaming, SQL and GraphX
  2. Flink Streaming
  3. StormKafka streaming application POC
  4. Flume custom source and config files
  5. Hadoop MapReduce old api joins,custom types etc
  6. Solutions for kaggle problems using numpy or graphlab

About

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Scala 50.9%
  • Java 35.9%
  • Python 11.4%
  • Dockerfile 1.1%
  • Shell 0.7%