NoSQL Database
Objectives
After completing this lesson, you should be able to:
• Describe NoSQL Database characteristics
• Differentiate NoSQL from RDBMS and HDFS
10 - 2
1
What is a NoSQL Database?
• A NoSQL (originally referring to "non SQL" or "non
relational") database provides a mechanism
for storage and retrieval of data that is modeled in means
other than the tabular relations used in relational
databases.
10 - 3
RDBMS Compared to NoSQL
RDBMS NoSQL
High-value, high-density, complex data Low-value, low-density, simple data
Complex data relationships Very simple relationships
Joins Avoids joins
Schema-centric, structured data Unstructured or semi-structured data
Designed to scale up Distributed storage and processing
Well-defined standards Standards not yet evolved
Database-centric Application- and developer-centric
High security Minimal or no security
10 - 4
2
HDFS Compared to NoSQL
HDFS NoSQL
File system Database
No inherent structure Simple data structure
Bulk storage Fast access to specific records
Write once, read many Read, write, delete, update
10 - 5
Examples of NoSQL databases
Accumulo, Cassandra, Druid, HBase, Vertica. Apache
CouchDB, ArangoDB, BaseX, Clusterpoint, Couchbase,
Cosmos DB, IBM Domino, MarkLogic, MongoDB,
OrientDB, Qizx, RethinkDB, Aerospike, Apache Ignite,
ArangoDB, Couchbase, Dynamo, FairCom c-treeACE,
FoundationDB, InfinityDB, MemcacheDB, MUMPS, Oracle
NoSQL Database, OrientDB, Redis, Riak, Berkeley DB,
SDBM/Flat File dbm, AllegroGraph, ArangoDB,
I fi it G h Apache
InfiniteGraph, A h Giraph,
Gi h MarkLogic,
M kL i Neo4J,
N 4J
OrientDB, Virtuoso, ArangoDB, Couchbase,
FoundationDB, InfinityDB, MarkLogic, OrientDB, Cosmos
DB
10 - 6
3
Types and examples
Wide column: Accumulo, Cassandra, Scylla, HBase.
Document: Apache CouchDB, ArangoDB, BaseX,
Clusterpoint, Couchbase, Cosmos DB, eXist-db, IBM Domino,
MarkLogic, MongoDB, OrientDB, Qizx, RethinkDB
Key–value: Aerospike, Apache Ignite, ArangoDB, Berkeley
DB, Couchbase, Dynamo, FoundationDB, InfinityDB,
MemcacheDB MUMPS,
MemcacheDB, MUMPS Oracle NoSQL Database,
Database OrientDB,
OrientDB
Redis, Riak, SciDB, SDBM/Flat File dbm, ZooKeeper
Graph: AllegroGraph, ArangoDB, InfiniteGraph, Apache
Giraph, MarkLogic, Neo4J, OrientDB, Virtuoso
10 - 7
Apache Hadoop Ecosystem
H d
Hadoop partial
ti l list
li t off Associated
A i t d Projects
P j t
Hadoop Core Components:
• HDFS (Storage)
• MapReduce (Processing)
10 - 8
4
Summary
After completing this lesson, you learned how to:
• Describe NoSQL Database characteristics
• Differentiate NoSQL from RDBMS and HDFS
10 - 9