0% found this document useful (0 votes)

31 views47 pages

Chapter 3 NoSQL Database

Uploaded by

thedeveloper333

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

31 views47 pages

Chapter 3 NoSQL Database

Uploaded by

thedeveloper333

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 47

Introduction To NoSQL Database

AIDS – B.E – BDA

Dr. Pooja K Revankar
Assistant Professor,
Dept. of Computer Science and Engg.,
SIES Graduate School of Technology

1
Dr. Pooja K R
Agenda
• Introduction to NoSQL

• Limitations of Relational Database

• What is NoSQL

• Business Drivers of NoSQL

• NoSQL Data Architecture Patterns

• NoSQL solution for big data

• Choosing distribution models

2
Dr. Pooja K R
Introduction to NoSQL Databases

•A database Management System provides the mechanism to store

and retrieve the data.

•There are different kinds of database Management Systems:

1. RDBMS (Relational Database Management Systems)

2. OLAP (Online Analytical Processing)

3. NoSQL (Not only SQL)

3
Dr. Pooja K R
Different SQL Databases

4
Dr. Pooja K R
What is NoSQL?

NoSQL is a set of concepts that allows the rapid and

efficient processing of data sets with a focus on
performance, reliability, and agility.

5
Dr. Pooja K R
Limitations of Relational databases
•Need to define structure and schema of data first and then
only we can process the data.

•Provides consistency and integrity of data by

enforcing ACID properties.

•Most of the applications store their data in JSON format.

•RDBMS don’t provide you a better way of performing

operations such as create, insert, update, delete etc on this
data.

6
Dr. Pooja K R
Advantages of NoSQL

•High scalability

•High Availability

7
Dr. Pooja K R
RDBMS Vs NoSQL
• RDBMS: It is a structured data that provides more functionality but
gives less performance.

• NoSQL: Structured or semi structured data, less functionality and high

performance.

8
Dr. Pooja K R
NOSQL DATABASES

9
Dr. Pooja K R
What is NoSQL?
• More than rows in tables

• Free of joins

• Schema-free

• Works on many processors

• Uses shared-nothing commodity computers

• Supports linear scalability

• Innovative

10
Dr. Pooja K R
NoSQL Database Categories

•Document Database

•Key value stores

•Graph store

•Wide column stores

11
Dr. Pooja K R
NoSQL Data Architecture Patterns

12
Dr. Pooja K R
NOSQL BUSINESS DRIVERS

 VOLUME

 VELOCITY

 VARIABILITY

 AGILITY

13
Dr. Pooja K R
What is the CAP Theorem?

CAP theorem is also called brewer's theorem. It states that

is impossible for a distributed data store to offer more than
two out of three guarantees:

1. Consistency
2. Availability
3. Partition Tolerance

14
Dr. Pooja K R
BASE Properties

15
Dr. Pooja K R
BASE Properties

NoSQL relies upon a softer model known as the BASE model(instead of

ACID properties)

 Basically Available: Guarantees the availability of the data . There

will be a response to any request (can be failure too).

 Soft state: The state of the system could change over time.

 Eventual consistency: The system will eventually become

consistent once it stops receiving input.

16
Dr. Pooja K R
NoSQL Database Categories

•Document Database

•Key value stores

•Graph store

•Wide column stores

17
Dr. Pooja K R
NoSQL Data Architecture Patterns

18
Dr. Pooja K R
Data Models
NoSQL databases are classified in four major data
models :

19
Dr. Pooja K R
Key-value
 Simplest NOSQL databases

 The main idea is the use of a hash table

 Access data (values) by strings called keys

 Data has no required format

 Data model: (key, value) pairs

 Key maps to a BLOB(Binary Large Object)

 Example of Key-value store DataBase : Redis,

Dynamodb, Riak, Memcache etc.
20
Dr. Pooja K R
Operations using KEY VALUE STORE

• Get(key)

• Put (key, value)

• Multi-get(Key1, Key2,….Keyn)

• Delete(key)

21
Dr. Pooja K R
KEY VALUE STORE PROS

 Any data type in value field

 Consistent

 Returned values on queries can be used to convert into lists,

data frames etc.

 Scalable

 Reliable

 Key can be synthetic or auto generated

22
Dr. Pooja K R
KEY VALUE STORE CONS

 No indexes are made on values.

 Do not provide traditional DBMS capabilities ,such as ACID

properties when multiple transactions are executed
simultaneously.

 No queries on values.

 Maintaining unique keys is a problem if volume is large.

23
Dr. Pooja K R
Key Value Stores

24
Dr. Pooja K R
Key Value Stores

25
Dr. Pooja K R
Document-Based Store NoSQL

•In this type of database, the record and its associated data are stored
in a single document.

•So this model is not completely unstructured but it is a kind of Semi-

structured data.

•The difference between a document and Key value pair is that in

document type storage is that in this type some kind of encoding is
provided while storing the data in documents.

• It can be XML encoding or JSON encoding.

•The below example shows a document that can be stored in a

document database but with a different encoding.

26
Dr. Pooja K R
DOCUMENT STORES
 The central concept of a document-oriented database is the notion
of a document.

 Documents in a document store are roughly equivalent to the

programming concept of an object.

 They are not required to adhere to a standard schema, nor will

they have all the same sections, slots, parts or keys.

 Generally, programs using objects have many different types of

objects, and those objects often have many optional fields.

 Every object, even those of the same class, can look very different.

 Document stores are similar in that they allow different types of

documents in a single store, allow the fields within them to be
optional, and often allow them to be encoded using different
encoding systems.

27
Dr. Pooja K R
DOCUMENT STORES

JSON DOCUMENT XML DOCUMENT

28
Dr. Pooja K R
DOCUMENT STORES

29
Dr. Pooja K R
Document-Based Store NoSQL
•The document type is mostly used for CMS systems, blogging
platforms, real-time analytics & e-commerce applications. It should not
use for complex transactions which require multiple operations or
queries against varying aggregate structures.

•Amazon SimpleDB, CouchDB, MongoDB, Riak, Lotus Notes,

MongoDB, are popular Document originated DBMS systems.

30
Dr. Pooja K R
Example:

•The difference between conventional databases and document-based

databases is that data here is not stored in tables like conventional
databases but are stored in documents.

•The examples of databases using the above data model are MongoDB
and Couchbase.

•These types of databases are used extensively especially in big data

analysis.
31
Dr. Pooja K R
COLUMN ORIENTED DATABASES
 Column-oriented databases primarily work on columns and every column is treated

individually.

 Values of a single column are stored contiguously.

 Column stores data in column specific files.

 In Column stores, query processors work on columns too.

 All data within each column data file have the same type which makes it ideal for

compression.

 Column stores can improve the performance of queries as it can access specific

column data.

 High performance on aggregation queries (e.g. COUNT, SUM, AVG, MIN, MAX).

 Works on data warehouses and business intelligence, customer relationship

management (CRM), Library card catalogs etc.

32
 Example of Column-oriented databases : BigTable, Cassandra, SimpleDB etc
Dr. Pooja K R
COLUMN-ORIENTED DATABASE

33
Dr. Pooja K R
GRAPH DATABASES
 A graph database stores data in a graph.

 It is capable of elegantly representing any kind of data in a highly

accessible way.
 A graph database is a collection of nodes and edges.

 Each node represents an entity (such as a student or business)

and each edge represents a connection or relationship between
two nodes.

 Every node and edge is defined by a unique identifier.

 Each node knows its adjacent nodes.

 As the number of nodes increases, the cost of a local step (or hop)
remains the same.
 Index for lookups.
 Example of Graph databases: OrientDB, Neo4J, Titan.etc.
38
Dr. Pooja K R
GRAPH STORES

39
Dr. Pooja K R
GRAPH STORES

40
Dr. Pooja K R
Analyzing big data with a shared-nothing architecture

41
Dr. Pooja K R
Analyzing big data with a shared-nothing architecture

42
Dr. Pooja K R
Analyzing big data with a shared-nothing architecture

•A shared nothing architecture (SN) is a distributed computing

architecture in which each node is independent and self-sufficient, and
there is no single point of contention across the system.

•More specifically, none of the nodes share memory or disk storage.

•People typically contrast SN with systems that keep a large amount of

centrally-stored state information, whether in a database, an application
server, or any other similar single point of contention.

43
Dr. Pooja K R
Analyzing big data with a shared-nothing architecture
•The advantages of SN architecture versus a central entity that controls
the network (a controller-based architecture) include eliminating any
single point of failure, allowing self-healing capabilities and providing an
advantage with offering non-disruptive upgrades.

•Shared nothing is popular for web development because of its

scalability.

•SN system can scale almost infinitely simply by adding nodes in the
form of inexpensive computers, since there is no single bottleneck to
slow the system down.

•A SN system typically partitions its data among many nodes on

different databases (assigning different computers to deal with different
users or queries),
• It may require every node to maintain its own copy of the application's
data, using some kind of coordination protocol. This is often referred to
as database sharding.
44
Dr. Pooja K R
Choosing distribution models: master-slave versus peer-to-peer

45
Dr. Pooja K R
Master-slave versus peer-to-peer

• In master-slave configuration where all incoming database requests

(reads or writes) are sent to a single master node and redistributed
from there.

•The master node is called the NameNode in Hadoop.

• This node keeps a database of all the other nodes in the cluster and
the rules for distributing requests to each node.

• In the peer-to-peer model stores all the information about the cluster
on each node in the cluster.

•If any node crashes, the other nodes can take over and processing
can continue.

46
Dr. Pooja K R
Choosing distribution models: master-slave versus peer-to-
peer
• Peer-to-peer systems distribute the responsibility of the master to
each node in the cluster.
• In this situation, testing is much easier since you can remove any
node in the cluster and the other nodes will continue to function.
•The disadvantage of peer-to-peer networks is that there’s an
increased complexity and communication overhead that must occur for
all nodes to be kept up to date with the cluster status.

47
Dr. Pooja K R
Master Slave Distribution Model
•With a master-slave distribution model, the role of managing the
cluster is done on a single master node.
•This node can run on specialized hardware such as RAID drives to
lower the probability that it crashes.
•The cluster can also be configured with a standby master that’s
continually updated from the master node.
•The challenge with this option is that it’s difficult to test the standby
master without jeopardizing the health of the cluster.
•Failure of the standby master to take over from the master node is a
real concern for high-availability operations.

48
Dr. Pooja K R
NoSQL systems to handle big data problems

49
Dr. Pooja K R
Case Study:

• Google maps stores GIS in Bigtable

• Storing analytical information in BigTables
•References:
• https://dzone.com/articles/what-nosql

50
Dr. Pooja K R
Thank You!
(poojakr@sies.edu.in)

51
Dr. Pooja K R

Online Teacher Transfer
100% (7)
Online Teacher Transfer
58 pages
Lecture 6 - NoSQL
No ratings yet
Lecture 6 - NoSQL
28 pages
NOsql Presentation
No ratings yet
NOsql Presentation
20 pages
NOSQL Lecture 1 Notes
No ratings yet
NOSQL Lecture 1 Notes
31 pages
Lecture 3.1.2
No ratings yet
Lecture 3.1.2
47 pages
Unit 2
No ratings yet
Unit 2
26 pages
Lecture 1 - NoSQL
No ratings yet
Lecture 1 - NoSQL
31 pages
NoSql 2024 Assign2
No ratings yet
NoSql 2024 Assign2
189 pages
Lecture 1
No ratings yet
Lecture 1
31 pages
MongoDB Slides Until ClassTest
No ratings yet
MongoDB Slides Until ClassTest
221 pages
Big Data Unit 3
No ratings yet
Big Data Unit 3
374 pages
Unit 2
No ratings yet
Unit 2
65 pages
Unit Ii - Nosql Databases
No ratings yet
Unit Ii - Nosql Databases
112 pages
NoSQL Tutorial - New
No ratings yet
NoSQL Tutorial - New
10 pages
DBMS Lecture13 NoSQL
No ratings yet
DBMS Lecture13 NoSQL
31 pages
Cs 620 / Dasc 600 Introduction To Data Science & Analytics: Lecture 6-Nosql
No ratings yet
Cs 620 / Dasc 600 Introduction To Data Science & Analytics: Lecture 6-Nosql
31 pages
Introduction To NoSQL
No ratings yet
Introduction To NoSQL
29 pages
Module 1 Introduction
No ratings yet
Module 1 Introduction
9 pages
NoSQL Notes
No ratings yet
NoSQL Notes
11 pages
Unit VI - 1
No ratings yet
Unit VI - 1
31 pages
No SQL
No ratings yet
No SQL
12 pages
Nosql Tricks
No ratings yet
Nosql Tricks
34 pages
DBMS - Unit 5 (NoSQL Databases)
No ratings yet
DBMS - Unit 5 (NoSQL Databases)
35 pages
Unit 2 Handouts
No ratings yet
Unit 2 Handouts
11 pages
Module 5 - NoSQL Databases
No ratings yet
Module 5 - NoSQL Databases
33 pages
Full Stack UNIT3
No ratings yet
Full Stack UNIT3
57 pages
Unit II No-SQL DB Managment
No ratings yet
Unit II No-SQL DB Managment
33 pages
Chapter14 BigData&NoSQLDatabases
No ratings yet
Chapter14 BigData&NoSQLDatabases
39 pages
BD Unit 4
No ratings yet
BD Unit 4
45 pages
Learning Guide 2.1 - CloudDatabase - NOSQL PDF
No ratings yet
Learning Guide 2.1 - CloudDatabase - NOSQL PDF
44 pages
NoSQL D
No ratings yet
NoSQL D
26 pages
BDA Unit-2
No ratings yet
BDA Unit-2
30 pages
Bda Module-2
No ratings yet
Bda Module-2
32 pages
BIG Data 2
No ratings yet
BIG Data 2
18 pages
Nosql Databases Unit-1
No ratings yet
Nosql Databases Unit-1
16 pages
Full Stack-Unit-Iii
No ratings yet
Full Stack-Unit-Iii
56 pages
BDT Unit-Ii
No ratings yet
BDT Unit-Ii
13 pages
Bda Unit-2
No ratings yet
Bda Unit-2
29 pages
Module 1
No ratings yet
Module 1
69 pages
Big Data Analytics Unit-2
No ratings yet
Big Data Analytics Unit-2
30 pages
Ca23301-Full Stack Web Development Unit-III
No ratings yet
Ca23301-Full Stack Web Development Unit-III
61 pages
Introduction To: Nosql
No ratings yet
Introduction To: Nosql
27 pages
No SQL
No ratings yet
No SQL
109 pages
NGD Chap1
No ratings yet
NGD Chap1
22 pages
Module 2
No ratings yet
Module 2
100 pages
Features of Nosql: Non-Relational
No ratings yet
Features of Nosql: Non-Relational
7 pages
Nosql Module 1
No ratings yet
Nosql Module 1
23 pages
Unit 3 NoSQL
No ratings yet
Unit 3 NoSQL
98 pages
Overview of NoSQL
No ratings yet
Overview of NoSQL
17 pages
No SQL
No ratings yet
No SQL
38 pages
Types of NoSQL Databases - GeeksforGeeks
No ratings yet
Types of NoSQL Databases - GeeksforGeeks
9 pages
Wa0004.
No ratings yet
Wa0004.
43 pages
2 Big Data Analytics-Hadoop R21 A7902 ABP
No ratings yet
2 Big Data Analytics-Hadoop R21 A7902 ABP
16 pages
Unit 1 Mangodb
No ratings yet
Unit 1 Mangodb
57 pages
Module 3 Bigdata Analytics
No ratings yet
Module 3 Bigdata Analytics
19 pages
Big Data Analysis
No ratings yet
Big Data Analysis
9 pages
Bcse302l Dbms Module-7 Nosql
No ratings yet
Bcse302l Dbms Module-7 Nosql
30 pages
Bda - Unit 2
No ratings yet
Bda - Unit 2
30 pages
Learn SQL in 24 Hours
From Everand
Learn SQL in 24 Hours
Alex Nordeen
5/5 (4)
DBMS MASTER: Become Pro in Database Management System
From Everand
DBMS MASTER: Become Pro in Database Management System
Ummed Singh
No ratings yet
Databases: System Concepts, Designs, Management, and Implementation
From Everand
Databases: System Concepts, Designs, Management, and Implementation
Jonathan Rigdon
No ratings yet
CC104 Rev
No ratings yet
CC104 Rev
8 pages
Business Intelligence-LO1
No ratings yet
Business Intelligence-LO1
10 pages
Database Administration Help Topics For Printing
No ratings yet
Database Administration Help Topics For Printing
312 pages
Siebel Application Architecture
No ratings yet
Siebel Application Architecture
6 pages
SQL Interview Questions With Theory Answers
No ratings yet
SQL Interview Questions With Theory Answers
26 pages
Sem 620
No ratings yet
Sem 620
22 pages
Topic 2-Database-Create-Read-Update-Delete-PK-FK
No ratings yet
Topic 2-Database-Create-Read-Update-Delete-PK-FK
41 pages
Structured and Unstructured Data
No ratings yet
Structured and Unstructured Data
3 pages
Nav2013 Enus Cssol 13
No ratings yet
Nav2013 Enus Cssol 13
66 pages
Unit 3
100% (1)
Unit 3
30 pages
Unit 3-6
No ratings yet
Unit 3-6
34 pages
Unit 4 BDTT
No ratings yet
Unit 4 BDTT
23 pages
CCE Detailed Syllabus
No ratings yet
CCE Detailed Syllabus
106 pages
History of DBMS
No ratings yet
History of DBMS
3 pages
Module2 Chapter4
No ratings yet
Module2 Chapter4
17 pages
ICT Assignment For F4
No ratings yet
ICT Assignment For F4
7 pages
Database Functions
No ratings yet
Database Functions
3 pages
Test Prep Chapter 3 - ISTN212
No ratings yet
Test Prep Chapter 3 - ISTN212
35 pages
Relational Database Design by ER - To-Relational Mapping
No ratings yet
Relational Database Design by ER - To-Relational Mapping
16 pages
M - SC - (IT) Batch 2019 (10-06-2020)
No ratings yet
M - SC - (IT) Batch 2019 (10-06-2020)
82 pages
Project 1 Phase
No ratings yet
Project 1 Phase
11 pages
Nongatebook
100% (1)
Nongatebook
2,245 pages
Dmbs Notes
No ratings yet
Dmbs Notes
151 pages
Chapter 1-1.1
No ratings yet
Chapter 1-1.1
22 pages
Computer Science AKU EB
No ratings yet
Computer Science AKU EB
15 pages
Important Computer Questions For RSMSSB Informatics Assistant Exam Set 1 1
No ratings yet
Important Computer Questions For RSMSSB Informatics Assistant Exam Set 1 1
5 pages
OOAD Chapter 5 Programming Style
No ratings yet
OOAD Chapter 5 Programming Style
36 pages
Get and Post
No ratings yet
Get and Post
4 pages
Syllabus
No ratings yet
Syllabus
12 pages

Chapter 3 NoSQL Database

Uploaded by

Chapter 3 NoSQL Database

Uploaded by

Introduction To NoSQL Database

AIDS – B.E – BDA

• Limitations of Relational Database

• Business Drivers of NoSQL

• NoSQL Data Architecture Patterns

• NoSQL solution for big data

• Choosing distribution models

•A database Management System provides the mechanism to store

•There are different kinds of database Management Systems:

1. RDBMS (Relational Database Management Systems)

2. OLAP (Online Analytical Processing)

3. NoSQL (Not only SQL)

NoSQL is a set of concepts that allows the rapid and

•Provides consistency and integrity of data by

•Most of the applications store their data in JSON format.

•RDBMS don’t provide you a better way of performing

• NoSQL: Structured or semi structured data, less functionality and high

• Works on many processors

• Uses shared-nothing commodity computers

• Supports linear scalability

•Key value stores

•Wide column stores

CAP theorem is also called brewer's theorem. It states that

NoSQL relies upon a softer model known as the BASE model(instead of

 Basically Available: Guarantees the availability of the data . There

 Eventual consistency: The system will eventually become

•Key value stores

•Wide column stores

 The main idea is the use of a hash table

 Access data (values) by strings called keys

 Data has no required format

 Data model: (key, value) pairs

 Key maps to a BLOB(Binary Large Object)

 Example of Key-value store DataBase : Redis,

• Put (key, value)

 Any data type in value field

 Returned values on queries can be used to convert into lists,

 Key can be synthetic or auto generated

 No indexes are made on values.

 Do not provide traditional DBMS capabilities ,such as ACID

 Maintaining unique keys is a problem if volume is large.

•So this model is not completely unstructured but it is a kind of Semi-

•The difference between a document and Key value pair is that in

• It can be XML encoding or JSON encoding.

•The below example shows a document that can be stored in a

 Documents in a document store are roughly equivalent to the

 They are not required to adhere to a standard schema, nor will

 Generally, programs using objects have many different types of

 Document stores are similar in that they allow different types of

JSON DOCUMENT XML DOCUMENT

•Amazon SimpleDB, CouchDB, MongoDB, Riak, Lotus Notes,

•The difference between conventional databases and document-based

•These types of databases are used extensively especially in big data

 Values of a single column are stored contiguously.

 Column stores data in column specific files.

 In Column stores, query processors work on columns too.

 Works on data warehouses and business intelligence, customer relationship

management (CRM), Library card catalogs etc.

 It is capable of elegantly representing any kind of data in a highly

 Each node represents an entity (such as a student or business)

 Every node and edge is defined by a unique identifier.

 Each node knows its adjacent nodes.

•A shared nothing architecture (SN) is a distributed computing

•More specifically, none of the nodes share memory or disk storage.

•People typically contrast SN with systems that keep a large amount of

•Shared nothing is popular for web development because of its

•A SN system typically partitions its data among many nodes on

• In master-slave configuration where all incoming database requests

•The master node is called the NameNode in Hadoop.

• Google maps stores GIS in Bigtable

You might also like