Comparison Between NoSQL and RDBMS
Comparison Between NoSQL and RDBMS
Sqoop is used to -import data from a relational database management system (RDBMS) into the
Hadoop Distributed File System(HDFS),
1) Sqoop examines the database to gather the necessary metadata for the data to be imported.
2) Map-only Hadoop job : Transfers the actual data using the metadata.
Sqoop will use database name for the directory or the user can specify any alternative directory
where the files should be populated, By default these files contain comma delimited.with new lines
separating different records.
Data export from the cluster works in a similar fashion. The export is done in two steps :
2) Map-only Hadoop job to write the data to the database. Sqoop divides the input data set into
splits, then uses individual map tasks to push the splits to the database.
Illustrate the Key-Value Based Data Architecture Pattern with Example & Mention Its
Limitation
The simplest way to implement a schema-less data store is to use key-value pairs. The data
store characteristics are high performance, scalability and flexibility. Data retrieval is fast in
key-value pairs data store. A simple string called, key maps to a large data string or BLOB
(Basic Large Object). Key-value store accesses use a primary key for accessing the values.
Therefore, the store can be easily scaled up for very large data. The concept is similar to a
hash table where a unique key points to a particular item(s) of data. Figure 3.4 shows key
value pairs architectural pattern and example of students' database as key-value pairs
MongoDB Features