Data_Management_System_Important_Question_1685003411175

Download as docx, pdf, or txt
Download as docx, pdf, or txt
You are on page 1of 4

1.

Assume you have been assigned to a team that will be creating an inventory tracking
system. Your boss has requested each team leader to provide a basic work plan to
the next meeting as part of the project kick-off. These work plans will be examined
at that meeting to establish the overall project schedule, budget, labour needs, and
software requirements.
For the time being, you have been required to present a work plan that specifies the
phases of data design and contains the following information for each step as the
team leader for the data design team:
Data Design Phases
Input and Output phases
Issues and Challenges

2. You are tasked with creating an ER diagram for a new database for a fictional
company. The database is intended to track information about employees,
departments, and projects. The following information has been gathered from the
company's management: An employee is identified by a unique employee ID and has
a name, address, and phone number. An employee is assigned to a single
department, and a department has a unique department ID, name, and location. A
project has a unique project ID, a name, a description, and a start and end date. An
employee can be assigned to multiple projects, and a project can have multiple
employees assigned to it. Each employee assigned to a project may have a specific
role in that project, such as project manager, developer, or tester.

Create an ER diagram for this database.

3. You are a data analyst at a large retail company. The company has recently
implemented a NoSQL database to handle their large and rapidly growing data sets.
The company has also implemented Hadoop for processing and analysing big data.
The NoSQL database contains customer information, including their name, age,
gender, location, and purchase history. The purchase history includes the date of
purchase, the product purchased, and the quantity purchased. The Hadoop
implementation includes a cluster of nodes that are used to analyse the data in the
NoSQL database. The data is processed using Hadoop MapReduce jobs, which are
designed to identify trends in customer behaviour and help the company better
target their marketing efforts.

How is NoSQL different from traditional relational databases?


How does the NoSQL database help the retail company handle their large data sets?
How is Hadoop used in the retail company's data analysis process?
What are some benefits of using Hadoop to process big data?
What types of MapReduce jobs might the retail company use to analyse their
customer data?
How might the retail company use the insights gained from analysing their customer
data to improve their business?
What are some potential challenges the retail company might face with their NoSQL
and Hadoop implementation?
How might the retail company address these challenges?
4. DHL takes pleasure in having up-to-date information on the status and location of
each dispatched item. The firm relies on a company-wide information system to do
this. The heart of the company's product tracking information system is sent
products. Shipped objects are identified by their unique item number, weight,
dimensions, insurance amount, destination, and ultimate delivery date. Shipped
products are received and entered into the corporate system at a single retail
location. Retail centres are distinguished by their type, unique ID, and location.
Shipped products are transported to their destination using one or more
conventional corporate transportation activities (i.e., flights, truck deliveries). These
transportation events are distinguished by a distinct scheduleNumber, a type (for
example, airline, truck), and a deliveryRoute.

Please draw an Entity Relationship diagram that depicts this information about the
company's system. Make sure to include identifiers and cardinality limitations.

5. A multinational corporation is looking to improve its supply chain management and


has collected data on various aspects of its operations such as inventory levels,
shipping times, and delivery reliability. The company wants to use this data to
identify bottlenecks in its supply chain and make data-driven improvements.

What steps would you take to analyse the data collected by the corporation to
improve its supply chain management?

Include the use of big data management tools and technologies in your

Explain each step-in detail Provide examples and use case scenarios to support your
explanation.

6. You have been assigned to create a data model for an e-commerce website. The
website needs to store information about customers, products, orders, and reviews.
The following information has been gathered:

Customers have a unique customer ID, name, email address, and shipping address.
Products have a unique product ID, name, description, price, and category.
Orders have a unique order ID, customer ID, order date, and total amount.
Each order can contain multiple products, and each product can be part of multiple
orders.
Customers can write reviews for products, and each review has a unique review ID,
customer ID, product ID, rating, and comments.

7. Production tracking is essential in many production situations (for example, the


pharmaceutical business, children's toys, and so on). The ER diagram below collects
critical information in production tracking. The ER diagram depicts the links
between manufacturing lots (or batches), specific production units, and raw
materials.
a) Please create a relational database structure from the ER diagram. Make sure to
include main keys and limits on referential integrity.
b) Please identify an attribute in the following ER diagram that may represent a
composite attribute and explain why/how it may do so.
c) The present ER diagram has the following connection, “raw materials are
utilized in 0 to many lots.” Please explain how the meaning changes in the
context of manufacturing if the minimal cardinality is changed to "1" (i.e., the
connection becomes "raw materials are utilised in 1 to many quantities.")
d) There are various instances of data redundancy in the ER diagram/relational
database structure. Please provide one example of a data redundancy problem.

8. A social media platform wants to implement a recommendation system to suggest


relevant content to its users based on their preferences and interactions. How can a
database management system support this functionality by efficiently storing and
retrieving user data and analysing relationships between users and content?

9. Certain University wants to develop a database system to manage its course


registration process. The university offers multiple programs, each consisting of
several courses. Each course has a unique course code, title, and credit hours.
Students can enrol in multiple courses, and each student has a unique student ID,
name, and email address. Additionally, each student can belong to a specific
program. Based on this scenario, please create an Entity-Relationship (ER) diagram
representing the database schema for the course registration system. Identify the
entities, their attributes, and the relationships between them.
10. A social media platform is experiencing exponential growth in the number of users
and posts. The platform wants to ensure scalability and flexibility in handling user-
generated content while maintaining high performance. They are considering
adopting a NoSQL database for their system.

a. How would the choice of a NoSQL database benefit the social media platform
in terms of scalability and handling the increasing volume of user-generated
content? The social media platform wants to implement a feature that allows
users to tag their posts with multiple keywords or categories.
b. How can a NoSQL database model accommodate this requirement compared
to a traditional relational database?

11. A retail company operates multiple stores across different regions and wants to
leverage big data analytics to gain insights into customer behavior and optimize its
marketing strategies. The company has access to various data sources, including
point-of-sale transactions, customer loyalty programs, website traffic logs, and social
media interactions.

Question:
What steps would you take to analyse the company's big data and derive
actionable insights for improving its marketing strategies? Explain each step
in detail, including the use of relevant big data management tools and
technologies.

12. A technology consulting firm has been approached by a company operating a large-
scale data analytics platform. The company collects and processes vast amounts of
structured and unstructured data from various sources, such as IoT devices, social
media, and customer interactions. They want to improve their data analytics
capabilities and require a suitable database option.

Which database option (SQL, NoSQL, or NewSQL) would you recommend for this
data analytics platform and why? Justify your choice, considering the specific
requirements and challenges of handling large-scale data analytics.

Please provide a detailed explanation of your recommendation, highlighting the


advantages and relevance of the chosen database option in addressing the company's
needs effectively

You might also like