BDA CW Chapter 1

Download as pdf or txt
Download as pdf or txt
You are on page 1of 3

BDA CW Chapter 1: 5-10M

1. Explain the 5 Vs of Big Data and give two examples of big data case studies. Indicate which Vs are
satisfied by these case studies. [PYQ, IA1 Characteristics only]

5 Vs of Big Data:

1. Volume: Refers to the massive amount of data generated every second. Example: Social media
platforms like Twitter produce terabytes of data daily.
2. Velocity: Describes the speed at which data is generated and processed. Example: Real-time data
from IoT devices such as smart thermostats.
3. Variety: Indicates the different types of data (structured, semi-structured, and unstructured).
Example: Text, images, videos, and transactional data.
4. Veracity: Represents the quality or trustworthiness of the data. Example: Cleaning noisy data in
healthcare datasets for analysis.
5. Value: Emphasizes extracting meaningful insights and actionable intelligence from data.
Example: Retail companies analyzing customer purchase behavior to boost sales.

Case Studies:

1. E-commerce (e.g., Amazon):


o Volume: Large-scale transactional data from millions of users.
o Velocity: Real-time data processing for personalized recommendations.
o Variety: Data from clicks, searches, reviews, and payments.
o Value: Helps in targeted marketing and inventory management.
2. Healthcare (e.g., Predicting Diseases):
o Volume: Huge patient data from hospitals and devices.
o Variety: Includes medical records, scans, and genomic data.
o Veracity: High accuracy required for diagnosis and treatment.
o Value: Enables early detection of diseases and personalized treatments.

2. What is Big Data? What are the types of Big Data?

Big Data refers to large and complex datasets that cannot be managed, processed, or analyzed using
traditional data processing tools and techniques. It includes data generated from various sources like
social media, sensors, transactions, and more, characterized by the 5 Vs (Volume, Velocity, Variety,
Veracity, and Value). Big Data helps organizations derive meaningful insights and make data-driven
decisions.

Types of Big Data:

1. Structured Data:
o Organized and stored in a predefined schema like rows and columns in a database.
o Example: Transactional data in relational databases, such as sales records.
2. Semi-Structured Data:
o Partially organized but lacks a fixed schema.
o Example: XML, JSON files, email metadata (sender, recipient, timestamp).
3. Unstructured Data:
o Data that doesn’t follow a specific format or organization.
o Example: Text files, images, videos, social media posts.

3. Differentiate between Traditional Data and Big Data. [IA1]

4. List the uses of Big Data.


1. Business Analytics: Analyzes customer behavior to improve products and services.
2. Healthcare: Enables personalized medicine and predictive healthcare analytics.
3. Finance and Banking: Detects fraud and manages financial risk through data
analysis.
4. Retail: Offers personalized recommendations and optimizes inventory management.
5. Supply Chain and Logistics: Predicts demand and optimizes transportation routes.
6. Telecommunications: Optimizes network traffic and predicts customer churn.
7. Manufacturing: Anticipates equipment failures with predictive maintenance.
8. Energy and Utilities: Manages smart grids and predicts energy production from
renewables.
9. Government and Public Sector: Enhances public safety and disaster response
through predictive analytics.
10. Education: Personalizes learning experiences and monitors student performance.
5. How Big Data Analytics can be useful in development of Digital India? [PYQ]

Big Data Analytics (BDA) plays a crucial role in achieving the goals of the Digital India initiative by
driving data-driven decisions, improving service delivery, and fostering innovation. Below are specific
ways BDA can contribute:

1. Improved Governance and Policy Making

• Data-Driven Governance: Analyze large datasets from multiple sources to identify trends and
issues for better policymaking.
• Smart Cities: Manage traffic, waste, and energy more efficiently using real-time analytics from
IoT sensors and surveillance systems.

2. Enhanced Public Service Delivery

• E-Governance Portals: Analyze user behavior to optimize portals like DigiLocker and UMANG
for better user experience.
• Targeted Schemes: Use demographic and behavioral data to identify beneficiaries for programs
like Jan Dhan Yojana and Ayushman Bharat.

3. Agriculture and Rural Development

• Smart Farming: Analyze weather patterns, soil data, and crop yields to provide actionable
insights to farmers.
• Financial Inclusion: Use analytics to evaluate the impact of schemes like PM-Kisan and offer
microloans efficiently.

4. Healthcare Transformation

• Predictive Analysis: Use health records to predict disease outbreaks and optimize resource
allocation.
• Telemedicine: Improve teleconsultation services by analyzing patient data and healthcare trends.

5. Education and Skill Development

• Personalized Learning: Leverage analytics in platforms like SWAYAM to provide customized


learning experiences.
• Skill Gap Analysis: Analyze job market trends to align skill development programs like PMKVY
with industry needs.

6. Boosting Economy and Employment

• Startup Ecosystem: Use data to identify industry gaps and support startups with relevant insights.
• E-commerce and Digital Payments: Monitor transaction patterns to improve platforms like UPI
and ensure secure payments.

You might also like