Bid Data Analytics - Google Docs
Bid Data Analytics - Google Docs
ig Datarefers to extremely large sets of data thatare too complex and massive to be handled
B
by traditional data processing tools.These datasets encompass structured data (e.g.,
databases), unstructured data (e.g., text, images), and semi-structured data (e.g., JSON files).
Big Data is used in many fields—like tracking customer behavior, predicting weather, detecting
fraud, and more.
Example:
singBig Datain the right way helps companies understandthings better, make smarter
U
decisions, and plan more effectively. They use advanced methods like predicting future trends
or studying how users behave to get useful information from large amounts of data.
But working with Big Data isn’t easy. There are many challenges, like:
o deal with all this, companies need special tools and technologies made to handle big and
T
complex data.
The key characteristic features of Big Data are often described using the 5 Vs:
1. V
olume: The huge amount of data generated every day.This can be in terabytes,
petabytes, or more. Example: billions of transactions on an e-commerce site daily.
2. V
elocity: The speed at which data is created, processed,and analyzed. Data may need
to be processed in real-time or near real-time. Example: real-time social media posts or
stock market transactions happening every second.
3. V
ariety: The different types and sources of data,such as text, images, videos, sensor
data, social media posts, etc. Example: customer reviews (text), product images (visual
data), and sensor data from devices (structured data).
4. V
eracity: The trustworthiness or quality of the data.With large datasets, it's important to
ensure that the data is accurate and reliable. Example: ensuring the accuracy of
financial transaction data.
5. V
alue: The insights or benefits that can be derivedfrom analyzing the data. It’s not just
about having big data but making it useful. Without value, data is just noise. Example:
analyzing shopping patterns to improve sales.
1. U
nderstanding Customers: Big stores like Amazon andWalmart use data to know
what people buy, how often they buy, and what they like. This helps them offer better
products and deals.
2. P
roduct Suggestions: Websites show suggestions basedon what you searched
before. For example, if you look for bed covers, you’ll see ads for bed covers next time.
3. S
mart Traffic: Traffic cameras and GPS help find roadswith less traffic, so people can
reach faster and save fuel.
4. F
light Safety: Planes have sensors that check thingslike speed and weather. This helps
keep flights safe and tells when parts need repair.
5. S
elf-Driving Cars: Cars use sensors and cameras tosee the road, other cars, and
obstacles. They use this data to drive safely on their own.
6. V
oice Assistants: Tools like Siri or Google Assistantuse your location, time, and
weather data to answer your questions, like if you need an umbrella.
7. H
ealth Monitoring: Hospitals use sensors to keep trackof patients’ health, like
heartbeat or blood pressure, and warn doctors if something is wrong.
8. O
nline Learning: Learning websites show ads to peoplewho search for videos or topics
related to their courses.
9. S
aving Electricity: Smart meters check how much poweris used and suggest the best
time to use heavy machines to save on electricity bills.
10.Movies and Music: Apps like Netflix and Spotify check what you watch or listen to and
suggest similar shows or songs you might like.
1. P
rivacy Issues:Collecting vast amounts of personaldata—such as browsing habits,
health records, and location information—can lead to privacy violations if not properly
managed.
2. S
ecurity Threats:Large datasets can be a target forhackers. Unauthorized access or
data breaches can expose sensitive information, leading to financial and reputational
damage.
3. D
ata Quality Problems:If the data is incorrect, outdated,or incomplete, it can lead to
wrong conclusions and poor decisions.
4. H
igh Cost of Technology:Big Data systems requirepowerful infrastructure and tools,
which can be expensive to set up and maintain.
5. C
omplexity in Handling:Managing and analyzing huge, varied data from different
sources needs skilled people and advanced tools, which not all organizations have.
6. L
egal and Compliance Issues:Misuse or mishandlingof data can lead to violations of
data protection laws, resulting in legal penalties and loss of public trust.
7. B
ias and Misuse:If data is biased or misinterpreted,it can lead to unfair decisions,
such as in hiring or loan approvals.
1. L
imited Data Capacity:Traditional systems can't handlelarge volumes of data (Big
Data). They work well only with small to medium-sized datasets.
2. O
nly Structured Data Support:They mostly handle structureddata (tables and rows)
and can't deal well with unstructured data like images, videos, or social media posts.
3. S
low Processing:Traditional systems process datain batches and can be very slow
with large or real-time data.
4. S
calability Issues:These systems can't easily growor expand. Adding more data often
slows them down or needs expensive upgrades.
5. E
xpensive for Big Data:Upgrading hardware and softwareto manage big data in a
traditional system can be very costly.
6. L
ack of Flexibility:It's hard to make quick changesor updates in traditional systems
when new types of data or new business needs arise.
7. D
ata Integration Problems:Combining data from multiplesources (like websites,
sensors, and social media) is difficult with conventional systems.
eb data
W
Web datarefers to any type of data that is generated,stored, or exchanged over the World
Wide Web. It includes both structured and unstructured information collected from websites,
online platforms, and internet-based services.
1. S
tructured Web Data: Organized in a predefined manner,typically found in databases
or spreadsheets. Examples: Product listings on e-commercesites, job postings, or
financial data tables.
2. S
emi-Structured Web Data: Doesn't conform strictlyto structured formats but contains
tags or markers to separate data elements.Examples:HTML pages, XML files, JSON
responses from APIs.
3. U
nstructured Web Data: Lacks a specific format orstructure, making it more
challenging to process.Examples: Blog posts, socialmedia comments, images, videos.
● Used in web analytics, digital marketing, sentiment analysis, and data mining
● Enables recommendation systems, personalized ads, and market trends analysis