Fundamentals of Data Analytics

Download as pdf or txt
Download as pdf or txt
You are on page 1of 39

Fundamentals of Data Analytics

Unit 1
Fundamentals of
Data Analytics
Fundamentals of Data Analytics

Disclaimer
The content is curated from online/offline resources and used for educational purpose only
Fundamentals of Data Analytics
Fundamentals of Data Analytics

Learning Objectives
• Why we need to learn Data analytics
• Introduction to Data Analytics
• Types of Analytics
• Components of a Modern Data
Ecosystem
• Key Steps in a Data Analysis Process
• Data Analytics vs. Data Analysis
• Data Storage Terminologies
• Introduction to Power BI
• Working with Power BI.
• Database Integration using Python
Fundamentals of Data Analytics

Why we need to learn Data Analytics ?


Fundamentals of Data Analytics

Why we need to learn Data Analytics ?


Fundamentals of Data Analytics

Data Type
Fundamentals of Data Analytics

What is Data Analytics ?


• The term "data analytics" describes the methods
used to analyze data in order to increase
productivity and financial gain.
• it is the procedure of deriving valuable, useful
insights from the analysis of raw data, which are
then used to guide and inform wise business
decisions.
• In order to analyze different behavioral patterns,
data is extracted from a variety of sources, cleaned
up, and categories.
Fundamentals of Data Analytics

Types of Analytics

Descriptive Analytics Diagnostic Analytics Predictive Analytics Prescriptive Analytics


Fundamentals of Data Analytics

Descriptive Analytics
• The simplest type of analytics is descriptive
analytics, which serves as the base for all other
types.
• It enables you to quickly summarise what occurred
or is happening by drawing trends from the raw
data.
• When used in business, it gives the analyst a view
of important metrics and measurements used by
the company.
Fundamentals of Data Analytics

Diagnostic Analytics
• This type compares concurrent trends or movements,
reveals correlations between variables, and, when
possible, establishes causal relationships.
• The question of why something happened can now be
answered by comparing historical data to other data.
• Diagnostic analytics offers in-depth perceptions into a
specific issue.
Fundamentals of Data Analytics

Predictive Analytics
• The use of predictive analytics provides an answer to the
question, "What might happen in the future?" by making
predictions about present trends or events.
• It makes use of the outcomes of descriptive and
diagnostic analytics to find clusters and exceptions as well
as make predictions about future trends.
Fundamentals of Data Analytics

Prescriptive Analytics

• Prescriptive analytics assists in providing


recommendations for action.
• Prescriptive analytics helps by making suggestions
for taking action.
• Prescriptive analytics is a type of advanced analytics
that offers many benefits, such as sophisticated
analysis based on deep learning or machine
learning.
Fundamentals of Data Analytics

Types of File Formats

Delimited text file formats

Microsoft Excel Open XML Spreadsheet, or XLSX

Extensible Markup Language, or XML

Portable Document Format, or PDF

JavaScript Object Notation, or JSON


Fundamentals of Data Analytics

Sources of Data

Relational Databases APIs and Web Services Data Streams, and Feeds.

Flat files and XML Datasets Web Scraping


Fundamentals of Data Analytics

Data Ecosystem
• A data ecosystem is a collection of business applications
and infrastructure that is used to gather and analyze data.
• It enables businesses to develop better marketing, pricing,
and business plans by helping them better understand their
customers.
• All the programming languages, algorithms, applications,
and underlying infrastructure that are used to gather,
process, and store data are collectively referred to as the
"data ecosystem".
Fundamentals of Data Analytics

Different Components of a Modern Data Ecosystem

Sensing

Storage Collection

COMPONENT
OF A DATA
ECOSYSTEM

Analysis Wrangling
Fundamentals of Data Analytics

Key Players in the Data Ecosystem

Data Engineer Data Scientists Business Intelligence


Analysts

Data Analysts Business Analysts


Fundamentals of Data Analytics

Data Analysis
Data analysis is the process of cleaning, transforming, and
processing raw data in order to extract actionable, relevant
information that assists businesses in making informed
decisions. The procedure reduces the risks associated with
decision-making by providing useful insights and statistics,
which are frequently presented in charts, images, tables, and
graphs.

Click here

Reference link
Fundamentals of Data Analytics

Key Steps in a Data Analysis Process

Defining the Collecting Cleaning the Analyzing Sharing your


question the data data the data results
Fundamentals of Data Analytics

Data Analytics vs. Data Analysis

Data Analytics Data Analysis

Data analytics is a type of traditional or Data analysis is a subset of analytics that is


generic analytics that is used in businesses to used in businesses to evaluate data and gain
make data-driven decisions. insights.
It has one or more users and is typically used It entailed defining the data, investigating it,
for data collection and inspection. cleaning it up, and modifying it to produce a
useful result.
It is divided into several stages, such as data In order to process data, raw data must first
collection and business data inspection. be defined in a meaningful way before
relevant information can be extracted from it
via data cleaning and conversion.
It uses a variety of technologies to process It analyses data using a variety of tools,
data, including Tableau, Python, Excel, including SPARK, Google Fusion tables,
Google Analytics, and others. Node XL, Excel, and others.
Click here

Reference link
Fundamentals of Data Analytics

Database
• Database is the collection of organized data that is
structured and stored electronically on a computer
system.
• Databases can store data in the form of tables
depending upon the type of database. The
database's primary goal is to store a huge amount of
data.

Examples of some databases: MySQL, Oracle,


MongoDB, PostgreSQL, SQL Server, etc.

Click here

Reference link
Fundamentals of Data Analytics

Data Marts
• A data mart is a simple type of data warehouse that
focuses on a single subject or business line, such
as sales, finance, or marketing.
• Data marts, by definition, collect data from fewer
sources than data warehouses. Internal operating
systems, a central data warehouse, and external
data can all be used as data mart sources.
Fundamentals of Data Analytics

Data Lakes
• A data lake is a central repository for vast amounts
of data in its native, raw format.
• In contrast to a hierarchical data warehouse, which
stores data in files or folders, a data lake stores data
using a flat design and object storage.‍‍
• Object storage tags data and assigns it a unique
identity, making it easier to identify and retrieve data
across regions and improving speed.
• Data lakes enable numerous applications to use
data by using low-cost object storage and open
formats.
Fundamentals of Data Analytics

What is Data Wrangling


Data exploration, transformation, validation, and availability are all steps in the iterative process of
data wrangling, also referred to as data munging, which makes the data available for a reliable and
insightful analysis.

Discovery Transformation Validation Publishing


Fundamentals of Data Analytics

Popular Data Analysis Tools


Fundamentals of Data Analytics

Python and Database Integration


What is MongoDB ?
A document-oriented NoSQL database solution called MongoDB offers strong scalability and flexibility in
addition to a robust querying system. You can create a variety of database applications quickly using
MongoDB and Python. Therefore, MongoDB is for you if your Python application requires a database that
is as adaptable as the language itself.
Fundamentals of Data Analytics

Python and Database Integration


SQL vs NoSQL

Property SQL Databases NoSQL Databases


Data model Relational Nonrelational

Structure Table-based, with columns and rows Document based, key-value


pairs, graph, or wide-column
Query language Structured Query Language (SQL) Varies from database to
database
ACID Supported Supported, depending on
transactions the specific NoSQL
database
Ability to add Need to alter the schema first Possible without disturbing
new properties anything
Fundamentals of Data Analytics

Python and Database Integration


MongoDB Database Basic Terminology

• A MongoDB Database can be called as the


container for all the collections.
• Collection is a bunch of MongoDB documents. It is
similar to tables in RDBMS.
• Document is made of fields. It is similar to a tuple in
RDBMS, but it has dynamic schema here.
Documents of the same collection need not have the
same set of fields.
Fundamentals of Data Analytics

Lab 1 - Demonstrating python integration with MongoDB


Fundamentals of Data Analytics

CRUD Operation in MongoDB


• CRUD stands for Create, Read, Update, Delete.

• So these operations when performed on the database, help us to view, search or modify data in our
database.

• C: Create a database and collection in Database

• R: Retrieving the document from the collection

• U: Updating document in collection

• D: Dropping/Deleting collection and Database


Fundamentals of Data Analytics

Lab 2 – CRUD Operation of MongoDB Using Python


Fundamentals of Data Analytics

Summary
• Implementing data analytics into the business model means companies can help reduce costs by
identifying more efficient ways of doing business. A company can also use data analytics to make
better business decisions.
• Power BI allows users to create interactive dashboards and reports that make it easy to understand
and communicate large amounts of data. This can help organizations make better decisions and
identify new opportunities.
• PyMongo is the official MongoDB driver for synchronous Python applications. You have learn how to
connect and use MongoDB from your Python application and performed various crud operations.
Fundamentals of Data Analytics

Quiz
1) To glean insights from the data, many analysts and data scientists rely on _________________.

a) Data warehouse
b) Data visualization
c) Data mining
d) All of the above

Answer: b) Data visualization


Fundamentals of Data Analytics

Quiz
2. Data Analysis is a process of________________

a) Data Cleaning
b) Transforming of data
c) Inspecting data
d) All of the above

Answer: d) All of the above


Fundamentals of Data Analytics

Quiz
3. Which is not a type analytics

a) Predictive
b) Diagnostic
c) Prescriptive
d) Perspective

Answer: d) Perspective
Fundamentals of Data Analytics

Quiz
4. Data Analytics uses ___ to get insights from data.

a) Statistical figures
b) Numerical aspects
c) Statistical methods
d) None of the mentioned above

Answer: c) Statistical methods


Fundamentals of Data Analytics

Reference
• https://bootcamp.berkeley.edu/resources/coding/learn-data-analytics/introduction-to-data-analytics/
• https://data-flair.training/blogs/power-bi-tutorial/
• https://www.w3schools.com/python/python_mysql_getstarted.asp
• https://www.tutorialspoint.com/python/python_database_access.htm
• https://intellipaat.com/blog/tutorial/data-analytics-tutorial/data-analytics-lifecycle/#no3
• https://online.hbs.edu/blog/post/types-of-data-analysis
• https://www.tutorialspoint.com/power_bi/power_bi_introduction.htm
Fundamentals of Data Analytics

Thank you...!

You might also like