Data Analyst Associate (Da-100) : Module - 2
Data Analyst Associate (Da-100) : Module - 2
Data Analyst Associate (Da-100) : Module - 2
By
Mr. PARVEEZULLA
NAAC COORDINATOR
Christ Academy Institute for Advanced Studies
DA (100)
Data:
In the context of AI (Artificial Intelligence), "data" refers to the raw
information or input that is used to train, test, and improve machine learning
models. Data is a crucial component in the field of AI, as machine learning
algorithms rely on large amounts of relevant and diverse data to learn patterns
and make predictions or decisions.
DA (100)
Data Analysis:
Data analysis is the process of identifying, cleaning, transforming, and
modeling data to discover meaningful and useful information. The data is
then crafted into a story through reports for analysis to support the critical
decision-making process.
DA (100)
5.
4. Cognitive
Prescriptive Analysis
3.
Analysis
Predictive
2.
Analysis
Diagnostic
1.
Analysis
Descriptive
Analysis
DA (100)
Descriptive Analytics:
• Descriptive analytics help answer questions about what has happened based on
historical data.
• Descriptive analytics techniques summarize large datasets to describe outcomes to
stakeholders.
• By developing key performance indicators (KPIs), these strategies can help track the
success or failure of key objectives. Metrics such as return on investment (ROI) are used
in many industries, and specialized metrics are developed to track performance in
specific industries.
• An example of descriptive analytics is generating reports to provide a view of an
organization's sales and financial data
DA (100)
Diagnostic Analytics:
Diagnostic analytics help answer questions about why events happened. Diagnostic
analytics techniques supplement basic descriptive analytics, and they use the findings
from descriptive analytics to discover the cause of these events. Then, performance
indicators are further investigated to discover why these events improved or became
worse. Generally, this process occurs in three steps:
1. Identify anomalies in the data. These anomalies might be unexpected changes in a
metric or a particular market.
2. Collect data that's related to these anomalies.
3. Use statistical techniques to discover relationships and trends that explain these
anomalies
DA (100)
Predictive Analytics:
Predictive analytics Predictive analytics help answer questions about what will happen in
the future. Predictive analytics techniques use historical data to identify trends and
determine if they're likely to recur. Predictive analytical tools provide valuable insight into
what might happen in the future. Techniques include a variety of statistical and machine
learning techniques such as neural networks, decision trees, and regression.
DA (100)
Prescriptive Analytics:
Prescriptive analytics help answer questions about which actions should be taken to
achieve a goal or target. By using insights from prescriptive analytics, organizations can
make data driven decisions. This technique allows businesses to make informed decisions
in the face of uncertainty. Prescriptive analytics techniques rely on machine learning as
one of the strategies to find patterns in large datasets. By analyzing past decisions and
events, organizations can estimate the likelihood of different outcomes.
DA (100)
Cognitive Analytics:
Cognitive analytics attempt to draw inferences from existing data and patterns, derive
conclusions based on existing knowledge bases, and then add these findings back into the
knowledge base for future inferences, a self-learning feedback loop. Cognitive analytics
help you learn what might happen if circumstances change and determine how you might
handle these situations.
DA (100)
Roles in Data:
Business Analyst:
While some similarities exist between a data analyst and business analyst, the key
differentiator between the two roles is what they do with data. A business analyst is closer
to the business and is a specialist in interpreting the data that comes from the
visualization. Often, the roles of data analyst and business analyst could be the
responsibility of a single person
DA (100)
Data Analyst:
A data analyst enables businesses to maximize the value of their data assets through
visualization and reporting tools such as Microsoft Power BI. Data analysts are responsible
for profiling, cleaning, and transforming data. Their responsibilities also include designing
and building scalable and effective data models, and enabling and implementing the
advanced analytics capabilities into reports for analysis. A data analyst works with the
pertinent stakeholders to identify appropriate and necessary data and reporting
requirements, and then they are tasked with turning raw data into relevant and
meaningful insights.
DA (100)
Data Engineer:
Data engineers provision and set up data platform technologies that are on-premises and
in the cloud. They manage and secure the flow of structured and unstructured data from
multiple sources. The data platforms that they use can include relational databases,
nonrelational databases, data streams, and file stores. Data engineers also ensure that
data services securely and seamlessly integrate across data platforms.
Both database administrators and business intelligence professionals can transition to a
data engineer role; they need to learn the tools and technology that are used to process
large amounts of data.
DA (100)
Data Scientist:
Data scientists perform advanced analytics to extract value from data. Their work can vary
from descriptive analytics to predictive analytics. Descriptive analytics evaluate data
through a process known as exploratory data analysis (EDA). Predictive analytics are used
in machine learning to apply modeling techniques that can detect anomalies or patterns.
These analytics are important parts of forecast models. Descriptive and predictive
analytics are only partial aspects of data scientists' work. Some data scientists might work
in the realm of deep learning, performing iterative experiments to solve a complex data
problem by using customized algorithm
DA (100)
Database Administrator:
A database administrator implements and manages the operational aspects of cloud-
native and hybrid data platform solutions that are built on Microsoft Azure data services
and Microsoft SQL Server. A database administrator is responsible for the overall
availability and consistent performance and optimizations of the database solutions. They
work with stakeholders to identify and implement the policies, tools, and processes for
data backup and recovery plans. The role of a database administrator is different from the
role of a data engineer. A database administrator monitors and manages the overall health
of a database and the hardware that it resides on, whereas a data engineer is involved in
the process of data wrangling, in other words,
2. Model Data in Power BI
a) Load Data:
•Open Power BI Desktop.
•Click on "Get Data" to connect to your data source, such as Excel, SQL
Server, SharePoint, or others.
•Load the necessary tables or datasets into Power BI.
b) Data Transformation:
• In the "Home" tab, use the "Transform Data" option to open the Power
Query Editor.
• Perform data cleaning, filtering, and transformation tasks using the Power
Query Editor. You can remove duplicates, rename columns, change data
types, and more.
2. Model Data in Power BI
Power BI data modeling is the process of creating a relationship
between common columns of multiple tables.
2. Model Data in Power BI
c) Create Relationships:
•If your dataset involves multiple tables, establish relationships between them.
Relationships help connect data from different tables and enable more
sophisticated analysis.
•Navigate to the "Model" view and use the "Manage Relationships" option to
create and manage relationships between tables.
d) Calculated Columns:
•Add calculated columns to your tables using Data Analysis Expressions
(DAX). These columns allow you to create new data based on calculations
involving existing columns.
•You can create a calculated column by selecting the table, then choosing
"New Column" in the Modeling tab.
2. Model Data in Power BI
e) Measures:
• Create measures using DAX to perform calculations at an aggregate level.
Measures are often used for calculations like sums, averages, and other
aggregations.
•Use the "New Measure" option in the Modeling tab to define DAX measures.
f) Hierarchies:
•Create hierarchies to organize data in a more structured manner. Hierarchies
are useful for drilling down into detailed data.
•You can create hierarchies by selecting columns and using the "New
Hierarchy" option in the Modeling tab.
2. Model Data in Power BI
g) Data Categorization:
Assign appropriate data categories to your columns. For example, designate
date columns as Date, text columns as Text, etc. This helps Power BI
understand the nature of your data for better visualization and analysis.
h) Sort and Format:
Configure the sorting and formatting of your data to enhance the visual appeal
and clarity of your reports. This can be done in the Modeling tab.
i) Data Summarization:
Define summarization methods for your data fields. For example, you can
choose to summarize a numeric field as a sum, average, minimum, or
maximum.
2. Model Data in Power BI
j) Testing and Validation:
After making changes, preview the results and ensure that your data looks as
expected. Use the "Close & Apply" option to apply the changes and return to
the report view.
3. Visualise Data in Power BI
a) Connect to Data:
•Launch Power BI Desktop.
•Click on "Get Data" to connect to your data source. Power BI supports
various data sources like Excel, SQL Server, SharePoint, etc.
•Load the data into Power BI.
b) Data Modeling:
•After loading data, you might need to do some data modeling. You can create
relationships between tables, create calculated columns, and perform other
data transformation tasks.
3. Visualise Data in Power BI
c) Data Visualization:
•Once your data is loaded and modeled, go to the "Report" view.
•On the right side, you'll see a Fields pane. Drag and drop fields from your
data tables into the values, axis, or legend areas to start creating visualizations.