0% found this document useful (0 votes)

4 views8 pages

Data Science Interview Questions2

The document explains logistic regression, decision tree construction, and various metrics for evaluating models like RMSE, MSE, accuracy, precision, and recall. It also discusses the concept of stationary time-series data and the collaborative filtering algorithm used for recommendations on platforms like Amazon. Additionally, it includes a programming task for generating a FizzBuzz output.

Uploaded by

Ankit Kamble

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

4 views8 pages

Data Science Interview Questions2

Uploaded by

Ankit Kamble

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 8

2. How is logistic regression done?

Logistic regression measures the relationship between the

dependent variable (our label of what we want to predict) and
one or more independent variables (our features) by
estimating probability using its underlying logistic function
(sigmoid).
The image shown below depicts how logistic regression works:

The formula and graph for the sigmoid function are as shown:

3. Explain the steps in making a decision

tree.
1. Take the entire data set as input
2. Calculate entropy of the target variable, as well as the
predictor attributes
3. Calculate your information gain of all attributes (we gain
information on sorting different objects from each other)
4. Choose the attribute with the highest information gain as
the root node
5. Repeat the same procedure on every branch until the
decision node of each branch is finalized
For example, let's say you want to build a decision tree to
decide whether you should accept or decline a job offer. The
decision tree for this case is as shown:
It is clear from the decision tree that an offer is accepted if:
 Salary is greater than $50,000
 The commute is less than an hour
• • Incentives are offered

8. In your choice of language,

write a program that prints
the numbers ranging from
one to 50.

But for multiples of three, print "Fizz" instead of

the number, and for the multiples of five, print
"Buzz." For numbers which are multiples of both
three and five, print "FizzBuzz"

The code is shown below:

Note that the range mentioned is 51, which means
zero to 50. However, the range asked in the
question is one to 50. Therefore, in the above
code, you can include the range as (1,51).

The output of the above code is as shown:

15. How do you find RMSE and
MSE in a linear regression
model?

RMSE and MSE are two of the most common

measures of accuracy for a linear
regression model.

RMSE indicates the Root Mean Square Error.

MSE indicates the Mean Square Error.

19. How can time-series data

be declared as stationery?

It is stationary when the variance and mean of the

series are constant with time.

Here is a visual example:

In the first graph, the variance is constant with
time. Here, X is the time factor and Y is the
variable. The value of Y goes through the same
points all the time; in other words, it is stationary.

In the second graph, the waves get bigger, which

means it is non-stationary and the variance is
changing with time.

20. How can you calculate

accuracy using a confusion
matrix?

Consider this confusion matrix:

You can see the values for total data, actual
values, and predicted values.

The formula for accuracy is:

Accuracy = (True Positive + True Negative) / Total

Observations

= (262 + 347) / 650

= 609 / 650

= 0.93

As a result, we get an accuracy of 93 percent.

21. Write the equation and

calculate the precision and
recall rate.

Consider the same confusion matrix used in the

previous question.
Precision = (True positive) / (True Positive + False
Positive)

= 262 / 277

= 0.94

Recall Rate = (True Positive) / (Total Positive +

False Negative)

= 262 / 288

= 0.90

22. 'People who bought this

also bought…'
recommendations seen on
Amazon are a result of which
algorithm?

The recommendation engine is accomplished with

collaborative filtering. Collaborative filtering
explains the behavior of other users and their
purchase history in terms of ratings, selection, etc.

The engine makes predictions on what might

interest a person based on the preferences of other
users. In this algorithm, item features are
unknown.

For example, a sales page shows that a certain

number of people buy a new phone and also buy
tempered glass at the same time. Next time, when
a person buys a phone, he or she may see a
recommendation to buy tempered glass as well.

Machine Learning Interview Questions
From Everand
Machine Learning Interview Questions
Tech Interviews
4.5/5 (2)
Machine Learning Lab Manual 06
100% (1)
Machine Learning Lab Manual 06
8 pages
What Are The Differences Between Supervised and Unsupervised Learning?
No ratings yet
What Are The Differences Between Supervised and Unsupervised Learning?
22 pages
DATA SCIENCE iNTERVIEW QUESTION
No ratings yet
DATA SCIENCE iNTERVIEW QUESTION
42 pages
What Are The Differences Between Supervised and Unsupervised Learning?
No ratings yet
What Are The Differences Between Supervised and Unsupervised Learning?
21 pages
DSBDL - Write - Ups - 4 To 7
No ratings yet
DSBDL - Write - Ups - 4 To 7
11 pages
Data Science and Machine Learning - Interview Questions
No ratings yet
Data Science and Machine Learning - Interview Questions
185 pages
Data Science Interview Questions
No ratings yet
Data Science Interview Questions
50 pages
Classification Algorithms
100% (2)
Classification Algorithms
23 pages
DS Unit 4
No ratings yet
DS Unit 4
13 pages
Top 90+ Data Science Interview Questions and Answers (2024)
No ratings yet
Top 90+ Data Science Interview Questions and Answers (2024)
38 pages
SemVII MachineLearning
No ratings yet
SemVII MachineLearning
22 pages
ML Unit Ii
No ratings yet
ML Unit Ii
30 pages
Logistic Regression 5
No ratings yet
Logistic Regression 5
61 pages
Forecasting and Learning Theory
No ratings yet
Forecasting and Learning Theory
46 pages
Chapter 6 Supervised Learning
No ratings yet
Chapter 6 Supervised Learning
6 pages
Machine Learning
No ratings yet
Machine Learning
133 pages
Data Science Interview Questions
No ratings yet
Data Science Interview Questions
31 pages
Algorithms
No ratings yet
Algorithms
5 pages
Lecture Material 11
No ratings yet
Lecture Material 11
14 pages
ML Model Paper 2 Solution
No ratings yet
ML Model Paper 2 Solution
15 pages
Aiml Unit 3
No ratings yet
Aiml Unit 3
9 pages
Part A Assignment - No - 5 PDF
No ratings yet
Part A Assignment - No - 5 PDF
8 pages
Week 4 Q&A
No ratings yet
Week 4 Q&A
7 pages
Machine Learning
No ratings yet
Machine Learning
33 pages
Model Evaluation
No ratings yet
Model Evaluation
80 pages
Machine Learning Study Experiment
No ratings yet
Machine Learning Study Experiment
5 pages
ML Short
No ratings yet
ML Short
11 pages
v0_ML
No ratings yet
v0_ML
53 pages
Lecture 7 Classification
No ratings yet
Lecture 7 Classification
33 pages
M2 - Supervised Machine Learning
No ratings yet
M2 - Supervised Machine Learning
79 pages
Unit2 ML Notes
No ratings yet
Unit2 ML Notes
19 pages
MLRS Assignment 1 24070146008 Sreemanth Mannem
No ratings yet
MLRS Assignment 1 24070146008 Sreemanth Mannem
12 pages
Broadly, There Are 3 Types of Machine Learning Algorithms.
No ratings yet
Broadly, There Are 3 Types of Machine Learning Algorithms.
33 pages
AIMLB-PGP-2025-Session-8
No ratings yet
AIMLB-PGP-2025-Session-8
52 pages
PID5108657
No ratings yet
PID5108657
8 pages
Unit V - Big Data Programming
No ratings yet
Unit V - Big Data Programming
22 pages
ML-classification Models
No ratings yet
ML-classification Models
27 pages
Machine Learning
No ratings yet
Machine Learning
41 pages
Linear Regression
No ratings yet
Linear Regression
89 pages
Ch-2 Supervised Machine Learning
No ratings yet
Ch-2 Supervised Machine Learning
48 pages
Logistic Regression
No ratings yet
Logistic Regression
21 pages
Machine Learning Model Evaluation
No ratings yet
Machine Learning Model Evaluation
11 pages
Supervised Learning
No ratings yet
Supervised Learning
24 pages
Assignment 4 Reportdocx
No ratings yet
Assignment 4 Reportdocx
10 pages
Machine Learing Algorithms
No ratings yet
Machine Learing Algorithms
13 pages
ML 04 Validation Regularization
No ratings yet
ML 04 Validation Regularization
57 pages
Ds Module 4
No ratings yet
Ds Module 4
73 pages
ML Merged
No ratings yet
ML Merged
51 pages
Paper1 Lite
No ratings yet
Paper1 Lite
18 pages
Types of Regression
No ratings yet
Types of Regression
8 pages
Day.12 Logistic Regression
No ratings yet
Day.12 Logistic Regression
8 pages
Supervised Learning - Basics
No ratings yet
Supervised Learning - Basics
115 pages
A Detailed Analysis of The Supervised Machine Learning Algorithms
No ratings yet
A Detailed Analysis of The Supervised Machine Learning Algorithms
5 pages
Lecture2_MCQ_Guide
No ratings yet
Lecture2_MCQ_Guide
8 pages
ML DL NLP Definitions
No ratings yet
ML DL NLP Definitions
22 pages
ML Classifiers
No ratings yet
ML Classifiers
48 pages
ML Unit-3 - RTU
No ratings yet
ML Unit-3 - RTU
20 pages
MC Learning
No ratings yet
MC Learning
4 pages
Accuracy Assessment and Confusion Matrix
No ratings yet
Accuracy Assessment and Confusion Matrix
23 pages
Lecture 1
No ratings yet
Lecture 1
15 pages
1 Complete SQL For Data Science Cheatsheet
No ratings yet
1 Complete SQL For Data Science Cheatsheet
3 pages
Top 40 Artificial Intelligence Interview Questions Episode 97 of
No ratings yet
Top 40 Artificial Intelligence Interview Questions Episode 97 of
27 pages
Fine-Structure Constant - Wikipedia
No ratings yet
Fine-Structure Constant - Wikipedia
16 pages
This Is A List of Scientific Laws Named After of Eponyms, See Eponym
No ratings yet
This Is A List of Scientific Laws Named After of Eponyms, See Eponym
13 pages
Unit 7 PDF
No ratings yet
Unit 7 PDF
16 pages
Unit1 PDF
No ratings yet
Unit1 PDF
9 pages
EXERCISE 8 Spss
No ratings yet
EXERCISE 8 Spss
2 pages
Decision Tree
No ratings yet
Decision Tree
3 pages
Dimensionality Reduction
No ratings yet
Dimensionality Reduction
9 pages
Ordinal Regression
No ratings yet
Ordinal Regression
4 pages
Group 8 - Sales Forecast Assignment
No ratings yet
Group 8 - Sales Forecast Assignment
7 pages
Q1: Total: Chi-Square Test
No ratings yet
Q1: Total: Chi-Square Test
3 pages
SurveyData 3
No ratings yet
SurveyData 3
49 pages
Univariate Data, 2155
No ratings yet
Univariate Data, 2155
4 pages
Neural Networks and Their Statistical Application
No ratings yet
Neural Networks and Their Statistical Application
41 pages
Kabir Khan 1147 - 4
No ratings yet
Kabir Khan 1147 - 4
4 pages
Zayyad Chapter Three Edited
No ratings yet
Zayyad Chapter Three Edited
4 pages
Chapter 10 Exercise Solutions: PM N F MNMP F
No ratings yet
Chapter 10 Exercise Solutions: PM N F MNMP F
27 pages
Malhotra16 Tif
No ratings yet
Malhotra16 Tif
11 pages
Modeling Change: Kristin Sainani PH.D
No ratings yet
Modeling Change: Kristin Sainani PH.D
34 pages
Econ 232 RE and FE Estimation Using Stata PDF
No ratings yet
Econ 232 RE and FE Estimation Using Stata PDF
5 pages
1.10 Simple Linear Regression
No ratings yet
1.10 Simple Linear Regression
9 pages
Sample Size (N) P 0.05 4 5 6 7 8 9 10 11 12 13 14 15: Spearman Rank-Order Coefficient of Correlation Page 1 of 2
No ratings yet
Sample Size (N) P 0.05 4 5 6 7 8 9 10 11 12 13 14 15: Spearman Rank-Order Coefficient of Correlation Page 1 of 2
2 pages
January 26, 2013 Chih‐Ping Chou 周志秉
No ratings yet
January 26, 2013 Chih‐Ping Chou 周志秉
42 pages
2024-2025 S2 SB Assignment
No ratings yet
2024-2025 S2 SB Assignment
3 pages
Regression in R & Python Paper
No ratings yet
Regression in R & Python Paper
38 pages
Psychological Assessment Prefinals
No ratings yet
Psychological Assessment Prefinals
6 pages
Examining Relationships Regression Facts
No ratings yet
Examining Relationships Regression Facts
10 pages
Econometrics Chapter 14, 15 & 16 PPT Slides
100% (2)
Econometrics Chapter 14, 15 & 16 PPT Slides
113 pages
Aditya Surya Pratama 36B - Tugas Statistik
No ratings yet
Aditya Surya Pratama 36B - Tugas Statistik
22 pages
Durbin Watson Statistic - Overview, How To Calculate and Interpret
No ratings yet
Durbin Watson Statistic - Overview, How To Calculate and Interpret
5 pages
Diabetes EDA and Kears Modeling
No ratings yet
Diabetes EDA and Kears Modeling
26 pages
2019 Syllabus Empirical Methods in Finance
No ratings yet
2019 Syllabus Empirical Methods in Finance
8 pages
Chapter 11: Simple Linear Regression
No ratings yet
Chapter 11: Simple Linear Regression
57 pages
Improve Quality and Efficiency of Textile Process Using Data-Driven
No ratings yet
Improve Quality and Efficiency of Textile Process Using Data-Driven
13 pages
MMW - Module 4-1
100% (1)
MMW - Module 4-1
80 pages

Data Science Interview Questions2

Uploaded by

Data Science Interview Questions2

Uploaded by

2. How is logistic regression done?

Logistic regression measures the relationship between the

3. Explain the steps in making a decision

8. In your choice of language,

But for multiples of three, print "Fizz" instead of

The code is shown below:

The output of the above code is as shown:

RMSE and MSE are two of the most common

RMSE indicates the Root Mean Square Error.

MSE indicates the Mean Square Error.

19. How can time-series data

It is stationary when the variance and mean of the

Here is a visual example:

In the second graph, the waves get bigger, which

20. How can you calculate

Consider this confusion matrix:

The formula for accuracy is:

Accuracy = (True Positive + True Negative) / Total

= (262 + 347) / 650

As a result, we get an accuracy of 93 percent.

21. Write the equation and

Consider the same confusion matrix used in the

Recall Rate = (True Positive) / (Total Positive +

22. 'People who bought this

The recommendation engine is accomplished with

The engine makes predictions on what might

For example, a sales page shows that a certain

You might also like