Data Science and Machine Learning - MCQ
Data Science and Machine Learning - MCQ
Data Science and Machine Learning - MCQ
2. Graph consists of a?
A. non-empty set of vertices
B. empty set of vertices
C. Both A and B
D. None of the above
View Answer
Ans : A
Explanation: The Handshaking Lemma : In a graph, the sum of all the degrees of all the
vertices is equal to twice the number of edges.
Explanation: If in a graph multiple edges between the same set of vertices are allowed, it
is called Multigraph.
7. The graph in which, there is a closed trail which includes every edge
of the graph is known as?
A. Hamiltonian Graphs
B. Euler Graphs
C. Planar graph
D. Directed Graph
View Answer
Ans : B
Explanation: A connected graph is called an Euler graph, if there is a closed trail which
includes every edge of the graph
8. In a 7-node directed cyclic graph, the number of Hamiltonian cycle
is to be
A. 180
B. 720
C. 360
D. 540
View Answer
Ans : C
Answer:b
Explanation: Number of ways in which every vertex can be connected to each
other is nC2.
Answer:yes
Explanation: In a regular graph, degrees of all the vertices are equal. In the given
graph the degree of every vertex is 3.
Answer:b
Explanation: The given statement is the definition of regular graphs.
2. Clustering is a-
A. Supervised learning
B. Unsupervised learning
C. Reinforcement learning
D. None
3. Which of the following clustering algorithms suffers from the problem of convergence
at local optima?
A. K- Means clustering
B. Hierarchical clustering
C. Diverse clustering
D. All of the above
14. Which of the following is a method of choosing the optimal number of clusters for k-
means?
A. cross-validation
B. the silhouette method
C. the elbow method
D. All of the above
16. Which of the following clustering algorithm follows a top to bottom approach?
A. K-means
B. Divisible
C. Agglomerative
D. None
17. Which algorithm does not require a dendrogram?
A. K-means
B. Divisible
C. Agglomerative
D. All of the above
18. Which of the following clustering algorithms suffers from the problem of convergence
at local optima?
A. Takes each data point as an individual cluster
B. Goes on making clusters until it reaches to an optimal number of cluster
C. Follows a top to bottom approach
D. All of the above
A. Nominal
B. Ordinal
C. Spacial
D. All of the Above
A. Generality
B. Dimensionality
C. Resolution
D. All of the Above
23. The correct way of pre processing the data should be-
A. Imputation ->feature scaling-> training
B. Feature scaling->imputation->training
C. Feature scaling->label encoding->training
D. None
27. Which is the correct order for pre processing in Natural Language
Processing?
A. tokenization ->stemming ->lemmatization
B. lemmatization ->tokenization ->stemming
C. stemming ->tokenization ->lemmatization
D. None
1 .Linear regression is-The regression line changes due to outliers. So, it is sensitive to
outliers.
2 Which of the following evaluation metrics can be used for Regression?
Regression gives continuous output. So, we use Mean-Squared-Error or MSE as
evaluation metric. Rest are used in classification.
3. If you fit 2 degree polynomial in linear regression-Higher degree
polynomials have chances to underfit at a lower degree.
4. What will happen when you increase the size of training data?
Bias increases and Variance decreases
5. Which methods are used to find the best fit line in linear regression?
Least Square Error
11 Suppose you have observed that you data has an exponential growth tendency.
Then what regression model you should use-Polynomial regression
12. Can we perform linear regression with a neural network? Ans yes
13 If you get a poor accuracy using a simple linear regression model. What will be the
cause behind it-A. The data was not linear
15. What is a support vector? The distance between two boundary data points
16 What is a kernel? A function that maps the value from one dimension to the other
#CLASSIFICATION
A. Logistic regression
B. Linear regression
C. Polynomial regression
D. None
A classifier-
Suppose your classification model predicted true for a class which actual value
was false. Then this is a-False positive
What is the most widely used distance metric in KNN? Euclidean distance
Which of the following is the best algorithm for text classification? Naive Bayes
What is a kernel? A function that maps the value from one dimension to the other
Why Naive Bayes is called naive? Because its assumption may or may not true
How does a decision tree work? Maximizes the information gain and minimizes the
entropy
Suppose you have a dataset that is randomly distributed. What will be the best
algorithm for that dataset? Decision tree
Which pair of the algorithms are similar in operation? Decision tree and Random
forest
Which metric is not used for evaluating classification models? Mean absolute
error
REINFORCEMENT LEARNING
A. Unsupervised learning
B. Supervised learning
C. Award based learning
D. None
A. Topic modeling
B. Recommendation system
C. Pattern recognition
D. Image classification
A. Reinforcement algorithm
B. Supervised algorithm
C. Unsupervised algorithm
D. None
You have a task which is to show relative ads to target users. Which algorithm
you should use for this task?
A. K means clustering
B. Naive Bayes
C. Support vector machine
D. Upper confidence bound
A. Supervised learning
B. Unsupervised learning
C. Reinforcement learning
D. All of the above
Thompson sampling is a-
A. Probabilistic algorithm
B. Based on Bayes inference rule
C. Reinforcement learning algorithm
D. All of the above
A. Sentiment analysis
B. Information extraction
C. Information retrieval
D. Machine translation
A. Self-driving cars
B. Pattern recognition
C. Natural language processing
D. All of the above
Which of the following statement is not correct?
A. Deep learning
B. Machine learning
C. Data mining
D. None
A. Regression problems
B. Classification problems
C. Clustering problems
D. All of the above
For an image classification task, which of the following deep learning algorithm
is best suited?
A. 1D CNN
B. 2D CNN
C. LSTM
D. None