BDA Quiz 2 Help

Download as pdf or txt
Download as pdf or txt
You are on page 1of 4

QUIZ – 2 Answers

1) Support vectors are the data points that lie closest to the decision surface.

True

============================================================================

2) The graphical representation of an SNA is made up of links and _____

Nodes

============================================================================

3) Data visualization helps in making large data set coherent. True/false

True

============================================================================

4) Which of the following statements is true regarding binary logistic regression?

The dependent variable consists of two categories

============================================================================

5) ------------ is not a functionality of datamining

Selection and interpretation

============================================================================

6) ______________ is/are the way/s to represent uncertainty.

All (?)

============================================================================

7) Where can Bayes Rule be used?

Answering probabilistic query

============================================================================

8) Advantages of Kernel func in SVM

All of the above

============================================================================
9) __________predicts future trends & behaviors, allowing business managers to make
proactive,knowledge-driven decisions

Data Mining

============================================================================

10) Study the chart and answer the questions: Pie chart given here represents the domestic
expenditure of a family in percent. Study the chart and answer the following questions if the total
monthly income of the family is Rs. 33,650 House rent per month is _________

6057

============================================================================

11) Which among the following statements is false regarding decision trees

Decision trees will never suffer from overfitting

============================================================================

12) If an ANN receives n inputs x1, x2, x3… xn with weights w1, w2, w3… wn.The weighted sum ------ is
calculated and passed to an activation function

Σ wi * xi

============================================================================

13) If A, B are two sets of items, and A⊆ B. Which of the following statement is always true?

support(A) >= support(B)

============================================================================

14) Which among the following statements is false regarding Artificial Neural Networks

ANN can handle only categorical values

============================================================================

15) TDM in text mining:

Term Document Matrix

============================================================================

16) Considering the K-means algorithm, if points (-3, 2), (-4, 2), and (-2, -1) are the only points which
are assigned to the first cluster now, what is the new centroid for this cluster
(-3,1)

============================================================================

17) Which of the following is true about Naive Bayes

Assumes that all the features in a dataset are independent

============================================================================

18) SVM less effective when

The data is noisy and contains overlapping points

============================================================================

19) The splitting criteria used in C4.5 algorithm is

Information Gain

============================================================================

20) Match the following

a-3, b-1, c-2, d-4

============================================================================

21) Real world application of SVM

Text and hypertext categorization

============================================================================

22) ____ is also known as Market Basket Analysis

Association Rule Mining

============================================================================

23) Given the regression equation Y=22.4 - 53.9X, then 22.4 is the ------- while -53.9 is the ---- of the
regression line

Intercept, slope

============================================================================

24) Applications of Text mining

All
============================================================================

25) The main purpose for structure mining is to extract previously unknown relationships between

Web pages

============================================================================

26) _____ can be used to describe nodes that contain the most amount of information about a
network

Degree centrality

============================================================================

27) Best choice of number of clusters

============================================================================

28) In regression analysis, if all the values fall on the same straight line and the line has a positive
slope then , correlation coefficient ‘r’ is

r = +1

============================================================================

29) Confusion matrix predictive accuracy

0.806

============================================================================

30) Which of the following statements regarding k-means clustering is wrong

It is suitable for discovering cluster shapes that are not hyperellipsoids

============================================================================

You might also like