Long Quiz 3

Which of the following is ALWAYS TRUE in evaluating a decision tree?
I. Sanity checks by validating the decision with decision experts will determine if the decision
rules are sound
II. Overfitting is a result of too few attributes.
Market Basket Analysis is another term used to refer to?
The following are applications of association rules EXCEPT

Which of the following is TRUE about clustering?
The most informative method is chosen using an entropy based-methods. Entropy is a measure
of what characteristics of the attribute?
The following is always true about the k-means algorithm EXCEPT
Which of the following is TRUE about classification?
I. Classification is a supervised learning method.
II. In classification, labels of training dataset are predetermined.

Which of the following is ALWAYS TRUE about unit of measurement?
I. To remedy the issue on unit measurement, rescaling of the attributes may be done by dividing
each attribute value by its standard deviation.
II. The choice for the unit of measurement of a particular object is important because it directly
affects the cluster membership of the data points.
Assuming 1,000 transactions, with {bread,wine} appearing in 450 of them, {wine} appearing in
600, and {bread} appearing in 500, then the lift (bread wine) is equal to
The output variable in a decision tree is either subscribed or not subscribes. If

P(subscribed)=0.2, what is the base entropy of the decision tree?
Which of the following decision tree algorithm is based on GINI diversity index?
Which of the following is ALWAYS TRUE about considerations regarding the implementation of
k-means?
I. K-means can handle all types of variables.
II. The k-means algorithm is sensitive to the starting positions of the initial centroid.
Which of the following are possible questions that association rules can address?
Assuming 1,000 transactions, with {bread,wine} appearing in 450 of them, {wine} appearing in
600, and {bread} appearing in 500, then the confidence (bread wine) is equal to
Mathematically, this is the percent of transactions that contain a particular itemset.
Refer to the figure below, give the number of leaves and the maximum depth of the decision
tree?
The following is an application of clustering EXCEPT
Which of the following is ALWAYS TRUE about the considerations regarding the object
attributes when performing cluster analysis?
I. On the choice of which attributes to use, it is important to understand what attributes will be
known at the time a new object will be assigned to a cluster.
II. Whenever possible and based on the data, it is best to have as many attributes as possible.
This approach to improve apriori algorithm add new candidate item sets only when all of their
subsets are estimated to be frequent.
Which of the following is ALWAYS TRUE about association rules?

I. Association rule is descriptive and not a predictive method.
II. The goal in the association rule is to discover interesting relationships hidden in a large
dataset.
Which of the following is TRUE about clustering technique?
I. Clustering is often used for exploratory data analysis.
II. Clustering methods find the similarities between objects according to the object attributes
and group the similar objects into clusters.
The following is a result from when running means on a particular dataset on 620 high school
students with attributes regarding their grades on English, Math and Science. Based on this
result, to which cluster does a student belong whose grade for English, Math and Science are
75, 70 and 72 respectively?
Which of the following is TRUE about apriori algorithm?
I. Apriori algorithm uses the downward closure property.
II. Apriori algorithm utilizes ‘pruning’ to control the exponential growth of candidate itemsets.
Which of the following is TRUE about unsupervised learning?
I. Unsupervised learning refers to the problem of finding hidden structures within unlabeled
data.
II. Decision tree technique is considered unsupervised.

Which of the following is TRUE about decision trees?
I. Classification trees are utilized for continuous response variable.
II. A decision tree can be converted into a set of decision rules.

Long Quiz 3

Uploaded by

Copyright:

Available Formats

Long Quiz 3

Uploaded by

Document Information

Original Title

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

Long Quiz 3

Uploaded by

Copyright:

Available Formats

Which of the following is ALWAYS TRUE in evaluating a decision tree?

II. Overfitting is a result of too few attributes.

Market Basket Analysis is another term used to refer to?

The following are applications of association rules EXCEPT

Which of the following is TRUE about classification?

I. Classification is a supervised learning method.

II. In classification, labels of training dataset are predetermined.

The output variable in a decision tree is either subscribed or not subscribes. If

I. K-means can handle all types of variables.

Which of the following is ALWAYS TRUE about association rules?

Which of the following is TRUE about clustering technique?

I. Clustering is often used for exploratory data analysis.

I. Apriori algorithm uses the downward closure property.

II. Decision tree technique is considered unsupervised.

I. Classification trees are utilized for continuous response variable.

II. A decision tree can be converted into a set of decision rules.

You might also like