MCQ Collection
Data Mining MCQs
Practice Data Mining questions with answers and explanations.
Choose an option to check your answer.
Correct Answer: A. {A, B, C} must be infrequent
Explanation:
A superset cannot have more support than its subset.
Because {A, B} is below threshold, any superset containing it is also below threshold.
Choose an option to check your answer.
Correct Answer: A. An item label, a count, and links within the tree structure
Explanation:
The count records how many transactions share the represented prefix.
Links connect nodes to parents, children, and same-item occurrences.
Choose an option to check your answer.
Correct Answer: A. A model that assigns observations to predefined classes
Explanation:
A classifier learns a decision rule from labeled examples.
It then predicts class labels for new observations.
Choose an option to check your answer.
Correct Answer: A. It focuses on positive-case retrieval and is sensitive to false positives
Explanation:
ROC curves can appear optimistic when true negatives dominate.
Precision-recall analysis emphasizes performance on the minority class.
Choose an option to check your answer.
Correct Answer: A. Splits depend on order and thresholds rather than distance magnitude
Explanation:
Multiplying a feature by a positive constant changes threshold values but not ordering.
Distance-based domination is therefore not a concern.
Choose an option to check your answer.
Correct Answer: A. An unseen feature-class combination makes the estimated likelihood zero
Explanation:
Multiplying by a zero likelihood makes the entire class posterior zero.
Smoothing assigns small nonzero probabilities to unseen events.
Choose an option to check your answer.
Correct Answer: A. High bias and dominance by broad class prevalence
Explanation:
Large neighborhoods smooth local structure and may underfit.
The majority class can overwhelm minority patterns.
Choose an option to check your answer.
Correct Answer: B. {A, C} is frequent
Explanation:
Every subset of a frequent itemset must meet minimum support.
The subset may have greater support than the full itemset.
Choose an option to check your answer.
Correct Answer: B. To link all nodes containing the same item
Explanation:
Header links allow efficient traversal of an item's occurrences across the tree.
They are used to construct conditional pattern bases.
Choose an option to check your answer.
Correct Answer: B. Fitting the model's parameters or decision rules
Explanation:
The model learns patterns from training examples.
Performance must be assessed on separate data to estimate generalization.
Choose an option to check your answer.
Correct Answer: B. Preserving approximately the same class proportions in each split
Explanation:
Stratification reduces accidental class imbalance between partitions.
It is especially useful for small or imbalanced datasets.
Choose an option to check your answer.
Correct Answer: B. It may favor attributes with many distinct values
Explanation:
A high-cardinality attribute can create many pure but tiny partitions.
Gain ratio or regularization can reduce this preference.