MCQ Collection
Data Mining MCQs
Practice Data Mining questions with answers and explanations.
Choose an option to check your answer.
A.
Combining two databases
B.
Converting a continuous attribute into intervals or categories
C.
Removing all categories
D.
Estimating class probabilities
Show Answer
Correct Answer: B. Converting a continuous attribute into intervals or categories
Explanation:
Discretization replaces numeric ranges with bins such as low, medium, and high.
It can simplify patterns but may lose detail.
Choose an option to check your answer.
A.
An itemset occurring k times only
B.
An itemset containing exactly k items
C.
A transaction with k customers
D.
A rule with k conclusions
Show Answer
Correct Answer: B. An itemset containing exactly k items
Explanation:
The value k refers to the number of distinct items in the set.
A three-itemset contains exactly three items.
Choose an option to check your answer.
A.
Support of X ∪ Y
B.
Confidence
C.
Lift
D.
Leverage
Show Answer
Correct Answer: B. Confidence
Explanation:
Confidence uses the antecedent support in its denominator.
Therefore confidence(X → Y) can differ from confidence(Y → X).
Choose an option to check your answer.
A.
Generate all association rules
B.
Find frequent one-itemsets
C.
Build a classification tree
D.
Cluster the transactions
Show Answer
Correct Answer: B. Find frequent one-itemsets
Explanation:
Apriori begins by counting individual item frequencies.
Only frequent one-itemsets can participate in larger frequent sets.
Choose an option to check your answer.
A.
To increase every numeric value
B.
To convert all variables to text
C.
To detect and correct errors, inconsistencies, and missing values
D.
To guarantee a balanced class distribution
Show Answer
Correct Answer: C. To detect and correct errors, inconsistencies, and missing values
Explanation:
Data cleaning improves reliability before modeling.
Poor-quality inputs can create misleading patterns and unstable models.
Choose an option to check your answer.
A.
Creating bins with equal numbers of observations
B.
Using one bin per unique value
C.
Dividing the numeric range into intervals of the same width
D.
Sorting categories alphabetically
Show Answer
Correct Answer: C. Dividing the numeric range into intervals of the same width
Explanation:
Equal-width binning partitions the value range uniformly.
It is simple but can create sparse bins for skewed data.
Choose an option to check your answer.
A.
The number of items not in the set
B.
The confidence of every rule from the set
C.
The number of transactions containing the itemset
D.
The number of classes in the dataset
Show Answer
Correct Answer: C. The number of transactions containing the itemset
Explanation:
Support count is an absolute frequency.
It shows how many transactions contain all items in the itemset.
Choose an option to check your answer.
A.
A frequent itemset with no subsets
B.
An itemset appearing once
C.
A frequent itemset with no proper superset having the same support
D.
A rule whose confidence is one
Show Answer
Correct Answer: C. A frequent itemset with no proper superset having the same support
Explanation:
Closed itemsets compress frequent-pattern output without losing support information.
Any itemset sharing support with a larger superset is not closed.
Choose an option to check your answer.
A.
Creating random class labels
B.
Generating synthetic transactions
C.
Creating possible k-itemsets from frequent (k-1)-itemsets
D.
Selecting a test set
Show Answer
Correct Answer: C. Creating possible k-itemsets from frequent (k-1)-itemsets
Explanation:
The join step combines compatible frequent itemsets.
A subsequent prune step removes candidates with infrequent subsets.
Choose an option to check your answer.
A.
Nominal scale
B.
Ordinal scale
C.
Ratio scale
D.
Interval scale
Show Answer
Correct Answer: D. Interval scale
Explanation:
Interval scales support meaningful differences but not meaningful ratios.
Temperature in Celsius is a common example because zero is arbitrary.
Choose an option to check your answer.
A.
To discover useful and previously unknown patterns in data
B.
To store data without analysis
C.
To replace all database systems
D.
To guarantee causal conclusions
Show Answer
Correct Answer: A. To discover useful and previously unknown patterns in data
Explanation:
Data mining applies computational and statistical methods to reveal actionable structure in data.
Its focus is knowledge discovery rather than simple storage or reporting.
Choose an option to check your answer.
A.
Ratio scale
B.
Nominal scale
C.
Ordinal scale
D.
Interval scale
Show Answer
Correct Answer: A. Ratio scale
Explanation:
Ratio variables have equal intervals and an absolute zero point.
Examples include weight, distance, and transaction amount.