MCQ Collection
Data Visualization MCQs
Practice Data Visualization questions with answers and explanations.
Choose an option to check your answer.
A.
A default claim evaluated using sample evidence
B.
The conclusion that must always be accepted
C.
A chart with no data
D.
The sample statistic itself
Show Answer
Correct Answer: A. A default claim evaluated using sample evidence
Explanation:
The null often states no difference, no effect, or a specified parameter value.
A test assesses whether the data are sufficiently inconsistent with it.
Choose an option to check your answer.
A.
To minimize within-cluster squared distances to cluster centroids
B.
To maximize the number of clusters automatically
C.
To predict a labeled response
D.
To estimate a survival function
Show Answer
Correct Answer: A. To minimize within-cluster squared distances to cluster centroids
Explanation:
K-means assigns observations to the nearest centroid and updates centroids iteratively.
Its objective is the within-cluster sum of squares.
Choose an option to check your answer.
A.
The average pairwise distance between observations in two clusters
B.
The minimum value in each cluster
C.
The correlation of cluster labels
D.
The variance of the full dataset only
Show Answer
Correct Answer: A. The average pairwise distance between observations in two clusters
Explanation:
Average linkage balances the nearest- and farthest-pair approaches.
It uses all cross-cluster pairs in its dissimilarity calculation.
Choose an option to check your answer.
A.
A fair coin outcome coded as heads or tails
B.
A positive variable produced by multiplying many independent factors
C.
A fixed constant with no variation
D.
A signed error symmetric around zero
Show Answer
Correct Answer: B. A positive variable produced by multiplying many independent factors
Explanation:
Multiplicative effects become additive on the logarithmic scale.
A normal model on the log scale therefore yields a lognormal variable.
Choose an option to check your answer.
A.
By adding the endpoint values only
B.
By calculating the area under the density between a and b
C.
By multiplying the mean by the variance
D.
By counting category labels
Show Answer
Correct Answer: B. By calculating the area under the density between a and b
Explanation:
For a continuous variable, interval probability equals an integral of the density.
Graphically, it is the area beneath the curve over the interval.
Choose an option to check your answer.
A.
Both display only categorical counts
B.
Both estimate distribution density and have total area approximately one
C.
Both require equal-width categories
D.
Both show exact event probabilities at points
Show Answer
Correct Answer: B. Both estimate distribution density and have total area approximately one
Explanation:
A density histogram is piecewise constant, while KDE is smooth.
Their common scaling allows them to be overlaid meaningfully.
Choose an option to check your answer.
A.
Only differences between group means
B.
The strength of a monotonic relationship using ranks
C.
The probability of survival
D.
The number of clusters in a dendrogram
Show Answer
Correct Answer: B. The strength of a monotonic relationship using ranks
Explanation:
Spearman correlation replaces values with ranks and assesses monotonic association.
It is less sensitive to extreme magnitudes and can capture nonlinear monotonic patterns.
Choose an option to check your answer.
A.
A second version of the same dataset
B.
The competing claim supported when evidence contradicts the null
C.
A requirement that the sample be normal
D.
The probability of a Type I error
Show Answer
Correct Answer: B. The competing claim supported when evidence contradicts the null
Explanation:
The alternative describes the effect or difference of scientific interest.
It may be two-sided or specify a direction.
Choose an option to check your answer.
A.
A response label for every observation
B.
The number of clusters k
C.
A time-to-event variable
D.
A null hypothesis
Show Answer
Correct Answer: B. The number of clusters k
Explanation:
Standard k-means requires the desired number of clusters as an input.
Methods such as elbow or silhouette analysis can help evaluate choices.
Choose an option to check your answer.
A.
By calculating the overall mean
B.
By cutting the tree at an appropriate height
C.
By setting every branch length to zero
D.
By sorting leaves alphabetically
Show Answer
Correct Answer: B. By cutting the tree at an appropriate height
Explanation:
A horizontal cut separates branches into distinct groups.
The height or desired cluster count determines the resulting partition.
Choose an option to check your answer.
A.
It becomes a discrete PMF
B.
It becomes uniform
C.
It becomes normally distributed under the model
D.
It loses all variation
Show Answer
Correct Answer: C. It becomes normally distributed under the model
Explanation:
The defining property of a lognormal variable is normality after logging.
This can simplify modeling and visualization.
Choose an option to check your answer.
A.
No, density values are probabilities
B.
Only for discrete variables
C.
Yes, if the total area under the curve still equals 1
D.
Only when the sample size is one
Show Answer
Correct Answer: C. Yes, if the total area under the curve still equals 1
Explanation:
Density is probability per unit of measurement, not probability itself.
A narrow distribution can have a peak above one while maintaining unit area.