MCQ Collection
Data Visualization MCQs
Practice Data Visualization questions with answers and explanations.
Choose an option to check your answer.
A.
Correlations cannot be colored
B.
The number of pairwise comparisons grows rapidly with the number of variables
C.
Every coefficient becomes identical
D.
Matrices can contain only two variables
Show Answer
Correct Answer: B. The number of pairwise comparisons grows rapidly with the number of variables
Explanation:
With p variables, there are p(p-1)/2 distinct pairwise correlations.
Ordering, clustering, and selective annotation can improve readability.
Choose an option to check your answer.
A.
Significant results are always caused by bias
B.
A very small effect can be statistically detectable in a large sample
C.
Practical significance can only be negative
D.
P-values directly measure cost
Show Answer
Correct Answer: B. A very small effect can be statistically detectable in a large sample
Explanation:
A test addresses evidence against a null value, not whether the effect matters in practice.
Effect sizes and domain consequences should also be reported.
Choose an option to check your answer.
A.
K-means cannot process numeric data
B.
The algorithm will create clusters even when no meaningful grouping exists
C.
It always returns only one cluster
D.
Its centroids are exact population parameters
Show Answer
Correct Answer: B. The algorithm will create clusters even when no meaningful grouping exists
Explanation:
Partitioning is guaranteed for any chosen k, but substantive clusters are not.
Stability, visualization, and domain relevance should be evaluated.
Choose an option to check your answer.
A.
Subject knowledge changes the sample size
B.
A fitted model never needs validation
C.
A statistically close fit may still imply impossible or inappropriate values
D.
All distributions have the same support
Show Answer
Correct Answer: C. A statistically close fit may still imply impossible or inappropriate values
Explanation:
Support, mechanisms, and plausible tail behavior depend on the real process.
A model should be both empirically adequate and scientifically sensible.
Choose an option to check your answer.
A.
A separate probability mass that must equal one
B.
A regression line through all points
C.
A kernel function centered at the observation
D.
A category label with no numeric position
Show Answer
Correct Answer: C. A kernel function centered at the observation
Explanation:
KDE forms the estimate by averaging kernels centered at the sample values.
Nearby kernels overlap to create a smooth curve.
Choose an option to check your answer.
A.
It can only be negative
B.
It always lies between -1 and 1
C.
Its magnitude depends on the variables' units and scales
D.
It ignores paired observations
Show Answer
Correct Answer: C. Its magnitude depends on the variables' units and scales
Explanation:
Changing a measurement unit changes covariance numerically.
Correlation standardizes covariance and is therefore easier to compare.
Choose an option to check your answer.
A.
One variable has missing labels
B.
The sample contains repeated values
C.
A third variable influences both variables and can create or distort their relationship
D.
The covariance is measured in squared units
Show Answer
Correct Answer: C. A third variable influences both variables and can create or distort their relationship
Explanation:
A confounder provides an alternative explanation for an observed relationship.
Stratification, design, and modeling are needed to assess it.
Choose an option to check your answer.
A.
The probability that the null is true
B.
The chosen sample size only
C.
A quantitative measure of the magnitude of a difference or relationship
D.
The number of chart annotations
Show Answer
Correct Answer: C. A quantitative measure of the magnitude of a difference or relationship
Explanation:
Effect sizes describe how large an observed phenomenon is.
They complement p-values and help assess practical importance.
Choose an option to check your answer.
A.
It puts all observations into one final cluster and stops
B.
It selects labeled training examples
C.
It treats each observation as its own cluster
D.
It estimates a regression slope
Show Answer
Correct Answer: C. It treats each observation as its own cluster
Explanation:
Agglomerative clustering begins with singleton clusters.
It repeatedly merges the closest clusters until a hierarchy is formed.
Choose an option to check your answer.
A.
A lognormal model
B.
An exponential model
C.
A positive-support empirical model
D.
An untransformed normal model
Show Answer
Correct Answer: D. An untransformed normal model
Explanation:
A normal distribution has support over the entire real line.
For positive skewed data, this may assign unrealistic probability to negative outcomes.
Choose an option to check your answer.
A.
A dendrogram kernel only
B.
A categorical bar kernel
C.
A survival censoring kernel
D.
A Gaussian kernel
Show Answer
Correct Answer: D. A Gaussian kernel
Explanation:
Gaussian kernels are popular because they are smooth and mathematically convenient.
Other kernels usually produce similar results when bandwidth is chosen well.
Choose an option to check your answer.
A.
From 0 to 1 only
B.
From negative infinity to positive infinity
C.
From 0 to the sample size
D.
From -1 to 1
Show Answer
Correct Answer: D. From -1 to 1
Explanation:
Pearson correlation standardizes linear association to a fixed range.
The sign gives direction and the magnitude gives linear strength.