MCQ Collection
Big Data Analytics MCQs
Practice Big Data Analytics questions with answers and explanations.
Choose an option to check your answer.
Correct Answer: A. A list of blocks stored on a DataNode
Explanation:
Block reports help the NameNode maintain accurate block-location metadata.
They complement frequent lightweight heartbeats.
Choose an option to check your answer.
Correct Answer: A. A temporary state in which namespace changes are restricted while block status is assessed
Explanation:
At startup, the NameNode waits for enough block reports and replicas.
It exits safe mode when the file system is considered sufficiently available.
Choose an option to check your answer.
Correct Answer: A. To define how input data is split and converted into records
Explanation:
InputFormat creates InputSplits and RecordReaders.
It adapts different file structures to the key-value processing model.
Choose an option to check your answer.
Correct Answer: A. It reruns the map task, possibly on another node
Explanation:
Map input remains in HDFS and can be read again.
The scheduler chooses an available location for the retry.
Choose an option to check your answer.
Correct Answer: A. A mapper that emits each input record unchanged
Explanation:
Identity mapping preserves the input key and value.
It can be useful when only shuffle, sort, or reduce logic is needed.
Choose an option to check your answer.
Correct Answer: B. Structured, integrated data optimized for analytics and reporting
Explanation:
Warehouses organize curated data for consistent business analysis.
They commonly use schema-on-write and dimensional models.
Choose an option to check your answer.
Correct Answer: B. Placing replicas across racks to tolerate rack or network failures
Explanation:
Failures can affect an entire rack or switch.
Cross-rack replication improves resilience while balancing network cost.
Choose an option to check your answer.
Correct Answer: B. Gracefully removing a DataNode while replicating its blocks elsewhere
Explanation:
Decommissioning preserves the target replication factor before the node leaves.
It is safer than abruptly shutting down a storage node.
Choose an option to check your answer.
Correct Answer: B. Converts an InputSplit into input key-value records for the mapper
Explanation:
The RecordReader interprets the bytes within a split.
For text input, it commonly emits byte offsets and lines.
Choose an option to check your answer.
Correct Answer: B. Successful mapper outputs remain available on their local nodes for refetching
Explanation:
The new reducer attempt retrieves the same intermediate partitions again.
Lost mapper output may require only the affected mappers to be rerun.
Choose an option to check your answer.
Correct Answer: B. A reducer that emits grouped input records without additional transformation
Explanation:
Identity reduction can preserve framework grouping and sorting results.
It may also serve as a simple pass-through stage.
Choose an option to check your answer.
Correct Answer: C. Consistency
Explanation:
Consistency means all clients observe a single current value.
Under network partitions, systems may trade this against availability.