Cloudera CCD-410 Cloudera Certified Developer for Apache Hadoop (CCDH) Practice Test 2 | Practice Test / Quiz / MCQs

2. Your cluster's HDFS block size in 64MB. You have directory containing 100 plain text files, each of which is 100MB in size. The InputFormat for your job is TextInputFormat. Determine how many Mappers will run?

64010020064

3. Identify the MapReduce v2 (MRv2 / YARN) daemon responsible for launching application containers and monitoring application resource usage?

ApplicationMasterApplicationMasterServiceResourceManagerNodeManager

4. Which best describes what the map method accepts and emits?

It accepts a single key-value pair as input and emits a single key and list of corresponding values as output.It accepts a single key-value pairs as input and can emit any number of key-value pair as output, including zero.It accepts a list key-value pairs as input and can emit only one key-value pair as output.It accepts a single key-value pairs as input and can emit only one key-value pair as output.

5. For each input key-value pair, mappers can emit:

As many intermediate key-value pairs as designed, as long as all the keys have the same types and all the values have the same type.As many intermediate key-value pairs as designed, but they cannot be of the same type as the input key-value pair.One intermediate key-value pair, of a different type.As many intermediate key-value pairs as designed. There are no restrictions on the types of those key- value pairs (i.e., they can be heterogeneous).

6. You've written a MapReduce job that will process 500 million input records and generated 500 million key- value pairs. The data is not uniformly distributed. Your MapReduce job will create a significant amount of intermediate data that it needs to transfer between mappers and reduces which is a potential bottleneck. A custom implementation of which interface is most likely to reduce the amount of intermediate data transferred across the network?

WritableComparableCombinerOutputFormatPartitioner

7. In a MapReduce job, you want each of your input files processed by a single map task. How do you configure a MapReduce job so that a single map task processes each input file regardless of how many blocks the input file occupies?

Increase the parameter that controls minimum split size in the job configuration.Set the number of mappers equal to the number of input files you want to process.Write a custom MapRunner that iterates over all key-value pairs in the entire file.Write a custom FileInputFormat and override the method isSplitable to always return false.

8. Assuming default settings, which best describes the order of data provided to a reducer's reduce method:

Both the keys and values passed to a reducer always appear in sorted order.The keys given to a reducer aren't in a predictable order, but the values associated with those keys always are.The keys given to a reducer are in sorted order but the values associated with each key are in no predictable orderNeither keys nor values are in any predictable order.

9. You write MapReduce job to process 100 files in HDFS. Your MapReduce algorithm uses TextInputFormat: the mapper applies a regular expression over input values and emits key-values pairs with the key consisting of the matching text, and the value containing the filename and byte offset. Determine the difference between setting the number of reduces to one and settings the number of reducers to zero.

There is no difference in output between the two settings.With zero reducers, all instances of matching patterns are gathered together in one file on HDFS. With one reducer, instances of matching patterns are stored in multiple files on HDFS.With zero reducers, instances of matching patterns are stored in multiple files on HDFS. With one reducer, all instances of matching patterns are gathered together in one file on HDFS.With zero reducers, no reducer runs and the job throws an exception. With one reducer, instances of matching patterns are stored in a single file on HDFS.

10. A combiner reduces:

The number of output files a reducer must produce.The number of values across different keys in the iterator supplied to a single reduce method call.The amount of intermediate data that must be transferred between the mapper and reducer.The number of input files a mapper must process.

11. How are keys and values presented and passed to the reducers during a standard sort and shuffle phase of MapReduce?

Keys are presented to a reducer in random order; values for a given key are not sorted.Keys are presented to reducer in sorted order; values for a given key are not sorted.Keys are presented to a reducer in random order; values for a given key are sorted in ascending order.Keys are presented to reducer in sorted order; values for a given key are sorted in ascending order.

12. Which process describes the lifecycle of a Mapper?

The JobTracker calls the TaskTracker's configure () method, then its map () method and finally its close () method.The TaskTracker spawns a new Mapper to process each key-value pair.The JobTracker spawns a new Mapper to process all records in a single file.The TaskTracker spawns a new Mapper to process all records in a single input split.

13. How are keys and values presented and passed to the reducers during a standard sort and shuffle phase of MapReduce?

Keys are presented to reducer in sorted order; values for a given key are not sorted.Keys are presented to a reducer in random order; values for a given key are sorted in ascending order.Keys are presented to reducer in sorted order; values for a given key are sorted in ascending order.Keys are presented to a reducer in random order; values for a given key are not sorted.

14. A client application creates an HDFS file named foo.txt with a replication factor of 3. Identify which best describes the file access rules in HDFS if the file has a single block that is stored on data nodes A, B and

The file can be accessed if at least one of the data nodes storing the file is available.A.The file will be marked as corrupted if data node B fails during the creation of the file.Each data node locks the local file to prohibit concurrent readers and writers of the file.Each data node stores a copy of the file in the local file system with the same name as the HDFS file.

15. Indentify which best defines a SequenceFile?

A SequenceFile contains a binary encoding of an arbitrary number of heterogeneous Writable objectsA SequenceFile contains a binary encoding of an arbitrary number of homogeneous Writable objectsA SequenceFile contains a binary encoding of an arbitrary number of WritableComparable objects, in sorted order.A SequenceFile contains a binary encoding of an arbitrary number key-value pairs. Each key must be the same type. Each value must be the same type.

16. You want to run Hadoop jobs on your development workstation for testing before you submit them to your production cluster. Which mode of operation in Hadoop allows you to most closely simulate a production cluster while using a single machine?

Run all the nodes in your production cluster as virtual machines on your development workstation.Run the DataNode, TaskTracker, NameNode and JobTracker daemons on a single machine.Run the hadoop command with the jt local and the fs file:///options.Run simldooop, the Apache open-source software for simulating Hadoop clusters.

17. Your client application submits a MapReduce job to your Hadoop cluster. Identify the Hadoop daemon on which the Hadoop framework will look for an available slot schedule a MapReduce operation.

TaskTrackerNameNodeDataNodeJobTracker

19. To process input key-value pairs, your mapper needs to lead a 512 MB data file in memory. What is the best way to accomplish this?

Serialize the data file, insert in it the JobConf object, and read the data into memory in the configure method of the mapper.Place the data file in the DistributedCache and read the data into memory in the configure method of the mapper.Place the data file in the DataCache and read the data into memory in the configure method of the mapper.Place the data file in the DistributedCache and read the data into memory in the map method of the mapper.

20. A combiner reduces:

The amount of intermediate data that must be transferred between the mapper and reducer.The number of input files a mapper must process.The number of values across different keys in the iterator supplied to a single reduce method call.The number of output files a reducer must produce.

21. Given a directory of files with the following structure: line number, tab character, string: Example: 1abialkjfjkaoasdfjksdlkjhqweroij 2kadfjhuwqounahagtnbvaswslmnbfgy 3kjfteiomndscxeqalkzhtopedkfsikj You want to send each line as one record to your Mapper. Which InputFormat should you use to complete the line: conf.setInputFormat (____.class) ; ?

SequenceFileAsTextInputFormatSequenceFileInputFormatBDBInputFormatKeyValueFileInputFormat

22. Analyze each scenario below and indentify which best describes the behavior of the default partitioner?

The default partitioner assigns key-values pairs to reduces based on an internal random number generator.The default partitioner implements a round-robin strategy, shuffling the key-value pairs to each reducer in turn. This ensures an event partition of the key space.The default partitioner computes the hash of the key and divides that valule modulo the number of reducers. The result determines the reducer assigned to process the key-value pair..The default partitioner computes the hash of the key. Hash values between specific ranges are associated with different buckets, and each bucket is assigned to a specific reducer.

23. What types of algorithms are difficult to express in MapReduce v1 (MRv1)?

Algorithms that require applying the same mathematical function to large numbers of individual binary records.Large-scale graph algorithms that require one-step link traversal.Relational operations on large amounts of structured and semi-structured data.Algorithms that require global, sharing states.

24. To process input key-value pairs, your mapper needs to lead a 512 MB data file in memory. What is the best way to accomplish this?

Place the data file in the DataCache and read the data into memory in the configure method of the mapper.Place the data file in the DistributedCache and read the data into memory in the map method of the mapper.Serialize the data file, insert in it the JobConf object, and read the data into memory in the configure method of the mapper.Place the data file in the DistributedCache and read the data into memory in the configure method of the mapper.

25. What data does a Reducer reduce method process?

All data for a given key, regardless of which mapper(s) produced it.All data produced by a single mapper.All data for a given value, regardless of which mapper(s) produced it.All the data in a single input file.

25 Questions

❤ If you liked Fatskills, consider supporting us by checking out The Life Manuals You Never Got.

About | Explore | User Guide | Topics | Subjects | Doubt Solver | Career Aptitude Test | Answers | Free Tools | What Should We Know?
Privacy | Terms |

Without work one finishes nothing. - Ralph Waldo Emerson
© 2026 Fatskills.com

All trademarks, logos and brand names are the property of their respective owners. All company, product and service names used in this website are for identification purposes only. Use of these names, trademarks and brands does not imply endorsement.

25 Questions

❤ If you liked Fatskills, consider supporting us by checking out The Life Manuals You Never Got.

About | Explore | User Guide | Topics | Subjects | Doubt Solver | Career Aptitude Test | Answers | Free Tools | What Should We Know? Privacy | Terms |

Without work one finishes nothing. - Ralph Waldo Emerson© 2026 Fatskills.com

All trademarks, logos and brand names are the property of their respective owners. All company, product and service names used in this website are for identification purposes only. Use of these names, trademarks and brands does not imply endorsement.

About | Explore | User Guide | Topics | Subjects | Doubt Solver | Career Aptitude Test | Answers | Free Tools | What Should We Know?
Privacy | Terms |

Without work one finishes nothing. - Ralph Waldo Emerson
© 2026 Fatskills.com