You want to perform analysis on a large collection of images. You want to store this data in HDFS and process it with MapReduce but you also want to give your data analysts and data scientists the ability to process the data directly from HDFS with an interpreted high-level programming language like Python. Which format should you use to store this data in HDFS?

🎲 Try a Random Question  |  Total Questions in Quiz: 53  |  🧠 Study this quiz with Flashcards
This question is part of a full practice quiz:
Cloudera CCD-410 Cloudera Certified Developer for Apache Hadoop (CCDH) Practice Test 2 — practice the complete quiz, review flashcards, or try a random question.


You want to perform analysis on a large collection of images. You want to store this data in HDFS and process it with MapReduce but you also want to give your data analysts and data scientists the ability to process the data directly from HDFS with an interpreted high-level programming language like Python. Which format should you use to store this data in HDFS?