Deal of The Day! Hurry Up, Grab the Special Discount - Save 25% - Ends In 00:00:00 Coupon code: SAVE25
Welcome to Pass4Success

- Free Preparation Discussions

Amazon Exam BDS-C00 Topic 5 Question 70 Discussion

Actual exam question for Amazon's BDS-C00 exam
Question #: 70
Topic #: 5
[All BDS-C00 Questions]

A large grocery distributor receives daily depletion reports from the field in the form of gzip archives of CSV files uploading to Amazon S3. The files range from 500MB to 5GB. These files are processes daily by an EMR job.

Recently it has been observed that the file sizes vary, and the EMR jobs take too long. The distributor needs to tune and optimize the data processing workflow with this limited information to improved the performance of the EMR job.

Which recommendation should an administrator provide?

Show Suggested Answer Hide Answer
Suggested Answer: A

Contribute your Thoughts:

Using bzip2 or Snappy could be a good option to reduce the file sizes and potentially improve the processing time. Gzip is a common compression format, but there might be better alternatives for these large files.
upvoted 0 times
...
Alaine
17 days ago
Reducing the HDFS block size to increase the number of task processors seems like a logical solution, but I'm not sure if that's the best approach here. The file sizes are quite large, so the overhead of managing more tasks might outweigh the benefits.
upvoted 0 times
...
Albina
23 days ago
I think decompressing the gzip archives and storing the data as CSV files would be the best option.
upvoted 0 times
...
Anjelica
25 days ago
I disagree, I believe we should use bzip2 or Snappy instead of gzip for the archives.
upvoted 0 times
...
Edwin
27 days ago
I think we should reduce the HDFS block size to increase task processors.
upvoted 0 times
...

Save Cancel
az-700  pass4success  az-104  200-301  200-201  cissp  350-401  350-201  350-501  350-601  350-801  350-901  az-720  az-305  pl-300  

Warning: Cannot modify header information - headers already sent by (output started at /pass.php:70) in /pass.php on line 77