Independence Day Deal! Unlock 25% OFF Today – Limited-Time Offer - Ends In 00:00:00 Coupon code: SAVE25
Welcome to Pass4Success

- Free Preparation Discussions

Amazon Exam BDS-C00 Topic 3 Question 113 Discussion

Actual exam question for Amazon's BDS-C00 exam
Question #: 113
Topic #: 3
[All BDS-C00 Questions]

A customer has a machine learning workflow that consist of multiple quick cycles of reads-writes-reads on Amazon S3. The customer needs to run the workflow on EMR but is concerned that the reads in subsequent cycles will miss new data critical to the machine learning from the prior cycles.

How should the customer accomplish this?

Show Suggested Answer Hide Answer
Suggested Answer: B

Contribute your Thoughts:

Burma
1 months ago
Option A - the 'turn it on and forget it' approach. Way better than option D, the 'make up config settings as you go' approach.
upvoted 0 times
Glen
1 days ago
A) Turn on EMRFS consistent view when configuring the EMR cluster
upvoted 0 times
...
Freida
7 days ago
B) Use AWS Data Pipeline to orchestrate the data processing cycles
upvoted 0 times
...
Erasmo
13 days ago
A) Turn on EMRFS consistent view when configuring the EMR cluster
upvoted 0 times
...
...
Claudia
1 months ago
Hadoop.s3.consistency = true? That's a new one to me. I'd stick with the tried and true option A.
upvoted 0 times
Lashunda
2 days ago
Option A sounds like the best choice to ensure consistency in your data processing cycles.
upvoted 0 times
...
...
Olene
2 months ago
Setting Hadoop.data.consistency = true might work, but I'm not sure if that applies specifically to S3 data. Option A is probably safer.
upvoted 0 times
...
Amber
2 months ago
AWS Data Pipeline could work, but that adds an extra layer of complexity. I'd go with the simpler option A.
upvoted 0 times
Margurite
9 days ago
I think option A might be more straightforward for the customer.
upvoted 0 times
...
Evan
15 days ago
C) Set Hadoop.data.consistency = true in the core-site.xml file
upvoted 0 times
...
Gaynell
1 months ago
That sounds like a good idea, it should help with the data consistency.
upvoted 0 times
...
Deane
1 months ago
A) Turn on EMRFS consistent view when configuring the EMR cluster
upvoted 0 times
...
...
Kaycee
2 months ago
Option A seems like the most straightforward approach. Consistent view should help ensure the reads in subsequent cycles see the latest data.
upvoted 0 times
Thurman
18 days ago
I agree, consistency is key for the machine learning process to work effectively.
upvoted 0 times
...
Larae
23 days ago
A) Turn on EMRFS consistent view when configuring the EMR cluster
upvoted 0 times
...
Diego
28 days ago
Yes, that should help with ensuring the machine learning workflow sees the latest data.
upvoted 0 times
...
Carla
1 months ago
A) Turn on EMRFS consistent view when configuring the EMR cluster
upvoted 0 times
...
Bettye
2 months ago
That sounds like a good idea to make sure the reads are consistent.
upvoted 0 times
...
Gracia
2 months ago
A) Turn on EMRFS consistent view when configuring the EMR cluster
upvoted 0 times
...
...
Terry
2 months ago
I'm not sure, but I think option B) Use AWS Data Pipeline could also help in orchestrating the data processing cycles efficiently.
upvoted 0 times
...
Roxane
2 months ago
I agree with Elenora. EMRFS consistent view ensures that the subsequent cycles will not miss new data.
upvoted 0 times
...
Elenora
2 months ago
I think the customer should choose option A) Turn on EMRFS consistent view when configuring the EMR cluster.
upvoted 0 times
...

Save Cancel
az-700  pass4success  az-104  200-301  200-201  cissp  350-401  350-201  350-501  350-601  350-801  350-901  az-720  az-305  pl-300  

Warning: Cannot modify header information - headers already sent by (output started at /pass.php:70) in /pass.php on line 77