You trained a model on data stored in a Cloud Storage bucket. The model needs to be retrained frequently in Vertex AI Training using the latest data in the bucket. Data preprocessing is required prior to retraining. You want to build a simple and efficient near-real-time ML pipeline in Vertex AI that will preprocess the data when new data arrives in the bucket. What should you do?
Cloud Run can be triggered on new data arrivals, which makes it ideal for near-real-time processing. The function then initiates the Vertex AI Pipeline for preprocessing and storing features in Vertex AI Feature Store, aligning with the retraining needs. Cloud Scheduler (Option A) is suitable for scheduled jobs, not event-driven triggers. Dataflow (Option C) is better suited for batch processing or ETL rather than ML preprocessing pipelines.
Mike
28 days agoDoug
1 months agoErnest
1 months agoCarri
2 days agoWynell
16 days agoDalene
18 days agoAmie
1 months agoJoesph
22 days agoJolanda
23 days agoCharlesetta
27 days agoCassie
1 months agoVirgina
1 months agoCherrie
2 months agoEve
25 days agoHan
27 days agoTammara
1 months agoTula
2 months agoNieves
2 months agoJavier
1 months agoDallas
2 months agoJackie
2 months ago