Independence Day Deal! Unlock 25% OFF Today – Limited-Time Offer - Ends In 00:00:00 Coupon code: SAVE25
Welcome to Pass4Success

- Free Preparation Discussions

Google Exam Professional Machine Learning Engineer Topic 4 Question 74 Discussion

Actual exam question for Google's Professional Machine Learning Engineer exam
Question #: 74
Topic #: 4
[All Professional Machine Learning Engineer Questions]

You are developing an ML model to predict house prices. While preparing the data, you discover that an important predictor variable, distance from the closest school, is often missing and does not have high variance. Every instance (row) in your data is important. How should you handle the missing data?

Show Suggested Answer Hide Answer
Suggested Answer: A

Sampled Shapley is a fast and scalable approximation of the Shapley value, which is a game-theoretic concept that measures the contribution of each feature to the model prediction. Sampled Shapley is suitable for online prediction requests, as it can return feature attributions with minimal latency. The path count parameter controls the number of samples used to estimate the Shapley value, and a lower value means faster computation. Integrated Gradients is another explanation method that computes the average gradient along the path from a baseline input to the actual input. Integrated Gradients is more accurate than Sampled Shapley, but also more computationally intensive. Therefore, it is not recommended for online prediction requests, especially with a high path count. Prediction drift is the change in the distribution of feature values or labels over time. It can affect the performance and accuracy of the model, and may require retraining or redeploying the model. Vertex AI Model Monitoring allows you to monitor prediction drift on your deployed models and endpoints, and set up alerts and notifications when the drift exceeds a certain threshold. You can specify an email address to receive the notifications, and use the information to retrigger the training pipeline and deploy an updated version of your model. This is the most direct and convenient way to achieve your goal. Training-serving skew is the difference between the data used for training the model and the data used for serving the model. It can also affect the performance and accuracy of the model, and may indicate data quality issues or model staleness. Vertex AI Model Monitoring allows you to monitor training-serving skew on your deployed models and endpoints, and set up alerts and notifications when the skew exceeds a certain threshold. However, this is not relevant to the question, as the question is about the feature attributions of the model, not the data distribution.Reference:

Vertex AI: Explanation methods

Vertex AI: Configuring explanations

Vertex AI: Monitoring prediction drift

Vertex AI: Monitoring training-serving skew


Contribute your Thoughts:

Lawanda
1 months ago
Ah, the classic 'replace with zeros' approach. It's like trying to fix a leaky faucet with duct tape - it just doesn't work!
upvoted 0 times
...
Vivan
1 months ago
Replacing missing values with zeros? That's about as useful as a chocolate teapot. We need a better solution here.
upvoted 0 times
Mari
8 days ago
In that case, applying feature crossing with another column might be a good alternative.
upvoted 0 times
...
Lorita
16 days ago
But what if the missing values are not predictable?
upvoted 0 times
...
Hyun
17 days ago
I think predicting the missing values using linear regression could be a better approach.
upvoted 0 times
...
Omega
20 days ago
Agreed, replacing missing values with zeros is not a good idea.
upvoted 0 times
...
...
Izetta
1 months ago
Predicting the missing values using linear regression? Hmm, I'm not sure that's a good idea if the variable doesn't have high variance.
upvoted 0 times
Delpha
5 days ago
User2: User1, that's a good suggestion. It could help capture the relationship between the missing variable and the other column.
upvoted 0 times
...
Clorinda
10 days ago
User1: Maybe you could try applying feature crossing with another column that does not have missing values.
upvoted 0 times
...
Antonette
11 days ago
C) Predict the missing values using linear regression.
upvoted 0 times
...
Hubert
20 days ago
B) Apply feature crossing with another column that does not have missing values.
upvoted 0 times
...
Vincenza
1 months ago
A) Delete the rows that have missing values.
upvoted 0 times
...
...
Whitley
2 months ago
Ooh, feature crossing! Sounds fancy. But won't that just mask the problem instead of actually solving it?
upvoted 0 times
Darrel
12 days ago
C) Predict the missing values using linear regression.
upvoted 0 times
...
Fernanda
26 days ago
Ooh, feature crossing! Sounds fancy. But won't that just mask the problem instead of actually solving it?
upvoted 0 times
...
Barabara
1 months ago
B) Apply feature crossing with another column that does not have missing values.
upvoted 0 times
...
Basilia
2 months ago
A) Delete the rows that have missing values.
upvoted 0 times
...
...
Ellen
2 months ago
Deleting rows with missing values? That's like throwing the baby out with the bathwater. We need to preserve every data point!
upvoted 0 times
Stephaine
1 months ago
User 2: How about predicting the missing values using linear regression?
upvoted 0 times
...
Percy
1 months ago
User 1: I agree, we can't just delete rows with missing values.
upvoted 0 times
...
...
Virgina
2 months ago
I think deleting the rows with missing values is the best option to ensure accurate predictions.
upvoted 0 times
...
Johna
2 months ago
I disagree, I believe we should apply feature crossing with another column that does not have missing values.
upvoted 0 times
...
Socorro
3 months ago
I think we should predict the missing values using linear regression.
upvoted 0 times
...

Save Cancel
az-700  pass4success  az-104  200-301  200-201  cissp  350-401  350-201  350-501  350-601  350-801  350-901  az-720  az-305  pl-300  

Warning: Cannot modify header information - headers already sent by (output started at /pass.php:70) in /pass.php on line 77