CompTIA Exam DY0-001 Topic 5 Question 4 Discussion

Actual exam question for CompTIA's DY0-001 exam

Question #: 4
Topic #: 5

A data scientist is developing a model to predict the outcome of a vote for a national mascot. The choice is between tigers and lions. The full data set represents feedback from individuals representing 17 professions and 12 different locations. The following rank aggregation represents 80% of the data set:

Which of the following is the most likely concern about the model's ability to predict the outcome of the vote?

AInterpolated data

BExtrapolated data

CIn-sample data

DOut-of-sample data

Show Suggested Answer

Suggested Answer: D

The aggregated feedback covers only 80% of respondents, mostly from a few professions and locations, so the model hasn't ''seen'' the remaining 20% (and those underrepresented groups). Its performance on those unseen subsets (out-of-sample data) is therefore the primary concern for how well it will predict the actual vote.

by Ruth at Jun 11, 2025, 06:48 PM

Limited Time Offer

25%

Off

Get Premium DY0-001 Questions as Interactive Web-Based Practice Test or PDF

Contribute your Thoughts:

Submit Cancel

Currently there are no comments in this discussion, be the first to comment!