Exam Professional Machine Learning Engineer topic 1 question 184 discussion - ExamTopics


A retail company seeks the best data splitting approach for sales prediction using Vertex AI, choosing between manual, default, chronological, and random splits.
AI Summary available β€” skim the key points instantly. Show AI Generated Summary
Show AI Generated Summary

You work for a retail company. You have a managed tabular dataset in Vertex AI that contains sales data from three different stores. The dataset includes several features, such as store name and sale timestamp. You want to use the data to train a model that makes sales predictions for a new store that will open soon. You need to split the data between the training, validation, and test sets. What approach should you use to split the data?

  • A. Use Vertex AI manual split, using the store name feature to assign one store for each set
  • B. Use Vertex AI default data split
  • C. Use Vertex AI chronological split, and specify the sales timestamp feature as the time variable
  • D. Use Vertex AI random split, assigning 70% of the rows to the training set, 10% to the validation set, and 20% to the test set
Show Suggested Answer Hide Answer
Suggested Answer: C πŸ—³οΈ

Was this article displayed correctly? Not happy with what you see?

Tabs Reminder: Tabs piling up in your browser? Set a reminder for them, close them and get notified at the right time.

Try our Chrome extension today!


Share this article with your
friends and colleagues.
Earn points from views and
referrals who sign up.
Learn more

Facebook

Save articles to reading lists
and access them on any device


Share this article with your
friends and colleagues.
Earn points from views and
referrals who sign up.
Learn more

Facebook

Save articles to reading lists
and access them on any device