Exam Professional Machine Learning Engineer topic 1 question 140 discussion - ExamTopics


AI Summary Hide AI Generated Summary

Problem: Protecting Sensitive Customer Data in ML Models

The scenario involves a retailer using sensitive customer data (AGE, IS_EXISTING_CUSTOMER, LATITUDE_LONGITUDE, SHIRT_SIZE) for training machine learning models. The question focuses on the best method to secure this data before providing it to the data science team.

Options and Suggested Solution:

Four options are presented:

  • A. Tokenize all fields using hashed dummy values. This is the suggested answer.
  • B. Use principal component analysis (PCA) to reduce the four fields to one vector.
  • C. Coarsen the data (quantiles for AGE, rounding LATITUDE_LONGITUDE).
  • D. Remove all sensitive data fields.

The suggested solution is A, which advocates for tokenizing the data using hashed dummy values to replace real values, thereby protecting sensitive information while allowing model training.

Sign in to unlock more AI features Sign in with Google

You work for a retailer that sells clothes to customers around the world. You have been tasked with ensuring that ML models are built in a secure manner. Specifically, you need to protect sensitive customer data that might be used in the models. You have identified four fields containing sensitive data that are being used by your data science team: AGE, IS_EXISTING_CUSTOMER, LATITUDE_LONGITUDE, and SHIRT_SIZE. What should you do with the data before it is made available to the data science team for training purposes?

  • A. Tokenize all of the fields using hashed dummy values to replace the real values.
  • B. Use principal component analysis (PCA) to reduce the four sensitive fields to one PCA vector.
  • C. Coarsen the data by putting AGE into quantiles and rounding LATITUDE_LONGTTUDE into single precision. The other two fields are already as coarse as possible.
  • D. Remove all sensitive data fields, and ask the data science team to build their models using non-sensitive data.
Show Suggested Answer Hide Answer
Suggested Answer: A πŸ—³οΈ

Was this article displayed correctly? Not happy with what you see?

Tabs Reminder: Tabs piling up in your browser? Set a reminder for them, close them and get notified at the right time.

Try our Chrome extension today!


Share this article with your
friends and colleagues.
Earn points from views and
referrals who sign up.
Learn more

Facebook

Save articles to reading lists
and access them on any device


Share this article with your
friends and colleagues.
Earn points from views and
referrals who sign up.
Learn more

Facebook

Save articles to reading lists
and access them on any device