Exam Professional Machine Learning Engineer topic 1 question 176 discussion - ExamTopics


A food company seeks the most efficient method to preprocess BigQuery sales data for TensorFlow model training in Vertex AI, choosing between Spark/Dataproc, in-place BigQuery SQL, TensorFlow preprocessing, or a Dataflow pipeline.
AI Summary available — skim the key points instantly. Show AI Generated Summary
Show AI Generated Summary

You work for a food product company. Your company’s historical sales data is stored in BigQuery.You need to use Vertex AI’s custom training service to train multiple TensorFlow models that read the data from BigQuery and predict future sales. You plan to implement a data preprocessing algorithm that performs mm-max scaling and bucketing on a large number of features before you start experimenting with the models. You want to minimize preprocessing time, cost, and development effort. How should you configure this workflow?

  • A. Write the transformations into Spark that uses the spark-bigquery-connector, and use Dataproc to preprocess the data.
  • B. Write SQL queries to transform the data in-place in BigQuery.
  • C. Add the transformations as a preprocessing layer in the TensorFlow models.
  • D. Create a Dataflow pipeline that uses the BigQuerylO connector to ingest the data, process it, and write it back to BigQuery.
Show Suggested Answer Hide Answer
Suggested Answer: B 🗳️

Was this article displayed correctly? Not happy with what you see?


Share this article with your
friends and colleagues.

Facebook



Share this article with your
friends and colleagues.

Facebook