Exam Professional Machine Learning Engineer topic 1 question 202 discussion - ExamTopics


A startup is migrating its PySpark data science workloads from on-premises to Google Cloud, and this article explores the most cost-effective initial migration step.
AI Summary available β€” skim the key points instantly. Show AI Generated Summary
Show AI Generated Summary

You work for a startup that has multiple data science workloads. Your compute infrastructure is currently on-premises, and the data science workloads are native to PySpark. Your team plans to migrate their data science workloads to Google Cloud. You need to build a proof of concept to migrate one data science job to Google Cloud. You want to propose a migration process that requires minimal cost and effort. What should you do first?

  • A. Create a n2-standard-4 VM instance and install Java, Scala, and Apache Spark dependencies on it.
  • B. Create a Google Kubernetes Engine cluster with a basic node pool configuration, install Java, Scala, and Apache Spark dependencies on it.
  • C. Create a Standard (1 master, 3 workers) Dataproc cluster, and run a Vertex AI Workbench notebook instance on it.
  • D. Create a Vertex AI Workbench notebook with instance type n2-standard-4.
Show Suggested Answer Hide Answer
Suggested Answer: C πŸ—³οΈ

Was this article displayed correctly? Not happy with what you see?


Share this article with your
friends and colleagues.

Facebook



Share this article with your
friends and colleagues.

Facebook