gcp-professional-machine-learning-engineer-study-guide

Professional Machine Learning Engineer Prep:

The Professional Machine Learning Engineer (PMLE) certification tests your ability to build, deploy, and monitor ML solutions using Gemini Enterprise Agent Platform, BigQuery ML, and Model Garden. This guide maps every domain and objective to verified Google Cloud documentation.

You can also explore more GCP certification study guides on the GCP category page to keep building your skills.

Google Machine Learning Engineer Materials:

Coursera	Google Cloud Machine Learning Engineer Professional Certificate
Udemy	Google Cloud Professional Machine Learning Engineer
Whizlabs	Certified Professional Machine Learning Engineer

Section 1: Architecting low-code AI solutions (~13% of the exam)

1.1 Developing ML models using BigQuery ML or AutoML on Gemini Enterprise Agent Platform. Considerations include:

Building models in BigQuery ML or Agent Platform AutoML (e.g., classification, regression, forecasting, and clustering) based on the business problem

Introduction to ML in BigQuery

Gemini Enterprise Agent Platform for BigQuery users

AutoML training overview

AutoML beginner’s guide

1.2 Building AI solutions using Google Cloud AI APIs or foundational models. Considerations include:

Evaluating and selecting the appropriate model for a given task from Gemini Enterprise Agent Platform Model Garden

Overview of Model Garden

Overview of models on Agent Platform

Use models in Model Garden

Building applications using industry-specific APIs (e.g., Document AI API, Vision API, and Translate API)

Document AI overview

Cloud Vision API documentation

Overview of the Cloud Translation API

Building solutions and tuning models for specific use cases (e.g., Gemini, Imagen, Veo, and models as a service in Model Garden)

Generate images with Gemini

Generate videos with Veo on Gemini Enterprise Agent Platform

Introduction to tuning

Use models in Model Garden

Optimizing Gemini-based applications for cost, latency, and availability

Monitor models

Model versions and lifecycle

Overview of Model Garden

Section 2: Collaborating within and across teams to manage data and models (~16% of the exam)

2.1 Exploring and preprocessing data for ML. Considerations include:

Organizing and exploring different data types (e.g., tabular, text, and images) for efficient experimenting, training, and serving

AutoML training overview

Train and use your own models

About Feature Store on Gemini Enterprise Agent Platform

Choosing the right tool for data preprocessing based on scale and complexity (e.g., BigQuery [SQL], Dataflow, Apache Spark, and in-memory Python frameworks)

Introduction to ML in BigQuery

Dataflow overview

Managed Service for Apache Spark documentation

Creating and consolidating features in Gemini Enterprise Agent Platform Feature Store

About Feature Store on Gemini Enterprise Agent Platform

Introduction to feature management in Gemini Enterprise Agent Platform

Ensuring data privacy and handling sensitive information (e.g., personally identifiable information [PII])

Sensitive Data Protection overview

De-identifying sensitive data

Classification, redaction, and de-identification

2.2 Model prototyping using notebooks (e.g., Gemini Enterprise Agent Platform Workbench and Colab Enterprise). Considerations include:

Applying collaboration and security best practices when setting up and running notebook environments

Notebooks

Choose a notebook solution

Introduction to Agent Platform Workbench

Developing models in Agent Platform Workbench or Colab Enterprise notebooks using common frameworks (e.g., PyTorch, sklearn, and JAX)

Choose a notebook solution

Introduction to Agent Platform Workbench

Train and use your own models

Using a variety of foundational and open-source models in Model Garden to create model prototypes in notebook environments

Overview of Model Garden

Use models in Model Garden

Overview of self-deployed models

2.3 Tracking and running ML experiments. Considerations include:

Choosing the appropriate Google Cloud environment for development and experimentation (e.g., Experiments on Gemini Enterprise Agent Platform, Gemini Enterprise Agent Platform Pipelines, and Kubeflow Pipelines) given the framework

Introduction to Gemini Enterprise Agent Platform Experiments

Introduction to Gemini Enterprise Agent Platform Pipelines

Build a pipeline

Evaluating predictive and gen AI solutions (e.g., model evaluation metrics and LLM-as-a-judge)

Gen AI evaluation service overview

Agent evaluation

Gen AI evaluation service API

Tracking and comparing model artifacts, versions, and lineage (e.g., Experiments on Agent Platform and Gemini Enterprise Agent Platform ML Metadata)

Introduction to Gemini Enterprise Agent Platform Experiments

Introduction to Gemini Enterprise Agent Platform ML Metadata

Data model and resources

Section 3: Scaling prototypes into ML models (~21% of the exam)

3.1 Building models given the task considering cost, complexity, latency, and scalability. Considerations include:

Choosing the model type (e.g., ARIMA, DNN, and LLM)

Introduction to ML in BigQuery

Train and use your own models

Tabular Workflows on Agent Platform

Choosing the product (e.g., Agent Platform AutoML, BigQuery ML, and Agent Platform Pipelines)

Train and use your own models

Introduction to ML in BigQuery

Introduction to Gemini Enterprise Agent Platform Pipelines

Choosing the deployment strategy

Deploy a model to an endpoint

Train and use your own models

Modeling techniques given interpretability requirements

Introduction to Vertex Explainable AI

Introduction to Model Monitoring

3.2 Training models. Considerations include:

Organizing training data (e.g., tabular, text, speech, images, and videos) on Google Cloud (e.g., Cloud Storage and BigQuery)

Gemini Enterprise Agent Platform serverless training overview

Introduction to ML in BigQuery

Ingesting structured and unstructured data from various sources into training pipelines

Introduction to Gemini Enterprise Agent Platform Pipelines

Dataflow overview

Model training using different software development kits (SDKs) (e.g., Agent Platform custom training, Kubeflow on Google Kubernetes Engine [GKE], Agent Platform AutoML, and Tabular Workflows) and organizing training on Google Cloud

Gemini Enterprise Agent Platform serverless training overview

Tabular Workflows on Agent Platform

Training classes

Train and use your own models

Troubleshooting ML model training failures

Gemini Enterprise Agent Platform serverless training overview

Distributed training

Hyperparameter tuning

Overview of hyperparameter tuning

Create a hyperparameter tuning job

Fine-tuning foundational models from Agent Platform and Model Garden and when tuning should be considered

Introduction to tuning

About supervised fine-tuning for Gemini models

Use models in Model Garden

3.3 Choosing appropriate hardware for training. Considerations include:

Evaluation of compute and accelerator options (e.g., CPU, GPU, and TPU)

Configure compute resources for Gemini Enterprise Agent Platform serverless training

Gemini Enterprise Agent Platform serverless training overview

Understanding the options for distributed training on GPUs and TPUs using data and model parallelism strategies

Distributed training

Configure compute resources for Gemini Enterprise Agent Platform serverless training

Section 4: Serving and scaling models (~20% of the exam)

4.1 Serving models. Considerations include:

Deploying models for batch and online inference using appropriate services (e.g., Agent Platform, Model Garden, Cloud Run, and GKE)

Overview of getting inferences on Agent Platform

Get batch inferences from a custom trained model

Deploy a model to an endpoint

Packaging and serving models from different frameworks (e.g., PyTorch and XGBoost) using prebuilt and custom containers

Prebuilt containers for inference and explanation

Use a custom container for inference

Custom inference routines

Organizing and versioning models in Gemini Enterprise Agent Platform Model Registry

Introduction to Model Registry

BigQuery ML and Model Registry

Model versions and lifecycle

Implementing model rollout strategies (e.g., A/B testing and canary deployments) to compare model versions

Deploy a model to an endpoint

Deploy a model by using the Google Cloud console

Developing solutions for inference preprocessing and postprocessing

Custom inference routines

Use a custom container for inference

4.2 Scaling online model serving. Considerations include:

Managing and serving features using Agent Platform Feature Store

About Feature Store on Gemini Enterprise Agent Platform

Introduction to feature management in Gemini Enterprise Agent Platform

Deploying models to public and private endpoints

Deploy a model to an endpoint

Deploy a model by using the Google Cloud console

Choosing appropriate hardware (e.g., CPU, GPU, TPU, and edge)

Configure compute resources for Gemini Enterprise Agent Platform serverless training

Prebuilt containers for inference and explanation

Scaling the serving backend based on the throughput (e.g., Gemini Enterprise Agent Platform Inference and containerized serving)

Monitor models

Overview of getting inferences on Agent Platform

Tuning ML models for training and serving in production

Introduction to tuning

Distributed training

Overview of hyperparameter tuning

Section 5: Automating and orchestrating ML pipelines (~18% of the exam)

5.1 Developing end-to-end ML pipelines. Considerations include:

Validating data and models

Introduction to Gemini Enterprise Agent Platform Pipelines

Introduction to Gemini Enterprise Agent Platform ML Metadata

Building and orchestrating pipelines using managed or unmanaged services and from templates or custom solutions (e.g., Agent Platform Pipelines, Managed Service for Apache Airflow, and Ray on Gemini Enterprise Agent Platform)

Introduction to Gemini Enterprise Agent Platform Pipelines

Managed Service for Apache Airflow

Ray on Agent Platform overview

Build a pipeline

Ensuring consistent data preprocessing between training and serving

Perform feature engineering with the TRANSFORM clause

Introduction to Model Monitoring

5.2 Automating model retraining. Considerations include:

Determining an appropriate retraining policy

Introduction to Model Monitoring

MLOps on Gemini Enterprise Agent Platform

Deploying models in continuous integration, continuous delivery, and continuous training (CI/CD/CT) pipelines (e.g., Cloud Build)

Overview of Cloud Build

MLOps on Gemini Enterprise Agent Platform

Introduction to Gemini Enterprise Agent Platform Pipelines

Section 6: Monitoring AI solutions (~13% of the exam)

6.1 Identifying risks to AI solutions. Considerations include:

Building secure AI systems by protecting against unintentional exploitation and leaks of data or models (e.g., data exfiltration, malicious prompting, and sharing sensitive data with LLMs) using the appropriate security tool (e.g., Regex, safety filters, and Model Armor)

Integrate Model Armor with Gemini Enterprise Agent Platform

Sensitive Data Protection overview

Aligning with responsible AI practices (e.g., monitoring for bias)

Gemini image generation and responsible AI

Overview of models on Agent Platform

Model explainability on Agent Platform (e.g., Agent Platform Inference)

Introduction to Vertex Explainable AI

Introduction to Model Monitoring

Monitor feature attribution skew and drift

6.2 Monitoring, testing, and troubleshooting AI solutions. Considerations include:

Configuring and using Model Monitoring on Gemini Enterprise Agent Platform to establish continuous evaluation metrics for production models

Introduction to Model Monitoring

Gemini Enterprise Agent Platform Model Monitoring for batch inferences

Monitoring for common issues (e.g., training-serving skew, data drift, concept drift, and feature attribution drift)

Introduction to Model Monitoring

Monitor feature attribution skew and drift

Monitoring, testing, and evaluating gen AI solutions

Gen AI evaluation service overview

Agent evaluation

Monitor models

Machine Learning Engineer – Final Thoughts

This PMLE study guide walked through all six exam domains, from low-code AI solutions and data collaboration to scaling, serving, pipeline automation, and monitoring. Working through each objective and its official documentation will build the practical, hands-on knowledge the exam expects.

You can also explore more GCP certification study guides on the GCP category page to keep building your skills. Have a question or tip? Leave a comment below.

Receive Updates on Google Machine Learning Exam

Want to be notified as soon as I post? Subscribe to the RSS feed / leave your email address in the subscribe section. Share the article to your social networks with the below links so it can benefit others.

Google Cloud Professional Machine Learning Engineer Study Guide

Professional Machine Learning Engineer Prep:

Google Machine Learning Engineer Materials:

Section 1: Architecting low-code AI solutions (~13% of the exam)

1.1 Developing ML models using BigQuery ML or AutoML on Gemini Enterprise Agent Platform. Considerations include:

Building models in BigQuery ML or Agent Platform AutoML (e.g., classification, regression, forecasting, and clustering) based on the business problem

Performing feature engineering or selection using BigQuery ML

Generating predictions using BigQuery ML

Training models using Agent Platform AutoML

Fine-tuning Gemini models using BigQuery

1.2 Building AI solutions using Google Cloud AI APIs or foundational models. Considerations include:

Evaluating and selecting the appropriate model for a given task from Gemini Enterprise Agent Platform Model Garden

Building applications using industry-specific APIs (e.g., Document AI API, Vision API, and Translate API)

Building solutions and tuning models for specific use cases (e.g., Gemini, Imagen, Veo, and models as a service in Model Garden)

Optimizing Gemini-based applications for cost, latency, and availability

Section 2: Collaborating within and across teams to manage data and models (~16% of the exam)

2.1 Exploring and preprocessing data for ML. Considerations include:

Organizing and exploring different data types (e.g., tabular, text, and images) for efficient experimenting, training, and serving

Choosing the right tool for data preprocessing based on scale and complexity (e.g., BigQuery [SQL], Dataflow, Apache Spark, and in-memory Python frameworks)

Creating and consolidating features in Gemini Enterprise Agent Platform Feature Store

Ensuring data privacy and handling sensitive information (e.g., personally identifiable information [PII])

2.2 Model prototyping using notebooks (e.g., Gemini Enterprise Agent Platform Workbench and Colab Enterprise). Considerations include:

Applying collaboration and security best practices when setting up and running notebook environments

Developing models in Agent Platform Workbench or Colab Enterprise notebooks using common frameworks (e.g., PyTorch, sklearn, and JAX)

Using a variety of foundational and open-source models in Model Garden to create model prototypes in notebook environments

2.3 Tracking and running ML experiments. Considerations include:

Choosing the appropriate Google Cloud environment for development and experimentation (e.g., Experiments on Gemini Enterprise Agent Platform, Gemini Enterprise Agent Platform Pipelines, and Kubeflow Pipelines) given the framework

Evaluating predictive and gen AI solutions (e.g., model evaluation metrics and LLM-as-a-judge)

Tracking and comparing model artifacts, versions, and lineage (e.g., Experiments on Agent Platform and Gemini Enterprise Agent Platform ML Metadata)

Section 3: Scaling prototypes into ML models (~21% of the exam)

3.1 Building models given the task considering cost, complexity, latency, and scalability. Considerations include:

Choosing the model type (e.g., ARIMA, DNN, and LLM)

Choosing the product (e.g., Agent Platform AutoML, BigQuery ML, and Agent Platform Pipelines)

Choosing the deployment strategy

Modeling techniques given interpretability requirements

3.2 Training models. Considerations include:

Organizing training data (e.g., tabular, text, speech, images, and videos) on Google Cloud (e.g., Cloud Storage and BigQuery)

Ingesting structured and unstructured data from various sources into training pipelines

Model training using different software development kits (SDKs) (e.g., Agent Platform custom training, Kubeflow on Google Kubernetes Engine [GKE], Agent Platform AutoML, and Tabular Workflows) and organizing training on Google Cloud

Troubleshooting ML model training failures

Hyperparameter tuning

Fine-tuning foundational models from Agent Platform and Model Garden and when tuning should be considered

3.3 Choosing appropriate hardware for training. Considerations include:

Evaluation of compute and accelerator options (e.g., CPU, GPU, and TPU)

Understanding the options for distributed training on GPUs and TPUs using data and model parallelism strategies

Section 4: Serving and scaling models (~20% of the exam)

4.1 Serving models. Considerations include:

Deploying models for batch and online inference using appropriate services (e.g., Agent Platform, Model Garden, Cloud Run, and GKE)

Packaging and serving models from different frameworks (e.g., PyTorch and XGBoost) using prebuilt and custom containers

Organizing and versioning models in Gemini Enterprise Agent Platform Model Registry

Implementing model rollout strategies (e.g., A/B testing and canary deployments) to compare model versions

Developing solutions for inference preprocessing and postprocessing

4.2 Scaling online model serving. Considerations include:

Managing and serving features using Agent Platform Feature Store

Deploying models to public and private endpoints

Choosing appropriate hardware (e.g., CPU, GPU, TPU, and edge)

Scaling the serving backend based on the throughput (e.g., Gemini Enterprise Agent Platform Inference and containerized serving)

Tuning ML models for training and serving in production

Section 5: Automating and orchestrating ML pipelines (~18% of the exam)

5.1 Developing end-to-end ML pipelines. Considerations include:

Validating data and models

Building and orchestrating pipelines using managed or unmanaged services and from templates or custom solutions (e.g., Agent Platform Pipelines, Managed Service for Apache Airflow, and Ray on Gemini Enterprise Agent Platform)

Ensuring consistent data preprocessing between training and serving

5.2 Automating model retraining. Considerations include:

Determining an appropriate retraining policy

Deploying models in continuous integration, continuous delivery, and continuous training (CI/CD/CT) pipelines (e.g., Cloud Build)

Section 6: Monitoring AI solutions (~13% of the exam)

6.1 Identifying risks to AI solutions. Considerations include:

Building secure AI systems by protecting against unintentional exploitation and leaks of data or models (e.g., data exfiltration, malicious prompting, and sharing sensitive data with LLMs) using the appropriate security tool (e.g., Regex, safety filters, and Model Armor)

Aligning with responsible AI practices (e.g., monitoring for bias)

Model explainability on Agent Platform (e.g., Agent Platform Inference)

6.2 Monitoring, testing, and troubleshooting AI solutions. Considerations include:

Configuring and using Model Monitoring on Gemini Enterprise Agent Platform to establish continuous evaluation metrics for production models

Monitoring for common issues (e.g., training-serving skew, data drift, concept drift, and feature attribution drift)

Monitoring, testing, and evaluating gen AI solutions

Machine Learning Engineer – Final Thoughts

Receive Updates on Google Machine Learning Exam

Share the Google Machine Learning Study Guide in Your Network

You may also like

GCP Professional Cloud Architect Exam Study Guide