Vertex AI Prediction and Model Monitoring Guide
Training a model is only one part of machine learning. After a model is trained and validated, it must serve predictions. Then the team must monitor whether the model continues to behave well in production. This guide explains batch prediction, online prediction, and model monitoring in simple terms. Quick Answer Use batch prediction when you need many predictions at once and do not need an instant response. Use online prediction when an application needs a fast prediction through an endpoint. Use model monitoring to detect changes in production data, training-serving skew, drift, and behavior that may reduce model quality. ...