As machine learning teams grow, feature engineering becomes harder to manage. Different teams may create similar features, calculate them differently, or struggle to serve the same values during training and prediction.

Vertex AI Feature Store helps organize, reuse, and serve machine learning features.

Quick Answer

Vertex AI Feature Store is a managed feature repository for storing, sharing, and serving feature values. It helps teams reuse features, manage feature freshness, and reduce training-serving skew.

Key Takeaways

  • A feature store centralizes machine learning features.
  • Feature stores help teams reuse features across models.
  • Entity types represent business objects such as users, products, accounts, or movies.
  • Features are measurable attributes of those entities.
  • Features can be served for batch training or low-latency online prediction.

Why Feature Stores Exist

Feature stores solve practical ML team problems:

  • features are hard to find,
  • similar features are built many times,
  • training and serving calculations can drift,
  • production systems need low-latency features,
  • teams need feature history,
  • governance and reuse become difficult.

A feature store gives teams a shared place to define, store, discover, and serve features.

Core Concepts

ConceptMeaning
Feature storeTop-level container for feature resources
Entity typeA type of object, such as user, product, account, or movie
EntityOne instance of an entity type, such as a specific user
FeatureA measurable attribute, such as average rating or account age
Feature valueThe value of a feature for a specific entity at a point in time
IngestionImporting feature values into the store
ServingExporting feature values for training or prediction

Example Data Model

Imagine a movie recommendation workflow.

Entity type:

  • movie

Features:

  • average rating,
  • genre,
  • release year,
  • watch count,
  • recent popularity,
  • content rating.

The feature store can keep feature values over time so models can train with historical data and serve current values later.

Feature Ingestion

Feature ingestion means loading feature values into the feature store.

Sources may include:

  • BigQuery tables,
  • Cloud Storage files,
  • streaming pipelines,
  • batch feature engineering jobs.

Before ingestion, teams should confirm:

  • entity IDs are present,
  • data types match feature definitions,
  • timestamps are available where needed,
  • source columns have clear names,
  • sensitive data is approved for use.

Batch Serving And Online Serving

Feature serving means retrieving feature values for model use.

Serving typeBest for
Batch servingTraining datasets, offline scoring, large exports
Online servingLow-latency predictions in applications

Batch serving is useful when creating training examples or running large prediction jobs. Online serving is useful when an application needs current feature values quickly.

Training-Serving Skew

Training-serving skew happens when features are calculated differently during training and prediction.

Examples:

  • training uses yesterday’s definition, serving uses today’s definition,
  • missing values are handled differently,
  • categories are encoded differently,
  • timestamp logic changes,
  • production data arrives late.

A feature store helps reduce this risk by centralizing feature definitions and serving patterns.

When To Use A Feature Store

Use a feature store when:

  • multiple models reuse the same features,
  • multiple teams build ML systems,
  • online prediction needs low-latency feature lookup,
  • training and serving consistency matters,
  • feature freshness must be tracked,
  • feature ownership and discovery are becoming difficult.

You may not need a feature store for a small one-off model with simple static data.

FAQ

What is Vertex AI Feature Store?

Vertex AI Feature Store is a managed repository for storing, sharing, and serving machine learning features for training and prediction workflows.

Why use a feature store?

A feature store helps teams reuse features, reduce training-serving skew, manage feature values over time, and serve features for batch or online prediction.

Bottom Line

Vertex AI Feature Store is useful when feature reuse, consistency, serving, and governance become important. It helps teams turn feature engineering from scattered code into managed ML infrastructure.