As machine learning teams grow, feature engineering becomes harder to manage. Different teams may create similar features, calculate them differently, or struggle to serve the same values during training and prediction.
Vertex AI Feature Store helps organize, reuse, and serve machine learning features.
Quick Answer
Vertex AI Feature Store is a managed feature repository for storing, sharing, and serving feature values. It helps teams reuse features, manage feature freshness, and reduce training-serving skew.
Key Takeaways
- A feature store centralizes machine learning features.
- Feature stores help teams reuse features across models.
- Entity types represent business objects such as users, products, accounts, or movies.
- Features are measurable attributes of those entities.
- Features can be served for batch training or low-latency online prediction.
Why Feature Stores Exist
Feature stores solve practical ML team problems:
- features are hard to find,
- similar features are built many times,
- training and serving calculations can drift,
- production systems need low-latency features,
- teams need feature history,
- governance and reuse become difficult.
A feature store gives teams a shared place to define, store, discover, and serve features.
Core Concepts
| Concept | Meaning |
|---|---|
| Feature store | Top-level container for feature resources |
| Entity type | A type of object, such as user, product, account, or movie |
| Entity | One instance of an entity type, such as a specific user |
| Feature | A measurable attribute, such as average rating or account age |
| Feature value | The value of a feature for a specific entity at a point in time |
| Ingestion | Importing feature values into the store |
| Serving | Exporting feature values for training or prediction |
Example Data Model
Imagine a movie recommendation workflow.
Entity type:
- movie
Features:
- average rating,
- genre,
- release year,
- watch count,
- recent popularity,
- content rating.
The feature store can keep feature values over time so models can train with historical data and serve current values later.
Feature Ingestion
Feature ingestion means loading feature values into the feature store.
Sources may include:
- BigQuery tables,
- Cloud Storage files,
- streaming pipelines,
- batch feature engineering jobs.
Before ingestion, teams should confirm:
- entity IDs are present,
- data types match feature definitions,
- timestamps are available where needed,
- source columns have clear names,
- sensitive data is approved for use.
Batch Serving And Online Serving
Feature serving means retrieving feature values for model use.
| Serving type | Best for |
|---|---|
| Batch serving | Training datasets, offline scoring, large exports |
| Online serving | Low-latency predictions in applications |
Batch serving is useful when creating training examples or running large prediction jobs. Online serving is useful when an application needs current feature values quickly.
Training-Serving Skew
Training-serving skew happens when features are calculated differently during training and prediction.
Examples:
- training uses yesterday’s definition, serving uses today’s definition,
- missing values are handled differently,
- categories are encoded differently,
- timestamp logic changes,
- production data arrives late.
A feature store helps reduce this risk by centralizing feature definitions and serving patterns.
When To Use A Feature Store
Use a feature store when:
- multiple models reuse the same features,
- multiple teams build ML systems,
- online prediction needs low-latency feature lookup,
- training and serving consistency matters,
- feature freshness must be tracked,
- feature ownership and discovery are becoming difficult.
You may not need a feature store for a small one-off model with simple static data.
Related AI Charcha Reading
- Feature Engineering for Machine Learning
- Feature Engineering With Keras and BigQuery ML
- How to Build Generative AI Apps in Azure with Microsoft Foundry
FAQ
What is Vertex AI Feature Store?
Vertex AI Feature Store is a managed repository for storing, sharing, and serving machine learning features for training and prediction workflows.
Why use a feature store?
A feature store helps teams reuse features, reduce training-serving skew, manage feature values over time, and serve features for batch or online prediction.
Bottom Line
Vertex AI Feature Store is useful when feature reuse, consistency, serving, and governance become important. It helps teams turn feature engineering from scattered code into managed ML infrastructure.