infra/website/docs/blog/feast-mlflow-native-integration.md
Feast manages your features. MLflow tracks your experiments. But between the two, there has always been a manual gap.
When a data scientist trains a model, the features that shaped it are retrieved from Feast, but MLflow has no idea which features were used, which feature service they belong to, or what entity DataFrame produced the training set. The result is a familiar set of problems:
driver_hourly_stats?" — grep through repos and ask around.Teams have tried to close this gap with manual mlflow.log_param("features", ...) calls, custom wrappers, or convention-based tagging. These approaches are fragile, inconsistent, and the first thing to break when someone new joins the team.
Starting with Feast v0.62, the Feast–MLflow integration is native and zero-code. Add an mlflow: block to your feature_store.yaml, and every feature retrieval inside an active MLflow run is automatically tagged with the features, feature views, feature service, entity count, and retrieval duration.
project: driver_ranking
registry: data/registry.db
provider: local
online_store:
type: sqlite
path: data/online_store.db
mlflow:
enabled: true
tracking_uri: http://127.0.0.1:5000
That's it. No decorators, no wrappers, no import mlflow scattered through your training code.
When mlflow.enabled: true and an active MLflow run exists, Feast hooks into get_historical_features() and get_online_features() at the end of each call and writes structured metadata to the run:
| Tag | Example |
|---|---|
feast.project | driver_ranking |
feast.retrieval_type | historical |
feast.feature_service | driver_activity_v1 |
feast.feature_views | driver_hourly_stats |
feast.feature_refs | driver_hourly_stats:conv_rate, driver_hourly_stats:acc_rate |
feast.entity_count | 200 |
feast.feature_count | 5 |
feast.job_submission_sec | 0.43 (metric) |
Even if features are passed as a list of refs rather than a FeatureService object, Feast auto resolves the matching feature service from the registry. The resolution is cached with a 5-minute TTL, so there is no registry overhead on every call.
store.mlflow APIThe integration surfaces through a single property on FeatureStore:
from feast import FeatureStore
store = FeatureStore(".")
with store.mlflow.start_run(run_name="v1_training"):
# Auto-logged: feature refs, feature views, entity count, duration
training_df = store.get_historical_features(
features=store.get_feature_service("driver_activity_v1"),
entity_df=entity_df,
).to_df()
model = train(training_df)
# Saves feast_features.json alongside the model artifact
store.mlflow.log_model(model, "model")
train_run_id = store.mlflow.active_run_id
# Propagates feast.feature_service to the model version
store.mlflow.register_model(f"runs:/{train_run_id}/model", "driver_model")
# Prediction: links back to the training run
with store.mlflow.start_run(run_name="batch_prediction"):
model = store.mlflow.load_model("models:/driver_model/1")
features = store.get_online_features(
features=store.get_feature_service("driver_activity_v1"),
entity_rows=[{"driver_id": 1001}],
)
predictions = model.predict(...)
store.mlflow is lazy-initialized on first access. When MLflow is not installed or enabled is false, it returns None — so existing code that doesn't use MLflow is unaffected.
This is the capability that closes the loop between experiment tracking and production serving. Given any registered model URI, Feast can tell you exactly which feature service it needs:
fs_name = store.mlflow.resolve_features("models:/driver_model/1")
# Returns: "driver_activity_v1"
Resolution follows a precise chain:
feast.feature_service (set by register_model)feast.feature_service (set by auto-logging)feast_features.json artifact to ensure the feature service projections match the features the model was actually trained onIf there is a mismatch : say someone renamed a feature in the service after training — resolve_features() raises FeastMlflowModelResolutionError with a clear diff. No silent serving skew.
This enables a powerful production pattern: your serving pipeline doesn't hardcode feature names. It resolves them from the model:
fs_name = store.mlflow.resolve_features(f"models:/driver_model/production")
features = store.get_online_features(
features=store.get_feature_service(fs_name),
entity_rows=request_entities,
)
Promote a new model version that uses different features, and the serving pipeline auto-adapts.
<div style="display: flex; justify-content: center; margin: 2rem 0;"> </div>When auto_log_entity_df: true, the integration saves the entity DataFrame as a Parquet artifact on every historical retrieval. Later, you can reconstruct the exact training inputs:
entity_df = store.mlflow.get_training_entity_df(run_id="abc123")
with store.mlflow.start_run(run_name="retrain_v2"):
new_df = store.get_historical_features(
features=store.get_feature_service("driver_activity_v1"),
entity_df=entity_df,
).to_df()
Even without entity DataFrame archival, Feast always logs metadata : row count, column names, date range, or the SQL query — so you have an audit trail of what went into the model.
<div style="display: flex; justify-content: center; margin: 2rem 0;"> </div>When log_operations: true, feast apply and feast materialize are logged to a dedicated MLflow experiment ({project}-feast-ops). These are self-contained runs : they don't require a user-initiated active run:
mlflow:
enabled: true
log_operations: true
ops_experiment_suffix: "-feast-ops"
Apply runs record which feature views, feature services, and entities were created, updated, or deleted. Materialize runs record the feature views, date range, and duration. This gives platform teams a time-series audit trail of every registry and materialization change.
<div style="display: flex; justify-content: center; margin: 2rem 0;"> </div>For teams that use MLflow's dataset tracking, the integration provides an explicit API:
store.mlflow.log_training_dataset(
df=training_df,
dataset_name="driver_training_v1",
source="feast.get_historical_features",
)
This uses mlflow.data.from_pandas and mlflow.log_input to register the DataFrame as a dataset input on the active run.
The integration provides two ways to access MLflow, depending on your preference:
store.mlflow — explicit, multi-store safestore = FeatureStore(".")
store.mlflow.start_run(run_name="training")
store.mlflow.log_model(model, "model")
store.mlflow only exposes Feast-enhanced methods. For raw MLflow access, use the escape hatches:
store.mlflow.client # MlflowClient instance
store.mlflow.mlflow # raw mlflow module
feast.mlflow — drop-in module replacementimport feast.mlflow
feast.mlflow.start_run(run_name="training") # Feast-enhanced
feast.mlflow.log_params({"lr": "0.01"}) # passthrough to mlflow
feast.mlflow.log_model(model, "model") # Feast-enhanced
feast.mlflow auto-discovers the most recently created FeatureStore. For Feast-specific methods (like log_model, register_model, resolve_features), it uses the enhanced version. For everything else (log_params, set_tag, MlflowClient, ...), it delegates to raw mlflow. One import, both worlds.
The Feast UI automatically surfaces MLflow data when the integration is enabled. Three new API endpoints power the UI:
| Endpoint | What it shows |
|---|---|
/api/mlflow-runs | All Feast-tagged runs with linked registered models |
/api/mlflow-feature-usage | Per-feature-view: run count, last used, associated models |
/api/mlflow-feature-models | Reverse index: feature ref to registered models |
The feature view detail page shows MLflow training run count, last-used date, and a table of registered models that depend on the view. The registry graph visualization draws edges from feature services through MLflow runs to registered models.
When MLflow is not enabled, these endpoints return empty responses and the UI components are hidden — no visual noise for users who don't use MLflow.
<div style="display: flex; gap: 1rem; justify-content: center; margin: 2rem 0;"> </div>| Option | Type | Default | Description |
|---|---|---|---|
enabled | bool | false | Master switch |
tracking_uri | string | (env/default) | MLflow tracking URI |
auto_log | bool | true | Auto-tag runs on retrieval |
auto_log_entity_df | bool | false | Save entity DataFrame as artifact |
entity_df_max_rows | int | 100000 | Skip artifact for large DataFrames |
log_operations | bool | false | Log apply/materialize to ops experiment |
ops_experiment_suffix | string | "-feast-ops" | Ops experiment name suffix |
Install Feast with MLflow support:
pip install feast[mlflow]
Add the mlflow: block to your feature_store.yaml, start an MLflow tracking server, and run your training code. Features are automatically linked to experiments from the first retrieval.
We'd love to hear how you're using (or plan to use) the Feast–MLflow integration. Reach out on Slack or GitHub — issues and PRs welcome!