metadata-ingestion/docs/sources/fabric-data-factory/README.md
Microsoft Fabric Data Factory is a cloud-based data integration service within the Microsoft Fabric platform. Learn more in the official Microsoft Fabric Data Factory documentation.
The DataHub integration for Fabric Data Factory covers pipeline and orchestration entities such as workspaces, data pipelines, and activities. It also captures table-level lineage and stateful deletion detection.
| Fabric Data Factory Concept | DataHub Entity | Notes |
|---|---|---|
| Workspace | Container (subtype: Fabric Workspace) | Top-level organizational unit |
| Data Pipeline | DataFlow | Orchestration pipeline containing activities |
| Activity | DataJob | Individual task within a pipeline (Copy, Lookup, Spark, etc.) |
| Pipeline Run | DataProcessInstance | Execution record for a pipeline run |
| Activity Run | DataProcessInstance | Execution record for an individual activity within a pipeline |
| Connection | (resolved to external Dataset) | Used for lineage resolution to datasets on external platforms |
Platform (fabric-data-factory)
└── Workspace (Container)
└── Data Pipeline (DataFlow)
└── Activity (DataJob)
├── Pipeline Run (DataProcessInstance)
└── Activity Run (DataProcessInstance)