metadata-ingestion/docs/sources/fabric-onelake/README.md
Microsoft Fabric OneLake is a storage and lakehouse platform. Learn more in the official Microsoft Fabric OneLake documentation.
The DataHub integration for Microsoft Fabric OneLake covers workspace, lakehouse, and warehouse containers, table datasets with schema metadata, and view datasets with view definitions and view-to-table lineage parsed from the view SQL. It also extracts query usage statistics from the SQL Analytics Endpoint's queryinsights views, and captures stateful deletion detection.
| Microsoft Fabric | DataHub Entity | Notes |
|---|---|---|
| Workspace | Container (subtype: Fabric Workspace) | Top-level organizational unit |
| Lakehouse | Container (subtype: Fabric Lakehouse) | Contains schemas and tables |
| Warehouse | Container (subtype: Fabric Warehouse) | Contains schemas and tables |
| Schema | Container (subtype: Fabric Schema) | Logical grouping within lakehouse/warehouse |
| Table | Dataset | Tables within schemas |
| View | Dataset (subtype: View) | Lakehouse and Warehouse views; lineage extracted from view definition via SQL parsing |
Platform (fabric-onelake)
└── Workspace (Container)
├── Lakehouse (Container)
│ └── Schema (Container)
│ └── Table/View (Dataset)
└── Warehouse (Container)
└── Schema (Container)
└── Table/View (Dataset)
The Fabric REST API does not expose tenant-level endpoints. To represent tenant-level organization in DataHub, set the platform_instance configuration field to your tenant identifier (e.g., "contoso-tenant"). This will be included in all container and dataset URNs, effectively grouping all workspaces under the specified platform instance/tenant.