Back to Datahub

README

metadata-ingestion/docs/sources/fabric-onelake/README.md

1.5.0.41.9 KB
Original Source

Overview

Microsoft Fabric OneLake is a storage and lakehouse platform. Learn more in the official Microsoft Fabric OneLake documentation.

The DataHub integration for Microsoft Fabric OneLake covers file/lakehouse metadata entities such as datasets, paths, and containers. Depending on module capabilities, it can also capture features such as lineage, usage, profiling, ownership, tags, and stateful deletion detection.

Concept Mapping

Microsoft FabricDataHub EntityNotes
WorkspaceContainer (subtype: Fabric Workspace)Top-level organizational unit
LakehouseContainer (subtype: Fabric Lakehouse)Contains schemas and tables
WarehouseContainer (subtype: Fabric Warehouse)Contains schemas and tables
SchemaContainer (subtype: Fabric Schema)Logical grouping within lakehouse/warehouse
TableDatasetTables within schemas

Hierarchy Structure

Platform (fabric-onelake)
└── Workspace (Container)
    ├── Lakehouse (Container)
    │   └── Schema (Container)
    │       └── Table (Dataset)
    └── Warehouse (Container)
        └── Schema (Container)
            └── Table/View (Dataset)

Platform Instance as Tenant

The Fabric REST API does not expose tenant-level endpoints. To represent tenant-level organization in DataHub, set the platform_instance configuration field to your tenant identifier (e.g., "contoso-tenant"). This will be included in all container and dataset URNs, effectively grouping all workspaces under the specified platform instance/tenant.