metadata-ingestion/docs/sources/fabric-onelake/fabric-onelake_pre.md
The fabric-onelake module ingests metadata from Fabric Onelake into DataHub. It is intended for production ingestion workflows and module-specific capabilities are documented below.
:::tip Quick Start
Workspace.Read.All and workspace accessfabric-onelake_recipe.yml as a templatedatahub ingest -c fabric-onelake_recipe.yml
:::Azure Authentication
Fabric Concepts
Before running ingestion, ensure network connectivity to the source, valid authentication credentials, and read permissions for metadata APIs required by this module.
The connector supports multiple Azure authentication methods:
| Method | Best For | Configuration |
|---|---|---|
| Service Principal | Production environments | authentication_method: service_principal |
| Managed Identity | Azure-hosted deployments (VMs, AKS, App Service) | authentication_method: managed_identity |
| Azure CLI | Local development | authentication_method: cli (run az login first) |
| DefaultAzureCredential | Flexible environments | authentication_method: default |
For service principal setup, see Register an application with Microsoft Entra ID.
The connector requires read-only access to Fabric workspaces and their contents. The authenticated identity (service principal, managed identity, or user) must have:
Workspace-Level Permissions:
API Permissions: The service principal or user must have the following Microsoft Entra API permissions:
Workspace.Read.All (delegated) - Required to list and read workspace metadataWorkspace.ReadWrite.All (delegated) - Provides read and write accessToken Audiences: The connector uses two different token audiences depending on the operation:
https://api.fabric.microsoft.com): Uses Power BI API scope (https://analysis.windows.net/powerbi/api/.default) for listing workspaces, lakehouses, warehouses, and basic table metadatahttps://onelake.table.fabric.microsoft.com): Uses Storage audience (https://storage.azure.com/.default) for accessing schemas and tables in schemas-enabled lakehousesThe connector automatically handles both token audiences. For schemas-enabled lakehouses, it will use OneLake Delta Table APIs with Storage audience tokens. For schemas-disabled lakehouses, it uses the standard Fabric REST API.
OneLake Data Access Permissions: For schemas-enabled lakehouses, you may also need OneLake data access permissions:
Note: The connector automatically detects whether a lakehouse has schemas enabled and uses the appropriate API endpoint and token audience. No additional configuration is required.
For detailed information on permissions, see:
For Service Principal:
Workspace.Read.AllFor Managed Identity: