metadata-ingestion/docs/sources/confluence/confluence_pre.md
:::caution Not Supported with Remote Executor This source is available as a private beta feature on DataHub Cloud. Note that running the connector using the Remote Executor is not yet supported. :::
The Confluence source ingests pages and spaces from Confluence workspaces (Cloud or Data Center) as DataHub Document entities with optional semantic embeddings for semantic search.
Optional semantic search support with sensible defaults:
See Semantic Search Configuration for detailed setup and advanced options.
Supports smart incremental updates via stateful ingestion:
Create an API token:
You'll need:
https://your-domain.atlassian.net/wiki)Create a Personal Access Token:
You'll need:
https://confluence.company.com)Note: For Data Center, you can also use username/password, but Personal Access Tokens are recommended.
The API credentials must have:
If you want semantic search capabilities, configure an embedding provider in your DataHub instance.
Supported providers include Cohere (API key) and AWS Bedrock (IAM roles). The connector will use sensible defaults for chunking and embedding configuration.
See Semantic Search Configuration for detailed provider setup and configuration options.