metadata-ingestion/docs/sources/flink/flink_pre.md
The flink module ingests metadata from Apache Flink into DataHub. It connects to the Flink JobManager REST API to extract jobs, execution plans, and run history. When a SQL Gateway URL is provided, it resolves SQL/Table API table references to their actual platforms (Kafka, Postgres, Iceberg, Paimon, etc.) via catalog introspection.
In order to ingest metadata from Apache Flink, you will need:
DESCRIBE CATALOG requires Flink 1.20+)| Capability | API | Required Access |
|---|---|---|
| Job metadata, run history | JobManager REST API (/v1/jobs) | Read access to the REST API |
| Platform-resolved lineage | SQL Gateway REST API (/v1/sessions) | Session creation and SQL execution |
NOTE: If your Flink cluster uses authentication (bearer token or basic auth), provide credentials in the
connectionconfig. The same credentials are used for both the JobManager and SQL Gateway APIs.