metadata-ingestion/docs/sources/informatica/README.md
Informatica Intelligent Data Management Cloud (IDMC) is a cloud-native data integration and management platform. Learn more in the official Informatica documentation.
The DataHub integration for Informatica covers projects and folders as containers; Mapping Tasks as DataFlows with a transform DataJob per task; Taskflows as DataFlows with a single orchestrate DataJob that chains the step order via inputDatajobs; and resolves table-level lineage across the data estate from mapping source/target connections. It also supports ownership extraction and stateful deletion detection.
| Source Concept | DataHub Concept | Notes |
|---|---|---|
"informatica" | Data Platform | |
| Project | Container | SubType "Project" |
| Folder | Container | SubType "Folder" |
| Taskflow | DataFlow + one orchestrate DataJob | SubTypes "Taskflow" / "Taskflow Orchestration"; the orchestrate sits at the end of the chain with inputDatajobs = [last MT] |
| Mapping Task | DataFlow + inner transform DataJob | SubTypes "Mapping Task" / "Task Logic"; MTs chain to each other via inputDatajobs in Taskflow step order |
| Mapping | not emitted as a standalone entity | Only Mapping Tasks (runnable schedules) are emitted; the Mapping reference is surfaced via customProperties on the Task |
| Mapplet | not emitted | Internal sub-mappings included in other mappings; skipped |
| Source/Target | Dataset | Upstream/downstream lineage; external dataset URNs receive a minimal Status stub so they resolve in lineage search |