metadata-ingestion/docs/sources/sagemaker/README.md
Sagemaker is a machine learning platform. Learn more in the official Sagemaker documentation.
The DataHub integration for Sagemaker covers ML entities such as models, features, and related lineage metadata. It also captures table-level lineage and stateful deletion detection.
While the specific concept mapping is still pending, this shows the generic concept mapping in DataHub.
| Source Concept | DataHub Concept | Notes |
|---|---|---|
| Platform/account/project scope | Platform Instance, Container | Organizes assets within the platform context. |
| Core technical asset (for example table/view/topic/file) | Dataset | Primary ingested technical asset. |
| Schema fields / columns | SchemaField | Included when schema extraction is supported. |
| Ownership and collaboration principals | CorpUser, CorpGroup | Emitted by modules that support ownership and identity metadata. |
| Dependencies and processing relationships | Lineage edges | Available when lineage extraction is supported and enabled. |