Back to Datahub

README

metadata-ingestion/docs/sources/s3/README.md

1.5.0.31.4 KB
Original Source

Overview

Amazon S3 is a storage and lakehouse platform. Learn more in the official Amazon S3 documentation.

The DataHub integration for Amazon S3 covers file/lakehouse metadata entities such as datasets, paths, and containers. Depending on module capabilities, it can also capture features such as lineage, usage, profiling, ownership, tags, and stateful deletion detection.

Concept Mapping

Source ConceptDataHub ConceptNotes
"s3"Data Platform
s3 object / Folder containing s3 objectsDataset
s3 bucketContainerSubtype S3 bucket
s3 folderContainerSubtype Folder