Back to Datahub

README

metadata-ingestion/docs/sources/s3/README.md

1.6.01.3 KB
Original Source

Overview

Amazon S3 is a storage and lakehouse platform. Learn more in the official Amazon S3 documentation.

The DataHub integration for Amazon S3 covers file/lakehouse metadata entities such as datasets, paths, and containers. It also captures data profiling, tags, and stateful deletion detection.

Concept Mapping

Source ConceptDataHub ConceptNotes
"s3"Data Platform
s3 object / Folder containing s3 objectsDataset
s3 bucketContainerSubtype S3 bucket
s3 folderContainerSubtype Folder