Back to Datahub

README

metadata-ingestion/docs/sources/pinecone/README.md

1.6.01.2 KB
Original Source

Overview

Pinecone is a managed vector database platform used to store, index, and query high-dimensional vector embeddings for AI and machine learning applications. Learn more in the official Pinecone documentation.

The DataHub integration for Pinecone extracts metadata about indexes, namespaces, and vector collections, including inferred schemas from vector metadata fields.

Concept Mapping

Source ConceptDataHub ConceptNotes
Pinecone AccountPlatform InstanceOrganizes assets within the platform context.
IndexContainer (PINECONE_INDEX)Top-level organizational unit storing vectors.
NamespaceContainer (PINECONE_NAMESPACE)Logical partition within an index.
Vector CollectionDatasetRepresents the collection of vectors in a namespace.
Metadata FieldsSchemaFieldInferred from sampled vector metadata.