Back to Cocoindex

Text Embedding (v1) 🔍

examples/text_embedding/README.md

1.0.5863 B
Original Source

Text Embedding (v1) 🔍

This example embeds local markdown files, stores the chunks + embeddings in Postgres (pgvector), and provides a simple semantic-search query demo.

Prerequisites

  • A running Postgres with the pgvector extension. If you don't have one, start a local instance with the compose file in this repo:

    sh
    docker compose -f ../../dev/postgres.yaml up -d
    
  • POSTGRES_URL set, e.g.

    sh
    export POSTGRES_URL="postgres://cocoindex:cocoindex@localhost/cocoindex"
    

Run

Install deps:

sh
pip install -e .

Build/update the index (writes rows into Postgres). Either of the following works:

sh
cocoindex update main

or

sh
python main.py

Query:

sh
python main.py query "what is self-attention?"

Note: this example does not create a vector index; queries will do a sequential scan.