Back to Cocoindex

Text Embedding with Turbopuffer (v1)

examples/text_embedding_turbopuffer/README.md

1.0.6989 B
Original Source

Text Embedding with Turbopuffer (v1)

This example embeds local markdown files, stores the chunks + embeddings in a Turbopuffer namespace, and provides a simple semantic-search query demo.

Prerequisites

Copy .env.example to .env and fill in your Turbopuffer API key:

sh
cp .env.example .env
# then edit .env and set TURBOPUFFER_API_KEY=tpuf_...

The example loads variables from .env automatically via python-dotenv. TURBOPUFFER_REGION defaults to gcp-us-central1 if you don't change it.

Run

Install deps:

sh
pip install -e .

Build/update the index (writes rows into Turbopuffer). Pick one of the two modes:

  • Catch-up run — scan sources, sync changes, exit:

    sh
    cocoindex update main
    
  • Live run — catch up, then keep watching for file changes (the source declares live=True in main.py):

    sh
    cocoindex update -L main
    

Query:

sh
python main.py "what is self-attention?"