examples/code_embedding_lancedb/README.md
This example extracts code chunks from local Python, Rust, TOML, and Markdown files, stores the code and their vector embeddings in LanceDB, and provides a simple semantic search demo for code.
./lancedb_data/ directory - just copy to move itAll data is stored in the ./lancedb_data/ directory in your project folder. This directory is created automatically on first run.
To start fresh, simply delete the ./lancedb_data/ directory and re-run the indexing.
Install dependencies:
pip install -e .
Build/update the index (writes rows into LanceDB). Pick one of the two modes:
Catch-up run — scan sources, sync changes, exit:
cocoindex update main
Live run — catch up, then keep watching for file changes (the source declares live=True in main.py):
cocoindex update -L main
Query:
python main.py "embedding"
Copy .env.example to .env and fill in the blanks — it is loaded automatically when you run the example:
cp .env.example .env
LanceDB is an embedded local store, so there are no required secrets — the file documents the optional LANCEDB_URI override.