examples/code_embedding_lancedb/README.md
This example extracts code chunks from local Python, Rust, TOML, and Markdown files, stores the code and their vector embeddings in LanceDB, and provides a simple semantic search demo for code.
./lancedb_data/ directory - just copy to move itAll data is stored in the ./lancedb_data/ directory in your project folder. This directory is created automatically on first run.
To start fresh, simply delete the ./lancedb_data/ directory and re-run the indexing.
Install dependencies:
pip install -e .
Build/update the index (writes rows into LanceDB). Pick one of the two modes:
Catch-up run — scan sources, sync changes, exit:
cocoindex update main
Live run — catch up, then keep watching for file changes (the source declares live=True in main.py):
cocoindex update -L main
Query:
python main.py "embedding"