examples/data_science/llama/README.md
Simple Llama (generative AI) build and run with Devbox.
git clone https://github.com/jetify-com/devbox.gitcd devbox/examples/data_science/llama/devbox shellllama that you can use to run the built llama.cpp.devbox run get_modeldevbox run llamaThis example downloads vicuna-7b model. You can change it to download another Llama model by editing the devbox.json
devbox run llama runs the llama binary with a "hello world" prompt. To change that you can edit the prompt in devbox.json or once in devbox shell, run
llama -m ./models/vic7B/ggml-vic7b-q5_0.bin -n 512 -p "your custom prompt"
For more details on llama inference parameters refer to llama.cpp docs. Note that, instead of running ./main you can run llama inside devbox shell.