cookbook/90_models/llama_cpp/README.md
Note: Fork and clone this repository if needed
Run your chat model using Llama CPP. For the examples below make sure to download ggml-org/gpt-oss-20b-GGUF. Please also make sure that the model is reachable at http://127.0.0.1:8080/v1.
Command to run GPT-OSS-20B:
llama-server -hf ggml-org/gpt-oss-20b-GGUF --ctx-size 0 --jinja -ub 2048 -b 2048
python3 -m venv ~/.venvs/aienv
source ~/.venvs/aienv/bin/activate
uv pip install -U ddgs openai agno
python cookbook/92_models/llama_cpp/basic_stream.py
python cookbook/92_models/llama_cpp/basic.py
python cookbook/92_models/llama_cpp/tool_use_stream.py
python cookbook/92_models/llama_cpp/tool_use.py
python cookbook/92_models/llama_cpp/structured_output.py