Ollama - Cline — ContextQMD

Prerequisites

Windows, macOS, or Linux computer
Cline installed in VS Code

Setup Steps

1. Install Ollama

Visit ollama.com
Download and install for your operating system

2. Choose and Download a Model

Browse models at ollama.com/search
Select model and copy command:
bash
```
ollama run [model-name]
```

Open your Terminal and run the command:
- Example:
  bash
```
ollama run llama2
```

Your model is now ready to use within Cline.

3. Configure Cline

Open VS Code and configure Cline:

Click the Cline settings icon
Select "Ollama" as your API provider
Base URL: http://localhost:11434/ (default, usually no need to change)
Select your model from the dropdown

Recommended Models

For the best experience with Cline, use Qwen 2.5 Coder 32B. This model provides strong coding capabilities and reliable tool use for local development.

To download it:

bash

ollama pull qwen2.5-coder:32b

Other capable models include:

mistral-small:latest - Good balance of performance and speed
codellama:34b-code - Optimized for coding tasks

Important Notes

Start Ollama before using with Cline
Keep Ollama running in background
First model download may take several minutes

Enable Compact Prompts

For better performance with local models, enable compact prompts in Cline settings. This reduces the prompt size by 90% while maintaining core functionality.

Navigate to Cline Settings → Features → Use Compact Prompt and toggle it on.

Troubleshooting

If Cline can't connect to Ollama:

Verify Ollama is running
Check base URL is correct
Ensure model is downloaded

Need more info? Read the Ollama Docs.