packages/kilo-docs/pages/code-with-ai/features/speech-to-text.md
{% callout type="warning" title="🧪 Experimental Feature" %} Voice Transcription / speech-to-text (STT) is currently in experimental status. Expect potential issues and changes as the feature matures. {% /callout %}
Kilo Code now includes experimental support for voice input in the chat interface. This feature allows you to dictate your messages using speech-to-text (STT) technology powered by OpenAI's Whisper API.
Voice transcription requires two components to be set up:
FFmpeg is required for audio capture and processing. Install it for your platform:
macOS:
brew install ffmpeg
Linux (Ubuntu/Debian):
sudo apt update
sudo apt install ffmpeg
Windows: Download from ffmpeg.org/download.html and add to your system PATH.
Voice transcription uses OpenAI's Whisper API for speech recognition. You need an OpenAI API configuration in Kilo Code:
Voice transcription is an experimental feature that must be enabled:
Once configured and enabled, a microphone button will appear in the chat input area:
The feature includes real-time audio level visualization and voice activity detection to automatically detect when you're speaking.
Microphone button not appearing:
Transcription errors:
This feature is currently experimental and may have limitations: