documentation/blog/2025-03-28-vibe-coding-with-goose/index.md
Imagine creating an app just by describing what you want out loud, like you’re talking to a friend. That’s the magic of vibe coding: turning natural language into working code with the help of an AI agent. And while typing a prompt gets the job done, saying it out loud hits different 🔥 The new Speech MCP server has quite literally entered the chat.
<!--truncate-->In a recent Wild Goose Case livestream, hosts Ebony Louis and Adewale Abati were joined by Max Novich from Block's AI tools team, who demonstrated an exciting new extension - the Speech MCP server.
During the livestream, Max demonstrated this by creating an entire web application using only voice commands - no keyboard or mouse required. This resulted in a vibrant, animated webpage with 3D effects, synthwave aesthetics, and interactive elements, all created through natural conversation with Goose.
<iframe class="aspect-ratio" src="https://www.youtube.com/embed/Zey9GHyXlHY?start=437&end=752" title="YouTube video player" frameborder="0" allow="accelerometer; autoplay; clipboard-write; encrypted-media; gyroscope; picture-in-picture; web-share" referrerpolicy="strict-origin-when-cross-origin" allowfullscreen></iframe>Speech MCP is an open source MCP server that enables voice interaction with AI agents like Goose. What makes it special is that it runs entirely locally on your machine, making it:
Local Speech Processing: Uses two main models:
Voice Selection: Choose from 54 different voices with varying characteristics and personalities
Multi-Speaker Narration: Generate and play conversations between multiple voices
Audio Transcription: Convert audio/video content to text with timestamps and speaker detection
During the demonstration, Max showcased several impressive capabilities:
Voice-Controlled Development:
System Integration:
Natural Interaction:
To try the Speech MCP server yourself:
Install the required audio library (PortAudio):
# For macOS
brew install portaudio
# For Linux
apt-get install portaudio # or dnf install portaudio
Install the extension directly using the one-click deep link install in Goose
The Speech MCP server is open-source and welcomes contributions. You can also connect with Max on Discord for questions and collaboration.
Voice interactions with AI agents like Goose with the power and tools to act on instructions provides a different kind of vibe that makes the future feel closer than ever. Whether you're interested in vibe coding, accessibility improvements, or just want to feel a bit more like Tony Stark while getting Goose to pull a J.A.R.V.I.S, the Speech MCP server offers a glimpse into the future of human-AI collaboration - and it's available today.
<head> <meta property="og:title" content="Vibe Coding with Goose and the Speech MCP" /> <meta property="og:type" content="article" /> <meta property="og:url" content="https://goose-docs.ai/blog/2025/03/28/vibe-coding-with-goose" /> <meta property="og:description" content="Explore the new Speech MCP server that enables voice-controlled coding and natural conversation with your AI agent." /> <meta property="og:image" content="https://goose-docs.ai/assets/images/vibe-coding-b2efeed37ea43f4773da5f1ff96f4184.png" /> <meta name="twitter:card" content="summary_large_image" /> <meta property="twitter:domain" content="goose-docs.ai" /> <meta name="twitter:title" content="Vibe Coding with Goose and the Speech MCP" /> <meta name="twitter:description" content="Explore the new Speech MCP server that enables voice-controlled coding and natural conversation with your AI agent." /> <meta name="twitter:image" content="https://goose-docs.ai/assets/images/vibe-coding-b2efeed37ea43f4773da5f1ff96f4184.png" /> </head>