
Hush removes competing voices, background noise, and audio interference from real-time calls so your voice AI agents always hear what matters.
Hush is an open-source noise suppression model developed by weya AI that strips background noise, competing voices, and audio interference from real-time calls. Designed specifically for Voice AI pipelines, Hush cleans chaotic phone audio at the source, turning noisy input into clean, ASR-ready speech. It processes each 10 ms audio frame in under 1 ms on standard CPUs, keeping conversations flowing without latency. At just 8 MB, the model is lightweight enough to deploy in your own cloud or data center, and it ranked in the top 5 speech-enhancement models on Hugging Face’s Audio-to-Audio leaderboard at launch.
Hush processes each 10 ms audio frame in under 1 ms on standard CPUs, so calls stay fast with no GPUs needed. This makes it practical to run at scale without expensive hardware upgrades.
The model isolates the main caller and pushes background talk, TV noise, and other competing voices aside. ASR systems hear the person who matters, reducing errors from overlapping speech.
Hush handles traffic, office buzz, fans, and street sounds, keeping calls understandable even in the worst everyday noise. It was trained on over 10,000 hours of real-world noisy audio, including overlapping speakers and tough environments.
Hush fixes the call signal at the source, turning chaotic phone audio into clean, ASR-ready speech.
Most Voice AI failures come from bad audio, not bad models. Hush addresses this directly by cleaning the input before it reaches your speech recognition pipeline. Its open-source nature means you can deploy it freely, inspect the code, and integrate it into existing stacks without vendor lock-in. The combination of tiny model size (8 MB), CPU-only real-time performance, and proven leaderboard ranking makes it a practical foundation for any Voice AI system.
You build or operate Voice AI agents that handle real-world phone calls — especially in noisy environments like busy streets, cafes, or open offices. Hush is also worth exploring if you want to reduce ASR errors, improve agent comprehension, or clean call recordings for compliance without adding GPU costs. Its open-source license and lightweight footprint make it a low-risk addition to any audio pipeline.
Other tools you might consider
The Supercut MCP gives your AI/coding assistants permission-aware access to recordings, including semantic search, transcripts, frames, comments, reactions, and more.
Give your AI agent a product flow to record. Slideshot drives your web app through MCP, captures the walkthrough, and returns a polished demo video and GIF with zooms, cursor motion, and custom intro animation. No manual recording or editing.
Point MCP Bridge at any REST, GraphQL, SOAP, or gRPC API. It auto-generates MCP tool definitions with typed schemas, auth, rate limiting, and response processing. Your LLM agents call enterprise APIs through one standard interface.
The world can't build compute fast enough to keep up with AI demand. So we took a different path. ZeroGPU is AI infrastructure powered by small language models running on a hybrid edge network reusing compute that already exists. Not every task needs a frontier model. Our purpose-built, edge-optimized models run 10x faster, 50% cheaper and offload 70–80% of production tasks to small models with frontier-level accuracy.
Loading comments…
Maker
neon_dev
Project Info
Product Keywords