Google Gemini 3.1 Flash Live moves voice agents from speech-to-text to direct speech-to-speech, cutting latency and boosting ...
Chief among these features is Kairos, a persistent daemon that can operate in the background even when the Claude Code ...
Abstract: Deep learning (DL) libraries have become increasingly popular and their quality assurance is also gaining significant attention. Although many fault detection techniques have been proposed, ...
Small and fast: only 123M parameters. High-quality voice cloning: state-of-the-art performance in speaker similarity, intelligibility, and naturalness. Multi-lingual: support Chinese and English.
#3 Winner of Best Use of Zoom API at Stanford TreeHacks 2025! An AI-powered meeting assistant that captures video, audio and textual context from Zoom calls using multimodal RAG. WhisperVoice is a ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results