Google DeepMind has released Gemini 3.5 Live Translate, bringing near real-time and naturally flowing voice translation to three major Google platforms. The feature integrates into Google AI Studio for developers, Google Translate for general users, and Google Meet for remote collaboration. The emphasis on naturalness — not just speed — marks a meaningful step forward for AI-powered multilingual communication.
Cohere has partnered with Mila, the Quebec AI Institute, to improve the representation of Quebec French (Québécois) and its cultural nuances in AI. The collaboration aims to address the European French bias in current models by leveraging Cohere's multilingual capabilities and Mila's research expertise. This initiative will help deliver more culturally accurate AI solutions for Quebec's public and private sectors.
This page aggregates all technology-focused articles on the Cohere blog. As an enterprise-focused AI company, Cohere's technical content primarily covers its Command LLM family, industry-leading Embed and Rerank models, and practical RAG implementation guides. It serves as a key resource for developers and enterprise architects tracking Cohere's technical evolution.
Cohere has partnered with RWS, a global leader in translation and localization services, to deliver high-performance AI language intelligence for enterprises. The collaboration integrates Cohere's multilingual models (like Command R) into RWS's platforms to provide culturally accurate translations. This partnership focuses on secure, enterprise-grade deployment and advanced multilingual Retrieval-Augmented Generation (RAG).
Mistral AI introduces Voxtral, a speech understanding model family with 24B and 3B variants under Apache 2.0. The models support long-context transcription, audio Q&A, summarization, multilingual detection, and function calling from voice. Mistral says Voxtral is competitive across transcription and audio understanding benchmarks, with API access starting at $0.001 per minute and local downloads available on Hugging Face.
Mistral AI introduced Voxtral TTS, its first text-to-speech model, focused on realistic multilingual voice generation. The 4B-parameter model supports nine languages, quick voice adaptation from short references, and low-latency streaming for voice agents. Mistral says human evaluations show stronger naturalness than ElevenLabs Flash v2.5, with API access, Studio testing, Le Chat access, and open weights on Hugging Face.
Mistral AI introduced Voxtral TTS, its first text-to-speech model, targeting natural multilingual voice generation across nine languages. The 4B-parameter model supports voice adaptation from short references, emotional expressiveness, dialect handling, and low-latency streaming. It is available through API, Mistral Studio, and Le Chat, with open weights on Hugging Face under a non-commercial CC BY NC 4.0 license.
Harvey and ElevenLabs announced a partnership to bring ElevenLabs Text to Speech and Speech to Text into Harvey’s legal AI platform. The first phase will let Harvey deliver spoken answers in almost any language or dialect. Future plans mentioned include multilingual voice translation, voice mode, spoken trial simulations, tone customization, and related voice features.
ElevenLabs introduced Scribe v2 Realtime, a low-latency speech-to-text model built for live transcription, voice agents, meeting assistants, and real-time captions. The company says it transcribes in under 150 ms across several major languages and supports 90 languages. Key features include automatic language detection, VAD, manual commit, text conditioning, multiple audio formats, API access, ElevenLabs Agents integration, and enterprise compliance options.
Based only on the title, this ElevenLabs Blog post likely discusses multilingual diplomacy during Poland’s presidency of the Council of the EU. It may involve voice, translation, or audio workflows, but the original text is unavailable, so specific claims cannot be verified. The main signal is that AI voice tools are being positioned for public-sector and international communication use cases.