Latest in AI

Showing:ttsDevelopersClear ×

🔥 Trending today

anthropic4 open-source3 amazon3 ai-regulation2 government-policy2 export-controls2 geopolitics2 privacy2 python-packaging2 webassembly2

Topic

Release New Tool Tutorial Business Paper Benchmark Opinion Regulation

For

General Developers Designers Product Founders Marketing Researchers Students

TTS Benchmark Revamped with Objective Standards and Blind ELO Voting (46 Models)
r/LocalLLaMA top day5 days agoBenchmark
Reddit user UkieTechie has revamped their TTS benchmark platform with objective scoring standards and live blind voting, now covering 46 speech synthesis models. Hosted on Hugging Face Space, the arena lets users vote on audio quality without knowing the model name, generating a dynamic ELO leaderboard. The project is open-source on GitHub and welcomes community submissions of new models.
Community Discussion: Local Installation and Multilingual Training for Kokoro TTS
r/LocalLLaMA top day6 days agoCommentary
A LocalLLaMA subreddit post discusses challenges with Kokoro TTS's multilingual performance on cloud APIs. The author is seeking community advice on how to install Kokoro locally and train/fine-tune it for Brazilian Portuguese to achieve more natural-sounding speech.
Gemini 3.1 Flash TTS：具備精準控制力與表現力的下一代 AI 語音模型★ 80
Google DeepMind Blog60 days agoRelease
Google DeepMind has officially released its latest generation speech synthesis model, "Gemini 3.1 Flash TTS," designed to bring revolutionary expressiveness…
Hugging Face 推出 TTS Arena：用社群盲測群眾外包評測語音合成模型★ 75
Hugging Face Blog838 days agoNew Tool
Hugging Face recently announced the launch of "TTS Arena" (Text-to-Speech Arena), a brand-new open-source platform specifically designed for evaluating…
使用 🤗 Transformers 優化 Bark 語音生成模型★ 75
Hugging Face Blog1,040 days agoTutorial
Bark is an innovative text-to-audio model developed by the team at Suno. It can generate not only high-quality, multilingual speech, but also background music…
Microsoft SpeechT5 登陸 Hugging Face：語音合成、辨識與轉換的多功能統一模型★ 75
Hugging Face Blog1,222 days agoRelease
Microsoft's SpeechT5 model has been officially integrated into Hugging Face's Transformers library. This represents a significant advancement in the field of…