Hugging Face BlogJul 23, 2024, 12:00 AMimportant 95

Meta 推出 Llama 3.1:405B、70B 與 8B 旗艦開源模型,支援多語言與 128K 超長上下文

Original: Llama 3.1 - 405B, 70B & 8B with multilinguality and long context

Meta's Llama 3.1 represents a major milestone in the open-source AI landscape. The most notable model is the 405B (405 billion parameter)…

Meta 正式發布 Llama 3.1 系列,包含 8B、70B 及首款能與頂級閉源模型媲美的 405B 旗艦模型。此版本將上下文視窗大幅提升至 128k,並增強了多語言能力。Hugging Face 同步推出完整生態系支援,涵蓋 Transformers 整合、TGI 推論優化、TRL 微調以及 FP8 量化,降低 405B 的部署門檻。

Meta's Llama 3.1 represents a major milestone in the open-source AI landscape. The most notable model is the 405B (405 billion parameter) version — the first open-source model capable of standing toe-to-toe with top-tier commercial models such as GPT-4o and Claude 3.5 Sonnet across domains including commonsense reasoning, mathematics, and translation. Llama 3.1 also updates the existing 8B and 70B models. Three major highlights of this upgrade are: a 128k extended context window (a dramatic improvement over the previous 8k limit), robust multilingual support (officially supporting 8 languages), and significantly enhanced tool use capabilities. Crucially, Meta revised its license terms to allow developers to use outputs from Llama 3.1 to train and improve other models, which will greatly accelerate distillation and innovation within the open-source community.

Full summary

Free shows the 3-line summary; Pro unlocks the full deep summary (~300 words) so you never have to click through.

See Pro plans →

Want the original English / full article?

Read on Hugging Face Blog →

Summaries are AI-generated; the original article is authoritative.