展翅高飛:擁有 1800 億參數的 Falcon 180B 正式發布
Original: Spread Your Wings: Falcon 180B is here
The Technology Innovation Institute (TII) in Abu Dhabi, UAE has officially released what is currently the largest openly accessible large…
阿聯酋技術創新研究所(TII)推出全新開源大語言模型 Falcon 180B,擁有 1800 億參數,並在 3.5 兆 Token 的 RefinedWeb 數據集上進行訓練。該模型在 Hugging Face Open LLM 排行榜上名列前茅,性能超越 LLaMA 2 70B,直逼 Google 的 PaLM-2。然而,其龐大的體積也對硬體提出了極高要求,推論至少需要 640GB 顯存(約 8 張 A100 80GB)。
The Technology Innovation Institute (TII) in Abu Dhabi, UAE has officially released what is currently the largest openly accessible large language model on Hugging Face — Falcon 180B. This model boasts 180 billion parameters and was trained on up to 3.5 trillion tokens from the RefinedWeb dataset, marking yet another major milestone for the open-source AI community.
Free shows the 3-line summary; Pro unlocks the full deep summary (~300 words) so you never have to click through.
See Pro plans →Want the original English / full article?
Read on Hugging Face Blog →Summaries are AI-generated; the original article is authoritative.