Replicate BlogJul 23, 2024, 12:00 AMimportant 85

使用 API 運行 Meta Llama 3.1 405B:Replicate 雲端部署指南

Original: Run Meta Llama 3.1 405B with an API

On July 23, 2024, Meta officially released the highly anticipated Llama 3.1 405B — one of the most powerful open-source large language…

Meta 推出最強開源模型 Llama 3.1 405B,Replicate 隨即宣布全面支援其 API 運行。開發者無需自行準備昂貴的 GPU 基礎設施,即可透過 Replicate 的雲端平台,以極低的延遲與簡單的一行程式碼整合該模型。此服務支援 128k 脈絡長度,並提供結構化輸出等功能,大幅降低了企業與開發者應用頂級開源 AI 的門檻。

On July 23, 2024, Meta officially released the highly anticipated Llama 3.1 405B — one of the most powerful open-source large language models in the world today. Its performance on multiple benchmarks is competitive with mainstream commercial closed-source models such as GPT-4o and Claude 3.5 Sonnet. To allow developers to experience and integrate this model as quickly and conveniently as possible, the cloud AI hosting platform Replicate announced the simultaneous launch of API access to Llama 3.1 405B.

Full summary

Free shows the 3-line summary; Pro unlocks the full deep summary (~300 words) so you never have to click through.

See Pro plans →

Want the original English / full article?

Read on Replicate Blog →

Summaries are AI-generated; the original article is authoritative.