Hugging Face BlogSep 23, 2025, 12:00 AMimportant 80

Smol2Operator:用於電腦操作(Computer Use)的輕量級 GUI 代理後訓練指南與模型

Original: Smol2Operator: Post-Training GUI Agents for Computer Use

### Background and Challenge: The Rise of Local "Computer Use" With Anthropic's introduction of Computer Use and the development of various…

Hugging Face 發表 Smol2Operator,這是一套針對「電腦操作(Computer Use)」設計的後訓練 GUI 代理方案。基於輕量級視覺語言模型(如 SmolVLM),透過特定的監督微調(SFT)與強化學習,使其能精準識別螢幕元素並執行點擊、輸入等操作。此項目開源了模型權重與訓練方法,讓開發者能在消費級硬體上部署隱私安全、低延遲的本地 GUI 代理。

### Background and Challenge: The Rise of Local "Computer Use"

Full summary

Free shows the 3-line summary; Pro unlocks the full deep summary (~300 words) so you never have to click through.

See Pro plans →

Want the original English / full article?

Read on Hugging Face Blog →

Summaries are AI-generated; the original article is authoritative.