Latest in AI

Showing:fine-tuningResearchersClear ×

🔥 Trending today

anthropic6 export-controls4 model-access3 amazon3 national-security2 open-source2 ai-regulation2 government-policy2 enterprise-ai2 compliance2

Topic

Release New Tool Tutorial Business Paper Benchmark Opinion Regulation

For

General Developers Designers Product Founders Marketing Researchers Students

微調 olmOCR 打造高保真度 OCR 引擎★ 75
Hugging Face Blog418 days agoTutorial
### Background With the proliferation of vision-language models (VLMs), using VLMs for document OCR (e.g., converting PDFs to Markdown) has become mainstream…
Hugging Face 經典 NLP 課程正式轉型為 LLM 課程：迎向大語言模型時代的全面升級★ 85
Hugging Face Blog437 days agoTutorial
Hugging Face's "NLP Course" has long been a must-read classic for developers and researchers worldwide looking to enter the fields of Transformers and natural…
使用 Sentence Transformers 訓練與微調 Reranker 重排模型教學★ 80
Hugging Face Blog445 days agoTutorial
When building RAG (Retrieval-Augmented Generation) systems, relying solely on vector embeddings for semantic search is often not precise enough. To improve…
Open R1 第三次更新：Hugging Face 釋出開源推理模型與 GRPO 訓練優化細節★ 85
Hugging Face Blog459 days agoRelease
Since its launch, Hugging Face's Open R1 project has been dedicated to replicating the reasoning capabilities of DeepSeek-R1 in a fully open-source manner. In…
Hugging Face 釋出 vid_ds_scripts：一站式構建影片生成高品質資料集★ 75
Hugging Face Blog487 days agoNew Tool
With the rise of open-source video generation models such as LTX-Video, HunyuanVideo, and CogVideoX, building high-quality training datasets has become the…
如何在 AWS 上部署與微調 DeepSeek 模型：Hugging Face 官方指南★ 85
Hugging Face Blog500 days agoTutorial
As DeepSeek-R1 swept through the AI landscape on the strength of its powerful reasoning capabilities, how to safely and efficiently deploy and fine-tune these…
Hugging Face 推出 Synthetic Data Generator：用自然語言輕鬆構建 AI 訓練資料集★ 82
Hugging Face Blog545 days agoNew Tool
Hugging Face launched a brand-new "Synthetic Data Generator" in December 2024 — a web-based, no-code tool designed to allow anyone to create high-quality AI…
投資於效能：利用 LLM 洞察微調小型模型 — CFM 案例研究★ 75
Hugging Face Blog558 days agoBusiness
This case study from Hugging Face details how quantitative asset management firm Capital Fund Management (CFM) has optimized its investment and research…
Replicate 大幅提升 FLUX 微調速度，並將優化技術開源★ 75
Replicate Blog565 days agoRelease
The AI cloud hosting platform Replicate has announced a major fine-tuning speed optimization for FLUX.1, currently the most popular open-source image…
Argilla 2.4 發布：在 Hugging Face Hub 上免程式碼輕鬆構建微調與評估數據集★ 75
Hugging Face Blog587 days agoRelease
The open-source data curation and annotation platform Argilla has officially released version 2.4, with the core of this update being deep integration with…
Llama 3.2 正式支援 Keras：跨框架輕鬆微調與部署 Meta 最新輕量級模型★ 75
Hugging Face Blog601 days agoRelease
Meta's Llama 3.2 release includes lightweight 1B and 3B text models designed specifically for edge computing and mobile devices. These models have now been…
修正梯度累積：解決 LLM 微調中常被忽視的數學偏差★ 85
Hugging Face Blog606 days agoTutorial
### The Mathematical Flaw in Traditional Gradient Accumulation Gradient accumulation is an extremely common technique in deep learning. When VRAM is limited…
微調 LLM 至 1.58-bit：讓極限模型量化變得簡單★ 85
Hugging Face Blog634 days agoTutorial
The deployment of large language models (LLMs) has long faced a dual bottleneck of VRAM capacity and memory bandwidth. Microsoft previously introduced the…
透過 Flash Attention 2 的 Packing 技術提升 Hugging Face 訓練效率★ 80
Hugging Face Blog662 days agoTutorial
When fine-tuning or pre-training large language models (LLMs), the sequence lengths of input data are typically uneven. The traditional approach is to use…
Replicate Intelligence #11：微調 FLUX.1、生成式影像遊戲與元宇宙的新願景★ 75
Replicate Blog667 days agoRelease
This edition of Replicate Intelligence #11 compiles major recent technical breakthroughs and application trends in the generative AI space, focusing primarily…
LAVE：在 Docmatix 上使用 LLM 進行零樣本 VQA 評估——我們還需要微調嗎？★ 75
Hugging Face Blog689 days agoPaper
### Background and Challenges Document Visual Question Answering (DocVQA) is an important application of multimodal AI, requiring models to simultaneously…
Meta 推出 Llama 3.1：405B、70B 與 8B 旗艦開源模型，支援多語言與 128K 超長上下文★ 95
Hugging Face Blog691 days agoRelease
Meta's Llama 3.1 represents a major milestone in the open-source AI landscape. The most notable model is the 405B (405 billion parameter) version — the first…
如何利用 distilabel 打造 Argilla 2.0 專屬聊天機器人★ 75
Hugging Face Blog698 days agoTutorial
In the AI field, quickly building a chatbot that can accurately answer questions about a specific domain or newly released software has always been a major…
Replicate Intelligence #7：資料整理與資料生成的重要性
Replicate Blog702 days agoCommentary
In the current wave of generative AI, the industry's attention is gradually shifting from "fine-tuning model architectures" to "improving data quality." Issue…
NuminaMath 如何贏得首屆 AIMO 進步獎（AI 數學奧林匹亞）並宣佈完整開源★ 80
Hugging Face Blog703 days agoRelease
### Background and Achievement The AI Mathematical Olympiad (AIMO) Progress Prize aims to advance AI models capable of solving Olympiad-level mathematical…
視覺語言模型（VLM）的偏好最佳化指南：使用 TRL 進行 DPO 微調★ 75
Hugging Face Blog704 days agoTutorial
As vision-language models (VLMs) are increasingly applied to multimodal tasks, how to make these models produce outputs that better align with human…
Hugging Face 推出全新資料集搜尋與篩選功能，大幅提升數據檢索效率★ 70
Hugging Face Blog706 days agoRelease
Hugging Face's official blog announced in July 2024 the launch of new "Dataset Search and Filtering Features," aimed at addressing the pain point of precisely…
微調 Microsoft Florence-2：微軟頂尖視覺語言模型實戰指南★ 80
Hugging Face Blog720 days agoTutorial
Microsoft open-sourced Florence-2 in June 2024 — a vision-language model (VLM) based on a sequence-to-sequence architecture. Despite its compact size (the Base…
NVIDIA H100 GPU 即將登陸 Replicate：支援更快速的模型推理與訓練
Replicate Blog732 days agoRelease
The official blog of Replicate, the popular AI model hosting and deployment platform, has announced that NVIDIA H100 Tensor Core GPUs will soon be officially…
Hugging Face 推出 RLOO 演算法：降低記憶體消耗，讓強化學習重回 RLHF 主流★ 80
Hugging Face Blog732 days agoRelease
In recent years, methods such as Direct Preference Optimization (DPO) have become mainstream for large language model (LLM) alignment, as they eliminate the…
Diffusers 正式支援 Stable Diffusion 3：更強大的圖像生成與記憶體優化★ 80
Hugging Face Blog732 days agoRelease
Hugging Face's official blog announced that its diffusers library now officially supports Stable Diffusion 3 (SD3), the latest release from Stability AI. SD3…
Replicate Intelligence #3：Garden State Llama、實用 LLM 指南與即時影像生成
Replicate Blog737 days agoCommentary
This issue of Replicate Intelligence #3 brings curated content on three core themes for developers and AI enthusiasts: 1. **Garden State Llama**: This is a…
使用 Sentence Transformers 訓練與微調嵌入模型 (Embedding Models)★ 80
Hugging Face Blog747 days agoRelease
The official Hugging Face blog introduces a major update to the Sentence Transformers library (v3.0), centered on the launch of the new…
Google 推出 PaliGemma：結合 SigLIP 與 Gemma 的開源視覺語言模型★ 80
Hugging Face Blog761 days agoRelease
Google has officially launched PaliGemma, a powerful yet lightweight open-source Vision-Language Model (VLM). The release of PaliGemma represents a significant…
StarCoder2-Instruct：完全透明且具備寬鬆授權的程式碼生成自我對齊技術★ 75
Hugging Face Blog776 days agoRelease
### Background and Challenges In the field of code generation, instruction tuning is the key to improving a model's practical utility and alignment with human…

← PreviousPage 2Next →

Latest in AI

微調 olmOCR 打造高保真度 OCR 引擎★ 75

Hugging Face 經典 NLP 課程正式轉型為 LLM 課程：迎向大語言模型時代的全面升級★ 85

使用 Sentence Transformers 訓練與微調 Reranker 重排模型教學★ 80

Open R1 第三次更新：Hugging Face 釋出開源推理模型與 GRPO 訓練優化細節★ 85

Hugging Face 釋出 vid_ds_scripts：一站式構建影片生成高品質資料集★ 75

如何在 AWS 上部署與微調 DeepSeek 模型：Hugging Face 官方指南★ 85

Hugging Face 推出 Synthetic Data Generator：用自然語言輕鬆構建 AI 訓練資料集★ 82

投資於效能：利用 LLM 洞察微調小型模型 — CFM 案例研究★ 75

Replicate 大幅提升 FLUX 微調速度，並將優化技術開源★ 75

Argilla 2.4 發布：在 Hugging Face Hub 上免程式碼輕鬆構建微調與評估數據集★ 75

Llama 3.2 正式支援 Keras：跨框架輕鬆微調與部署 Meta 最新輕量級模型★ 75

修正梯度累積：解決 LLM 微調中常被忽視的數學偏差★ 85

微調 LLM 至 1.58-bit：讓極限模型量化變得簡單★ 85

透過 Flash Attention 2 的 Packing 技術提升 Hugging Face 訓練效率★ 80

Replicate Intelligence #11：微調 FLUX.1、生成式影像遊戲與元宇宙的新願景★ 75

LAVE：在 Docmatix 上使用 LLM 進行零樣本 VQA 評估——我們還需要微調嗎？★ 75

Meta 推出 Llama 3.1：405B、70B 與 8B 旗艦開源模型，支援多語言與 128K 超長上下文★ 95

如何利用 distilabel 打造 Argilla 2.0 專屬聊天機器人★ 75

Replicate Intelligence #7：資料整理與資料生成的重要性

NuminaMath 如何贏得首屆 AIMO 進步獎（AI 數學奧林匹亞）並宣佈完整開源★ 80

視覺語言模型（VLM）的偏好最佳化指南：使用 TRL 進行 DPO 微調★ 75

Hugging Face 推出全新資料集搜尋與篩選功能，大幅提升數據檢索效率★ 70

微調 Microsoft Florence-2：微軟頂尖視覺語言模型實戰指南★ 80

NVIDIA H100 GPU 即將登陸 Replicate：支援更快速的模型推理與訓練

Hugging Face 推出 RLOO 演算法：降低記憶體消耗，讓強化學習重回 RLHF 主流★ 80

Diffusers 正式支援 Stable Diffusion 3：更強大的圖像生成與記憶體優化★ 80

Replicate Intelligence #3：Garden State Llama、實用 LLM 指南與即時影像生成

使用 Sentence Transformers 訓練與微調嵌入模型 (Embedding Models)★ 80

Google 推出 PaliGemma：結合 SigLIP 與 Gemma 的開源視覺語言模型★ 80

StarCoder2-Instruct：完全透明且具備寬鬆授權的程式碼生成自我對齊技術★ 75