Latest in AI

Showing:fine-tuningClear ×

🔥 Trending today

anthropic6 export-controls4 model-access3 amazon3 national-security2 open-source2 ai-regulation2 government-policy2 enterprise-ai2 compliance2

Topic

Release New Tool Tutorial Business Paper Benchmark Opinion Regulation

For

General Developers Designers Product Founders Marketing Researchers Students

Hugging Face 推出 RLOO 演算法：降低記憶體消耗，讓強化學習重回 RLHF 主流★ 80
Hugging Face Blog732 days agoRelease
In recent years, methods such as Direct Preference Optimization (DPO) have become mainstream for large language model (LLM) alignment, as they eliminate the…
NVIDIA H100 GPU 即將登陸 Replicate：支援更快速的模型推理與訓練
Replicate Blog732 days agoRelease
The official blog of Replicate, the popular AI model hosting and deployment platform, has announced that NVIDIA H100 Tensor Core GPUs will soon be officially…
Diffusers 正式支援 Stable Diffusion 3：更強大的圖像生成與記憶體優化★ 80
Hugging Face Blog732 days agoRelease
Hugging Face's official blog announced that its diffusers library now officially supports Stable Diffusion 3 (SD3), the latest release from Stability AI. SD3…
Replicate Intelligence #3：Garden State Llama、實用 LLM 指南與即時影像生成
Replicate Blog737 days agoCommentary
This issue of Replicate Intelligence #3 brings curated content on three core themes for developers and AI enthusiasts: 1. **Garden State Llama**: This is a…
使用 Sentence Transformers 訓練與微調嵌入模型 (Embedding Models)★ 80
Hugging Face Blog747 days agoRelease
The official Hugging Face blog introduces a major update to the Sentence Transformers library (v3.0), centered on the launch of the new…
Google 推出 PaliGemma：結合 SigLIP 與 Gemma 的開源視覺語言模型★ 80
Hugging Face Blog761 days agoRelease
Google has officially launched PaliGemma, a powerful yet lightweight open-source Vision-Language Model (VLM). The release of PaliGemma represents a significant…
StarCoder2-Instruct：完全透明且具備寬鬆授權的程式碼生成自我對齊技術★ 75
Hugging Face Blog776 days agoRelease
### Background and Challenges In the field of code generation, instruction tuning is the key to improving a model's practical utility and alignment with human…
歡迎 Llama 3：Meta 全新開源大型語言模型正式登陸 Hugging Face★ 95
Hugging Face Blog787 days agoRelease
Meta officially released Llama 3, the next generation of its open-source large language models, on April 18, 2024. The initial release includes two parameter…
Ryght 攜手 Hugging Face 專家支援，賦能醫療與生命科學領域的 AI 轉型之旅
Hugging Face Blog789 days agoBusiness
This case study details how biomedical AI startup Ryght leveraged Hugging Face's Expert Support service to overcome the many challenges of deploying generative…
Hugging Face 推出 Idefics2：強大的 8B 開源視覺語言模型★ 80
Hugging Face Blog790 days agoRelease
Hugging Face has announced the launch of Idefics2, the next generation of its open-source Vision Language Model (VLM). With 8 billion (8B) parameters, this…
GaLore：在消費級硬體上訓練大型語言模型的突破性技術★ 85
Hugging Face Blog816 days agoRelease
As the parameter counts of large language models (LLMs) have skyrocketed, the hardware requirements for training and fine-tuning these models have risen…
在 NVIDIA DGX Cloud 上輕鬆使用 H100 GPU 訓練 Hugging Face 模型★ 75
Hugging Face Blog818 days agoRelease
Hugging Face has announced a deep partnership with NVIDIA to directly integrate NVIDIA DGX Cloud services into the Hugging Face platform. This collaboration…
在 Hugging Face 中微調 Gemma 模型★ 80
Hugging Face Blog842 days agoTutorial
After Google released the Gemma family of open-source models (including 2B and 7B parameter versions), Hugging Face promptly published this practical…
歡迎 Gemma：Google 全新開源大語言模型登陸 Hugging Face★ 85
Hugging Face Blog844 days agoRelease
Google has officially released a new family of open-source large language models called "Gemma" — a series of lightweight, state-of-the-art open-source models…
使用 🤗 Transformers 微調 W2V2-BERT 以進行低資源語音辨識 (ASR)★ 75
Hugging Face Blog877 days agoTutorial
This technical blog post from Hugging Face provides a detailed walkthrough of how to use the `transformers` library to fine-tune Meta's open-source W2V2-BERT…
使用直接偏好最佳化 (DPO) 方法對 LLM 進行偏好微調 (Preference Tuning)★ 80
Hugging Face Blog878 days agoTutorial
This technical blog post from Hugging Face takes an in-depth look at the latest techniques in "preference tuning," with a particular focus on **Direct…
使用 Unsloth 與 🤗 TRL 讓 LLM 微調速度提升 2 倍★ 80
Hugging Face Blog886 days agoRelease
Hugging Face's official blog announced a partnership with the Unsloth team to integrate Unsloth's efficient fine-tuning technology directly into Hugging Face's…
全世界的 LoRA 訓練腳本聯合起來！Hugging Face 推出全新 SDXL LoRA 進階訓練腳本★ 75
Hugging Face Blog894 days agoNew Tool
Hugging Face's official blog published a post titled "LoRA training scripts of the world, unite!" announcing the release of a powerful new advanced SDXL LoRA…
使用開源模型複製你的聲音：Replicate 推出 RVC 微調 API★ 75
Replicate Blog921 days agoRelease
AI cloud hosting platform Replicate has officially announced support for fine-tuning with RVC (Retrieval-based Voice Conversion). This new feature allows…
比較大語言模型性能：深入探討使用 LoRA 微調 RoBERTa、Llama 2 與 Mistral 進行災難推特分析★ 75
Hugging Face Blog950 days agoTutorial
This Hugging Face blog post takes an in-depth look at how to use LoRA (Low-Rank Adaptation) to fine-tune three models of different architectures and scales for…
Personal Copilot：訓練專屬於你的程式碼助手★ 75
Hugging Face Blog961 days agoTutorial
In everyday development, tools like GitHub Copilot dramatically improve productivity, but for enterprises or individual developers, general-purpose models may…
深入剖析：使用 PPO 進行 RLHF 的 N 個關鍵實作細節★ 85
Hugging Face Blog964 days agoTutorial
This technical blog post from Hugging Face takes an in-depth look at the critical "implementation details" that are routinely glossed over in academic papers…
在 Replicate 上微調 MusicGen，輕鬆生成任何風格的音樂
Replicate Blog975 days agoNew Tool
AI cloud deployment and runtime platform Replicate has announced official support for fine-tuning Meta's open-source music generation model MusicGen. This new…
使用 TRL 透過 DDPO 微調 Stable Diffusion 模型★ 75
Hugging Face Blog989 days agoRelease
Hugging Face published a blog post introducing how to use the DDPO (Denoising Diffusion Policy Optimization) algorithm within the TRL (Transformer…
非工程師指南：如何訓練專屬的 LLaMA 2 聊天機器人
Hugging Face Blog990 days agoTutorial
This official guide from Hugging Face is designed for readers without a technical background. It provides a detailed walkthrough of how to use Hugging Face's…
使用 PyTorch FSDP 高效微調 Llama 2 70B：解決 CPU 記憶體不足的實務指南★ 72
Hugging Face Blog1,005 days agoTutorial
When fine-tuning massively large open-source models like Llama 2 70B — with its 70 billion parameters — developers frequently encounter a bottleneck that goes…
Hugging Face SafeCoder 對決閉源程式碼助手：企業級私有化部署的安全性優勢★ 70
Hugging Face Blog1,007 days agoRelease
In today's software development workflows, AI coding assistants have become a critical tool for boosting developer productivity. However, for many enterprises…
Replicate 大幅優化微調模型冷啟動時間，現在只需不到一秒即可完成載入★ 75
Replicate Blog1,012 days agoRelease
AI cloud hosting platform Replicate has announced a major technical breakthrough for fine-tuned models: the "cold boot" time for fine-tuned models has been…
Hugging Face 推出 SafeCoder：專為企業設計的自託管安全程式碼生成助手★ 75
Hugging Face Blog1,027 days agoRelease
### Background and Enterprise Pain Points The widespread adoption of AI coding assistants like GitHub Copilot has significantly boosted developer productivity…
使用 DPO 微調 Llama 2：Hugging Face TRL 實作指南★ 80
Hugging Face Blog1,041 days agoTutorial
### Background and Pain Points Traditional RLHF (Reinforcement Learning from Human Feedback), while achieving enormous success with models like ChatGPT…

← PreviousPage 3Next →

Latest in AI

Hugging Face 推出 RLOO 演算法：降低記憶體消耗，讓強化學習重回 RLHF 主流★ 80

NVIDIA H100 GPU 即將登陸 Replicate：支援更快速的模型推理與訓練

Diffusers 正式支援 Stable Diffusion 3：更強大的圖像生成與記憶體優化★ 80

Replicate Intelligence #3：Garden State Llama、實用 LLM 指南與即時影像生成

使用 Sentence Transformers 訓練與微調嵌入模型 (Embedding Models)★ 80

Google 推出 PaliGemma：結合 SigLIP 與 Gemma 的開源視覺語言模型★ 80

StarCoder2-Instruct：完全透明且具備寬鬆授權的程式碼生成自我對齊技術★ 75

歡迎 Llama 3：Meta 全新開源大型語言模型正式登陸 Hugging Face★ 95

Ryght 攜手 Hugging Face 專家支援，賦能醫療與生命科學領域的 AI 轉型之旅

Hugging Face 推出 Idefics2：強大的 8B 開源視覺語言模型★ 80

GaLore：在消費級硬體上訓練大型語言模型的突破性技術★ 85

在 NVIDIA DGX Cloud 上輕鬆使用 H100 GPU 訓練 Hugging Face 模型★ 75

在 Hugging Face 中微調 Gemma 模型★ 80

歡迎 Gemma：Google 全新開源大語言模型登陸 Hugging Face★ 85

使用 🤗 Transformers 微調 W2V2-BERT 以進行低資源語音辨識 (ASR)★ 75

使用直接偏好最佳化 (DPO) 方法對 LLM 進行偏好微調 (Preference Tuning)★ 80

使用 Unsloth 與 🤗 TRL 讓 LLM 微調速度提升 2 倍★ 80

全世界的 LoRA 訓練腳本聯合起來！Hugging Face 推出全新 SDXL LoRA 進階訓練腳本★ 75

使用開源模型複製你的聲音：Replicate 推出 RVC 微調 API★ 75

比較大語言模型性能：深入探討使用 LoRA 微調 RoBERTa、Llama 2 與 Mistral 進行災難推特分析★ 75

Personal Copilot：訓練專屬於你的程式碼助手★ 75

深入剖析：使用 PPO 進行 RLHF 的 N 個關鍵實作細節★ 85

在 Replicate 上微調 MusicGen，輕鬆生成任何風格的音樂

使用 TRL 透過 DDPO 微調 Stable Diffusion 模型★ 75

非工程師指南：如何訓練專屬的 LLaMA 2 聊天機器人

使用 PyTorch FSDP 高效微調 Llama 2 70B：解決 CPU 記憶體不足的實務指南★ 72

Hugging Face SafeCoder 對決閉源程式碼助手：企業級私有化部署的安全性優勢★ 70

Replicate 大幅優化微調模型冷啟動時間，現在只需不到一秒即可完成載入★ 75

Hugging Face 推出 SafeCoder：專為企業設計的自託管安全程式碼生成助手★ 75

使用 DPO 微調 Llama 2：Hugging Face TRL 實作指南★ 80