Latest in AI

Showing:fine-tuningResearchersClear ×

🔥 Trending today

anthropic5 open-source3 amazon3 export-controls3 national-security2 model-access2 ai-regulation2 government-policy2 geopolitics2 privacy2

Topic

Release New Tool Tutorial Business Paper Benchmark Opinion Regulation

For

General Developers Designers Product Founders Marketing Researchers Students

歡迎 Llama 3：Meta 全新開源大型語言模型正式登陸 Hugging Face★ 95
Hugging Face Blog787 days agoRelease
Meta officially released Llama 3, the next generation of its open-source large language models, on April 18, 2024. The initial release includes two parameter…
Ryght 攜手 Hugging Face 專家支援，賦能醫療與生命科學領域的 AI 轉型之旅
Hugging Face Blog789 days agoBusiness
This case study details how biomedical AI startup Ryght leveraged Hugging Face's Expert Support service to overcome the many challenges of deploying generative…
Hugging Face 推出 Idefics2：強大的 8B 開源視覺語言模型★ 80
Hugging Face Blog790 days agoRelease
Hugging Face has announced the launch of Idefics2, the next generation of its open-source Vision Language Model (VLM). With 8 billion (8B) parameters, this…
GaLore：在消費級硬體上訓練大型語言模型的突破性技術★ 85
Hugging Face Blog816 days agoRelease
As the parameter counts of large language models (LLMs) have skyrocketed, the hardware requirements for training and fine-tuning these models have risen…
在 NVIDIA DGX Cloud 上輕鬆使用 H100 GPU 訓練 Hugging Face 模型★ 75
Hugging Face Blog818 days agoRelease
Hugging Face has announced a deep partnership with NVIDIA to directly integrate NVIDIA DGX Cloud services into the Hugging Face platform. This collaboration…
在 Hugging Face 中微調 Gemma 模型★ 80
Hugging Face Blog842 days agoTutorial
After Google released the Gemma family of open-source models (including 2B and 7B parameter versions), Hugging Face promptly published this practical…
歡迎 Gemma：Google 全新開源大語言模型登陸 Hugging Face★ 85
Hugging Face Blog844 days agoRelease
Google has officially released a new family of open-source large language models called "Gemma" — a series of lightweight, state-of-the-art open-source models…
使用 🤗 Transformers 微調 W2V2-BERT 以進行低資源語音辨識 (ASR)★ 75
Hugging Face Blog877 days agoTutorial
This technical blog post from Hugging Face provides a detailed walkthrough of how to use the `transformers` library to fine-tune Meta's open-source W2V2-BERT…
使用直接偏好最佳化 (DPO) 方法對 LLM 進行偏好微調 (Preference Tuning)★ 80
Hugging Face Blog878 days agoTutorial
This technical blog post from Hugging Face takes an in-depth look at the latest techniques in "preference tuning," with a particular focus on **Direct…
使用 Unsloth 與 🤗 TRL 讓 LLM 微調速度提升 2 倍★ 80
Hugging Face Blog886 days agoRelease
Hugging Face's official blog announced a partnership with the Unsloth team to integrate Unsloth's efficient fine-tuning technology directly into Hugging Face's…
比較大語言模型性能：深入探討使用 LoRA 微調 RoBERTa、Llama 2 與 Mistral 進行災難推特分析★ 75
Hugging Face Blog950 days agoTutorial
This Hugging Face blog post takes an in-depth look at how to use LoRA (Low-Rank Adaptation) to fine-tune three models of different architectures and scales for…
深入剖析：使用 PPO 進行 RLHF 的 N 個關鍵實作細節★ 85
Hugging Face Blog964 days agoTutorial
This technical blog post from Hugging Face takes an in-depth look at the critical "implementation details" that are routinely glossed over in academic papers…
在 Replicate 上微調 MusicGen，輕鬆生成任何風格的音樂
Replicate Blog975 days agoNew Tool
AI cloud deployment and runtime platform Replicate has announced official support for fine-tuning Meta's open-source music generation model MusicGen. This new…
使用 TRL 透過 DDPO 微調 Stable Diffusion 模型★ 75
Hugging Face Blog989 days agoRelease
Hugging Face published a blog post introducing how to use the DDPO (Denoising Diffusion Policy Optimization) algorithm within the TRL (Transformer…
使用 PyTorch FSDP 高效微調 Llama 2 70B：解決 CPU 記憶體不足的實務指南★ 72
Hugging Face Blog1,005 days agoTutorial
When fine-tuning massively large open-source models like Llama 2 70B — with its 70 billion parameters — developers frequently encounter a bottleneck that goes…
Hugging Face SafeCoder 對決閉源程式碼助手：企業級私有化部署的安全性優勢★ 70
Hugging Face Blog1,007 days agoRelease
In today's software development workflows, AI coding assistants have become a critical tool for boosting developer productivity. However, for many enterprises…
Replicate 大幅優化微調模型冷啟動時間，現在只需不到一秒即可完成載入★ 75
Replicate Blog1,012 days agoRelease
AI cloud hosting platform Replicate has announced a major technical breakthrough for fine-tuned models: the "cold boot" time for fine-tuned models has been…
Hugging Face 推出 SafeCoder：專為企業設計的自託管安全程式碼生成助手★ 75
Hugging Face Blog1,027 days agoRelease
### Background and Enterprise Pain Points The widespread adoption of AI coding assistants like GitHub Copilot has significantly boosted developer productivity…
使用 DPO 微調 Llama 2：Hugging Face TRL 實作指南★ 80
Hugging Face Blog1,041 days agoTutorial
### Background and Pain Points Traditional RLHF (Reinforcement Learning from Human Feedback), while achieving enormous success with models like ChatGPT…
在 Replicate 上微調 Llama 2 模型教學
Replicate Blog1,060 days agoTutorial
The well-known cloud AI hosting platform Replicate has announced official support for fine-tuning Meta's open-source large language model Llama 2. This service…
Llama 2 正式登場！已在 Hugging Face 開源上架並全面支援生態系★ 95
Hugging Face Blog1,062 days agoRelease
Meta and Microsoft jointly announced Llama 2, a new generation of open-source large language models. Compared to the original Llama, Llama 2 increases training…
Hugging Face 的開源文本生成與 LLM 生態系全景指南★ 85
Hugging Face Blog1,063 days agoRelease
This official Hugging Face blog post systematically maps out the complete ecosystem it has built around open-source large language models (LLMs). As…
在 Intel CPU 上微調 Stable Diffusion 模型
Hugging Face Blog1,066 days agoTutorial
In the era of booming generative AI, fine-tuning large image generation models like Stable Diffusion has generally been considered the exclusive domain of…
在 Habana Gaudi2 上加速視覺語言模型：BridgeTower 實作指南
Hugging Face Blog1,081 days agoTutorial
This technical blog post from Hugging Face details how to accelerate the vision-language model (VLM) "BridgeTower" on Intel's Habana Gaudi2 deep learning…
微調 MMS Adapter 模型：為低資源語言打造專屬語音辨識 (ASR)★ 70
Hugging Face Blog1,091 days agoTutorial
Meta's MMS (Massively Multilingual Speech) project, released in 2023, extends speech technology to over 1,000 languages, covering automatic speech recognition…
Falcon 系列開源模型正式登陸 Hugging Face 生態系統★ 75
Hugging Face Blog1,105 days agoRelease
The Falcon series of large language models (including Falcon-40B and Falcon-7B), developed by Abu Dhabi's Technology Innovation Institute (TII), have…
Hugging Face 整合 bitsandbytes、4-bit 量化與 QLoRA，讓大型語言模型更親民★ 90
Hugging Face Blog1,117 days agoRelease
This official Hugging Face blog post introduces a deep integration with the `bitsandbytes` library, formally adding 4-bit quantization support to…
使用 InstructPix2Pix 對 Stable Diffusion 進行指令微調 (Instruction-tuning)★ 70
Hugging Face Blog1,118 days agoTutorial
This Hugging Face blog post provides an in-depth exploration of how to use InstructPix2Pix technology to apply instruction tuning to Stable Diffusion, enabling…
使用 StarCoder 打造程式助手：StarChat Alpha 正式推出
Hugging Face Blog1,132 days agoRelease
Hugging Face has announced the launch of StarChat Alpha, a conversational AI assistant designed specifically for programming. The model is based on StarCoder…
使用 TensorFlow 與 TPU 透過 🤗 Transformers 訓練語言模型★ 70
Hugging Face Blog1,144 days agoTutorial
This technical guide from Hugging Face provides a detailed walkthrough of how to efficiently train language models by combining TensorFlow, the Hugging Face…

← PreviousPage 3Next →

Latest in AI

歡迎 Llama 3：Meta 全新開源大型語言模型正式登陸 Hugging Face★ 95

Ryght 攜手 Hugging Face 專家支援，賦能醫療與生命科學領域的 AI 轉型之旅

Hugging Face 推出 Idefics2：強大的 8B 開源視覺語言模型★ 80

GaLore：在消費級硬體上訓練大型語言模型的突破性技術★ 85

在 NVIDIA DGX Cloud 上輕鬆使用 H100 GPU 訓練 Hugging Face 模型★ 75

在 Hugging Face 中微調 Gemma 模型★ 80

歡迎 Gemma：Google 全新開源大語言模型登陸 Hugging Face★ 85

使用 🤗 Transformers 微調 W2V2-BERT 以進行低資源語音辨識 (ASR)★ 75

使用直接偏好最佳化 (DPO) 方法對 LLM 進行偏好微調 (Preference Tuning)★ 80

使用 Unsloth 與 🤗 TRL 讓 LLM 微調速度提升 2 倍★ 80

比較大語言模型性能：深入探討使用 LoRA 微調 RoBERTa、Llama 2 與 Mistral 進行災難推特分析★ 75

深入剖析：使用 PPO 進行 RLHF 的 N 個關鍵實作細節★ 85

在 Replicate 上微調 MusicGen，輕鬆生成任何風格的音樂

使用 TRL 透過 DDPO 微調 Stable Diffusion 模型★ 75

使用 PyTorch FSDP 高效微調 Llama 2 70B：解決 CPU 記憶體不足的實務指南★ 72

Hugging Face SafeCoder 對決閉源程式碼助手：企業級私有化部署的安全性優勢★ 70

Replicate 大幅優化微調模型冷啟動時間，現在只需不到一秒即可完成載入★ 75

Hugging Face 推出 SafeCoder：專為企業設計的自託管安全程式碼生成助手★ 75

使用 DPO 微調 Llama 2：Hugging Face TRL 實作指南★ 80

在 Replicate 上微調 Llama 2 模型教學

Llama 2 正式登場！已在 Hugging Face 開源上架並全面支援生態系★ 95

Hugging Face 的開源文本生成與 LLM 生態系全景指南★ 85

在 Intel CPU 上微調 Stable Diffusion 模型

在 Habana Gaudi2 上加速視覺語言模型：BridgeTower 實作指南

微調 MMS Adapter 模型：為低資源語言打造專屬語音辨識 (ASR)★ 70

Falcon 系列開源模型正式登陸 Hugging Face 生態系統★ 75

Hugging Face 整合 bitsandbytes、4-bit 量化與 QLoRA，讓大型語言模型更親民★ 90

使用 InstructPix2Pix 對 Stable Diffusion 進行指令微調 (Instruction-tuning)★ 70

使用 StarCoder 打造程式助手：StarChat Alpha 正式推出

使用 TensorFlow 與 TPU 透過 🤗 Transformers 訓練語言模型★ 70