Replicate published their technical newsletter "Replicate Intelligence #5," with this issue focusing on major breakthroughs and real-world applications in the…
### Background In the current development of large language models (LLMs), high-quality alignment data (such as the preference data required for RLHF and DPO)…
In this case study, Prezi — the well-known company behind the non-linear presentation software of the same name — shares how it is embracing the "multimodal…
As large language models (LLMs) have made tremendous strides in code generation, the long-standing industry gold standard — the HumanEval benchmark — has…
Replicate has published its technical newsletter, Replicate Intelligence #4, summarizing recent major developments in the AI field as well as the latest…
In the era of large language models (LLMs), the VRAM of a single GPU is often insufficient to hold models with tens of billions of parameters. To overcome this…
The official blog of Replicate, the popular AI model hosting and deployment platform, has announced that NVIDIA H100 Tensor Core GPUs will soon be officially…
In recent years, methods such as Direct Preference Optimization (DPO) have become mainstream for large language model (LLM) alignment, as they eliminate the…
Hugging Face's official blog announced that its diffusers library now officially supports Stable Diffusion 3 (SD3), the latest release from Stability AI. SD3…
Hugging Face has announced the launch of a new Hugging Face Embedding container (Deep Learning Container, DLC) designed specifically for Amazon SageMaker. This…
Hugging Face's official blog published an article titled "Making sense of this mess," announcing a comprehensive redesign of the official documentation for its…
This issue of Replicate Intelligence #3 brings curated content on three core themes for developers and AI enthusiasts: 1. **Garden State Llama**: This is a…
Hugging Face has partnered with independent AI evaluation organization Artificial Analysis to officially launch the "Text to Image Leaderboard & Arena." This…
Hugging Face has announced the launch of "NPC-Playground," a 3D interactive sandbox environment designed to showcase and test non-player characters (NPCs)…
Hugging Face, in collaboration with Intel, has announced official support for "Assisted Generation" (also commonly known as Speculative Decoding) on Intel…
Hugging Face officially released a major security update announcement on May 31, 2024. The company stated that its security team had detected signs of…
Replicate's technical newsletter, Replicate Intelligence #2, takes a deep dive into three of the most hotly discussed trends in the open-source AI community…
This official Hugging Face blog post takes an in-depth look at how to benchmark Text Generation Inference (TGI), Hugging Face's open-source LLM inference and…
The official Hugging Face blog introduces a major update to the Sentence Transformers library (v3.0), centered on the launch of the new…
As large language models (LLMs) become increasingly prevalent in software development and automated workflows, their "dual-use" risks in the cybersecurity…
The Technology Innovation Institute (TII) of Abu Dhabi has officially released a new open-source model family on Hugging Face — Falcon 2 11B. This model, with…
This Replicate technical digest (Intelligence #1) compiles three of the most talked-about technical breakthroughs and open-source projects in the AI community…
AI model hosting platform Replicate published a security advisory on May 23, 2024, disclosing a "Shared Network Vulnerability" affecting its multi-tenant…
Hugging Face has announced official support for AWS Inferentia2 (Inf2) instances within its hosted Inference Endpoints service. This update gives developers…
During Microsoft Build 2024, Hugging Face announced a further strategic collaboration with Microsoft, aimed at providing developers with a more seamless…
Hugging Face and Dell Technologies have announced the launch of the "Dell Enterprise Hub," a new solution designed for enterprise on-premise AI deployment. As…
With the explosive growth of generative AI, demand for high-performance GPUs has reached an unprecedented level. To break hardware monopolies and reduce AI…
Hugging Face has officially announced the launch of the new "Spaces Dev Mode," an important feature designed to improve AI application development efficiency…
During the inference process of large language models (LLMs), the self-attention mechanism needs to store the Key and Value vectors of historical tokens (i.e…
Hugging Face has announced the launch of the "Open Arabic LLM Leaderboard," an important initiative aimed at advancing Arabic natural language processing (NLP)…