Latest in AI

Showing:ResearchersCohereClear ×

🔥 Trending today

anthropic7 export-controls4 model-access3 spacex3 amazon3 national-security2 open-source2 governance2 ai-policy2 ai-regulation2

Topic

Release New Tool Tutorial Business Paper Benchmark Opinion Regulation

For

General Developers Designers Product Founders Marketing Researchers Students

Why MoE Models Benefit More from Speculative Decoding
Cohere Blog2 days agoBenchmark
Cohere analyzes why speculative decoding behaves differently on Mixture-of-Experts models than on dense LLMs. Its benchmarks show MoE speedups can peak at moderate batch sizes because sparse expert routing keeps verification bandwidth-bound. The post also finds that temporal expert overlap and fixed overhead amortization make multi-token verification cheaper than simple worst-case models predict.
Production-Ready W4A8: vLLM Integration and Quality Recovery Techniques
Cohere Blog2 days agoTutorial
Cohere’s post appears to explain how W4A8 quantization can be prepared for production inference through vLLM integration. From the title, the focus is likely on deployment mechanics and techniques for recovering model quality after aggressive quantization. Because no article body is available, specific benchmarks, supported models, implementation steps, and measured quality gains cannot be confirmed.
The Future of Work Debate Has an Evidence Problem
Cohere Blog3 days agoCommentary
Cohere’s post appears to frame the future-of-work debate as limited by weak or incomplete evidence. Based on the title alone, its likely focus is not a product announcement but a commentary on how claims about AI’s workplace impact should be evaluated. The central takeaway is that policymakers, employers, and researchers should avoid overconfident predictions without better data.
Cohere Releases North Mini Code: Open-Source Agentic Coding Model
r/LocalLLaMA top day4 days agoRelease
Cohere has released North Mini Code 1.0, its first open-source agentic coding model, under the permissive Apache 2.0 license. The model has 30 billion total parameters but activates only 3 billion at inference time, suggesting a sparse architecture optimized for efficiency. It scores 33.4 on the Artificial Analysis Coding Index, positioned as competitive among models of comparable size, and is available on Hugging Face.
Cohere Launches North Mini Code: A Lightweight Model for Code Tasks
Cohere Blog4 days agoRelease
Cohere has introduced North Mini Code, a smaller, code-specialized variant of its North model family designed for developer use cases. The mini model prioritizes low latency and cost efficiency while retaining strong code completion, debugging, and explanation capabilities. This follows the industry trend of pairing flagship models with lightweight alternatives for high-frequency API usage in enterprise and individual developer contexts.
Unsloth releases GGUF version of Cohere North-Mini-Code 1.0 (30B A3B MoE) on Hugging Face
r/LocalLLaMA top day4 days agoRelease
Unsloth uploaded a GGUF version of Cohere's North-Mini-Code 1.0 to Hugging Face, making local inference possible for this 30B A3B MoE coding-focused model. The poster links the release to llama.cpp PR #24260, suggesting new architecture support may be required. No benchmarks or test results have been shared yet; this is an early community resource post.
Releasing Cohere North Mini Code
r/LocalLLaMA top day5 days agoRelease
Cohere’s Jay Alammar announced the official release of North Mini Code after early community feedback from r/LocalLLaMA. Weights are available on Hugging Face, including an fp8 version, and the model can be tried for free through OpenCode. For vLLM deployment, Cohere recommends using vLLM main for now and installing cohere_melody for accurate response parsing, while noting community requests for quantization and llama.cpp support.
Cohere North Mini Code 1.0
r/LocalLLaMA top day5 days agoRelease
CohereLabs’ North Mini Code 1.0 appears to have moved from early access to final release, with weights available on Hugging Face. The Reddit post describes it as a 30B A3B coding model. Its Artificial Analysis overall score of 28 trails Qwen 3.6 35B at 43, but its coding index score of 33 is close to Qwen’s 35 and above Gemma 4 26B’s 22.
Introducing North Mini Code: Cohere's First Model For Developers
Hugging Face Blog5 days agoRelease
Cohere officially introduces North Mini Code, the first model in its North lineup explicitly aimed at developers rather than enterprise API customers. The 'Mini' designation signals a compact, cost-efficient model suited for IDE integrations, CLI tools, and real-time code completion. This marks a strategic expansion for Cohere beyond B2B into the broader developer tooling ecosystem.
Omi Med STT v1: Open-Weight Medical ASR Fine-Tuned from Parakeet 0.6B★ 72
r/LocalLLaMA top day5 days agoRelease
Omi Health’s founder says he fine-tuned NVIDIA Parakeet TDT 0.6B v2 for clinical speech and released Omi Med STT v1 under CC-BY-4.0. The runtime supports Mac, Windows, and Linux, auto-selecting MLX, NeMo, or GGUF/parakeet.cpp backends. In the author’s held-out medical benchmark, it reports 2.37% medical-WER and 145× realtime on local A10 compute.
Cohere and Mila Partner to Advance Quebec French Language and Culture in AI
Cohere Blog6 days agoBusiness
Cohere has partnered with Mila, the Quebec AI Institute, to improve the representation of Quebec French (Québécois) and its cultural nuances in AI. The collaboration aims to address the European French bias in current models by leveraging Cohere's multilingual capabilities and Mila's research expertise. This initiative will help deliver more culturally accurate AI solutions for Quebec's public and private sectors.
Cohere Healthcare & Life Sciences: Transforming Clinical Research and Medical Data Retrieval with Enterprise AI
Cohere Blog6 days agoBusiness
Cohere highlights its enterprise AI solutions tailored for the healthcare and life sciences sectors. By utilizing its Command, Embed, and Rerank models, Cohere enables medical institutions and pharmaceutical companies to securely retrieve and analyze complex clinical data. This accelerates drug discovery, streamlines clinical trials, and improves administrative efficiency while ensuring strict regulatory compliance.
Introducing Cohere Transcribe: A New State-of-the-Art in Open-Source Speech Recognition★ 80
Cohere Blog6 days agoRelease
Cohere has announced "Cohere Transcribe," a new state-of-the-art open-source speech recognition model. Designed to deliver highly accurate and efficient speech-to-text capabilities, it represents Cohere's expansion into open-source audio AI. The model aims to challenge existing industry benchmarks like OpenAI's Whisper by offering superior multilingual performance.
Cohere Blog: Technology Tag Page Overview
Cohere Blog6 days agoCommentary
This page aggregates all technology-focused articles on the Cohere blog. As an enterprise-focused AI company, Cohere's technical content primarily covers its Command LLM family, industry-leading Embed and Rerank models, and practical RAG implementation guides. It serves as a key resource for developers and enterprise architects tracking Cohere's technical evolution.
Cohere Highlights Thomas Euyang as Research Visual Storyteller
Cohere Blog6 days agoBusiness
Cohere highlights the role of Thomas Euyang, a Research Visual Storyteller at the company. His work focuses on translating complex machine learning research and LLM concepts into intuitive, engaging visual narratives. This spotlight underscores the growing importance of design and visual communication in making advanced AI research accessible to developers and the public.
Co/plot: Supporting the Research Process Through Visualization
Cohere Blog6 days agoNew Tool
Cohere has announced "Co/plot," a tool dedicated to supporting the research process through advanced visualization. It aims to help researchers and developers better understand complex data structures, model behaviors, and research workflows. This launch highlights Cohere's expanding focus on building practical developer and researcher tools that complement their core LLM and embedding models.
Cohere's Commitment to Open Science and Collaborative AI Research
Cohere Blog6 days agoCommentary
Cohere's Open Science initiative, primarily driven by its non-profit research lab Cohere For AI (C4AI), focuses on democratizing AI research. By releasing open-weights models like Aya and fostering global research collaborations, Cohere aims to bridge the gap in multilingual AI representation. This approach highlights their commitment to community-driven, accessible AI development.
Cohere Product Launch Blog Hub
Cohere Blog6 days agoRelease
This link directs to Cohere's official "Product Launch" blog category. It serves as a centralized hub aggregating all major product announcements, including the Command LLM series, Embed models, Rerankers, and developer platform updates. It is a key resource for tracking Cohere's enterprise AI advancements.
Cohere Official Research Blog and Technical Publications
Cohere Blog6 days agoCommentary
The Cohere Research blog serves as the central hub for the company's academic papers and technical breakthroughs. It covers key areas including advanced Retrieval-Augmented Generation (RAG), multilingual embeddings, and robust tool-use capabilities for enterprise agents. This is a key resource for understanding the foundational technology behind Cohere's models.
LLM Research Papers: The 2026 List (January to May)
Ahead of AI (Raschka)8 days agoPaper
Sebastian Raschka compiles a curated reference list of LLM papers he bookmarked from January through May 2026. The list is not comprehensive, but organized around topics useful for future articles, lectures, code examples, and research work. Public sections emphasize reasoning, RL, efficient inference, long context, agent systems, tool use, coding agents, diffusion language models, and serving infrastructure.
RSI is the new AGI — and it’s just as hard to pin down
TechCrunch AI17 days agoCommentary
TechCrunch reports that recursive self-improvement, or RSI, is becoming a new AI industry fixation, much like AGI. Researchers and startups including Recursive Superintelligence, Auto-Research, AutoScientist, and Disarray are exploring ways for AI systems to automate parts of AI research. But experts caution that AI-assisted research is not the same as fully autonomous self-improvement, especially while models still struggle with long-term self-direction and verification.

Latest in AI

Why MoE Models Benefit More from Speculative Decoding

Production-Ready W4A8: vLLM Integration and Quality Recovery Techniques

The Future of Work Debate Has an Evidence Problem

Cohere Releases North Mini Code: Open-Source Agentic Coding Model

Cohere Launches North Mini Code: A Lightweight Model for Code Tasks

Unsloth releases GGUF version of Cohere North-Mini-Code 1.0 (30B A3B MoE) on Hugging Face

Releasing Cohere North Mini Code

Cohere North Mini Code 1.0

Introducing North Mini Code: Cohere's First Model For Developers

Omi Med STT v1: Open-Weight Medical ASR Fine-Tuned from Parakeet 0.6B★ 72

Cohere and Mila Partner to Advance Quebec French Language and Culture in AI

Cohere Healthcare & Life Sciences: Transforming Clinical Research and Medical Data Retrieval with Enterprise AI

Introducing Cohere Transcribe: A New State-of-the-Art in Open-Source Speech Recognition★ 80

Cohere Blog: Technology Tag Page Overview

Cohere Highlights Thomas Euyang as Research Visual Storyteller

Co/plot: Supporting the Research Process Through Visualization

Cohere's Commitment to Open Science and Collaborative AI Research

Cohere Product Launch Blog Hub

Cohere Official Research Blog and Technical Publications

LLM Research Papers: The 2026 List (January to May)

RSI is the new AGI — and it’s just as hard to pin down