Latest in AI

Showing:agentic-workflowsResearchersClear ×

🔥 Trending today

anthropic6 export-controls4 model-access3 amazon3 national-security2 open-source2 ai-regulation2 government-policy2 enterprise-ai2 compliance2

Topic

Release New Tool Tutorial Business Paper Benchmark Opinion Regulation

For

General Developers Designers Product Founders Marketing Researchers Students

DeepSeek v4 Coding Scores Clash With Broader Frontier Benchmarks
r/LocalLLaMA top day3 days agoCommentary
A Reddit post questions why DeepSeek v4 can rank near the top of coding leaderboards while CAISI reportedly places it about eight months behind the US frontier. The author argues that both views may be compatible because coding benchmarks measure a narrow, heavily optimized slice of capability. For local users, the bigger question is how quantized DeepSeek v4 variants perform in real agent workflows, tool calls, cybersecurity, and abstract reasoning.
Gemini 3.5 發布：具備行動力的前沿智能，主打複雜代理型工作流 (Agentic Workflows)★ 85
Google DeepMind Blog29 days agoRelease
Google DeepMind has announced the launch of its next-generation AI model, Gemini 3.5, positioned as "frontier intelligence with action." This announcement…