Latest in AI

Showing:math-reasoningStudentsClear ×

🔥 Trending today

anthropic6 export-controls4 model-access3 amazon3 national-security2 open-source2 ai-regulation2 government-policy2 enterprise-ai2 compliance2

Topic

Release New Tool Tutorial Business Paper Benchmark Opinion Regulation

For

General Developers Designers Product Founders Marketing Researchers Students

ChatGPT vs Doubao on Gaokao Math
量子位 QbitAI6 days agoBenchmark
The article appears to test ChatGPT and Doubao on Chinese Gaokao math problems. Since the original text is unavailable, the exact questions, prompts, scores, and winner cannot be verified. It should be treated as a media-style AI capability comparison rather than a rigorous, reproducible benchmark.
An OpenAI model solved a famous math problem that stumped humans for 80 years
Ars Technica AI13 days agoCommentary
Ars Technica reports that an unspecified OpenAI model solved a famous math problem that had stumped humans for roughly 80 years. The article aims to explain the solution more clearly than OpenAI's own account. The provided excerpt does not identify the problem, model, proof steps, validation process, or degree of human involvement, so the scope of the reported breakthrough cannot be assessed from it alone.
DeepMath：結合 smolagents 打造的輕量級數學推理 Agent★ 75
Hugging Face Blog192 days agoRelease
### Background and Challenge Large language models (LLMs) frequently encounter "hallucinations" or calculation errors when handling complex mathematical…