The article appears to test ChatGPT and Doubao on Chinese Gaokao math problems. Since the original text is unavailable, the exact questions, prompts, scores, and winner cannot be verified. It should be treated as a media-style AI capability comparison rather than a rigorous, reproducible benchmark.
Ars Technica reports that an unspecified OpenAI model solved a famous math problem that had stumped humans for roughly 80 years. The article aims to explain the solution more clearly than OpenAI's own account. The provided excerpt does not identify the problem, model, proof steps, validation process, or degree of human involvement, so the scope of the reported breakthrough cannot be assessed from it alone.
### Background and Challenge Large language models (LLMs) frequently encounter "hallucinations" or calculation errors when handling complex mathematical…