News

The updated version of DeepSeek-R1 tied for first place with Google’s Gemini-2.5 and Anthropic’s Claude Opus 4 on the WebDev Arena leaderboard, which evaluates large language models (LLMs) on ...
This time, it was the X announcement that Codex, its AI coding agent, is now available for Plus tier users. In other words, you no longer have to spend $200/mo to get Codex's programming help.
Ambience Healthcare on Tuesday announced a new medical coding AI model that outperforms doctors by 27%. The company trained the new model using OpenAI's reinforcement fine-tuning technology.
Ambience announces OpenAI-powered medical coding model that outperforms physicians By Ashley Capoot, CNBC • Published May 27, 2025 • Updated on May 27, 2025 at 1:03 pm ...
Claude Opus 4 is Anthropic's most powerful model to date, and it is the world's best coding model with a 72.5 percent score on SWE-bench and 43.2 percent score on Terminal-bench.
Anthropic says Opus 4 leads industry benchmarks for coding tasks, achieving 72.5 percent on SWE-bench and 43.2 percent on Terminal-bench, calling it "the world's best coding model." ...
Anthropic Releases Claude 4, ‘the World’s Best Coding Model’ The new AI could be a game-changer for entrepreneurs who want to develop complex apps but don’t have a software background.
Anthropic kicked off its first-ever Code with Claude conference today with the announcement of a new frontier AI system. The company is calling Claude Opus 4 the best coding model in the world.
AI startup Mistral has announced a new AI model focused on coding: Devstral. The company claims it's competitive on at least one programming benchmark.