Coding Reasoning Maths

OpenAI o3-Mini Review & Performance Tested : Coding, Math and Logical Reasoning

Whether it’s automating tedious coding tasks, solving complex logic puzzles, or even weighing in on ethical dilemmas, AI tools like OpenAI’s o3-Mini promise to make our lives easier. But let’s be ...

VentureBeat

Microsoft’s GRIN-MoE AI model takes on coding and math, beating competitors in key benchmarks

Join our daily and weekly newsletters for the latest updates and exclusive content on industry-leading AI coverage. Learn More Microsoft has unveiled a groundbreaking artificial intelligence model, ...

Geeky Gadgets

New ChatGPT-o1-mini excels at STEM, especially math and coding

OpenAI has also today released its the ChatGPT-o1-mini AI large language model, designed to be a cost-effective alternative to the o1-preview while maintaining strong performance in reasoning tasks.

NextBigFuture

OpenAI Releases O3 Model With High Performance and High Cost

OpenaI o3 sets new records in several key areas, particularly in reasoning, coding and mathematical problem-solving. It scores 75.7% on the semi-private eval in low-compute mode (for $20 per task in ...

TechSpot

OpenAI's newest o3 and o4-mini models excel at coding and math – but hallucinate more often

A hot potato: OpenAI's latest artificial intelligence models, o3 and o4-mini, have set new benchmarks in coding, math, and multimodal reasoning. Yet, despite these advancements, the models are drawing ...

Morning Overview on MSN

An open-source AI model from China just matched OpenAI’s best at a third of the cost — forcing the world’s biggest labs to slash their prices

In January 2025, a Hangzhou-based AI lab called DeepSeek dropped a reasoning model that, by its own benchmarks, went ...

TMCnet

Hide inaccessible results

OpenAI o3-Mini Review & Performance Tested : Coding, Math and Logical Reasoning

Microsoft’s GRIN-MoE AI model takes on coding and math, beating competitors in key benchmarks

New ChatGPT-o1-mini excels at STEM, especially math and coding

OpenAI Releases O3 Model With High Performance and High Cost

OpenAI's newest o3 and o4-mini models excel at coding and math – but hallucinate more often

An open-source AI model from China just matched OpenAI’s best at a third of the cost — forcing the world’s biggest labs to slash their prices

Logical Intelligence Tops Leading AI Verification Benchmarks as Verified Code Generation Nears Reality with Aleph

Google revamps Gemini 2.5 Pro again, claiming superiority in coding and math

DeepSeek says its R1 update rivals ChatGPT o3 and Gemini 2.5 Pro in performing math, coding and logic

Imandra’s new AI coding assistant CodeLogician uses ‘reasoning’ to guarantee the accuracy of its code