OpenAI on Thursday launched its reply to Google’s spectacular Gemini 3 Professional mannequin–GPT-5.2—and by the seems of some head-to-head benchmark check scores, it seems like a winner. The brand new mannequin took the best rating on quite a lot of benchmark assessments overlaying coding, math, science, instrument use, and imaginative and prescient. (Benchmarks ought to, after all, be mixed with real-world use to inform the entire story. However nonetheless . . .)
OpenAI says GPT-5.2, which is a reasoning mannequin, achieved expert-level efficiency scores by itself GDPval benchmark, which evaluates efficiency on 44 actual skilled duties together with issues like spreadsheet creation, doc drafting, presentation constructing, and extra.
GPT-5.2 topped Gemini 3 Professional on the SWE-Bench Professional benchmark (software program engineering duties) with a rating of 55.6% (versus Gemini 3 Professional’s 43.3%). It achieved an 86.2% on the ARC-AGI-1 summary reasoning benchmark, in comparison with Gemini 3 Professional’s 75% rating. It scored a 92.4% on the GPQA Diamond benchmark (science questions), in contrast with Gemini 3 Professional’s 91.9% rating.
The brand new mannequin is available in three variants. GPT-5.2 Prompt is nice for searching for data and how-tos, skill-building and research, and profession steering. GPT-5.2 Pondering is nice for more durable skilled duties like spreadsheet formatting and slideshow creation. GPT-5.2 Professional, the corporate says, takes longer to generate solutions however is its “smartest and most reliable” mannequin for producing correct solutions in advanced domains like programming.
For the various builders that are actually growing brokers, OpenAI says GPT-5.2 with reasoning is its strongest providing but, bringing “important enhancements throughout basic intelligence, long-context understanding, agentic tool-calling, and imaginative and prescient.”
OpenAI reportedly pushed to launch GPT-5.2 earlier than the tip of the yr in order that it might counter the discharge of Google’s Gemini 3. The corporate launched GPT-5 in August, heralding it as the subsequent main leap ahead in its AI analysis. GPT-5 was a “system” of fashions, utilizing a “router” to direct the best queries to specialised fashions. It’s referring to GPT-5.2 as a “unified system that mechanically chooses the best way to reply based mostly on process complexity.”
The GPT-5.2 mannequin’s elevated capability for processing and reasoning about multi-modal enter (audio, video, photographs, textual content, and so on.) is important, as a result of Google Gemini 3 does this very properly.
For instance, the brand new mannequin was requested to investigate the options of a picture of a circuit board after which establish and label all of the small parts. OpenAI says GPT-5.2 did this with much more element and accuracy than its earlier GPT-5.1 mannequin might. When reasoning is launched, the mannequin could possibly diagnose issues in mechanical techniques by recognizing the visible indicators.
All three variants of GPT-5.2 can be found in ChatGPT right now, beginning with paid subscribers and accessible to builders by the API. Microsoft, a serious investor in OpenAI, says it’s bringing GPT-5.2 to Microsoft 365 Copilot and Copilot Studio customers worldwide right now.
In associated information, OpenAI additionally introduced that it had struck a licensing deal with Disney that may enable Sora 2 customers to make use of Disney characters in photographs they generate and share utilizing the app. As well as, Disney will make a $1-billion fairness funding in OpenAI, with an choice to buy extra fairness sooner or later.

