Rivian

OpenAI's GPT-5.2 Drops with Math Boosts, Disney Ties, and Leaked Image Tech – Runway Gen-4.5 Steals the Video Show

Dec 12, 2025

Christopher Shaun Godwin

Author

Even as the AI news cycle eases into holiday mode, this week delivered a torrent of updates. OpenAI led the charge with GPT-5.2, a Disney megadeal, potential image model leaks, and a new standards push for AI agents. Runway rolled out Gen-4.5, topping video benchmarks, while Rivian teased ambitious autonomy plans.

GPT-5.2: Sharper Math, Bigger Context, Incremental Gains

OpenAI launched ChatGPT-5.2 after a slight delay, addressing complaints that its predecessor, GPT-5.1, was faltering on accuracy. Early benchmarks spotlight improvements in math, science, and coding, with the model claiming top spots internally against GPT-5.1.

Key specs include a 400,000-token context window (about 300,000 words) and 128,000-token output limit. API pricing sits at $1.75 per million input tokens and $14 per million output tokens, aligning with competitors.

On SWE-bench Pro for software engineering, GPT-5.2 hits 55.6% – up from 50.8% on GPT-5.1, edging Claude Opus 4.5 (52%) and surpassing Gemini 3 Pro (43.3%). Science tasks show dominant gains over GPT-5.1, though external comparisons remain sparse. Hallucinations may be tamed, but real-world tests are pending.

Disney Pumps $1B into OpenAI for IP-Powered Sora Magic

In a surprise move, Disney is reportedly investing $1 billion in OpenAI, granting access to its vast IP library. Expect Disney characters in Sora video generations and native image tools. This could enable personalized Disney+ shorts, like AI-crafted Moana clips, blending generative AI with streaming.

Leaked OpenAI Image Models: Celeb Selfies and Code-Rendering Prowess

Rumors swirled around codenamed “Chestnut” and “Hazelnut,” purportedly GPT-5.2 companions tested on arenas like Design Arena. Leaks reveal strong world knowledge (researching prompts), photoreal celeb selfies rivaling top tools, and crisp text/code rendering – from whiteboard slogans to JSON overlays on PlayStation controllers.

Comparisons to current GPT image gen highlight leaps: fewer proportion errors, better teeth/hair, though subtle AI tells linger in eyes and skin. Celebrity group shots look convincingly real at a glance, signaling relaxed safeguards on real faces.

Agentic AI Foundation: Industry Unites for Interoperable Agents

OpenAI, Anthropic, and Block launched the Agentic AI Foundation under the Linux Foundation, backed by Google, Microsoft, Amazon, Bloomberg, and Cloudflare. The goal: standardize AI agents for seamless cross-app operation, safety, and reliability.

As agents handle emails, bookings, and troubleshooting, fragmented builds risk silos. This neutral body ensures plug-and-play compatibility, akin to universal electrical standards, preventing vendor lock-in.

Runway Gen-4.5: Benchmark King with Physics and Prompt Mastery

Runway began deploying Gen-4.5, hailed for “state-of-the-art” motion, physics, and adherence. It leads global text-to-video charts, simulating weight, fluid dynamics, consistent faces, and nuanced emotions – sans audio.

Hands-on tests impressed:

Glass sphere on marble stairs: Realistic bounces, water splashes, refractions – near-perfect prompt match.
Rainy street walker: Umbrella physics, subtle smile, neon backlighting, handheld jitters nailed.
Anime explorer: Stylized but background wonky; consistency holds for foreground.
Barista latte pour: Swirling milk, steam, blurred patrons, authentic smile – macro details shine.
Neon alley chase: Drone spotlight, sparks, reflections solid; minor physics/camera hiccups in 5-second clip.

Prompt fidelity stands out, though rivals like Veo 3.1 edge on realism and sound integration.

Quick Hits: Models, Integrations, and Controversies

Open Models Surge: Mistral’s open-weight Devstral 2 rivals DeepSeek v3.2 for local coding (72.2% benchmarks). Zhipu AI’s GLM-4.6V (tool-calling vision) and Qwen’s Omni Flash upgrade (human-like voices, personality tweaks) compete fiercely.
OpenAI “Ads” Faux Pas: Shopping suggestions mimicked ads; paused for refinement with user controls.
ChatGPT + Adobe: Free Acrobat, Express, Photoshop edits via connectors – early tests show promise but limitations.
Meta Snaps Limitless Pendant: Always-on audio recorder now under Meta, raising privacy flags.
Alibaba’s Qwen Image2LoRA: One-shot LoRAs from images for style/character replication (e.g., Studio Ghibli vibes).

Rivian Accelerates AI Autonomy

At Rivian’s AI & Autonomy Day, highlights included custom silicon (Nvidia-hybrid), phased self-driving (hands-free to unsupervised Level 4 by 2027-28), integrated LiDAR, and a voice assistant syncing calendar/texts/car controls (“Warm the seats, skip passenger”).

Test drives showed reliable city navigation, though interventions needed.

McDonald’s AI Ad Backlash: Fatigue Hits Peak

A fully AI-generated McDonald’s spot – grumpy holiday mishaps – drew ire for “slop” from a deep-pocketed giant. Amid social media AI overload, viewers crave human craft over cheap gen-AI, urging hybrids: real talent augmented sparingly.

This week’s releases underscore AI’s maturation: specialized leaps, ethical guardrails, and ecosystem bridges. Stay tuned – the firehose persists.