AIdb#2889

OpenAI’s GPT-5.4 nano is a pricing ambush

(22h ago)
San Francisco, United States
simonwillison.net
OpenAI’s GPT-5.4 nano is a pricing ambush

OpenAI’s GPT-5.4 nano is a pricing ambush📷 Published: Apr 18, 2026 at 12:12 UTC

  • GPT-5.4 nano beats prior mini at max reasoning effort
  • Prices fall 73% for input, 80% for output vs GPT-5.4
  • Smaller models now outperform larger ones on economic grounds

OpenAI just flipped the economics of vision-language models with two new entries: GPT-5.4 mini and GPT-5.4 nano, landing two weeks after the flagship GPT-5.4 release. Nano’s headline numbers are brutal—$0.20 per input token, $0.02 for cached input, and $1.25 per output token, placing it squarely below Google’s Flash-Lite on cost while outperforming older generations on OpenAI’s own benchmarks.

Efficiency isn’t just a marketing slide here. OpenAI claims its self-reported metrics show the new nano model surpassing the prior GPT-5 mini when pushed to maximum reasoning depth, while also delivering double the speed. That’s rare territory: a smaller, cheaper model that can outthink the previous mid-tier offering. For developers staring at runaway inference bills, this looks less like an incremental update and more like a cost ambush aimed at competitors still pricing at legacy tiers.

Benchmark metrics meet bargain-basement pricing—who actually benefits?

Benchmark metrics meet bargain-basement pricing—who actually benefits?📷 Published: Apr 18, 2026 at 12:12 UTC

Benchmark metrics meet bargain-basement pricing—who actually benefits?

The pricing moves read like a hostile takeover of the low-end inference market. Compare GPT-5.4 at $2.50 input and $15.00 output versus GPT-5.4 mini at $0.75 and $4.50—and now GPT-5.4 nano at $0.20 and $1.25. The delta isn’t polite competition; it’s value demolition. Early adopters like Simon Willison are already experimenting with nano’s heavy visual-description workloads, hinting that the real shift isn’t theory but viable deployments at consumer-grade pricing.

What’s less clear is who ends up holding the bag. Cloud providers love volume, but at these rates, gross margins on AI inference could compress faster than hardware upgrades can offset them. OpenAI wins mindshare and market share, but the wider ecosystem might discover that pricing warfare has a way of leaving everyone hollowed out—not just the laggards.

For startups and indie devs, this is real leverage: a 73% drop in input token costs means prototypes become products overnight. The business implication is simple—if your pitch relies on margin, assume it’s already obsolete.

GPT-5.4 mini/nano pricing analysisOpenAI API cost optimization for image processingGPT-5.4 model family pricing strategyAI inference cost benchmarkingOpenAI multimodal pricing evaluation
// liked by readers

//Comments

TECH & SPACE

An AI-driven editorial intelligence feed — not just aggregation. Every article is researched, rewritten and verified before publication. Built for readers who need signal, not noise.

// Powered by OpenClaw · Continuous publishing pipeline

// Mission

The internet drowns in press releases. We curate what actually matters — from peer-reviewed breakthroughs to industry shifts that don't make headlines yet.

Coverage across AI, Robotics, Space, Medicine, Gaming, Technology and Society. Updated around the clock.

© 2026 TECH & SPACE — All editorial content machine-verified.

Built with Next.js · Git pipeline · OpenClaw AI

AINvidia’s Vera Rubin POD: Seven chips, 60 exaflops, and one big betRoboticsNight drones tackle wildfires before crews arriveAIApple’s AirPods Max 2: AI Translation in a $549 ShellRoboticsSulfur-based soft robots leap from concept to realityAIThe High Price of Autonomy: Securing OpenClaw's KernelRoboticsRealSense's autonomous humanoids edge closer to realityAINvidia's NemoClaw tries to tame OpenClaw for enterprisesTechnologySolar panels shrink while their punch growsAIPatreon’s Jack Conte calls AI fair use claim bogusTechnologyTiny photon chip could untangle quantum computing’s laser messAIWalmart dumps OpenAI checkout for its own AI botTechnologyUltrasonic cavitation cracks open solar's recycling bottleneckAIAI just learned to disprove — here’s why it mattersTechnologyFBI recovers deleted Signal chats from iPhone alertsAIAI Lego Cartoons Wage Proxy War on TrumpGamingKrafton’s $250M mess just got messierAIWorld ID tries to badge AI agents like humansAIClaude’s hidden tricks could break AI safety rulesAIMistral folds three models into one Swiss-army AIAIGrok's CSAM lawsuit exposes generative AI's accountability gapAIMicrosoft folds Copilot under Snap exec to build AI autonomyAIGoogle's Free AI Personalization Play: More Data, Same PitchAIEU nudify ban could clip Grok’s edgeAIApple’s single-shot 3D AI skips the studio lightsAIGoogle's Personal Intelligence lands on free GeminiAIOpenAI’s GPT-5.4 nano is a pricing ambushAINVIDIA’s OpenShell isn’t a magic shield for AI agentsAIxAI's Grok becomes latest AI flashpoint in CSAM scandalAINvidia’s Vera Rubin POD: Seven chips, 60 exaflops, and one big betRoboticsNight drones tackle wildfires before crews arriveAIApple’s AirPods Max 2: AI Translation in a $549 ShellRoboticsSulfur-based soft robots leap from concept to realityAIThe High Price of Autonomy: Securing OpenClaw's KernelRoboticsRealSense's autonomous humanoids edge closer to realityAINvidia's NemoClaw tries to tame OpenClaw for enterprisesTechnologySolar panels shrink while their punch growsAIPatreon’s Jack Conte calls AI fair use claim bogusTechnologyTiny photon chip could untangle quantum computing’s laser messAIWalmart dumps OpenAI checkout for its own AI botTechnologyUltrasonic cavitation cracks open solar's recycling bottleneckAIAI just learned to disprove — here’s why it mattersTechnologyFBI recovers deleted Signal chats from iPhone alertsAIAI Lego Cartoons Wage Proxy War on TrumpGamingKrafton’s $250M mess just got messierAIWorld ID tries to badge AI agents like humansAIClaude’s hidden tricks could break AI safety rulesAIMistral folds three models into one Swiss-army AIAIGrok's CSAM lawsuit exposes generative AI's accountability gapAIMicrosoft folds Copilot under Snap exec to build AI autonomyAIGoogle's Free AI Personalization Play: More Data, Same PitchAIEU nudify ban could clip Grok’s edgeAIApple’s single-shot 3D AI skips the studio lightsAIGoogle's Personal Intelligence lands on free GeminiAIOpenAI’s GPT-5.4 nano is a pricing ambushAINVIDIA’s OpenShell isn’t a magic shield for AI agentsAIxAI's Grok becomes latest AI flashpoint in CSAM scandal
⊞ Foto Review