Is GPT-5.5 available on Smart AIPI?

Yes. GPT-5.5 is live on Smart AIPI right now, accessible via model ID 'gpt-5.5' in any API call. It was added the same day OpenAI released it.

How much does GPT-5.5 cost on Smart AIPI?

Smart AIPI is the cheapest way to use GPT-5.5 on the market. We charge 75% less than OpenAI's direct API — $1.25 per 1M input tokens and $7.50 per 1M output tokens, versus OpenAI's $5 and $30.

What makes GPT-5.5 better than GPT-5.4?

GPT-5.5 posts higher scores on every major benchmark — 82.7% on Terminal-Bench 2.0 (vs 75.1%), 84.9% on GDPval, 78.7% on OSWorld-Verified, 84.4% on BrowseComp, 51.7% on FrontierMath Tier 1–3, and 95.0% on ARC-AGI-1. It also uses fewer tokens to complete the same Codex tasks and matches GPT-5.4 per-token latency in real-world serving.

Is GPT-5.5 cheapest on OpenAI direct or Smart AIPI?

Smart AIPI. OpenAI direct charges $5/1M input + $30/1M output. Smart AIPI charges $1.25/1M input + $7.50/1M output — a 75% reduction for the exact same model.

Can I use GPT-5.5 with Codex CLI, OpenCode, Cursor, Cline, or Roo Code?

Yes. Set your base URL to https://api.smartaipi.com/v1 and use model 'gpt-5.5' in any OpenAI-compatible tool. No code changes required.

Does GPT-5.5 support reasoning effort levels?

Yes. GPT-5.5 supports low, medium, and high reasoning effort plus reasoning summaries, same as GPT-5.4. For agentic coding and long-horizon tasks, high effort is recommended.

What is the context window for GPT-5.5?

GPT-5.5 supports up to a 400K context window in Codex mode and a 1M context window in the API. Smart AIPI forwards the full context to the upstream Codex backend.

Do I need to change anything to use GPT-5.5 instead of GPT-5.4?

No. Swap the model ID from 'gpt-5.4' to 'gpt-5.5' in your existing request. Same endpoint, same API key, same parameter shape.

Is GPT-5.5 better than Claude Opus 4.7 or Gemini 3.1 Pro?

Yes on most benchmarks. GPT-5.5 outperforms Claude Opus 4.7 on Terminal-Bench 2.0 (82.7% vs 69.4%), GDPval (84.9% vs 80.3%), and FrontierMath Tier 1–3 (51.7% vs 43.8%). On BrowseComp, Claude Opus 4.7 scores 79.3% vs GPT-5.5's 84.4%. Only Gemini 3.1 Pro edges it on BrowseComp (85.9%).

When did GPT-5.5 launch?

OpenAI released GPT-5.5 on 2026-04-23. Smart AIPI added it to the pool the same day, with no downtime or config changes required by users.

GPT-5.5 Is Here: OpenAI's Smartest Model, Available Now on Smart AIPI

TL;DR: GPT-5.5 is live on Smart AIPI today — the same day OpenAI released it. Use model gpt-5.5 in any API call. It beats GPT-5.4 on every major benchmark — coding, reasoning, agentic computer use, math, and scientific research — and Smart AIPI is the cheapest way to use it: 75% off OpenAI direct pricing.

OpenAI just released GPT-5.5, their smartest and most intuitive model yet. We added it to Smart AIPI the same day. Use model ID gpt-5.5 in any chat completion or responses API call and you're using it — no config changes, no redeployment, same API key.

Below: the benchmark numbers that matter, how it compares to GPT-5.4, Claude Opus 4.7, and Gemini 3.1 Pro, and why Smart AIPI is the cheapest place to use it.

The Benchmarks: GPT-5.5 Sets New State of the Art

GPT-5.5 improves on GPT-5.4 across every major benchmark. The gains are especially strong in agentic coding, computer use, knowledge work, and early scientific research — areas where progress depends on reasoning across context and taking action over time.

Benchmark	GPT-5.5	GPT-5.4	Claude Opus 4.7	Gemini 3.1 Pro
Terminal-Bench 2.0 Command-line workflows	82.7%	75.1%	69.4%	68.5%
SWE-Bench Pro Real-world GitHub issues	58.6%	57.7%	64.3%	54.2%
Expert-SWE Long-horizon coding tasks	73.1%	68.5%	—	—
GDPval Knowledge work (wins or ties)	84.9%	83.0%	80.3%	67.3%
OSWorld-Verified Computer use	78.7%	75.0%	78.0%	—
BrowseComp Agentic browsing	84.4%	82.7%	79.3%	85.9%
Toolathlon Tool use	55.6%	54.6%	—	48.8%
FrontierMath Tier 1–3 Advanced mathematics	51.7%	47.6%	43.8%	36.9%
FrontierMath Tier 4 Hardest math problems	35.4%	27.1%	22.9%	16.7%
CyberGym Cybersecurity	81.8%	79.0%	73.1%	—
ARC-AGI-1 Abstract reasoning	95.0%	93.7%	93.5%	98.0%
ARC-AGI-2 Harder abstract reasoning	85.0%	73.3%	75.8%	77.1%
BixBench Bioinformatics research	80.5%	74.0%	—	—

The standout numbers: 82.7% on Terminal-Bench 2.0 (a 7.6-point jump over GPT-5.4 and 13+ points ahead of Claude Opus 4.7), 84.9% on GDPval knowledge work, 85.0% on ARC-AGI-2 (up from 73.3%), and 35.4% on FrontierMath Tier 4 — the hardest math benchmark in production — up from 27.1% and nearly double Claude Opus 4.7's 22.9%.

What Actually Changed

Three things developers will notice right away:

Stronger agentic coding. GPT-5.5 carries more of the work itself. It holds context across large systems, reasons through ambiguous failures, checks assumptions with tools, and carries changes through the surrounding codebase. Early NVIDIA engineers said "losing access to GPT-5.5 feels like I've had a limb amputated." On Expert-SWE (long-horizon tasks with a median 20-hour expert completion time), GPT-5.5 hits 73.1% vs GPT-5.4's 68.5%.
Fewer tokens for the same work. GPT-5.5 is more token-efficient than GPT-5.4 on Codex tasks. Terminal-Bench 2.0 at 82.7% uses fewer tokens per task. Expert-SWE at 73.1% also uses fewer tokens. That means the raw per-token price is only half the story — your effective cost per finished task is lower still.
Same latency. Larger, more capable models are typically slower to serve, but GPT-5.5 matches GPT-5.4 per-token latency in real-world serving. You get a smarter model without paying for it in wall-clock time.

Beyond raw benchmarks, senior engineers testing the model have reported a step change: GPT-5.5 catches issues in advance, predicts testing and review needs without explicit prompting, and resolves complex merges in a single shot. One CEO described it as "the first coding model I've used that has serious conceptual clarity."

Smart AIPI Is the Cheapest Way to Use GPT-5.5

OpenAI's direct API pricing for GPT-5.5 is $5 per 1M input tokens and $30 per 1M output tokens. That's a 2x jump over GPT-5.4's direct pricing, reflecting the capability leap.

Through Smart AIPI, you pay:

Pricing (per 1M tokens)	OpenAI direct	Smart AIPI	You save
Input tokens	$5.00	$1.25	75%
Cached input	$0.50	$0.125	75%
Output tokens	$30.00	$7.50	75%

A typical Codex-style agentic workload with 50K input tokens, 20K cached prefix hits, and 8K output tokens costs $0.32 through Smart AIPI versus $1.28 direct from OpenAI. Across a day of 500 such requests, that's $160 vs $640 — a $480 daily difference for the exact same model running on the exact same OpenAI infrastructure.

No other provider offers GPT-5.5 at this price. OpenRouter passes through OpenAI direct pricing plus a margin. Google Vertex and Azure OpenAI charge the full direct rate. Smart AIPI aggregates ChatGPT Business subscription capacity and passes the savings to you.

How to Use GPT-5.5 on Smart AIPI

It's already live. Just set your model parameter:

Chat Completions API

curl https://api.smartaipi.com/v1/chat/completions \
  -H "Authorization: Bearer YOUR_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "gpt-5.5",
    "messages": [{"role": "user", "content": "Hello!"}]
  }'

Responses API

curl https://api.smartaipi.com/v1/responses \
  -H "Authorization: Bearer YOUR_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "gpt-5.5",
    "input": "Refactor this function for clarity and add tests"
  }'

With High Reasoning Effort

curl https://api.smartaipi.com/v1/responses \
  -H "Authorization: Bearer YOUR_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "gpt-5.5",
    "input": "Design a caching strategy for a high-read API...",
    "reasoning": {"effort": "high", "summary": "auto"}
  }'

Codex CLI

Update your ~/.codex/config.toml:

model = "gpt-5.5"

Cursor, Cline, Roo Code, OpenCode, Aider

Point the tool at Smart AIPI's base URL and set the model to gpt-5.5:

OPENAI_BASE_URL=https://api.smartaipi.com/v1
OPENAI_API_KEY=sk-your-smart-aipi-key
OPENAI_MODEL=gpt-5.5

Context Window and Advanced Features

Context window: 400K tokens in Codex mode, 1M tokens via the Responses API.
Reasoning effort: low, medium, high. For agentic and long-horizon work, high effort is recommended — it's where GPT-5.5 pulls furthest ahead of GPT-5.4.
Streaming: supported on both chat completions and responses APIs. Use "stream": true.
Prompt caching: Smart AIPI preserves prompt cache hits upstream. Typical sustained workloads hit 85–95% cache rates on gpt-5.5, so your effective input cost is closer to $0.125 per 1M than $1.25.

What About GPT-5.5 Mini, Nano, and Pro?

OpenAI hasn't released gpt-5.5-mini, gpt-5.5-nano, or gpt-5.5-pro on the Codex backend yet. When they go live we'll add them the same day and publish a follow-up. For now, only the base gpt-5.5 model is available.

If you're using one of the older gpt-5-nano or gpt-5.4-nano aliases in Roo Code, Cline, or similar agent frameworks, Smart AIPI transparently routes those to gpt-5.4-mini for reliability — no action needed on your end.

Get Started

Free credits included. Every new account gets $5 in free credits. Sign up at smartaipi.com/signup, create an API key, and start using GPT-5.5 immediately. No credit card required.

Sign up at smartaipi.com/signup (free credits, no credit card)
Create an API key in the dashboard
Set your base URL to https://api.smartaipi.com/v1
Use model gpt-5.5

Same API, same OpenAI-compatible shape, same tools — now with the smartest model OpenAI has ever shipped, at 75% off.