terminal
howtonotcode.com
business

Responses API

Platform

OpenAI o4-mini is a generative pre-trained transformer model created by OpenAI. On April 16, 2025, the o4-mini model was released to all ChatGPT users (including free-tier users) as well as via the Chat Completions API and Responses API. Additionally, OpenAI introduced the o4-mini-high model, which was made available exclusively to paid-tier ChatGPT users. The high model was a slower model which had a higher reasoning effort to produce more accurate responses. Unlike earlier models, o4-mini is c

article 1 story calendar_today First seen: 2026-02-24 update Last seen: 2026-02-24 menu_book Wikipedia

Stories

Showing 1-1 of 1

OpenAI speeds up agent backends with Responses API WebSockets and gpt‑realtime‑1.5

OpenAI shipped a faster path for real-time, tool-calling agents by adding WebSockets to the Responses API and upgrading its voice model to gpt-realtime-1.5. OpenAI reports the new [gpt-realtime-1.5](https://the-decoder.com/openai-ships-api-upgrades-targeting-voice-reliability-and-agent-speed-for-developers/) improves number/letter transcription (~10%), logical audio tasks (~5%), and instruction following (~7%), while the Responses API now supports [WebSockets](https://the-decoder.com/openai-ships-api-upgrades-targeting-voice-reliability-and-agent-speed-for-developers/) so agents stream state and tool calls without resending full context, yielding a claimed 20–40% speedup on complex graphs. For productionization, OpenAI’s docs emphasize hardened patterns—capability encapsulation via [Skills](https://developers.openai.com/api/docs/guides/tools-skills/) and secure prompting/tooling per [Cybersecurity checks](https://developers.openai.com/api/docs/guides/safety-checks/cybersecurity)—while the cookbook on [long‑horizon Codex tasks](https://developers.openai.com/cookbook/examples/codex/long_horizon_tasks/) remains relevant for workflows that still need multi‑hour execution. Ecosystem notes: the Python SDK [v2.24.0](https://github.com/openai/openai-python/releases/tag/v2.24.0) adds a new API “phase” enum; community threads flag rough edges like fine‑tune inconsistencies between Chat vs. Responses with GPT‑4o, transient 401s on vector store creation, and disappearing service‑account keys (linkable via the OpenAI forum).

calendar_today 2026-02-24
openai gpt-realtime-15 responses-api realtime-api openai-python