ANTHROPIC
30 days · UTC
Synchronizing with global intelligence nodes...
Claude Opus 4.7 ships: big coding gains, higher-res vision, and a tokenizer change that hits your bill
Anthropic released Claude Opus 4.7, a GA model with major coding, vision, and instruction-following gains plus a tokenizer change that affects costs. ...
Anthropic launches Claude Design: chat-to-canvas prototypes with code handoff
Anthropic launched Claude Design, a chat-plus-canvas workspace that turns prompts and your brand system into shareable prototypes, decks, and handoff-...
Agentic coding moves from hype to ops: evals, observability, and resilience land across the stack
A cluster of releases and guides tightens the nuts and bolts of running coding agents in production. Promptfoo’s guide breaks down why agent evals di...
Multi-model AI solidifies around OpenAI-compatible gateways as Mozilla debuts a sovereign client
Teams are coalescing around OpenAI-compatible APIs and multi-model gateways, with a fresh push toward self-hosted, sovereign AI clients. A DEV piece ...
OpenAI turns Codex into a multi‑agent superapp with background computer control
OpenAI expanded Codex from a coding helper into a multi‑agent, do‑the‑work app with background computer control, a built‑in browser, memory, and autom...
Claude Code ships native CLI, tighter sandboxing, and a desktop redesign for parallel agent work
Anthropic pushed rapid Claude Code updates and a desktop redesign that tighten security, speed up reviews, and make multi-session agent work practical...
Anthropic decouples agent internals with Managed Agents, while MCP and measured skills shape production patterns
Anthropic introduced a decoupled Managed Agents service that stabilizes agent interfaces while letting harnesses and sandboxes evolve. Anthropic’s ne...
Anthropic ships Claude Opus 4.7: steadier coding, higher‑res vision, stricter prompts
Anthropic released Claude Opus 4.7 with steadier long-run coding, higher‑resolution vision, and stricter instruction following. Opus 4.7 tightens exe...
Claude’s “computer use” makes desktop UI a first-class automation surface
Anthropic’s Claude now runs real desktop workflows by seeing your screen and controlling your mouse and keyboard. According to [WebProNews](https://w...
Anthropic’s Managed Agents: stable interfaces for long-horizon AI work
Anthropic details how Claude Managed Agents split agent brain and hands behind stable session, harness, and sandbox interfaces. In this engineering d...
Claude Code desktop rework ships with cloud-hosted Routines; v2.1.110 adds tracing hooks and sturdier ops
Anthropic rebuilt Claude Code around parallel orchestration and previewed cloud-hosted Routines that run on a schedule or via API triggers. The redes...
GitHub tightens Copilot Pro access; Copilot CLI ships clarity, /ask, and security fixes
GitHub paused new Copilot Pro trials and tightened usage limits while shipping Copilot CLI updates that improve clarity, ergonomics, and security. Gi...
AI agents just got real: autonomy is near, but ops and unit economics will decide who wins
AI agents are moving from flashy demos to production, and the bottlenecks are reliability, orchestration, and unit economics. The big labs are steeri...
Build dependable document QA: production RAG patterns, the right long‑context model, and safer behavior shaping
If you’re shipping document QA, combine a solid RAG spine with model choice tuned for structure and tactics that stabilize behavior. A deep, opiniona...
Anthropic’s Managed Agents land: decouple your agent stack, fix your harness, and stop burning retries
Anthropic introduced Managed Agents, a decoupled service for long-horizon agent work, highlighting why harness design and memory hygiene now matter mo...
Anthropic launches Project Glasswing, using unreleased Claude Mythos to harden critical software with industry partners
Anthropic unveiled Project Glasswing, a defense-focused program using its unreleased Claude Mythos model to find and fix critical software vulnerabili...
SWE-bench scores are spiking, but variant mix-ups make the leaderboard noisy for real-world tool choices
Vendors are touting big SWE-bench jumps, but versions differ and scores alone won’t pick your coding copilot. SWE-bench measures fail-to-pass bug fix...
Anthropic launches Claude Managed Agents: stable interfaces for long‑running AI work
Anthropic introduced Claude Managed Agents, a hosted service that decouples an agent’s reasoning, control loop, and execution into stable, swappable i...
OpenAI drops ChatGPT Pro to $100 and leans into Codex for power users
OpenAI repositioned ChatGPT Pro at $100 per month with bigger Codex allocations, turning up the heat on Anthropic for developer wallets. According to...
Anthropic launches Project Glasswing, giving controlled access to Claude Mythos for vulnerability discovery
Anthropic formed Project Glasswing and is withholding its Claude Mythos Preview model for controlled, defensive use after it found thousands of high‑s...
AI security pivots to defense: restricted LLMs, risky code assistants, and practical guardrails
Vendors are shifting from open access to locked-down, defense-first AI as code assistants prove easy to abuse. A report says OpenAI is prepping a res...
Anthropic previews Claude Mythos and launches Project Glasswing to weaponize defense against zero‑days
Anthropic previewed Claude Mythos and launched Project Glasswing, claiming the model can autonomously find high‑severity bugs across major OSes and br...
Anthropic launches Claude Managed Agents: production-grade agent orchestration as a service
Anthropic launched Claude Managed Agents, a hosted stack that runs long-lived, tool-using AI agents with sandboxing, tracing, and scoped permissions.