terminal
howtonotcode.com
Reddit logo

Reddit

Platform

Reddit is a platform for users to share content and engage in discussions.

article 21 storys calendar_today First seen: 2026-02-03 update Last seen: 2026-02-24 open_in_new Website menu_book Wikipedia

Resources

Links to check for updates: homepage, feed, or git repo.

home Homepage

rss_feed Feed

Stories

Showing 1-20 of 21

Copilot CLI locks down MCP; Skills mature; watch VS Code and licensing gotchas

GitHub Copilot’s latest CLI releases tighten Model Context Protocol access and add workflow polish, while teams see editor and licensing edge cases worth planning for. Copilot CLI v0.0.416 adds enforcement to block third‑party MCP servers when policy disallows them and improves help, streaming counters, terminal status layout, and undo confirmations, while v0.0.415 brought agent model selection, a plan approval menu with curated actions, an env loader, a show_file tool, and quality fixes like UTF‑8 BOM handling and MCP UI polish ([0.0.416](https://github.com/github/copilot-cli/releases/tag/v0.0.416), [0.0.415](https://github.com/github/copilot-cli/releases/tag/v0.0.415), [all releases](https://github.com/github/copilot-cli/releases)). For security‑minded orgs, this pairs with growing scrutiny of what MCP unlocks inside enterprises, from querying internal systems to chaining multi‑step actions—governance and allowlists now matter in practice ([Scalekit’s analysis](https://www.scalekit.com/blog/github-copilot-mcp-enterprise-security-governance)). On the usability front, VS Code Insiders is iterating on a model picker with search, context‑window details, and contextual quick‑pick dialogs, while Copilot in VS Code is adding deeper C++/CMake awareness for richer assistance ([Insiders discussion](https://www.reddit.com/r/GithubCopilot/comments/1rct0g9/new_in_vs_code_insiders_model_picker_and/), [InfoWorld coverage](https://www.infoworld.com/article/4136164/microsoft-brings-c-plus-plus-smarts-to-github-copilot-in-visual-studio-code.html)). Teams should also track known rough edges like Copilot chat sessions not updating without reinstall and license entitlement desync between business and personal seats ([VS Code issue](https://github.com/microsoft/vscode/issues/297226), [GitHub community thread](https://github.com/orgs/community/discussions/187874)). For repeatable DevOps/SRE workflows, “Skills” provide on‑demand, reusable AI runbooks that load progressively and bundle scripts/templates, making it easier to standardize safe automation alongside MCP‑backed tools ([Skills walkthrough](https://dev.to/pwd9000/github-copilot-skills-reusable-ai-workflows-for-devops-and-sres-caf)).

calendar_today 2026-02-24
github-copilot copilot-cli github visual-studio-code microsoft

AI IDEs go agentic: Cursor "demos" and Windsurf Cascade

AI IDEs are shifting from code suggestions to autonomous agents that run, test, and showcase changes, led by Cursor’s new demo-first experience and Windsurf’s Cascade engine. Cursor now emphasizes "demos, not diffs," with agents that can run the software they build and send video evidence of their changes ([YouTube](https://www.youtube.com/watch?v=XbZvC4KTH68&pp=ygURQ3Vyc29yIElERSB1cGRhdGU%3D)). Meanwhile, Windsurf’s agentic Cascade engine promises project-aware, multi-file edits on a familiar VS Code foundation with simple onboarding and settings import ([TechCompanyNews guide](https://www.techcompanynews.com/how-to-use-windsurf-step-by-step-guide-for-beginners/)). The direction is clear: AI IDEs are moving from inline suggestions to autonomous, runnable workflows. Operational maturity remains a concern: users report surprise auto-updates ([automatic updater](https://forum.cursor.com/t/cursor-automatic-updater/152697)), Windows update failures ([Windows updates failing](https://forum.cursor.com/t/updates-on-windows-are-failing-still-antivirus/152819)), and visibility issues before approval in a recent build ([v2.5.20 diffs visibility](https://forum.cursor.com/t/modified-code-changes-not-visible-before-approval-cursor-v2-5-20/152760)), alongside UI changes like replacing "Keep All" with auto-approve ([discussion](https://forum.cursor.com/t/the-loss-of-keep-all-the-addition-of-auto-approve/152780)). Community threads also cite rate limits even on paid plans ([Reddit](https://www.reddit.com/r/cursor/comments/1rdfk9p/what_would_make_you_switch_from_cursor_to_another/)) and a practical auth fix for a Windsurf codex plugin by clearing a local token file ([Reddit fix](https://www.reddit.com/r/codex/comments/1rdddu3/windsurf_codex_plugin_issue/)). Teams are sketching an "AI builder stack" that pairs an agentic IDE with project tracking, instant deploy previews, and AI QA to close the loop from change to validation ([HackerNoon](https://hackernoon.com/the-ai-builder-stack-linear-cursor-vercel-and-qatech?source=rss)). New native entrants like macOS-focused G-Rump hint at a widening field and specialization opportunities ([Swift forums](https://forums.swift.org/t/g-rump-a-native-macos-ai-coding-agent-looking-for-early-feedback/84953)).

calendar_today 2026-02-24
cursor windsurf codeium visual-studio-code linear

Windsurf ships new models, Linux ARM64, and enterprise hooks

Windsurf rolled out new frontier coding models, full Linux ARM64 support, and enterprise-grade Cascade Hooks while community feedback spotlights its transparent crediting versus rivals' opaque limits. Windsurf’s latest updates add Gemini 3.1 Pro, Claude Sonnet 4.6, GLM-5, Minimax M2.5, and GPT-5.3-Codex-Spark with time-limited credit multipliers, plus quality-of-life fixes and features like automatic Plan→Code switching, skills loading from .agents/skills, tracked rules in post_cascade_response, and diff zones auto-closing on commit; importantly, it now provides full Linux ARM64 deb/rpm packages and enterprise cloud config for Cascade Hooks with Devin service key auth, as detailed in the [Windsurf changelog](https://windsurf.com/changelog). A power user’s comparison underscores cost control and predictability: they favored Windsurf’s clear credit model over Cursor/Claude Code’s rate-limit surprises, keeping GitHub Copilot Pro+ for predictable premium requests while continuing to code primarily in Windsurf, per this [Reddit write-up](https://www.reddit.com/r/windsurf/comments/1r9b58e/i_almost_left_windsurf/).

calendar_today 2026-02-20
windsurf gemini-31-pro claude-sonnet-46 glm-5 minimax-m25

Early signals on OpenAI Codex: agent workflows, throughput tips, and hype to filter

OpenAI's Codex is surfacing in community posts as an agent-oriented coding tool for building and running code, with early demos and throughput tips alongside hype about a 'GPT-5.3 Codex'. Builders are sharing hands-on experiences, including a zero-code 2D game built with Codex agent skills and CLI, which hints at agentic patterns and composable skills for programming tasks ([demo thread](https://community.openai.com/t/show-2d-game-built-using-codex-and-agent-skills-zero-code/1374319)). For heavier usage, a discussion on throughput scaling covers considerations for parallelism and high-volume AI builder workloads ([throughput thread](https://community.openai.com/t/codex-throughput-scaling-for-heavy-ai-builder-workloads/1374316)), and another thread explores orchestrating subagents for subtasks to mitigate model fatigue ([subagent thread](https://community.openai.com/t/model-fatigue-how-to-ask-codex-to-run-a-subagent-for-a-subtask/1374247)). Sentiment is mixed: an OpenAI community post voices strong skepticism about LLMs and Codex reliability ([skeptic thread](https://community.openai.com/t/codex-and-llms-in-general-are-a-big-fat-lie/1374390)), while viral chatter on Reddit and X touts a "GPT-5.3 Codex" replacing developers—claims that are unverified and likely overstated ([Reddit](https://www.reddit.com/r/AISEOInsider/comments/1r6c0zq/gpt53_codex_ai_coding_model_just_replaced_half_of/), [X post](https://x.com/elmd_/status/2023473911728611425)).

calendar_today 2026-02-17
openai codex gpt-53-codex agents code-generation

Agentic coding meets reality: benchmarks expose gaps, runtime tracing narrows them

New evidence shows LLMs still struggle with production-grade observability and cross-cutting tasks, but agentic workflows augmented with runtime facts significantly improve reliability and speed. An independent SRE benchmark, [OTelBench](https://www.freep.com/press-release/story/145971/quesma-releases-otelbench-independent-benchmark-reveals-frontier-llms-struggle-with-real-world-sre-tasks/), finds frontier models pass only 29% of OpenTelemetry instrumentation tasks across 11 languages, with context propagation as a key failure mode despite much higher scores on coding-only tests. In contrast, Syncause boosted SWE-bench Verified fixes to 83.4% by adding dynamic tracing “Runtime Facts” to the Live-SWE-agent with Gemini 3 Pro, detailing methods and open-sourcing trajectories and code in their [blog](https://syn-cause.com/blog/swe-bench-verified-83) and [repo](https://github.com/Syncause/syncause-swebench). Complementing this, new research on cross-domain workflow generation proposes a decompose–recompose–decide method that surpasses 20-iteration refinement baselines in a single pass, reducing latency and cost for agentic orchestration ([paper](https://arxiv.org/html/2602.11114v1)). For hands-on adoption, the open-source [DeepCode](https://github.com/HKUDS/DeepCode) project provides multi-agent “Text2Backend” capabilities to prototype structured, telemetry-aware coding agents.

calendar_today 2026-02-12
quesma otelbench opentelemetry google-gemini-3-pro syncause

Copilot CLI stabilizes for long sessions as IDEs move to agentic, team‑scoped AI

GitHub Copilot CLI’s latest update focuses on memory reductions and long‑session stability while IDE workflows and AI agents mature around team‑level customization and modernization tasks. GitHub Copilot CLI v0.0.410 ships broad stability improvements—fixing high memory usage under rapid logging, reducing streaming overhead, improving long‑session compaction, and adding ergonomic shell features like Ctrl+Z suspend/resume, Page Up/Down scrolling, repo‑level validation toggles, and an IDE status indicator when connected ([release notes](https://github.com/github/copilot-cli/releases)). The momentum aligns with a wider agentic shift: The New Stack frames VS Code as a “multi‑agent command center” for developers ([coverage](https://thenewstack.io/vs-code-becomes-multi-agent-command-center-for-developers/)), and Microsoft’s Copilot App Modernization details AI agents that assess, upgrade, containerize, and deploy .NET/Java apps to Azure in days ([deep dive](https://itnext.io/how-microsoft-is-using-ai-agents-to-turn-8-month-app-modernizations-into-days-a-technical-deep-8340a33513e7)). For IDE standardization, JetBrains/Android Studio Copilot customizations support workspace‑scoped settings committed under .github so teams can share constraints and conventions across projects ([guide](https://www.telefonica.com/en/communication-room/blog/github-copilot-android-studio-customization/)); also watch cost dynamics—one report shows OpenCode using far more credits than Copilot CLI for the same prompt, warranting usage instrumentation and policy checks ([user report](https://www.reddit.com/r/GithubCopilot/comments/1r2fhs2/opencode_vs_github_copilot_cli_huge_credit_usage/)).

calendar_today 2026-02-12
github-copilot-cli github visual-studio-code android-studio jetbrains

Claude Code praised for reasoning; Cursor flagged for auto commit co-authoring

Teams evaluating AI coding agents report Claude Code excels at complex reasoning and speed, while some Cursor users are seeing auto-added co-author lines in Git commits that may violate repo policies. A practitioner review contrasts Windsurf, Cursor, and Claude Code, highlighting trade-offs like reasoning, UI, multi-agents, debugging, and price [I spent the last month rotating between Windsurf, Cursor, and Claude Code](https://www.reddit.com/r/ClaudeCode/comments/1qzkwav/i_spent_the_last_month_rotating_between_windsurf/)[^1]. A separate report flags Cursor adding "Co-authored-by: Cursor cursoragent@cursor.com" to commit messages by default [Cursor is signing commit messages now?](https://www.reddit.com/r/cursor/comments/1r05m6l/cursor_is_signing_commit_messages_now/)[^2], and an enterprise take explains why teams keep switching among Copilot, Cursor, and Claude Code [Copilot, Cursor, Claude Code: why enterprises can’t stop switching](https://jpcaparas.medium.com/copilot-cursor-claude-code-why-enterprises-cant-stop-switching-dd4ca0f93262?source=rss-8af100df272------2)[^3]. [^1]: Adds: first-hand comparison of strengths/weaknesses across Claude Code, Cursor, and Windsurf. [^2]: Adds: concrete example of governance/compliance risk from agent-altered commit metadata. [^3]: Adds: enterprise lens on vendor/agent switching dynamics and decision factors.

calendar_today 2026-02-10
cursor claude-code windsurf github-copilot github

Collab-first AI IDEs: Dropstone's Share Chat vs single-player agents

Collaborative AI coding workspaces like Dropstone’s Share Chat are challenging single‑user AI IDEs by letting PMs and engineers co-edit live contexts to push production-grade changes faster while preserving review control. [Dropstone’s Share Chat 3.0.5](https://medium.com/@epicprogrammer/the-23-minute-feature-how-dropstones-share-chat-is-breaking-the-ai-coding-hierarchy-9d6e4f93b303)[^1] contrasts with single-player agents by sharing a live reasoning+code state for real-time review/edits, targeting the “70% wall.” A practitioner comparison highlights day-to-day tradeoffs of [Cursor, Windsurf, and Claude Code](https://www.reddit.com/r/ClaudeCode/comments/1qzkwav/i_spent_the_last_month_rotating_between_windsurf/)[^2] [^1]: Adds: Explains Share Chat 3.0.5, live workspace links, and the “70% wall” with a concrete end-to-end example and workflow details. [^2]: Adds: Hands-on pros/cons and pricing context across Cursor, Windsurf, and Claude Code, including model access and collaboration features.

calendar_today 2026-02-09
dropstone blankline cursor windsurf claude-code

Operationalizing Claude Code: auto-memory, agent teams, and gateway observability

Claude Code’s new auto-memory and emerging multi-agent workflows, plus Vercel AI Gateway routing, help teams standardize AI coding while keeping usage observable and controllable. Auto-memory persists per-project notes in MEMORY.md, can be disabled via an env var, and has minimal official docs; see this [Reddit breakdown](https://www.reddit.com/r/ClaudeCode/comments/1qzmofn/how_claude_code_automemory_works_official_feature/)[^1] and [Anthropic memory docs](https://code.claude.com/docs/en/memory#manage-auto-memory)[^2]. To scale operationally, route traffic through [Vercel AI Gateway](https://vercel.com/docs/ai-gateway/coding-agents/claude-code)[^3], bootstrap standards with the [Ultimate Guide repo](https://github.com/FlorianBruniaux/claude-code-ultimate-guide)[^4] or this [toolkit](https://medium.com/@ashfaqbs/the-claude-code-toolkit-mastering-ai-context-for-production-ready-development-036d702f83d7)[^5], and evaluate multi-agent “Agent Teams” shown here [demo](https://www.youtube.com/watch?v=-1K_ZWDKpU0&pp=ygUSQ2xhdWRlIENvZGUgdXBkYXRl)[^6]. [^1]: Adds: Practical explanation of auto-memory behavior, 200-line limit, MEMORY.md path, and disable flag. [^2]: Adds: Official entry point for managing auto-memory. [^3]: Adds: Step-by-step config to route Claude Code via AI Gateway with observability and Claude Code Max support. [^4]: Adds: Comprehensive templates, CLAUDE.md patterns, hooks, and release-tracking for team standards. [^5]: Adds: Production-ready rules/agents methodology across common backend/data stacks. [^6]: Adds: Visual walkthrough of new multi-agent/Agent Teams workflows.

calendar_today 2026-02-09
claude-code anthropic vercel-ai-gateway claude-code-max agent-teams

Cursor updates spark security alerts, memory leak, and commit co-authoring

Community reports indicate recent Cursor updates introduced security alerts, memory leaks, and unexpected commit metadata behavior, prompting some teams to favor alternatives like Claude Code. Multiple users report a Cursor update triggering Microsoft Defender malware alerts, a <cursor_commands> memory leak, and unwanted 'Co-authored-by: Cursor' lines in Git commits ([Defender alert thread](https://forum.cursor.com/t/microsoft-defender-detects-cursor-update-as-trojan-win32-wacatac-b-ml-and-trojan-script-wacatac/151306)[^1], [memory leak report](https://forum.cursor.com/t/cursor-commands-memory-leak/151286)[^2], [commit co-author complaint](https://www.reddit.com/r/cursor/comments/1r05m6l/cursor_is_signing_commit_messages_now/)[^3]). A comparative review favors Claude Code for daily work and notes Cursor’s strengths but flags stability and pricing concerns ([tool comparison](https://www.reddit.com/r/ClaudeCode/comments/1qzkwav/i_spent_the_last_month_rotating_between_windsurf/)[^4], [Cursor rules regression](https://forum.cursor.com/t/cursor-rules-not-working-anymore/151255)[^5]). [^1]: Adds: community report of Defender flagging a recent Cursor update as Trojan. [^2]: Adds: user-reported memory leak in <cursor_commands> after update. [^3]: Adds: complaint about Cursor auto-adding 'Co-authored-by' lines in Git commits. [^4]: Adds: hands-on comparison praising Claude Code and outlining Cursor pros/cons. [^5]: Adds: report that Cursor Rules stopped working after an update.

calendar_today 2026-02-09
cursor claude-code microsoft-defender github git

Copilot model selection guidance with quota and UI gotchas

Microsoft outlines how to choose Copilot models by task while users report quota friction and a missing Edit mode after recent updates. A Microsoft guide maps everyday, lightweight, deep‑reasoning, and agentic tasks to specific Copilot model types and flags enterprise considerations like premium request multipliers [Choosing the Right Model in GitHub Copilot](https://techcommunity.microsoft.com/blog/azuredevcommunityblog/choosing-the-right-model-in-github-copilot-a-practical-guide-for-developers/4491623)[^1]. Meanwhile, community threads flag a disappearing Copilot Edit mode after the latest chat extension update and pain around non‑rolling premium request quotas (e.g., 300 Pro / 1,500 Pro+) [Github Copilot Edit mode gone after latest update?](https://github.com/microsoft/vscode/issues/293826)[^2] [Copilot premium requests to roll over to the next month](https://github.com/orgs/community/discussions/186654)[^3], with additional confusion from a recent Pro+ subscriber report [Bought Copilot Pro+ 2 hours ago, haven't use anything and ...](https://www.reddit.com/r/GithubCopilot/comments/1r07185/bought_copilot_pro_2_hours_ago_havent_use/)[^4]. [^1]: Adds: Developer-focused model selection guidance and enterprise usage considerations (multipliers) from Microsoft. [^2]: Adds: Report that Copilot Edit mode vanished after updating Copilot Chat Extension 0.37.1 on VS Code 1.109. [^3]: Adds: User feedback on lack of premium request rollover and stated quota numbers (300 Pro / 1,500 Pro+). [^4]: Adds: Anecdotal Pro+ subscription/usage confusion visible in VS Code.

calendar_today 2026-02-09
github-copilot microsoft github visual-studio-code openai

Opus 4.6 Agent Teams vs GPT-5.3 Codex: multi‑agent coding arrives for real SDLC work

Anthropic's Claude Opus 4.6 brings multi-agent "Agent Teams" and a 1M-token context while OpenAI's GPT-5.3-Codex counters with faster, stronger agentic coding, together signaling a step change in AI-assisted development. Opus 4.6 adds team-based parallelization in Claude Code, long‑context retrieval gains, adaptive reasoning/effort controls, and Office sidebars, with pricing unchanged [Data Points](https://www.deeplearning.ai/the-batch/claude-opus-4-6-pushes-the-envelope/)[^1] and launch coverage framing initial benchmark leads at release [AI Collective](https://aicollective.substack.com/p/the-brief-anthropics-opus-46-agent)[^2]. OpenAI’s GPT‑5.3‑Codex posts top results on SWE‑Bench Pro and Terminal‑Bench 2.0 and helped debug its own training pipeline [Data Points](https://www.deeplearning.ai/the-batch/claude-opus-4-6-pushes-the-envelope/)[^3], while practitioners surface Claude Code’s new Auto‑Memory behavior/controls for safer long‑running projects [Reddit](https://www.reddit.com/r/ClaudeCode/comments/1qzmofn/how_claude_code_automemory_works_official_feature/)[^4] and Anthropic leaders say AI now writes nearly all their internal code [India Today](https://www.indiatoday.in/technology/news/story/anthropic-says-ai-writing-nearly-100-percent-code-internally-claude-basically-writes-itself-now-2865644-2026-02-09)[^5]. [^1]: Adds: Opus 4.6 features (1M context), long‑context results, adaptive/effort/compaction API controls, and unchanged pricing. [^2]: Adds: Agent Teams in Claude Code, Office (Excel/PowerPoint) sidebars, 1M context, and benchmark framing at launch. [^3]: Adds: GPT‑5.3‑Codex benchmarks, 25% speedup, availability, and self‑use in OAI’s training/deployment pipeline. [^4]: Adds: Concrete Auto‑Memory details (location, 200‑line cap) and disable flag for policy compliance. [^5]: Adds: Real‑world claim of near‑100% AI‑written internal code at Anthropic, indicating mature SDLC use.

calendar_today 2026-02-09
anthropic openai claude-opus-46 claude-code gpt-53-codex

Pin Claude Code CLI to the stable channel for reliability on Windows

Switching Claude Code CLI to the stable update channel can resolve recent 'latest' channel issues reported on Windows. A user reports that running `claude doctor` to check the channel, switching it to `stable`, and then `claude update` fixed major Windows bugs in the CLI ([Reddit PSA](https://www.reddit.com/r/ClaudeAI/comments/1qxs2jk/psa_claude_code_cli_has_a_stable_update_channel/))[^1]. Pinning your CLI channel also improves reproducibility across dev machines and CI. [^1]: Adds: user-verified workaround with commands and context about 'latest' vs 'stable' channels on Windows.

calendar_today 2026-02-07
claude-code-cli anthropic windows cli version-pinning

User flags degraded Claude Opus 4.6 behavior and higher credit burn in Windsurf vs Claude Code

A Reddit report describes noticeably worse results and more credit burn when using Claude Opus 4.6 through Windsurf compared to running the same model via Claude Code directly. The post details unnecessary back-and-forth, confrontational replies, and 2×–4× credit multipliers in [this thread](https://www.reddit.com/r/windsurf/comments/1qxpcfd/is_anyone_else_getting_really_frustrated_with/)[^1]. [^1]: Adds: First-hand comparison of Windsurf vs Claude Code behavior, including examples and credit multipliers.

calendar_today 2026-02-07
windsurf claude-opus-46 claude-code claude-opus ai-coding-assistants

Mixture-of-Models routing tops single LLMs on SWE-Bench via task specialization

A lightweight Mixture-of-Models router that assigns issues to semantic clusters and routes to the historically strongest model per cluster hit 75.6% on SWE-Bench, edging past single-model baselines (~74%) by exploiting complementary strengths rather than defaulting to the top aggregate model [Reddit summary](https://www.reddit.com/r/LocalLLaMA/comments/1qvm0ft/mixtureofmodels_routing_beats_single_llms_on/)[^1]. The authors share a methodology write-up and an open-source framework so teams can reproduce the gating approach without test-time search or repo execution [methodology blog](https://nordlyslabs.com/blog/hypernova)[^2] and [framework code](https://github.com/Nordlys-Labs/nordlys)[^3]. [^1]: Highlights task-level specialization on SWE-Bench and the routing approach with reported results. [^2]: Details the clustering, per-model success statistics, and routing mechanism. [^3]: Provides the open-source implementation for building a MoM router.

calendar_today 2026-02-04
nordlys nordlys-labs swe-bench mixture-of-models model-routing

Claude Code 2.1.x lands practical speedups and governed multi‑agent workflows

Anthropic pushed a rapid series of Claude Code 2.1 updates (v2.1.26–v2.1.31) that cut RAM on session resume, add page‑level PDF reads, support MCP servers without dynamic registration, enable PR‑based session bootstraps, and ship many reliability fixes [Reddit summary](https://www.reddit.com/r/ClaudeAI/comments/1qvgdc5/claude_code_v21262130_what_changed/)[^1] and [official v2.1.31 notes](https://github.com/anthropics/claude-code/releases/tag/v2.1.31)[^2]. Practitioners also highlight 2.1’s skill hot‑reload, lifecycle hooks, and forked sub‑agents as a foundation for governed, observable multi‑agent workflows—positioning Claude Code as a lightweight "agent OS" for real projects [deep dive](https://medium.com/@richardhightower/build-agent-skills-faster-with-claude-code-2-1-release-6d821d5b8179)[^3]. [^1]: Adds: community changelog for v2.1.26–30 covering performance, MCP, GitHub/PR workflows, and PDF handling. [^2]: Adds: official v2.1.31 fixes (PDF lockups, sandbox FS errors, streaming temperature override, tool routing prompts, provider labels) and hard limits (100 pages, 20MB). [^3]: Adds: perspective on skill hot‑reload, lifecycle hooks, and forked sub‑agents enabling governed multi‑agent patterns.

calendar_today 2026-02-04
claude-code anthropic mcp-model-context-protocol github slack

Windsurf: user-reported slowness vs multi-file edit workflow

Developers report Windsurf sessions stalling for nearly an hour and missing features (e.g., image input on some models), eroding confidence in day-to-day use [Reddit post](https://www.reddit.com/r/windsurf/comments/1qvn8l7/done_with_windsurf_it_used_to_be_my_daily_driver/)[^1]. If you still pilot it, center usage on the documented multi-file edit flow that coordinates atomic cross-file refactors with previews and rollback, noting prerequisites (Windsurf IDE, Cascade AI, .windsurf directory, Git) [SkillKit guide](https://skillkit.io/zh/skills/claude-code/windsurf-multi-file-editing)[^2]. [^1]: Adds: user-reported reliability issues (slowness, hangs, missing image support) and perceived drop in responsiveness from the team. [^2]: Adds: concrete workflow for coordinated multi-file edits with atomic changes, preview, rollback, and required setup.

calendar_today 2026-02-04
windsurf cascade-ai gpt-52-codex-high kimi-k25 skillkit

Claude Code goes multi-agent with Swarm; plugins surge, outage underscores ops readiness

Anthropic has officially made Claude Code a multi-agent orchestrator with Swarm mode, turning one assistant into a team lead that plans and delegates to specialist agents, while also introducing task‑oriented plugins (including a legal plugin) and the no‑code Cowork, signaling a shift from model to workflow owner [What is Swarm](https://www.atcyrus.com/stories/what-is-claude-code-swarm-feature)[^1] and [legal plugin + Cowork](https://legaltechnology.com/2026/02/03/anthropic-unveils-claude-legal-plugin-and-causes-market-meltdown/)[^2]. Early adopters report compressing months of ops work into a weekend—site audits, DNS/AWS cleanups, and mass WordPress updates—using Claude Code automations, but a brief Claude API outage shows the need for fallbacks and resilience [real‑world wins](https://authorautomations.com/p/things-i-did-with-claude-code-this)[^3] and [outage recap](https://www.theverge.com/news/873093/claude-code-down-outage-anthropic)[^4]. For safe adoption, standardize native installs and REPL health checks, and design plugins with explicit context resets, file‑based state, and recovery logic for long‑horizon tasks [install/REPL best practices](https://dev.to/cristiansifuentes/conversational-development-with-claude-code-part-3-installing-trusting-and-operating-the-tool-2ekp)[^5] and [context/state lessons](https://www.reddit.com/r/ClaudeAI/comments/1quuxkj/technical_lessons_while_building_a_trilogy_of/)[^6]. [^1]: Adds: Deep dive on Swarm mode’s orchestration model (team lead, specialist agents, task board, TeammateTool ops). [^2]: Adds: Overview of Anthropic’s new plugins and Cowork; legal plugin capabilities and strategic shift to workflow ownership. [^3]: Adds: Concrete automation outcomes (Ghost audits, Cloudflare DNS cleanup, AWS cost hygiene, WordPress fleet updates) using Claude Code. [^4]: Adds: Report of the Feb 3 outage impacting Claude APIs and Claude Code; duration and impact context. [^5]: Adds: Production-grade install guidance (native installer), REPL health commands (doctor, status, login) for operational trust. [^6]: Adds: Practical patterns for context management, subagents, and file-based state/recovery across sessions.

calendar_today 2026-02-03
anthropic claude-code claude claude-cowork photoprism

OpenAI Codex ships macOS app with parallel agents, Plan mode, and higher limits

OpenAI released a macOS Codex app that runs parallel agent threads for long‑running work with built‑in Git/worktrees, skills, automations, and temporarily higher rate limits across app/CLI/IDE for paid tiers ([Codex changelog](https://developers.openai.com/codex/changelog/)[^1]). The latest release enables Plan mode by default, stabilizes personality config, supports loading skills from .agents/skills, and surfaces runtime metrics for diagnostics ([v0.94.0 release](https://github.com/openai/codex/releases/tag/rust-v0.94.0)[^2]). OpenAI is positioning Codex for autonomous, multi‑threaded, complex tasks vs. Claude Code, citing 1M monthly users and 20x growth since August, while community reports mention a large context window (unconfirmed) ([Sources newsletter](https://sources.news/p/openai-takes-aim-at-anthropics-coding)[^3], [Reddit thread](https://www.reddit.com/r/OpenAI/comments/1qu7hii/openai_just_massdeployed_codex_to_every_surface/)[^4]). [^1]: Official feature overview and rate-limit details. [^2]: Release notes (Plan mode default, skills folder support, personality, metrics). [^3]: Press briefing recap with positioning vs. Claude Code and usage stats. [^4]: Community summary noting "trinity" surfaces and context-size claim (unverified).

calendar_today 2026-02-03
openai codex chatgpt anthropic claude-code

Rumor: Anthropic 'Claude Image' hinted by beta tester

A beta tester post suggests Anthropic may be preparing a release called "Claude Image"; treat this as unconfirmed and monitor for an official announcement via trusted channels like company blogs or press.[Reddit thread](https://www.reddit.com/r/singularity/comments/1quromm/beta_tester_hints_at_new_anthropic_release_claude/)[^1] [^1]: Adds: single-source rumor thread claiming an early beta tester hint; no official confirmation or technical details.

calendar_today 2026-02-03
anthropic claude claude-image reddit sdlc