
Issue #26 · Weekly Digest
Weekly AI Dev News Digest: June 20 - 26, 2026
Coding tools don't trust their own agents. Vercel, Anthropic, and Zed all wrapped agent execution in sandboxes, approval gates, and credential walls within days of each other, after a public disclosure proved a single fake error report could make an agent run an attacker's code.
On June 25 Vercel shipped AI SDK 7 with a default that says where the industry's head is at: a system message tucked inside a prompt now gets rejected unless it's explicitly allowed, because that is how prompt injection sneaks in. Two days earlier Anthropic added a setting to Claude Code that stops sandboxed commands from reading credential files and secret environment variables. Zed shipped an agent terminal sandbox days later. None of these is a headline feature. They are the plumbing that goes in after someone proves the old way was dangerous, and someone did.
OpenAI previewed GPT-5.6, its next frontier model, to a government-approved shortlist of about 20 partners, and pointed its security models at patching open source, the half of the work that lags behind finding the bugs. npm locked down its most-downloaded accounts. GitHub put Claude behind its JetBrains agent and opened the Copilot app to local models. A new lab shipped an open-weights coding model that trains itself with guardrails against gaming its own reward. Across all of it, more of the stack now ships locked down by default.
85.6%
GPT-5.5-Cyber CyberGym score
20
partners on the GPT-5.6 preview
30+
open-source projects in Patch the Planet
72-hour
npm high-impact account lock
30M
weekly AI SDK installs
397B
Ornith-1.0 flagship parameters
In Focus
Coding Agents Get Sandboxes and Approval Gates
The most-installed AI framework in JavaScript rebuilt itself around running agents in production, a step up from the prototyping its earlier versions were built for. AI SDK 7, out June 25, adds a durable WorkflowAgent that survives process restarts and delayed approvals, first-class tool approvals, timeout budgets, redesigned telemetry, and a sandbox abstraction for executing model-written commands. It also plugs other harnesses, Claude Code, Codex, and OpenCode, in behind one API. The new default is the real signal: a system-role message buried in a prompt or message list gets rejected unless it's explicitly allowed, because that is a classic injection vector. Vercel put a safer default in front of 30 million weekly installs. (Vercel)
Anthropic hardened Claude Code's own attack surface too. Version 2.1.187, shipped June 23, added a sandbox.credentials setting that blocks sandboxed commands from reading credential files and secret environment variables, plus organization-configured model restrictions across the model picker and the ANTHROPIC_MODEL variable. (Claude Code) Zed's 1.9 preview, out June 24, pushed the same idea harder: an agent terminal sandbox toggle, a settings page for persistent grants that scopes allowed domains and writable paths, and terminal sandboxing for agent commands on Windows. (Zed)
All of this answers a problem with a name. Agentjacking, disclosed by Tenet Security earlier in June, lets an attacker POST a fake error to a public Sentry endpoint; the coding agent reads the "fix" through MCP and runs it with the developer's own privileges. In testing it worked about 85 percent of the time and touched 2,388 organizations. Tool output and error reports now have to be treated as untrusted input. (Agentjacking)
In Focus
OpenAI Previews GPT-5.6, Its Next Frontier Model
OpenAI opened a limited preview of GPT-5.6 on June 26, its first new frontier model since GPT-5.5, split into three named tiers that advance on their own schedules. Sol is the flagship, with agentic gains in coding, cybersecurity, and biology, a new "max" reasoning mode, and an "ultra" mode that coordinates sub-agents on a single task. Terra matches GPT-5.5 at roughly half the cost, and Luna is the cheapest and fastest. Sol is priced at $5 input and $30 output per million tokens, the same as GPT-5.5 and about half of Anthropic's Fable 5. For developers the access terms matter as much as the model: the preview runs through the API and Codex to around 20 trusted partners, with no ChatGPT and general availability only promised in the coming weeks. (OpenAI)
In Focus
AI Security Tools Shift From Finding Bugs to Patching Them
OpenAI expanded its Daybreak program on June 22 around a claim that flips the usual security story: AI now finds vulnerabilities faster than people can fix them, so the bottleneck has moved to patching. The release pairs the full GPT-5.5-Cyber, which posts a state-of-the-art 85.6 percent on CyberGym and stays gated to vetted defenders, with Patch the Planet, run with Trail of Bits to carry open-source projects from finding to fix. More than 30 projects signed on, including cURL, Go, Python, and Sigstore, and a five-day sprint produced hundreds of findings and dozens of merged patches. The Codex Security plugin has scanned over 30 million commits since its March preview. (OpenAI)
The timing is pointed. OpenAI is pressing a cyber-defense edge while Anthropic's most capable security models sit behind an export ban, and it is doing it by building Daybreak into the maintainer workflows developers already use, the same GitHub and CI surfaces where patches get reviewed.
npm reinforced the supply chain from the other direction. On June 25 it began putting its highest-impact accounts, the maintainers behind the most widely used packages, into a 72-hour read-only state whenever someone changes the account email or burns a 2FA recovery code. During the lock, installs and downloads keep working, but publishing and token minting pause. That closes the exact sequence the Shai-Hulud worm used: take over an account, change the email, mint a token, push malware. (npm)
In Focus
Coding Agents Move Into Team Workflows
Anthropic's other launch puts the agent in the team's workspace. Claude Tag, released June 23, drops Claude into Slack as a persistent, shared teammate. Anyone in a channel types @Claude, hands off a task, and the agent works it in stages while posting threaded updates, with an ambient mode that lets it follow up on stalled threads on its own. Each channel gets a scoped identity and memory, and admins control which tools and data it can reach. Anthropic calls it the evolution of Claude Code and says an internal version already writes 65 percent of its product team's code. It replaces the older Claude in Slack app, which retires August 3. (Claude Tag)
GitHub both tightened control over its agents and opened them up. A June 22 update brought Claude into JetBrains IDEs as an agent provider in public preview and added organization and enterprise custom agents that admins can publish as one curated, governed set for the org. (GitHub) A day later the Copilot desktop app gained bring-your-own-key support, so sessions can run against outside providers, including OpenAI, Anthropic, and self-hosted Ollama or LM Studio, with keys held in the local OS keychain. (GitHub Copilot)
In Focus
DeepReinforce Open-Sources a Self-Scaffolding Coding Model
DeepReinforce open-sourced Ornith-1.0 on June 25, a family of agentic coding models in four sizes, from a 9B dense model for edge use to a 397B mixture-of-experts flagship, all under the MIT license on Hugging Face and post-trained on Gemma 4 and Qwen 3.5. Ornith's training method breaks from the norm. Most coding agents wrap a model in a hand-built harness; Ornith learns to write its own. During reinforcement learning it generates both the solution and the task-specific scaffold that guides it, so higher-reward orchestration strategies get selected on their own. It ships with explicit anti-reward-hacking guardrails, a fixed trust boundary, a deterministic monitor, and a frozen judge that can veto, a pointed inclusion given that GLM-5.2 disclosed its own reward-hacking behavior at release. By the lab's own benchmarks the 397B model scores 82.4 on SWE-Bench Verified and edges out Claude Opus 4.7, though those are vendor numbers awaiting independent evaluation. (DeepReinforce)
Signals
Signals from the Edges
The open-weight gap keeps closing
GLM-5.2 from Z.ai, out mid-month under an MIT license with no regional limits, has kept gaining traction with developers locked out of Anthropic's Fable 5 by the export ban. It is the clearest sign that open models are now a real fallback for teams that need to route around a single provider.
Anthropic accuses Alibaba of a record distillation attack
In a letter obtained by CNBC, Anthropic alleged Alibaba ran the largest known distillation campaign against its models, using thousands of fraudulent accounts to extract Claude capabilities even though Anthropic keeps its products out of China. The geopolitics of open weights is now playing out in legal filings.
Claude had a rough operational week
Claude Code and the wider platform went down across three straight days, June 22 through 24, including a major all-platform outage on June 23 that spared only Claude for Government. For teams that have wired Claude Code into daily work, three days of instability is a reliability signal worth tracking.
The Fable 5 billing cliff arrived
The free trial for paid Anthropic subscribers closed June 22, moving access to separate, dollar-denominated usage credits, while Fable 5 and Mythos 5 stayed offline under the US export directive with no restoration date. Developers who built pipelines on the model now pay API rates or wait.
Looking Ahead
What to Watch
- 1
Gemini 3.5 Pro is overdue
Google committed to a June general-availability date at I/O and has not shipped, leaving Gemini 3.5 Flash as the public model. Prediction markets put the odds of a June 30 launch around 50 to 55 percent, so the deadline itself is the thing to watch.
- 2
The Claude in Slack migration clock is running
Claude Tag replaces the legacy Slack app on August 3, with a 30-day admin migration window. Teams that miss it lose Slack-based Claude access.
- 3
Watch whether agent sandboxing flips from opt-in to default
Three tools added agent sandboxes this week, all off by default. The tell will be the first major coding tool that ships the sandbox turned on out of the box.
The major coding tools now ship the switches. The unsettled question is which one turns safety on by default before the next Agentjacking forces the issue.