Posts / agentic-coding

Claude's Wild June: Kill-Switches, Billing Retreats, and Google Burning Its Own Community

June was a lot. Not in a good way, mostly. A few genuinely useful things shipped, but the month will be remembered for a US export control order that took down Claude globally for 18 days, a billing change that nearly caused a mass exodus, and Google deciding to close-source a project that 6,000 community contributors had spent a year building. Let’s take these in order of how much they made me stare at the ceiling.

The Fable 5 situation is the one I keep coming back to. Anthropic launched Claude Fable 5 on June 9. Three days later, the Commerce Department invoked export control powers and mandated a shutdown for all foreign nationals. Anthropic, apparently unable to filter users by nationality in real time across AWS Bedrock, Google Cloud, Microsoft Foundry, and its own APIs, did the only technically feasible thing: shut it down for everyone. All integrations. Universal. For 18 days.

The controls were lifted on June 30. Access started coming back July 1.

I want to be clear that I’m not blaming Anthropic for the shutdown itself. The regulatory situation put them in an impossible position. What I’m reflecting on is what it revealed about my own assumptions. I have pipelines that name a specific model identifier. That’s it. One model, named explicitly, no fallback. June proved that’s not a minor oversight; it’s an operational risk that now has a concrete, documented precedent. A fallbackModel config pointing at Opus 4.8 is a weekend afternoon of work. I’m doing it this week.

The contract language point from the post-mortems is also worth sitting with. Most enterprise templates apparently relied on vague force majeure clauses rather than anything specific to government-mandated model suspension. That’s now a named risk. If you’re in a regulated environment, this is worth a conversation with legal before the next quarter.

The billing reversal is the other story. Starting June 15, Anthropic was going to separate Agent SDK usage, claude -p, Claude Code GitHub Actions, and third-party agent authentication into a separate credit pool: $20 for Pro, $100 for Max 5x. Billed at API rates. Community analysis put the effective price increase somewhere between 12x and 175x depending on workload.

Then on June 15, the day it was supposed to go live, Anthropic pulled it. The only public statement was “Nothing changes for now.”

Which is, you know, reassuring in the way that someone saying “I’ll think about it” is reassuring.

The underlying economics haven’t changed. Anthropic’s own tooling is optimised for prompt cache hit rates; third-party agents bypass that and process context from scratch on every call. Boris Cherny, who runs Claude Code at Anthropic, described it as “really hard to do sustainably.” He’s right. The pause exists because OpenAI is reportedly cutting API prices and Anthropic has IPO paperwork in flight. Those are competitive pressures, not a change of heart about the economics.

If you’re running production automation on a subscription plan, the architecture that was always intended for that use case is a direct API key with spend controls. The billing split will return. Building against that expectation now is just sensible.

Claude Sonnet 5 actually shipped something worth getting excited about, which felt like good news after the first two items. The Terminal-Bench 2.1 score is 80.4%, beating Opus 4.8’s 74.6%. That’s the first time a mid-tier Anthropic model has outright beaten the flagship on a terminal agentic coding benchmark. For the SWE-bench Pro variant, the one using actively-maintained real repositories with no training-data leakage, Sonnet 5 sits at 63.2% versus Opus’s 69.2%. A six-point gap at roughly half the cost.

For most of what I actually do in Claude Code, iterative refactors, test generation, PR reviews, that gap doesn’t decide anything. The launch pricing is $2/$10 per million tokens through August 31, which is genuinely attractive.

Here’s the trap though: the new tokeniser maps the same input to 1.0 to 1.35 times more tokens than previous versions. The introductory pricing is partially compensating for that inflation. After September 1, when it reverts to $3/$15, some workloads will cost roughly 30% more than they did on Sonnet 4.6. The window is real but measure your actual token consumption before assuming cost neutrality.

The Claude Code shipping updates are less dramatic but the ones I keep thinking about are the auto-mode safety changes. Destructive git commands, git reset --hard, git clean -fd, git stash drop, are now blocked unless you explicitly asked for them. Same with terraform destroy, pulumi destroy, cdk destroy. Previously you’d handle this in CLAUDE.md or hook configuration. Now it’s structural.

I’ve had moments where an agent confidently destroyed something I didn’t intend to destroy. It’s not fun. This is the right call.

The Artifacts feature is interesting for a different reason: it turns session output into a live-updating shareable page. When the session updates the artifact, anyone with the link sees it refresh. For async collaboration on incident pages or PR walkthroughs, that’s a genuinely different model than exporting a document. I want to try it on the next production incident.

The Google situation is the one that leaves the worst taste. Google accepted over 6,000 merged pull requests from community contributors to the open-source Gemini CLI over nearly a year. Then on June 18 they restricted Gemini CLI to enterprise customers only and replaced it with Antigravity CLI, which is closed-source, for everyone else. The justification, that Antigravity CLI shares an agent harness with Antigravity 2.0 ensuring future improvements propagate automatically, is architecturally coherent. The execution, done to a community that had invested significant time and effort, is something else.

Between December 2025 and March 2026, Google cut Antigravity’s usage quotas four times without advance notice. One developer documented going from 300 million input tokens per week to under 9 million at the same Pro subscription tier.

If you’re deep in Google Cloud, Firebase, BigQuery, Android, Antigravity 2.0 is probably worth evaluating seriously. The deep integrations with that stack are real. For everyone else, it’s a third option at best, and one I’d give 60 days before revisiting.

One genuinely useful thing in Cursor this month: the Bugbot update cut average review time from five minutes to around 90 seconds, found 10% more bugs per run, at 22% lower cost. There’s also a /review command that runs Bugbot before you open a pull request. 90 seconds is fast enough that this becomes a pre-push step rather than a PR-stage afterthought. That’s a meaningful change in how you use it.

The Team MCP distribution is also worth looking at if you’re coordinating shared integrations. Admin configures once, distributes across cloud agents, IDE, and CLI. No per-developer setup. That removes a friction point that was genuinely deterring MCP adoption on teams.

June felt like a month where the infrastructure of this stuff, the contracts, the billing models, the community relationships, got stress-tested in ways that the benchmark charts don’t capture. The models are getting better. The surrounding machinery is still being figured out, often at the expense of the people using it.

I don’t have a tidy way to end that observation.