Morning Brief: Saturday, June 27

Seventy-eight feeds. Two weeks. 5,532 items reduced to what follows. (what we track, how we crawl, subscribe)

Saturday. The week's open question gets its answer overnight: OpenAI ships GPT-5.6 as a restricted-release family (Sol / Terra / Luna), trusted-partners only. METR publishes its predeployment evaluation of GPT-5.6 Sol the same morning, and The New Stack covers what amounts to the first explicit US-government gating of an OpenAI release. Tuesday's White-House stagger ask, yesterday's "must be punished" Alibaba escalation, and the Akrites launch resolve into one frame: model access is now a state-coordinated allocation, not a product launch.

Meta-Harness Summer gets its platform receipt. GitHub publishes the methodology behind its Copilot agentic harness evaluation across models and tasks — vendor-side codification of what Latent Space named two days ago. The harness, not the model, is the comparable unit.

The Akrites thread widens from yesterday's HN letter to LWN ("vulnerability-mitigation project launches") and The New Stack ("19 organizations launch open source security body"). The maintainer-versus-scanner asymmetry now has an industrial coalition.

Quieter but load-bearing: Cloudflare's saga-rollback writeup for Workflows (correctness primitive for distributed agent state), The New Stack on AWS/Microsoft/Google converging on "session as compute" but diverging on isolation, Schneier on Meta testing facial recognition for police and military, The Air Current on the first US deployment of a collaborative combat aircraft.

Top (5-7 min)

OpenAI GPT-5.6 Sol / Terra / Luna — restricted to trusted partners
Latent Space, 2026-06-27. Pairs with METR's predeployment evaluation of GPT-5.6 Sol published the same day and The New Stack: US government just told OpenAI who's allowed to use the next GPT 5.6 model. Three signals — capability release, third-party eval, state-actor allocation — landing in one window.
Evaluating the GitHub Copilot agentic harness across models and tasks
GitHub, 2026-06-25. The harness eval methodology published by a platform vendor. Codifies what Latent Space framed two days ago as Meta-Harness Summer — the harness, not the model, is the unit of comparison.
"Akrites" vulnerability-mitigation project launches
LWN, 2026-06-26. The thread widens. Plus The New Stack: After Fable 5 ban, Anthropic and 19 organizations launch open source security body. Yesterday's letter (We All Depend on Open Source. We Will Defend It Together) was the announcement; today the maintainer press picks it up.
How we built saga rollbacks for Cloudflare Workflows
Cloudflare, 2026-06-25. The correctness primitive beneath distributed agent state. Compensating actions for partially-completed agent workflows — what makes async retry safe to ship.
AWS, Microsoft, and Google agree the session is the new unit of compute. They disagree on how to isolate it.
The New Stack, 2026-06-26. Three vendors converge on the same primitive — agent session as runtime — and split on the security model. The infrastructure question of the next 12 months in one paragraph.
What happened after 2,000 people tried to hack my AI assistant
Simon Willison, 2026-06-26. Simon's writeup of Fernando Iglesias's hackmyclaw experiment. Real-world adversarial data released the same week Patronus AI raised $50M for AI agent stress-testing.

Themes this week

Restricted release becomes the model release pattern
GPT-5.6 Sol/Terra/Luna restricted to trusted partners, METR's predeployment evaluation, The New Stack: US government allow-list, Simon Willison quoting OpenAI on the release, Deployment Awareness Matters More Than Evaluation Awareness (Alignment Forum). The pipeline — third-party eval, government allow-list, partner-only rollout — is what model release looks like now.
Meta-Harness Summer, vendor-side codification
GitHub's Copilot harness evaluation methodology, Latent Space: Meta-Harness Summer, TNS: session as the new unit of compute, yesterday's Vercel AI SDK 7 + harness adapters. Four vendors converge on the same abstraction.
Akrites and the maintainer-coordination response
"Akrites" vulnerability-mitigation project launches (LWN), The New Stack on the 19-org coalition, yesterday's open letter + Linux Foundation sponsorship. The CVE-flood-vs-maintainer-capacity asymmetry now has institutional governance.

Scan (15 min)

Tail

Feed silences (diagnostic)

  • arxiv-cs-ai: 2,939 items in the 14-day window, fully live.
  • bitsavers (6 feeds): all connected, 0 items this run (sparse archive).
  • James Bornholt, Netflix Tech Blog: errors persist (DNS / TLS), permanent.
  • Anthropic first-party RSS: still 404, now covered by anthropic-generated
    • Latent Space + The New Stack.
  • claude-auth: brief generation auth failed every morning June 21–27. Today's brief regenerated manually.

Build provenance

build: 2026-06-27 | crawler-sha: c04c1d1 (Walsh-Research/1.2, compliance v1.3) | feeds: 78 (incl. 12 corp-eng, 6 bitsavers, 5 generated) | items-considered: 5532 (14d, incl. 2939 arXiv) | warehouse: 19584 items | published: 53 | note: GPT-5.6 Sol restricted release + METR predeploy eval same morning; GitHub Copilot harness eval methodology; Akrites coverage widens to LWN/TNS; Cloudflare saga rollbacks; AWS/MSFT/Google "session as compute"; Meta tests facial recognition for police/military