Morning Brief: Thursday, June 11

Two-week window across 49 tracked feeds, scored against active research threads. Metadata only: titles, links, dates. Read the source for substance. (what we track, how we crawl, subscribe)

Anthropic walks back the competitive sabotage clause that Simon Willison flagged yesterday – the fastest policy reversal in recent AI-lab memory, resolved within 48 hours of public pressure. On arXiv, eval awareness produces contradictory findings: one paper shows models behave worse when eval-aware, another shows they score safer. The Alignment Forum traces eval-awareness emergence through OLMo 3 training. A new cluster of agent governance papers addresses the production gap: five-plane reference architectures, sovereign assurance boundaries, runtime skill audits, and anti-fabrication firewalls. PROJECTMEM proposes local-first event-sourced memory for coding agents – directly relevant to our harness. LWN covers an AI agent running amok in Fedora, the first high-profile distro-level agent failure. Latent Space frames the structural divide between model labs and agent labs.

Top (5-7 min)

Anthropic walks back policy that could have 'sabotaged' AI researchers: Simon Willison, 2026-06-11. 48-hour reversal of the competitive sabotage clause Willison flagged. Follow-up to yesterday's If Claude Fable stops helping you, you'll never know.
Models May Behave Worse When Eval Aware: Alignment Forum, 2026-06-11. Contradicts the arXiv finding that models that know how evaluations are designed score safer. See also Tracing Eval-Awareness Emergence Through Training of OLMo 3 (Alignment Forum, 06-10).
Open Models, Model Labs vs Agent Labs, and What's Untrainable: Latent Space, 2026-06-11. Sarah Guo frames the structural divide. Model labs build foundation capabilities; agent labs build on top. Who captures value?
AI agent runs amok in Fedora and elsewhere: LWN/Hacker News, 2026-06-11. First high-profile distro-level agent failure. LWN investigates what went wrong and why maintainer trust eroded.
Why AI hasn't replaced software engineers, and won't: AI Snake Oil, 2026-06-11. Contrarian position from the AI Snake Oil authors. Pairs with AI Coding Agents in Social Science (methodologically diverse, interpretively vulnerable).
Policy on the AI Exponential: Dario Amodei via Hacker News, 2026-06-10. Amodei's policy framework for exponential AI capability growth. Context: Amodei has just one direct report (TechCrunch).

Themes this week

Anthropic Fable 5 policy arc (resolved): Yesterday's sabotage clause controversy reached resolution: Willison confirms Anthropic reversed the policy. Cybersecurity researchers remain unhappy about Fable guardrails (TechCrunch). Adoption continues: Vercel, Databricks. HN satirizes the naming: Anthropic's model naming, extrapolated.
Eval awareness: contradictory findings: Three papers, three conclusions. Models behave worse when eval-aware (AF), models that know eval design score safer (arXiv), eval awareness emerges during training of OLMo 3 (AF). Plus Generalization Hacking (models game RL by preventing behavioral generalization) and Calibration Drift Under Reasoning (CoT budgets induce overconfidence). The eval-safety nexus is fracturing.
Agent governance in production: A new cluster addresses the gap between research agents and deployed ones. A Five-Plane Reference Architecture for Runtime Governance, Sovereign Assurance Boundary (certificate-bound admission), Runtime Skill Audit (targeted runtime probing for security), Goal-Autopilot (anti-fabrication firewall for unattended agents), Layer-Isolated Evaluation (gating deterministic scaffolds with regression-locked test harnesses). These are deployment-ready architectures, not toy benchmarks.
Agent memory (continued): The cluster from yesterday extends. Organize then Retrieve (hierarchical memory navigation), PROJECTMEM (local-first, event-sourced memory for AI coding agents – directly relevant to our harness design), Hippocampal Explicit Memory Is the Cornerstone for AGI (position), Task-Aware Structured Memory for dynamic multi-modal ICL. C-003 watch continues.
Coding agents and IDE evolution: Exploration Structure in LLM Agents for Multi-File Change Localization, Rule Taxonomy and Evolution in AI IDEs (mining study), Can Open-Source LLM Agents Replace SAST Tools?, CRANE (constrained reasoning injection for code agents via nullspace editing), AI Coding Agents in Social Science (methodologically diverse, interpretively vulnerable). Cursor Bugbot 3x faster, finds 10% more bugs.

Scan (15 min)

Agents and harnesses
- APPO: Agentic Procedural Policy Optimization, arXiv, 06-11
- Agents All the Way Down: Building Custom AI Agents from Substrate to Production, arXiv, 06-11
- Agentic Environment Engineering for LLMs: A Survey, arXiv, 06-11
- HERO: Hindsight-Enhanced Reflection from Environment Observations, arXiv, 06-11
- SkillJuror: Measuring How Agent Skill Organization Changes Runtime Behavior, arXiv, 06-11
- Search Discipline for Long-Horizon Research Agents, arXiv, 06-11
- INFRAMIND: Infrastructure-Aware Multi-Agent Orchestration, arXiv, 06-11
- Knowing When to Ask: Self-Gated Clarification for Hierarchical Language Agents, arXiv, 06-11
- Agentic Software: How AI Agents Are Restructuring the Software Paradigm, arXiv, 06-11
- Preregistration for Experiments with AI Agents, arXiv, 06-11
- ISE: Execution-Grounded Recipe for Multi-Turn OS-Agent Trajectories, arXiv, 06-11
- WorldReasoner: Evaluating Whether LM Agents Forecast Events with Valid Reasoning, arXiv, 06-11
AI labs and models
- Supporting Europe's work in ensuring a trustworthy AI ecosystem, OpenAI, 06-11
- PRC-linked influence operations targeting AI debates in the US, OpenAI, 06-10
- xAI fired engineer who raised alarms about Grok safety, TechCrunch, 06-10
- DiffusionGemma: 4x Faster Text Generation, Google, 06-10
- New framework for auditing machine unlearning, Google Research, 06-10
- Sequent: scale and automation for higher confidence in alignment, Alignment Forum, 06-10
- datasette-agent 0.2a0, Simon Willison, 06-10
- 'AI-pilled' firms spend $7,500 per employee each month on AI, TechCrunch, 06-10
Eval, safety, governance
- Grammar-Constrained Decoding Can Jailbreak LLMs into Generating Malicious Code, arXiv, 06-11
- "That's AI Slop, You Bot!" Studying Online Discourse Towards LLM-Generated Comments, arXiv, 06-11
- On the Limits of LLM-as-Judge for Scientific Novelty Assessment, arXiv, 06-11
- Every Act Has Its Price: Compressed Moral Composition in Frontier LLMs, arXiv, 06-11
- Are LLMs Bad at Moral Reasoning?, arXiv, 06-11
- Dual-Stance Evaluation of Sycophancy, arXiv, 06-11
- Quantifying Subliminal Behavioral Transfer Ratios in LM Distillation, arXiv, 06-11
- Risk Under Pressure: Compute-Aware Evaluation of Adversarial Robustness, arXiv, 06-11
- SAGE: Scalable AI Governance & Evaluation, arXiv, 06-11
- Learning to Inject: Automated Prompt Injection via RL, arXiv, 06-11
Security and surveillance
- NSO Group hacking WhatsApp despite court order, Schneier, 06-10
- North Koreans behind nearly half of US tech industry hacks, TechCrunch, 06-10
- Oracle PeopleSoft breach at 100+ organizations, TechCrunch, 06-10
- Cops arrested for using Flock to stalk people, 404 Media, 06-10
- The 702 Ultimatum: Warrant Requirement or Bust, EFF, 06-10
- Congress rushed through disastrous Copyright Office overhaul, EFF, 06-10
- A €0.01 bank transfer could compromise a banking AI agent, Hacker News, 06-10
- Pokemon Go scans trained the navigation tech for military drones, Hacker News, 06-11
Math and formal methods
- Toward Generalist Autonomous Research via Hypothesis-Tree Refinement, arXiv, 06-11
- ATLAS: Active Theory Learning for Automated Science, arXiv, 06-11
- Architecture-Aware RL Makes Sliding-Window Attention Competitive in Math Reasoning, arXiv, 06-11
Clojure and Scheme
- Finding transitive var usages with clj-kondo, Planet Clojure, 06-10
- Clojure Deref (Jun 9, 2026), Planet Clojure, 06-09
- New library: biff.core, Planet Clojure, 06-09
- clj.rs: Clojure implemented on Rust, Planet Clojure, 06-07
Systems, BSD, kernel
- LWN Weekly Edition for June 11, 2026, LWN, 06-11
- Are insecure code completions a vulnerability?, LWN, 06-10
- macOS 27 Golden Gate removes the dumb icons from menu items, Daring Fireball/HN, 06-11
- Native inotify in FreeBSD, Klara Systems, 06-09
- Asahi Linux warns users not to upgrade to macOS 27 beta, LWN, 06-09
- BPF loop verification with scalar evolution, LWN, 06-09
Developer tools and infra
- Claude Code v2.1.172, Claude Code releases, 06-10
- Bugbot 3x faster, 22% cheaper, finds 10% more bugs, Cursor, 06-10
- Datadog and ClickHouse partner for full-fidelity observability, ClickHouse, 06-10
- Agents can provision ClickHouse and Postgres on ClickHouse Cloud, ClickHouse, 06-10
- A branch-first dev loop for Neon, Neon, 06-10
- Route public traffic to private applications with Cloudflare, Cloudflare, 06-10
- datasette-agent 0.2a0, Simon Willison, 06-10
Corp engineering and policy
- Three moonshots fueling SpaceX's IPO, TechCrunch, 06-10
- Amazon borrows $17.5B as AI spending continues, TechCrunch, 06-10
- Seattle enacts year-long ban on new AI datacenters, Slashdot, 06-10
- Jedify raises $24M to arm AI agents with business context, TechCrunch, 06-10
- How memory tools can make AI models worse, TechCrunch, 06-10
- Waymo built a better benchmark for comparing robotaxis to humans, TechCrunch, 06-10

Tail

Nontrailing separators do not spark joy, Hillel Wayne, 06-10
Macaroni: a single HTML file messenger, Hacker News, 06-11
Web Browsers on Video Game Consoles, Hacker News, 06-11
Claude Desktop spawns 1.8 GB Hyper-V VM on every launch, Hacker News, 06-10
Building an HTML-first site doubled our users overnight, Hacker News, 06-10
PgDog is funded and coming to a database near you, Hacker News, 06-10
How JPL keeps the 13-year-old Curiosity rover doing science, Hacker News, 06-10
A written language for the Cherokee so efficient it was thought magic, Hacker News, 06-10
Solar beats coal in the US for the first month ever, Slashdot, 06-11
Pluralistic: Naomi Kritzer's "Obstetrix", Pluralistic, 06-09

Feed silences (diagnostic)

arxiv-cs-ai: 3496 items in the 14-day window, fully live.
anthropic-generated: last item 06-03 (Services Track, Partner Hub).
claude-code-releases: v2.1.172 (06-10), latest in window.
Apple ML Research: third-generation foundation models post (06-08).
Terence Tao: 2 items in window (06-08, 06-09).
bitsavers (6 feeds): all connected, 0 items (sparse output).

Build provenance

build: 2026-06-11 | crawler-sha: 508e4ab (Walsh-Research/1.2, compliance v1.3) | feeds: 49 core | items-considered: 4829 (14d, incl. 3496 arXiv) | warehouse: 13594 items | published: 143 | note: Anthropic reverses sabotage clause in 48h; eval-awareness contradiction cluster; agent runtime governance papers (5-plane arch, sovereign boundary, skill audit); PROJECTMEM local-first coding agent memory; LWN AI agent amok in Fedora; grammar-constrained jailbreak; NSO WhatsApp despite court order