Research Ecosystem: Morning Brief

Two-week window across the tracked feeds (60 core feeds this run: agents, evals, interp, formal methods, surveillance, BSD, Clojure/Scheme, SDR, aviation), scored against active research threads. Metadata only: titles, links, dates. Read the source for substance. (what we track, how we crawl)

The arXiv cs.AI firehose is back after the weekend lull (654 items in window, the largest single source); yesterday's brief shed it as Sunday-quiet. The four generated gap-fill feeds (DeepMind, Cursor, Claude Code, Anthropic) remain in with second-hand provenance flagged inline; DeepMind has now aged out of the 14-day window (last post 05-01).

Top (5-7 min)

No honor among (ad-tech) thieves
Pluralistic, 2026-05-25. Doctorow on ad-tech firms defrauding each other, the surveillance-economy underbelly that the adversarial ad-tech thread tracks. Fresh today and the cleanest statement of how the incentives rot from the inside.
Claude Compliance API support with Cloudflare CASB
Cloudflare, 2026-05-21. Agent governance shipped as a CASB product, paired with Claude Managed Agents. The commercial mirror of the Walsh-Research compliance contract, from the infra seat rather than the spec seat.
Agentic software development hypothesis
Marc Brooker, 2026-05-20. States the agentic-SDLC claim as something falsifiable rather than assumed. The CPRR move applied to the loudest claim in the field this quarter.
Frontier Risk Report (February to March 2026)
METR, 2026-05-19. Capability elicitation and risk evaluation from a named eval org. Primary material for the eval-under-constraints thread, the empirical counterweight to lab self-reporting.
Vega: zero-knowledge proofs for digital identity in the age of AI
Microsoft Research, 2026-05-21. ZK identity proofs pitched as the answer to agent-era verification. The constructive counterpart to the age-verification critique below; worth holding the two side by side.

Themes this week

Every model lab is now an agent lab
Latent Space says it outright; OpenAI's feed is wall-to-wall Codex (Gartner leader, Codex from anywhere); Cursor ships Composer 2.5; Microsoft ships MagenticLite and Fara. The product surface has converged on agents, which puts the weight on the harness, sandbox, and compliance layers rather than the model.
Identity and surveillance get an infrastructure layer
the same week brings Doctorow on ad-tech fraud, an FTC active-listening settlement, a Markup win on data brokers, and Microsoft's ZK identity proposal. The critique and the build-out are now arguing over the same substrate.
Eval realism is the contested ground
Alignment Forum argues for evaluating model behaviors and that risk reports must address deployment-time spread; METR ships a frontier risk report; Microsoft adds SocialReasoning-Bench. The move is from leaderboard deltas to whether an evaluation measures behavior that shows up under deployment.
Agent-build claims meet scrutiny
booster framing (Latent Space, OpenAI) meets empirical pushback (AI Snake Oil on the $916 OS, Torvalds on kernel bugs) and falsifiable claims (Brooker's hypothesis).

Scan (15 min)

Tail

Feed silences (diagnostic)

  • arxiv-cs-ai: recovered. 276 items this run, 654 in the 14-day window, the single largest source. Yesterday's brief shed it as Sunday-quiet; the weekday announcement batch is back.
  • deepmind-blog (generated): aged out of the window. Latest post is 05-01, now outside 14 days, so the DeepMind gap-fill contributed nothing this run. Monthly cadence; lean on first-party labs until the next post.
  • Dan Luu, James Bornholt, Netflix Tech Blog: errors this run (XML parse deep in the archive feed; host-side connection; TLS path). Bornholt and Netflix are transient and left in place to re-check; Dan Luu stays demoted.
  • harvard-seas: 0 items again (the Localist API returned no events this run).
  • Logic Magazine: still demoted (feed URL serves a Netlify 404 page). No longer crawled.
  • Generated feeds cursor-blog, claude-code-releases, anthropic-generated current via 304; deepmind-blog stale (see above). Third-party LLM-scraped RSS for publishers with no first-party feed; provenance second-hand, on a C-004 stability watch.

Build provenance

build: 2026-05-25 | crawler-sha: 44e3db1 (Walsh-Research/1.2, compliance v1.2) | feeds: 60 core (incl. 4 generated gap-fill, 1 aged out) | items-considered: 1059 (14d, incl. 654 arXiv) | published: 49 | note: arXiv firehose recovered post-weekend; DeepMind generated feed aged out of window; 3 feed errors (Dan Luu/Bornholt/Netflix)