← ArchiveAbout
Morning Digest
Thursday, May 14 · ~5 min read
📖 Read In Depth
New Mythos checkpoint shows continued improvement: “On a 32-step corporate network attack we estimate takes a human expert ~20 hours, this checkpoint completes the full attack in 6 /10 attempts.”
Anthropic's Mythos security AI model can now complete a full 32-step corporate network attack (estimated at ~20 human-expert hours) in 6 out of 10 attempts autonomously. This is a concrete capability benchmark that cuts through AI hype — the kind of empirical result that matters for understanding where AI agents actually stand on hard, real-world tasks.
reddit/r/singularity
Why the Bombing of Iran Tied the U.S. More Closely to China
The Iran war has drained US weapons stockpiles, creating a structural dependency on China for rare-earth minerals needed to rebuild. This is a sharp piece on how a military conflict reshapes geopolitical leverage — directly relevant to the semiconductor supply chain and the broader US-China technology competition that affects Xinyu's world.
nyt/Business
The Emacsification of Software
An essay arguing that modern software is trending toward 'Emacsification' — highly extensible, programmable environments where the tool is also the substrate. Given Xinyu's interest in software architecture and building from scratch to understand systems, this likely engages with deep questions about where the IDE/agent/tool boundary is heading.
hn/Best Stories
Deterministic Fully-Static Whole-Binary Translation Without Heuristics
A paper on deterministic, fully-static whole-binary translation without heuristics — essentially lifting arbitrary binaries to an IR for analysis or recompilation with mathematical guarantees rather than pattern matching. For someone who builds algorithms from scratch to understand them, this is the kind of systems/CS theory crossover that's both intellectually interesting and practically significant for security and compatibility work.
hn/Best Stories
Leaving GitHub for Forgejo
A developer's detailed account of migrating from GitHub to Forgejo (a Gitea fork), covering the technical and philosophical reasons — GitHub's use of public repos to train Copilot being a central concern. Gets into the actual mechanics and tradeoffs, not just the politics.
hn/Best Stories
I moved my digital stack to Europe
A developer walks through migrating their entire digital infrastructure to EU-based providers amid growing concern about US data sovereignty under the current political climate. The 983-point HN score and the comments from EU government contractors suggest this is capturing a real, accelerating market shift worth understanding.
hn/Best Stories
Why China Isn’t Worried A.I. Will Replace Its Workers
Ross Douthat examines why China isn't worried about AI displacing its workers — a fundamental difference in how the two superpowers conceptualize AI's economic role. The divergence in incentive structures and policy frameworks is directly shaping the global AI race in ways that matter to anyone building in the field.
nyt/Top Stories
🎬 Check It Out
The Handmaiden….wow.
An enthusiastic r/TrueFilm thread on Park Chan-wook's *The Handmaiden* (2016), treating it as his masterpiece for its structural brilliance and layered twists. Available on various streaming platforms. If you haven't seen it, this is a strong recommendation — it's a film that rewards exactly the kind of analytical attention Xinyu applies to systems.
reddit/r/TrueFilm
⚡ FYI
Mozilla Used Anthropic’s Mythos to Find and Fix 271 Bugs in Firefox
Mozilla used Anthropic's Mythos AI to find and fix 271 bugs in Firefox — a significant real-world deployment of AI-assisted security research at scale on a major open-source codebase. Pairs directly with the Mythos checkpoint story above, but this is the applied/production angle worth knowing about.
reddit/r/singularity
A.I. and Humans Battle It Out in a Cybersecurity Showdown
In a national cybersecurity competition, AI agents performed respectably on their own but humans augmented by AI outperformed both. The nuanced finding — AI helps most when the human knows how to direct it — is a useful data point on where AI agency actually stands in adversarial, real-world settings.
nyt/Technology
CERT is releasing six CVEs for serious security vulnerabilities in dnsmasq
CERT is releasing six CVEs for serious vulnerabilities in dnsmasq, which is embedded in routers, IoT devices, Android, and countless Linux systems worldwide. If you run any networked infrastructure or embedded systems, you'll want to patch quickly — the attack surface here is enormous.
hn/Best Stories
Figure AI livestream: watch a team of humanoid robots running a full 8-hour shift at human performance levels, fully autonomous.
Figure AI ran a full 8-hour autonomous shift with a team of humanoid robots at claimed human performance levels, with a live stream. There are skeptical observations (possible teleoperator handoffs noted by other users) worth cross-referencing, but if even partially validated, it's a meaningful milestone in robot deployment at scale.
reddit/r/singularity
Can Some Very Tiny Particles Cool the Planet? One Tech Company Says Yes.
A startup called Stardust Solutions claims its engineered particles can reflect sunlight to cool the planet without environmental harm, and is moving toward deployment. The governance question — whether private companies should be able to unilaterally alter Earth's climate — is genuinely unresolved and increasingly urgent.
nyt/Business
Generated by Daily Digest · Powered by your config, not an algorithm