|
📖 Read In Depth
|
Gemini 3.5 Flash
Google released Gemini 3.5 Flash at Google I/O. The HN thread includes a comment inferring model architecture/parameter counts from TPU 8i specs, and a pricing analysis showing a notable 3x cost jump over the prior Flash generation — raising real questions about whether Google is moving upmarket or just inflating prices. Worth reading alongside the HN comments for the technical signal.
hn/Best Stories
|
Google changes its search box
Google overhauled its search box for the first time in 25 years, using Gemini AI to handle longer queries, added video generation, and is pushing AI Mode past 1B monthly users. The HN discussion cuts to the core tension: if Google summarizes your content for users, why let Googlebot crawl at all? The 'Google Zero' traffic scenario is no longer theoretical.
hn/Best Stories
|
Show HN: Forge – Guardrails take an 8B model from 53% to 99% on agentic tasks
An open-source reliability layer for self-hosted LLM tool-calling that uses guardrails (retry nudges, step enforcement, error recovery, VRAM-aware context management) to boost an 8B model from 53% to 99% on agentic tasks. Directly relevant to anyone building agentic systems from scratch — the gap between raw model performance and production reliability is a real engineering problem.
hn/Best Stories
|
CISA Admin Leaked AWS GovCloud Keys on GitHub
A CISA administrator leaked AWS GovCloud credentials on GitHub — an embarrassing and serious incident from the agency responsible for federal cybersecurity. Krebs typically has strong sourcing and technical detail. Worth reading for the specifics of how it happened and what it reveals about credential hygiene even at the highest levels.
hn/Best Stories
|
China Wants A.I. to Flourish, but Not at the Expense of Jobs
Chinese courts are issuing precedent-setting rulings to protect workers displaced by AI — a genuinely different policy response than anything happening in the US or EU. This is an early signal of how governments might use labor law rather than AI regulation as the primary lever, with implications for how AI deployment strategies will diverge across geographies.
nyt/Business
|
Everything in C is undefined behavior
A technically substantive deep-dive into undefined behavior in C, which is never not interesting to someone who builds algorithms from scratch. UB is one of the most counterintuitive aspects of systems programming and this kind of article typically has the sort of precise, under-discussed analysis that rewards careful reading.
hn/Best Stories
|
How Iran Gained Leverage in the War
An analytical piece on how Iran used 'triangular coercion' — attacking Gulf states and closing the Strait of Hormuz to gain leverage despite being outmatched militarily. This is the kind of strategic analysis that connects geopolitics, energy markets, and global supply chains in ways that affect everything from chip supply to data center energy costs.
nyt/Top Stories
|
|
âš¡ FYI
|
Cerebras is running a trillion parameter model (Kimi K2.6) at 1000 tokens/s
Cerebras is serving Kimi K2.6 (a trillion-parameter model) at 1,000 tokens/second. This is a meaningful hardware milestone — it reframes what 'fast inference' means at frontier scale and is relevant context for thinking about inference chip competition and the role of non-NVIDIA architectures.
reddit/r/singularity
|
Meta Begins Laying Off 8,000 Employees Amid A.I. Transformation
Meta is laying off 8,000 employees as it restructures toward an AI-first model. Coming the same week as Google I/O and Karpathy's Anthropic move, this is part of a broader industry narrative: large tech companies are explicitly trading headcount for AI capability investment, and the job market for non-AI roles is getting squeezed.
nyt/Technology
|
Incident Report: Railway Blocked by Google Cloud (Resolved)
Railway was blocked by a Google Cloud account action, causing a significant outage. The HN thread captures the recurring pattern of GCP's ergonomic UX masking brittle account management — 'It has been 0 days since GCP has taken down a startup.' A useful data point for infrastructure and vendor risk reasoning.
hn/Best Stories
|
Mini Shai-Hulud Strikes Again: 314 npm Packages Compromised
314 npm packages were compromised in a new supply chain attack. Supply chain security in the npm ecosystem remains a chronic, structural problem — worth knowing about if you're consuming open-source dependencies in any production context.
hn/Best Stories
|
The Growing Anxiety Over AI, Jobs and the Future
A DealBook roundup on AI-driven layoff anxiety, covering both the Meta news and broader signals from commencement speeches and surveys. The macro labor story is real and accelerating — useful context for understanding the political and social backlash that will shape AI regulation.
nyt/Business
|
How Google Is Starting to Win the A.I. Race
A consumer-focused take on how Google's Gemini has caught up to and arguably surpassed ChatGPT in everyday utility after a rocky start. Skippable if you're following the technical coverage, but useful for understanding how the narrative around the AI race is shifting in mainstream perception.
nyt/Technology
|
The Best of ‘S.N.L.’ Season 51: Sensitive Strippers and Regretful Moms
SNL season 51 highlights, including the Will Ferrell finale. Included for completeness — skip unless you want the cultural temperature check on how late-night is processing the current political moment.
nyt/Arts
|