The hottest Compute Infrastructure Substack posts right now

And their main takeaways

@adlrocha - Auto-research: The Lab that runs while you sleep

@adlrocha Weekly Newsletter • 64 implied HN points • 13 Mar 26

🕹 Technology Compute Infrastructure

A simple edit-evaluate-keep loop lets autonomous agents run short experiments and find real improvements by iterating quickly on a single editable training file and a fast proxy metric like validation bits-per-byte.
Many small agents running on varied hardware can share discoveries via gossip protocols and turn idle or distributed GPUs into a decentralized research swarm that accelerates optimizations collectively.
Picking the right evaluation and reward function is the hard part—designing clean, fast proxies and constraints (research taste) will matter more than raw execution in many fields, especially where feedback is slow or noisy.

On Dwarkesh Patel's 2026 Podcast With Dario Amodei

Don't Worry About the Vase • 4032 implied HN points • 16 Feb 26

🕹 Technology Compute Infrastructure

AI capabilities are advancing very fast, especially in coding, and it’s plausible that extremely powerful ‘genius’ systems in data centers could appear within a few years.
Despite expecting rapid technical progress, AI companies are deliberately cautious about buying massive compute and are prioritizing profitability to avoid overextending and failing.
Policy and geopolitics matter a lot: there’s strong support for export controls, international coordination, and clearer governance to manage risks and competition, while alignment and existential risk concerns are getting less attention in practice.

AI #155: Welcome to Recursive Self-Improvement

Don't Worry About the Vase • 3225 implied HN points • 12 Feb 26

🕹 Technology Compute Infrastructure

AI capabilities are accelerating rapidly, with new model releases improving agentic coding, in-context continual learning, and media generation so fast that benchmarks and measurement struggle to keep up.
These advances are already reshaping economies and work: automation and agentic tools threaten many jobs, trigger volatile market reactions, and push companies toward new monetization and product strategies like ads and verticalized offerings.
Safety, alignment, and governance remain urgent unresolved problems; researchers are worried or leaving, red lines get crossed, and connecting powerful models to real-world systems (labs, agents, surveillance) creates legal and existential risks we aren’t yet managing.

45 Thoughts About Agents

Am I Stronger Yet? • 846 implied HN points • 02 Mar 26

🕹 Technology Compute Infrastructure

AI agents are the fastest-moving layer of the AI stack and are accelerating capabilities through rapid software updates and user-driven experimentation. They make ambitious tasks feasible and are already changing what people can build and how quickly.
Getting real value from agents means reshaping workflows: pick agent-shaped tasks, give very clear success criteria, and have agents check their own work or use separate checkers to avoid endless revision loops. Good prompts and orchestration often save far more time than fixing sloppy outputs.
Widespread agent use will create big productivity gains and new kinds of risk at the same time — think compute limits, safety tradeoffs, and the possibility of autonomous or rogue agents — so adoption will bring fast cultural change and new policy questions.

Gas town

benn.substack • 1431 implied HN points • 30 Jan 26

🕹 Technology Compute Infrastructure

Gas Town imagines AI as a sprawling factory of agents that spawn more agents to write, test, and fix code, producing enormous and fast but often messy output. Progress there is driven by throughput and relentless experimentation, so lots of work is wasted as part of the process.
This speed-first, industrialized approach fuels hype and frantic product churn but is unsustainable: it creates feature bloat, enormous compute and financial waste, and most of the many experiments and startups will fail. The result is not utopia but anxiety, short lifecycles, and uneven value creation.
All that frantic online building can distract from real-world problems that need people in the streets and communities on the ground. Individuals face a choice between staying locked into endless 'vibe coding' or stepping away to do tangible, local work that actually helps neighbors.

Get a weekly roundup of the best Substack posts, by hacker news affinity:

AI #151: While Claude Coworks

Don't Worry About the Vase • 2777 implied HN points • 15 Jan 26

🕹 Technology Compute Infrastructure

AI systems are advancing fast and being built into many real products. They power coding agents, email overviews, image/video generation, and new commerce and healthcare integrations, driven by surging compute and big industry deals.
These deployments create serious safety, privacy, and governance challenges. Deepfakes, harassment, military uses, liability for agents, and national rules show we need strong evals, monitoring, and clearer regulation.
The economic and labor impact is large but uncertain. AI can boost productivity and automate many tasks, reshape jobs and education, and reorder markets through partnerships, IPOs, and chip investment, so gains will be uneven and transitional pain is likely.

Zhipu and MiniMax IPO

ChinaTalk • 800 implied HN points • 19 Jan 26

🕹 Technology Compute Infrastructure

Zhipu is selling model-as-a-service to businesses and public-sector clients while MiniMax is a consumer-focused, multimodal company whose companion apps drive huge user counts but low per-user revenue.
Neither firm owns massive training farms; both rely on external cloud/GPU providers, with MiniMax explicitly using a light-asset, outsourced model and Zhipu increasingly buying cloud services.
Each company frames AGI and safety to match its strategy—Zhipu leans on LLM research and safety commitments, MiniMax pushes multimodality and companion use—while big‑tech and state investors, cross‑ownership, and regulatory/legal risks shape their commercial prospects.

The All-Star Chinese AI Conversation of 2026

ChinaTalk • 696 implied HN points • 13 Jan 26

🕹 Technology Compute Infrastructure

China has huge AI talent and a vibrant open-source scene, but real gaps remain — especially around compute supply, chip/lithography production, and the broader software ecosystem, so the leadership gap with top US labs may not be shrinking as it seems.
The next paradigm will come from agents, native multimodal sensory integration, and much better memory/continual learning, plus hardware-software co-design; these advances are what will let AI handle long, real-world tasks and drive strong productivity gains for businesses.
China’s odds of becoming the global AI leader in 3–5 years hinge on fixing structural issues: more domestic compute or chip breakthroughs, a mature To‑B market that will pay for productivity, a stronger risk-taking culture for paradigm-shifting research, and wider education so people can actually use AI effectively.

Weekly Top Picks #112: Besides Moltbook

The Algorithmic Bridge • 244 implied HN points • 03 Feb 26

🕹 Technology Compute Infrastructure

Building and running frontier AI models is extremely expensive and they depreciate quickly, so firms often only barely break even because R&D and rapid model turnover eat profits.
Who’s winning the AI race depends on what you measure: Chinese players like DeepSeek are taking market share and publishing new scaling advances, but the overall picture is mixed and some elite researchers are pessimistic.
Privacy and governance are lagging—interactions with AI are frequently monitored, and internal safety conflicts at big labs can paradoxically accelerate competition instead of slowing it.

The Sequence Radar #775: Last Week in AI: Tokens, Throughput, and Trillions

TheSequence • 63 implied HN points • 21 Dec 25

🕹 Technology Compute Infrastructure

Massive funding and infrastructure bets are setting the rules: the companies that can industrialize models into cheap, reliable global services will win more than those with just the fanciest research demos.
Engineering focus has shifted to throughput, latency, and long-context agentic capabilities, with new models and hardware optimized to move lots of tokens through multi-step workflows at predictable cost.
Generative outputs and developer workflows are becoming iterative and productized — image editing in chat and tightened data/observability loops make AI a usable creative IDE, while enterprise platforms race to own the data plane and production tooling.

UX Roundup: How People Use AI | AI Does User Research | Founding Designers | Prompt Engineering | AI Agent Use Cases | Electricity Fuels AI | GPT 5.2 Beats Humans

Jakob Nielsen on UX • 23 implied HN points • 15 Dec 25

🕹 Technology Compute Infrastructure

Workers are already using AI a lot — often secretly — so product design must support both automation and collaboration, teach prompting, and give users control (especially for creative workflows that need canvas-style UIs and curator tools).
AI can run and analyze large-scale interviews, turning qualitative insights into quantifiable themes and making researchers into orchestrators, but agent behavior and user needs change over time so longitudinal usability studies are essential.
Simple persona prompts don’t improve factual accuracy, yet models and costs are improving rapidly — cutting task costs and enabling AI to outperform experts on many half-day tasks — so designs and infrastructure (including power capacity) must evolve quickly.

The Sequence AI of the Week #777: Thinking Fast, Thinking Cheap: Thinking Fast, Thinking Cheap: The Nemotron 3 Blueprint

TheSequence • 14 implied HN points • 24 Dec 25

🕹 Technology Compute Infrastructure

NVIDIA launched the Nemotron 3 family (Nano, Super, and Ultra), establishing a new baseline for open-weight AI and moving into the reasoning-model race.
The models use a hybrid Mamba-Transformer Mixture-of-Experts design, and Nemotron 3 Nano achieves a new state-of-the-art for the 30B parameter class, showing strong efficiency and performance.
This release signals a shift away from brute-force dense Transformers toward more architecture-efficient, cost-effective models that matter for enterprises and researchers.

What We Talk About When We Talk About AI

Center for Veb Account Research Newsletter • 3 implied HN points • 12 Dec 25

🕹 Technology Compute Infrastructure

AI is best understood as a set of decision‑making tools that 'satisfice' — they search for good‑enough solutions in complex models instead of finding perfect mathematical optima like operations research.
AI tools expand a user or organization's administrative capacity by enabling new actions and complex modeling, but they can be brittle and depend heavily on training data and organizational process; the financial hype or stock valuations around AI are distinct from its practical usefulness.
Intelligence and consciousness are not the same: systems can perform many cognitive tasks and even be 'general' in the sense of producing and using satisficing models without being conscious, so task performance alone doesn't imply subjective experience.