The hottest Compute Substack posts right now

And their main takeaways
Category
Top Technology Topics
SemiAnalysis • 45763 implied HN points • 05 Feb 26
  1. Claude Code proves agentic AI works in practice by reading environments, planning multi‑step tasks, and executing them so people can ask for outcomes instead of writing code; this shift is already making "vibe coding" and long‑horizon automation real.
  2. The cost of usable AI intelligence is collapsing, so agents can cheaply automate many information workflows and threaten seat‑based SaaS moats, BI, analytics, and lots of back‑office knowledge work.
  3. Anthropic’s agent stack and model advances are driving rapid revenue and compute growth, while big cloud players—especially Microsoft—face a hard choice between allocating GPUs to grow Azure or prioritizing Copilot to defend Office, either of which risks their long‑term position.
New World Same Humans • 28 implied HN points • 22 Mar 26
  1. World models can simulate physical reality and let us run thousands of virtual experiments in parallel, speeding up tasks like robot training, materials testing, and drug discovery.
  2. By turning compute and energy into synthetic time, these simulations can compress years of real-world processes into hours or minutes, acting as a powerful lever on time.
  3. The main challenge will be managing and interpreting the huge volume of simulated outcomes, so we’ll need better tools or machine assistance to surface useful insights and decide what to explore.
Big Technology • 6755 implied HN points • 27 Feb 26
  1. AI training is shifting heavily toward reinforcement learning, which teaches models to complete real tasks instead of just predicting text.
  2. Task-based training needs detailed simulated environments and far more compute because models must try many steps to learn workflows like banking or booking.
  3. Reinforcement learning often doesn’t generalize well, so models are likely to specialize and diverge, with different systems becoming better at different kinds of tasks.
One Useful Thing • 2565 implied HN points • 12 Mar 26
  1. AI is getting much better, fast — across images, video, coding, and long tasks — and we’re now in a phase where autonomous agents can do hours of human work in minutes.
  2. Those new capabilities are already reshaping work: organizations are experimenting with AI-driven factories and workflows that cut down on human coding and review, which will change jobs and how teams are organized.
  3. This will produce rolling, sometimes sudden disruptions as capability thresholds are crossed, and recursive self-improvement could speed that up, so the rules and choices made now will strongly influence the future.
Astral Codex Ten • 12044 implied HN points • 12 Feb 26
  1. A compute-centered forecasting approach correctly captured that AI progress has largely tracked available compute and scaling laws, which explains much of the recent boom.
  2. The main error was underestimating algorithmic progress and effective compute growth (including longer training runs and test-time compute), so systems became far more powerful each year than the model assumed and pushed timelines much earlier.
  3. Forecasts are still useful but hinge on a few sensitive parameters, so you need proper sensitivity analysis and humility — uncertainty can cut both ways and make outcomes riskier than naive skepticism assumes.
Get a weekly roundup of the best Substack posts, by hacker news affinity:
Democratizing Automation • 688 implied HN points • 24 Feb 26
  1. Distillation — using a stronger model’s outputs as synthetic training data — is a routine, cost‑effective way to improve models and can give big gains on specific skills, but its benefits are uneven and often hard to integrate properly.
  2. Some labs reportedly ran large-scale distillation campaigns that generated hundreds of billions of synthetic tokens, which can meaningfully boost post-training performance for agentic behavior and coding, but that data alone usually can’t replace on-policy RL and heavy in-house training.
  3. Public accusations about illicit distillation have raised geopolitical and policy tensions, yet fully preventing distillation via distributed API access is practically very hard, so model providers must weigh open APIs against locking down capabilities.
TheSequence • 126 implied HN points • 15 Mar 26
  1. AI is rapidly shifting from chat assistants to autonomous, persistent workers that can plan, act, and even modify their own code, enabling self-improving research loops and agentic code review.
  2. Multi-agent frameworks and locally hosted persistent agents are spreading quickly, letting individuals automate complex workflows while also creating serious security and governance risks when agents gain deep system access.
  3. Massive capital is pouring into compute and new model paradigms — gigawatt-scale GPU factories and billion-dollar bets on grounded "world models" — alongside releases like multimodal embeddings that make retrieval and agent memory far more powerful.
Interconnected • 555 implied HN points • 16 Jan 26
  1. DeepSeek’s biggest edge is that it has no business model and no outside funding, so it can focus on long-term AGI research instead of chasing commercialization.
  2. Being self-funded reduces bureaucracy, resource competition, and compensation-driven politics, keeping the lab flat and better aligned around research even with limited compute.
  3. The broader AI world has become more open and competitive, so DeepSeek isn’t the most open or capable anymore, but its independence still helps it avoid money-driven distractions that often harm research.
State of the Future • 29 implied HN points • 27 Feb 26
  1. AI builders expect rapid, widespread disruption of white‑collar work, so societies will need to adapt fast to avoid big economic and employment shocks.
  2. The next big gains will come from orchestration, not just bigger chips or models — combining diverse hardware and specialised components will be a key competitive edge.
  3. Models and models' outputs are now attackable and competitive assets, so security and new architectures (many small agents checking each other) are becoming essential to reduce errors and theft.
ChinaTalk • 252 implied HN points • 14 Jan 26
  1. Compute power and scaling laws are the fulcrum of modern AI breakthroughs. Having more compute gives the U.S. time, not permanent safety, unless it pairs that lead with energy capacity, enforcement, and fast government adoption.
  2. Inventing frontier models isn’t enough — national security wins require integrating those models into military and intelligence workflows. Without a deliberate effort (a 'Rickover for AI') to operationalize AI, a country can invent the technology and still lose to an opponent that better applies it.
  3. AI is reshaping cyber operations by automating vulnerability discovery and accelerating intrusions, while also boosting defensive tools. The balance of power will come down to who best deploys AI across both offense and defense and who embeds defensive checks into software development.
Not Boring by Packy McCormick • 82 implied HN points • 06 Feb 26
  1. Leading labs released much smarter models this week—one general reasoning model and one focused on coding—and teams are using agent workflows to speed up research and engineering.
  2. Smarter models mean a surge in demand for inference compute, data centers, and energy, which is prompting massive CapEx plans from cloud and hardware companies.
  3. Breakthroughs are happening across fields: cultured brain cells can control drones, Waymo just raised huge funding while scaling many autonomous rides, and AI tools are being adopted and monetized far faster than prior technologies.
Import AI • 1238 implied HN points • 15 Jan 24
  1. Today's AI systems struggle with word-image puzzles like REBUS, highlighting issues with abstraction and generalization.
  2. Chinese researchers have developed high-performing language models similar to GPT-4, showing advancements in the field, especially in Chinese language processing.
  3. Language models like GPT-3.5 and 4 can already automate writing biological protocols, hinting at the potential for AI systems to accelerate scientific experimentation.
Jakob Nielsen on UX • 116 implied HN points • 13 Jan 26
  1. 2026 is the Integration Era: AI stops being a party trick and gets embedded into work and products through autonomous agents, generative UIs, and multimodal/physical capabilities. User experience and agent management, not raw model IQ, become the primary business differentiators.
  2. A compute-driven two-tier world will emerge: persistent shortages and costly inference mean premium subscribers get powerful, multimodal agents while most people use weaker, eco-models. This forces tiered pricing, compute-aware product design, and widens professional and economic divides.
  3. Human roles shift toward judgment, oversight, and trust work: people will focus on setting goals, auditing agent decisions, designing guardrails, and training via apprenticeships. New risks like AI-powered dark patterns will create demand for defensive agents, governance, and stronger UX ethics.
Interconnected • 92 implied HN points • 06 Jan 26
  1. Right now the US is judged to be slightly ahead of China in the AI competition, scored like a halftime football game (USA 29, China 25).
  2. The analysis breaks the competition into five stacked layers — energy, infrastructure capacity, chips/compute, foundational models, and applications — and scores each layer separately.
  3. Those layer-by-layer scores reveal trade-offs (for example, China scores higher on energy while the US leads on other layers), so who wins depends on which parts of the stack matter most.
SemiAnalysis • 7576 implied HN points • 27 Sep 23
  1. Eroom's Law and Moore's Law are critical in Semiconductors and Drug Research, analyzing time, money, and output.
  2. Healthcare, a $4 trillion industry, lags behind in technological progress driven by Moore's Law.
  3. Illumina acquisition by Nvidia could bridge the gap in genomics, addressing bottlenecks and enabling full-stack healthcare solutions.
TheSequence • 21 implied HN points • 05 Feb 26
  1. For years AI advanced by scaling up pre-training—more data, bigger models, and huge GPU time to bake capabilities into fixed weights.
  2. Test-time compute flips that idea by letting models use extra computation during inference to reason, plan, backtrack, and self-correct—basically "letting the model think."
  3. The big implication is that model performance depends not just on training compute but also on how much compute is allowed at inference, changing tradeoffs for how we build and deploy AI.
Alex's Personal Blog • 65 implied HN points • 18 Dec 25
  1. OpenAI is chasing enormous amounts of funding to buy more compute because limited GPUs are constraining both research and product growth, and that compute race is driving huge investment into chip makers and related firms.
  2. China says it has an operational EUV prototype, and if it turns that into production it could break ASML’s chokehold on high-end lithography and shift chipmaking power away from Taiwan and its partners.
  3. Political and corporate money are merging in odd ways, exemplified by a Trump-linked media company pairing with a fusion firm backed by big tech, showing that access to capital and government influence is reshaping deal logic beyond pure business sense.
TheSequence • 63 implied HN points • 11 Dec 25
  1. Modern AI depends on massive matrix multiplications run on GPUs, and much of its progress has come from scaling up models and GPU clusters.
  2. This brute-force scaling is hitting diminishing returns because it consumes huge amounts of energy and hardware, making further improvements increasingly costly.
  3. Researchers and startups are exploring radically different hardware—like analog chips, photonics, neuromorphic designs, and quantum systems—to build more efficient AI computers and move beyond GPUs.
The Asianometry Newsletter • 3130 implied HN points • 26 Apr 23
  1. AI models are growing in size, straining the current hardware's ability to support them.
  2. The memory wall problem arises due to limitations in memory capacity and processing speed.
  3. To address AI hardware challenges, innovative solutions like Compute-in-Memory are being explored.
The Product Channel By Sid Saladi • 6 implied HN points • 12 Feb 26
  1. Elon plans to run AI data centers in orbit, using Starship launches and much stronger solar power to make large-scale GPU compute cheaper and uncoupled from Earth grid limits.
  2. The main bottleneck for AI isn’t algorithms anymore but infrastructure — especially electricity and power delivery — so any AI product strategy must account for compute and energy constraints.
  3. The frontier model race and commercialization are accelerating: Anthropic and OpenAI shipped major new models with big long-context and coding gains, while platforms add ads and multi-model checks to fund and improve real-world use.
Technically • 18 implied HN points • 25 Nov 25
  1. To make AI smarter, we need more computers, especially powerful GPUs. The more compute power we have, the better AI models we can create.
  2. Building more data centers is required for this extra compute power, but our current power grid can't handle the demand. This could lead to problems as AI grows.
  3. Big tech companies are investing in nuclear power plants because renewable energy alone can't keep up with the energy needs of AI data centers.
Axis of Ordinary • 39 implied HN points • 12 Jan 24
  1. AI advancements showcased in different domains like video models, AI glasses for the visually impaired, and AI-powered cough tracking apps.
  2. Exciting developments in astronomy with potential signs of life on exoplanets.
  3. Innovation in computing with faster nanotechnology, graphene spintronics, and Silicon Photonics breaking bandwidth limitations.
Subsack • 4 implied HN points • 09 Dec 25
  1. Markets are dynamic, adversarial environments that force AI to adapt under uncertainty, making them a stronger real‑world benchmark than static puzzles. They test whether knowledge survives contact with reality, not just pattern recognition.
  2. Building an AI that works in markets demands new capabilities — sample efficiency, continual learning without catastrophic forgetting, long‑term memory, deep multimodal world models, and game‑theoretic strategic reasoning. Those constraints push research beyond today’s scale‑and‑transformer centric approach.
  3. Economic AGI offers a clear monetisation path: outperforming markets, running prediction markets, or allocating capital can directly convert intelligence into revenue. That revenue can make labs financially sustainable and fund further AGI research.
Pivotal • 1 HN point • 20 May 23
  1. Data and compute values have changed, affecting software and data business models.
  2. The data explosion in the decade led to new successful business models downstream.
  3. AI impact on data and compute leads to increased data value and the need for new tools and ecosystem in the AI-first world.
DYNOMIGHT INTERNET NEWSLETTER • 1 HN point • 06 Mar 23
  1. Using scaling laws can help predict how much better language models will get with more computational power or data.
  2. The majority of the error in language models comes from limited data, rather than limited model size.
  3. To improve language models significantly, more data and compute are needed, but there may be a limit to how much more can be added with current technology.