The hottest Machine Learning Substack posts right now

And their main takeaways
Category
Top Business Topics
Don't Worry About the Vase • 1792 implied HN points • 10 Jul 25
  1. Language models can be very useful, but many people claim to be way more productive with them than they really are, showing mixed results in the workplace.
  2. Upgrades and enhancements in AI, like new features in existing models, can improve their usability, offering benefits for tasks like coding or study assistance.
  3. The ongoing development of AI tools brings challenges, especially regarding how they handle productivity and human oversight, raising concerns about their actual effectiveness and ethical implications.
Democratizing Automation • 292 implied HN points • 14 Dec 25
  1. Open models made a dramatic jump in 2025, matching closed models on many benchmarks and becoming realistic options for real-world deployments beyond just privacy or fine-tuning.
  2. A few breakout releases — notably DeepSeek R1, Qwen 3, and Kimi K2 — had outsized influence, driving wider adoption and encouraging more open licensing from major labs, especially in China.
  3. The ecosystem exploded in scale and variety, with thousands of new models uploaded monthly, clear specialist niches and a public tiering of makers, leaving open models established and poised for further growth in 2026.
The Algorithmic Bridge • 318 implied HN points • 15 Dec 25
  1. Two leading AI figures are pursuing opposite goals: one is focused on building and containing a possible future superintelligence, while the other is building practical tutor-like agents for today’s use cases.
  2. Their stark disagreement, despite similar training and prestige, shows that even top experts don’t agree on AI’s ultimate path or timeline.
  3. That deep uncertainty extends across industry, academia, and investors, producing fragmented, independent bets instead of a coordinated plan for the future.
The ML Engineer Insights • 359 implied HN points • 22 Jun 24
  1. Building a strong foundation in machine learning fundamentals and staying updated with the latest research are crucial for success as a Machine Learning Engineer.
  2. Playing to your strengths, such as data and feature engineering, modeling, and deployment scalability, is key. Seek help in areas where you're less experienced.
  3. Focus on aligning your work with business goals, understanding trade-offs, ROI, and embracing experimentation. Continuous learning, networking, and mentorship are invaluable.
Marcus on AI • 4189 implied HN points • 09 Jan 25
  1. AGI, or artificial general intelligence, is not expected to be developed by 2025. This means that machines won't be as smart as humans anytime soon.
  2. The release of GPT-5, a new AI model, is also uncertain. Even experts aren't sure if it will be out this year.
  3. There is a trend of people making overly optimistic predictions about AI. It's important to be realistic about what technology can achieve right now.
Get a weekly roundup of the best Substack posts, by hacker news affinity:
Vasu’s Newsletter • 78 implied HN points • 25 Jan 26
  1. Each token creates query, key, and value vectors so it can ask what it needs, match that against other tokens, and gather useful information.
  2. Tokens compare their query to every key to get raw scores, convert those scores to attention weights with softmax, and use the weights to take a weighted sum of value vectors to produce a new contextual vector.
  3. Self-attention makes token meanings contextual (helping with pronouns, disambiguation, and long-range links), and models use multiple attention heads plus feed-forward layers to capture different relation patterns and refine each token's representation.
In My Tribe • 197 implied HN points • 21 Dec 25
  1. AI can run many human-like interviews and assessments cheaply and reliably, letting organizations collect richer open-ended responses at scale.
  2. Even when AI succeeds technically, the firms that build models might not capture the value—competition can erode profits and create financial risks even as enterprise usage and integration grow.
  3. Whoever controls the data, algorithms, and coordination networks gains real decision-making power, and AI’s fast adaptability could outpace human retraining and reshape many jobs.
Marcus on AI • 4663 implied HN points • 24 Nov 24
  1. Scaling laws in AI aren't as reliable as people once thought. They're more like general ideas that can change, rather than hard rules.
  2. The new approach to scaling, which focuses on how long you train a model, can be costly and doesn't always work better for all problems.
  3. Instead of just trying to make existing models bigger or longer-lasting, the field needs fresh ideas and innovations to improve AI.
Democratizing Automation • 150 implied HN points • 05 Jan 26
  1. Several major open models and updates landed at year-end — releases from NVIDIA, Arcee, LLM360, Zhipu and others noticeably pushed open-model capabilities higher.
  2. The community trend is toward bigger and Mixture-of-Experts (MoE) architectures, multi-token prediction, and openly releasing training data and checkpoints, which should speed progress and reproducibility.
  3. Important tradeoffs remain: some models excel on specific tasks like UI or coding but can be slower or weaker on very long-context workloads, and even larger, more capable variants are promised in 2026.
Alex Ghiculescu's Newsletter • 135 implied HN points • 19 Jan 26
  1. AI labs will focus on coding agents, with most development effort and revenue moving toward models that write software.
  2. Keeping up with rapidly improving AI coding tools will be the main challenge for software companies; engineering teams will need to learn new workflows and roll them out across people with different skills and enthusiasm.
  3. New techniques will close agents' domain-knowledge gaps so models can understand real codebases and make decisions, and those same solutions will boost many other AI applications.
The Algorithmic Bridge • 286 implied HN points • 12 Dec 25
  1. A clear set of twenty specific predictions about how AI will develop in 2026 is presented.
  2. The piece reviews results from 2025 predictions and commits to being more specific and accountable to improve forecasting accuracy.
  3. Full access to the detailed content is behind a subscription paywall, though a 7-day free trial is offered.
TheSequence • 112 implied HN points • 25 Jan 26
  1. Serving models (inference) is now the main battleground, drawing huge funding as startups race to make model serving boring, reliable, and infinitely scalable.
  2. New kernel-level tricks are cutting recomputation and memory waste: RadixAttention reuses KV cache blocks like an LRU to avoid recomputing prefixes, and PagedAttention pages KV memory so GPUs can pack many more requests without VRAM fragmentation.
  3. Latency and per-turn cost now define product quality, causing a split in the stack between orchestration/hardware layers that manage scale and kernel teams that squeeze every FLOP to make models fast and cheap.
Marcus on AI • 3161 implied HN points • 17 Feb 25
  1. AlphaGeometry2 is a specialized AI designed specifically for solving tough geometry problems, unlike general chatbots that tackle various types of questions. This means it's really good at what it was built for, but not much else.
  2. The system's impressive 84% success rate comes with a catch: it only achieves this after converting problems into a special math format first. Without this initial help, the success rate drops significantly.
  3. While AlphaGeometry2 shows promising advancements in AI problem-solving, it still struggles with many basic geometry concepts, highlighting that there's a long way to go before it can match high school students' understanding in geometry.
next big thing • 37 implied HN points • 12 Feb 26
  1. Greatness exists in distinct layers, and the gap between each level can be enormous — someone who’s great at one level can be thoroughly outclassed by the next.
  2. Many systems follow a power-law pattern where a tiny number of people, companies, or places capture most of the attention, wealth, or returns.
  3. AI, especially models that can help build and improve themselves, is accelerating that concentration, so a small set of firms is likely to pull much farther ahead.
Don't Worry About the Vase • 1299 implied HN points • 23 Jul 25
  1. OpenAI's ChatGPT Agent can now perform tasks like managing your calendar or shopping for groceries. It uses a combination of web browsing, research skills, and conversational abilities to help users with more complex requests.
  2. Although the ChatGPT Agent shows promise and can do some tasks well, like spreadsheet work, it still faces limitations. For now, it feels more like a helpful assistant rather than a full replacement for humans in many tasks.
  3. Safety is a top priority with the new capabilities of the ChatGPT Agent. OpenAI is taking steps to prevent misuse and ensure that the technology is used responsibly, especially in sensitive areas like biology and chemistry.
Don't Worry About the Vase • 3852 implied HN points • 30 Dec 24
  1. OpenAI's new model, o3, shows amazing improvements in reasoning and programming skills. It's so good that it ranks among the top competitive programmers in the world.
  2. o3 scored impressively on challenging math and coding tests, outperforming previous models significantly. This suggests we might be witnessing a breakthrough in AI capabilities.
  3. Despite these advances, o3 isn't classified as AGI yet. While it excels in certain areas, there are still tasks where it struggles, keeping it short of true general intelligence.
Data Science Weekly Newsletter • 159 implied HN points • 25 Jul 24
  1. AI models can break down when trained on data that is generated by other models. This can cause problems in how well they work.
  2. There is scientific research about the history of Italian filled pasta. It shows that most types likely came from a single area in northern Italy.
  3. There are new resources and guides available for improving predictive modeling with tabular data. These can help you build better models by focusing on how data is represented.
Encyclopedia Autonomica • 19 implied HN points • 09 Oct 24
  1. Using Transformer Agents 2.0 is a step up from traditional methods. They can handle multi-step tasks better and have memory to store information as they work.
  2. Setting up and building a basic ReAct Agent is straightforward. You only need to install some packages and create the agent using selected models and tools.
  3. You can orchestrate multiple agents together for more complex tasks. By combining different agents, you can enhance their capabilities and improve the results of your searches or queries.
Marcus on AI • 3952 implied HN points • 08 Dec 24
  1. Generative AI struggles with understanding complex relationships between objects in images. It sometimes produces physically impossible results or gets details wrong when asked to create images from text.
  2. Recent improvements in AI models, like DALL-E3, show only slight progress in handling specifications related to parts of objects. It can still mislabel parts or fail to follow more complex requests.
  3. AI systems need to improve their ability to check and confirm that generated images match the prompts given by users. This may require new technologies for better understanding between language and visuals.
Mindful Modeler • 639 implied HN points • 23 Apr 24
  1. Different machine learning models exhibit varying behaviors when extrapolating features, influenced by their inductive biases.
  2. Inductive biases in machine learning influence the learning algorithm's direction, excluding certain functions or preferring specific forms.
  3. Understanding inductive biases can lead to more creative and data-friendly modeling practices in machine learning.
Data Science Weekly Newsletter • 1418 implied HN points • 19 Jan 24
  1. Good data visualization is important. Some types of graphs can be misleading, and it's better to avoid them.
  2. In healthcare, it's not just about having advanced technology like AI. The real focus should be on getting effective results from these technologies.
  3. Netflix released a lot of data about what people watched in 2023. Analyzing this can help us understand trends in streaming better.
The Algorithmic Bridge • 3344 implied HN points • 21 Jan 25
  1. DeepSeek, a Chinese AI company, has quickly created competitive AI models that are open-source and cheap. This challenges the idea that the U.S. has a clear lead in AI technology.
  2. Their new model, R1, is comparable to OpenAI's best models, showcasing that they can produce high-quality AI without the same resources. It suggests they might be using innovative methods to build these models efficiently.
  3. DeepSeek’s approach also includes letting their model learn on its own without much human guidance, raising questions about what future AI could look like and how it might think differently than humans.
Enterprise AI Trends • 168 implied HN points • 30 Dec 25
  1. Meta's acquisition of Manus rescues a fast-growing but unprofitable startup and rewards its founders and investors, while adding geopolitical and competitive implications.
  2. Because Manus relied heavily on Anthropic's Claude, the deal creates strategic tension — Meta could replace Claude in Manus's agent loop and become a direct competitor to Anthropic.
  3. The purchase highlights a bigger industry debate: Meta is betting that agent scaffolding and tools — not just foundational models — hold the most value, a stance that could reshape AI strategy and competition.
Mindful Modeler • 419 implied HN points • 28 May 24
  1. Statistical modeling involves modeling distributions and assuming relationships between features and the target with a few interpretable parameters.
  2. Distributions shape the hypothesis space by restricting the range of models compatible with specific distributions like a zero-inflated Poisson distribution.
  3. Parameterization in statistical modeling simplifies estimation, interpretation, and inference of model parameters by making them more interpretable and allowing for confidence intervals.
The Security Industry • 35 implied HN points • 17 Feb 26
  1. AI development is accelerating fast, with new models that feel like a qualitative leap and are even being used to build the next generation of models.
  2. The AI security market has exploded into hundreds of companies, including many focused on automating SOC work, and it has attracted substantial venture funding.
  3. AI security is becoming a standard part of organizational defenses, and soon it will no longer make sense to treat it as a separate category because every vendor will have AI-driven security features.
Graphs For Science • 105 implied HN points • 10 Jan 26
  1. A strong theme is practical engineering: many books show how to turn LLM demos into working agents using RAG, embeddings, knowledge graphs, tool use, and prompt patterns to make outputs more reliable and auditable.
  2. There’s a clear focus on hands-on playbooks and trade-offs—quick-starts, checklists, code examples, and patterns for prototyping, retrieval, latency/cost decisions, multi-agent orchestration, and production concerns.
  3. The collection balances technical how-to guidance with broader perspectives on responsible use, human uniqueness, organizational strategy, and interdisciplinary science, highlighting ethics, norms for academics, and big-picture questions about life and intelligence.
Vesuvius Challenge • 98 implied HN points • 13 Jan 26
  1. The team has digitally unwrapped about 70% of the lower region of PHerc. 172 using a new automated pipeline that's over 10× faster than fully manual methods, though humans still must fix sheet‑switch errors.
  2. The unwrapped area covers roughly 7 meters by 14 cm and gives semi‑continuous surfaces with readable ink mainly on outer wraps and fragments; the upper ~30% is too mangled to unwrap reliably and the 7.9 µm scan resolution limits legibility compared with clearer 2.4 µm rescans.
  3. Help is needed to improve surface extraction (to reduce sheet switches), strengthen ink detection in hard inner regions, and make the pipeline more scalable and user‑friendly—there's an ongoing Kaggle challenge for surface detection.
Democratizing Automation • 195 implied HN points • 18 Dec 25
  1. The publication grew a lot this year and became a much more influential source of cutting‑edge AI analysis, reaching millions of pageviews and a much larger audience.
  2. Reinforcement learning, reasoning models, and open‑model ecosystems were the central technical themes, and major initiatives were launched to advance American open models and research infrastructure.
  3. Output hit practical limits after a year of high volume, so the focus is shifting to higher‑value work: prioritizing quality over quantity, investing in key projects, and using more open models going forward.
next big thing • 141 implied HN points • 01 Jan 26
  1. Autonomous, end-to-end AI agents will move from being copilots to pilots, owning whole workflows and delivering outcomes rather than just answering prompts.
  2. Persistent memory, proactive behavior, and on-device inference will make AI feel like a personal companion and unlock a wave of new consumer products, generative media, and personalized experiences.
  3. AI will start showing up in the bottom line, driving real deployments, new pricing models, hardware launches, and a surge of IPOs and M&A, while human-heavy AI services get exposed if they can’t prove machine-driven margins.
Dev Interrupted • 56 implied HN points • 03 Feb 26
  1. AI has erased the blank-page problem and speeds up code generation, but those upstream gains are being lost to chaotic code reviews, testing, and integration unless teams build proper infrastructure.
  2. Agentic tools that can control your local machine (like OpenClaw/Moltbot) show huge power but create major security and governance risks, so most organizations won’t give them autonomous control yet.
  3. The economics of software are shifting: survival favors substrate-efficient tools and firms with unique data or "insight compression," and the current "dark flow" of vibe coding can make teams feel faster while actually introducing hidden bugs, so risk-aware pipelines and better testing are essential.
Metacritic Capital • 6 implied HN points • 10 Mar 26
  1. AI training and inference costs are falling rapidly, with practical community optimizations already cutting costs by large orders of magnitude.
  2. Cheaper models let you run far more reasoning tokens, and that extra compute predictably improves performance; reinforcement learning with verifiable rewards can crystallize those gains.
  3. Falling costs combined with inference-time scaling and agent swarms create a feedback loop that can drive recursive self-improvement, so investors should expect faster capability growth and significant economic and safety implications.
TheSequence • 84 implied HN points • 29 Jan 26
  1. Reasoning comes from the interaction loop with the environment, not just from the model itself.
  2. Current LLMs act like fast, shallow 'System 1' pattern matchers, so they need agentic feedback loops to produce real-world reasoning and agency.
  3. The next frontier is designing the agentic loop and environment (the "new hidden layer") rather than only scaling model parameters.
The Palindrome • 4 implied HN points • 14 Mar 26
  1. Machine learning means training predictive models from data. The core setup uses a dataset, a parametric model (a hypothesis), and a loss function to measure how well the model fits the data.
  2. A model approximates the true input–output relation and depends on both its parameters and the training data (often written h(x; w, D)). Models can be deterministic or probabilistic and belong to different families like generative or discriminative.
  3. Which learning paradigm you use depends on what inputs, outputs, and labels are available — the main paradigms are supervised, unsupervised, semi‑supervised, and reinforcement learning. In supervised learning you have input–label pairs and the goal is to learn the mapping from x to y.
Redwood Research blog • 285 HN points • 17 Jun 24
  1. Achieving a 50% accuracy on the ARC-AGI dataset using GPT-4o involved generating a large number of Python programs and selecting the correct ones based on examples.
  2. Key approaches included meticulous step-by-step reasoning prompts, revision of program implementations, and feature engineering for better grid representations.
  3. Further improvements in performance were noted to be possible by increasing runtime compute, following clear scaling laws, and fine-tuning GPT models for better understanding of grid representations.
TheSequence • 84 implied HN points • 28 Jan 26
  1. Two new commercial companies from the vLLM and SGLang teams—Inferact and RadixArk—raised huge funding and are positioning themselves as major players in the inference stack.
  2. The focus is shifting from building bigger models to improving inference unit economics, so the software that manages memory, scheduling, and kernels is now the main battleground.
  3. Serving models efficiently is bottlenecked by scarce VRAM and the KV cache tax, because asynchronous and unpredictable inference patterns drive up cost and complexity.
TheSequence • 49 implied HN points • 12 Feb 26
  1. Evaluation moved from informal "vibe checks" to using stronger LLMs to automatically grade weaker models' outputs.
  2. That single-pass LLM-as-judge approach powered benchmarks like MT-Bench and Chatbot Arena, but simple intuitive judgments are becoming insufficient.
  3. The field is shifting to agent-as-a-judge, where evaluations need multi-step reasoning engines and dynamic, agentic judging instead of static benchmarks.
VuTrinh. • 319 implied HN points • 08 Jun 24
  1. LinkedIn processes around 4 trillion events every day, using Apache Beam to unify their streaming and batch data processing. This helps them run pipelines more efficiently and save development time.
  2. By switching to Apache Beam, LinkedIn significantly improved their performance metrics. For example, one pipeline's processing time went from over 7 hours to just 25 minutes.
  3. Their anti-abuse systems became much faster with Beam, reducing the time taken to identify abusive actions from a day to just 5 minutes. This increase in efficiency greatly enhances user safety and experience.
Software Bits Newsletter • 103 implied HN points • 05 Jan 26
  1. Transform hard problems into easier ones by moving to a different domain, doing the simpler computation there, and (if needed) transforming the result back; this is worth it when the transform cost plus the easier computation is less than solving the original problem.
  2. Use well-known transforms to fix numerical and computational issues: log-space turns tiny-product underflow into stable sums (use the log-sum-exp trick to add probabilities safely), Fourier turns convolution into cheap pointwise multiplication, and embeddings or kernels lift data so linear methods work.
  3. Always check that a transform preserves what you need and that the round-trip cost is justified; the best algorithms exploit problem structure by finding the space where the computation becomes simple.
Cobus Greyling on LLMs, NLU, NLP, chatbots & voicebots • 119 implied HN points • 29 Jul 24
  1. Agentic applications are AI systems that can perform tasks and make decisions on their own, using advanced models. They can adapt their actions based on user input and the environment.
  2. OpenAgents is a platform designed to help regular users interact with AI agents easily. It includes different types of agents for data analysis, web browsing, and integrating daily tools.
  3. For these AI agents to work well, they need to be user-friendly, quick, and handle mistakes gracefully. This is important to ensure that everyone can use them, not just tech experts.