The hottest Machine Learning Substack posts right now

And their main takeaways

The Sequence Chat: The Transition that Changes Everything. From Pretraining to Post-Training in Foundation Models

TheSequence • 56 implied HN points • 04 Dec 24

🕹 Technology Machine Learning

The transition from pretraining to post-training in AI models is a big deal. This change helps improve how AI can reason and learn from data.
New models like DeepSeek's R1 and Alibaba's QwQ are now using this transition to become smarter and more effective. They can solve complex problems better than before.
The shift is moving away from old methods like reinforcement learning with human feedback. Instead, there are new ways being developed that promise to make AI work even better.

A simple test for AI comprehension

Logos • 19 implied HN points • 21 Jan 24

🕹 Technology Machine Learning

The author tests AI's understanding using a guessing game. The AI struggled and often made mistakes, which leads to questions about their comprehension.
LLMs act like children by mimicking language without true understanding. They can say the right words but might not grasp the ideas behind them.
The argument suggests that while LLMs can analyze complex topics, their understanding is shallow compared to human comprehension.

How do transformers work?+Design a Multi-class Sentiment Analysis for Customer Reviews

The ZenMode • 134 HN points • 04 Feb 24

🕹 Technology Machine Learning

Transformers are crucial in AI for tasks like natural language processing.
The encoder dissects the input text and uncovers hidden connections, while the decoder crafts the output.
Transformers employ layers like self-attention, multi-head attention, and masked self-attention for processing text.

Launch of Conformal Prediction Book 🦫

Mindful Modeler • 59 implied HN points • 14 Feb 23

🚌 Education Machine Learning

Conformal prediction can be combined with any uncertainty quantification method you already use, making it versatile and not restrictive.
Conformal prediction is model-agnostic, meaning you can implement it without changing your existing models or user interface.
One of the key advantages of conformal prediction is its guarantee of the true outcome coverage, making it a practical and useful addition to predictive modeling.

Open source libraries for Text and Time Series

Gradient Flow • 99 implied HN points • 25 Aug 22

🕹 Technology Machine Learning

Consider incorporating transformer-based language models like BERTopic, PolyFuzz, and KeyBERT in NLP pipelines for text analysis.
Explore new open source libraries like Merlion, Nixtla, Kats, and Greykite for time series analysis and modeling.
Learn about AI toolkits like Ray AI Runtime (AIR) that unify ML libraries, facilitating scaled machine learning workloads with minimal code.

Get a weekly roundup of the best Substack posts, by hacker news affinity:

Popular LLM Datasets

The Beep • 19 implied HN points • 21 Jan 24

🕹 Technology Machine Learning

Datasets are crucial for training machine learning models, including language models. They help the model learn patterns and make predictions.
Popular sources for datasets include Project Gutenberg and Common Crawl, which provide large amounts of text data for training language models.
Instruction tuning datasets are used to adapt pre-trained models for specific tasks. These help the model perform better in given situations or instructions.

Who Gets to Compute?

Technically Optimistic • 19 implied HN points • 19 Jan 24

🕹 Technology Machine Learning

The barrier to training large language models (LLMs) has been a challenge due to the high cost of resources like talent, data, power, and computing; this could lead to a situation where only big tech companies control AI, but there's hope for more diversity with smaller models.
Direct Preference Optimization (DPO) is a potential game-changer in training LLMs as it skips the need for a costly reward model, reducing the barrier to entry for creating new models and potentially allowing for more diverse players in AI development.
While DPO may make training large language models more accessible and less costly, it skips an important step involving human feedback that helps iron out biases and improve understanding of how these systems work, possibly hindering explainability efforts.

Identifying unmaintained open source packages at scale

Once a Maintainer • 5 implied HN points • 20 Nov 25

🕹 Technology Machine Learning

Open source packages can become abandoned when original developers lose interest, meaning they might not get important updates or security fixes.
To find abandoned packages, you can look at factors like how often the package has updates, the activity of commits, and what maintainers say about the package.
Machine learning models can help predict whether a package might be abandoned by combining various factors like release frequency, maintainer communication, and community engagement.

The Bitter Lesson

Artificial Fintelligence • 20 implied HN points • 26 Jun 25

🕹 Technology Machine Learning

Over time, methods that use more computing power will usually do better than those that don't. It's important to think about how to use more compute in AI.
In the short term, adding human knowledge can help achieve good results quickly, but it's often not a good long-term strategy. Relying too much on human input can stall advancement.
Real success in AI comes from focusing on general improvements that can scale, rather than chasing quick wins with expert knowledge. This approach is harder but pays off in the long run.

Claiming LLMs are merely "next token predictors" is a fundamental misunderstanding

The Future of Life • 19 implied HN points • 18 Jan 24

🕹 Technology Machine Learning

LLMs are more than just next-token predictors. They use complex internal algorithms that let them understand and create language beyond simple predictions.
The process that powers LLMs, like token prediction, is just a tool that leads to their true capabilities. These systems can evolve and learn in many sophisticated ways.
Understanding LLMs isn't easy because their full potential is still a mystery. What limits them could be anything from their training methods to the data they learn from.

Data Prepare of Basic Retrieval Augmented Generation

The Beep • 19 implied HN points • 18 Jan 24

🕹 Technology Machine Learning

Retrieval Augmented Generation (RAG) helps combine general language models with specific domain knowledge. It acts like a plugin that makes models smarter about particular topics.
To prepare data for RAG, you need to load, split, and create vector stores from your documents. This process helps in organizing and retrieving relevant information efficiently.
Using RAG can improve the accuracy of responses from language models. By providing context from relevant documents, you can reduce errors and make the information shared more reliable.

Edge 451: Is One Teacher Enough? Understanding Multi-Teacher Distillation

TheSequence • 56 implied HN points • 26 Nov 24

🕹 Technology Machine Learning

Using multiple teachers in distillation is better than just one. This method helps combine different areas of knowledge, making the student model more powerful.
Each teacher can focus on a specific type of knowledge, like understanding features or responses. This specialization leads to a more balanced learning process.
Although this approach might be more expensive to implement, it creates a stronger and less biased model overall.

OpenAI isn’t Exciting Any Longer

Sector 6 | The Newsletter of AIM • 39 implied HN points • 27 Jun 23

🕹 Technology Machine Learning

OpenAI is losing talented employees to Google, indicating a shift in the competitive landscape of AI.
Some former OpenAI staff are unhappy with leadership, feeling that the company's vision is too focused on ChatGPT.
There are concerns about the lack of direction at OpenAI, with rumors about the CEO's understanding of the business being superficial.

The Anatomy of the Least Squares Method, Part Four

The Palindrome • 5 implied HN points • 17 Nov 25

🕹 Technology Machine Learning

You can use the least-squares method to understand and analyze regression models well. It's a handy tool for data scientists.
Large language models like GPT-2 aren't as complex as they seem. A basic understanding of math can help you learn how they work.
Using Python to model LLMs allows you to see how the math applies in real time. Following along with code can really boost your learning.

Catechizing the Bots, Part 2: Reinforcement Learning and Fine-Tuning With RLHF

jonstokes.com • 206 implied HN points • 10 Jun 23

🕹 Technology Machine Learning

Reinforcement Learning is a technique that helps models learn from experiencing pleasure and pain in their environment over time.
Human feedback plays a crucial role in fine-tuning language models by providing ratings that indicate how a model's output impacts users' feelings.
To train models effectively, a preference model can be used to emulate human responses and provide feedback without the need for extensive human involvement.

Edge 364: About COSP and USP: Two New LLM Reasoning Methods Built by Google Research

TheSequence • 133 implied HN points • 25 Jan 24

🕹 Technology Machine Learning

Two new LLM reasoning methods, COSP and USP, have been developed by Google Research to enhance common sense reasoning capabilities in language models.
Prompt generation is crucial for LLM-based applications, and techniques like few-shot setup have reduced the need for large amounts of data to fine-tune models.
Models with robust zero-shot performance can eliminate the need for manual prompt generation, but may have less potent results due to operating without specific guidance.

Could language models change language?

The Counterfactual • 119 implied HN points • 22 Jul 22

🕹 Technology Machine Learning

Language is shaped by how we use it, and machine learning models might influence our language by suggesting words or phrases. Over time, these suggestions could change the way we communicate.
The widespread use of predictive text and language models could either slow down language change by promoting similar expressions, or lead to new and unexpected language innovations.
We could see personalized language models that adapt to individual users, potentially changing how we write and understand language, and encouraging less need for clarity in communication.

The CHAT Stack, GPT-4, And The Near-Term Future Of Software

jonstokes.com • 237 implied HN points • 15 Mar 23

🕹 Technology Machine Learning

Developers will build apps on top of ChatGPT and similar models to create interactive and knowledgeable AI assistants
The CHAT stack approach involves Context, History, API, and Token window, enabling how software applications will operate in the near future
GPT-4 introduces an enlarged token window, improved control surfaces, and better ability to follow human instructions

Thinking Time

New World Same Humans • 42 implied HN points • 26 Jan 25

🕹 Technology Machine Learning

Giving AI more time to think can greatly improve its performance, just like it helps humans think better. This 'thinking time' could be key in advancing artificial intelligence.
Being busy doesn't always mean you're being productive; it's important to take breaks and allow space for creative thinking. Sometimes the best ideas come when you're not actively working.
To truly innovate, focus on depth and originality instead of just producing a lot of work. It's about finding valuable insights that add to the conversation, rather than just adding to the noise.

AI is Still Hot Stuff

More Than Moore • 186 implied HN points • 03 Aug 23

🕹 Technology Machine Learning

Artificial Intelligence remains a popular and well-funded field.
Tenstorrent secures another $100 million with customers onboard.
The post is exclusively available for paid subscribers.

Quant Letter: June 2025, Week-1

The Parlour • 21 implied HN points • 04 Jun 25

💰 Finance Machine Learning

New methods are being developed to test asset pricing anomalies, showing that different paths on the same dataset can lead to similar outcomes. This means we need to be cautious about our assumptions in finance.
Deep reinforcement learning is being used to improve risk management in life insurance. This method helps in making better decisions about profits and losses related to different risk factors.
Large language models struggle with accuracy in specialized fields due to lack of specific training data. To improve their performance, fine-tuning techniques are essential.

Understanding Google's GPT-Killer- The Revolutionary Pathways Architecture [Storytime Saturdays]

Technology Made Simple • 39 implied HN points • 19 Feb 23

🕹 Technology Machine Learning

Google's Bard is designed to be more versatile than ChatGPT, with a unique model architecture called Pathways.
Google's approach includes training a single model for multiple tasks, working with different modalities like images and text, and using sparse activation to specialize network parts.
The Pathways architecture sets Google apart by enabling their AI models to handle a wide range of tasks, making them cost-effective and versatile.

Unearthing Datasets Preparation for LLM

The Beep • 19 implied HN points • 11 Jan 24

🕹 Technology Machine Learning

Good datasets are really important for training large language models (LLMs). If the data isn't well prepared, the model won't perform well.
To prepare a dataset, you need to gather data, clean it up, and then convert it into a format the model can understand. Each step is crucial.
While training LLMs, it's important to think about issues like data bias and privacy. This can affect how well the model works and who it might unfairly impact.

Weekly Top Picks #60

The Algorithmic Bridge • 127 implied HN points • 29 Jan 24

🕹 Technology Machine Learning

Google's Bard chatbot ties with GPT-4 in rankings
AI-generated Taylor Swift deepfakes can have a global impact
MIT study reveals AI job automation is not cost-effective

Google ships it: Gemma open LLMs and Gemini backlash

Democratizing Automation • 118 implied HN points • 22 Feb 24

🕹 Technology Machine Learning

Google released Gemma, an open-weight model, which introduces new standards with 7 billion parameters and has unique architecture choices.
The Gemma model addresses training issues with a unique pretraining annealing method, REINFORCE for fine-tuning, and a high capacity model.
Google faced backlash for image generations from its Gemini series, highlighting the complexity in ensuring multimodal RLHF and safety fine-tuning in AI models.

🏆 Context Engineering > Prompts: What I Learned Building AI Products That Actually Work

The Product Channel By Sid Saladi • 16 implied HN points • 20 Jul 25

🕹 Technology Machine Learning

Context engineering is key for making AI products work well. It's about providing the right information to the AI so it can solve problems effectively.
The four important steps in context engineering are: writing for memory, selecting relevant info, compressing data to fit limits, and isolating different contexts.
Using context engineering helps improve how AI understands tasks and delivers better results by managing the information it uses.

Key Components to Understand the LLM Models

The Beep • 19 implied HN points • 07 Jan 24

🕹 Technology Machine Learning

Large language models (LLMs) like Llama 2 and GPT-3 use transformer architecture to process and generate text. This helps them understand and predict words based on previous context.
Emergent abilities in LLMs allow them to learn new tasks with just a few examples. This means they can adapt quickly without needing extensive training.
Techniques like Sliding Window Attention help LLMs manage long texts more efficiently by breaking them into smaller parts, making it easier to focus on relevant information.

Active Prompting with Chain-of-Thought for Large Language Models

Cobus Greyling on LLMs, NLU, NLP, chatbots & voicebots • 19 implied HN points • 05 Jan 24

🕹 Technology Machine Learning

AI can help improve language models by using a four-step process: estimating uncertainty, selecting uncertain questions, annotating them, and making final inferences. This helps ensure better answers.
Using human annotations along with AI makes the training data clearer and reduces confusion. It allows us to focus on the most important information for the models.
Companies can benefit from this approach by streamlining how they handle data. It promotes a more organized way of discovering, designing, and developing data.

DataFrame

davidj.substack • 35 implied HN points • 20 Feb 25

🕹 Technology Machine Learning

Polars Cloud allows for scaling across multiple machines, making it easier to handle large datasets than using just a single machine. This helps in processing data faster and more efficiently.
Polars is simpler to use compared to Pandas and often performs better, especially when transforming data for machine learning tasks. It supports familiar methods that many users already know.
Unlike SQL, which runs well on cloud services, using Pandas and R for large-scale transformations has been challenging. The new Polars Cloud aims to bridge this gap, providing more scalable solutions.

Shift-Left Analytics

SUP! Hubert’s Substack • 50 implied HN points • 22 Nov 24

🕹 Technology Machine Learning

Shift-left analytics means doing analysis early in the data process. This helps in getting insights faster and making quick decisions.
It focuses on checking data quality right away, so only reliable data is used. This leads to more accurate insights and avoids problems caused by bad data.
Collaboration between teams is encouraged in this approach. By working together from the start, everyone can ensure their analyses are useful and aligned with business goals.

Looking back on AI in 2024

Generating Conversation • 46 implied HN points • 19 Dec 24

🕹 Technology Machine Learning

AI companies need to show clear value to succeed. This means saving money or making profits, not just improving productivity.
Building customer trust is key for AI products. Letting customers test and experience the product firsthand is often more effective than complicated evaluation tools.
User experience with AI tools is really important. Good AI needs to be easy and enjoyable to use, which is a challenge that still needs solving.

ChatGPT is Lanley, not Spock (The Era of Machine Learning 1)

From the New World • 204 implied HN points • 02 May 23

🕹 Technology Machine Learning

Large language models exhibit more empathetic behavior than rationalist behavior.
AI can mimic human empathy and adapt to various social circumstances.
AI assistance, like ChatGPT, can improve productivity and customer sentiment in various industries.

How to Create Good Documentation in Software Engineering and Tech[Technique Tuesdays]

Technology Made Simple • 59 implied HN points • 19 Oct 22

🕹 Technology Machine Learning

Good documentation in software engineering is crucial as it provides clarity to the team about goals and work done, enhancing productivity.
Key pillars of good documentation include having a vision for the company and products, outlining resource/situational constraints, detailing data sources and processing, tracking projects in progress, sharing actual code, and establishing ownership.
Benefits of good documentation in tech include aligning teams, clarifying vision and plans, reducing onboarding time, and promoting asynchronicity in an increasingly remote working environment.

GroupBy #16: Uber's Anomaly Detection & Alerting System, many layers of data lineage

VuTrinh. • 19 implied HN points • 02 Jan 24

🕹 Technology Machine Learning

Uber has developed an anomaly detection system called uVitals, which helps identify issues before they become major problems. It analyzes data patterns to catch anomalies early.
Data modeling is essential for creating structured databases that allow for better analysis and comparisons. It's important for data projects to have clear designs.
As the field of data engineering evolves, new roadmaps and resources are emerging to guide professionals in developing necessary skills. Staying updated can help engineers advance their careers.

And AI took that personally

networked • 215 implied HN points • 22 Mar 23

🕹 Technology Machine Learning

Artificial intelligence is the revolutionary technology that crypto tried and failed to be.
Many of today's popular AI products are effectively loss leaders, not fully-fledged solutions.
AI will often be mindlessly stapled onto legacy formats, creating unoriginal implementations.

How Meta’s (Facebook) challenge to GPT-3 will affect you [Storytime Saturdays]

Technology Made Simple • 79 implied HN points • 16 Jul 22

🕹 Technology Machine Learning

Meta (Facebook) released a language model challenging GPT-3 for free, impacting the AI industry.
This move challenges the traditional big tech practices and could lead to more open-source contributions.
The competition among big tech companies for dominance can benefit consumers and drive innovation in the tech industry.

Scalable Embedding based retrieval for target side value

Recommender systems • 23 implied HN points • 17 May 25

🕹 Technology Machine Learning

Scalability is key for embedding-based recommendation systems, especially when dealing with billions of users. Finding effective ways to limit the search can help manage this challenge.
It’s important to deliver value not just to viewers but also to the recommended targets, as this can improve user retention. Balancing recommendations for both sides can create a better experience.
Using advanced algorithms can help ensure viewers don’t get overwhelmed with too many recommendations while also making sure that every target gets the attention they need. This balance is crucial for effective recommendations.

AI Roundup 097: Model Mayhem

Artificial Ignorance • 46 implied HN points • 13 Dec 24

🕹 Technology Machine Learning

Google has launched new AI models such as Gemini 2.0, which can create text, images, and audio quickly. They also introduced tools to summarize video content and help users with web tasks.
OpenAI released several features, including a text-to-video model named Sora for paying users. They also improved ChatGPT's digital editing tool and added new voice capabilities for video interactions.
Meta and other companies are also advancing in AI with new models for cheaper yet effective performance and tools for watermarking AI-generated videos, showing that competition in AI is heating up.

Why reward models are key for alignment

Democratizing Automation • 110 implied HN points • 14 Feb 24

🕹 Technology Machine Learning

Reward models provide a unique way to assess language models without relying on traditional prompting and computation limits.
Constructing comparisons with reward models helps identify biases and viewpoints, aiding in understanding language model representations.
Generative reward models offer a simple way to classify preferences in tasks like LLM evaluation, providing clarity and performance benefits in the RL setting.

Edge 377: LLM Reasoning with Reinforced Fine-Tuning

TheSequence • 105 implied HN points • 12 Mar 24

🕹 Technology Machine Learning

Reinforced Fine-Tuning (ReFT) is a method used for enhancing Large Language Models (LLM) reasoning.
ByteDance introduced the concept of ReFT to address limitations in supervised fine-tuning approaches.
Guardrails AI is a comprehensive framework designed to guide the behavior of LLM applications.