The hottest Machine Learning Substack posts right now

And their main takeaways

Refactoring BabyAGI - Code Quality and LLMs

Laszlo’s Newsletter • 32 implied HN points • 28 Apr 23

🕹 Technology Machine Learning

Refactoring an early implementation of autonomous agents based on LLMs with clean architecture principles
When analyzing a legacy codebase, focus on finding the main entry point and understanding variable usage
Consider moving external dependencies into their own classes and introducing a 'Task' class to improve code structure

Unveiling Backdoors, Optimized LLMs, and Spurious Patterns in AI

HackerPulse Dispatch • 8 implied HN points • 15 Nov 24

🕹 Technology Machine Learning

Backdoors can be secretly added to machine learning models. These backdoors let bad actors change how the model makes decisions without being noticed.
Large Language Models (LLMs) are helpful for tuning model settings to make them work better. They can suggest and adjust configurations based on past performance.
Understanding spurious patterns in data is important. These patterns can confuse models and lead to mistakes, which is crucial for developing responsible AI systems.

Infinite Barnacle

Cybernetic Forests • 19 implied HN points • 13 Feb 22

🕹 Technology Machine Learning

Memories and data are distinct - photographs capture data, while memories hold fragments of experiences.
Technology can transform memories into new data - a machine can create new pictures from a collection of images.
Generative images challenge the concept of memory - creating variations that may not accurately reflect the original experience.

Data Science Weekly - Issue 451

Data Science Weekly Newsletter • 19 implied HN points • 14 Jul 22

🕹 Technology Machine Learning

Many people believe that data scientists today often do tasks very similar to data analysts. They're not just creating charts; there's a concern that their work lacks deeper statistical analysis.
There's a lively debate about what it means to be a data scientist. While some argue the role has become too diluted, others believe that practical application in companies differs from academic definitions.
Data science is evolving, with new techniques and applications emerging, like the importance of understanding datasets and using principles from various fields to improve intelligence in AI.

AIM turns 10!

Sector 6 | The Newsletter of AIM • 19 implied HN points • 23 May 22

🕹 Technology Machine Learning

AIM has been around for ten years, showing significant growth in analytics and technology. It's impressive how much the industry has evolved in that time.
The rise of data science and AI/ML has changed the business landscape. People are now recognizing the importance of these fields more than ever.
One major success of AIM is its role in establishing analytics as a key tech stack in the industry. They have helped people understand the value of data in decision-making.

Get a weekly roundup of the best Substack posts, by hacker news affinity:

Data Science Weekly - Issue 450

Data Science Weekly Newsletter • 19 implied HN points • 07 Jul 22

🕹 Technology Machine Learning

AI forecasting contests help predict future progress and improve forecasting skills. It’s important to evaluate predictions against actual outcomes to see how accurate forecasters are.
Analytics engineering has become a popular job choice, shifting from being less desired to highly sought after. This change reflects the growing need for skilled professionals in data analytics.
High-quality machine translation is now possible for low-resource languages through models like NLLB-200. This will make information more accessible to speakers of these languages worldwide.

Faster Fine-Tuning, Smarter Agents, and Real-World Vision-Language Wins

ppdispatch • 2 implied HN points • 08 Aug 25

🕹 Technology Machine Learning

A new method called Model Stock can fine-tune AI models using just two models instead of many. This saves resources and still performs really well on tasks.
OpenMed NER offers high performance for biomedical tasks by using smart training without needing to use a lot of data or power, making it fast and eco-friendly.
The SEAgent is a computer-use agent that learns on its own through experience, which helps it improve without needing extra training data, making software interaction smoother.

Quant Letter: February 2024, Week-2

The Parlour • 17 implied HN points • 14 Feb 24

🕹 Technology Machine Learning

Using Autoencoder architectures in Statistical Arbitrage can simplify strategy development and improve returns compared to traditional methods.
A new method, Causal-NECOVaR, provides reliable risk predictions for financial risk analysis regardless of market shocks and systemic changes.
The Merton investment-consumption problem is expanded to incorporate transaction costs and stochastic differential utility in Portfolio Optimization for a better understanding of parameter combinations.

Data Science Weekly - Issue 449

Data Science Weekly Newsletter • 19 implied HN points • 30 Jun 22

🕹 Technology Machine Learning

Machine learning exercises can deepen your understanding of concepts like linear algebra and optimization. Practicing these can help you think critically about model building.
Ethical AI development toolkits play a crucial role in shaping how companies approach ethics in technology. It's important to recognize the gaps between what these toolkits suggest and the real work involved in implementing ethical practices.
Recent studies on adaptive optimizers show that models can go through phases of overfitting before suddenly generalizing very well. Understanding this 'grokking' phenomenon can help refine training processes for better performance.

Am I still relevant as a programmer now that there's AI?

Andrew's Substack • 33 HN points • 14 Mar 23

🕹 Technology Machine Learning

Computers can now do impressive things with AI like answer questions and generate code or art.
Concerns exist over the relevance of human programmers in the future due to advancements in AI technology.
To remain pertinent, focus on developing human skills, learn about AI, and stay updated on software development practices involving AI.

How AI generates text, visually explained 📝

Year 2049 • 6 implied HN points • 18 Jan 25

🕹 Technology Machine Learning

AI generates text by analyzing patterns in data, similar to how a DJ mixes music. This means it learns from examples to create new content.
Understanding how AI learns helps us see its strengths and weaknesses, like how it can sometimes be biased.
The next episode will focus on how AI creates images, which is another interesting aspect of how AI works.

Data Science Weekly - Issue 448

Data Science Weekly Newsletter • 19 implied HN points • 23 Jun 22

🕹 Technology Machine Learning

Machine learning can help the IRS process a huge amount of tax data more efficiently, improving enforcement actions on tax compliance.
Denoising Diffusion Probabilistic Models are showing great success in generating images and audio, making them popular in creative AI applications like DALL-E 2.
Training and developing skills in SQL can greatly enhance your data handling abilities, leading to better opportunities in data analysis and engineering.

Fine-tune an open source LLM from Postgres data in 5 minutes

Database Engineering by Sort • 15 implied HN points • 27 Mar 24

🕹 Technology Machine Learning

Fine-tuning an open source language model is now super easy and can be done in just five minutes. This makes it accessible for more people to customize LLMs for their needs.
You can use data from a Postgres database to create a product catalog that the fine-tuned LLM can answer questions about. This can help with tasks like customer support and product information.
With tools like Together.ai, you can quickly set up fine-tuning and chat with your customized LLM. It's great for building chatbots and enhancing user interactions.

AGI Doom and the Drake equation

I Am Not a Robot • 31 HN points • 07 Apr 23

🕹 Technology Machine Learning

The AGI doom scenario requires multiple rare conditions to happen simultaneously.
Current AI technology limitations make the worst case scenario less likely.
It's important to understand the technology and gaps to assess the likelihood of AGI doom.

Machine Teaching in a World of Vibeworkers

Modern Data Democracy • 3 implied HN points • 29 May 25

🕹 Technology Machine Learning

AI can either make users feel like they are just passengers in a car or empower them to learn and grow. We should think about how we design user experiences with this in mind.
Instead of just using technology to make tasks easier, we should focus on teaching users and helping them gain knowledge and understanding.
Designers have a responsibility to create AI tools that elevate people, instead of just making them dependent. Let's aim for user growth, not just convenience.

Data Science Weekly - Issue 447

Data Science Weekly Newsletter • 19 implied HN points • 16 Jun 22

🕹 Technology Machine Learning

Natural language processing is getting better, but it's important to remember that it's just imitating consciousness, not actually having it.
Scaling AI models may improve performance, but there are limits due to the quality of the data they learn from.
Emerging techniques like optical neural networks are being developed to speed up image classification significantly.

AI/ML courses galore!

Sector 6 | The Newsletter of AIM • 19 implied HN points • 25 Apr 22

🕹 Technology Machine Learning

Andrew Ng has updated his popular machine learning course, which is launching in June 2022. It's created with Stanford Online and DeepLearning.ai.
The original machine learning course by Ng has seen about 5 million enrollments since it started on Coursera in 2012.
There are many AI/ML courses available, showing a growing interest in these technologies.

Can LLMs earn $1M freelancing?

HackerPulse Dispatch • 5 implied HN points • 21 Feb 25

🕹 Technology Machine Learning

AI models are being tested to see if they can earn a million dollars through freelancing. But it turns out many of them struggle with real-world tasks.
A new video model can create high-quality videos from text descriptions. It uses advanced techniques to improve video quality and generation.
Small AI models can perform better when they are trained on easier tasks instead of trying to learn from more complex ones.

Papers I've read this week: vision language models

Artificial Fintelligence • 8 implied HN points • 28 Oct 24

🕹 Technology Machine Learning

Vision language models (VLMs) are simplifying how we extract text from images. Unlike older software, modern VLMs make this process much easier and faster.
There are several ways to combine visual and text data in VLMs. Most recent models prefer a straightforward approach of merging image features with text instead of using complex methods.
Training a VLM involves using a good vision encoder and a pretrained language model. This combination seems to work well without any major drawbacks.

Data Science Weekly - Issue 446

Data Science Weekly Newsletter • 19 implied HN points • 09 Jun 22

🕹 Technology Machine Learning

The history of AI in literature shows how machines have been involved in writing since the 19th century. It's fascinating to see how far technology has come in helping with creative tasks.
Jupyter Notebooks are versatile tools for data scientists, used for more than just coding. They can creatively combine text, visuals, and code to make data exploration easier.
Using machine learning with small data sets can be tricky, but there are effective techniques to make it work. Smaller datasets can still yield valuable insights with the right approaches.

Learn from Experiences of Experts - Running Trustworthy A/B Test

Machine Learning Diaries • 7 implied HN points • 27 Nov 24

🕹 Technology Machine Learning

A/B tests are important for businesses because they help test ideas and make informed decisions. Many companies have seen significant revenue increases by using A/B tests.
It's crucial to define the right performance metrics for A/B tests to ensure long-term success. Focus on metrics that show real customer engagement, not just short-term results.
Pay close attention to statistical principles when running A/B tests. Misunderstanding p-values and making hasty conclusions can lead to incorrect results and poor decisions.

Quant Letter: October 2023, Week 2

The Parlour • 21 implied HN points • 12 Oct 23

💰 Finance Machine Learning

The post is about a quantitative finance newsletter for October 2023, Week 2.
A recently published thesis discusses Deep RL for Portfolio Allocation, showing the potential of deep reinforcement learning in enhancing portfolio allocation methods.
Readers can subscribe to Machine Learning & Quant Finance for more content and a 7-day free trial.

Data Science Weekly - Issue 445

Data Science Weekly Newsletter • 19 implied HN points • 02 Jun 22

🕹 Technology Machine Learning

There's a new set of best practices for safely using large language models, aiming to help the industry work together responsibly.
We are using less agricultural land now, even though we're producing more food, which is good for both us and nature.
Qualitative research is important in AI. It helps us ask the right questions and understand how AI affects society beyond just numbers.

How the field of "AI" got like this

Apperceptive (moved to buttondown) • 20 implied HN points • 02 Nov 23

🕹 Technology Machine Learning

The field of AI can be hostile to individuals who are not white men, which hinders progress and innovation.
The history of AI showcases past failures and the subsequent shift towards more practical, engineering-focused approaches like machine learning.
Success in the AI field is heavily reliant on performance advancements on known benchmarks, emphasizing practical engineering solutions.

Data Science Weekly - Issue 444

Data Science Weekly Newsletter • 19 implied HN points • 26 May 22

🕹 Technology Machine Learning

Operationalizing machine learning models is important. There are key differences between how ML is used in research and in real-world applications, and understanding these can improve system design.
DALL-E and similar AI models show that composition in AI can produce unexpected and enjoyable results. This is a fun way to think about how AI works with semantics, even if it doesn't always make sense.
Data can sometimes lead to worse decisions. It's essential to think critically about how we use data rather than just relying on it blindly.

Data Science Weekly - Issue 443

Data Science Weekly Newsletter • 19 implied HN points • 19 May 22

🕹 Technology Machine Learning

Data scientists should improve their software development skills by learning about project structure, testing, reproducibility, and version control.
AI-generated artwork may not be considered true art because it lacks the communication and consciousness involved in traditional art creation.
Using optimized tools like DuckDB can enhance the data processing experience by making it faster and easier to work with large datasets.

Vector Database: History and Basic Concept

The Beep • 2 HN points • 08 Feb 24

🕹 Technology Machine Learning

Vector databases help store and manage embedding vectors effectively. This is important for improving how AI finds and retrieves information.
The concept of vector databases has been around for a long time, dating back to the 1990s. They have evolved from early uses in semantic models to current advanced techniques.
Various algorithms have been developed to convert digital items into vectors and to streamline searching within these vectors. This makes it easier for AI to understand and process data.

⚡ One-step Diffusion & 1 Million FPS Simulations

ppdispatch • 8 implied HN points • 11 Oct 24

🕹 Technology Machine Learning

A new technology called Differential Transformer helps improve language understanding by reducing noise and focusing on the important context, making it better for tasks that need long-term memory.
GPUDrive is an advanced driving simulator that works really fast, allowing training of AI agents in complex driving situations, speeding up their learning process significantly.
One-step Diffusion is a new method for creating images quickly without losing quality, making it much faster than traditional methods while still producing great results.

Data Science Weekly - Issue 442

Data Science Weekly Newsletter • 19 implied HN points • 12 May 22

🕹 Technology Machine Learning

Splitting data into training, testing, and validation sets is crucial for building effective machine learning models. It helps ensure that we evaluate our models properly.
Bandit algorithms can improve recommender systems by balancing exploration of new items and exploitation of known user preferences. This way, they can discover hidden gems instead of just repeating popular choices.
Protecting machine learning models and their intellectual property is important, and best practices are still evolving. It's useful to stay updated on strategies to safeguard your work in this fast-changing field.

Prompt-Based Feature Engineering Part 1: Generative AI Generates Data

nick’s datastack • 1 HN point • 24 Apr 24

🕹 Technology Machine Learning

Generative AI can generate data, impacting workflows and pipelines significantly.
Using LLMs for prompt-based feature engineering can save time and effort compared to traditional methods like manual data searching and merging.
While LLMs in data pipelines may feel magical, it's important to be cautious of potential inaccuracies due to the probabilistic nature of AI outputs.

Data Science Weekly - Issue 440

Data Science Weekly Newsletter • 19 implied HN points • 05 May 22

🕹 Technology Machine Learning

Meta AI is sharing a big language model, OPT-175B, to help others learn about new technology. This model has 175 billion parameters and is based on publicly available data.
Handling harmful text in data science is a tricky issue. Researchers are looking for ways to address this challenge while still making progress in natural language processing.
There are many resources and courses available for learning data science and machine learning. These include guides for using Python and R, plus access to various data visualization tools.

Writing my master's thesis in public

Santiago and the ML Models • 4 HN points • 17 Mar 23

🚌 Education Machine Learning

Writing a master's thesis can be overwhelming, but sharing the process publicly can make it less lonely.
Choosing the right thesis topic and finding a supportive supervisor are crucial steps.
For a thesis project, replicating testing frameworks, analyzing datasets, and implementing new methods are significant tasks.

Can LLaMA approve credit card applications? (Part 1)

followfox.ai’s Newsletter • 4 HN points • 03 May 23

🕹 Technology Machine Learning

LLaMA models of size 13B or above might be better than random chance at evaluating credit card approvals.
Smaller LLaMA models (7B) didn't show improvement over random chance.
Instruction-finetuning didn't significantly enhance model performance.

Auto-Optimized Prompts, AI Text Detection, and Parametric RAG

HackerPulse Dispatch • 5 implied HN points • 31 Jan 25

🕹 Technology Machine Learning

LLM-AutoDiff can make AI workflows more efficient by automatically optimizing prompts, leading to better performance without the need for manual work.
Racing for superintelligence might cause more problems than it solves, making cooperation between nations a better option.
Combining reinforcement learning with transformers can create AI that adapts and solves new problems effectively over time.

Data Science Weekly - Issue 440

Data Science Weekly Newsletter • 19 implied HN points • 28 Apr 22

🕹 Technology Machine Learning

AI is getting smarter, but we need a better way to understand how it makes decisions. A common language with AI could help us communicate our questions and concerns.
Creating more synthetic data can help when there's not enough real data for training models. Techniques like data augmentation can help make our data better.
Making data more accessible can solve big problems for society. If we can use available data properly, it can lead to more health and happiness for everyone.

[In case you missed it] Data Science Weekly - Issue 439

Data Science Weekly Newsletter • 19 implied HN points • 24 Apr 22

🕹 Technology Machine Learning

Building a recommendation system is challenging. It requires careful planning and execution to serve users quickly and efficiently.
Understanding different probability distributions is essential in data science. They help us make better predictions and understand the variability in our data.
Contrastive learning is an important method for training machine learning models. Recent advances in this area can improve how we represent data and solve complex problems.

Faster LLMs, Safer Chains of Thought, and Image Tokenization Reinvented

ppdispatch • 2 implied HN points • 18 Jul 25

🕹 Technology Machine Learning

There's a new book that helps people understand deep learning in a clear way. It covers important topics like neural networks and how they work.
A new technique called Chain-of-Thought Monitorability may help keep AI safe by watching how AI reasons with language. But it’s still seen as a bit weak and needs more work.
Researchers found that recent improvements in AI reasoning might not be genuine. They suggest that better ways to check AI's performance are needed to ensure it really understands and isn't just memorizing data.

Data Science Weekly - Issue 439

Data Science Weekly Newsletter • 19 implied HN points • 21 Apr 22

🕹 Technology Machine Learning

Building recommendation systems requires careful planning and quick processing to handle live requests effectively. It's not just about creating a model but also about deploying it at scale.
Contrastive learning is a powerful technique in machine learning that helps in improving model performance. New insights in this area can lead to better model training and application.
Understanding different probability distributions is crucial in data science. It helps in modeling data accurately and predicting outcomes better.

Data Science Weekly - Issue 438

Data Science Weekly Newsletter • 19 implied HN points • 14 Apr 22

🕹 Technology Machine Learning

The Modern Data Stack is becoming crucial for handling data, with many tools available to improve the way businesses work with data. It helps users understand how to start using these tools effectively.
DeepMind's AlphaFold is revolutionizing biology by accurately predicting protein shapes. This technology is changing how researchers approach biological problems.
There are better ways to visualize SQL joins than using Venn diagrams. New methods like the checkered flag diagram can make understanding joins easier and clearer.

Axial Discovery - Machine learning and DNA-encoded libraries

Axial • 29 implied HN points • 13 Feb 23

🔬 Science Machine Learning

DNA-encoded libraries (DEL) use unique DNA barcodes to screen chemical compounds efficiently.
Machine learning helps map out structure-activity relationships in DELs for virtual screening.
Challenges in DELs include improving chemical diversity, developing better filters for virtual screening, and expanding screening criteria for more accurate models.