The hottest Data science Substack posts right now

And their main takeaways

Chain-Of-Knowledge Prompting

Cobus Greyling on LLMs, NLU, NLP, chatbots & voicebots • 19 implied HN points • 22 Nov 23

🕹 Technology Data science

Chain-Of-Knowledge (CoK) prompting is a useful technique for complex reasoning tasks. It helps make AI responses more accurate by using structured facts.
Creating effective prompts using CoK requires careful construction of evidence and may involve human input. This is important for ensuring the quality and reliability of the information AI generates.
The CoK approach aims to reduce errors or 'hallucinations' in AI responses. It offers a more transparent way to build prompts and enhances the overall reasoning ability of AI systems.

Why Talking Models are not going to take your jobs [Math Mondays]

Technology Made Simple • 39 implied HN points • 29 Nov 22

🕹 Technology Data science

Models processing inputs use vectors to represent features, not replacing people
Comparing similarity between data points helps models generate answers efficiently
Big models have limitations in working with new inputs and face engineering challenges at scale

The Sequence Radar #486 : The Amazing AlphaGeometry2 Now Achieved Gold Medalist in Math Olympiads

TheSequence • 28 implied HN points • 09 Feb 25

🕹 Technology Data science

AlphaGeometry2 has become a top performer in solving geometry problems, even surpassing human math Olympiad gold medalists. It can handle tough geometry concepts and has a better understanding of different math problems compared to its predecessor.
The latest improvements in AlphaGeometry2 include an enhanced symbolic engine and a wider range of mathematical language features. This allows it to solve more complex geometry problems efficiently.
AI is getting closer to matching or even exceeding human capabilities in competitive mathematics. This success in geometry could lead to similar advancements in other scientific fields like physics and chemistry.

The Long Game 169: AI Investment Thesis, Peter Attia, Earth AI, Science-Based Lifting

The Long Game by Mehdi Yacoubi • 3 implied HN points • 19 Nov 25

🕹 Technology Data science

Longevity works best when you focus on basics—build muscle, move often, eat and sleep reasonably well—and avoid turning health into constant self-surveillance that makes you feel fragile.
The AI app market is unstable because foundational model providers can rapidly absorb app features, so most startups either need to generate quick cash, aim to be acquired, or specialize in niches with unique atom-level data, hardware, or heavy enterprise integration.
Real competitive advantage comes from controlling the full loop: huge, cleaned datasets, continent-scale multimodal models, and cheap execution that ties AI to real-world testing, and founders should build from conviction rather than chasing what’s currently fundable.

AI Week That Was

Sector 6 | The Newsletter of AIM • 39 implied HN points • 19 Mar 23

🕹 Technology Data science

Alpaca 7B is a new AI model introduced by Stanford that performs well, similar to OpenAI's models, but is smaller and cheaper to use.
The AI landscape is buzzing with exciting developments and new models, making it an interesting time for AI enthusiasts.
The week highlights a range of impressive AI technologies, signaling that there's much more innovation to come in this field.

Get a weekly roundup of the best Substack posts, by hacker news affinity:

Attention Explained: When to use Self, Graph, and Target-Aware Attention

Recommender systems • 16 implied HN points • 25 May 25

🕹 Technology Data science

Self-attention helps summarize a list of information, making it easier to find what's most relevant, like recent videos you watched.
Graph attention looks at how items in a network relate to each other, like understanding social connections in a network.
Target-aware attention checks how relevant certain items are based on your past choices or queries, helping improve recommendations.

How Should Large Language Models Be Evaluated?

Cobus Greyling on LLMs, NLU, NLP, chatbots & voicebots • 19 implied HN points • 06 Nov 23

🕹 Technology Data science

When evaluating large language models (LLMs), it's important to define what you're trying to achieve. Know the problems you're solving so you can measure success and failure.
Choosing the right data is crucial for evaluating LLMs. You'll need to think about what data to use and how it will be delivered in your application.
The process of evaluation can be automated or involve human input. Deciding how to implement this process is key to building effective LLM applications.

Predicting earthquakes

The Works in Progress Newsletter • 11 implied HN points • 16 Jul 25

🔬 Science Data science

Scientists estimate that a major earthquake can occur in the American West Coast, causing massive destruction and loss of life. Planning for these events is crucial, given the high number of residents in these areas today.
Funding for earthquake prediction is very limited, focusing mostly on understanding where earthquakes might happen rather than when. There is a big need for more resources to develop better warning systems.
Using advanced technology and data sharing can significantly improve earthquake prediction. A centralized lab focusing on research and collaboration could potentially provide better warning times and save lives.

Reasoning Models, visually explained 🤔

Year 2049 • 11 implied HN points • 17 Jul 25

🕹 Technology Data science

Reasoning models take time to think through problems step-by-step, unlike standard LLMs that give quick answers. This helps them break down complex questions and find better solutions.
While reasoning models can work better for complex problems, they might fail on simpler ones and can overthink too much. Sometimes, basic LLMs are faster and more accurate.
Choosing the right AI model for your task is important. Not every problem needs a reasoning model, so understanding their strengths and limitations can help set realistic expectations.

OpenAI Deep Research Explains Itself

From the New World • 26 implied HN points • 06 Feb 25

🕹 Technology Data science

AI hardware has evolved significantly, from early specialized chips to powerful GPUs and TPUs. These advancements make training AI models much faster and more efficient.
The design of algorithms, especially with transformers, has greatly improved AI's ability to understand and generate language. These models can now learn complex patterns that were hard to capture before.
Building and maintaining large AI systems requires careful planning and practices. Companies need efficient workflows and monitoring systems to manage data, hardware, and software effectively.

The Most Common Data Science Interview Mistake

inexactscience • 39 implied HN points • 14 Mar 23

🚌 Education Data science

One big mistake in data science interviews is jumping to solutions too quickly. It's important to first understand the problem before trying to solve it.
Asking questions during the interview can show your insight and help you gather essential information. It helps to clarify the business context and what needs to be addressed.
Finding a balance is key. You want to ask enough questions to understand the issue without getting stuck in overthinking. A good candidate knows when to seek clarification and when to respond directly.

July Newsletter

RSS DS+AI Section • 11 implied HN points • 01 Jul 25

🕹 Technology Data science

Data science and AI are constantly evolving, with new research and developments happening every month. It's important to stay updated on these changes.
Ethical considerations like bias and privacy are ongoing challenges in the AI field. Engaging in discussions about these topics is crucial for responsible technology use.
There are many practical applications and resources available for those wanting to enhance their skills in data science and AI. Exploring tutorials and job opportunities can help grow your knowledge and career.

AI, is it Logic or Magic?

The Novice • 19 implied HN points • 26 Oct 23

🕹 Technology Data science

AI is based on statistics and massive data processing, not magic.
AI mimics human-like thought processes through algorithms and machine learning techniques.
Understanding AI involves complex details and processes beyond human perception.

The dbt meta tag

Data Thoughts • 59 implied HN points • 25 Nov 22

🕹 Technology Data science

The dbt meta tag helps document important info about data models. It's a simple way to keep track of data governance like ownership and sensitivity.
Many companies have used the dbt meta tag to enhance their products. Some of these companies have received significant venture capital funding because of these improvements.
Documenting tools and their funding related to the dbt meta tag can inspire others. It shows how small features can lead to big opportunities.

99% of people just get AI wrong...

do clouds feel vertigo? • 39 implied HN points • 25 Mar 23

🕹 Technology Data science

Microsoft claims that GPT-4 shows potential for Artificial General Intelligence, but some critics doubt its transparency and reliability, feeling it's more of a marketing claim than factual science.
Generative AI models can produce creative outputs but shouldn't be judged like traditional knowledge tools. They often generate believable yet false information, showcasing a need for a different evaluation standard.
As AI technology evolves, the cost to create content is decreasing, which raises questions about who will really profit from it and how existing knowledge can be effectively leveraged in this new landscape.

"Algorithmic entombment", explore-exploit trade-offs, and serendipity

The Counterfactual • 59 implied HN points • 04 Oct 22

🕹 Technology Data science

Recommendation systems can help us find new favorites but also risk making our choices repetitive. If we're only shown what we already like, we might miss out on discovering exciting new things.
There's a balance between exploring new options and sticking to what we know. Too much of either can lead to boredom or discomfort, so it’s important to mix both approaches in our choices.
Serendipity, or those happy accidents that lead to great moments, can be lost with strict recommendation systems. Sometimes the best experiences come from unexpected encounters, not just from things we already enjoy.

Updated: Emerging RAG & Prompt Engineering Architectures for LLMs

Cobus Greyling on LLMs, NLU, NLP, chatbots & voicebots • 19 implied HN points • 18 Oct 23

🕹 Technology Data science

Large Language Models (LLMs) rely on both input and output data that are unstructured and conversational. This means they process language in a natural, free-flowing manner.
Fine-tuning LLMs has become less popular because it requires a lot of specific training and can get outdated. Using contextual prompts at the right time is a better way to improve their accuracy.
New tools are emerging that test different LLMs against prompts instead of just tweaking prompts for one LLM. This helps in finding the best model suited for different tasks.

Edge 445: A New Series About Knowledge Distillation

TheSequence • 35 implied HN points • 05 Nov 24

🕹 Technology Data science

Knowledge distillation helps make large AI models smaller and cheaper. This is important for using AI on devices like smartphones.
A key goal of this process is to keep the accuracy of the original model while reducing its size.
The series will include reviews of research papers and discussions on frameworks like Google's Data Commons that support factual knowledge in AI.

Temporal degradation framework and other ideas

Santiago and the ML Models • 19 implied HN points • 05 Jun 23

🔬 Science Data science

The author is working on a Temporal Model Degradation Framework for AI models.
They have implemented an experiment with early results showing model performance degradation over time.
The author plans to conduct a Continuous Retraining Experiment to test if continuous retraining can prevent model degradation.

Premium JC update

The Jolly Contrarian • 19 implied HN points • 14 Aug 23

🚌 Education Data science

Premium JC update includes progress on premiumizing ISDA and Equity Derivatives Definitions
Consolidated anatomy of emissions trading documentation is in the works under ISDA, EFET, and IETA
JC Essays explore themes like form versus substance, system redundancy, and pace layering

The Long Game 170: Genomics, Embryo Selection, Expectations, Taking Yourself Seriously

The Long Game by Mehdi Yacoubi • 2 implied HN points • 04 Dec 25

🕹 Technology Data science

Embryo selection is extremely high-stakes, so companies must have honest marketing and solid science. If you see fake reviews, copied research, or basic methodological errors, be very skeptical and don't trust them with decisions about future children.
Set deliberately low expectations so small improvements feel like wins and bad news feels normal. Controlling your expectations reduces unnecessary suffering and helps you appreciate progress.
Stop waiting for life to happen and take yourself seriously by choosing a direction and acting on it. Real progress comes from responsibility, risk, and doing more than what feels safe.

The Camel Principle: Why Adding Zero is the Most Powerful Trick in Mathematics

The Palindrome • 1 implied HN point • 12 Jan 26

🚌 Education Data science

The camel principle is the idea that you can add zero in clever ways to transform problems, and that tiny trick can unlock big simplifications.
Adding zero is essential because it helps rewrite expressions, simplify derivations, and connect different methods across mathematics and machine learning.
A practical workshop can teach these foundations by building linear regression from scratch, covering vectors, vectorized code, optimization, and gradient descent with notebooks and recordings for practice.

The Birth of Baby Llama

Sector 6 | The Newsletter of AIM • 19 implied HN points • 25 Jul 23

🕹 Technology Data science

Andrej Karpathy worked on a fun project to create a smaller version of the Llama 2 model called Baby Llama. It's designed to run on a single computer.
The Baby Llama can load and use the models released by Meta, making it more accessible for users.
Karpathy shared that the performance is promising, with potential for faster processing speeds on a cloud setup.

How the CIA Writes Python

Luminotes • 28 implied HN points • 15 Dec 24

🕹 Technology Data science

The CIA has a unique Python style guide, focusing on clarity and readability, with special rules for exceptions, globals, and list comprehensions.
They use specific tools like PyCharm for development and have a custom setup for installing Python and managing packages within secure environments.
There are no strict rules governing coding practices; instead, individuals make choices based on their preferences and the limitations of their working conditions.

Bayesian Thinking for Software Engineering [Math Mondays]

Technology Made Simple • 59 implied HN points • 03 May 22

🕹 Technology Data science

Bayes Theorem allows us to update beliefs based on evidence, crucial for software developers making decisions.
Bayesian Thinking is implicit in many decisions we make, and recognizing its importance can prevent fallacies.
Learning Bayesian Thinking involves understanding intuition behind the math, using resources like StatsQuest and 3Blue1Brown.

The AI Supernova

Perspective Agents • 24 implied HN points • 15 Jan 25

🕹 Technology Data science

AI is changing how we work and learn. Jobs will focus more on things like emotional intelligence and problem-solving instead of routine tasks.
There is a big gap between those who understand and use AI effectively and those who don't. This gap can lead to businesses being left behind if they don't adapt.
Whether it's through simulations or understanding people's feelings, human touch will always matter. Genuine moments of connection can outshine machines, even if they seem perfect.

What's a vector database?

Technically • 34 implied HN points • 21 Oct 24

🕹 Technology Data science

A vector database is a special storage for data used in AI. It helps store numbers that represent different types of information like text or images.
To make AI models smarter, they need to use unique data from companies. This helps tailor responses and improve accuracy.
There are ways to enhance AI models with unique data, like fine-tuning them or using a method called Retrieval Augmented Generation (RAG) to include important information in prompts.

Gradient Flow #46: Smarter Language Models; Data Engineering Trends

Gradient Flow • 99 implied HN points • 04 Nov 21

🕹 Technology Data science

Data scientists should transition into social scientists in addition to being computer scientists.
The report presents insights from a global online survey of 372 respondents on data engineering trends and challenges.
Information on improvements in large language models, modernizing data integration, and the importance of data quality is shared in the podcast.

Edge 453: Distillation Across Different Modalities

TheSequence • 28 implied HN points • 03 Dec 24

🕹 Technology Data science

Cross-modal distillation allows one model to teach another model that works with a different type of data. This means you can share knowledge even if the models are processing images, text, or something else entirely.
This method can be really helpful when there's not much paired data available. It helps improve the learning process in situations where gathering data might be difficult.
Hugging Face’s Gradio lets developers create AI applications for the web easily. It's a neat tool that helps bring AI to everyday use in a user-friendly way.

The DeepSeek drama, visually explained 🐳

Year 2049 • 22 implied HN points • 28 Jan 25

🕹 Technology Data science

The actual cost to train DeepSeek R1 is unknown, but it’s likely higher than the reported $5.6 million for its base model, DeepSeek V3.
DeepSeek used a different training method called Reinforcement Learning, which lets the model improve itself based on rewards, unlike OpenAI's supervised learning approach.
DeepSeek R1 is open-source and much cheaper to use for developers and businesses, challenging the idea that expensive hardware is necessary for AI model training.

Gradient Flow #45: Top Places to Work for Data Scientists; Model Serving; Tuning Language Models

Gradient Flow • 99 implied HN points • 14 Oct 21

🕹 Technology Data science

Top Places to Work for Data Scientists offers lists for different career stages
Improving zero-shot performance of language models through instruction tuning
Ray Serve showing 3X serving speed up and becoming popular for model serving

AI Roundup 093: Diminishing returns

Artificial Ignorance • 29 implied HN points • 15 Nov 24

🕹 Technology Data science

Big AI companies are realizing that just making their models bigger doesn't always improve their performance. They're facing challenges because the quality of training data is more important than simply using more computing power.
AI companies need to create new ways to measure performance since the old benchmarks are outdated. This lack of standard testing makes it hard for people to compare how different AI models stack up against each other.
AI-generated art is becoming more popular and accepted in the market. A recent artwork sold for a lot of money, showing that people are starting to appreciate creations made by AI, even though it raises questions about what creativity really means.

Awarding the amazing autosegmentation work from 2024

Vesuvius Challenge • 21 implied HN points • 24 Jan 25

🕹 Technology Data science

Two teams were awarded for their amazing work on automating scroll segmentation. They worked really hard, using only a few hours of human help to get impressive results.
The new methods focus on breaking down the task into smaller parts, like surface prediction and fitting, making it easier and faster to recover lost texts from ancient scrolls.
Even though there are still challenges, the community is excited about the progress and future plans, like getting better at detecting ink on more scrolls.

November Newsletter

RSS DS+AI Section • 29 implied HN points • 01 Nov 24

🕹 Technology Data science

Data science and AI are constantly evolving, with new research and developments being released regularly. It's important to stay updated on these changes to understand their implications.
Ethics, bias, and regulation in AI continue to be hot topics. Discussions around how to handle these challenges are crucial for the responsible use of AI technologies.
There are many practical applications and resources available for those interested in implementing AI. Tips and how-to guides can help individuals and organizations make better use of these technologies.

GPT-4 Was the Biggest Disappointment Ever

Sector 6 | The Newsletter of AIM • 19 implied HN points • 30 Jun 23

🕹 Technology Data science

GPT-4 is seen as disappointing compared to expectations. People hoped for more detailed information, but it was not provided.
OpenAI's decision to keep model specifics secret may have led to letdowns. Transparency could have changed many opinions about its performance.
The head of OpenAI hinted that users should prepare for disappointment, which matched how many felt after experiencing GPT-4.

May Progress Prizes and Updates to Tooling

Vesuvius Challenge • 9 implied HN points • 13 Jun 25

🕹 Technology Data science

The Vesuvius Challenge team is improving their tools for handling scroll data. They're making it easier for people to process large datasets without needing advanced tech skills.
Philip Allgaier made significant updates to the VC3D tool, including fixing memory issues and making it easier to install and use. This will help users have a smoother experience.
New features like freehand drawing and better options for data analysis have been added, which will boost productivity for those working with the VC3D tool.

Newsletter #13: StructGPT

Decoding Coding • 19 implied HN points • 25 May 23

🕹 Technology Data science

StructGPT helps large language models (LLMs) work better with structured data like graphs and databases. It converts this complex data into a simpler format that LLMs can understand.
There are three key tasks that StructGPT can do: answer questions based on knowledge graphs, process data tables, and perform text-to-SQL queries. Each task has its own specific steps.
The method focuses on linearizing raw data so that LLMs can process it more effectively. This allows LLMs to handle a wider variety of tasks more efficiently.

How to get into tech as a behavioral scientist

The Kahneman Bot • 19 implied HN points • 13 Feb 23

💼 Business Data science

To get into tech as a behavioral scientist, consider starting in a junior PM role, transferring internally, working at a startup, or starting your own company.
Before transitioning into tech, make sure you enjoy building software and understand how tech teams work.
Experienced behavioral scientists can enter tech by joining a big tech company as a researcher, rebranding as a data scientist, or joining a tech company that values behavioral science as part of its IP.

Newsletter 21: To keepdims or not to keepdims!

Decoding Coding • 1 HN point • 19 Jul 24

🕹 Technology Data science

Understanding the 'keepdims' parameter in tensor operations is important for getting correct results in PyTorch. If you set 'keepdims' to True, the dimensions are preserved, which helps with broadcasting correctly.
When summing tensors, if 'keepdims' is False, it can lead to incorrect calculations because the tensor's shape changes. This can result in dividing values incorrectly, leading to unexpected outputs.
It's crucial to be careful with tensor shapes and broadcasting rules in machine learning models. Even a small oversight can cause models to produce wrong predictions, so always double-check these details.

Newsletter #12: System Design for Machine Learning - Part II

Decoding Coding • 19 implied HN points • 18 May 23

🕹 Technology Data science

Airbnb uses a special tool called Zipline for feature engineering in their Customer Lifetime Value model, which helps them pick and create over 150 features needed for predictions.
Chicisimo built a recommendation system based on user data, which includes both objective and subjective features, to give personalized fashion advice using their Social Fashion Graph.
Case studies provide valuable lessons in applying frameworks to real-world projects, showing that you need both a good framework and experience from past projects to succeed.