The hottest Data science Substack posts right now

And their main takeaways

The Journey of Vadim Fedotov: From Professional Athlete to Entrepreneur in Data-Driven Health Optimization

The Healthtech Initiative • 0 implied HN points • 13 Dec 24

🏥 Health & Wellness Data science

Vadim Fedotov turned his experience as a basketball player and entrepreneur into a passion for health optimization. He realized that traditional medicine often focuses on treating illness rather than promoting better health.
His company, Bioniq, offers personalized health solutions based on data and user feedback. The goal is to create effective supplements that meet individual needs without unnecessary complexity.
Vadim highlighted the importance of focusing on the first 1,000 customers who believe in your product. These early advocates can be crucial for a startup's success and help build a strong community.

🐿️ The Squirrelmobile - The Insanely Profitabe Tech Newsletter

Squirrel Squadron Substack • 0 implied HN points • 17 Dec 24

🕹 Technology Data science

Graphs can help visualize motion and speed, making concepts like calculus easier to understand. It's fun to relate math to real-life activities, like driving a car.
Machine learning improves by tweaking weights to reduce errors, similar to adjusting software for better performance. It's like steering a computer program to make it better.
To build successful software, focus on small, frequent changes and measure how well they improve things. This method can lead to big wins in product development.

🐿️ Platitudinous - The Insanely Profitable Tech Newsletter

Squirrel Squadron Substack • 0 implied HN points • 17 Dec 24

🕹 Technology Data science

When looking at CVs, it's important to see what candidates did and why it mattered. Focus on real impact instead of fancy buzzwords.
Many candidates use vague phrases that sound good but don't tell you anything meaningful. Look for specific results they achieved and how they benefited customers.
A strong CV should show clear business results, like increasing sales or cutting costs. If it doesn’t do that, it might not be worth considering.

o3 AI model surpasses ARC-AGI benchmark

philsiarri • 0 implied HN points • 26 Dec 24

🕹 Technology Data science

OpenAI's new o3 AI model scored 85% on the ARC-AGI benchmark, which shows it can solve problems like a human. This score is higher than the last best AI score of 55%.
The ARC-AGI test checks how well an AI can handle new challenges using little information, which is important for general intelligence. This breakthrough raises questions about how close AI is to being as smart as humans.
Although the o3 model shows great promise, there are still doubts. Not enough details have been shared, and scientists want to test it more to see how well it can adapt in different situations.

A New Way to Guide LLM Reasoning

What The Heck • 0 implied HN points • 15 Jan 25

🕹 Technology Data science

An algorithm can help guide LLM reasoning to generate correct answers more often. It uses a method similar to Monte Carlo Tree Search to improve outcomes.
By sampling different reasoning steps and keeping track of which ones lead to correct answers, we can better inform the LLMs on how to approach problems.
Having a feedback model to suggest better reasoning steps can enhance the overall performance of LLMs, making them more effective in generating accurate answers.

Get a weekly roundup of the best Substack posts, by hacker news affinity:

On Reason

domsteil • 0 implied HN points • 27 Jan 25

🕹 Technology Data science

Intelligence grows through a system of rewards and lessons learned over time. It’s not just about finding the one right answer but refining our understanding step by step.
Using principles like blame and reward helps us learn better, whether it's cooking, driving lessons, or training AI. This process shows us how to improve and adapt in different situations.
AI can become more flexible and powerful by training with specific tasks. By experimenting and learning from mistakes, we can develop smarter AI systems that can tackle a variety of tasks.

February RSS AI and Data Science newsletter - anything to contribute?

RSS DS+AI Section • 0 implied HN points • 18 Jan 25

🕹 Technology Data science

The next newsletter for AI and Data Science will come out in early February. It’s a good chance to stay updated.
You can contribute to the newsletter if you have announcements, meetups, or jobs to share. Just reach out directly instead of replying to the email.
Make sure to send your contributions to the specified email to ensure they are included.

The Art of Forgetting: How AI Learns to Understand Rather Than Memorize

Nano Thoughts • 0 implied HN points • 27 Jan 25

🕹 Technology Data science

AI can struggle with memorization instead of understanding, similar to how students might remember specific math problems without grasping the general concept. When AI memorizes examples too closely, it can't apply knowledge to new situations.
Techniques like regularization help AI focus on important patterns rather than get lost in details. This is like training athletes under various conditions to build real skills instead of just practicing one way.
Understanding how to forget unimportant information is crucial for both AI and human intelligence. The best learning doesn't come from remembering everything, but from knowing which patterns are worth keeping.

Quant Letter: February 2025, Week-3

The Parlour • 0 implied HN points • 19 Feb 25

💰 Finance Data science

Using data from US corporate bond holdings can help predict credit risk better than traditional ratings. It means more real-time information for making investment decisions.
A new investment strategy called Betting Against Bad Beta is introduced. This strategy aims to improve how investors can bet against stocks with poor performance.
Machine learning is becoming more important in finance, especially for analysis and predicting risks. This technology helps make smarter investment choices.

Generative Agents 2.0

Gonzo ML • 0 implied HN points • 24 Feb 25

🕹 Technology Data science

Researchers successfully created AI agents that can simulate 1,052 real people with about 85% accuracy. This means the AI can closely mimic how real people would respond in various situations.
The study highlights the importance of interviews over surveys, as they provide deeper insights into people’s behaviors and thoughts, allowing the AI to generate better follow-up questions and responses.
These AI agents have potential uses in social science research. They could help predict public reactions to policy changes or simulate behavioral responses, leading to new methods of understanding human decision-making.

s1: Simple test-time scaling

Gonzo ML • 0 implied HN points • 12 Feb 25

🕹 Technology Data science

A new model called s1-32B was created by using a small dataset of 1,000 question-answer pairs focused on reasoning. This cost about $25 to train, which is quite affordable.
The method of controlling how much the model thinks during tests allows for better performance. They used a strategy called budget forcing to ensure the model generates the right amount of information.
This approach showed that it's possible to achieve high-quality results with less data and resources, suggesting a promising path for future AI developments.

Reasoning Environments, LLM Memory, and Financial Interpretability

ppdispatch • 0 implied HN points • 06 Jun 25

🕹 Technology Data science

Reasoning Gym offers new ways to train models so they can get better at logic and math. It's like a gym for AI where they can practice and improve their skills.
New techniques are helping us understand how large language models work in finance. This makes it easier to spot problems and ensure they follow rules.
Research shows that language models like GPT memorize data before they start to understand it better. They can store a certain amount of information before they have to generalize.

Online event, 4pm, This Wednesday, Federated Learning for data scientists and statisticians

RSS DS+AI Section • 0 implied HN points • 09 Jun 25

🕹 Technology Data science

There's an online talk about federated learning happening this Wednesday at 4 PM. It's a great chance to learn from experts in the field.
The talk will explain how federated learning is different from traditional analysis. You'll find out what it means for the future of data science.
Participants will also discuss the challenges of federated analytics and how it works today. It's a good opportunity to think about new possibilities in data analysis.

SQL in Hex

Expand Mapping with Mike Morrow • 0 implied HN points • 14 Jul 25

🕹 Technology Data science

You can choose how SQL query results are stored in Hex, either in memory or in the database. This affects how quickly you can run follow-up queries.
There are two types of SQL commands in Hex: one that queries directly from the database and another that queries from a local in-memory dataframe. This choice can impact how your data is used.
Hex allows you to chain SQL queries, which makes handling complex tasks easier. However, you need to be aware of where each query pulls data from to avoid surprises.

Why ChatGPT‑5 Feels Smarter in Some Ways — and Confusing in Others

philsiarri • 0 implied HN points • 18 Aug 25

🕹 Technology Data science

ChatGPT-5 can handle longer chats and understand both text and images, making it more versatile.
Despite improvements, it sometimes makes mistakes in complex tasks or gives wrong answers even if they sound good.
Users appreciate friendlier responses and expect better reliability as OpenAI makes updates to the model.

December Newsletter

RSS DS+AI Section • 0 implied HN points • 01 Dec 25

🕹 Technology Data science

Data science and AI are constantly evolving, with new technologies and tools emerging regularly. Keeping up with these changes is important for anyone interested in the field.
Ethics in AI is a major topic right now. It's essential to discuss bias, regulation, and the moral implications of using AI in our lives.
There are many opportunities to get involved in data science communities, whether through volunteering or participating in discussions. Joining these groups can help shape the future of data science.

Quant Letter: December 2025, Week-1

The Parlour • 0 implied HN points • 04 Dec 25

💰 Finance Data science

Open-source satellite imagery can be used to create a global census of residential buildings to better measure climate risk and its impacts on housing and financial stability.
Recent quantitative research is applying remote sensing and data-driven techniques to map built environments and inform climate and risk modeling.
Full articles and curated analyses are often behind a subscription paywall, but short free trials can give temporary access to the full archives.

January RSS AI and Data Science newsletter - anything to contribute?

RSS DS+AI Section • 0 implied HN points • 23 Dec 25

🕹 Technology Data science

A new RSS AI and Data Science newsletter will be sent out in early January.
Contributions are welcome, such as announcements, meetups, publications, and jobs.
Please send contributions directly by email rather than replying to the newsletter message.

February RSS AI and Data Science newsletter - anything to contribute?

RSS DS+AI Section • 0 implied HN points • 23 Jan 26

🕹 Technology Data science

The Royal Statistical Society AI and Data Science newsletter will be published in early February.
Contributions are invited, including announcements, meetups, publications, and job listings.
Please send items directly to [email protected] rather than replying to the email.

Econ 196: WEEK 1: Introduction, & the Very Longest Run Shape of Human Population History

Brad DeLong's Grasping Reality • 0 implied HN points • 02 Jan 26

🚌 Education Data science

The course is a quantitative, long-run global economic history class that teaches data-science literacy (including Python) to analyze population and income trends.
Grades are intentionally generous but contingent on showing up, doing pre-class work, and participating—skip or zone out and you lose that privilege.
Expect weekly short writing assignments, background readings, small data exercises, and optional Thursday Zoom sessions, with all logistics and materials posted on the course site.

How They Built The Top Health AI: Flo Health

The Healthtech Initiative • 0 implied HN points • 02 Mar 26

🕹 Technology Data science

Small, autonomous teams that own their entire stack unlocked velocity and scale, while splitting functions (like mobile and backend) slowed delivery.
Only use AI when it truly outperforms simple rules—reserve models for cycle prediction, symptom analysis, personalization, and fine-tune on women’s health data to reduce bias and improve safety.
Build the core competitive advantage (the health AI and data flywheel) and buy everything else, using wearable time-series models to proactively predict conditions and power growth.