The hottest Modeling Substack posts right now

And their main takeaways

Everyone Loves The Idea Of AI, But Not The Reality

High ROI Data Science • 615 implied HN points • 06 Oct 24

🕹 Technology AI Data Business Modeling Training Platforms

Many businesses love the idea of AI but find it hard to put into practice. It often looks easy on paper, but the reality is very different when trying to make it work.
Data is really important for AI to work well. Companies need good data to build effective AI products, and often, they realize this too late after facing challenges.
AI projects often fail because businesses don’t fully understand what they need to achieve. Companies should focus on solving real problems rather than just using the latest technology.

Entrevista a Grady Booch

Érase una vez un algoritmo... • 39 implied HN points • 27 Oct 24

🕹 Technology Software Engineering AI Development Ethics Modeling Computer Science

Grady Booch is a key figure in software engineering, known for creating UML, which helps developers visualize software systems. His work has changed how we think about software design.
He emphasizes the ongoing evolution in software engineering due to changes like AI and mobile technology. Adaptation and continuous learning are essential for success in this field.
Booch advocates for ethics in technology development, stressing the need for education and accountability among tech leaders to ensure responsible use of AI and other emerging technologies.

N-1 Block A

Soviet Space Substack • 178 implied HN points • 12 Oct 24

🕹 Technology Aerospace Engineering Modeling

The N1-3L rocket has a complex engine system, with different engines numbered for clarity. Understanding these details is crucial for analyzing the rocket's design and performance.
Grid fins are an important feature of the N1 rocket, providing enhanced control during high-speed flights. Their design has evolved over time to improve stability and effectiveness.
There were various design changes made to the Block A of the N1 rocket to improve its function and control. These updates were likely based on lessons learned from previous flight tests.

DeepSeek and the Future of AI Competition with Miles Brundage

ChinaTalk • 948 implied HN points • 25 Jan 25

🕹 Technology AI Modeling Competition Trade policy Innovation

DeepSeek's R1 model shows that AI competition is heating up between the U.S. and China. It's similar to OpenAI's model but developed quickly, closing the gap.
The efficiency at which DeepSeek operates is driven by export controls, meaning limited access to advanced chips. More chips would better their AI capabilities.
Open-sourcing AI models has its benefits, but governments need to be careful. They should ensure the technology is not misused while still allowing some level of open collaboration.

From Theory to Practice: Inductive Biases in Machine Learning

Mindful Modeler • 639 implied HN points • 23 Apr 24

🕹 Technology Machine Learning Algorithms Data Bias Modeling

Different machine learning models exhibit varying behaviors when extrapolating features, influenced by their inductive biases.
Inductive biases in machine learning influence the learning algorithm's direction, excluding certain functions or preferring specific forms.
Understanding inductive biases can lead to more creative and data-friendly modeling practices in machine learning.

Get a weekly roundup of the best Substack posts, by hacker news affinity:

Statistical modeling seen through inductive biases

Mindful Modeler • 419 implied HN points • 28 May 24

🔬 Science Statistics Modeling Machine Learning

Statistical modeling involves modeling distributions and assuming relationships between features and the target with a few interpretable parameters.
Distributions shape the hypothesis space by restricting the range of models compatible with specific distributions like a zero-inflated Poisson distribution.
Parameterization in statistical modeling simplifies estimation, interpretation, and inference of model parameters by making them more interpretable and allowing for confidence intervals.

No, Sora has not “learned physics”

Marcus on AI • 3596 implied HN points • 02 Mar 24

🕹 Technology AI Physics Modeling

Sora is not a reliable source for understanding how the world works, as it focuses more on how things look visually.
Sora's videos often depict objects behaving in ways that defy physics or biology, indicating a lack of understanding of physical entities.
The inconsistencies in Sora's videos highlight the difference between image sequence prediction and actual physics, emphasizing that Sora is more about predicting images than modeling real-world objects.

How to make use of inductive biases

Mindful Modeler • 219 implied HN points • 04 Jun 24

🔬 Science Machine Learning Interpretability Forecasting Modeling

Inductive biases play a crucial role in model robustness, interpretability, and leveraging domain knowledge.
Choosing inherently interpretable models can enhance model understandability by restricting the hypothesis space of the learning algorithm.
By selecting inductive biases that reflect the data-generating process, models can better align with reality and improve performance.

How I made peace with quantile regression

Mindful Modeler • 778 implied HN points • 16 Jan 24

🔬 Science Statistics Machine Learning Estimation Modeling

Quantile regression can be understood through the lens of loss optimization, specifically with the pinball loss function.
In machine learning, quantile regression is essentially regression with the unique pinball loss function that emphasizes absolute differences between actual and predicted values.
The asymmetry of the pinball loss function, controlled by the parameter tau, dictates how models should handle under- and over-predictions, making quantile regression a tool to optimize different quantiles of a distribution.

Scaling Laws Meet Economics, but Adoption is still Accelerating

Mule’s Musings • 333 implied HN points • 19 Dec 24

🕹 Technology AI Economics Modeling Hardware Innovation

Economics are very important when it comes to scaling tech, and while costs are rising, tools like ChatGPT are still becoming more popular. Understanding the balance of cost and usage is crucial.
Scaling laws are changing, and relying solely on large pre-trained models may not be the best strategy anymore. Businesses might need to explore smaller models or alternative methods to improve efficiency and reduce costs.
Adoption of AI technologies is still growing rapidly, which shows that despite challenges, many people are eager to use and integrate these tools into their lives.

No Free Dessert in Machine Learning

Mindful Modeler • 399 implied HN points • 20 Feb 24

🔬 Science Machine Learning Generalization Modeling Data

Generalization in machine learning is essential for a model to perform well on unseen data.
There are different types of generalization in machine learning: from training data to unseen data, from training data to application, and from sample data to a larger population.
The No Free Lunch theorem in machine learning highlights that assumptions and effort are always needed for generalization, and there's no free lunch when it comes to achieving further generalization.

2024 Interconnects year in review

Democratizing Automation • 229 implied HN points • 31 Dec 24

🕹 Technology AI Policy Open Source Modeling Evaluation

In 2024, AI continued to be the hottest topic, with major changes expected from OpenAI's new model. This shift will affect how AI is developed and used in the future.
Writing regularly helped to clarify key AI ideas and track their importance. The focus areas included reinforcement learning, open-source AI, and new model releases.
The landscape of open-source AI is changing, with fewer players and increased restrictions, which could impact its growth and collaboration opportunities.

The Sequence Knowledge #478: Speculative RAG is a More Efficient Form of RAG

TheSequence • 147 implied HN points • 28 Jan 25

🕹 Technology AI Software Research Modeling Innovation

Speculative RAG uses two models to improve results. One model specializes in creating content, while the other checks and verifies it.
This new approach makes the overall system more efficient and accurate than traditional methods.
Understanding how Speculative RAG works can help enhance AI technologies and their applications.

GraphRAG Analysis, Part 1: How Indexing Elevates Knowledge Graph Performance in RAG

AI Encoder: Parsing Signal from Hype • 70 HN points • 09 Jul 24

🕹 Technology Analysis Research Evaluation Modeling Metrics

Knowledge graphs do not significantly impact context retrieval in RAG, as all methods showed similar context relevancy scores.
Neo4j with its own index improved answer relevancy and faithfulness compared to Neo4j without indexing and FAISS, showcasing the importance of effective indexing for precise content retrieval in RAG applications.
Developers need to consider the trade-offs between ROI constraints and performance improvements when deciding to use GraphRAG, especially in high-precision applications that require accurate answers.

to kaggle, or not to kaggle

Mindful Modeler • 379 implied HN points • 13 Feb 24

🕹 Technology Machine Learning Modeling Competitions Data science AI

There are conflicting views on Kaggle - some see it as a playground while others believe it produces top machine learning results.
Participating in Kaggle competitions can be beneficial to learn core supervised machine learning concepts.
The decision to focus on Kaggle competitions should depend on how much daily tasks align with Kaggle-style work.

Moving Past RLHF: In 2025 We Will Transition from Preference Tuning to Reward Optimization in Foundation Models

TheSequence • 189 implied HN points • 29 Dec 24

🕹 Technology Artificial Intelligence Machine Learning Neural Networks Modeling Data science

Artificial intelligence is moving from preference tuning to reward optimization for better alignment with human values. This change aims to improve how models respond to our needs.
Preference tuning has its limits because it can't capture all the complexities of human intentions. Researchers are exploring new reward models to address these limitations.
Recent models like GPT-o3 and Tülu 3 showcase this evolution, showing how AI can become more effective and nuanced in understanding and generating language.

How to sell bread with quantile regression

Mindful Modeler • 339 implied HN points • 23 Jan 24

🕹 Technology Machine Learning Data science Modeling Predictions

Quantile regression can be used for robust modeling to handle outliers and predict tail behavior, helping in scenarios where underestimation or overestimation leads to loss.
It is important to choose quantile regression when predicting specific quantiles, such as upper quantiles, for scenarios like bread sales where under or overestimating can have financial impacts.
Quantile regression can also be utilized for uncertainty quantification, and combining it with conformal prediction can improve coverage, making it useful for understanding and managing uncertainty in predictions.

OpenAI's moat

TechTalks • 334 implied HN points • 15 Jan 24

🕹 Technology AI Open Source Modeling Monetization Innovation

OpenAI is building new protections to safeguard its generative AI business from open-source models
OpenAI is reinforcing network effects around ChatGPT with features like GPT Store and user engagement strategies
Reducing costs and preparing for future innovations like creating their own device are part of OpenAI's strategy to maintain competitiveness

The Future of Prompt Engineering

Gradient Flow • 559 implied HN points • 04 May 23

🕹 Technology AI NLP Modeling Data Analysis Tools

NLP pipelines are shifting to include large language models (LLMs) for accuracy and user-friendliness.
Effective prompt engineering is crucial for crafting useful input prompts tailored to generative AI models.
Future prompt engineering tools need to be interoperable, transparent, and capable of handling diverse data types for collaboration and model sharing.

A Pragmatic View of Uncertainty in Machine Learning

Mindful Modeler • 359 implied HN points • 06 Jun 23

🕹 Technology Machine Learning Uncertainty Modeling Prediction Calibration

Machine learning models have uncertainty in predictions, categorized into aleatoric and epistemic uncertainty.
Defining and distinguishing between aleatoric and epistemic uncertainty is a complex task influenced by deterministic and random factors.
Conformal prediction methods capture both aleatoric and epistemic uncertainty, providing prediction intervals reflecting model uncertainty.

Defending machine learning in a room full of old-school statisticians

Mindful Modeler • 299 implied HN points • 27 Jun 23

🕹 Technology Machine Learning Statistics Modeling

Be mindful of your modeling mindset and be open to exploring other modeling cultures beyond your current beliefs.
Recognize that differences in modeling mindsets are deeply rooted in culture and background, influencing how individuals approach statistical modeling.
Interpretability remains a significant concern for modelers, especially in the context of machine learning advancements, although progress has been made in providing tools for better understanding models.

The Most Amazing Week in Gen AI Releases

TheSequence • 84 implied HN points • 15 Dec 24

🕹 Technology AI Software Innovation Research Modeling

Several major tech companies like OpenAI, Google, and Microsoft launched new AI models in a single week. This shows how quickly AI technology is progressing.
OpenAI's Sora model allows users to create videos from text descriptions, but it has some limitations. It's an exciting step for video generation!
Google's Gemini 2.0 has improved capabilities, allowing it to handle more complex tasks and interact more effectively with users.

The Social Network (Part 2)

Logging the World • 279 implied HN points • 13 Apr 23

🔬 Science Networks Statistics Social media Modeling

Real social networks exhibit more complex behaviors than simple mathematical models can capture.
The structure of social media follower counts differs significantly from the Erdős–Rényi network model, with some users having exponentially more followers than others.
Recent network models like the Barabási-Albert model better represent the dynamics of online social networks like Twitter, where heavy-tailed distributions of follower counts emerge.

The Statistician Who Loved Machine Learning

Mindful Modeler • 279 implied HN points • 23 May 23

🕹 Technology Machine Learning Statistics Modeling Algorithms

Leo Breiman emphasized the importance of both data modeling culture and algorithmic modeling culture in statistical modeling.
Breiman advocated for being problem-focused over solution-focused, encouraging modelers to choose the appropriate mindset based on the task at hand.
Understanding various modeling mindsets, such as statistical inference and machine learning, is crucial for effective modeling.

how and why to lie with spreadsheets

Dan Davies - "Back of Mind" • 235 implied HN points • 07 Jun 23

💼 Business Finance Management Decision-making Modeling

Using spreadsheets to manipulate numbers is common in business and finance.
Understanding how to manipulate spreadsheet results can indicate a deep understanding of the business.
Spreadsheets are a tool to present arguments, not the arguments themselves.

Should we stop interpreting ML models because XAI methods are imperfect?

Mindful Modeler • 199 implied HN points • 31 Oct 23

🕹 Technology Machine Learning Interpretability Neural Networks Modeling

Don't let a pursuit of perfection in interpreting ML models hinder progress. It's important to be pragmatic and make decisions even in the face of imperfect methods.
Consider the balance of benefits and risks when interpreting ML models. Imperfect methods can still provide valuable insights despite their limitations.
While aiming for improvements in interpretability methods, it's practical to use the existing imperfect methods that offer a net benefit in practice.

Five Ideas I'll use in my optimization class after listening to Gurobi's Tobias Achterberg

Mike Talks AI • 216 implied HN points • 05 Oct 23

🚌 Education Optimization Machine Learning Modeling Algorithms

MIPs are a powerful general-purpose tool for problem-solving.
Using tools like ChatGPT could potentially make optimization models more accessible.
Commercial optimization solvers are often superior to open-source ones due to resources and detailed engineering.

In Defense of Tech Trees

Atlas of Wonders and Monsters • 424 implied HN points • 18 Aug 23

🕹 Technology Gaming Innovation History Analysis Modeling

Tech trees serve as a powerful metaphor for various phenomena across time.
Tech trees are a game concept that discretize progress and provide rewards, making them fun.
Tech trees are more about analyzing past technologies than predicting future ones.

Explore Your Modeling Mindset With A Quiz

Mindful Modeler • 179 implied HN points • 20 Jun 23

🕹 Technology Modeling Web Development Machine Learning Programming

Modeling assumptions affect how the model can be used. For instance, causal considerations lead to causal claims.
Revisiting and understanding our modeling assumptions can help us tackle problems more effectively, beyond our usual mindset.
Creating simple static websites can be made easier with tools like GPT-4, especially if you have some understanding of HTML, CSS, and JavaScript.

Getting Started with LoRAs + Vodka Photorealism

followfox.ai’s Newsletter • 176 implied HN points • 15 Jun 23

🕹 Technology AI Training Modeling

The post discusses getting started with LoRAs and creating a photorealistic LoRA for Vodka models.
It includes steps like downloading and using a LoRA, training the first LoRA, and finally fine-tuning a custom LoRA for photorealistic results.
The process involves using specific tools, datasets, and parameters to train LoRAs, and explores possibilities for creating high-quality, realistic images.

Can you explain GPT with ... GPT?

Mindful Modeler • 199 implied HN points • 16 May 23

🕹 Technology Neural Networks Interpretability Modeling Language Models AI Ethics

OpenAI experimented with using GPT-4 to interpret the functionality of neurons in GPT-2, showcasing a unique approach to understanding neural networks.
The process involved analyzing activations for various input texts, selecting specific texts to explain neuron activations, and evaluating the accuracy of these explanations.
Interpreting complex models like LLMs with other complex models, such as using GPT-4 to understand GPT-2, presents challenges but offers a method to evaluate and improve interpretability.

From bare-bones to holistic machine learning

Mindful Modeler • 159 implied HN points • 08 Aug 23

🕹 Technology Machine Learning Modeling Interpretability Data Tools

Machine learning can range from simple, bare-bones tasks to more complex, holistic approaches.
In bare-bones machine learning, the modeling choices are defined, making it about the model's performance and tuning.
Holistic machine learning involves designing the model to connect with the larger context, considering factors like uncertainty, interpretability, and shifts in distribution.

Week #3: Conformal Prediction For Regression

Mindful Modeler • 279 implied HN points • 03 Jan 23

🔬 Science Regression Modeling

In regression, conformal prediction can turn point predictions into prediction intervals with guarantees of future observation coverage.
Starting from point predictions or non-conformal intervals from quantile regression are two common approaches to creating prediction intervals.
Conformalized mean regression and conformalized quantile regression are two techniques to generate prediction intervals in regression models.

Tang Shiping predicts Trump "highly likely" to win - probability more than 60%

Pekingnology • 56 implied HN points • 03 Nov 24

🇺🇸 U.S. Politics Elections Forecasting Political Analysis Modeling Public Opinion

A professor predicts that Donald Trump has a greater than 60% chance of winning the 2024 U.S. presidential election. This prediction is based on computer simulations rather than traditional polling.
The simulations suggest Trump will likely win key states like Michigan, Ohio, and Florida, while Harris is expected to win states like Georgia and Arizona.
The forecasting method used is known as Agent-Based Modeling, which combines real data about voters and economic conditions to make predictions rather than relying on expert opinions.

Novaxia vs Bigpharmia

Logging the World • 199 implied HN points • 04 Nov 22

🔬 Science Health Graphs Vaccines Modeling Data Analysis

Understand the impact of vaccines on disease spread: Novaxia and Bigpharmia are examples of two scenarios showing how vaccines can affect the spread of a disease differently.
Graphs help visualize data trends: Using different types of graphs can show how disease spread changes over time and the effectiveness of interventions like vaccines.
Consider the importance of logarithmic scales: Logarithmic scales can provide a different perspective on data trends, allowing for better understanding of the impact of interventions like vaccines.

SNT for Dummies

Gordian Knot News • 131 implied HN points • 06 Jan 24

🔬 Science Radiation Health Modeling Cancer Compensation

SNT model is crucial for understanding radiation harm.
Dose rate through time is important to realistic radiation harm models.
SNT is more accurate and easy to implement than LNT for assessing radiation harm.

Understanding Different Uncertainty Mindsets

Mindful Modeler • 179 implied HN points • 24 Jan 23

🔬 Science Probability Modeling Uncertainty Machine Learning Statistics

Understanding the fundamental difference between Bayesian and frequentist interpretations of probability is crucial for grasping uncertainty quantification techniques.
Conformal prediction offers prediction regions with a frequentist interpretation, similar to confidence intervals in linear regression models.
Conformal prediction shares similarities with the evaluation requirements and mindset of supervised machine learning, emphasizing the importance of separate calibration and ground truth data.

In two days: Incremental Documentation for Your Database, Wednesday, Feb 7, 2024 19:00 CET

Minimal Modeling • 101 implied HN points • 05 Feb 24

🕹 Technology Data Management Documentation Modeling Collaboration

Event on Wednesday, February 7, 2024, 19:00 CET about Incremental Documentation for Databases
Minimal Modeling approach focuses on lightweight tabular format for data catalog
Benefits include reduced onboarding time, better communication, and cost savings

Preventing model exfiltration with upload limits

Redwood Research blog • 19 implied HN points • 08 May 24

🕹 Technology Security Compression Modeling Encryption

Preventing model exfiltration can be crucial for security; setting upload limits can be a simple yet effective way to protect large model weights from being stolen.
Implementing compression schemes for model generations can significantly reduce the amount of data that needs to be uploaded, providing an additional layer of protection against exfiltration.
Limiting uploads, tracking and controlling data flow from data centers, and restricting access to model data are practical approaches to making exfiltration of model weights harder for attackers.

The Key to AI in 2024 is Specialization

Dana Blankenhorn: Facing the Future • 59 implied HN points • 08 Jan 24

🔬 Science AI Biochemistry Modeling Research Specialization

Specialization is key to the future of AI in 2024.
AI will increase demand for scientists in various specialized fields.
Automation and AI technologies can support scientists in tasks like research, visualization, and prediction.