The hottest ML Substack posts right now

And their main takeaways

Wanna Bet?

I'll Keep This Short • 5 implied HN points • 11 Apr 23

Prediction markets can help gain subject matter expertise.
Precise forecasting requires precisely defined questions.
Viral topics attract more participation in prediction markets.

ML for Robots: Hybrid Learned vs End-to-End Learned

General Robots • 4 HN points • 27 Feb 23

🕹 Technology ML Robotics AI Data Research

Using deep convolutional neural networks for perception is a good idea in robotics.
For specific robot tasks, consider a Hybrid Learned approach combining ML perception with classical robotics.
For a general-purpose robot that can respond to any request, End-to-End learning may be more suitable.

The Economics of Building ML Products in the LLM Era

ScaleDown • 3 HN points • 06 Apr 23

🕹 Technology AI ML APIs Data science Economics

LLM APIs are changing how AI products are developed and who can develop them
Building applications using APIs first allows for quick market entry and low initial costs
Finetuning and eventually moving to custom models can reduce costs and improve system trust

Interview Session: Design a ChatGPT

The ZenMode • 3 HN points • 12 Feb 23

🕹 Technology AI ML Data processing Infrastructure Model optimization

ChatGPT is a large language model trained by OpenAI to generate human-like text responses.
Design of a ChatGPT system involves components like data processing, model training, inference, and deployment.
Ensuring ChatGPT system is scalable involves horizontal scalability, load balancing, caching, and monitoring.

The AI Report #5: GPT-4 aces MIT.. or not

The AI Report • 1 HN point • 19 Jun 23

🕹 Technology AI ML Speech Synthesis Research

The MIT Fiasco showed issues with evaluating AI models
ML community uses various prompting techniques for AI, like chaining prompts
Using an AI model to both solve and evaluate a problem can lead to biased results

Get a weekly roundup of the best Substack posts, by hacker news affinity:

Interactive Mutation Browser

The Century of Biology • 1 HN point • 23 Apr 23

🕹 Technology AI Web Software Programming ML

AI has revolutionized workflows in research, writing, and coding.
Challenges in academic publication push researchers to share new tools and research results in alternative ways.
Exploring new UI designs for protein language models can open up exciting possibilities for biologists.

🥟 Chao-Down #253 Sam Altman looks to raise billions for AI chip factories, AI-generated products are coming to video podcast advertising, Amazon's struggles with making a next-gen "remarkable" Alexa

Chaos Theory • 0 implied HN points • 22 Jan 24

🕹 Technology AI Ethics Models ML Research

Sam Altman is looking to raise billions for AI chip factories.
AI-generated products are coming to video podcast advertising.
Amazon is facing challenges in making a next-gen 'remarkable' Alexa.

Underdog joins the fight

ML Under the Hood • 0 implied HN points • 05 Oct 23

🕹 Technology AI Cloud Models ML

Anthropic partners with Amazon in a $4B deal, offering access to second best LLM model through an API on AWS Bedrock
Cloudflare introduces Workers AI to run low-power LLM models worldwide, aiming for data localization compliance
Mistral AI releases a powerful 7B model with Apache 2.0 license, outperforming larger models and providing true open-source capability

Heard the news? LLaMa is open source now. In a way...

ML Under the Hood • 0 implied HN points • 05 Mar 23

🕹 Technology ML OpenAI Product Development

LLaMa, a new GPT 3.0 model, is now open source for researchers to access and try out.
Running the LLaMa model requires decent hardware like Nvidia 3090 with 24GB VRAM and normal RAM.
OpenAI has introduced GPT-3.5 Turbo, which is not only better but also 10 times cheaper to use.

Today’s Top 5 HN posts

NeurIPS's Big ML Week

Sector 6 | The Newsletter of AIM • 0 implied HN points • 12 Dec 21

🕹 Technology AI ML Conferences Research Innovation

NeurIPS 2021 was a major conference for machine learning and AI, showcasing the latest research in the field.
Over 9,000 papers were submitted, showing a huge interest and activity in machine learning.
Google and Microsoft were the top contributors, reflecting their strong involvement in advancing AI technology.

Validating Low-Confidence LLM Generation

Cobus Greyling on LLMs, NLU, NLP, chatbots & voicebots • 0 implied HN points • 11 Jan 24

🕹 Technology AI ML Data Computing Language

A new method can find and fix mistakes in language models as they create text. This means fewer wrong or silly sentences when they're generating responses.
First, the system checks for uncertainty in the generated sentences to spot potential errors. If it sees something is likely wrong, it can pull in correct information from reliable sources to fix it.
This process not only helps fix single errors, but it can also stop those mistakes from spreading to the next sentences, making the overall output much more accurate.

Improving Text Embeddings with LLM Generated Synthetic Data

Cobus Greyling on LLMs, NLU, NLP, chatbots & voicebots • 0 implied HN points • 03 Jan 24

🕹 Technology AI Data ML Embeddings Synthetic Data

Synthetic data can be used to create high-quality text embeddings without needing human-labeled data. This means you can generate lots of useful training data more easily.
This study shows that it's possible to create diverse synthetic data by applying different techniques to various language and task categories. This helps improve the quality of text understanding across many languages.
Using large language models like GPT-4 for generating synthetic data can save time and effort. However, it’s also important to understand the limitations and ensure data quality for the best results.

Hot Topics #18 (Feb. 14, 2023)

The Merge • 0 implied HN points • 14 Feb 23

🕹 Technology ML Bioinformatics Code generation

Machine learning model predicts activation energies of hydrogen atom transfer in proteins
CodeBERTScore evaluates code generation using pretrained models
SWARM parallelism offers efficient communication for training large models

Hot Topics #19 (Feb. 21, 2023)

The Merge • 0 implied HN points • 22 Feb 23

🕹 Technology ML Robotics Models Optimization Algorithms

Molecular optimization using multi-objective Bayesian optimization and GFlowNets.
Discovery of a simple and effective optimization algorithm, Lion, for deep neural network training.
DreamerV3 algorithm based on world models outperforms previous approaches in various domains.

Hot Topics #22 (Apr. 3, 2023)

The Merge • 0 implied HN points • 03 Apr 23

🕹 Technology ML Optimization Language Models Machine Learning Robotics

Fast Imitation of Skills from Humans (FISH) can train robots with less than a minute of demonstrations.
Regularization and Lipschitz regularization are key in Optimal Transport-Based Distributionally Robust Optimization.
Chain of Hindsight technique helps align language models with human preferences by training on feedback sequences.

Subjective AI/ML Digest: April II

Boris Again • 0 implied HN points • 23 Apr 23

🕹 Technology AI ML Research Applications Risks

Language models used to create believable simulations
New open-source language models with large parameters and datasets
Innovative AI projects like ChatGPT and DINOv2 making advancements

Hot Topics #23 (May 2, 2023)

The Merge • 0 implied HN points • 02 May 23

🕹 Technology ML Protein engineering

Boosted Prompt Ensembles can enhance large language models' performance for reasoning
Large language models like ChatGPT can excel in relevance ranking for Information Retrieval tasks
Autonomous driving systems can be trained efficiently using deep RL without simulation or expert demonstrations

MythBusting LLMs: From GPU-rich Dreams to GPT-4's Gleam!

ScaleDown • 0 implied HN points • 19 Sep 23

🕹 Technology AI ML Computing Resources Innovation

Building your own GPT-4 equivalent is not easy
Having an LLM won't automatically give you a competitive edge
Having more GPUs doesn't always mean better outcomes

The hottest ML Substack posts right now

I'll Keep This Short • 5 implied HN points • 11 Apr 23

General Robots • 4 HN points • 27 Feb 23

ScaleDown • 3 HN points • 06 Apr 23

The ZenMode • 3 HN points • 12 Feb 23

The AI Report • 1 HN point • 19 Jun 23

The Century of Biology • 1 HN point • 23 Apr 23

Chaos Theory • 0 implied HN points • 22 Jan 24

ML Under the Hood • 0 implied HN points • 05 Oct 23

ML Under the Hood • 0 implied HN points • 05 Mar 23

Top 5 HN Posts of the day • 0 implied HN points • 29 Apr 24

Sector 6 | The Newsletter of AIM • 0 implied HN points • 12 Dec 21

Cobus Greyling on LLMs, NLU, NLP, chatbots & voicebots • 0 implied HN points • 11 Jan 24

Cobus Greyling on LLMs, NLU, NLP, chatbots & voicebots • 0 implied HN points • 03 Jan 24

The Merge • 0 implied HN points • 14 Feb 23

The Merge • 0 implied HN points • 22 Feb 23

The Merge • 0 implied HN points • 03 Apr 23

Boris Again • 0 implied HN points • 23 Apr 23

The Merge • 0 implied HN points • 02 May 23

ScaleDown • 0 implied HN points • 19 Sep 23