The hottest AI/ML Substack posts right now

And their main takeaways

💥 Tech Talks Weekly #29

Tech Talks Weekly • 79 implied HN points • 30 Aug 24

This week features new talks from 11 conferences, including GopherCon UK 2024 and PyCon US 2024. It's a great way to catch up on the latest in tech from experts in the field.
The Tech Talks Weekly newsletter provides a convenient way to stay updated without the clutter of platforms like YouTube. You can watch talks at your own pace and reduce FOMO.
Readers are encouraged to share the newsletter and provide feedback through a form. This helps improve the content and build a better community around technology discussions.

Advanced Prompt Engineering

Deep (Learning) Focus • 609 implied HN points • 08 May 23

🕹 Technology AI/ML Deep Learning Prompt engineering Information Retrieval

LLMs can solve complex problems by breaking them into smaller parts or steps using CoT prompting.
Automatic prompt engineering techniques, like gradient-based search, provide a way to optimize language model prompts based on data.
Simple techniques like self-consistency and generated knowledge can be powerful for improving LLM performance in reasoning tasks.

RAG is more than just vectors

Tribal Knowledge • 11 HN points • 17 Jul 24

🕹 Technology AI/ML Data Storage APIs Data retrieval

RAG provides context to an LLM by fetching data from various sources, not just vector databases. It can use any data store to enhance the language model's predictions.
Context for an LLM can include system prompts, chat history, RAG, fine-tuning, and more. Any way to turn information into text can improve LLM performance.
RAG can work with vectors, but it's not limited to them. By enabling the LLM to call functions, it can fetch data from a variety of sources beyond vectors, like relational or graph databases.

Self Supervised Learning (SSL)

Dubverse Black • 98 implied HN points • 09 Aug 23

🕹 Technology AI/ML Research Papers Image Recognition

Self Supervised Learning (SSL) is a way to train models using synthetic labels generated from the data itself.
SSL can be applied in different domains like NLP, Speech, Vision using techniques like MLM, LM, VicReg, Autoencoders, and VAE.
SSL enables models to learn powerful data representations inexpensively which can be utilized for various tasks like transfer learning and fine-tuning.

XGen, a 7B LLM trained on up to 8K sequence length from SalesForce

MLOps Newsletter • 78 implied HN points • 05 Aug 23

🕹 Technology AI/ML Weather Climate Models Libraries

ClimaX is a deep learning model designed for weather and climate tasks like forecasting temperature and predicting extreme weather events.
XGen is a 7B LLM trained on up to 8K sequence length, achieving state-of-the-art results in tasks like MMLU, QA, and HumanEval.
GPT-4 API from OpenAI provides easy access to a powerful language model capable of generating text, translating languages, and answering questions.

Get a weekly roundup of the best Substack posts, by hacker news affinity:

GroupBy #27: Balancing HDFS DataNodes in the Uber DataLake, How Figma’s databases team lived to tell the scale

VuTrinh. • 19 implied HN points • 19 Mar 24

🕹 Technology Data Engineering Infrastructure Software Development AI/ML Web Technologies

Balancing your data infrastructure is key for efficiency and reliability. Companies like Uber face challenges in maintaining this balance as they scale up their data needs.
Figma's database team has successfully handled a massive growth in data since 2020, showing that scaling can lead to new technical challenges but also growth opportunities.
Optimizing data pipelines can save significant costs. Techniques to reduce data shuffling in processes like Apache Spark can help make data handling more efficient.

More Super Models is All We Need

TheSequence • 77 implied HN points • 18 Feb 24

🕹 Technology AI/ML Tech Releases ML Research Real World ML AI Radar

Last week saw the release of five major foundation models in the generative AI space, each from a different tech giant, showcasing innovative advancements in various areas like text-to-video generation and multilingual support.
These new models are not only significant for the future of generative AI applications but also highlight the unique innovations and contributions made by different companies in the AI field.
The continuous evolution and release of these super models are driving progress and setting new standards in the field of generative AI, pushing boundaries and inspiring further advancements.

Money of the Future

Random Minds by Katherine Brodsky • 42 implied HN points • 06 Oct 23

💰 Finance Cryptocurrencies Digital Currency Blockchain AI/ML

Financial transactions are evolving with a shift towards digital currencies
Cryptocurrencies offer potential for increased privacy and security but face challenges with adoption and ease of use
Future financial security may rely on biometric data and quantum encryption for heightened protection

How to split the pie: Shaping Web3 Advertising market under fair competition

Oleksii Sidorov • 36 implied HN points • 01 Mar 23

🕹 Technology Web3 AI/ML Advertising Data Algorithms

A wide network is crucial for efficient optimization in advertising.
Having advanced algorithms can give a competitive edge in Web3 advertising.
The Web3 advertising market is likely to have multiple strong players rather than a monopoly.

Issue #42

Infra Weekly Newsletter • 13 implied HN points • 04 Apr 23

🕹 Technology Infrastructure AI/ML Cloud Computing Databases Programming

GitHub's RSA SSH private key was briefly exposed, leading to an update
Tech leaders like Elon Musk are calling for caution in advancing AI beyond human level
Consider using Postgres for graph databases and exploring tools like OpenAI GPT in PostgreSQL

An Incentive to Label

Olshansky's Newsletter • 6 HN points • 02 Apr 23

🕹 Technology AI/ML Blockchain AR Tech Solutions

High-quality labels are crucial for the success of language models like GPT
Labeling tasks involve significant time and effort in ensuring quality annotations
Blockchain-based solutions can incentivize better labeling by penalizing low-quality work

MAD, China, and the Semiconductor Showdown (Part 1)

East Wind • 3 HN points • 20 Mar 23

🕹 Technology AI/ML Venture Capital Semiconductors Geopolitics Investments

US leads in dollars deployed across MAD Index, followed by China
China invests in 'deep tech' areas of tech ecosystem with fewer companies
US adopts laissez-faire approach in VC investing, while China focuses on core tech categories

TTW Extra #5 🔥: Must-watch 2024 QCon talks

Tech Talks Weekly • 0 implied HN points • 04 Jun 24

🕹 Technology Software Engineering AI/ML DevOps Frontend Backend Leadership

QCon talks cover a wide range of software engineering topics, including backend, frontend, AI, and DevOps. These talks are great for anyone looking to learn more about tech trends.
A curated list of 35 must-watch talks from QCon London and San Francisco includes interesting topics like how Netflix uses Java and scaling with Amazon DynamoDB. These videos can help you understand real-world applications of technology.
If you subscribe, you'll get a weekly email with new talks from over 100 conferences. This is an easy way to stay updated on tech without the clutter of YouTube.

On Chat GPT Dumbness, Trustbit Benchmarks and ML Product Labs

ML Under the Hood • 0 implied HN points • 10 Sep 23

🕹 Technology AI/ML Benchmarks Guides Language Models

ChatGPT is not getting dumber, just misunderstood when instructions aren't clear.
LLM Benchmark: A new model has surpassed Chat GPT 3.5 on Enterprise Workloads.
ML Product Labs offers two new guides for building products with LLM technology.

The Key Aspect of Problem Framing in Building AI/ML Applications

CodeLink’s Substack • 0 implied HN points • 31 Jan 24

🕹 Technology AI/ML

Clearly define the problem you want to solve to set the project direction.
Identify and outline the desired outcomes to avoid overcomplexity.
Set performance measurements, benchmarks, and evaluation metrics to track progress and guide development.

Designing Better Evaluations of Generative Models

Tom’s Substack • 0 implied HN points • 11 Nov 23

🕹 Technology AI/ML Evaluation Generative models Red-Teaming

Evaluation of models should focus on selecting the best performing model, giving confidence in AI outputs, identifying safety and ethical issues, and providing actionable insights for improvement.
Standard evaluation approaches face challenges like broad performance metrics, data leakage from benchmarks, and lack of contextual understanding.
To improve evaluations, embrace human-centered evaluation methods and red-teaming to understand user perceptions, uncover vulnerabilities, and ensure models are safe and effective.

Example-Driven Development

m3 | music, medicine, machine learning • 0 implied HN points • 17 Aug 23

🕹 Technology AI/ML Programming Tools Development Machine Learning

Providing a wider range of examples to ChatGPT helps in generating more natural-sounding outputs.
Using a local plugin for ChatGPT allows for accessing and providing context from local files for better collaboration.
Example-driven development with LLMs is useful for identifying relevant context, mimicking input characteristics, and making connections between different types of files.

Why now?

Miguel’s Substack • 0 implied HN points • 01 May 23

🕹 Technology AI/ML Community Newsletter Tutorials Social media

Realize that you are not defined by your job.
It's important to pursue what truly interests you.
Building a supportive community is crucial for growth and learning.

AI Partnerships Advance Industrial Automation

Exponential Industry • 0 implied HN points • 28 Jan 24

🕹 Technology AI/ML Manufacturing Robotics Semiconductors Energy Storage

AI partnerships are advancing industrial automation by improving quality, throughput, and worker safety.
Businesses are investing in new technologies like sensors, robotics, 3D printing, and AI to enhance manufacturing processes.
Government initiatives like Made Smarter are driving tech investments in SMEs for industry growth and sustainability.

Inside India’s Biggest Data Engineering Summit

Sector 6 | The Newsletter of AIM • 0 implied HN points • 03 Jun 24

🕹 Technology Data science AI/ML Software Engineering Innovation

The Data Engineering Summit in Bengaluru was a huge success, with over 1,000 attendees and more than 50 speakers from the AI and analytics community.
Key topics of discussion included software deployment architectures and frameworks for using data in business, highlighting the importance of these technologies.
Attendees showed lots of enthusiasm for the discussions and innovative ideas that were shared at the event, demonstrating a vibrant interest in data engineering.