TheSequence $5 / month

TheSequence Substack focuses on the latest trends and innovations in AI, covering open source LLM models, generative AI advancements, and multimodal generative AI. It discusses new research, frameworks, and tools, highlighting their impact on software development and AI applications' efficiency and capabilities.

Artificial Intelligence Generative AI Open Source AI Models Language Models Machine Learning Frameworks AI Research AI Applications in Software Development Multimodal Generative AI

The hottest Substack posts of TheSequence

And their main takeaways

Edge 363: Inside Google's Reasoning+Acting Method

210 implied HN points • 23 Jan 24

🕹 Technology AI ML Research Open Source

The post discusses Google's ReAct technique for LLM reasoning and action.
It reviews the original paper by Google Research on this technique.
It introduces Helicone as an open source platform for monitoring LLMs.

The Sequence Chat #475: Ed Sim, Forbes Top Tech Investor, on AI Investing, Security, Agents and More

21 implied HN points • 23 Jan 25

🕹 Technology AI Investing Cybersecurity Startups Venture Capital

Investing early in AI involves backing technical founders before they even start their company. It's about helping them develop their ideas and getting them the right support as they launch.
Building a startup in the AI space should always begin with creating a great product, no matter how much money you have. It's important to focus on getting user feedback and refining your offering rather than spending excessively.
AI security is becoming crucial as tech evolves. Companies need to be proactive in protecting against AI-driven cyber threats, and there are opportunities for startups to innovate in this space by securing AI implementations in various industries.

The Most Open Open Source Generative AI Release

161 implied HN points • 04 Feb 24

🕹 Technology AI Open Source ML Research Real World ML

AllenAI released its OLMo LLM model with all components in a truly open fashion.
The term 'open source' in generative AI often refers to weights of models for reproducibility.
Foundation models usually have small source code, making the weights crucial for open source models.

Don't Overlook China's Open Source LLMs

154 implied HN points • 11 Feb 24

🕹 Technology AI Open Source Research Tech Releases AI Radar

Smaug-72B, an open-source Chinese model, leads the open LLM leaderboard.
Chinese innovation in open-source generative AI is noteworthy with models like Yi family, DeepSeek Chat, and Qwen LLMs.
Chinese open-source LLMs like Smaug demonstrate impressive quality, showcasing contributions to the AI space.

The Sequence Chat: Yohei Nakajima on Creating BabyAGI, Autonomous Agents and Investing in Generative AI

140 implied HN points • 06 Mar 24

🕹 Technology AI Generative AI Venture Capital Autonomous Agents Open Source

BabyAGI project focuses on autonomous agents and AI enhancements for task execution, planning, and reasoning over time.
Challenges in adopting autonomous agents include human behavior changes and enabling AI access to tools for task execution.
Future generative AI trends include AI integration across various industries, increased passive AI usage, and automation of workflows with AI workers.

Get a weekly roundup of the best Substack posts, by hacker news affinity:

Edge 374: Some Technical Details we Learned About OpenAI's Sora

140 implied HN points • 29 Feb 24

🕹 Technology AI Generative models Video Engineering

OpenAI's Sora is a groundbreaking text-to-video model that can create high-quality videos up to a minute long.
The release of Sora has caused a lot of excitement and discussion in the generative AI community and media outlets.
While OpenAI has not revealed extensive technical details about Sora, the model includes some clever engineering optimizations.

Edge 453: Distillation Across Different Modalities

28 implied HN points • 03 Dec 24

🕹 Technology AI Data science Machine Learning Web Development Research

Cross-modal distillation allows one model to teach another model that works with a different type of data. This means you can share knowledge even if the models are processing images, text, or something else entirely.
This method can be really helpful when there's not much paired data available. It helps improve the learning process in situations where gathering data might be difficult.
Hugging Face’s Gradio lets developers create AI applications for the web easily. It's a neat tool that helps bring AI to everyday use in a user-friendly way.

The Sequence Pulse: The ML Architecture Powering LinkedIn's Skills Graph

154 implied HN points • 31 Jan 24

🕹 Technology AI Machine Learning LinkedIn Job search

LinkedIn uses transformer models for mapping jobs to job seekers.
LinkedIn's Skills Graph is a sophisticated stack that facilitates job searches and skill acquisition.
LinkedIn integrates skills listed on profiles, job descriptions, and LinkedIn Learning courses for a robust skills-based framework.

Edge 445: A New Series About Knowledge Distillation

35 implied HN points • 05 Nov 24

🕹 Technology AI Data science Machine Learning Computing Engineering

Knowledge distillation helps make large AI models smaller and cheaper. This is important for using AI on devices like smartphones.
A key goal of this process is to keep the accuracy of the original model while reducing its size.
The series will include reviews of research papers and discussions on frameworks like Google's Data Commons that support factual knowledge in AI.

💡WEBINAR: Beyond fine-tuning. Approaches in LLM optimization

133 implied HN points • 09 Feb 24

🕹 Technology Webinars

The webinar discusses techniques beyond fine-tuning for optimizing LLMs.
Topics covered include prompt optimization, production insights, and LLM architectures.
Register for the webinar to gain insights even if you can't attend live.

The Sequence Chat: Microsoft's Evan Chaki on Semantic Kernel and Combining LLMs with Conventional Programming Languages

294 implied HN points • 26 Apr 23

🕹 Technology AI Machine Learning Programming Open Source Development Tools

Semantic Kernel enables developers to create AI applications using large language models without writing complex code or training custom models.
Memory systems and data connectors play a crucial role in enhancing productivity and efficiency in LLM-based applications.
Hybrid programming with natural language and traditional programming languages can automate tasks like creating educational content and contract Q&A, leading to faster, error-free results.

The LLMcorns: 4 New Billion Dollar Gen AI Valuations in One Week

133 implied HN points • 28 Jan 24

🕹 Technology AI Venture Capital ML Research AI Tech Releases Real World ML

LLM providers are still receiving remarkable valuations in fundraising
Last week saw four new companies surpassing $1 billion in valuation in generative AI
The area of foundation models in generative AI is still commanding remarkable valuations

Edge 364: About COSP and USP: Two New LLM Reasoning Methods Built by Google Research

133 implied HN points • 25 Jan 24

🕹 Technology AI Language Models Research Machine Learning Data science

Two new LLM reasoning methods, COSP and USP, have been developed by Google Research to enhance common sense reasoning capabilities in language models.
Prompt generation is crucial for LLM-based applications, and techniques like few-shot setup have reduced the need for large amounts of data to fine-tune models.
Models with robust zero-shot performance can eliminate the need for manual prompt generation, but may have less potent results due to operating without specific guidance.

Inside LangChain: The Super Popular LLM Framework You Need to Know About

294 implied HN points • 13 Apr 23

🕹 Technology Frameworks Software Development Open Source LLMs Integration

LangChain integrates LLMs into mainstream software development lifecycles.
LLMs are powerful when integrated with other sources of computation or knowledge.
LangChain is an open-source framework addressing challenges of using LLMs effectively.

💡WEBINAR: Beyond fine-tuning. Approaches in LLM optimization

126 implied HN points • 29 Jan 24

🕹 Technology Webinar LLM Optimization Techniques

LLMs can be optimized through various techniques beyond just tuning and prompt engineering.
The webinar will cover topics like prompt optimization, production insights, and LLM architectures.
Even if you can't attend, you can still register to receive the recording of the webinar.

Edge 377: LLM Reasoning with Reinforced Fine-Tuning

105 implied HN points • 12 Mar 24

🕹 Technology Artificial Intelligence Machine Learning

Reinforced Fine-Tuning (ReFT) is a method used for enhancing Large Language Models (LLM) reasoning.
ByteDance introduced the concept of ReFT to address limitations in supervised fine-tuning approaches.
Guardrails AI is a comprehensive framework designed to guide the behavior of LLM applications.

Edge 366: Anthropic's Sleeper Agents Explore How LLMs can be Deceptive

119 implied HN points • 01 Feb 24

🕹 Technology AI Security Research Vulnerabilities

Security in the realm of large language models (LLMs) is a significant and complex area.
LLMs present new challenges due to their stochastic nature and limited understanding.
An exploration by Anthropic raises concerns about deceptive behavior in LLMs during training and deployment.

Edge 376: The Creators of Vicuna and Chatbot Arena Built SGLang for Super Fast LLM Inference

98 implied HN points • 07 Mar 24

🕹 Technology Artificial Intelligence Programming Open Source Machine Learning

SGLang is a new open source project from Berkeley University designed to enhance interactions with Large Language Models (LLMs), making them faster and more manageable.
SGLang integrates backend runtime systems with frontend languages to provide better control over LLMs, aiming to optimize the processes involved in working with these models.
The framework created by LMSys offers significant optimizations that can boost the inference times in LLMs by up to 5 times, showcasing advancements in processing vast amounts of data at incredible speeds.

Open Source Generative AI is Experiencing a "Linux Moment" but it Needs an "Apache Moment"

238 implied HN points • 23 Apr 23

🕹 Technology AI Research Tech Releases ML AI Radar

Open source generative AI is experiencing a 'Linux moment'
It needs something similar to an 'Apache moment'
The movement needs to find its ChatGPT

Edge 372: Learn About CALM, Google DeepMind's Method to Augment LLMs with Other LLMs

98 implied HN points • 22 Feb 24

🕹 Technology Artificial Intelligence Machine Learning Data science Research

Knowledge augmentation is crucial in LLM-based applications with new techniques constantly evolving to enhance LLMs by providing access to external tools or data.
Exploring the concept of augmenting LLMs with other LLMs involves merging general-purpose anchor models with specialized ones to unlock new capabilities, such as combining code understanding with language generation.
The process of combining different LLMs might require additional training or fine-tuning of the models, but can be hindered by computational costs and data privacy concerns.

📝 Guest Post: Evaluating LLM Applications*

91 implied HN points • 11 Mar 24

🕹 Technology AI Machine Learning Evaluation Development Monitoring

Traditional software development practices like automation and testing suites are valuable when evaluating Large Language Models (LLMs) for AI applications.
Different types of evaluations, including judgment return types and sources, are important for assessing LLMs effectively.
A robust evaluation process for LLM applications involves interactive, batch offline, and monitoring online stages to support rapid iteration cycles and performance improvements.

Edge 289: What is Chain of Thought Prompting?

217 implied HN points • 09 May 23

🕹 Technology ML Language Models Reasoning Frameworks

Chain of Thought Prompting is a technique for multi-step reasoning tasks in language models.
Google Research proposed Chain of Thought Prompting to address challenges in reasoning.
The OpenChatKit framework is a topic covered in the post.

Edge 286: Vicuna, the LLaMA-Based Model that Matches ChatGPT Performance

210 implied HN points • 27 Apr 23

🕹 Technology AI Models Research

Vicuna is a new AI model based on Meta's LLaMA, matching ChatGPT performance.
Vicuna was created by researchers from UC Berkeley, CMU, Stanford, and UC San Diego.
LLaMA is becoming a foundational technology for various conversational AI models.

📝 Guest Post: Caching LLM Queries for Improved Performance and Cost Savings*

217 implied HN points • 10 Apr 23

🕹 Technology Language Models APIs

Using a semantic cache can improve LLM application performance by reducing retrieval times and API call expenses.
Caching LLM responses can enhance scalability by reducing the load on the LLM service and improving user experience by reducing network latency.
GPTCache is an open-source semantic cache designed for storing LLM responses efficiently and offers various customization options.

Google Goes Small and Open Source with Gemma

84 implied HN points • 25 Feb 24

🕹 Technology AI Open Source Generative AI ML Research AI Tech Releases

Google released Gemma, a family of small open-source language models based on the architecture of its Gemini model. Gemma is designed to be more accessible and easier to work with than larger models.
Open-source efforts in generative AI, like Gemma, are gaining traction with companies like Google and Microsoft investing in smaller, more manageable models. This shift aims to make advanced AI models more widely usable and customizable.
The rise of small language models (SLMs) like Gemma showcases a growing movement towards more efficient and specialized AI solutions. Companies are exploring ways to make AI technology more practical and adaptable for various applications.

Edge 365: Understanding LLM Reasoning with Reflexion

91 implied HN points • 30 Jan 24

🕹 Technology AI ML Language Models

Reflexion is a reasoning method in LLMs that allows agents to execute actions in a more efficient manner.
The original Reflexion paper by Northeastern University is reviewed in this post.
Flowise, a visual tool for building LLM apps, is introduced in this issue.

📌 ML Engineering Event: Mastering AI and ML at Production Scale at apply()

84 implied HN points • 19 Feb 24

🕹 Technology AI ML

The event offers real-world insights from engineering leaders on ML model deployment and best practices.
Participants can engage in sponsor-free knowledge sharing sessions with peers, focusing on in-depth discussions.
Attendees have the opportunity to network with a diverse group of AI and ML engineers, including industry veterans and emerging leaders.

Edge 287: A New Series About New Techniques in Foundation Models

196 implied HN points • 02 May 23

🕹 Technology AI Machine Learning Innovation Research

A new series is starting on new techniques in foundation models.
Anthropic's Constitutional AI paper will be discussed.
The LangChain framework will be explored.

Inside Alpaca: The Language Model from Stanford University that can Follow Instructions and Match GPT-3.5

203 implied HN points • 06 Apr 23

🕹 Technology Language Models Artificial Intelligence Academic Research

Alpaca is a language model from Stanford University that can follow instructions and is smaller than GPT-3.5.
Instruction-following models like GPT-3.5 have issues with false information, social stereotypes, and toxic language.
Academic research on instruction-following models is challenging due to limited availability of models similar to closed-source ones like OpenAI's text-davinci-003.

Text-to-Video Games and 1-Bit Models: Two Monumental Generative AI Research Milestones in One Week

77 implied HN points • 03 Mar 24

🕹 Technology AI Generative AI ML Research AI Tech Releases Real World ML

Genie by Google DeepMind can create 2D video games from text, opening doors to interactive environments in simulations, gaming, and robotics.
BitNet b1.58, a 1-bit model by Microsoft and University of Chinese Academy of Sciences, offers cost-efficient and high-performance training for Large Language Models (LLMs).
The pace of research in generative AI is rapid, leading to groundbreaking advancements like Genie and BitNet b1.58.

The Sequence Chat: Salesforce Research's Junnan Li on Multimodal Generative AI

196 implied HN points • 12 Apr 23

🕹 Technology AI Machine Learning Generative AI

BLIP-2 enables image understanding for large language models.
Zero-shot image-to-text generation is a key feature of BLIP-2.
Multimodal generative AI advancements will shape the future of AI breakthroughs.

Edge 284: Meet Dolly 2.0: One of the First Open Source Instruction Following LLMs

189 implied HN points • 20 Apr 23

🕹 Technology AI Open Source Machine Learning Data Infrastructure

Dolly 2.0 is an open source instruction following LLM model.
Dolly builds on the principles of InstructGPT on the GPT-J model.
Dolly is a smaller model with characteristics similar to ChatGPT.

Edge 378: Meet TimesFM: Google's New Foundation Model for Time-Series Forecasting

70 implied HN points • 14 Mar 24

🕹 Technology Machine Learning AI

Time series forecasting is crucial in various fields like retail, finance, manufacturing, healthcare, and more, despite lagging behind other areas in AI development.
Google has introduced TimeFM, a pretrain model with 200M parameters trained on over 100 billion time series data points, aiming to advance forecasting accuracy.
The new TimeFM model from Google will soon be accessible in Vertex AI, showcasing a shift towards leveraging pretrained models for time series forecasting.

More Super Models is All We Need

77 implied HN points • 18 Feb 24

🕹 Technology AI/ML Tech Releases ML Research Real World ML AI Radar

Last week saw the release of five major foundation models in the generative AI space, each from a different tech giant, showcasing innovative advancements in various areas like text-to-video generation and multilingual support.
These new models are not only significant for the future of generative AI applications but also highlight the unique innovations and contributions made by different companies in the AI field.
The continuous evolution and release of these super models are driving progress and setting new standards in the field of generative AI, pushing boundaries and inspiring further advancements.

📝 Guest Post: An introduction to Similarity Search*

182 implied HN points • 03 Apr 23

🕹 Technology Embeddings Models Algorithms

Vector similarity search is essential for recommendation systems, image search, and natural language processing.
Vector search involves finding similar vectors to a query vector using distance metrics like L1, L2, and cosine similarity.
Common vector search strategies include linear search, space partitioning, quantization, and hierarchical navigable small worlds.

💡On-Demand Webinar: Designing & Scaling FanDuel's Machine Learning Platform

77 implied HN points • 26 Jan 24

🕹 Technology Machine Learning Webinar Data Engineering

FanDuel designed a powerful ML platform to deliver personalized experiences to users
Technology choices and frameworks are crucial in building an effective ML platform
Managing data backfills and orchestrating the process is important when features change

SmallCon: Free virtual conference for GenAI builders ft. Meta, DoorDash, Mistral

14 implied HN points • 29 Nov 24

🕹 Technology Artificial Intelligence Software Development Tech Events Innovation Startups

SmallCon is a free online conference for people interested in Generative AI. It's a great opportunity to learn from experts in the field.
The conference will feature talks and discussions from big companies like Meta and DoorDash. Attendees will get insights on the latest trends and technologies in AI.
You can register now to save your spot and gain knowledge on building effective AI models and applications. It's a chance to learn how to make the most out of small AI models.

Can I Solve Science?

63 implied HN points • 10 Mar 24

🕹 Technology AI Machine Learning Research Tech Releases ML Research

AI can advance scientific workflows but will always be limited by computational irreducibility.
Stephen Wolfram's theory explores the potential of AI in discovering new science.
The combination of AI and computational languages could open doors to advancing science.

Explore the Global Generative AI Landscape 2024 by AIport

56 implied HN points • 18 Mar 24

🕹 Technology AI

The Global Generative AI Landscape 2024 report by AIport offers insights into 107 international companies developing 128 generative models, expanding beyond typical American and European focus.
The study covers six continents and more countries than previous similar projects, providing a comprehensive analysis of the global GenAI landscape.
The report is reader-friendly and showcases how international companies are driving GenAI development, highlighting the widespread impact across various regions.

One AI for Navigating Any 3D Environment

56 implied HN points • 17 Mar 24

🕹 Technology AI ML Research Real World ML AI Radar

Google DeepMind created a new model, SIMA, that can navigate any 3D environment by following language instructions.
SIMA can translate abstract instructions into mouse and keyboard actions for navigating different 3D worlds.
This AI breakthrough has implications for embodied AI environments, simulations, and other areas requiring physical tasks.