Davis Treybig

Thinking in public about topics I find interesting in computing infrastructure.

The hottest Substack posts of Davis Treybig

And their main takeaways

The Context Window Dillema

19 implied HN points • 24 Jul 23

🕹 Technology Research Models Attention Architecture Innovation

The driving factor limiting context window size is the quadratic scaling of self-attention in transformers.
New research explores alternative mechanisms like Hyena Operators, State Space Models, and hierarchical attention to improve context window efficiency.
Emphasis is placed on the importance of context curation and retrieval systems over simply increasing context window size for effective LLM performance.

Large language models in security

19 implied HN points • 15 Apr 23

🕹 Technology Security AI Analysis Automation Data

Large language models (LLMs) are being used in security for tasks like logs analysis and incident response.
LLMs are changing the landscape of traditional static analysis tools in cloud and application security.
LLMs have the potential to automate processes like vendor security questionnaires and enhance engineer-oriented security workflows.

Trends & Open Questions in Retrieval for Foundation Models

0 implied HN points • 01 Feb 24

🕹 Technology Information Retrieval Algorithms

LLM applications resemble traditional recommendation systems, benefiting from information retrieval expertise.
Building information retrieval pipelines for LLM products is complex and requires in-house development and tool curation.
Trends include hybrid retrieval architectures, multi-stage rerankers, and evolving index management structures.