Gary’s Substack

Writing about ML and technology.

The hottest Substack posts of Gary’s Substack

And their main takeaways

CPU only LLM (Vicuna-13B) with Web Search

4 HN points • 16 May 23

🕹 Technology AI Python

Experimenting with running powerful LLM models on CPU without GPUs.
Llama.cpp allows real-time inference on CPU, providing a trade-off for larger models with more available RAM.
Integration of Google Search API to retrieve search results and smoothly integrate them into the LLM responses.

Coming soon

0 implied HN points • 14 May 23

🎭️ Culture Media

Gary Linscott has a Substack newsletter coming soon.
The Substack link is garylinscott.substack.com.
You can subscribe to Gary's Substack newsletter.