Cobus Greyling on LLMs, NLU, NLP, chatbots & voicebots • 39 implied HN points • 18 Mar 24
- Long context windows (LCWs) and retrieval-augmented generation (RAG) serve different purposes and won’t replace each other. LCWs work well when asking multiple questions at once, while RAG is better for separate inquiries.
- Using LCWs can get really expensive because they involve processing a lot of data at once. In contrast, RAG uses smaller, focused data chunks, which helps keep costs down.
- Research shows that LLMs perform better when important information is at the start or end of a long context. So, relying only on LCWs can lead to problems since crucial details may get overlooked.