RAG

Retrieval-Augmented Generation (RAG): LLMs retrieve relevant chunks from uploaded documents at each query.The Kaparthy LLM Wiki

Characteristics

  • Knowledge rediscovered from scratch every question.
  • No accumulation or persistence.
  • Examples: NotebookLM, ChatGPT file uploads, most RAG systems.

Contrast with LLM Wiki

  • RAG: Ephemeral retrieval.
  • LLM Wiki: Proactive maintenance of persistent wiki.

From 10-AI-Concepts

  • Fixes LLM knowledge cutoff & hallucinations: Retrieve relevant docs (vector DB) into prompt for grounded answers.
  • Vector databases: Store embeddings for semantic similarity search (e.g. “refund policy” finds “return”/“money back”).