RAG

Retrieval-Augmented Generation (RAG): LLMs retrieve relevant chunks from uploaded documents at each query.The Kaparthy LLM Wiki

Characteristics

RAG optimizes LLM output by referencing authoritative knowledge bases outside training data without retraining.
Solves challenges like hallucinations, static knowledge cut-offs, and lack of source attribution.
Pipelines involve creating external data (embeddings), retrieving relevant information (vector search), and augmenting the LLM prompt.
Cost-effective alternative to fine-tuning for injecting domain-specific or real-time data.