Term

Retrieval-augmented generation (RAG)

Retrieving documents at inference time and conditioning generation on them.

Also known as: RAG

RAG remains the dominant pattern for grounding LLM output in private or recent data. Modern variants blend retrieval into the agent loop rather than running it once at the start.