Retrieval-Augmented Generation (RAG)¶

Description¶

A variation of Prompt Enrichment where a search engine is used to add additional knowledge to a prompt to increase response output.

RAG is often used as a lower-cost method for including local knowledge with general knowledge.

Many RAG designs focus on the use of a Vector Database to find text that is similar to a given prompt.

RAG models where the parametric memory is a pre-trained seq2seq model and the non-parametric memory is a dense vector index of Wikipedia.