What is Retrieval-Augmented Generation? - Higress Technical Glossary

📖 Definition

RAG is a technique that combines information retrieval with text generation. It first retrieves relevant documents from a knowledge base, then uses the retrieval results as context input to the LLM to generate answers, effectively reducing model hallucinations and providing up-to-date information.

🔗 How Higress Uses This

Higress AI Gateway can integrate with vector databases, supporting knowledge retrieval routing and cache optimization in RAG scenarios to improve retrieval-augmented generation efficiency.

💡 Examples

1 Enterprise knowledge base Q&A systems use RAG to ensure answer accuracy
2 Customer service systems retrieve product documents through RAG to answer user questions
3 RAG can combine real-time data sources to provide the latest information

🔄 Related Terms

Large Language Model

Embedding

❓ FAQ

What is Retrieval-Augmented Generation?