Looking for the plugin's configuration parameters? You can find them in the AI RAG Injector configuration reference doc.
The AI RAG Injector plugin simplifies the creation of retrieval-augmented generation (RAG) pipelines by automatically injecting content from a vector database of choice on the existing requests.
This plugin provides the following benefits:
- Improves productivity and accelerates creating RAG pipelines, as you don’t have to build the semantic association.
- Lets you lock down sensitive vector databases, so that developers don’t have direct access. AI Gateway becomes the client, instead of the developer applications.
- Enables building RAG pipelines in more places, even in places where connectivity to the vector database was originally not possible.
How it works
- You configure the AI RAG Injector plugin via the Kong Admin API or decK, setting up the RAG content to send to the vector database.
- When a request reaches the AI Gateway, the plugin generates embeddings for request prompts, then queries the vector database for the top-k most similar embeddings.
- The plugin injects the retrieved content from the vector search result into the request body, and forwards the request to the upstream service.
Get started with the AI RAG Injector plugin
- AI Gateway quickstart: Set up AI Proxy
- Configuration reference
- Basic configuration example
- Learn how to use the plugin
All AI Gateway plugins
- AI Proxy
-
AI Proxy Advanced Available with Kong Gateway Enterprise subscription - Contact Sales
- AI Request Transformer
- AI Response Transformer
-
AI Semantic Cache Available with Kong Gateway Enterprise subscription - Contact Sales
-
AI Semantic Prompt Guard Available with Kong Gateway Enterprise subscription - Contact Sales
-
AI Rate Limiting Advanced Available with Kong Gateway Enterprise subscription - Contact Sales
-
AI Azure Content Safety Available with Kong Gateway Enterprise subscription - Contact Sales
- AI Prompt Template
- AI Prompt Guard
- AI Prompt Decorator