
AI Proxy Advanced
Configuration
Hide Child Parameters
Hide Child Parameters
Specifies in which cases an upstream response should be failover to the next target. Each option in the array is equivalent to the function of http://nginx.org/en/docs/http/ngx_http_proxy_module.html#proxy_next_upstream
Allowed values:errorhttp_403http_404http_429http_500http_502http_503http_504invalid_headernon_idempotenttimeout
Default:error, timeout
Hide Child Parameters
Hide Child Parameters
Set this if you are using an AWS provider (Bedrock) and you are authenticating using static IAM User credentials. Setting this will override the AWS_ACCESS_KEY_ID environment variable for this plugin instance.
This field is encrypted.
This field is referenceable.
Set this if you are using an AWS provider (Bedrock) and you are authenticating using static IAM User credentials. Setting this will override the AWS_SECRET_ACCESS_KEY environment variable for this plugin instance.
This field is encrypted.
This field is referenceable.
If azure_use_managed_identity is set to true, and you need to use a different user-assigned identity for this LLM instance, set the client ID.
This field is referenceable.
If azure_use_managed_identity is set to true, and you need to use a different user-assigned identity for this LLM instance, set the client secret.
This field is encrypted.
This field is referenceable.
If azure_use_managed_identity is set to true, and you need to use a different user-assigned identity for this LLM instance, set the tenant ID.
This field is referenceable.
Set this field to the full JSON of the GCP service account to authenticate, if required. If null (and gcp_use_service_account is true), Kong will attempt to read from environment variable GCP_SERVICE_ACCOUNT
.
This field is encrypted.
This field is referenceable.
If AI model requires authentication via Authorization or API key header, specify its name here.
This field is referenceable.
Specify the full auth header value for ‘header_name’, for example ‘Bearer key’ or just ‘key’.
This field is encrypted.
This field is referenceable.
If AI model requires authentication via query parameter, specify its name here.
This field is referenceable.
Specify the full parameter value for ‘param_name’.
This field is encrypted.
This field is referenceable.
Hide Child Parameters
Hide Child Parameters
Set this if you are using an AWS provider (Bedrock) and you are authenticating using static IAM User credentials. Setting this will override the AWS_ACCESS_KEY_ID environment variable for this plugin instance.
This field is encrypted.
This field is referenceable.
Set this if you are using an AWS provider (Bedrock) and you are authenticating using static IAM User credentials. Setting this will override the AWS_SECRET_ACCESS_KEY environment variable for this plugin instance.
This field is encrypted.
This field is referenceable.
If azure_use_managed_identity is set to true, and you need to use a different user-assigned identity for this LLM instance, set the client ID.
This field is referenceable.
If azure_use_managed_identity is set to true, and you need to use a different user-assigned identity for this LLM instance, set the client secret.
This field is encrypted.
This field is referenceable.
If azure_use_managed_identity is set to true, and you need to use a different user-assigned identity for this LLM instance, set the tenant ID.
This field is referenceable.
Set this field to the full JSON of the GCP service account to authenticate, if required. If null (and gcp_use_service_account is true), Kong will attempt to read from environment variable GCP_SERVICE_ACCOUNT
.
This field is encrypted.
This field is referenceable.
If AI model requires authentication via Authorization or API key header, specify its name here.
This field is referenceable.
Specify the full auth header value for ‘header_name’, for example ‘Bearer key’ or just ‘key’.
This field is encrypted.
This field is referenceable.
If AI model requires authentication via query parameter, specify its name here.
This field is referenceable.
Specify the full parameter value for ‘param_name’.
This field is encrypted.
This field is referenceable.
Hide Child Parameters
The model’s operation implementation, for this provider.
Allowed values:audio/v1/audio/speechaudio/v1/audio/transcriptionsaudio/v1/audio/translationsimage/v1/images/editsimage/v1/images/generationsllm/v1/assistantsllm/v1/batchesllm/v1/chatllm/v1/completionsllm/v1/embeddingsllm/v1/filesllm/v1/responsespreserverealtime/v1/realtime
Hide Child Parameters
Hide Child Parameters
Hide Child Parameters
Cluster addresses to use for Redis connections when the redis
strategy is defined. Defining this field implies using a Redis Cluster. The minimum length of the array is 1 element.
>= 1 characters
Show Child Parameters
Limits the total number of opened connections for a pool. If the connection pool is full, connection queues above the limit go into the backlog queue. If the backlog queue is full, subsequent connect operations fail and return nil
. Queued operations (subject to set timeouts) resume once the number of connections in the pool is less than keepalive_pool_size
. If latency is high or throughput is low, try increasing this value. Empirically, this value is larger than keepalive_pool_size
.
>= 0<= 2147483646
The size limit for every cosocket connection pool associated with every remote server, per worker process. If neither keepalive_pool_size
nor keepalive_backlog
is specified, no pool is created. If keepalive_pool_size
isn’t specified but keepalive_backlog
is specified, then the pool uses the default value. Try to increase (e.g. 512) this value if latency is high or throughput is low.
Default:256
>= 1<= 2147483646
Password to use for Redis connections. If undefined, no AUTH commands are sent to Redis.
This field is encrypted.
This field is referenceable.
Sentinel node addresses to use for Redis connections when the redis
strategy is defined. Defining this field implies using a Redis Sentinel. The minimum length of the array is 1 element.
>= 1 characters
Show Child Parameters
Sentinel password to authenticate with a Redis Sentinel instance. If undefined, no AUTH commands are sent to Redis Sentinels.
This field is encrypted.
This field is referenceable.
Sentinel username to authenticate with a Redis Sentinel instance. If undefined, ACL authentication won’t be performed. This requires Redis v6.2.0+.
This field is referenceable.
If set to true, verifies the validity of the server SSL certificate. If setting this parameter, also configure lua_ssl_trusted_certificate
in kong.conf
to specify the CA (or server) certificate used by your Redis server. You may also need to configure lua_ssl_verify_depth
accordingly.
Default:false
Username to use for Redis connections. If undefined, ACL authentication won’t be performed. This requires Redis v6.0.0+. To be compatible with Redis v5.x.y, you can set it to default
.
This field is referenceable.
If set, the plugin will activate only for requests where the specified has been authenticated. (Note that some plugins can not be restricted to consumers this way.). Leave unset for the plugin to activate regardless of the authenticated Consumer.
If set, the plugin will activate only for requests where the specified consumer group has been authenticated. (Note that some plugins can not be restricted to consumers groups this way.). Leave unset for the plugin to activate regardless of the authenticated Consumer Groups
A list of the request protocols that will trigger this plugin. The default value, as well as the possible values allowed on this field, may change depending on the plugin type. For example, plugins that only work in stream mode will only support tcp and tls.
Allowed values:grpcgrpcshttphttpswswss
Default:grpc, grpcs, http, https, ws, wss
If set, the plugin will only activate when receiving requests via the specified route. Leave unset for the plugin to activate regardless of the route being used.