Intro to LLM Gateway

An LLM Gateway provides a unified interface to manage your organization's LLM usage. Below are the key features that enhance functionality and security:

Unified API: Access multiple LLM providers through a single OpenAI compatible interface, requiring no code changes.
API Key Security: Secure and centralized management of credentials.
Governance & Control: Set limits, enforce access controls, and apply content filtering to manage how LLMs are used within your organization.
Rate Limiting: Implement measures to prevent abuse and ensure fair usage across users.
Observability: Track and analyse usage, costs, latency, and overall performance.
Cost Management: Monitor spending and configure budget alerts to keep expenses under control.
Audit Trails: Maintain logs of all interactions with LLMs to support compliance and auditing requirements.

LLM Playground

The LLM Playground is a UI for the LLM Gateway where you can interact with the various models you've added. Below is an overview of the features:

Support for multiple model types
1. Chat Models
2. Completion Models
3. Embedding Models
Image Upload: Upload images for image captioning or visual question answering. This is only available for models that support images such as GPT-4o
Model Comparison: Compare responses from different completion models to evaluate their performance.
System Prompts: Use predefined system prompts to guide model behaviour.System prompt inform how the model should respond. Sample system prompt - Be clear, concise, and polite in your responses. Avoid sharing any sensitive or personal information.

Speed features of LLM Gateway

Near-Zero Overhead: Just 3-5 ms added latency
Optimised Backend: Built with performant Node.js framework and optimised config caching
Smart Routing: Minimal processing overhead
Edge Ready: Deploy close to your apps