Jan is an open source ChatGPT-alternative that runs 100% offline on your computer. It’s a desktop application designed for privacy-conscious users who want to chat with AI without sending their data to external servers. Jan can run powerful local models like Llama3, Gemma, and Mistral directly on your device, while also providing the flexibility to connect to cloud AI providers when needed.
Monitor your Jan AI usage through TrueFoundry’s metrics tab:With TrueFoundry’s AI gateway, you can monitor and analyze:
Performance Metrics: Track key latency metrics like Request Latency, Time to First Token (TTFS), and Inter-Token Latency (ITL) with P99, P90, and P50 percentiles
Cost and Token Usage: Gain visibility into your application’s costs with detailed breakdowns of input/output tokens and the associated expenses for each model
Usage Patterns: Understand how your application is being used with detailed analytics on user activity, model distribution, and team-based usage
Hybrid Usage: Monitor the balance between local model usage and cloud model requests through TrueFoundry
Rate limit and Load balancing: Set up rate limiting, load balancing and fallback for your models