Quick Start
Get started with Lepton AI in under 2 minutes:Add Provider in Model Catalog
Before making requests, add Lepton AI to your Model Catalog:- Go to Model Catalog → Add Provider
- Select Lepton AI
- Enter your Lepton API key
- Name your provider (e.g.,
lepton)
Complete Setup Guide
See all setup options and detailed configuration instructions
Lepton AI Capabilities
Chat Completions
Generate chat completions with Lepton’s serverless models:Speech-to-Text
Transcribe audio using Lepton’s Whisper models:Streaming
Enable streaming for real-time responses:Supported Models
Lepton AI provides serverless access to various models:| Model | Description |
|---|---|
| llama-3.1-8b | Llama 3.1 8B model |
| llama-3-8b-sft-v1 | Fine-tuned Llama 3 |
| whisper-large-v3 | Speech-to-text |
Next Steps
Gateway Configs
Add fallbacks, load balancing, and more
Observability
Monitor and trace your Lepton requests
Prompt Library
Manage and version your prompts
Metadata
Add custom metadata to requests
SDK Reference
Complete Portkey SDK documentation

