Deepinfra

Portkey provides a robust and secure gateway to integrate various Large Language Models (LLMs) into applications, including Deepinfra’s hosted models. With Portkey, take advantage of features like fast AI gateway access, observability, prompt management, and more, while securely managing API keys through Model Catalog.

Quick Start

Get Deepinfra working in 3 steps:

from portkey_ai import Portkey

# 1. Install: pip install portkey-ai
# 2. Add @deepinfra provider in model catalog
# 3. Use it:

portkey = Portkey(api_key="PORTKEY_API_KEY")

response = portkey.chat.completions.create(
    model="@deepinfra/nvidia/Nemotron-4-340B-Instruct",
    messages=[{"role": "user", "content": "Say this is a test"}]
)

print(response.choices[0].message.content)

Tip: You can also set provider="@deepinfra" in Portkey() and use just model="nvidia/Nemotron-4-340B-Instruct" in the request.

Add Provider in Model Catalog

Go to Model Catalog → Add Provider
Select Deepinfra
Choose existing credentials or create new by entering your Deepinfra API key
Name your provider (e.g., deepinfra-prod)

Complete Setup Guide →

See all setup options, code examples, and detailed instructions

Supported Models

Deepinfra hosts a wide range of open-source models for text generation. View the complete list:

Deepinfra Models

Browse all available models on Deepinfra

Popular models include:

nvidia/Nemotron-4-340B-Instruct
meta-llama/Meta-Llama-3.1-405B-Instruct
Qwen/Qwen2.5-72B-Instruct

Next Steps

Add Metadata

Add metadata to your Deepinfra requests

Gateway Configs

Add gateway configs to your Deepinfra requests

Tracing

Trace your Deepinfra requests

Fallbacks

Setup fallback from OpenAI to Deepinfra

For complete SDK documentation:

SDK Reference

Complete Portkey SDK documentation

Ecosystem

LLM Integrations

Cloud Platforms

Guardrails

Plugins

Vector Databases

Agents

AI Apps

Libraries

Tracing Providers

MCP Clients

MCP Servers

Quick Start

Add Provider in Model Catalog

Complete Setup Guide →

Supported Models

Deepinfra Models

Next Steps

Add Metadata

Gateway Configs

Tracing

Fallbacks

SDK Reference

Ecosystem

LLM Integrations

Cloud Platforms

Guardrails

Plugins

Vector Databases

Agents

AI Apps

Libraries

Tracing Providers

MCP Clients

MCP Servers

​Quick Start

​Add Provider in Model Catalog

Complete Setup Guide →

​Supported Models

Deepinfra Models

​Next Steps

Add Metadata

Gateway Configs

Tracing

Fallbacks

SDK Reference

Quick Start

Add Provider in Model Catalog

Supported Models

Next Steps