OpenAI-Compatible
Use the same SDKs and switch only the base URL.
Platform
Permute keeps model access, billing, and request history in one place.
Use the same SDKs and switch only the base URL.
Plans include text and image usage each month.
Direct API usage and connected app usage can be shown separately.
Request history is available for 30 days.
Overview
Permute is built for teams that want one API, one billing flow, and one account.
Permute runs as its own inference product with its own accounts, billing, and support.
Use stable model names, one billing flow, and one console.
The public product stays focused on keys, plans, models, billing, and status.
Principles
Drop-In Endpoint
Use one OpenAI-compatible endpoint for text and image requests, with billing, usage, and account access managed in the same console.
from openai import OpenAI
client = OpenAI(
base_url="https://api.permute.sh/v1",
api_key="pm_..."
)
response = client.chat.completions.create(
model="deepseek-v4-flash",
messages=[{"role": "user", "content": "Ship it"}]
)Model Catalog
Text and image generation ship together in v1, with service health surfaced through the console and status page.
Enterprise
Need a model that is not listed? For enterprise volume, Permute can host requested models on dedicated infrastructure, handle maintenance, and provide an API endpoint for your workload.
For enterprise plans or custom model hosting, contact our sales team.