Now in beta · smile4u.in

All AI.
One API.

One SDK. Smart routing across OpenAI, Anthropic, Gemini, and Llama via Groq. Automatic failover. Cost optimisation. Zero lock-in.

Integrates withAnthropicOpenAIGeminiLlama
uniq_demo.py
from uniq import AI
 
ai = AI(api_key="uq_••••••••••••••••")
 
response = ai.complete(
"Explain transformer architecture",
model="auto", # routes to best provider
)
 
print(response.text)
 
# ✓ Provider : anthropic → claude-sonnet-4-6
# ✓ Latency : 1.24s │ Tokens: 342 │ $0.0014
Connected to smile4u.in
Python 3.10+·uniq-ai 0.1.0
5+
Providers
50+
Models
< 50ms
Routing overhead
99.9%
Uptime SLA

Platform

Everything you need,
nothing you don't.

Built for developers who want the simplest possible interface to the most powerful AI models — without sacrificing control.

Smart Routing

Auto-classify

Automatically classifies your prompt — code, math, analysis, translation — and routes to the model with the strongest performance for that task. No config required.

Automatic Failover

< 50ms switch

If a provider is down, rate-limiting, or times out, the request is instantly retried against the next healthy provider in your fallback chain — fully transparent to your app.

Cost Optimisation

Save up to 80%

Routes to the cheapest model capable of handling the request. The routing engine balances latency, quality, and per-token cost in real time across all configured providers.

Observability

Per-key tracking

Every call logged with provider, model, token counts, latency, and cost. Per-request traces, hourly latency percentiles, and live charts in the dashboard.

API Key Management

Instant revoke

Generate scoped uq_* keys with RPM limits, daily token quotas, and monthly spend caps. Revoke compromised credentials instantly without redeploying.

SDK-First Design

pip install uniq-ai

Typed Python SDK with sync and async clients, streaming, tool calling, and exponential-backoff retries baked in. Drop-in compatible with the OpenAI client format.

RAG Pipeline

BM25 search

Ingest documents, search with BM25 ranking, and ground LLM answers in your own data. The /rag CLI command indexes files in seconds — no vector DB required.

Benchmarking

MMLU · HumanEval

Run MMLU, HumanEval, and GSM8K suites against any provider/model combination. CI regression checks and a public leaderboard included out of the box.

Cron Jobs

every 5m

Schedule recurring LLM tasks — summaries, reports, alerts — using a simple cron expression. Results are stored and accessible via API or the dashboard.

Orgs & Teams

OIDC / SSO

Create organisations, invite members, assign roles, and enforce per-org monthly budget limits. SSO via OIDC/SAML for enterprise teams.

VS Code + MCP

Native extension

Inline AI completion straight in your editor via the VS Code extension. MCP server exposes the full API to Claude Desktop and any MCP-compatible client.

Slack Integration

Multi-turn chat

Add the uniq bot to any Slack channel. Multi-turn conversations, file uploads, and slash commands — all routed through your configured providers and keys.

Pricing

Simple, transparent pricing.

Start free. Scale when you're ready. No hidden fees, no egress charges.

Free

For side projects and exploration.

$0/ mo
Start for free
  • 1,000 requests / month
  • All 5 providers
  • Smart routing
  • API key management
  • RAG pipeline
  • Usage analytics (7 days)
  • Community support
  • Auto failover
  • Cost optimisation
  • Cron jobs
  • Orgs & Teams

Pro

For teams with production workloads.

$99/ mo
Get Pro
  • 500,000 requests / month
  • All providers + Ollama local
  • Smart routing + custom rules
  • Auto failover + cost optimisation
  • RAG, Bench, Cron, Fine-tune
  • Orgs & Teams
  • Usage analytics (90 days)
  • Rate limit + spend controls
  • 99.9% uptime SLA
  • Priority support
  • Unlimited requests
  • SSO / SAML
  • Audit logs

Enterprise

For large-scale, compliance-critical deployments.

Custom
Talk to us
  • Unlimited requests
  • All providers + private models
  • Custom routing logic
  • Dedicated infrastructure
  • Unlimited analytics retention
  • SSO / SAML
  • Audit logs
  • 99.99% uptime SLA
  • Dedicated support + SLA

All plans include API key management and the full Python SDK. Prices in USD. Cancel anytime.