Now in beta · smile4u.in

All AI.
One API.

One SDK. Smart routing across OpenAI, Anthropic, Gemini, and Llama via Groq. Automatic failover. Cost optimisation. Zero lock-in.

Get Started for free View Docs→See how it works↗

Integrates withAnthropicOpenAIGeminiLlama

uniq_demo.py

from uniq import AI
 
ai = AI(api_key="uq_••••••••••••••••")
 
response = ai.complete(
    "Explain transformer architecture",
    model="auto",  # routes to best provider
)
 
print(response.text)
 
# ✓ Provider : anthropic → claude-sonnet-4-6
# ✓ Latency  : 1.24s   │  Tokens: 342  │  $0.0014

Connected to smile4u.in

Python 3.10+·uniq-ai 0.1.0

Providers

50+

Models

< 50ms

Routing overhead

99.9%

Uptime SLA

Platform

Everything you need,
nothing you don't.

Built for developers who want the simplest possible interface to the most powerful AI models — without sacrificing control.

Smart Routing

Auto-classify

Automatically classifies your prompt — code, math, analysis, translation — and routes to the model with the strongest performance for that task. No config required.

Automatic Failover

< 50ms switch

If a provider is down, rate-limiting, or times out, the request is instantly retried against the next healthy provider in your fallback chain — fully transparent to your app.

Cost Optimisation

Save up to 80%

Routes to the cheapest model capable of handling the request. The routing engine balances latency, quality, and per-token cost in real time across all configured providers.

Observability

Per-key tracking

Every call logged with provider, model, token counts, latency, and cost. Per-request traces, hourly latency percentiles, and live charts in the dashboard.

API Key Management

Instant revoke

Generate scoped uq_* keys with RPM limits, daily token quotas, and monthly spend caps. Revoke compromised credentials instantly without redeploying.

SDK-First Design

pip install uniq-ai

Typed Python SDK with sync and async clients, streaming, tool calling, and exponential-backoff retries baked in. Drop-in compatible with the OpenAI client format.

RAG Pipeline

BM25 search

Ingest documents, search with BM25 ranking, and ground LLM answers in your own data. The /rag CLI command indexes files in seconds — no vector DB required.

Benchmarking

MMLU · HumanEval

Run MMLU, HumanEval, and GSM8K suites against any provider/model combination. CI regression checks and a public leaderboard included out of the box.

Cron Jobs

every 5m

Schedule recurring LLM tasks — summaries, reports, alerts — using a simple cron expression. Results are stored and accessible via API or the dashboard.

Orgs & Teams

OIDC / SSO

Create organisations, invite members, assign roles, and enforce per-org monthly budget limits. SSO via OIDC/SAML for enterprise teams.

VS Code + MCP

Native extension

Inline AI completion straight in your editor via the VS Code extension. MCP server exposes the full API to Claude Desktop and any MCP-compatible client.

Slack Integration

Multi-turn chat

Add the uniq bot to any Slack channel. Multi-turn conversations, file uploads, and slash commands — all routed through your configured providers and keys.

Pricing

Simple, transparent pricing.

Start free. Scale when you're ready. No hidden fees, no egress charges.

Free

For side projects and exploration.

$0/ mo

Start for free

1,000 requests / month
All 5 providers
Smart routing
API key management
RAG pipeline
Usage analytics (7 days)
Community support
Auto failover
Cost optimisation
Cron jobs
Orgs & Teams

Builder

For developers shipping real products.

$29/ mo

Start building

50,000 requests / month
All 5 providers
Smart routing + auto failover
Cost optimisation
RAG + Benchmarking
Cron jobs
VS Code extension
Usage analytics (30 days)
Email support
Orgs & Teams
SSO
Audit logs

Pro

For teams with production workloads.

$99/ mo

Get Pro

500,000 requests / month
All providers + Ollama local
Smart routing + custom rules
Auto failover + cost optimisation
RAG, Bench, Cron, Fine-tune
Orgs & Teams
Usage analytics (90 days)
Rate limit + spend controls
99.9% uptime SLA
Priority support
Unlimited requests
SSO / SAML
Audit logs

Enterprise

For large-scale, compliance-critical deployments.

Custom

Talk to us

Unlimited requests
All providers + private models
Custom routing logic
Dedicated infrastructure
Unlimited analytics retention
SSO / SAML
Audit logs
99.99% uptime SLA
Dedicated support + SLA

All plans include API key management and the full Python SDK. Prices in USD. Cancel anytime.

All AI.One API.

Everything you need,nothing you don't.