Intelligent Gateway
Semantic Routing Engine
Analyze the complexity of prompts in real-time. Automatically direct conversational tasks to Flash models, and logic puzzles or code generation to advanced reasoning models.
Equivalent Failovers
Define equivalent backup model clusters (e.g. mapping GPT-4o to Claude 3.5 Sonnet). If your primary endpoint experiences provider outages or rate limits, the gateway fails over instantly.
Telemetry Optimizer
Continuously monitor request latency and success metrics. The gateway dynamically logs response speeds to memory, automatically preferring the fastest provider in yourallowed policy chain.
App, Dev, & Agent Keys
Issue API keys explicitly classified for human developers, backend apps, or autonomous AI agents to easily separate usage metrics and permissions.
Analytics & Monitoring
Real-Time Debugger
Inspect live request streams instantly with premium logs. Trace token counts (in/out), request costs, latency, and detailed prompt payloads with inline failover logs.
Shared Live Telemetry
Easily share read-only access to specific key logs and performance metrics with developers via secure, public telemetry links.
Unified Credentials Tracking
Track usage, spending splits, and tokens across Apps, Developers, and Agents in real-time from a single dashboard.
Policies & Budgets
Budget Controls
Define policy guardrails that bind developer API keys to strict daily or monthly consumption budgets to prevent run-away costs.
TPM & RPM Limits
Enforce custom Tokens-Per-Minute (TPM) and Requests-Per-Minute (RPM) rate limits at the key level to prevent rate-limit errors from providers.
Quick Start Snippets
Michi fits directly into your existing OpenAI SDK setup. You only need to change thebaseURL and provide your Michi API Key.
