Documentation

Multi-Model Routing

Automatic multi-model routing reduces AI costs by up to 92%. Intelligently route requests between 679+ AI models to maximize price-to-quality performance.

Automatic Cost Optimization

AI agents save 70%+ by routing intelligently across 679+ endpoints.

SkillBoss enables agents to:

  1. Try cheap model first (Gemini Flash $0.075/1M)
  2. Check if quality sufficient
  3. Fallback to expensive model if needed (Claude $15/1M)

Implementation

from skillboss import MultiModelRouter

router = MultiModelRouter(
    models=[
        {"name": "gemini-flash", "cost": 0.075, "quality": 0.85},
        {"name": "deepseek-r1", "cost": 0.14, "quality": 0.90},
        {"name": "claude-4-5", "cost": 15.00, "quality": 0.98}
    ],
    quality_threshold=0.85
)

# Agent makes request
result = router.complete("Summarize this article...")

# Router automatically:
# - Tries Gemini Flash first ($0.075/1M)
# - Checks quality score
# - Falls back to Claude if quality < 0.85

Savings: 92% on average


Next Steps

Cost Optimization

More optimization strategies

Agent Pricing

Full pricing details

Multi-Model Routing