Overview
ArgentOS uses a model router to automatically select the best model for each task. Simple queries go to fast, cheap models. Complex reasoning goes to powerful, expensive ones. This saves money without sacrificing quality.The Tier System
| Tier | Score Range | Default Model | Cost | Use Case |
|---|---|---|---|---|
| LOCAL | < 0.3 | Qwen3 30B-A3B via Ollama | Free | Simple lookups, quick replies |
| FAST | 0.3 - 0.5 | Claude Haiku | Low | Straightforward questions, memory recall |
| BALANCED | 0.5 - 0.8 | Claude Sonnet | Medium | Most conversations, tool use |
| POWERFUL | > 0.8 | Claude Opus | High | Complex reasoning, multi-step planning |
Key Features
Automatic Routing
No manual model selection needed. The router scores complexity and picks the right tier.
Cost Optimization
Simple tasks use cheap models, saving budget for complex work.
Provider Diversity
Mix local models, Anthropic, MiniMax, Z.AI, and OpenRouter.
Failover
Automatic fallback when a provider is unavailable.
Dashboard Integration
The dashboard shows a model badge on each message indicating which tier was used:| Color | Tier | Cost |
|---|---|---|
| Green | LOCAL | Free |
| Yellow | FAST | $ |
| Blue | BALANCED | $$ |
| Purple | POWERFUL | $$$ |
Deep Dives
Router Architecture
How complexity scoring and tier routing work
Providers
Supported model providers and configuration
Provider Registry
Dynamic catalog of 15+ providers and 60+ models
Auth Profiles
Managing multiple API keys and subscriptions
Failover
How automatic failover and cooldown work
