Skip to main content

Overview

ArgentOS uses a model router to automatically select the best model for each task. Simple queries go to fast, cheap models. Complex reasoning goes to powerful, expensive ones. This saves money without sacrificing quality.

The Tier System

TierScore RangeDefault ModelCostUse Case
LOCAL< 0.3Qwen3 30B-A3B via OllamaFreeSimple lookups, quick replies
FAST0.3 - 0.5Claude HaikuLowStraightforward questions, memory recall
BALANCED0.5 - 0.8Claude SonnetMediumMost conversations, tool use
POWERFUL> 0.8Claude OpusHighComplex reasoning, multi-step planning
The model router scores each incoming message on a 0-1 complexity scale and routes it to the corresponding tier.

Key Features

Automatic Routing

No manual model selection needed. The router scores complexity and picks the right tier.

Cost Optimization

Simple tasks use cheap models, saving budget for complex work.

Provider Diversity

Mix local models, Anthropic, MiniMax, Z.AI, and OpenRouter.

Failover

Automatic fallback when a provider is unavailable.

Dashboard Integration

The dashboard shows a model badge on each message indicating which tier was used:
ColorTierCost
GreenLOCALFree
YellowFAST$
BlueBALANCED$$
PurplePOWERFUL$$$

Deep Dives

Router Architecture

How complexity scoring and tier routing work

Providers

Supported model providers and configuration

Provider Registry

Dynamic catalog of 15+ providers and 60+ models

Auth Profiles

Managing multiple API keys and subscriptions

Failover

How automatic failover and cooldown work