`cortex qm` / `cortex mqm`

The Model Quartermaster is an adaptive model-selection engine that observes LLM calls across sessions, computes six weighted signal scores, and fuses them to predict the optimal model for each task. cortex mqm provides model-level analysis across all sessions; the legacy cortex qm tool-prediction variant is also available for tool-level analysis.

Usage

cortex qm <subcommand> [options]
cortex mqm <subcommand> [options]

Six Prediction Signals (MQM)

Signal	Description
Trajectory	Recent model usage patterns and sequences
Episodic	Similar conversation context matching
Historical	Past performance data for task categories
Cost	Cost efficiency optimization across models
Quality	Expected quality based on model capabilities
Reflection	Per-turn reflection feedback integration

`cortex qm` Subcommands

Subcommand	Description
`patterns`	List learned patterns
`weights`	Show current signal weights
`stats`	Display usage statistics
`decisions`	Show prediction decisions for a session
`trace`	Trace the prediction chain for a turn
`dashboard`	Rich visual dashboard with accuracy bars and top predictions
`accuracy`	Show prediction accuracy metrics
`reset`	Reset quartermaster state for a session
`reset-all`	Reset all quartermaster data

Options

Option	Subcommand	Description
`--limit, -n`	`patterns`	Limit number of patterns shown
`--session, -s`	`decisions`, `dashboard`, `accuracy`	Filter by session ID
`--limit, -n`	`decisions`	Limit number of decisions shown

`cortex mqm` Subcommands

Subcommand	Description
`stats`	Model-level usage statistics
`decisions`	Model-level prediction decisions
`weights`	Model-level signal weights
`accuracy`	Accuracy metrics across all sessions
`dashboard`	Model-level dashboard
`reset`	Reset model-level quartermaster
`reset-all`	Reset all model-level data

Options

Option	Subcommand	Description
`--limit, -n`	`decisions`	Limit number of decisions
`--hours, -h`	`accuracy`	Time window for accuracy calculation

Prediction Confidence Levels

Confidence	Action
≥ 85%	Enforce — Override model selection (safe operations only)
65–84%	Suggest — Recommend model to the agent
< 65%	Defer — Let the agent decide

Active Mode

The Quartermaster requires 50 observations before entering active prediction mode. Before this threshold, it operates in learning-only mode, collecting data without making predictions.

Reinforcement Learning

After each prediction, the Quartermaster evaluates correctness:

Reward (EMA α = 0.15): Increase signal weights for correct predictions
Punishment (EMA α = 0.25): Decrease weights for incorrect predictions
Convergence: Weights stabilize after ~200–500 observations

Dashboard Output

The dashboard subcommand provides the richest output:

Accuracy bars per signal
Current signal weights visualization
Top models by prediction accuracy
Session and model-level trends
Confidence distribution histograms

Examples

# View prediction dashboard for a session
cortex qm dashboard -s sess_abc123

# Show learned patterns
cortex qm patterns --limit 20

# Display signal weights
cortex qm weights

# Trace prediction for a specific turn
cortex qm trace 42

# Show accuracy metrics
cortex qm accuracy -s sess_abc123

# Model-level accuracy over last 24 hours
cortex mqm accuracy -h 24

# Reset all quartermaster data
cortex mqm reset-all

cortex qm / cortex mqm

Usage

Six Prediction Signals (MQM)

cortex qm Subcommands

Options

cortex mqm Subcommands

Options

Prediction Confidence Levels

Active Mode

Reinforcement Learning

Dashboard Output

Examples

`cortex qm` / `cortex mqm`

`cortex qm` Subcommands

`cortex mqm` Subcommands