Aegis: Closed-Loop Intelligence Engine
Ground behavior, improve it, and defend every ship decision with evidence.
Mode
Eval-first
Release
Gate-aware
Reports
Shareable
Access
Accounts enabled
Shell policy
The workspace chrome does not inject sample benchmark rows, synthetic scores, or decorative regression traces. Live evidence belongs in the closed loop, research runs, review queue, and release train after a real workspace is populated.
Closed Loop
Import traces, run the strict loop, and open the dossier.
Research Runs
Measure benchmark deltas and investigate candidate behavior.
Review Queue
Attach ownership, severity, and operator judgment.
Release Train
Persist gate state beside the same artifact lineage.
Launch-grade proof should be grounded in persisted artifacts, not shell placeholders.
surface
purpose
required
owner
dataset
fixed benchmark contract
yes
research
comparison
baseline vs candidate delta
yes
operator
review
annotated release judgment
yes
human
promotion
gate outcome + lineage
yes
release
Release train

Track launch readiness across datasets, baselines, and saved comparisons.

This is the shipping layer above evals. Watch which datasets are active, which gates are green, which runs are pinned as baselines, and which comparisons still need a closer review.

Tracked datasets
0
Saved gates
0
Latest gate pass rate
0%
Latest launch proof
Not recorded
run launch proof
Closed-loop handoff
Reviewed dossiers now feed directly into the release train.

This lane keeps the operator decision attached to each strict run: review status, owner, summary, promotion outcome, and whether the dossier is ready to move toward a launch-proof bundle.

Handoff dossiers
0
Approved
0
Selected dossier
None
Matches
0 dossiers ยท 0 datasets
No reviewed closed-loop dossiers have been handed into the release train yet.
Dataset filter
Release overview

Switch the train view by dataset and keep the whole shipping picture in one place: baselines, latest gates, saved comparisons, and recent run activity.

Runs in scope
0
Saved comparisons
0
Scope
All datasets
Loading
Pulling release gates, baselines, and compare snapshots...