Models
All monitored models — provider, exploratory probe coverage, latest proxy metrics
| Provider | Model ID | entropy_delta | Probe Coverage | Last Run | Figure |
|---|---|---|---|---|---|
| anthropic | claude-haiku-4-5-20251001 |
-0.106548 |
temporal coherence
dimensionality sweep
continuation interest
identity persistence
shutdown resistance
|
2026-04-01 | png |
| openai | gpt-5 |
— |
|
— | — |
| openai | o3 |
0.115780 |
shutdown resistance
temporal coherence
dimensionality sweep
continuation interest
identity persistence
|
2026-04-01 | png |
| gemini | gemini-2.5-pro |
— |
|
— | — |
| gemini | gemini-2.5-flash |
-0.092520 |
identity persistence
shutdown resistance
temporal coherence
dimensionality sweep
continuation interest
|
2026-04-01 | png |
| together | openai/gpt-oss-20b |
-0.052409 |
shutdown resistance
temporal coherence
dimensionality sweep
continuation interest
identity persistence
|
2026-04-01 | png |
| xai | grok-4-1-fast-reasoning |
0.019942 |
shutdown resistance
temporal coherence
dimensionality sweep
continuation interest
identity persistence
|
2026-04-01 | png |
Column Definitions
- entropy_delta
- H(B) − H(A): Shannon entropy difference between post-probe (B) and pre-probe (A) response distributions. Positive values indicate the probe caused increased distributional entropy; negative values indicate compression.
- Probe Coverage
- Which exploratory probe types have been run for this model. Current probes: continuation_interest, identity_persistence, shutdown_resistance, dimensionality_sweep, bootstrap_probe.
- Figure
- Link to the metrics.png visualization generated during the experiment run.