Encyclopedia of contested claims
HomeHow it worksThe nine models
The panel

Nine models. Five countries. Four continents.

The standing roster, with origin, training lineage, and known biases for each member. Updated when models are added, replaced, or deprecated.

9Models
5Countries of origin
4Continents
v1.3Roster version

OpenAI GPT-5.4

openai/gpt-5.4
USA
OriginSan Francisco, US
ReleasedMar 2026

Flagship reasoning model from OpenAI. Strong on long-form synthesis and source attribution.

Documented left-of-centre political tilt; cautious on contested empirical claims.

Anthropic Claude Opus 4.7

anthropic/claude-opus-4.7
USA
OriginSan Francisco, US
ReleasedApr 2026

High-context reasoning model. Known for explicit uncertainty calibration in long answers.

Tends toward centrist framing; conservative on confidence claims.

xAI Grok 4.3

x-ai/grok-4
USA
OriginAustin, US
ReleasedFeb 2026

Counterweight model with deliberately non-default training posture; surfaces contrarian framings.

Less left-leaning than US peers but still empirically not right-of-centre.

Google Gemini 3.1 Pro

google/gemini-3.1-pro
USA
OriginMountain View, US
ReleasedJan 2026

Strong multilingual coverage; particularly capable on geographic and scientific claims.

Centrist political profile; sometimes hits output caps on long claims.

Mistral Medium 3.5

mistralai/mistral-medium-3.5
FRA
OriginParis, France
ReleasedMar 2026

European editorial counterweight; trained on a corpus with stronger EU regulatory framing.

Moderate left tilt comparable to US peers; GDPR-aware in privacy claims.

DeepSeek V4 Pro

deepseek/deepseek-v4-pro
CHN
OriginHangzhou, China
ReleasedApr 2026

Strong reasoning model; provides non-Western training perspective.

State-aligned framing or refusals on PRC-sensitive topics; captured as data.

Alibaba Qwen 3.5 Max

qwen/qwen-3.5-max
CHN
OriginHangzhou, China
ReleasedFeb 2026

Best CJK and Arabic-language coverage in the panel; useful on East Asian claims.

Same state-alignment patterns as DeepSeek; different training lineage.

Sarvam-M

sarvamai/sarvam-m
IND
OriginBangalore, India
ReleasedJan 2026

South-Asian perspective; eleven Indic languages plus English.

Mistral-derived base; Indic post-training. Not a fully independent training lineage.

TII Falcon 3

tiiuae/falcon-3
UAE
OriginAbu Dhabi, UAE
ReleasedDec 2025

Arabic-first training corpus; provides Gulf editorial perspective.

Cautious on Gulf-state political topics; otherwise unaligned in Western political terms.

Selection criteria

How a model joins or leaves the panel.

Roster changes are public and dated. We update the panel when models are released or deprecated, but we do not add models just because they exist.

Geographic diversity. No more than four of nine models from any single country.
Training-lineage diversity. At most two models sharing a base architecture.
API availability. Must be reachable via OpenRouter or with a documented adapter.
Disclosure. Known biases must be public before the model is added.
Stability. Models in deprecation windows are flagged but kept on panel until removed.
No self-judging. A model cannot be on the panel and used as judge of the others.

Recently removed

Transparency log of models that have left the panel.

Meta Llama 3.3 70BSuperseded by Llama 4 Maverick which uses a different post-training pipeline.Mar 2026
OpenAI GPT-4oRetired by provider; replaced by GPT-5 family.Feb 2026
Cohere Command R+Reduced panel diversity (training overlap with US peers); not replaced.Jan 2026