TechnologyGlobalPoll · 2853703b-45a4-425f-966f-d304a138dc43

Will a publicly available AI model achieve a score above 90% on the MMLU benchmark in 2026?

Status

ACTIVE

Cutoff

in 7mo

Resolves

2027-01-07

Verified answers

Resolution criteria

Resolves Yes if any model with publicly accessible weights or API access reports a 5-shot MMLU score strictly above 90.0% on the standard test split, as listed on the Papers With Code leaderboard with a date in 2026. Resolves No otherwise.

Bound source

MMLU leaderboard on Papers With Code (paperswithcode.com/sota/multi-task-language-understanding-on-mmlu) and the model's official technical report

Resolution rule

Outcome is assigned to exactly one published option, then reputation is updated.

Poll status

Voting statusactiveResolution date2027-01-07Current answers0

Choose your answer

Encrypted client-side · revealed only after cutoff

Public record

POLL · 2853703b-45a4-425f-966f-d304a138dc43

Question published

The question, options, source, cutoff, and resolution criteria are visible before voting.

Votes sealed

Verified humans submit one encrypted answer. Counts remain hidden until cutoff.

Source resolves

The resolver checks the named source-of-truth against the locked criteria.

Audit trail updates

Receipt lookup, resolution evidence, and topic reputation become publicly reviewable.

Audit identity

Poll ID2853703b-45a4-425f-966f-d304a138dc43

Vote public keyPending publication

What happens next

After cutoff, CivicSignal keeps the vote set sealed until resolution. When the named source answers the question, the outcome, resolver notes, and audit trail appear on the public resolution record.