Will the next GPT model released by OpenAI debut at a score of at least 1450 by December 31, 2026?
Probability
84¢
1h
+0.0pp
24h
+0.0pp
24h Vol
$0.00
Liquidity
$273.38
Canonical status
confidence: mediumSource status (Polymarket)
Derived status (Orrery)
Reason
Liquidity is below $1,000 — price discovery is unreliable.
The derived status is computed from the source flags plus the live YES price, so a market trading near a rail can read as PRICE-PINNED while upstream is still active. That isn't the same as resolved.
Methodology explanation
Review-only opportunity
No paper intent is emitted from the public opportunity row. Paper action appears only after the paper governor evaluates the row.
Why this market is in review
signalResolution-source risk
15% source confidence on this opportunity row.
Paper-only action
paper-onlyReview-only opportunity
read_only_explanation; no live order, no network send, no raw secrets.
Risk / veto readback
reviewResolution review required
The public opportunity row has not passed the paper governor; source wording and settlement state remain review blockers.
Source evidence
source4 mapped surfaces
283/283 sources runtime-backed; all registered sources are runtime-backed; live authority remains false.
Signals
- Resolution-source riskwatch
15% source confidence on this opportunity row.
Veto / blockers
- Resolution review requiredwatch
The public opportunity row has not passed the paper governor; source wording and settlement state remain review blockers.
- Confidence below paper gatewatch
Paper policy needs at least 72% confidence before any paper intent can be proposed.
Costs / sizing
- Research score
- Composite opportunity score before paper-governor costs and vetoes.
- Capacity
- Estimated research capacity, not an approved size.
- Liquidity
- Market liquidity visible on the opportunity row.
all registered sources are runtime-backed; live authority remains false; this card cites mapped surfaces only.
Resolution & Risk
LOW risk- End date
- Dec 31, 2026
- UMA status
- n/a
- Resolution source
- PrimaryChatbot Arena LLM LeaderboardLinkTypeLeaderboard / benchmarkConfidenceextracted · high
- Market type
- Binary
- Wide spread (11.0¢) — liquidity risk on entry/exit.
Probability (last 7 days)
+0.0pp 7dWhy did it move?
Structured · 2 factors- 01Active signal
Resolution-risk observation firing
Spread is wide enough that intraday moves overstate any tradeable edge.
- 02Spread cost
Wide spread — 11.0¢
Bid-ask spread is wide enough that intraday moves overstate any tradeable advantage. Most of the headline pp move would be eaten by spread on a market order.
What to track next
- Verify the resolution source on this page — exchange feed, official release, news consensus — before treating any move as new information.
Verification actions only — never trade recommendations.
Each factor is grounded in a single named metric you can verify on this page — probability, volume, liquidity, signal, resolution state. No predictions, no prose hallucinations.
Resolution & Risk
LOW risk- End date
- Dec 31, 2026
- UMA status
- n/a
- Resolution source
- PrimaryChatbot Arena LLM LeaderboardLinkTypeLeaderboard / benchmarkConfidenceextracted · high
- Market type
- Binary
- Wide spread (11.0¢) — liquidity risk on entry/exit.
Verification Brief
A guided checklist for what must be proven before this market is treated as reliable context.
Liquidity is thin enough that movement can be noisy. Verify source context before reading the move as information.
Create trust-state alertVerification goal
Map resolution-relevant sources to the exact criteria needed to avoid misreading price as settlement.
What needs proof
Primary source
requiredConfirm the primary leaderboard / benchmark and whether it directly answers the market's resolution question.
Current evidence: Chatbot Arena LLM Leaderboard
Contract wording
requiredExtract the exact condition that makes YES resolve true and note any exclusions, time zone, deadline, or fallback rule.
Orrery verification task Will the next GPT model released by OpenAI debut at a score of at least 1450 by December 31, 2026? State: Illiquid — liquidity sensitive Need sources for: 1. Primary source: Confirm the primary leaderboard / benchmark and whether it directly answers the market's resolution question. 2. Contract wording: Extract the exact condition that makes YES resolve true and note any exclusions, time zone, deadline, or fallback rule. 3. Liquidity context: Check whether the move is supported by enough liquidity and volume to treat it as meaningful context. Drop only durable links that map to one criterion. Research only. Not trade advice.
Orrery verification brief Will the next GPT model released by OpenAI debut at a score of at least 1450 by December 31, 2026? State: liquidity sensitive Verify: Primary source + Contract wording Price is not settlement. Research only. Not trade advice.
Market link goes in reply
A controlled loop for durable sources. Every source must answer one criterion from this Verification Brief.
Sources
0
Criteria covered
0
Reviewed
0
Awaiting review
0
Submit a source mapped to a criterion
Recent source checks
Pilot audit summary
Research only · Not trade advice · Every source maps to one criterion
Timeline
critical · price · trade flowCritical
- Dec 31, 23:59 UTCScheduled resolutionLOW
Market resolves in 4524.3h
- 11:42SignalLOW
Resolution risk
Spread is wide enough that intraday moves overstate any tradeable edge.
Price movement
+0.0pp over the last 24h, now 84¢.
Active signals
Recent Trades
No recent trades visible from the Data API for this market. That usually means liquidity is thin or this market is between event windows.
Related Markets
5- 15¢0.0
Will the next GPT model released by OpenAI debut at a score of at least 1500 by December 31, 2026?
AI · Vol $0.00
- 3¢-0.1
Will the next GPT model released by OpenAI debut at a score of at least 1520 by December 31, 2026?
AI · Vol $0.00
- 64¢+9.5
Will the next GPT model released by OpenAI debut at a score of at least 1480 by December 31, 2026?
AI · Vol $195.88
- 49¢+1.0
Will the next GPT model released by OpenAI debut at a score of at least 1490 by December 31, 2026?
AI · Vol $0.00
- 49¢+1.5
Will the next GPT model released by OpenAI debut at a score of at least 1470 by December 31, 2026?
AI · Vol $0.00
Market Description
This market will resolve to "Yes" if the next OpenAI GPT model added to the Arena.AI Leaderboard (https://arena.ai/leaderboard/text/overall-no-style-control) has at least the specified score at 12:00 PM ET on the calendar date following the date on which it first appears on the leaderboard. Otherwise, this market will resolve to "No". A qualifying model must be attributed to OpenAI and have “GPT” in its displayed model name, regardless of capitalization or surrounding prefixes, suffixes, version numbers, dates, or descriptors. For example, gpt-5.6-high, chatgpt-6o-latest, or similar would qualify. Models not attributed to OpenAI, or OpenAI models whose displayed name does not include “GPT,” will not qualify. Results from the "Score" column under the "Text Arena | Overall" Leaderboard tab at https://arena.ai/leaderboard/text/overall-no-style-control with style control off will be used to resolve this market. This market will resolve solely based on the specified score in the Score column of the leaderboard, regardless of any underlying granular or unrounded data presented elsewhere. If multiple models are added to the leaderboard on the same calendar date (ET), the highest-scoring model will be used for resolution. Models added to the leaderboard on the calendar date following the initial qualifying model’s first appearance will not be considered. A qualifying model must be newly added to the Arena.AI Leaderboard. Whether the model was previously released, publicly accessible, in beta, or otherwise available before appearing on the leaderboard is irrelevant for this market. The resolution source for this market is the Chatbot Arena LLM Leaderboard found at arena.ai/leaderboard/text. If this resolution source is unavailable at 12:00 PM ET on the calendar date following the date on which the qualifying model first appears on the leaderboard, this market will resolve based on the first subsequent instance at which such a score becomes available on the leaderboard. If it remains unavailable through the end of the seventh day after the qualifying model first appears on the leaderboard or if no qualifying model release occurs by December 31, 2026, 11:59 PM ET, this market will resolve to "No".
Why this category?
confidence: mediumCategory
Source
Matched term
openaiReason
Question text contains "openai" — matched the AI keyword rule.
Categories come from a deterministic engine: manual overrides (highest priority) → sports hard markers → event-tag rules → keyword rules → Gamma category hint → default. The engine is versioned in category-overrides.ts; methodology lives at /methodology.
FAQ — questions readers actually ask
What is the current Polymarket probability for "Will the next GPT model released by OpenAI debut at a score of at least 1450 by December 31, 2026?"?
As of Fri, 26 Jun 2026 11:42:11 GMT, YES is priced at 84% implied probability on Polymarket. The price changed +0.0pp in the last 24 hours, +0.0pp in the last hour, and +0.0pp in the last 7 days.
When does this market resolve?
This market resolves by Dec 31, 2026 (2026-12-31T23:59:00.000Z). Resolution is determined from https://arena.ai/leaderboard/text/overall-no-style-control.
What source determines the outcome?
Resolution is sourced from https://arena.ai/leaderboard/text/overall-no-style-control. Polymarket's UMA optimistic oracle is the final settlement layer; if the published source is ambiguous, UMA tokenholders adjudicate. Source-extraction confidence is shown in the Resolution & Risk block above.
How much is being traded on this market?
$0.00 of trading volume in the last 24 hours. Lifetime volume on Polymarket: $8.33. Open liquidity in the YES/NO orderbooks: $273.38. Spread between best bid and best ask: 11.0¢.
Is this a trade recommendation?
No. Orrery describes — never predicts. Every signal on this market carries explicit Evidence, Backtest, and Action tiers. The Action is always one of: Watch only, Inspect timeline, Create alert, Verify source, or Ignore — never Buy or Sell. The probability above is the market's collective implied probability, not a forecast Orrery is publishing.
How fresh is this data and where does it come from?
This page revalidates from the public Polymarket APIs every 30 seconds. Probability and 24h-change come from Gamma; the chart series comes from the CLOB orderbook history; trade and holder data come from the Data API. The fetched-at timestamp on each block tells you exactly how old the underlying upstream pull was.
Alerts
In-app banners fire as soon as a rule is satisfied. Email digests are wired to a server-side cron and continue while your tabs are closed. Telegram and Discord are planned — every existing rule will keep working as channels light up.