AIExpires Dec 31, 2026
Creator

Will the next GPT model released by OpenAI debut at a score of at least 1500 by December 31, 2026?

Probability

15¢

1h

+0.0pp

24h

+0.0pp

24h Vol

$0.00

Liquidity

$2.9K

Research loop

Inspect detail

Read the move, holders, recent trades, source, and resolution risk before saving anything.

Canonical status

confidence: high

Source status (Polymarket)

active

Derived status (Orrery)

LIVE

Reason

Active market, expiry beyond the near-expiry window.

Resolution & Risk

LOW risk
End date
Dec 31, 2026
UMA status
n/a
Resolution source
Primary
Chatbot Arena LLM Leaderboard
Type
Leaderboard / benchmark
Confidence
extracted · high
Market type
Binary
  • No obvious resolution-risk signals in metadata.

Probability (last 7 days)

+0.0pp 7d
1007550250
15¢
Jun 23, 2026, 23:00 UTCJun 26, 2026, 11:42 UTC
updated 11:42:46 UTC·src:Polymarket CLOB·snap:snap_2026-06-26T11-42Z

Why did it move?

Structured · 1 factor
  • 01
    Liquidity

    Thin liquidity

    Only $2.9k of visible book — small orders can move the line. Treat the probability as a soft estimate.

What to track next

  • Set an alert at the next ±5pp probability move so you find out before the next leg.
  • Add to your watchlist — Home will show probability deltas since your last visit.
  • Compare against sibling markets in the same event below — divergent pricing across related contracts is the cleanest tell.

Verification actions only — never trade recommendations.

Each factor is grounded in a single named metric you can verify on this page — probability, volume, liquidity, signal, resolution state. No predictions, no prose hallucinations.

Verification Brief

A guided checklist for what must be proven before this market is treated as reliable context.

live monitoring
Trust transition

The market is live. Track source, liquidity, and wording before escalating it to a resolution-sensitive task.

Create trust-state alert

Verification goal

Map resolution-relevant sources to the exact criteria needed to avoid misreading price as settlement.

What needs proof

Primary source

required

Confirm the primary leaderboard / benchmark and whether it directly answers the market's resolution question.

Current evidence: Chatbot Arena LLM Leaderboard

Contract wording

required

Extract the exact condition that makes YES resolve true and note any exclusions, time zone, deadline, or fallback rule.

Telegram verification task

Orrery verification task Will the next GPT model released by OpenAI debut at a score of at least 1500 by December 31, 2026? State: Live — live monitoring Need sources for: 1. Primary source: Confirm the primary leaderboard / benchmark and whether it directly answers the market's resolution question. 2. Contract wording: Extract the exact condition that makes YES resolve true and note any exclusions, time zone, deadline, or fallback rule. 3. Liquidity context: Check whether the move is supported by enough liquidity and volume to treat it as meaningful context. Drop only durable links that map to one criterion. Research only. Not trade advice.

X verification thread

Orrery verification brief Will the next GPT model released by OpenAI debut at a score of at least 1500 by December 31, 2026? State: live monitoring Verify: Primary source + Contract wording Price is not settlement. Research only. Not trade advice.

Market link goes in reply

Guided source review

A controlled loop for durable sources. Every source must answer one criterion from this Verification Brief.

Sources

0

Criteria covered

0

Reviewed

0

Awaiting review

0

Submit a source mapped to a criterion

Confirm the primary leaderboard / benchmark and whether it directly answers the market's resolution question.

Research only · no price-only screenshots · no profit claims

Recent source checks

Pilot audit summary

Research only · Not trade advice · Every source maps to one criterion

Timeline

critical · price · trade flow

Critical

  • Dec 31, 23:59 UTCScheduled resolution

    Market resolves in 4524.3h

    LOW

Price movement

+0.0pp over the last 24h, now 15¢.

Biggest hourly move: -28.5pp at 11:00 (to 15¢).

updated 11:42:46 UTC·src:Polymarket CLOB·Polymarket Data

Recent Trades

No recent trades visible from the Data API for this market. That usually means liquidity is thin or this market is between event windows.

updated 11:42:46 UTC·src:Polymarket Data

Market Description

This market will resolve to "Yes" if the next OpenAI GPT model added to the Arena.AI Leaderboard (https://arena.ai/leaderboard/text/overall-no-style-control) has at least the specified score at 12:00 PM ET on the calendar date following the date on which it first appears on the leaderboard. Otherwise, this market will resolve to "No". A qualifying model must be attributed to OpenAI and have “GPT” in its displayed model name, regardless of capitalization or surrounding prefixes, suffixes, version numbers, dates, or descriptors. For example, gpt-5.6-high, chatgpt-6o-latest, or similar would qualify. Models not attributed to OpenAI, or OpenAI models whose displayed name does not include “GPT,” will not qualify. Results from the "Score" column under the "Text Arena | Overall" Leaderboard tab at https://arena.ai/leaderboard/text/overall-no-style-control with style control off will be used to resolve this market. This market will resolve solely based on the specified score in the Score column of the leaderboard, regardless of any underlying granular or unrounded data presented elsewhere. If multiple models are added to the leaderboard on the same calendar date (ET), the highest-scoring model will be used for resolution. Models added to the leaderboard on the calendar date following the initial qualifying model’s first appearance will not be considered. A qualifying model must be newly added to the Arena.AI Leaderboard. Whether the model was previously released, publicly accessible, in beta, or otherwise available before appearing on the leaderboard is irrelevant for this market. The resolution source for this market is the Chatbot Arena LLM Leaderboard found at arena.ai/leaderboard/text. If this resolution source is unavailable at 12:00 PM ET on the calendar date following the date on which the qualifying model first appears on the leaderboard, this market will resolve based on the first subsequent instance at which such a score becomes available on the leaderboard. If it remains unavailable through the end of the seventh day after the qualifying model first appears on the leaderboard or if no qualifying model release occurs by December 31, 2026, 11:59 PM ET, this market will resolve to "No".

Why this category?

confidence: medium

Category

AI

Source

Keyword rule

Matched term

openai

Reason

Question text contains "openai" — matched the AI keyword rule.

Categories come from a deterministic engine: manual overrides (highest priority) → sports hard markers → event-tag rules → keyword rules → Gamma category hint → default. The engine is versioned in category-overrides.ts; methodology lives at /methodology.

FAQ — questions readers actually ask

What is the current Polymarket probability for "Will the next GPT model released by OpenAI debut at a score of at least 1500 by December 31, 2026?"?

As of Fri, 26 Jun 2026 11:42:46 GMT, YES is priced at 15% implied probability on Polymarket. The price changed +0.0pp in the last 24 hours, +0.0pp in the last hour, and +0.0pp in the last 7 days.

When does this market resolve?

This market resolves by Dec 31, 2026 (2026-12-31T23:59:00.000Z). Resolution is determined from https://arena.ai/leaderboard/text/overall-no-style-control.

What source determines the outcome?

Resolution is sourced from https://arena.ai/leaderboard/text/overall-no-style-control. Polymarket's UMA optimistic oracle is the final settlement layer; if the published source is ambiguous, UMA tokenholders adjudicate. Source-extraction confidence is shown in the Resolution & Risk block above.

How much is being traded on this market?

$0.00 of trading volume in the last 24 hours. Lifetime volume on Polymarket: $1.1K. Open liquidity in the YES/NO orderbooks: $2.9K. Spread between best bid and best ask: 2.0¢.

Is this a trade recommendation?

No. Orrery describes — never predicts. Every signal on this market carries explicit Evidence, Backtest, and Action tiers. The Action is always one of: Watch only, Inspect timeline, Create alert, Verify source, or Ignore — never Buy or Sell. The probability above is the market's collective implied probability, not a forecast Orrery is publishing.

How fresh is this data and where does it come from?

This page revalidates from the public Polymarket APIs every 30 seconds. Probability and 24h-change come from Gamma; the chart series comes from the CLOB orderbook history; trade and holder data come from the Data API. The fetched-at timestamp on each block tells you exactly how old the underlying upstream pull was.

Alerts

¢
Deliver

In-app banners fire as soon as a rule is satisfied. Email digests are wired to a server-side cron and continue while your tabs are closed. Telegram and Discord are planned — every existing rule will keep working as channels light up.