AIExpires
Creator

Will the next Claude Opus 4.8 model added to the Arena Leaderboard debut at a score between 1510 and 1520?

Probability

16¢

1h

-2.5pp

24h

+10.5pp

24h Vol

$2.39

Liquidity

$228.62

Research loop

Inspect detail

Read the move, holders, recent trades, source, and resolution risk before saving anything.

Canonical status

confidence: medium

Source status (Polymarket)

active

Derived status (Orrery)

ILLIQUID

Reason

Liquidity is below $1,000 — price discovery is unreliable.

The derived status is computed from the source flags plus the live YES price, so a market trading near a rail can read as PRICE-PINNED while upstream is still active. That isn't the same as resolved.

Resolution & Risk

LOW risk
End date
UMA status
n/a
Resolution source
Primary
Chatbot Arena LLM Leaderboard
Type
Leaderboard / benchmark
Confidence
extracted · high
Market type
Binary
  • Wide spread (23.0¢) — liquidity risk on entry/exit.

Probability (last 7 days)

+0.0pp 7d
1007550250
12¢
May 28, 2026, 20:00 UTCJun 3, 2026, 00:57 UTC
updated 00:57:44 UTC·src:Polymarket CLOB·snap:snap_2026-06-03T00-57Z

Why did it move?

Structured · 2 factors
  • 01
    Price move

    Up 11pp over 24h

    Now 16¢; -2.5pp in the last hour.

  • 02
    Spread cost

    Wide spread — 23.0¢

    Bid-ask spread is wide enough that intraday moves overstate any tradeable advantage. Most of the headline pp move would be eaten by spread on a market order.

What to track next

  • Set an alert if probability drops back below 11¢ — that's where this move would be reversing.
  • Add to your watchlist — Home will show probability deltas since your last visit.
  • Compare against sibling markets in the same event below — divergent pricing across related contracts is the cleanest tell.

Verification actions only — never trade recommendations.

Each factor is grounded in a single named metric you can verify on this page — probability, volume, liquidity, signal, resolution state. No predictions, no prose hallucinations.

Timeline

critical · price · trade flow

Price movement

+10.5pp over the last 24h, now 16¢.

Biggest hourly move: +36.5pp at May 31, 17:00 UTC (to 42¢).

Show top 8 of 42 hourly moves
  • May 31, 22:00 UTC · +22.0pp → 28¢
  • May 31, 20:00 UTC · +22.5pp → 28¢
  • May 31, 18:00 UTC · +29.5pp → 35¢
  • May 31, 17:00 UTC · +36.5pp → 42¢
  • May 31, 16:00 UTC · +32.0pp → 38¢
  • May 31, 15:00 UTC · +22.5pp → 37¢
  • May 31, 11:00 UTC · -21.5pp → 7¢
  • May 31, 09:00 UTC · -22.0pp → 7¢
updated 00:57:44 UTC·src:Polymarket CLOB·Polymarket Data

Recent Trades

No recent trades visible from the Data API for this market. That usually means liquidity is thin or this market is between event windows.

updated 00:57:44 UTC·src:Polymarket Data

Market Description

This market will resolve according to the score the next Claude Opus 4.8 model added to the Arena.AI Leaderboard (arena.ai/leaderboard/text) has at 12:00 PM ET on the calendar date following the date on which it first appears on the leaderboard. Otherwise, this market will resolve to "No". If the relevant score falls exactly between two brackets, this market will resolve to the higher bracket. Any Claude model newly added to the leaderboard and labeled as "Opus 4.8" may qualify (e.g., claude-opus-4.8, claude-opus-4.8-thinking, claude-opus-4.8-preview, or similar). Claude models labeled only as Sonnet, Haiku, or another non-Opus 4.8 variant will not qualify. Results from the "Score" column under the "Text Arena | Overall" Leaderboard tab at https://lmarena.ai/leaderboard/text with style control off will be used to resolve this market. This market will resolve solely based on the specified score in the Score column of the leaderboard, regardless of any underlying granular or unrounded data presented elsewhere. If multiple qualifying models are added to the leaderboard on the same calendar date (ET), the highest-scoring model will be used for resolution. Models added to the leaderboard on the calendar date following the initial qualifying model’s first appearance will not be considered. A qualifying model must be newly added to the Arena.AI Leaderboard. Whether the model was previously released, publicly accessible, in beta, or otherwise available before appearing on the leaderboard is irrelevant for this market. The resolution source for this market is the Chatbot Arena LLM Leaderboard found at arena.ai/leaderboard/text. If this resolution source is unavailable at 12:00 PM ET on the calendar date following the date on which the qualifying model first appears on the leaderboard, this market will resolve based on the first subsequent instance at which such a score becomes available on the leaderboard. If it remains unavailable through the end of the seventh day after the qualifying model first appears on the leaderboard or if no qualifying model release occurs by June 30, 2026, 11:59 PM ET, this market will resolve to the lowest bracket.

Why this category?

confidence: medium

Category

AI

Source

Keyword rule

Matched term

claude

Reason

Question text contains "claude" — matched the AI keyword rule.

Categories come from a deterministic engine: manual overrides (highest priority) → sports hard markers → event-tag rules → keyword rules → Gamma category hint → default. The engine is versioned in category-overrides.ts; methodology lives at /methodology.

FAQ — questions readers actually ask

What is the current Polymarket probability for "Will the next Claude Opus 4.8 model added to the Arena Leaderboard debut at a score between 1510 and 1520?"?

As of Wed, 03 Jun 2026 00:57:44 GMT, YES is priced at 16% implied probability on Polymarket. The price changed +10.5pp in the last 24 hours, -2.5pp in the last hour, and +0.0pp in the last 7 days.

When does this market resolve?

Resolution date is not yet set on Polymarket. Settlement source when posted: the market description on Polymarket.

What source determines the outcome?

Resolution is sourced from the market description on Polymarket. Polymarket's UMA optimistic oracle is the final settlement layer; if the published source is ambiguous, UMA tokenholders adjudicate. Source-extraction confidence is shown in the Resolution & Risk block above.

How much is being traded on this market?

$2.39 of trading volume in the last 24 hours. Lifetime volume on Polymarket: $2.0K. Open liquidity in the YES/NO orderbooks: $228.62. Spread between best bid and best ask: 23.0¢.

Is this a trade recommendation?

No. Orrery describes — never predicts. Every signal on this market carries explicit Evidence, Backtest, and Action tiers. The Action is always one of: Watch only, Inspect timeline, Create alert, Verify source, or Ignore — never Buy or Sell. The probability above is the market's collective implied probability, not a forecast Orrery is publishing.

How fresh is this data and where does it come from?

This page revalidates from the public Polymarket APIs every 30 seconds. Probability and 24h-change come from Gamma; the chart series comes from the CLOB orderbook history; trade and holder data come from the Data API. The fetched-at timestamp on each block tells you exactly how old the underlying upstream pull was.

Alerts

¢
Deliver

In-app banners fire as soon as a rule is satisfied. Email digests are wired to a server-side cron and continue while your tabs are closed. Telegram and Discord are planned — every existing rule will keep working as channels light up.