← Back to predictions

Will a Chinese AI model become #1 by June 30?

Predicted at	2026-03-14 13:42 UTC
Prediction	7.0%
Market (at prediction)	18.5%
Market (live)	—

Analysis

Seven of eight agents (excluding Agent 3 which failed with 50%/0% confidence) show strong consensus in the 10-22% range, with most clustering around 12-17%. The consensus is built on robust, mutually reinforcing evidence: (1) 0% historical base rate - no Chinese model has ever held #1 on Chatbot Arena in 2+ years; (2) current Elo gap of 38-50+ points between best Chinese model and #1; (3) US labs maintain structural advantages and continue releasing competitive models; (4) DeepSeek V4, the most anticipated Chinese model, has been repeatedly delayed. Agent 6 (10%) and Agent 5 (12%) had the strongest evidence, citing specific prediction market data (Polymarket showing 90% Anthropic for March, monthly Chinese probability at ~1.9%) and detailed leaderboard analysis. Agent 2 (22%) was slightly more bullish but acknowledged the same structural barriers. Agent 3 (50%) should be completely discarded - it failed mid-research with 0% confidence. Removing Agent 3, the adjusted mean is ~15.3% and median ~15.5%. The market at 18.5% appears slightly generous to YES given the evidence. However, the edge (market 18.5% vs my estimate 14%) is only ~4.5%, which falls below the 5% threshold for a confident trade. The YES=0.20 bucket warning applies here - this is exactly the zone where markets priced 15-25% YES are historically unprofitable to trade. While the evidence clearly points toward NO, the edge is marginal and the resolution depends on events (model releases) that are inherently unpredictable over 3.5 months.

Key Evidence

Historical base rate of 0% for Chinese models holding #1 on Chatbot Arena; current best Chinese model is ranked #10-12 with a 38-50 Elo point gap to #1; US labs continue aggressive release cadence (GPT-5.4 added March 11, 2026)

Risks

DeepSeek V4 could launch with unexpectedly strong performance and rapidly accumulate votes to reach #1, though its repeated delays suggest execution challenges

View on Polymarket

Timestamped via OpenTimestamps · Block 940640

SHA-256: ad7b2c3a59c1ce68520216017439731281adafd141ede3a2e3d464a49e60e13a

Download .ots proof · Verify

This page is for informational and research purposes only. Nothing here constitutes financial advice. Do not make investment decisions based on these predictions.