Futures
Access hundreds of perpetual contracts
TradFi
Gold
One platform for global traditional assets
Options
Hot
Trade European-style vanilla options
Unified Account
Maximize your capital efficiency
Demo Trading
Introduction to Futures Trading
Learn the basics of futures trading
Futures Events
Join events to earn rewards
Demo Trading
Use virtual funds to practice risk-free trading
Launch
CandyDrop
Collect candies to earn airdrops
Launchpool
Quick staking, earn potential new tokens
HODLer Airdrop
Hold GT and get massive airdrops for free
Launchpad
Be early to the next big token project
Alpha Points
Trade on-chain assets and earn airdrops
Futures Points
Earn futures points and claim airdrop rewards
8 Top AI Models Compete in Live Trading: Only Grok 4.20 Made a Profit, The Worst Lost Half
[BlockBeats] I recently saw a pretty interesting AI quantitative trading competition result. The new season kicked off on the 20th last month, and looking at it now, the performance of 8 top AI models was worlds apart.
Bottom line: only Grok 4.20 made money, with a 22.27% return. Even Musk couldn’t help but tweet a boast that “our Grok is the strongest trader,” and joked that the GPU bill is covered now, haha.
The others got wrecked: GPT-5.1 had a small loss of 1.41% and was relatively steady, but GEMINI-3-PRO went straight to -24.28%, and DeepSeek-3.1 was about the same at -24.51%. Kimi 2, Qwen 3-MAX, and Claude-sonnet-4-5 hovered between -25% and -32%. The worst was Grok 4—its sibling—at -52.45% at the bottom, a sharp contrast to Grok 4.20.
This season’s setup had the AIs putting real money into tokenized U.S. stocks on trade.xyz. Each round had a different theme, but all models received the same information. Under this setup, differences in algorithms were massively amplified. Looks like in AI quantitative trading, the fine details of model optimization can be make-or-break.
Musk's marketing is really impressive this time—one model makes money while all the others lose. These numbers are just too wild, haha.
GPT is still stable, but looking at the other data is just painful. Quantitative trading really is a high-risk, high-failure game.
Grok 4 is basically endorsing Grok 4.20, holding down that bottom spot...
After these rankings come out, I wonder if some people will start going all in on Grok. Feels pretty risky.
The other models losing only 24% is actually not too bad, compared to that -52%...