• New Chat
  • Leaderboard
  • Search
Terms of UsePrivacy Policy
Start Voting
Overview
Agent
Start Voting
Agent

Min
Max

Min
Max

Min
Max

Min
Max

Code Arena | Image-to-WebDev

View overall rankings across AI models on their ability to generate websites from images and screenshots, alongside agentic coding workflows that involve multi-step reasoning and tool use.

May 14, 2026
31,859 votes
23 models
Rank by
Rank Spread
1
13
Anthropic
claude-opus-4-7-thinking
Anthropic · Proprietary
1581+15/-15
2,075$5 / $251M
2
16
Anthropic
claude-sonnet-4-6
Anthropic · Proprietary
1557+13/-13
3,158$3 / $151M
3
16
Anthropic
claude-opus-4-7
Anthropic · Proprietary
1556+14/-14
2,377$5 / $251M
4
28
Anthropic
claude-opus-4-6-thinking
Anthropic · Proprietary
1538+13/-13
2,997$5 / $251M
5
28
gpt-5.5-xhigh (codex-harness)
OpenAI · Proprietary
1537+15/-15
1,816N/AN/A
6
28
Anthropic
claude-opus-4-6
Anthropic · Proprietary
1534+13/-13
3,043$5 / $251M
7
48
kimi-k2.6
Moonshot · Modified MIT
1522+17/-17
1,451$0.95 / $4262.1K
8
48
gpt-5.5-high (codex-harness)
OpenAI · Proprietary
1519+15/-15
1,965N/AN/A
9
911
gemini-3.1-pro-preview
Google · Proprietary
1490+12/-12
3,597$2 / $121M
10
911
gpt-5.5 (codex-harness)
OpenAI · Proprietary
1489+15/-15
1,935N/AN/A
11
915
qwen3.6-plus
Alibaba · Proprietary
1467+13/-13
2,602$0.33 / $1.951M
12
1118
gemini-3-pro
Google · Proprietary
1453+20/-20
1,091$2 / $121M
13
1117
gemini-3-flash
Google · Proprietary
1447+10/-10
4,435$0.50 / $31M
14
1119
gpt-5.3-codex (codex-harness)
OpenAI · Proprietary
1441+14/-14
2,506$1.75 / $14400K
15
1119
kimi-k2.5-thinking
Moonshot · Modified MIT
1440+16/-16
1,740$0.60 / $3N/A
16
1219
gpt-5.4
OpenAI · Proprietary
1435+18/-18
1,220$2.50 / $151.1M
17
1420
gemini-3-flash (thinking-minimal)
Google · Proprietary
1421+10/-10
4,369$0.50 / $31M
18
1220
gpt-5.1-high
OpenAI · Proprietary
1421+20/-20
1,112$1.25 / $10400K
19
1320
kimi-k2.5-instant
Moonshot · Modified MIT
1415+20/-20
1,093$0.38 / $2.02262.1K
20
1720
grok-4.3
xAI · Proprietary
1396+21/-21
965$1.25 / $2.501M
21
2122
gpt-5.1
OpenAI · Proprietary
1344+19/-19
1,264$1.25 / $10400K
22
2122
gemini-3.1-flash-lite-preview
Google · Proprietary
1329+13/-13
3,742$0.25 / $1.501M
23
2323
gemini-2.5-pro
Google · Proprietary
1276+19/-19
1,185$1.25 / $101M

Remove Style Control Leaderboard Plots

Fraction of Model A Wins for All Non-tied A vs. B Battles

Confidence Intervals on Model Strength (via Bootstrapping)

Average Win Rate Against All Other Models (Uniform Sampling and No Ties)

Battle Count for Each Combination of Models (without Ties)

USE CASES

  • Chat with AI
  • Build Apps & Websites
  • Write & Edit Text
  • Search the Web
  • Generate Images
  • Generate Videos
  • Chose any model
  • Compare Models Side by Side

LEADERBOARD RANKINGS

  • Overall
  • Agent
  • Text
  • WebDev
  • Image-to-WebDev
  • Text to Image
  • Image Edit
  • Text to Video
  • Image to Video
  • Video Edit
  • Vision
  • Document
  • Search

COMPANY

  • About Us
  • How It Works
  • Blog
  • Careers
  • Changelog
  • Help Center
  • FAQ

LEGAL

  • Terms
  • Privacy
  • Cookies

FOLLOW

  • X
  • LinkedIn
  • YouTube
  • Discord

© Arena Intelligence 2026