S

Svelte Bench

60 Questions of Svelte (LLM Judged)

Coding↑ Higher Better7 models tested60 test size

Top Performing Models

Models with the best performance on Svelte Bench

#1
GPT 5 Mini
OpenAI
75.2
accuracy percentage
#2
GPT 5 Mini High
OpenAI
65.8
accuracy percentage
#3
Qwen 3 Max
Alibaba
54.5
accuracy percentage

Model Performance

60 Questions of Svelte (LLM Judged)

Price vs Performance

How model cost relates to performance on this benchmark

Model Details for Svelte Bench

Complete performance breakdown for all models tested on Svelte Bench

Claude 4.5 Sonnet
Claude 4.5 Sonnet
Anthropic
Anthropic
53.513.162.0707.1200,000
GLM-4.6
GLM-4.6
Z.ai
Zhipu AI
49.78.952.0701.9200,000
Qwen 3 Coder 480B A35B
Qwen 3 Coder 480B A35B
Qwen
Alibaba
52.012.630.01500.64262,000
Qwen 3 Max
Qwen 3 Max
Qwen
Alibaba
54.56.739.5540.38256,000
GPT 5 Mini
GPT 5 Mini
OpenAI
OpenAI
75.22.343.2413.46400,000
GPT 5 Mini High
GPT 5 Mini High
OpenAI
OpenAI
65.83.442.7474.35400,000
Grok Code Fast 1
Grok Code Fast 1
Grok
xAI
45.815.837.7840.93256,000

Benchmark Information

Type

custom

Units

accuracy percentage

Max Score

100

Test Size

60