Benchmark results
Raw OGS-Bench result documents, one per model. Each file conforms to benchmark-result.schema.json and is what the leaderboard is built from.
| File | Size |
|---|---|
baseline.json | 976 KB |
claude-haiku-4-5.json | 1417 KB |
claude-opus-4-7.json | 1373 KB |
claude-sonnet-4-6.json | 1468 KB |
gemini-2.5-flash-lite.json | 1362 KB |
gemini-2.5-flash.json | 1374 KB |
gemini-2.5-pro.json | 1361 KB |
gpt-4.1-mini.json | 1370 KB |
gpt-4.1-nano.json | 1327 KB |
gpt-4.1.json | 1395 KB |
gpt-5.4-mini.json | 1354 KB |
gpt-5.4-nano.json | 1417 KB |
gpt-5.4.json | 1396 KB |