AutoBench LLM Leaderboard

Interactive leaderboard for AutoBench, where LLMs rank LLMs' responses. Includes performance, cost, and latency metrics.Data updated on April 25, 2025.

More info for this benchmark run: AutoBench Run 2 Results. If you want to know more about AutoBench: AutoBench Release.

Overall Model Performance

Models ranked by AutoBench score. Lower cost ($ Cents) and latency (s) are better.

llama-4-Maverick-17B-128E-Instruct-FP8
4.57
0.793
36.57
223.47