View

Search Benchmark family Sort Penalize sparse coverage

Weights

Workload preset Input tokens / run Cached input tokens / run Output tokens / run Runs

Separates pure token-dollar efficiency from premium model preference.

Imported rows drive rankings; researched sources show the next best places to improve coverage and cost signal.

Each benchmark family shows current rows, pricing coverage, source status, and whether it is active in the score.

Higher and farther left means more benchmark performance per token dollar.

Select up to four rows to compare efficiency, score, and projected workload cost.

Default rank is adjusted benchmark points per projected token dollar.

Compare	Rank	Model	Composite	Evidence	Quality	Benchmarks	Published cost	Profile cost	Cost / point	Efficiency