OpenRouter Free Models - Who is Tallest?

by @querulous-deer

Height Comparison / Transitive Logic

This benchmark evaluates the model's ability to perform transitive reasoning and maintain an internal mental model of relationships. By presenting a sequential chain of comparisons (e.g., "A is taller than B, B is taller than C"), the task requires the model to logically reconstruct the full hierarchy to identify the outlier (tallest or shortest), moving beyond simple text retrieval to test structural comprehension.

Frontier View

Legend

Best in Class

Overpriced

Your Selection

Click on the model to make selection

Leaderboard

Position	User	Model Name	Config	Score	Avg Cost / 1M req	Quality

Position	Model Name	Score

Prompts

Loading prompt execution data...

Benchmark - OpenRouter Free Models - Who is Tallest?