by @querulous-deer
This benchmark evaluates model performance on tool calling for the ClawHub: session-logs skill.
Best in Class
Overpriced
Your Selection
Click on the model to make selection
| Config | Total Cost | Quality | Username | Position | |
|---|---|---|---|---|---|
| arcee-ai/trinity-mini:free | $0.0000 | 50.0% | @querulous-deer | #1 | |
| qwen/qwen3-next-80b-a3b-instru... | $0.0000 | 35.7% | @querulous-deer | #2 | |
| meta-llama/llama-3.1-405b-inst... | $0.0000 | 35.7% | @querulous-deer | #2 | |
| meta-llama/llama-3.3-70b-instr... | $0.0000 | 21.4% | @querulous-deer | #4 | |
| qwen/qwen3-vl-235b-a22b-instru... | $0.0048 | 21.4% | @querulous-deer | #4 | |
| google/gemini-2.5-flash | $0.0071 | 14.3% | @querulous-deer | #6 | |
| xiaomi/mimo-v2-flash:free | $0.0000 | 7.1% | @querulous-deer | #7 | |
| openai/gpt-oss-20b:free | $0.0000 | 0.0% | @querulous-deer | #8 | |
| openai/gpt-oss-120b:free | $0.0000 | 0.0% | @querulous-deer | #8 | |
| moonshotai/kimi-k2:free | $0.0000 | 0.0% | @querulous-deer | #8 | |
| mistralai/mistral-small-3.1-24... | $0.0000 | 0.0% | @querulous-deer | #8 | |
| meta-llama/llama-3.2-3b-instru... | $0.0000 | 0.0% | @querulous-deer | #8 | |
| z-ai/glm-4.5-air:free | $0.0000 | 0.0% | @querulous-deer | #8 | |
| x-ai/grok-4.1-fast | $0.0248 | 0.0% | @querulous-deer | #8 | |
| mistralai/mistral-large | $0.0431 | 0.0% | @querulous-deer | #8 | |
| arcee-ai/virtuoso-large | $0.0000 | 0.0% | @querulous-deer | #8 |
| Total Cost | Quality | |
|---|---|---|
| arcee-ai/trinity-mini:free | $0.0000 | 50.0% |
| qwen/qwen3-next-80b-a3b-instru... | $0.0000 | 35.7% |
| meta-llama/llama-3.1-405b-inst... | $0.0000 | 35.7% |
| meta-llama/llama-3.3-70b-instr... | $0.0000 | 21.4% |
| qwen/qwen3-vl-235b-a22b-instru... | $0.0048 | 21.4% |
| google/gemini-2.5-flash | $0.0071 | 14.3% |
| xiaomi/mimo-v2-flash:free | $0.0000 | 7.1% |
| openai/gpt-oss-20b:free | $0.0000 | 0.0% |
| openai/gpt-oss-120b:free | $0.0000 | 0.0% |
| moonshotai/kimi-k2:free | $0.0000 | 0.0% |
| mistralai/mistral-small-3.1-24... | $0.0000 | 0.0% |
| meta-llama/llama-3.2-3b-instru... | $0.0000 | 0.0% |
| z-ai/glm-4.5-air:free | $0.0000 | 0.0% |
| x-ai/grok-4.1-fast | $0.0248 | 0.0% |
| mistralai/mistral-large | $0.0431 | 0.0% |
| arcee-ai/virtuoso-large | $0.0000 | 0.0% |
Loading prompt execution data...