by @querulous-deer
This benchmark evaluates model performance on tool calling for the ClawHub: gcalcli-calendar skill.
Best in Class
Overpriced
Your Selection
Click on the model to make selection
| Config | Total Cost | Quality | Username | Position | |
|---|---|---|---|---|---|
| qwen/qwen3-vl-235b-a22b-instru... | $0.0085 | 84.6% | @querulous-deer | #1 | |
| xiaomi/mimo-v2-flash:free | $0.0000 | 73.1% | @querulous-deer | #2 | |
| qwen/qwen3-next-80b-a3b-instru... | $0.0000 | 69.2% | @querulous-deer | #3 | |
| moonshotai/kimi-k2:free | $0.0000 | 61.5% | @querulous-deer | #4 | |
| z-ai/glm-4.5-air:free | $0.0000 | 61.5% | @querulous-deer | #4 | |
| mistralai/mistral-large | $0.0799 | 61.5% | @querulous-deer | #4 | |
| google/gemini-2.5-flash | $0.0122 | 61.5% | @querulous-deer | #4 | |
| openai/gpt-oss-20b:free | $0.0000 | 58.3% | @querulous-deer | #8 | |
| x-ai/grok-4.1-fast | $0.0165 | 57.7% | @querulous-deer | #9 | |
| z-ai/glm-4.7 | $0.0226 | 53.8% | @querulous-deer | #10 | |
| meta-llama/llama-3.3-70b-instr... | $0.0000 | 50.0% | @querulous-deer | #11 | |
| meta-llama/llama-3.1-405b-inst... | $0.0000 | 50.0% | @querulous-deer | #11 | |
| arcee-ai/trinity-mini:free | $0.0000 | 34.6% | @querulous-deer | #13 | |
| openai/gpt-oss-120b:free | $0.0000 | 30.8% | @querulous-deer | #14 | |
| qwen/qwen-2.5-vl-7b-instruct:f... | $0.0000 | 0.0% | @querulous-deer | #15 | |
| mistralai/mistral-small-3.1-24... | $0.0000 | 0.0% | @querulous-deer | #15 | |
| meta-llama/llama-3.2-3b-instru... | $0.0000 | 0.0% | @querulous-deer | #15 |
| Total Cost | Quality | |
|---|---|---|
| qwen/qwen3-vl-235b-a22b-instru... | $0.0085 | 84.6% |
| xiaomi/mimo-v2-flash:free | $0.0000 | 73.1% |
| qwen/qwen3-next-80b-a3b-instru... | $0.0000 | 69.2% |
| moonshotai/kimi-k2:free | $0.0000 | 61.5% |
| z-ai/glm-4.5-air:free | $0.0000 | 61.5% |
| mistralai/mistral-large | $0.0799 | 61.5% |
| google/gemini-2.5-flash | $0.0122 | 61.5% |
| openai/gpt-oss-20b:free | $0.0000 | 58.3% |
| x-ai/grok-4.1-fast | $0.0165 | 57.7% |
| z-ai/glm-4.7 | $0.0226 | 53.8% |
| meta-llama/llama-3.3-70b-instr... | $0.0000 | 50.0% |
| meta-llama/llama-3.1-405b-inst... | $0.0000 | 50.0% |
| arcee-ai/trinity-mini:free | $0.0000 | 34.6% |
| openai/gpt-oss-120b:free | $0.0000 | 30.8% |
| qwen/qwen-2.5-vl-7b-instruct:f... | $0.0000 | 0.0% |
| mistralai/mistral-small-3.1-24... | $0.0000 | 0.0% |
| meta-llama/llama-3.2-3b-instru... | $0.0000 | 0.0% |
Loading prompt execution data...