by @querulous-deer
This benchmark evaluates model performance on tool calling for the ClawHub: elevenlabs skill.
Best in Class
Overpriced
Your Selection
Click on the model to make selection
| Config | Total Cost | Quality | Username | Position | |
|---|---|---|---|---|---|
| meta-llama/llama-3.3-70b-instr... | $0.0000 | 76.5% | @querulous-deer | #1 | |
| qwen/qwen3-vl-235b-a22b-instru... | $0.0132 | 73.5% | @querulous-deer | #2 | |
| meta-llama/llama-3.1-405b-inst... | $0.0000 | 44.1% | @querulous-deer | #3 | |
| mistralai/mistral-large | $0.1258 | 38.2% | @querulous-deer | #4 | |
| arcee-ai/trinity-mini:free | $0.0000 | 26.5% | @querulous-deer | #5 | |
| x-ai/grok-4.1-fast | $0.0320 | 26.5% | @querulous-deer | #5 | |
| z-ai/glm-4.5-air:free | $0.0000 | 17.6% | @querulous-deer | #7 | |
| qwen/qwen3-next-80b-a3b-instru... | $0.0000 | 17.6% | @querulous-deer | #7 | |
| xiaomi/mimo-v2-flash:free | $0.0000 | 14.7% | @querulous-deer | #9 | |
| openai/gpt-oss-20b:free | $0.0000 | 11.8% | @querulous-deer | #10 | |
| moonshotai/kimi-k2:free | $0.0000 | 5.9% | @querulous-deer | #11 | |
| google/gemini-2.5-flash | $0.0188 | 2.9% | @querulous-deer | #12 | |
| qwen/qwen-2.5-vl-7b-instruct:f... | $0.0000 | 0.0% | @querulous-deer | #13 | |
| openai/gpt-oss-120b:free | $0.0000 | 0.0% | @querulous-deer | #13 | |
| mistralai/mistral-small-3.1-24... | $0.0000 | 0.0% | @querulous-deer | #13 | |
| meta-llama/llama-3.2-3b-instru... | $0.0000 | 0.0% | @querulous-deer | #13 | |
| anthropic/claude-sonnet-4.5 | $0.2836 | 0.0% | @querulous-deer | #13 | |
| openai/gpt-5.2 | $0.1830 | 0.0% | @querulous-deer | #13 |
| Total Cost | Quality | |
|---|---|---|
| meta-llama/llama-3.3-70b-instr... | $0.0000 | 76.5% |
| qwen/qwen3-vl-235b-a22b-instru... | $0.0132 | 73.5% |
| meta-llama/llama-3.1-405b-inst... | $0.0000 | 44.1% |
| mistralai/mistral-large | $0.1258 | 38.2% |
| arcee-ai/trinity-mini:free | $0.0000 | 26.5% |
| x-ai/grok-4.1-fast | $0.0320 | 26.5% |
| z-ai/glm-4.5-air:free | $0.0000 | 17.6% |
| qwen/qwen3-next-80b-a3b-instru... | $0.0000 | 17.6% |
| xiaomi/mimo-v2-flash:free | $0.0000 | 14.7% |
| openai/gpt-oss-20b:free | $0.0000 | 11.8% |
| moonshotai/kimi-k2:free | $0.0000 | 5.9% |
| google/gemini-2.5-flash | $0.0188 | 2.9% |
| qwen/qwen-2.5-vl-7b-instruct:f... | $0.0000 | 0.0% |
| openai/gpt-oss-120b:free | $0.0000 | 0.0% |
| mistralai/mistral-small-3.1-24... | $0.0000 | 0.0% |
| meta-llama/llama-3.2-3b-instru... | $0.0000 | 0.0% |
| anthropic/claude-sonnet-4.5 | $0.2836 | 0.0% |
| openai/gpt-5.2 | $0.1830 | 0.0% |
Loading prompt execution data...