Cost mode:

Category: Social & Promotional Content · Rail: absolute · Typical I/O: 6155→11077 tokens

Models

Frontier on this task: Kimi K2.6 at 8.26 / 10. Quality bar at 95%: 7.85.

024681095% barKimi K2.6$0.050155/call0% cheaperClaude Sonnet 4.6$0.184620/call-268% cheaperClaude Opus 4.7$0.307700/call-513% cheaperQwen 3.5 Flash$0.003065/call94% cheaperQwen 3.6 Plus$0.023601/call53% cheaperHaiku 4.5$0.061540/call-23% cheaperDeepSeek V4 Flash$0.003963/call92% cheaperDeepSeek V4 Pro$0.049258/call2% cheaperGemini 3 Flash Preview$0.036308/call28% cheaperGemini 3.1 Flash Lite$0.018154/call64% cheaperGemini 3.1 Pro Preview$0.145234/call-190% cheaperMiniMax M2.5$0.015139/call70% cheaperGPT-5.4 mini$0.054463/call-9% cheaperGPT-5.4 nano$0.015077/call70% cheaperGPT-5.5$0.363085/call-624% cheaper

point-estimate floor (CI low) · upper CI (less certain) · Bars sorted by blended cost; cheapest qualifier first. Greyed rows are MEDIUM+ models whose point estimate clears the bar but whose CI low does not.

Cost breakdown

ModelQualitySampleBlended cost / callSavings vs bestMode
Kimi K2.6 best Moonshot AI8.26 / 10 CI [7.96, 8.55]n=100 · high$0.050155(anchor)batch
Claude Sonnet 4.6 Anthropic7.85 / 10 CI [7.55, 8.15]n=100 · high$0.184620batch
Claude Opus 4.7 Anthropic8.18 / 10 CI [7.90, 8.45]n=100 · high$0.307700batch

Typical call shape for this task: 6155 input tokens → 11077 output tokens, EMA-tracked from production traffic. Blended cost = (in × in_price + out × out_price), rounded to 6 decimals.