Cost mode:

At a glance

Good enough on 12/35 tasks at the 95% bar. Cheapest qualifier on 0 tasks. Doesn't qualify on any: Content Summarization & Synthesis.

Provider
Anthropic
Model name
claude-sonnet-4-6
Qualifies on
12 / 35 tasks (at 90% bar)
Cheapest qualifier on
0 tasks

Cost vs quality across all tasks

$0.00064$0.00306$0.01467$0.07030$0.336860246810Quality score (0–10)Blended cost / callactivity_promo_generation autogenerated — quality 8.50, cost $0.006924at_content_domain_suggest autogenerated — quality 0.00, cost $0.008716Author Matching — quality 7.70, cost $0.021486author_soul_generation autogenerated — quality 9.18, cost $0.071880auto_reddit_post_generation autogenerated — quality 7.85, cost $0.184620claim_extraction autogenerated — quality 6.71, cost $0.008574claim_refinement autogenerated — quality 6.34, cost $0.008758Claim-Referenced Analyst Writing (pooled) — quality 8.98, cost $0.052576 (anchor)content-summarization autogenerated — quality 6.62, cost $0.025866Direct Browse Content Synthesis — quality 0.00, cost $0.007539executive_summary_generation autogenerated — quality 9.49, cost $0.107124Generic TOC Extraction — quality 0.00, cost $0.005250image_prompt_generation autogenerated — quality 8.13, cost $0.016078Language Detection — quality 9.99, cost $0.000639LLM Prompt Adaptation — quality 9.10, cost $0.044344 (anchor)markdown_newline_repair autogenerated — quality 0.00, cost $0.336858metadata_paragraph_improvement autogenerated — quality 5.30, cost $0.003222onboarding_chapter_generation autogenerated — quality 0.00, cost $0.096564onboarding_chapter_prompt_generation autogenerated — quality 8.58, cost $0.218102onboarding_prospect_analysis autogenerated — quality 0.00, cost $0.013776ps_section_reassignment autogenerated — quality 0.00, cost $0.026166query_validation autogenerated — quality 8.12, cost $0.005330region_identification autogenerated — quality 9.22, cost $0.012538S1 TOC extraction — quality 0.00, cost $0.020472SEC Filling Analysis — quality 7.49, cost $0.036777sec-s1-chunk-analysis autogenerated — quality 7.84, cost $0.060492section_generation autogenerated — quality 8.79, cost $0.018748Social Post Promo (pooled) — quality 7.87, cost $0.010824structured_output_extraction autogenerated — quality 8.65, cost $0.024918subreddit_selection autogenerated — quality 5.03, cost $0.011856subreddit_vetting autogenerated — quality 0.00, cost $0.018075Substack Newsletter (pooled) — quality 8.89, cost $0.033012synthesis_analysis autogenerated — quality 8.95, cost $0.065295 (anchor)synthesis_of_titles_for_publication autogenerated — quality 7.82, cost $0.018027theme_generation autogenerated — quality 0.00, cost $0.031644Topic Discovery Clustering (pooled) — quality 6.05, cost $0.183104topic_client_matching autogenerated — quality 6.78, cost $0.057693topic_cluster_naming autogenerated — quality 8.20, cost $0.020378topic_clustering_assign_sections autogenerated — quality 8.13, cost $0.003898topic_sequence_determination autogenerated — quality 0.00, cost $0.020612Trading Recommendation — quality 8.79, cost $0.138282 (anchor)Translation — quality 7.31, cost $0.016206vetted_site_selection autogenerated — quality 7.80, cost $0.069537x_com_messages_for_promotion autogenerated — quality 0.00, cost $0.096711x_post_selection autogenerated — quality 8.41, cost $0.009684

qualifies at 90% bar · doesn't qualify · ★ this model is the best on that task. Lower + further right = cheaper + higher quality. Y-axis is log-scaled.

Per-task breakdown

CategoryTaskQualityConfidenceCost / callvs bestQualifies @ 90%
Structured Data & Fact Extractionclaim_extraction autogenerated6.71high · n=100$0.008574-1467%no
Structured Data & Fact ExtractionGeneric TOC Extraction0.00low · n=0$0.005250no
Structured Data & Fact Extractionregion_identification autogenerated9.22high · n=25$0.012538-84%
Structured Data & Fact ExtractionS1 TOC extraction0.00low · n=0$0.020472no
Structured Data & Fact Extractionstructured_output_extraction autogenerated8.65medium · n=91$0.024918-712%no
Financial Analysis & Trading Decisionsonboarding_prospect_analysis autogenerated0.00low · n=0$0.013776no
Financial Analysis & Trading DecisionsSEC Filling Analysis7.49medium · n=100$0.03677749%no
Financial Analysis & Trading Decisionssec-s1-chunk-analysis autogenerated7.84medium · n=81$0.06049247%no
Financial Analysis & Trading Decisionssynthesis_analysis autogenerated8.95high · n=29$0.065295best
Financial Analysis & Trading DecisionsTrading Recommendation8.79high · n=86$0.138282best
Infrastructure & Utilityclaim_refinement autogenerated6.34medium · n=100$0.008758-321%no
Infrastructure & Utilityimage_prompt_generation autogenerated8.13ranked · n=100$0.01607840%no
Infrastructure & UtilityLLM Prompt Adaptation9.10ranked · n=100$0.044344best
Infrastructure & Utilitymarkdown_newline_repair autogenerated0.00low · n=0$0.336858no
Infrastructure & Utilitymetadata_paragraph_improvement autogenerated5.30medium · n=90$0.003222-296%no
Infrastructure & Utilityonboarding_chapter_prompt_generation autogenerated8.58ranked · n=75$0.21810250%no
Infrastructure & Utilityquery_validation autogenerated8.12medium · n=100$0.005330-3054%no
Infrastructure & UtilityTranslation7.31high · n=90$0.01620649%no
Long-form Content Generationauthor_soul_generation autogenerated9.18ranked · n=84$0.07188040%
Long-form Content GenerationClaim-Referenced Analyst Writing (pooled)8.98high · n=84$0.052576best
Long-form Content Generationonboarding_chapter_generation autogenerated0.00low · n=0$0.096564no
Long-form Content Generationsection_generation autogenerated8.79medium · n=6$0.018748-288%
Long-form Content GenerationSubstack Newsletter (pooled)8.89low · n=5$0.03301240%no
Long-form Content Generationtheme_generation autogenerated0.00low · n=0$0.031644no
Relevance, Classification & Matchingat_content_domain_suggest autogenerated0.00low · n=0$0.008716no
Relevance, Classification & MatchingAuthor Matching7.70high · n=100$0.02148613%no
Relevance, Classification & MatchingLanguage Detection9.99ranked · n=82$0.000639-471%
Relevance, Classification & Matchingsubreddit_selection autogenerated5.03high · n=63$0.01185644%no
Relevance, Classification & Matchingsubreddit_vetting autogenerated0.00low · n=0$0.018075no
Relevance, Classification & Matchingtopic_client_matching autogenerated6.78high · n=100$0.057693-1770%no
Relevance, Classification & Matchingvetted_site_selection autogenerated7.80low · n=1$0.069537no
Relevance, Classification & Matchingx_post_selection autogenerated8.41ranked · n=100$0.00968441%
Social & Promotional Contentactivity_promo_generation autogenerated8.50medium · n=63$0.00692440%
Social & Promotional Contentauto_reddit_post_generation autogenerated7.85high · n=100$0.184620-268%
Social & Promotional ContentSocial Post Promo (pooled)7.87high · n=94$0.010824-7859%no
Social & Promotional Contentx_com_messages_for_promotion autogenerated0.00low · n=0$0.096711no
Content Summarization & Synthesiscontent-summarization autogenerated6.62high · n=100$0.025866-1161%no
Content Summarization & SynthesisDirect Browse Content Synthesis0.00low · n=0$0.007539no
Content Summarization & Synthesisexecutive_summary_generation autogenerated9.49low · n=2$0.107124no
Content Summarization & Synthesissynthesis_of_titles_for_publication autogenerated7.82high · n=68$0.018027-80%no
Topic Organization & Clusteringps_section_reassignment autogenerated0.00low · n=0$0.026166no
Topic Organization & ClusteringTopic Discovery Clustering (pooled)6.05high · n=100$0.18310440%no
Topic Organization & Clusteringtopic_cluster_naming autogenerated8.20ranked · n=94$0.02037842%
Topic Organization & Clusteringtopic_clustering_assign_sections autogenerated8.13high · n=100$0.00389841%no
Topic Organization & Clusteringtopic_sequence_determination autogenerated0.00low · n=0$0.020612no