Cost mode:

At a glance

Good enough on 14/35 tasks at the 95% bar. Cheapest qualifier on 1 task. Doesn't qualify on any: Content Summarization & Synthesis.

Provider
Anthropic
Model name
claude-opus-4-7
Qualifies on
14 / 35 tasks (at 90% bar)
Cheapest qualifier on
1 tasks

Cost vs quality across all tasks

$0.00537$0.01717$0.05491$0.17558$0.561430246810Quality score (0–10)Blended cost / callactivity_promo_generation autogenerated — quality 8.65, cost $0.011540 (anchor)at_content_domain_suggest autogenerated — quality 0.00, cost $0.014528Author Matching — quality 7.17, cost $0.035810author_soul_generation autogenerated — quality 9.53, cost $0.119800 (anchor)auto_reddit_post_generation autogenerated — quality 8.18, cost $0.307700claim_extraction autogenerated — quality 7.04, cost $0.014290claim_refinement autogenerated — quality 6.66, cost $0.014598Claim-Referenced Analyst Writing (pooled) — quality 8.82, cost $0.087628content-summarization autogenerated — quality 6.96, cost $0.043110Direct Browse Content Synthesis — quality 0.00, cost $0.012565executive_summary_generation autogenerated — quality 8.74, cost $0.178540Generic TOC Extraction — quality 0.00, cost $0.008750image_prompt_generation autogenerated — quality 8.57, cost $0.026798 (anchor)LLM Prompt Adaptation — quality 8.83, cost $0.073908markdown_newline_repair autogenerated — quality 1.50, cost $0.561430metadata_paragraph_improvement autogenerated — quality 4.48, cost $0.005370onboarding_chapter_generation autogenerated — quality 0.00, cost $0.160940onboarding_chapter_prompt_generation autogenerated — quality 8.98, cost $0.363502onboarding_prospect_analysis autogenerated — quality 0.00, cost $0.022960ps_section_reassignment autogenerated — quality 0.00, cost $0.043610region_identification autogenerated — quality 9.15, cost $0.020898Relevance Scoring (POST) — quality 0.00, cost $0.008750Relevance Scoring (Topic Report) — quality 0.00, cost $0.008750Relevance Scoring (X Post) — quality 0.00, cost $0.008750S1 TOC extraction — quality 0.00, cost $0.034120SEC Filling Analysis — quality 7.92, cost $0.061295sec-s1-chunk-analysis autogenerated — quality 8.19, cost $0.100820section_generation autogenerated — quality 9.06, cost $0.031248Social Post Promo (pooled) — quality 7.76, cost $0.018040structured_output_extraction autogenerated — quality 9.03, cost $0.041530subreddit_selection autogenerated — quality 5.24, cost $0.019760subreddit_vetting autogenerated — quality 0.00, cost $0.030125Substack Newsletter (pooled) — quality 9.51, cost $0.055020 (anchor)synthesis_analysis autogenerated — quality 8.77, cost $0.108825synthesis_of_titles_for_publication autogenerated — quality 8.09, cost $0.030045theme_generation autogenerated — quality 0.00, cost $0.052740Topic Discovery Clustering (pooled) — quality 8.77, cost $0.305172 (anchor)topic_client_matching autogenerated — quality 7.40, cost $0.096155topic_cluster_naming autogenerated — quality 7.88, cost $0.033962topic_sequence_determination autogenerated — quality 0.00, cost $0.034352Trading Recommendation — quality 8.75, cost $0.230470vetted_site_selection autogenerated — quality 0.00, cost $0.115895x_com_messages_for_promotion autogenerated — quality 0.00, cost $0.161185x_post_selection autogenerated — quality 8.31, cost $0.016140

qualifies at 90% bar · doesn't qualify · ★ this model is the best on that task. Lower + further right = cheaper + higher quality. Y-axis is log-scaled.

Per-task breakdown

CategoryTaskQualityConfidenceCost / callvs bestQualifies @ 90%
Structured Data & Fact Extractionclaim_extraction autogenerated7.04high · n=100$0.014290-2512%no
Structured Data & Fact ExtractionGeneric TOC Extraction0.00low · n=0$0.008750no
Structured Data & Fact Extractionregion_identification autogenerated9.15ranked · n=25$0.020898-207%
Structured Data & Fact ExtractionS1 TOC extraction0.00low · n=0$0.034120no
Structured Data & Fact Extractionstructured_output_extraction autogenerated9.03ranked · n=83$0.041530-1254%no
Financial Analysis & Trading Decisionsonboarding_prospect_analysis autogenerated0.00low · n=0$0.022960no
Financial Analysis & Trading DecisionsSEC Filling Analysis7.92high · n=100$0.06129516%no
Financial Analysis & Trading Decisionssec-s1-chunk-analysis autogenerated8.19high · n=81$0.10082012%no
Financial Analysis & Trading Decisionssynthesis_analysis autogenerated8.77high · n=25$0.108825-67%
Financial Analysis & Trading DecisionsTrading Recommendation8.75ranked · n=86$0.230470-67%
Infrastructure & Utilityclaim_refinement autogenerated6.66medium · n=100$0.014598-602%no
Infrastructure & Utilityimage_prompt_generation autogenerated8.57ranked · n=100$0.026798best
Infrastructure & UtilityLLM Prompt Adaptation8.83ranked · n=100$0.073908-67%
Infrastructure & Utilitymarkdown_newline_repair autogenerated1.50low · n=1$0.561430no
Infrastructure & Utilitymetadata_paragraph_improvement autogenerated4.48medium · n=90$0.005370-560%no
Infrastructure & Utilityonboarding_chapter_prompt_generation autogenerated8.98high · n=75$0.36350216%
Long-form Content Generationauthor_soul_generation autogenerated9.53ranked · n=84$0.119800best
Long-form Content GenerationClaim-Referenced Analyst Writing (pooled)8.82high · n=100$0.087628-67%
Long-form Content Generationonboarding_chapter_generation autogenerated0.00low · n=0$0.160940no
Long-form Content Generationsection_generation autogenerated9.06high · n=6$0.031248-547%
Long-form Content GenerationSubstack Newsletter (pooled)9.51ranked · n=5$0.055020best
Long-form Content Generationtheme_generation autogenerated0.00low · n=0$0.052740no
Relevance, Classification & Matchingat_content_domain_suggest autogenerated0.00low · n=0$0.014528no
Relevance, Classification & MatchingAuthor Matching7.17high · n=100$0.035810-45%no
Relevance, Classification & MatchingRelevance Scoring (POST)0.00low · n=0$0.008750-119%no
Relevance, Classification & MatchingRelevance Scoring (Topic Report)0.00low · n=0$0.008750-5369%no
Relevance, Classification & MatchingRelevance Scoring (X Post)0.00low · n=0$0.008750no
Relevance, Classification & Matchingsubreddit_selection autogenerated5.24high · n=63$0.0197607%no
Relevance, Classification & Matchingsubreddit_vetting autogenerated0.00low · n=0$0.030125no
Relevance, Classification & Matchingtopic_client_matching autogenerated7.40high · n=100$0.096155-3016%no
Relevance, Classification & Matchingvetted_site_selection autogenerated0.00low · n=0$0.115895no
Relevance, Classification & Matchingx_post_selection autogenerated8.31high · n=100$0.0161402%
Social & Promotional Contentactivity_promo_generation autogenerated8.65medium · n=63$0.011540best
Social & Promotional Contentauto_reddit_post_generation autogenerated8.18high · n=100$0.307700-513%
Social & Promotional ContentSocial Post Promo (pooled)7.76medium · n=94$0.018040-13165%no
Social & Promotional Contentx_com_messages_for_promotion autogenerated0.00low · n=0$0.161185no
Content Summarization & Synthesiscontent-summarization autogenerated6.96high · n=100$0.043110-2001%no
Content Summarization & SynthesisDirect Browse Content Synthesis0.00low · n=0$0.012565no
Content Summarization & Synthesisexecutive_summary_generation autogenerated8.74low · n=2$0.178540no
Content Summarization & Synthesissynthesis_of_titles_for_publication autogenerated8.09high · n=68$0.030045-199%no
Topic Organization & Clusteringps_section_reassignment autogenerated0.00low · n=0$0.043610no
Topic Organization & ClusteringTopic Discovery Clustering (pooled)8.77high · n=98$0.305172best
Topic Organization & Clusteringtopic_cluster_naming autogenerated7.88ranked · n=94$0.0339623%no
Topic Organization & Clusteringtopic_sequence_determination autogenerated0.00low · n=0$0.034352no