Cost mode:

At a glance

Good enough on 15/35 tasks at the 95% bar. Cheapest qualifier on 1 task. Doesn't qualify on any: Financial Analysis & Trading Decisions, Content Summarization & Synthesis.

Provider
DeepSeek
Model name
deepseek-v4-pro
Qualifies on
15 / 35 tasks (at 90% bar)
Cheapest qualifier on
1 tasks

Cost vs quality across all tasks

$0.00062$0.00243$0.00960$0.03791$0.149680246810Quality score (0–10)Blended cost / callactivity_promo_generation autogenerated — quality 8.58, cost $0.002951at_content_domain_suggest autogenerated — quality 0.00, cost $0.008597Author Matching — quality 8.94, cost $0.024626 (anchor)author_living_check autogenerated — quality 9.31, cost $0.008554 (anchor)author_soul_generation autogenerated — quality 9.16, cost $0.017647auto_reddit_post_generation autogenerated — quality 7.51, cost $0.049258claim_extraction autogenerated — quality 7.45, cost $0.006793claim_refinement autogenerated — quality 6.76, cost $0.007247Claim-Referenced Analyst Writing (pooled) — quality 8.73, cost $0.042515content-summarization autogenerated — quality 7.51, cost $0.016344executive_summary_generation autogenerated — quality 9.61, cost $0.077440Generic TOC Extraction — quality 0.00, cost $0.003480image_prompt_generation autogenerated — quality 8.43, cost $0.009871Language Detection — quality 9.95, cost $0.000616LLM Prompt Adaptation — quality 8.73, cost $0.022865markdown_newline_repair autogenerated — quality 0.00, cost $0.083560metadata_paragraph_improvement autogenerated — quality 7.89, cost $0.001879onboarding_chapter_generation autogenerated — quality 0.00, cost $0.046514onboarding_chapter_prompt_generation autogenerated — quality 8.52, cost $0.105862onboarding_prospect_analysis autogenerated — quality 0.00, cost $0.008302ps_section_reassignment autogenerated — quality 0.00, cost $0.010170region_identification autogenerated — quality 9.10, cost $0.006699S1 TOC extraction — quality 0.00, cost $0.012206SEC Filling Analysis — quality 7.61, cost $0.018858sec-s1-chunk-analysis autogenerated — quality 8.04, cost $0.040490section_generation autogenerated — quality 8.81, cost $0.009403Social Post Promo (pooled) — quality 8.23, cost $0.004962structured_output_extraction autogenerated — quality 9.43, cost $0.008549subreddit_selection autogenerated — quality 4.87, cost $0.010626subreddit_vetting autogenerated — quality 9.30, cost $0.005436Substack Newsletter (pooled) — quality 9.20, cost $0.008206synthesis_analysis autogenerated — quality 8.28, cost $0.066482synthesis_of_titles_for_publication autogenerated — quality 8.07, cost $0.011301theme_generation autogenerated — quality 0.00, cost $0.026257Topic Discovery Clustering (pooled) — quality 7.42, cost $0.149677topic_client_matching autogenerated — quality 8.14, cost $0.038350topic_cluster_naming autogenerated — quality 8.27, cost $0.021492topic_clustering_assign_sections autogenerated — quality 7.99, cost $0.004188topic_sequence_determination autogenerated — quality 0.00, cost $0.013417Trading Recommendation — quality 7.70, cost $0.070294Translation — quality 7.79, cost $0.008824vetted_site_selection autogenerated — quality 0.00, cost $0.052308x_com_messages_for_promotion autogenerated — quality 9.60, cost $0.088194x_post_selection autogenerated — quality 8.04, cost $0.005345

qualifies at 90% bar · doesn't qualify · ★ this model is the best on that task. Lower + further right = cheaper + higher quality. Y-axis is log-scaled.

Per-task breakdown

CategoryTaskQualityConfidenceCost / callvs bestQualifies @ 90%
Structured Data & Fact Extractionclaim_extraction autogenerated7.45high · n=100$0.006793-1142%
Structured Data & Fact ExtractionGeneric TOC Extraction0.00low · n=0$0.003480no
Structured Data & Fact Extractionregion_identification autogenerated9.10high · n=27$0.0066992%
Structured Data & Fact ExtractionS1 TOC extraction0.00low · n=0$0.012206no
Structured Data & Fact Extractionstructured_output_extraction autogenerated9.43high · n=89$0.008549-179%
Financial Analysis & Trading Decisionsonboarding_prospect_analysis autogenerated0.00low · n=0$0.008302no
Financial Analysis & Trading DecisionsSEC Filling Analysis7.61high · n=100$0.01885874%no
Financial Analysis & Trading Decisionssec-s1-chunk-analysis autogenerated8.04medium · n=96$0.04049065%no
Financial Analysis & Trading Decisionssynthesis_analysis autogenerated8.28high · n=34$0.066482-2%no
Financial Analysis & Trading DecisionsTrading Recommendation7.70medium · n=82$0.07029449%no
Infrastructure & Utilityclaim_refinement autogenerated6.76medium · n=100$0.007247-249%no
Infrastructure & Utilityimage_prompt_generation autogenerated8.43ranked · n=100$0.00987163%
Infrastructure & UtilityLLM Prompt Adaptation8.73ranked · n=100$0.02286548%
Infrastructure & Utilitymarkdown_newline_repair autogenerated0.00low · n=0$0.083560no
Infrastructure & Utilitymetadata_paragraph_improvement autogenerated7.89high · n=98$0.001879-131%no
Infrastructure & Utilityonboarding_chapter_prompt_generation autogenerated8.52high · n=79$0.10586276%no
Infrastructure & UtilityTranslation7.79high · n=100$0.00882472%no
Long-form Content Generationauthor_soul_generation autogenerated9.16ranked · n=97$0.01764785%
Long-form Content GenerationClaim-Referenced Analyst Writing (pooled)8.73high · n=66$0.04251519%
Long-form Content Generationonboarding_chapter_generation autogenerated0.00low · n=0$0.046514no
Long-form Content Generationsection_generation autogenerated8.81high · n=7$0.009403-95%
Long-form Content GenerationSubstack Newsletter (pooled)9.20low · n=4$0.00820685%no
Long-form Content Generationtheme_generation autogenerated0.00low · n=0$0.026257no
Relevance, Classification & Matchingat_content_domain_suggest autogenerated0.00low · n=0$0.008597no
Relevance, Classification & MatchingAuthor Matching8.94ranked · n=100$0.024626best
Relevance, Classification & Matchingauthor_living_check autogenerated9.31ranked · n=71$0.008554best
Relevance, Classification & MatchingLanguage Detection9.95high · n=89$0.000616-450%
Relevance, Classification & Matchingsubreddit_selection autogenerated4.87high · n=67$0.01062650%no
Relevance, Classification & Matchingsubreddit_vetting autogenerated9.30low · n=1$0.005436no
Relevance, Classification & Matchingtopic_client_matching autogenerated8.14ranked · n=100$0.038350-1143%
Relevance, Classification & Matchingvetted_site_selection autogenerated0.00low · n=0$0.052308no
Relevance, Classification & Matchingx_post_selection autogenerated8.04ranked · n=100$0.00534567%no
Social & Promotional Contentactivity_promo_generation autogenerated8.58high · n=67$0.00295174%
Social & Promotional Contentauto_reddit_post_generation autogenerated7.51high · n=100$0.0492582%no
Social & Promotional ContentSocial Post Promo (pooled)8.23high · n=96$0.004962-3549%
Social & Promotional Contentx_com_messages_for_promotion autogenerated9.60low · n=1$0.088194no
Content Summarization & Synthesiscontent-summarization autogenerated7.51high · n=100$0.016344-696%no
Content Summarization & Synthesisexecutive_summary_generation autogenerated9.61low · n=3$0.077440no
Content Summarization & Synthesissynthesis_of_titles_for_publication autogenerated8.07high · n=73$0.011301-13%no
Topic Organization & Clusteringps_section_reassignment autogenerated0.00low · n=0$0.010170no
Topic Organization & ClusteringTopic Discovery Clustering (pooled)7.42high · n=100$0.14967751%no
Topic Organization & Clusteringtopic_cluster_naming autogenerated8.27ranked · n=77$0.02149239%
Topic Organization & Clusteringtopic_clustering_assign_sections autogenerated7.99high · n=100$0.00418837%no
Topic Organization & Clusteringtopic_sequence_determination autogenerated0.00low · n=0$0.013417no