Cost mode:

At a glance

Good enough on 10/35 tasks at the 95% bar. Cheapest qualifier on 5 tasks. Doesn't qualify on any: Financial Analysis & Trading Decisions, Content Summarization & Synthesis, Social & Promotional Content.

Provider
DeepSeek
Model name
deepseek-v4-flash
Qualifies on
10 / 35 tasks (at 90% bar)
Cheapest qualifier on
5 tasks

Cost vs quality across all tasks

$0.00005$0.00020$0.00078$0.00306$0.012040246810Quality score (0–10)Blended cost / callat_content_domain_suggest autogenerated — quality 0.00, cost $0.000692Author Matching — quality 8.63, cost $0.001981author_living_check autogenerated — quality 8.87, cost $0.000688author_soul_generation autogenerated — quality 9.26, cost $0.001420auto_reddit_post_generation autogenerated — quality 7.04, cost $0.003963claim_extraction autogenerated — quality 7.46, cost $0.000547 (anchor)claim_refinement autogenerated — quality 7.01, cost $0.000583Claim-Referenced Analyst Writing (pooled) — quality 7.83, cost $0.003421content-summarization autogenerated — quality 7.12, cost $0.001315executive_summary_generation autogenerated — quality 6.16, cost $0.006231Generic TOC Extraction — quality 0.00, cost $0.000280image_prompt_generation autogenerated — quality 8.35, cost $0.000794Language Detection — quality 9.94, cost $0.000050LLM Prompt Adaptation — quality 8.52, cost $0.001840markdown_newline_repair autogenerated — quality 0.00, cost $0.006723metadata_paragraph_improvement autogenerated — quality 6.79, cost $0.000151onboarding_chapter_generation autogenerated — quality 0.00, cost $0.003742onboarding_chapter_prompt_generation autogenerated — quality 8.00, cost $0.008518onboarding_prospect_analysis autogenerated — quality 0.00, cost $0.000668ps_section_reassignment autogenerated — quality 0.00, cost $0.000818query_generation autogenerated — quality 6.25, cost $0.000744query_validation autogenerated — quality 8.11, cost $0.000260region_identification autogenerated — quality 8.51, cost $0.000539S1 TOC extraction — quality 0.00, cost $0.000982SEC Filling Analysis — quality 7.68, cost $0.001517sec-s1-chunk-analysis autogenerated — quality 8.22, cost $0.003258section_generation autogenerated — quality 7.13, cost $0.000757Social Post Promo (pooled) — quality 7.74, cost $0.000399structured_output_extraction autogenerated — quality 9.75, cost $0.000688subreddit_selection autogenerated — quality 4.73, cost $0.000855subreddit_vetting autogenerated — quality 0.00, cost $0.000437Substack Newsletter (pooled) — quality 8.49, cost $0.000660synthesis_analysis autogenerated — quality 7.61, cost $0.005349synthesis_of_titles_for_publication autogenerated — quality 8.01, cost $0.000909theme_generation autogenerated — quality 0.00, cost $0.002113Topic Discovery Clustering (pooled) — quality 4.11, cost $0.012043topic_client_matching autogenerated — quality 8.24, cost $0.003086 (anchor)topic_cluster_naming autogenerated — quality 7.67, cost $0.001729topic_clustering_assign_sections autogenerated — quality 8.40, cost $0.000337topic_sequence_determination autogenerated — quality 0.00, cost $0.001080Translation — quality 7.67, cost $0.000710vetted_site_selection autogenerated — quality 5.50, cost $0.004209x_com_messages_for_promotion autogenerated — quality 0.00, cost $0.007096x_post_selection autogenerated — quality 8.43, cost $0.000430

qualifies at 90% bar · doesn't qualify · ★ this model is the best on that task. Lower + further right = cheaper + higher quality. Y-axis is log-scaled.

Per-task breakdown

CategoryTaskQualityConfidenceCost / callvs bestQualifies @ 90%
Structured Data & Fact Extractionclaim_extraction autogenerated7.46high · n=100$0.000547best
Structured Data & Fact ExtractionGeneric TOC Extraction0.00low · n=0$0.000280no
Structured Data & Fact Extractionregion_identification autogenerated8.51low · n=27$0.00053992%no
Structured Data & Fact ExtractionS1 TOC extraction0.00low · n=0$0.000982no
Structured Data & Fact Extractionstructured_output_extraction autogenerated9.75ranked · n=88$0.00068878%
Financial Analysis & Trading Decisionsonboarding_prospect_analysis autogenerated0.00low · n=0$0.000668no
Financial Analysis & Trading DecisionsSEC Filling Analysis7.68medium · n=100$0.00151798%no
Financial Analysis & Trading Decisionssec-s1-chunk-analysis autogenerated8.22high · n=94$0.00325897%no
Financial Analysis & Trading Decisionssynthesis_analysis autogenerated7.61medium · n=34$0.00534992%no
Infrastructure & Utilityclaim_refinement autogenerated7.01high · n=100$0.00058372%no
Infrastructure & Utilityimage_prompt_generation autogenerated8.35ranked · n=100$0.00079497%
Infrastructure & UtilityLLM Prompt Adaptation8.52high · n=100$0.00184096%no
Infrastructure & Utilitymarkdown_newline_repair autogenerated0.00low · n=0$0.006723no
Infrastructure & Utilitymetadata_paragraph_improvement autogenerated6.79medium · n=98$0.00015181%no
Infrastructure & Utilityonboarding_chapter_prompt_generation autogenerated8.00high · n=79$0.00851898%no
Infrastructure & Utilityquery_generation autogenerated6.25medium · n=69$0.00074497%no
Infrastructure & Utilityquery_validation autogenerated8.11medium · n=100$0.000260-54%no
Infrastructure & UtilityTranslation7.67high · n=100$0.00071098%no
Long-form Content Generationauthor_soul_generation autogenerated9.26ranked · n=97$0.00142099%
Long-form Content GenerationClaim-Referenced Analyst Writing (pooled)7.83medium · n=44$0.00342193%no
Long-form Content Generationonboarding_chapter_generation autogenerated0.00low · n=0$0.003742no
Long-form Content Generationsection_generation autogenerated7.13medium · n=7$0.00075784%no
Long-form Content GenerationSubstack Newsletter (pooled)8.49low · n=4$0.00066099%no
Long-form Content Generationtheme_generation autogenerated0.00low · n=0$0.002113no
Relevance, Classification & Matchingat_content_domain_suggest autogenerated0.00low · n=0$0.000692no
Relevance, Classification & MatchingAuthor Matching8.63ranked · n=100$0.00198192%
Relevance, Classification & Matchingauthor_living_check autogenerated8.87high · n=76$0.00068892%
Relevance, Classification & MatchingLanguage Detection9.94high · n=91$0.00005055%
Relevance, Classification & Matchingsubreddit_selection autogenerated4.73ranked · n=67$0.00085596%no
Relevance, Classification & Matchingsubreddit_vetting autogenerated0.00low · n=0$0.000437no
Relevance, Classification & Matchingtopic_client_matching autogenerated8.24ranked · n=100$0.003086best
Relevance, Classification & Matchingvetted_site_selection autogenerated5.50low · n=1$0.004209no
Relevance, Classification & Matchingx_post_selection autogenerated8.43ranked · n=100$0.00043097%
Social & Promotional Contentauto_reddit_post_generation autogenerated7.04high · n=100$0.00396392%no
Social & Promotional ContentSocial Post Promo (pooled)7.74high · n=96$0.000399-193%no
Social & Promotional Contentx_com_messages_for_promotion autogenerated0.00low · n=0$0.007096no
Content Summarization & Synthesiscontent-summarization autogenerated7.12high · n=100$0.00131536%no
Content Summarization & Synthesisexecutive_summary_generation autogenerated6.16low · n=2$0.006231no
Content Summarization & Synthesissynthesis_of_titles_for_publication autogenerated8.01high · n=61$0.00090991%no
Topic Organization & Clusteringps_section_reassignment autogenerated0.00low · n=0$0.000818no
Topic Organization & ClusteringTopic Discovery Clustering (pooled)4.11medium · n=100$0.01204396%no
Topic Organization & Clusteringtopic_cluster_naming autogenerated7.67high · n=91$0.00172995%no
Topic Organization & Clusteringtopic_clustering_assign_sections autogenerated8.40high · n=100$0.00033795%
Topic Organization & Clusteringtopic_sequence_determination autogenerated0.00low · n=0$0.001080no