Cost mode:

At a glance

Good enough on 9/35 tasks at the 95% bar. Cheapest qualifier on 9 tasks. Doesn't qualify on any: Financial Analysis & Trading Decisions, Content Summarization & Synthesis, Long-form Content Generation.

Provider
Alibaba Cloud (DashScope)
Model name
qwen3.5-flash
Qualifies on
9 / 35 tasks (at 90% bar)
Cheapest qualifier on
9 tasks

Cost vs quality across all tasks

$0.00009$0.00026$0.00080$0.00244$0.007460246810Quality score (0–10)Blended cost / callactivity_promo_generation autogenerated — quality 8.46, cost $0.000092at_content_domain_suggest autogenerated — quality 0.00, cost $0.000206Author Matching — quality 7.04, cost $0.000436author_living_check autogenerated — quality 7.63, cost $0.000409author_soul_generation autogenerated — quality 8.41, cost $0.001225auto_reddit_post_generation autogenerated — quality 6.77, cost $0.003065claim_extraction autogenerated — quality 6.91, cost $0.000238claim_refinement autogenerated — quality 7.28, cost $0.000237Claim-Referenced Analyst Writing (pooled) — quality 7.19, cost $0.001441content-summarization autogenerated — quality 7.45, cost $0.000805executive_summary_generation autogenerated — quality 6.49, cost $0.003129Generic TOC Extraction — quality 0.00, cost $0.000160image_prompt_generation autogenerated — quality 8.30, cost $0.000507LLM Prompt Adaptation — quality 8.45, cost $0.001489markdown_newline_repair autogenerated — quality 0.00, cost $0.005725metadata_paragraph_improvement autogenerated — quality 7.78, cost $0.000104onboarding_chapter_generation autogenerated — quality 0.00, cost $0.003312onboarding_chapter_prompt_generation autogenerated — quality 7.34, cost $0.007463onboarding_prospect_analysis autogenerated — quality 0.00, cost $0.000437ps_section_reassignment autogenerated — quality 0.00, cost $0.000367query_generation autogenerated — quality 7.89, cost $0.000345query_validation autogenerated — quality 8.63, cost $0.000169 (anchor)Relevance Scoring (POST) — quality 4.49, cost $0.000160Relevance Scoring (Topic Report) — quality 8.48, cost $0.000160 (anchor)Relevance Scoring (X Post) — quality 7.40, cost $0.000160S1 TOC extraction — quality 0.00, cost $0.000653SEC Filling Analysis — quality 7.18, cost $0.001237sec-s1-chunk-analysis autogenerated — quality 8.07, cost $0.001835section_generation autogenerated — quality 8.48, cost $0.000635Social Post Promo (pooled) — quality 8.35, cost $0.000136 (anchor)structured_output_extraction autogenerated — quality 9.78, cost $0.000374subreddit_vetting autogenerated — quality 0.00, cost $0.000287Substack Newsletter (pooled) — quality 8.68, cost $0.000561synthesis_analysis autogenerated — quality 7.66, cost $0.001501synthesis_of_titles_for_publication autogenerated — quality 7.85, cost $0.000563theme_generation autogenerated — quality 0.00, cost $0.000853Topic Discovery Clustering (pooled) — quality 7.21, cost $0.004984topic_cluster_naming autogenerated — quality 7.54, cost $0.000453topic_clustering_assign_sections autogenerated — quality 8.65, cost $0.000085topic_sequence_determination autogenerated — quality 0.00, cost $0.000633Trading Recommendation — quality 6.46, cost $0.004665Translation — quality 7.94, cost $0.000534vetted_site_selection autogenerated — quality 7.07, cost $0.001988x_post_selection autogenerated — quality 8.38, cost $0.000103

qualifies at 90% bar · doesn't qualify · ★ this model is the best on that task. Lower + further right = cheaper + higher quality. Y-axis is log-scaled.

Per-task breakdown

CategoryTaskQualityConfidenceCost / callvs bestQualifies @ 90%
Structured Data & Fact Extractionclaim_extraction autogenerated6.91ranked · n=100$0.00023856%no
Structured Data & Fact ExtractionGeneric TOC Extraction0.00low · n=0$0.000160no
Structured Data & Fact ExtractionS1 TOC extraction0.00low · n=0$0.000653no
Structured Data & Fact Extractionstructured_output_extraction autogenerated9.78high · n=100$0.00037488%
Financial Analysis & Trading Decisionsonboarding_prospect_analysis autogenerated0.00low · n=0$0.000437no
Financial Analysis & Trading DecisionsSEC Filling Analysis7.18medium · n=100$0.00123798%no
Financial Analysis & Trading Decisionssec-s1-chunk-analysis autogenerated8.07ranked · n=100$0.00183598%no
Financial Analysis & Trading Decisionssynthesis_analysis autogenerated7.66high · n=40$0.00150198%no
Financial Analysis & Trading DecisionsTrading Recommendation6.46medium · n=100$0.00466597%no
Infrastructure & Utilityclaim_refinement autogenerated7.28high · n=100$0.00023789%no
Infrastructure & Utilityimage_prompt_generation autogenerated8.30ranked · n=100$0.00050798%
Infrastructure & UtilityLLM Prompt Adaptation8.45ranked · n=100$0.00148997%no
Infrastructure & Utilitymarkdown_newline_repair autogenerated0.00low · n=0$0.005725no
Infrastructure & Utilitymetadata_paragraph_improvement autogenerated7.78high · n=100$0.00010487%no
Infrastructure & Utilityonboarding_chapter_prompt_generation autogenerated7.34high · n=100$0.00746398%no
Infrastructure & Utilityquery_generation autogenerated7.89high · n=95$0.00034598%no
Infrastructure & Utilityquery_validation autogenerated8.63medium · n=100$0.000169best
Infrastructure & UtilityTranslation7.94ranked · n=100$0.00053498%
Long-form Content Generationauthor_soul_generation autogenerated8.41ranked · n=100$0.00122599%no
Long-form Content GenerationClaim-Referenced Analyst Writing (pooled)7.19medium · n=91$0.00144197%no
Long-form Content Generationonboarding_chapter_generation autogenerated0.00low · n=0$0.003312no
Long-form Content Generationsection_generation autogenerated8.48medium · n=8$0.00063587%no
Long-form Content GenerationSubstack Newsletter (pooled)8.68low · n=6$0.00056199%no
Long-form Content Generationtheme_generation autogenerated0.00low · n=0$0.000853no
Relevance, Classification & Matchingat_content_domain_suggest autogenerated0.00low · n=0$0.000206no
Relevance, Classification & MatchingAuthor Matching7.04high · n=100$0.00043698%no
Relevance, Classification & Matchingauthor_living_check autogenerated7.63medium · n=92$0.00040995%no
Relevance, Classification & MatchingRelevance Scoring (POST)4.49high · n=100$0.00016096%no
Relevance, Classification & MatchingRelevance Scoring (Topic Report)8.48high · n=60$0.000160best
Relevance, Classification & MatchingRelevance Scoring (X Post)7.40low · n=14$0.000160no
Relevance, Classification & Matchingsubreddit_vetting autogenerated0.00low · n=0$0.000287no
Relevance, Classification & Matchingvetted_site_selection autogenerated7.07low · n=2$0.001988no
Relevance, Classification & Matchingx_post_selection autogenerated8.38ranked · n=100$0.00010399%
Social & Promotional Contentactivity_promo_generation autogenerated8.46high · n=85$0.00009299%
Social & Promotional Contentauto_reddit_post_generation autogenerated6.77high · n=100$0.00306594%no
Social & Promotional ContentSocial Post Promo (pooled)8.35ranked · n=100$0.000136best
Content Summarization & Synthesiscontent-summarization autogenerated7.45high · n=100$0.00080561%no
Content Summarization & Synthesisexecutive_summary_generation autogenerated6.49low · n=2$0.003129no
Content Summarization & Synthesissynthesis_of_titles_for_publication autogenerated7.85high · n=93$0.00056394%no
Topic Organization & Clusteringps_section_reassignment autogenerated0.00low · n=0$0.000367no
Topic Organization & ClusteringTopic Discovery Clustering (pooled)7.21medium · n=100$0.00498498%no
Topic Organization & Clusteringtopic_cluster_naming autogenerated7.54high · n=100$0.00045399%no
Topic Organization & Clusteringtopic_clustering_assign_sections autogenerated8.65ranked · n=100$0.00008599%
Topic Organization & Clusteringtopic_sequence_determination autogenerated0.00low · n=0$0.000633no