Cost mode:

At a glance

Good enough on 14/35 tasks at the 95% bar. Cheapest qualifier on 7 tasks. Doesn't qualify on any: Content Summarization & Synthesis.

Provider
Alibaba Cloud (DashScope)
Model name
qwen3.6-plus
Qualifies on
14 / 35 tasks (at 90% bar)
Cheapest qualifier on
7 tasks

Cost vs quality across all tasks

$0.00081$0.00235$0.00678$0.01955$0.056420246810Quality score (0–10)Blended cost / callactivity_promo_generation autogenerated — quality 8.48, cost $0.000816at_content_domain_suggest autogenerated — quality 0.00, cost $0.001983Author Matching — quality 7.81, cost $0.004674author_living_check autogenerated — quality 8.52, cost $0.003296author_soul_generation autogenerated — quality 8.70, cost $0.009284auto_reddit_post_generation autogenerated — quality 7.57, cost $0.023601claim_extraction autogenerated — quality 6.42, cost $0.002054claim_refinement autogenerated — quality 8.08, cost $0.002079 (anchor)Claim-Referenced Analyst Writing (pooled) — quality 8.53, cost $0.012542executive_summary_generation autogenerated — quality 7.50, cost $0.026125Generic TOC Extraction — quality 0.00, cost $0.001300image_prompt_generation autogenerated — quality 8.34, cost $0.004030LLM Prompt Adaptation — quality 8.89, cost $0.011387markdown_newline_repair autogenerated — quality 1.50, cost $0.043455metadata_paragraph_improvement autogenerated — quality 8.58, cost $0.000814 (anchor)onboarding_chapter_generation autogenerated — quality 0.00, cost $0.025000onboarding_chapter_prompt_generation autogenerated — quality 8.27, cost $0.056416onboarding_prospect_analysis autogenerated — quality 0.00, cost $0.003463ps_section_reassignment autogenerated — quality 0.00, cost $0.003146query_generation autogenerated — quality 8.04, cost $0.002930query_validation autogenerated — quality 7.67, cost $0.001339Relevance Scoring (POST) — quality 4.73, cost $0.001300Relevance Scoring (Topic Report) — quality 7.91, cost $0.001300Relevance Scoring (X Post) — quality 7.08, cost $0.001300S1 TOC extraction — quality 0.00, cost $0.005154SEC Filling Analysis — quality 7.74, cost $0.009450sec-s1-chunk-analysis autogenerated — quality 8.15, cost $0.014955section_generation autogenerated — quality 9.09, cost $0.004831 (anchor)Social Post Promo (pooled) — quality 8.20, cost $0.001254structured_output_extraction autogenerated — quality 9.84, cost $0.003067 (anchor)subreddit_selection autogenerated — quality 4.56, cost $0.002763subreddit_vetting autogenerated — quality 0.00, cost $0.002272Substack Newsletter (pooled) — quality 9.09, cost $0.004258synthesis_analysis autogenerated — quality 8.79, cost $0.014724synthesis_of_titles_for_publication autogenerated — quality 8.06, cost $0.004504theme_generation autogenerated — quality 0.00, cost $0.007507Topic Discovery Clustering (pooled) — quality 7.46, cost $0.043578topic_client_matching autogenerated — quality 7.67, cost $0.014279topic_cluster_naming autogenerated — quality 8.20, cost $0.004549topic_clustering_assign_sections autogenerated — quality 8.73, cost $0.000865topic_sequence_determination autogenerated — quality 0.00, cost $0.005119Trading Recommendation — quality 8.22, cost $0.035572Translation — quality 7.76, cost $0.004132vetted_site_selection autogenerated — quality 0.00, cost $0.016832x_post_selection autogenerated — quality 8.37, cost $0.001066

qualifies at 90% bar · doesn't qualify · ★ this model is the best on that task. Lower + further right = cheaper + higher quality. Y-axis is log-scaled.

Per-task breakdown

CategoryTaskQualityConfidenceCost / callvs bestQualifies @ 90%
Structured Data & Fact Extractionclaim_extraction autogenerated6.42high · n=100$0.002054-276%no
Structured Data & Fact ExtractionGeneric TOC Extraction0.00low · n=0$0.001300no
Structured Data & Fact ExtractionS1 TOC extraction0.00low · n=0$0.005154no
Structured Data & Fact Extractionstructured_output_extraction autogenerated9.84ranked · n=100$0.003067best
Financial Analysis & Trading Decisionsonboarding_prospect_analysis autogenerated0.00low · n=0$0.003463no
Financial Analysis & Trading DecisionsSEC Filling Analysis7.74high · n=100$0.00945087%no
Financial Analysis & Trading Decisionssec-s1-chunk-analysis autogenerated8.15ranked · n=100$0.01495587%no
Financial Analysis & Trading Decisionssynthesis_analysis autogenerated8.79ranked · n=40$0.01472477%
Financial Analysis & Trading DecisionsTrading Recommendation8.22medium · n=100$0.03557274%no
Infrastructure & Utilityclaim_refinement autogenerated8.08high · n=100$0.002079best
Infrastructure & Utilityimage_prompt_generation autogenerated8.34ranked · n=100$0.00403085%
Infrastructure & UtilityLLM Prompt Adaptation8.89ranked · n=100$0.01138774%
Infrastructure & Utilitymarkdown_newline_repair autogenerated1.50low · n=1$0.043455no
Infrastructure & Utilitymetadata_paragraph_improvement autogenerated8.58ranked · n=100$0.000814best
Infrastructure & Utilityonboarding_chapter_prompt_generation autogenerated8.27ranked · n=100$0.05641687%no
Infrastructure & Utilityquery_generation autogenerated8.04high · n=95$0.00293087%no
Infrastructure & Utilityquery_validation autogenerated7.67high · n=100$0.001339-692%no
Infrastructure & UtilityTranslation7.76high · n=100$0.00413287%no
Long-form Content Generationauthor_soul_generation autogenerated8.70ranked · n=100$0.00928492%no
Long-form Content GenerationClaim-Referenced Analyst Writing (pooled)8.53high · n=59$0.01254276%
Long-form Content Generationonboarding_chapter_generation autogenerated0.00low · n=0$0.025000no
Long-form Content Generationsection_generation autogenerated9.09ranked · n=8$0.004831best
Long-form Content GenerationSubstack Newsletter (pooled)9.09ranked · n=6$0.00425892%
Long-form Content Generationtheme_generation autogenerated0.00low · n=0$0.007507no
Relevance, Classification & Matchingat_content_domain_suggest autogenerated0.00low · n=0$0.001983no
Relevance, Classification & MatchingAuthor Matching7.81ranked · n=100$0.00467481%no
Relevance, Classification & Matchingauthor_living_check autogenerated8.52high · n=92$0.00329661%no
Relevance, Classification & MatchingRelevance Scoring (POST)4.73high · n=100$0.00130068%no
Relevance, Classification & MatchingRelevance Scoring (Topic Report)7.91medium · n=60$0.001300-712%no
Relevance, Classification & MatchingRelevance Scoring (X Post)7.08low · n=14$0.001300no
Relevance, Classification & Matchingsubreddit_selection autogenerated4.56ranked · n=84$0.00276387%no
Relevance, Classification & Matchingsubreddit_vetting autogenerated0.00low · n=0$0.002272no
Relevance, Classification & Matchingtopic_client_matching autogenerated7.67ranked · n=100$0.014279-363%no
Relevance, Classification & Matchingvetted_site_selection autogenerated0.00low · n=0$0.016832no
Relevance, Classification & Matchingx_post_selection autogenerated8.37ranked · n=100$0.00106694%
Social & Promotional Contentactivity_promo_generation autogenerated8.48ranked · n=85$0.00081693%
Social & Promotional Contentauto_reddit_post_generation autogenerated7.57medium · n=100$0.02360153%no
Social & Promotional ContentSocial Post Promo (pooled)8.20ranked · n=100$0.001254-822%
Content Summarization & Synthesisexecutive_summary_generation autogenerated7.50low · n=1$0.026125no
Content Summarization & Synthesissynthesis_of_titles_for_publication autogenerated8.06ranked · n=93$0.00450455%no
Topic Organization & Clusteringps_section_reassignment autogenerated0.00low · n=0$0.003146no
Topic Organization & ClusteringTopic Discovery Clustering (pooled)7.46high · n=100$0.04357886%no
Topic Organization & Clusteringtopic_cluster_naming autogenerated8.20ranked · n=100$0.00454987%
Topic Organization & Clusteringtopic_clustering_assign_sections autogenerated8.73high · n=100$0.00086587%
Topic Organization & Clusteringtopic_sequence_determination autogenerated0.00low · n=0$0.005119no