Cost mode:

At a glance

Good enough on 12/35 tasks at the 95% bar. Cheapest qualifier on 1 task. Doesn't qualify on any: Financial Analysis & Trading Decisions, Long-form Content Generation.

Provider
Gemini
Model name
gemini-3.1-pro-preview
Qualifies on
12 / 35 tasks (at 90% bar)
Cheapest qualifier on
1 tasks

Cost vs quality across all tasks

$0.00045$0.00222$0.01097$0.05416$0.267410246810Quality score (0–10)Blended cost / callactivity_promo_generation autogenerated — quality 8.33, cost $0.005024at_content_domain_suggest autogenerated — quality 0.00, cost $0.006101Author Matching — quality 8.59, cost $0.014381author_living_check autogenerated — quality 8.87, cost $0.020280author_soul_generation autogenerated — quality 8.25, cost $0.057132auto_reddit_post_generation autogenerated — quality 7.58, cost $0.145234claim_extraction autogenerated — quality 6.42, cost $0.006320claim_refinement autogenerated — quality 6.37, cost $0.006397Claim-Referenced Analyst Writing (pooled) — quality 8.01, cost $0.038590content-summarization autogenerated — quality 5.22, cost $0.019861Direct Browse Content Synthesis — quality 0.00, cost $0.005546executive_summary_generation autogenerated — quality 4.50, cost $0.080386Generic TOC Extraction — quality 0.00, cost $0.004000image_prompt_generation autogenerated — quality 7.61, cost $0.012401Language Detection — quality 10.01, cost $0.000450LLM Prompt Adaptation — quality 8.63, cost $0.035037markdown_newline_repair autogenerated — quality 0.00, cost $0.267414metadata_paragraph_improvement autogenerated — quality 7.42, cost $0.002504onboarding_chapter_generation autogenerated — quality 0.00, cost $0.076924onboarding_chapter_prompt_generation autogenerated — quality 7.55, cost $0.173588onboarding_prospect_analysis autogenerated — quality 0.00, cost $0.010655ps_section_reassignment autogenerated — quality 0.00, cost $0.019362query_generation autogenerated — quality 8.67, cost $0.009015query_validation autogenerated — quality 6.79, cost $0.004119region_identification autogenerated — quality 7.62, cost $0.009862Relevance Scoring (POST) — quality 5.69, cost $0.004000 (anchor)Relevance Scoring (Topic Report) — quality 8.03, cost $0.004000Relevance Scoring (X Post) — quality 5.91, cost $0.004000S1 TOC extraction — quality 0.00, cost $0.015859SEC Filling Analysis — quality 7.72, cost $0.029078sec-s1-chunk-analysis autogenerated — quality 7.23, cost $0.046014section_generation autogenerated — quality 7.93, cost $0.014864Social Post Promo (pooled) — quality 7.46, cost $0.007720structured_output_extraction autogenerated — quality 9.63, cost $0.018874subreddit_selection autogenerated — quality 5.18, cost $0.008503subreddit_vetting autogenerated — quality 0.00, cost $0.013984Substack Newsletter (pooled) — quality 8.77, cost $0.026200synthesis_analysis autogenerated — quality 7.28, cost $0.045304synthesis_of_titles_for_publication autogenerated — quality 8.25, cost $0.013859theme_generation autogenerated — quality 0.00, cost $0.023098Topic Discovery Clustering (pooled) — quality 7.25, cost $0.134085topic_client_matching autogenerated — quality 7.88, cost $0.043936topic_cluster_naming autogenerated — quality 7.56, cost $0.013996topic_clustering_assign_sections autogenerated — quality 8.61, cost $0.002663topic_sequence_determination autogenerated — quality 0.00, cost $0.015751Trading Recommendation — quality 7.09, cost $0.109451Translation — quality 8.02, cost $0.012715vetted_site_selection autogenerated — quality 0.00, cost $0.051790x_com_messages_for_promotion autogenerated — quality 8.10, cost $0.069070x_post_selection autogenerated — quality 8.31, cost $0.006560

qualifies at 90% bar · doesn't qualify · ★ this model is the best on that task. Lower + further right = cheaper + higher quality. Y-axis is log-scaled.

Per-task breakdown

CategoryTaskQualityConfidenceCost / callvs bestQualifies @ 90%
Structured Data & Fact Extractionclaim_extraction autogenerated6.42high · n=100$0.006320-1055%no
Structured Data & Fact ExtractionGeneric TOC Extraction0.00low · n=0$0.004000no
Structured Data & Fact Extractionregion_identification autogenerated7.62medium · n=25$0.009862-45%no
Structured Data & Fact ExtractionS1 TOC extraction0.00low · n=0$0.015859no
Structured Data & Fact Extractionstructured_output_extraction autogenerated9.63high · n=90$0.018874-515%
Financial Analysis & Trading Decisionsonboarding_prospect_analysis autogenerated0.00low · n=0$0.010655no
Financial Analysis & Trading DecisionsSEC Filling Analysis7.72ranked · n=100$0.02907860%no
Financial Analysis & Trading Decisionssec-s1-chunk-analysis autogenerated7.23high · n=83$0.04601460%no
Financial Analysis & Trading Decisionssynthesis_analysis autogenerated7.28medium · n=28$0.04530431%no
Financial Analysis & Trading DecisionsTrading Recommendation7.09high · n=78$0.10945121%no
Infrastructure & Utilityclaim_refinement autogenerated6.37high · n=100$0.006397-208%no
Infrastructure & Utilityimage_prompt_generation autogenerated7.61ranked · n=100$0.01240154%no
Infrastructure & UtilityLLM Prompt Adaptation8.63ranked · n=100$0.03503721%no
Infrastructure & Utilitymarkdown_newline_repair autogenerated0.00low · n=0$0.267414no
Infrastructure & Utilitymetadata_paragraph_improvement autogenerated7.42high · n=93$0.002504-208%no
Infrastructure & Utilityonboarding_chapter_prompt_generation autogenerated7.55ranked · n=82$0.17358860%no
Infrastructure & Utilityquery_generation autogenerated8.67ranked · n=71$0.00901560%
Infrastructure & Utilityquery_validation autogenerated6.79medium · n=100$0.004119-2337%no
Infrastructure & UtilityTranslation8.02ranked · n=92$0.01271560%
Long-form Content Generationauthor_soul_generation autogenerated8.25ranked · n=89$0.05713252%no
Long-form Content GenerationClaim-Referenced Analyst Writing (pooled)8.01ranked · n=72$0.03859027%no
Long-form Content Generationonboarding_chapter_generation autogenerated0.00low · n=0$0.076924no
Long-form Content Generationsection_generation autogenerated7.93high · n=5$0.014864-208%no
Long-form Content GenerationSubstack Newsletter (pooled)8.77high · n=5$0.02620052%no
Long-form Content Generationtheme_generation autogenerated0.00low · n=0$0.023098no
Relevance, Classification & Matchingat_content_domain_suggest autogenerated0.00low · n=0$0.006101no
Relevance, Classification & MatchingAuthor Matching8.59ranked · n=100$0.01438142%
Relevance, Classification & Matchingauthor_living_check autogenerated8.87ranked · n=67$0.020280-137%
Relevance, Classification & MatchingLanguage Detection10.01ranked · n=84$0.000450-302%
Relevance, Classification & MatchingRelevance Scoring (POST)5.69high · n=100$0.004000best
Relevance, Classification & MatchingRelevance Scoring (Topic Report)8.03low · n=43$0.004000-2400%no
Relevance, Classification & MatchingRelevance Scoring (X Post)5.91low · n=7$0.004000no
Relevance, Classification & Matchingsubreddit_selection autogenerated5.18ranked · n=60$0.00850360%no
Relevance, Classification & Matchingsubreddit_vetting autogenerated0.00low · n=0$0.013984no
Relevance, Classification & Matchingtopic_client_matching autogenerated7.88ranked · n=100$0.043936-1324%
Relevance, Classification & Matchingvetted_site_selection autogenerated0.00low · n=0$0.051790no
Relevance, Classification & Matchingx_post_selection autogenerated8.31ranked · n=100$0.00656060%
Social & Promotional Contentactivity_promo_generation autogenerated8.33ranked · n=61$0.00502456%
Social & Promotional Contentauto_reddit_post_generation autogenerated7.58ranked · n=100$0.145234-190%no
Social & Promotional ContentSocial Post Promo (pooled)7.46ranked · n=97$0.007720-5576%no
Social & Promotional Contentx_com_messages_for_promotion autogenerated8.10low · n=1$0.069070no
Content Summarization & Synthesiscontent-summarization autogenerated5.22ranked · n=100$0.019861-868%no
Content Summarization & SynthesisDirect Browse Content Synthesis0.00low · n=0$0.005546no
Content Summarization & Synthesisexecutive_summary_generation autogenerated4.50low · n=1$0.080386no
Content Summarization & Synthesissynthesis_of_titles_for_publication autogenerated8.25ranked · n=69$0.013859-38%
Topic Organization & Clusteringps_section_reassignment autogenerated0.00low · n=0$0.019362no
Topic Organization & ClusteringTopic Discovery Clustering (pooled)7.25high · n=100$0.13408556%no
Topic Organization & Clusteringtopic_cluster_naming autogenerated7.56ranked · n=92$0.01399660%no
Topic Organization & Clusteringtopic_clustering_assign_sections autogenerated8.61high · n=99$0.00266360%
Topic Organization & Clusteringtopic_sequence_determination autogenerated0.00low · n=0$0.015751no