Cost mode:

At a glance

Good enough on 5/35 tasks at the 95% bar. Cheapest qualifier on 1 task. Doesn't qualify on any: Financial Analysis & Trading Decisions, Structured Data & Fact Extraction, Content Summarization & Synthesis, Long-form Content Generation, Social & Promotional Content, Topic Organization & Clustering.

Provider
Gemini
Model name
gemini-3-flash-preview
Qualifies on
5 / 35 tasks (at 90% bar)
Cheapest qualifier on
1 tasks

Cost vs quality across all tasks

$0.00011$0.00055$0.00274$0.01353$0.066850246810Quality score (0–10)Blended cost / callactivity_promo_generation autogenerated — quality 8.16, cost $0.001256at_content_domain_suggest autogenerated — quality 0.00, cost $0.001525Author Matching — quality 8.34, cost $0.003595author_living_check autogenerated — quality 8.52, cost $0.005070author_soul_generation autogenerated — quality 8.26, cost $0.014283auto_reddit_post_generation autogenerated — quality 6.21, cost $0.036308claim_extraction autogenerated — quality 6.71, cost $0.001580claim_refinement autogenerated — quality 7.73, cost $0.001599Claim-Referenced Analyst Writing (pooled) — quality 6.45, cost $0.009648content-summarization autogenerated — quality 4.75, cost $0.004965Direct Browse Content Synthesis — quality 3.50, cost $0.001386executive_summary_generation autogenerated — quality 7.56, cost $0.020096Generic TOC Extraction — quality 0.00, cost $0.001000image_prompt_generation autogenerated — quality 7.60, cost $0.003100Language Detection — quality 10.01, cost $0.000112 (anchor)LLM Prompt Adaptation — quality 7.54, cost $0.008759markdown_newline_repair autogenerated — quality 0.00, cost $0.066854metadata_paragraph_improvement autogenerated — quality 7.71, cost $0.000626onboarding_chapter_generation autogenerated — quality 0.00, cost $0.019231onboarding_chapter_prompt_generation autogenerated — quality 6.12, cost $0.043397onboarding_prospect_analysis autogenerated — quality 0.00, cost $0.002664ps_section_reassignment autogenerated — quality 0.00, cost $0.004840query_generation autogenerated — quality 8.29, cost $0.002254query_validation autogenerated — quality 8.29, cost $0.001030region_identification autogenerated — quality 8.06, cost $0.002466S1 TOC extraction — quality 0.00, cost $0.003965SEC Filling Analysis — quality 7.02, cost $0.007270sec-s1-chunk-analysis autogenerated — quality 7.51, cost $0.011504section_generation autogenerated — quality 7.90, cost $0.003716Social Post Promo (pooled) — quality 7.42, cost $0.001930structured_output_extraction autogenerated — quality 8.46, cost $0.004718subreddit_selection autogenerated — quality 5.00, cost $0.002126subreddit_vetting autogenerated — quality 0.00, cost $0.003496Substack Newsletter (pooled) — quality 8.02, cost $0.006550synthesis_analysis autogenerated — quality 6.71, cost $0.011326synthesis_of_titles_for_publication autogenerated — quality 7.84, cost $0.003465theme_generation autogenerated — quality 0.00, cost $0.005774Topic Discovery Clustering (pooled) — quality 5.62, cost $0.033521topic_client_matching autogenerated — quality 7.38, cost $0.010984topic_cluster_naming autogenerated — quality 7.87, cost $0.003499topic_clustering_assign_sections autogenerated — quality 7.93, cost $0.000666topic_sequence_determination autogenerated — quality 0.00, cost $0.003938Trading Recommendation — quality 6.54, cost $0.027363Translation — quality 7.81, cost $0.003179vetted_site_selection autogenerated — quality 7.40, cost $0.012948x_com_messages_for_promotion autogenerated — quality 0.00, cost $0.017268x_post_selection autogenerated — quality 8.32, cost $0.001640

qualifies at 90% bar · doesn't qualify · ★ this model is the best on that task. Lower + further right = cheaper + higher quality. Y-axis is log-scaled.

Per-task breakdown

CategoryTaskQualityConfidenceCost / callvs bestQualifies @ 90%
Structured Data & Fact Extractionclaim_extraction autogenerated6.71high · n=100$0.001580-189%no
Structured Data & Fact ExtractionGeneric TOC Extraction0.00low · n=0$0.001000no
Structured Data & Fact Extractionregion_identification autogenerated8.06high · n=25$0.00246664%no
Structured Data & Fact ExtractionS1 TOC extraction0.00low · n=0$0.003965no
Structured Data & Fact Extractionstructured_output_extraction autogenerated8.46medium · n=88$0.004718-54%no
Financial Analysis & Trading Decisionsonboarding_prospect_analysis autogenerated0.00low · n=0$0.002664no
Financial Analysis & Trading DecisionsSEC Filling Analysis7.02ranked · n=100$0.00727090%no
Financial Analysis & Trading Decisionssec-s1-chunk-analysis autogenerated7.51high · n=83$0.01150490%no
Financial Analysis & Trading Decisionssynthesis_analysis autogenerated6.71medium · n=28$0.01132683%no
Financial Analysis & Trading DecisionsTrading Recommendation6.54ranked · n=78$0.02736380%no
Infrastructure & Utilityclaim_refinement autogenerated7.73ranked · n=100$0.00159923%
Infrastructure & Utilityimage_prompt_generation autogenerated7.60ranked · n=100$0.00310088%no
Infrastructure & UtilityLLM Prompt Adaptation7.54ranked · n=100$0.00875980%no
Infrastructure & Utilitymarkdown_newline_repair autogenerated0.00low · n=0$0.066854no
Infrastructure & Utilitymetadata_paragraph_improvement autogenerated7.71high · n=93$0.00062623%no
Infrastructure & Utilityonboarding_chapter_prompt_generation autogenerated6.12high · n=82$0.04339790%no
Infrastructure & Utilityquery_generation autogenerated8.29high · n=71$0.00225490%no
Infrastructure & Utilityquery_validation autogenerated8.29high · n=100$0.001030-509%
Infrastructure & UtilityTranslation7.81ranked · n=92$0.00317990%
Long-form Content Generationauthor_soul_generation autogenerated8.26ranked · n=89$0.01428388%no
Long-form Content GenerationClaim-Referenced Analyst Writing (pooled)6.45high · n=55$0.00964882%no
Long-form Content Generationonboarding_chapter_generation autogenerated0.00low · n=0$0.019231no
Long-form Content Generationsection_generation autogenerated7.90high · n=5$0.00371623%no
Long-form Content GenerationSubstack Newsletter (pooled)8.02medium · n=5$0.00655088%no
Long-form Content Generationtheme_generation autogenerated0.00low · n=0$0.005774no
Relevance, Classification & Matchingat_content_domain_suggest autogenerated0.00low · n=0$0.001525no
Relevance, Classification & MatchingAuthor Matching8.34ranked · n=100$0.00359585%no
Relevance, Classification & Matchingauthor_living_check autogenerated8.52ranked · n=67$0.00507041%no
Relevance, Classification & MatchingLanguage Detection10.01ranked · n=84$0.000112best
Relevance, Classification & Matchingsubreddit_selection autogenerated5.00high · n=60$0.00212690%no
Relevance, Classification & Matchingsubreddit_vetting autogenerated0.00low · n=0$0.003496no
Relevance, Classification & Matchingtopic_client_matching autogenerated7.38ranked · n=100$0.010984-256%no
Relevance, Classification & Matchingvetted_site_selection autogenerated7.40low · n=1$0.012948no
Relevance, Classification & Matchingx_post_selection autogenerated8.32ranked · n=100$0.00164090%
Social & Promotional Contentactivity_promo_generation autogenerated8.16ranked · n=61$0.00125689%no
Social & Promotional Contentauto_reddit_post_generation autogenerated6.21ranked · n=100$0.03630828%no
Social & Promotional ContentSocial Post Promo (pooled)7.42ranked · n=97$0.001930-1319%no
Social & Promotional Contentx_com_messages_for_promotion autogenerated0.00low · n=0$0.017268no
Content Summarization & Synthesiscontent-summarization autogenerated4.75ranked · n=100$0.004965-142%no
Content Summarization & SynthesisDirect Browse Content Synthesis3.50low · n=1$0.001386no
Content Summarization & Synthesisexecutive_summary_generation autogenerated7.56low · n=2$0.020096no
Content Summarization & Synthesissynthesis_of_titles_for_publication autogenerated7.84ranked · n=69$0.00346565%no
Topic Organization & Clusteringps_section_reassignment autogenerated0.00low · n=0$0.004840no
Topic Organization & ClusteringTopic Discovery Clustering (pooled)5.62ranked · n=100$0.03352189%no
Topic Organization & Clusteringtopic_cluster_naming autogenerated7.87ranked · n=92$0.00349990%no
Topic Organization & Clusteringtopic_clustering_assign_sections autogenerated7.93high · n=99$0.00066690%no
Topic Organization & Clusteringtopic_sequence_determination autogenerated0.00low · n=0$0.003938no