Cost mode:

At a glance

Good enough on 2/35 tasks at the 95% bar. Cheapest qualifier on 0 tasks. Doesn't qualify on any: Financial Analysis & Trading Decisions, Structured Data & Fact Extraction, Content Summarization & Synthesis, Long-form Content Generation, Topic Organization & Clustering, Infrastructure & Utility.

Provider
OpenAI
Model name
gpt-5.4-mini
Qualifies on
2 / 35 tasks (at 90% bar)
Cheapest qualifier on
0 tasks

Cost vs quality across all tasks

$0.00017$0.00083$0.00412$0.02032$0.100280246810Quality score (0–10)Blended cost / callactivity_promo_generation autogenerated — quality 8.62, cost $0.001884at_content_domain_suggest autogenerated — quality 0.00, cost $0.002288Author Matching — quality 6.42, cost $0.005393author_soul_generation autogenerated — quality 8.66, cost $0.021424auto_reddit_post_generation autogenerated — quality 6.69, cost $0.054463claim_extraction autogenerated — quality 3.70, cost $0.002370claim_refinement autogenerated — quality 6.95, cost $0.002399Claim-Referenced Analyst Writing (pooled) — quality 7.83, cost $0.014471content-summarization autogenerated — quality 6.20, cost $0.007448Direct Browse Content Synthesis — quality 5.17, cost $0.002080executive_summary_generation autogenerated — quality 3.00, cost $0.030145Generic TOC Extraction — quality 0.00, cost $0.001500image_prompt_generation autogenerated — quality 8.13, cost $0.004650Language Detection — quality 9.99, cost $0.000169LLM Prompt Adaptation — quality 6.03, cost $0.013139markdown_newline_repair autogenerated — quality 0.00, cost $0.100280metadata_paragraph_improvement autogenerated — quality 6.95, cost $0.000939onboarding_chapter_generation autogenerated — quality 0.00, cost $0.028846onboarding_chapter_prompt_generation autogenerated — quality 8.06, cost $0.065096onboarding_prospect_analysis autogenerated — quality 0.00, cost $0.003996ps_section_reassignment autogenerated — quality 0.00, cost $0.007261region_identification autogenerated — quality 6.70, cost $0.003698Relevance Scoring (POST) — quality 2.56, cost $0.001500Relevance Scoring (Topic Report) — quality 5.51, cost $0.001500Relevance Scoring (X Post) — quality 4.92, cost $0.001500S1 TOC extraction — quality 0.00, cost $0.005947SEC Filling Analysis — quality 5.74, cost $0.010904sec-s1-chunk-analysis autogenerated — quality 7.63, cost $0.017255section_generation autogenerated — quality 8.47, cost $0.005574Social Post Promo (pooled) — quality 7.34, cost $0.002895structured_output_extraction autogenerated — quality 7.96, cost $0.007078subreddit_selection autogenerated — quality 4.33, cost $0.003189subreddit_vetting autogenerated — quality 0.00, cost $0.005244Substack Newsletter (pooled) — quality 7.39, cost $0.009825synthesis_analysis autogenerated — quality 6.43, cost $0.016989synthesis_of_titles_for_publication autogenerated — quality 7.66, cost $0.005197theme_generation autogenerated — quality 0.00, cost $0.008662Topic Discovery Clustering (pooled) — quality 6.63, cost $0.050282topic_client_matching autogenerated — quality 5.04, cost $0.016476topic_cluster_naming autogenerated — quality 7.97, cost $0.005248topic_clustering_assign_sections autogenerated — quality 7.42, cost $0.000999topic_sequence_determination autogenerated — quality 0.00, cost $0.005907Trading Recommendation — quality 7.55, cost $0.041044Translation — quality 7.25, cost $0.004768vetted_site_selection autogenerated — quality 0.00, cost $0.019421x_com_messages_for_promotion autogenerated — quality 0.00, cost $0.025901x_post_selection autogenerated — quality 7.53, cost $0.002460

qualifies at 90% bar · doesn't qualify · ★ this model is the best on that task. Lower + further right = cheaper + higher quality. Y-axis is log-scaled.

Per-task breakdown

CategoryTaskQualityConfidenceCost / callvs bestQualifies @ 90%
Structured Data & Fact Extractionclaim_extraction autogenerated3.70high · n=100$0.002370-333%no
Structured Data & Fact ExtractionGeneric TOC Extraction0.00low · n=0$0.001500no
Structured Data & Fact Extractionregion_identification autogenerated6.70low · n=22$0.00369846%no
Structured Data & Fact ExtractionS1 TOC extraction0.00low · n=0$0.005947no
Structured Data & Fact Extractionstructured_output_extraction autogenerated7.96medium · n=91$0.007078-131%no
Financial Analysis & Trading Decisionsonboarding_prospect_analysis autogenerated0.00low · n=0$0.003996no
Financial Analysis & Trading DecisionsSEC Filling Analysis5.74medium · n=100$0.01090485%no
Financial Analysis & Trading Decisionssec-s1-chunk-analysis autogenerated7.63high · n=85$0.01725585%no
Financial Analysis & Trading Decisionssynthesis_analysis autogenerated6.43low · n=29$0.01698974%no
Financial Analysis & Trading DecisionsTrading Recommendation7.55high · n=81$0.04104470%no
Infrastructure & Utilityclaim_refinement autogenerated6.95medium · n=100$0.002399-15%no
Infrastructure & Utilityimage_prompt_generation autogenerated8.13ranked · n=100$0.00465083%no
Infrastructure & UtilityLLM Prompt Adaptation6.03medium · n=100$0.01313970%no
Infrastructure & Utilitymarkdown_newline_repair autogenerated0.00low · n=0$0.100280no
Infrastructure & Utilitymetadata_paragraph_improvement autogenerated6.95medium · n=91$0.000939-15%no
Infrastructure & Utilityonboarding_chapter_prompt_generation autogenerated8.06high · n=82$0.06509685%no
Infrastructure & UtilityTranslation7.25medium · n=90$0.00476885%no
Long-form Content Generationauthor_soul_generation autogenerated8.66ranked · n=93$0.02142482%no
Long-form Content GenerationClaim-Referenced Analyst Writing (pooled)7.83high · n=98$0.01447172%no
Long-form Content Generationonboarding_chapter_generation autogenerated0.00low · n=0$0.028846no
Long-form Content Generationsection_generation autogenerated8.47medium · n=6$0.005574-15%no
Long-form Content GenerationSubstack Newsletter (pooled)7.39low · n=4$0.00982582%no
Long-form Content Generationtheme_generation autogenerated0.00low · n=0$0.008662no
Relevance, Classification & Matchingat_content_domain_suggest autogenerated0.00low · n=0$0.002288no
Relevance, Classification & MatchingAuthor Matching6.42high · n=100$0.00539378%no
Relevance, Classification & MatchingLanguage Detection9.99ranked · n=85$0.000169-51%
Relevance, Classification & MatchingRelevance Scoring (POST)2.56medium · n=100$0.00150062%no
Relevance, Classification & MatchingRelevance Scoring (Topic Report)5.51low · n=44$0.001500-838%no
Relevance, Classification & MatchingRelevance Scoring (X Post)4.92low · n=12$0.001500no
Relevance, Classification & Matchingsubreddit_selection autogenerated4.33high · n=62$0.00318985%no
Relevance, Classification & Matchingsubreddit_vetting autogenerated0.00low · n=0$0.005244no
Relevance, Classification & Matchingtopic_client_matching autogenerated5.04medium · n=100$0.016476-434%no
Relevance, Classification & Matchingvetted_site_selection autogenerated0.00low · n=0$0.019421no
Relevance, Classification & Matchingx_post_selection autogenerated7.53high · n=99$0.00246085%no
Social & Promotional Contentactivity_promo_generation autogenerated8.62ranked · n=64$0.00188484%
Social & Promotional Contentauto_reddit_post_generation autogenerated6.69high · n=100$0.054463-9%no
Social & Promotional ContentSocial Post Promo (pooled)7.34ranked · n=100$0.002895-2029%no
Social & Promotional Contentx_com_messages_for_promotion autogenerated0.00low · n=0$0.025901no
Content Summarization & Synthesiscontent-summarization autogenerated6.20medium · n=100$0.007448-263%no
Content Summarization & SynthesisDirect Browse Content Synthesis5.17low · n=39$0.002080no
Content Summarization & Synthesisexecutive_summary_generation autogenerated3.00low · n=1$0.030145no
Content Summarization & Synthesissynthesis_of_titles_for_publication autogenerated7.66high · n=67$0.00519748%no
Topic Organization & Clusteringps_section_reassignment autogenerated0.00low · n=0$0.007261no
Topic Organization & ClusteringTopic Discovery Clustering (pooled)6.63high · n=100$0.05028284%no
Topic Organization & Clusteringtopic_cluster_naming autogenerated7.97high · n=89$0.00524885%no
Topic Organization & Clusteringtopic_clustering_assign_sections autogenerated7.42medium · n=100$0.00099985%no
Topic Organization & Clusteringtopic_sequence_determination autogenerated0.00low · n=0$0.005907no