Cost mode:

At a glance

Good enough on 3/35 tasks at the 95% bar. Cheapest qualifier on 2 tasks. Doesn't qualify on any: Financial Analysis & Trading Decisions, Long-form Content Generation, Social & Promotional Content, Topic Organization & Clustering, Infrastructure & Utility.

Provider
OpenAI
Model name
gpt-5.4-nano
Qualifies on
3 / 35 tasks (at 90% bar)
Cheapest qualifier on
2 tasks

Cost vs quality across all tasks

$0.00005$0.00023$0.00113$0.00561$0.027810246810Quality score (0–10)Blended cost / callat_content_domain_suggest autogenerated — quality 0.00, cost $0.000617Author Matching — quality 5.08, cost $0.001440author_soul_generation autogenerated — quality 8.83, cost $0.005944auto_reddit_post_generation autogenerated — quality 6.50, cost $0.015077claim_extraction autogenerated — quality 2.18, cost $0.000647claim_refinement autogenerated — quality 6.78, cost $0.000654Claim-Referenced Analyst Writing (pooled) — quality 7.12, cost $0.003947content-summarization autogenerated — quality 8.92, cost $0.002052 (anchor)Direct Browse Content Synthesis — quality 1.65, cost $0.000568executive_summary_generation autogenerated — quality 6.50, cost $0.008263Generic TOC Extraction — quality 0.00, cost $0.000412image_prompt_generation autogenerated — quality 6.89, cost $0.001282Language Detection — quality 9.98, cost $0.000046LLM Prompt Adaptation — quality 5.62, cost $0.003641markdown_newline_repair autogenerated — quality 0.00, cost $0.027812metadata_paragraph_improvement autogenerated — quality 7.70, cost $0.000259onboarding_chapter_generation autogenerated — quality 0.00, cost $0.008006onboarding_chapter_prompt_generation autogenerated — quality 7.93, cost $0.018063onboarding_prospect_analysis autogenerated — quality 0.00, cost $0.001102ps_section_reassignment autogenerated — quality 0.00, cost $0.001984region_identification autogenerated — quality 7.38, cost $0.001024S1 TOC extraction — quality 0.00, cost $0.001641SEC Filling Analysis — quality 5.14, cost $0.003022sec-s1-chunk-analysis autogenerated — quality 8.16, cost $0.004744section_generation autogenerated — quality 8.37, cost $0.001546Social Post Promo (pooled) — quality 7.25, cost $0.000785structured_output_extraction autogenerated — quality 9.76, cost $0.001944subreddit_selection autogenerated — quality 3.93, cost $0.000865subreddit_vetting autogenerated — quality 6.00, cost $0.001447Substack Newsletter (pooled) — quality 7.71, cost $0.002725synthesis_analysis autogenerated — quality 6.55, cost $0.004575synthesis_of_titles_for_publication autogenerated — quality 7.83, cost $0.001432theme_generation autogenerated — quality 0.00, cost $0.002360Topic Discovery Clustering (pooled) — quality 4.71, cost $0.013709topic_client_matching autogenerated — quality 4.53, cost $0.004530topic_cluster_naming autogenerated — quality 7.36, cost $0.001410topic_clustering_assign_sections autogenerated — quality 7.90, cost $0.000268topic_sequence_determination autogenerated — quality 0.00, cost $0.001625Trading Recommendation — quality 7.86, cost $0.011377Translation — quality 5.92, cost $0.001319vetted_site_selection autogenerated — quality 0.00, cost $0.005315x_com_messages_for_promotion autogenerated — quality 0.00, cost $0.007022x_post_selection autogenerated — quality 6.60, cost $0.000659

qualifies at 90% bar · doesn't qualify · ★ this model is the best on that task. Lower + further right = cheaper + higher quality. Y-axis is log-scaled.

Per-task breakdown

CategoryTaskQualityConfidenceCost / callvs bestQualifies @ 90%
Structured Data & Fact Extractionclaim_extraction autogenerated2.18high · n=100$0.000647-18%no
Structured Data & Fact ExtractionGeneric TOC Extraction0.00low · n=0$0.000412no
Structured Data & Fact Extractionregion_identification autogenerated7.38low · n=22$0.00102485%no
Structured Data & Fact ExtractionS1 TOC extraction0.00low · n=0$0.001641no
Structured Data & Fact Extractionstructured_output_extraction autogenerated9.76high · n=88$0.00194437%
Financial Analysis & Trading Decisionsonboarding_prospect_analysis autogenerated0.00low · n=0$0.001102no
Financial Analysis & Trading DecisionsSEC Filling Analysis5.14medium · n=100$0.00302296%no
Financial Analysis & Trading Decisionssec-s1-chunk-analysis autogenerated8.16ranked · n=85$0.00474496%no
Financial Analysis & Trading Decisionssynthesis_analysis autogenerated6.55low · n=29$0.00457593%no
Financial Analysis & Trading DecisionsTrading Recommendation7.86high · n=81$0.01137792%no
Infrastructure & Utilityclaim_refinement autogenerated6.78medium · n=100$0.00065469%no
Infrastructure & Utilityimage_prompt_generation autogenerated6.89high · n=100$0.00128295%no
Infrastructure & UtilityLLM Prompt Adaptation5.62medium · n=100$0.00364192%no
Infrastructure & Utilitymarkdown_newline_repair autogenerated0.00low · n=0$0.027812no
Infrastructure & Utilitymetadata_paragraph_improvement autogenerated7.70high · n=91$0.00025968%no
Infrastructure & Utilityonboarding_chapter_prompt_generation autogenerated7.93ranked · n=81$0.01806396%no
Infrastructure & UtilityTranslation5.92medium · n=90$0.00131996%no
Long-form Content Generationauthor_soul_generation autogenerated8.83ranked · n=93$0.00594495%no
Long-form Content GenerationClaim-Referenced Analyst Writing (pooled)7.12high · n=96$0.00394792%no
Long-form Content Generationonboarding_chapter_generation autogenerated0.00low · n=0$0.008006no
Long-form Content Generationsection_generation autogenerated8.37low · n=6$0.00154668%no
Long-form Content GenerationSubstack Newsletter (pooled)7.71low · n=4$0.00272595%no
Long-form Content Generationtheme_generation autogenerated0.00low · n=0$0.002360no
Relevance, Classification & Matchingat_content_domain_suggest autogenerated0.00low · n=0$0.000617no
Relevance, Classification & MatchingAuthor Matching5.08medium · n=100$0.00144094%no
Relevance, Classification & MatchingLanguage Detection9.98ranked · n=85$0.00004659%
Relevance, Classification & Matchingsubreddit_selection autogenerated3.93high · n=62$0.00086596%no
Relevance, Classification & Matchingsubreddit_vetting autogenerated6.00low · n=1$0.001447no
Relevance, Classification & Matchingtopic_client_matching autogenerated4.53medium · n=100$0.004530-47%no
Relevance, Classification & Matchingvetted_site_selection autogenerated0.00low · n=0$0.005315no
Relevance, Classification & Matchingx_post_selection autogenerated6.60medium · n=99$0.00065996%no
Social & Promotional Contentauto_reddit_post_generation autogenerated6.50high · n=100$0.01507770%no
Social & Promotional ContentSocial Post Promo (pooled)7.25high · n=100$0.000785-477%no
Social & Promotional Contentx_com_messages_for_promotion autogenerated0.00low · n=0$0.007022no
Content Summarization & Synthesiscontent-summarization autogenerated8.92ranked · n=100$0.002052best
Content Summarization & SynthesisDirect Browse Content Synthesis1.65low · n=2$0.000568no
Content Summarization & Synthesisexecutive_summary_generation autogenerated6.50low · n=1$0.008263no
Content Summarization & Synthesissynthesis_of_titles_for_publication autogenerated7.83high · n=67$0.00143286%no
Topic Organization & Clusteringps_section_reassignment autogenerated0.00low · n=0$0.001984no
Topic Organization & ClusteringTopic Discovery Clustering (pooled)4.71medium · n=100$0.01370996%no
Topic Organization & Clusteringtopic_cluster_naming autogenerated7.36medium · n=89$0.00141096%no
Topic Organization & Clusteringtopic_clustering_assign_sections autogenerated7.90medium · n=100$0.00026896%no
Topic Organization & Clusteringtopic_sequence_determination autogenerated0.00low · n=0$0.001625no