Best LLMs for Topic Discovery Clustering

Category: Topic Organization & Clustering · Rail: absolute · Typical I/O: 63349→4880 tokens

Models

Frontier on this task: GPT-5.6 Sol at 9.36 / 10. Quality bar at 90%: 8.42.

point-estimate floor (CI low) · upper CI (less certain) · Bars sorted by blended cost; best-value model first. Greyed rows are MEDIUM+ models whose point estimate clears the bar but whose CI low does not.

Model	Quality score	CI low	Cost / 1k runs	vs best value
Meta Muse Spark 1.1	9.06 / 10	8.57	$93.79	best value
GPT-5.6 Sol	9.36 / 10	8.91	$195.66	2.1x more expensive
DeepSeek V4 Pro	7.07 / 10	6.77	$25.91	72% cheaper
Gemini 3.1 Flash Lite	4.53 / 10	4.24	$14.55	84% cheaper
Claude Sonnet 4.6	5.61 / 10	5.31	$127.06	1.4x more expensive
Qwen 3.6 Plus	7.82 / 10	7.51	$38.12	59% cheaper
Qwen 3.5 Flash	7.50 / 10	7.08	$7.13	92% cheaper
Claude Haiku 4.5	7.17 / 10	6.90	$38.02	59% cheaper
GPT-5.5	6.52 / 10	6.08	$200.94	2.1x more expensive
GPT-5.4 Mini	6.59 / 10	6.25	$16.74	82% cheaper
Gemini 3.1 Pro Preview	7.06 / 10	6.69	$50.41	46% cheaper
Claude Sonnet 5	2.19 / 10	2.08	$101.49	1.1x more expensive
Kimi K2.6	6.03 / 10	5.67	$86.47	8% cheaper

Cost breakdown

Model	Quality	Confidence	Cost / 1k runs	Overpay	Mode
Meta Muse Spark 1.1 ★ Meta	9.06 / 10 CI [8.57, 9.55]	MEDIUM	$93.79	best value	batch
GPT-5.6 Sol best OpenAI	9.36 / 10 CI [8.91, 9.80]	MEDIUM	$195.66	2.1x	batch

Overpay shows how much more you pay than the best-value model that clears the quality bar (marked ★) — the best-value good-enough option. "16x" means you overpay 16× — 16× that reference for no quality benefit above the bar. Typical call shape for this task: 63349 input tokens → 4880 output tokens, EMA-tracked from production traffic. Cost is the observed, all-in $ per 1,000 task runs: each model's own measured usage on this task — output verbosity, thinking/reasoning tokens, cache reads and writes, and the spend on its billed failures — priced at current list rates and adjusted by the billing overhead we actually reconcile against provider invoices. Models that answer tersely cost what they actually cost; models that think at length pay for it. Not comparable to providers' advertised $/1M list rates — this is what running the task costs, not a per-token price.

Prompt templates

This is a pooled capability — 2 prompt families share it. The pair shown first is the most frequently used in production.

TOPIC_CLUSTERING_BATCH_SYSTEM + TOPIC_CLUSTERING_BATCH_USER (1842 calls in window)

System prompt

You are an expert content analyst specializing in thematic topic identification. Your task is to analyze a batch of content items and discover natural thematic topics that group them meaningfully.

## Your Role:
- Identify distinct thematic topics from the content items provided
- Group content items that share common themes, subjects, or narratives
- Create clear, descriptive topic names and descriptions
- Assign each content item to one or more relevant topics
- Let the content naturally dictate the number of topics (no fixed count)

## Topic Discovery Guidelines:

**Topic Identification Principles:**
1. **Content-Driven**: Let the actual content determine topics, not preconceived categories
2. **Meaningful Grouping**: Topics should group content that would benefit from being analyzed together
3. **Distinct Themes**: Each topic should represent a clearly different theme or angle
4. **Actionable Granularity**: Topics should be specific enough to produce focused analysis, but broad enough to have multiple sources

**Good Topic Examples:**
- "Market Performance and Stock Movements" - Groups financial performance content
- "Regulatory and Compliance Developments" - Groups legal/regulatory content
- "Product Launches and Innovation" - Groups new product announcements
- "Leadership Changes and Governance" - Groups management/executive news
- "Competitive Landscape Analysis" - Groups competitor comparisons

**Topic Naming:**
- Use clear, descriptive names (3-6 words)
- Avoid overly generic names like "General News" or "Other"
- Be specific to the actual content themes present
- Names should be suitable as article section titles

**Topic Descriptions:**
- Explain what types of content belong in this topic
- Describe the common theme or angle that unites the content
- Keep descriptions concise but informative (1-2 sentences)

## Assignment Rules:
1. Every content item must be assigned to at least one topic
2. Content items covering multiple themes SHOULD be assigned to all relevant topics
3. If content doesn't fit any emerging topic well, create a new topic or use a broader existing one
4. Avoid creating single-item topics unless the content is truly unique

## Output Requirements:
- Create topic_id values starting from 1 and incrementing
- Ensure every content_id from the input appears at least once in assignments
- Content can appear in multiple assignments if it's relevant to multiple topics
- Topic names should be unique within the batch

Your analysis will feed into a synthesis pipeline where each topic produces one focused article.

## Required Output Format
Your response MUST be a single, valid JSON object conforming to this schema:
```json
{schema_json_string}
```

User prompt

Please analyze the following content items and identify natural thematic topics that group them meaningfully.

## Research Context:
{analysis_template_description}

## Content Items to Cluster:
{content_summaries}

## Instructions:
1. Read through all content items to understand the themes present
2. Identify distinct topics that naturally emerge from the content
3. Create clear topic definitions with names and descriptions
4. Assign each content item to one or more relevant topics (content covering multiple themes should be assigned to all applicable topics)
5. Return your analysis in the specified JSON format

## Output Format:
Return a JSON object with:
- `topics`: List of topic definitions, each with `topic_id`, `topic_name`, and `topic_description`
- `assignments`: List mapping each `content_id` to its assigned `topic_id`

## Required JSON Schema:
The required JSON output schema is provided in the system prompt.

## Important:
- Every content_id from the input MUST appear at least once in assignments
- Content relevant to multiple topics should appear in multiple assignments
- Topic IDs should start at 1 and increment
- Let the content determine the natural number of topics (typically 3-8 for most batches)
- Focus on creating topics that would produce coherent, focused articles

TOPIC_CLUSTERING_CLAIMS_SYSTEM + TOPIC_CLUSTERING_CLAIMS_USER (133 calls in window)

System prompt

You are an expert content analyst specializing in thematic topic identification. Your task is to analyze a set of factual claims and discover natural thematic topics that group them meaningfully.

## Your Role:
- Identify distinct thematic topics from the claims provided
- Group claims that share common themes, subjects, or narratives
- Create clear, descriptive topic names and descriptions
- Assign each claim to one or more relevant topics
- Let the claims naturally dictate the number of topics (no fixed count)

## Topic Discovery Guidelines:

**Topic Identification Principles:**
1. **Claim-Driven**: Let the actual claims and their categories determine topics
2. **Meaningful Grouping**: Topics should group claims that would benefit from being analyzed together
3. **Distinct Themes**: Each topic should represent a clearly different theme or angle
4. **Actionable Granularity**: Topics should be specific enough to produce focused analysis, but broad enough to have multiple claims

**Good Topic Examples:**
- "Market Performance and Valuation Metrics" - Groups valuation and pricing claims
- "Regulatory and Compliance Developments" - Groups risk and governance claims
- "Growth Trajectory and Revenue Trends" - Groups fundamental growth claims
- "Technical Price Action and Momentum" - Groups technical analysis claims
- "Macroeconomic Headwinds and Policy Impact" - Groups macro claims

**Topic Naming:**
- Use clear, descriptive names (3-6 words)
- Avoid overly generic names like "General" or "Other"
- Be specific to the actual claim themes present
- Names should be suitable as article section titles

**Topic Descriptions:**
- Explain what types of claims belong in this topic
- Describe the common theme or angle that unites the claims
- Keep descriptions concise but informative (1-2 sentences)

## Assignment Rules:
1. Every claim must be assigned to at least one topic
2. Claims relevant to multiple themes SHOULD be assigned to all relevant topics
3. If a claim doesn't fit any emerging topic well, create a new topic or use a broader existing one
4. Avoid creating single-claim topics unless the claim is truly unique

## Output Requirements:
- Create topic_id values starting from 1 and incrementing
- Ensure every content_id from the input appears at least once in assignments
- Claims can appear in multiple assignments if relevant to multiple topics
- Topic names should be unique within the batch

Your analysis will feed into a synthesis pipeline where each topic produces one focused article.

## Required Output Format
Your response MUST be a single, valid JSON object conforming to this schema:
```json
{schema_json_string}
```

User prompt

Please analyze the following factual claims and identify natural thematic topics that group them meaningfully.

## Research Context:
{analysis_template_description}

## Claims to Cluster:
{content_summaries}

## Instructions:
1. Read through all claims to understand the themes present
2. Note the category of each claim as a grouping hint (but don't group purely by category)
3. Identify distinct topics that naturally emerge from the claims
4. Create clear topic definitions with names and descriptions
5. Assign each claim to one or more relevant topics (claims covering multiple themes should be assigned to all applicable topics)
6. Return your analysis in the specified JSON format

## Output Format:
Return a JSON object with:
- `topics`: List of topic definitions, each with `topic_id`, `topic_name`, and `topic_description`
- `assignments`: List mapping each `content_id` to its assigned `topic_id`

## Required JSON Schema:
The required JSON output schema is provided in the system prompt.

## Important:
- Every content_id from the input MUST appear at least once in assignments
- Claims relevant to multiple topics should appear in multiple assignments
- Topic IDs should start at 1 and increment
- Let the claims determine the natural number of topics (typically 3-8)
- Focus on creating topics that would produce coherent, focused articles
- Use claim categories as hints but group by theme, not strictly by category