Best LLMs for Financial Analysis & Trading Decisions — DTP Benchmark
Read business/financial docs deeply enough to make forward-looking judgments about opportunity and risk.
Read business/financial docs deeply enough to make forward-looking judgments about opportunity and risk.
Task-by-task breakdown
onboarding_prospect_analysis autogenerated
No model has reached MEDIUM confidence yet — accumulating evidence.
SEC Filling Analysis
| Rank | Model | Quality | Cost / call | vs best |
|---|---|---|---|---|
| 1 | GPT-5.5 best | 8.47 | $0.072695 | — |
sec-s1-chunk-analysis autogenerated
| Rank | Model | Quality | Cost / call | vs best |
|---|---|---|---|---|
| 1 | Kimi K2.6 | 8.54 | $0.034047 | 70% |
| 2 | GPT-5.5 best | 8.84 | $0.115035 | — |
synthesis_analysis autogenerated
| Rank | Model | Quality | Cost / call | vs best |
|---|---|---|---|---|
| 1 | Qwen 3.6 Plus | 8.79 | $0.014724 | 77% |
| 2 | Claude Sonnet 4.6 best | 8.95 | $0.065295 | — |
| 3 | Claude Opus 4.7 | 8.77 | $0.108825 | -67% |
Trading Recommendation
| Rank | Model | Quality | Cost / call | vs best |
|---|---|---|---|---|
| 1 | Kimi K2.6 | 8.51 | $0.074631 | 46% |
| 2 | Claude Sonnet 4.6 best | 8.79 | $0.138282 | — |
| 3 | Claude Opus 4.7 | 8.75 | $0.230470 | -67% |
| 4 | GPT-5.5 | 8.73 | $0.273628 | -98% |