Pricing
Start Free. Scale When You're Ready.
Credit-based pricing that scales with your usage. No hidden fees. No contracts.
Free
forever free
Try blind evaluation and see PeerLM in action.
Get Started Free- 200 signup credits
- Pay-as-you-go at $0.20/credit
- Standard + Advanced models
- 1 seat
- 7-day report retention
- Shareable reports (PeerLM branded)
- Community support
- Custom eval criteria
- API & MCP access
- Premium & Frontier models
Pro
per workspace
For teams running serious evaluations across use cases.
Upgrade to Pro- 1,000 eval credits/month
- Standard + Advanced + Premium models
- Unlimited seats
- 90-day report retention
- Shareable reports (co-branded)
- Custom eval criteria
- API & MCP access
- Email support
- Extra credits at $0.10/credit
- SSO / SAML
Enterprise
from $500/month
For organizations that need unlimited evaluations, Frontier models, and dedicated support.
Start with a Free Managed TrialSee a sample benchmark report →- Everything in Pro
- Unlimited eval credits
- All model tiers incl. Frontier
- 1-year report retention
- White-label reports
- API & MCP access + webhooks
- SSO / SAML
- Dedicated support & SLA
- Free managed trial included
- Volume pricing
All plans billed monthly. No contracts — cancel anytime.
How eval credits work
Credits scale with the models you use and how much they generate. Each model is assigned a tier based on its cost — the more expensive the model, the more credits it uses.
Standard
1x multiplier
GPT-4o-mini, DeepSeek V3
Advanced
1x multiplier
GPT-4o, Claude Sonnet 4.6
Premium
2x multiplier
GPT-5.2, Claude Opus 4.6
Frontier
3x multiplier
o1-pro, o3-pro
Credit cost = SUM(model multipliers) × system prompts × test prompts × samples per prompt.
Quick comparison
3 prompts, 2 Standard + 1 Standard eval
9credits
Mid-range eval
5 prompts, 1 Std + 1 Adv + 1 Prem response, 1 Adv eval
25credits
Full benchmark
10 prompts, 2 Adv + 1 Prem + 1 Frontier response, 2 evals
90credits