Pricing
You save money every single time. At $0.05 per 1M saved tokens, compression always costs a fraction of what your LLM charges — so every API call is cheaper with us.
$0.05per 1M saved tokens
You only pay for done work
You send in 500 tokens. We remove 200 redundant ones and return 300 compressed tokens. You only pay for the 200 tokens we removed — at $0.05 per 1M tokens.
Available models
All models are priced the same — pick the one that fits your needs.
| Model | Released |
|---|---|
| bear-1 | November 2025 |
| bear-1.1 | January 2026 |
| bear-1.2 | February 2026 |
Example
Acme Corp sends 10B tokens per week to Gemini 3 Pro Preview ($2 / 1M input tokens) and achieves 75% compression with bear-1.1.
Estimate your savings
Adjust the inputs to match your usage.
Performance benchmarks
Compression doesn't just save money — it improves accuracy and latency.
More benchmarks coming soon
We are evaluating compression across additional domains and model families. Results will be published here as they are completed.
Start compressingStart saving on your LLM costs
Free to try. No credit card required.