# The Token Company

> LLM input compression API. Reduce tokens by 66%, cut AI costs by 3x, improve accuracy.

The Token Company builds compression models (bear-1, bear-1.1, bear-1.2) that remove low-signal tokens from LLM prompts before they reach the model. By stripping noise, the LLM spends its attention on the tokens that actually matter — which is why compressed prompts often score higher than uncompressed ones. One API call compresses your input; you pass the compressed text to any LLM. Free tier includes up to 1B processed tokens per month. Backed by Y Combinator.

## Key Links

- [Home](https://thetokencompany.com)
- [Pricing](https://thetokencompany.com/pricing)
- [Blog](https://thetokencompany.com/blog)
- [Benchmarks](https://thetokencompany.com/benchmarks)
- [Contact](https://thetokencompany.com/contact)
- [Careers](https://thetokencompany.com/careers)
- [Privacy Policy](https://thetokencompany.com/privacy)
- [Data Residency](https://thetokencompany.com/data-residency)

## Blog Posts

- [Pax Historia Case Study](https://thetokencompany.com/blog/pax-historia): 193B tokens/mo customer improved quality with compression in a 268K-vote blind arena
- [bear-1.1 Release](https://thetokencompany.com/blog/bear-1-1): Improved accuracy preservation and faster compression
- [bear-1 Launch](https://thetokencompany.com/blog/bear-1): First LLM input compression model

## Benchmarks

- [FinanceBench](https://thetokencompany.com/benchmarks/financebench): Accuracy evaluation on real-world financial documents