# The Token Company > LLM input compression API. Reduce tokens by 66%, cut AI costs by 3x, improve accuracy. The Token Company builds compression models (bear-1, bear-1.1, bear-1.2) that remove low-signal tokens from LLM prompts before they reach the model. By stripping noise, the LLM spends its attention on the tokens that actually matter — which is why compressed prompts often score higher than uncompressed ones. One API call compresses your input; you pass the compressed text to any LLM. Free tier includes up to 1B processed tokens per month. Backed by Y Combinator. ## Key Links - [Home](https://thetokencompany.com) - [Pricing](https://thetokencompany.com/pricing) - [Blog](https://thetokencompany.com/blog) - [Benchmarks](https://thetokencompany.com/benchmarks) - [Contact](https://thetokencompany.com/contact) - [Careers](https://thetokencompany.com/careers) - [Privacy Policy](https://thetokencompany.com/privacy) - [Data Residency](https://thetokencompany.com/data-residency) ## Blog Posts - [Pax Historia Case Study](https://thetokencompany.com/blog/pax-historia): 193B tokens/mo customer improved quality with compression in a 268K-vote blind arena - [bear-1.1 Release](https://thetokencompany.com/blog/bear-1-1): Improved accuracy preservation and faster compression - [bear-1 Launch](https://thetokencompany.com/blog/bear-1): First LLM input compression model ## Benchmarks - [FinanceBench](https://thetokencompany.com/benchmarks/financebench): Accuracy evaluation on real-world financial documents