Otso Veisterä
Founder & CEO, The Token Company
Otso founded The Token Company to solve the problem he kept seeing across AI teams: most of the tokens you send to an LLM don't contribute to the answer. The company builds learned compression models — the latest is bear-2 — that remove redundant tokens before they reach your LLM, cutting costs while preserving or improving output quality.
Before starting The Token Company, Otso worked at the intersection of ML infrastructure and product engineering. He writes about LLM cost optimization, compression, and the practical side of shipping AI products at scale.
Articles
One of the biggest token consumers globally improved quality by removing context bloat
Pax Historia ran a 268K-vote model arena. Compressed models scored higher and A/B tests showed +5% purchase lift.
Helonic: cutting inference costs while maintaining quality
How Helonic uses token compression to reduce LLM API costs without sacrificing output quality.