
The API for AI phone calls
Independent AI evaluations lab
Make AI phone calls from a single API call.
We build independent and contamination-proof benchmarks that measure real world performance. LLM Stats is the most complete LLM leaderboard. We have the most complete archive of LLM benchmark results and also run independent evaluations that are not the classical ones that are already in the training data of most models. Our mission: become the biggest community dedicated to AI transparency.
The company shifted from offering an API for AI phone calls (telephony product) to building independent LLM benchmarks and leaderboards (AI evaluation lab). This is an entirely different product, problem, and market—an unmistakable full pivot.
Independent AI evaluations lab(viewing)
Auto-optimizer for AI agents