Pivot Archive
All pivots
Benchspan logo

Benchspan

Sp26
2 people|Active|Website
AIOpsAnalyticsInfrastructureAI
132°Near Reinvention
Before

Institutional data layer for prediction markets

After

Run agent benchmarks in minutes, not hours

Full description — before

Oddpool is probability infrastructure for prediction markets. We capture full orderbook microstructure across venues and normalize it into structured, queryable data for quantitative and institutional workflows. Parquet-native, with full provenance. A Federal Reserve research paper found that prediction markets provide a statistically significant improvement over traditional forecasts for major macro events. But the data is fragmented across venues, unstandardized, and has no institutional-grade historical record. Oddpool provides the normalized cross-venue data layer, the canonical event mapping that links the same outcome across exchanges, and the historical depth that compounds over time and can't be rebuilt retroactively.

Full description — after

Benchspan is a benchmarking platform for AI agents. If you're building an agent, you need to know if it's getting better. But running benchmarks is slow, expensive, and fragile. You spend days writing glue code every time you want to run a new benchmark, runs take forever on your laptop, and when they fail halfway through you burn hundreds of dollars in tokens with nothing to show for it. Benchspan fixes all of it. Onboard your agent once, and it works with every benchmark on the platform. We onboarded Claude Code in 37 lines of code. Running a benchmark becomes a single command, executed in parallel in the cloud. Every result goes to one place your whole team can see, with full trajectories, token usage, latency, and custom metrics. When runs partially fail, rerun just the subset that errored instead of starting from scratch. Compare runs side by side to see exactly where your agent is improving and where it's regressing.

Category shift
AI Investment & ResearchEnterprise AI Agents
Summary

Oddpool pivoted from building data infrastructure for prediction markets to creating a benchmarking platform for AI agents - completely unrelated products in different industries.

Detected today · 2026-03-26