
Datacurve
Winter 2024Frontier coding data for training and evaluating LLMs
About Company
We generate expert quality coding data at scale for fine-tuning LLMs
Active Founders
Started building software in high school - built a climbing training app with Team Canada athletes. Studied at Waterloo CS for a year then dropped out. Worked with the Cohere CTO on LLM reasoning and coding capabilities through synthetic data. Went to YC W24, pivoted 3 times until Datacurve. Now scaling high quality coding data production pipelines at Datacurve to enable next generation coding models
Hacking on things since middle school. Went to Waterloo CS, interned at Google, then dove into AI research on multi-modal RL and training browser-use agents. Went through YC W24, pivoted a few times, and landed on Datacurve – now providing the data infrastructure for frontier LLMs.

