
Besimple AI
Spring 2025Voice data for AI
About Company
We are building the data layer for AI, starting with audio. We start with data collection, curating our own proprietary set of diverse conversational data covering a wide range of languages, dialects and accents. We then leverage human expert audio annotators and our own annotation platform to process audio data for Automatic Speech Recognition. With human level transcription and diarization, our data help push the audio model frontier. Today we have over millions of hours of conversational data, and growing. If you need audio data for training or evaluating your voice models or voice agents, reach out! We offer flexible licensing deals that work for startups and enterprises, with minimal process. Audio data should besimple :)
Active Founders
AI product leader at top-tier tech companies including Meta, Microsoft, and Dropbox, specializing in deploying large scale AI systems to realize business value
Shenzhen-born, Rhode Island-brewed, Bay Area-domesticated. While at Meta, launched 7 products and killed 2. Most recently, led the GenAI Annotation team, developing an in-house annotation platform for training LLaMa. Previously, managed an engineering organization responsible for improving connectivity for over 300 million users and optimizing 70%+ of Meta's annual SMS spend.

