Over the past decade, I have had the privilege of developing ML products at several companies. My most recent position at Apple involved overseeing the gathering of training data for the team, which required a significant investment of hundreds of millions of dollars. While working on this project, we utilized a range of tools, including spreadsheets, open-source resources, and custom internal tools. Despite the company’s resources, I found myself wishing for a labeling interface that didn't exist - one that would have made the process more efficient and streamlined.It was this realization that led me to create Datasaur, a solution that would address this gap in the market once and for all. Our goal is to provide an intuitive and effective labeling interface that can benefit the broader NLP ecosystem across all industries and languages.
We create smart and efficient data labeling tools.
Behind every AI algorithm are vast amounts of human-labeled training examples. Organizing and labeling such data today is tedious, time-consuming and expensive. We started Datasaur to create intelligent, efficient and productive data labeling tools to save companies time and improve labeling quality.