Hey everyone 👋 We’re Anna and Tyler from OmniAI, and we help clean up data using AI.
Here we started with a database of all the public demo day videos and extracted some of the most interesting business metrics — without needing to watch hundreds of videos.
A company’s most valuable data set is often its least accessible. Only 20% of corporate data lives in SQL, with the remaining 80% locked away in unstructured formats (reviews, chat logs, transcripts, pdfs). It takes some serious engineering work to answer questions like:
And this problem will only get worse as AI interfaces become more ubiquitous and more user data is locked up in chat logs. 🤖
.
Even YC founders don’t have a unified strategy here. From the hundreds of companies we’ve interviewed, it usually boils down to:
🌎 Outsourced data entry - Slow and surprisingly inaccurate
🔎 Spot checking - Slow and very prone to bias (i.e. read 10 user reviews and make sweeping business decisions)
💸 Hire some ML engineers - Best results, but it’s going to be slow and expensive
.
Use OmniAI to turn that unstructured data into tabular data. OmniAI makes it simple to transform and enhance unstructured data, all within your existing data warehouse.
Run ML models across your data. Categorize, extract, summarize, translate, and more.
Anna comes from building ETLs and data enrichment tools at Hightouch. While Tyler has a background in ML and healthcare applications (most recently at Fair Square). We’ve dealt with everything from syncing CRMs to fax machine APIs.
We’re building Omni because we hate working with messy data, and we can make this problem a little better for everyone.
Email us at founders@getomni.ai