HomeCompaniesSepal AI

Sepal AI

Data Development for Advanced AI

Sepal AI works with the world’s leading experts for developing advanced capability training data, expert annotations, and bespoke model evaluations. Our team comes from Turing, Vercel, McKinsey, and Bain. At Turing, we built the LLM training business and products to support $100+M revenue for companies like OpenAI, Google, and Anthropic. We learned that data development for advanced AI use cases is more complex and has unique needs different from traditional label generation Foundational AI companies and large enterprises that we worked with, like OpenAI, PepsiCo, and Johnson &Johnson, don't have the data they need to train models to produce value for highly technical use cases. Which means they’re not going to unlock the value from AI without a partner. We are working with the most influential companies of our time to develop models and systems that extend human knowledge and capabilities.
Sepal AI
Founded:2024
Team Size:6
Status:
Active
Location:San Francisco
Group Partner:Michael Seibel
Active Founders

Robi Lin, Founder

Co-Founder @ Sepal AI Built the enterprise workflow products and fulfillment strategy at Turing.com. Scaled Turing’s LLM trainer business line from 50 to 800+ onboarded developers in 5 months for foundational LLM and enterprise customers. Previously was at Bain & Co.
Robi Lin
Robi Lin
Sepal AI

Kat Hu, Founder

Cofounder @ Sepal AI Built Turing’s Foundational LLM trainer business GTM. Ran orgs of 500+ AI trainers & built corresponding operations for scale. Previously was at McKinsey.

Fedor Paretsky, Founder

I'm the co-founder and CTO at Sepal AI. Previously, I built platforms and infrastructure to bill users at Vercel while it went through hyper growth ($20M -> $100M ARR). Before that, I worked on FP&A software at Mosaic and on platforms and infra at Newfront.
Fedor Paretsky
Fedor Paretsky
Sepal AI
Company Launches
🌱 Sepal AI - Confidently deploy your AI models
See original launch post ›

Tl;dr: Sepal provides frontier data and tooling for advancing responsible AI development.

__________________________________________________________

Sepal AI is on a mission to advance human knowledge and capabilities with the responsible development of artificial intelligence.

🧐 Responsibly advance human knowledge with AI? What does that mean?

We believe in a world where AI advances scientific research and empowers economic growth.

To achieve that future, AI product & model builders need:

  1. Golden Datasets and Frontier Benchmarking: To iteratively measure model performance on specific use cases.
  2. Training Data: To improve model capabilities using fine-tuning and RLHF.
  3. Safety / Red-teaming: To measure and forecast the safety of LLMs before putting them out in the wild.

__________________________________________________________

⚠️ Okay, well why does it matter?

Frontier data for AI development is vital for safe deployment & scaling. However, developing this data is difficult.

Most frontier data requires domain knowledge that can be hard to source and curate (e.g., finance, medical, physics, biology, etc.). Publicly available benchmarks (e.g., MMLU, GPQA, MATH, etc.) are contaminated and too general to be useful to actual product & model builders.

__________________________________________________________

🌱 How do we do this?

We’ve built Sepal AI - the data development platform that enables you to curate useful datasets.

The Platform: We bring data generation tooling, human experts, synthetic data augmentation, and rigorous quality control into one platform so you can manage the production of high-quality datasets.

Our Expert Network: We’ve built a network of 20k+ experts across STEM and professional services (think academic PhDs, business analysts, medical professionals, marketing and finance consultants) to support campaign design & data development.

Sample engagements we’ve run:

  • 🧬 Cell and Molecular Biology Benchmark: An original benchmark to evaluate complex reasoning across models. Produced by a team of PhD biologists from top institutions in the US.
  • 💼 Finance Q&A + SQL Eval: A Golden Dataset to test the ability of an AI agent to query a database and produce human-expert-level answers to complex finance questions.
  • 📏 Uplift Trials & Human Baselining: End to end support for conducting secure in-person evaluations on model performance.
  • …. [insert your custom use case next?]

__________________________________________________________

🙏 Asks:

  1. If you are building an AI application and need to measure or improve your model, or
  2. If you are a researcher at an AI lab building or evaluating models for new capabilities / risk areas, or
  3. If you’re passionate about the development of AI, AI safety, or evals in general…

Let’s chat — please check us out at www.sepalai.com

__________________________________________________________

👪 Our team:

Meet Kat (on the left), Robi (in the middle), Fedor (on the right)!

Robi and Kat previously built the technical LLM training business for Turing. Kat on the go-to-market & operations side. Robi on the product & fulfillment side. Fedor is a long-time close friend - he was an early engineer at Vercel & Newfront where he built out foundational infrastructure.

Say hi: founders@sepalai.com.