HomeCompaniesHalluminate

Halluminate

Data and RL environments to automate knowledge work

We help foundation model labs and enterprises train computer use AI with data and sandboxes. A knowledge worker spends 90% of their day on a computer. A productive AI agent must similarly learn how to use our computers, browsers, and software to deliver real world value. Today, browser and computer use AI is inaccurate, slow, and expensive. Model labs and enterprises who want to improve these agents are bottlenecked by two resources: 1) High quality datasets for benchmarking and evaluations 2) Realistic sandbox environments for safe and accurate testing/training (ex. a simulated version of Salesforce) Halluminate offers a suite of products and services to address both these issues. Our evaluation service combines proprietary datasets with high quality annotations to help our customers identify and prioritize the biggest failure modes of their computer- and browser- use AI. Our platform provides a catalog of fully managed sandboxes, empowering our customers to safely and accurately test/train at scale, resulting in improved performance. Our customers see - Improved browser- and computer-use agent performance - New and emergent frontier agent capabilities - Data driven prioritization leading to exponentially faster development speed - Increased revenue/sales via marketing from public benchmarks Our paying customers already include leading computer use model labs and the two largest browser agent companies in the space. Wyatt and Jerry are friends that met their first week of school studying CS at Cornell University. They’ve been working and studying together for 7+ years.
Active Founders
Jerry Wu
Jerry Wu
Founder
Jerry Wu is co-founder and CEO of Halluminate. Previously, he led product and research at Capital One Labs, launching one of the first AI agents in financial services and co-authoring three patents. Jerry studied Computer Science and Economics at Cornell, where he researched model quantization methods and served as VP of the Cornell Consulting Group. He was also class speaker at Acton-Boxborough High School.
Wyatt Marshall
Wyatt Marshall
Founder
2x early startup data/sw eng Building a platform for environments, evals, and benchmarks to train and test computer agents AMA about LLM/agent benchmarking and evals! https://halluminate.ai/
Company Launches
Halluminate: Data and sandboxes to train computer use AI 🖥️
See original launch post

Tldr

AI workers must learn how to use our computers, browsers, and software interfaces to deliver real-world value. Today, computer use agents are unreliable and inaccurate. Halluminate is building realistic sandboxes and datasets to train better computer/browser use AI.

Ask

Looking to chat with researchers or founders training computer/browser use agents!

❌ Problem

OpenAI’s Operator and Claude’s Computer Use give us a glimpse into the future where AI can take control of digital interfaces and do real work.

Performance today is inaccurate/unreliable. There are two bottlenecks to performance improvements.

First, reliance on real-world testing: researchers today train/test their browser- and computer-use agents on real-world sites. This is

  • Unsafe - agent actions have real consequences and data impact
  • Slow - challenging to parallelize
  • Difficult to reproduce - the real world is dynamic; the starting conditions cannot be “reset” easily, and data changes
  • Lots of noise - proxy, captcha, auth/login, ads, etc,. make it difficult to do clean testing/training

Second, lack of high-quality data: High-quality data provides the basis for evaluations and benchmarking. Producing this at scale is expensive, time-consuming, and logistically exhausting.

💡 Our Solution

Launch YC - Halluminate

At Halluminate, we’re building a suite of products and services to address both these issues.

  • Realistic sandboxes – Fully managed, parallelizable environments modeled after popular systems (e.g., Salesforce, Slack, Ticketing Software, websites) for safe and accurate computer/browser use training and testing.
  • Datasets – proprietary benchmarks and datasets
  • Evaluations - high-quality error analysis powered by expert annotators to identify and prioritize the biggest failure modes for our customers

Our customers see

  • Improved browser- and computer-use agent performance
  • New and emergent frontier agent capabilities
  • Data-driven prioritization leading to exponentially faster development speed
  • Increased revenue/sales via marketing from public benchmarks

🌐 Our Mission

Unlock significant advancements in browser and computer use AI capabilities. We believe this is necessary to usher in a new generation of use cases, startups, and productive AI workers.

📜 Backstory

Wyatt and I met while studying CS at Cornell and have been living and working together for over 7 years.

I previously led product/research at Capital One Labs, where I launched one of the first AI agents in banking. Wyatt previously was a Cornell Milstein scholar and did large-scale data engineering for 2 early-stage startups in NYC.

We faced these problems first-hand while building evals for browser/computer use agent companies. We didn’t see any good solutions, so we’re building one ourselves.

We’re in SF for the foreseeable future. Contact us if you wanna grab a coffee, go for a walk, or play pick-up basketball!

Emails: jerry@halluminate.ai, wyatt@halluminate.ai

🙏 Our Asks

  • Follow us on socials (Jerry: LinkedIn, Twitter Wyatt: LinkedIn, Twitter)
  • Introductions to researchers and founders building computer/browser use agents
  • Introductions to experts in RL & post-training
  • Introductions to companies that sell training data to large labs

uploaded image

YC Photos
Jobs at Halluminate
San Francisco, CA, US
$150K - $250K
0.50% - 0.75%
3+ years
San Francisco, CA, US
$150K - $250K
0.25% - 0.75%
3+ years
Halluminate
Founded:2024
Batch:Summer 2025
Team Size:2
Status:
Active
Location:San Francisco
Primary Partner:Jon Xu