HomeCompaniesFoundry

Foundry

Pre-Production Testing and Observability for Browser Agents

Foundry is the testing and observability platform for browser agents, empowering teams to evaluate, optimize, and productionize agents that automate entire workflows—like customer support, recruiting, and compliance. With Foundry, teams can quickly create browser agents that actually work, simulate real-world scenarios, track performance, and ensure reliability across their business-critical workflows. Pranav and I built AI systems at Scale AI for companies like OpenAI. Now, we’re bringing that expertise to teams everywhere to make browser agents reliable, scalable, and accessible for every business.
Foundry
Founded:2024
Team Size:2
Location:San Francisco
Group Partner:David Lieb

Active Founders

Manil Lakabi, Founder

Manil was an operator on the Gen AI team at Scale AI, where he led language data projects that generated millions in revenue and enabled top AI labs to evaluate and improve their AI models. Before that, he worked at Meta.
Manil Lakabi
Manil Lakabi
Foundry

Pranav Raja, Founder

ex Machine Learning Research Engineer on the Gen AI team as Scale. Building the world model for you agent.
Pranav Raja
Pranav Raja
Foundry

Company Launches

TLDR

Foundry is a platform to build, evaluate, and improve AI agents that can automate key parts of your business—customer support, hiring, sales, and more.

For businesses starting from scratch, we help design agents tailored to your workflows, capable of taking on entire processes autonomously. For those with existing agents, we provide a systematic way to measure performance, identify gaps, and make improvements—before customers complain or workflows break down. Watch our demo here:

Why is this important?

Every company will be an AI agent company. But building agents is tough, and knowing if they’re actually working well is even harder.

Can your agent give accurate answers? Is it handling tasks like generating code or analyzing contracts the way you’d expect? Most businesses don’t have a clear way to figure this out—they only realize something’s wrong when customers complain or workflows start breaking.

Without the right tools, it’s a guessing game. Foundry takes out the guesswork, helping you build agents that work as they should and keep getting better, so you can trust them to run key parts of your business.

Our solution

Foundry is a platform that helps businesses build, evaluate, and improve AI agents:

  • For New Agents: We help companies design modular agents tailored to their workflows, capable of automating tasks like analyzing contracts, generating code, or resolving customer queries across languages.
  • For Existing Agents: Foundry provides tools to systematically evaluate and improve agents, measuring dimensions like accuracy, relevance, task completion rates, and code execution reliability.

Using a SOTA factuality checker, internal knowledge bases, historical data, and evaluation datasets, we identify where agents fall short. We then improve their performance through auto-prompting, fine-tuning, or steering—automating over 90% of what businesses would otherwise handle manually.

Moving forward

We’re building more than just tools for evaluating and improving AI agents—our vision is to create the operating system for AI agents.

Imagine a marketplace where businesses can discover, compare, and instantly deploy the best AI agents for any task, complete with performance leaderboards and seamless drag-and-drop integration into workflows. Foundry will become the App Store for AI agents, enabling companies to choose top-performing agents to automate everything from customer service to sales to complex operations.

Long-term, Foundry will act as the orchestration layer for all AI agents. Businesses will be able to manage fleets of agents across functions, ensuring they collaborate effectively, improve autonomously, and work as a cohesive ecosystem.

Our ask

AI agents are being built across industries and use cases. If you know anyone working on agents—especially multimodal or multi-lingual agents—we’d be super grateful for an introduction! Email us at founders@thefoundryai.com.

Some examples of where AI agents are being built:

  • Customer support teams building multilingual bots
  • Recruiting teams automating candidate screening and outreach
  • Sales teams creating agents to manage outreach and pipeline tasks
  • Companies working on multi-modal agents for complex workflows
  • AI teams scaling agents for operations, finance, or logistics

We’d love to connect and help them build smarter, more reliable agents!

Team

Manil and Pranav worked on the Gen AI team at Scale AI, where they helped top AI labs build and improve their models.

Manil was an operator that led large-scale data projects. Pranav, an ML Researcher, developed new methods of human supervision, built production ML tools to optimize Scale’s data pipelines, and led the agentic tool use research for the SEAL Leaderboard, the gold standard for benchmarking AI agents.

After working closely with AI labs, we saw the massive potential of Gen AI agents to transform businesses—and the lack of tools to design, evaluate, and improve them effectively.