HomeCompaniesArga Labs
Arga Labs

The validation infrastructure for AI agents

We build environments to test your app and agents against the real world. Deploy your code/agent into a sandbox that runs against replicas of external services (Stripe, Slack, Google Drive, etc.). Twins support the same APIs, MCP tool calls & SDKs as the real software, and you can spin up thousands of instances in parallel and seed them using natural language. Run real workflows, surface edge cases, and observe how your code/agents actually behave before hitting production.
Active Founders
Phillip Li
Phillip Li
Founder
Co-founder & CEO @ Arga Labs. Built an internal dev tool @ Amazon to automate complex engineering workflows, resulted in 10+ recurring weeks/year of engineer hours saved across multiple teams, and prevented VP escalations due to bug issues. Before that, I was a fencer on the Canadian Junior National Team and researched cognitive systems at UBC
Akira Tong
Akira Tong
Founder
Co-founder, CTO @ Arga Labs Prev. SDE @ Stripe, quant @ GS Skipped high school, graduated @ 19 Pro player for Identity V, quit because I had to finish my finals :(
Company Launches
Arga Labs: on-demand, production-like staging for every PR
See original launch post

tl;dr: become a 100x tester with Arga by giving your agents staging environments scoped to a change. We spin up twins of your integrations (Slack, Stripe, etc.) and dependencies (DB, Redis), and deploy only the services that were changed. Everything else is routed to prod.

Launch Video: https://www.youtube.com/watch?v=OzEk5pYTvak

The problem: You’re a 1x tester

  • You don’t have staging :(
  • You have staging, but it’s different from prod, and things still break after testing
    • You mock integrations, which can be inaccurate
    • You use real accounts to test integrations, but you hit rate limits, can’t clear states between tests, and can’t test at scale
  • You have staging, but your agents can’t automatically test changes so you’re stuck at being a 1x tester despite being a 100x engineer
  • You have staging, but it’s difficult to merge concurrent changes without breaking something (for larger teams!)

Our Solution

We spin up a temporary staging of your app every time you open a PR. Only the services you change are redeployed; all other services are routed to prod. Your dependencies are deployed as in-memory sidecars, so nothing corrupts prod data. For example, you can configure the tests to read from the prod database and perform all write commands to the sidecar; you can also configure everything to be routed to the sidecar.

We also spin up twins of services your app uses: Stripe, Slack, GSuite, Hubspot, and more, so you don’t hit the real API. This lets you test more thoroughly: you can catch weird edge cases, simulate failure modes, and run tests in parallel without worrying about rate limits. All twins are fully compatible with any SDK, expose the same API endpoints, and support webhook events. You can seed any twin in natural language, or select from our pre-set scenarios.

uploaded image

Demo of Github, Slack, Stripe twins on the Arga app: https://www.youtube.com/watch?v=3ZPrIdGvPoo

You or your agent can prompt Arga via web, API, CLI, or MCP to automatically generate tests for your PR, run them, and stream the results and logs back. This allows them to know what was tested, what broke, and which calls/functions caused the failure. They can use this information to go back and fix the code, so you’ll have a fully-tested and passing PR before you look at it.

Validation report:

uploaded image

Claude Code using arga CLI to read logs & test results:

uploaded image

Our ask

Try us out! We’re live at www.argalabs.com and you can get access to limited features for free.
If you like our stuff, book a demo here (https://cal.com/phillip-li/30min).
If you know any teams that struggle with testing and use lots of external integrations, send them our way.

The Team

Akira and Phillip met each other in first-year calculus at UBC. At the time, Akira was 14 and Phillip was studying neuroscience to pursue research in human biology. Within a year of pivoting to CS, Phillip built a dev tool at Amazon that saved 10 weeks / year of engineering hours as an intern. While interning at Stripe, Akira realized that a high-fidelity “staging” was the key to unlocking dev productivity and was surprised that most companies don’t have proper staging that actually mirrored prod. Now they’re solving that problem.

uploaded image

YC Photos
Arga Labs
Founded:2025
Batch:Spring 2026
Team Size:3
Status:
Active
Location:San Francisco
Primary Partner:Pete Koomen