HomeCompaniesActiveloop

Database for AI

We provide a simple API for creating, storing, versioning, and collaborating on multi-modal AI datasets of any size. With Activeloop's open-core stack, you can rapidly transform and stream data while training models at scale. Deep Lake powers foundational model training by acting as a vector database with significant benefits, such as (1) the ability to use multi-modal datasets to fine-tune your own LLM models, (2) storing both the embeddings and the original data with automatic version control, so no embedding re-computation is needed (3) truly serverless service with no vendor lock-in. How cool is that? GitHub loves us - we're one of the fastest-growing libraries there, and we're used by little-known companies like Google, Waymo, and Intel. No big deal. Our founding team hails from places like Princeton, Stanford, Google, and Tesla, and we're backed by Y Combinator & other Silicon Valley heavyweights. Activeloop is hiring, and we want you! Check out our open roles on our YC page and join the fun. 10-min demo: https://activeloop.wistia.com/medias/aibvo0dst2 Whitepaper: https://www.deeplake.ai/whitepaper
Active Founders
Davit Buniatyan
Davit Buniatyan
Founder
Founding CEO Activeloop, PhD on leave from Princeton, AI/ML, Data and Infra, Y Combinator S18, UCL 16’ Working on Data 2.0
Company Launches
Activeloop Scientific Discover: The AI Analyst for Scientific Research
See original launch post

Hey 👋 Davit here, founder of Activeloop.

Today we are introducing something we believe will fundamentally reshape how scientific discovery happens.

Activeloop’s Scientific Discover, an intelligence agent built on one of the largest datasets of indexed scientific research. Here are the details:

https://www.youtube.com/watch?v=x8Lv5-C9ntw

The Problem: Scientific Data Is Broken

Every scientist already knows the pain.

  • Critical insights are buried across PDFs, images, tables, and supplementary files.
  • Key figures live inside screenshots, not structured databases.
  • Twenty years of research is trapped in incompatible formats and outdated systems.
  • Every new project begins with weeks of manual data wrangling.

The White House recently launched the Genesis Mission, recognizing that fragmented scientific data is one of the biggest blockers to breakthroughs in drug discovery, materials science, climate modeling, and more.

The Genesis Mission aims to build an integrated AI platform that harnesses Federal scientific datasets. The goal is to train scientific foundation models and create AI agents that test hypotheses, automate research workflows, and accelerate scientific breakthroughs.

The country is trying to unify scientific data for AI.

We decided to show what that future looks like.

Introducing L1: The Scientific Data Agent

To demonstrate what becomes possible when scientific data is indexed natively for AI, we built Activeloop L1, a multimodal scientific research agent running on top of:

  • 25M open access papers
  • 450M+ plus pages
  • 175TB of fully visual indexed scientific data

Unlike traditional NLP or paper search, L1 sees science.

It understands:

  • charts
  • molecules
  • protein structures
  • equations
  • experimental tables
  • clinical graphs
  • and the text that surrounds them

This allows L1 to answer questions no text only model can touch.

The Results: A New State of the Art 48% SOTA on Humanity’s Last Exam with Tools

uploaded image

L1 (Gemini 3 Pro) outperforms top models including Grok Heavy with tools, and GPT 5 Pro with tools.

Multimodal scientific reasoning

L1 synthesizes biochemical structures with clinical findings to answer end-to end research questions.

Example Query

Which compounds show novel synergy with metformin for treating type 2 diabetes?

L1 scans molecular structures, clinical outcomes, and experimental tables at the same time and produces grounded, citation linked answers.

Why This Matters

Multimodal scientific agents unlock capabilities that were not possible before.

This extends far beyond drug discovery. It applies to materials science, climate research, algorithmic innovation, and more.

The applications are enormous.

Try L1 Today with Our OpenAI Compatible API

uploaded image

Get Started

Try the Science Agent: https://chat.activeloop.ai/science

Docs and API: https://docs.activeloop.ai

Build with the OpenAI compatible API today.

Previous Launches
Less Time in CRM, More Sales. Get instant insights across your entire stack from CRMs/ERPs to sales decks and product analytics.
Turn PDFs, images & tables into instant, cited answers. Agentic RAG that just works.
Jobs at Activeloop
Mountain View, CA, US
$120K - $200K
3+ years
California
$200K - $300K
11+ years
Activeloop
Founded:2018
Batch:Summer 2018
Team Size:15
Status:
Active
Location:Mountain View
Primary Partner:Diana Hu