Databases Startups funded by Y Combinator (YC) 2026

May 2026

Browse 21 of the top Databases startups funded by Y Combinator.

We also have a Startup Directory where you can search through over 5,000 companies.

  • Mixpanel
    Mixpanel
    Y Combinator LogoS2009
    Active • 410 employees • San Francisco, CA, USA
    Mixpanel is analytics for builders that need answers from their data at their fingertips. When everyone in the organization can see — and learn from — the impact of their work, they are poised to make better decisions. Companies like OpenAI, Netflix, Pinterest, sweetgreen, CNN, samsara, Uber, and Yelp use Mixpanel to understand their customers, measure progress, and endeavor to make better decisions.
    analytics
    databases
    cloud-computing
    data-visualization
    b2b
  • s2.dev
    s2.dev
    Y Combinator LogoF2025
    Active • 5 employees • San Francisco, CA, USA
    S2 is an API for durable streams. We make working with streaming data dead simple and reliable. The more we worked on data systems, the more we felt like there was a missing building block. The seamless experience of object storage simply did not exist for durable streams – so we set out to fix that! S2 reimagines streams as an unlimited and access-controlled web resource. Like if Kafka and S3 had a baby. - Building agents? Use granular streams to persist context in real time, make workflows auditable, and coordinate between agents. - Publishing fast-moving data? Streams support high fanout and can be accessed directly over the internet, so they are a great fit for broadcasting real-time feeds. - Sandboxing? Emit to per-instance streams, and read history or live events just as easily. - Local-first and multiplayer experiences? Propagate updates reliably and efficiently. Stop dealing with clusters, and start streaming.
    databases
    developer-tools
    api
    infrastructure
  • HelixDB
    HelixDB
    Y Combinator LogoP2025
    Active • 6 employees • London, UK
    As AI agents replace traditional software, they need a way to store, recall, and reason over contextual data. HelixDB gives them all of that in one system. We’re building the knowledge infrastructure every AI agent will depend on.
    developer-tools
    open-source
    databases
    infrastructure
    ai
  • Morphik
    Morphik
    Y Combinator LogoP2025
    Active • 2 employees • San Francisco, CA, USA
    Morphik is open-source multimodal search for AI apps. We're used by customers ranging from space-tech teams searching across research papers to developers building brokerage agents.
    developer-tools
    search
    artificial-intelligence
    databases
    open-source
  • PgDog
    PgDog
    Y Combinator LogoP2025
    Active • 2 employees • San Francisco
    PgDog is an application for sharding PostgreSQL. It understands SQL and can distribute queries automatically between databases. It's built for managed databases, like AWS RDS, and doesn't require any changes to application code or schema. In addition to sharding, PgDog is a load balancer and pooler, so it can act as a replacement for PgBouncer, RDS Proxy, and other Postgres scaling products. It brings the simplicity and performance of HTTP load balancing to the database. Built from the experience of sharding Postgres at Instacart during peak growth in 2020, PgDog is the answer to the old question: does Postgres scale? It does now.
    databases
  • Praxim
    Praxim
    Y Combinator LogoW2025
    Active • 2 employees • San Francisco, CA, USA
    Praxim is the agentic AI word editor that makes edits across your entire word document with formatting control, file and web context, and your personal preferences in mind. Simply tell Praxim what you'd like to do with your document and Praxim will implement your changes automatically.
    artificial-intelligence
    saas
    b2b
    databases
    consumer
  • ReJot
    ReJot
    Y Combinator LogoW2025
    Active • 2 employees • Amsterdam, Netherlands
    ReJot is building Fragno to let API companies go from simply exposing an API to offering drop-in integrations that work out of the box. Fragno is an open‑source framework for packaging backend endpoints, data storage patterns, and frontend utilities into reusable building blocks. API providers ship full integrations, not just low‑level SDKs and docs. Customers configure rather than write code, so integrations are both much faster to launch and far more consistent across customers.
    infrastructure
    databases
    developer-tools
    api
  • Trellis AI
    Trellis AI
    Y Combinator LogoW2024
    Active • 25 employees • San Francisco
    Trellis helps healthcare providers treat more patients, faster—while eliminating pre-service paperwork. We automate document intake, prior authorizations, and appeals at scale to streamline operations and accelerate care. Our AI agent is trained on millions of clinical data points and converts messy, unstructured documents into clean, structured data directly in your EHR. With Trellis, leading healthcare providers and pharmaceutical companies were able to: 1. Reduce time to treatment by over 90% 2. Improve prior authorization approval and reimbursement rates 3. Leverage structured data to enhance drug program performance and clinical decision-making Administrative costs account for over 20% of U.S. healthcare spending—delaying care, draining revenue, and driving staff burnout while having less visibility into patient care than ever before. We built Trellis to tackle this head on.
    b2b
    data-engineering
    databases
    infrastructure
    ai
  • Ubicloud
    Ubicloud
    Y Combinator LogoW2024
    Active • 15 employees
    Ubicloud is an open source cloud that can run anywhere. Our cloud services include elastic compute, block storage, virtual networking, managed Postgres, K8s, AI inference, and powerful IAM. Ubicloud provides these services on bare metal providers, such as Hetzner, Leaseweb, or AWS Bare Metal. You can self-host our software or use our managed service to reduce your cloud costs by 3-10x.
    ai
    cloud-computing
    databases
    infrastructure
  • DataShare
    DataShare
    Y Combinator LogoS2023
    Active • 1 employees • Austin, TX, USA
    DataShare is a data-as-a-service platform that lets you embed charts, dashboards and exports directly into your product. For example, if you run an accounting startup, DataShare would enable you to embed a full profit and loss dashboard, with downloadable statements. DataShare is backed by an enterprise-grade data warehouse, and can be implemented in fewer than 20 lines of code.
    data-engineering
    databases
    analytics
  • ParadeDB
    ParadeDB
    Y Combinator LogoS2023
    Active • 10 employees • San Francisco, CA, USA
    You want better search, not the burden of Elasticsearch. ParadeDB is the modern Elastic alternative built as a Postgres extension.
    open-source
    databases
    infrastructure
    analytics
    developer-tools
  • Epsilla
    Epsilla
    Y Combinator LogoS2023
    Active • 3 employees • Sunnyvale, CA, USA
    Epsilla is an all-in-one platform for building AI agents powered by your private data and knowledge. Easy to use for domain professionals, deeply customizable for AI experts, and fully equipped for enterprise customers with security, scalability, and integration.
    databases
    saas
    infrastructure
    ai
  • Blitz
    Blitz
    Y Combinator LogoS2022
    Active • 4 employees • San Francisco, CA, USA
    Blitz is a no-code platform to build internal applications and automate manual tasks. Quickly build a database, integrate your logic, and scale your operations. Alternative to Google Sheet and Airtable, Blitz is not limited by a number of lines of records or by an API rate. Create your data model and set up data validation rules. Use our interface builder to create dynamic forms, and quickly onboard customer and partners. Leverage our granular permissions to share data to external users (publicly or privately). Add advanced business logic to your forms and portals, without writing any line of code. The experience is similar to creating spreadsheets, but without their limitations in terms of scalability and security. You can use Blitz to build an onboarding flow, an order management system or some validation workflows for compliance purposes. Stop using developer resources or rigid SaaS software. Start building your own tools, adapted to your needs, with Blitz.
    no-code
    saas
    b2b
    databases
    ai
  • Supabase
    Supabase
    Y Combinator LogoS2020
    Active • 120 employees • San Francisco, CA, USA
    Supabase is the easiest way to get started with Postgres. Each project within Supabase is an isolated Postgres cluster, allowing customers to scale independently, while still providing the features that you need to build: instant database setup, auth, row level security, realtime data streams, auto-generating APIs, and a simple to use web interface. We are 100% remote.
    open-source
    databases
    data-engineering
    big-data
    developer-tools
  • InfluxData
    InfluxData
    Y Combinator LogoW2013
    Active • 210 employees • New York City
    InfluxData, the creators of InfluxDB, delivers a modern Open Source Platform built from the ground up for analyzing metrics and events (time series data) for DevOps and IoT applications. Whether the data comes from humans, sensors, or machines, InfluxData empowers developers to build next-generation monitoring, analytics, and IoT applications faster, easier, and to scale delivering real business value quickly. Based in San Francisco, InfluxData customers include Autodesk, Cisco, eBay, and Coupa. Visit https://www.influxdata.com.
    time-series
    analytics
    databases
    open-source
  • Deasy Labs
    Deasy Labs
    Y Combinator LogoS2023
    Acquired • 8 employees • New York City
    Deasy Labs was acquired by Collibra in July 2025 (global leader in enterprise data governance). Deasy Labs provides metadata orchestration for AI workflows. Deasie's platform provides the best way for AI teams to create and embed high-quality, customized metadata into their AI workflows (e.g., RAG, Agentic frameworks). Our three founders (from Amazon, McKinsey/QuantumBlack & MIT) previously built an ML data governance tool from 0 to 1 within McKinsey, which we deployed with 11 Fortune 500 companies. We saw in early 2023 the ability to create high-quality metadata (without reliance on domain experts) would be a key factor in achieving the accuracy & speed in GenAI applications required for production. Our investors include General Catalyst, Y Combinator, RTP Global and world experts in enterprise data. Website: https://deasylabs.com
    ai-assistant
    data-labeling
    databases
    big-data
    artificial-intelligence
  • PeerDB
    PeerDB
    Y Combinator LogoS2023
    Acquired • 2 employees
    At PeerDB, we are building a fast, simple and the most cost effective way to stream data from Postgres to Data Warehouses, Queues and Storage engines. If you are running Postgres at the heart of your data-stack and move data at scale from Postgres to any of the above targets, PeerDB can provide value. We support different modes of streaming - log based (CDC), cursor based (timestamp or integer) and XMIN based. Performance wise, we are 10x faster than existing tools. Features wise, we support native Postgres features such as comprehensive set of data-types incl. jsonb/arrays/postgis, efficiently streaming toast columns, schema changes and so on.
    open-source
    enterprise-software
    data-engineering
    databases
    developer-tools
  • JumpWire
    JumpWire
    Y Combinator LogoW2022
    Acquired • 2 employees • New York, NY, USA
    JumpWire is a data protection platform that adds advanced data security controls between APIs, applications and databases. JumpWire automatically identifies sensitive properties inside large data sets and gives developers full control over which people and applications can access or update records containing sensitive info. Examples uses include restricting who can read customer PII to members of the customer service team, giving on-call engineers elevated access to production, or splitting user records between regions for GDPR purposes. JumpWire’s approach to securing data in-place minimizes the risk of data leaks exposing sensitive information or mishandling by other applications and vendors. The exact security scheme applied to data is defined by policies that align with an organization’s existing InfoSec program. JumpWire helps companies who maintain information security with compliance programs such as SOC or HIPAA. They are processing sensitive data, often from their own customers, and exceed security best practices as a competitive advantage. JumpWire provides defense at depth to data and sits alongside access controls and Layer 4 encryption to provide a comprehensive data security solution. JumpWire is unique from solutions such as data vaults by installing inside our customers’ own infrastructure and clouds. It is interoperable with existing applications and databases, which eliminates the need for large data migrations or code refactoring. Lower-level approaches to data security, such as encryption at rest, are too blunt and lack the ability to differentiate between properties in the data itself. Its scope is limited to physical storage, and security is lost as soon as an application or query loads the data.
    security
    data-labeling
    databases
  • KeyDB
    KeyDB
    Y Combinator LogoS2020
    Acquired • 2 employees • Toronto, ON, Canada
    KeyDB is a fast key value database that combines in memory caching and FLASH persistence in a single package.
    databases
    open-source
  • Compose
    Y Combinator LogoS2011
    Acquired • 51 employees • San Mateo, CA, USA
    Compose is a fully-managed platform used by developers to deploy, host, and scale databases to help you conquer the data layer. www.compose.io
    databases
    devops
  • Citus Data
    Citus Data
    Y Combinator LogoS2011
    Acquired • 45 employees • San Francisco, CA, USA
    The amount of time businesses spend on their databases is altogether too much time. Citus is fixing this problem. Citus is worry-free Postgres. Built to scale out, Citus is an extension to Postgres that is available as open source, as enterprise software that can be run on-prem or on any cloud, and as a fully-managed database as a service. Whether you have a multi-tenant application that needs to scale out, or you need performance for your real-time analytics customers, with Citus, you can focus on your app—not your database. Founded in 2011, Citus Data is venture backed by Khosla Ventures and Data Collective. Citus is a Y Combinator alumnus and has offices in San Francisco’s SoMa district and Istanbul, Turkey. At Citus, we make it simple to scale out Postgres. Citus Data online: www.citusdata.com Documentation: docs.citusdata.com GitHub: github.com/citusdata/citus
    databases
    big-data
    open-source