General Analysis

Automated AI Safety and Red Teaming Tools

S24

Active

artificial-intelligence

saas

trust-&-safety

San Francisco

https://generalanalysis.com

Automated AI Safety and Red Teaming Tools

General Analysis provides a comprehensive suite of AI safety tools, including red-teaming frameworks, interpretability techniques, and more. As AI systems become increasingly capable, their deployment in high-stakes environments poses significant risks—financial, ethical, and otherwise—where errors can lead to substantial consequences. To address these challenges, we offer access to novel tools and methodologies designed to systematically find model failure-modes and enhance model robustness.

General Analysis

Founded:2024

Team Size:2

Status:

Active

Location:San Francisco

Group Partner:Jared Friedman

Active Founders

Rez Havaei, Founder

Rez is the Co-Founder and CEO @ General Analysis. Previously he worked at Jane Street as a trader, as well as NVIDIA and Cohere as an AI researcher. He also helped build an LLM evaluation platform and benchmark at vals.ai as the founding engineer.

Rez Havaei

General Analysis

Company Launches

General Analysis: Finding Failure Modes for AI Models

See original launch post ›

TL;DR

General Analysis provides safety and performance reports for enterprise AI models, offering businesses clear insights into model vulnerabilities. Using a growing repository of automated red-teaming, jailbreaking, and interpretability techniques, we uncover and address critical failure modes.

Challenge

As AI systems become increasingly capable, their deployment in high-stakes environments poses significant risks—financial, ethical, and otherwise—where errors can lead to substantial consequences. We predict that a large percentage of the world’s cognitive tasks will soon be performed by AI systems across industries. However, this shift brings critical challenges:

Safety and performance efforts are not keeping pace with AI capabilities: Research and tools to evaluate AI systems have not kept up with the complexity and impact of modern models.
The field is fragmented: Approaches to AI safety and evaluation are scattered, lacking a unified framework.
Methods lack scalability and automation: Many current techniques are labor-intensive and fail to provide consistent, repeatable insights at scale.

Our approach

To address these challenges, we offer access to a unified set of tools and methodologies designed to systematically find model failure modes and enhance model robustness.

Providing Comprehensive Safety and Performance Reports: We deliver detailed reports to our customers, identifying novel failure modes in their models and providing actionable methods to mitigate them.
A Living Knowledge Base: Our repository collects and refines evaluation techniques while keeping pace with emerging exploits and vulnerabilities. This ensures our tools remain effective and relevant across diverse industries and evolving AI applications.

An example of our work: Eliciting legal hallucinations in GPT-4o

In our recent work, we show how GPT-4o is susceptible to hallucinating when asked about certain legal cases or concepts. The report, data and code are publicly available.

We train an attacker model that causes GPT-4o to hallucinate on more than 35% of prompts on a diverse set of legal questions.

Learn more at generalanlysis.com or read the full report here.

We are looking to connect with:

Startups creating LLMs or AI Agents in different sectors (Customer Support, Legal Tech, Medicine, foundation models) for design partnerships.
AI Safety, Interpretability, and Evaluation Researchers.

If you are interested in working with us or just want to chat please email us at founders@generalanalysis.com .

Other Company Launches

Kart AI: The AI shopping assistant for all e-commerce brands

We build assistants like Amazon Rufus for your catalog.

Read Launch ›

Kart AI – Your new electronics shopping assistant

We help you find and compare electronic products.

Read Launch ›