Open-Source Unit Testing for LLM Applications
Confident AI an open-source company building 1) an open-source package called DeepEval to unit-test LLM applications such as chatbots, agents, and RAG pipelines, and 2) the cloud platform for DeepEval. It's like Next.JS and Vercel. The founding team is a small group of exceptional engineers and researchers from top colleges and companies such as Google, Microsoft, and Princeton.
Things we value:
What you'll be doing:\
The entire process is usually remote and most communication happens over email or via video chat in Google Meet. We know that you may be interviewing elsewhere as well so am respectful of your time and will get back no later than 2 days of each step along the process.
The entire process has 4 steps and takes around 1.5 week in total:
You'll be working with the founders directly throughout the entire process.
Confident AI is the leading LLM evaluation platform that helps teams evaluate, test, benchmark, optimize, monitor, and red-team LLM applications. Powered by DeepEval, the go-to LLM evaluation framework with over 600k monthly downloads, 5.3k GitHub stars, and over 40 million evaluations conducted, Confident AI is trusted by hundreds of companies from leading startups to international corporations.