The Opportunity
As Reducto’s first growth hire, you’ll lead our demand generation efforts across channels. You'll have the autonomy to run creative experiments, build our content engine, and find unconventional ways to reach AI teams. This is a rare opportunity to shape the growth strategy for a rapidly scaling AI infrastructure company.
About Us
The vast majority of enterprise data — from financial statements to health records — are locked in unstructured file formats like PDFs and spreadsheets. Reducto is the most accurate way to parse and extract data from complex documents.
Today we power ingestion pipelines for hundreds of leading AI teams, ranging from popular startups to Fortune 10 enterprises. We’ve grown incredibly quickly (0→7 fig in ARR in 6 months), are loved by customers (>300M pages parsed), and are well funded by tier 1 investors.
We would love to meet you if the following resonates:
- You have a history of scaling projects: You've built growth engines from scratch and know how to deliver measurable results. Whether you learned this at a startup, running your own projects, or growing something from zero - you know what it takes to drive adoption.
- You’re great at talking to technical teams: You can create compelling technical content that resonates with ML teams. You're comfortable being a public face for the company, whether that's creating social content, speaking at events, or building partnerships.
- You're creative, scrappy, and data-driven: You have a proven track record of designing, running, and measuring scalable growth experiments. You’re efficient with trying new approaches, can analyze results, and iterate effectively.
- You hustle and have high agency: This is our first hire for this role and comes with a lot of scope. You won’t be given a playbook, you’ll help create it. The right person is comfortable with ambiguity, and will spot opportunities and execute on them repeatedly.
The core work will include:
80% of time:
- Managing and scaling existing channels such as SEO and content marketing
- Designing and experimenting with growth experiments for new channels
- Engaging with our community on socials
- Highlighting customer case studies, product updates, and other engaging content for a technical audience
20% of time:
- Building systems for analytics and attribution
- Working with sales and eng to design and improve our overall growth strategy
Bonus points if you:
- Are chronically online and have a history of manufacturing virality
- Have helped drive growth for technical products, especially in ML/AI
- Have experience leading both organic and paid campaigns
Nearly 80% of enterprise data is in unstructured formats like PDFs
PDFs are the status quo for enterprise knowledge in nearly every industry. Insurance claims, financial statements, invoices, and health records are all stored in a structure that’s simply impractical for use in digital workflows. This isn’t an inconvenience—it’s a critical bottleneck that leads to dozens of wasted hours every week.
Traditional approaches fail at reliably extracting information in complex PDFs
OCR and even more sophisticated ML approaches work for simple text documents but are unreliable for anything more complex. Text from different columns are jumbled together, figures are ignored, and tables are a nightmare to get right. Overcoming this usually requires a large engineering effort dedicated to building specialized pipelines for every document type you work with.
Reducto breaks document layouts into subsections and then contextually parses each depending on the type of content. This is made possible by a combination of vision models, LLMs, and a suite of heuristics we built over time. Put simply, we can help you:
- Accurately extract text and tables even with nonstandard layouts
- Automatically convert graphs to tabular data and summarize images in documents
- Extract important fields from complex forms with simple, natural language instructions
- Build powerful retrieval pipelines using Reducto’s document metadata
- Intelligently chunk information using the document’s layout data