Homeā€ŗCompaniesā€ŗReworkd

Reworkd

The simplest way to extract web data at scale

At Reworkd, we're working on multimodal LLM agents that serve as the simplest way to extract web data at scale. Customers come to us with lists of 100s to 1000s of websites along with a data schema. Our agents traverse these websites, understand their structure, and generate code to extract data from them. We've been working on LLM agents since their inception and have received over 30k stars on GitHub and 1M+ users across previous agent products. If you're interested in our pilot program, shoot us an email!
Jobs at Reworkd
San Francisco, CA, US
$130K - $180K
3+ years
Reworkd
Founded:2023
Team Size:6
Status:
Active
Location:San Francisco
Group Partner:Dalton Caldwell
Active Founders

Asim Shrestha, Founder

Software engineer and open source enthusiast. Also a co-founder @ Reworkd AI
Asim Shrestha
Asim Shrestha
Reworkd

Srijan Subedi, Founder

Co-founder at Reworkd AI. Combined major in Science at UBC. Previously worked at STEMCELL Technologies and Heart Lung Innovation as a Clinical Researcher.
Srijan Subedi
Srijan Subedi
Reworkd

Adam Watkins, Founder

Co-Founder & CTO of Reworkd AI - Pushing the boundaries of AGI agents. Deeply passionate about open-source, software architecture, engineering leadership, and emojis šŸš€šŸ˜€
Adam Watkins
Adam Watkins
Reworkd
Company Launches
šŸŒ Reworkd - Your new scraping co-pilot
See original launch post ā€ŗ

tl;dr: Reworkd automates your entire web data pipeline, end-to-end. It understands websites, writes code, runs scrapers, and validates results ā€” all from one simple system. Today, we're excited to launch our self-serve tool!

šŸ˜©Ā The Problem

Collecting, monitoring, and maintaining a web data pipeline can be complex and time-consuming, especially at scale. Traditional methods often struggle with issues such as pagination, dynamic content, bot detection, and site changesā€”all of which can compromise data quality and availability.

To address web data needs, businesses are often faced with either building out an internal engineering team or outsourcing to a low-cost country. The former can be expensive, while the latter is often unsustainable and requires significant management oversight.

šŸš€Ā The Solution:

Recognizing the inefficiencies of traditional data collection methods, we have built a platform to provide co-pilot experience for scraping. Simply provide a list of websites along with your unified schema, and our platform automatically generates custom Playwright code for each site. Youā€™re not locked into a black-box solutionā€”you have full control to guide, tweak, or completely rewrite the code in our built-in IDE as needed.

In addition, our platform offers:

  • Real-Time Dashboard: Monitor your scraping projects in real-time. Track outputs, scraper failures, unique results, visited pages, website review status, file downloads, and more.
  • Scheduling and Deduplication: Run scrapers at your desired frequency, choose between full or incremental scraping, and deduplicate data based on a primary key.
  • Bypass Anti-Bots: We manage all proxy and anti-bot measuresā€”including captcha solving and diverse proxy setupsā€”so you never have to worry about managing residential, data center, or other proxy types.
  • Complex Data Types: We take care of downloading and hosting files, ensuring data availability even as source websites evolve.
  • Seamless API Integration: Easily ingest your scraped data through our API.

šŸ™Ā Our Ask

Other Company Launches

šŸŒ Reworkd - Your new end-to-end web scraping platform

Effortlessly extract web data at scale. No code. No maintenance. No worries.
Read Launch ā€ŗ

šŸ¤– Reworkd AI - The open-source Zapier of AI agents

We help automate core business workflows with the help of AI Agents
Read Launch ā€ŗ