Founding Engineer: Large-Scale Web Scraping & Crawling

Building the first end-to-end testing platform for web agents, including a Browser Gym for RL-driven optimization.
$100,000 - $200,000
Backend
Senior Software Engineer
Remote
1 - 10 Employees
3+ years of experience
AI
This job posting may no longer be active. You may be interested in these related jobs instead:
Sr. Software Development Engineer, Profit Intelligence

Senior Software Engineer role at Amazon focusing on building profit intelligence systems and ML model deployment for retail business analytics.

Senior Software Development Engineer, Denied Party Screening

Senior Software Engineer role at Amazon Security, focusing on denied party screening systems processing billion-scale events daily using ML and algorithms.

Senior Software Engineer

Senior Software Engineer role at Microsoft's Cloud Operations + Innovation team, focusing on datacenter infrastructure automation and planning systems, offering remote work and competitive compensation.

Senior Software Developer

Senior Software Developer role at Oracle in Zapopan, working on cloud native applications and developer tools using Java, Python, and Kubernetes.

Senior Software Engineer

Senior Software Engineer role at Microsoft SCHIE team, focusing on DPU infrastructure development, offering competitive pay and hybrid work model in Santa Clara, CA.

Description For Founding Engineer: Large-Scale Web Scraping & Crawling

Foundry is building the first end-to-end testing platform for web agents, including a Browser Gym for RL-driven optimization. As a Founding Web Scraping Engineer, you'll be instrumental in building internet-scale web crawling infrastructure capable of handling millions of domains and evolving anti-bot defenses.

The role focuses on designing robust, distributed crawling systems that adapt dynamically to web changes, optimize for efficiency, and ensure reliable data extraction. You'll be responsible for building large-scale crawlers, developing adaptive scraping systems, solving captchas at scale, and managing proxy rotations.

We're looking for someone with expert-level experience in web scraping & crawling, using tools like Selenium, Puppeteer, Playwright, and Scrapy. Deep knowledge of anti-bot detection strategies and hands-on expertise with captcha-solving strategies is essential. You should be proficient in Python, Go, or JavaScript and have experience with high-performance, parallelized scraping frameworks.

As a YC-backed team, we're offering a founding role where you'll define and own our web crawling infrastructure from day one. You'll be working at internet scale, building systems that dynamically adapt across millions of domains. This is an opportunity to join a cutting-edge startup that's creating something entirely new in the web agent testing space.

The position offers competitive compensation ($100K - $200K) plus equity (1.00%), and we're open to remote work within the US. If you're passionate about large-scale web automation and want to be part of a founding team setting new standards in web agent testing, this role offers the perfect opportunity to make a significant impact.

Last updated 2 months ago

Responsibilities For Founding Engineer: Large-Scale Web Scraping & Crawling

  • Build large-scale, distributed crawlers for millions of domains
  • Develop adaptive web scraping systems
  • Optimize scraping performance and resilience
  • Solve captchas at scale
  • Manage proxy and identity rotation
  • Structure and clean extracted data

Requirements For Founding Engineer: Large-Scale Web Scraping & Crawling

Python
Go
JavaScript
  • Expert-level experience in large-scale web scraping & crawling
  • Deep knowledge of anti-bot detection strategies
  • Hands-on expertise with captcha-solving strategies
  • Proven experience building efficient proxy management systems
  • Proficiency in Python, Go, or JavaScript
  • Understanding of HTTP/2, HTTP/3, WebSockets, GraphQL
  • Experience designing scalable, fault-tolerant scraping infrastructure

Benefits For Founding Engineer: Large-Scale Web Scraping & Crawling

Equity
  • Founding role with infrastructure ownership
  • Work at internet scale
  • YC-backed startup opportunity

Interested in this job?