Software Engineer, Data

Harvey is a secure AI platform for professionals in law, tax, and finance that augments productivity and automates complex workflows.
$180,000 - $280,000
Data
Mid-Level Software Engineer
In-Person
51 - 100 Employees
3+ years of experience
AI · Enterprise SaaS · Finance

Description For Software Engineer, Data

Harvey is revolutionizing the legal, tax, and finance sectors with its secure AI platform that leverages reasoning-adept LLMs for complex workflow automation. Having raised over $200 million from prestigious investors like Sequoia, Google Ventures, and OpenAI Startup Fund, Harvey has achieved remarkable growth from $0-30M ARR in just 18 months.

The company has established strong partnerships with industry giants including Paul Weiss, A&O Shearman, Ashurst, O'Melveny & Myers, PwC, and KKR. As a Software Engineer, Data, you'll be at the forefront of developing distributed crawlers, data pipelines, and storage infrastructure to handle real-time updates from various sources.

You'll work directly with domain experts to structure complex datasets in Legal, Tax, and Finance domains, while building and scaling the Retrieval platform for RAG products. Key projects include crawling and indexing legal datasets, ingesting international tax codes, and scaling infrastructure to handle millions of documents with millisecond retrieval times.

The role offers an opportunity to work with cutting-edge embedding search technologies and collaborate with OpenAI to shape the future of generative AI. You'll join a world-class team assembled from companies like DeepMind, Google Brain, Stripe, and FAIR, with competitive compensation and comprehensive benefits.

This San Francisco-based position emphasizes in-person collaboration and offers relocation assistance, making it ideal for engineers passionate about applying data engineering to transform professional services.

Last updated 21 minutes ago

Responsibilities For Software Engineer, Data

  • Develop distributed crawlers, data pipelines, and storage infrastructure
  • Handle real-time updates from various data sources
  • Work with domain experts to structure complex datasets
  • Build and scale the Retrieval platform for RAG products
  • Crawl, structure, and index legal datasets
  • Scale data infrastructure to index millions of documents
  • Implement millisecond search and retrieval capabilities
  • Deploy cutting edge embedding search technologies

Requirements For Software Engineer, Data

Python
  • 3+ years of experience (post-BS/MS) in an engineering role
  • Experience with shipping and scaling data-powered products
  • Experience with data pipelines, databases, and backend platforms
  • Track record of shipping reliable products
  • Strong attention to detail
  • Experience with search infrastructure or vector databases is a plus
  • Experience working at early-stage startups is a plus

Benefits For Software Engineer, Data

Medical Insurance
Dental Insurance
Vision Insurance
401k
Equity
Relocation Benefits
  • Comprehensive health coverage
  • Dental coverage
  • Vision coverage
  • 401k match up to 4%
  • Flexible PTO
  • Equity
  • Relocation assistance

Interested in this job?

Jobs Related To Harvey Software Engineer, Data

Data Engineer (UK)

Data Engineer position at Dayshape, working on customer integrations using Azure Data Factory and ETL processes, offering hybrid work and competitive benefits.

Data Scientist / Data Engineer

Join an innovative AI startup as a Data Scientist/Engineer, building cutting-edge AI pipelines and fine-tuning LLMs to revolutionize European business solutions.

Application Engineer

Mid-level Application Engineer position at CompStak, focusing on ML solution implementation using Python and FastAPI, hybrid work in Belgrade.

Data Engineer

Data Engineer position at Mediatech, building and maintaining data infrastructure and pipelines, requiring 3+ years experience in data engineering, Python, and SQL skills.

Data Engineer, WW Installments Science and Engineering

Data Engineer role at Amazon's WW Installments team, building payment solutions and data pipelines for the Amazon Monthly Payment product.