Harvey is revolutionizing the legal, tax, and finance sectors with its secure AI platform that leverages reasoning-adept LLMs for complex workflow automation. Having raised over $200 million from prestigious investors like Sequoia, Google Ventures, and OpenAI Startup Fund, Harvey has achieved remarkable growth from $0-30M ARR in just 18 months.
The company has established strong partnerships with industry giants including Paul Weiss, A&O Shearman, Ashurst, O'Melveny & Myers, PwC, and KKR. As a Software Engineer, Data, you'll be at the forefront of developing distributed crawlers, data pipelines, and storage infrastructure to handle real-time updates from various sources.
You'll work directly with domain experts to structure complex datasets in Legal, Tax, and Finance domains, while building and scaling the Retrieval platform for RAG products. Key projects include crawling and indexing legal datasets, ingesting international tax codes, and scaling infrastructure to handle millions of documents with millisecond retrieval times.
The role offers an opportunity to work with cutting-edge embedding search technologies and collaborate with OpenAI to shape the future of generative AI. You'll join a world-class team assembled from companies like DeepMind, Google Brain, Stripe, and FAIR, with competitive compensation and comprehensive benefits.
This San Francisco-based position emphasizes in-person collaboration and offers relocation assistance, making it ideal for engineers passionate about applying data engineering to transform professional services.