Software Engineer, Data Acquisition

OpenAI is an AI research and deployment company dedicated to ensuring that general-purpose artificial intelligence benefits all of humanity.
$310,000 - $385,000
Data
Senior Software Engineer
In-Person
5+ years of experience

Description For Software Engineer, Data Acquisition

OpenAI, a leading AI research and deployment company, is seeking a Senior Software Engineer to join their Data Acquisition team within the Pre-training organization. This role is crucial in managing web crawling and GPTBot services, working closely with Data Processing, Architecture, and Scaling teams. The ideal candidate will have 5+ years of industry experience, strong expertise in distributed systems, and proficiency in Kubernetes.

As a Software Engineer in Data Acquisition, you'll lead projects in web crawling, data ingestion, and search. You'll develop highly scalable systems handling petabytes of data, implement data indexing algorithms, and maintain backend services for data storage. Collaboration is key, as you'll work with various teams to ensure smooth data flow and system operability, including the legal team for compliance matters.

OpenAI offers a competitive salary range of $310K – $385K, along with generous equity and comprehensive benefits. These include medical, dental, and vision insurance, mental health support, a 401(k) plan with 50% matching, unlimited time off, paid parental leave, and an annual learning stipend.

Join OpenAI in their mission to ensure general-purpose AI benefits all of humanity. They value diversity and are committed to creating an inclusive environment. If you're passionate about pushing the boundaries of AI capabilities and want to work on cutting-edge technology that can shape the future, this role at OpenAI presents an exciting opportunity to make a significant impact in the field of artificial intelligence.

Last updated 5 months ago

Responsibilities For Software Engineer, Data Acquisition

  • Own and lead engineering projects in the area of data acquisition including web crawling, data ingestion, and search.
  • Collaborate with other sub-teams, such as Data Processing, Architecture, and Scaling, to ensure smooth data flow and system operability.
  • Work closely with the legal team to handle any compliance or data privacy-related matters.
  • Develop and deploy highly scalable distributed systems capable of handling petabytes of data.
  • Architect and implement algorithms for data indexing and search capabilities.
  • Build and maintain backend services for data storage, including work with key-value databases and synchronization.
  • Deploy solutions in a Kubernetes Infrastructure-as-Code environment and perform routine system checks.
  • Conduct and analyze experiments on data to provide insights into system performance.

Requirements For Software Engineer, Data Acquisition

Kubernetes
  • BS/MS/PhD in Computer Science or a related field.
  • 5+ years of industry experience in software development.
  • Experience with large web crawlers a plus
  • Strong expertise in large stateful distributed systems and data processing.
  • Proficiency in Kubernetes, and Infrastructure-as-Code concepts.
  • Willingness and enthusiasm for trying new approaches and technologies.
  • Ability to handle multiple tasks and adapt to changing priorities.
  • Strong communication skills, both written and verbal.

Benefits For Software Engineer, Data Acquisition

Medical Insurance
Dental Insurance
Vision Insurance
401k
Education Budget
Parental Leave
Mental Health Assistance
  • Medical, dental, and vision insurance for you and your family
  • Mental health and wellness support
  • 401(k) plan with 50% matching
  • Unlimited time off and 13 company holidays per year
  • Paid parental leave (20 weeks) and family-planning support
  • Annual learning & development stipend ($1,500 per year)
  • Generous equity

Interested in this job?

Jobs Related To OpenAI Software Engineer, Data Acquisition

Data Engineer, Safety Systems

Senior Data Engineer role at OpenAI, working on ChatGPT's analytics and growth, offering $245K-$310K plus equity, based in San Francisco.

Data Infrastructure Engineer

Join OpenAI's Applied Data Platform team as a Data Infrastructure Engineer to design, build, and maintain scalable data systems powering critical AI research and products.

Data Scientist, Product

Senior Data Scientist role at OpenAI to drive data-driven product development for AI technologies.

Analytics Data Engineer, Applied Engineering

OpenAI is hiring an Analytics Data Engineer for their Applied Engineering team in San Francisco, offering competitive salary and benefits.

Data Infrastructure Engineer

OpenAI is hiring a Data Infrastructure Engineer to design and implement scalable data systems for AI research and development.