As a critical member of the Applied Engineering Analytics Data team at OpenAI, you will play a crucial role in enhancing our understanding of our infrastructure, particularly our GPU fleet: its allocation, utilization, costs, and optimization opportunities. This role is pivotal to ensuring we optimize infrastructure investments, which is vital for our AI research and deployment activities.
You will work on projects that develop key data sources and dashboards to provide actionable insights into infrastructure performance and efficiency, helping us to optimize our computational resources. This role is based in San Francisco, CA, and follows a hybrid work model of 3 days in the office per week. We offer relocation assistance to new employees.
Key responsibilities include:
- Collaborating with engineering and operations teams to identify high-impact analytics problems related to infrastructure utilization and develop data-driven solutions.
- Fostering a data-centric culture by defining, tracking, and operationalizing key infrastructure performance metrics.
- Leading cross-functional projects involving data collection, cleaning, and analysis from various sources, managing the lifecycle of metrics from prototyping to production.
- Developing and refining dashboards and reports that empower teams to independently extract and analyze infrastructure-related data.
- Designing and implementing data pipelines to consolidate data from multiple sources, ensuring that the resulting datasets are reliable and scalable.
The ideal candidate will have:
- Over 9 years of experience in relevant Data / Analytics Engineering roles.
- Expertise in SQL and experience with ETL workflows, especially in extracting and analyzing large datasets.
- Experience with batch data pipeline development, including tools like Spark or similar technologies.
- Strong data modeling skills, capable of designing OLAP data models using transactional tables to meet analytical requirements.
- Proficiency in quantitative programming languages, with a preference for Python, to create data models and tools for analytics.
- Familiarity with business intelligence tools such as Mode, Tableau, or Looker for data visualization and self-service analytics.
- Meticulous attention to detail and a strong commitment to data accuracy and integrity.
- Proven ability to build effective partnerships across diverse teams.
- Experience designing end-to-end data systems, from data ingestion through consumption.
- A track record of delivering substantial business impact.
Join OpenAI in shaping the future of technology and ensuring that general-purpose artificial intelligence benefits all of humanity.