Join the engineering teams that bring OpenAI's ideas safely to the world! The Applied Engineering team works across research, engineering, product, and design to bring OpenAI's technology to consumers and businesses. We seek to learn from deployment and distribute the benefits of AI, while ensuring that this powerful tool is used responsibly and safely.
As an Observability Software Engineer at OpenAI, you will play a crucial role in ensuring the observability, reliability, scalability, and performance of our systems as we continue to expand. You will work in a deeply iterative, collaborative, fast-paced environment to bring our technology to millions of users around the world, ensuring it's delivered with safety and reliability in mind.
Key Responsibilities:
- Develop and maintain systems for effective monitoring, logging, and tracing of software applications
- Integrate tools for logging, monitoring, and alerting to enhance visibility into system performance
- Collaborate with different engineering teams to integrate observability practices into their workflows
- Analyze system performance and identify areas for improvement
- Stay up-to-date with the latest trends in observability, logging, monitoring, and cloud technologies
- Participate in strategic planning for the technology roadmap
- Create comprehensive documentation for observability systems and processes
Ideal Candidate:
- Has a track record of building and operating observability systems at scale
- Enjoys addressing bottlenecks and improving system performance
- Experienced with Infrastructure as Code (IaC) principles
- Collaborates well with cross-functional teams
- Has a humble attitude and eagerness to help colleagues
- Owns problems end-to-end and is willing to learn new skills
Requirements:
- Bachelor's degree in Computer Science or related field (or equivalent experience)
- Proficiency in monitoring tools (e.g., DataDog, Prometheus, Grafana, ELK stack) and cloud platforms
- Strong background in software engineering and relevant programming languages
- Experience with containerization and orchestration technologies
- Excellent communication skills
- Strong understanding of distributed systems, networking, and database technologies
OpenAI offers competitive compensation, including equity, and comprehensive benefits. Join us in shaping the future of AI technology!