SageMaker Training Jobs is seeking a Software Development Engineer to join their core team. This role is central to AWS's machine learning platform, where you'll build and maintain mission-critical systems for the industry-leading SageMaker training platform.
The position involves scaling systems to support training jobs across hundreds of thousands of machines while maintaining a failure rate below 0.1%. You'll be at the forefront of innovation, experimenting with new technologies to ensure SageMaker remains the fastest, easiest, and most cost-effective platform for data scientists.
Amazon SageMaker is a fully managed Machine Learning platform that simplifies the process of building, managing, and integrating ML models with custom applications for online predictions. The platform eliminates the complexity typically associated with large-scale Machine Learning implementations, allowing developers and scientists to focus on creative modeling and solving business challenges.
The role demands high standards of engineering and operational excellence, including:
The team strongly values work-life balance, offering flexible working hours and fostering an environment where both personal and professional life can thrive. They provide robust mentorship opportunities, with senior members offering one-on-one guidance and thorough code reviews.
AWS maintains an inclusive culture with ten employee-led affinity groups spanning 40,000 employees across 190+ chapters globally. They offer innovative benefits and host regular learning experiences, including Conversations on Race and Ethnicity (CORE) and AmazeCon conferences, demonstrating their commitment to diversity and inclusion.