Scribd is a dynamic platform revolutionizing digital content access through its three major products: Everand, Scribd, and Slideshare. As a Software Engineer II on the ML Data Engineering team, you'll be at the forefront of managing and processing hundreds of millions of documents and billions of images. The role involves building robust systems for metadata extraction and enrichment, working with cutting-edge technologies including Python, Scala, and AWS services. The team operates at an unprecedented scale, handling diverse datasets from UGC documents to ebooks and audiobooks. The company offers a flexible work environment through their Scribd Flex program, balancing remote work with intentional in-person collaboration. With competitive compensation ranging from $103,500 to $196,000 depending on location, comprehensive benefits, and a culture focused on GRIT (Goals, Results, Innovation, Team), this position offers an exciting opportunity to impact content discovery and structured metadata across multiple platforms. The role requires 4+ years of experience in backend software engineering and expertise in data pipeline development, making it perfect for those passionate about working with data at scale.