Software Engineer II (Python and Data pipelines)

A platform creating a world of stories and knowledge through Everand, Scribd, and Slideshare, democratizing the exchange of ideas and information.
Salida, CO 81201, USAAspen, CO 81611, USANew York, NY, USA
$103,500 - $196,000
Data
Mid-Level Software Engineer
Remote
501 - 1,000 Employees
4+ years of experience
Enterprise SaaS · Education

Description For Software Engineer II (Python and Data pipelines)

Scribd is a dynamic platform revolutionizing digital content access through its three major products: Everand, Scribd, and Slideshare. As a Software Engineer II on the ML Data Engineering team, you'll be at the forefront of managing and processing hundreds of millions of documents and billions of images. The role involves building robust systems for metadata extraction and enrichment, working with cutting-edge technologies including Python, Scala, and AWS services. The team operates at an unprecedented scale, handling diverse datasets from UGC documents to ebooks and audiobooks. The company offers a flexible work environment through their Scribd Flex program, balancing remote work with intentional in-person collaboration. With competitive compensation ranging from $103,500 to $196,000 depending on location, comprehensive benefits, and a culture focused on GRIT (Goals, Results, Innovation, Team), this position offers an exciting opportunity to impact content discovery and structured metadata across multiple platforms. The role requires 4+ years of experience in backend software engineering and expertise in data pipeline development, making it perfect for those passionate about working with data at scale.

Last updated 3 days ago

Responsibilities For Software Engineer II (Python and Data pipelines)

  • Design and develop data pipelines to extract, enrich, and process metadata from millions of documents, images, and other content types
  • Collaborate with cross-functional teams, including ML engineers and product managers, to deliver scalable, efficient, and reliable metadata solutions
  • Build and maintain systems that operate at a massive scale, handling hundreds of millions of documents and billions of images
  • Optimize and refactor existing systems for performance, scalability, and reliability
  • Ensure data accuracy, integrity, and quality through automated validation and monitoring
  • Participate in code reviews, ensuring best practices are followed and maintaining high-quality standards in the codebase
  • Manage and maintain data pipelines, security and infrastructure

Requirements For Software Engineer II (Python and Data pipelines)

Python
MongoDB
Redis
  • 4+ years of experience in backend software engineering, with hands-on work in developing data pipelines and building and deploying your own infrastructure
  • Proficient in one or more programming languages, such as Python, Ruby or similar
  • Experience working with a public cloud provider (AWS, Azure, or Google Cloud)
  • Hands-on experience with building, deploying, and optimizing solutions using ECS, EKS or AWS Lambdas
  • Experience with queueing and streaming technologies like SQS, Sidekiq, Kafka or Kinesis
  • Experience working with systems at scale such as External APIs, and data transformations
  • Proven ability to test and optimize systems for performance and scalability
  • Bachelor's in CS or equivalent professional experience

Benefits For Software Engineer II (Python and Data pipelines)

Medical Insurance
Dental Insurance
Vision Insurance
Parental Leave
401k
Education Budget
Mental Health Assistance
  • Healthcare Insurance Coverage (Medical/Dental/Vision): 100% paid for employees
  • 12 weeks paid parental leave
  • Short-term/long-term disability plans
  • 401k/RSP matching
  • Onboarding stipend for home office peripherals + accessories
  • Tuition Reimbursement
  • Learning & Development programs
  • Quarterly stipend for Wellness, Connectivity & Comfort
  • Mental Health support & resources
  • Free subscription to Scribd + gift memberships for friends & family
  • Referral Bonuses
  • Book Benefit
  • Sabbaticals
  • Company wide events
  • Team engagement budgets
  • Vacation & Personal Days
  • Paid Holidays (+ winter break)
  • Flexible Sick Time
  • Volunteer Day

Interested in this job?

Jobs Related To Scribd Software Engineer II (Python and Data pipelines)

Data Engineer

Remote Data Engineer position at 2Brains, focusing on building and optimizing data infrastructure and analytics solutions using modern cloud technologies.

Data Engineer

Data Engineer apprenticeship role at Walton Arts Center and Walmart AMP, focusing on building and maintaining data infrastructure with guaranteed training and mentorship.

Data Engineer

Data Engineer position at Sword Health, developing and maintaining client reports and data warehouse solutions

Product Engineer, Data

Product Engineer role focusing on building end-to-end data oriented features across backend, frontend, and data infrastructure at Radar.

Analytics Engineer

Analytics Engineer role at Google DeepMind, building data foundations and providing insights to guide AI research strategy, 4+ years experience required.