Software Engineer II (Python and Data pipelines) for Scribd

Software Engineer II (Python and Data pipelines)

Scribd

A platform creating a world of stories and knowledge through Everand, Scribd, and Slideshare, democratizing the exchange of ideas and information.

Salida, CO 81201, USA • Aspen, CO 81611, USA • New York, NY, USA…

$103,500 - $196,000

Data

Mid-Level Software Engineer

Remote

501 - 1,000 Employees

4+ years of experience

Enterprise SaaS · Education

Description For Software Engineer II (Python and Data pipelines)

Scribd is a dynamic platform revolutionizing digital content access through its three major products: Everand, Scribd, and Slideshare. As a Software Engineer II on the ML Data Engineering team, you'll be at the forefront of managing and processing hundreds of millions of documents and billions of images. The role involves building robust systems for metadata extraction and enrichment, working with cutting-edge technologies including Python, Scala, and AWS services. The team operates at an unprecedented scale, handling diverse datasets from UGC documents to ebooks and audiobooks. The company offers a flexible work environment through their Scribd Flex program, balancing remote work with intentional in-person collaboration. With competitive compensation ranging from $103,500 to $196,000 depending on location, comprehensive benefits, and a culture focused on GRIT (Goals, Results, Innovation, Team), this position offers an exciting opportunity to impact content discovery and structured metadata across multiple platforms. The role requires 4+ years of experience in backend software engineering and expertise in data pipeline development, making it perfect for those passionate about working with data at scale.

Last updated 3 days ago

Responsibilities For Software Engineer II (Python and Data pipelines)

Design and develop data pipelines to extract, enrich, and process metadata from millions of documents, images, and other content types
Collaborate with cross-functional teams, including ML engineers and product managers, to deliver scalable, efficient, and reliable metadata solutions
Build and maintain systems that operate at a massive scale, handling hundreds of millions of documents and billions of images
Optimize and refactor existing systems for performance, scalability, and reliability
Ensure data accuracy, integrity, and quality through automated validation and monitoring
Participate in code reviews, ensuring best practices are followed and maintaining high-quality standards in the codebase
Manage and maintain data pipelines, security and infrastructure

Requirements For Software Engineer II (Python and Data pipelines)

Python

MongoDB

Redis

4+ years of experience in backend software engineering, with hands-on work in developing data pipelines and building and deploying your own infrastructure
Proficient in one or more programming languages, such as Python, Ruby or similar
Experience working with a public cloud provider (AWS, Azure, or Google Cloud)
Hands-on experience with building, deploying, and optimizing solutions using ECS, EKS or AWS Lambdas
Experience with queueing and streaming technologies like SQS, Sidekiq, Kafka or Kinesis
Experience working with systems at scale such as External APIs, and data transformations
Proven ability to test and optimize systems for performance and scalability
Bachelor's in CS or equivalent professional experience

Benefits For Software Engineer II (Python and Data pipelines)

Medical Insurance

Dental Insurance

Vision Insurance

Parental Leave

401k

Education Budget

Mental Health Assistance

Healthcare Insurance Coverage (Medical/Dental/Vision): 100% paid for employees
12 weeks paid parental leave
Short-term/long-term disability plans
401k/RSP matching
Onboarding stipend for home office peripherals + accessories
Tuition Reimbursement
Learning & Development programs
Quarterly stipend for Wellness, Connectivity & Comfort
Mental Health support & resources
Free subscription to Scribd + gift memberships for friends & family
Referral Bonuses
Book Benefit
Sabbaticals
Company wide events
Team engagement budgets
Vacation & Personal Days
Paid Holidays (+ winter break)
Flexible Sick Time
Volunteer Day

Scribd

A platform creating a world of stories and knowledge through Everand, Scribd, and Slideshare, democratizing the exchange of ideas and information.

Salida, CO 81201, USA • Aspen, CO 81611, USA • New York, NY, USA…

$103,500 - $196,000

Data

Mid-Level Software Engineer

Remote

501 - 1,000 Employees

4+ years of experience

Enterprise SaaS · Education

Interested in this job?

Jobs Related To Scribd Software Engineer II (Python and Data pipelines)

Data Engineer

2Brains

Remote Data Engineer position at 2Brains, focusing on building and optimizing data infrastructure and analytics solutions using modern cloud technologies.

Data Engineer

Walton Arts Center and Walmart AMP

Data Engineer apprenticeship role at Walton Arts Center and Walmart AMP, focusing on building and maintaining data infrastructure with guaranteed training and mentorship.

Data Engineer

Sword Health

Data Engineer position at Sword Health, developing and maintaining client reports and data warehouse solutions

Product Engineer, Data

Radar

Product Engineer role focusing on building end-to-end data oriented features across backend, frontend, and data infrastructure at Radar.

Analytics Engineer

Google DeepMind

Analytics Engineer role at Google DeepMind, building data foundations and providing insights to guide AI research strategy, 4+ years experience required.