Taro Logo

Staff Software Engineer - Distributed Data Systems

Databricks is the data and AI company that enables organizations to unify and democratize data, analytics and AI, founded by the original creators of Apache Spark™, Delta Lake and MLflow.
$192,000 - $260,000
Staff Software Engineer
In-Person
1,000 - 5,000 Employees
8+ years of experience
AI · Enterprise SaaS · Data

Description For Staff Software Engineer - Distributed Data Systems

At Databricks, we are building the next generation of distributed data systems that power some of the world's most sophisticated data and AI workloads. As a Staff Software Engineer on the Runtime team, you'll be working on cutting-edge distributed systems that process exabytes of data daily across millions of virtual machines. You'll be developing systems that combine the performance of specialized SQL engines with the flexibility to support diverse workloads from ETL to data science.

The role involves working on critical projects like Apache Spark™, Delta Lake, and Delta Pipelines, where you'll build systems that provide ACID transactions, time travel capabilities, and sophisticated orchestration for thousands of data pipelines. You'll be tackling complex challenges in distributed systems, working with cloud storage backends like AWS S3 and Azure Blob Store, and developing high-performance query optimizers.

This is an opportunity to join a team that's pushing the boundaries of data and AI technology while maintaining the resilience and scale necessary for enterprise customers. You'll be working alongside the original creators of Apache Spark™ in a rapidly growing company that serves thousands of organizations, including Fortune 100 companies.

The position offers competitive compensation ($192,000 - $260,000), comprehensive benefits, and the chance to work on technology that's solving real-world problems from security threat detection to cancer drug development. If you're passionate about distributed systems, have strong experience in Java/Scala, and want to work on technology that processes data at massive scale, this role offers the perfect challenge.

Last updated a month ago

Responsibilities For Staff Software Engineer - Distributed Data Systems

  • Build the next generation distributed data storage and processing systems
  • Develop Apache Spark™ framework
  • Deliver reliable and high performance services for cloud storage backends
  • Build next generation query optimizer and execution engine
  • Work on Delta Lake storage management system
  • Develop Delta Pipelines for orchestrating data pipelines

Requirements For Staff Software Engineer - Distributed Data Systems

Java
Scala
  • BS in Computer Science, related technical field or equivalent practical experience
  • 8+ years of production level experience in either Java, Scala or C++
  • Strong foundation in algorithms and data structures and their real-world use cases
  • Experience with distributed systems, databases, and big data systems (Apache Spark™, Hadoop)
  • Comfortable working towards a multi-year vision with incremental deliverables
  • Driven by delivering customer value and impact

Benefits For Staff Software Engineer - Distributed Data Systems

Medical Insurance
Dental Insurance
Vision Insurance
  • Comprehensive benefits package
  • Performance bonus
  • Equity

Interested in this job?

Jobs Related To Databricks Staff Software Engineer - Distributed Data Systems