Software Engineer III, Site Reliability Engineering, Google Cloud

Google

Google is a global technology leader that specializes in internet-related services and products, including cloud computing, software, and hardware.

Warsaw, Poland

Site Reliability

Mid-Level Software Engineer

Contact Company

5,000+ Employees

2+ years of experience

Enterprise SaaS · Cloud

Description For Software Engineer III, Site Reliability Engineering, Google Cloud

Site Reliability Engineering (SRE) at Google Cloud combines software and systems engineering to build and run large-scale, massively distributed, fault-tolerant systems. As an SRE, you'll ensure Google Cloud's services maintain reliability and appropriate uptime while managing system capacity and performance. The role focuses on optimizing existing systems, building infrastructure, and automation. You'll tackle unique scaling challenges specific to Google Cloud, applying expertise in coding, algorithms, complexity analysis, and large-scale system design. The team values diversity, intellectual curiosity, and problem-solving in a blame-free environment. You'll collaborate with professionals from diverse backgrounds, taking calculated risks and working on meaningful projects. The role offers strong support and mentorship for continuous learning and growth. Your technical expertise will be crucial in managing project priorities, deadlines, and deliverables, as well as designing, developing, testing, deploying, maintaining, and enhancing software solutions. Join a culture that promotes self-direction while providing the support needed to tackle complex distributed systems challenges.

Last updated 19 hours ago

Responsibilities For Software Engineer III, Site Reliability Engineering, Google Cloud

Write product or system development code
Review code developed by other engineers and provide feedback to ensure best practices
Contribute to existing documentation or educational content and adapt content based on product/program updates and user feedback
Triage product or system issues and debug/track/resolve by analyzing the sources of issues and the impact on hardware, network, or service operations and quality
Participate in, or lead design reviews with peers and stakeholders to decide amongst available technologies

Requirements For Software Engineer III, Site Reliability Engineering, Google Cloud

Linux

Kubernetes

Bachelor's degree in Computer Science, a related field, or equivalent practical experience
2 years of experience with software development in one or more programming languages
2 years of experience with data structures or algorithms
Experience working in computing, distributed systems, storage, or networking
Expertise in designing, analyzing, and troubleshooting large-scale distributed systems
Ability to debug, optimize code, and to automate routine tasks
Systematic problem-solving approach, coupled with effective verbal and written communication skills

Google

Google is a global technology leader that specializes in internet-related services and products, including cloud computing, software, and hardware.

Warsaw, Poland

Site Reliability

Mid-Level Software Engineer

Contact Company

5,000+ Employees

2+ years of experience

Enterprise SaaS · Cloud

Google

Find Winner on a Tic Tac Toe Game

Data Structures & AlgorithmsEasy

Tic-tac-toe is played by two players A and B on a 3 x 3 grid. The rules of Tic-Tac-Toe are: Players take turns placing characters into empty squares ''. The first player A always places 'X' characters, while the second player B always places 'O' characters. 'X' and 'O' characters are always placed into empty squares, never on filled ones. The game ends when there are three of the same (non-empty) character filling any row, column, or diagonal. The game also ends if all squares are non-empty. No more moves can be played if the game is over. Given a 2D integer array moves where moves[i] = [rowi, coli] indicates that the ith move will be played on gridrowi. return the winner of the game if it exists (A or B). In case the game ends in a draw return Draw. If there are still movements to play return Pending. You can assume that moves is valid (i.e., it follows the rules of Tic-Tac-Toe), the grid is initially empty, and A will play first. For example: moves = [[0,0],[2,0],[1,1],[2,1],[2,2]] Output: A moves = [[0,0],[1,1],[0,1],[0,2],[1,0],[2,0]] Output: B moves = [[0,0],[1,1],[2,0],[1,0],[1,2],[2,1],[0,1],[0,2],[2,2]] Output: Draw

Arrays

Google

How would you design a thread-safe LRU cache with O(1) get and put operations?

System DesignHard

Let's design a cache. Consider the following requirements: Basic Functionality: Implement a cache with put(key, value) and get(key) methods. Eviction Policy: Implement an LRU (Least Recently Used) eviction policy. When the cache is full, the least recently accessed item should be evicted to make space for the new item. Thread Safety: Ensure the cache is thread-safe to handle concurrent access from multiple threads. Generics: The cache should be able to store any type of key and value (using generics). Time Complexity: get and put operations should have an average time complexity of O(1). Example Usage: Cache cache = new Cache(5); // Cache with capacity 5 cache.put(1, one); cache.put(2, two); cache.get(1); // Access key 1, moves it to the most recently used cache.put(3, three); cache.put(4, four); cache.put(5, five); cache.put(6, six); // This will evict key 2 (LRU) System.out.println(cache.get(2)); // Output: null (because it was evicted) Discuss the data structures you would use, the implementation details for each method, and how you would address thread safety.

Database Problems

Arrays

Strings

Two Pointers

Stacks

Binary Search

Sliding Windows

Linked Lists

Trees

Recursion

Graphs

Dynamic Programming

Greedy Algorithms

Bit Manipulation

Google

Where do you see yourself in 5 years?

Behavioral

Where do you see yourself in 5 years?

Interested in this job?

Jobs Related To Google Software Engineer III, Site Reliability Engineering, Google Cloud

Software Developer III, Site Reliability Development, Google Cloud

Google

Site Reliability Developer role at Google Cloud focusing on building and maintaining large-scale distributed systems with competitive compensation and growth opportunities.

Technical Program Manager, Site Reliability Engineering

Google

Technical Program Manager position at Google's SRE team, leading infrastructure and service delivery projects with focus on operational excellence and cross-functional collaboration.

Program Manager, Platforms and Devices Site Reliability Engineering

Google

Lead complex technical programs for Google's Platforms and Devices SRE team, managing cross-functional projects and driving organizational efficiency.

Site Reliability Engineer

Google

Site Reliability Engineer position at Google Dublin, focusing on building and maintaining large-scale distributed systems with emphasis on reliability and automation.

Software Engineer III, Shopping Build Site Reliability Engineer

Google

Site Reliability Engineer role at Google focusing on building and maintaining large-scale distributed systems for Google Cloud services.