Senior Data Center Operations Engineer, Google Cloud
The AI and Infrastructure team is redefining what’s possible. We empower Google customers with breakthrough capabilities and insights by delivering AI and Infrastructure at unparalleled scale, efficiency, reliability and velocity. Our customers include Googlers, Google Cloud customers, and billions of Google users worldwide.
We're the driving force behind Google's groundbreaking innovations, empowering the development of our cutting-edge AI models, delivering unparalleled computing power to global services, and providing the essential platforms that enable developers to build the future. From software to hardware our teams are shaping the future of world-leading hyperscale computing, with key teams working on the development of our TPUs, Vertex AI for Google Cloud, Google Global Networking, Data Center operations, systems research, and much more.
US: $159000 - $231000 (USD) + 15% bonus target + equity + benefits
Learn more about benefits at Google.
- Help design and build system-level testbeds and maintain lab space, tools, and infrastructure.
- Work with teams to develop software and hardware systems and develop methodologies and tools to support team members.
- Coordinate the interactions between labs and engineering.
- Anticipate and resolve problems by applying knowledge and skills, escalating design, manufacturing, and product testing issues to other people within the organization when appropriate.
- Generate implementation plans and provide technical expertise and guidance during deployment activities.
Minimum qualifications:
- Bachelor’s degree in Electrical Engineering, Computer Engineering, Computer Science, Physics, or a specialized field, or equivalent practical experience.
- 4 years of experience working in a data center or networking operation center technical environment.
Preferred qualifications:
- Master's degree or PhD in Electrical Engineering, Computer Engineering, Physics, or a related field.
- Understanding of network design, protocols, and troubleshooting.
- Understanding of computer systems: physical, functional, logical, mechanical, electrical, software, thermal etc.
- Ability to read and write code in Python or any other scripting language.
- Excellent communication skills with an ability to act as a team player.
- Excellent time management skills with an ability to manage constant priority changes and deadlines.