Lead Site Reliability Engineer (Digibank)India
Grab
This job is no longer accepting applications
See open jobs at Grab.See open jobs similar to "Lead Site Reliability Engineer (Digibank)India" SpaceTalent.Life at Grab
At Grab, every Grabber is guided by The Grab Way, which spells out our mission, how we believe we can achieve it, and our operating principles - the 4Hs: Heart, Hunger, Honour and Humility. These principles guide and help us make decisions as we work to create economic empowerment for the people of Southeast Asia.
Get to know the Team
We are living in dynamic times. Technology is reshaping how we live, and we want to use it to redefine how financial services are offered. Grab is the leading technology company in Southeast Asia offering everyday services to the masses. Singtel is Asia’s leading communications group connecting millions of consumers and enterprises to essential digital services. This is why we are coming together to unlock big dreams, and financial inclusion for people in our region is just one of them. We want to build a digital bank with the right foundation - using data, technology, and trust to solve problems and serve customers. Join us if you have what it takes to help build this new Digibank with us.
Get to know the Role
At Digibank we treat Infrastructure and operations as Software Engineering problems. Our mission is to build and progress software platforms which enables the provisioning and managing of all Digibank services in safe, reliable and scalable ways. We consistently challenge the status quo, use new technologies to build platforms and tooling for engineering teams. In this role you will make significant decisions with a huge impact on building modern banking technology. You would be part of a team, responsible for designing & architecting new solutions, finding creative ways to optimise existing solutions which will improve agility for managing hundreds of microservices infrastructures in a stable & reliable way.
If you are:
A strong believer of automating DevOps & SRE aspects like infrastructure provisioning, deployment, observability, incident lifecycle, uptime SLA etc.
Bold to challenge, open to get challenged, curious to learn & grow
This is the right place for you!
The Day-to-Day Activities:
Working with Kubernetes clusters hosted in AWS
Using InfrastructureAsCode tooling like Terraform, and Ansible to manage AWS, Azure & Kubernetes resources
Engage with the development teams throughout the life cycle to help develop software for reliability and scale. Coaching team's SRE best practices
Troubleshoot priority incidents, facilitate blameless post-mortems and ensure permanent closure of incidents
Perform analytics on previous incidents and usage patterns to better predict issues and take proactive actions
Build and drive adoption for greater self-healing and resiliency patterns
Design automated software and product upgrades, change management, and release management solutions
Design, code, test and deliver software to automate manual operational work. Own your tools and services end to end.
Performance and cost optimization for infrastructure
Be part of an on-call rotation for the team’s tooling and 24x7 support coverage as needed
Succeed, fail, and learn together with other talented people. We believe in an environment that provides an opportunity for growth and see education as an outcome of failure that gets us closer to the next breakthrough
The Must-Haves:
Bachelor's degree in information systems, information technology, computer science, or similar.
9+ years of professional experience.
Experience with administering Kubernetes cluster
Experience with managing Infrastructure as code using Terraform
Direct production operations experience in a cloud environment.
Experience contributing to technology and product strategy.
Experience leading capability-building initiatives across diverse areas such as infrastructure and operations automation, observability, incident management, architecting HA systems, and other core engineering.
Demonstrated experience in driving operational efficiency and transparency of a growing engineering organization.
Our Commitment
We are committed to building diverse teams and creating an inclusive workplace that enables all Grabbers to perform at their best, regardless of nationality, ethnicity, religion, age, gender identity or sexual orientation, and other attributes that make each Grabber unique.
Equal opportunity
Grab is an equal opportunity employer. We owe our success to the talents of our globally-diverse team and the varying perspectives they add to our thriving community.
Recruitment agencies
Grab does not accept unsolicited resumes sent by recruiting agencies. Please do not forward resumes to our job postings, Grab employees or other parts of the business. Grab will not be liable to pay any fees to agencies for candidates hired as a result of unrequested resumes.
This job is no longer accepting applications
See open jobs at Grab.See open jobs similar to "Lead Site Reliability Engineer (Digibank)India" SpaceTalent.