Development and Operations Engineer
Trimble
Title: Development and Operations Engineer
Location: Chennai , India
Department: Trimble Cloud Core Platform
We are seeking a self-motivated and enthusiastic Senior Site Reliability Engineer to join the
Trimble Cloud Core Platform's Site Reliability Engineering team, which is responsible for
provisioning and operating our core services in the public cloud.
Key Responsibilities
● Quickly grasp and analyze new or new-to-you systems that are complex and rapidly
changing.
● Root cause analysis for production issues
● Identify problems and opportunities for improvements that are common across many
teams and services.
● Develop automation and monitoring solutions
● Utilize best practices in cloud security and operations
● Optimize application for maximum speed and scalability
● Collaborate with other team members and stakeholder
● Evaluate new tools, technologies, and processes to improve speed, efficiency, and
scalability of continuous integration environments
● Responsible for fixing compliance issues and requirements raised by SecOps tools
● Responsible for optimize cost across cloud platforms , logging and monitoring tools
● Foster collaboration with software product development, architecture, and engineering
team to ensure releases are delivered with repeatable and auditable processes
● Learn and be passionate about cloud computing
Required Skills and Experience
●Two to three year of strong experience with demonstrably deep AWS knowledge,
monitoring, troubleshooting, and related DevOps technologies
● Strong experience with CI/CD pipelines including Jenkins , Github Actions, and Azure DevOps
● Infrastructure automation using Terraform, CloudFormation , Ansible, Packer or
similar
● Experience with Cloud Orchestration frameworks, development and SRE support of
these systems
● OS image build for Linux, Windows and patch automation
● Deep understanding of Linux/Unix operating systems
● Familiarity and experience with architectural design.
● Serverless technology experience.
● Experience with scalability, security and performance engineering for web services.
● Responsible for mentoring and training others on the team
● Be an open team collaborator
● Support and troubleshoot scalability, high availability, performance, monitoring, backup
and restores of different environments
● Work independently across multiple platforms and applications to understand
dependencies
● Experience with scripting and automated process management via scripting, such as
Shell and Python
Desirable Skills and Experience
● Experience with monitoring tools like SumoLogic, DataDog , ELK , InfluxDB , Grafana,
Prometheus
● Experience in Atlassian tools , Bitbucket , Jira and Confluence
● Experience with containers
● Experience with serverless application models
● Experience with microservices
● Experience with NoSQL databases
● Experience with enterprise messaging