Senior Site Reliability Engineer
Semios
Who we are:
Founded in 2010, Semios Group is a leading agricultural technology company helping growers, agronomists, and ag retailers manage over 200 million acres across five countries. Semios pioneered variable-rate pheromone-based mating disruption in orchards and has since expanded into a comprehensive portfolio covering crop protection, water management, frost control, automation, and a leading farm management information system. The Semios Group includes trusted brands such as Semios, Agworld, Altrac, and Greenbook. We continue to drive the next generation of digital agriculture, supporting growers, agronomists and ag retailers in improving sustainability and profitability.
Our innovative work has been recognized with several industry awards, including:
- AgTech Breakthrough – Smart Irrigation Company & Pest Management Solution of the Year
- Thrive Top 50
- Google for Startups Accelerator Cohort
- Global Cleantech Top 100
We know our journey is only achievable by having a great team who shares ideas, tries new things, and learns as we go.
Who you are:
You’re driven by purpose and motivated by work that matters. You’re looking for more than a role, you want to be part of a growing, forward-thinking company solving real-world challenges to improve how farming works, today and for the future.
As a Senior Site Reliability Engineer (SRE), you will play a key role in ensuring the scalability, reliability, and performance of our infrastructure and services. Operating within a high-performing engineering team, this role bridges the gap between development and operations, with a strong focus on automation, observability, and resilience. You will use your technical expertise and leadership skills to drive improvements across systems and processes, ensuring we deliver high-quality, reliable products to our customers. This is a hands-on role where your impact will be felt across both technical execution and team development. You will be part of a team operating across time zones and global regions.
What you will do:
Leadership & Collaboration
- Lead the delivery of infrastructure projects.
- Plan and perform higher-risk maintenance.
- Contribute to resolving incidents and participate in an on-call roster.
- Work with product and software development colleagues to improve the resiliency and reliability of our products.
- Mentor team members in all aspects of SRE work.
- Manage your productivity and workload in a work-from-home environment
System Reliability & Performance
- Use a data-driven approach to identify changes to the product architecture to improve reliability, performance, and availability.
- Fully understand production environments and the end-to-end delivery process.
- Identify parts of the system that do not scale and drive solutions for these problem areas.
- Maintain and improve Service Level Indicators (SLI) that align with availability and performance targets.
- Build quality into the team's work by encouraging refactoring, testing, and breaking up the team’s work into small, releasable pieces.
Technical Skills & Expertise:
- Have good knowledge of Linux and bash or similar.
- Be versed in the delivery of a SaaS product on AWS, GCP, or Azure.
- Have strong programming skills (Ruby, Python, Go, etc.).
- Be competent with Terraform or similar Infrastructure as Code (IaC) tools.
- Have experience with Docker, Kubernetes, EKS, or similar technologies.
- Have experience with CI/CD pipelines on Buildkite or similar platforms.
- Be familiar with building delivery pipelines with Buildkite or similar.
- Have strong version control skills with Git.
- Be experienced with increasing monitoring and observability using Datadog or similar tools (New Relic, Splunk, etc.).
- Have the desire to document and/or automate to reduce repetitive tasks.
- Enjoy delivering quickly and iterating fast.
We want you to succeed, so you will need:
- 5+ years of relevant experience in DevOps, SRE, or infrastructure engineering roles.
- At least 2–3 years in a senior or lead capacity, with demonstrated ownership of critical systems and mentoring responsibilities.
- Hands-on experience with modern cloud environments (AWS, GCP, or Azure), including deployment, scaling, monitoring, and cost optimization of SaaS applications.
- Proven experience implementing and managing observability stacks (e.g., Datadog, Prometheus, New Relic, Splunk) and driving improvements to SLIs/SLOs.
- Experience in incident management, including participation in on-call rotations and leading post-incident reviews with a focus on continuous improvement.
Salary range: $120,000 to $140,000 per year
We publish a salary range to provide transparency and represent the full growth potential of the role; as a result, offers are made based on demonstrated mastery and experience and generally fall near the midpoint.
Why this is the opportunity for you:
- Purposeful Work: Make a global impact by advancing sustainable food production.
- Our People: Work with a fun, collaborative, and supportive team.
- Recharge: Generous vacation policy, company-paid holidays and year-end winter break.
- Work Flexibility: Hybrid working arrangements and strong work-life balance culture.
- Prioritize Your Well-Being: Access comprehensive health plans designed to support your physical and mental health.
- Group RRSP, which includes a 3% company paid match after three months of employment
- Office location that is convenient via transit and bike paths
At Semios Group, we value the full range of experience and perspectives people bring—not just what’s listed in a job description. If your background is a close match, we encourage you to apply. If you need accommodations during the interview process, please let us know.
We welcome all applicants regardless of race, gender, orientation, sexual identity, economic class, ability, disability, age, religious beliefs or disbeliefs, or status. We believe that different perspectives and backgrounds are what make a company flourish.