Senior Platform/DevOps Engineer (Kubernetes-Linux-Azure Local)
Armada
About the Company
Armada is an edge computing startup that provides computing infrastructure to remote areas where connectivity and cloud infrastructure is limited, as well as areas where data needs to be processed locally for real-time analytics and AI at the edge. We’re looking to bring on the most brilliant minds to help further our mission of bridging the digital divide with advanced technology infrastructure that can be rapidly deployed anywhere.
About Armada.ai
Armada.ai is at the forefront of revolutionizing distributed AI infrastructure by harnessing decentralized computing. We are building a cutting-edge platform that unlocks unprecedented scale and efficiency through our innovative, ruggedized Galleon mobile data centers and the Armada Edge Platform (AEP). Join our team to shape the future of infrastructure and its global impact
About the Role
We are seeking an experienced, collaborative, and detail-oriented Senior Platform/DevOps Engineer to join our growing Edge team.
You will be a subject matter expert in Microsoft Azure and Kubernetes, responsible for the design, automation, optimization, and operation of our hybrid Azure Local and Kubernetes-based deployments on bare metal infrastructure. You will support our Galleon mobile data centers and AEP integration, ensuring the reliability and performance of our distributed computing platform.
This is a critical role where you will leverage technical expertise in Azure, bare-metal provisioning, and Linux to build and manage resilient, secure, and scalable hybrid environments across diverse edge locations.
Location. This role is office-based at our Bellevue, Washington office.
What You'll Do (Key Responsibilities)
Platform Architecture & Management
- Architect, design, deploy, and manage highly available Azure Local environments on bare metal servers across our edge data centers.
- Architect, design, deploy, configure, and manage Kubernetes clusters on-prem (Galleon data centers) and cloud (Azure) environments.
- Utilize Azure Arc to centrally manage and monitor distributed Azure Local instances, ensuring consistent governance and security.
- Administer, maintain, and monitor the health, performance, and capacity of Kubernetes clusters and underlying infrastructure.
- Implement and manage Kubernetes networking solutions (CNI plugins, Ingress controllers) and storage solutions (PV/PVC, Storage Classes, CSI drivers).
Automation & Infrastructure as Code (IaC)
- Lead the automation of Azure Local deployment and configuration, including network, storage, and security setup.
- Drive Infrastructure-as-Code (IaC) initiatives using tools like Terraform, Ansible, Helm, and potentially Kubernetes Operators to promote automation, repeatability, and reliability.
- Develop and maintain automation scripts and IaC templates for repeatable and scalable deployments.
- Automate cluster operations, deployment pipelines (CI/CD integration), and infrastructure provisioning.
Security, Optimization & Operations
- Secure Azure Local deployments using best practices and technologies such as Azure Policy, Azure Key Vault, and network security groups.
- Implement and enforce Kubernetes security best practices (RBAC, Network Policies, Secrets Management, Security Contexts, Image Scanning).
- Troubleshoot and resolve complex issues related to Azure Local, bare metal infrastructure, networking, Kubernetes platform, and containerized services.
- Optimize Kubernetes clusters for performance, scalability, and resource utilization, particularly in edge environments.
- Contribute to the operational excellence of the platform, including participating in on-call rotations, incident management, and building self-healing capabilities.
Required Qualifications
Experience
- DevSecOps Experience: At least 7+ years of experience in DevSecOps/SRE and platform engineering, with a significant focus on building and managing complex production environments.
- Kubernetes Experience: Minimum of 5 years of hands-on experience designing, deploying, and administering production Kubernetes clusters, with experience specifically in on-premises and bare-metal deployments.
- Azure Experience: Minimum of 2 years of hands-on experience designing, deploying, and administering production Azure environments, with experience in Azure cloud or on-premises deployments.
- Linux Experience: Expertise in Linux administration and troubleshooting, demonstrated through at least 2+ years of hands-on experience managing complex Linux environments.
Technical Skills
- Azure Services: In-depth knowledge of Azure services, including Azure Arc, Azure Kubernetes Service (AKS), Azure Monitor, and Azure Policy.
- Networking: Strong understanding of networking concepts (TCP/IP, DNS, routing, firewalls, Load Balancing, VPNs) and container networking (CNI).
- IaC: Strong understanding and proven experience with Infrastructure as Code (IaC) solutions, specifically Terraform, Ansible, or Bicep.
- Scripting: Proficiency in scripting languages like Python, Bash, or PowerShell for automation.
- Monitoring: Experience configuring and managing robust monitoring/logging tools (e.g., Prometheus, Grafana, ELK Stack).
- Azure Networking: Strong experience with Virtual Networks, ExpressRoute, VPN Gateway, and network security configurations.
Education
- A bachelor's degree in computer science, Engineering, Information Technology, a related technical field, or equivalent practical experience.
Preferred Qualifications
- Experience deploying and maintaining CI/CD pipelines/solutions for DevSecOps, such as Azure DevOps, GitHub Actions, GitLab CI, or Jenkins.
- Familiarity with Azure Kubernetes Service (AKS), particularly on Azure Stack HCI/Azure Local.
- Experience with Kubernetes operators and Custom Resource Definitions (CRDs).
- Experience with service mesh technologies like Istio or Linkerd.
- Experience managing infrastructure/Kubernetes in edge computing or resource-constrained environments.
- Certifications:
- Azure certifications such as Azure Administrator Associate, Azure Solutions Architect Expert, or Azure Stack HCI Specialist
- Kubernetes certifications (CKA, CKS, CKAD).
Why Join Armada.ai?
- Be part of a team building the future of distributed computing and AI, impacting our innovative Galleon data center deployments and their integration with the Armada Edge Platform (AEP).
- Work with the latest technologies in edge computing, mobile data centers, AI infrastructure, Kubernetes, and hybrid cloud management.
- We are a rapidly growing company with ample opportunities for advancement.
- Collaborate with talented and passionate individuals dedicated to pushing boundaries.
Compensation
For U.S. Based candidates: To ensure fairness and transparency, the starting base salary range for this role for candidates in the U.S. are listed below, varying based on location experience, skills, and qualifications.
In addition to base salary, this role will also be offered equity and subsidized benefits (details available upon request).
Benefits
- Competitive base salary and equity
- Medical, dental, and vision (subsidized cost)
- Health savings accounts (HSA), flexible spending accounts (FSA), and dependent care FSAs (DCFSA)
- Retirement plan options, including 401(k) and Roth 401(k)
- Unlimited paid time off (PTO)
- 15 paid company holidays per year
#LI-ST1
#LI-Onsite
#532
You're a Great Fit if You're
- A go-getter with a growth mindset. You're intellectually curious, have strong business acumen, and actively seek opportunities to build relevant skills and knowledge
- A detail-oriented problem-solver. You can independently gather information, solve problems efficiently, and deliver results with a "get-it-done" attitude
- Thrive in a fast-paced environment. You're energized by an entrepreneurial spirit, capable of working quickly, and excited to contribute to a growing company
- A collaborative team player. You focus on business success and are motivated by team accomplishment vs personal agenda
- Highly organized and results-driven. Strong prioritization skills and a dedicated work ethic are essential for you
Equal Opportunity Statement
At Armada, we are committed to fostering a work environment where everyone is given equal opportunities to thrive. As an equal opportunity employer, we strictly prohibit discrimination or harassment based on race, color, gender, religion, sexual orientation, national origin, disability, genetic information, pregnancy, or any other characteristic protected by law. This policy applies to all employment decisions, including hiring, promotions, and compensation. Our hiring is guided by qualifications, merit, and the business needs at the time.
Unsolicited Resumes and Candidates
Armada does not accept unsolicited resumes or candidate submissions from external agencies or recruiters. All candidates must apply directly through our careers page. Any resumes submitted by agencies without a prior signed agreement will be considered unsolicited and Armada will not be obligated to pay any fees.