Description:
Do you love building ⚙️ products people love? As a Site Reliability Engineer, you will "make things scale" which includes supporting delivery and operation of the managed accesso Horizon product in customers’ cloud environments (AWS/Azure/GCP). You will work to deploy, operate, support customer environments, and automate tasks using Site Reliability and Cloud Engineering best practices.
What You’ll Be Working On
- Provisioning and deploying accesso Horizon components to customer cloud accounts using Infrastructure as Code (Terraform) and ArgoCD.
- Maintain, improve, and create CI/CD pipelines (GitHub Actions / ArgoCD) for application and infrastructure deployments.
- Support monitoring, logging and alerting (Prometheus, Grafana, & Coralogix) and respond to alerts, along with acting as level 3 escalation.
- Lead incident triage, root cause investigation, and follow-up tasks.
- Follow security and compliance requirements for customer cloud environments (identity, secrets, network controls).
- Produce and maintain operational runbooks, deployment guides, and change notes.
- Participate in monthly on-call rotation as an L3 responder.
- Normal workdays may require time outside the normal working day.
- Learn and apply accesso Horizon product architecture and configuration.
Technologies 💻 You May Work With
- Configuration management: Terraform, Git, ArgoCD
- Cloud SQL Databases: Azure SQL, RDS
- Containerization and virtualization technologies: Cloud Kubernetes (AKS, GKE, EKS)
- Metrics and monitoring: Coralogix, Grafana and Dynatrace
- General Microservice / Application troubleshooting: Understanding logs and troubleshooting errors
What You Bring To The Role
- Professional exposure to Cloud Platforms (AWS/Azure/GCP)
- Practical Experience with Terraform, Docker, Cloud Managed Kubernetes (EKS/AKS/GKE), and monitoring tools.
- Self-Managed training – learning new concepts, trialing them, and applying them
- Scripting ability using Python or Bash.
- Familiarity with Linux systems and general command–line.
- Understanding Ops and CI/CD concepts.
- Good written and verbal communication; customer-focused approach.
- Ability to work with minimal direction.
- Willingness to learn, take direction, and work within a team.