Description:
As a Platform Engineer at Ben, you will join us to ensure stable, secure & scalable operations and an excellent developer experience.
We’re looking for self-starters who are seeking a fast-paced environment where they can make a difference. Our team is small, which means high autonomy, ownership, and responsibility. We love what we do, have fun and, while we ship on time, take life-work balance seriously.
Things you will be working on...
- Infrastructure Automation: Set up infrastructure components using Terraform (via Terragrunt) with maintainability, scalability & reliability in mind
- Developer Tooling: Tooling and processes to support product squads in increasing the speed, reliability and visibility of their development workflows
- Observability: Tooling and processes to improve monitoring, logging, events and tracing at both the infrastructure and application layer
- Collaborating: Working with product squads to deliver infrastructure to support new services/features in a secure and scalable manner
- Resource Optimisation: Working to right-size our services and implement features such as auto-scaling to increase the cost effectiveness of our platform and reduce waste
- Security: Collaborating with the security squad on projects to increase our security posture, such as improving our least privilege approach to IAM roles for users and services
- Documentation: Maintaining clear and up-to-date documentation for systems and procedures to increase product squad self-service
You'll thrive here if you...
- Have built cost-effective, secure systems in AWS - you understand the trade-offs between different services and can make pragmatic infrastructure decisions.
- Are comfortable with containerised applications (we run on ECS/Fargate) and have used Infrastructure as Code tooling like Terraform to manage cloud resources.
- Have worked extensively with monitoring and observability tools (we use Datadog and Sentry) and know how to debug issues in production.
- Can write solid shell and Python scripts to automate common tasks and build internal tooling.
- Communicate clearly with both technical and non-technical audiences - you can explain complex infrastructure decisions to product teams and leadership.
Nice-to-haves, But Not Deal-breakers
- Experience implementing SLIs/SLOs and helping product teams track service reliability.
- Background working in ISO 27001 or SOC2 compliant environments.
- Advanced Python development skills (our platform team maintains Python tooling deployed via Lambda).