Description:
We are looking for a Principal Software Engineer to lead technical direction and execution across the Billing Experience domain.
This is a hands-on technical leadership role that combines system design, operational excellence, and cross-team collaboration. You will shape how Elastic’s billing services evolve — improving scalability, reliability, and developer productivity across multiple components and teams.
You will partner closely with Product, SRE, and Cloud Infrastructure teams to ensure our systems are resilient, observable, and efficient at global scale.
What You Will Be Doing:
- Lead end-to-end technical ownership of complex billing and account management systems that span multiple services and cloud providers.
- Design and implement large-scale data pipelines for billing data ingestion, transformation, and reconciliation, ensuring high performance and accuracy.
- Drive operational excellence: improve observability, incident response, and reliability of core billing components using modern DevOps practices.
- Shape system architecture for elasticity, scalability, and modularity, guiding service decomposition, API design, and data modeling efforts.
- Partner with SRE and Cloud teams to evolve deployment, monitoring, and CI/CD pipelines for critical billing services.
- Own PostgreSQL data architecture and performance tuning, ensuring query efficiency, schema design quality, and high availability.
- Act as the expert reviewer and go-to person for key parts of the billing experience stack, ensuring technical consistency and high-quality engineering practices.
- Mentor and influence engineers and tech leads across the Billing organization, helping them elevate design, operational standards, and long-term technical thinking.
- Collaborate multi-functionally with Product, Design, and Finance partners to shape roadmap priorities, ensuring technical feasibility and business alignment.
- Participate in on-call rotations and ensure production readiness through automation, resilience testing, and robust alerting.
What You Bring:
- Extensive experience (8+ years) in backend or platform engineering, with strong expertise in Python and PostgreSQL (schema design, performance optimization, HA setups).
- Deep understanding of distributed systems and event-driven architectures in large-scale SaaS environments.
- Operational focus with proven experience improving reliability, observability, and incident response in production systems.
- Strong background in cloud-native infrastructure (AWS, GCP, or Azure), including networking, IAM, monitoring, and deployment patterns.
- Demonstrated ability to design and lead multi-quarter projects that span multiple teams or domains, driving consensus and measurable results.
- Proven record of mentoring engineers, setting technical direction, and elevating engineering quality standards.
- Experience with SaaS billing, metering, or financial systems is a strong plus.
- Excellent written and verbal communication skills; able to influence across organizational boundaries and present clearly to technical and non-technical partners.
- Comfortable working autonomously in a globally distributed, asynchronous environment.