← All Positions
Posted May 25, 2026

Sr. Staff Site Reliability Engineer

Apply Now
Description: • Define and drive the company-wide reliability strategy across services. • Establish end-to-end system visibility frameworks for observability, detection, and resilience. • Partner with DevOps and Platform Engineering leadership to standardize SLI/SLOs and improve reliability practices across teams. • Serve as a technical escalation expert for reliability issues and incident response. • Build intelligent detection systems, including anomaly detection and connector health models. • Enable self-service observability for engineering teams. • Define and evolve a tiered incident communication strategy. • Lead postmortems and improve incident response practices to strengthen customer trust. • Contribute hands-on to system design, monitoring, and debugging across distributed systems and data pipelines. Requirements: • 5+ years of experience in SRE, Production Engineering, or a related role. • 3+ years of experience operating at a senior or technical leadership level, such as Staff scope or equivalent. • Deep expertise with AWS and/or GCP. • Experience with Kubernetes and Helm. • Experience with observability stacks such as Prometheus and Grafana, or equivalent tools. • Experience with CI/CD systems such as GitLab CI/CD and ArgoCD, or similar tools. • Proven experience designing and scaling reliability systems for multi-tenant SaaS platforms. • Strong debugging and systems thinking across distributed microservices and legacy systems. • Demonstrated ability to lead initiatives that improve incident detection, response, and system resilience. • Hands-on engineering approach with a track record of building reliability systems, not just configuring them. • Experience in B2B SaaS serving enterprise or financial customers, preferred. • Familiarity with third-party SaaS connector architectures and ingestion patterns, preferred. • Experience building anomaly detection or intelligent alerting systems, preferred. • Experience designing customer-facing status pages and incident communication frameworks, preferred. Benefits: • Competitive compensation with equity and 401(k). • Comprehensive healthcare with dental and vision coverage. • Flexible paid time off and paid holiday time off. • 12 weeks of new parent or family leave. • Personal and professional development resources. • Base salary range of $232,000 to $263,000 USD. • Eligibility for equity awards and possible sales commission or incentive compensation, depending on role or function. Apply tot his job Apply To this Job