Role:- Performance Engineer with SRE and cloud(AWS) Experience
Location:-
Remote (United states and Canada)
Contract:
12+ Months
Job Description :-
As a Performance Engineer, Site Reliability Engineering, you will:
• Define performance, scalability, and reliability metrics aligned to business services (e.g., customer journeys, checkout, promotions, event‑driven sales, and holiday readiness).
• Design, automate, and own the end‑to‑end performance testing strategy, including load, stress, spike, soak, and capacity testing.
• Build and maintain capacity metrics and dashboards to support scaling decisions and cost optimization.
• Create clear, actionable performance reports and summaries for technical and non‑technical stakeholders.
• Leverage observability and monitoring platforms (e.g., Datadog, CloudWatch) to analyze system performance and identify bottlenecks.
• Apply SRE metrics and frameworks such as availability, latency, saturation, DORA metrics, and service quality indicators.
• Contribute to automation initiatives that reduce toil and enable proactive or self‑healing responses to performance issues.
• Partner with engineering teams to identify performance risks early in the design and development phases.
• Advocate for performance engineering best practices and help raise performance literacy across the organization.
THE QUALIFICATIONS The Performance Engineer has:
• Hands‑on experience with performance/load testing concepts and methodologies.
• Solid understanding of Site Reliability Engineering fundamentals and principles.
• Strong knowledge of core performance metrics, including identifying/testing latency, throughput, error rates, resource utilization, and threading.
• Ability to write performance test plans, capacity models, and write detailed performance reports.
• Experience participating in incident response and driving or contributing to incident reviews.
• Proficiency in one or more programming or scripting languages (e.g., Python, Go, DOTNET).
• Familiarity with APIs, distributed systems, and cloud‑native architectures.
• Strong analytical thinking and problem‑solving skills.
• Excellent communication and collaboration skills.
Nice to Haves
• Experience with Infrastructure as code (IaC) tools (Terraform) is a plus
• Experience with 'Load Testing In Production' is a plus
• Experience with one or more load testing tools (e.g.,
k6
,
JMeter
,
Gatling
,
Locust
).
• Experience with Salesforce, Cloudflare, or edge/CDN performance considerations.
Apply tot his job
Apply To this Job