Job Description:
• Manage complex cloud-based architectures using AWS services like EC2, Lambda, RDS, S3, EFS, and VPC to support scalable, high-performance game development workflows.
• Implement test automation and ensure pipeline efficiency across platforms.
• Develop and enforce best practices for configuration management, infrastructure as code (IaC), and version control.
• Define SLAs, SLOs and SLIs, runbooks, and other monitoring and incident response patterns, including automation and notifications.
• Ensure OSE’s CI/CD cloud infrastructure is scalable, reliable, performant, and secure.
• Work with game teams to build-out, maintain, and evolve the company’s build, test, and deployment systems and pipelines (CI/CD and test automation).
• Architect and optimize continuous integration and deployment pipelines for building, testing, and deploying PC/console games using Jenkins, Git, and Perforce.
• Design and implement custom metrics and dashboards to monitor performance, availability, and reliability of game servers, CI/CD pipelines, and backend services, ensuring operational transparency and actionable insights.
• Profile and optimize system resource usage (CPU, memory, disk I/O, and network) for both game servers and CI/CD systems, ensuring efficient resource allocation and minimizing bottlenecks in the pipeline and production environments.
• Implement log aggregation and analysis systems using AWS CloudWatch Logs to centralize logs across services, making troubleshooting and incident response efficient.
• Beyond gathering functional requirements, work diligently to surface quality attributes and other non functional specifications.
• Develop and maintain concise software and technical design documents.
• Work well as part of a collaborative and flexible team.
• Stay up-to-date with industry trends and emerging technologies in DevOps and gaming, recommending improvements to enhance our development processes.
Requirements:
• 3+ years developing build systems and pipelines for PC/console games in a professional environment
• 6+ years as a software development professional
• 2+ years of AWS cloud services experience
• Strong problem-solving skills, particularly in high-pressure, real-time gaming environments
• Excellent written/verbal communications skills
• Proficient in pipeline design, management, and optimization
• Expertise in Jenkins as both a build tool and a CI/CD orchestrator
• Solid experience with AWS Elastic Container Service (or Kubernetes) and Docker
• Solid experience with Terraform or CloudFormation for provisioning AWS resources and infrastructure automation
• Skilled in setting up server and backend monitoring solutions, runbooks, and multiplayer load tests (AWS System Manager/CloudWatch, DataDog, PagerDuty)
• Experience with PagerDuty or other escalation notification systems
• Have implemented and maintained site reliability runbooks, SLAs, SLIs, and SLOs
• Some familiarity with TeamCity and Unreal Engine’s Horde
• Pro Git source control/repository admin skills
• Solid Perforce admin skills
• Great scripting abilities (Linux Shell, and/or Powershell)
• BS in CS, Math, Physics, or equivalent experience.
Benefits:
• Competitive compensation
• Full suite of benefits available anywhere in the United States