Site Reliability Engineer - 1604567SE

 Job-ID :   1604567SE 
  Category :  Engineers 
  Position Type : Contract Remote 

Job Description

Site Reliability Engineer


As a site reliability engineer, you will be focused on maximum availability, observability, reliability, security, and performance for Client  Digital Experiences.

SREs perform deep problem analysis, detect infrastructure or code defects, define, report, and create observability processes for Key Performance Indicators (KPIs), and work with product delivery teams to provide long term solutions to production issues.

We are looking for talented and passionate full stack developers with knowledge of datacenter infrastructure and cloud platforms who can bring the following:

  • Ability to observe, diagnose, and develop fixes for production issues quickly and efficiently
  • Ability to develop and drive real time monitoring solutions that provide visibility into site health and key performance indicators
  • Strong communication skills (written and verbal). They must be able to clearly articulate issues and their impact(s)
  • Highly confident and capable in reporting and communicating high value metrics to leadership. Deep understanding of the business landscape and how site reliability influences our consumers
  • Working understanding of IT service management (Incident, Problem, Change and Knowledge management)
  • Ability to work across teams (business and technical) to continuously analyze system performance in production, troubleshoot consumer reported issues, and proactively identify areas in need of optimization
  • Practical experience in managing and leading application reliability practices for consumer facing web and mobile experiences
  • Demonstrated negotiation and influencing skills
  • Passion for coaching, teaching, mentoring and learning


  • Bachelor’s degree in Computer Science, Information Systems, Business, or other relevant subject area
  • 5 years of professional experience in software development, operations, or support
  • Design and development experience with Java, Node.js, or similar language preferred
  • Proficiency with JavaScript on frontend (React, Angular, etc.) is a plus
  • Experience in other modern enterprise languages (functional or other – Scala, Python, Golang, etc.) is preferred
  • Basic understanding of DNS, Networking, Virtualization, Linux
  • Expertise in designing/building/supporting scalable cloud-based Micro Services deployed to AWS preferred
  • Experience with Docker and/or Serverless patterns
  • Experience with at least one No-SQL database like DynamoDb, Cassandra, etc.
  • Good understanding of RESTful APIs
  • Basic understanding of common tools for service management, agile, and observability: ServiceNow, Jira, Jenkins, Splunk, New Relic, SignalFX
  • Background with ITIL or Lean a plus



Chat with us