Lead Site Reliability Engineer - 1235756N

 Job-ID :   1235756N 
  Category :  Engineers 
  Position Type : Contract & Remote 

Job Description

Lead Site Reliability Engineer- Commerce Engineering, India- Tech OPS ICC

Become a Part of the Client, Inc. Team

Client, Inc. does more than outfit the world's best athletes. It is a place to explore potential, obliterate boundaries and push out the edges of what can be. The company looks for people who can grow, think, dream and create. Its culture thrives by embracing diversity and rewarding imagination. The brand seeks achievers, leaders and visionaries. At Client, it’s about each person bringing skills and passion to a challenging and constantly evolving game.

Client is a technology company. From our flagship website and five-star mobile apps to developing products, managing big data and providing leading edge engineering and systems support, our teams at Client Global Technology exist to revolutionize the future at the confluence of tech and sport. We invest and develop advances in technology and employ the most creative people in the world, and then give them the support to constantly innovate, iterate and serve consumers more directly and personally. Our teams are innovative, diverse, multidisciplinary and collaborative, taking technology into the future and bringing the world with it.

Reliability within Technology Operations ensures reliable quality and delivery of change.  We are committed to measuring delivery against our commitments and continuously improving the ways we work. We cultivate an environment of transparency, accountability, continuous learning and strive to provide best-in-class technology services in all that we do.


Within Site Reliability Engineering our goal is to provide technical solutions to complex production operations problems with a focus on reduction of incident and problem toil, speeding detection and recovery of critical incidents through observability and continuous improvement through operational health measurement and sharing.  We also enable productivity through the deployment of consumer-grade workplace technologies, ownership of critical infrastructure and enablement of public clouds.

Who are we looking for?

We are looking for an experienced, highly motivated, and passionate Lead Site Reliability Engineer to join our global team.  Someone who possesses a collaborative mindset with engineering intuition in order to assess and triage problems.  The ideal candidate should be forward thinking and ready to adapt to the latest technology trends.  

What will you work on?

As a Site Reliability Engineer you will be tasked with identifying problem in a DevOps landscape. Upholding ITIL processes, a Technical Lead’s responsibilities include, but are not limited to:

  • Ensuring the scalability of backend AWS services
  • Utilizing strong analytical abilities to evaluate end-to-end customer experience across multiple channels and customer touch points
  • Real time incident analysis, proposition of workarounds and resolutions for a seamless consumer experience
  • Identifying trends, optimized performance-based insights, and brainstorming new creative strategies for future opportunities
  • Proposing of technical enhancements where applicable: e.g.  caching, session time, cookie/device token validations, app design improvements etc.
  • Identifying redundancy and automation of menial tasks for consumer experience teams
  • Developing and driving real time monitoring solutions that provide visibility into site health and Service Level Indicators
  • Working with various technical/software engineering teams through day-to-day operations and critical incidents/problem management processes to restore service, manage root cause analysis and recommend solutions for long term fixes
  • Standing as the subject matter expert for consumer facing (e-commerce) software applications
  • Communicating projects/launch updates in weekly meetings, reporting and providing input to leadership teams

Who will you work with?

You will be working with engineering and operation teams to deliver and support consistent, reliable and scalable technology functions across Client’s global ecosystem. Working with a team of SREs, you will report to a Service Manager who will provide direction.   

What you bring?

The ideal candidate should possess the following:

  • Experience in AWS – configuring ASGs, EC2 instances, CloudWatch Alarms, Dynamo DB, Elastic search, DAX Caching, Lambda services, etc.
  • Utilizing Agile SCRUM, ITIL and Lean
  • Familiar with Github, JIRA, ServiceNow, and Scrum Methodology
  • Previous experience with developing and driving real time monitoring solutions that provide visibility into site health and key performance indicators
  • Familiarity with most of the following: Java, ServiceNow, Splunk, New Relic, Science Logic, Cloud computing, VMs, Windows, Linux and AWS
  • 1-3 years’ technical experience working with consumer facing (e-commerce) software applications in the Cloud using AWS, Azure, or Google Cloud
  • Basic understanding of DNS, Networking, Virtualization and Linux.
  • Basic understanding of most of the following: ServiceNow, Jira, Jenkins, Splunk, New Relic, EM7
  • 1-3 years of experience with one of the following: Java, Scala, Node.js or Python.

Chat with us