Senior Site Reliability Engineer

Job Description


Our Client is looking for a Site Reliability Engineer and must establish and maintain effective working relationships with key stakeholders across the organization.

Who you are:

  • You are an expert with SRE principles and will champion the culture of DevOps required to maintain a frictionless high-quality platform infrastructure
  • You are an expert in monitoring the availability, latency, performance of traditional VMS, cloud services and environments by utilizing SLIs, SLOs and SLAs
  • You have advanced coding experience with a background in languages such as Java, JavaScript, .NET, C#, Node.js, Golang or similar
  • You have a mind-set to script and automate everything utilizing Python, Bash, PowerShell or similar
  • You have cloud administration experience with a deep understanding of Azure, AWS or Google offerings and have a proven track record of setting up and managing these environments
  • You work closely with Development and Operations teams to provide fully automated deployment routines for Production
  • You dive deep into technology and are on the forefront of the latest tools, technologies, and strategies and will help evaluate, prototype, and introduce them to our team
  • You will perform with broad independence and deliver on project milestones and tasks assigned by manager on schedule while communicating progress regularly
  • You have an excellent understanding of how work impacts overall business and how decisions may have a cascading effect in other areas
  • You will build strong relationships with whole team and organization
  • You are professional with client facing skills including good verbal and written communication


Competency in a blend of the following areas, typically achieved in 1-3 or more years of closely related experience:

  • Computer Science degree is required or equivalent combination of education, certifications and relevant work experience
  • Ability to code/script as mid to senior level engineer in a software engineering organization is a must have
  • Cloud Administration experience with Azure, AWS or Google is a must have
  • 3+ years’ experience as a software engineer or system engineer across multiple systems, languages and frameworks
  • Strong build automation experience supporting continuous integration and delivery using tools such as Azure DevOps, Ansible or similar technology
  • Strong DevOps focus and experience building and deploying infrastructure as code with Terraform or similar technology
  • Strong research/troubleshooting skills to resolve complex programming issues and implement longer term solutions to frequently occurring issues
  • Proficient with a variety of monitoring tools such as DataDog, New Relic, AppDynamics with the ability to establish thresholds, dashboards and alerting best practices
  • Experience planning, coordinating, developing and executing all stages of test scripts as needed to make system improvements
  • Experience supporting and securing Windows or Linux systems in 24×7 production environment
  • Experience with containerization and managing kubernetes
  • Experience with common networking & load balancing protocols
  • Good written and verbal communication skills with the ability to document and communicate technical solutions at all levels

Reference Number: 5325