IT Operations Specialist

JOB DESCRIPTION

IT Operations Role:

  • 4 open roles
  • 6-month contract with likelihood to extend
  • Reports to supervisor and works closely with the SRE and dev teams
  • It is expected that this person is on an on-call rotation for incident management
  • This candidate will be able to come in and take direction and get heads down doing the work. There is a lot to get caught up on so they’ll be grateful to have this person onsite and ready to help.

Hiring Process:

  • 30 min phone screen
  • 60 to 90 min onsite interview

This candidate needs experience in the following areas: 
Azure:
Needs experience in using and automating activities with IaaS resources in Azure. This could range from create azure traffic management endpoints, to configuring virtual networks, to provisioning complete IaaS environments using ARM templates. Understands the use of Azure monitoring and logging tools, familiarity in the use of certificate management within Azure. This role would closely tie into the VM role, that is being able to understand current workloads, such as CPU, memory, disk and networking optimization to ensure that correct SKU is being utilized to maximize usage and reduce costs or keep costs low.

Sharepoint:
This role needs extensive administrative experience in managing large Sharepoint farms and having a good understanding of performance tuning and optimization. To ensure that each of the Sharepoint roles are being utilized to the fullest capacity. To ensure that the Sharepoint environment is maintained in a highly reliable and scalable fashion and be able to troubleshoot when issues/incidents arise. It is expected that this person is on a on-call rotation for incident management. Understanding using the various powershell and administrative Sharepoint tools is essential.

VM:
This person/role was mentioned above under Azure experience. But need significant experience in running and operating IaaS VM environments. Have experience in creating, operating and managing multiple VM environments, that can be automated through the use of ARM templates and be scripted to create immutable VM services. It is expected that this person/role is on a on-call rotation for incident management.

Working a ticket queue:
Ensuring the work ticket management is properly groomed, through appropriate response, triage and routed to the appropriate personnel with the SRE/IT operations team. This person would ensure that weekly reports of ticket status is provided, reviewed and actioned upon. There are user/group provisioning activities that are required to ensure that the correct access to various services is maintained. Experience using various systems like JIRA, Github issues, Service Now is essential.
It is expected that this person/role is on a on-call rotation for incident management.

Incident Management:
The role could be tied to the role above to provide incident management via tracking the incident to ensure that a root cause analysis is conducted, either by leading it as a facilitator or working with folks to ensure that it is correctly tracked and resolved before, during and after the incident. It is expected that this person/role is on a on-call rotation for incident management. Experience using various systems like JIRA, Github issues, Service Now is essential.