Job Summary
Off-Shore Site Reliability Engineer
Job Description
We are looking for an Off-Shore Site Reliability Engineer to join our team and help us maintain and improve the reliability and performance of our cloud-based and on-prem services. You will work closely with other engineers, developers, and product managers to troubleshoot and resolve issues, automate tasks, and optimize processes.
Terraform, SRE concepts (SLI, SLO, Error Budget), Monitoring, Scripting, CI/CD containerization (Docker or Kubernetes) and GCP/Azure.
As an Off-Shore Site Reliability Engineer, you will have the following preferred skillsets:
- Experience with cloud platforms such as Azure, or GCP
- Proficiency in scripting languages such as Python, Bash, or PowerShell
- Knowledge of DevOps tools and methodologies such as CI/CD, Git, GitHub, or Azure DevOps
- Familiarity with monitoring and logging tools such asGrafana , AppDynamics, or Splunk (Datadog, New relic, Dynatrace)
- Ability to analyze and troubleshoot complex systems and applications
Additionally, you will have the following skillsets:
- Good communication and collaboration skills
- Willingness to learn new technologies and best practices
- Problem-solving and critical thinking skills
- Attention to detail and quality
As an Off-Shore Site Reliability Engineer, you will be expected to:
- Willing to work during US overnight hours (normal offshore work hours) to troubleshoot and triage ServiceNow incidents quickly and effectively
- Able to communicate and coordinate with the onshore team and clients
- Leverage monitoring tools to assess platform health