Site Reliability Engineer (SRE)

Job description


  • Monitoring services
  • Extending and Improving current monitoring systems
  • Automate the current monitoring process
  • Deploying services to the production environment
  • Communicating with other teams to resolve incidents
  • Troubleshooting system problems
  • Deploying and changing the production environment

Requirements

  • *Ability to work in any shift pattern within the 24/7/365 operation including days, nights, holidays, and weekends.*
  • Familiar with Linux OS
  • Familiar with Grafana, ELK, Prometheus

  • Familiar with CI/CD

  • Familiar with Docker and Kubernetes