Site Reliability Engineer - SnappBox

Job description

We are looking for experienced, results-driven, and passionate engineers to join the SRE team. The ideal candidate is a self-starter and has excellent communication skills. Our collaborative environment relies heavily on innovation, technical savvy, and problem-solving skills. This is a full-time remote position within the Tehran. As a newest SRE Engineer, you’ll be a major contributor to the company’s success and you’ll have an opportunity to work alongside our wonderful SRE team that supports the Snappbox platform. You will embrace the SRE model, and work with other senior leaders on the team to modernize the tech stack.

  • Lead designs of software components, systems, and features to improve the availability, scalability, latency, and efficiency of SnappBox's services.
  • Lead sustainable incident response, blameless postmortems, and production improvements that result in direct business opportunities for SnappBox.
  • Provide guidance to other team members on managing availability and performance of mission critical services, on building automation to prevent problem recurrence, and on building automated responses for non-exceptional service conditions.
  • Mentor and train other team members on design techniques and coding standards, and to cultivate innovation and collaboration across multiple teams.
  • Manage individual projects priorities, deadlines, and deliverables.

Requirements

  • Strong analytical skills for problem-solving
  • communication skills
  • Have a good experience with Grafana and Prometheus
  • Familiar with log shipment / management tools (elk stack)
  • Experience in java/spring as a Developer (More than half of the total professional experiences)
  • Familiar with container docker
  • Familiar with Kubernetes
  • Strong TCP/IP knowledge
  • Familiar with Microservice architecture
  • Have a good experience with Linux (LPIC-2)
  • Familiar with log shipping solutions (ELK, Loki, ...)
  • Familiar with CI/CD
  • Familiar with REST API
  • Attention to details