Skip to content

Site Reliability Engineering Manager

  • On-site
    • Colombo, Western Province, Sri Lanka
  • Site Reliability Engineering

Job description

Cloud Solutions International Pvt Ltd is seeking a highly skilled and motivated Site Reliability Engineering Manager to join our dynamic Site Reliability Engineering department. As the Site Reliability Engineering Manager, you will play a crucial role in ensuring the reliability, scalability, and performance of our cloud infrastructure.

In this role, you will lead a team of talented Site Reliability Engineers and collaborate closely with cross-functional teams to drive continuous improvement and innovation. You will be responsible for overseeing the day-to-day operations of our cloud infrastructure, identifying and resolving technical issues, and implementing best practices to optimize system performance.

Key Responsibilities:

  • Lead and manage a team of Site Reliability Engineers, providing guidance, mentorship, and support.

  • Collaborate with cross-functional teams to define and implement strategies for improving system reliability, scalability, and performance.

  • Monitor and analyze system performance metrics, identifying areas for improvement and implementing proactive solutions.

  • Troubleshoot and resolve complex technical issues, ensuring minimal impact on system availability.

  • Implement and maintain monitoring, alerting, and incident response systems.

  • Develop and maintain documentation for system configurations, processes, and procedures.

  • Stay up-to-date with industry trends and emerging technologies, recommending and implementing innovative solutions.

Job requirements

Qualifications:

  • Previous experience in a similar role, managing a team of Site Reliability Engineers

  • Strong knowledge of Kubernetes

  • Proficiency in scripting and automation using languages like Python, Bash, or PowerShell

  • Experience with monitoring and logging tools, such as Prometheus, Grafana, or ELK stack

  • Excellent problem-solving and troubleshooting skills

  • Strong communication and leadership abilities

If you are a highly motivated individual with a passion for ensuring the reliability and performance of cloud infrastructure, we would love to hear from you. Join our team and contribute to the success of Cloud Solutions International Pvt Ltd!

or