Skip to content

Site Reliability Engineering - Lead

  • On-site
    • Colombo, Western Province, Sri Lanka
  • Site Reliability Engineering

Job description

Cloud Solutions International Pvt Ltd is seeking a highly skilled and experienced Site Reliability Engineering Lead to join our dynamic Site Reliability Engineer department. As the Lead, you will be responsible for overseeing the reliability, scalability, and performance of our cloud-based infrastructure.

Key Responsibilities : 

  • Lead a team of Site Reliability Engineers in designing, implementing, and maintaining highly available and scalable systems.

  • Collaborate with cross-functional teams to identify and resolve performance bottlenecks and ensure system reliability.

  • Develop and implement best practices for monitoring, troubleshooting, and resolving incidents.

  • Drive automation initiatives to improve operational efficiency and reduce manual intervention.

  • Participate in capacity planning and performance tuning activities

  • Stay up-to-date with the latest industry trends and technologies to drive continuous improvement.

Job requirements

Qualifications : 

  • Proven experience as a Site Reliability Engineer or similar role.

  • Strong knowledge of cloud infrastructure and distributed systems.

  • Strong knowledge in LINUX and creating and running SQL queries (Oracle, PostgreSQL, SQL).

  • Proficiency in programming languages such as Python, Java, or Go.

  • Experience with containerization technologies like Docker and Kubernetes.

  • Experience in Git technologies (BitBucket, Gitea, GitHub, GitLab).

  • Strong cloud expertise in AWS / Azure / GCP.

  • Strong knowledge in GitOps tools and principles (ArgoCD, Helm, Kustomize).

  • Solid understanding in DevOps tools (Jenkins, Bamboo) and Caching technologies (Redis).

  • Solid understanding of networking protocols and security principles.

  • Excellent problem-solving and troubleshooting skills.

  • Ability to lead and mentor a team of engineers.

  • Strong communication and collaboration skills.

or