Senior Site Reliability Engineer

4 weeks ago


Lahore, Punjab, Pakistan Programmers Force Pvt. Ltd. Full time

Join to apply for the Senior Site Reliability Engineer (SRE) role at Programmers Force.

We are looking for a highly skilled Senior Site Reliability Engineer (SRE) with expertise in monitoring, performance optimization, and ensuring high availability for SaaS web applications. The ideal candidate will be responsible for building, scaling, and maintaining reliable systems that can handle large traffic loads while ensuring minimal downtime. This role will focus on monitoring application performance, uptime, and reliability, working closely with engineering and DevOps teams to maintain seamless customer experiences. If you have a passion for automating reliability and scalability while maintaining the uptime of critical services, we'd love to have you on our team.

Job Overview

We are looking for a highly skilled Senior Site Reliability Engineer (SRE) with expertise in monitoring, performance optimization, and ensuring high availability for SaaS web applications. The ideal candidate will be responsible for building, scaling, and maintaining reliable systems that can handle large traffic loads while ensuring minimal downtime.

Key Responsibilities
  1. Monitoring and Observability: Design and implement monitoring solutions to ensure the health, performance, and availability of SaaS web applications and infrastructure. Develop and maintain dashboards, alerts, and reporting systems for proactive monitoring of application performance, user experience, and system health. Ensure end-to-end observability by integrating log aggregation, metrics, and tracing tools to identify and resolve issues before they impact customers.
  2. Incident Management & Root Cause Analysis: Lead the response to production incidents, working with cross-functional teams to identify the root cause and implement effective remediation strategies. Drive post-incident reviews and document incidents, identifying areas for improvement in systems, processes, and response strategies.
  3. Reliability & Availability: Collaborate with engineering and DevOps teams to implement strategies for ensuring high availability, scalability, and disaster recovery for critical services. Ensure systems are designed to handle high traffic loads and remain resilient to failures by building and deploying robust monitoring frameworks and automation tools.
  4. Automation & Efficiency: Drive automation efforts to eliminate manual intervention and improve system reliability through automated testing, deployment, and monitoring pipelines.
  5. Capacity Planning & Performance Tuning: Monitor system resource usage and identify potential capacity issues, driving proactive scaling and performance tuning initiatives.
  6. Collaboration & Cross-Functional Engagement: Work closely with developers, product managers, and DevOps engineers to improve application performance and reliability through better code, infrastructure, and operational practices.
  7. Continuous Improvement & Best Practices: Establish and promote best practices for reliability engineering, monitoring standards, incident management, and performance optimization.
Requirements
  • Required Skills and Qualifications:
    • 5+ years of experience as a Site Reliability Engineer (SRE), Systems Engineer, or DevOps Engineer with a focus on monitoring, reliability, and performance for SaaS-based web applications.
    • Proven track record in designing and maintaining monitoring systems for large-scale, high-availability applications.
    • Strong experience with monitoring, logging, and alerting tools such as Prometheus, Grafana, Datadog, ELK Stack (Elasticsearch, Logstash, Kibana), New Relic or similar.
    • Expertise in setting up and managing cloud-based infrastructure monitoring (AWS CloudWatch, Google Cloud Operations, etc.).
    • Experience with containerized applications (Docker, Kubernetes) and orchestrating infrastructure at scale.
    • Proficiency in automation tools (e.g., Terraform, Ansible, Chef, Puppet) and programming/scripting languages (e.g., Python, Go, Shell).
    Seniority level

Mid-Senior level

Employment type

Full-time

Job function

Engineering and Information Technology

Industries

Technology, Information and Internet

#J-18808-Ljbffr

  • Lahore, Punjab, Pakistan beBee Careers Full time

    Senior Site Reliability Engineer - A Key Role in Ensuring System Uptime and PerformanceIn this critical position, you will be responsible for building, scaling, and maintaining reliable systems that can handle large traffic loads while ensuring minimal downtime. The ideal candidate will have expertise in monitoring, performance optimization, and high...


  • Lahore, Punjab, Pakistan ibex Full time

    Join to apply for the Site Reliability Engineer role at ibex1 day ago Be among the first 25 applicantsJoin to apply for the Site Reliability Engineer role at ibexGet AI-powered advice on this job and more exclusive features.ibex. is looking for a Site Reliability Engineer, to join our team. This role offers the opportunity to work with cutting-edge...


  • Lahore, Punjab, Pakistan Sabre Corporation Full time

    Senior Site Reliability EngineerApplyLocations: Pakistan - Lahore-FerozepurTime Type: Full timePosted on: Posted 9 Days AgoJob Requisition ID: JR104987Sabre is a technology company that powers the global travel industry. By leveraging next-generation technology, we create global technology solutions that take on the biggest opportunities and solve the most...


  • Lahore, Punjab, Pakistan Taraki Full time

    We are hiring for an international startup. The role is on-site in Lahore (hybrid, i.e 1-2 days of remote work after probation period).About the CompanyA fast-growing fintech infrastructure company is building the backbone of institutional access to the digital asset ecosystem. Their end-to-end platform provides seamless connectivity to global crypto...


  • Lahore, Punjab, Pakistan beBee Careers Full time

    System Reliability EngineerWe are seeking a highly skilled System Reliability Engineer to join our team. As a key member of our engineering team, you will be responsible for enhancing system reliability, scalability, and resilience.


  • Lahore, Punjab, Pakistan beBee Careers Full time

    Site Reliability Engineer Leader">">We are seeking a highly experienced Site Reliability Engineer Leader to join our team and lead our Engineering team in designing, developing, and maintaining the systems and technologies that drive our solutions.You will work closely with other departments to ensure our products and services meet the needs of our...


  • Lahore, Punjab, Pakistan beBee Careers Full time

    Key ResponsibilitiesAs a Senior Site Reliability Engineer, you will be responsible for:Developing and maintaining dashboards, alerts, and reporting systems for proactive monitoring of application performance, user experience, and system health.Ensuring end-to-end observability by integrating log aggregation, metrics, and tracing tools to identify and resolve...


  • Lahore, Punjab, Pakistan beBee Careers Full time

    Cloud Reliability Expert">">We are looking for a highly skilled Cloud Reliability Expert to join our team and help us design, develop, and maintain the systems and technologies that drive our solutions.You will work closely with other departments to ensure our products and services meet the needs of our customers.Key responsibilities include:">Owning the...


  • Lahore, Punjab, Pakistan beBee Careers Full time

    Elevate System Reliability and PerformanceIn this critical role, you will be responsible for designing, building, and maintaining reliable systems that can handle high traffic loads and minimize downtime. You will work closely with engineering and DevOps teams to implement strategies for ensuring high availability, scalability, and disaster recovery for...

  • Site Civil Engineer

    3 weeks ago


    Lahore, Punjab, Pakistan Alfanar Projects Full time

    We are seeking a skilled and experienced Civil Site Engineer to oversee and manage civil works related to high voltage substations and transmission line infrastructure. The ideal candidate will have a strong background in civil engineering, site supervision, and construction management within the power transmission and distribution sector.Key...