Senior Site Reliability Engineer

6 days ago


Lahore, Punjab, Pakistan Programmers Force Pvt. Ltd. Full time

Join to apply for the Senior Site Reliability Engineer (SRE) role at Programmers Force.

We are looking for a highly skilled Senior Site Reliability Engineer (SRE) with expertise in monitoring, performance optimization, and ensuring high availability for SaaS web applications. The ideal candidate will be responsible for building, scaling, and maintaining reliable systems that can handle large traffic loads while ensuring minimal downtime. This role will focus on monitoring application performance, uptime, and reliability, working closely with engineering and DevOps teams to maintain seamless customer experiences. If you have a passion for automating reliability and scalability while maintaining the uptime of critical services, we'd love to have you on our team.

Job Overview

We are looking for a highly skilled Senior Site Reliability Engineer (SRE) with expertise in monitoring, performance optimization, and ensuring high availability for SaaS web applications. The ideal candidate will be responsible for building, scaling, and maintaining reliable systems that can handle large traffic loads while ensuring minimal downtime.

Key Responsibilities
  1. Monitoring and Observability: Design and implement monitoring solutions to ensure the health, performance, and availability of SaaS web applications and infrastructure. Develop and maintain dashboards, alerts, and reporting systems for proactive monitoring of application performance, user experience, and system health. Ensure end-to-end observability by integrating log aggregation, metrics, and tracing tools to identify and resolve issues before they impact customers.
  2. Incident Management & Root Cause Analysis: Lead the response to production incidents, working with cross-functional teams to identify the root cause and implement effective remediation strategies. Drive post-incident reviews and document incidents, identifying areas for improvement in systems, processes, and response strategies.
  3. Reliability & Availability: Collaborate with engineering and DevOps teams to implement strategies for ensuring high availability, scalability, and disaster recovery for critical services. Ensure systems are designed to handle high traffic loads and remain resilient to failures by building and deploying robust monitoring frameworks and automation tools.
  4. Automation & Efficiency: Drive automation efforts to eliminate manual intervention and improve system reliability through automated testing, deployment, and monitoring pipelines.
  5. Capacity Planning & Performance Tuning: Monitor system resource usage and identify potential capacity issues, driving proactive scaling and performance tuning initiatives.
  6. Collaboration & Cross-Functional Engagement: Work closely with developers, product managers, and DevOps engineers to improve application performance and reliability through better code, infrastructure, and operational practices.
  7. Continuous Improvement & Best Practices: Establish and promote best practices for reliability engineering, monitoring standards, incident management, and performance optimization.
Requirements
  • Required Skills and Qualifications:
    • 5+ years of experience as a Site Reliability Engineer (SRE), Systems Engineer, or DevOps Engineer with a focus on monitoring, reliability, and performance for SaaS-based web applications.
    • Proven track record in designing and maintaining monitoring systems for large-scale, high-availability applications.
    • Strong experience with monitoring, logging, and alerting tools such as Prometheus, Grafana, Datadog, ELK Stack (Elasticsearch, Logstash, Kibana), New Relic or similar.
    • Expertise in setting up and managing cloud-based infrastructure monitoring (AWS CloudWatch, Google Cloud Operations, etc.).
    • Experience with containerized applications (Docker, Kubernetes) and orchestrating infrastructure at scale.
    • Proficiency in automation tools (e.g., Terraform, Ansible, Chef, Puppet) and programming/scripting languages (e.g., Python, Go, Shell).
    Seniority level

Mid-Senior level

Employment type

Full-time

Job function

Engineering and Information Technology

Industries

Technology, Information and Internet

#J-18808-Ljbffr

  • Lahore, Punjab, Pakistan Sabre Corporation Full time

    Senior Site Reliability EngineerApplyLocations: Pakistan - Lahore-FerozepurTime Type: Full timePosted on: Posted 9 Days AgoJob Requisition ID: JR104987Sabre is a technology company that powers the global travel industry. By leveraging next-generation technology, we create global technology solutions that take on the biggest opportunities and solve the most...


  • Lahore, Punjab, Pakistan beBee Careers Full time

    SRE Position SummaryThis Senior Site Reliability Engineer position requires expertise in DevOps, including automation, CI/CD, build and release management, installation, and patching automation, as well as cloud technologies.You will work closely with internal networking and external clients for environment integrations, lead key DevOps initiatives, and...


  • Lahore, Punjab, Pakistan beBee Careers Full time

    Senior Site Reliability Engineer RoleAs a key member of our team, you will be responsible for ensuring the seamless delivery of scalable and highly reliable software at an accelerated pace with high quality.The Site Reliability Engineer plays a crucial role in supporting the development organization to manage various on-premise and cloud environments,...


  • Lahore, Punjab, Pakistan beBee Careers Full time

    As a Senior Site Reliability Engineer you will be responsible for driving continuous improvement across our cloud infrastructure and ensuring consistent high performance of our distributed messaging platforms.This role requires ownership of production services, participation in on-call rotations, and leadership of post-incident analysis.You will also build...


  • Lahore, Punjab, Pakistan Unifonic, Inc. Full time

    Proudly voted a Great Place to Work, we are a dynamic startup in the SaaS space that is revolutionizing the way businesses communicate. Our team is made up of 500 energetic and passionate Unifones who are dedicated to delivering the best possible experience to 5000+ customer-centric companies.We pride ourselves on our fun and collaborative work environment,...


  • Lahore, Punjab, Pakistan beBee Careers Full time

    About the RoleWe are seeking a highly skilled Senior Site Reliability Engineer to join our team. As a key member of our SRE team, you will be responsible for ensuring the reliability, scalability, and resilience of our cloud infrastructure.Your primary focus will be on enhancing system performance, reducing downtime, and improving overall operational...


  • Lahore, Punjab, Pakistan beBee Careers Full time

    Main ResponsibilitiesThe main responsibilities of this Senior Site Reliability Engineer role include:Becoming an SME on our business product and ensuring it is reliable and secure.Closely working with internal networking and external clients for environment integrations.Leading key DevOps Initiatives for specific products.Managing and supporting cloud...


  • Lahore, Punjab, Pakistan beBee Careers Full time

    Proudly voted a Great Place to Work, we are a dynamic startup that is revolutionizing the way businesses communicate.We pride ourselves on our fun and collaborative work environment where creativity and new ideas are constantly encouraged.Meet the teamWe're passionate about technology and excited about working on cutting-edge communication and engagement...


  • Lahore, Punjab, Pakistan beBee Careers Full time

    About the JobAs a Senior DevOps Engineer specializing in Observability, you will play a critical role in setting observability standards and driving automation within engineering teams. Your responsibilities will include managing and configuring the Datadog observability platform using Infrastructure-as-Code (IaC) practices.This is a hands-on role focused on...


  • Lahore, Punjab, Pakistan Velosi Asset Integrity Limited Full time

    Contact us at the Velosi office nearest to you or submit a business inquiry online.The strength of our approach comes from offering a multiregional service while meeting local needs. Velosi achieves this because we operate in selected world's major established and emerging markets.MD, Velosi Asset Integrity LimitedVelosi is always looking for talented people...