Site Reliability Engineer

4 weeks ago


Lahore, Punjab, Pakistan HR POD Careers Full time
Requirements:
  • 5+ years of experience in an SRE, DevOps, or infrastructure engineering role.
  • Strong experience with AWS or GCP, including services like EC2,Lambda, S3, RDS, and GKE (for GCP).
  • Experience with automation tools like Terraform.
  • Proficient in at least one scripting language (Python, Bash, Go, etc.).
  • Solid understanding of Linux systems, networking, and cloud-based architectures.
  • Experience working with container orchestration platforms like Kubernetes.
  • Proficient with CI/CD pipelines, preferably with cloud-native tools (e.g.,GitHub).
  • Ability to troubleshoot complex, distributed systems and provide solutions in high-pressure environments.
  • Ability to communicate effectively with both technical and non-technical stakeholders.

Nice to have:
  • Exposure to Execution Management Systems (EMS) / Portfolio Management Systems (PMS).
  • Experience with client-impact triage, working cross-functionally with account managers or product teams.
  • Proficiency with Datadog or similar observability platforms.
  • Knowledge of serverless architectures (e.g., AWS Lambda, GCP Cloud Functions).
  • Familiarity with RDBMS and NoSQL databases, such as RDS, CloudSQL, and DynamoDB.
  • Prior experience in fintech, trading platforms, or 24/7 financial infrastructure.
  • Strong understanding of API integrations and how infrastructure issues might manifest in client environments.
  • Excellent problem-solving and communication skills, with the ability to translate technical incidents into clear client updates.
  • Experience working with client-facing teams.

Responsibilities:
  • Ensure the reliability, availability, and performance of production systems, particularly during weekends.
  • Take ownership of monitoring, troubleshooting, and incident response during weekends and off-hours.
  • Troubleshoot and resolve critical issues in a fast-paced, high-availability environment.
  • Automate manual processes and workflows, reducing operational overhead.
  • Work closely with engineering teams to design and deploy scalable, fault-tolerant infrastructure solutions on AWS or GCP.
  • Improve observability by utilizing monitoring, logging, and alerting systems (e.g., CloudWatch, Datadog).
  • Lead post-incident reviews, contribute to the continuous improvement of system reliability, and follow up on strategic fixes.
  • Develop and update runbooks, incident response playbooks, and documentation.
  • Work closely with Engineering, Product, and Client teams to proactively identify infrastructure pain points that could affect the user experience.
  • Monitor alert channels, logs, and infrastructure load for the entire stack.
  • Set up automation for alerting.
#J-18808-Ljbffr

  • Lahore, Punjab, Pakistan Tkxel Full time

    Join to apply for the Site Reliability Engineer (SRE) (Azure) role at TkxelContinue with Google Continue with Google2 days ago Be among the first 25 applicantsJoin to apply for the Site Reliability Engineer (SRE) (Azure) role at TkxelWe are looking for a seasoned Site Reliability Engineer (SRE) to join our team and lead the charge in designing, implementing,...


  • Lahore, Punjab, Pakistan HR POD Careers Full time

    Site Reliability Engineer (Hybrid, Lahore, Remittance Salary)Join to apply for the Site Reliability Engineer (Hybrid, Lahore, Remittance Salary) role at HR POD CareersSite Reliability Engineer (Hybrid, Lahore, Remittance Salary)Join to apply for the Site Reliability Engineer (Hybrid, Lahore, Remittance Salary) role at HR POD Careers5+ years of experience in...


  • Lahore, Punjab, Pakistan Soliton Technologies (Pvt) Ltd. Full time

    We are seeking a talented and experienced Site Reliability Engineer with a strong background in Java Software Engineering. In this role, you will be responsible for maintaining and improving the reliability, performance, and scalability of our systems. You will collaborate with cross-functional teams to ensure our applications are robust, efficient, and...


  • Lahore, Punjab, Pakistan Soliton Technologies (Pvt) Ltd. Full time

    We are seeking a talented and experienced Site Reliability Engineer with a strong background in Java Software Engineering. In this role, you will be responsible for maintaining and improving the reliability, performance, and scalability of our systems. You will collaborate with cross-functional teams to ensure our applications are robust, efficient, and...


  • Lahore, Punjab, Pakistan ibex Full time

    Join to apply for the Site Reliability Engineer role at ibexJoin to apply for the Site Reliability Engineer role at ibexGet AI-powered advice on this job and more exclusive features.ibex. is looking for a Site Reliability Engineer, to join our team. This role offers the opportunity to work with cutting-edge technologies, drive automation, and enhance...


  • Lahore, Punjab, Pakistan Taraki Full time

    Join to apply for the Site Reliability Engineer (SRE) role at TarakiWe are hiring for an international startup. The role is on-site in Lahore (hybrid, i.e., 1-2 days of remote work after the probation period).About The CompanyA fast-growing fintech infrastructure company is building the backbone of institutional access to the digital asset ecosystem. Their...


  • Lahore, Punjab, Pakistan Taraki Full time

    We are hiring for an international startup. The role is on-site in Lahore (hybrid, i.e 1-2 days of remote work after probation period).About the CompanyA fast-growing fintech infrastructure company is building the backbone of institutional access to the digital asset ecosystem. Their end-to-end platform provides seamless connectivity to global crypto...


  • Lahore, Punjab, Pakistan Sabre Corporation Full time

    Join to apply for the Senior Site Reliability Engineer role at Sabre Corporation3 days ago Be among the first 25 applicantsJoin to apply for the Senior Site Reliability Engineer role at Sabre CorporationDirect message the job poster from Sabre CorporationThe Site Reliability Engineer is responsible for supporting the development organization to seamlessly...

  • Reliability Engineer

    2 weeks ago


    Lahore, Punjab, Pakistan Velosi Asset Integrity Limited Full time

    Contact us at the Velosi office nearest to you or submit a business inquiry online.The strength of our approach comes from offering a multiregional service while meeting local needs. Velosi achieves this because we operate in selected world's major established and emerging markets.MD, Velosi Asset Integrity LimitedVelosi is always looking for talented people...


  • Lahore, Punjab, Pakistan beBeeEngineering Full time

    Job Description:A highly skilled and experienced engineer is required to ensure the reliability, availability, and performance of production systems.The ideal candidate will have a strong background in SRE, DevOps, or infrastructure engineering with a proven track record of ensuring high uptime and performance in fast-paced environments.Required Skills and...