Current jobs related to Senior Site Reliability Engineer - Lahore, Punjab - Unifonic, Inc.
-
Senior Site Reliability Engineer
4 weeks ago
Lahore, Punjab, Pakistan Programmers Force Pvt. Ltd. Full timeJoin to apply for the Senior Site Reliability Engineer (SRE) role at Programmers Force.We are looking for a highly skilled Senior Site Reliability Engineer (SRE) with expertise in monitoring, performance optimization, and ensuring high availability for SaaS web applications. The ideal candidate will be responsible for building, scaling, and maintaining...
-
Senior Site Reliability Engineer
1 week ago
Lahore, Punjab, Pakistan beBee Careers Full timeSenior Site Reliability Engineer - A Key Role in Ensuring System Uptime and PerformanceIn this critical position, you will be responsible for building, scaling, and maintaining reliable systems that can handle large traffic loads while ensuring minimal downtime. The ideal candidate will have expertise in monitoring, performance optimization, and high...
-
Site Reliability Engineer
1 week ago
Lahore, Punjab, Pakistan ibex Full timeJoin to apply for the Site Reliability Engineer role at ibex1 day ago Be among the first 25 applicantsJoin to apply for the Site Reliability Engineer role at ibexGet AI-powered advice on this job and more exclusive features.ibex. is looking for a Site Reliability Engineer, to join our team. This role offers the opportunity to work with cutting-edge...
-
Senior Site Reliability Engineer
4 weeks ago
Lahore, Punjab, Pakistan Sabre Corporation Full timeSenior Site Reliability EngineerApplyLocations: Pakistan - Lahore-FerozepurTime Type: Full timePosted on: Posted 9 Days AgoJob Requisition ID: JR104987Sabre is a technology company that powers the global travel industry. By leveraging next-generation technology, we create global technology solutions that take on the biggest opportunities and solve the most...
-
Site Reliability Engineer
1 week ago
Lahore, Punjab, Pakistan Taraki Full timeWe are hiring for an international startup. The role is on-site in Lahore (hybrid, i.e 1-2 days of remote work after probation period).About the CompanyA fast-growing fintech infrastructure company is building the backbone of institutional access to the digital asset ecosystem. Their end-to-end platform provides seamless connectivity to global crypto...
-
Site Reliability Engineer Professional
1 week ago
Lahore, Punjab, Pakistan beBee Careers Full timeSystem Reliability EngineerWe are seeking a highly skilled System Reliability Engineer to join our team. As a key member of our engineering team, you will be responsible for enhancing system reliability, scalability, and resilience.
-
Reliability Engineer
3 days ago
Lahore, Punjab, Pakistan beBee Careers Full timeSite Reliability Engineer Leader">">We are seeking a highly experienced Site Reliability Engineer Leader to join our team and lead our Engineering team in designing, developing, and maintaining the systems and technologies that drive our solutions.You will work closely with other departments to ensure our products and services meet the needs of our...
-
Reliable Infrastructure Specialist
3 days ago
Lahore, Punjab, Pakistan beBee Careers Full timeKey ResponsibilitiesAs a Senior Site Reliability Engineer, you will be responsible for:Developing and maintaining dashboards, alerts, and reporting systems for proactive monitoring of application performance, user experience, and system health.Ensuring end-to-end observability by integrating log aggregation, metrics, and tracing tools to identify and resolve...
-
Senior Site Reliability Lead
3 days ago
Lahore, Punjab, Pakistan beBee Careers Full timeCloud Reliability Expert">">We are looking for a highly skilled Cloud Reliability Expert to join our team and help us design, develop, and maintain the systems and technologies that drive our solutions.You will work closely with other departments to ensure our products and services meet the needs of our customers.Key responsibilities include:">Owning the...
-
System Reliability Engineer
1 week ago
Lahore, Punjab, Pakistan beBee Careers Full timeElevate System Reliability and PerformanceIn this critical role, you will be responsible for designing, building, and maintaining reliable systems that can handle high traffic loads and minimize downtime. You will work closely with engineering and DevOps teams to implement strategies for ensuring high availability, scalability, and disaster recovery for...

Senior Site Reliability Engineer
4 weeks ago
Proudly voted a Great Place to Work, we are a dynamic startup in the SaaS space that is revolutionizing the way businesses communicate. Our team is made up of 500 energetic and passionate Unifones who are dedicated to delivering the best possible experience to 5000+ customer-centric companies.
We pride ourselves on our fun and collaborative work environment, where creativity and new ideas are constantly encouraged. As shareholders in the business, we're so much more than a group of passionate communicators. We are Unifones. Join our team and be a part of something big
Meet the team
Our Engineering team is responsible for designing, developing, and maintaining the systems and technologies that drive Unifonic's solutions. We work closely with other departments to ensure our products and services meet the needs of our customers. If you are passionate about technology and are excited about working on cutting-edge communication and engagement solutions, we want you on our team.
As a Senior Site Reliability Engineer you will be responsible for enhancing system reliability, scalability, and resilience. As part of our elite SRE team, you'll drive continuous improvement across our cloud infrastructure and ensure the consistent high performance of our distributed messaging platforms.
Help us shape the future of communication by:
Production Operations and Incident Management:
Owning the reliability, uptime, and scalability of critical production services.
Participating in the on-call rotation to respond to incidents, troubleshoot live production issues, and lead post-incident analysis.
Building robust operational playbooks, escalation paths, and improve Mean Time to Detect (MTTD) and Mean Time to Resolve (MTTR).
Ensuring operational excellence by proactively detecting and addressing reliability risks through SLO monitoring, chaos testing, and capacity planning.
Automating operational tasks to minimize human intervention.
Cloud Architecture & Management:
Architecting, implementing, and managing infrastructure across AWS, Oracle Cloud Infrastructure (OCI), and OpenStack environments.
Optimizing cloud resources to balance performance, security, and cost-efficiency.
Kubernetes & Container Orchestration:
Managing Kubernetes clusters (EKS, OKE, Rancher RKE2), ensuring scalability, availability, and robust performance.
Deploying advanced containerization strategies and troubleshooting.
Messaging, Caching & Queuing Systems:
Managing and optimizing high-performance messaging and caching systems including Kafka, RabbitMQ, and Redis.
Ensuring efficient, reliable message and data delivery critical to Unifonic's SMS and distributed systems.
Database Reliability Engineering:
Managing and optimizing production-grade MySQL and PostgreSQL databases.
Ensuring high availability, performance tuning, backups, and recovery processes for critical databases.
Disaster Recovery & Business Continuity:
Leading the planning and execution of comprehensive disaster recovery strategies.
Developing and maintaining robust business continuity plans.
Monitoring, Observability & Incident Management:
Implementing advanced observability solutions (Prometheus, Grafana, CloudWatch).
Defining, measuring, and enforcing Service Level Objectives (SLOs) and Service Level Indicators (SLIs) in alignment with SRE best practices.
Proactively identifying issues, minimizing downtime, and enhancing system transparency.
Automation, CI/CD, and Infrastructure-as-Code:
Driving automation initiatives using Terraform, Helm, Jenkins, Tekton or GitLab CI/CD.
Streamlining deployment pipelines and reduce manual intervention through innovative automation.
Security & Compliance:
Integrating security best practices into infrastructure and application layers.
Performing regular audits ensuring compliance and robust security posture.
Team Collaboration & Technical Leadership:
Collaborating with cross-functional teams (engineering, product, QA) to foster SRE culture.
Mentoring junior engineers, enhancing team capabilities and promoting knowledge sharing.
What you'll bring:
Bachelor's or Master's degree in Computer Science, Engineering, or a related technical field.
8+ years of hands-on production experience in SRE, DevOps, or cloud engineering roles.
Strong expertise in AWS, OCI, OpenStack environments.
Deep understanding of Kubernetes ecosystems (EKS, OKE, Rancher RKE2).
Proven experience with Kafka, RabbitMQ, Redis, and distributed messaging and caching systems.
Solid experience managing MySQL and PostgreSQL in production environments.
Expert-level scripting and automation skills (Python, Bash, Go).
Advanced proficiency with Helm, Terraform, and modern CI/CD toolchains.
Demonstrable experience with Linux system administration and troubleshooting.
As a Unifone you'll receive a range of benefits:
Competitive salary and bonus
Unifonic share scheme (we are all owners)
30 holiday days after the first anniversary
Your Birthday off
Spend up to 25 days per year working from anywhere in the world
Paid leave and assistance for new parents
LinkedIn learning license