Site Reliability Engineer I
3 days ago
- System Reliability: Ensuring the reliability of software systems by designing, implementing, and maintaining scalable and reliable infrastructure.
- Automation: Developing automation tools and scripts to streamline operational tasks, reduce manual intervention, and improve overall system efficiency.
- Incident Response and Resolution: Monitoring system performance and responding to incidents promptly to minimize downtime and ensure high availability.
- Capacity Planning: Analyzing system usage patterns and forecasting future capacity needs to ensure that the infrastructure can handle current and future demands.
- Performance Optimization: Identifying and addressing performance bottlenecks in software systems through optimization and tuning.
- Infrastructure as Code (IaC): Implementing infrastructure as code practices, using tools like Terraform or Ansible, to define and manage infrastructure in a version-controlled and automated manner.
- Monitoring and Logging: Implementing and maintaining monitoring and logging solutions to gain insights into system behavior, troubleshoot issues, and proactively address potential problems.
- On-Call Support: Participating in an on-call rotation to respond to incidents outside of regular working hours and ensure 24/7 system availability
- Security: Collaborating with security teams to implement and maintain security best practices in infrastructure and application
- Disaster Recovery Planning: Developing and maintaining disaster recovery plans to ensure that systems can quickly recover from major outages or failures
- Continuous Improvement: Continuously analyzing system performance, reliability, and incidents to identify areas for improvement and implementing changes to enhance overall system resilience.
- Programming Languages: Proficiency in one or more programming languages, commonly Python, Go, Shell, Bash.
- Automation and Scripting: Strong automation skills using tools like Ansible, Puppet, Chef, or custom scripts. Knowledge of Infrastructure as Code (IaC) tools like Terraform
- Containerization and Orchestration: Experience with containerization technologies like Docker and container orchestration platforms like Kubernetes.
- Cloud Computing: Proficiency in any of the cloud platforms such as AWS, Azure, or Google Cloud Platform, and knowledge of managing infrastructure in the cloud.
- Monitoring and Logging: Familiarity with monitoring tools (e.g., Prometheus, Grafana, ELK stack) and logging frameworks to track system performance and troubleshoot issues.
- Networking: Understanding of networking concepts, protocols, and troubleshooting skills.
- Security: Knowledge of security best practices, including encryption, access controls, and vulnerability management.
- Continuous Integration/Continuous Deployment (CI/CD): Understanding and implementation of CI/CD pipelines for automated testing and deployment.
- Load Balancing: Experience in incident response, troubleshooting, and resolution.
- Version Control: Proficient use of version control systems like Git.
- 2-4 years of experience in site reliability engineering.
- B.Tech/M.Tech in computer science, information technology or a related field.
- Having experience working for a product organization is a plus.
- Certifications from cloud service providers like AWS Certified DevOps Engineer, Google Cloud Professional DevOps Engineer, or Microsoft Certified is a plus
#LifeAtZeta is adventurous and exhilarating at the same time. You get to work with some of the best minds in the industry and experience a culture that values the diversity of thoughts. If you want to push boundaries, learn continuously and grow to be the best version of yourself, Zeta is the place to be Explore the life at zeta
Zeta is an equal opportunity employer. At Zeta, we are committed to equal employment opportunities regardless of job history, disability, gender identity, religion, race, marital/parental status, or another special status. We are proud to be an equitable workplace that welcomes individuals from all walks of life if they fit the roles and responsibilities.
-
Principal Site Reliability Engineer I
3 days ago
Hyderābād, Sindh, Pakistan Zeta Full time $80,000 - $120,000 per yearAbout Zeta Zeta is a Next-Gen Banking Tech company that empowers banks and fintechs to launch banking products for the future. It was founded by Bhavin Turakhia and Ramki Gaddipati in 2015. Our flagship processing platform - Zeta Tachyon - is the industry's first modern, cloud-native, and fully API-enabled stack that brings together issuance, processing,...
-
Principal Engineer I
3 days ago
Hyderābād, Sindh, Pakistan Zeta Full time 15,000,000 - 25,000,000 per yearAbout Zeta Zeta is a Next-Gen Banking Tech company that empowers banks and fintechs to launch banking products for the future. It was founded by Bhavin Turakhia and Ramki Gaddipati in 2015. Our flagship processing platform - Zeta Tachyon - is the industry's first modern, cloud-native, and fully API-enabled stack that brings together issuance, processing,...
-
Senior Site Reliability Engineer
3 days ago
Hyderābād, Sindh, Pakistan Zeta Full time $120,000 - $240,000 per yearZeta is a Next-Gen Banking Tech company that empowers banks and fintechs to launch banking products for the future. It was founded by Bhavin Turakhia and Ramki Gaddipati in 2015. Our flagship processing platform - Zeta Tachyon - is the industry's first modern, cloud-native, and fully API-enabled stack that brings together issuance, processing, lending, core...
-
Site Reliability Engineer
2 weeks ago
Hyderābād, Sindh, Pakistan Capgemini Full time 1,200,000 - 2,400,000 per yearChoosing Capgemini means choosing a company where you will be empowered to shape your career in the way you'd like, where you'll be supported and inspired by a collaborative community of colleagues around the world, and where you'll be able to reimagine what's possible. Join us and help the world's leading organizations unlock the value of technology and...
-
Site Reliability Engineer
1 day ago
Hyderābād, Sindh, Pakistan Capgemini Engineering Full time 1,200,000 - 3,600,000 per yearChoosing Capgemini means choosing a company where you will be empowered to shape your career in the way you'd like, where you'll be supported and inspired by a collaborative community of colleagues around the world, and where you'll be able to reimagine what's possible. Join us and help the world's leading organizations unlock the value of technology and...
-
Site Reliability Engineering
3 days ago
Hyderābād, Sindh, Pakistan Tata Consultancy Services (TCS) Full time 500,000 - 1,500,000 per yearMust Have:Hands on Experience in PostgreSQLHands on Experience in SQLworked in Terraform, AnsibleMust have Very good Knowledge in CloudGood to Have:Linux KnowledgeJava KnowledgeLocationHyderabadJob FunctionIT INFRASTRUCTURE SERVICESRoleDatabase AdministratorJob Id381153Desired SkillsPostgreSQL | SQL | AnsibleDesired Candidate ProfileQualifications : BACHELOR...
-
Data Reliability Engineer I
3 days ago
Hyderābād, Sindh, Pakistan Zeta Full time $60,000 - $120,000 per yearAbout Zeta Zeta is a Next-Gen Banking Tech company that empowers banks and fintechs to launch banking products for the future. It was founded by Bhavin Turakhia and Ramki Gaddipati in 2015. Our flagship processing platform - Zeta Tachyon - is the industry's first modern, cloud-native, and fully API-enabled stack that brings together issuance,...
-
Lead Site Reliability Engineer
3 days ago
Hyderābād, Sindh, Pakistan Zeta Full time $120,000 - $240,000 per yearZeta is a Next-Gen Banking Tech company that empowers banks and fintechs to launch banking products for the future. It was founded by Bhavin Turakhia and Ramki Gaddipati in 2015. Our flagship processing platform - Zeta Tachyon - is the industry's first modern, cloud-native, and fully API-enabled stack that brings together issuance, processing, lending, core...
-
Senior Site Reliability Engineer
3 days ago
Hyderābād, Sindh, Pakistan Lloyds Technology Centre Full time 600,000 - 1,200,000 per yearEnd DateWednesday 12 November 2025We Support Flexible Working – Click here for more information on flexible working optionsFlexible Working OptionsHybrid WorkingJob Description SummaryA Senior SRE is accountable for one or more areas of the cloud infrastructure resources and supervises the work of the SREs in that area. They will focus on observability of...
-
Lead I
1 week ago
Hyderābād, Sindh, Pakistan UST Full time 900,000 - 1,200,000 per year9 - 12 Years2 OpeningsHyderabadRole descriptionWho we are:At UST, we help the world's best organizations grow and succeed through transformation. Bringing together the right talent, tools, and ideas, we work with our client to co-create lasting change. Together, with over 26,000 employees in 25 countries, we build for boundless impact—touching billions of...