Software Reliability Specialist

22 hours ago


Hyderabad City Taluka, Pakistan beBee Careers Full time

**Job Title:** Software Reliability Specialist

We are seeking a highly skilled Software Reliability Specialist to ensure the high availability and performance of our critical systems. The role involves designing and implementing monitoring systems, analyzing system performance, and optimizing efficiency.

The ideal candidate will have hands-on experience in Python 3, excellent knowledge of databases, including relational and NoSQL, and proficiency in unit testing and debugging using Python testing frameworks.

This is a full-time position requiring 8+ years of experience as an SRE. The successful candidate will collaborate with development and operations teams to improve system reliability and performance, communicate effectively with stakeholders, and adhere to best practices in Python development.

Key Responsibilities:

  • Ensure critical systems' high availability and performance
  • Design and implement monitoring systems using tools like Prometheus or Nagios
  • Analyze system performance and optimize efficiency
  • Develop reliable Python code, manage APIs, and optimize system efficiency across teams
  • Follow best practices in Python development, including frameworks like Django, Flask, and FastAPI
  • Manage and optimize relational (like MySQL) and NoSQL (like MongoDB) databases
  • Implement and manage CI/CD pipelines to automate deployment processes
  • Write and maintain unit tests using Python testing frameworks like pytest, PyUnit, and Unit Test
  • Debug and resolve software issues to ensure smooth application operation
  • Collaborate with development and operations teams to improve system reliability and performance

Requirements:

  • 8+ years of experience as an SRE
  • Hands-on experience in Python 3
  • Excellent knowledge of databases, including relational and NoSQL
  • Proficiency in unit testing and debugging using Python testing frameworks
  • Good communication skills


  • Hyderabad City Taluka, Pakistan beBee Careers Full time

    About the Role:Take on a critical leadership position, defining the future of a global organization and driving significant impact in site reliability.Key Responsibilities:Demonstrate and champion site reliability culture and practices, exerting technical influence across your team.Lead initiatives to improve application and platform reliability, leveraging...


  • Hyderabad City Taluka, Pakistan beBee Careers Full time

    About the RoleWe are seeking a skilled Principal Site Reliability Engineer to join our team.This role offers an exceptional opportunity for professionals with expertise in site reliability, software engineering, and leadership to elevate their careers and contribute significantly to our community.


  • Hyderabad City Taluka, Pakistan GSPANN Technologies, Inc Full time

    Site Reliability Engineering (SRE), Python, Django, FastAPI, Flask, SQL, RESTful, pytestDescriptionGSPANN is hiring a Site Reliability Engineer with to ensure high availability and performance of critical systems using tools like Prometheus and Nagios. The role involves developing reliable Python code, managing APIs, and optimizing system efficiency across...


  • Hyderabad City Taluka, Pakistan beBee Careers Full time

    Cloud Engineering SpecialistWe are seeking an experienced Cloud Engineering Specialist to join our team. The successful candidate will have 3+ years of experience in cloud infrastructure, automation, and software development.The ideal candidate will have hands-on expertise in software development, infrastructure, automation, and container orchestration. They...


  • Hyderabad City Taluka, Pakistan beBee Careers Full time

    Senior Software EngineerWe are looking for a seasoned software engineer to join our team as a Consultant Specialist.Your key responsibilities will include:Developing robust and scalable back-end services using Spring Boot.Implementing RESTful APIs and gRPC services for seamless communication between microservices.Ensuring data management and persistence...


  • Hyderabad City Taluka, Pakistan JP Morgan Chase Full time

    Elevate your engineering prowess to unprecedented levels by joining a team of exceptionally gifted professionals and position yourself among the top echelon in site reliability.As a Principal Site Reliability Engineer at JPMorgan Chase within the Consumer & Community Banking, you will work with your stakeholders to define non-functional requirements (NFRs)...


  • Hyderabad City Taluka, Pakistan JP Morgan Chase Full time

    Assume a critical role in defining the future of a globally recognized firm and have a direct and significant effect in a realm tailored for top achievers in site reliability.As a Lead Site Reliability Engineer at JPMorgan Chase within the Consumer & Community Banking, you hold a leadership role in your team, demonstrate strong knowledge across multiple...


  • Hyderabad City Taluka, Pakistan beBee Careers Full time

    About UsBehind a vast portfolio of iconic content and beloved brands, are the storytellers bringing characters to life.We offer career-defining opportunities and thoughtfully curated benefits.The DTC Global Tech OrganizationThe organization has many software engineering teams building applications for various devices.SRE Roles and ResponsibilitiesDrive...


  • Hyderabad City Taluka, Pakistan beBee Careers Full time

    SRE Architect: Enhancing System Reliability and PerformanceWe are seeking a highly skilled SRE Architect to join our Platform and Tooling team. The ideal candidate will be responsible for developing scalable, secure, and resilient solutions to enhance reliability and performance across cloud, on-prem, and private cloud environments.This role requires...


  • Hyderabad City Taluka, Pakistan beBee Careers Full time

    About the PositionWe are seeking a Site Reliability Engineering Lead with extensive experience in IT Service Management and Application Performance Monitoring.The ideal candidate will have hands-on expertise with Datadog, a strong grasp of IT operations, and the ability to implement workflow automation and drive operational excellence through data and...