Reliability Engineer for Operational Excellence

7 days ago


Hyderabad City Taluka, Pakistan beBeeOperations Full time
Job Description

A Site Reliability Engineering (SRE) Lead is sought after to drive operational excellence through data and KPIs. The ideal candidate will have hands-on expertise with Datadog, a strong grasp of IT operations, and the ability to implement workflow automation.

">
  • Apply deep knowledge of Information Technology Infrastructure Library (ITIL v4) and IT Service Management platforms.
  • Use Datadog to monitor performance, infrastructure, and digital experience.
  • Implement complex process workflows and track performance using metrics-driven reporting.
  • Demonstrate a strong understanding of IT Operations and its impact on application reliability.
  • Communicate technical concepts clearly and concisely to both technical teams and executive leadership.
  • Build strategic relationships across teams, departments, business stakeholders, and external partners.
  • Translate business requirements into measurable KPIs that reflect application stability and provide business insights.
  • Troubleshoot recurring issues with a focus on incident reduction and operational automation.
  • Identify Toil (manual, repetitive work) and propose automation opportunities.
  • React quickly to time-sensitive issues with strong problem-solving and decision-making skills.

Required Skills and Qualifications

  • 7+ years of experience in ITIL/ITSM management.
  • 3+ years working with Datadog APM tools, including infrastructure monitoring, logs, and digital experience components.
  • Proven experience in administering the Datadog platform across its various features.
  • Prior experience in a similar application support or SRE leadership role.
  • Familiarity with additional monitoring tools and modern observability technologies.
  • Excellent analytical, troubleshooting, and problem-solving skills.
  • Strong communication and organizational capabilities.
  • Ability to manage multiple tasks while prioritizing effectively.


  • Hyderabad City Taluka, Pakistan GSPANN Technologies, Inc Full time

    Join to apply for the Site Reliability Engineer (SRE) role at GSPANN Technologies, IncContinue with Google Continue with Google2 months ago Be among the first 25 applicantsJoin to apply for the Site Reliability Engineer (SRE) role at GSPANN Technologies, IncSplunk, Information Technology Infrastructure Library (ITIL), IT Service Management...


  • Hyderabad City Taluka, Pakistan GSPANN Technologies, Inc Full time

    Workflows, Information Technology Infrastructure Library (ITIL), IT Service Management (ITSM), Splunk, IT Operations Management (ITOM)DescriptionGSPANN is hiring a Site Reliability Engineering (SRE) Lead with 10+ years of experience in IT Service Management (ITSM) and Application Performance Monitoring (APM). The ideal candidate will have hands-on expertise...


  • Hyderabad City Taluka, Pakistan beBeeReliability Full time 1,800,000 - 2,400,000

    Job Title: Lead SREWe are seeking a talented individual to fill the role of Lead Site Reliability Engineer. In this position, you will play a critical role in shaping the future of our organization and driving innovation.Demonstrate and champion site reliability culture and practicesLead initiatives to improve the reliability and stability of applications...

  • Technical Lead

    2 days ago


    Hyderabad City Taluka, Pakistan beBeeReliability Full time

    As a technical leader, you have the opportunity to shape the future of employee experience technology. Your role will involve leading a critical team, driving site reliability, and contributing significantly to the success of top achievers.Job DescriptionYou will be responsible for conducting resiliency design reviews, breaking down complex problems into...


  • Hyderabad City Taluka, Pakistan beBeeSite Full time

    Elevate your engineering expertise to new heights by leading a team of highly skilled professionals.As a Senior Lead Site Reliability Engineer, you will collaborate with stakeholders to define non-functional requirements and availability targets for applications and product lines.You will ensure these NFRs are integrated into product design and testing...


  • Hyderabad City Taluka, Pakistan JP Morgan Chase Full time

    Assume a critical role in defining the future of a globally recognized firm and have a direct and significant effect in a realm tailored for top achievers in site reliability.As a Lead Site Reliability Engineer at JPMorgan Chase within the Consumer & Community Banking, you hold a leadership role in your team, demonstrate strong knowledge across multiple...


  • Hyderabad City Taluka, Pakistan JP Morgan Chase Full time

    Elevate your engineering prowess to unprecedented levels by joining a team of exceptionally gifted professionals and position yourself among the top echelon in site reliability.As a Principal Site Reliability Engineer at JPMorgan Chase within the Consumer & Community Banking, you will work with your stakeholders to define non-functional requirements (NFRs)...


  • Hyderabad City Taluka, Pakistan JP Morgan Chase Full time

    When you mentor and advise multiple technical teams and move financial technologies forward, it's a big challenge with big impact. You were made for this.As a Senior Manager of Software Engineering at JPMorgan Chase within the Consumer and Community Banking, you serve in a leadership role by providing technical coaching and advisory to multiple technical...


  • Hyderabad City Taluka, Pakistan beBeeReliability Full time $150,000 - $170,000

    Job OpportunityElevate your engineering expertise by taking on a senior role in site reliability. You will join a team of skilled professionals and contribute to the development of high-quality systems.Key Responsibilities:Ensure the reliability and performance of IT systemsCollaborate with cross-functional teams to drive business outcomesDevelop and...


  • Hyderabad City Taluka, Pakistan Astronomer Full time

    Astronomer empowers data teams to bring mission-critical software, analytics, and AI to life and is the company behind Astro, the industry-leading unified DataOps platform powered by Apache Airflow. Astro accelerates building reliable data products that unlock insights, unleash AI value, and powers data-driven applications. Trusted by more than 700 of the...