
Reliability Engineer for Operational Excellence
7 days ago
A Site Reliability Engineering (SRE) Lead is sought after to drive operational excellence through data and KPIs. The ideal candidate will have hands-on expertise with Datadog, a strong grasp of IT operations, and the ability to implement workflow automation.
">- Apply deep knowledge of Information Technology Infrastructure Library (ITIL v4) and IT Service Management platforms.
- Use Datadog to monitor performance, infrastructure, and digital experience.
- Implement complex process workflows and track performance using metrics-driven reporting.
- Demonstrate a strong understanding of IT Operations and its impact on application reliability.
- Communicate technical concepts clearly and concisely to both technical teams and executive leadership.
- Build strategic relationships across teams, departments, business stakeholders, and external partners.
- Translate business requirements into measurable KPIs that reflect application stability and provide business insights.
- Troubleshoot recurring issues with a focus on incident reduction and operational automation.
- Identify Toil (manual, repetitive work) and propose automation opportunities.
- React quickly to time-sensitive issues with strong problem-solving and decision-making skills.
Required Skills and Qualifications
- 7+ years of experience in ITIL/ITSM management.
- 3+ years working with Datadog APM tools, including infrastructure monitoring, logs, and digital experience components.
- Proven experience in administering the Datadog platform across its various features.
- Prior experience in a similar application support or SRE leadership role.
- Familiarity with additional monitoring tools and modern observability technologies.
- Excellent analytical, troubleshooting, and problem-solving skills.
- Strong communication and organizational capabilities.
- Ability to manage multiple tasks while prioritizing effectively.
-
Site Reliability Engineer
2 weeks ago
Hyderabad City Taluka, Pakistan GSPANN Technologies, Inc Full timeJoin to apply for the Site Reliability Engineer (SRE) role at GSPANN Technologies, IncContinue with Google Continue with Google2 months ago Be among the first 25 applicantsJoin to apply for the Site Reliability Engineer (SRE) role at GSPANN Technologies, IncSplunk, Information Technology Infrastructure Library (ITIL), IT Service Management...
-
Site Reliability Engineering Lead
1 week ago
Hyderabad City Taluka, Pakistan GSPANN Technologies, Inc Full timeWorkflows, Information Technology Infrastructure Library (ITIL), IT Service Management (ITSM), Splunk, IT Operations Management (ITOM)DescriptionGSPANN is hiring a Site Reliability Engineering (SRE) Lead with 10+ years of experience in IT Service Management (ITSM) and Application Performance Monitoring (APM). The ideal candidate will have hands-on expertise...
-
Site Reliability Engineering Leader
6 hours ago
Hyderabad City Taluka, Pakistan beBeeReliability Full time 1,800,000 - 2,400,000Job Title: Lead SREWe are seeking a talented individual to fill the role of Lead Site Reliability Engineer. In this position, you will play a critical role in shaping the future of our organization and driving innovation.Demonstrate and champion site reliability culture and practicesLead initiatives to improve the reliability and stability of applications...
-
Technical Lead
2 days ago
Hyderabad City Taluka, Pakistan beBeeReliability Full timeAs a technical leader, you have the opportunity to shape the future of employee experience technology. Your role will involve leading a critical team, driving site reliability, and contributing significantly to the success of top achievers.Job DescriptionYou will be responsible for conducting resiliency design reviews, breaking down complex problems into...
-
Senior Leader in Engineering Excellence
2 days ago
Hyderabad City Taluka, Pakistan beBeeSite Full timeElevate your engineering expertise to new heights by leading a team of highly skilled professionals.As a Senior Lead Site Reliability Engineer, you will collaborate with stakeholders to define non-functional requirements and availability targets for applications and product lines.You will ensure these NFRs are integrated into product design and testing...
-
Lead Site Reliability Engineer
7 days ago
Hyderabad City Taluka, Pakistan JP Morgan Chase Full timeAssume a critical role in defining the future of a globally recognized firm and have a direct and significant effect in a realm tailored for top achievers in site reliability.As a Lead Site Reliability Engineer at JPMorgan Chase within the Consumer & Community Banking, you hold a leadership role in your team, demonstrate strong knowledge across multiple...
-
Senior Lead Site Reliability Engineer
1 day ago
Hyderabad City Taluka, Pakistan JP Morgan Chase Full timeElevate your engineering prowess to unprecedented levels by joining a team of exceptionally gifted professionals and position yourself among the top echelon in site reliability.As a Principal Site Reliability Engineer at JPMorgan Chase within the Consumer & Community Banking, you will work with your stakeholders to define non-functional requirements (NFRs)...
-
Senior Manager of Site Reliability Engineer
3 weeks ago
Hyderabad City Taluka, Pakistan JP Morgan Chase Full timeWhen you mentor and advise multiple technical teams and move financial technologies forward, it's a big challenge with big impact. You were made for this.As a Senior Manager of Software Engineering at JPMorgan Chase within the Consumer and Community Banking, you serve in a leadership role by providing technical coaching and advisory to multiple technical...
-
Senior Site Reliability Specialist
1 day ago
Hyderabad City Taluka, Pakistan beBeeReliability Full time $150,000 - $170,000Job OpportunityElevate your engineering expertise by taking on a senior role in site reliability. You will join a team of skilled professionals and contribute to the development of high-quality systems.Key Responsibilities:Ensure the reliability and performance of IT systemsCollaborate with cross-functional teams to drive business outcomesDevelop and...
-
Senior Airflow Reliability Engineer
3 days ago
Hyderabad City Taluka, Pakistan Astronomer Full timeAstronomer empowers data teams to bring mission-critical software, analytics, and AI to life and is the company behind Astro, the industry-leading unified DataOps platform powered by Apache Airflow. Astro accelerates building reliable data products that unlock insights, unleash AI value, and powers data-driven applications. Trusted by more than 700 of the...