
Site Reliability Engineering Lead
3 days ago
Workflows, Information Technology Infrastructure Library (ITIL), IT Service Management (ITSM), Splunk, IT Operations Management (ITOM)
Description
GSPANN is hiring a Site Reliability Engineering (SRE) Lead with 10+ years of experience in IT Service Management (ITSM) and Application Performance Monitoring (APM). The ideal candidate will have hands-on expertise with Datadog, a strong grasp of IT operations, and the ability to implement workflow automation and drive operational excellence through data and KPIs.
Location: Hyderabad / Any Offshore Location
Role Type: Full Time
Published On: 28 March 2025
Experience: 10+ Years
Share this job
Description
GSPANN is hiring a Site Reliability Engineering (SRE) Lead with 10+ years of experience in IT Service Management (ITSM) and Application Performance Monitoring (APM). The ideal candidate will have hands-on expertise with Datadog, a strong grasp of IT operations, and the ability to implement workflow automation and drive operational excellence through data and KPIs.
Role and Responsibilities
- Apply deep knowledge of Information Technology Infrastructure Library (ITIL v4) and ITSM platforms. (Certification is preferred).
- Use Datadog to monitor performance, infrastructure, and digital experience (RUM, Synthetic Monitoring, etc.).
- Implement complex process workflows and track performance using metrics-driven reporting.
- Demonstrate a strong understanding of IT Operations and its impact on application reliability.
- Communicate technical concepts clearly and concisely to both technical teams and executive leadership.
- Build strategic relationships across teams, departments, business stakeholders, and external partners.
- Translate business requirements into measurable KPIs that reflect application stability and provide business insights.
- Troubleshoot recurring issues with a focus on incident reduction and operational automation.
- Identify Toil (manual, repetitive work) and propose automation opportunities.
- React quickly to time-sensitive issues with strong problem-solving and decision-making skills.
- 7+ years of experience in ITIL/ITSM management.
- 3+ years working with Datadog APM tools, including infrastructure monitoring, logs, and digital experience components.
- Proven experience in administering the Datadog platform across its various features.
- Prior experience in a similar application support or SRE leadership role.
- Familiarity with additional monitoring tools and modern observability technologies.
- Excellent analytical, troubleshooting, and problem-solving skills.
- Strong communication and organizational capabilities.
- Ability to manage multiple tasks while prioritizing effectively.
-
Site Reliability Engineer
3 days ago
Hyderabad City Taluka, Pakistan GSPANN Technologies, Inc Full timeSite Reliability Engineering (SRE), Python, Django, FastAPI, Flask, SQL, RESTful, pytestDescriptionGSPANN is hiring a Site Reliability Engineer with to ensure high availability and performance of critical systems using tools like Prometheus and Nagios. The role involves developing reliable Python code, managing APIs, and optimizing system efficiency across...
-
Site Reliability Engineer
2 weeks ago
Hyderabad City Taluka, Pakistan GSPANN Technologies, Inc Full timeSplunk, Information Technology Infrastructure Library (ITIL), IT Service Management (ITSM)DescriptionGSPANN is hiring an experienced Site Reliability Engineer (SRE) with 8+ years in IT Service Management (ITSM) and hands-on expertise in Application Performance Monitoring (APM) tools like Datadog and Splunk. We're looking for a self-driven professional who...
-
Architect, Site Reliability Engineer
3 days ago
Hyderabad City Taluka, Pakistan Zscaler Full timeAbout ZscalerServing thousands of enterprise customers around the world including 40% of Fortune 500 companies, Zscaler (NASDAQ: ZS) was founded in 2007 with a mission to make the cloud a safe place to do business and a more enjoyable experience for enterprise users. As the operator of the world's largest security cloud, Zscaler accelerates digital...
-
Reliability Engineering Lead
7 days ago
Hyderabad City Taluka, Pakistan beBee Careers Full timeResponsibilitiesOur ideal candidate will have a strong background in software engineering, with experience in cloud computing, database management, and automation tools. They will be able to troubleshoot complex production issues, analyze datasets, and communicate effectively with global partners and stakeholders.Troubleshoot production issues by reviewing...
-
Hyderabad City Taluka, Pakistan Astronomer Full timeAstronomer designed Astro, an industry-leading, orchestration-first DataOps platform for data teams. Powered by Airflow, Astro accelerates building reliable data products that unlock insights, unleash AI value, and drive data-driven applications.We're a globally-distributed and rapidly growing venture-backed team of learners, innovators and collaborators....
-
Cloud Engineering Specialist
7 days ago
Hyderabad City Taluka, Pakistan beBee Careers Full timeJob Title: Cloud Engineering LeadAbout the RoleWe are looking for an experienced Cloud Engineering Lead to architect and build product, inventory data pipelines and platforms that power all our sites. You will work closely with backend engineers to create services that can ingest and supply data to and from external sources.Key ResponsibilitiesArchitect and...
-
Senior Full Stack Engineer
3 days ago
Hyderabad City Taluka, Pakistan beBee Careers Full timeSome careers stand out for their unique challenges and opportunities.We are seeking a senior full stack engineer with expertise in programming, API development, design patterns, SDLC, IaC tools, testing, and site reliability engineering to join our team.In this role, you will lead and contribute to the development of software solutions. You will define and...
-
Platform Engineering Lead
7 days ago
Hyderabad City Taluka, Pakistan beBee Careers Full timePlatform Engineering LeadWe are looking for an experienced Platform Engineering Lead to join our team and drive the development of our cloud-based platforms, ensuring they are scalable, reliable, and meet the evolving needs of the business.Technical LeadershipYou will lead large initiatives within the broader tech org, collaborating effectively with...
-
Director of Engineering and Architecture
2 days ago
Hyderabad City Taluka, Pakistan beBee Careers Full timeWe are seeking a talented individual to fill the role of Sr Associate Director.In this position, you will lead and contribute to the development of software solutions. You will define and implement best practices for software development, including coding standards, code reviews, and testing methodologies.You will possess a broad and deep understanding of...
-
Platform Reliability Engineer
7 days ago
Hyderabad City Taluka, Pakistan beBee Careers Full timeData Platform Expert Wanted:We're seeking a seasoned Senior System Engineer to build and maintain a cutting-edge data platform that meets the needs of our business. As a critical member of our cross-functional team, you'll be responsible for ensuring the reliability, scalability, and performance of our data infrastructure.Your Key Responsibilities:Design,...