Site Reliability Engineering Lead

4 weeks ago


Hyderabad City Taluka, Pakistan GSPANN Technologies, Inc Full time

Workflows, Information Technology Infrastructure Library (ITIL), IT Service Management (ITSM), Splunk, IT Operations Management (ITOM)

Description

GSPANN is hiring a Site Reliability Engineering (SRE) Lead with 10+ years of experience in IT Service Management (ITSM) and Application Performance Monitoring (APM). The ideal candidate will have hands-on expertise with Datadog, a strong grasp of IT operations, and the ability to implement workflow automation and drive operational excellence through data and KPIs.

Location: Hyderabad / Any Offshore Location

Role Type: Full Time

Published On: 28 March 2025

Experience: 10+ Years

Share this job

Description

GSPANN is hiring a Site Reliability Engineering (SRE) Lead with 10+ years of experience in IT Service Management (ITSM) and Application Performance Monitoring (APM). The ideal candidate will have hands-on expertise with Datadog, a strong grasp of IT operations, and the ability to implement workflow automation and drive operational excellence through data and KPIs.

Role and Responsibilities

  • Apply deep knowledge of Information Technology Infrastructure Library (ITIL v4) and ITSM platforms. (Certification is preferred).
  • Use Datadog to monitor performance, infrastructure, and digital experience (RUM, Synthetic Monitoring, etc.).
  • Implement complex process workflows and track performance using metrics-driven reporting.
  • Demonstrate a strong understanding of IT Operations and its impact on application reliability.
  • Communicate technical concepts clearly and concisely to both technical teams and executive leadership.
  • Build strategic relationships across teams, departments, business stakeholders, and external partners.
  • Translate business requirements into measurable KPIs that reflect application stability and provide business insights.
  • Troubleshoot recurring issues with a focus on incident reduction and operational automation.
  • Identify Toil (manual, repetitive work) and propose automation opportunities.
  • React quickly to time-sensitive issues with strong problem-solving and decision-making skills.

Skills And Experience
  • 7+ years of experience in ITIL/ITSM management.
  • 3+ years working with Datadog APM tools, including infrastructure monitoring, logs, and digital experience components.
  • Proven experience in administering the Datadog platform across its various features.
  • Prior experience in a similar application support or SRE leadership role.
  • Familiarity with additional monitoring tools and modern observability technologies.
  • Excellent analytical, troubleshooting, and problem-solving skills.
  • Strong communication and organizational capabilities.
  • Ability to manage multiple tasks while prioritizing effectively.
#J-18808-Ljbffr

  • Hyderabad City Taluka, Pakistan JP Morgan Chase Full time

    Assume a critical role in defining the future of a globally recognized firm and have a direct and significant effect in a realm tailored for top achievers in site reliability.As a Lead Site Reliability Engineer at JPMorgan Chase within the Consumer & Community Banking, you hold a leadership role in your team, demonstrate strong knowledge across multiple...


  • Hyderabad City Taluka, Pakistan JP Morgan Chase Full time

    Elevate your engineering prowess to unprecedented levels by joining a team of exceptionally gifted professionals and position yourself among the top echelon in site reliability.As a Principal Site Reliability Engineer at JPMorgan Chase within the Consumer & Community Banking, you will work with your stakeholders to define non-functional requirements (NFRs)...


  • Hyderabad City Taluka, Pakistan GSPANN Technologies, Inc Full time

    Site Reliability Engineering (SRE), Python, Django, FastAPI, Flask, SQL, RESTful, pytestDescriptionGSPANN is hiring a Site Reliability Engineer with to ensure high availability and performance of critical systems using tools like Prometheus and Nagios. The role involves developing reliable Python code, managing APIs, and optimizing system efficiency across...


  • Hyderabad City Taluka, Pakistan Zscaler Full time

    About ZscalerServing thousands of enterprise customers around the world including 40% of Fortune 500 companies, Zscaler (NASDAQ: ZS) was founded in 2007 with a mission to make the cloud a safe place to do business and a more enjoyable experience for enterprise users. As the operator of the world's largest security cloud, Zscaler accelerates digital...

  • Reliable IT Manager

    4 days ago


    Hyderabad City Taluka, Pakistan beBeeMonitoring Full time

    Job TitleSRE Role for IT Service Management and MonitoringJob DescriptionWe are seeking a seasoned professional with 8+ years of experience in IT Service Management to join our team as a Site Reliability Engineer. The successful candidate will be responsible for building resilient monitoring systems, improving application observability, and ensuring...

  • Sr Lead Engineer

    2 weeks ago


    Hyderabad City Taluka, Pakistan Qualcomm Technologies, Inc Full time

    Company:Qualcomm India Private LimitedJob Area:Engineering Group, Engineering Group > Software EngineeringGeneral Summary:Job Description:Position Overview: As a Senior Embedded Systems Engineer, you will play a critical role in the design, development, and maintenance of embedded systems and software. You will work closely with cross-functional teams to...

  • Sr Lead Engineer

    4 weeks ago


    Hyderabad City Taluka, Pakistan Qualcomm Technologies, Inc Full time

    Company:Qualcomm India Private LimitedJob Area:Engineering Group, Engineering Group > Hardware EngineeringGeneral Summary:As a leading technology innovator, Qualcomm pushes the boundaries of what's possible to enable next-generation experiences and drives digital transformation to help create a smarter, connected future for all. As a Qualcomm Hardware...

  • Azure Data Engineers

    4 weeks ago


    Hyderabad City Taluka, Pakistan GSPANN Technologies, Inc Full time

    Azure Data Factory, Azure, Azure API Management, Azure Databricks, Azure DevOps, Azure Data Lake, SQL, Synapse Data Warehouse, Azure CosmosDB, Power BI, Site Reliability Engineering (SRE)DescriptionGSPANN is hiring Azure Data Engineers with expertise in Site Reliability Engineering (SRE) to optimize and automate large-scale data applications. The role...

  • Lead Engineer

    2 weeks ago


    Hyderabad City Taluka, Pakistan Qualcomm Technologies, Inc Full time

    Company:Qualcomm India Private LimitedJob Area:Engineering Group, Engineering Group > Software EngineeringGeneral Summary:As a leading technology innovator, Qualcomm pushes the boundaries of what's possible to enable next-generation experiences and drives digital transformation to help create a smarter, connected future for all. As a Qualcomm Software...

  • Site Engineer

    3 weeks ago


    Gujujranwala City Tehsil, Pakistan Empire Lifestyle5 Full time

    At Empire Lifestyle, we are passionate about crafting exceptional living experiences through innovative design and construction. As a forward-thinking brand in the lifestyle and real estate sector, we blend modern aesthetics with functionality to create spaces that inspire. With a strong presence in Pakistan and growing ambitions, we're building more than...