Senior AI/ML Operations Engineer

3 weeks ago


Hyderabad City Taluka, Pakistan GSPANN Technologies, Inc Full time

AIOps, MLOps, DevOps, Python, Docker, Kubernetes, CI/CD Pipelines, AWS, GCP, Azure, MLflow, Terraform, Prometheus/Grafana

Description

GSPANN is hiring a Senior AI/ML Operations Engineer. The role focuses on building AIOps/MLOps systems and automating ML pipelines.

Location: Hyderabad / Pune / Gurugram

Role Type: Full Time

Published On: 1 July 2025

Experience: 6+ Years

Share this job

Description

GSPANN is hiring a Senior AI/ML Operations Engineer. The role focuses on building AIOps/MLOps systems and automating ML pipelines.

Role and Responsibilities

  • Architect and drive the implementation of scalable Artificial Intelligence for IT Operations (AIOps) and Machine Learning Operations (MLOps) frameworks.
  • Mentor junior engineers and data scientists by sharing best practices in model deployment and operational excellence.
  • Align technical strategies with business objectives through close collaboration with product managers, Site Reliability Engineers (SREs), and other key stakeholders.
  • Establish and uphold engineering standards, including Service-Level Agreements (SLAs), Service-Level Indicators (SLIs), and Service-Level Objectives (SLOs) for machine learning and AIOps services.
  • Design and manage Machine Learning (ML) CI/CD (Continuous Integration/Continuous Deployment) pipelines for model training, testing, deployment, and monitoring using tools such as Kubeflow, MLflow, and Apache Airflow.
  • Implement robust monitoring systems to track model performance metrics like drift, latency, and accuracy, and automate retraining workflows where necessary.
  • Lead model governance efforts by ensuring reproducibility, traceability, and compliance with frameworks such as FAIR (Findable, Accessible, Interoperable, Reusable), and maintaining audit logs.
  • Build AI/ML-powered solutions for proactive infrastructure monitoring, predictive alerting, and intelligent incident resolution.
  • Enhance anomaly detection and root cause analysis by integrating and optimizing observability tools such as Prometheus, Grafana, ELK (Elasticsearch, Logstash, Kibana), Dynatrace, Splunk, and Datadog.
  • Automate response workflows using predefined playbooks, runbooks, and self-healing systems.
  • Apply statistical techniques and machine learning models to analyze logs, metrics, and distributed traces at scale.

Skills And Experience
  • Bachelor's or Master's degree in Computer Science, Data Engineering, Artificial Intelligence, Machine Learning, or a related field.
  • Certifications in AWS/GCP DevOps, Kubernetes, or MLOps is desirable.
  • 6+ years of hands-on experience in DevOps, MLOps, or AIOps, including at least 2 years in a leadership or senior engineering capacity.
  • Demonstrate expert-level coding skills in Python and Bash, with working knowledge of Go or Java.
  • Use Docker for containerization and Kubernetes for orchestration across major cloud providers like Amazon Web Services (AWS), Google Cloud Platform (GCP), and Microsoft Azure.
  • Work with CI/CD tools and infrastructure-as-code technologies like Terraform, Ansible, and Helm.
  • Possess in-depth knowledge of ML lifecycle management, performance monitoring, and pipeline orchestration.
  • Maintain large-scale observability and telemetry platforms effectively.
  • Work with streaming data technologies including Apache Kafka, Apache Spark, and Apache Flink.
  • Manage service mesh architectures such as Istio or Linkerd to ensure secure and efficient service communication.
  • Understand data privacy and regulatory standards including the General Data Protection Regulation (GDPR) and Health Insurance Portability and Accountability Act (HIPAA).
#J-18808-Ljbffr

  • Hyderabad City Taluka, Pakistan GSPANN Technologies, Inc Full time

    OverviewGSPANN Technologies, Inc is hiring an AI Operations Engineer. The role focuses on deploying ML models, automating CI/CD pipelines, and implementing AIOps solutions.Location: Hyderabad / Pune / GurugramRole Type: Full TimeExperience: 3+ YearsResponsibilitiesBuild, automate, and manage continuous integration and continuous deployment (CI/CD) pipelines...


  • Hyderabad City Taluka, Pakistan DigitalOcean LLC Full time

    Dive in and do the best work of your career at DigitalOcean. Journey alongside a strong community of top talent who are relentless in their drive to build the simplest scalable cloud. If you have a growth mindset, naturally like to think big and bold, and are energized by the fast-paced environment of a true industry disruptor, you'll find your place here....


  • Hyderabad City Taluka, Pakistan JP Morgan Chase Full time

    When you mentor and advise multiple technical teams and move financial technologies forward, it's a big challenge with big impact. You were made for this.As a Senior Manager of Software Engineering at JPMorganChase within the Commercial & Investment Bank - Payments Technology team , you serve in a leadership role by providing technical coaching and advisory...

  • Principal AI Engineer

    2 weeks ago


    Hyderabad City Taluka, Pakistan Backbase Full time

    Principal AI EngineerBackbase has ushered in a new era of digital banking with the global launch of its AI-powered Banking Platform, recently lighting up Times Square. This milestone marks a bold step in reshaping the digital banking landscape—empowering banks to move beyond generative AI experiments and into full-scale execution. By automating critical...


  • Hyderabad City Taluka, Pakistan Warner Bros. Discovery, Inc. Full time

    Welcome to Warner Bros. Discovery… the stuff dreams are made of.Who We Are…When we say, "the stuff dreams are made of," we're not just referring to the world of wizards, dragons and superheroes, or even to the wonders of Planet Earth. Behind WBD's vast portfolio of iconic content and beloved brands, are the storytellers bringing our characters to life,...


  • Hyderabad City Taluka, Pakistan GSPANN Technologies, Inc Full time

    OverviewGSPANN Technologies, Inc is hiring Azure Data Engineers with expertise in Site Reliability Engineering (SRE) to optimize and automate large-scale data applications. The role focuses on ensuring system reliability and performance using Azure data services such as Azure Data Factory, Azure Databricks, Azure Cosmos DB, and Power...


  • Hyderabad City Taluka, Pakistan DigitalOcean LLC Full time

    Dive in and do the best work of your career at DigitalOcean. Journey alongside a strong community of top talent who are relentless in their drive to build the simplest scalable cloud. If you have a growth mindset, naturally like to think big and bold, and are energized by the fast-paced environment of a true industry disruptor, you'll find your place here....


  • Hyderabad City Taluka, Pakistan beBeeAIops Full time 900,000 - 1,200,000

    Job Title: AI Operations Engineer">The role focuses on deploying machine learning models, automating CI/CD pipelines, and implementing AIOps solutions.">This position involves working with data scientists to transition machine learning models from experimentation to production environments.">You will use tools such as Docker, Kubernetes, MLflow, or Kubeflow...


  • Faisalabad City Tehsil, Pakistan Velocity AI Full time

    Job Title: Full Stack Developer (AI/Machine Learning Specialist)Location: RemoteWe are a startup seeking a talented Full Stack Developer to join our team and help build a revolutionary new product. This is a unique opportunity for a self-starter who is excited about being the first technical hire and working on a cutting-edge project from...


  • Hyderabad City Taluka, Pakistan DigitalOcean LLC Full time

    Dive in and do the best work of your career at DigitalOcean. Journey alongside a strong community of top talent who are relentless in their drive to build the simplest scalable cloud. If you have a growth mindset, naturally like to think big and bold, and are energized by the fast-paced environment of a true industry disruptor, you'll find your place here....