AI Inferencing Expert

1 day ago


Hyderabad City Taluka, Pakistan beBee Careers Full time

**Role Summary:**

We are seeking a highly skilled and experienced AI inferencing expert to spearhead the development and commercialization of a cutting-edge AI Runtime SDK. This individual will be responsible for pushing performance boundaries with large models, staying abreast of GenAI advancements, and driving the work forward with a passion for the role of edge in AI's evolution.

Responsibilities:

The selected candidate will lead the development and deployment of large C/C++ software stacks using best practices, leveraging expertise in deploying large LLMs/Transformers and edge-based GenAI deployment. Additionally, they will collaborate with a globally diverse team, managing tasks independently and ensuring excellent communication and presentation skills.

Requirements:

  • Strong grasp of Generative AI models – LLM, LVM, LMMs, and building blocks (self-attention, cross attention, KV caching etc.).
  • Floating-point, Fixed-point representations, and Quantization concepts.
  • Experience with optimizing algorithms for AI hardware accelerators (like CPU/GPU/NPU).
  • Hands-on experience in C/C++ programming, Design Patterns, and OS concepts.
  • Excellent analytical and debugging skills.
  • Exposure to shell scripts, Python scripts, Linux/Windows systems, and automation scripts/environment.
  • Good communication skills, presentation skills, and ability to manage tasks independently.
  • Collaboration across a globally diverse team and multiple interests.

Preferred Qualifications:

  • Strong understanding of SIMD processor architecture and system design.
  • Proficiency in object-oriented software development and familiarity.
  • Familiarity with Linux and Windows environment.
  • Strong background in kernel development for SIMD architectures.
  • Familiarity with frameworks like llama.cpp, MLX, and MLC is a plus.
  • Good knowledge of PyTorch, TFLite, and ONNX Runtime is preferred.
  • Experience with parallel computing systems and languages like OpenCL and CUDA is a plus.


  • Faisalabad City Tehsil, Pakistan Velocity AI Full time

    AI And Machine Learning Expert Velocity AI, Pakistan Job Title: Full Stack Developer (AI/Machine learning specialist)Location: RemoteWe are a startup seeking a talented Full Stack Developer to join our team and help build a revolutionary new product. This is a unique opportunity for a self-starter who is excited about the prospect of being the first...


  • Hyderabad City Taluka, Pakistan Qualcomm Technologies, Inc Full time

    Company: Qualcomm India Private LimitedJob Area: Engineering Group, Engineering Group > Software EngineeringGeneral Summary:Join the exciting Generative AI team at Qualcomm focused on integrating cutting edge GenAI models on Qualcomm chipsets. The team uses Qualcomm chips' extensive heterogeneous computing capabilities to allow inference of GenAI models...

  • GenAI Engineer

    1 day ago


    Hyderabad City Taluka, Pakistan beBee Careers Full time

    **Job Description:**We are looking for a highly motivated and experienced AI engineer to join our team. As an AI inferencing expert, you will be responsible for developing and commercializing cutting-edge AI technology. Your primary focus will be on pushing performance boundaries with large models and staying up-to-date with the latest advancements in...

  • AI/ML Specialist

    1 day ago


    Hyderabad City Taluka, Pakistan beBee Careers Full time

    AI/ML ExpertWe are seeking an AI/ML expert to join our team in a role that requires technical expertise and leadership skills. In this position, you will be responsible for designing, developing, and implementing AI/ML initiatives and proof of concepts.Key ResponsibilitiesAct as a senior technical developer in the AI ML pod covering DCOO, procurement, and...

  • AI/ML Expert

    1 day ago


    Hyderabad City Taluka, Pakistan beBee Careers Full time

    Senior AI/ML DeveloperThis is an exciting opportunity to join our team as a senior AI/ML developer. In this role, you will be responsible for designing, developing, and implementing AI/ML initiatives and proof of concepts.Job DescriptionWe are looking for a highly skilled individual with experience in AI/ML technologies to join our team. The ideal candidate...


  • Hyderabad City Taluka, Pakistan beBee Careers Full time

    Key ResponsibilitiesAs an AI Solutions Architect, your key responsibilities will include:Designing and implementing scalable AI-driven workflows improving IT operations.Developing techniques for large language models (LLMs) to reliably author complex behaviors.Providing technical advice, troubleshooting, and analysis as a technical solutions...


  • Hyderabad City Taluka, Pakistan beBee Careers Full time

    We are seeking an experienced Architect to join our team. As a key member, you will be responsible for designing, developing, and deploying advanced AI solutions with autonomous, proactive, and adaptive behaviors.This role demands a deep understanding of AI/ML technologies, integration, automation, and solution architecture, along with strong problem-solving...

  • Edge AI Developer

    1 day ago


    Hyderabad City Taluka, Pakistan beBee Careers Full time

    **GenAI Expert Role:**We are seeking a seasoned GenAI expert to spearhead the development and commercialization of our cutting-edge AI Runtime SDK. The selected candidate will be responsible for driving the work forward with a passion for the role of edge in AI's evolution, pushing performance boundaries with large models, and staying abreast of GenAI...


  • Hyderabad City Taluka, Pakistan Workato Full time

    About WorkatoWorkato transforms technology complexity into business opportunity. As the leader in enterprise orchestration, Workato helps businesses globally streamline operations by connecting data, processes, applications, and experiences. Its AI-powered platform enables teams to navigate complex workflows in real-time, driving efficiency and...


  • Hyderabad City Taluka, Pakistan beBee Careers Full time

    Role Summary">The Generative AI team focuses on integrating cutting-edge GenAI models on Qualcomm chipsets. Leveraging Qualcomm chips' heterogeneous computing capabilities, the team enables inference of GenAI models on-device without cloud connectivity.">Main Responsibilities">