AI Model Optimization

1 day ago


Lahore, Punjab, Pakistan Sublime Wireless Inc Full time

AI Model Optimization & Fine-Tuning Engineer

(On-Device, Fully Offline)

About the Role

We are seeking a hands-on AI Model Optimization Engineer with proven experience in taking large base models, fine-tuning, distilling, and quantizing them for fully offline mobile deployment. This role requires real-world experience with model compression, dataset preparation, and mobile inference optimization for Android/iOS devices.

Responsibilities

  • End-to-end pipeline: data prep → fine-tuning → distillation → quantization → mobile packaging → benchmarking.
  • Apply PTQ/QAT quantization and distillation to deploy LLMs and multimodal models onto devices with limited memory/thermal budgets.
  • Format and prepare datasets for fine-tuning (tokenization, tagging, deduplication, versioning).
  • Optimize models for battery efficiency, low latency, and minimal RAM usage.
  • Benchmark and debug inference performance with Perfetto, Battery Historian, Instruments, etc.
  • Collaborate with app teams to integrate optimized models.

Mandatory Skills Checklist (Applicants must demonstrate experience in ALL of the following)

Quantization & Distillation

  • Post-Training Quantization (PTQ) and Quantization-Aware Training (QAT).
  • Methods like AWQ, GPTQ, SmoothQuant, RPTQ.
  • Knowledge of 4-bit/8-bit schemes (INT4, INT8, FP4, NF4).
  • Distillation methods (teacher–student, logit matching, feature distillation).

Fine-Tuning & Data Handling

  • LoRA/QLoRA/DoRA/AdaLoRA fine-tuning.
  • Instruction-tuning pipelines with PyTorch + Hugging Face.
  • Dataset formatting: JSONL, multi-turn dialogs, tagging, tokenization (SentencePiece/BPE).
  • Deduplication, stratified sampling, and eval set creation.

On-Device Deployment

  • Hands-on with at least two runtimes: / GGUF, MLC LLM, ExecuTorch, ONNX Runtime Mobile, TensorFlow Lite, Core ML.
  • Experience with hardware acceleration: Metal (iOS), NNAPI (Android), GPU/Vulkan, Qualcomm DSP/NPU, XNNPACK.
  • Real-world deployment: must provide examples of models running fully offline on mobile (tokens/s, RAM usage, device specs).

Performance & Benchmarking

  • Tools: Perfetto, systrace, Battery Historian, adb stats (Android); Xcode Instruments, Energy Log (iOS).
  • Profiling decode speed, cold start vs. warm start latency, RAM usage, and energy consumption.

General

  • Strong PyTorch and Hugging Face experience.
  • Clear documentation and ability to explain optimization trade-offs.

Nice to Have

  • Open-source contributions to LLM quantization/edge-AI frameworks.
  • Prior deployment of Qwen, LLaMA, Gemma, or Mistral families onto mobile devices.
  • Multilingual or low-resource dataset experience (Urdu, Arabic, Hindi, etc.), including tokenization, script handling, and fine-tuning.
  • Familiarity with multimodal (ASR/TTS/VAD) integration on device.

Application Requirements

Applicants must include in their application:

  • A short case study of a model they have fine-tuned (dataset + method + results).
  • A short case study of a model they have quantized/distilled for mobile (framework + bit-depth + device + performance metrics).
  • Links to GitHub repos, papers, or APK/TestFlight builds if available.

Job Type: Full-time

Pay: Rs250, Rs400,000.00 per month

Work Location: In person


  • AI Engineer

    1 day ago


    Lahore, Punjab, Pakistan Greetly AI Full time

    Job Title:AI EngineerExperience Level:2–3 YearsLocation:RemoteEmployment Type:Full-timeAbout the RoleWe are seeking an AI Engineer with 2–3 years of hands-on industry experience. The ideal candidate will have strong proficiency in Python, experience with modern AI frameworks, and a solid understanding of cloud-based AI infrastructure. You will build,...


  • Lahore, Punjab, Pakistan Liztek Full time 1,200,000 - 3,600,000 per year

    Senior AI Solutions Architect – Data Modeling & Compliance IntelligenceJoin our LinkedIn Liztek communityJob Description:We are seeking a highly skilled AI Solutions Architect with deep expertise in AI-driven data modeling, validation, and compliance integration. This role will lead the design and deployment of intelligent systems that unify diverse data...

  • AI Content

    7 days ago


    Lahore, Punjab, Pakistan Evren AI Full time $80,000 - $120,000 per year

    Company DescriptionEvren AI is committed to bridging the gap between advanced AI technology and real-world business needs by offering bespoke solutions tailored to each client. With expertise in AI-powered automation, chatbots, personalized recommendations, and predictive analytics, we enable businesses to streamline operations, enhance customer experiences,...

  • AI Automation

    2 weeks ago


    Lahore, Punjab, Pakistan Vertex Flow AI Full time 1,200,000 - 3,600,000 per year

    We are seeking asmart, reliable, and creative AI Automation & Marketing Specialistwho can seamlessly blend technical execution with marketing strategy. The ideal candidate will help us build and optimize scalable systems forcoaches,consultants, andservice-based businesseswhile focusing onAI-driven automation,lead generation, andsales funnel optimization.You...

  • AI Engineer

    1 day ago


    Lahore, Punjab, Pakistan Cube Discipline Full time 800,000 - 1,200,000 per year

    We are looking for a skilled AI Engineer to design, develop, and implement artificial intelligence and machine learning solutions. The ideal candidate will have strong expertise in ML/DL frameworks, data processing, and deploying AI models into production environments.Key ResponsibilitiesDesign and develop AI/ML models to solve business problems.Preprocess,...

  • Senior AI Developer

    2 weeks ago


    Lahore, Punjab, Pakistan Aegasis Labs Full time 900,000 - 1,200,000 per year

    We are currently seeking a talented Python AI engineer with strong expertise in AI and Python to join our dynamic team. This is an exciting opportunity to work on groundbreaking projects and make a significant impact in the field of artificial intelligence and Ai agents.The ideal candidate will have a strong background in Python, expertise in Generative AI,...

  • AI Engineer

    1 day ago


    Lahore, Punjab, Pakistan Mobeevo Full time

     About UsMobeevo empowers businesses to thrive in today's digital landscape by driving sustainable growth and unlocking the transformative power of data and AI. From strategic consulting to seamless AI integration, our services enable organizations to navigate their digital and AI journey with confidence, focusing on management consulting, program...


  • Lahore, Punjab, Pakistan Alethea AI Full time 900,000 - 1,200,000 per year

    DevOps Engineer (Full-Time, Onsite)Are you a DevOps engineer who loves building reliable foundations for fast-moving teams? Alethea AI Labs is hiring a DevOps Engineer to own cloud, containers, and CI/CD, enabling rapid, secure delivery of AI and Web3 products at scale.Alethea AI Labs is leading the Agentic AI movement across industries. Through partnerships...


  • Lahore, Punjab, Pakistan Alethea AI Full time 900,000 - 1,200,000 per year

    Python Engineer (Full-Time, Onsite )Are you a backend engineer who loves building scalable services that power intelligent products? Alethea AI Labs is hiring a Python/Backend Engineer to ship APIs, integrate models, and optimize performance for AI and Web3 experiences used by communities.Alethea AI Labs is leading the Agentic AI movement across industries....

  • AI Engineer

    2 weeks ago


    Lahore, Punjab, Pakistan Invowork Full time 900,000 - 1,200,000 per year

    AI Engineer (1–2 Years Experience)Lahore, Pakistan (On-site)Full-timeAbout the Role:Invowork is looking for a passionate and innovativeAI Engineerwith1–2 years of hands-on experiencein developing, training, and deploying AI/ML models. You'll be working on exciting real-world projects that combine data, automation, and intelligence to build scalable...