Senior Machine Learning

6 days ago


Lahore, Punjab, Pakistan Tek Headquarters Full time 3,500,000 - 7,000,000 per year

About the Role

We in Tekhqs are seeking an elite ML/AI Engineer with deep theoretical and engineering mastery in deep learning, LLMs, and generative architectures. You must have trained, fine-tuned, benchmarked, and deployed transformer-based models at scale, understand the research behind every paper you cite, and have shipped production-grade ML systems across distributed GPU environments.

This is a zero-to-one role: no prebuilt dataset, no precleaned labels, no off-the-shelf pipelines. Just research, code, and compute.

Key Responsibilities

  • Designing and training foundation models (e.g., GPT, T5, Mistral, LLaMA) from scratch on multi-node GPU clusters.
  • Leading full-stack GenAI architecture: tokenizer design, attention variants, pretraining schemes, alignment (RLHF), quantization, serving.
  • Conducting frontier research in model optimization, mixture-of-experts (MoE), context length extrapolation, prompt tuning, and memory efficiency.
  • Implementing and extending SOTA methods from arXiv (e.g., Phi-3, Gemini, FlashAttention-2, GQA, LLaVA, Orca, DPO, ZeRO-Infinity).
  • Managing model lifecycle: data curation pretraining finetuning evaluation quantization deployment postmortems.
  • Working with multi-modal inputs (text, image, video, audio, embeddings) for cross-domain GenAI systems.
  • Driving production-grade optimization: low-latency inference, batching strategies, CUDA kernel debugging, memory offloading, model sharding.

Requirements

Core ML/DL:

  • Transformers, self-attention, residual connections, GELUs, normalization strategies (RMSNorm, LayerNorm, etc.)
  • LLM scaling laws, curriculum learning, token sampling strategies (Top-k, Top-p, Temperature, Mirostat, nucleus filtering)
  • Contrastive learning, masked modeling, autoregressive generation, denoising diffusion

Training Infrastructure:

  • PyTorch Lightning, DeepSpeed, HuggingFace Accelerate, Fully Sharded Data Parallel (FSDP), ZeRO-3
  • GPU/TPU cluster management, distributed checkpointing, mixed precision (fp16, bfloat16), quantization-aware training
  • Data streaming at scale with WebDataset, TFRecord, Parquet

Model Optimization & Serving:

  • Quantization: GPTQ, AWQ, SmoothQuant, LLM.int8(), QLoRA
  • Compilers: TensorRT, Torch-TensorRT, TVM, ONNX Runtime, XLA, GGUF
  • Model serving: Triton Inference Server, vLLM, TGI, HuggingFace TextGeneration Inference

GenAI Systems:

  • RLHF pipelines: reward modeling, PPO, DPO, ORPO, RLAIF
  • Retrieval-Augmented Generation (RAG): hybrid semantic search, vectorDB integration, prompt composition
  • Tokenizer development: SentencePiece, Tiktoken, Byte Pair Encoding(BPE), Unigram LM

Software Engineering:

  • Clean, modular, testable Python code with CI/CD pipelines
  • Profiling tools: PyTorch Profiler, Nsight, nvtop, nvprof, memory profiler
  • Containerization: Docker, NVIDIA Container Toolkit, Kubernetes for distributed training

Mathematical Rigor:

  • Optimization: AdamW, Lion, RMSProp, gradient clipping, learning rate schedulers
  • Loss functions: Cross-entropy, KL divergence, cosine similarity, contrastive losses
  • Strong command of linear algebra, probability, statistics, and numerical methods

Experience: 4+ years hands-on (Minimum 3+ years in GenAI & Transformers)

Job Type: Hybrid

Job Time: 3pm to 9pm from office and 11pm to 1 am from home

Location: DHA Phase 6 Lahore

About Us:

TEKHQS is a global technology solutions provider headquartered in Lake Forest, California, with an offshore team of 300+ experts based in Pakistan. We specialize in Web 2.0 (Web & Mobile App Development), Web 3.0 (Blockchain & Crypto Platform Development), AI/ML Solutions, and ERP services as a certified partner of SAP S/4HANA, Oracle NetSuite, and Microsoft Dynamics 365 Business Central. Our expertise includes implementation, training, customization, integration, support, IT staff augmentation, and certified ERP consultancy.

Job Type: Full-time

Work Location: In person



  • Lahore, Punjab, Pakistan Scraperrs Full time 900,000 - 1,200,000 per year

    Company DescriptionWe suggest you enter details here.Role DescriptionThis is a full-time on-site role for a Senior Machine Learning Engineer at Scraperrs, located in Lahore. The Senior Machine Learning Engineer will be responsible for developing and implementing machine learning models, analyzing data patterns, improving algorithms, and ensuring the accuracy...

  • Machine Learning

    2 weeks ago


    Lahore, Punjab, Pakistan Brickslogix Full time $70,000 - $120,000 per year

    We are seeking a highly motivated AI/ML Engineers to join our growing technology team. The ideal candidate will have a strong foundation in machine learning, artificial intelligence, data processing, and algorithm development. This role will involve building intelligent systems, analyzing datasets, and implementing cutting-edge AI solutions to support our...


  • Lahore, Punjab, Pakistan Infynix Solutions Full time 400,000 - 1,200,000 per year

    Company DescriptionInfynix AI is a leader in expert AI software development, offering custom AI chatbots, automation, analytics, and content generation services. They specialize in web development, custom software, Python automation, YouTube SEO, and graphic design services. Elevate your business with their innovative and high-impact solutions.Role...


  • Lahore, Punjab, Pakistan Infostack Full time 1,800,000 - 2,400,000 per year

    Job Title:Machine Learning EngineerLocation:Remote (Pakistan – US hours overlap)Type:Full-Time / ContractCompensation Contractors:$3000 – $4000/Per Project/ Per MonthPrior Experience in the Staff Augmentation Model is mandatory.Job Summary:We are looking for a skilledMachine Learning Engineerto design, develop, and deploy machine learning models and...


  • Lahore, Punjab, Pakistan The Hexaa Full time 900,000 - 1,200,000 per year

    Experience Required: 3–4 yearsAbout the RoleWe are looking for a Machine Learning Engineer with 3–4 years of experience to design, implement, and optimize machine learning solutions for real-world applications. The role requires strong analytical skills, hands-on coding experience, and the ability to collaborate effectively with cross-functional...


  • Lahore, Punjab, Pakistan Bridging Bits Full time 900,000 - 1,200,000 per year

    Company DescriptionBridging Bits is a technology company specializing in software and AI development, helping businesses unlock innovation through intelligent solutions. Established in 2021, we've grown into a trusted digital partner serving clients across multiple industries and regions. Our team of experienced engineers, data scientists, and designers...


  • Lahore, Punjab, Pakistan AxcelerateAI Full time 1,200,000 - 3,600,000 per year

    We are a software development company with a strong specialised focus on developing artificial intelligence-based solutions for real-world industrial problems. We focus on using state-of-the-art AI technologies to solve complex industrial problems.We are looking for a Machine Learning (ML) engineer, with a specialisation in deep learning, to join our...


  • Lahore, Punjab, Pakistan Splendid Mark Full time 900,000 - 1,200,000 per year

    Company DescriptionSplendid Mark specializes in providing IT services and consulting with a strong focus on AI integration and automation solutions. Our mission is to drive business efficiency and innovation through cutting-edge technology. We partner with various industries to deliver customized solutions that enhance productivity and operational...


  • Lahore, Punjab, Pakistan Axix Technologies Full time 600,000 - 840,000 per year

    Job Title: Machine Learning EngineerCompany: Axix Technologies Pvt LtdLocation: Gulberg III, Lahore (Onsite)Job Type: Full-TimeExperience: proven 2–3 years industry experienceRole OverviewAxix Technologies, we don't just build software we build intelligent systems that shape the future of industries worldwide. If you're passionate about AI, Computer...


  • Lahore, Punjab, Pakistan 10xEngineers Full time 3,500,000 - 10,500,000 per year

    Level:Senior (5-7 years)Location:Lahore, Pakistan – On-siteJob Type:Full-timeAbout the roleWe are seeking a senior engineer to join our high-performance team, which partners with leading AI chip companies to deliver cutting-edge software that enables end-users to run Vision and Generative AI inference workloads efficiently on custom accelerators.As a...