AI Speech Engineer – Custom TTS

2 weeks ago


Lahore, Punjab, Pakistan Sublime Wireless Inc Full time

AI Speech Engineer – Custom TTS (On-Device, Multilingual)

About the Role

We are looking for an experienced AI Speech Engineer with deep expertise in fine-tuning open-source TTS engines using custom voice datasets. The role requires the ability to build high-quality, multilingual, expressive TTS voices (with cues, non-verbal expressions, and prosody variations), and to optimize them for fully offline use on mobile devices.

You will be responsible for creating a world-class TTS pipeline: from dataset preparation → fine-tuning → evaluation → on-device deployment (Android/iOS).

Responsibilities

  • Fine-tune open-source TTS models (e.g., VITS, Glow-TTS, Tacotron2, FastSpeech2, Coqui TTS, Fairseq S2T, Bark-like models).
  • Build custom voices from multi-actor recordings, including prosody cues, NVEs (non-verbal expressions), laughter, sighs, whispers, emphasis, and emotional tones.
  • Format and preprocess multilingual datasets (e.g., English, Urdu, Arabic, Hindi).
  • Implement voice cloning and speaker adaptation methods (speaker embeddings, x-vectors, HuBERT/ContentVec conditioning).
  • Apply latest fine-tuning techniques (gradual unfreezing, adversarial training, vocoder adaptation, multi-speaker conditioning).
  • Deploy optimized TTS engines on-device using ONNX Runtime, TensorFlow Lite, Core ML, or custom inference runtimes.
  • Optimize for low-latency, low-memory, and battery-efficient speech generation on mobile CPUs/NPUs/GPUs.
  • Evaluate output quality (MOS, prosody accuracy, multilingual pronunciation consistency).
  • Collaborate with engineers to integrate TTS modules into apps for real-time, offline speech synthesis.

Mandatory Skills Checklist (Applicants must demonstrate experience in ALL of the following)

TTS Model Fine-Tuning

  • Hands-on fine-tuning of open-source TTS engines (Coqui TTS, VITS, Glow-TTS, Tacotron, FastSpeech).
  • Building multilingual and multi-speaker models.
  • Dataset alignment: phoneme extraction, grapheme-to-phoneme (G2P), forced alignment (Montreal Forced Aligner, MFA, or equivalent).
  • Handling prosody, cues, and NVEs in dataset labeling.

Voice Dataset Engineering

  • Preparing raw actor recordings → cleaned, labeled dataset.
  • Handling multilingual phoneme sets (IPA, G2P for Urdu, Arabic, Hindi, English).
  • Speaker embedding extraction (d-vectors, x-vectors, ECAPA, HuBERT units).
  • Noise reduction, augmentation, silence trimming, forced alignment.

On-Device Deployment

  • Exporting TTS models to ONNX/TFLite/Core ML.
  • Running inference with optimized vocoders (HiFi-GAN, WaveGlow, Parallel WaveGAN).
  • Experience with quantization/pruning of speech models for mobile.
  • Benchmarking real-time inference: latency, RAM usage, and energy efficiency.

Latest Techniques Knowledge

  • Expressive/controllable TTS (prosody embeddings, style tokens, GST, variational prosody models).
  • Speaker adaptation & cross-lingual voice transfer.
  • Handling low-resource languages (Urdu, Arabic).
  • Evaluation frameworks (MOS testing, AB preference tests, WER for intelligibility).

Nice to Have

  • Contributions to open-source TTS projects (Coqui, ESPnet, Fairseq, Bark, etc.).
  • Experience with speech-to-speech systems or multimodal pipelines.
  • Familiarity with distillation/quantization of TTS models for edge devices.
  • Worked on custom vocoder design for emotional or non-verbal cues.

Application Requirements

Applicants must include:

  • A short case study of a TTS model they fine-tuned (dataset type, model used, output samples).
  • A short case study of deploying a TTS model on-device (framework, device, latency, memory usage).
  • Links to audio samples, demos, GitHub repos, or production apps showing custom voices.

Job Type: Full-time

Pay: Rs250, Rs400,000.00 per month

Work Location: In person


  • AI/ML Engineer

    2 weeks ago


    Lahore, Punjab, Pakistan SAARZ Int. Full time

    Are you passionate about building intelligent systems, deploying real-world AI agents, and working on cutting-edge LLM-based automation?SAARZ Int.is looking for a skilled AI Engineer (or AI Full-Stack Developer) with1 year of experienceto join our growing team. You'll work on a range of exciting projects — from AI chat/voice agents to autonomous workflows...

  • AI ML Engineer

    2 weeks ago


    Lahore, Punjab, Pakistan Add Hype Full time 1,200,000 - 3,600,000 per year

    𝗪𝗲'𝗿𝗲 𝗛𝗶𝗿𝗶𝗻𝗴: 𝗚𝗲𝗻𝗲𝗿𝗮𝘁𝗶𝘃𝗲 𝗔𝗜 𝗘𝗻𝗴𝗶𝗻𝗲𝗲𝗿 (2–4 𝗬𝗲𝗮𝗿𝘀 𝗘𝘅𝗽𝗲𝗿𝗶𝗲𝗻𝗰𝗲) Location: On-site – DHA Phase 4, Lahore Department: Technology / AI/MLWe're looking for a Generative AI Engineer to join our team and help build next-gen AI...

  • AI/ML Engineer

    1 week ago


    Lahore, Punjab, Pakistan Webbuggs Full time 1,200,000 - 3,600,000 per year

    We are seeking a highly skilledAI/ML Engineerwith expertise across traditional machine learning, deep learning, generative AI, natural language processing (NLP), computer vision, and workflow automation. The ideal candidate should be hands-on with state-of-the-art frameworks and tools, capable of building production-ready AI systems, and comfortable working...

  • AI/ML Engineer

    2 days ago


    Lahore, Punjab, Pakistan Swift Studioz Full time 900,000 - 1,200,000 per year

    AI Engineer (NLP & Voice AI)Role OverviewWe are seeking a talented AI Engineer with strong expertise in NLP, speech recognition, and conversational AI. You will be part of a small, elite team developing scalable AI solutions for real-world customer interactions.Key ResponsibilitiesDevelop, train, and fine-tune NLP and speech recognition models.Build...


  • Lahore, Punjab, Pakistan Techknock Full time 900,000 - 1,200,000 per year

    Generative AI Engineer – Voice Calling Systems (Full-Time)Type: Onsite, Full TimeLocation: Lake City, Lahore, Pakistan*Please review the screening questions caredully and dont apply if you dont have the required experience*About the RoleWe are seeking AI engineer with expertise in Gen-AI, voice calling systems, conversational AI, and AWS. You'll be...

  • AI Engineer

    2 weeks ago


    Lahore, Punjab, Pakistan muSharp Full time 1,200,000 - 3,600,000 per year

    Company OverviewmuSharp is a global software development partner specializing in delivering expert-driven solutions tailored to meet the unique needs of clients. With a focus on complex, non-standard e-commerce and a commitment to driving digital transformation, muSharp combines strategic consulting with comprehensive software development services. Our team...

  • AI Engineer

    2 weeks ago


    Lahore, Punjab, Pakistan Greetly AI Full time

    Job Title:AI EngineerExperience Level:2–3 YearsLocation:RemoteEmployment Type:Full-timeAbout the RoleWe are seeking an AI Engineer with 2–3 years of hands-on industry experience. The ideal candidate will have strong proficiency in Python, experience with modern AI frameworks, and a solid understanding of cloud-based AI infrastructure. You will build,...


  • Lahore, Punjab, Pakistan Tron AI Full time 1,200,000 - 3,600,000 per year

    Full Stack Developer (Generative AI Engineer)Location: Hybrid (Lahore)Experience Level: 1-2 YearsEducation: Bachelor's Degree in Computer Science or related field (Graduates from FAST and PUCIT are a plus)About the RoleWe are looking for a passionate and driven Full Stack Developer (Generative AI & Backend Engineer) to join our growing team. This role is...

  • AI Engineer

    40 minutes ago


    Lahore, Punjab, Pakistan Zepto Systems Limited Full time 900,000 - 1,200,000 per year

    About the RoleWe're creating an AI Receptionist System that automates patient calls, appointment scheduling, and FAQs using LLM-powered voice AI. You'll build the core AI backend — the "brain" that understands and responds like a human.Key ResponsibilitiesDevelop & maintain FastAPI-based AI microservicesImplement LLM workflows (LangChain,...


  • Lahore, Punjab, Pakistan Techtix Full time 900,000 - 1,200,000 per year

    Company DescriptionTechtix stands apart in the tech world by emphasizing action and fearless execution with a mission to harness technology to overcome significant challenges and elevate business operations. We prioritize building long-term relationships by immersing ourselves in client visions and co-creating solutions that future-proof businesses. Our...