AI Speech Engineer – Custom TTS
2 weeks ago
AI Speech Engineer – Custom TTS (On-Device, Multilingual)
About the Role
We are looking for an experienced AI Speech Engineer with deep expertise in fine-tuning open-source TTS engines using custom voice datasets. The role requires the ability to build high-quality, multilingual, expressive TTS voices (with cues, non-verbal expressions, and prosody variations), and to optimize them for fully offline use on mobile devices.
You will be responsible for creating a world-class TTS pipeline: from dataset preparation → fine-tuning → evaluation → on-device deployment (Android/iOS).
Responsibilities
- Fine-tune open-source TTS models (e.g., VITS, Glow-TTS, Tacotron2, FastSpeech2, Coqui TTS, Fairseq S2T, Bark-like models).
- Build custom voices from multi-actor recordings, including prosody cues, NVEs (non-verbal expressions), laughter, sighs, whispers, emphasis, and emotional tones.
- Format and preprocess multilingual datasets (e.g., English, Urdu, Arabic, Hindi).
- Implement voice cloning and speaker adaptation methods (speaker embeddings, x-vectors, HuBERT/ContentVec conditioning).
- Apply latest fine-tuning techniques (gradual unfreezing, adversarial training, vocoder adaptation, multi-speaker conditioning).
- Deploy optimized TTS engines on-device using ONNX Runtime, TensorFlow Lite, Core ML, or custom inference runtimes.
- Optimize for low-latency, low-memory, and battery-efficient speech generation on mobile CPUs/NPUs/GPUs.
- Evaluate output quality (MOS, prosody accuracy, multilingual pronunciation consistency).
- Collaborate with engineers to integrate TTS modules into apps for real-time, offline speech synthesis.
Mandatory Skills Checklist (Applicants must demonstrate experience in ALL of the following)
TTS Model Fine-Tuning
- Hands-on fine-tuning of open-source TTS engines (Coqui TTS, VITS, Glow-TTS, Tacotron, FastSpeech).
- Building multilingual and multi-speaker models.
- Dataset alignment: phoneme extraction, grapheme-to-phoneme (G2P), forced alignment (Montreal Forced Aligner, MFA, or equivalent).
- Handling prosody, cues, and NVEs in dataset labeling.
Voice Dataset Engineering
- Preparing raw actor recordings → cleaned, labeled dataset.
- Handling multilingual phoneme sets (IPA, G2P for Urdu, Arabic, Hindi, English).
- Speaker embedding extraction (d-vectors, x-vectors, ECAPA, HuBERT units).
- Noise reduction, augmentation, silence trimming, forced alignment.
On-Device Deployment
- Exporting TTS models to ONNX/TFLite/Core ML.
- Running inference with optimized vocoders (HiFi-GAN, WaveGlow, Parallel WaveGAN).
- Experience with quantization/pruning of speech models for mobile.
- Benchmarking real-time inference: latency, RAM usage, and energy efficiency.
Latest Techniques Knowledge
- Expressive/controllable TTS (prosody embeddings, style tokens, GST, variational prosody models).
- Speaker adaptation & cross-lingual voice transfer.
- Handling low-resource languages (Urdu, Arabic).
- Evaluation frameworks (MOS testing, AB preference tests, WER for intelligibility).
Nice to Have
- Contributions to open-source TTS projects (Coqui, ESPnet, Fairseq, Bark, etc.).
- Experience with speech-to-speech systems or multimodal pipelines.
- Familiarity with distillation/quantization of TTS models for edge devices.
- Worked on custom vocoder design for emotional or non-verbal cues.
Application Requirements
Applicants must include:
- A short case study of a TTS model they fine-tuned (dataset type, model used, output samples).
- A short case study of deploying a TTS model on-device (framework, device, latency, memory usage).
- Links to audio samples, demos, GitHub repos, or production apps showing custom voices.
Job Type: Full-time
Pay: Rs250, Rs400,000.00 per month
Work Location: In person
-
AI/ML Engineer
2 weeks ago
Lahore, Punjab, Pakistan SAARZ Int. Full timeAre you passionate about building intelligent systems, deploying real-world AI agents, and working on cutting-edge LLM-based automation?SAARZ Int.is looking for a skilled AI Engineer (or AI Full-Stack Developer) with1 year of experienceto join our growing team. You'll work on a range of exciting projects — from AI chat/voice agents to autonomous workflows...
-
AI ML Engineer
2 weeks ago
Lahore, Punjab, Pakistan Add Hype Full time 1,200,000 - 3,600,000 per year𝗪𝗲'𝗿𝗲 𝗛𝗶𝗿𝗶𝗻𝗴: 𝗚𝗲𝗻𝗲𝗿𝗮𝘁𝗶𝘃𝗲 𝗔𝗜 𝗘𝗻𝗴𝗶𝗻𝗲𝗲𝗿 (2–4 𝗬𝗲𝗮𝗿𝘀 𝗘𝘅𝗽𝗲𝗿𝗶𝗲𝗻𝗰𝗲) Location: On-site – DHA Phase 4, Lahore Department: Technology / AI/MLWe're looking for a Generative AI Engineer to join our team and help build next-gen AI...
-
AI/ML Engineer
1 week ago
Lahore, Punjab, Pakistan Webbuggs Full time 1,200,000 - 3,600,000 per yearWe are seeking a highly skilledAI/ML Engineerwith expertise across traditional machine learning, deep learning, generative AI, natural language processing (NLP), computer vision, and workflow automation. The ideal candidate should be hands-on with state-of-the-art frameworks and tools, capable of building production-ready AI systems, and comfortable working...
-
AI/ML Engineer
2 days ago
Lahore, Punjab, Pakistan Swift Studioz Full time 900,000 - 1,200,000 per yearAI Engineer (NLP & Voice AI)Role OverviewWe are seeking a talented AI Engineer with strong expertise in NLP, speech recognition, and conversational AI. You will be part of a small, elite team developing scalable AI solutions for real-world customer interactions.Key ResponsibilitiesDevelop, train, and fine-tune NLP and speech recognition models.Build...
-
Generative AI Engineer – Voice Calling Systems
2 weeks ago
Lahore, Punjab, Pakistan Techknock Full time 900,000 - 1,200,000 per yearGenerative AI Engineer – Voice Calling Systems (Full-Time)Type: Onsite, Full TimeLocation: Lake City, Lahore, Pakistan*Please review the screening questions caredully and dont apply if you dont have the required experience*About the RoleWe are seeking AI engineer with expertise in Gen-AI, voice calling systems, conversational AI, and AWS. You'll be...
-
AI Engineer
2 weeks ago
Lahore, Punjab, Pakistan muSharp Full time 1,200,000 - 3,600,000 per yearCompany OverviewmuSharp is a global software development partner specializing in delivering expert-driven solutions tailored to meet the unique needs of clients. With a focus on complex, non-standard e-commerce and a commitment to driving digital transformation, muSharp combines strategic consulting with comprehensive software development services. Our team...
-
AI Engineer
2 weeks ago
Lahore, Punjab, Pakistan Greetly AI Full timeJob Title:AI EngineerExperience Level:2–3 YearsLocation:RemoteEmployment Type:Full-timeAbout the RoleWe are seeking an AI Engineer with 2–3 years of hands-on industry experience. The ideal candidate will have strong proficiency in Python, experience with modern AI frameworks, and a solid understanding of cloud-based AI infrastructure. You will build,...
-
Generative AI Engineer
2 weeks ago
Lahore, Punjab, Pakistan Tron AI Full time 1,200,000 - 3,600,000 per yearFull Stack Developer (Generative AI Engineer)Location: Hybrid (Lahore)Experience Level: 1-2 YearsEducation: Bachelor's Degree in Computer Science or related field (Graduates from FAST and PUCIT are a plus)About the RoleWe are looking for a passionate and driven Full Stack Developer (Generative AI & Backend Engineer) to join our growing team. This role is...
-
AI Engineer
40 minutes ago
Lahore, Punjab, Pakistan Zepto Systems Limited Full time 900,000 - 1,200,000 per yearAbout the RoleWe're creating an AI Receptionist System that automates patient calls, appointment scheduling, and FAQs using LLM-powered voice AI. You'll build the core AI backend — the "brain" that understands and responds like a human.Key ResponsibilitiesDevelop & maintain FastAPI-based AI microservicesImplement LLM workflows (LangChain,...
-
Mid Level AI/ML Engineer
2 days ago
Lahore, Punjab, Pakistan Techtix Full time 900,000 - 1,200,000 per yearCompany DescriptionTechtix stands apart in the tech world by emphasizing action and fearless execution with a mission to harness technology to overcome significant challenges and elevate business operations. We prioritize building long-term relationships by immersing ourselves in client visions and co-creating solutions that future-proof businesses. Our...