Lead Data Engineer – Python

3 days ago


Hyderābād, Sindh, Pakistan Egen Full time 1,200,000 - 1,800,000 per year
Job Overview: We are looking for a skilled and motivated Lead Data Engineer with strong experience in Python programming and Google Cloud Platform (GCP) to join our data engineering team. The ideal candidate will be responsible for requirements gathering, designing, architecting the solution, developing, and maintaining robust and scalable ETL (Extract, Transform, Load) & ELT data pipelines. The role involves working with customers directly, gathering requirements, discovery phase,  designing, architecting the solution, using various GCP services, implementing data transformations, data ingestion, data quality, and consistency across systems, and post post-delivery support.
Experience Level: 10 to 12 years of relevant IT experience Key Responsibilities:
  • Design, develop, test, and maintain scalable ETL data pipelines using Python.
  • Architect the enterprise solutions with various technologies like Kafka, multi-cloud services, auto-scaling using GKE, Load balancers, APIGEE proxy API management, DBT, using LLMs as needed in the solution, redaction of sensitive information, DLP (Data Loss Prevention) etc.
Work extensively on Google Cloud Platform (GCP) services such as: Dataflow for real-time and batch data processingCloud Functions for lightweight serverless computeBigQuery for data warehousing and analyticsCloud Composer for orchestration of data workflows (on Apache Airflow)Google Cloud Storage (GCS) for managing data at scaleIAM for access control and securityCloud Run for containerized applications Should have experience in the following areas : API framework: Python FastAPIProcessing engine: Apache SparkMessaging and streaming data processing: KafkaStorage: MongoDB, Redis/BigtableOrchestration: AirflowExperience in deployments in GKE, Cloud Run.Perform data ingestion from various sources and apply transformation and cleansing logic to ensure high-quality data delivery.Implement and enforce data quality checks, validation rules, and monitoring.Collaborate with data scientists, analysts, and other engineering teams to understand data needs and deliver efficient data solutions.Manage version control using GitHub and participate in CI/CD pipeline deployments for data projects.Write complex SQL queries for data extraction and validation from relational databases such as SQL Server, Oracle, or PostgreSQL.Document pipeline designs, data flow diagrams, and operational support procedures. Required Skills:
  • 10 to 12 years of hands-on experience in Python for backend or data engineering projects.
  • Strong understanding and working experience with GCP cloud services (especially Dataflow, BigQuery, Cloud Functions, Cloud Composer, etc.).
  • Solid understanding of data pipeline architecture, data integration, and transformation techniques.
  • Experience in working with version control systems like GitHub and knowledge of CI/CD practices.
  • Experience in Apache Spark, Kafka, Redis, Fast APIs, Airflow, GCP Composer DAGs.
  • Strong experience in SQL with at least one enterprise database (SQL Server, Oracle, PostgreSQL, etc.).
  • Experience in data migrations from on-premise data sources to Cloud platforms.
Good to Have (Optional Skills):
  • Experience working with the Snowflake cloud data platform.
  • Hands-on knowledge of Databricks for big data processing and analytics.
  • Familiarity with Azure Data Factory (ADF) and other Azure data engineering tools.
Additional Details:
  • Excellent problem-solving and analytical skills.
  • Strong communication skills and ability to collaborate in a team environment.
Education:
  •  Bachelor's degree in Computer Science, a related field, or equivalent experience.
We may use artificial intelligence (AI) tools to support parts of the hiring process, such as reviewing applications, analyzing resumes, or assessing responses. These tools assist our recruitment team but do not replace human judgment. Final hiring decisions are ultimately made by humans. If you would like more information about how your data is processed, please contact us.

  • Hyderābād, Sindh, Pakistan Capgemini Full time 900,000 - 1,200,000 per year

    Choosing Capgemini means choosing a company where you will be empowered to shape your career in the way you'd like, where you'll be supported and inspired by a collaborative community of colleagues around the world, and where you'll be able to reimagine what's possible. Join us and help the world's leading organizations unlock the value of technology and...

  • Data Engineer

    3 days ago


    Hyderābād, Sindh, Pakistan Egen Full time $60,000 - $120,000 per year

    Job Overview:  We are looking for a skilled and motivated Data Engineer with strong experience in Python programming and Google Cloud Platform (GCP) to join our data engineering team. The ideal candidate will be responsible for designing, developing, and maintaining robust and scalable ETL (Extract, Transform, Load) data pipelines. The role involves...


  • Hyderābād, Sindh, Pakistan Capgemini Engineering Full time 60,000 - 120,000 per year

    Choosing Capgemini means choosing a company where you will be empowered to shape your career in the way you'd like, where you'll be supported and inspired by a collaborative community of colleagues around the world, and where you'll be able to reimagine what's possible. Join us and help the world's leading organizations unlock the value of technology and...


  • Hyderābād, Sindh, Pakistan UST Full time 900,000 - 1,200,000 per year

    5 - 7 Years3 OpeningsBengaluru, Chennai, Hyderabad, Kochi, TrivandrumRole descriptionAI/ML Engineer - Required SkillsHands-on experience with Agentic Layer A2A frameworks and MCP Protocol.Expertise in AI/ML engineering, specifically vector embeddings, prompt engineering, and context engineering.Strong programming skills in at least two of the following:...


  • Hyderābād, Sindh, Pakistan Egen Full time $90,000 - $120,000 per year

    Job Overview:   We are looking for a skilled and motivated Senior Data Engineer with strong experience in Python programming and Google Cloud Platform (GCP) to join our data engineering team. The ideal candidate will be responsible for designing, developing, and maintaining robust and scalable ETL (Extract, Transform, Load) data pipelines. The role...

  • Lead Data Scientist

    2 weeks ago


    Hyderābād, Sindh, Pakistan Opella Full time 6,000,000 - 12,000,000 per year

    Job title:Lead Data ScientistLocation: HyderabadOpella is the self-care challenger with the purest and third-largest portfolio in the Over-The-Counter (OTC) & Vitamins, Minerals & Supplements (VMS) market globally.Our mission is to bring health in people's hands by making self-care as simple as it should be. For half a billion consumers worldwide – and...


  • Hyderābād, Sindh, Pakistan TTEC Digital Full time 1,200,000 - 3,600,000 per year

    Position Purpose The position of Consultant is within TTEC Digital - Analytics team. Analytics group is responsible for Data Science and Engineering projects that include the design and validation of data models, build systems to collect, manage and convert transactional raw data to usable data structures to generate insights for decision making Our Data...


  • Hyderābād, Sindh, Pakistan Incedo Full time 1,200,000 - 2,400,000 per year

    Company Overview Incedo is a US-based consulting, data science and technology services firm with over 3000 people helping clientsfrom our six offices across US, Mexico and India. We help our clients achieve competitive advantage throughend-to-end digital transformation. Our uniqueness lies in bringing together strong engineering, data science, anddesign...


  • Hyderābād, Sindh, Pakistan Appen Full time 1,500,000 - 3,000,000 per year

    About Appen Appen is a leader in AI enablement for critical tasks such as model improvement, supervision, and evaluation. To do this we leverage our global crowd of over one million skilled contractors, speaking over 180 languages and dialects, representing 130 countries. In addition, we utilize the industry's most advanced AI-assisted data annotation...

  • GenAI + Python

    1 week ago


    Hyderābād, Sindh, Pakistan Tata Consultancy Services (TCS) Full time 500,000 - 1,500,000 per year

    Python – 4 to 8 yrs of hands-on python experience in modules like numpy, pandas, SciPy, Sci-Kit Learn, Matplotlib, seaborn. Experience in writing and debugging object-oriented code.SQL: Intermediate knowledge of SQL – Experience in writing Views, joins and aggregate functions.Data Science/ML experience: 4 to 8 yrs of professional experience of using...