Data Engineer – Python
4 days ago
We are seeking a highly skilled Data Engineer with strong expertise in AWS cloud services and Python programming. The ideal candidate will be responsible for designing, building, and maintaining scalable data pipelines, ensuring data availability, quality, and performance across enterprise systems. You will collaborate closely with data analysts, data scientists, and business stakeholders to deliver reliable, high-quality data solutions.
Key Responsibilities
- Design, develop, and maintain ETL/ELT data pipelines using Python and AWS native services (Glue, Lambda, EMR, Step Functions, etc.)
- Build and manage data lakes and data warehouses using Amazon S3, Redshift, Athena, and Lake Formation
- Implement data ingestion from diverse sources (RDBMS, APIs, streaming data, on-premise systems)
- Optimize data workflows for performance, cost, and reliability using AWS tools like Glue Jobs, Athena, and Redshift Spectrum
- Develop reusable, modular Python-based frameworks for data ingestion, transformation, and validation
- Work with stakeholders to understand data requirements, model data structures, and ensure data consistency and governance
- Deploy and manage data infrastructure using Infrastructure as Code (IaC) tools such as Terraform or AWS CloudFormation
- Implement data quality, monitoring, and alerting using CloudWatch, Glue Data Catalog, or third-party tools
- Support data security and compliance (IAM roles, encryption, data masking, GDPR policies, etc.)
- Collaborate with DevOps and ML teams to integrate data pipelines into analytics and AI workflows
- Bachelor's or Master's degree in Computer Science, Information Technology, or related field.
- Minimum 5 to 8 years of experience as a Data Engineer or similar role.
- Strong programming experience in Python (pandas, boto3, PySpark, SQLAlchemy, etc.)
- Deep hands-on experience with AWS services, including:
- AWS Glue, Lambda, EMR, Redshift, Athena, S3, Step Functions
- IAM, CloudWatch, Kinesis (for streaming), and ECS/EKS (for containerized workloads)
- Experience with SQL and NoSQL databases (e.g., PostgreSQL, DynamoDB, MongoDB)
- Strong knowledge of data modeling, schema design, and ETL orchestration.
- Familiarity with version control (Git) and CI/CD pipelines for data projects.
- Understanding of data governance, lineage, and cataloging principles.
- Excellent problem-solving, debugging, and performance-tuning skills.
- Experience with Apache Spark or PySpark on AWS EMR.
- Exposure to Airflow, dbt, or similar workflow orchestration tools.
- Knowledge of containerization (Docker, Kubernetes) and DevOps practices.
- Experience with machine learning data pipelines or real-time streaming (Kafka, Kinesis).
- Familiarity with AWS Glue Studio, AWS DataBrew, or AWS Lake Formation.
- Strong analytical and communication skills.
- Ability to work independently and in cross-functional teams.
- Passion for automation and continuous improvement.
- Adaptability in fast-paced, evolving cloud environments.
-
Data Engineer – Python
4 days ago
Hyderābād, Sindh, Pakistan Egen Solutions Inc Full timeJob Overview:We are seeking a highly skilled Data Engineer with strong expertise in AWS cloud services and Python programming. The ideal candidate will be responsible for designing, building, and maintaining scalable data pipelines, ensuring data availability, quality, and performance across enterprise systems. You will collaborate closely with data...
-
Lead Data Engineer – Python
1 week ago
Hyderābād, Sindh, Pakistan Egen Full time 2,000,000 - 2,500,000 per yearJob Overview: We are looking for a skilled and motivated Lead Data Engineer with strong experience in Python programming and Google Cloud Platform (GCP) to join our data engineering team. The ideal candidate will be responsible for requirements gathering, designing, architecting the solution, developing, and maintaining robust and scalable ETL (Extract,...
-
Senior Data Engineer – Python
4 days ago
Hyderābād, Sindh, Pakistan Egen Solutions Inc Full timeJob Overview:We are looking for an experienced Senior Data Engineer to design, build, and optimize scalable, high-performance data platforms using AWS cloud services and Python. The ideal candidate will play a key role in architecting end-to-end data pipelines, driving automation, ensuring data quality, and enabling analytics and AI workloads across the...
-
Senior Data Engineer – Python
4 days ago
Hyderābād, Sindh, Pakistan Egen Full timeJob Overview: We are looking for an experienced Senior Data Engineer to design, build, and optimize scalable, high-performance data platforms using AWS cloud services and Python. The ideal candidate will play a key role in architecting end-to-end data pipelines, driving automation, ensuring data quality, and enabling analytics and AI workloads across the...
-
Data Engineer(Python, Pyspark, Oracle, ETL)
2 weeks ago
Hyderābād, Sindh, Pakistan UST Full time $1,200,000 - $1,600,000 per year3 - 5 Years1 OpeningBangalore, HyderabadRole descriptionRole: Data Engineering & AI Specialist We are seeking an experienced Data Engineering and AI Data Science and professional with deep expertise across the full data stack, and cutting-edge Agentic AI systems. Required Technical Expertise: Programming & Scripting Languages:Advanced proficiency in Python,...
-
Data Engineer
1 week ago
Hyderābād, Sindh, Pakistan Egen Full time $60,000 - $120,000 per yearJob Overview: We are looking for a skilled and motivated Data Engineer with strong experience in Python programming and Google Cloud Platform (GCP) to join our data engineering team. The ideal candidate will be responsible for designing, developing, and maintaining robust and scalable ETL (Extract, Transform, Load) data pipelines. The role involves...
-
Data Engineer(Python with Gen AI)
4 days ago
Hyderābād, Sindh, Pakistan Capgemini Full timeHyderabad, Pune, BangaloreData Engineer(Python with Gen AI)At Capgemini Invent, we believe difference drives change. As inventive transformation consultants, we blend our strategic, creative and scientific capabilities, collaborating closely with clients to deliver cutting-edge solutions. Join us to drive transformation tailored to our client's challenges of...
-
Senior Data Engineer
1 week ago
Hyderābād, Sindh, Pakistan Egen Full time 900,000 - 1,200,000 per yearJob Overview: We are looking for a skilled and motivated Senior Data Engineer with strong experience in Python programming and Google Cloud Platform (GCP) to join our data engineering team. The ideal candidate will be responsible for designing, developing, and maintaining robust and scalable ETL (Extract, Transform, Load) data pipelines. The role...
-
Data Analytics Engineer
2 weeks ago
Hyderābād, Sindh, Pakistan Capgemini Full time 900,000 - 1,200,000 per yearChoosing Capgemini means choosing a company where you will be empowered to shape your career in the way you'd like, where you'll be supported and inspired by a collaborative community of colleagues around the world, and where you'll be able to reimagine what's possible. Join us and help the world's leading organizations unlock the value of technology and...
-
Data Engineer
2 weeks ago
Hyderābād, Sindh, Pakistan Backbase Full time ₹800,000 - ₹2,400,000 per yearAs a Data EngineerYou design, build, and optimize large-scale data pipelines and platforms across cloud environments. You manage data integration from multiple business systems, ensuring high data quality, performance, and governance. You collaborate with cross-functional teams to deliver trusted, scalable, and secure data solutions that enable analytics,...