Senior Data Engineer
3 days ago
Appen is a leader in AI enablement for critical tasks such as model improvement, supervision, and evaluation. To do this we leverage our global crowd of over one million skilled contractors, speaking over 180 languages and dialects, representing 130 countries. In addition, we utilize the industry's most advanced AI-assisted data annotation platform to collect and label various types of data like images, text, speech, audio, and video.
Our data is crucial for building and continuously improving the world's most innovative artificial intelligence systems and Appen is already trusted by the world's largest technology companies. Now with the explosion of interest in generative AI, Appen is helping leaders in automotive, financial services, retail, healthcare, and governments the confidence to deploy world-class AI products.
At Appen, we are purpose driven. Our fundamental role in AI is to ensure all models are helpful, honest, and harmless, so we firmly believe in unlocking the power of AI to build a better world. We have a learn-it-all culture that values perspective, growth, and innovation. We are customer-obsessed, action-oriented, and celebrate winning together.
At Appen, we are committed to creating an inclusive and diverse workplace. We are an equal opportunity employer that does not discriminate on the basis of race, religion, color, national origin, gender, sexual orientation, age, marital status, veteran status, or disability status.
Position Summary We're hiring a Senior Data Engineer with strong experience in AWS and Databricks to build scalable data solutions that power next-gen AI and machine learning. Join our fast-growing team to work on impactful projects, collaborate with top talent, and drive innovation at scale. Key Responsibilities:
- Design, build, and manage large-scale data infrastructures using a variety of AWS technologies such as Amazon Redshift, AWS Glue, Amazon Athena, AWS Data Pipeline, Amazon Kinesis, Amazon EMR, and Amazon RDS.
- Design, develop, and maintain scalable data pipelines and architectures on Databricks using tools such as Delta Lake, Unity Catalog, and Apache Spark (Python or Scala), or similar technologies.
- Integrate Databricks with cloud platforms like AWS to ensure smooth and secure data flow across systems.
- Build and automate CI/CD pipelines for deploying, testing, and monitoring Databricks workflows and data jobs.
- Continuously optimize data workflows for performance, reliability, and security, applying Databricks best practices around data governance and quality.
- Ensure the performance, availability, and security of datasets across the organization, utilizing AWS's robust suite of tools for data management.
- Collaborate with data scientists, software engineers, product managers, and other key stakeholders to develop data-driven solutions and models.
- Translate complex functional and technical requirements into detailed design proposals and implement them.
- Mentor junior and mid-level data engineers, fostering a culture of continuous learning and improvement within the team.
- Identify, troubleshoot, and resolve complex data-related issues.
- Champion best practices in data management, ensuring the cleanliness, integrity, and accessibility of our data.
- Optimize and fine-tune data queries and processes for performance. Evaluate and advise on technological components, such as software, hardware, and networking capabilities, for database management systems and infrastructure.
- Stay informed on the latest industry trends and technologies to ensure our data infrastructure is modern and robust.
- 5-7 years of hands-on experience with AWS data engineering technologies, such as Amazon Redshift, AWS Glue, AWS Data Pipeline, Amazon Kinesis, Amazon RDS, and Apache Airflow.
- Hands-on experience working with Databricks, including Delta Lake, Apache Spark (Python or Scala), and Unity Catalog.
- Demonstrated proficiency in SQL and NoSQL databases, ETL tools, and data pipeline workflows.
- Experience with Python, and/or Java.
- Deep understanding of data structures, data modeling, and software architecture.
- Experience with AI and machine learning technologies is highly desirable.
- Strong problem-solving skills and attention to detail.
- Self-motivated and able to work independently, with excellent organizational and multitasking skills.
- Exceptional communication skills, with the ability to explain complex data concepts to non-technical stakeholders.
- Bachelor's Degree in Computer Science, Information Systems, or a related field. A Master's Degree is preferred.
-
Senior Data Engineer
3 days ago
Hyderābād, Sindh, Pakistan Egen Full time $90,000 - $120,000 per yearJob Overview: We are looking for a skilled and motivated Senior Data Engineer with strong experience in Python programming and Google Cloud Platform (GCP) to join our data engineering team. The ideal candidate will be responsible for designing, developing, and maintaining robust and scalable ETL (Extract, Transform, Load) data pipelines. The role...
-
Senior Data Engineer
3 days ago
Hyderābād, Sindh, Pakistan Incedo Full time 1,200,000 - 2,400,000 per yearCompany Overview Incedo is a US-based consulting, data science and technology services firm with over 3000 people helping clientsfrom our six offices across US, Mexico and India. We help our clients achieve competitive advantage throughend-to-end digital transformation. Our uniqueness lies in bringing together strong engineering, data science, anddesign...
-
Senior Data Engineer
3 days ago
Hyderābād, Sindh, Pakistan Egen Full time 120,000 - 180,000 per yearJob Overview: We are seeking a highly skilled Senior Data Engineer with deep expertise in Microsoft Azure data ecosystem to design, develop, and maintain scalable data pipelines and architectures. The ideal candidate will play a key role in building robust data solutions that support advanced analytics, BI, and AI workloads across the organization. This...
-
Senior Data Engineer
3 days ago
Hyderābād, Sindh, Pakistan Egen Solutions Inc Full timeJob Overview:We are seeking a highly skilled Senior Data Engineer with deep expertise in Microsoft Azure data ecosystem to design, develop, and maintain scalable data pipelines and architectures. The ideal candidate will play a key role in building robust data solutions that support advanced analytics, BI, and AI workloads across the organization. This role...
-
Data Engineer
1 week ago
Hyderābād, Sindh, Pakistan Pragma Edge Full time 1,200,000 - 3,600,000 per yearJob Category - ITJob Type - Full TimeNo of openings: 01Experience : 6 to 8 YearsJob Location - HyderabadSenior IT Data EngineerThe Senior IT Data Engineers are experts in data streaming, building data pipelines that support real-time data refreshes and are cost-optimized for computing resources. They deeply understand data security, implementing row-level...
-
Senior Data Engineer
1 week ago
Hyderābād, Sindh, Pakistan Turvo Inc Full time 1,200,000 - 3,600,000 per yearAbout TurvoTurvo provides a collaborative Transportation Management System (TMS) application designed specifically for the supply chain. Turvo Collaboration Cloud connects freight brokers, 3PLs, shippers, and carriers to unite supply chain ecosystems, delivering outstanding customer experiences, real-time collaboration, and accelerated growth. The technology...
-
Senior Data Engineer
3 days ago
Hyderābād, Sindh, Pakistan Turvo Full time 1,500,000 - 3,000,000 per yearAbout Turvo Turvo provides a collaborative Transportation Management System (TMS) application designed specifically for the supply chain. Turvo Collaboration Cloud connects freight brokers, 3PLs, shippers, and carriers to unite supply chain ecosystems, delivering outstanding customer experiences, real-time collaboration, and accelerated growth. The...
-
Data Engineer
3 days ago
Hyderābād, Sindh, Pakistan Egen Full time $60,000 - $120,000 per yearJob Overview: We are looking for a skilled and motivated Data Engineer with strong experience in Python programming and Google Cloud Platform (GCP) to join our data engineering team. The ideal candidate will be responsible for designing, developing, and maintaining robust and scalable ETL (Extract, Transform, Load) data pipelines. The role involves...
-
Senior Data Modeler
3 days ago
Hyderābād, Sindh, Pakistan Incedo Full time $120,000 - $180,000 per yearCompany Overview Incedo is a US-based consulting, data science and technology services firm with over 3000 people helping clientsfrom our six offices across US, Mexico and India. We help our clients achieve competitive advantage throughend-to-end digital transformation. Our uniqueness lies in bringing together strong engineering, data science, anddesign...
-
Senior Quality Engineer
3 days ago
Hyderābād, Sindh, Pakistan Matillion Full time $90,000 - $120,000 per yearReady to shape the future of data? Matillion is the intelligent data integration platform. We're changing how the world works with data – and we need driven, curious people who think big and move fast. We built the Data Productivity Cloud to supercharge data productivity, and now we're shaping the future of data engineering with Maia – our...