
Data Engineer with PySpark and Databricks Expertise
4 weeks ago
Get AI-powered advice on this job and more exclusive features.
Direct message the job poster from Cloud Enterprise Business Solutions (CEBS)
As a Data Engineer, you will play a pivotal role in designing, building, and maintaining large-scale data pipelines using PySpark and Databricks. You will collaborate closely with stakeholders to clarify requirements and suggest improvements, ensuring our data infrastructure is robust, scalable, and efficient. Your interest in AI will contribute to innovative solutions and data products for our 5 million members.
Key Responsibilities:
- Data Pipeline Development: Deliver efficient, large-scale data pipelines using PySpark to process and analyse big data.
- Data Orchestration in Databricks: Build and maintain data orchestration pipelines within the Databricks platform.
- Stakeholder Collaboration: Communicate with stakeholders to clarify requirements, provide insights, and suggest improvements to existing data processes.
- Data Product Migration: Oversee the migration and integration of existing data products from one company to another, ensuring seamless functionality.
- Customer Journey Analytics: Develop and maintain data pipelines for customer event journeys, supporting analytics for our 5 million members.
- Performance Optimisation: Optimise data processing and storage for efficiency and scalability.
- Monitoring and Logging: Implement monitoring and logging solutions to ensure data pipelines run smoothly and issues are promptly identified.
- Security and Compliance: Ensure all data processing complies with security policies and data protection regulations.
- Innovation and AI Integration: Leverage your interest in AI to incorporate advanced analytics and machine learning where appropriate.
- Documentation: Create and maintain comprehensive documentation for data architectures, processes, and procedures.
Requirements:
- Qualification: Bachelor's degree in Computer Science, Information Technology, Engineering, or a related field (or equivalent work experience).
- Experience: 3+ years of experience in data engineering, focusing on building large-scale data pipelines.
- Databricks Expertise: At least 2 years of hands-on experience working with Databricks.
- Big Data Technologies: Proficiency with big data technologies and frameworks (e.g., Hadoop, Spark).
- Cloud Platforms: Experience with cloud platforms such as AWS, Azure, or Google Cloud Platform.
- Data Modelling: Solid understanding of data modelling, ETL processes, and data warehousing concepts.
- Programming Skills: Proficiency in Python and SQL; experience with other programming languages is a plus.
- Stakeholder Engagement: Proven ability to communicate effectively with stakeholders to gather requirements and provide solutions.
- AI Interest: Demonstrated interest in AI, machine learning, or data science.
- Problem-Solving Skills: Excellent analytical and troubleshooting abilities.
- Communication: Strong verbal and written communication skills.
- Agile Methodologies: Familiarity with Agile/Scrum development processes.
- Security Awareness: Understanding of data security practices and compliance requirements.
- Continuous Learning: A proactive attitude towards continuous learning and staying updated with industry trends.
What's in it for you
- Employees' Provident Fund, medical and other incentives
- Unique working environment where you communicate and work directly with international clients
- Seniority levelMid-Senior level
- Employment typeFull-time
- Job functionInformation Technology
- IndustriesIT Services and IT Consulting
Referrals increase your chances of interviewing at Cloud Enterprise Business Solutions (CEBS) by 2x
Sign in to set job alerts for "Data Engineer" roles.Associate Software Engineer -React NativeFull Stack Engineer- Node.js, React,js and FirebaseSolutions Engineer (Onsite, Islamabad, USD salary)Associate Software Engineer at FoomotionSoftware Developer at iSmile Dental SoftwareBlockchain Developer - Next.js & Bitcoin EcosystemFull Stack Developer - Python & Angular (Onsite, Islamabad, Remittance Salary)We're unlocking community knowledge in a new way. Experts add insights directly into each article, started with the help of AI.
#J-18808-Ljbffr-
Senior Data Engineer
4 days ago
Islamabad, Islamabad, Pakistan beBeeDataEngineering Full time 900,000 - 1,200,000Data Engineering Expertise WantedWe are seeking a seasoned data engineering professional to join our team in transforming and ensuring the accuracy and consistency of our data. Our company is undergoing a significant overhaul of its data transformation processes, and we require an expert who can write PySpark and SQL scripts to validate data pipelines,...
-
Senior Business Intelligence Analyst
5 days ago
Islamabad, Islamabad, Pakistan beBeeDataScientist Full timeJob Description:We are seeking an experienced Data Scientist to spearhead our data-driven initiatives. As a key member of our team, you will be responsible for architecting scalable solutions, applying advanced analytical techniques to solve complex business problems, and transforming raw data into actionable insights that drive strategy and operational...
-
Data Engineer in Test
4 days ago
Islamabad, Islamabad, Pakistan Confiz Full timeWe are undergoing a significant data transformation to ensure that accurate and consistent data is available precisely when and where it's needed.ResponsibilitiesWrite PySpark and SQL scripts to validate data pipelines, transformations, and integrations.Design and run tests for data validation, storage, and retrieval using Azure services like Data Lake,...
-
Data Scientist/ ML Engineer
4 weeks ago
Islamabad, Islamabad, Pakistan LMK Resources Ltd. Full timeJob Summary:The Data Scientist / ML Engineer will support the development of machine learning models and data pipelines. The role focuses on data analysis, feature engineering, and scalable processing using modern data platforms.Responsibilities:Perform data analysis and build machine learning models using Python, Spark, and PySpark.Develop and maintain...
-
Data Scientist
3 weeks ago
Islamabad, Islamabad, Pakistan North Eastern Services Full timeAbout FusemachinesFusemachines is a 10+ year old AI company, dedicated to delivering state-of-the-art AI products and solutions to a diverse range of industries. Founded by Sameer Maskey, Ph.D., an Adjunct Associate Professor at Columbia University, our company is on a steadfast mission to democratize AI and harness the power of global AI talent from...
-
Lead Data Engineer
2 days ago
Islamabad, Islamabad, Pakistan Fusemachines Full timeAbout FusemachinesFusemachines is a leading AI strategy, talent, and education services provider. Founded by Sameer Maskey Ph.D., Adjunct Associate Professor at Columbia University, Fusemachines has a core mission of democratizing AI. With a presence in 4 countries (Nepal, United States, Canada, and Dominican Republic) and more than 450 employees,...
-
Lead Data Engineer
1 day ago
Islamabad, Islamabad, Pakistan North Eastern Services Full timeAbout FusemachinesFusemachines is a leading AI strategy, talent, and education services provider. Founded by Sameer Maskey Ph.D., Adjunct Associate Professor at Columbia University, Fusemachines has a core mission of democratizing AI. With a presence in 4 countries (Nepal, United States, Canada, and Dominican Republic and more than 450 employees)....
-
Senior ETL Data Engineer
6 days ago
Islamabad, Islamabad, Pakistan NorthBay - Pakistan Full time2 months ago Be among the first 25 applicantsJob Title: Senior ETL Data Engineer (Ab Initio / Informatica)Experience: 4+ YearsLocation: Lahore / Karachi / Islamabad (Hybrid)Job Type: Full-TimeAbout The RoleWe are looking for a highly skilled Senior ETL Data Engineer (Ab Initio / Informatica) with strong experience in building robust data pipelines, working...
-
AI/ML Engineer
6 days ago
Islamabad, Islamabad, Pakistan Nisum Full timeWe are looking for a highly skilled AI/ML Engineer with strong expertise in the Azure ecosystem (Databricks, Data Lake, Data Factory, Synapse), along with hands-on experience in Generative AI (LLMs, RAG, agent frameworks). The role requires solid knowledge of Python, ML/DL frameworks, and modern orchestration tools to build scalable AI-driven solutions.What...
-
Senior Data Systems Architect
4 days ago
Islamabad, Islamabad, Pakistan beBeeData Full time $160,000 - $200,000Lead Data Engineer RoleThe Lead Data Engineer role is a critical position that requires technical expertise and leadership skills. The successful candidate will be responsible for designing, developing, and implementing large-scale data systems, as well as leading cross-functional teams to deliver high-quality solutions.Key Responsibilities:Architect and...