Sr. Data Engineer AWS Snowflake

2 weeks ago


Islamabad, Islamabad, Pakistan FuseMachines Full time

About Fusemachines
Fusemachines is a 10+ year old AI company, dedicated to delivering state-of-the-art AI products and solutions to a diverse range of industries. Founded by Sameer Maskey, Ph.D., an Adjunct Associate Professor at Columbia University, our company is on a steadfast mission to democratize AI and harness the power of global AI talent from underserved communities. With a robust presence in four countries and a dedicated team of over 400 full-time employees, we are committed to fostering AI transformation journeys for businesses worldwide. At Fusemachines, we not only bridge the gap between AI advancement and its global impact but also strive to deliver the most advanced technology solutions to the world.

About the role:
This is a remote, full time consulting position (contract) responsible for designing, building, and maintaining the infrastructure required for data integration, storage, processing, and analytics (BI, visualization and Advanced Analytics) to optimize digital channels and technology innovations with the end goal of creating competitive advantages for food services industry around the globe. We're looking for a solid lead engineer who brings fresh ideas from past experiences and is eager to tackle new challenges.

We're in search of a candidate who is knowledgeable about and loves working with modern data integration frameworks, big data and cloud technologies. Candidates must also be proficient with data programming languages (Python and SQL), AWS cloud and Snowflake Data Platform. The data engineer will build a variety of data pipelines and models to support advanced AI/ML analytics projects, with the intent of elevating the customer experience and driving revenue and profit growth globally.

Qualification & Experience:

  • Must have a full-time Bachelor's degree in Computer Science or similar from an accredited institution.
  • At least 3 years of experience as a data engineer with strong expertise in Python, Snowflake, PySpark, and AWS.
  • Proven experience delivering large-scale projects and products for Data and Analytics, as a data engineer.

Skill Set Requirement:

  • Vast background in all things data-related.
  • 3+ years of real-world data engineering development experience in Snowflake and AWS (certifications preferred).
  • Highly skilled in one or more programming languages, must have Python, and proficient in writing efficient and optimized code for data integration, storage, processing, manipulation and automation.
  • Strong experience in working with ELT and ETL tools and being able to develop custom integration solutions as needed, from different sources such as APIs, databases, flat files, and event streaming. Including experience with modern ETL tools such as Informatica, Matillion, or DBT; Informatica CDI is a plus.
  • Strong experience with scalable and distributed Data Technologies such as Spark/PySpark, DBT and Kafka, to be able to handle large volumes of data.
  • Strong programming skills in SQL, with proficiency in writing efficient and optimized code for data integration, storage, processing, and manipulation.
  • Strong experience in designing and implementing Data Warehousing solutions in AWS with Snowflake.
  • Good understanding of Data Modelling and Database Design Principles. Being able to design and implement efficient database schemas that meet the requirements of the data architecture to support data solutions.
  • Proven experience as a Snowflake Developer, with a strong understanding of Snowflake architecture and concepts.
  • Proficient in Snowflake services such as Snowpipe, stages, stored procedures, views, materialized views, tasks and streams.
  • Robust understanding of data partitioning and other optimization techniques in Snowflake.
  • Knowledge of data security measures in Snowflake, including role-based access control (RBAC) and data encryption.
  • Experience with Kafka, Pulsar, or other streaming technologies.
  • Experience orchestrating complex task flows across a variety of technologies, Apache Airflow preferred.
  • Expert in Cloud Computing in AWS, including deep knowledge of a variety of AWS services like Lambda, Kinesis, S3, Lake Formation, EC2, ECS/ECR, IAM, CloudWatch, EKS, API Gateway, etc
  • Good understanding of Data Quality and Governance, including implementation of data quality checks and monitoring processes to ensure that data is accurate, complete, and consistent.
  • Good Problem-Solving skills: being able to troubleshoot data processing pipelines and identify performance bottlenecks and other issues.

Responsibilities:

  • Follow established design and constructed data architectures. Developing and maintaining data pipelines (streaming and batch), ensuring data flows smoothly from source (point-of-sale, back of house, operational platforms and more of a Global Data Hub) to destination. Handle ETL/ELT processes, including data extraction, loading, transformation and loading data from various sources into Snowflake to enable best-in-class technology solutions.
  • Play a key role in the Data Operations team - developing data solutions responsible for driving Growth.
  • Contribute to standardizing and developing a framework to extend these pipelines globally, across markets and business areas.
  • Develop on a data platform by building applications using a mix of open-source frameworks (PySpark, Kubernetes, Airflow, etc.) and best-in-breed SaaS tools (Informatica Cloud, Snowflake, Domo, etc.).
  • Implement and manage production support processes around data lifecycle, data quality, coding utilities, storage, reporting and other data integration points.
  • Ensure the reliability, scalability, and efficiency of data systems are maintained at all times.
  • Assist in the configuration and management of Snowflake data warehousing and data lake solutions, working under the guidance of senior team members.
  • Work with cross-functional teams, including Product, Engineering, Data Science, and Analytics teams to understand and fulfill data requirements.
  • Contribute to data quality assurance through validation checks and support data governance initiatives, including cataloging and lineage tracking.
  • Takes ownership of storage layer, SQL database management tasks, including schema design, indexing, and performance tuning.
  • Continuously evaluate and integrate new technologies to enhance data engineering capabilities and actively participate in our Agile team meetings and improvement activities.
Fusemachines is an Equal opportunity employer, committed to diversity and inclusion. All qualified applicants will receive consideration for employment without regard to race, colour, religion, sex, sexual orientation, gender identity, national origin, disability, or any other characteristic protected by applicable federal, state, or local laws.
#J-18808-Ljbffr
  • Sr. Data Engineer

    3 weeks ago


    Islamabad, Islamabad, Pakistan Tekrowe Full time

    Direct message the job poster from TekroweTalent Acquisition Specialist @ Tekrowe Digital | Technical Recruiter | HR Operations | Head Hunter | Employee Engagement | Corporate CommunicationsThe Company Tekrowe Digital, a rapidly growing Software Development and Technology Innovation company in Pakistan, specializes in cutting-edge Web and Mobile...

  • Sr. Data Engineer

    3 weeks ago


    Islamabad, Islamabad, Pakistan Tekrowe Digital Full time

    Get AI-powered advice on this job and more exclusive features.Sign in to access AI-powered advicesContinue with Google Continue with GoogleContinue with Google Continue with GoogleContinue with Google Continue with GoogleContinue with Google Continue with GoogleContinue with Google Continue with GoogleContinue with Google Continue with GoogleDirect message...

  • Data Engineer

    3 days ago


    Islamabad, Islamabad, Pakistan NorthBay Solutions LLC Full time

    Job Title: Data EngineerLocation:Karachi, Lahore , Islamabad (Hybrid)Experience:5+ YearsJob Type: Full-TimeJob Overview:We are looking for a highly skilled and experienced Data Engineer with a strong foundation in Big Data, distributed computing, and cloud-based data solutions. This role demands a strong understanding of end-to-end Data pipelines, data...

  • Data Engineer

    2 weeks ago


    Islamabad, Islamabad, Pakistan Nisum Full time

    Job DescriptionNisum is a leading global digital commerce firm headquartered in California, with services spanning digital strategy and transformation, insights and analytics, blockchain, business agility, and custom software development. Founded in 2000 with the customer-centric motto "Building Success Together," Nisum has grown to over 1,800 professionals...


  • Islamabad, Islamabad, Pakistan SwipBox Full time

    Job Summary: We are looking for a skilled Data Engineer/Data Scientist with strong Python programming and analytical skills, who can work across the data pipeline from ingestion and processing to modelling and reporting. The ideal candidate has experience handling large-scale (big) datasets, building predictive models, working with live and runtime data...

  • Data Engineer

    2 weeks ago


    Islamabad, Islamabad, Pakistan Nisum Full time

    Get AI-powered advice on this job and more exclusive features.Job DescriptionNisum is a leading global digital commerce firm headquartered in California, with services spanning digital strategy and transformation, insights and analytics, blockchain, business agility, and custom software development. Founded in 2000 with the customer-centric mottoJob...


  • Islamabad, Islamabad, Pakistan beBeeData Full time 1,200,000 - 1,500,000

    Senior Data EngineerWe are seeking an experienced Senior Data Engineer to join our team.Design and develop complex data pipelines using Big Data and cloud-native technologies.Leverage tools such as Ab Initio, Informatica, DBT, and Apache Spark to build scalable data workflows.Implement distributed data processing using Hadoop, Hive, Kafka, and Spark.Build...


  • Islamabad, Islamabad, Pakistan beBeeData Full time

    Job Opportunity:We are seeking an experienced Data Architect to lead the design and implementation of scalable data platforms, ensuring seamless integration with enterprise systems. As a key member of our team, you will be responsible for architecting data infrastructure, supporting high-performance data pipelines, and driving continuous improvement.Key...


  • Islamabad, Islamabad, Pakistan beBeeDataEngineer Full time

    About Our MissionWe strive to create a better world by connecting people who don't speak the local language with medical professionals and other essential services.Our goal is to empower individuals in need through technology. We do this by arranging translators and interpreters for patients, crime victims, and others who require assistance navigating their...

  • AWS Data Architect

    3 weeks ago


    Islamabad, Islamabad, Pakistan NorthBay Solutions Full time

    Position OverviewWe are seeking an experienced AWS Data Architect to design and implement an AWS Data Lake solution for managing data from our SAP ERP systems to support FP&A processes in Anaplan. This hybrid role allows the candidate to work in Pakistan (Lahore, Karachi, Islamabad) with the flexibility to work remotely and on-site as needed. Additionally,...