Sr. Data Engineer AWS

1 week ago


Islamabad, Islamabad, Pakistan FuseMachines Full time

About Fusemachines

Fusemachines is a 10+ year old AI company, dedicated to delivering state-of-the-art AI products and solutions to a diverse range of industries. Founded by Sameer Maskey, Ph.D., an Adjunct Associate Professor at Columbia University, our company is on a steadfast mission to democratize AI and harness the power of global AI talent from underserved communities. With a robust presence in four countries and a dedicated team of over 400 full-time employees, we are committed to fostering AI transformation journeys for businesses worldwide. At Fusemachines, we not only bridge the gap between AI advancement and its global impact but also strive to deliver the most advanced technology solutions to the world.

About the role

This is a remote full-time contractual position, working in the Travel & Hospitality Industry, responsible for designing, building, testing, optimizing and maintaining the infrastructure and code required for data integration, storage, processing, pipelines and analytics (BI, visualization and Advanced Analytics) from ingestion to consumption, implementing data flow controls, and ensuring high data quality and accessibility for analytics and business intelligence purposes. This role requires a strong foundation in programming and a keen understanding of how to integrate and manage data effectively across various storage systems and technologies.

We're looking for someone who can quickly ramp up, contribute right away and work independently as well as with junior team members with minimal oversight.

We are looking for a skilled Sr. Data Engineer with a strong background in Python, SQL, Pyspark, Redshift, and AWS cloud-based large-scale data solutions with a passion for data quality, performance and cost optimization. The ideal candidate will develop in an Agile environment.

This role is perfect for an individual passionate about leveraging data to drive insights, improve decision-making, and support the strategic goals of the organization through innovative data engineering solutions.

Qualification / Skill Set Requirement:

  • Must have a full-time Bachelor's degree in Computer Science, Information Systems, Engineering, or a related field.
  • 5+ years of real-world data engineering development experience in AWS (certifications preferred). Strong expertise in Python, SQL, PySpark and AWS in an Agile environment, with a proven track record of building and optimizing data pipelines, architectures, and datasets, and proven experience in data storage, modelling, management, lake, warehousing, processing/transformation, integration, cleansing, validation and analytics.
  • A senior person who can understand requirements and design end-to-end solutions with minimal oversight.
  • Strong programming Skills in one or more languages such as Python, Scala, and proficient in writing efficient and optimized code for data integration, storage, processing and manipulation.
  • Strong knowledge SDLC tools and technologies, including project management software (Jira or similar), source code management (GitHub or similar), CI/CD system (GitHub actions, AWS CodeBuild or similar) and binary repository manager (AWS CodeArtifact or similar).
  • Good understanding of Data Modelling and Database Design Principles. Being able to design and implement efficient database schemas that meet the requirements of the data architecture to support data solutions.
  • Strong SQL skills and experience working with complex data sets, Enterprise Data Warehouse and writing advanced SQL queries. Proficient with Relational Databases (RDS, MySQL, Postgres, or similar) and NonSQL Databases (Cassandra, MongoDB, Neo4j, etc.).
  • Skilled in Data Integration from different sources such as APIs, databases, flat files, and event streaming.
  • Strong experience in implementing data pipelines and efficient ELT/ETL processes, batch and real-time, in AWS and using open source solutions, being able to develop custom integration solutions as needed, including Data Integration from different sources such as APIs (PoS integrations is a plus), ERP (Oracle and Allegra are a plus), databases, flat files, Apache Parquet, event streaming, including cleansing, transformation and validation of the data.
  • Strong experience with scalable and distributed Data Technologies such as Spark/PySpark, DBT and Kafka, to be able to handle large volumes of data.
  • Experience with stream-processing systems: Storm, Spark-Streaming, etc. is a plus.
  • Strong experience in designing and implementing Data Warehousing solutions in AWS with Redshift. Demonstrated experience in designing and implementing efficient ELT/ETL processes that extract data from source systems, transform it (DBT), and load it into the data warehouse.
  • Strong experience in Orchestration using Apache Airflow.
  • Expert in Cloud Computing in AWS, including deep knowledge of a variety of AWS services like Lambda, Kinesis, S3, Lake Formation, EC2, EMR, ECS/ECR, IAM, CloudWatch, etc
  • Good understanding of Data Quality and Governance, including implementation of data quality checks and monitoring processes to ensure that data is accurate, complete, and consistent.
  • Good understanding of BI solutions, including Looker and LookML (Looker Modelling Language).
  • Strong knowledge and hands-on experience of DevOps principles, tools and technologies (GitHub and AWS DevOps), including continuous integration, continuous delivery (CI/CD), infrastructure as code (IaC – Terraform), configuration management, automated testing, performance tuning and cost management and optimization.
  • Good Problem-Solving skills: being able to troubleshoot data processing pipelines and identify performance bottlenecks and other issues.
  • Possesses strong leadership skills with a willingness to lead, create Ideas, and be assertive.
  • Strong project management and organizational skills.
  • Excellent communication skills to collaborate with cross-functional teams, including business users, data architects, DevOps/DataOps/MLOps engineers, data analysts, data scientists, developers, and operations teams. Essential to convey complex technical concepts and insights to non-technical stakeholders effectively.
  • Ability to document processes, procedures, and deployment configurations.

Responsibilities:

  • Design, implement, deploy, test and maintain highly scalable and efficient data architectures, defining and maintaining standards and best practices for data management independently with minimal guidance.
  • Ensuring the scalability, reliability, quality and performance of data systems.
  • Mentoring and guiding junior/mid-level data engineers.
  • Collaborating with Product, Engineering, Data Scientists and Analysts to understand data requirements and develop data solutions, including reusable components.
  • Evaluating and implementing new technologies and tools to improve data integration, data processing and analysis.
  • Design architecture, observability and testing strategies, and build reliable infrastructure and data pipelines.
  • Takes ownership of storage layer, data management tasks, including schema design, indexing, and performance tuning.
  • Swiftly address and resolve complex data engineering issues, incidents and resolve bottlenecks in SQL queries and database operations.
  • Conduct a Discovery on the existing Data Infrastructure and Proposed Architecture.
  • Evaluate and implement cutting-edge technologies and methodologies, and continue learning and expanding skills in data engineering and cloud platforms, to improve and modernize existing data systems.
  • Evaluate, design, and implement data governance solutions: cataloguing, lineage, quality and data governance frameworks that are suitable for a modern analytics solution, considering industry-standard best practices and patterns.
  • Define and document data engineering architectures, processes and data flows.
  • Assess best practices and design schemas that match business needs for delivering a modern analytics solution (descriptive, diagnostic, predictive, prescriptive).
  • Be an active member of our Agile team, participating in all ceremonies and continuous improvement activities.

Fusemachines is an Equal opportunity employer, committed to diversity and inclusion. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, or any other characteristic protected by applicable federal, state, or local laws.

#J-18808-Ljbffr
  • AWS Data Engineer

    6 days ago


    Islamabad, Islamabad, Pakistan beBee Careers Full time

    Job OverviewWe are seeking a skilled AWS Data Engineer to work in a rotational shift environment. This role involves monitoring and troubleshooting AWS-based data pipelines while ensuring data integrity and pipeline continuity.You will work with AWS Glue, Lambda, Step Functions, and other AWS data services to investigate job failures, re-run ETL jobs, and...

  • Sr. Data Engineer

    2 days ago


    Islamabad, Islamabad, Pakistan Tekrowe Full time

    Direct message the job poster from TekroweTalent Acquisition Specialist @ Tekrowe Digital | Technical Recruiter | HR Operations | Head Hunter | Employee Engagement | Corporate CommunicationsThe Company Tekrowe Digital, a rapidly growing Software Development and Technology Innovation company in Pakistan, specializes in cutting-edge Web and Mobile...

  • Sr. Data Engineer

    1 day ago


    Islamabad, Islamabad, Pakistan Tekrowe Digital Full time

    Get AI-powered advice on this job and more exclusive features.Sign in to access AI-powered advicesContinue with Google Continue with GoogleContinue with Google Continue with GoogleContinue with Google Continue with GoogleContinue with Google Continue with GoogleContinue with Google Continue with GoogleContinue with Google Continue with GoogleDirect message...


  • Islamabad, Islamabad, Pakistan beBeeDataEngineer Full time

    Job TitleAWS Data Engineer - L2 Support RoleAbout the JobWe are seeking a highly skilled AWS Data Engineer to join our team. This role involves monitoring, troubleshooting, and optimizing AWS-based data pipelines while ensuring data integrity and pipeline continuity.You will work with AWS Glue, Lambda, Step Functions, and other AWS data services to...

  • AWS Data Architect

    3 days ago


    Islamabad, Islamabad, Pakistan NorthBay Solutions Full time

    Position OverviewWe are seeking an experienced AWS Data Architect to design and implement an AWS Data Lake solution for managing data from our SAP ERP systems to support FP&A processes in Anaplan. This hybrid role allows the candidate to work in Pakistan (Lahore, Karachi, Islamabad) with the flexibility to work remotely and on-site as needed. Additionally,...


  • Islamabad, Islamabad, Pakistan Cloudelligent Full time

    Position Title: AWS Database EngineerJob Timings: 8:00 AM – 5:00 PM CST (6:00 PM- 3:00 AM PKT)Location: IslamabadAbout CloudelligentCloudelligent is Cloud-native consultancy and AWS Advanced consulting partner We specialize in providing bespoke cloud solutions to Startups & SMBs. Being a next-gen cloud service provider, Cloudelligent helps businesses to...

  • AWS AI Engineer

    2 weeks ago


    Islamabad, Islamabad, Pakistan Cloudelligent Full time

    If you're ready to innovate and create with cutting-edge AI, Cloudelligent is looking for talented engineers like you.Position Title: AWS AI EngineerJob Timings: 6pm-3am PKTLocation: IslamabadAbout CloudelligentCloudelligent is Cloud-native consultancy and AWS Advanced consulting partner We specialize in providing bespoke cloud solutions to Startups & SMBs....


  • Islamabad, Islamabad, Pakistan beBeeData Full time

    Job DescriptionSeeking an experienced professional to design and implement a scalable data architecture for managing large volumes of data from SAP ERP systems. This role involves building robust data pipelines, enabling seamless integration with Anaplan for financial planning and analysis.Key ResponsibilitiesDesign and implement AWS Data Lake solution for...


  • Islamabad, Islamabad, Pakistan beBeeBackend Full time

    Sr. Backend Software EngineerWe are seeking a highly skilled and experienced Sr. Backend Software Engineer to join our team.

  • Data Engineer

    7 days ago


    Islamabad, Islamabad, Pakistan VisionX Technologies, Inc. Full time

    About us: VisionX works with world-leading brands, Fortune 1000 as their innovation partner, providing product strategy and custom application development leveraging agile methodologies, technology accelerators, and by creating Intellectual Property.VisionX has been listed in the Top 10 Most Innovative Companies of 2020 by Fast Company – ranked alongside...