Senior Customer Reliability Engineer, Infrastructure

3 weeks ago


Hyderabad City Taluka, Pakistan Astronomer Full time

Astronomer designed Astro, an industry-leading, orchestration-first DataOps platform for data teams. Powered by Airflow, Astro accelerates building reliable data products that unlock insights, unleash AI value, and drive data-driven applications.

We're a globally-distributed and rapidly growing venture-backed team of learners, innovators and collaborators. Our mission is to empower data teams to bring mission-critical analytics, AI, and software to life. As a member of our team, you will be at the forefront of the industry as we strive to deliver the world's data.

Your background may be unconventional; as long as you have the essential qualifications, we encourage you to apply. While having "bonus" qualifications makes for a strong candidate, Astronomer values diverse experiences. Many of us at Astronomer haven't followed traditional career paths, and we welcome it if yours hasn't either.

About this role:

The Astronomer Customer Reliability Engineering (CRE) team is responsible for the success of our customers' usage of our managed Airflow service.

The CRE are responsible for operating, monitoring, and maintaining the platform to ensure availability, predictability, and reliable operations.

As an infrastructure specialist within the team, you will focus on the reliability of the underlying cloud infrastructure and Kubernetes clusters. This entails responding to incidents either raised by a customer, or from our monitoring system and then taking further steps to ensure problems are permanently resolved or monitored. As owners of the observability platform, CRE has unlimited potential to improve the reliability of the product and deliver the best possible outcome for our customers.

This role is directly customer-facing and gives exposure to very diverse problems and requirements. The CRE get the opportunity to interface with customers from a variety of industries across different cloud providers, and all with different expectations. Your contributions will directly impact customers' success with using the Astronomer products, and you will be able to help make meaningful improvements to the customer experience.

What you get to do:
  1. Provide solutions to customers to make them successful using our products.

  2. Troubleshoot Customer environments and engage in active triaging with customers.

  3. Provide feedback to the product development teams on customer needs and pain points.

  4. Build out our monitoring and alerting systems.

  5. Build and maintain automation to ensure daily operational tasks are handled as efficiently as possible.

  6. Help direct the architecture of the products and contribute where possible.

  7. Own the customer experience, working directly with customers to prioritize and solve issues, meet SLAs, and provide "white glove" guidance on the path to production.

  8. Participate remotely within a fully distributed team.

  9. Enhance and enrich customer documentation.

  10. Work on a modern, sophisticated, cloud-native product that customers use to connect to dozens of other systems.

  11. Help maintain 24x7 coverage through a specified 6-hour pager period during your work day.

  12. Participate in paid on-call rotation for weekend coverage.

What you bring to the role:
  1. 5+ years of experience, preferably with large, complex cloud infrastructures operating at scale.

  2. 3+ years of experience with Kubernetes.

  3. Experience managing a Production distributed system with at least one major cloud provider (one or all: AWS, GCP, Azure).

  4. Strong Network Experience with one of the major Clouds.

  5. Strong Linux experience.

  6. Knowledge of how to operate and monitor issues for distributed systems.

  7. Experience with Observability tools.

  8. Previous experience in handling customer issues (internal and external).

  9. Strong Communication Skills.

  10. DevOps or CI/CD experience.

  11. Python scripting.

  12. Good troubleshooting Skills.

Bonus points if you have:
  1. Experience as a Site Reliability Engineer.

  2. Worked with Kubernetes Custom Resources.

  3. Depth of knowledge with Azure.

  4. Airflow/Big Data Orchestration experience.

  5. IaC experience.

#LI-Remote

At Astronomer, we value diversity. We are an equal opportunity employer: we do not discriminate on the basis of race, religion, color, national origin, gender, sexual orientation, age, marital status, veteran status, or disability status. Astronomer is a remote-first company.

#J-18808-Ljbffr

  • Hyderabad City Taluka, Pakistan Astronomer Full time

    Astronomer is a venture-backed team of learners, innovators, and collaborators. Our mission is to empower data teams to bring mission-critical analytics, AI, and software to life. We strive to deliver the world's data through our industry-leading orchestration-first DataOps platform.We're looking for a highly skilled engineer to join our Customer Reliability...


  • Hyderabad City Taluka, Pakistan Astronomer Full time

    Astronomer is a remote-first company that values diversity and is committed to making a positive impact. Our mission is to empower data teams to bring mission-critical analytics, AI, and software to life.About the Role:Join our Customer Reliability Engineering team as an infrastructure specialist and focus on the reliability of our cloud infrastructure and...


  • Hyderabad City Taluka, Pakistan Qualcomm Technologies, Inc Full time

    Job DescriptionWe are seeking a highly skilled Senior IT Infrastructure Specialist to join our team at Qualcomm Technologies, Inc. This role will be responsible for managing operational and support responsibilities for critical services supporting software engineering environments globally.The ideal candidate will have experience in handling operations roles...

  • Senior Cloud Engineer

    24 hours ago


    Hyderabad City Taluka, Pakistan FANATICS INC Full time

    Job Description:We are seeking a highly skilled and driven Senior Cloud Engineer with 3+ years of experience in cloud infrastructure, automation, and software development. This role focuses on building and maintaining secure, scalable, and efficient cloud systems. The ideal candidate will have hands-on expertise in software development, infrastructure,...


  • Hyderabad City Taluka, Pakistan Zscaler Full time

    Job OverviewZscaler is a leader in cloud security, protecting thousands of enterprise customers from cyberattacks and data loss. We are seeking an experienced Architect, Site Reliability Engineer to join our SRE Platform and Tooling team. As a key member of the team, you will be responsible for developing scalable, secure, and resilient SRE platform and...


  • Hyderabad City Taluka, Pakistan Oracle - Egypt Full time

    **Cloud Infrastructure Position**We are hiring a Cloud Infrastructure Specialist to join our team at Oracle - Egypt. As a key member of the Network Team, you will be responsible for supporting reliable and secure connectivity solutions for customers worldwide.Your primary responsibilities will include solving and resolving complex issues faced by customers...


  • Hyderabad City Taluka, Pakistan Warner Bros. Discovery, Inc. Full time

    Job DescriptionAs a Senior Site Reliability Engineer at Warner Bros. Discovery, you will be responsible for ensuring the reliability and scalability of our cloud-based systems.You will drive improvements in operational efficiency and proactive monitoring, automate workflows, and develop self-sustainable tools.


  • Hyderabad City Taluka, Pakistan Astronomer Full time

    Astronomer's globally distributed team is growing rapidly, and we're seeking a skilled engineer to join our CRE team. As an infrastructure specialist, you'll focus on the reliability of our cloud infrastructure and Kubernetes clusters.Job Description:Operate, monitor, and maintain our platform to ensure availability, predictability, and reliable...


  • Hyderabad City Taluka, Pakistan DigitalOcean LLC Full time

    About the Company:">DigitalOcean is a leading provider of cloud infrastructure and services.Our Mission:">To make cloud computing accessible to developers and businesses of all sizes.To provide a scalable and reliable infrastructure platform for applications.The Role:">Provide technical support to customers via phone, email, and chat.Collaborate with...


  • Hyderabad City Taluka, Pakistan beBee Careers Full time

    Job Summary:We are seeking a skilled Senior Cloud Engineer to design and implement solutions for complex, large-scale systems.The successful candidate will have extensive experience in cloud infrastructure, automation, and software development.As a Senior Cloud Engineer, you will collaborate across teams to deliver innovative, reliable cloud infrastructure...


  • Hyderabad City Taluka, Pakistan DigitalOcean LLC Full time

    About the Role:">We are seeking a highly skilled and motivated Technical Support Specialist to join our team. As a Technical Support Specialist, you will be responsible for providing exceptional support to our customers through various communication channels.Responsibilities:">Provide timely and effective technical support to customers via phone, email,...


  • Hyderabad City Taluka, Pakistan Qualcomm Technologies, Inc Full time

    Company: Qualcomm India Private LimitedJob Area: Engineering Group, Engineering Group > Software EngineeringGeneral Summary:We are looking for a talented, motivated, and experienced Cloud DevOps and Site Reliability Engineer (SRE). As part of the IoT (Internet of Things) team you will be working on the next generation of IoT products. This role includes...

  • Sr Engineer Cloud

    1 hour ago


    Hyderabad City Taluka, Pakistan FANATICS INC Full time

    Senior Cloud EngineerJob Description:We are seeking a highly skilled and driven Senior Cloud Engineer with 3+ years of experience in cloud infrastructure, automation, and software development. This role focuses on building and maintaining secure, scalable, and efficient cloud systems. The ideal candidate will have hands-on expertise in software development,...


  • Hyderabad City Taluka, Pakistan beBee Careers Full time

    Job Description:We are seeking a highly skilled and driven Senior Cloud Engineer with 3+ years of experience in cloud infrastructure, automation, and software development. This role focuses on building and maintaining secure, scalable, and efficient cloud systems.The ideal candidate will have hands-on expertise in software development, infrastructure,...


  • Hyderabad City Taluka, Pakistan DigitalOcean LLC Full time

    Simplifying Cloud ComputingDigitalOcean is committed to simplifying the complexities of cloud computing for developers and businesses. Our cloud platform provides a simple, robust, and cost-effective way for users to build, deploy, and scale applications.About the RoleWe're seeking a talented Senior Software Engineer to join our team. As a key member of our...


  • Hyderabad City Taluka, Pakistan Backbase Full time

    About the RoleWe are seeking a highly skilled Senior System Engineer to join our team and contribute to building an enterprise-grade "Data as a Service" platform from scratch.This role involves designing, implementing, and maintaining the infrastructure of our Data platform on public cloud (Azure). You will work closely with cross-functional teams to ensure...


  • Hyderabad City Taluka, Pakistan Zscaler Full time

    About ZscalerServing thousands of enterprise customers around the world including 40% of Fortune 500 companies, Zscaler (NASDAQ: ZS) was founded in 2007 with a mission to make the cloud a safe place to do business and a more enjoyable experience for enterprise users. As the operator of the world's largest security cloud, Zscaler accelerates digital...


  • Hyderabad City Taluka, Pakistan Oracle - Egypt Full time

    **Overview of Oracle Cloud Infrastructure Team**We are constructing a new technology organization with an entrepreneurial spirit that promotes a creative and energetic environment. Our team is focused on delivering exceptional services to our customers.The Network Team within Oracle's Cloud infrastructure organization requires technical engineers to support...


  • Hyderabad City Taluka, Pakistan beBee Careers Full time

    We're looking for a Senior Cloud Security Engineer who can support infrastructure teams from a security engineering perspective.About the Position:The successful candidate will establish and maintain security best practices for our mobile, on-premises and cloud-based platforms.Key Responsibilities:Establish and maintain infrastructure vulnerability...


  • Hyderabad City Taluka, Pakistan Oracle - Egypt Full time

    Job DescriptionAs part of the Network Team within Oracle's Cloud infrastructure organization, you and your team will be responsible for supporting reliable and secure connectivity solutions for customers. You will be supporting our OCI customers on networking that enable customers in developing, selling, and delivering Oracle products worldwide.Overview: The...