About the company:
RapidFire AI is a cutting-edge deep tech startup specializing in scaling Machine Learning solutions. We are dedicated to empowering customers to effortlessly scale their AI workloads, ensuring they stay at the forefront of innovation in their industries.
About the Role:
We are seeking a highly motivated and skilled Cloud DevOps Engineer to join our growing team. In this role, you will be responsible for deploying and managing customer applications on Amazon Web Services (AWS), as well as integrating a custom application that directly manages operations on AWS instances. You will collaborate closely with developers and customer-facing personnel to ensure seamless execution and high availability.
Responsibilities:
Design, develop, and deploy applications on AWS infrastructure, following best practices and security guidelines. Potentially extend these services to other public cloud service providersUse containerization and orchestration tools like Docker and Kubernetes to quickly deploy and maintain applications on clustersUse Infrastructure-as-Code (IaC) tools like Terraform or CloudFormation for automated infrastructure provisioning and managementConfigure and manage AWS infrastructure services, including EC2, S3, EBS, EFS, EKS, and LambdaMonitor and troubleshoot AWS resources to ensure optimal performance and availabilityCollaborate with developers to define and implement cloud infrastructure strategies for applicationsImplement continuous integration and continuous delivery (CI/CD) pipelines for efficient application deploymentsDeliver and maintain applications for customer accounts via a virtual private cloud (VPC) on AWSDesign, develop, deploy, and maintain spend aggregation and billing strategies for different applicationsIntegrate custom applications that manage and configure customer AWS instances based on specific needsAutomate routine tasks and implement monitoring solutions for proactive issue identification and resolutionCommunicate effectively with customers to understand their requirements and ensure successful application deploymentsStay up-to-date with the latest advancements in cloud technologies and best practices,
Qualifications:
4+ years of experience as a Cloud Engineer, Cloud DevOps, or similar role using AWS infrastructure servicesProven experience with AWS deployments and architectures, including VPC, security, authentication, and billing practices4+ programming experience with Python and shell scripting like Bash4+ years deploying production applications with Docker and KubernetesFamiliarity with IaC tools like Terraform or CloudFormationUnderstanding of CI/CD methodologies and toolsBasic experience with version control systems (e.g., Git) and collaborative development workflowsExcellent communication and collaboration skillsAbility to work independently and as part of a teamStrong problem-solving and analytical skillsA passion for learning and staying updated with the latest cloud technologies,
Nice to have:
AWS certifications (e.g. AWS Certified Solutions Architect - Associate)Kubernetes certifications (eg. CKA - Certified Kubernetes Administrator)Demonstrated experience deploying Deep Learning tools like PyTorch or TensorFlow on GPU clusters in productionKnowledge of tools like MLFlow or similarKnowledge of equivalent infrastructure services on other public cloud providers