POSTED Dec 22

Platform Engineer, Cloud Infrastructure

at Stability AIUnited States

Share:

About Stability: 

Stability AI is on the forefront of artificial intelligence development, building brand new technologies that push the boundaries of what is possible. As an open source company, we are committed to our community, and care deeply about the real-world implications and applications of our technology. Our most considerable advances grow from working across multiple teams and disciplines. We are not afraid to go against established norms or buck industry expectations. We are motivated to generate breakthrough ideas and convert them into tangible solutions. Our vibrant communities consist of experts, leaders, and partners across the globe who are developing cutting-edge open AI models for Image, Language, Audio, Video, 3D, and Biology. 

About role: 

We are currently looking for a skilled Cloud Infrastructure Engineer with a specialized focus on API development to facilitate seamless integration and interaction between cloud-based services and High-Performance Computing (HPC) environments. The successful candidate will play a pivotal role in designing and implementing APIs that enable efficient communication and data exchange between cloud platforms and HPC systems.

Responsibilities:

  • Design, develop, and maintain robust APIs that facilitate communication and data exchange between cloud-based services, particularly AWS, and HPC environments.
  • Collaborate with cross-functional teams to understand the unique requirements of both cloud based services and HPC systems, ensuring that the APIs developed meet the specific needs of these environments.
  • Implement best practices for API design, including security, scalability, and performance optimization to ensure efficient interaction between cloud services and HPC clusters.
  • Utilize services such as Cloudflare to enhance API performance, security, and reliability in the cloud-to-HPC communication, optimizing for speed and resilience.
  • Work closely with HPC engineers to identify and address integration challenges, striving for seamless connectivity between diverse systems and cloud-based platforms.
  • Drive innovation by proposing and implementing new API strategies, enhancing the efficiency and functionality of data exchange between AWS, GCP, Cloudflare, and on-premise HPC environments.
  • Create comprehensive documentation and provide training to internal teams on the use and integration of developed APIs, focusing on AWS and Cloudflare environments.
  • Monitor API performance and address issues related to data transfer, ensuring reliability and consistent operation between AWS, Cloudflare, and HPC systems.
  • Collaborate with the security team to ensure that the APIs comply with industry standards and best practices for data privacy and protection, especially in AWS and Cloudflare environments.

Requirements:

  • Strong experience in cloud computing, API development, and a deep understanding of High-Performance Computing environments, particularly in an AWS setting.
  • Proficiency in programming languages such as Python and Typescript, essential for API development and integration within AWS and/or Cloudflare environments.
  • Demonstrated expertise in API design, implementation, and maintenance, ensuring security and performance best practices within AWS and Cloudflare.
  • Knowledge of containerization technologies (e.g., Docker, Kubernetes) for deployment of APIs within AWS, Cloudflare, and HPC systems.
  • Familiarity with authentication and authorization protocols (e.g., OAuth, JWT) to ensure secure data exchange between AWS, Cloudflare, and HPC environments.
  • Strong problem-solving skills and the ability to troubleshoot complex issues related to API integrations in a hybrid cloud-HPC setup, particularly in AWS and Cloudflare environments.
  • Excellent communication and collaboration skills to work effectively with diverse teams and stakeholders in AWS and Cloudflare ecosystems.

Equal Employment Opportunity:

We are an equal opportunity employer and do not discriminate on the basis of race, religion, national origin, gender, sexual orientation, age, veteran status, disability or other legally protected statuses. 

 

Please mention that you found this job on Moaijobs, this helps us get more companies to post here, thanks!

Related Jobs

Groq
Principal Software Engineer, Infrastructure Platform
Mountain View, CA (Remote)
Groq
Principal Site Reliability Engineer, Infrastructure Platform
Mountain View, CA (Remote)
Shield AI
Senior Engineer, Software Infrastructure (R2973)
San Diego Metro Area
Helsing
Systems Engineer
Munich
X AI
Infrastructure Engineer - Supercomputing
San Francisco & Palo Alto, CA