POSTED Apr 2

System Development Engineer (ML Ops), Amazon EKS

at AmazonUS, WA, Seattle

Share:

At Amazon Elastic Kubernetes Service (EKS) , we are building a core set of services that allow our customers to create and use Kubernetes at scale. You will be part of an exceptional team moving the needle towards making containers as the next generation compute platform. This is an opportunity to operate and engineer on a massive scale, and to gain top-notch experience in cloud computing.

As an ML OPS Engineer in Amazon Elastic Kubernetes Service (EKS) compute team, you will help make EKS the most reliable place to run AI/ML workloads on Kubernetes at massive scale (10,000 or more nodes per cluster). We are looking for engineers to help build our strong product roadmap and who have or want to develop deep expertise in Kubernetes data plane ecosystem.



Key job responsibilities
- Tune Amazon EKS accelerated machine images (AMI) to be best in class for AI/ML workloads
- Write test suites that accurately represent Kubernetes workloads
- Design and build CI/CD pipelines that perform functional, load tests and security scans for EKS GPU machine images


A day in the life
- Collaborate with peers over design approaches
- Write critical path code and review your peer's code
- Investigate issues and improve SLOs when oncall
- Attend daily standups

About the team
About AWS

Diverse Experiences
AWS values diverse experiences. Even if you do not meet all of the qualifications and skills listed in the job description, we encourage candidates to apply. If your career is just starting, hasn’t followed a traditional path, or includes alternative experiences, don’t let it stop you from applying.

Why AWS?
Amazon Web Services (AWS) is the world’s most comprehensive and broadly adopted cloud platform. We pioneered cloud computing and never stopped innovating — that’s why customers from the most successful startups to Global 500 companies trust our robust suite of products and services to power their businesses.

Inclusive Team Culture
Here at AWS, it’s in our nature to learn and be curious. Our employee-led affinity groups foster a culture of inclusion that empower us to be proud of our differences. Ongoing events and learning experiences, including our Conversations on Race and Ethnicity (CORE) and AmazeCon (gender diversity) conferences, inspire us to never stop embracing our uniqueness.

Mentorship & Career Growth
We’re continuously raising our performance bar as we strive to become Earth’s Best Employer. That’s why you’ll find endless knowledge-sharing, mentorship and other career-advancing resources here to help you develop into a better-rounded professional.

Work/Life Balance
We value work-life harmony. Achieving success at work should never come at the expense of sacrifices at home, which is why we strive for flexibility as part of our working culture. When we feel supported in the workplace and at home, there’s nothing we can’t achieve in the cloud.

We are open to hiring candidates to work out of one of the following locations:

Seattle, WA, USA

Please mention that you found this job on Moaijobs, this helps us get more companies to post here, thanks!

Related Jobs

Amazon
Software Development Engineer, Amazon Q Business
US, NY, New York
Amazon
Software Development Engineer 2025, AI/ML
US, WA, Seattle
Amazon
Software Development Engineer (ML), AGI Foundations
US, CA, Sunnyvale
AMD
System Design Engineer Intern (Product Development)
Penang, Malaysia
AMD
System Architecture Engineer
Austin, Texas