2 days ago
Principal, Model Optimization, Generative AI Innovation Center
US, WA, Seattle
The Generative AI Innovation Center (GenAIIC) team helps AWS customers accelerate the use of Generative AI to solve business and operational challenges and promote innovation in their organization. AWS GenAIIC customers are maturing on their GenAI journey, and increasingly looking for ways to optimize their solutions around latency, cost, and model performance.
In this role you will help drive GenAIIC strategy and delivery approach to tackle this demand through optimizations using purpose-built solutions like SageMaker HyperPod and Amazon Trainium and Inferentia accelerators. As a GenAI Optimization Specialist, you are customer obsessed and understand concepts of low-level computer hardware and large language models. You are also proficient in designing and optimizing ML solutions that work backward from customer business outcomes. You will be expected to invent on behalf of our customers and create new and innovative ideas that accelerate the development and migration of customer GenAI models on the current and future Trainium/Inferentia hardware.
You will possess a strong technical background as well as ability to develop and drive business strategy and go to market motions. You must excel at written and verbal communications. You will have experience with developer tools and software development processes. You must be able to thrive and succeed in an entrepreneurial environment, and not be hindered by ambiguity or competing priorities. This means anticipating bottlenecks, making tradeoffs and balancing the needs of the business versus technical constraints. Cross-team coordination, project management and an ability to learn and understand new technology are essential.
Key job responsibilities
- Use ML and Generative AI tools, such as Amazon SageMaker HyperPod and Neuron chips to provide a scalable cloud solutions for our customers to build, train, tune and deploy their models.
- Interact with customer directly to understand the business problem and requirements, and help scope technical projects that will drive business outcomes and ROI.
- Collaborate with our scientists, engineers, and architects to deliver technical solutions to customers and develop reusable solutions and artifacts.
- Dive deep into custom LLM model architectures to migrate and optimize for performance and cost on Neuron chips.
- Work closely with account team, research scientist teams and product engineering teams including Annapurna labs and SageMaker to identify force-multiplier opportunities and drive new capabilities and model support.
- Write monthly and weekly business reviews and participate in strategic planning documents such as 3 Year plans, OP1 and Yearly Goals and present them to senior leaders across Amazon.
About the team
ABOUT AWS:
Diverse Experiences
Amazon values diverse experiences. Even if you do not meet all of the preferred qualifications and skills listed in the job description, we encourage candidates to apply. If your career is just starting, hasn’t followed a traditional path, or includes alternative experiences, don’t let it stop you from applying.
Why AWS
Amazon Web Services (AWS) is the world’s most comprehensive and broadly adopted cloud platform. We pioneered cloud computing and never stopped innovating — that’s why customers from the most successful startups to Global 500 companies trust our robust suite of products and services to power their businesses.
Work/Life Balance
We value work-life harmony. Achieving success at work should never come at the expense of sacrifices at home, which is why we strive for flexibility as part of our working culture. When we feel supported in the workplace and at home, there’s nothing we can’t achieve in the cloud.
Inclusive Team Culture
Here at AWS, it’s in our nature to learn and be curious. Our employee-led affinity groups foster a culture of inclusion that empower us to be proud of our differences. Ongoing events and learning experiences, including our Conversations on Race and Ethnicity (CORE) and AmazeCon (gender diversity) conferences, inspire us to never stop embracing our uniqueness.
Mentorship and Career Growth
We’re continuously raising our performance bar as we strive to become Earth’s Best Employer. That’s why you’ll find endless knowledge-sharing, mentorship and other career-advancing resources here to help you develop into a better-rounded professional.
In this role you will help drive GenAIIC strategy and delivery approach to tackle this demand through optimizations using purpose-built solutions like SageMaker HyperPod and Amazon Trainium and Inferentia accelerators. As a GenAI Optimization Specialist, you are customer obsessed and understand concepts of low-level computer hardware and large language models. You are also proficient in designing and optimizing ML solutions that work backward from customer business outcomes. You will be expected to invent on behalf of our customers and create new and innovative ideas that accelerate the development and migration of customer GenAI models on the current and future Trainium/Inferentia hardware.
You will possess a strong technical background as well as ability to develop and drive business strategy and go to market motions. You must excel at written and verbal communications. You will have experience with developer tools and software development processes. You must be able to thrive and succeed in an entrepreneurial environment, and not be hindered by ambiguity or competing priorities. This means anticipating bottlenecks, making tradeoffs and balancing the needs of the business versus technical constraints. Cross-team coordination, project management and an ability to learn and understand new technology are essential.
Key job responsibilities
- Use ML and Generative AI tools, such as Amazon SageMaker HyperPod and Neuron chips to provide a scalable cloud solutions for our customers to build, train, tune and deploy their models.
- Interact with customer directly to understand the business problem and requirements, and help scope technical projects that will drive business outcomes and ROI.
- Collaborate with our scientists, engineers, and architects to deliver technical solutions to customers and develop reusable solutions and artifacts.
- Dive deep into custom LLM model architectures to migrate and optimize for performance and cost on Neuron chips.
- Work closely with account team, research scientist teams and product engineering teams including Annapurna labs and SageMaker to identify force-multiplier opportunities and drive new capabilities and model support.
- Write monthly and weekly business reviews and participate in strategic planning documents such as 3 Year plans, OP1 and Yearly Goals and present them to senior leaders across Amazon.
About the team
ABOUT AWS:
Diverse Experiences
Amazon values diverse experiences. Even if you do not meet all of the preferred qualifications and skills listed in the job description, we encourage candidates to apply. If your career is just starting, hasn’t followed a traditional path, or includes alternative experiences, don’t let it stop you from applying.
Why AWS
Amazon Web Services (AWS) is the world’s most comprehensive and broadly adopted cloud platform. We pioneered cloud computing and never stopped innovating — that’s why customers from the most successful startups to Global 500 companies trust our robust suite of products and services to power their businesses.
Work/Life Balance
We value work-life harmony. Achieving success at work should never come at the expense of sacrifices at home, which is why we strive for flexibility as part of our working culture. When we feel supported in the workplace and at home, there’s nothing we can’t achieve in the cloud.
Inclusive Team Culture
Here at AWS, it’s in our nature to learn and be curious. Our employee-led affinity groups foster a culture of inclusion that empower us to be proud of our differences. Ongoing events and learning experiences, including our Conversations on Race and Ethnicity (CORE) and AmazeCon (gender diversity) conferences, inspire us to never stop embracing our uniqueness.
Mentorship and Career Growth
We’re continuously raising our performance bar as we strive to become Earth’s Best Employer. That’s why you’ll find endless knowledge-sharing, mentorship and other career-advancing resources here to help you develop into a better-rounded professional.
Related Jobs
Amazon
1 month ago
Machine Learning Engineer, Generative AI Innovation Center - Model Customization
US, NY, New York