OpenAI
4 days ago

Technical Program Manager, GPU Fleet

San Francisco

Share:

About the Team

The fleet team runs the GPU fleet that serves the models backing ChatGPT and API while also supporting training workloads for our next generation models. We manage one of the largest cutting edge GPU fleets in the world, exposing it as a singular platform for other OpenAI teams to seamlessly run production Applied AI and training workloads. 

We seek to learn from deployment and distribute the benefits of AI, while ensuring that this powerful tool is used responsibly and safely. Safety is more important to us than unfettered growth.

About the Role

As a Technical Program Manager for the GPU Fleet, your role is to help make our future compute plans become a reality by coordinating with engineers to bring up, maintain, and serve capacity to all of OpenAIs training and inference workloads. You will be responsible for managing & coordinating the overall body of work across many parallel programs/projects, ensuring cohesive communication and consistent alignment across all teams in platform, to all cross functional teams, and up to leadership.

This role is based in San Francisco, CA. We use a hybrid work model of 3 days in the office per week and offer relocation assistance to new employees.

In this role, you will:

  • Guide the roadmap for automation for a fleet that can grow an order of magnitude in size or more.

  • Ensure that incoming clusters are tracked and delivered on-time while providing a stable supply signal for the OpenAI fleet.

  • Support ongoing maintenance windows, healthy fleet management, and the deployment of new compute architectures and data centers across the fleet. 

  • Work with Fleet Turnup engineers on executing on tight timelines and iteratively improving the process, tooling, and automation.

  • Work with external partners to unlock bleeding edge compute and making it available as a turnkey resource for scheduling workloads

  • Collaborate closely with a broad set of stakeholders, including product engineering, inference, security, research and finance

You might thrive in this role if you:

  • Possess a degree in a hard science, or have a demonstrated track record of engineering expertise.

  • Have 5+ years of experience in program management for major projects including capital projects or hyperscaler infrastructure deployment 

  • Demonstrated ability to serve as the go-to person solely responsible for driving and delivering complex projects.

  • Comfortable in managing cross-functional and cross-company teams; experience driving information and decision hygiene 

  • Have an extensive track record of successfully delivering high-profile, technical projects against tight deadlines.

  • Are technically adept and have effectively partnered with engineering or fundamental research teams of the highest caliber.

  • Interfacing and leading external vendors including: engineering firms, equipment suppliers, and/or construction firms

  • Expertise in designing and implementing simple, scalable processes that solve complex problems.

  • Experience managing complicated dependencies such as logistics and or supply chains

  • Are relentlessly resourceful and thrive in ambiguous, fast-paced environments.

  • Are interested in and thoughtful about the impacts of AGI.

About OpenAI

OpenAI is an AI research and deployment company dedicated to ensuring that general-purpose artificial intelligence benefits all of humanity. We push the boundaries of the capabilities of AI systems and seek to safely deploy them to the world through our products. AI is an extremely powerful tool that must be created with safety and human needs at its core, and to achieve our mission, we must encompass and value the many different perspectives, voices, and experiences that form the full spectrum of humanity. 

We are an equal opportunity employer and do not discriminate on the basis of race, religion, national origin, gender, sexual orientation, age, veteran status, disability or any other legally protected status. 

OpenAI Affirmative Action and Equal Employment Opportunity Policy Statement

For US Based Candidates: Pursuant to the San Francisco Fair Chance Ordinance, we will consider qualified applicants with arrest and conviction records.

We are committed to providing reasonable accommodations to applicants with disabilities, and requests can be made via this link.

OpenAI Global Applicant Privacy Policy

At OpenAI, we believe artificial intelligence has the potential to help people solve immense global challenges, and we want the upside of AI to be widely shared. Join us in shaping the future of technology.

Please mention that you found this job on MoAIJobs, this helps us grow, thanks!

Related Jobs

AMD

2 weeks ago

Technical Program Manager

MARKHAM, Canada

Tenstorrent

2 weeks ago

Technical Program Manager

United States

OpenAI

4 days ago

Technical Program Manager, Human Data (2025)

San Francisco

DeepMind

4 weeks ago

Technical Program Manager, Research

London, UK

DeepMind

4 weeks ago

Technical Program Manager, Robotics

London, UK