About the Team
OpenAI's Human Data Team creates custom data solutions driving groundbreaking research. Our work enhances and evaluates our flagship models and products like ChatGPT, GPT-4, and Sora, and contributes to safety initiatives through collaboration with our Preparedness and Safety Systems teams.
We work with AI trainers to gather specialized data for training and evaluating our models across modalities such as video, audio, text, and tool actions. Our goal is to develop scalable methods, tools, and platforms to generate and evaluate high-quality data from both synthetic sources and human experts in various fields, including mathematics, sciences, creative writing, programming, art, and safety. We leverage OpenAI models to improve and streamline our data collection and quality processes.
About the Role
The role, simply put, is to accelerate the pace of research at OpenAI. As our researchers and partners require data collection to support new model capabilities, this role provides the execution, experience, and rigor to successfully design, kick-off, orchestrate, and ultimately own the outcomes of a data collection campaign.
This role is the critical link between our research roadmap, our vendors and AI trainers, and the Human Data engineering team building the annotation platform used to collect all data.
We have a team of Technical Program Managers assigned to different research areas. Collectively as a team, we are writing the playbook for how data collection efforts are accomplished, not just following one. Members of this team are expected to develop the systems, processes, and tools to run data collection efforts in a way that takes advantage of OpenAI’s incredible product capabilities and unique research insights.
This role is based in our San Francisco HQ and will report to the manager of Human Data Operations. We offer relocation support to new employees.
In this role, you will:
Work directly with researchers (they sit next to us) to scope their needs, including timeline and budget, as well as how quality will be measured and what campaign success looks like.
Design our most challenging data collection campaigns, including
Collaborating with the researcher(s) to understand their needs
Designing the onboarding process for the campaign
Writing and maintaining instructions / taxonomy / training documentation
Setting up and executing a quality management system
Reporting on quality, throughput, spend and insightful trends to cross-functional teams
Ensuring our AI trainers and vendors have the materials and insights needed for success
Ultimately, you will be responsible for the quality and success campaign
Work with our vendors and internal Operations Specialists to scale your impact
Build and enhance the systems we’ll need as a team, including
Designing the overall training and onboarding strategy and materials for our vendors
Creating dashboards, command centers, and queries that highlight campaign-specific and team-wide progress on facets such as quality, throughput, and spend
Building the frameworks, tools, and processes to understand our data, using SQL, Python, or other technical means when appropriate
Partnering with the engineers building our Human Data annotation platform to track and implement feature requests and triage any issues blocking progress.
Leverage the amazing product and research capabilities OpenAI has developed to do data collection in a way that exponentially increases our ability to stay ahead of competition
You’ll thrive in this role if you:
Can be an effective thought partner to our Researchers, educating on how best to achieve their goals
Value using both your technical and non-technical skills to do the job
Have experience creating or managing data labeling operations
Communicate clearly and concisely, anticipating the needs of our vendors and partners
Want to get your hands dirty, grit and creative problem solving will be required daily
Architect scalable processes, knowing that building is by far the easiest part of a change
Have an action-oriented and deeply curious mind, often leading you to become a power user of software and tools
Operate with high horsepower, are adept at frequent context switching and working on multiple projects at once with expansive ownership, and ruthlessly prioritize.
Thrive in dynamic environments and can navigate ambiguity with ease
Nice to have: The ability to code enough to debug or even contribute to our annotation platform when appropriate, though this will not be a main focus of the role
About OpenAI
OpenAI is an AI research and deployment company dedicated to ensuring that general-purpose artificial intelligence benefits all of humanity. We push the boundaries of the capabilities of AI systems and seek to safely deploy them to the world through our products. AI is an extremely powerful tool that must be created with safety and human needs at its core, and to achieve our mission, we must encompass and value the many different perspectives, voices, and experiences that form the full spectrum of humanity.
We are an equal opportunity employer and do not discriminate on the basis of race, religion, national origin, gender, sexual orientation, age, veteran status, disability or any other legally protected status.
OpenAI Affirmative Action and Equal Employment Opportunity Policy Statement
For US Based Candidates: Pursuant to the San Francisco Fair Chance Ordinance, we will consider qualified applicants with arrest and conviction records.
We are committed to providing reasonable accommodations to applicants with disabilities, and requests can be made via this link.
OpenAI Global Applicant Privacy Policy
At OpenAI, we believe artificial intelligence has the potential to help people solve immense global challenges, and we want the upside of AI to be widely shared. Join us in shaping the future of technology.