Our team has released the Seamless Communication models at the end of 2023, the very first massively multilingual, streaming and expressive multimodal translation systems. We are looking for a Research Engineer, expert in speech generation to take these models to the next level by making them production ready.
Overtime, this project will be transitioned fully to an infrastructure team, and the role will support our next research vision to build a personalizable, controllable foundation model for synchronous, multimodal and expressive behavior generation.
Meta Fundamental AI Research (FAIR) is a research organization committed to advancing open AI research, and we will push the boundaries of human-centric understanding and generation. Our team's technology will enable next-generation human-to-human and human-to-machine communication.
- Collaborate, and execute on research that pushes forward the state of the art in human-centric understanding and generation.
- Directly contribute to experiments, including designing experimental details, writing reusable code, running evaluations, and organizing results.
- Develop methodology and benchmarks to evaluate different approaches.
- Work with a large and globally distributed team.
Minimum Qualifications
- Bachelor's degree in Computer Science, Computer Engineering, relevant technical field, or equivalent practical experience.
- Masters Degree in Computer Science or relevant technical field
- 3+ years of industry, academic or government laboratory experience
- Experience holding an industry, faculty, or government researcher position
- Experience developing machine learning algorithms or machine learning infrastructure in Python
- Experience writing software and executing complex experiments involving large AI models and datasets
- Experience in speech generation and text-to-speech
Preferred Qualifications
- A PhD in AI, computer science, data science, or related technical fields.
- Direct experience in generative AI, and LLM research.
- First author publications experience at peer-reviewed AI conferences (NeurIPS, CVPR, ICML, ICLR, ICCV, ACL, EMNLP, Interspeech, etc.))
- Experience in multimodal generation modeling, in particular human motion generation modeling.
For those who live in or expect to work from California if hired for this position, please click here for additional information.
Start preparing
Learn about how to prepare for your interview with our interview guide, tips, and interactive experiences.Visit interview prep
Locations
About Meta
Meta builds technologies that help people connect, find communities, and grow businesses. When Facebook launched in 2004, it changed the way people connect. Apps like Messenger, Instagram and WhatsApp further empowered billions around the world. Now, Meta is moving beyond 2D screens toward immersive experiences like augmented and virtual reality to help build the next evolution in social technology. People who choose to build their careers by building with us at Meta help shape a future that will take us beyond what digital connection makes possible today—beyond the constraints of screens, the limits of distance, and even the rules of physics.
$70.67/hour to $208,000/year + bonus + equity + benefits
Individual compensation is determined by skills, qualifications, experience, and location. Compensation details listed in this posting reflect the base hourly rate, monthly rate, or annual salary only, and do not include bonus, equity or sales incentives, if applicable. In addition to base compensation, Meta offers benefits. Learn more about benefits at Meta.
Equal Employment Opportunity and Affirmative Action
Meta is proud to be an Equal Employment Opportunity and Affirmative Action employer. We do not discriminate based upon race, religion, color, national origin, sex (including pregnancy, childbirth, reproductive health decisions, or related medical conditions), sexual orientation, gender identity, gender expression, age, status as a protected veteran, status as an individual with a disability, genetic information, political views or activity, or other applicable legally protected characteristics. You may view our Equal Employment Opportunity notice here.
Meta is committed to providing reasonable support (called accommodations) in our recruiting processes for candidates with disabilities, long term conditions, mental health conditions or sincerely held religious beliefs, or who are neurodivergent or require pregnancy-related support. If you need support, please reach out to accommodations-ext@fb.com.
Related Jobs
Meta
Research Engineer, Speech Generation - FAIR
Pittsburgh, PA, Menlo Park, CA, Seattle, WA, New York, NY, San Francisco, CA