Mistral AI is hiring an expert in the role of pre-training and fine-tuning large language models.
The role will involve
- modifying pre-trained large language models to make them able to interact with humans
- equipping large language models with the ability of calling external tools
- aligning large language models based on feedback obtained during their deployment, or going through an ad-hoc annotation process.
- designing ad-hoc annotation processes themselves.
The role will also involve participating to the pre-training effort.
The successful candidate will have
- high scientific understanding of the field of generative AI. This means a broad knowledge of the field of AI, and specific knowledge or interest in fine-tuning and using language models for applications.
- high technical engineering competence. This means being able to design complex software and make them usable in production. She is able to navigate the full MLOps technical stack, with a focus on architecture development and model evaluation and usage. She will occasionally be able to do front-end development, and have to use complex HPC infrastructure with full autonomy.
We're a small team, composed of seasoned researchers and engineers in the AI field. We like to work hard and be at the edge of science. We are creative, low-ego, team-spirited, and have been passionate about AI for years. We hire people that foster in competitive environments, because they find them more fun to work in. We hire passionate women and men from all over the world.
Developers are using our API via la Plateforme to build incredible AI-first applications powered by our models that can understand and generate natural language text and code. We are multilingual at our core. More recently, we released le Chat, as a demonstrator of our models.
Related Jobs
Meta
AI Research Scientist, Language - Generative AI
Bellevue, WA, Redmond, WA, Menlo Park, CA, Seattle, WA, Burlingame, CA, New York, NY