Location - Remote Spain, UK, Germany
About Stability:
Stability AI is a community and mission driven, open artificial intelligence company that cares deeply about real-world implications and applications. Our most considerable advances grow from our diversity in working across multiple teams and disciplines. We are unafraid to go against established norms and explore creativity. We are motivated to generate breakthrough ideas and convert them into tangible solutions. Our vibrant communities consist of experts, leaders and partners across the globe who are developing cutting-edge open AI models for Image, Language, Audio, Video, 3D and Biology.
About the role:
We are looking for Research Engineers who are passionate about generative models and creative applications of AI. In particular, we are looking to expand our language modeling team that researches, trains and deploys some of the most advanced models in the market. The role has a strong research component, as it involves the development of new efficient architectures, in-context learning and retrieval techniques, and improved multilingual fine-tuning (instruction tuning, RLHF, etc). The ideal candidate will have a strong understanding of large language models, natural language processing, and machine learning. They will be able to develop, train, and deploy large language models for a variety of applications, with a focus on fine-tuning existing large language models. You will have access to state-of-the-art high performance computing resources and you will be able to work alongside top researchers and engineers to truly make an impact in the fast growing world of generative AI.
Responsibilities:
- Develop, train, and deploy large language models for a variety of applications.
- Fine-tune language models on specific tasks, languages, and domains, ensuring they meet use-case requirements.
- Develop new algorithms to produce data mixtures that are optimal for model training.
- Conduct rigorous testing and evaluation of models to assess their accuracy, efficiency, and robustness.
- Stay up-to-date on the latest research in large language models and natural language processing.
- Work with the research team on developing the next generation of models, where you may directly assist with areas such as optimization of model training, model tuning, dataset engineering, HPC clusters, tooling, and work on open efforts.
- Contribute to the development of our company's AI strategy by working directly with customers and partners to understand and define model requirements and use-cases
Qualifications:
- 2+ years working on machine learning projects involving language models.
- Solid understanding of natural language processing (NLP) concepts and techniques, including data cleaning and filtering.
- Experience with Python scientific stack, PyTorch, creating Jupyter/Colab notebooks.
- Experience with JAX / TPUs / CUDA-level / JavaScript (TensorFlow.js etc) is a plus.
- Experience with instruction fine-tuning, reinforcement learning from human feedback (RLHF) and similar techniques is a plus.
- Experience with training and/or deploying ML models with Amazon AWS (Sagemaker) or Google Cloud is a plus.
- Experience with the open-source ML ecosystem (HuggingFace, W&B, etc.).
- Ability to communicate machine learning concepts and results effectively through writing and visualization.
- Ability to work in a fast-paced, remote and collaborative startup environment.
Equal Employment Opportunity:
We are an equal opportunity employer and do not discriminate on the basis of race, religion, national origin, gender, sexual orientation, age, veteran status, disability or other legally protected statuses.