Together AI is seeking exceptional Research Interns to join our research team for Summer 2025. You will work on cutting-edge research in foundation model architectures and efficiency, contributing to our mission of advancing open and transparent AI systems.
This application will close on February 7, 2025, at 5:00 pm PT.
About Together AITogether AI is a research-driven artificial intelligence company. We believe open and transparent AI systems will drive innovation and create the best outcomes for society, and together we are on a mission to significantly lower the cost of modern AI systems by co-designing software, algorithms, and models. We have contributed to leading open-source research, models, and datasets to advance the frontier of AI, and our team has been behind technological advancements such as FlashAttention, Mamba, FlexGen, Petals, Mixture of Agents, and RedPajama.
Research AreasAs a Research Intern, you will work on one or more of the following areas:
- Novel model architectures and architectural adaptations for foundation models
- Inference optimization algorithms and techniques (e.g., speculative decoding, quantization, sparsity, model compression, knowledge distillation)
- High-performance kernel development and optimization
- Advanced post-training optimization and finetuning methods
- New techniques and systems for efficient training of neural networks (e.g., distributed training, algorithmic improvements, optimization methods)
- Robust and reliable evaluation of foundation model capabilities
- Reasoning strategies and inference-time compute techniques
- Research and implement novel techniques in one or more of our focus areas
- Design and conduct rigorous experiments to validate hypotheses
- Document findings in scientific publications and blog posts
- Integrate the research results into Together products
- Communicate the plans, progress, and results of projects to the broader team
- Currently pursuing a Bachelor's, Master's, or Ph.D. degree in Computer Science, Electrical Engineering, or a related field
- Strong knowledge of Machine Learning and Deep Learning fundamentals
- Experience with deep learning frameworks (PyTorch, JAX, etc.)
- Strong programming skills in Python
- Strong programming skills in C++ (for kernel development)
- Familiarity with Transformer architectures and recent developments in foundation models
- Prior research experience with foundation models or efficient machine learning
- Publications at leading ML conferences (such as NeurIPS, ICML, or ICLR)
- Experience with CUDA programming (for kernel development)
- Understanding of model optimization techniques and hardware acceleration approaches
- Contributions to open-source machine learning projects
- Duration: ~12 weeks (Summer 2025)
- Location: San Francisco, Amsterdam and London
- Opportunity to work with leading researchers in AI
- Access to significant computational resources
- Exposure to real-world, large-scale Machine Learning problems
- Possibility to contribute to influential open-source projects
Please submit your application with:
- Resume/CV
- A cover letter that includes your preferred research areas of interest, academic transcript (unofficial is acceptable), and links to relevant projects or publication
Together AI is an Equal Opportunity Employer and is proud to offer equal employment opportunity to everyone regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability, gender identity, veteran status, and more.
For more information about our privacy policy, please visit: https://www.together.ai/privacy