Work on State of the Art runtime systems hosting Large Language Models (LLM). We work in a fast paced dynamic environment to rapidly experiment and deliver scaled runtime solutions based on experiments and research in the LLM space.
Key job responsibilities
- Design, develop, test and deploy inference solutions for high-end LLMs
- Explore emerging inference optimization techniques
- Collaborate with cross-functional teams of engineers and scientists to identify and solve complex problems
- Mentor and guide junior engineers, and contribute to the overall growth and development of the team
Key job responsibilities
- Design, develop, test and deploy inference solutions for high-end LLMs
- Explore emerging inference optimization techniques
- Collaborate with cross-functional teams of engineers and scientists to identify and solve complex problems
- Mentor and guide junior engineers, and contribute to the overall growth and development of the team