TT-Distributed Software Engineer
Tenstorrent is leading the industry on cutting-edge AI technology, revolutionizing performance expectations, ease of use, and cost efficiency. With AI redefining the computing paradigm, solutions must evolve to unify innovations in software models, compilers, platforms, networking, and semiconductors. Our diverse team of technologists have developed a high performance RISC-V CPU from scratch, and share a passion for AI and a deep desire to build the best AI platform possible. We value collaboration, curiosity, and a commitment to solving hard problems. We are growing our team and looking for contributors of all seniorities.
As our TT-Distributed Software Engineer, you will develop and optimize distributed software systems that power the most efficient and highest-performing AI and HPC clusters. In this role, you'll work on distributed programming across multiple nodes, utilizing systems programming, inter-node communication, and Tenstorrent’s scalable architectures to advance the state-of-the-art distributed inference and training infrastructure.
This role is hybrid, based out of Santa Clara, CA; Austin, TX; or Toronto, ON.
We welcome candidates at various experience levels for this role. During the interview process, candidates will be assessed for the appropriate level, and offers will align with that level, which may differ from the one in this posting.
Responsibilities:
- Architect, implement, and optimize distributed software systems (TT-Distributed) to efficiently manage communications and computations across clusters of AI accelerators and CPUs.
- Design robust systems leveraging inter-process communication (IPC), inter-node sockets, and distributed programming frameworks like MPI to ensure reliability, scalability, and high performance.
- Dive deep into system-level optimizations, cluster resource management (e.g., Slurm), and performance tuning to maximize efficiency in distributed AI workloads.
- Collaborate closely with AI researchers and hardware engineers to integrate distributed inference and training frameworks into Tenstorrent’s broader software ecosystem.
Experience & Qualifications:
- Bachelor's degree or higher in Computer Science, Electrical/Computer Engineering, or a related field.
- Solid proficiency in C/C++ and a foundational understanding of systems programming, operating systems, and distributed system principles.
- Enthusiasm for distributed computing, including inter-process communication (IPC), socket programming, cluster resource management, and distributed inference and training.
- Willingness to think from first principles, consider out-of-the-box solutions, discover where industry norms and current state-of-the-art fall short, and drive to surpass them.
- Desire to learn, grow, and become an expert in distributed systems, bringing curiosity and innovative thinking to solve challenging problems in large-scale computing.
- Experience or familiarity with high-performance networking, MPI, RDMA, or cluster computing frameworks is advantageous but not required.
Compensation for all engineers at Tenstorrent ranges from $100k - $500k including base and variable compensation targets. Experience, skills, education, background and location all impact the actual offer made.
Tenstorrent offers a highly competitive compensation package and benefits, and we are an equal opportunity employer.
Due to U.S. Export Control laws and regulations, Tenstorrent is required to ensure compliance with licensing regulations when transferring technology to nationals of certain countries that have been licensing conditions set by the U.S. government.
Our engineering positions and certain engineering support positions require access to information, systems, or technologies that are subject to U.S. Export Control laws and regulations, please note that citizenship/permanent residency, asylee and refugee information and/or documentation will be required and considered as Tenstorrent moves through the employment process.
If a U.S. export license is required, employment will not begin until a license with acceptable conditions is granted by the U.S. government. If a U.S. export license with acceptable conditions is not granted by the U.S. government, then the offer of employment will be rescinded.