logo inner

Fabric SOC Architect


Tenstorrent is leading the industry on cutting-edge AI technology, revolutionizing performance expectations, ease of use, and cost efficiency. With AI redefining the computing paradigm, solutions must evolve to unify innovations in software models, compilers, platforms, networking, and semiconductors. Our diverse team of technologists have developed a high performance RISC-V CPU from scratch, and share a passion for AI and a deep desire to build the best AI platform possible. We value collaboration, curiosity, and a commitment to solving hard problems. We are growing our team and looking for contributors of all seniorities.
At Tenstorrent, we’re building cutting-edge hardware and software solutions that power AI, HPC, and general-purpose workloads. As a Performance Architect on our Platform Architecture team, you’ll work across ML software stacks, compilers, CPU design, cache coherency protocols, and interconnect fabrics to shape the future of high-performance systems. This role is all about bridging software execution and silicon design—making data-driven decisions that directly influence our SoC performance.This role isremote, based out of The United States.We welcome candidates at various experience levels for this role.

During the interview process, candidates will be assessed for the appropriate level, and offers will align with that level, which may differ from the one in this posting.

Who You Are


  • Passionate about solving complex system-level performance problems.
  • Comfortable working across hardware and software boundaries.
  • Analytical and data-driven, with a talent for turning workloads into architectural insights.
  • Collaborative, thriving in cross-functional teams spanning compilers, CPU, and interconnect.
  • Excited to shape the future of AI/HPC platforms through performance architecture.

What We Need


  • BS/MS/PhD in EE, ECE, CE, or CS
  • Deep understanding of NoC topologies, routing algorithms, QoS, and traffic scheduling.
  • Expertise in cache coherency protocols (AMBA CHI/AXI) and modern memory/IO technologies (DDR, LPDDR, GDDR, PCIe, CCIX, CXL).
  • Proficiency in C/C++ programming, with experience in building efficient performance models.
  • Familiarity with ML/AI traffic patterns or formal verification of cache coherence protocols is a strong plus.

What You Will Learn


  • How real ML/AI traffic patterns influence SoC interconnect and cache design.
  • The art of balancing performance vs. complexity in coherence and memory hierarchies.
  • How performance models feed into CPU and accelerator microarchitecture decisions.
  • Best practices for correlating pre-silicon and post-silicon performance.
  • Cutting-edge approaches to integrating heterogeneous compute systems at scale.

Compensation for all engineers at Tenstorrent ranges from $100k - $500k including base and variable compensation targets. Experience, skills, education, background and location all impact the actual offer made.Tenstorrent offers a highly competitive compensation package and benefits, and we are an equal opportunity employer.This offer of employment is contingent upon the applicant being eligible to access U.S. export-controlled technology.  Due to U.S. export laws, including those codified in the U.S.

Export Administration Regulations (EAR), the Company is required to ensure compliance with these laws when transferring technology to nationals of certain countries (such as EAR Country Groups D:1, E1, and E2).   These requirements apply to persons located in the U.S. and all countries outside the U.S.  As the position offered will have direct and/or indirect access to information, systems, or technologies subject to these laws, the offer may be contingent upon your citizenship/permanent residency status or ability to obtain prior license approval from the U.S.

Commerce Department or applicable federal agency.  If employment is not possible due to U.S. export laws, any offer of employment will be rescinded.

Life at Tenstorrent

At Tenstorrent, we are creating the next generation of high-performance processor ASICs, specifically engineered for deep learning and smart hardware. Our processor is designed to excel at both learning and inference, while being software-programmable to support future innovations in the field of machine learning. The processor's architecture easily scales from battery-powered IoT devices to large cloud servers, and surpasses today's solutions by several orders of magnitude in raw performance and energy efficiency. Our team, made up of alumni from hardware industry leaders like NVIDIA and AMD, is committed to providing the core hardware necessary to increase the pace of deep learning research and enable smart devices to live untethered from the power grid and the Internet. We are based in Toronto and proudly backed by Real Ventures, the Canadian VC of the Year two years running.
Thrive Here & What We Value* Innovation, collaboration, problem-solving* Competitive compensation package* Diverse team with varying seniorities* Hybrid work arrangement (Santa Clara, CA; Austin, TX)* Equal opportunity employer* Cutting-edge AI technology leadership* Passionate technologists in diverse teams* High performance RISCV CPU development
Your tracker settings

We use cookies and similar methods to recognize visitors and remember their preferences. We also use them to measure ad campaign effectiveness, target ads and analyze site traffic. To learn more about these methods, including how to disable them, view our Cookie Policy or Privacy Policy.

By tapping `Accept`, you consent to the use of these methods by us and third parties. You can always change your tracker preferences by visiting our Cookie Policy.

logo innerThatStartupJob
Discover the best startup and their job positions, all in one place.
Copyright © 2025