logo inner

Director of Engineering, SRE

CrusoeSan Francisco, California, United States | Sunnyvale, California, United StatesOnsite

Crusoe is building the World’s Favorite AI-first Cloud infrastructure company. We’re pioneering vertically integrated,  purpose-built AI infrastructure solutions trusted by Fortune 500 companies to power their most advanced AI applications. Crusoe is redefining AI cloud infrastructure, with a mission to align the future of computing with the future of the climate. Our AI platform is recognized as the "gold standard" for reliability and performance. Our data centers are optimized for AI workloads and are powered by clean, renewable energy.
Be part of the AI revolution with sustainable technology at Crusoe. Here, you'll drive meaningful innovation, make a tangible impact, and join a team that’s setting the pace for responsible, transformative cloud infrastructure.

About the Role:


We are hiring a Director of Engineering to lead our Crusoe Cloud SRE Organization responsible for Infrastructure and Product SRE, Incident Management, Platform and Tooling and overall Site Resiliency and Operational Excellence This leader will drive initiatives to improve availability, reliability, and efficiency at scale, and shape the future of how Crusoe builds and operates cloud platforms.

Core Responsibilities:


  • Build and lead a high-performing Embedded SRE organization, deeply partnered with product and platform teams.
  • Design and scale Crusoe’s incident response and management programs, including root cause analysis, blameless postmortems, and follow-through.
  • Champion reliability best practices across development and infrastructure teams.
  • Evolve SLOs, alerting, and observability standards to support business-critical systems.
  • Partner with engineering leadership to embed SREs in key services and drive org-wide alignment on reliability goals.
  • Mentor and grow engineering leads and ICs within Embedded SRE and Incident Ops.
  • Foster a culture of operational excellence through documentation, knowledge sharing, and tooling.

Key Challenges or Interesting Projects


  • Building Crusoe’s first org-wide Incident Management program with scalable, developer-friendly workflows.
  • Designing embedded SRE operating models that support multi-tenant, high-availability compute services.
  • Driving adoption of standardized observability tooling and proactive reliability engineering.
  • Collaborating across orgs to reduce incident frequency, MTTR, and long-tail production risks.

Growth Opportunities


  • Define and scale the Embedded SRE function at a pivotal growth moment.
  • Own cross-org incident culture and reliability frameworks.
  • Potential to grow into VP-level ownership over SRE, Infra, and Operations.

Benefits:


  • Industry competitive pay
  • Restricted Stock Units in a fast growing, well-funded technology company
  • Health insurance package options that include HDHP and PPO, vision, and dental for you and your dependents
  • Employer contributions to HSA accounts
  • Paid Parental Leave
  • Paid life insurance, short-term and long-term disability
  • Teladoc
  • 401(k) with a 100% match up to 4% of salary
  • Generous paid time off and holiday schedule
  • Cell phone reimbursement
  • Tuition reimbursement
  • Subscription to the Calm app
  • MetLife Legal
  • Company paid Commuter FSA benefit of $200 per month

Compensation Range


Compensation will be paid in the range of $320,000 - $360,000. Restricted Stock Units are included in all offers. Compensation to be determined by the applicants knowledge, education, and abilities, as well as internal equity and alignment with market data.Crusoe is an Equal Opportunity Employer. Employment decisions are made without regard to race, color, religion, disability, genetic information, pregnancy, citizenship, marital status, sex/gender, sexual preference/ orientation, gender identity, age, veteran status, national origin, or any other status protected by law or regulation.

Life at Crusoe

Crusoe is on a mission to help the oil industry eliminate routine flaring of natural gas and reduce the cost of cloud computing. We are passionate about our goals to help the oil industry operate more efficiently, achieve better relationships with communities and regulators, and improve environmental performance. Crusoe repurposes otherwise wasted energy to fuel the growing demand for computational power in the expanding digital economy.
Thrive Here & What We Value- Innovative Mission- Carbon-negative solutions- Inclusive work environment- Continuous learning opportunities- Collaboration and innovation- Positive impact on global emissions- Competitive compensation packages- Comprehensive benefits package- Remote work support- Equal opportunity employer
Your tracker settings

We use cookies and similar methods to recognize visitors and remember their preferences. We also use them to measure ad campaign effectiveness, target ads and analyze site traffic. To learn more about these methods, including how to disable them, view our Cookie Policy or Privacy Policy.

By tapping `Accept`, you consent to the use of these methods by us and third parties. You can always change your tracker preferences by visiting our Cookie Policy.

logo innerThatStartupJob
Discover the best startup and their job positions, all in one place.
Copyright © 2025