About Handshake AI
Handshake is building the career network for the AI economy. Our three-sided marketplace connects 18 million students and alumni, 1,500+ academic institutions across the U.S. and Europe, and 1 million employers to power how the next generation explores careers, builds skills, and gets hired.Handshake AI is a human data labeling business that leverages the scale of the largest early career network.We work directly with the world’s leading AI research labs to build a new generation of human data products.
From PhDs in physics to undergrads fluent in LLMs, Handshake AI is the trusted partner for domain-specific data and evaluation at scale.This is a unique opportunity to join a fast-growing team shaping the future of AI through better data, better tools, and better systems—for experts, by experts.Now’s a great time to join Handshake. Here’s why:
- Leading the AI Career Revolution: Be part of the team redefining work in the AI economy for millions worldwide.
- Proven Market Demand: Deep employer partnerships across Fortune 500s and the world’s leading AI research labs.
- World-Class Team: Leadership from Scale AI, Meta, xAI, Notion, Coinbase, and Palantir, just to name a few.
- Capitalized & Scaling: $3.5B valuation from top investors including Kleiner Perkins, True Ventures, Notable Capital, and more.
About the Role
- Design and implement post-training systems and methodologies in close partnership with research scientists and domain experts
- Build and maintain infrastructure that supports large-scale model training, specialized data processing, and benchmark evaluation
- Develop robust frameworks for verifying the quality and integrity of highly specialized domain datasets
- Create next-generation LLM benchmarks that push the boundaries of model evaluation and capabilities assessment
- Optimize performance across software and hardware layers to accelerate post-training experimentation and deployment
- Collaborate across disciplines to ensure rigorous validation of model improvements and benchmark reliability
Desired Capabilities
- Strong Python programming skills with attention to clean, efficient, and scalable code
- Experience building and operating large-scale systems for model post-training, specialized data processing, or benchmark evaluation
- Deep familiarity with PyTorch and modern post-training techniques (RLHF, constitutional AI, etc.)
- A background in applied machine learning, model evaluation, or large-scale data quality assessment
- Experience with benchmark design, evaluation methodologies, and performance measurement frameworks
- Clear communication skills and a collaborative mindset for cross-functional research teams
Extra Credit
- Experience optimizing deep learning models for performance (e.g., memory usage, training speed)
- Interest in the societal and ethical impacts of AI technologies
- Contributions to open-source ML infrastructure or tools
Perks
Handshake delivers benefits that help you feel supported—and thrive at work and in life.
The below benefits are for full-time US employees.
🎯
Ownership:
Equity in a fast-growing company
💰 Financial Wellness: 401(k) match, competitive compensation, financial coaching🍼
Family Support:
Paid parental leave, fertility benefits, parental coaching
💝
Wellbeing:
Medical, dental, and vision, mental health support, $500 wellness stipend
📚 Growth: $2,000 learning stipend, ongoing development💻
Remote & Office:
Stipends for home office setup, internet, commuting, and free lunch/gym in our SF office
🏝
Time Off:
Flexible PTO, 15 holidays + 2 flex days
🤝
Connection:
Team outings & referral bonuses
Explore our mission, values, and comprehensive US benefits at joinhandshake.com/careers.
Compensation Range: $200K - $270K