logo inner

Machine Learning Engineer, ML Runtime & Optimization

pony.aiOnsite
This job is no longer open

Description


Founded in 2016 in Silicon Valley, Pony.ai has quickly become a global leader in autonomous mobility and is a pioneer in extending autonomous mobility technologies and services at a rapidly expanding footprint of sites around the world. Operating Robotaxi, Robotruck and Personally Owned Vehicles (POV) business units, Pony.ai is an industry leader in the commercialization of autonomous driving and is committed to developing the safest autonomous driving capabilities on a global scale. Pony.ai’s leading position has been recognized, with CNBC ranking Pony.ai #10 on its CNBC Disruptor list of the 50 most innovative and disruptive tech companies of 2022.

In June 2023, Pony.ai was recognized on the XPRIZE and Bessemer Venture Partners inaugural “XB100” 2023 list of the world’s top 100 private deep tech companies, ranking #12 globally. As of August 2023, Pony.ai has accumulated nearly 21 million miles of autonomous driving globally. Pony.ai went public at NASDAQ in Nov. 2024.

Responsibility


The ML Infrastructure team at Pony.ai provides a set of tools to support and automate the lifecycle of the AI workflow, including model development, evaluation, optimization, deployment, and monitoring.As a Machine Learning Engineer in ML Runtime & Optimization, you will be developing technologies to accelerate the training and inferences of the AI models in autonomous driving systems.This includes:

  • Identifying key applications for current and future autonomous driving problems and performing in-depth analysis and optimization to ensure the best possible performance on current and next-generation compute architectures.
  • Collaborating closely with diverse groups in Pony.ai including both hardware and software to optimize and craft core parallel algorithms as well as to influence the next-generation compute platform architecture design and software infrastructure.
  • Apply model optimization and efficient deep learning techniques to models and optimized ML operator libraries.
  • Work across the entire ML framework/compiler stack (e.g.Torch, CUDA and TensorRT), and system-efficient deep learning models.

Requirements


  • BS/MS or Ph.D in computer science, electrical engineering or a related discipline.
  • Strong programming skills in C/C++ or Python.
  • Experience on model optimization, quantization or other efficient deep learning techniques
  • Good understanding of hardware performance, regarding CPU or GPU execution model, threads, registers, cache, cost/performance trade-off, etc.
  • Experience with profiling, benchmarking and validating performance for complex computing architectures.
  • Experience in optimizing the utilization of compute resources, identifying and resolving compute and data flow bottlenecks.
  • Strong communication skills and ability to work cross-functionally between software and hardware teams

Preferred Qualifications:


One or more of the following fields are preferred

  • Experience with parallel programming, ideally CUDA, OpenCL or OpenACC.
  • Experience in computer vision, machine learning and deep learning.
  • Strong knowledge of software design, programming techniques and algorithms.
  • Good knowledge of common deep learning frameworks and libraries.
  • Deep knowledge on system performance, GPU optimization or ML compiler.

Compensation and Benefits


Base Salary Range: $140,000 - $250,000 AnnuallyCompensation may vary outside of this range depending on many factors, including the candidate’s qualifications, skills, competencies, experience, and location. Base pay is one part of the Total Compensation and this role may be eligible for bonuses/incentives and restricted stock units.Also, we provide the following benefits to the eligible employees:

  • Health Care Plan (Medical, Dental & Vision)
  • Retirement Plan (Traditional and Roth 401k)
  • Life Insurance (Basic, Voluntary & AD&D)
  • Paid Time Off (Vacation & Public Holidays)
  • Family Leave (Maternity, Paternity)
  • Short Term & Long Term Disability
  • Free Food & Snacks

This job is no longer open

Life at pony.ai

Pony.ai is a start-up with the goal to create best AI solutions for autonomous driving. We aim at developing Level 4 fully autonomous driving vehicles that will revolutionize the transportation system. We are convinced that these vehicles must be safe, reliable and cost effective. In addition, our AI technology enables our vehicles to be intelligent, so that they can comfortably handle complicated city driving conditions with other vehicles, bicycles, and pedestrians. At Pony.ai, we believe that practical engineering approach is the most feasible path to bring the first generation of self-driving vehicles into the market.
Thrive Here & What We Value- Innovative and disruptive tech company- Global leader in autonomous mobility- Emphasis on safest autonomous driving capabilities- Recognition by reputable organizations (CNBC, XPRIZE, Bessemer Venture Partners)- Competitive compensation package- Comprehensive benefits package (health care plan, retirement plan, life insurance, paid time off, family leave, free food and snacks)

Related Sub

This job belongs to these sub. Explore related roles here:
Machine learning jobs
Your tracker settings

We use cookies and similar methods to recognize visitors and remember their preferences. We also use them to measure ad campaign effectiveness, target ads and analyze site traffic. To learn more about these methods, including how to disable them, view our Cookie Policy or Privacy Policy.

By tapping `Accept`, you consent to the use of these methods by us and third parties. You can always change your tracker preferences by visiting our Cookie Policy.

logo innerThatStartupJob
Discover the best startup and their job positions, all in one place.
Copyright © 2025