logo inner

Senior Site Reliability Engineer (SRE)

Viz.aiIsraelHybrid

About Viz.ai


Viz.ai is the pioneer in the use of AI algorithms and machine learning to increase the speed of diagnosis and care across 1,700+ hospitals and health systems in the U.S. and Europe. The AI-powered Viz.ai OneTM is an intelligent care coordination solution that identifies more patients with a suspected disease, informs critical decisions at the point of care, and optimizes care pathways and helps improve outcomes. Backed by real-world clinical evidence, Viz.ai One delivers significant value to patients, providers, and pharmaceutical and medical device companies.

For more information visit Viz.ai.

About the role:


We are seeking a skilled Site Reliability Engineer (SRE) to join our team and help build, maintain, and improve the reliability, scalability, and performance of our systems. As an SRE, you will be responsible for owning observability tools, driving incident management processes, and implementing automation to enhance our infrastructure. This role involves collaborating across teams to ensure a robust and efficient technology stack supporting mission-critical systems.

You will:


  • Proactively enhance system reliability, scalability, and performance through automation, monitoring, and capacity planning.
  • Develop and maintain observability systems, including distributed tracing, logging, and metrics platforms.
  • Establish and maintain organizational standards for monitoring, leveraging tools like Prometheus, Grafana, and OpenTelemetry.
  • Drive incident management, root cause analysis, and continuous improvement initiatives.
  • Partner with development teams to integrate reliability best practices into the software development lifecycle.
  • Manage infrastructure at scale in cloud services (AWS advantage) and  platforms  like Kubernetes or ECS.
  • Optimize resource utilization to reduce costs while maintaining service quality.

What success looks like: 


  • You will have reduced the frequency and impact of production incidents by building resilient systems and improving incident response processes.
  • You will have improved observability: Key metrics, logs, and traces are available and actionable for all critical services, empowering teams to quickly detect and resolve issues.
  • You will be actively engaged in proactive problem solving: You identify and resolve systemic issues before they impact customers, and continuously refine SLOs/SLIs to reflect evolving business needs.
  • Leadership & Mentorship: You are seen as a reliable thought leader within the organization, mentoring others and helping shape the future of our SRE practices.

We are looking for:


  • At least 5 years of experience as a SRE.
  • Strong experience with Observability Tools: Proficiency with OpenTelemetry, Grafana, Prometheus, and ELK stack (Elasticsearch, Logstash, Kibana).
  • Experience with Cloud Platforms: In-depth knowledge of AWS services, including EC2, S3, RDS, and CloudFormation/Terraform for infrastructure-as-code.
  • Proficiency in scripting and/or development languages like Bash or Python.
  • Thorough understanding of CI/CD pipelines and automation tools.
  • Understanding of Infrastructure as Code, and strong experience with automation tools like Terraform and/or Ansible.
  • Solid troubleshooting and debugging skills.
  • A team player with a strong can-do mentality.

Why should you join us? 


  • If you are looking to make an impact, join our mission to develop life-saving products.
  • If you want to be part of an amazing team, our people are at the heart of everything we do.
  • If you are a self-starter and naturally motivated.
  • You have a passion for innovative technologies in the healthcare sector, this may be the place for you!.

Location: 


We are located in San Francisco, Tel Aviv,  This position is based in Tel Aviv.Our office in Tel-Aviv is located in Menachem Begin 150, within walking distance of Arlozorov and Ha'Shalom train stations.

Life at Viz.ai

Viz.ai, Inc is emerging as the leader in applied artificial intelligence in healthcare. Our mission is to fundamentally improve how healthcare is delivered in the world, through intelligent software that promises to reduce time to treatment and improve access to care. Our flagship product, Viz LVO, leverages advanced deep learning to communicate time-sensitive information about stroke patients straight to a specialist who can intervene and treat. In February 2018, the U.S. Food and Drug Administration (FDA) granted a De Novo clearance for Viz LVO, the first-ever computer-aided triage and notification platform. Most recently, Viz.ai announced its second FDA clearance for Viz CTP through the 510(k) pathway, offering healthcare providers an important tool for automated cerebral image analysis. We are located in San Francisco and Tel Aviv and backed by leading Silicon Valley investors, including Kleiner Perkins, Google Ventures, Innovation Endeavors and DHVC.
Thrive Here & What We Value1. Collaborative Environment2. Dynamic Work Culture3. Healthcare Impact Focus4. Professional Growth & Mentorship5. Competitive Employee Benefits6. Hybrid Work Model (Two Days In-Office)7. Dog-Friendly Office

Related Sub

This job belongs to these sub. Explore related roles here:
Machine learning jobs
Your tracker settings

We use cookies and similar methods to recognize visitors and remember their preferences. We also use them to measure ad campaign effectiveness, target ads and analyze site traffic. To learn more about these methods, including how to disable them, view our Cookie Policy or Privacy Policy.

By tapping `Accept`, you consent to the use of these methods by us and third parties. You can always change your tracker preferences by visiting our Cookie Policy.

logo innerThatStartupJob
Discover the best startup and their job positions, all in one place.
Copyright © 2025