CentralReach is a leading provider of autism and IDD care software for Applied Behavior Analysis (ABA), multidisciplinary therapy, and special education. Trusted by more than 200,000 users, we enable therapy providers, educators, and employers to scale the way they deliver ABA and related therapies with innovative technology, market-leading industry expertise, and world-class customer satisfaction.
The Engineering Operations group at CentralReach builds the underlying technologies that power our Public and Private Cloud Platforms worldwide. The group is responsible for storage, data infrastructure, IT, observability systems, DevOps, SRE, provisioning, compute, orchestration platform, internal tools, internal platforms (laptops, networks, systems etc.) and services - all the components that make up the CentralReach Platform. If you have a passion for the future, enjoy and thrive in an agile, fast-moving, ever-changing startup environment, welcome and take on technical challenges of all shapes and sizes, have excellent interpersonal skill and sense of humor and enjoy rolling up your sleeves and jumping in, then read on! As a Sr.
SRE, you will work closely with the key stakeholders in Software Engineering to drive adoption of modern reliability practices like SLOs, error budget policies, actionable alerts, incident retrospectives, chaos testing, and end-to-end ownership.
Key Accountabilities:
- Responsible for availability, latency, performance, efficiency, monitoring/observability, emergency response, capacity planning, setting and maintaining SLOs, SLIs and Error Budgets, creating dashboards.
- Analyze, troubleshoot and resolve operational challenges contributing to defined SLO's.
- Manage site stability, performance, reliability, and maintain uptime for production environments.
- Develop a fully automated multi-environment observability stack based on the existing system and extend it to predict capacity needs based on the usage patterns.
- Strive for automation to reduce toil and increase development velocity.
- Perform application-specific production support, incident management, change management, problem management, RCAs, and service restoration as needed.
- Identify changes for the product architecture from the reliability, performance and availability perspective with a data driven approach.
- Document resolution run books and standard operating procedures.
- Actively look for opportunities to improve the availability and performance of the system by applying the learnings from monitoring and observation.
- Collaborate with software development teams in the release management process and to shape the future roadmap and establish strong operational readiness across teams.
- Implementation of reliability and observability tools (like New Relic, Prometheus, Grafana etc.,)
Desired Skills and Experience:
- Strong background as a SRE supporting a 24x7 highly available production environment for a SaaS or cloud service provider.
- Strong Experience with AWS, and Infrastructure as code (Terraform, CloudFormation).
- Understanding of High Availability best practices in AWS.
- Solid experience with Monitoring/APM/Observability tools (Splunk, New Relic etc.)
- Solid experience with Prometheus and Grafana.
- Experience implementing observability plans around logs, metrics, and traces.
- Extensive experience with Kubernetes, Helm, CI/CD and config management tools like Ansible, Chef.
- Experience with Release automation, system administration, configuration management.
- Experience with programming languages (Java, Python, Go, etc.).
- Experience with scripting languages (Bash, PowerShell).
- Strong understanding of Linux, Windows, software development, systems, networking, and cloud concepts.
Backed by Roper Technologies, Inc. (Nasdaq: ROP), and led by award-winning CEO Chris Sullens, CentralReach is entering an exciting phase of growth, innovation, and scale. Recognized as one of the best places to work over 10 times by organizations such as Inc, Built In, and NJBIZ, our culture is centered around impact, inclusion, and flexibility. As a hybrid company with collaborative offices in Ft. Lauderdale, FL; Holmdel, NJ; and Verona, Italy, we foster a workplace where top talent can thrive and make a real difference in the lives of those we serve.We offer competitive compensation, comprehensive health benefits, generous PTO, 401(k) matching, and paid parental leave.
Our team members also enjoy hybrid work schedules, career development support, wellness programs, and opportunities to give back through CR Cares™, our community engagement initiative.Be part of a market leader driving the future of care. Explore opportunities at centralreach.com/careers. The expected salary range for this position is $140,000 - 180,000. Compensation will vary based on a number of factors, including education, experience, skills, and location. The range listed is a good faith estimate of base pay for the role, and final compensation will be determined based on the qualifications of the selected candidate.
This role may also be eligible for additional incentive compensation, such as bonuses or commissions, where applicable. In addition to base pay, we offer a comprehensive benefits package.
140000.00 To 180000.00 (USD) Annually