Job FunctionsMaintaining and improving the reliability and performance of a cloud-based lending platformOverseeing systems to ensure reliability for customersMonitoring distribution systems and notifying appropriate persons of any potential issuesRunning the production environment by monitoring availability and taking a holistic view of system healthBuilding software and systems to manage platform infrastructure and applicationsImproving reliability, quality, and time-to-market of our suite of software solutionsPartnering with development teams to improve services through rigorous testing and release procedures
Job RequirementsBachelor's Degree (B.A.) in Computer Science or Design or equivalent four-year degree, or equivalent related experience.5-7 years of proven experience in a Site Reliability role or similar experience.
SkillsAWS experienceSystem monitoringInfrastructure automationSystems engineeringSoftware developmentContainerization technologiesAutomated deployment frameworksMonitoring, logging, alerting, system internals, networking, databases, distributed systems, service-oriented architectureExperience with Infrastructure and Application Monitoring tools such as New Relic, SumoLogic, Uptime monitoring (Pingdom), CloudTrail, CloudWatch Insights, CodePipeline, CodeDeployExtensive working knowledge of managing AWS and Linux OSExperience working with MSSQL, MySQL, in cloud-based environments, as well as demonstrable knowledge and experience of AWS service technologies, i.e., Aurora, MySQLExperience of working with NoSQL database technologies (ideally DynamoDB)Experience of working with pipeline automation scripting and tooling, i.e., Jenkins, TerraformKnowledge and experience utilizing coding languages (e.g., C++, Java, PHP) and frameworks/systems (e.g., AWS)Strong sense of ownershipExcellent written and verbal communication and interpersonal skillsAble to effectively collaborate with technical and business partnersCan take on full projects from beginning to endProblem solverTeam Player
We are looking for a Senior Site Reliability Engineer with strong experience in AWS, system monitoring, and infrastructure automation. The role involves maintaining and improving the reliability and performance of a cloud-based lending platform used by mid-market and large financial institutions.
The ideal candidate will have a solid background in systems engineering and software development, be comfortable working across teams, and take ownership of operational stability and tooling improvements.
Responsibilities:
- Own your deep learning about the software, its functions, and how it fulfills the clients’ needs, and how they use the product.
- Oversee systems to ensure reliability for customers.
- Monitor distribution systems and notify appropriate persons of any potential issues.
- Run the production environment by monitoring availability and taking a holistic view of system health.
- Build software and systems to manage platform infrastructure and applications.
- Improve reliability, quality, and time-to-market of our suite of software solutions.
- Measure and optimize system performance, with an eye toward pushing our capabilities forward, getting ahead of customer needs, and innovating to continually improve.
- Partner with development teams to improve services through rigorous testing and release procedures.
Technical Skills:
- Bachelor's Degree (B.A.) in Computer Science or Design or equivalent four-year degree, or equivalent related experience.
- 5-7 years of proven experience in a Site Reliability role or similar experience.
- Excellent oral and written communication skills, including facilitation of group presentations, and consulting skills in the English language.
- Possess deep technical experience with AWS, containerization technologies, automated deployment frameworks, monitoring, logging, alerting, system internals, networking, databases, distributed systems, and service-oriented architecture.
- Demonstrate hands-on technical leadership and business impact in combining software engineering skills with systems engineering skills to solve complex automation and reliability challenges.
- Experience working with Infrastructure and Application Monitoring tools such as: New Relic, SumoLogic, Uptime monitoring (Pingdom), CloudTrail, CloudWatch Insights, CloudFormation, CodePipeline, CodeDeploy.
- Extensive working knowledge of managing AWS and Linux OS.
- Experience working with MSSQL, MySQL, in cloud-based environments, as well as demonstrable knowledge and experience of AWS service technologies, i.e., Aurora, MySQL.
- Experience of working with NoSQL database technologies (ideally DynamoDB).
- Experience of working with pipeline automation scripting and tooling, i.e., Jenkins, Terraform.
- Knowledge and experience utilizing coding languages (e.g., C++, Java, PHP) and frameworks/systems (e.g., AWS).
- Ability to learn new languages and technologies strongly preferred.
- Broad understanding of the lending industry, with the ability to become a subject matter expert on the job.
Soft Skills:
- A strong sense of ownership.
- Excellent written and verbal communication and interpersonal skills.
- Able to effectively collaborate with technical and business partners.
- Can take on full projects from beginning to end.
- Problem solver.
- Team Player.
- Advanced English level.
Apply for this job
Life at NTD software
Thrive Here & What We Value- Peoplefirst culture that attracts, develops, and retains top talent- Prioritizes employee wellbeeing, empowering teams with the best tools and opportunities for growth in technology, healthcare, and organizational development- Highgrowth trajectory, each team member plays a crucial role in transforming healthcare- Culture of Excellence- Innovative Projects- TeamFocused Environment- HighQuality, Secure Software Solutions- Collaborative Environment- Continuous Learning and Improvement- Passionate Team Members- Emphasis on collaboration between product, development, and cross-platform teams- Commitment to staying updated with the latest tools, best practices, and QA methods for continuous improvement- NTD Software is an equal opportunity employer- All employment decisions at NTD Software are based on a person’s merit, business needs, and role requirements- Emphasis on staying updated with emerging technologies and applying them to improve existing solutions- Empowering individuals through innovative financial solutions- Leveraging proprietary data and analytics for tailored solutions- Cuttingedge technologies and stringent quality standards- Collaborative mindset in a continuous integration environment- Emphasis on building effective crossfunctional relationships- Dynamic and Inclusive Work Environment- Passionate about innovation