We are looking for an experienced
Sr. Director, Network Operations Modernization (AI/ML) to oversee and enhance the reliability and performance of our IT systems through strategic Artificial Intelligence and Machine learning initiatives. This role involves leading a team of engineers, collaborating with cross-functional teams, and implementing best practices to ensure system resilience and efficiency.
PRIMARY RESPONSIBILITIES
- Lead and mentor a team of IT reliability and automation engineers.
- Develop an AI/ML solution for Operational Center activities
- Develop and implement strategies for automating repetitive tasks and improving system reliability.
- Oversee the design, development, and maintenance of automation tools and scripts.
- Collaborate with development, operations, and product teams to ensure seamless integration and deployment of new systems and features.
- Monitor system performance and reliability, proactively identifying and addressing potential issues.
- Establish and enforce best practices for system monitoring, incident response, and disaster recovery.
- Analyze system failures and develop comprehensive solutions to prevent recurrence.
- Maintain detailed documentation of system configurations, processes, and procedures.
REQUIRED KNOWLEDGE/SKILLS/ABILITIES
- Bachelor’s degree in Computer Science, Information Technology, or a related field.
- Extensive experience in solving problems with AI & ML
- Proven leadership and team management skills.
- Strong programming skills in languages such as Python, Go, or Java.
- Experience with automation tools like Ansible, Puppet, or Chef.
- Familiarity with monitoring tools such as Prometheus, Grafana, or Nagios.
- Excellent problem-solving skills and attention to detail.
- Strong communication and collaboration skills.
PREFERRED KNOWLEDGE/SKILLS/ABILITIES
- Experience with containerization technologies like Docker and Kubernetes.
- Knowledge of cloud platforms such as AWS, Azure, or Google Cloud.
- Understanding of CI/CD pipelines and tools like Jenkins or GitLab CI. #LI-REMOTE #LI-JL1