logo inner

Senior DevOps Engineer (GCP)

CallsignOnsite
This job is no longer open

Description


Russian hacker, Vladimir Leonidovitch Levin, attempted the biggest bank heist the world had ever seen via dial-up internet in 1994, Zia Hayat, Callsign CEO and founder, was hooked - armchair fraud became a real possibility. From this moment, Zia knew he wanted to play a part in stopping the bad guys and securing the internet for all. Founded In 2012, Callsign's mission has been to make Digital Identity simple and secure for everyone and everything. In that time, we've grown to over 200 employees, opened offices in Singapore and Abu Dhabi, been recognised as a WEF Global Innovator and our technology is being used by many of the world's leading financial institutions to keep millions of consumers safe.But we aren't stopping here.

Callsign is venturing on a multi-cloud strategy and looking for exceptional talent with experience of Google Cloud Platform. As a GCP Senior DevOps Engineer here at Callsign, you will play a critical role to work with our engineering team to design, architect, develop, implement, optimize, and maintain cloud native solution on Google Cloud Platform. To be successful in this role, you should be able to identify the most optimal cloud-based solutions and maintain cloud infrastructures in accordance with best practices and company security policies.

Responsibilities


  • Architect, implement, and manage scalable and secure infrastructure on Google Cloud Platform (GCP), leveraging core services such as GKE Enterprise, Cloud Run, Managed Apache Kafka, CloudSQL, and MemoryStore to support high-availability and performance-critical applications.
  • Design and automate infrastructure provisioning using Infrastructure as Code (IaC) tools like Terraform, ensuring repeatability and consistency across environments.
  • Develop and maintain CI/CD pipelines using GitLab CI and GitOps tools like FluxCD or ArgoCD to enable seamless and reliable application delivery.
  • Operate and optimize Kubernetes clusters with deep expertise in cluster management, networking, and workload orchestration. Experience with custom Kubernetes operators for application lifecycle management is a strong plus.
  • Implement and enhance observability using tools such as Prometheus, Grafana, NewRelic, ElasticSearch, Google Cloud Logging, and Monitoring to ensure system health, performance, and reliability.
  • Lead disaster recovery (DR) planning and operations, including hands-on experience with cross-region and multi-cloud (GCP/AWS) failover, automated DR workflows using Terraform and CI/CD, and validation of RTO/RPO objectives.
  • Continuously improve system performance, scalability, and security, staying current with GCP’s latest offerings and best practices.
  • Design and implement cloud-native solutions on GCP by leveraging industry-standard architectural patterns, Google’s Cloud Architecture Framework, and proven design principles. Proactively identify and resolve technical blockers through structured root cause analysis and collaboration with cross-functional teams.
  • Mentor junior engineers and contribute to the evolution of DevOps practices within the organization.

Requirements


  • Bachelor’s degree in Computer Science, Engineering, or a related field.
  • 8-10 years of experience in DevOps or cloud infrastructure roles, with at least 5+ years working with GCP in production environments.
  • Proven experience with GCP-native services, especially GKE, CloudSQL, MemoryStore and Managed Apache Kafka.
  • Hands-on experience with stateful systems such as MySQL/PostgreSQL,, Redis, and Kafka, with a focus on deployment, scaling, and data durability in cloud-native environments.
  • Strong proficiency in Terraform for infrastructure automation.
  • Hands-on experience with CI/CD tools like GitLab CI and GitOps workflows using FluxCD or ArgoCD.
  • Advanced scripting skills in Golang and Python.
  • Deep understanding of Kubernetes internals, including networking, security, and custom operator development.
  • Experience with observability stacks including Prometheus, Grafana, NewRelic, ElasticSearch, and Google’s operations suite.
  • Practical experience in disaster recovery operations, with a preference for multi-cloud DR strategies involving AWS.
  • Google Cloud Professional Cloud Architect certification or equivalent is a plus.
  • Strong analytical, troubleshooting, and communication skills.

Nice to have


  • Hands-on experience with GCP’s data analytics and AI/ML services—BigQuery, Dataproc, Dataplex, and Vertex AI—to support MLOps pipelines, data governance, and scalable machine learning model deployment in production environments.
  • Familiarity with GCP-native and third-party security tooling, including identity and access management (IAM), workload identity federation, vulnerability scanning, policy enforcement, and secrets management.
  • Experience in building internal tools and reusable infrastructure components that streamline development workflows, improve deployment efficiency, and enhance team productivity.

Benefits


  • Relocation Assistance to Abu Dhabi including flights, accommodation & visa support for you and your dependents

  • Annual airfare allowance for a return flight home

  • Comprehensive medical insurance for you and your dependents

  • 3 months full pay maternity leave & 2 weeks full pay paternity leave

  • 25 days of annual leave + Callsign Bank Holiday (not included in holiday allowance)

This job is no longer open

Life at Callsign

Friction-free Identification and Authentication By using all of the thousands of data points available such as typing or swiping techniques, location, online habits, face recognition, devices, and yes even passwords, we can determine someone is who they say they are; we even know the Monday person can behave differently to the Friday person. Most of these data points are friction-free for the user, and so we use these to determine that someone's behaviour is within their normal pattern. Where there is a veering from the norm we then intelligently introduce further tests, avoiding a rules-based approach that can be replicated by the bad guys. We have the lowest false positive rates in the industry and zero breaches thanks to our inbuilt malware detector. As a result, users can get on with their digital lives whilst businesses improve customer engagement, increase productivity and reduce the risk of fraud. Callsign enables customers and employees to #GetOn with their digital lives with friction-free identification and authentication.
Thrive Here & What We Value- Collaborative and fun team- High standards for self and peers- Importance of balancing fun with hard work- Teamwork and mutual respect- Continuous improvement mindset- Positive attitude towards challenges- Open communication channels- Encouragement of creativity and innovation- Supportive environment for growth- Commitment to excellence

Related Sub

This job belongs to these sub. Explore related roles here:
Machine learning jobs
Your tracker settings

We use cookies and similar methods to recognize visitors and remember their preferences. We also use them to measure ad campaign effectiveness, target ads and analyze site traffic. To learn more about these methods, including how to disable them, view our Cookie Policy or Privacy Policy.

By tapping `Accept`, you consent to the use of these methods by us and third parties. You can always change your tracker preferences by visiting our Cookie Policy.

logo innerThatStartupJob
Discover the best startup and their job positions, all in one place.
Copyright © 2025