Recrutement Thales

Site Reliability Engineer W - M H/F - Thales

  • Nice - 06
  • CDI
  • Thales
Publié le 4 mai 2026
Postuler sur le site du recruteur

Les missions du poste

Lieu : Sophia Antipolis, France
Construisons ensemble un avenir de confiance

Thales est un leader mondial des hautes technologies spécialisé dans trois secteurs d'activité : Défense & Sécurité, Aéronautique & Spatial, et Cyber & Digital. Il développe des produits et solutions qui contribuent à un monde plus sûr, plus respectueux de l'environnement et plus inclusif. Le Groupe investit près de 4 milliards d'euros par an en Recherche & Développement, notamment dans des domaines clés de l'innovation tels que l'IA, la cybersécurité, le quantique, les technologies du cloud et la 6G. Thales compte près de 81 000 collaborateurs dans 68 pays.

Nos engagements, vos avantages

- Une réussite portée par notre excellence technologique, votre expérience et notre ambition partagée
- Un package de rémunération attractif
- Un développement des compétences en continu: parcours de formation, académies et communautés internes
- Un environnement inclusif, bienveillant et respectant l'équilibre des collaborateurs
- Un engagement sociétal et environnemental reconnu

Votre quotidien
Au coeur de la Silicon Valley de la région PACA, notre site regroupe nos activités développe des sonars de pointe équipant les sous-marins et les bâtiments de surface ainsi que des activités de services numériques. Pionnier dans le domaine des produits de simulation, le site mobilise une expertise approfondie en acoustique et en traitement du signal.
We are seeking a Site Reliability Engineer to ensure the high level of service and operation excellence for the development of the innovative and ambitious Telecommunication solution (high availability, strong performance constraints) deployed in the public cloud. This product requires the establishment of a product specific SRE team.

Essential Functions

- Automation & Infrastructure as Code: Design, build, and maintain scalable infrastructure using tools such as Terraform, Ansible, and Kubernetes. Develop automated CI/CD pipelines via GitLab to reduce manual toil.
- Availability & Reliability Engineering: Define and monitor Service Level Objectives (SLOs) and Service Level Indicators (SLIs). Manage "Error Budgets" to balance the velocity of new features with the stability of the platform.
- Incident Management & On-Call Support: Participate in 24/7 on-call rotations to provide emergency response and perform deep-dive troubleshooting for production issues.
- Performance & Capacity Planning: Conduct system performance analysis, identify bottlenecks, and perform capacity planning to ensure the infrastructure can handle growth and peak loads.
- Observability & Monitoring: Implement and refine symptom-based alerting and comprehensive monitoring strategies using platforms like Datadog to ensure high visibility into system health.
- Continuous Improvement & Postmortems: Lead blameless postmortems after incidents to identify root causes and implement long-term technical fixes to prevent recurrence.
- Security & Compliance Collaboration: Partner with Cloud Security teams to implement security best practices, manage access controls, and respond to security breaches or vulnerabilities.
- Support customer relationship
- Interface with other stakeholders to define solution improvement plan
- You will have the ownership of solution service availability.

Minimum Requirements

Education:

- Engineer or equivalent

Experience:

- at least 1 year experience

Skills and Abilities:

- Java development skill is required.
- You are familiar with Public Cloud (GCP, AWS), containers and microservices (Docker, Kubernetes, Java), CI/CD and automation (Jenkins, Gitlab, Helm), NoSQL database.

Certification

- GCP cloud architect certification is a plus

Preferred Qualifications

- You have already set up product monitoring and the underlying infrastructure
- You have development experience in a distributed systems and/or high availability context
- You are familiar with microservices development
- You participated in the definition of architectures, data structures, algorithms with performance, security, reliability constraints, etc.
- Public cloud architect certification
- You are interested in aspects of Site Reliability Engineer: CI/CD, automation, monitoring and observability, and continuous improvement.
- You are an accomplished, versatile and multi-tasking developer engineer.

Thales, entreprise Handi-Engagée, reconnait tous les talents. La diversité est notre meilleur atout. Postulez et rejoignez nous !

Postuler sur le site du recruteur

Parcourir plus d'offres d'emploi