Cloud infrastructure architect who has built platforms serving 1.3M concurrent users and 5M banking customers with 12+ years of hands-on engineering across banking, aerospace, media, and public safety. Recognized expert in AWS/Azure multi-cloud, Kubernetes orchestration at scale, and zero-downtime delivery. Consistent track record: 99.95%+ uptime on production systems, 84% deployment time reductions, and enterprise-wide standards adopted across 8+ development teams. Combines deep technical execution with strategic architecture: the engineer who both designs the system and ships it.
Core Differentiators: Not just an architect, a hands-on engineer who writes Terraform, debugs Kubernetes clusters, and ships production deployments. Migrated 150+ containerized workloads across 3 live EKS clusters with zero downtime. ⭐ Employee of the Month, March 2024 for exceptional technical delivery on complex enterprise platform.
🔗 To learn more about me, visit www.badrjbai.sh | portfolio & projects.
Initiated and owned the full project concept: proposed SelfHealingKubernetesClusters as the PFE subject, defined scope, and gained academic approval.
Engineered the complete tech stack selection and produced a sequential architecture diagram covering auto-detection, self-remediation, and observability layers.
Designed and communicated a 6-month engineering roadmap with clearly defined milestones, dependencies, and delivery phases.
Distributed all engineering tasks via Jira: created epics, user stories, and sub-tasks aligned to Atlassian ticketing standards, with each story linked to corresponding GitHub branches and pull requests.
Served as Scrum Master: facilitated sprint planning, daily standups, retrospectives, and sprint reviews; kept the team aligned and unblocked.
Leadership Role: Technical architect and hands-on engineer across two mission-critical banking platform trains (Train Prety & Train Snowpiercer) at BPI France. Drove infrastructure automation, Kubernetes migrations, and observability initiatives from design to production. ⭐ Employee of the Month, March 2024
Deployed and maintained dual cloud platforms (Train Prety & Train Snowpiercer) across Staging, Master-Staging, and Production on AWS EKS and Tanzu, ensuring zero-downtime releases throughout.
Spearheaded Kubernetes EKS migration across multiple environments; received commendations from tech leads, business analysts, and development teams.
Integrated Kafka ecosystem (Kafka Connect, Confluent, AKHQ) with Keycloak authentication replacing IBM ISAM, deployed a centralized AKHQ portal, and industrialized Kafka Connect on Tanzu with 25% efficiency improvement.
Engineered standardized Helm release chart framework and centralized Terraform IaC modules, reducing deployment complexity by 40% and improving consistency across environments.
Automated credential delivery to SAFE vault via AWS Lambda, implemented IAM credential rotation for AWS environments, and configured HSTS security — eliminating manual secret handling.
Implemented Datadog monitoring dashboards and proactive alerting across Tomcat/Tomee and all platform services, reducing incident response time by 35%.
Migrated Logstash VMs to Kubernetes containers (CaaS) across DEV, PPD, and PRD environments and authored full implementation documentation.
Led Chaos Engineering initiatives, resolved FluxCD GitOps challenges, and coordinated production MEPs (Mise en Production) at each sprint conclusion.
Technical ManagerKomutel
May 2022 - Mar 2023
Tunis, Tunisia | Public Safety & Emergency Systems
Expertise: AWS infrastructure, Terraform IaC, CI/CD automation, test automation, team leadership, mission-critical systems
Led QA engineering team of 6 engineers. Implemented comprehensive test automation frameworks achieving 30% productivity increase and 45% reduction in bug escape rate.
Architected production AWS infrastructure using Terraform infrastructure-as-code. Implemented automated provisioning and infrastructure drift detection reducing manual errors by 90%.
Engineered CI/CD pipeline automation with Jenkins declarative pipelines reducing release cycle from 2 weeks to 3 days.
Developed comprehensive testing strategy for mission-critical emergency systems ensuring 99.99% availability and regulatory compliance.
Engineered and operated enterprise DevOps platform serving 200+ developers. Maintained 99.9% SLA through automated monitoring and self-healing infrastructure.
Automated Software Factory upgrades across Docker Swarm clusters using Portainer orchestration reducing maintenance windows by 60% (from 8h to 3h).
Implemented Jenkins Configuration as Code (JCasC) with GitOps workflow across Dev, Staging, Production environments reducing configuration drift incidents by 85%.
Developed Python automation SDK leveraging REST APIs for RBAC auditing across DevOps toolchain automating permission reconciliation for 200+ users.
Architected complete platform infrastructure and DevOps automation tooling for enterprise banking applications serving 5M+ customers implementing infrastructure-as-code patterns and GitOps workflows.
Enabled 12 development squads to achieve continuous delivery through platform engineering implementing self-service deployment automation reducing deployment time from days to hours.
Engineered reusable Java application blueprints with embedded best practices deployed to Pivotal Cloud Foundry (Tanzu) using Concourse pipeline-as-code adopted by 15+ microservices.
Streamlined customer data ingestion workflows into SSAS platform, reducing processing time while providing expert support to internal stakeholders and clients.
Developed complex SQL queries on SQL Server for large-scale data extraction and ETL processes supporting internal and external reporting systems.
Automated metadata cleansing using Python and Excel, enhancing data quality and accelerating processing for the data operations team.
Designed and implemented data pipelines to optimize throughput and consistently meet client SLA commitments.
Collaborated with project managers on client calls, performing proactive data analysis to identify potential blockers and mitigate delays.
Partnered with offshore QA team in Kenya to identify process bottlenecks and implement operational improvements.
Leveraged Zendesk and Jira for issue tracking; developed JSON parsing solutions using PyCharm for supplier intelligence team.
Tech Stack: Bamboo, JFrog Artifactory, IBM Anthill Pro, Jenkins, Groovy, Gradle, Git, SVN, Ant, Ivy, WebSphere, Atlassian Suite, Crucible, SonarQube, Oracle DB, Java, Windows Server
Led migration of 15+ project repositories from IBM Anthill Pro to Bamboo CI/CD including binary artifact migration to JFrog Artifactory reducing build times by 35%.
Administered Windows-based development infrastructure for Java aerospace applications optimizing build tooling and CI pipelines for mission-critical fault-tolerant systems.
Deployed and validated enterprise media monitoring products for international clients across multiple regions ensuring successful global rollouts.
Administered and customized MediaVantage platform instances on UNIX/Linux systems tailoring configurations to meet specific client requirements.
Built custom BI reports using Pentaho and developed XML/XSLT transformations for automated, personalized email distribution systems serving thousands of users.
Education & Certifications
Bachelor's Degree in Computer Science, Software Engineering & Information Systems
AWS Certified Cloud Practitioner Amazon Web Services
Languages: English (Fluent) | French (Fluent) | Arabic (Native)