Hello, I'm

Ahmed Soria

DevOps Engineer

I build and operate the platforms that keep production running.

8+ years designing and managing scalable cloud and on-premises infrastructure — Kubernetes, AWS, VMware, Terraform, and Ansible.

Ahmed Soria — DevOps Engineer

About Me

I'm a DevOps Engineer with over 8 years of experience designing and managing scalable cloud and on-premises infrastructure. My work spans AWS, Kubernetes, VMware virtualization, and the automation tooling that ties it all together — Terraform, Ansible, and CI/CD pipelines that teams can rely on.

I've worked across telecom, cybersecurity, and technology companies — from managing 200+ Linux servers at Huawei to deploying hardened Kubernetes clusters and VMware environments at Oryxlabs. I'm the person teams call when something breaks, and the one who builds the systems so it doesn't happen again.

My approach is practical: understand the problem deeply, build the right solution, automate what should be automated, and optimize for performance, security, and cost efficiency. I care about reliability, clean infrastructure, and mentoring teams to drive operational excellence.

Core Skills & Expertise

Technologies and platforms I work with daily.

Cloud

  • AWS
  • Azure
  • EKS
  • EC2
  • S3
  • IAM
  • VPC

Virtualization

  • VMware vCenter / vSphere
  • Harvester HCI
  • OpenShift Virtualization
  • FusionSphere OpenStack

Containers & Orchestration

  • Kubernetes
  • OpenShift
  • Tanzu
  • EKS
  • Docker
  • Helm

Infrastructure as Code & Automation

  • Terraform
  • Ansible
  • Shell Scripting
  • GitOps

CI/CD

  • ArgoCD
  • GitHub Actions

Databases & Data Platforms

  • PostgreSQL
  • PostgreSQL Citus
  • OpenSearch
  • ClickHouse
  • Trino
  • Kafka

Observability & Operations

  • Nagios
  • Centreon
  • Monitoring Pipelines
  • Incident Recovery
  • Platform Troubleshooting

Security & Infrastructure

  • Security Hardening
  • Linux/Unix Administration
  • Server Management (Dell R740/R640)
  • Veritas Cluster
  • Red Hat HA Addon

Experience

A timeline of my professional journey in infrastructure and platform engineering.

DevOps Engineer

January 2022 — Present

Oryxlabs

Abu Dhabi, UAE

Leading infrastructure and platform operations across Kubernetes, VMware, and AWS environments. Responsible for cluster lifecycle management, infrastructure automation, CI/CD pipeline architecture, and production operations.

  • Deployed and optimized VMware vCenter environments for secure, scalable virtualization
  • Led hardened Kubernetes cluster deployments (EKS, Tanzu, OpenShift) ensuring high availability and compliance
  • Automated infrastructure provisioning with Terraform and Ansible, reducing deployment time and configuration drift
  • Administered and scaled PostgreSQL with Citus for high-performance distributed databases
  • Managed AWS cloud infrastructure with a focus on security, cost optimization, and hybrid integration
  • Built and maintained secure CI/CD pipelines with ArgoCD and GitHub for rapid, reliable releases
  • Deployed and managed Harvester HCI to consolidate workloads and improve infrastructure resilience
  • Operated OpenShift Virtualization to unify VM and container management in secure environments
  • Upgraded and managed Dell R740/R640 servers, improving performance, stability, and security

DevOps Engineer

October 2020 — December 2021

Malcrove

Dubai, UAE

Built and maintained cloud infrastructure and CI/CD pipelines across Azure and on-premise environments, automating deployments and improving system reliability.

  • Administered and maintained Unix/Linux systems, ensuring smooth daily operations and stability
  • Deployed and managed computational resources on Azure Cloud, supporting scalable workloads
  • Designed and implemented automated deployment pipelines using Ansible to improve efficiency
  • Ensured scalability and reliability of Kubernetes clusters in production environments
  • Installed and configured Dell physical servers (R740, R640, R440, R340) to optimize infrastructure performance
  • Managed VMware clusters to maintain high availability and optimize system performance

Linux/Unix System Engineer

September 2017 — December 2020

Huawei Technologies

Khartoum, Sudan

Managed large-scale Linux server infrastructure, performed system administration, and supported critical telecom systems with automation and monitoring.

  • Performed daily administration, configuration, and troubleshooting of Unix/Linux infrastructure and servers
  • Automated configuration management for 200+ Linux servers using Ansible, improving efficiency and consistency
  • Managed virtual environments and Linux projects on VMware and FusionSphere OpenStack platforms
  • Supported multiple critical systems (Billing, EVD, Remedy, ESB, ERP) by resolving complex tickets and performing root cause analysis
  • Administered server clusters for high availability using Veritas Cluster and Red Hat HA Addon
  • Deployed and maintained monitoring solutions including Nagios, Centreon, and MX for proactive system oversight
  • Conducted UNIX/Linux security hardening, auditing, and ensured compliance with best practices

Teaching Assistant

January 2018 — May 2018

University of Bahri

Khartoum, Sudan

Supported academic programs in networking and Linux systems, delivering practical lab sessions and training.

  • Assisted in configuring Asterisk, an open-source VoIP solution
  • Conducted practical lab sessions and a Linux boot camp
  • Provided training on deploying and troubleshooting VoIP systems

Education

Bachelor of Information Technology

University of Bahri

Khartoum, Sudan
May 2017GPA: 3.42

Featured Projects & Technical Work

Selected infrastructure and platform engineering work.

Hardened Kubernetes Platform Deployment

Problem

The organization needed hardened, compliant Kubernetes clusters across multiple platforms to support production workloads with high availability.

Solution

Led deployment of hardened Kubernetes clusters on EKS, Tanzu, and OpenShift with ArgoCD for GitOps-based delivery, ensuring compliance and high availability across environments.

Impact

Achieved stable multi-cluster operations with automated deployments, compliance enforcement, and reduced incident response time.

KubernetesEKSTanzuOpenShiftArgoCDHelm

VMware vCenter & Harvester HCI Infrastructure

Problem

Infrastructure needed consolidation and modernization to improve resilience, scalability, and management of virtualized workloads.

Solution

Deployed and optimized VMware vCenter environments alongside Harvester HCI to consolidate workloads, and operated OpenShift Virtualization to unify VM and container management.

Impact

Improved infrastructure resilience, simplified workload management, and unified VM and container operations.

VMware vCenterHarvester HCIOpenShift VirtualizationDell R740/R640

Infrastructure Automation with Terraform & Ansible

Problem

Manual infrastructure provisioning was slow, error-prone, and led to configuration drift across environments.

Solution

Built modular Terraform configurations for AWS infrastructure and Ansible playbooks for configuration management, reducing deployment time and eliminating drift.

Impact

Reduced environment provisioning from days to under an hour with consistent, repeatable infrastructure.

TerraformAnsibleAWSEKSIAMVPC

CI/CD Pipeline Architecture with ArgoCD & GitHub

Problem

Legacy deployment processes were manual, slow, and risky — lacking proper environment promotion and rollback capabilities.

Solution

Built secure CI/CD pipelines using GitHub Actions for build and ArgoCD for GitOps-based deployment, with proper environment promotion and rollback workflows.

Impact

Deployment frequency increased while rollback time dropped from hours to minutes, enabling rapid and reliable releases.

GitHub ActionsArgoCDDockerHelmKubernetes

Large-Scale Linux Automation & Monitoring

Problem

Managing 200+ Linux servers manually was inefficient and led to inconsistencies across the fleet.

Solution

Automated configuration management with Ansible across the entire server fleet and deployed monitoring solutions (Nagios, Centreon) for proactive system oversight.

Impact

Improved fleet consistency, reduced manual effort significantly, and enabled proactive incident detection.

AnsibleLinuxNagiosCentreonVeritas ClusterRed Hat HA

PostgreSQL Citus Distributed Database Operations

Problem

Growing data volumes required a high-performance distributed database solution that could scale horizontally.

Solution

Administered and scaled PostgreSQL with Citus extension for distributed query processing, ensuring high performance and reliability for production workloads.

Impact

Achieved horizontal scalability for database workloads with improved query performance and operational reliability.

PostgreSQLCitusLinuxMonitoring

Community & Knowledge Sharing

I'm an active member of the technical community in Sudan and the broader region. From running the technical operations behind conferences and workshops to mentoring engineers — community work is a core part of who I am.

SDNOG — Sudan Network Operators Group

Active member and technical office lead at SDNOG. Responsible for maintaining all technical infrastructure during conferences and workshops, as well as organizing community events.

Visit Website

IEEE Sudan Subsection

Served as technical support for the IEEE Sudan Subsection, handling all technical requirements during community events, workshops, and activities.

Visit Website

Teaching & Mentoring

Conducted Linux boot camps and practical lab sessions at University of Bahri. Mentored junior engineers on infrastructure best practices and troubleshooting methodologies.

Get in Touch

Whether you have a question, an opportunity, or just want to connect — I'd be happy to hear from you.