About
DevOps & Site Reliability Engineer with 4 years of experience managing Linux-based production environments across AWS and Azure cloud platforms. Expertise in CI/CD automation, Infrastructure as Code (Terraform), Kubernetes orchestration, and cloud-native architecture design. Strong background in incident management, root cause analysis (RCA), system performance optimization, and high-availability implementations. Microsoft Certified (AZ-900, AZ-400) with proven success in improving deployment efficiency, reducing MTTR, and enhancing system reliability and security.
Skills & Expertise (27)
Work Experience
Azure DevOps & Cloud Modernization
Microsoft
Present - Present
Built Azure DevOps YAML pipelines to automate build, test, and multi-stage deployments across environments. Deployed enterprise applications to Azure App Service and AKS using CI/CD automation. Executed Blue-Green and Rolling deployment strategies ensuring zero-downtime releases. Monitored AKS and App Service workloads using Azure Monitor, Application Insights, and Log Analytics to optimize CPU and memory utilization. Configured alerts and dashboards to proactively identify performance bottlenecks and deployment failures. Managed Linux-based container workloads and performed system-level troubleshooting. Conducted root cause analysis (RCA) for recurring deployment and performance issues, Automated remediation fixes. Collaborated with development and infrastructure teams to improve release reliability and reduce deployment failures by 35%. Provided L2/L3 production support in a 24/7 environment maintaining 99.9% uptime and meeting SLA targets. Monitored system health, responded to alerts, and resolved priority incidents to maintain 99.9% uptime and meet SLA requirements.
Software Engineer
LTIMindtree
May 2022 - Present
Designed and provisioned AWS infrastructure using Terraform including VPC, Subnets, Route Tables, NAT Gateway, EC2, IAM Roles, S3, ALB, and Auto Scaling Groups. Automated end-to-end infrastructure provisioning and CI/CD workflows, reducing manual deployment efforts by 99%. Administered Linux-based EC2 instances including user management, patching, service configuration, and log analysis. Troubleshot production incidents involving CPU spikes, memory leaks, disk space issues, and application crashes, reducing MTTR by 30%. Established high availability architecture using Auto Scaling and Load Balancers, achieving 99.9% service uptime. Built reusable Terraform modules to standardize infrastructure across Dev, QA, and Production environments. Engineered CI/CD pipelines using Jenkins integrated with GitHub and Maven for automated builds and deployments. Containerized applications using Docker and managed image lifecycle with AWS ECR. Deployed and managed workloads on Kubernetes clusters using YAML manifests with rolling updates and health checks. Deployed configuration management using Ansible playbooks for server and application automation. Established proactive monitoring and alerting using Prometheus, Grafana, and AWS CloudWatch, improving incident detection efficiency by 40%. Centralized log aggregation and troubleshooting using Kibana dashboards.
Education
Bachelor of Engineering – Electronics Engineering
2018 - 2021 · Afghanistan