About
Results-driven Data Engineer with hands-on experience designing and deploying scalable ETL/ELT pipelines on the Microsoft Azure Data Platform. Proficient in PySpark, Azure Databricks, Azure Data Factory (ADF), ADLS Gen2, and Delta Lake, with demonstrated expertise processing 250GB+ datasets. Skilled in Medallion Architecture, Delta Live Tables (DLT), Star Schema dimensional modeling, and incremental batch processing. Delivers analytics-ready, high-quality data assets integrated with Power BI for business intelligence and KPI reporting.
Skills & Expertise (31)
Work Experience
Data Engineer Intern
CNH Industrial
Dec 2025 - Present
Designed and deployed scalable ETL pipelines using PySpark on Azure Databricks to process 250GB+ datasets; applied window functions for deduplication and time-based analytics, improving pipeline performance by ~25%. Implemented Medallion Architecture (Bronze → Silver → Gold) with Delta Lake, enabling data reliability, schema evolution, and incremental loading across all pipeline stages. Automated data quality validation using Delta Live Tables (DLT); applied forward-fill and backward-fill imputation strategies, improving downstream data quality by 25%+. Designed Star Schema dimensional models (fact and dimension tables) integrated with Power BI dashboards for real-time KPI tracking and operational trend reporting for cross-functional stakeholders. Orchestrated end-to-end pipeline runs via Azure Data Factory (ADF) for reliable data ingestion from source systems into the Azure Lakehouse environment.
Education
Bachelor of Technology — Information Technology - Ambalika Institute of Management and Technology
2021 - 2025 · Afghanistan
Certifications
No certifications added yet
Interested in this developer?
Profile Score Breakdown
Profile Overview
Availability Details
Visa Status
Need Sponsorship
Relocation
Depends on Offer
Skills (31)
Click a skill to find developers with the same skill