About
Detail-oriented Data Analyst with 3+ years of experience working extensively with large tabular and time-series datasets using Python (NumPy, Pandas) and applied statistics. Strong background in data cleaning, reconciliation, validation, and exploratory analysis to ensure high data accuracy and reliability before reporting or modeling. Experienced in turning raw multi-source datasets into consistent, analysis-ready formats and communicating insights clearly to stakeholders.
Skills & Expertise (25)
Work Experience
Data & Analytics Analyst
Accenture
Jul 2021 - Aug 2022
Processed and analyzed more than 1 million rows of structured sales and inventory data using Pandas, NumPy, and SQL, converting raw multi-source datasets into clean, well-structured formats ready for reliable analysis and reporting. Applied practical statistical techniques including distribution analysis, correlation evaluation, and trend identification to understand performance drivers and detect unusual data behavior across large operational datasets. Performed thorough data reconciliation across multiple reporting systems, identifying mismatches in totals and metrics, and preparing detailed data validation and QA reports to ensure consistency and accuracy. Conducted comprehensive exploratory data analysis (EDA) on messy datasets by systematically handling missing values, duplicates, outliers, and formatting inconsistencies to improve data quality. Developed reusable and well-documented Python analysis notebooks with built-in validation checks and sanity tests, all maintained under Git version control for reproducibility and traceability. Collaborated closely with business and technical stakeholders to define clear data definitions, metric calculations, and reporting standards, ensuring consistency across dashboards, reports, and analytical outputs.
Data Analyst / Applied ML Support
CVS Health
Apr 2024 - Jan 2026
Worked extensively with large time-series healthcare operational datasets using Pandas and NumPy to clean, reshape, and validate patient, claims, and workflow data before it was used for analysis and reporting. Performed detailed data reconciliation across multiple healthcare data sources such as operational reports, database extracts, and reporting systems, identifying inconsistencies and producing QA reports that reduced reporting errors by 45%. Applied statistical analysis to identify trends, anomalies, and abnormal patterns in weekly datasets (10k+ records), helping teams detect irregular operational behavior early. Built reusable and well-documented Python notebooks for continuous data validation, consistency checks, and automated sanity testing of newly incoming healthcare data. Used SQL alongside Python to cross-verify aggregates, totals, and calculated metrics, ensuring high data accuracy before results were shared with stakeholders. Clearly communicated data findings, discrepancies, and validation outcomes to business and technical teams, improving overall trust and reliability of analytical outputs.
Education
Master of Science in Management Information Systems - Stevens Institute of Technology
2022 - 2023 · Afghanistan
Bachelor of Technology in Electrical and Electronics Engineering - Vardhaman College of Engineering
2017 - 2021 · Afghanistan
Certifications
Oracle Cloud Infrastructure 2025 Machine Learning Professional
Oracle · 2025
Oracle Cloud Infrastructure 2024 Generative AI Certified Professional
Oracle · 2024