Back to Developers
Rafi Sonu

Rafi Sonu

Data Analyst

Hyderabad, Telangana
85
Profile Score

About

Detail-oriented Data Analyst with 3+ years of experience working extensively with large tabular and time-series datasets using Python (NumPy, Pandas) and applied statistics. Strong background in data cleaning, reconciliation, validation, and exploratory analysis to ensure high data accuracy and reliability before reporting or modeling. Experienced in turning raw multi-source datasets into consistent, analysis-ready formats and communicating insights clearly to stakeholders.

Skills & Expertise (25)

Pandas Advanced
8.8/10
4
Years Exp
NumPy Advanced
8.3/10
4
Years Exp
SQL Advanced
8.0/10
4
Years Exp
Jupyter Notebook Advanced
7.9/10
4
Years Exp
Excel Command Line Git Validation queries Aggregation Data Extraction Data integrity reporting Consistency Checks Cross-source validation Error metrics Distribution analysis correlation analysis Regression Analysis Hypothesis Testing Data Normalization Outlier Detection Missing value treatment Time-Series Analysis Large tabular data processing Seaborn Matplotlib

Work Experience

Data & Analytics Analyst

Accenture

Jul 2021 - Aug 2022

Processed and analyzed more than 1 million rows of structured sales and inventory data using Pandas, NumPy, and SQL, converting raw multi-source datasets into clean, well-structured formats ready for reliable analysis and reporting. Applied practical statistical techniques including distribution analysis, correlation evaluation, and trend identification to understand performance drivers and detect unusual data behavior across large operational datasets. Performed thorough data reconciliation across multiple reporting systems, identifying mismatches in totals and metrics, and preparing detailed data validation and QA reports to ensure consistency and accuracy. Conducted comprehensive exploratory data analysis (EDA) on messy datasets by systematically handling missing values, duplicates, outliers, and formatting inconsistencies to improve data quality. Developed reusable and well-documented Python analysis notebooks with built-in validation checks and sanity tests, all maintained under Git version control for reproducibility and traceability. Collaborated closely with business and technical stakeholders to define clear data definitions, metric calculations, and reporting standards, ensuring consistency across dashboards, reports, and analytical outputs.

Data Analyst / Applied ML Support

CVS Health

Apr 2024 - Jan 2026

Worked extensively with large time-series healthcare operational datasets using Pandas and NumPy to clean, reshape, and validate patient, claims, and workflow data before it was used for analysis and reporting. Performed detailed data reconciliation across multiple healthcare data sources such as operational reports, database extracts, and reporting systems, identifying inconsistencies and producing QA reports that reduced reporting errors by 45%. Applied statistical analysis to identify trends, anomalies, and abnormal patterns in weekly datasets (10k+ records), helping teams detect irregular operational behavior early. Built reusable and well-documented Python notebooks for continuous data validation, consistency checks, and automated sanity testing of newly incoming healthcare data. Used SQL alongside Python to cross-verify aggregates, totals, and calculated metrics, ensuring high data accuracy before results were shared with stakeholders. Clearly communicated data findings, discrepancies, and validation outcomes to business and technical teams, improving overall trust and reliability of analytical outputs.

Education

Master of Science in Management Information Systems - Stevens Institute of Technology

2022 - 2023 · Afghanistan

Bachelor of Technology in Electrical and Electronics Engineering - Vardhaman College of Engineering

2017 - 2021 · Afghanistan

Certifications

Oracle Cloud Infrastructure 2025 Machine Learning Professional

Oracle · 2025

Oracle Cloud Infrastructure 2024 Generative AI Certified Professional

Oracle · 2024

Interested in this developer?

Profile Score Breakdown

📷 Photo 10/10
📄 Resume 10/10
💼 Job Title 10/10
✍️ Bio 10/10
🛠️ Skills 20/20
🎓 Education 10/10
⏱️ Experience 5/15
💰 Rate 0/5
🏆 Certs 5/5
Verified 5/5
Total Score 85/100

Profile Overview

Member sinceMar 2026

Skills (25)

Pandas NumPy SQL Jupyter Notebook Excel Command Line Git Validation queries Aggregation Data Extraction +15 more