About
No bio added yet
Skills & Expertise (20)
Work Experience
Data Pipeline & Statistical Analysis
Goverment Job Analytics
Present - Present
Built an end-to-end data pipeline scraping 1,000+ job listings using Python (Requests, BeautifulSoup), processing 15+ fields per listing across paginated results. Engineered 6+ features including salary normalization, qualification standardization, and regex-based role/sector extraction from unstructured job titles. Processed 650+ valid salary records (handling ~35% missing values) across 8 sectors and 10+ role categories. Revealed through multivariate EDA that role hierarchy — not education level — is the primary driver of salary variation, with PSU and Banking sectors dominating high-pay bands. Applied 99th percentile trimming to separate core market from elite salary segments, using median over mean due to high right-skew in salary distribution.
SQL Business Insights
Chinook Database
Present - Present
Analyzed 8 relational tables covering 1,000+ transactions, 50+ customers, 3,500+ tracks across 20+ genres and 15+ countries to answer 11 business questions. Identified top revenue-generating city and country using multi-table joins across Customer, Invoice, Track, and Artist tables with SUM and COUNT aggregations. Applied window functions (RANK) with CTEs to determine top-spending customer per country and most popular genre per region. Uncovered a Pareto-style revenue pattern where a small customer segment contributed a disproportionate share of total sales. Delivered 11 business recommendations covering customer retention, regional marketing, and genre-based targeting.
Power BI Dashboard
Telangana Weather Data Analysis
Present - Present
Integrated and cleaned 10 source files (9 CSV + 1 XLSX) using Python (pandas) and Power Query, resolving data type errors across 7 numeric weather parameters. Analyzed 2,25,527 records across 35 districts and 588 mandals covering January 2021 – December 2024. Built 15+ DAX measures including TOPN/MAXX patterns for dynamic KPI cards, seasonal filters, and year-on-year comparisons accounting for partial 2024 data. Revealed that 86.6% of annual rainfall is concentrated in 4 monsoon months, with a 6.4× gap between Warangal Rural (highest) and Hanumakkonda (lowest) districts. Designed district-level risk classification framework identifying flood-prone, drought-prone, and heat-stressed zones across 8 interactive dashboard pages.
Education
Data Science With GenAI - Innomatics Research Institute
2025 - · Afghanistan
B.Tech in CSE (AI & ML) - Presidency University, Bangalore
2021 - 2025 · Afghanistan
Certifications
SQL For Data Analysis
Innomatics Research Labs · 2026
Python For Data Science
Innomatics Research Labs · 2025
Interested in this developer?
Profile Score Breakdown
Profile Overview
Skills (20)
Click a skill to find developers with the same skill