About
Data Engineer with 1.5 years of experience in building scalable data pipelines and analytics solutions using Python and SQL. Skilled in cloud-based architectures using AWS. Experienced in data visualization using Power BI and Tableau, with strong expertise in DAX, Power Query, and dashboard development. Adept at data cleaning, modeling, and transforming complex datasets into actionable insights to support business decisions.
Skills & Expertise (31)
Work Experience
Data Engineer
Dataeaze System
Jan 2021 - Jan 2023
Developed advanced DAX measures and calculated columns to enhance KPI tracking and reporting, resulting in improved data insights and better reporting efficiency. Cleaned and transformed raw datasets by resolving missing values and inconsistencies, enhancing overall data quality and improving dataset reliability across large volumes of data. Designed interactive and user-friendly dashboards using Power BI. Optimized SQL queries and data workflows to improve report performance.
Data Engineer
Lemma Project
Jan 2021 - Jan 2023
Built and maintained scalable ETL pipelines using Python, Spark, and SQL for integrating multi-source data. Integrated third-party data into MySQL by developing connectors for Beeswax, OpenMarket, and PubMatic platforms. Automated API-based data ingestion and daily report processing workflows. Managed database schema for tracking key metrics such as impressions, clicks, and revenue. Implemented CI/CD pipelines using AWS (CodePipeline, ECS, EC2) to automate deployment and reduce processing time. Created CI workflows and developed parallel CI/CD pipelines for Spark views on EC2, improving deployment efficiency. Integrated cloud and on-premise data sources using AWS services, improving data availability for analytics teams. Contributed to CiteData tool by developing features for compliance tracking and metadata management.
Data Engineer
50 Hertz Project
Jan 2021 - Jan 2023
Developed Python automation scripts using Selenium to extract data, download reports, and process files. Loaded and managed data in PostgreSQL ensuring accuracy and consistency. Applied web scraping and data transformation techniques to maintain seamless data flow. Built and deployed a Streamlit dashboard integrated with BigQuery and Google Cloud Storage (GCS) for visualization.
Education
Master in Computer Applications (MCA) - Fergusson College, Pune
- · Afghanistan