Back to Developers
Anish Joshi

Anish Joshi

Data Scientist

Pune
80
Profile Score

About

Detail-oriented Data Science & Engineering fresher with practical experience in Big Data processing, predictive modeling, and cloud-based analytics. Proficient in EDA, ETL, SQL, and machine learning workflows, with a track record of building dashboards and analytical reports that enhance business understanding. Worked on 10+ data science projects, including cleaning and transforming real-world datasets and handling datasets of 100k+ rows, optimizing pipelines, and delivering actionable insights. Skill in Problem solving through data interpretation, with a strong foundation in statistics and a continuous learning mindset.

Skills & Expertise (46)

AWS Expert
9.2/10
7
Years Exp
Python Expert
9.0/10
7
Years Exp
Apache Spark Advanced
8.8/10
5
Years Exp
SQL Expert
8.7/10
7
Years Exp
Machine Learning Algorithms Advanced
8.5/10
5
Years Exp
Statistical Modeling Communication Skills Data Warehousing Data Pipelines Data Wrangling Data Visualization Feature Engineering Regression Classification A/B testing Business Problem Solving Requirements Gathering Data Storytelling Dashboarding ETL Workflow Understanding Data Quality Assessment Hypothesis Testing Model Evaluation Analytical Thinking Problem-solving IAM MySql NoSQL HBase PySpark Hadoop HDFS MapReduce S3 RDS EC2 Exploratory data analysis Power BI Excel Matplotlib Pandas NumPy Data Cleaning Data Ingestion Data Transformation Predictive Analytics

Work Experience

Data Scientist

AWS EMR

Present - Present

Developed a real-time fraud detection pipeline using Spark, Kafka, Hive, and HBase on AWS EMR, reducing lookup latency by ~40% through optimized NoSQL key-based retrievals. Automated end-to-end batch and streaming ingestion from MySQL (RDS) via Sqoop, increasing data throughput by 30% and enabling continuous fraud rule validation at scale.

Data Engineer

Python–Pandas

Present - Present

Engineered a Python–Pandas ETL pipeline that cleaned and standardized messy collision data, improving data quality by 95% and enabling reliable trend analysis. Conducted EDA to uncover high-risk zones and peak collision patterns, driving data-backed safety recommendations that improved insight accuracy by 30%.

Data Analyst

IMDB

Present - Present

Analyzed IMDB data using advanced SQL (CTEs, window functions) to identify genre-rating trends, improving content decision insights by 35%. Cleaned and validated multi-table datasets to raise reporting accuracy by 90%, enabling reliable dashboards and production-level business analytics.

Education

Executive PG Programme in Data Science & AI - IIIT-Bangalore

- 2025 · Afghanistan

B.Com - Modern College of Arts, Science & Commerce, Pune

- 2024 · Afghanistan

Interested in this developer?

Profile Score Breakdown

📷 Photo 10/10
📄 Resume 10/10
💼 Job Title 10/10
✍️ Bio 10/10
🛠️ Skills 20/20
🎓 Education 10/10
⏱️ Experience 5/15
💰 Rate 0/5
🏆 Certs 0/5
Verified 5/5
Total Score 80/100

Profile Overview

Member sinceFeb 2026

Availability Details

Visa Status

Citizen

Relocation

Depends on Offer

Skills (46)

AWS Python Apache Spark SQL Machine Learning Algorithms Statistical Modeling Communication Skills Data Warehousing Data Pipelines Data Wrangling +36 more