About
Data Scientist with over 4 years of experience in data-driven roles, machine learning NLP, Retrieval-Augmented Generation (RAG), and Computer Vision. Specialized in retrieval optimization and end-to-end ML pipelines from data ingestion to lightweight deployment.
Skills & Expertise (28)
Work Experience
Academic Assistant
SMEG Edulabs
Oct 2020 - Mar 2022
Designed and implemented a facial recognition-based attendance system using OpenCV, enabling automated image capture and CSV-based attendance logging for classroom environments. Built a SMS spam detection model using TF-IDF and Scikit-learn, achieving ~96% precision in filtering malicious messages during offline evaluation. Mentored students on academic and mini-projects, guiding data preprocessing, EDA, logic building, and result interpretation using Python.
Trainer
EduBridge India
Jan 2020 - Jun 2020
Trained 100+ students in data analysis and Excel-based numerical reasoning. Delivered hands-on sessions using VLOOKUP, PivotTables, and conditional formulas.
Data Scientist
Scipy Technologies
Apr 2025 - Present
Achieved 90%+ faithfulness and context recall in RAG pipelines by implementing RAGAS-based validation and systematic retrieval tuning. Improved document retrieval quality by identifying and eliminating near-duplicate content across 400+ PDFs using TF-IDF and SBERT embeddings, leading to more consistent grounding and reduced retrieval noise. Designed and validated a CPU-only real-time vision pipeline under edge deployment constraints, achieving ~8.9 FPS while preserving stable multi-person tracking in live streams. Diagnosed data leakage and class imbalance during exploratory analysis and introduced corrective feature engineering strategies, improving model robustness as measured by Precision, Recall, and F1-score.
Enterprise Development Executive
District Industries Centre
Apr 2022 - Mar 2025
Analyzed a ~10,000-row enterprise dataset to support identification, evaluation, and onboarding of 500+ enterprises, contributing to full target achievement across multiple development schemes. Built Excel-based dashboards (Pivot Tables, formulas) to track enterprise status, fund utilization, and scheme-wise performance for district-level monitoring.
Education
M.Tech, Signal Processing - Government Engineering College Barton Hill, Thiruvananthapuram
2017 - 2019 · Afghanistan
B.Tech, Electronics and Communication Engineering - College of Engineering and Management Punnapra, Alappuzha
2013 - 2017 · Afghanistan