Back to Developers
Rihas Raheem

Rihas Raheem

Data Scientist

Alappuzha, Kerala, India
80
Profile Score

About

Data Scientist with over 4 years of experience in data-driven roles, machine learning NLP, Retrieval-Augmented Generation (RAG), and Computer Vision. Specialized in retrieval optimization and end-to-end ML pipelines from data ingestion to lightweight deployment.

Skills & Expertise (28)

NLP Advanced
9.0/10
4
Years Exp
Python Advanced
9.0/10
4
Years Exp
Semantic Search Advanced
8.8/10
4
Years Exp
Feature Engineering Advanced
8.6/10
4
Years Exp
RAG Advanced
8.5/10
1
Years Exp
NumPy Advanced
8.5/10
4
Years Exp
Pandas Advanced
8.5/10
4
Years Exp
EDA Advanced
8.5/10
4
Years Exp
Exploratory data analysis Advanced
8.5/10
4
Years Exp
TensorFlow Advanced
8.5/10
4
Years Exp
scikit-learn Advanced
8.5/10
4
Years Exp
Model Evaluation Advanced
8.4/10
4
Years Exp
SBERT Advanced
8.2/10
1
Years Exp
Google Colab Advanced
8.0/10
4
Years Exp
OpenCV Advanced
8.0/10
4
Years Exp
SQL Advanced
8.0/10
4
Years Exp
FAISS Intermediate
8.0/10
1
Years Exp
FastAPI Intermediate
7.8/10
1
Years Exp
BM25 Intermediate
7.8/10
1
Years Exp
Advanced Excel Intermediate
7.8/10
4
Years Exp
SQLAlchemy Intermediate
7.5/10
1
Years Exp
Streamlit Intermediate
7.5/10
1
Years Exp
YOLOv8 Intermediate
7.5/10
1
Years Exp
ByteTrack Intermediate
7.5/10
1
Years Exp
Deepsort Intermediate
7.5/10
1
Years Exp
RAGAS Intermediate
7.5/10
1
Years Exp
ChromaDB Intermediate
7.5/10
1
Years Exp
LlamaIndex Intermediate
7.5/10
1
Years Exp

Work Experience

Academic Assistant

SMEG Edulabs

Oct 2020 - Mar 2022

Designed and implemented a facial recognition-based attendance system using OpenCV, enabling automated image capture and CSV-based attendance logging for classroom environments. Built a SMS spam detection model using TF-IDF and Scikit-learn, achieving ~96% precision in filtering malicious messages during offline evaluation. Mentored students on academic and mini-projects, guiding data preprocessing, EDA, logic building, and result interpretation using Python.

Trainer

EduBridge India

Jan 2020 - Jun 2020

Trained 100+ students in data analysis and Excel-based numerical reasoning. Delivered hands-on sessions using VLOOKUP, PivotTables, and conditional formulas.

Data Scientist

Scipy Technologies

Apr 2025 - Present

Achieved 90%+ faithfulness and context recall in RAG pipelines by implementing RAGAS-based validation and systematic retrieval tuning. Improved document retrieval quality by identifying and eliminating near-duplicate content across 400+ PDFs using TF-IDF and SBERT embeddings, leading to more consistent grounding and reduced retrieval noise. Designed and validated a CPU-only real-time vision pipeline under edge deployment constraints, achieving ~8.9 FPS while preserving stable multi-person tracking in live streams. Diagnosed data leakage and class imbalance during exploratory analysis and introduced corrective feature engineering strategies, improving model robustness as measured by Precision, Recall, and F1-score.

Enterprise Development Executive

District Industries Centre

Apr 2022 - Mar 2025

Analyzed a ~10,000-row enterprise dataset to support identification, evaluation, and onboarding of 500+ enterprises, contributing to full target achievement across multiple development schemes. Built Excel-based dashboards (Pivot Tables, formulas) to track enterprise status, fund utilization, and scheme-wise performance for district-level monitoring.

Education

M.Tech, Signal Processing - Government Engineering College Barton Hill, Thiruvananthapuram

2017 - 2019 · Afghanistan

B.Tech, Electronics and Communication Engineering - College of Engineering and Management Punnapra, Alappuzha

2013 - 2017 · Afghanistan

Interested in this developer?

Profile Score Breakdown

📷 Photo 10/10
📄 Resume 10/10
💼 Job Title 10/10
✍️ Bio 10/10
🛠️ Skills 20/20
🎓 Education 10/10
⏱️ Experience 5/15
💰 Rate 0/5
🏆 Certs 0/5
Verified 5/5
Total Score 80/100

Profile Overview

Member sinceDec 2025

Skills (28)

NLP Python Semantic Search Feature Engineering RAG NumPy Pandas EDA Exploratory data analysis TensorFlow +18 more