Back to Developers
Rebecca Datta

Rebecca Datta

Junior Data Science Engineer

Howrah
80
Profile Score

About

Analytical thinker with a strong foundation in data science, passionate about transforming raw data into impactful business insights. Proven expertise in delivering end-to-end data solutions that turn complex problems into smart outcomes.

Skills & Expertise (25)

Python Intermediate
7.1/10
1
Years Exp
Data Visualization Intermediate
6.8/10
1
Years Exp
SQL Intermediate
6.5/10
1
Years Exp
Machine Learning Intermediate
6.3/10
1
Years Exp
Linear Regression OCR n8n BeautifulSoup Playwright Selenium Zyte Scrapy Positive Attitude Adaptability Team Player Storytelling Business Analysis Data Modeling Data Processing Office Package Jupyter Notebook MySql Tableau Power BI Excel

Work Experience

Data Science Intern

SkyQuest Technology Consulting Pvt. Ltd.

Jun 2025 - Jan 2026

Collected, cleaned, and processed large volumes of structured data from news portals using Scrapy, Zyte, Selenium, Playwright, and BeautifulSoup to ensure reliable and accurate datasets. Built automated n8n workflows to fetch, transform, and store organizational data into structured Excel reports, enabling faster and more efficient data reporting. Implemented OCR-based automation systems to extract product information from images and perform product comparison using COA (Certificate of Analysis) for quality assessment. Worked on a Retrieval-Augmented Generation (RAG) model where a dedicated database of earnings call transcripts was created using data scraped through Scrapy with Zyte. The system enables automatic summarization of complete transcripts, extraction of key insights, and intelligent question-answering based on user queries over the transcript data.

Junior Data Science Engineer

SkyQuest Technology Consulting Pvt. Ltd.

Jan 2026 - Present

Engineered scalable web-scraping pipelines using Scrapy, Zyte, Selenium, Playwright, and BeautifulSoup to collect, clean, and normalize large volumes of structured news data, improving data accuracy and downstream analysis reliability. Automated ETL workflows in n8n to fetch, transform, validate, and export organizational data into structured Excel dashboards, reducing manual reporting effort and turnaround time. Designed and deployed OCR-based automation systems to extract product attributes from images and perform COA (Certificate of Analysis)-driven product comparison, enabling faster and more consistent quality assessment. Developed a Retrieval-Augmented Generation (RAG) pipeline backed by a custom earnings call transcript database scraped via Scrapy and Zyte, enabling automated transcript summarization, key insight extraction, and contextual Q&A, significantly improving financial research efficiency.

Data Science Intern

Codsoft

Apr 2025 - May 2025

Acquired practical experience in data preprocessing by cleaning and transforming datasets to ensure accuracy and consistency for analysis. Applied feature engineering techniques to enhance model performance by selecting and transforming key variables. Created data visualizations using Python to effectively communicate insights and trends to stakeholders. Led end-to-end data analysis projects, including building, training, and evaluating machine learning models to drive actionable business outcomes.

Education

PGP + MBA – Business Analytics & Data Science - Bengal Institute of Business Studies – Vidyasagar University

2024 - Present · Afghanistan

B.TECH – Electrical Engineering - Calcutta Institute of Engineering and Management – MAKAUT

- 2022 · Afghanistan

WBCHSE – Science - Santragachi Kedarnath Institution for Girls’

- 2018 · Afghanistan

WBBSE - Bantra Tarasundari Balika Vidyabhaban

- 2016 · Afghanistan

Interested in this developer?

Profile Score Breakdown

📷 Photo 10/10
📄 Resume 10/10
💼 Job Title 10/10
✍️ Bio 10/10
🛠️ Skills 20/20
🎓 Education 10/10
⏱️ Experience 5/15
💰 Rate 0/5
🏆 Certs 0/5
Verified 5/5
Total Score 80/100

Profile Overview

Member sinceMar 2026

Skills (25)

Python Data Visualization SQL Machine Learning Linear Regression OCR n8n BeautifulSoup Playwright Selenium +15 more