Back to Developers
Prerna Gupta

Prerna Gupta

Data Engineer

Hyderabad, India 2+ yrs exp 85 · Excellent

About

Data engineer with expertise in ETL development, data transformation and cloud based data engineering using AWS and Python. Experienced in handling large multisource datasets building analytical dashboards, and deriving business insights using SQL and Power BI. Adept to improving data quality and pipeline efficiency, and decision making through scalable solutions.

Skills & Expertise (40)

Python Advanced
8.1/10
3
Years Exp
ETL Pipeline Development Advanced
8.0/10
3
Years Exp
AWS Advanced
7.8/10
2
Years Exp
Apache Spark Intermediate
7.5/10
2
Years Exp
Data Modelling Intermediate
7.3/10
3
Years Exp
NoSQL SQL Data Warehousing Vector Databases Amazon S3 AWS Glue AWS Lambda Amazon Redshift Distributed Data Processing Pandas NumPy scikit-learn Power BI Tableau Looker EDA KPI Monitoring Webhooks AI Agent Prompt Engineering LangChain ChromaDB Hugging Face OpenAI Workflow Automation Apache Airflow SQL Server Make.com Git GitHub AWS Management Console Excel tally Document Intelligence MySql

Work Experience

Data Annotator (Contract)

Innodata Inc.

Nov 2025 - Mar 2026

Annotated large volumes of both structured and unstructured datasets to create high-quality labeled data used for training NLP, computer vision and speech recognition models. Performed rigorous data quality assurance and annotation validation, reviewing in-production datasets against detailed labeling guidelines to maintain consistency, accuracy, and reliability of machine learning training data. Collaborated closely with data scientists and machine learning engineers to refine annotation schemas, resolving ambiguous labeling scenarios, and improve dataset preparation workflows for AI model development and evaluation.

Data Analyst

Community Dreams Foundation

Aug 2024 - Jul 2025

Built scalable ETL pipelines using Python, SQL, and AWS (Amazon S3, AWS Lambda AWS Glue, Amazon Redshift) to ingest and standardize sustainability datasets provided by multiple contractors and eco-conscious business operations. Transformed large operational datasets using Apache Spark via AWS Glue, performing data cleaning, schema standardization, and data transformation to create structured datasets suitable for downstream analytics and reporting. Developed interactive PowerBI dashboards and analytical reports to monitor 15+ operational and sustainability KPIs, identifying anomalies in energy consumption and delivering insights, resulting in a 20% reduction in environmental footprint across monitored sites.

Graduate Teaching Assistant

New Jersey Institute of Technology

Jan 2023 - May 2024

Facilitated coursework delivery on core and advanced machine learning algorithms, including linear regression, support vector machines, neural network, ensemble learning and dimensionality reduction. Evaluated and graded programming assignments, quizzes and exams ensuring compliance with grading rubrics and providing analytical feedback to enhance student comprehension of model evaluation, hyperparameter tuning and optimization techniques, and feature engineering. Supervised students’ projects involving real world machine learning applications guiding the implementation of algorithms such as neural networks, decision trees, and ensemble methods while ensuring best practices in secure model development and data integrity.

Education

Master’s Degree: Computer Science - New Jersey Institute of Technology

2022 - 2024 · Afghanistan

Bachelor’s Degree: Information Technology - KIET Group of Institutions

2015 - 2019 · Afghanistan

Certifications

No certifications added yet

Interested in this developer?

Profile Score Breakdown

📷 Photo 10/10
📄 Resume 10/10
💼 Job Title 10/10
✍️ Bio 10/10
🛠️ Skills 20/20
🎓 Education 10/10
⏱️ Experience 10/15
💰 Rate 0/5
🏆 Certs 0/5
Verified 5/5
Total Score 85/100

Profile Overview

Member sinceMay 2026