Professional Experience


Research Assistant

University at Buffalo
Mar 2024 - Present

  • Performed experiments on wireless sensors, collected empirical data and migrated it to Azure Cosmos DB using Python scripts.
  • Generated data based on empirical data using Generative Adversarial Network for more comprehensive analytics.

Skills: RFID, Support Vector Machine (SVM), k-Nearest Neighbors, Generative Adverserial Network


Data Engineer Intern

Jerry DeFalco Advertisement
June 2024 - Dec 2023

  • Utilized normalization and data modeling techniques to design a production-grade database on a PostgreSQL server using Google Cloud's Alloy DB service ensuring data integrity.
  • Independently developed an ETL pipeline in Python (Pandas), scheduled on GCP Virtual Machines, to streamline data ingestion from Google Analytics 4 and other CRM sources into GCP AlloyDB, Improving data management efficiency.
  • Developed a StreamLit web application with Power BI dashboards to visualize marketing data for interactive analytical reports.
  • Automated email chains to send regular analytics to clients reducing the significant load and rework on a weekly basis.

Skills:Python (Pandas, Numpy, StreamLit), Google Analytics 4, PowerBI, Google Cloud Platform services like AlloyDB and Virtual Machines, ETL, PostgreSQL, Data Analytics, Data Modeling


Data Engineer

Impetus Technologies
Jan 2020 - Aug 2022

  • Enabled expansion of a finance project to include automobile financing by creating ETL pipelines as per the business needs.
  • Established automatic data ingestion utilizing Apache Nifi, AWS S3 as datalake, and AWS Glue jobs, to pick up auto-financial data in parquet format through APIs for various clients and ingest it to Snowflake.
  • Optimized prototype pipeline logic innovatively by employing parallelization, achieving a 20% runtime reduction.
  • Enhanced pipeline efficiency by appending disk cleanup steps, resulting in a 33% decrease in storage space usage per execution.
  • Formulated AWS CloudFormation scripts to automate cluster infrastructure provisioning, minimizing manual effort by 80%.
  • Led the development of a company-wide knowledge sharing platform using SharePoint, PowerApps, and PowerAutomate. This platform facilitated communication and idea exchange among 4000+ users.

Skills:Python (Pandas, Numpy), JavaScript, Java, Exploratory Data Analytics, Apache Nifi, Restful APIs, Springboot, Maven, Amaxon Web Services - S3, EC2, AWS Glue, AWS CloudFormation, AWS EMR, Git, CI/CD (Jenkins), Docker, ReactJS, Angular