Professional Experience
Research Assistant
University at Buffalo
Mar 2024 - Present
- Performed experiments on wireless sensors, collected empirical data and migrated it to Azure Cosmos DB using Python scripts.
- Generated data based on empirical data using Generative Adversarial Network for more comprehensive analytics.
Skills: RFID, Support Vector Machine (SVM), k-Nearest Neighbors, Generative Adverserial Network
Data Engineer Intern
Jerry DeFalco Advertisement
June 2024 - Dec 2023
- Utilized normalization and data modeling techniques to design a production-grade database on a PostgreSQL server using Google Cloud's Alloy DB service ensuring data integrity.
- Independently developed an ETL pipeline in Python (Pandas), scheduled on GCP Virtual Machines, to streamline data ingestion from Google Analytics 4 and other CRM sources into GCP AlloyDB, Improving data management efficiency.
- Developed a StreamLit web application with Power BI dashboards to visualize marketing data for interactive analytical reports.
- Automated email chains to send regular analytics to clients reducing the significant load and rework on a weekly basis.
Skills:Python (Pandas, Numpy, StreamLit), Google Analytics 4, PowerBI, Google Cloud Platform services like AlloyDB and Virtual Machines, ETL, PostgreSQL, Data Analytics, Data Modeling
Data Engineer
Impetus Technologies
Jan 2020 - Aug 2022
- Enabled expansion of a finance project to include automobile financing by creating ETL pipelines as per the business needs.
- Established automatic data ingestion utilizing Apache Nifi, AWS S3 as datalake, and AWS Glue jobs, to pick up auto-financial data in parquet format through APIs for various clients and ingest it to Snowflake.
- Optimized prototype pipeline logic innovatively by employing parallelization, achieving a 20% runtime reduction.
- Enhanced pipeline efficiency by appending disk cleanup steps, resulting in a 33% decrease in storage space usage per execution.
- Formulated AWS CloudFormation scripts to automate cluster infrastructure provisioning, minimizing manual effort by 80%.
- Led the development of a company-wide knowledge sharing platform using SharePoint, PowerApps, and PowerAutomate. This platform facilitated communication and idea exchange among 4000+ users.
Skills:Python (Pandas, Numpy), JavaScript, Java, Exploratory Data Analytics, Apache Nifi, Restful APIs, Springboot, Maven, Amaxon Web Services - S3, EC2, AWS Glue, AWS CloudFormation, AWS EMR, Git, CI/CD (Jenkins), Docker, ReactJS, Angular