Skip to content
View AtharvaPatil-Data's full-sized avatar
  • Dublin, Ireland

Block or report AtharvaPatil-Data

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this userโ€™s behavior. Learn more about reporting abuse.

Report abuse
AtharvaPatil-Data/README.md

Typing intro

LinkedIn Email


๐Ÿ“Œ About Me

๐ŸŽ“ MSc in Data Analytics (Dublin City University)
๐Ÿงฐ Building Data Analytics and Data Engineering projects that deliver measurable business impact
๐Ÿ‡ฎ๐Ÿ‡ช Based in Ireland โ€” targeting Data Analyst & Data Engineer roles
๐Ÿงญ Currently learning Airflow โ€ข Spark โ€ข Azure Databricks


๐Ÿ›  Skills

Programming Languages


Python

SQL

Data Analysis & Machine Learning


Pandas

NumPy

TensorFlow

Scikit-Learn

Data Engineering & Cloud


Azure

Databricks

Airflow

Data Visualisation


Power BI
Power Automate
Tableau
Excel

Other Tools


GitHub

Google Colab

VS Code

๐Ÿš€ Projects

๐Ÿ“Š Diabetic Retinopathy Cascade Classification

A two-stage cascaded deep learning framework using ResNet50 for accurate early and advanced diabetic retinopathy detection, trained on APTOS 2019 and Diabetic Retinopathy Resized datasets.
Tech: Python, TensorFlow, Pandas, NumPy
๐Ÿ”— View Repository


๐Ÿ’ณ Loan Defaulter Risk Model

Machine learning model to predict loan default risk using borrower profiles, credit history, and financial features.
Tech: Python, scikit-learn, imbalanced-learn, EDA
๐Ÿ”— View Repository


๐Ÿ“ˆ Flight Traffic Visualization

Visualizing busiest airline routes (2015โ€“2019) using Python + Tableau.
Tech: Tableau, Pandas, Matplotlib
๐Ÿ”— View Repository


๐Ÿ›’ E-commerce Product Categorization

Hierarchical e-commerce product categorization using TF-IDF, SMOTE, and an LR/RF/LightGBM ensemble.
Tech: Python, scikit-learn, LightGBM
๐Ÿ”— View Repository


๐Ÿ“ˆ Impact Snapshots

  • ๐Ÿ“Œ 0.99 recall on defaulters โ†’ fewer missed high-risk customers
  • โšก Automated cleaning scripts โ†’ ~40% faster preprocessing
  • ๐Ÿ“Š Executive-ready dashboards for actionable decisions

Pinned Loading

  1. AtharvaPatil-Data AtharvaPatil-Data Public

  2. Azure-Databricks-ETL-Loan-Pipeline Azure-Databricks-ETL-Loan-Pipeline Public

    Cloud ETL pipeline for LendingClub 2018Q4 loan data using Azure Databricks (Spark), ADLS Gen2, and Azure SQL. Includes notebooks, PySpark modules, and SQL scripts.

  3. Flight-Traffic-Visualization Flight-Traffic-Visualization Public

    Visualizing busiest airline routes (2015โ€“2019) using Python + Tableau.

    Jupyter Notebook

  4. Loan-defaulter-risk-model Loan-defaulter-risk-model Public

    Machine learning model to predict loan default risk using borrower profiles, credit history, and financial features.

    Jupyter Notebook

  5. Ecommerce-Product-Categorization Ecommerce-Product-Categorization Public

    Hierarchical eโ€‘commerce product categorization using TFโ€‘IDF, SMOTE, and an LR/RF/LightGBM ensemble (topโ€‘level) and Ridge (bottomโ€‘level).

    Jupyter Notebook

  6. Diabetic-Retinopathy-Cascade-Classification Diabetic-Retinopathy-Cascade-Classification Public

    A two-stage cascaded deep learning framework using ResNet50 for accurate early and advanced diabetic retinopathy detection, trained on APTOS 2019 and Diabetic Retinopathy Resized datasets.

    Jupyter Notebook