TMDB End-to-End ETL Pipeline
Automated ingestion from TMDB API, data transformation using Spark, and regression modeling for movie ratings.
Amazon Products End-to-End ETL Pipeline
End-to-end ETL pipeline and machine learning model to predict product prices using Spark and API ingestion.
Formula 1 End-to-End ETL Pipeline
ETL pipeline using Azure Databricks, Delta Lake, and ADF for ingestion and transformation with PySpark.
Steel Energy Spark-MLLib
Distributed ML model using PySpark to predict energy consumption with an R2 score of 99.5%.
Bank Churn Spark-MLLib
Large-scale churn classification using PySpark Spark-MLLib with 86%+ test accuracy.