TMDB ETL

TMDB End-to-End ETL Pipeline

Automated ingestion from TMDB API, data transformation using Spark, and regression modeling for movie ratings.

Amazon ETL

Amazon Products End-to-End ETL Pipeline

End-to-end ETL pipeline and machine learning model to predict product prices using Spark and API ingestion.

F1 ETL

Formula 1 End-to-End ETL Pipeline

ETL pipeline using Azure Databricks, Delta Lake, and ADF for ingestion and transformation with PySpark.

Steel Spark

Steel Energy Spark-MLLib

Distributed ML model using PySpark to predict energy consumption with an R2 score of 99.5%.

Bank Spark

Bank Churn Spark-MLLib

Large-scale churn classification using PySpark Spark-MLLib with 86%+ test accuracy.