Project Overview
The project began as a single Jupyter notebook used for experimentation. Once the modeling approach was validated, the notebook was refactored into a production-style pipeline composed of independent Python modules.
Key ideas demonstrated:
-
Separation of concerns
-
Script-based execution instead of notebooks
-
Deterministic, repeatable pipelines using DVC
-
Parameter-driven experimentation
Tools used:
-
Python
-
DVC
-
Git & GitHub
-
Cookiecutter Data Science