In this project a simple pipeline is created using Luigi. The pipeline consists of three tasks:
- Extracting the data from
data.txt
file in thedata
folder and store it inextract.csv
file. - Transform and load the data in
tranform.csv
file.
I've added 10seconds delay in the tasks to see the pipeline visualization.
Python version 3.12.0
First install the dependencies using requirements.txt
file.
pip install -r requirements.txt
Then run the pipeline using the following command:
PYTHONPATH='.' luigi --module etl_helper StartETL --local-scheduler
First run this command:
luigid
Then open http://localhost:8082
in your browser.
After that run the previous command without --local-scheduler
option.
PYTHONPATH='.' luigi --module etl_helper StartETL