luftdatencrawler

The luftdatencrawler is designed to cyclicly crawl particulate matter (PM) data from luftdaten.info

API: http://api.luftdaten.info/static/v1/data.json

It is supposed to run on a AWS cloud VM (EC2) instance and save data to the dynamoDB.

How to run

First instantiate a dynamoDB and a Amazon EC2 VM on AWS.
Edit app.env.sample with your credentials and rename to app.env.
Connect to the VM via ssh and edit crontab -e. Enter */30 * * * * bash -c '(cd /home/ubuntu/luftdatencrawler && source ./app.env && python3 ./luftdatencrawler.py) >/tmp/out.err 2>&1 and the crawler runs every 30 min and saves data to the dynamoDB instance.

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
.gitignore		.gitignore
README.md		README.md
app.env.sample		app.env.sample
delete_data.py		delete_data.py
luftdatencrawler.py		luftdatencrawler.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

luftdatencrawler

How to run

About

Releases

Packages

Languages

clellmann/luftdatencrawler

Folders and files

Latest commit

History

Repository files navigation

luftdatencrawler

How to run

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages