pip install requirements.txt
scrapy crawl <site_name>
scrapy crawl sephora -o sephora.json
- Sample Data for this can be found on all_scraped_data folders.
- maccosmetics.com
- beautybay.com
- cultbeauty.co.uk
- sephora.com
- maybelline.co.uk
- selfridges.com
- Polyvore.com
- net-a-porter.com
- shopstyle.co.uk
- beautylish.com
The project runs on Python 2.7.
1. Scrapy
2. Pillow for saving images.
3. scrapy-fake-useragent for rotating browser headers.