Skip to content

Scaping of Beauty Products from different UK based beauty sites.

Notifications You must be signed in to change notification settings

kaflesudip/BeautyProductsScraping

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

28 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Beautyscraper

Scraping top 10 beauty e-commerce sites.

How do I get set up?

  • pip install requirements.txt
  • scrapy crawl <site_name>

Example of scraping:

scrapy crawl sephora -o sephora.json

View Sample Data

  1. Sample Data for this can be found on all_scraped_data folders.

List of sites to crawl

  • maccosmetics.com
  • beautybay.com
  • cultbeauty.co.uk
  • sephora.com
  • maybelline.co.uk
  • selfridges.com
  • Polyvore.com
  • net-a-porter.com
  • shopstyle.co.uk
  • beautylish.com

Libraries used:

The project runs on Python 2.7.

1. Scrapy
2. Pillow for saving images.
3. scrapy-fake-useragent for rotating browser headers.

About

Scaping of Beauty Products from different UK based beauty sites.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages