Skip to content

Scraping and transforming job listings data from reed.co.uk

Notifications You must be signed in to change notification settings

Chris-Larkin/reed-job-listings

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

8 Commits
 
 
 
 

Repository files navigation

reed-job-listings

This program scrapes and cleans job listings data from reed.co.uk for a given day. It crawls with a delay of 1 second between hits for primary data extraction exercise, so can take several hours to obtain all data. It extracts as columns:

applications_ten -> whether there have been ten or fewer applicants
job_country -> country where the job is located
job_description -> free text description of the job provided by the poster
job_locality -> city/town where the job is located
job_postcode -> postcode for where the job is located
job_region -> region where the job is located
job_type -> full-time, part-time, temporary, contract, etc.
job_type_disp -> job type as displayed to applicants
link -> URL to the job listing
salary_disp -> salary as displayed to the applicant
salary_max -> maximum salary
salary_min -> minimum salary
salary_time -> time unit over which salary is reported (hourly, monthly, yearly)

It outputs a pandas data frame called reed_data.

About

Scraping and transforming job listings data from reed.co.uk

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages