Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Write an ethical scraper to get IMDB IDs for each episode of a show #3

Open
havanagrawal opened this issue Jul 29, 2019 · 0 comments
Open
Assignees

Comments

@havanagrawal
Copy link
Owner

IMDB IDs are a great way to connect the Wikidata entries to another dataset. A potential design is something that scrapes the IMDB page for the main show (for any show that has an IMDB title on Wikidata), looking for episodes of different season, for example:

https://www.imdb.com/title/tt1898069/episodes?season=1

We then join this on the episode list from Wikidata using title/label on a season basis, and then make an update.

@havanagrawal havanagrawal self-assigned this Aug 24, 2019
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant