VK Scraper

vk-scraper is a command-line application written in Python that scrapes and downloads VK user's / community's data. Use responsibly.

To get a better understanding of how it works, head to the docs.

Features

Scrape photos
Scrape videos (both uploaded and external)
Scrape saved photos
Scrape stories

Install

Arch GNU/Linux

For the stable version (vk-scraper):

git clone https://aur.archlinux.org/vk-scraper.git vk-scraper

For the git version (vk-scraper-git):

git clone https://aur.archlinux.org/vk-scraper-git.git vk-scraper

Then build & install:

cd vk-scraper 
makepkg -sic

Or use an AUR helper of your choice.

Other distros

For the stable version:

$ pip3 install vk-scraper --upgrade --user

For the git version:

$ pip3 install git+https://github.com/vanyasem/VK-Scraper.git --upgrade --user

Usage

To scrape media:

vk-scraper <username/community> -u <your username> -p <your password>

By default, downloaded media will be placed in <current working directory>/<username>.

To specify multiple users/communities, pass a comma separated list of users:

vk-scraper username1,community1,username2,username3,community2

You can also supply a file containing a list of users/communities:

vk-scraper -f scrape_list.txt

$ cat vk_users.txt
username1
community1
username2
username3
community2
...

Usernames may be separated by newlines, commas, semicolons, or whitespace.

Arguments

--help -h             Show help message and exit

--login-user  -u      Your VK username

--login-pass  -p      Your VK password

--filename    -f      Path to a file containing a list of users/communities to scrape

--destination -d      Specify destination folder. By default, media will
                      be downloaded to <current working directory>/<username>

--retain-username -n  Creates a subdirectory for each scraped name when the flag is set

--media-types -t      Specify media types to scrape. Enter as space separated values.
                      Valid values are image, saved, video, story, wall, or none
                      (defaults to image)

--latest              Scrape only new media since the last scrape. Uses the last modified
                      time of the latest media item in the destination directory for comparasion

--quiet       -q      Be quiet while scraping

--maximum     -m      Maximum number of items to scrape

--offset      -o      Offset from which the scrape starts. 0 is from the oldest. (Defaults to 0)

Contribution

Check open issues, or open a new one to start a discussion around your idea or a bug you found
Fork the repository and make your changes
Send a pull request

Futurelog

Scrape by hashtag
Scrape by location
Save metadata to a file (likes, comments, etc)
Sort photos by their albums

Name		Name	Last commit message	Last commit date
Latest commit History 65 Commits
vk_scraper		vk_scraper
.gitignore		.gitignore
DOCS.md		DOCS.md
LICENSE		LICENSE
MANIFEST.in		MANIFEST.in
README.md		README.md
setup.cfg		setup.cfg
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

VK Scraper

Features

Install

Arch GNU/Linux

Other distros

Usage

Arguments

Contribution

Futurelog

About

Releases 5

Packages

Contributors 2

Languages

License

vanyasem/VK-Scraper

Folders and files

Latest commit

History

Repository files navigation

VK Scraper

Features

Install

Arch GNU/Linux

Other distros

Usage

Arguments

Contribution

Futurelog

About

Topics

Resources

License

Stars

Watchers

Forks

Releases 5

Packages 0

Contributors 2

Languages

Packages