Releases: josecannete/spanish-corpora
Releases · josecannete/spanish-corpora
More corpora!
Added corpora to sum up to 3 Billion words!
First release
First release is a compilation of about 2.6B tokens of Spanish corpora from Wikis, ParaCrawl, EUBookshop, MultiUN, OpenSubtitles.