Skip to content

Releases: josecannete/spanish-corpora

More corpora!

18 Jun 03:48
f057fc4
Compare
Choose a tag to compare

Added corpora to sum up to 3 Billion words!

First release

17 Jun 02:59
2dc650f
Compare
Choose a tag to compare

First release is a compilation of about 2.6B tokens of Spanish corpora from Wikis, ParaCrawl, EUBookshop, MultiUN, OpenSubtitles.