diff --git a/README.md b/README.md index 9a29776..bd5407c 100644 --- a/README.md +++ b/README.md @@ -1,5 +1,5 @@ # WikiExtractor -[WikiExtractor.py](http://medialab.di.unipi.it/wiki/Wikipedia_Extractor) is a Python script that extracts and cleans text from a [Wikipedia database backup dump](https://dumps.wikimedia.org/). +[WikiExtractor.py](http://medialab.di.unipi.it/wiki/Wikipedia_Extractor) is a Python script that extracts and cleans text from a [Wikipedia database backup dump](https://dumps.wikimedia.org/), e.g. https://dumps.wikimedia.org/enwiki/latest/enwiki-latest-pages-articles.xml.bz2 for English. The tool is written in Python and requires Python 3 but no additional library. **Warning**: problems have been reported on Windows due to poor support for `StringIO` in the Python implementation on Windows. diff --git a/wikiextractor/WikiExtractor.py b/wikiextractor/WikiExtractor.py index feab143..c195a19 100755 --- a/wikiextractor/WikiExtractor.py +++ b/wikiextractor/WikiExtractor.py @@ -68,7 +68,7 @@ # =========================================================================== # Program version -__version__ = '3.0.5' +__version__ = '3.0.6' ## # Defined in