Skip to content

Commit

Permalink
Browse files Browse the repository at this point in the history
  • Loading branch information
attardi committed Mar 7, 2022
2 parents 05b5cc7 + f4f9d2d commit f0ca16c
Show file tree
Hide file tree
Showing 2 changed files with 2 additions and 2 deletions.
2 changes: 1 addition & 1 deletion README.md
Original file line number Diff line number Diff line change
@@ -1,5 +1,5 @@
# WikiExtractor
[WikiExtractor.py](http://medialab.di.unipi.it/wiki/Wikipedia_Extractor) is a Python script that extracts and cleans text from a [Wikipedia database backup dump](https://dumps.wikimedia.org/).
[WikiExtractor.py](http://medialab.di.unipi.it/wiki/Wikipedia_Extractor) is a Python script that extracts and cleans text from a [Wikipedia database backup dump](https://dumps.wikimedia.org/), e.g. https://dumps.wikimedia.org/enwiki/latest/enwiki-latest-pages-articles.xml.bz2 for English.

The tool is written in Python and requires Python 3 but no additional library.
**Warning**: problems have been reported on Windows due to poor support for `StringIO` in the Python implementation on Windows.
Expand Down
2 changes: 1 addition & 1 deletion wikiextractor/WikiExtractor.py
Original file line number Diff line number Diff line change
Expand Up @@ -68,7 +68,7 @@
# ===========================================================================

# Program version
__version__ = '3.0.5'
__version__ = '3.0.6'

##
# Defined in <siteinfo>
Expand Down

0 comments on commit f0ca16c

Please sign in to comment.