-
Notifications
You must be signed in to change notification settings - Fork 0
Converts the Wikipedia Data Dump from XML format to a tab delimited (or comma delimited, etc.) file format.
bcollier/XML-2-Delimiter
Folders and files
Name | Name | Last commit message | Last commit date | |
---|---|---|---|---|
Repository files navigation
Still under development, probably not ready for public use. Also not well documented. But, take a look at the example config file wpdatawork.cfg. The only one that needs to exist is the stub-meta-history file, the rest are created with the names of your choosing. Run the program with python xml2delimiter.py See progress at the log file (that you configured in wpdatawork.cfg). REQUIREMENTS: BeautifulSoup Progress Bar
About
Converts the Wikipedia Data Dump from XML format to a tab delimited (or comma delimited, etc.) file format.
Resources
Stars
Watchers
Forks
Releases
No releases published
Packages 0
No packages published