Parser for Wikipedia

Boolean picture Boolean · Oct 8, 2010 · Viewed 8.8k times · Source

I downloaded a Wikipedia dump and I want to convert the wiki format into my object format. Is there a wiki parser available that converts the object into XML?

Answer

dogbane picture dogbane · Oct 8, 2010

See java-wikipedia-parser. I have never used it but according to the docs :

The parser comes with an HTML generator. You can however control the output that is being generated by passing your own implementation of the be.devijver.wikipedia.Visitor interface.