Top "Wikipedia" questions

Consider the tags wikipedia-api (or the more general mediawiki-api) and mediawiki.

Where to find "bug free" html to wiki converter

While googling for it.I've stumbled upon html2wiki that seems to do the job(will try after done posting …

wiki wikipedia
Wikipedia Category Hierarchy from dumps

Using Wikipedia's dumps I want to build a hierarchy for its categories. I have downloaded the main dump (enwiki-latest-pages-articles) and …

mediawiki categories dump wikipedia wikipedia-api
How to extract information from a Wikipedia infobox?

There is this fancy infobox in <some Wikipedia article>. How do I get the value of <this …

wikipedia dbpedia wikipedia-api structured-data wikidata
Wikipedia : Java library to remove wikipedia text markup removal

I downloaded wikipedia dump and now want to remove the wikipedia markup in the contents of each page. I tried …

java parsing wiki wikipedia
Content of infobox of Wikipedia

I need to get the content of an infobox of any movie. I know the name of the movie. One …

python mediawiki wikipedia pywikibot
Wikipedia api fulltext search to return articles with title, snippet and image

I've been looking for a way to query the wikipedia api based on a search string for a list of …

mediawiki wikipedia wikipedia-api mediawiki-api
API to retrieve info about famous people

I'm looking for some callable way to get information about famous people and celebrities. Given a string, I'd like to …

api wikipedia wikipedia-api
Am I allowed to use Wikipedia content?

I'm always confused when reading licenses .... I want to implement (to be honest implementation is already done) a commercial iPad …

licensing wikipedia
How can I get a Wikipedia article's text using Python 3 with Beautiful Soup?

I have this script made in Python 3: response = simple_get("https://en.wikipedia.org/wiki/Mathematics") result = {} result["url"] = url …

python html web-scraping beautifulsoup wikipedia
Extract the main article text from a Wikipedia page using Python

I've been searching for hours on how to extract the main text of a Wikipedia article, without all the links …

python parsing extract wikipedia