Top "Html-parsing" questions

HTML parsing is the process of consuming a serialization of an HTML document and producing a representation that you can work with programmatically — e.g., in order to extract data from it.

How to normalize HTML in JavaScript or jQuery?

Tags can have multiple attributes. The order in which attributes appear in the code does not matter. For example: <…

javascript jquery html html-parsing
BeautifulSoup findAll() given multiple classes?

I would like to scrape a list of items from a website, and preserve the order that they are presented …

python html beautifulsoup html-parsing
Parse html table using file_get_contents to php array

I am trying to parse the table shown here into a multi-dimensional php array. I am using the following code …

php html-parsing file-get-contents
Using regular expressions to parse HTML: why not?

It seems like every question on stackoverflow where the asker is using regex to grab some information from HTML will …

regex html-parsing
How to get HTML from a beautiful soup object

I have the following bs4 object listing: >>> listing <div class="listingHeader"> <h2> .... >&…

python html beautifulsoup html-parsing
Get text content of an HTML element using XPath?

See this html <div> <p> <span class="abc">Monitor</span> <b&…

html xml xpath html-parsing
Python BeautifulSoup scrape tables

I am trying to create a table scrape with BeautifulSoup. I wrote this Python code: import urllib2 from bs4 import …

python html web-scraping beautifulsoup html-parsing
How to convert the html object to string type?

I use jQuery method to get some type of html object: var content = $('#cke_ckeditor iframe').contents().find('.…

string html-parsing
ItextSharp Error on trying to parse html for pdf conversion

I was using the ItextSharp module to convert the below listed html in to a pdf page. <div style="…

c# itext html-parsing html-agility-pack xmlworker
Robust and Mature HTML Parser for PHP

How can one parse HTML/XML and extract information from it?

php html html-parsing