Top "Html-parsing" questions

HTML parsing is the process of consuming a serialization of an HTML document and producing a representation that you can work with programmatically — e.g., in order to extract data from it.

Android HTML ImageGetter as AsyncTask

Okay, I'm losing my mind over this one. I have a method in my program which parses HTML. I want …

android html-parsing drawable android-asynctask
Cleaning HTML by removing extra/redundant formatting tags

I have been using CKEditor wysiwyg editor for a website where users are allowed to use the HTML editor to …

php html dom html-parsing bbcode
DOMDocument in php

I have just started reading documentation and examples about DOM, in order to crawl and parse the document. For example …

php xml-parsing html-parsing domdocument
How can I get at the matches when using preg_replace in PHP?

I am trying to grab the capital letters of a couple of words and wrap them in span tags. I …

php regex html-parsing preg-replace
JavaScript DOM childNodes.length also returning number of text nodes

In JavaScript DOM, childNodes.length returns the number of both element and text nodes. Is there any way to count …

javascript html dom html-parsing
How to save back changes made to a HTML file using BeautifulSoup in Python?

I have the script below, which modifies href attributes in an HTML file (in the future, it will be a …

python html-parsing beautifulsoup
web scraping java beginner

I am new to Java, I would like to become really good in web scraping and parsing data Are there …

java web-scraping html-parsing webharvest htmlcleaner
Using HTMLParser in Python 3.2

I have been using HTML Parser to scrapping data from websites and stripping html coding whilst doing so. I'm aware …

python-3.x html-parsing arguments web-scraping stripping
HTML Agility Pack strip tags NOT IN whitelist

I'm trying to create a function which removes html tags and attributes which are not in a white list. I …

c# tags html-parsing html-agility-pack sanitize
How can I clean HTML tags out of a ColdFusion string?

I am looking for a quick way to parse HTML tags out of a ColdFusion string. We are pulling in …

regex coldfusion rss html-parsing coldfusion-8