Top "Html-parsing" questions

HTML parsing is the process of consuming a serialization of an HTML document and producing a representation that you can work with programmatically — e.g., in order to extract data from it.

Parsing HTML in Python

What's my best bet for parsing HTML if I can't use BeautifulSoup or lxml? I've got some code that uses …

python html-parsing
Parse the JavaScript returned from BeautifulSoup

I would like to parse the webpage http://dcsd.nutrislice.com/menu/meadow-view/lunch/ to grab today's lunch menu. (I've …

javascript python beautifulsoup html-parsing
Insert and parse HTML into view using AngularJS

What I know is when I want to insert HTML into view, I use 'ng-bind-html' or 'ng-bind-html-unsafe'. What I don't …

angularjs html-parsing
How can I use regular expression to grab an 'img' tag?

I want to grab an img tag from text returned from JSON data like that. I want to grab this …

regex image html-parsing
Simple html dom file_get_html not working - is there any workaround?

<?php // Report all PHP errors (see changelog) error_reporting(E_ALL); include('inc/simple_html_dom.php'); //base url $…

php html-parsing file-get-contents simple-html-dom
What does HTML Parsing mean?

I have heard of HTML Parser libraries like Simple HTML DOM and HTML Parser. I have also heard of questions …

html parsing html-parsing
Symfony DomCrawler: Find element with specific attribute value

I'm using the DomCrawler component: http://symfony.com/doc/current/components/dom_crawler.html I'd like to, using the CSS …

php dom symfony html-parsing
beautifulsoup: find_all on bs4.element.ResultSet object or list?

Hi so I apply find_all on a beautifulsoup object, and find something, which is an bs4.element.ResultSet object …

python html beautifulsoup html-parsing
Extracting an information from web page by machine learning

I would like to extract a specific type of information from web pages in Python. Let's say postal address. It …

python machine-learning html-parsing web-scraping extract
PHP DOMDocument::loadHTML() [domdocument.loadhtml]: htmlParseEntityRef: no name in Entity

I trying to get the "link" elements from certain webpages. I can't figure out what i'm doing wrong though. I'm …

php html-parsing domdocument