Popular "beautifulsoup" questions | Page 12

I'm trying to remove all the html/javascript using bs4, however, it doesn't get rid of javascript. I still see …

python beautifulsoup nltk

I'm trying to scrape data from the public site asx.com.au The page http://www.asx.com.au/asx/…

python angularjs web-scraping beautifulsoup urllib2

I'm trying to decode HTML entries from here NYTimes.com and I cannot figure out what I am doing wrong. …

python unicode character-encoding content-type beautifulsoup

I am trying to load a html-page and output the text, even though i am getting the webpage correctly, BeautifulSoup …

python encoding utf-8 beautifulsoup mojibake

I noticed something odd about when working with BeautifulSoup and couldn't find any documentation to support this so I wanted …

python beautifulsoup

I have this: dates = soup.findAll("div", {"id" : "date"}) However, I need id to be a wildcard search since the …

python beautifulsoup

I want to select all the divs which have BOTH A and B as class attributes. The following selection soup.…

python beautifulsoup

The webpage is something like this: <h2>section1</h2> <p>article</p> &…

python find beautifulsoup scrape siblings

I'm using BeautifulSoup. I have to find any reference to the <div> tags with id like: post-#. For …

python beautifulsoup

I have the script below, which modifies href attributes in an HTML file (in the future, it will be a …

python html-parsing beautifulsoup

Top "Beautifulsoup" questions