Top "Beautifulsoup" questions

Beautiful Soup is a Python package for parsing HTML/XML.

python - PipeMapRed.waitOutputThreads(): subprocess failed with code 1

Recently, I want to parse websites and then use BeautifulSoup to filter what I want and write in csv file …

mapreduce beautifulsoup hadoop-streaming
Scrape the absolute URL instead of a relative path in python

I'm trying to get all the href's from a HTML code and store it in a list for future processing …

python beautifulsoup mechanize
Scraping a website with python 3 that requires login

Just a question regarding some scraping authentication. Using BeautifulSoup: #importing the requests lib import requests from bs4 import BeautifulSoup #specifying …

python python-3.x web-scraping beautifulsoup mechanicalsoup
How to change tag name with BeautifulSoup?

I am using python + BeautifulSoup to parse an HTML document. Now I need to replace all <h2 class="someclass"&…

python html-parsing beautifulsoup
BeautifulSoup - lxml and html5lib parsers scraping differences

I am using BeautifulSoup 4 with Python 2.7. I would like to extract certain elements from a website (Quantities, see the example …

python web-scraping beautifulsoup lxml html5lib
How to return html of a page using robobrowser

I'm experimenting with http://robobrowser.readthedocs.org/en/latest/readme.html, a new python library based on the beautiful soup …

python django beautifulsoup robobrowser
Don't put html, head and body tags automatically, beautifulsoup

using beautifulsoup with html5lib, it puts the html, head and body tags automatically: BeautifulSoup('<h1>FOO&…

python beautifulsoup html5lib
XML to CSV Python

The XML data(file.xml) for the state will look like below <?xml version="1.0" encoding="UTF-8" standalone="true"?> &…

python-3.x pandas csv beautifulsoup xml.etree
Is there an OrderedDict comprehension?

I don't know if there is such a thing - but I'm trying to do an ordered dict comprehension. However …

python beautifulsoup
Getting BeautifulSoup to find a specific <p>

I'm trying to put together a basic HTML scraper for a variety of scientific journal websites, specifically trying to get …

python beautifulsoup html-content-extraction