Top "Screen-scraping" questions

Screen-scraping, also known as web-scraping or data-scraping, is a software technique used to collect and parse information from user interfaces.

How can I read and parse the contents of a webpage in R

I'd like to read the contents of a URL (e.q., http://www.haaretz.com/) in R. I am wondering …

html r screen-scraping html-content-extraction
Website scraping using jquery and ajax

I want to be able to manipulate the html of a given url. Something like html scraping. I know this …

javascript jquery ajax screen-scraping
file_get_contents() give me 403 Forbidden

I have a partner that has created some content for me to scrape. I can access the page with my …

php html http-headers screen-scraping
Screen-scraping a windows application in c#

I need to scrape data from a windows application to run a query in another program. Does anyone know of …

c# .net screen-scraping
scrape websites with infinite scrolling

I have written many scrapers but I am not really sure how to handle infinite scrollers. These days most website …

python screen-scraping scraper
Python regular expression for HTML parsing (BeautifulSoup)

I want to grab the value of a hidden input field in HTML. <input type="hidden" name="fooId" value="12…

python regex screen-scraping
What's a good tool to screen-scrape with Javascript support?

Is there a good test suite or tool set that can automate website navigation -- with Javascript support -- and …

javascript screen-scraping
Best way for a beginner to learn screen scraping by Python

This might be one of those questions that are difficult to answer, but here goes: I don't consider my self …

python screen-scraping beautifulsoup lxml scrapy
Using Python and Mechanize to submit form data and authenticate

I want to submit login to the website Reddit.com, navigate to a particular area of the page, and submit …

python networking screen-scraping mechanize