Scrape web pages in real time with Node.js

Avishai picture Avishai · Mar 6, 2011 · Viewed 44.2k times · Source

What's a good was to scrape website content using Node.js. I'd like to build something very, very fast that can execute searches in the style of kayak.com, where one query is dispatched to several different sites, the results scraped, and returned to the client as they become available.

Let's assume that this script should just provide the results in JSON format, and we can process them either directly in the browser or in another web application.

A few starting points:

Using node.js and jquery to scrape websites

Anybody have any ideas?

Answer

Avishai picture Avishai · Mar 12, 2011

Node.io seems to take the cake :-)