Scrape web pages in real time with Node.js


What's a good was to scrape website content using Node.js. I'd like to build something very, very fast that can execute searches in the style of, where one query is dispatched to several different sites, the results scraped, and returned to the client as they become available.

Let's assume that this script should just provide the results in JSON format, and we can process them either directly in the browser or in another web application.

A few starting points:

Using node.js and jquery to scrape websites

Anybody have any ideas?

6/18/2012 5:00:20 PM seems to take the cake :-)

2/26/2014 4:28:24 AM

Licensed under: CC-BY-SA with attribution
Not affiliated with: Stack Overflow