Top "Scrapy" questions

Scrapy is a fast open-source high-level screen scraping and web crawling framework written in Python used to crawl websites and extract structured data from their pages.

How to drop a collection with pymongo?

I use scarpy to crawl data and save it to cloud hosting mLab successfully with MongoDB. My collection name is …

python scrapy pymongo
How to set different scrapy-settings for different spiders?

I want to enable some http-proxy for some spiders, and disable them for other spiders. Can I do something like …

scrapy
scrapy: convert html string to HtmlResponse object

I have a raw html string that I want to convert to scrapy HTML response object so that I can …

python web-scraping scrapy
scrapy xpath select elements by classname

I have followed How can I find an element by CSS class with XPath? which gives the selector to use …

xpath web-scraping scrapy scrapy-shell
Scrapy - parse a page to extract items - then follow and store item url contents

I have a question on how to do this thing in scrapy. I have a spider that crawls for listing …

python scrapy
How to disable or change the path of ghostdriver.log?

Question is straightfoward, but some context may help. I'm trying to deploy scrapy while using selenium and phantomjs as downloader. …

scrapy phantomjs ghostdriver
get lastweek dates using python?

I am trying to get the date of the last week with python. if date is : 10 OCT 2014 means It should …

python datetime python-3.x scrapy python-datetime
Can Scrapy work on PHP?

Can I use Scrapy on PHP or are there similar tools that work with PHP? I am not a technical …

php web-scraping scrapy bookmarklet
How to bypass Incapsula with Python

I use Scrapy and I try to scrape this site that uses Incapsula <meta name="robots" content="noindex,nofollow"&…

python scrapy recaptcha incapsula
How to use CrawlSpider from scrapy to click a link with javascript onclick?

I want scrapy to crawl pages where going on to the next link looks like this: <a href="#" onclick="…

javascript python onclick scrapy web-scraping