Scrapy is a web crawling and scraping tool written in Python.
(I had blogged about Pholcidae, another Python web crawling library, recently.)
http://scrapy.org/
It is from Insophia, a South American software company, in Montevideo, Uruguay, that works mainly with Python.
I had come across Scrapy and Insophia some time ago. Saw them again recently.
http://insophia.com/
They have also created a site, ScrapingHub, with a couple more related services:
http://scrapinghub.com/
http://scrapinghub.com/scrapy-cloud.html
http://scrapinghub.com/autoscraping.html
, that facilitates the use of their web scraping products and services.
Also see:
http://en.m.wikipedia.org/wiki/Scrapy
https://readthedocs.org/projects/scrapy/
http://pravin.insanitybegins.com/posts/writing-a-spider-in-10-mins-using-scrapy
http://milinda.pathirage.org/2012/03/13/recursively_scraping_blog_with_scrapy/
- Vasudev Ram
www.dancingbison.com
Python, Linux and open source consulting.