Showing posts with label web-scraping. Show all posts
Showing posts with label web-scraping. Show all posts

Monday, January 7, 2013

Scrapy and ScrapingHub from Insophia, a Python software company

Scrapy is a web crawling and scraping tool written in Python.

(I had blogged about Pholcidae, another Python web crawling library, recently.)

http://scrapy.org/

It is from Insophia, a South American software company, in Montevideo, Uruguay, that works mainly with Python.

I had come across Scrapy and Insophia some time ago. Saw them again recently.

http://insophia.com/

They have also created a site, ScrapingHub, with a couple more related services:

http://scrapinghub.com/

http://scrapinghub.com/scrapy-cloud.html

http://scrapinghub.com/autoscraping.html

, that facilitates the use of their web scraping products and services.

Also see:

http://en.m.wikipedia.org/wiki/Scrapy

https://readthedocs.org/projects/scrapy/

http://pravin.insanitybegins.com/posts/writing-a-spider-in-10-mins-using-scrapy

http://milinda.pathirage.org/2012/03/13/recursively_scraping_blog_with_scrapy/

- Vasudev Ram
www.dancingbison.com
Python, Linux and open source consulting.