20 Mar 11
An overview of different techniques to extract actual content from web pages.
27 Aug 10
Time Explorer is an application designed for analyzing how news changes over time. Time Explorer is designed to help users discover how entities such as people and locations associated with a query change over time. Second, by searching on time ex-pressions extracted automatically from text, the application allows the user to explore not only how topics evolved in the past, but also how they will continue to evolve in the future.
Pattern matching algorithm used in GNU Grep (see: http://lists.freebsd.org/pipermail/freebsd-current/2010-August/019310.html )