I often find businesses hide their contact details behind layers of navigation. I guess they want to cut down their support costs.
This wastes my time so I use this snippet to automate extracting the available emails:
Information retrieval Python Example November 06, 2011
I often find businesses hide their contact details behind layers of navigation. I guess they want to cut down their support costs.
This wastes my time so I use this snippet to automate extracting the available emails:
Information retrieval October 11, 2011
In a previous post I showed a tool for automatically extracting article summaries. Recently I came across a free online service from instapaper.com that does an even better job.
Here is one of my blog articles:
Information retrieval October 06, 2010
I made my own version of this technique to extract article summaries.
Source code can be found here.
The idea is simple - extract the biggest text block - but performs well.
Here are some test results:
http://www.nytimes.com/2010/03/23/technology/23google.html?_r=1