I’ve just updated SproutSearch’s blog spider. It now tries to fetch the RSS feed for a blog before parsing it with my HTML parser. This should save some CPU time and give me better results. It also gets the date and time of the blog’s latest post and some statistics about the number of words used. This new data will allow me to generate better pages in the future. I am also brainstorming some methods of data mining I can use to make SproutSearch a bit more interesting. With over 8 million blogs in the database there are lots of possibilities.http://www.sproutsearch.com
More: continued here
This entry was posted
on Saturday, June 16th, 2007 at 2:02 am
and is filed under news.
You can follow any responses to this entry through the RSS 2.0 feed.
You can leave a response, or trackback from your own site.
About Graphics Software is the ultimate resource for learning about graphics software for Macintosh and Windows. Guide Sue Chastain brings you informational articles, how-tos ...
Software Description: ZiLOG?s real-time preemptive multitasking kernel, RZK, is designed for time-critical embedded applications.
The software industry comprises businesses involved in the development, maintenance and publication of computer software. The software industry started in the mid-1970s at the time ...
Home page for software downloads for HP ProCurve switches. This page supplies version and release information as well as download links.
Software reviews from experts and regular people. Find out what other people think about the software you want to buy. Browse a selection of reviews for antivirus software ... OS ...
Learn how HP business software solutions can help maximize performance of your IT infrastructure.