-
Niall Kennedy does a little reverse engineering and documents the current state of MyYahoo RSS feeds (not quite valid RSS, but then it’s not released for public use, either).
-
extended discussion on merits and issues of tag entry formats - spaces, commas, semicolons, quotes, etc as used on del.icio.us, flickr, and other tagging applications
-
Instructions for using the Yahoo shortcuts system, which appears to be a little like YubNub.
-
Microsoft Research social software projects on a range of topics including events, photos, collaboration, virtual worlds, etc.
-
Description of a programming language and implementation approach for describing and distributing parallellized tasks across commodity hardware, e.g. search analysis on the googleplex. PDF and HTML slides.
-
Interpreting the Data: Scientific Programming with Sawzall - Rob Pike, et al. Sawzall is a scripting language built on MapReduce to provide parallel processing services on very large data sets such as telephone directories, server logs, or collections of
-
Luiz Barroso, Jeffrey Dean, and Urs Hoelzle paper in IEEE Micro describing the hardware and software implementation approach at Google
-
Raymie Stata, Krishna Bharat, Farzin Maghoul paper at WWW9 describing an approach for extracting relevant terms from web page content and generating a term vector which can be used by other applications instead of scanning the full page to determine topi
-
Official home page, by Martin Porter. Porter stemming is used to remove assorted endings and variations between English language words that are mostly the same, i.e. search, searching, searches.
This entry was posted
on Thursday, December 22nd, 2005 at 12:19 am and is filed under Links.
You can follow any responses to this entry through the RSS 2.0 feed.
You can leave a response, or trackback from your own site.
Tags: none