-
Article by Raul Valdes-Perez, CEO of Vivisimo on why personalized search won’t work, at least for now.
-
Nice list of services and resources to consider if you’re putting together a small startup today
-
Good summary of advice for prospective independent consultants
-
Long thread discussing various issues around search engine indexing, duplicate detection, and more
-
by Junghoo Cho, Hector Garcia-Molina, Lawrence Page. A Stanford paper on some of what later became Google.
-
by Page, Lawrence; Brin, Sergey; Motwani, Rajeev; Winograd, Terry - 1998 Stanford paper on Google
-
2002 paper by Taher H. Haveliwala at Stanford extending PageRank approach to multiple dimensions
-
Internet Mathematics Vol. 1, No. 3: 335-380 Amy N. Langville and Carl D. Meyer. Detailed 46-page overview of PageRank and search analysis.
-
Andras A. Benczur, Karoly Csalogany, Tamas Sarlos, Mate Uher - Proposes a SpamRank metric based on personalized pagerank and local pagerank distribution of linking sites.
-
Franco Scarselli, Ah Chung Tsoi, Markus Hagenbuchner (2004)
-
(Extended abstract) Paolo Boldi, Massimo Santini, Sebastiano Vigna. Discussion of tradeoffs between various site crawling and page discovery approaches.
-
A detailed look at the value and costs of reputation and some speculation on how much it costs to purchase higher ranking through spam, link brokering, etc
-
When one iteration is sufficient K.Avrachenkov, N. Litvak, D. Nemirovsky, N. Osipova. Proposes an alternative single pass statistical method rather than deterministic pagerank calculation.
-
William Pugh presentation slides on US patent 6,658,423 (assigned to Google) for an approach using shingles (text fragments) to compare content similarity.
This entry was posted
on Thursday, December 1st, 2005 at 12:19 am and is filed under Links.
You can follow any responses to this entry through the RSS 2.0 feed.
You can leave a response, or trackback from your own site.
Tags: none
December 1st, 2005 at 1:38 pm
A reading list on PageRank and search algorithms
If you’re subscribed to the full feed, you’ll notice I collected some background reading on PageRank, search crawlers, search personalization, and spam detection in the daily links section yesterday. Here are some references that are worth…
December 11th, 2005 at 12:53 am
[…] Ho John Lee’s Weblog » A reading list on PageRank and search algorithms If you’re subscribed to the full feed, you’ll notice I collected some background reading on PageRank, search crawlers, search personalization, and spam detection in the daily links section yesterday. […]