links for 2005-12-01
-
Article by Raul Valdes-Perez, CEO of Vivisimo on why personalized search won’t work, at least for now.
-
Nice list of services and resources to consider if you’re putting together a small startup today
-
Good summary of advice for prospective independent consultants
-
Long thread discussing various issues around search engine indexing, duplicate detection, and more
-
by Junghoo Cho, Hector Garcia-Molina, Lawrence Page. A Stanford paper on some of what later became Google.
-
by Page, Lawrence; Brin, Sergey; Motwani, Rajeev; Winograd, Terry – 1998 Stanford paper on Google
-
2002 paper by Taher H. Haveliwala at Stanford extending PageRank approach to multiple dimensions
-
Internet Mathematics Vol. 1, No. 3: 335-380 Amy N. Langville and Carl D. Meyer. Detailed 46-page overview of PageRank and search analysis.
-
Andras A. Benczur, Karoly Csalogany, Tamas Sarlos, Mate Uher – Proposes a SpamRank metric based on personalized pagerank and local pagerank distribution of linking sites.
-
Franco Scarselli, Ah Chung Tsoi, Markus Hagenbuchner (2004)
-
(Extended abstract) Paolo Boldi, Massimo Santini, Sebastiano Vigna. Discussion of tradeoffs between various site crawling and page discovery approaches.
-
A detailed look at the value and costs of reputation and some speculation on how much it costs to purchase higher ranking through spam, link brokering, etc
-
When one iteration is sufficient K.Avrachenkov, N. Litvak, D. Nemirovsky, N. Osipova. Proposes an alternative single pass statistical method rather than deterministic pagerank calculation.
-
William Pugh presentation slides on US patent 6,658,423 (assigned to Google) for an approach using shingles (text fragments) to compare content similarity.

































[...] Ho John Lee’s Weblog » A reading list on PageRank and search algorithms If you’re subscribed to the full feed, you’ll notice I collected some background reading on PageRank, search crawlers, search personalization, and spam detection in the daily links section yesterday. [...]
A reading list on PageRank and search algorithms
If you’re subscribed to the full feed, you’ll notice I collected some background reading on PageRank, search crawlers, search personalization, and spam detection in the daily links section yesterday. Here are some references that are worth…