Sign In

Communications of the ACM

ACM News

How Google Ranks Tweets


View as: Print Mobile App Share: Send by email Share on reddit Share on StumbleUpon Share on Hacker News Share on Tweeter Share on Facebook

To deliver useful search returns from the so-called real-time Web--such as seconds-old Twitter "tweets" reporting traffic jams--Google has adapted its page-ranking technology and developed new algorithmic tricks and filters to keep returns relevant, according to a leading Google engineer.

Google rolled out real-time search technology last month to offer searchers access to brand-new blog posts and news items far faster than the five to 15 minutes it previously took Google's Web crawlers to discover newly created items.

Bing, Cuil, and other search engines also provide various kinds of real-time results. Both Google and Bing have also forged major deals with Twitter to get real-time access to tweets, those 140-character microblog posts sent out by Twitter members.

But Google claims to offer the most comprehensive real-time results by scanning news headlines, blogs, and feeds from Facebook, MySpace, Twitter, and other sources.

The tweets are a mainstay of Google's real-time results, but Google has not previously discussed how it ranks them.

A fundamental Google strategy for identifying tweet relevance is analogous to that used by Google's PageRank technology, which helps find relevant Web pages with traditional Web search. Under PageRank, Google judges the importance of pages containing a given search keyword in part by looking at the pages' link structure. The more pages that link to a page--and the more pages linking to the linkers--the more relevant the original page.

From Technology Review
View Full Article


 


 

No entries found