Research and Advances
Computing Applications The consumer side of search

Bias on the Web

Posted
  1. Introduction
  2. What is Bias?
  3. A Measure of Bias
  4. Experimental Results
  5. Conclusion
  6. References
  7. Authors
  8. Figures
  9. Tables



Biased search results on product information illustrate a general problem of social importance. The Web is replacing traditional repositories that individuals and organizations turn to for the information needed to solve problems and make decisions.


Indexical bias in a set or list of URLs retrieved in response to a query is a function of emphasis and prominence. It is related to other "quality of information" issues, but captures an aspect of quality that differs from relevance, accuracy, timeliness, and so on. A collection of items retrieved from a database may exhibit bias whether or not the items are relevant to a user’s query. The purport of the "whether or not" caveat is clear from some extreme cases. If the items retrieved are all deemed relevant by a user, there may be others—not retrieved—that would also be considered relevant by that user. On the other hand, a set of items irrelevant to one user might be relevant to another user for the very same query.

Given a norm prescribing expected frequency or prominence of items retrieved in response to a query, a set exhibits bias when some items occur more frequently or prominently with respect to the norm, while others occur less frequently or prominently with respect to the norm [2, 8]. The absence of certain brand names in the refrigerator example signifies bias in the results of a particular engine because other engines do retrieve those brand names. Prominence is reflected in the position a URL occupies in the list of items retrieved for a given search term. The norm used in the research reported here is based on the idea of pooling the results of a basket of search engines. This norm lends itself to a practical measurement scheme.



The only realistic way to counter the ill effects of search engine bias on the ever-expanding Web is to make sure a number of alternative engines are available.








Join the Discussion (0)

Become a Member or Sign In to Post a Comment

The Latest from CACM

Shape the Future of Computing

ACM encourages its members to take a direct hand in shaping the future of the association. There are more ways than ever to get involved.

Get Involved

Communications of the ACM (CACM) is now a fully Open Access publication.

By opening CACM to the world, we hope to increase engagement among the broader computer science community and encourage non-members to discover the rich resources ACM has to offer.

Learn More