Sign In

Communications of the ACM

ACM TechNews

Microsoft Researchers Say Anonymized Data Isn't So Anonymous


View as: Print Mobile App Share: Send by email Share on reddit Share on StumbleUpon Share on Hacker News Share on Tweeter Share on Facebook
Microsoft logo

Data routinely gathered in Web logs, such as Internet Protocol (IP) address, cookies, operating systems, browser type, and user-agent strings can threaten online privacy because they can be used to identify the activity of individual machines, according to Microsoft researchers.  '

However, they say an analysis of such data when anonymized can help detect malicious activity and improve overall Internet security. The researchers found that HTTP user-agent information can accurately tag a host with an accuracy of 92.8 percent when more than one user ID was linked to a single host, such as with a family that shares a single computer.

The researchers also found that even anonymized data can leak information. "[C]oarse-grained IP prefixes achieve similar host-tracking accuracy to that of precise IP address information when they are combined with hashed [user-agent] strings," the researchers say.

They aimed to determine how much identifying information gets revealed by common identifiers and to understand the patterns of aggregated activities and explore their implications. "Our analysis suggests that users who do not wish to be tracked should do much more than clear cookies," the researchers note.

From Network World
View Full Article

Abstracts Copyright © 2012 Information Inc. External Link, Bethesda, Maryland, USA 


 

No entries found

Sign In for Full Access
» Forgot Password? » Create an ACM Web Account