October 2009 – Communications of the ACM

News Oct 1 2009

Dealing with terabytes of data is not the monumental task it once was. The difficult part is presenting enormous amounts of information in ways that are most useful to a wide variety of users.

David Lindley

Computing Applications

News Oct 1 2009

Debating Net Neutrality

Advocates seek to protect users from potential business practices, but defenders of the status quo say that concerns are overblown.

Alan Joch

Computing Applications

News Oct 1 2009

Shaping the Future

To create shape-shifting robotic ensembles, researchers need to teach micro-machines to work together.

Tom Geller

Architecture and Hardware

Opinion Viewpoints Oct 1 2009

Contagious Craziness, Spreading Sanity

Some examples of the upward or downward spiral of behaviors in the workplace.

Phillip G. Armour

Computing Applications

Opinion Viewpoints Oct 1 2009

Computing in the Depression Era

Since its beginning, the computer industry has been through several major recessions, each occurring approximately five years after the establishment of a new computing paradigm. These new computing modes created massive opportunities that the entrepreneurial economy rapidly supplied and then oversupplied.

Martin Campbell-Kelly

Computing Applications

Opinion Viewpoints Oct 1 2009

Reflections on Conficker

Conficker's alarming growth rate in early 2009 along with the apparent mystery surrounding its ultimate purpose had raised concern among whitehat security researchers. Here is an insider's view of the analysis and implications of the Conficker conundrum.

Phillip Porras

Computing Applications

Opinion Viewpoints Oct 1 2009

Dealing with the Venture Capital Crisis

The venture capital industry, like financial services in general, has fallen on hard times. Part of the problem is that large payoffs have become increasingly scarce. But perhaps the biggest future challenge for VC firms will be geography. What really might jump-start the industry is more creative globalization, with an eye toward using some overseas markets as "natural incubators."

Michael Cusumano

Computing Applications

Opinion Viewpoints Oct 1 2009

Kode Reviews 101

A review of code review do's and don'ts.

George V. Neville-Neil

Computing Applications

Opinion Viewpoints Oct 1 2009

Retrospective: An Axiomatic Basis For Computer Programming

C.A.R. Hoare revisits his past Communications article on the axiomatic approach to programming and uses it as a touchstone for the future.

C. A. R. Hoare

Computing Applications

Practice Oct 1 2009

Probing Biomolecular Machines with Graphics Processors

GPU acceleration and other computer performance increases will offer critical benefits to biomedical science.

James C. Phillips and John E. Stone

Architecture and Hardware

Practice Oct 1 2009

Unifying Biological Image Formats with HDF5

The biosciences need an image format capable of high performance and long-term maintenance. Is HDF5 the answer?

Artificial Intelligence and Machine Learning

Opinion Oct 1 2009

A Conversation with David E. Shaw

Stanford professor Pat Hanrahan sits down with the noted hedge fund founder, computational biochemist, and (above all) computer scientist.

CACM Staff

Architecture and Hardware

Research and Advances Contributed articles Oct 1 2009

A View of the Parallel Computing Landscape

Writing programs that scale with increasing numbers of cores should be as easy as writing programs for sequential computers. Here as a concrete example of a coordinated attack on the problem of parallelism.

Architecture and Hardware

Research and Advances Contributed articles Oct 1 2009

Automated Support For Managing Feature Requests in Open Forums

The result is stable, focused, dynamic discussion threads that avoid redundant ideas and engage thousands of stakeholders.

Jane Cleland-Huang, Horatiu Dumitru, Chuan Duan, and Carlos Castro-Herrera

Architecture and Hardware

Research and Advances Review articles Oct 1 2009

Smoothed Analysis: An Attempt to Explain the Behavior of Algorithms in Practice

This Gödel Prize-winning work traces the steps toward modeling real data.

Daniel A. Spielman and Shang-Hua Teng

Systems and Networking

Research and Advances Research highlights Oct 1 2009

Technical Perspective: Relational Query Optimization: Data Management Meets Statistical Estimation

Relational systems have made it possible to query large collections of data in a declarative style through languages such as SQL. There is a key component that is needed to support this declarative style of programming and that is the query optimizer.

Surajit Chaudhuri

Computing Applications

Research and Advances Research highlights Oct 1 2009

Distinct-Value Synopses For Multiset Operations

The task of estimating the number of distinct values (DVs) in a large dataset arises in a wide variety of settings in computer science and elsewhere. We provide DV estimation techniques for the case in which the dataset of interest is split into partitions. We create for each partition a synopsis that can be used to estimate the number of DVs in the partition. By combining and extending a number of results in the literature, we obtain both suitable synopses and DV estimators. The synopses can be created in parallel, and can be easily combined to yield synopses and DV estimates for "compound" partitions that are created from the base partitions via arbitrary multiset union, intersection, or difference operations. Our synopses can also handle deletions of individual partition elements. We prove that our DV estimators are unbiased, provide error bounds, and show how to select synopsis sizes in order to achieve a desired estimation accuracy. Experiments and theory indicate that our synopses and estimators lead to lower computational costs and more accurate DV estimates than previous approaches.

Computing Applications

Research and Advances Research highlights Oct 1 2009

Technical Perspective: Data Stream Processing – When You Only Get One Look

The database and systems communities have made great progress in developing database systems that allow us to store and query huge amounts of data. Real-time analysis is becoming mandatory. Here is where data stream processing comes to the rescue.

Johannes Gehrke

Computing Applications

Research and Advances Research highlights Oct 1 2009

Finding the Frequent Items in Streams of Data

Many data generation processes can be modeled as data streams. While this data may be archived and indexed within a data warehouse, it is also important to process the data "as it happens," to provide up to the minute analysis and statistics on current trends.

Graham Cormode and Marios Hadjieleftheriou

Computing Applications

Opinion Last byte Oct 1 2009

Q&A: The Networker

Jon Kleinberg talks about algorithms, information flow, and the connections between Web search and social networks.

Leah Hoffmann

Artificial Intelligence and Machine Learning