April 1995 - Vol. 38 No. 4

April 1995 issue cover image

Features

Opinion

Copyright and digital libraries

This issue of Communications highlights some of the many projects underway for the creation or enhancement of digital libraries. At the moment, no one seems to think there will be only one gargantuan digital library to sate the public's appetite for information. Rather, the expectation is that there will be many digital libraries, most of which will have specialized collections and will be internetworked together in a way loosely resembling today's Internet. Most digital library project planners are aware there are intellectual property issues that must be resolved in order to successfully deploy their libraries. Some proposals for digital library projects express an intent to resolve intellectual property issues as part of the overall plan for the library, albeit without much specificity about how this would be achieved in their systems [2, 4].
Research and Advances

Rich interaction in the digital library

Effective information access involves rich interactions between users and information residing in diverse locations. Users seek and retrieve information from the sources—for example, file serves, databases, and digital libraries—and use various tools to browse, manipulate, reuse, and generally process the information. We have developed a number of techniques that support various aspects of the process of user/information interaction. These techniques can be considered attempts to increase the bandwidth and quality of the interactions between users and information in an information workspace—an environment designed to support information work (see Figure 1).
Research and Advances

NSF Center for Intelligent Information Retrieval

The Center for Intelligent Information Retrieval (CIIR) was started at the University of Massachusetts two years ago to do research and technology transfer in the area of distributed, text-based information systems. The CIIR is part of the NSF State/Industry University Collaborative Research Centers program and relies heavily on government and industrial partners, in addition to federal funding. There are currently 24 members from a variety of industries and government agencies, and many of these are involved in both research and development projects.
Research and Advances

World-Wide Web and computer science reports

With the advent of the World-Wide Web, computing professionals have eagerly pursued the idea of moving from a paper-based technical report service to one that employs networked information systems. Many departments keep some version of their reports on an FTP server to help with this process. To facilitate access to CS reports a number of sites have set up lists of these archives (see sidebar).
Research and Advances

Wide Area Technical Report Service: technical reports online

Wide Area Technical Report Service (WATERS) is a distributed database of computer science technical reports. Contributors are departments of computer science that make their reports, stored locally at their sites, available through World-Wide Web and a WAIS search engine. By using WATERS, anyone with access to the Internet can, using a client such as Mosaic browse, search, obtain abstract and bibliographic information, and retrieve technical reports online. The latter assumes that the user's WWW client is set up to launch viewers such as ghostview for Postscript files, xdvi for dvi files, and xtiff for TIFF page images.
Research and Advances

The information zone system

Spatial data is complex data composed of hundreds or thousands of attribute/value pairs that define logical, physical, and abstract geographic features. It has traditionally been available to researchers and other users as paper maps, but it is now commonplace to collect and distribute spatial data as structured graphical elements (e.g., vectors) and their associated relational data sets, or as scanned images (when no analysis is necessary). Even though vast stores of spatial data exist, awareness of and access to these data sets, images, and related documents remain tremendous problems [2]. This situation is due in part to the lack of integration between geographic information systems (GIS), storage management, and network tools. It also is due to the dependence on text-based menus and queries to support access. Text-based access is inconsistent for many spatial data users since users conceptualize and data is organized by geographic location [1].
Research and Advances

Dienst: an architecture for distributed document libraries

As one of the five universities participating in the ARPA-sponsored Computer Science Technical Report project, we at Cornell have developed a digital library architecture called Dienst. Dienst is a protocol and implementation that provides Internet access to a distributed, decentralized multi-format document collection. The collection is managed by a set of interoperating Dienst servers distributed over the Internet. These servers provide three digital library services: repositories of multi-format documents; indexes into the document collection and search engines for these indexes; and user interfaces for browsing, searching, and accessing the collection.
Research and Advances

Knowledge-based editing and visualization for hypermedia encyclopedias

One of the main goals in developing digital libraries is to provide users with opportunities for accessing and using information in highly flexible and user-oriented ways not available in current information repositories. This implies a focus on supporting the intellectual access to information and to consider the context of the information request. Systems for digital libraries should be able to select the appropriate type and amount of information from a comprehensive pool and compose it on the fly for a meaningful and coherent presentation which might have never occurred before or never will again after this event because it is customized to the current situation.
Research and Advances

The University of California CD-ROM Information System

The University of California CD-ROM Information System replaces the equivalent of 260,000 books of published federal statistics with a CD-ROM-based online information system. The size of this database is currently 270 CD-ROMs (135GB). It contains 1990 U.S. census data (approximately 3,000 items of socio-economic and demographic information, including race-ethnicity, employment, income, educational level, and poverty) for every block and census tract in the U.S., as well as U.S. foreign trade data by commodity from every city in the U.S. to every country in the world. It also contains the digitized map outline boundary data for city blocks for the entire U.S. (census TIGER files).
Research and Advances

Envision: a user-centered database of computer science literature

Project Envision is an early NSF-funded digital library effort to develop a multimedia collection of computer science literature with full-text searching and full-content retrieval capabilities. Envision was launched in 1991 in accordance with the ACM Publications Board's plans for encouraging research studies to develop an electronic archive for computer science.
Research and Advances

Making a digital library: the chemistry online retrieval experiment

The CORE project is an electronic library of primary journal articles in chemistry, containing about five years of twenty primary journals published by the American Chemical Society (about 425,000 pages). Unlike many digital library projects, CORE includes both a scanned image and a marked-up ASCII version (represented in Standard Generalized Markup Language, or SGML) for each page of the publisher's database. Each page was scanned and segmented, with graphical units isolated and linked to figure references in the articles. The original machine-readable typography was converted to SGML format and the results were used to build databases with indexes for full-text Boolean searching; a single search engine served data for each of three X-Window interfaces.
Research and Advances

An appreciation of Laurence Rosenberg

It takes many different kinds of contributions to create a viable research community, and many dedicated people have made major contributions in the past decade to make things like the NSF Digital Libraries initiative happen. I'd like to take a few moments to honor one such individual who left our community when he died this past spring following a brave fight with cancer. Laurence Rosenberg affected many of our lives and careers through his activities and his vision as a program manager and a deputy director in the Division of Information, Robotics, and Intelligent Systems at the National Science Foundation.
Research and Advances

Informedia Digital Video Library

The Informedia Digital Video Library Project is developing new technologies for creating full-content search and retrieval digital video libraries. Working in collaboration with WQED Pittsburgh, the project is creating a testbed that will enable K-12 students to access, explore, and retrieve science and mathematics materials from the digital video library. The library will initially contain 1,000 hours of video from the archives of project partners: WQED, Fairfax Co. VA Schools' Electronic BBC-produced video courses. (Industrial partners include Digital Equipment Corp., Bell Atlantic, Intel Corp., and Microsoft, Inc.) This library will be installed at Winchester Thurston School, an independent K-12 school in Pittsburgh.
Research and Advances

The Stanford Digital Library Project

The Stanford Integrated Digital Library Project will develop enabling technologies for an integrated “virtual” library to provide an array of new services and uniform access to networked information collections. The Integrated Digital Library will create a shared environment linking everything from personal information collections, to collections of conventional libraries, to large data collections shared by scientists.
Research and Advances

UC Berkeley’s Digital Library project

For digital libraries to succeed, we must abandon the traditional notion of “library” altogether. The reason is as follows: The digital “library” will be a collection of distributed information services; producers of material will make it available, and consumers will find it and use it, perhaps through the help of automated agents. Libraries in the traditional sense are nowhere to be found in this model (i. e., the notion of a limited intermediary containing some small fraction of preselected material available only to local patrons is replaced by a system providing to users everywhere direct access to the full contents of all available material).
Research and Advances

Alexandria Digital Library

The goal of the Alexandria Digital Library Project is to develop a distributed system that provides a comprehensive range of library services for collections of spatially indexed and graphical information. While such collections include digitized maps and images as important special components, the Alexandria Digital Library will involve a very wide range of graphical materials and will include textual materials. Users of the Alexandria Digital Library will range from school children to academic researchers to members of the general public. They will be able to retrieve materials from the library on the basis of information content as well by reference to spatial location.
Research and Advances

Building the interspace: the Illinois Digital Library Project

The University of Illinois is building a large-scale digital library testbed, planned to grow to thousands of users and thousands of documents, with the goal of bringing professional quality search and display to Internet information services. Concurrently, research is designing and implementing a prototype of the Interspace, a vision of what the Internet will evolve into where the distributed network of interconnected machines is replaced by a distributed space of interlinked information.
Research and Advances

University of Michigan Digital Library Project

The University of Michigan Digital Library Project (NSF-UMDL) is a multidisciplinary collaboration among faculty and staff throughout the University of Michigan. Initial use and evaluation of the NSF-UMDL will take place in science classes in Ann Arbor, Mich. high schools and the Stuyvesant High School in New York City, as well as instruction and research at the university. Access will also be available through Ann Arbor and New York City public libraries. External partners to the project include: IBM, Elsevier Science, Apple Computer, Bellcore, UMI International, McGraw-Hall, Encyclopaedia Britannica Educational Corporation, and Kodak.
Research and Advances

Library of Congress Digital Library effort

The Library of Congress has been actively seeking gift funds to continue the digitization efforts begun five years ago by its American Memory pilot project, which has successfully digitized more than 210,000 items. Many of these items have been made available to 44 test sites around the United States, and selected items from this project are also available via the Library of Congress's World-Wide Web server (http://www.loc.gov). Last October, the library received a total of $13 million in grant and gift funds to digitize its American historical collections, many of which are unique, and to help it identify instructional uses of digitized library materials.
Research and Advances

The roles of digital libraries in teaching and learning

Libraries have long served crucial roles in learning. The first great library, in Alexandria 2,000 years ago, was really the first university. It consisted of a zoo and various cultural artifacts in addition to much of the ancient world's written knowledge and attracted scholars from around the Mediterranean, who lived and worked in a scholarly community for years at a time. Today, the rhetoric associated with the National/Global Information Infrastructure (N/GII) always includes examples of how the vast quantities of information that global networks provide (i.e., digital libraries) will be used in educational settings [16].
Research and Advances

Going digital: a look at assumptions underlying digital libraries

What are digital libraries, how should they be designed, how will they be used, and what relationship will they bear to what we now call “libraries”? Although we cannot hope to answer all these crucial questions in this short article, we do hope to encourage, and in some small measure to shape, the dialog among computer scientists, librarians, and other interested parties out of which answers may arise. Our contribution here is to make explicit, and to question, certain assumptions that underlie current digital library efforts. We will argue that current efforts are limited by a largely unexamined and unintended allegiance to an idealized view of what libraries have been, rather than what they actually are or could be. Since these limits come from current ways of thinking about the problem, rather than being inherent in the technology or in social practice, expanding our conception of digital libraries should serve to expand the scope and the utility of development efforts.
Research and Advances

Digital libraries, value, and productivity

A digital library is popularly viewed an electronic version of a public library. But replacing paper by electronic storage leads to three major differences: storage in digital form, direct communication to obtain material, and copying from a master version. These differences in turn lead to a plethora of further differences, so that eventually the digital library no longer mimics the traditional library. Furthermore, a library is only element in the process of creating, storing, culling, accessing, selecting, and distributing information to customers. While the technical focus of digital library research is on the central functions of storage and access, major changes will occur in the interaction within the new systems.
News

The ACM electronic publishing plan

For the past three years, the ACM Publications Board has been developing its vision for the future of publication in the electronic age and a program to achieve it. I am pleased to present that plan here. We envisage a diminishing role for print journals and exciting new programs around an ACM digital library. The plan raises important policy questions regarding the treatment of copyright. Our interim answers to these policy questions are discussed in two additional documents—the ACM Interim Copyright Policies and the ACM Author's Guide to the Interim Copyright Policies. Our overall intention is to move aggressively into electronic publishing and to preserve the traditional openness of ACM publications in the new media. Authors and readers should find these policies at least as hospitable as the traditional ones.

Recent Issues

  1. November 2024 CACM cover
    November 2024 Vol. 67 No. 11
  2. October 2024 CACM cover
    October 2024 Vol. 67 No. 10
  3. September 2024 CACM cover
    September 2024 Vol. 67 No. 9
  4. August 2024 CACM cover
    August 2024 Vol. 67 No. 8