Research and Advances
Computing Applications

Making a digital library: the chemistry online retrieval experiment

Posted

The CORE project is an electronic library of primary journal articles in chemistry, containing about five years of twenty primary journals published by the American Chemical Society (about 425,000 pages). Unlike many digital library projects, CORE includes both a scanned image and a marked-up ASCII version (represented in Standard Generalized Markup Language, or SGML) for each page of the publisher's database. Each page was scanned and segmented, with graphical units isolated and linked to figure references in the articles. The original machine-readable typography was converted to SGML format and the results were used to build databases with indexes for full-text Boolean searching; a single search engine served data for each of three X-Window interfaces.

View this article in the ACM Digital Library.

Join the Discussion (0)

Become a Member or Sign In to Post a Comment

The Latest from CACM

Shape the Future of Computing

ACM encourages its members to take a direct hand in shaping the future of the association. There are more ways than ever to get involved.

Get Involved

Communications of the ACM (CACM) is now a fully Open Access publication.

By opening CACM to the world, we hope to increase engagement among the broader computer science community and encourage non-members to discover the rich resources ACM has to offer.

Learn More