Societies often create smaller subsets or communities that connect with one another for commerce and intellectual exchange over mutual interests. In science and engineering, the need for communication among researchers is often hampered by artificial barriers of university politics, economic market forces, and the sheer momentum of an academic reward structure that values individual discovery over joint development. Recent initiatives have attempted to reduce some of these barriers, encouraging collaborative multidisciplinary research programs. Through this effort, we have studied the processes that lead to the successful foundation of new communities.
Our current focus has led to the creation of Insight, a project for open source image processing software development, along with the Insight Software Consortium (ISC), which includes more than 17 participating universities and commercial institutions. The initial emphasis of this effort is to provide public software tools for 3D segmentation and deformable and rigid registration, capable of analyzing the head and neck anatomy of the Visible Human Project data. The eventual goal is to provide the cornerstone of a self-sustaining software community in 3D, 4D, and higher-dimensional data analysis. Ultimately, we intend this to be a public software resource that will serve as a foundation for future medical image research.
The Visible Human Project was initially formed to collect data from human subjects to serve as a guidebook and baseline data set in modern anatomy research and education [1]. The intent of the current effort is to amplify the investment being made through the Visible Human Project and future programs for medical image analysis by reducing the reinvention of basic algorithms. We are also hoping to empower young researchers and small research laboratories with the kernel of an image-analysis system in the public domain. We are committed to open source public software, including open interfaces supporting connections to a broad range of visualization and graphical user-interface platforms.
Community through Scientific Rendezvous
There have been several notable examples of accelerating computer science research through what we describe as a scientific rendezvous. Generally, a scientific rendezvous is a meeting of scientific minds: shared data, a shared conference, or more essentially a shared vision that promotes dialogue throughout a community. It is most likely to be an open and welcoming group, admitting free dialogue across many subjects, but since it primarily advances a small emerging but significant field, the topics will be narrow in scope and tightly focused.
A scientific rendezvous is not likely to be closed, proprietary, nor a standard. Some of the most important scientific rendezvous in computer science have been open source software projects that have stimulated education, research, and training. The creation of the Unix operating system along with the long-term support of the University of California at Berkeley did more to disseminate advanced operating system design elements among academic and research institutions than any particular text or curriculum [5]. The open hardware architecture of the IBM PC also introduced a common basis over which a community dialogue could be conducted. The Apple Macintosh graphical user interface did much to codify conventions in user-interface design, leading to a new jargon, a protracted discussion on pull-down vs. pop-up menus, and the promotion of a wide-ranging commercial effort in mouse and trackball design.
Scientific rendezvous sometimes emerge from funded research projects rather than commercial ventures. Perhaps the most successful sponsored project that led to the formation of a community dialogue and later to an entire industry was the development of Internet tools. HTML serves as the core for the most successful of these tools, and the open nature of HTML opened the way for derivative products such as Netscape Communicator and Internet Explorer Web browsers. The National Library of Medicine (NLM) has participated in multiple efforts of this nature, including GenBank, the public gene sequence repository. The Visible Human Project has produced a pair of studies of human anatomy that have provided common ground from which to extend multidisciplinary research in the fields of anatomy, medical imaging, visualization, and other areas of computer science [1].
What distinguishes a successful scientific rendezvous? The fundamental strength is its ability to promote and sustain dialogue for whatever field it supports. It provides a foundation for greater research across institutions, demanding interoperability rather than proprietary functions. As such it should encourage communication and stimulate development, often through a new lexicon of technical terms that form around the emerging technology. This concept is based on widely adopted conventions rather than formal standards.
The demand for common ground usually leads to the development of standards; we submit that before a field is mature enough to require standards, it can benefit from some nucleus around which scientific and engineering conversations can revolve. Principal attributes of the most successful scientific rendezvous in IT are open architecture (software or hardware), extensible infrastructure and control languages, and explicit support for the substance of the core material including distribution mechanisms, open licensing, tutorials and documentation, tools, and bug fixes.
The Need for Public Image Analysis Tools
There are pressing needs for automated techniques in image analysis. Medical processes such as early cancer detection, monitoring the progress of medical treatment, response to cardiovascular disease, and analysis of neurological disorders such as stroke all benefit from advanced imaging. Public exchange of software tools accelerates the development and improvement of medical care through imaging. This need creates a focus for the community and a basis for a scientific rendezvous.
After conducting a joint National Institutes of Health (NIH) workshop in 2000 [2], the NLM, in a multipartner arrangement, supported the creation of the Insight Toolkit (ITK), an application programming interface for high-dimension image processing. This work has been the focus over which both research and government institutions have come together.1
Planning the Consortium
Most of the systems, conferences, or data sets that have become the common basis for information exchange have evolved spontaneously as need and opportunity converged. They seldom represent good business models. Indeed, the idea of making the components open and accessible limits individual control of any element, inhibits the exclusive generation of profit, and invites competition. Anyone planning a potential scientific rendezvous should attempt to include as many of the elements of previous successful projects and recognize that neither control nor profit are useful motives or goals in such an endeavor.
We sought a consortium of software developers, rather than a single award, in the hopes that the natural process of building consensus would generate a broad versatile software base for a wider audience. Since “design by committee” efforts are often unsuccessful, we empowered a core group of software architects and systems integrators to make primary design choices. The ISC is working to deliver a software toolkit to improve and enable research in volume imaging for all areas of health care. The successful collaboration among such disparate groups as engineering schools, medical schools, small businesses, and large corporate R&D centers participating in the project is a measure of the success of this effort.2
Design Principles. The project evolved along certain principles, eventually reinforced and extended during the first meeting of the ISC in November 1999. The fundamental guiding principals include:
- Open source (see the sidebar “Open Source Software Support”);
- No cost licensing;
- Portable (runs on Windows and Unix systems);
- Compact (encumbers no cascades of licenses);
- Designed to be easily maintained;
- Emphasis on 3D; and
- Validation metrics included as part of the tools.
Target Uses. Several target uses for the toolkit were identified during the planning stages, including:
- Repository of image-processing algorithms;
- Standard implementations for comparison;
- Prototype methods to encourage variations;
- A common format for writing new methods;
- Communication for imaging software research;
- Software foundation for young researchers; and
- Metrics with which to compare methods.
Algorithms and Validation
The Insight Project was formed around the need for public medical image segmentation and registration software, including the archiving of working versions of often-cited common algorithms enabling comparison of existing techniques with improved variations. ITK also emphasizes threaded, pipelined, and streaming implementations suitable for execution in multiprocessor environments and for use on large data sets that do not fit within core memory.
Template-based generic programming has permitted the flexible combination of solvers, optimizers, and other methods with typical approaches for image segmentation and registration. For example, a statistical affine normalized correlation image registration tool can be implemented with gradient descent optimizer and a regular-step gradient descent optimizer with a simple reorganization of existing templates, simply by recombining software building blocks. Thus, a single technique may be represented multiple times in the toolkit through significant variations. Meshes and images represent the core classes of ITK, with filters used as operations. A wide variety of algorithms are represented in ITK (see the sidebar “Algorithms Available in ITK”).
Some of the most important scientific rendezvous in computer science have been open source software projects that have stimulated education, research, and training.
In order to measure the efficacy of a segmentation or registration method, data with known truth must be available [7]. Performance of an algorithm is also measured along many different dimensions including efficiency, accuracy, and precision [11]. Team members are working to publish test metrics as well as public data, including the software framework necessary to execute their planned validation pilot studies. The Insight Project is designed with the concept of academic transparency at its heart. All implementations are available in a working form, and all performance reports will be accompanied by the tests performed to measure them. Researchers following in the wake of the original implementers are encouraged to use the same tests to evaluate the performance of their improved or alternate segmentation or registration methods. Furthermore, all software developers in this area are strongly encouraged to contribute to this growing corpus of work in image processing.
Conclusion
A non-profit association, the ISC was incorporated in early 2004, and is an indication of the future of this effort. In time, the ISC may become the trustees of the open source software and serve as a focal point for the community. At the time this article was written in late 2004, the project had produced its fifth release of ITK (1.8). NLM is committed to supporting this effort, and regular software releases are intended. The software, which is publicly available in source-code form, builds and runs on Windows systems as well as on a variety of Unix systems including Solaris, Linux, SGI Irix, and Macintosh OS-X, all on a wide variety of compilers; interested software developers should visit the Insight Web site: www.itk.org. An open mailing list, insight-users, is currently active and open to all subscribers. Support tools including CMake, Cable, and DART, the dashboard support infrastructure, are all part of the public software associated with this project and available as source code.
Time, performance, and the broad user community will determine the ultimate success of the Insight project as a scientific rendezvous. To date, it has been a rare privilege to observe and participate in the growth and evolution of this noble test of group dynamics and collaborative software development.
Figure. Examples of medical image segmentation and registration algorithms available in ITK: a) Functional MRI fused with MR angiography using landmark initialized mutual information registration (courtesy of UNC); b) 2D to 3D registration of angiography data (courtesy of the Imperial College of London); c) Inner-ear segmentation of the cochlea and vestibular system using fast-marching level-set methods (courtesy of Kitware); d) Eyes, muscles, and optic nerves of the Visible Human Project data using interactive color watershed segmentation (courtesy of the University of Utah).
Join the Discussion (0)
Become a Member or Sign In to Post a Comment