India boasts of one of the youngest populations globally with an average age of 29.a A report published by the Federation of Indian Chambers of Commerce and Industry (FICCI) in 2013 estimated that to train this large young population, the country must build six new universities and 270 new colleges every month in the next 20 years.7 It was an impossible target! Or so it seemed until 2014 when IIT Kharagpur conceptualized the National Digital Library of India (NDLI; https://ndl.iitkgp.ac.in/) with an aim to bring equity of access to educational resources for every Indian through a single window access mechanism. NDLI, a project funded by the Ministry of Education, Government of India, is a meta-library—a portal that connects users to hundreds of libraries in India and abroad and provides more than 81 million forms of educational content, including books, lecture videos, research articles, and more in over 100 languages, including several vernaculars used throughout the country.
In pre-NDLI era, there were many isolated initiatives on digital education in the country, including the growing number of institutional repositories with their disparate metadata schema, archives of video lectures such as NPTEL,b and national thesis repositories like Sodhganga.c NDLI provided the long-awaited integration among them by becoming the common portal through which each of them is seamlessly accessible.
In the international scenario, while many major digital libraries such as Europeana, DPLA, Canadiana, Trove, World Digital Library, and Digital NZ focus on culture and heritage, NDLI, respecting national need, has been designed to focus on the educational space. Unlike general-purpose search engines, NDLI indexes only educational resources based on their corresponding metadata. NDLI also exposes several filters that allow the user to refine search results based on educational metadata (for example, subject domain, educational level, educational degree, and learning resource type). This helps users to express and iteratively refine their search intent using precise metadata values—a feature missing in general-purpose search engines. Figure 1 shows the NDLI interface: it offers both browse and search options; the user can customize the retrieved results using metadata-based filters visible on either side of the interface. Many of the educational resources in NDLI are organized into collections, for example, schoolbooks are organized into curriculum, thereby allowing users to easily discover related resources. One of the other exclusive features of NDLI is free access to special resources like South Asia Archive, the World eBook Library, and the digitized 'red books' of the Oscar-winning film-maker Satyajit Ray, that are not otherwise publicly accessible.
Figure 1. NDLI interface showing browse and search options at the top, retrieved results (for search string 'Magnetism') on the central pane, and metadata-based filters on either side.
Designing a digital library of this scale posed several technical challenges related to:
Figure 2. High-level NDLI architecture consisting of major components including ingestion, services, integration, and outreach.
NDLI users comprise primary, secondary, and higher-secondary school students, undergraduates, postgraduates, research scholars, and lifelong learners. Typical use cases include faceted search and browse of the repository through multiple modalities (for example, by subject domain). Access is also provided to specially curated collections like Exam Preparatory, IIT-JEE Preparatory and News Archives. Presently, around 6.4 million active users access NDLI resources. Most traffic includes users from higher education institutions. Figure 3 presents some statistics on repository growth and usage of NDLI.d
Figure 3. NDLI content growth, user, and usage statistics.
During COVID-19, NDLI created a virtual collection Study at Home, which includes specially curated resources on all subjects for students appearing for Class-X and Class-XII examinations. Not surprisingly, there was a sharp rise in daily document views after this feature was introduced. Another valuable addition has been the COVID-19 Research Resources Repository containing COVID-19 research articles, blogs, and preprints. NDLI's contributions in digital learning during COVID-19 have been recognized through several international awardse and recognition.f In this context, it is also worthwhile to quote Sundaram "Sundy" Srinivasan, President, PanIIT USA:
"The Indian diaspora as well as the rest of the world can now avail knowledge from this multilingual, multi-subject, multimedia digital repository [NDLI] that any individual learner or a lifelong learner can avail."
Despite the enormous repository size and user-friendly discovery services in NDLI, it is not an easy task to carry its benefits to every learner in the country. Therefore, virtual NDLI Club service (see Figure 4) has been designed to facilitate and coordinate activities around NDLI resources at different institutions. This service aims at improving user engagement with NDLI by organizing competitions, training sessions, and workshops at the institution level, regional level where multiple institutions participate, and global level. Presently, more than 2,600 NDLI clubs have been established across the country.
Figure 4. Conceptual structure of NDLI Club governance. NDLI Core refers to NDLI repository, thematic activities include hackathon and math circle.
The design and implementation of NDLI services have spawned several research challenges, some of which have been deliberated under the ambit of NDLI:
NDLI has seen enormous growth in user engagement in the last few years, especially in the post-pandemic era. It has been largely able to take digital education to the remote corners of the country. Presently, NDLI is in the process of transforming into a community-driven service where the collections can be managed by relevant communities, and quality learning resources can be created in vernaculars using semiautomatic translations. All these lead to the future prospect of crowdsource-based user community contributions in NDLI.
1. Banerjee, S. Sanyal, D.K., Chattopadhyay, S. Bhowmick, P.K., and Das, P.P. Segmenting scientific abstracts into discourse categories: a deep learning-based approach for sparse labeled data. In Proceedings of the ACM/IEEE Joint Conf. Digital Libraries, 2020.
2. Das, A. and Das, P.P. Automatic semantic segmentation and annotation of MOOC lecture videos. In Proceedings of the 21st Intern. Conf. Asian Digital Libraries. Springer, 2019, 181–188.
3. Das, A. and Das, P.P. Semantic segmentation of MOOC lecture videos by analyzing concept change in domain knowledge graph. In Proceedings of the 22nd Intern. Conf. Asian Digital Libraries. Springer, 2020, 55–70.
4. Das, P.P. et al. National Digital Library: A platform for paradigm shift in education and research in India. Science and Culture 82, 1–2 (2016), 4–11.
5. Ghosh, K., Nangi, S.R., Kanchugantla, Y, Rayapati, P.G., Bhowmick, P.K., and Goyal, P. Augmenting video lectures: Identifying off-topic concepts and linking to relevant video lecture segments. Intern. J. Artificial Intelligence in Education (2021), 1–31.
6. Kundu, S. et al. Design and development of a medical image databank for assisting studies in radiomics. J, Digital Imaging (2022), 1–16.
7. Lopes, N.V., Baguma R. (Eds). Developing Knowledge Societies for Distinct Country Contexts. IGT Global, 2021
8. Sadhu, S. and Bhowmick, P.K. Automatic segmentation and semantic annotation of verbose queries in digital library. In Proceedings of the 22nd Intern. Conf. Theory and Practice of Digital Libraries. Springer, 2018, 270–276.
9. Santosh, T.Y.S.S., Sanyal, D.K., Bhowmick, P.K., and Das., P.P. Surrogator: A tool to enrich a digital library with open access surrogate resources. In Proceedings of the 18th ACM/IEEE-CS Joint Conf. Digital Libraries, 2018, 379–380.
10. Santosh, T.Y.S.S., Sanyal, D.K., Bhowmick, P.K., and Das., P.P DAKE: Document-level attention for keyphrase extraction. In Proceedings of the 42nd European Conf. Information Retrieval, 2020, 392–401.
11. Sanyal, D.K., Bhowmick, P.K., Das, P.P., Chattopadhyay, S., and Santosh, T.Y.S.S. Enhancing access to scholarly publications with surrogate resources. Scientometrics 121, 2 (2019), 1129–1164.
d. 'Education level' field is not mandatory during user registration. Hence, many of the users did not choose any education level.
©2022 ACM 0001-0782/22/11
Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and full citation on the first page. Copyright for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, to republish, to post on servers, or to redistribute to lists, requires prior specific permission and/or fee. Request permission to publish from firstname.lastname@example.org or fax (212) 869-0481.
The Digital Library is published by the Association for Computing Machinery. Copyright © 2022 ACM, Inc.
No entries found