Project Report
Leverage Natural Language Processing (NLP) to improve the discoverability of academic resources
Charlene Chou ,Shravan Khunti
,Harshit Bhargava
Abstract
This interdisciplinary project is a collaboration among library metadata librarians, data scientists, digital library technologists, university IT, and the university press. Its goal is to improve the discoverability of academic resources by enhancing metadata through Natural Language Processing (NLP) and embedding-based semantic search, addressing the limitations of traditional keyword-based retrieval. To support this pilot, a library NLP system architecture has been designed, including the development of a vector database to enable semantic search within discovery platforms
Author information
Shravan Khunti
Center for Data Science, New York University,US
Harshit Bhargava
Center for Data Science, New York University,US
Cite this article
Chou, C., Khunti, S., & Bhargava, H. (2025). Leverage Natural Language Processing (NLP) to improve the discoverability of academic resources. Proceedings of the International Conference on Dublin Core and Metadata Applications, 2025. https://doi.org/10.23106/dcmi.952586098
- Published
Issue
DCMI 2025 Conference Proceedings
- Location:
- University of Barcelona, Barcelona, Spain
- Dates:
- October 22-25, 2025