4:00pm - 4:15pmID: 427
/ PS-06: 1
Short Papers
Confirmation 1: I/we acknowledge that all session authors/presenters have read and agreed to the ASIS&T Annual Meeting PoliciesTopics: Data Science; Analytics; and Visualization (data science; data analytics; data mining; decision analytics; social analytics; information visualization)Keywords: Textual analysis tool; Open science; Text Mining; Natural Language Processing (NLP); Visualization; Bibliometrics
Coconut Libtool: Bridging Textual Analysis Gaps for Non-Programmers
Faizhal Arif Santosa1, Manika Lamba2, Crissandra George3, J. Stephen Downie2
1National Research and Innovation Agency, Indonesia; 2University of Illinois at Urbana-Champaign, USA; 3Case Western Reserve University, USA
In the era of big and ubiquitous data, professionals and students alike are finding themselves needing to perform a number of textual analysis tasks. Historically, the general lack of statistical expertise and programming skills has stopped many with humanities or social sciences backgrounds from performing and fully benefiting from such analyses. Thus, we introduce Coconut Libtool (www.coconut-libtool.com/), an open-source, web-based application that utilizes state-of-the-art natural language processing (NLP) technologies. Coconut Libtool analyzes text data from customized files and bibliographic databases such as Web of Science, Scopus, and Lens. Users can verify which functions can be performed with the data they have. Coconut Libtool deploys multiple algorithmic NLP techniques at the backend, including topic modeling (LDA, Biterm, and BERTopic algorithms), network graph visualization, keyword lemmatization, and sunburst visualization. Coconut Libtool is the people-first web application designed to be used by professionals, researchers, and students in the information sciences, digital humanities, and computational social sciences domains to promote transparency, reproducibility, accessibility, reciprocity, and responsibility in research practices.
4:15pm - 4:45pmID: 296
/ PS-06: 2
Long Papers
Confirmation 1: I/we acknowledge that all session authors/presenters have read and agreed to the ASIS&T Annual Meeting PoliciesTopics: Knowledge Organization (information knowledge organization; knowledge representation; metadata; classification; thesauri; and ontology construction; indexing and abstracting; indexing languages; terminology & standards; information architecture & design)Keywords: Multimodal Dataset, Academic Text Mining, Keyword Extraction, Information Extraction
Building a Multimodal Dataset of Academic Paper for Keyword Extraction
JIngyu Zhang1, Xinyi Yan1, Yi Xiang1, Yingyi Zhang2, Chengzhi Zhang1
1Nanjing University of Science and Technology, People's Republic of China; 2Suzhou University, People's Republic of China
Up to this point, keyword extraction task typically relies solely on textual data. Neglecting visual details and audio features from image and audio modalities leads to deficiencies in information richness and overlooks potential correlations, thereby constraining the model's ability to learn representations of the data and the accuracy of model predictions. Furthermore, the currently available multimodal datasets for keyword extraction task are particularly scarce, further hindering the progress of research on multimodal keyword extraction task. Therefore, this study constructs a multimodal dataset of academic paper consisting of 1000 samples, with each sample containing paper text, images, audios and keywords. Based on unsupervised and supervised methods of keyword extraction, experiments are conducted using textual data from papers, as well as text extracted from images and audio. The aim is to investigate the differences in performance in keyword extraction task with respect to different modal information and the fusion of multimodal information. The experimental results indicate that text from different modalities exhibits distinct characteristics in the model. The concatenation of paper text, image text and audio text can effectively enhance the keyword extraction performance of academic papers.
4:45pm - 5:00pmID: 306
/ PS-06: 3
Short Papers
Confirmation 1: I/we acknowledge that all session authors/presenters have read and agreed to the ASIS&T Annual Meeting PoliciesTopics: Archives; Data Curation; and Preservation (archives; records; cultural heritage materials; data curation; digital libraries; digital humanities)Keywords: Data curation, Digital Curation, Bibliometric Analysis, Visual Analysis, Topic Modeling
Thematic Trends in Data Curation Literature (Best Short Paper Honorable Mention)
Angela Murillo, Ayoung Yoon
Indiana University Indianapolis, USA
The field of data curation is rapidly changing due to new developments in technologies and techniques for conducting data work. As the field of data curation evolves, researchers, practitioners, and educators need to be able to respond to these developments. One way to understand trends in a field is by examining published literature. This study first gathered data curation literature through a modified systematic literature review with the framing question, ‘What competencies, skill sets, and proficiencies are needed to conduct data curation activities?’. These literatures were then analyzed using bibliometric analysis, visual analysis of the citation data, and topic modeling to understand trends in the data curation field.
|