Controlled vocabularies in scientific literature’s indexing: The case of the 1918 pandemic
DOI:
https://doi.org/10.47909/978-9916-9974-8-2.87Keywords:
controlled vocabularies in health field, scientific information retrieval, 1918 pandemic, scientific edition, scientific authorshipAbstract
The scientific interest in the 1918 flu pandemic has been reinforced by the emergence in the early 21st century of epidemic pneumonia diseases caused by a virus, and more recently, the emergence of the SARS-CoV-2 virus, which caused the global pandemic known as “COVID-19,” in 2020. This paper presents the findings of an exploratory study on the use of controlled languages in the scientific community, with the aim of identifying the knowledge generated and needed. This research has two objectives. The first is to identify the relevant controlled languages used by the scientific community to label the knowledge produced. The second is to ascertain the role played by controlled vocabularies in the recovery of scientific production. The research is centered on the production of literature concerning the 1918 pandemic, which has been indexed in two widely utilized databases: Web of Science and Scopus. Additionally, the investigation encompasses the controlled vocabularies pertinent to medical and health sciences subjects. Following the identification of articles pertaining to the subject matter, the scientific journals from which the articles have been retrieved are selected. Subsequently, the paper examines the instructions and guidance provided to authors by the journals in question, with the objective of analyzing the role played by keywords and controlled vocabularies in the scientific literature with regard to indexing and recovering knowledge in scientific databases. The preliminary results indicate that controlled vocabularies are infrequently utilized by journal publishers, as they are not included in the instructions provided to authors.
Downloads
References
Ahmad, A., Justo, J. L. B., Feng, C., & Khan, A. A. (2020). The impact of controlled vocabularies on requirements engineering activities: a systematic mapping study. Applied Sciences, 10(21), Article 7749. https://doi.org/10.3390/app10217749
Anderson, J. D., & Perez-Carballo, J. (2001). The nature of indexing: How humans and machines analyze messages and texts for retrieval. Part I: Research, and the nature of human indexing. Information Processing & Management, 37(2), 231. https://doi.org/10.1016/S0306-4573(00)00026-1
Baeza-Yates, R., & Ribeiro-Neto, B. (2011). Modern information retrieval. The concepts and technology behind search. Pearson.
Barry, J. M. (2004). The site of origin of the 1918 influenza pandemic and its public health implications. Journal of Translational Medicine, 2(3), 1-4. https://doi.org/10.1186/1479-5876-2-3
Garcia-Alsina, M., & Cobarsí, J. (2022). Controlled vocabularies and information retrieval: 1918 Pandemic’s scientific literature as an example. International Journal of Computer and Information Engineering, 16(8), 286-293.
Ghanbarpour, A., & Naderi, H. (2019). A model-based method to improve the quality of ranking in keyword search systems using pseudo-relevance feedback. Journal of Information Science, 45(4), 473-487. https://doi.org/10.1177/0165551518799637
Golub, K. (2021). Automated subject indexing: An overview. Cataloging & Classification Quarterly, 59(8), 702-719. https://doi.org/10.1080/01639374.2021.2012311
Harter, S. P. (1975a). A probabilistic approach to automatic keyword indexing. Part I. On the distribution of specialty words in a technical literature. Journal of the American Society for Information Science, 26(4), 197-206. https://doi.org/10.1002/asi.4630260402
Harter, S. P. (1975b). A probabilistic approach to automatic keyword indexing. Part II. An algorithm for probabilistic indexing. Journal of the American Society for Information Science, 26(5), 280-289. https://doi.org/10.1002/asi.4630260504
Hong, J.-Y., Suh, E., & Kim, S.-J. (2009). Context-aware systems: A literature review and classification. Expert Systems with Applications, 36(4), 8509-8522. https://doi.org/10.1016/j.eswa.2008.10.071
Ishida, Y., Shimizu, T., & Yoshikawa, M. (2020). An analysis and comparison of keyword recommendation methods for scientific data. International Journal on Digital Libraries, 21(3), 307-327. https://doi.org/10.1007/s00799-020-00279-3
Jahoda, G. (1970). Information storage and retrieval systems for individual researchers. Wiley-Interscience.
Keyser, P. (2012). Indexing: from thesauri to the Semantic web. Chandos Publishing.
Knobler, S., Mack, A., Mahmoud, A., & Lemon, S. (2005) “1: The story of influenza.” The threat of pandemic influenza: Are we ready? In Workshop Summary (pp. 60-61). The National Academies Press.
Kwon, S. (2018). Characteristics of interdisciplinary research in author keywords appearing in Korean journals. Malaysian Journal of Library & Information Science, 23(2), 77-93. https://doi.org/10.22452/mjlis.vol23no2.5
Lancaster, F. W. (1968). Information retrieval systems: Characteristics, testing, and evaluation. John Wiley.
Leise, F. (2008). Controlled vocabularies: An introduction. Indexer, 26(3). https://doi.org/10.3828/indexer.2008.37
Lu, W., Liu, Z., Huang, Y., Bu, Y., Li, X., & Cheng, Q. (2020). How do authors select keywords? A preliminary study of author keyword selection behavior. Journal of Informetrics, 14(4), Article 101066. https://doi.org/10.1016/j.joi.2020.101066
Veyette, J. H., Jr. (1961). Information retrieval: The general nature of IR and indexing Dewey Decimal System Universal Decimal System. Two new systems regional IR centers related developments. The American Behavioral Scientist (Pre-1986), 4(10), 15.
White, H. (2013). Examining scientific vocabulary: Mapping controlled vocabularies with free text keywords. Cataloging & Classification Quarterly, 51(6), 655–674. https://doi.org/10.1080/01639374.2013.777004
White, H., Willis, C., & Greenberg, J. (2012). The HIVE impact: Contributing to consistency via automatic indexing. In Proceedings of the 2012 iConference (pp. 582-584). Association for Computing Machinery. https://doi.org/10.1145/2132176.213229
World Health Organization. (2015, May). World Health Organization best practices for the naming of new infectious diseases. https://www.who.int/topics/infectious_diseases/naming-new-diseases/en/
Zhang, C. (2008). Automatic keyword extraction from documents using conditional random fields. Journal of Computational Information Systems, 4(3), 1169-1180.
Downloads
Published
How to Cite
Issue
Section
License
Copyright (c) 2024 Montserrat Garcia Alsina, Josep Cobarsi-Morales

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.
This is an open access article distributed under the terms of the Creative Commons Attribution-NonCommercial 4.0 International License (CC BY-NC 4.0) which permits copying and redistributing the material in any medium or format, adapting, transforming and building upon the material as long as the license terms are followed.