Controlled vocabularies in scientific literature’s indexing: The case of the 1918 pandemic

Authors

DOI:

https://doi.org/10.47909/978-9916-9974-8-2.87

Keywords:

controlled vocabularies in health field, scientific information retrieval, 1918 pandemic, scientific edition, scientific authorship

Abstract

The scientific interest in the 1918 flu pandemic has been reinforced by the emergence in the early 21st century of epidemic pneumonia diseases caused by a virus, and more recently, the emergence of the SARS-CoV-2 virus, which caused the global pandemic known as “COVID-19,” in 2020. This paper presents the findings of an exploratory study on the use of controlled languages in the scientific community, with the aim of identifying the knowledge generated and needed. This research has two objectives. The first is to identify the relevant controlled languages used by the scientific community to label the knowledge produced. The second is to ascertain the role played by controlled vocabularies in the recovery of scientific production. The research is centered on the production of literature concerning the 1918 pandemic, which has been indexed in two widely utilized databases: Web of Science and Scopus. Additionally, the investigation encompasses the controlled vocabularies pertinent to medical and health sciences subjects. Following the identification of articles pertaining to the subject matter, the scientific journals from which the articles have been retrieved are selected. Subsequently, the paper examines the instructions and guidance provided to authors by the journals in question, with the objective of analyzing the role played by keywords and controlled vocabularies in the scientific literature with regard to indexing and recovering knowledge in scientific databases. The preliminary results indicate that controlled vocabularies are infrequently utilized by journal publishers, as they are not included in the instructions provided to authors.

Downloads

Download data is not yet available.

References

Ahmad, A., Justo, J. L. B., Feng, C., & Khan, A. A. (2020). The impact of controlled vocabularies on requirements engineering activities: a systematic mapping study. Applied Sciences, 10(21), Article 7749. https://doi.org/10.3390/app10217749

Anderson, J. D., & Perez-Carballo, J. (2001). The nature of indexing: How humans and machines analyze messages and texts for retrieval. Part I: Research, and the nature of human indexing. Information Processing & Management, 37(2), 231. https://doi.org/10.1016/S0306-4573(00)00026-1

Baeza-Yates, R., & Ribeiro-Neto, B. (2011). Modern information retrieval. The concepts and technology behind search. Pearson.

Barry, J. M. (2004). The site of origin of the 1918 influenza pandemic and its public health implications. Journal of Translational Medicine, 2(3), 1-4. https://doi.org/10.1186/1479-5876-2-3

Garcia-Alsina, M., & Cobarsí, J. (2022). Controlled vocabularies and information retrieval: 1918 Pandemic’s scientific literature as an example. International Journal of Computer and Information Engineering, 16(8), 286-293.

Ghanbarpour, A., & Naderi, H. (2019). A model-based method to improve the quality of ranking in keyword search systems using pseudo-relevance feedback. Journal of Information Science, 45(4), 473-487. https://doi.org/10.1177/0165551518799637

Golub, K. (2021). Automated subject indexing: An overview. Cataloging & Classification Quarterly, 59(8), 702-719. https://doi.org/10.1080/01639374.2021.2012311

Harter, S. P. (1975a). A probabilistic approach to automatic keyword indexing. Part I. On the distribution of specialty words in a technical literature. Journal of the American Society for Information Science, 26(4), 197-206. https://doi.org/10.1002/asi.4630260402

Harter, S. P. (1975b). A probabilistic approach to automatic keyword indexing. Part II. An algorithm for probabilistic indexing. Journal of the American Society for Information Science, 26(5), 280-289. https://doi.org/10.1002/asi.4630260504

Hong, J.-Y., Suh, E., & Kim, S.-J. (2009). Context-aware systems: A literature review and classification. Expert Systems with Applications, 36(4), 8509-8522. https://doi.org/10.1016/j.eswa.2008.10.071

Ishida, Y., Shimizu, T., & Yoshikawa, M. (2020). An analysis and comparison of keyword recommendation methods for scientific data. International Journal on Digital Libraries, 21(3), 307-327. https://doi.org/10.1007/s00799-020-00279-3

Jahoda, G. (1970). Information storage and retrieval systems for individual researchers. Wiley-Interscience.

Keyser, P. (2012). Indexing: from thesauri to the Semantic web. Chandos Publishing.

Knobler, S., Mack, A., Mahmoud, A., & Lemon, S. (2005) “1: The story of influenza.” The threat of pandemic influenza: Are we ready? In Workshop Summary (pp. 60-61). The National Academies Press.

Kwon, S. (2018). Characteristics of interdisciplinary research in author keywords appearing in Korean journals. Malaysian Journal of Library & Information Science, 23(2), 77-93. https://doi.org/10.22452/mjlis.vol23no2.5

Lancaster, F. W. (1968). Information retrieval systems: Characteristics, testing, and evaluation. John Wiley.

Leise, F. (2008). Controlled vocabularies: An introduction. Indexer, 26(3). https://doi.org/10.3828/indexer.2008.37

Lu, W., Liu, Z., Huang, Y., Bu, Y., Li, X., & Cheng, Q. (2020). How do authors select keywords? A preliminary study of author keyword selection behavior. Journal of Informetrics, 14(4), Article 101066. https://doi.org/10.1016/j.joi.2020.101066

Veyette, J. H., Jr. (1961). Information retrieval: The general nature of IR and indexing Dewey Decimal System Universal Decimal System. Two new systems regional IR centers related developments. The American Behavioral Scientist (Pre-1986), 4(10), 15.

White, H. (2013). Examining scientific vocabulary: Mapping controlled vocabularies with free text keywords. Cataloging & Classification Quarterly, 51(6), 655–674. https://doi.org/10.1080/01639374.2013.777004

White, H., Willis, C., & Greenberg, J. (2012). The HIVE impact: Contributing to consistency via automatic indexing. In Proceedings of the 2012 iConference (pp. 582-584). Association for Computing Machinery. https://doi.org/10.1145/2132176.213229

World Health Organization. (2015, May). World Health Organization best practices for the naming of new infectious diseases. https://www.who.int/topics/infectious_diseases/naming-new-diseases/en/

Zhang, C. (2008). Automatic keyword extraction from documents using conditional random fields. Journal of Computational Information Systems, 4(3), 1169-1180.

Published

13-12-2024

How to Cite

Garcia Alsina, M., & Cobarsi-Morales, J. (2024). Controlled vocabularies in scientific literature’s indexing: The case of the 1918 pandemic. Advanced Notes in Information Science, 7, 109–123. https://doi.org/10.47909/978-9916-9974-8-2.87