Abstract
Background: The acquisition and exchange of meaningful, integrated, and accurate
information are at the forefront of the combat against COVID-19; still, there are many countries
whose health systems are disrupted. Moreover, no one is adequately equipped for COVID-19
contingencies. Many organizations have established static information systems to manage the information.
This fact presents numerous issues, including delays, inconsistencies, and inaccuracies
in COVID-19 information collected for pandemic control and monitoring.
Objective: This paper presents a semantic representation of COVID-19 data, a domain ontology
to facilitate measurement, clarification, linking, and sharing. We automatically generate a computer-
intelligible knowledge base from COVID-19 case information, which contains machineunderstandable
information. Furthermore, we have anticipated an ontology population algorithm
from tabular data that delivers interoperable, consistent, and accurate content with COVID-19 information.
Methods: We utilized the tabula package to extract the tables from PDF files and user NLP libraries
to sort and rearrange tables. The proposed algorithm was then applied to all instances to
automatically add to the input ontology using the Owlready Python module. Moreover, to evaluate
the performance, SPARQL queries were used to retrieve answers to competency questions.
Results: When there is an equivalence relationship, the suggested algorithm consistently finds the
right alignments and performs at its best or very close to it in terms of precision. Moreover, a
demonstration of algorithm performance and a case study on COVID-19 data to information
management and visualization of the populated data are also presented.
Conclusion: This paper presents an ontology learning/matching tool for ontology and populating
instances automatically to ontology by emphasizing the importance of a unit's distinguishing features
by unit matching.
Keywords:
Information extraction, ontology population, COVID-19 ontology, semantic web, ontology engineering, ontology reuse, ontology publishing.
Graphical Abstract
[1]
Maedche A, Pekar V, Staab S. Ontology learning part one—on discovering taxonomic relations from the web InWeb Intelligence. Berlin, Heidelberg: Springer 2003; pp. 301-19.
[2]
Cimiano P. Ontology learning and population from text: algorithms, evaluation and applications. Springer Science & Business Media 2006.
[3]
Wohlgenannt G, Belk S, Schett M. A Prototype for Automating Ontology Learning and Ontology Evolution 2013.
[4]
Li G, Ooi BC, Feng J, Wang J, Zhou L. Ease: an effective 3-in-1 keyword search method for unstructured, semistructured and structured data. In Proceedings of the 2008 ACM SIGMOD international conference on Management of data. 903-14.
[6]
He Y, Yu H, Ong E, et al. Cido: The community-based coronavirus infectious disease ontology. 2021.
[7]
Xiao Y, Zheng X, Song W, et al. CIDO-COVID-19: An Ontology for COVID-19 Based on CIDO. In 2021 43rd Annual International Conference of the IEEE Engineering in Medicine & Biology Society (EMBC). 2119-22.
[9]
Beverley J, Babcock S, Cowell L, Smith B. The COVID 19 Infectious Disease Ontology. 2021.
[13]
Farquhar A, Fikes R, Pratt W, Rice J. Collaborative ontology construction for information integration In: Technical Report KSL-95-63 Stanford University Knowledge Systems Laboratory 1995.
[14]
Pinto HS, Staab S, Tempich C. DILIGENT: Towards a fine-grained methodology for DIstributed, Looselycontrolled and evolvInG Engineering of oNTologies. In: ECAI. 2004; 16: p. 393.
[15]
Fernández-López M, Gómez-Pérez A, Juristo N. Methontology: from ontological art towards ontological engineering 1997; 33-40.
[18]
Freitag D, McCallum A. Information extraction with HMMs and shrinkage. In Proceedings of the AAAI-99 workshop on machine learning for information extraction. 31-6.
[24]
Noy NF, Crubézy M, Fergerson RW, et al. Protégé-2000: an open-source ontology-development and knowledge-acquisition environ-ment. AMIA Symposium. 953-3.
[30]
Aussenac-Gilles N, Despres S, Szulman S. The TERMINAE Method and Platform for Ontology Engineering from Texts 2008; pp. 199-23.
[33]
Finkelstein-Landau M, Morin E. Extracting semantic relationships between terms: Supervised vs. unsupervised methods. Interna-tional Workshop on Ontological Engineering on the Global Information Infrastructure. 71-80.
[34]
Makki J, Alquier AM, Prince Vs. Int J Humanit Soc Sci 2009; 3(3): 212-7.
[38]
Celjuska D, Vargas-Vera M. Ontosophie: A semi-automatic system for ontology population from text. In International Conference on Natural Language Processing (ICON). 60
[40]
Tanev H, Magnini B. Weakly supervised approaches for ontology population. In 11th Conference of the European Chapter of the Association for Computational Linguistics. 2006; pp 17-24.
[42]
Magnini B, Pianta E, Popescu O, Speranza M. Ontology population from textual mentions: Task definition and benchmark. In Proceedings of the 2nd Workshop on Ontology Learning and Population: Bridging the Gap between Text and Knowledge. 26-32.
[44]
Maynard D, Li Y, Peters W. NLP Techniques for Term Extraction and Ontology Population
[45]
Patel C, Supekar K, Lee Y. OntoGenie: Extracting ontology instances from WWW Human Language Technology for the Semantic Web and Web Services. ISWC 2003.
[48]
Buitelaar P, Cimiano P, Frank A, Racioppa S. SOBA: Smartweb ontology-based annotation. In Demo Session at the International Semantic Web Conference (ISWC).
[49]
Hurst M. Layout and language: Challenges for table understanding on the web. In Proceedings of the International Workshop on Web Document Analysis. 27-30.
[51]
Bauer F, Kaltenböck M. Linked open data: The essentials Edition mono/monochrom. Vienna 2011; 710.
[53]
Sharma S, Jain S. Comprehensive Study of Semantic Annotation: Variant and Praxis. In: Advances in Computational Intelligence, its Concepts & Applications (ACI). 2021; pp. 102-16.
[54]
Miller S, Guinness J, Zamanian A. Name tagging with word clusters and discriminative training. In: In Proceedings of the Human Language Technology Conference of the North American Chapter of the Association for Computational Linguistics: HLT-NAACL 2004. 2004; pp. 337- 42.
[58]
Ghawi R, Cullot N. Database-to-ontology mapping generation for semantic interoperability. Third international workshop on database interoperability (InterDB 2007). 91