Semantic Web

A rule-based decision support system for aiding iron deficiency management

Wed, 2021-12-15 06:00

Health Informatics J. 2021 Oct-Dec;27(4):14604582211066054. doi: 10.1177/14604582211066054.

ABSTRACT

Iron is a vital mineral for the proper function of hemoglobin which is also a protein needed to transport oxygen in the blood. The lack of iron in human blood causes a range of serious health problems including "anemia." In this article, the COntAneRS (Clinical ONTology-based Iron Deficiency-ANEmia- Recommendation System) is proposed as a clinical decision support system to diagnose iron deficiency and manage its treatment. The applied methodologies and main technical contributions of this study are discussed in four aspects: (1) Iron Deficiency Domain Ontology (IDDOnt), (2) Semantic Web Rule Knowledgebase (SWRL), (3) Inference Engine, and (4) Physician Portal of the system. Experimental studies of the proposed system have been applied on a population of 200 people, consisting of real anemia patients and healthy individuals. First, a decision tree classifier is used to diagnose iron deficiency condition based on the patients' demographic information and certain medical data, as well as recently measured hemoglobin CBC levels of the patients. To check the effectiveness of the system, the data of 50 anonymous patients randomly selected from 200 patients are entered manually in the IDDOnt and the system is then verified according to the inferencing results. After inferencing step, the recommendations related to appropriate iron drugs, daily consumption dose, drug consumption periods, and additional medical suggestions about drug interactions are provided by the system to the responsible physician through system ontology, SWRL rules, and web services. As a result of experimental studies, our system has provided very good accuracy (99.5%) and robust results in producing patient-suitable suggestions. In addition, the applicability of the system on the cases is discussed as case studies in this paper. The results reported from the applied case studies are promising in demonstrating the applicability, effectiveness, and efficiency of the proposed approach.

PMID:34910611 | DOI:10.1177/14604582211066054

Categories: Literature Watch

Identifying opportunities for timely diagnosis of bladder and renal cancer via abnormal blood tests: a longitudinal linked data study

Tue, 2021-12-14 06:00

Br J Gen Pract. 2021 Dec 31;72(714):e19-e25. doi: 10.3399/BJGP.2021.0282. Print 2022 Jan.

ABSTRACT

BACKGROUND: Understanding pre-diagnostic test use could reveal diagnostic windows where more timely evaluation for cancer may be indicated.

AIM: To examine pre-diagnostic patterns of results of abnormal blood tests in patients with bladder and renal cancer.

DESIGN AND SETTING: A retrospective cohort study using primary care and cancer registry data on patients with bladder and renal cancer who were diagnosed between April 2012 and December 2015 in England.

METHOD: The rates of patients with a first abnormal result in the year before cancer diagnosis, for 'generic' (full blood count components, inflammatory markers, and calcium) and 'organ-specific' blood tests (creatinine and liver function test components) that may lead to subsequent detection of incidental cancers, were examined. Poisson regression was used to detect the month during which the cohort's rate of each abnormal test started to increase from baseline. The proportion of patients with a test found in the first half of the diagnostic window was examined, as these 'early' tests might represent opportunities where further evaluation could be initiated.

RESULTS: Data from 4533 patients with bladder and renal cancer were analysed. The monthly rate of patients with a first abnormal test increased towards the time of cancer diagnosis. Abnormalities of both generic (for example, high inflammatory markers) and organ-specific tests (for example, high creatinine) started to increase from 6-8 months pre-diagnosis, with 25%-40% of these patients having an abnormal test in the 'early half' of the diagnostic window.

CONCLUSION: Population-level signals of bladder and renal cancer can be observed in abnormalities in commonly performed primary care blood tests up to 8 months before diagnosis, indicating the potential for earlier diagnosis in some patients.

PMID:34903517 | PMC:PMC8714503 | DOI:10.3399/BJGP.2021.0282

Categories: Literature Watch

A botanical demonstration of the potential of linking data using unique identifiers for people

Tue, 2021-12-14 06:00

PLoS One. 2021 Dec 14;16(12):e0261130. doi: 10.1371/journal.pone.0261130. eCollection 2021.

ABSTRACT

Natural history collection data available digitally on the web have so far only made limited use of the potential of semantic links among themselves and with cross-disciplinary resources. In a pilot study, botanical collections of the Consortium of European Taxonomic Facilities (CETAF) have therefore begun to semantically annotate their collection data, starting with data on people, and to link them via a central index system. As a result, it is now possible to query data on collectors across different collections and automatically link them to a variety of external resources. The system is being continuously developed and is already in production use in an international collection portal.

PMID:34905557 | DOI:10.1371/journal.pone.0261130

Categories: Literature Watch

Semantic Network Analysis Using Construction Accident Cases to Understand Workers' Unsafe Acts

Fri, 2021-12-10 06:00

Int J Environ Res Public Health. 2021 Dec 1;18(23):12660. doi: 10.3390/ijerph182312660.

ABSTRACT

Unsafe acts by workers are a direct cause of accidents in the labor-intensive construction industry. Previous studies have reviewed past accidents and analyzed their causes to understand the nature of the human error involved. However, these studies focused their investigations on only a small number of construction accidents, even though a large number of them have been collected from various countries. Consequently, this study developed a semantic network analysis (SNA) model that uses approximately 60,000 construction accident cases to understand the nature of the human error that affects safety in the construction industry. A modified human factor analysis and classification system (HFACS) framework was used to classify major human error factors-that is, the causes of the accidents in each of the accident summaries in the accident case data-and an SNA analysis was conducted on all of the classified data to analyze correlations between the major factors that lead to unsafe acts. The results show that an overwhelming number of accidents occurred due to unintended acts such as perceptual errors (PERs) and skill-based errors (SBEs). Moreover, this study visualized the relationships between factors that affected unsafe acts based on actual construction accident case data, allowing for an intuitive understanding of the major keywords for each of the factors that lead to accidents.

PMID:34886388 | PMC:PMC8656935 | DOI:10.3390/ijerph182312660

Categories: Literature Watch

LIO-CSI: LiDAR inertial odometry with loop closure combined with semantic information

Wed, 2021-12-08 06:00

PLoS One. 2021 Dec 8;16(12):e0261053. doi: 10.1371/journal.pone.0261053. eCollection 2021.

ABSTRACT

Accurate and reliable state estimation and mapping are the foundation of most autonomous driving systems. In recent years, researchers have focused on pose estimation through geometric feature matching. However, most of the works in the literature assume a static scenario. Moreover, a registration based on a geometric feature is vulnerable to the interference of a dynamic object, resulting in a decline of accuracy. With the development of a deep semantic segmentation network, we can conveniently obtain the semantic information from the point cloud in addition to geometric information. Semantic features can be used as an accessory to geometric features that can improve the performance of odometry and loop closure detection. In a more realistic environment, semantic information can filter out dynamic objects in the data, such as pedestrians and vehicles, which lead to information redundancy in generated map and map-based localization failure. In this paper, we propose a method called LiDAR inertial odometry (LIO) with loop closure combined with semantic information (LIO-CSI), which integrates semantic information to facilitate the front-end process as well as loop closure detection. First, we made a local optimization on the semantic labels provided by the Sparse Point-Voxel Neural Architecture Search (SPVNAS) network. The optimized semantic information is combined into the front-end process of tightly-coupled light detection and ranging (LiDAR) inertial odometry via smoothing and mapping (LIO-SAM), which allows us to filter dynamic objects and improve the accuracy of the point cloud registration. Then, we proposed a semantic assisted scan-context method to improve the accuracy and robustness of loop closure detection. The experiments were conducted on an extensively used dataset KITTI and a self-collected dataset on the Jilin University (JLU) campus. The experimental results demonstrate that our method is better than the purely geometric method, especially in dynamic scenarios, and it has a good generalization ability.

PMID:34879118 | PMC:PMC8654169 | DOI:10.1371/journal.pone.0261053

Categories: Literature Watch

Changes of the Public Attitudes of China to Domestic COVID-19 Vaccination After the Vaccines Were Approved: A Semantic Network and Sentiment Analysis Based on Sina Weibo Texts

Fri, 2021-12-03 06:00

Front Public Health. 2021 Nov 11;9:723015. doi: 10.3389/fpubh.2021.723015. eCollection 2021.

ABSTRACT

Introduction: On December 31, 2020, the Chinese government announced that the domestic coronavirus disease-2019 (COVID-19) vaccines have obtained approval for conditional marketing and are free for vaccination. This release drove the attention of the public and intense debates on social media, which reflected public attitudes to the domestic vaccine. This study examines whether the public concerns and public attitudes to domestic COVID-19 vaccines changed after the official announcement. Methods: This article used big data analytics in the research, including semantic network and sentiment analysis. The purpose of the semantic network is to obtain the public concerns about domestic vaccines. Sentiment analysis reflects the sentiments of the public to the domestic vaccines and the emotional changes by comparing the specific sentiments shown on the posts before and after the official announcement. Results: There exists a correlation between the public concerns about domestic vaccines before and after the official announcement. According to the semantic network analysis, the public concerns about vaccines have changed after the official announcement. The public focused on the performance issues of the vaccines before the official approval, but they cared more about the practical issues of vaccination after that. The sentiment analysis showed that both positive and negative emotions increased among the public after the official announcement. "Good" was the most increased positive emotion and indicated great public appreciation for the production capacity and free vaccination. "Fear" was the significantly increased negative emotion and reflected the public concern about the safety of the vaccines. Conclusion: The official announcement of the approval for marketing improved the Chinese public acceptance of the domestic COVID-19 vaccines. In addition, safety and effectiveness are vital factors influencing public vaccine hesitancy.

PMID:34858918 | PMC:PMC8632040 | DOI:10.3389/fpubh.2021.723015

Categories: Literature Watch

Investigating the breast cancer screening-treatment-mortality pathway of women diagnosed with invasive breast cancer: Results from linked health data

Wed, 2021-12-01 06:00

Eur J Cancer Care (Engl). 2022 Jan;31(1):e13539. doi: 10.1111/ecc.13539. Epub 2021 Nov 30.

ABSTRACT

OBJECTIVE: To examine the screening-treatment-mortality pathway among women with invasive breast cancer in 2006-2014 using linked data.

METHODS: BreastScreen histories of South Australian women diagnosed with breast cancer (n = 8453) were investigated. Treatments recorded within 12 months from diagnosis were obtained from linked registry and administrative data. Associations of screening history with treatment were investigated using logistic regression and with cancer mortality outcomes using competing risk analyses, adjusting for socio-demographic, cancer and comorbidity characteristics.

RESULTS AND CONCLUSION: For screening ages of 50-69 years, 70% had participated in BreastScreen SA ≤ 5 years and 53% ≤ 2 years of diagnosis. Five-year disease-specific survival post-diagnosis was 90%. Compared with those not screened ≤5 years, women screened ≤2 years had higher odds, adjusted for socio-demographic, cancer and comorbidity characteristics, and diagnostic period, of breast-conserving surgery (aOR 2.5, 95% CI 1.9-3.2) and radiotherapy (aOR 1.2, 95% CI 1.1-1.3). These women had a lower unadjusted risk of post-diagnostic cancer mortality (SHR 0.33, 95% CI 0.27-0.41), partly mediated by stage (aSHR 0.65, 95% CI 0.51-0.81), and less breast surgery (aSHR 0.78, 95% CI 0.62-0.99). Screening ≤2 years and conserving surgery appeared to have a greater than additive association with lower post-diagnostic mortality (interaction term SHR 0.42, 95% CI 0.23-0.78). The screening-treatment-mortality pathway was investigated using linked data.

PMID:34850484 | DOI:10.1111/ecc.13539

Categories: Literature Watch

Fully automatic image colorization based on semantic segmentation technology

Tue, 2021-11-30 06:00

PLoS One. 2021 Nov 30;16(11):e0259953. doi: 10.1371/journal.pone.0259953. eCollection 2021.

ABSTRACT

Aiming at these problems of image colorization algorithms based on deep learning, such as color bleeding and insufficient color, this paper converts the study of image colorization to the optimization of image semantic segmentation, and proposes a fully automatic image colorization model based on semantic segmentation technology. Firstly, we use the encoder as the local feature extraction network and use VGG-16 as the global feature extraction network. These two parts do not interfere with each other, but they share the low-level feature. Then, the first fusion module is constructed to merge local features and global features, and the fusion results are input into semantic segmentation network and color prediction network respectively. Finally, the color prediction network obtains the semantic segmentation information of the image through the second fusion module, and predicts the chrominance of the image based on it. Through several sets of experiments, it is proved that the performance of our model becomes stronger and stronger under the nourishment of the data. Even in some complex scenes, our model can predict reasonable colors and color correctly, and the output effect is very real and natural.

PMID:34847177 | PMC:PMC8631650 | DOI:10.1371/journal.pone.0259953

Categories: Literature Watch

The Collaborative Metadata Repository (CoMetaR) Web App: Quantitative and Qualitative Usability Evaluation

Tue, 2021-11-30 06:00

JMIR Med Inform. 2021 Nov 29;9(11):e30308. doi: 10.2196/30308.

ABSTRACT

BACKGROUND: In the field of medicine and medical informatics, the importance of comprehensive metadata has long been recognized, and the composition of metadata has become its own field of profession and research. To ensure sustainable and meaningful metadata are maintained, standards and guidelines such as the FAIR (Findability, Accessibility, Interoperability, Reusability) principles have been published. The compilation and maintenance of metadata is performed by field experts supported by metadata management apps. The usability of these apps, for example, in terms of ease of use, efficiency, and error tolerance, crucially determines their benefit to those interested in the data.

OBJECTIVE: This study aims to provide a metadata management app with high usability that assists scientists in compiling and using rich metadata. We aim to evaluate our recently developed interactive web app for our collaborative metadata repository (CoMetaR). This study reflects how real users perceive the app by assessing usability scores and explicit usability issues.

METHODS: We evaluated the CoMetaR web app by measuring the usability of 3 modules: core module, provenance module, and data integration module. We defined 10 tasks in which users must acquire information specific to their user role. The participants were asked to complete the tasks in a live web meeting. We used the System Usability Scale questionnaire to measure the usability of the app. For qualitative analysis, we applied a modified think aloud method with the following thematic analysis and categorization into the ISO 9241-110 usability categories.

RESULTS: A total of 12 individuals participated in the study. We found that over 97% (85/88) of all the tasks were completed successfully. We measured usability scores of 81, 81, and 72 for the 3 evaluated modules. The qualitative analysis resulted in 24 issues with the app.

CONCLUSIONS: A usability score of 81 implies very good usability for the 2 modules, whereas a usability score of 72 still indicates acceptable usability for the third module. We identified 24 issues that serve as starting points for further development. Our method proved to be effective and efficient in terms of effort and outcome. It can be adapted to evaluate apps within the medical informatics field and potentially beyond.

PMID:34847059 | DOI:10.2196/30308

Categories: Literature Watch

Orchestrating Heterogeneous Devices and AI Services as Virtual Sensors for Secure Cloud-Based IoT Applications

Sat, 2021-11-27 06:00

Sensors (Basel). 2021 Nov 12;21(22):7509. doi: 10.3390/s21227509.

ABSTRACT

The concept of the cloud-to-thing continuum addresses advancements made possible by the widespread adoption of cloud, edge, and IoT resources. It opens the possibility of combining classical symbolic AI with advanced machine learning approaches in a meaningful way. In this paper, we present a thing registry and an agent-based orchestration framework, which we combine to support semantic orchestration of IoT use cases across several federated cloud environments. We use the concept of virtual sensors based on machine learning (ML) services as abstraction, mediating between the instance level and the semantic level. We present examples of virtual sensors based on ML models for activity recognition and describe an approach to remedy the problem of missing or scarce training data. We illustrate the approach with a use case from an assisted living scenario.

PMID:34833585 | DOI:10.3390/s21227509

Categories: Literature Watch

FAIR data representation in times of eScience: a comparison of instance-based and class-based semantic representations of empirical data using phenotype descriptions as example

Fri, 2021-11-26 06:00

J Biomed Semantics. 2021 Nov 25;12(1):20. doi: 10.1186/s13326-021-00254-0.

ABSTRACT

BACKGROUND: The size, velocity, and heterogeneity of Big Data outclasses conventional data management tools and requires data and metadata to be fully machine-actionable (i.e., eScience-compliant) and thus findable, accessible, interoperable, and reusable (FAIR). This can be achieved by using ontologies and through representing them as semantic graphs. Here, we discuss two different semantic graph approaches of representing empirical data and metadata in a knowledge graph, with phenotype descriptions as an example. Almost all phenotype descriptions are still being published as unstructured natural language texts, with far-reaching consequences for their FAIRness, substantially impeding their overall usability within the life sciences. However, with an increasing amount of anatomy ontologies becoming available and semantic applications emerging, a solution to this problem becomes available. Researchers are starting to document and communicate phenotype descriptions through the Web in the form of highly formalized and structured semantic graphs that use ontology terms and Uniform Resource Identifiers (URIs) to circumvent the problems connected with unstructured texts.

RESULTS: Using phenotype descriptions as an example, we compare and evaluate two basic representations of empirical data and their accompanying metadata in the form of semantic graphs: the class-based TBox semantic graph approach called Semantic Phenotype and the instance-based ABox semantic graph approach called Phenotype Knowledge Graph. Their main difference is that only the ABox approach allows for identifying every individual part and property mentioned in the description in a knowledge graph. This technical difference results in substantial practical consequences that significantly affect the overall usability of empirical data. The consequences affect findability, accessibility, and explorability of empirical data as well as their comparability, expandability, universal usability and reusability, and overall machine-actionability. Moreover, TBox semantic graphs often require querying under entailment regimes, which is computationally more complex.

CONCLUSIONS: We conclude that, from a conceptual point of view, the advantages of the instance-based ABox semantic graph approach outweigh its shortcomings and outweigh the advantages of the class-based TBox semantic graph approach. Therefore, we recommend the instance-based ABox approach as a FAIR approach for documenting and communicating empirical data and metadata in a knowledge graph.

PMID:34823588 | DOI:10.1186/s13326-021-00254-0

Categories: Literature Watch

RDFizing the biosynthetic pathway of E.coli O-antigen to enable semantic sharing of microbiology data

Tue, 2021-11-23 06:00

BMC Microbiol. 2021 Nov 22;21(1):325. doi: 10.1186/s12866-021-02384-y.

ABSTRACT

BACKGROUND: The abundance of glycomics data that have accumulated has led to the development of many useful databases to aid in the understanding of the function of the glycans and their impact on cellular activity. At the same time, the endeavor for data sharing between glycomics databases with other biological databases have contributed to the creation of new knowledgebases. However, different data types in data description have impeded the data sharing for knowledge integration. To solve this matter, Semantic Web techniques including Resource Description Framework (RDF) and ontology development have been adopted by various groups to standardize the format for data exchange. These semantic data have contributed to the expansion of knowledgebases and hold promises of providing data that can be intelligently processed. On the other hand, bench biologists who are experts in experimental finding are end users and data producers. Therefore, it is indispensable to reduce the technical barrier required for bench biologists to manipulate their experimental data to be compatible with standard formats for data sharing.

RESULTS: There are many essential concepts and practical techniques for data integration but there is no method to enable researchers to easily apply Semantic Web techniques to their experimental data. We implemented our procedure on unformatted information of E.coli O-antigen structures collected from the web and show how this information can be expressed as formatted data applicable to Semantic Web standards. In particular, we described the E-coli O-antigen biosynthesis pathway using the BioPAX ontology developed to support data exchange between pathway databases.

CONCLUSIONS: The method we implemented to semantically describe O-antigen biosynthesis should be helpful for biologists to understand how glycan information, including relevant pathway reaction data, can be easily shared. We hope this method can contribute to lower the technical barrier that is required when experimental findings are formulated into formal representations and can lead bench scientists to readily participate in the construction of new knowledgebases that are integrated with existing ones. Such integration over the Semantic Web will enable future work in artificial intelligence and machine learning to enable computers to infer new relationships and hypotheses in the life sciences.

PMID:34809564 | DOI:10.1186/s12866-021-02384-y

Categories: Literature Watch

Knowledge Engineering Framework for IoT Robotics Applied to Smart Healthcare and Emotional Well-Being

Mon, 2021-11-22 06:00

Int J Soc Robot. 2021 Nov 16:1-28. doi: 10.1007/s12369-021-00821-6. Online ahead of print.

ABSTRACT

Social companion robots are getting more attention to assist elderly people to stay independent at home and to decrease their social isolation. When developing solutions, one remaining challenge is to design the right applications that are usable by elderly people. For this purpose, co-creation methodologies involving multiple stakeholders and a multidisciplinary researcher team (e.g., elderly people, medical professionals, and computer scientists such as roboticists or IoT engineers) are designed within the ACCRA (Agile Co-Creation of Robots for Ageing) project. This paper will address this research question: How can Internet of Robotic Things (IoRT) technology and co-creation methodologies help to design emotional-based robotic applications? This is supported by the ACCRA project that develops advanced social robots to support active and healthy ageing, co-created by various stakeholders such as ageing people and physicians. We demonstra this with three robots, Buddy, ASTRO, and RoboHon, used for daily life, mobility, and conversation. The three robots understand and convey emotions in real-time using the Internet of Things and Artificial Intelligence technologies (e.g., knowledge-based reasoning).

PMID:34804257 | PMC:PMC8594653 | DOI:10.1007/s12369-021-00821-6

Categories: Literature Watch

FAIR data for prehistoric mining archaeology

Mon, 2021-11-22 06:00

Int J Digit Libr. 2021;22(3):267-277. doi: 10.1007/s00799-020-00282-8. Epub 2020 Jan 23.

ABSTRACT

This paper presents an approach how to create FAIR data for prehistoric mining archaeology, based on the CIDOC CRM ontology and semantic web standards. The interdisciplinary Research Centre HiMAT (History of mining activities in the Tyrol and adjacent areas, University of Innsbruck) investigates mining history from prehistoric to modern times with an interdisciplinary approach. One of the projects carried out at the research centre is the multinational DACH project "Prehistoric copper production in the eastern and central Alps". For a specific geographical region of the project, the data transformation to open and re-usable data is investigated in a separate Open Research Data pilot project. The methodological approach will use the FAIR principles to make data Findable, Accessible, Interoperable and Re-usable. Every archaeological investigation in Austria has to be documented according to the requirements of the Austrian Federal Monuments Office. This documentation is deposited in the CERN-based EU supported research data repository ZENODO. For each deposited file, metadata are created through the application of the conceptual metadata schema CIDOC CRM, an ISO standard for Cultural Heritage Information, which was adopted by ARIADNE, the European Union Research Infrastructure for archaeological resources. Concepts specific to mining archaeology research are organized with the DARIAH Back Bone Thesaurus, a model for sustainable interoperable thesauri maintenance, developed in the European Union Digital Research Infrastructure for the Arts and Humanities. Metadata are created through the extraction of information from the documentation and the transformation to a knowledge graph using semantic web standards. To facilitate usage, graph data are exported to hierarchical and tabular formats representing sites and objects with their geographic locations, temporal and typological assignments and links to the research activities and documents. Metadata are deposited together with the documentation into the repository.

PMID:34803481 | PMC:PMC8591667 | DOI:10.1007/s00799-020-00282-8

Categories: Literature Watch

Internet-based language production research with overt articulation: Proof of concept, challenges, and practical advice

Sat, 2021-11-20 06:00

Behav Res Methods. 2021 Nov 19. doi: 10.3758/s13428-021-01686-3. Online ahead of print.

ABSTRACT

Language production experiments with overt articulation have thus far only scarcely been conducted online, mostly due to technical difficulties related to measuring voice onset latencies. Especially the poor audiovisual synchrony in web experiments (Bridges et al. 2020) is a challenge to time-locking stimuli and participants' spoken responses. We tested the viability of conducting language production experiments with overt articulation in online settings using the picture-word interference paradigm - a classic task in language production research. In three pre-registered experiments (N = 48 each), participants named object pictures while ignoring visually superimposed distractor words. We implemented a custom voice recording option in two different web experiment builders and recorded naming responses in audio files. From these stimulus-locked audio files, we extracted voice onset latencies offline. In a control task, participants classified the last letter of a picture name as a vowel or consonant via button-press, a task that shows comparable semantic interference effects. We expected slower responses when picture and distractor word were semantically related compared to unrelated, independently of task. This semantic interference effect is robust, but relatively small. It should therefore crucially depend on precise timing. We replicated this effect in an online setting, both for button-press and overt naming responses, providing a proof of concept that naming latency - a key dependent variable in language production research - can be reliably measured in online experiments. We discuss challenges for online language production research and suggestions of how to overcome them. The scripts for the online implementation are made available.

PMID:34799842 | DOI:10.3758/s13428-021-01686-3

Categories: Literature Watch

A Methodology for an Auto-Generated and Auto-Maintained HL7 FHIR OWL Ontology for Health Data Management

Fri, 2021-11-19 06:00

Stud Health Technol Inform. 2021 Nov 18;287:99-103. doi: 10.3233/SHTI210824.

ABSTRACT

The process of maintenance of an underlying semantic model that supports data management and addresses the interoperability challenges in the domain of telemedicine and integrated care is not a trivial task when performed manually. We present a methodology that leverages the provided serializations of the Health Level Seven International (HL7) Fast Health Interoperability Resources (FHIR) specification to generate a fully functional OWL ontology along with the semantic provisions for maintaining functionality upon future changes of the standard. The developed software makes a complete conversion of the HL7 FHIR Resources along with their properties and their semantics and restrictions. It covers all FHIR data types (primitive and complex) along with all defined resource types. It can operate to build an ontology from scratch or to update an existing ontology, providing the semantics that are needed, to preserve information described using previous versions of the standard. All the results based on the latest version of HL7 FHIR as a Web Ontology Language (OWL-DL) ontology are publicly available for reuse and extension.

PMID:34795090 | DOI:10.3233/SHTI210824

Categories: Literature Watch

Epione application: An integrated web-toolkit of clinical genomics and personalized medicine in systemic lupus erythematosus

Thu, 2021-11-18 06:00

Int J Mol Med. 2022 Jan;49(1):8. doi: 10.3892/ijmm.2021.5063. Epub 2021 Nov 18.

ABSTRACT

Genome wide association studies (GWAS) have identified autoimmune disease‑associated loci, a number of which are involved in numerous disease‑associated pathways. However, much of the underlying genetic and pathophysiological mechanisms remain to be elucidated. Systemic lupus erythematosus (SLE) is a chronic, highly heterogeneous autoimmune disease, characterized by differences in autoantibody profile, serum cytokines and a multi‑system involvement. This study presents the Epione application, an integrated bioinformatics web‑toolkit, designed to assist medical experts and researchers in more accurately diagnosing SLE. The application aims to identify the most credible gene variants and single nucleotide polymorphisms (SNPs) associated with SLE susceptibility, by using patient's genomic data to aid the medical expert in SLE diagnosis. The application contains useful knowledge of >70,000 SLE‑related publications that have been analyzed, using data mining and semantic techniques, towards extracting the SLE‑related genes and the corresponding SNPs. Probable genes associated with the patient's genomic profile are visualized with several graphs, including chromosome ideograms, statistic bars and regulatory networks through data mining studies with relative publications, to obtain a representative number of the most credible candidate genes and biological pathways associated with the SLE. Furthermore, an evaluation study was performed on a patient diagnosed with SLE and is presented herein. Epione has also been expanded in family‑related candidate patients to evaluate its predictive power. All the recognized gene variants that were previously considered to be associated with SLE were accurately identified in the output profile of the patient, and by comparing the results, novel findings have emerged. The Epione application may assist and facilitate in early stage diagnosis by using the patients' genomic profile to compare against the list of the most predictable candidate gene variants related to SLE. Its diagnosis‑oriented output presents the user with a structured set of results on variant association, position in genome and links to specific bibliography and gene network associations. The overall aim of the present study was to provide a reliable tool for the most effective study of SLE. This novel and accessible webserver tool of SLE is available at http://geneticslab.aua.gr/epione/.

PMID:34791504 | DOI:10.3892/ijmm.2021.5063

Categories: Literature Watch

PheneBank: a literature-based database of phenotypes

Wed, 2021-11-17 06:00

Bioinformatics. 2021 Nov 12:btab740. doi: 10.1093/bioinformatics/btab740. Online ahead of print.

ABSTRACT

MOTIVATION: Significant effort has been spent by curators to create coding systems for phenotypes such as the Human Phenotype Ontology (HPO), as well as disease-phenotype annotations. We aim to support the discovery of literature-based phenotypes and integrate them into the knowledge discovery process.

RESULTS: PheneBank is a Web-portal for retrieving human phenotype-disease associations that have been text-mined from the whole of Medline. Our approach exploits state-of-the-art machine learning for concept identification by utilising an expert annotated rare disease corpus from the PMC Text Mining subset. Evaluation of the system for entities is conducted on a gold-standard corpus of rare disease sentences and for associations against the Monarch initiative data.

AVAILABILITY: The PheneBank Web-portal freely available at http://www.phenebank.org. Annotated Medline data is available from Zenodo at DOI: 10.5281/zenodo.1408800. Semantic annotation software is freely available for non-commercial use at GitHub: https://github.com/pilehvar/phenebank.

SUPPLEMENTARY INFORMATION: Supplementary data is available at Bioinformatics online.

PMID:34788791 | DOI:10.1093/bioinformatics/btab740

Categories: Literature Watch

The ontology of fast food facts: conceptualization of nutritional fast food data for consumers and semantic web applications

Wed, 2021-11-10 06:00

BMC Med Inform Decis Mak. 2021 Nov 9;21(Suppl 7):275. doi: 10.1186/s12911-021-01636-1.

ABSTRACT

BACKGROUND: Fast food with its abundance and availability to consumers may have health consequences due to the high calorie intake which is a major contributor to life threatening diseases. Providing nutritional information has some impact on consumer decisions to self regulate and promote healthier diets, and thus, government regulations have mandated the publishing of nutritional content to assist consumers, including for fast food. However, fast food nutritional information is fragmented, and we realize a benefit to collate nutritional data to synthesize knowledge for individuals.

METHODS: We developed the ontology of fast food facts as an opportunity to standardize knowledge of fast food and link nutritional data that could be analyzed and aggregated for the information needs of consumers and experts. The ontology is based on metadata from 21 fast food establishment nutritional resources and authored in OWL2 using Protégé.

RESULTS: Three evaluators reviewed the logical structure of the ontology through natural language translation of the axioms. While there is majority agreement (76.1% pairwise agreement) of the veracity of the ontology, we identified 103 out of the 430 statements that were erroneous. We revised the ontology and publicably published the initial release of the ontology. The ontology has 413 classes, 21 object properties, 13 data properties, and 494 logical axioms.

CONCLUSION: With the initial release of the ontology of fast food facts we discuss some future visions with the continued evolution of this knowledge base, and the challenges we plan to address, like the management and publication of voluminous amount of semantically linked fast food nutritional data.

PMID:34753474 | DOI:10.1186/s12911-021-01636-1

Categories: Literature Watch

A Virtual Community for Disability Advocacy: Development of a Searchable Artificial Intelligence-Supported Platform

Fri, 2021-11-05 06:00

JMIR Form Res. 2021 Nov 5;5(11):e33335. doi: 10.2196/33335.

ABSTRACT

BACKGROUND: The lack of availability of disability data has been identified as a major challenge hindering continuous disability equity monitoring. It is important to develop a platform that enables searching for disability data to expose systemic discrimination and social exclusion, which increase vulnerability to inequitable social conditions.

OBJECTIVE: Our project aims to create an accessible and multilingual pilot disability website that structures and integrates data about people with disabilities and provides data for national and international disability advocacy communities. The platform will be endowed with a document upload function with hybrid (automated and manual) paragraph tagging, while the querying function will involve an intelligent natural language search in the supported languages.

METHODS: We have designed and implemented a virtual community platform using Wikibase, Semantic Web, machine learning, and web programming tools to enable disability communities to upload and search for disability documents. The platform data model is based on an ontology we have designed following the United Nations Convention on the Rights of Persons with Disabilities (CRPD). The virtual community facilitates the uploading and sharing of validated information, and supports disability rights advocacy by enabling dissemination of knowledge.

RESULTS: Using health informatics and artificial intelligence techniques (namely Semantic Web, machine learning, and natural language processing techniques), we were able to develop a pilot virtual community that supports disability rights advocacy by facilitating uploading, sharing, and accessing disability data. The system consists of a website on top of a Wikibase (a Semantic Web-based datastore). The virtual community accepts 4 types of users: information producers, information consumers, validators, and administrators. The virtual community enables the uploading of documents, semiautomatic tagging of their paragraphs with meaningful keywords, and validation of the process before uploading the data to the disability Wikibase. Once uploaded, public users (information consumers) can perform a semantic search using an intelligent and multilingual search engine (QAnswer). Further enhancements of the platform are planned.

CONCLUSIONS: The platform ontology is flexible and can accommodate advocacy reports and disability policy and legislation from specific jurisdictions, which can be accessed in relation to the CRPD articles. The platform ontology can be expanded to fit international contexts. The virtual community supports information upload and search. Semiautomatic tagging and intelligent multilingual semantic search using natural language are enabled using artificial intelligence techniques, namely Semantic Web, machine learning, and natural language processing.

PMID:34738910 | DOI:10.2196/33335

Categories: Literature Watch

Pages