Semantic Web
Automatically Assessing Quality of Online Health Articles.
Automatically Assessing Quality of Online Health Articles.
IEEE J Biomed Health Inform. 2020 Oct 20;PP:
Authors: Afsana F, Kabir MA, Hassan N, Paul M
Abstract
Today information in the world wide web is overwhelmed by unprecedented quantity of data on versatile topics with varied quality. However, the quality of information disseminated in the field of medicine has been questioned as the negative health consequences of health misinformation can be life-threatening.There is currently no generic automated tool for evaluating the quality of online health information spanned over broad range. To address this gap, in this paper, we applied data mining approach to automatically assess the quality of online health articles based on 10 quality criteria. We have prepared a labelled dataset with 53012 features and applied different feature selection methods to identify the best feature subset with which our trained classifier achieved an accuracy of 84%-90% varied over 10 criteria. Our semantic analysis of features shows the underpinning associations between the selected features and assessment criteria and further rationalize our assessment approach. Our findings will help in identifying high quality health articles and thus aiding users in shaping their opinion to make right choice while picking health related help from online.
PMID: 33079686 [PubMed - as supplied by publisher]
Integrating Unified Medical Language System and Kleinberg's Burst Detection Algorithm into Research Topics of Medications for Post-Traumatic Stress Disorder.
Integrating Unified Medical Language System and Kleinberg's Burst Detection Algorithm into Research Topics of Medications for Post-Traumatic Stress Disorder.
Drug Des Devel Ther. 2020;14:3899-3913
Authors: Xu S, Xu D, Wen L, Zhu C, Yang Y, Han S, Guan P
Abstract
Background: The treatment of post-traumatic stress disorder (PTSD) has long been a challenge because the symptoms of PTSD are multifaceted. PTSD is primarily treated with psychotherapy and medication, or a combination of psychotherapy and medication. The present study was designed to analyze the literature on medications for PTSD and explore high-frequency common drugs and low-frequency burst drugs by burst detection algorithm combined with Unified Medical Language System (UMLS) and provide references for developing new drugs for PTSD.
Methods: Publications related to medications for PTSD from 2010 to 2019 were identified through PubMed, Web of Science Core Collection, and BIOSIS Previews. SemRep and SemRep semantic result processing system were performed to extract the set of drug concepts with therapeutic relationship according to the semantic relationship of UMLS. Kleinberg's burst detection algorithm was applied to calculate the burst weight index of drug concepts by a Java-based program. These concepts were sorted according to the frequency and the burst weight index.
Results: Four hundred and fifty-nine treatment-related drug concepts were extracted. The drug with the highest burst weight index was "Psilocybine", a hallucinogen, which was more likely to be a hotspot for the pharmacotherapy of PTSD. The highest frequency concept was "prazosin", which was more likely to be the focus of research in the medications for PTSD.
Conclusion: The present study assessed the medication-related literature on PTSD treatment, providing a framework of burst words detection-based method, a baseline of information for future research and the new attempt for the discovery of textual knowledge. The bibliometric analysis based on the burst detection algorithm combined with UMLS has shown certain feasibility in amplifying the microscopic changes of a specific research direction in a field, it can also be used in other aspects of disease and to explore the trends of various disciplines.
PMID: 33061296 [PubMed - in process]
Creativity in temporal social networks: how divergent thinking is impacted by one's choice of peers.
Creativity in temporal social networks: how divergent thinking is impacted by one's choice of peers.
J R Soc Interface. 2020 Oct;17(171):20200667
Authors: Baten RA, Bagley D, Tenesaca A, Clark F, Bagrow JP, Ghoshal G, Hoque E
Abstract
Creativity is viewed as one of the most important skills in the context of future-of-work. In this paper, we explore how the dynamic (self-organizing) nature of social networks impacts the fostering of creative ideas. We run six trials (N = 288) of a web-based experiment involving divergent ideation tasks. We find that network connections gradually adapt to individual creative performances, as the participants predominantly seek to follow high-performing peers for creative inspirations. We unearth both opportunities and bottlenecks afforded by such self-organization. While exposure to high-performing peers is associated with better creative performances of the followers, we see a counter-effect that choosing to follow the same peers introduces semantic similarities in the followers' ideas. We formulate an agent-based simulation model to capture these intuitions in a tractable manner, and experiment with corner cases of various simulation parameters to assess the generality of the findings. Our findings may help design large-scale interventions to improve the creative aptitude of people interacting in a social network.
PMID: 33050776 [PubMed - in process]
Protein ontology on the semantic web for knowledge discovery.
Protein ontology on the semantic web for knowledge discovery.
Sci Data. 2020 Oct 12;7(1):337
Authors: Chen C, Huang H, Ross KE, Cowart JE, Arighi CN, Wu CH, Natale DA
Abstract
The Protein Ontology (PRO) provides an ontological representation of protein-related entities, ranging from protein families to proteoforms to complexes. Protein Ontology Linked Open Data (LOD) exposes, shares, and connects knowledge about protein-related entities on the Semantic Web using Resource Description Framework (RDF), thus enabling integration with other Linked Open Data for biological knowledge discovery. For example, proteins (or variants thereof) can be retrieved on the basis of specific disease associations. As a community resource, we strive to follow the Findability, Accessibility, Interoperability, and Reusability (FAIR) principles, disseminate regular updates of our data, support multiple methods for accessing, querying and downloading data in various formats, and provide documentation both for scientists and programmers. PRO Linked Open Data can be browsed via faceted browser interface and queried using SPARQL via YASGUI. RDF data dumps are also available for download. Additionally, we developed RESTful APIs to support programmatic data access. We also provide W3C HCLS specification compliant metadata description for our data. The PRO Linked Open Data is available at https://lod.proconsortium.org/ .
PMID: 33046717 [PubMed - in process]
The Synthetic Biology Open Language (SBOL) Version 3: Simplified Data Exchange for Bioengineering.
The Synthetic Biology Open Language (SBOL) Version 3: Simplified Data Exchange for Bioengineering.
Front Bioeng Biotechnol. 2020;8:1009
Authors: McLaughlin JA, Beal J, Mısırlı G, Grünberg R, Bartley BA, Scott-Brown J, Vaidyanathan P, Fontanarrosa P, Oberortner E, Wipat A, Gorochowski TE, Myers CJ
Abstract
The Synthetic Biology Open Language (SBOL) is a community-developed data standard that allows knowledge about biological designs to be captured using a machine-tractable, ontology-backed representation that is built using Semantic Web technologies. While early versions of SBOL focused only on the description of DNA-based components and their sub-components, SBOL can now be used to represent knowledge across multiple scales and throughout the entire synthetic biology workflow, from the specification of a single molecule or DNA fragment through to multicellular systems containing multiple interacting genetic circuits. The third major iteration of the SBOL standard, SBOL3, is an effort to streamline and simplify the underlying data model with a focus on real-world applications, based on experience from the deployment of SBOL in a variety of scientific and industrial settings. Here, we introduce the SBOL3 specification both in comparison to previous versions of SBOL and through practical examples of its use.
PMID: 33015004 [PubMed]
A Web Resource for Exploring the CORD-19 Dataset Using Root- and Rule-Based Phrases.
A Web Resource for Exploring the CORD-19 Dataset Using Root- and Rule-Based Phrases.
J Indian Inst Sci. 2020 Sep 29;:1-7
Authors: Collard J, Bhat T, Subrahmanian E, Monarch I, Tash J, Sriram R, Elliot J
Abstract
This short paper describes a web resource-the NIST CORD-19 Web Resource-for community explorations of the COVID-19 Open Research Dataset (CORD-19). The tools for exploration in the web resource make use of the NIST-developed Root- and Rule-based method, which exploits underlying linguistic structures to create terms that represent phrases in a corpus. The method allows for auto-suggesting-related terms to discover terms to refine the search of a COVID-19 heterogenous document base. The method also produces taxonomic structures in the target domain as well as providing semantic information about the relationships between terms. This term structure can serve as a basis for creating topic modeling and trend analysis tools. In this paper, we describe use of a novel search engine to demonstrate some of the capabilities above.
PMID: 33013023 [PubMed - as supplied by publisher]
Microblog topic identification using Linked Open Data.
Microblog topic identification using Linked Open Data.
PLoS One. 2020;15(8):e0236863
Authors: Yıldırım A, Uskudarli S
Abstract
Much valuable information is embedded in social media posts (microposts) which are contributed by a great variety of persons about subjects that of interest to others. The automated utilization of this information is challenging due to the overwhelming quantity of posts and the distributed nature of the information related to subjects across several posts. Numerous approaches have been proposed to detect topics from collections of microposts, where the topics are represented by lists of terms such as words, phrases, or word embeddings. Such topics are used in tasks like classification and recommendations. The interpretation of topics is considered a separate task in such methods, albeit they are becoming increasingly human-interpretable. This work proposes an approach for identifying machine-interpretable topics of collective interest. We define topics as a set of related elements that are associated by having posted in the same contexts. To represent topics, we introduce an ontology specified according to the W3C recommended standards. The elements of the topics are identified via linking entities to resources published on Linked Open Data (LOD). Such representation enables processing topics to provide insights that go beyond what is explicitly expressed in the microposts. The feasibility of the proposed approach is examined by generating topics from more than one million tweets collected from Twitter during various events. The utility of these topics is demonstrated with a variety of topic-related tasks along with a comparison of the effort required to perform the same tasks with words-list-based representations. Manual evaluation of randomly selected 36 sets of topics yielded 81.0% and 93.3% for the precision and F1 scores respectively.
PMID: 32780736 [PubMed - indexed for MEDLINE]
Developing an ontology for representing the domain knowledge specific to non-pharmacological treatment for agitation in dementia.
Developing an ontology for representing the domain knowledge specific to non-pharmacological treatment for agitation in dementia.
Alzheimers Dement (N Y). 2020;6(1):e12061
Authors: Zhang Z, Yu P, Chang HCR, Lau SK, Tao C, Wang N, Yin M, Deng C
Abstract
Introduction: A large volume of clinical care data has been generated for managing agitation in dementia. However, the valuable information in these data has not been used effectively to generate insights for improving the quality of care. Application of artificial intelligence technologies offers us enormous opportunities to reuse these data. For health data science to achieve this, this study focuses on using ontology to coding clinical knowledge for non-pharmacological treatment of agitation in a machine-readable format.
Methods: The resultant ontology-Dementia-Related Agitation Non-Pharmacological Treatment Ontology (DRANPTO)-was developed using a method adopted from the NeOn methodology.
Results: DRANPTO consisted of 569 concepts and 48 object properties. It meets the standards for biomedical ontology.
Discussion: DRANPTO is the first comprehensive semantic representation of non-pharmacological management for agitation in dementia in the long-term care setting. As a knowledge base, it will play a vital role to facilitate the development of intelligent systems for managing agitation in dementia.
PMID: 32995470 [PubMed]
Intersection of the Web-Based Vaping Narrative With COVID-19: Topic Modeling Study
J Med Internet Res. 2020 Oct 30;22(10):e21743. doi: 10.2196/21743.
ABSTRACT
BACKGROUND: The COVID-19 outbreak was designated a global pandemic on March 11, 2020. The relationship between vaping and contracting COVID-19 is unclear, and information on the internet is conflicting. There is some scientific evidence that vaping cannabidiol (CBD), an active ingredient in cannabis that is obtained from the hemp plant, or other substances is associated with more severe manifestations of COVID-19. However, there is also inaccurate information that vaping can aid COVID-19 treatment, as well as expert opinion that CBD, possibly administered through vaping, can mitigate COVID-19 symptoms. Thus, it is necessary to study the spread of inaccurate information to better understand how to promote scientific knowledge and curb inaccurate information, which is critical to the health of vapers. Inaccurate information about vaping and COVID-19 may affect COVID-19 treatment outcomes.
OBJECTIVE: Using structural topic modeling, we aimed to map temporal trends in the web-based vaping narrative (a large data set comprising web-based vaping chatter from several sources) to indicate how the narrative changed from before to during the COVID-19 pandemic.
METHODS: We obtained data using a textual query that scanned a data pool of approximately 200,000 different domains (4,027,172 documents and 361,100,284 words) such as public internet forums, blogs, and social media, from August 1, 2019, to April 21, 2020. We then used structural topic modeling to understand changes in word prevalence and semantic structures within topics around vaping before and after December 31, 2019, when COVID-19 was reported to the World Health Organization.
RESULTS: Broadly, the web-based vaping narrative can be organized into the following groups or archetypes: harms from vaping; Vaping Regulation; Vaping as Harm Reduction or Treatment; and Vaping Lifestyle. Three archetypes were observed prior to the emergence of COVID-19; however, four archetypes were identified post-COVID-19 (Vaping as Harm Reduction or Treatment was the additional archetype). A topic related to CBD product preference emerged after COVID-19 was first reported, which may be related to the use of CBD by vapers as a COVID-19 treatment.
CONCLUSIONS: Our main finding is the emergence of a vape-administered CBD treatment narrative around COVID-19 when comparing the web-based vaping narratives before and during the COVID-19 pandemic. These results are key to understanding how vapers respond to inaccurate information about COVID-19, optimizing treatment of vapers who contract COVID-19, and possibly minimizing instances of inaccurate information. The findings have implications for the management of COVID-19 among vapers and the monitoring of web-based content pertinent to tobacco to develop targeted interventions to manage COVID-19 among vapers.
PMID:33001829 | PMC:PMC7641646 | DOI:10.2196/21743
PMO: A knowledge representation model towards precision medicine.
PMO: A knowledge representation model towards precision medicine.
Math Biosci Eng. 2020 Jun 08;17(4):4098-4114
Authors: Hou L, Wu M, Kang HY, Zheng S, Shen L, Qian Q, Li J
Abstract
With the rapid development of biomedical technology, amounts of data in the field of precision medicine (PM) are growing exponentially. Valuable knowledge is included in scattered data in which meaningful biomedical entities and their semantic relationships are buried. Therefore, it is necessary to develop a knowledge representation model like ontology to formally represent the relationships among diseases, phenotypes, genes, mutations, drugs, etc. and achieve effective integration of heterogeneous data. On basis of existing work, our study focus on solving the following issues: (i) Selecting the primary entities in PM domain; (ii) collecting and integrating biomedical vocabularies related to the above entities; (iii) defining and normalizing semantic relationships among these entities. We proposed a semi-automated method which improved the original Ontology Development 101 method to build the Precision Medicine Ontology (PMO), including defining the scope of the PMO according to the definition of PM, collecting terms from different biomedical resources, integrating and normalizing the terms by a combination of machine and manual work, defining the annotation properties, reusing existing ontologies and taxonomies, defining semantic relationships, evaluating PMO and creating the PMO website. Finally, the Precision Medicine Vocabulary (PMV) contains 4.53 million terms collected from 62 biomedical vocabularies, and the PMO includes eleven branches of PM concepts such as disease, chemical and drug, phenotype, gene, mutation, gene product and cell, described by 93 semantic relationships among them. PMO is an open, extensible ontology of PM, all of the terms and relationships in which could be obtained from the PMO website (http://www.phoc.org.cn/pmo/). Compared to existing project, our work has brought a broader and deeper coverage of mutation, gene and gene product, which enriches the semantic type and vocabulary in PM domain and benefits all users in terms of medical literature annotation, text mining and knowledge base construction.
PMID: 32987570 [PubMed - in process]
Semantic strategies in ubiquitous music: Deploying the sound sphere ecology in transitional settings.
Semantic strategies in ubiquitous music: Deploying the sound sphere ecology in transitional settings.
Heliyon. 2020 Sep;6(9):e04843
Authors: Keller D, Freitas B, Bessa WRB, Simurra I, Farias FM
Abstract
We report the results of a study involving twenty subjects doing musical activities in transitional settings, supported by an ecology of tools based on the metaphor for creative action Sound Sphere. The Sound Sphere Ecology (SFS) is a set of web-based tools, loosely organized around audio mixing and processing tasks. It employs verbal strategies for knowledge transfer to provide support for lay participants and specialists. To understand how the stakeholders influence and are influenced by this design strategy, we carried out a series of experiments involving assessments of the participants' behaviours and of the sonic products during various creative musical tasks with SFS. The overall results were positive, indicating that the proposed metaphor provides effective support for casual interaction, highlighting the participants' level of engagement. As a downside, the assessments pointed to ease of use as the lowest and less consistent item among the rated creative factors. We discuss the implications of these results and propose various design enhancements to enable the usage of a larger pool of resources. Considering the heterogeneous profiles of casual stakeholders, methodological refinements are also proposed to assess the knowledge gained by the participants during the exploratory activities, while augmenting their ability to share knowledge. This is one of the first studies on creativity-action metaphors for casual interaction.
PMID: 32984585 [PubMed]
Indoor location identification of patients for directing virtual care: An AI approach using machine learning and knowledge-based methods.
Indoor location identification of patients for directing virtual care: An AI approach using machine learning and knowledge-based methods.
Artif Intell Med. 2020 Aug;108:101931
Authors: Van Woensel W, Roy PC, Abidi SSR, Abidi SR
Abstract
In a digitally enabled healthcare setting, we posit that an individual's current location is pivotal for supporting many virtual care services-such as tailoring educational content towards an individual's current location, and, hence, current stage in an acute care process; improving activity recognition for supporting self-management in a home-based setting; and guiding individuals with cognitive decline through daily activities in their home. However, unobtrusively estimating an individual's indoor location in real-world care settings is still a challenging problem. Moreover, the needs of location-specific care interventions go beyond absolute coordinates and require the individual's discrete semantic location; i.e., it is the concrete type of an individual's location (e.g., exam vs. waiting room; bathroom vs. kitchen) that will drive the tailoring of educational content or recognition of activities. We utilized Machine Learning methods to accurately identify an individual's discrete location, together with knowledge-based models and tools to supply the associated semantics of identified locations. We considered clustering solutions to improve localization accuracy at the expense of granularity; and investigate sensor fusion-based heuristics to rule out false location estimates. We present an AI-driven indoor localization approach that integrates both data-driven and knowledge-based processes and artifacts. We illustrate the application of our approach in two compelling healthcare use cases, and empirically validated our localization approach at the emergency unit of a large Canadian pediatric hospital.
PMID: 32972660 [PubMed - in process]
Supporting Topic Modeling and Trends Analysis in Biomedical Literature.
Supporting Topic Modeling and Trends Analysis in Biomedical Literature.
J Biomed Inform. 2020 Sep 21;:103574
Authors: Kavvadias S, Drosatos G, Kaldoudi E
Abstract
Topic modeling refers to a suite of probabilistic algorithms for extracting popular topics from a collection of documents. A common approach involves the use of the Latent Dirichlet Allocation (LDA) algorithm, and, although free implementations are available, their deployment in general requires a certain degree of programming expertise. This paper presents a user-friendly web-based application, specifically designed for the biomedical professional, that supports the entire process of topic modeling and comparative trends analysis of scientific literature. The application was evaluated for its efficacy and usability by intended users with no programming expertise (15 biomedical professionals). Results of evaluation showed a positive acceptance of system functionalities and an overall usability score of 76/100 in the System Usability Score (SUS) scale. This suggests that literature topic modeling can become more popular amongst biomedical professionals via the use of a user-friendly application that fully supports the entire workflow, thus opening new perspectives for literature review and scientific research.
PMID: 32971274 [PubMed - as supplied by publisher]
User-Centered Design of a Web-Based Crowdsourcing-Integrated Semantic Text Annotation Tool for Building a Mental Health Knowledge Base.
User-Centered Design of a Web-Based Crowdsourcing-Integrated Semantic Text Annotation Tool for Building a Mental Health Knowledge Base.
J Biomed Inform. 2020 Sep 19;:103571
Authors: He X, Zhang H, Bian J
Abstract
BACKGROUND: One in five U.S. adults lives with some kind of mental health condition and 4.6% of all U.S. adults have a serious mental illness. The Internet has become the first place for these people to seek online mental health information for help. However, online mental health information is not well-organized and often of low quality. There have been efforts in building evidence-based mental health knowledgebases curated with information manually extracted from the high-quality scientific literature. Manual extraction is inefficient. Crowdsourcing can potentially be a low-cost mechanism to collect labeled data from non-expert laypeople. However, there is not an existing annotation tool integrated with popular crowdsourcing platforms to perform the information extraction tasks. In our previous work, we prototyped a Semantic Text Annotation Tool (STAT) to address this gap.
OBJECTIVE: We aimed to refine the STAT prototype (1) to improve its usability and (2) to enhance the crowdsourcing workflow efficiency to facilitate the construction of evidence-based mental health knowledgebase, following a user-centered design (UCD) approach.
METHODS: Following UCD principles, we conducted four design iterations to improve the initial STAT prototype. In the first two iterations, usability testing focus groups were conducted internally with 8 participants recruited from a convenient sample, and the usability was evaluated with a modified System Usability Scale (SUS). In the following two iterations, usability testing was conducted externally using the Amazon Mechanical Turk (MTurk) platform. In each iteration, we summarized the usability testing results through thematic analysis, identified usability issues, and conducted a heuristic evaluation to map identified usability issues to Jakob Nielsen's usability heuristics. We collected suggested improvements in the usability testing sessions and enhanced STAT accordingly in the next UCD iteration. After four UCD iterations, we conducted a case study of the system on MTurk using mental health related scientific literature. We compared the performance of crowdsourcing workers with two expert annotators from two aspects: efficiency and quality.
RESULTS: The SUS score increased from 70.3 ± 12.5 to 81.1 ± 9.8 after the two internal UCD iterations as we improved STAT's functionality based on the suggested improvements. We then evaluated STAT externally through MTurk in the following two iterations. The SUS score decreased to 55.7 ± 20.1 in the third iteration, probably because of the complexity of the tasks. After further simplification of STAT and the annotation tasks with an improved annotation guideline, the SUS score increased to 73.8 ± 13.8 in the fourth iteration of UCD. In the evaluation case study, on average, the workers spent 125.5 ± 69.2 seconds on the onboarding tutorial and the crowdsourcing workers spent significantly less time on the annotation tasks compared to the two experts. In terms of annotation quality, the workers' annotation results achieved average F1-scores ranged from 0.62 to 0.84 for the different sentences.
CONCLUSIONS: We successfully developed a web-based semantic text annotation tool, STAT, to facilitate the curation of semantic web knowledgebases through four UCD iterations. The lessons learned from the UCD process could serve as a guide to further enhance STAT and the development and design of other crowdsourcing-based semantic text annotation tasks. Our study also showed that a well-organized, informative annotation guideline is as important as the annotation tool itself. Further, we learned that a crowdsourcing task should consist of multiple simple microtasks rather than a complicated task.
PMID: 32961307 [PubMed - as supplied by publisher]
An interactive retrieval system for clinical trial studies with context-dependent protocol elements.
An interactive retrieval system for clinical trial studies with context-dependent protocol elements.
PLoS One. 2020;15(9):e0238290
Authors: Park J, Park S, Kim K, Hwang W, Yoo S, Yi GS, Lee D
Abstract
A well-defined protocol for a clinical trial guarantees a successful outcome report. When designing the protocol, most researchers refer to electronic databases and extract protocol elements using a keyword search. However, state-of-the-art database systems only offer text-based searches for user-entered keywords. In this study, we present a database system with a context-dependent and protocol-element-selection function for successfully designing a clinical trial protocol. To do this, we first introduce a database for a protocol retrieval system constructed from individual protocol data extracted from 184,634 clinical trials and 13,210 frame structures of clinical trial protocols. The database contains a variety of semantic information that allows the filtering of protocols during the search operation. Based on the database, we developed a web application called the clinical trial protocol database system (CLIPS; available at https://corus.kaist.edu/clips). This system enables an interactive search by utilizing protocol elements. To enable an interactive search for combinations of protocol elements, CLIPS provides optional next element selection according to the previous element in the form of a connected tree. The validation results show that our method achieves better performance than that of existing databases in predicting phenotypic features.
PMID: 32946464 [PubMed - as supplied by publisher]
A systematic literature review of automatic Alzheimer's disease detection from speech and language.
A systematic literature review of automatic Alzheimer's disease detection from speech and language.
J Am Med Inform Assoc. 2020 Sep 14;:
Authors: Petti U, Baker S, Korhonen A
Abstract
OBJECTIVE: In recent years numerous studies have achieved promising results in Alzheimer's Disease (AD) detection using automatic language processing. We systematically review these articles to understand the effectiveness of this approach, identify any issues and report the main findings that can guide further research.
MATERIALS AND METHODS: We searched PubMed, Ovid, and Web of Science for articles published in English between 2013 and 2019. We performed a systematic literature review to answer 5 key questions: (1) What were the characteristics of participant groups? (2) What language data were collected? (3) What features of speech and language were the most informative? (4) What methods were used to classify between groups? (5) What classification performance was achieved?
RESULTS AND DISCUSSION: We identified 33 eligible studies and 5 main findings: participants' demographic variables (especially age ) were often unbalanced between AD and control group; spontaneous speech data were collected most often; informative language features were related to word retrieval and semantic, syntactic, and acoustic impairment; neural nets, support vector machines, and decision trees performed well in AD detection, and support vector machines and decision trees performed well in decline detection; and average classification accuracy was 89% in AD and 82% in mild cognitive impairment detection versus healthy control groups.
CONCLUSION: The systematic literature review supported the argument that language and speech could successfully be used to detect dementia automatically. Future studies should aim for larger and more balanced datasets, combine data collection methods and the type of information analyzed, focus on the early stages of the disease, and report performance using standardized metrics.
PMID: 32929494 [PubMed - as supplied by publisher]
ANDDigest: a new web-based module of ANDSystem for the search of knowledge in the scientific literature.
ANDDigest: a new web-based module of ANDSystem for the search of knowledge in the scientific literature.
BMC Bioinformatics. 2020 Sep 14;21(Suppl 11):228
Authors: Ivanisenko TV, Saik OV, Demenkov PS, Ivanisenko NV, Savostianov AN, Ivanisenko VA
Abstract
BACKGROUND: The rapid growth of scientific literature has rendered the task of finding relevant information one of the critical problems in almost any research. Search engines, like Google Scholar, Web of Knowledge, PubMed, Scopus, and others, are highly effective in document search; however, they do not allow knowledge extraction. In contrast to the search engines, text-mining systems provide extraction of knowledge with representations in the form of semantic networks. Of particular interest are tools performing a full cycle of knowledge management and engineering, including automated retrieval, integration, and representation of knowledge in the form of semantic networks, their visualization, and analysis. STRING, Pathway Studio, MetaCore, and others are well-known examples of such products. Previously, we developed the Associative Network Discovery System (ANDSystem), which also implements such a cycle. However, the drawback of these systems is dependence on the employed ontologies describing the subject area, which limits their functionality in searching information based on user-specified queries.
RESULTS: The ANDDigest system is a new web-based module of the ANDSystem tool, permitting searching within PubMed by using dictionaries from the ANDSystem tool and sets of user-defined keywords. ANDDigest allows performing the search based on complex queries simultaneously, taking into account many types of objects from the ANDSystem's ontology. The system has a user-friendly interface, providing sorting, visualization, and filtering of the found information, including mapping of mentioned objects in text, linking to external databases, sorting of data by publication date, citations number, journal H-indices, etc. The system provides data on trends for identified entities based on dynamics of interest according to the frequency of their mentions in PubMed by years.
CONCLUSIONS: The main feature of ANDDigest is its functionality, serving as a specialized search for information about multiple associative relationships of objects from the ANDSystem's ontology vocabularies, taking into account user-specified keywords. The tool can be applied to the interpretation of experimental genetics data, the search for associations between molecular genetics objects, and the preparation of scientific and analytical reviews. It is presently available at https://anddigest.sysbio.ru/ .
PMID: 32921303 [PubMed - in process]
End-to-end semantic segmentation of personalized deep brain structures for non-invasive brain stimulation.
End-to-end semantic segmentation of personalized deep brain structures for non-invasive brain stimulation.
Neural Netw. 2020 May;125:233-244
Authors: Rashed EA, Gomez-Tames J, Hirata A
Abstract
Electro-stimulation or modulation of deep brain regions is commonly used in clinical procedures for the treatment of several nervous system disorders. In particular, transcranial direct current stimulation (tDCS) is widely used as an affordable clinical application that is applied through electrodes attached to the scalp. However, it is difficult to determine the amount and distribution of the electric field (EF) in the different brain regions due to anatomical complexity and high inter-subject variability. Personalized tDCS is an emerging clinical procedure that is used to tolerate electrode montage for accurate targeting. This procedure is guided by computational head models generated from anatomical images such as MRI. Distribution of the EF in segmented head models can be calculated through simulation studies. Therefore, fast, accurate, and feasible segmentation of different brain structures would lead to a better adjustment for customized tDCS studies. In this study, a single-encoder multi-decoders convolutional neural network is proposed for deep brain segmentation. The proposed architecture is trained to segment seven deep brain structures using T1-weighted MRI. Network generated models are compared with a reference model constructed using a semi-automatic method, and it presents a high matching especially in Thalamus (Dice Coefficient (DC) = 94.70%), Caudate (DC = 91.98%) and Putamen (DC = 90.31%) structures. Electric field distribution during tDCS in generated and reference models matched well each other, suggesting its potential usefulness in clinical practice.
PMID: 32151914 [PubMed - indexed for MEDLINE]
A decision support system on the obesity management and consultation during childhood and adolescence using ontology and semantic rules.
A decision support system on the obesity management and consultation during childhood and adolescence using ontology and semantic rules.
J Biomed Inform. 2020 Sep 07;:103554
Authors: Taçyıldız Ö, Çelik Ertuğrul D
Abstract
BACKGROUND: Obesity is defined as abnormal or excessive fat accumulation that presents a risk to health according to the World Health Organization (WHO). Pediatric or childhood obesity is the most prevalent nutritional disorder among children and adolescents worldwide. In pediatric or childhood obesity, constant monitoring of the pediatric patients by health experts is required to provide efficient obesity management and treatment. Therefore, the patients are examined on a regular basis, the measurements are compared against predefined percentile values and the development of the pediatric patient is examined.
RESULTS: This study discusses the design, implementation, and potential use of an ontology-based obesity management and consultation system which is a decision support system for health experts during treatments of the children and adolescent patients. The system does not only share instant gathered medical data to health experts but also examines the data as a smart medical assistant. The system includes an ontology-based inference engine module, which is a decision support module, and used to infer certain personalized suggestions for patients. Suggestions in four categories emerged as a result: (1) Development Feedback Suggestions, (2) Calorie Intake Suggestions and Physical Activities, (3) Mom Suggestions, and (4) Obesity Treatment Stage Suggestions. The methodologies applied and main technical contributions are discussed in three aspects: (1) Obesity Tracking Ontology, (2) Semantic Web Rule Knowledge base, and (3) Inference Engine Module. In this study, unlike other similar studies, ontology and rule based smart medical assistant which have different functionalities from adults' obesity management is considered especially for obesity management of children and adolescents. The system also includes intensive pediatric health care expert involvement. Eighty case studies from real anonymous pediatric patients are analyzed and discussed in this experimental study.
CONCLUSIONS: The results retrieved from 80 case studies are promising in demonstrating the applicability, effectiveness and efficiency of the proposed approach. The inference engine module of the proposed system can be integrated semantically into intelligent and distributed decision support systems, and the system ontology can be used as a knowledge base in similar systems.
PMID: 32911081 [PubMed - as supplied by publisher]
SCALEUS-FD: A FAIR Data Tool for Biomedical Applications.
SCALEUS-FD: A FAIR Data Tool for Biomedical Applications.
Biomed Res Int. 2020;2020:3041498
Authors: Pereira A, Lopes RP, Oliveira JL
Abstract
The Semantic Web and Linked Data concepts and technologies have empowered the scientific community with solutions to take full advantage of the increasingly available distributed and heterogeneous data in distinct silos. Additionally, FAIR Data principles established guidelines for data to be Findable, Accessible, Interoperable, and Reusable, and they are gaining traction in data stewardship. However, to explore their full potential, we must be able to transform legacy solutions smoothly into the FAIR Data ecosystem. In this paper, we introduce SCALEUS-FD, a FAIR Data extension of a legacy semantic web tool successfully used for data integration and semantic annotation and enrichment. The core functionalities of the solution follow the Semantic Web and Linked Data principles, offering a FAIR REST API for machine-to-machine operations. We applied a set of metrics to evaluate its "FAIRness" and created an application scenario in the rare diseases domain.
PMID: 32908882 [PubMed - in process]