Semantic Web

Towards a reproducible interactome: semantic-based detection of redundancies to unify protein-protein interaction databases

Tue, 2022-01-11 06:00

Bioinformatics. 2022 Jan 7:btac013. doi: 10.1093/bioinformatics/btac013. Online ahead of print.

ABSTRACT

MOTIVATION: Information on protein-protein interactions is collected in numerous primary databases with their own curation process. Several meta-databases aggregate primary databases to provide more exhaustive datasets. In addition to exhaustivity, aggregation contributes to reliability by providing an overview of the various studies and detection methods supporting an interaction. However, interactions listed in different primary databases are partly redundant because some publications reporting protein-protein interactions have been curated by multiple primary databases. Mere aggregation can thus introduce a bias if these redundancies are not identified and eliminated. To overcome this bias, meta-databases rely on the Molecular Interaction ontology that describes interaction detection methods, but they do not fully take advantage of the ontology's rich semantics, which leads to systematically overestimating interaction reproducibility.

RESULTS: We propose a precise definition of explicit and implicit redundancy, and show that both can be easily detected using Semantic Web technologies. We apply this process to a dataset from the APID meta-database and show that while explicit redundancies were detected by the APID aggregation process, about 15% of APID entries are implicitly redundant and should not be taken into account when presenting confidence-related metrics. More than 90% of implicit redundancies result from the aggregation of distinct primary databases, while the remaining occurs between entries of a single database. Finally, we build a" reproducible interactome" with interactions that have been reproduced by multiple methods or publications. The size of the reproducible interactome is drastically impacted by removing redundancies for both yeast (-59%) and human (-56%), and we show that this is largely due to implicit redundancies.

AVAILABILITY: Software, data and results are available at https://gitlab.com/nnet56/reproducible-interactome, https://reproducible-interactome.genouest.org/, Zenodo (doi : 10.5281/zenodo.5595037) and NDEx (doi : 10.18119/N94302, doi : 10.18119/N97S4D.

SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.

PMID:35015827 | DOI:10.1093/bioinformatics/btac013

Categories: Literature Watch

Dealing with the Ambiguity of Glycan Substructure Search

Tue, 2022-01-11 06:00

Molecules. 2021 Dec 23;27(1):65. doi: 10.3390/molecules27010065.

ABSTRACT

The level of ambiguity in describing glycan structure has significantly increased with the upsurge of large-scale glycomics and glycoproteomics experiments. Consequently, an ontology-based model appears as an appropriate solution for navigating these data. However, navigation is not sufficient and the model should also enable advanced search and comparison. A new ontology with a tree logical structure is introduced to represent glycan structures irrespective of the precision of molecular details. The model heavily relies on the GlycoCT encoding of glycan structures. Its implementation in the GlySTreeM knowledge base was validated with GlyConnect data and benchmarked with the Glycowork library. GlySTreeM is shown to be fast, consistent, reliable and more flexible than existing solutions for matching parts of or whole glycan structures. The model is also well suited for painless future expansion.

PMID:35011294 | DOI:10.3390/molecules27010065

Categories: Literature Watch

Algebra dissociates from arithmetic in the brain semantic network

Sat, 2022-01-08 06:00

Behav Brain Funct. 2022 Jan 7;18(1):1. doi: 10.1186/s12993-022-00186-4.

ABSTRACT

BACKGROUND: Mathematical expressions mainly include arithmetic (such as 8 - (1 + 3)) and algebra (such as a - (b + c)). Previous studies have shown that both algebraic processing and arithmetic involved the bilateral parietal brain regions. Although previous studies have revealed that algebra was dissociated from arithmetic, the neural bases of the dissociation between algebraic processing and arithmetic is still unclear. The present study uses functional magnetic resonance imaging (fMRI) to identify the specific brain networks for algebraic and arithmetic processing.

METHODS: Using fMRI, this study scanned 30 undergraduates and directly compared the brain activation during algebra and arithmetic. Brain activations, single-trial (item-wise) interindividual correlation and mean-trial interindividual correlation related to algebra processing were compared with those related to arithmetic. The functional connectivity was analyzed by a seed-based region of interest (ROI)-to-ROI analysis.

RESULTS: Brain activation analyses showed that algebra elicited greater activation in the angular gyrus and arithmetic elicited greater activation in the bilateral supplementary motor area, left insula, and left inferior parietal lobule. Interindividual single-trial brain-behavior correlation revealed significant brain-behavior correlations in the semantic network, including the middle temporal gyri, inferior frontal gyri, dorsomedial prefrontal cortices, and left angular gyrus, for algebra. For arithmetic, the significant brain-behavior correlations were located in the phonological network, including the precentral gyrus and supplementary motor area, and in the visuospatial network, including the bilateral superior parietal lobules. For algebra, significant positive functional connectivity was observed between the visuospatial network and semantic network, whereas for arithmetic, significant positive functional connectivity was observed only between the visuospatial network and phonological network.

CONCLUSION: These findings suggest that algebra relies on the semantic network and conversely, arithmetic relies on the phonological and visuospatial networks.

PMID:34996499 | PMC:PMC8740448 | DOI:10.1186/s12993-022-00186-4

Categories: Literature Watch

Enabling Research and Clinical Use of Patient-Generated Health Data (the mindLAMP Platform): Digital Phenotyping Study

Fri, 2022-01-07 06:00

JMIR Mhealth Uhealth. 2022 Jan 7;10(1):e30557. doi: 10.2196/30557.

ABSTRACT

BACKGROUND: There is a growing need for the integration of patient-generated health data (PGHD) into research and clinical care to enable personalized, preventive, and interactive care, but technical and organizational challenges, such as the lack of standards and easy-to-use tools, preclude the effective use of PGHD generated from consumer devices, such as smartphones and wearables.

OBJECTIVE: This study outlines how we used mobile apps and semantic web standards such as HTTP 2.0, Representational State Transfer, JSON (JavaScript Object Notation), JSON Schema, Transport Layer Security (version 1.3), Advanced Encryption Standard-256, OpenAPI, HTML5, and Vega, in conjunction with patient and provider feedback to completely update a previous version of mindLAMP.

METHODS: The Learn, Assess, Manage, and Prevent (LAMP) platform addresses the abovementioned challenges in enhancing clinical insight by supporting research, data analysis, and implementation efforts around PGHD as an open-source solution with freely accessible and shared code.

RESULTS: With a simplified programming interface and novel data representation that captures additional metadata, the LAMP platform enables interoperability with existing Fast Healthcare Interoperability Resources-based health care systems as well as consumer wearables and services such as Apple HealthKit and Google Fit. The companion Cortex data analysis and machine learning toolkit offer robust support for artificial intelligence, behavioral feature extraction, interactive visualizations, and high-performance data processing through parallelization and vectorization techniques.

CONCLUSIONS: The LAMP platform incorporates feedback from patients and clinicians alongside a standards-based approach to address these needs and functions across a wide range of use cases through its customizable and flexible components. These range from simple survey-based research to international consortiums capturing multimodal data to simple delivery of mindfulness exercises through personalized, just-in-time adaptive interventions.

PMID:34994710 | DOI:10.2196/30557

Categories: Literature Watch

End-to-End provenance representation for the understandability and reproducibility of scientific experiments using a semantic approach

Fri, 2022-01-07 06:00

J Biomed Semantics. 2022 Jan 6;13(1):1. doi: 10.1186/s13326-021-00253-1.

ABSTRACT

BACKGROUND: The advancement of science and technologies play an immense role in the way scientific experiments are being conducted. Understanding how experiments are performed and how results are derived has become significantly more complex with the recent explosive growth of heterogeneous research data and methods. Therefore, it is important that the provenance of results is tracked, described, and managed throughout the research lifecycle starting from the beginning of an experiment to its end to ensure reproducibility of results described in publications. However, there is a lack of interoperable representation of end-to-end provenance of scientific experiments that interlinks data, processing steps, and results from an experiment's computational and non-computational processes.

RESULTS: We present the "REPRODUCE-ME" data model and ontology to describe the end-to-end provenance of scientific experiments by extending existing standards in the semantic web. The ontology brings together different aspects of the provenance of scientific studies by interlinking non-computational data and steps with computational data and steps to achieve understandability and reproducibility. We explain the important classes and properties of the ontology and how they are mapped to existing ontologies like PROV-O and P-Plan. The ontology is evaluated by answering competency questions over the knowledge base of scientific experiments consisting of computational and non-computational data and steps.

CONCLUSION: We have designed and developed an interoperable way to represent the complete path of a scientific experiment consisting of computational and non-computational steps. We have applied and evaluated our approach to a set of scientific experiments in different subject domains like computational science, biological imaging, and microscopy.

PMID:34991705 | DOI:10.1186/s13326-021-00253-1

Categories: Literature Watch

Patients need emotional support: Managing physician disclosure information to attract more patients

Thu, 2021-12-30 06:00

Int J Med Inform. 2021 Dec 25;158:104674. doi: 10.1016/j.ijmedinf.2021.104674. Online ahead of print.

ABSTRACT

BACKGROUND: Information asymmetry causes barriers for the patient's decision-making in the online health community. Patients can rely on the physician's self-disclosed information to alleviate it. However, the impact of physician's self-disclosed information on the patient's decision has rarely been discussed.

OBJECTIVES: To investigate the impact of the physician's self-disclosed information on the patient's decision in the online health community and to examine the moderating effect of the physician's online reputation.

METHODS: Drawing on the limited-capacity model of attention, we develop a theoretical model to estimate the impact of physician's self-disclosure information on patient's decision and the contingent roles of physician's online reputation in online healthcare community by econometric methods. We designed a web crawler based on R language program to collect more than 20,000 physicians' data from their homepage in Haodf-a leading online healthcare community platform in China. The attributes of the physician's information disclosure are measured by the following variables: emotion orientation, the quantity of information and the semantic topics diversity.

RESULTS: The empirical analysis derives the following findings: (1) The emotion orientation in physician's self-disclosure information is positively associated with patient's decision; (2) Both excessive quantity of information and semantic topics diversity can raise barriers for patient's decision; (3) When the level of physician's online reputation is high, the negative effect of the quantity of information and semantic topics diversity are all strengthened while the positive effect of the emotion orientation is not strengthened.

CONCLUSIONS: This study has a profound importance for a deep understanding of the impact of physician's self-disclosure information and contributes to the literature on information disclosure, the limited capacity model of attention, patient's decision. Also, this study provides implications for practice.

PMID:34968960 | DOI:10.1016/j.ijmedinf.2021.104674

Categories: Literature Watch

PADI-web 3.0: A new framework for extracting and disseminating fine-grained information from the news for animal disease surveillance

Fri, 2021-12-24 06:00

One Health. 2021 Dec 3;13:100357. doi: 10.1016/j.onehlt.2021.100357. eCollection 2021 Dec.

ABSTRACT

PADI-web (Platform for Automated extraction of animal Disease Information from the web) is a biosurveillance system dedicated to monitoring online news sources for the detection of emerging animal infectious diseases. PADI-web has collected more than 380,000 news articles since 2016. Compared to other existing biosurveillance tools, PADI-web focuses specifically on animal health and has a fully automated pipeline based on machine-learning methods. This paper presents the new functionalities of PADI-web based on the integration of: (i) a new fine-grained classification system, (ii) automatic methods to extract terms and named entities with text-mining approaches, (iii) semantic resources for indexing keywords and (iv) a notification system for end-users. Compared to other biosurveillance tools, PADI-web, which is integrated in the French Platform for Animal Health Surveillance (ESA Platform), offers strong coverage of the animal sector, a multilingual approach, an automated information extraction module and a notification tool configurable according to end-user needs.

PMID:34950760 | PMC:PMC8671119 | DOI:10.1016/j.onehlt.2021.100357

Categories: Literature Watch

Ontology-Enabled Emotional Sentiment Analysis on COVID-19 Pandemic-Related Twitter Streams

Thu, 2021-12-23 06:00

Front Public Health. 2021 Dec 6;9:798905. doi: 10.3389/fpubh.2021.798905. eCollection 2021.

ABSTRACT

The exponential growth of social media users has changed the dynamics of retrieving the potential information from user-generated content and transformed the paradigm of information-retrieval mechanism with the novel developments on the concept of "web of data". In this regard, our proposed Ontology-Based Sentiment Analysis provides two novel approaches: First, the emotion extraction on tweets related to COVID-19 is carried out by a well-formed taxonomy that comprises possible emotional concepts with fine-grained properties and polarized values. Second, the potential entities present in the tweet can be analyzed for semantic associativity. The extraction of emotions can be performed in two cases: (i) words directly associated with the emotional concepts present in the taxonomy and (ii) words indirectly present in the emotional concepts. Though the latter case is very challenging in processing the tweets to find the hidden patterns and extract the meaningful facts associated with it, our proposed work is able to extract and detect almost 81% of true positives and considerably able to detect the false negatives. Finally, the proposed approach's superior performance is witnessed from its comparison with other peer-level approaches.

PMID:34938715 | PMC:PMC8685242 | DOI:10.3389/fpubh.2021.798905

Categories: Literature Watch

Development of RIKEN Plant Metabolonome MetaDatabase

Fri, 2021-12-17 06:00

Plant Cell Physiol. 2021 Dec 17:pcab173. doi: 10.1093/pcp/pcab173. Online ahead of print.

ABSTRACT

The advancement of metabolomics in terms of techniques for measuring small molecules has enabled the rapid detection and quantification of numerous cellular metabolites. Metabolomic data provide new opportunities to gain a deeper understanding of plant metabolism that can improve the health of both plants and humans that consume them. Although major public repositories for general metabolomic data have been established, the community still has shortcomings related to data sharing, especially in terms of data reanalysis, reusability, and reproducibility. To address these issues, we developed the RIKEN Plant Metabolome MetaDatabase (RIKEN PMM, http://metabobank.riken.jp/pmm/db/plantMetabolomics), which stores mass spectrometry-based (e.g. GC-MS-based) metabolite profiling data of plants together with their detailed, structured experimental metadata, including sampling and experimental procedures. Our metadata are described as Linked Open Data (LOD) based on the Resource Description Framework (RDF) using standardized and controlled vocabularies, such as the Metabolomics Standards Initiative Ontology (MSIO), which are to be integrated with various life and biomedical science data using the World Wide Web. RIKEN PMM implements intuitive and interactive operations for plant metabolome data, including raw data (netCDF format), mass spectra (NIST MSP format), and metabolite annotations. The feature is suitable not only for biologists who are interested in metabolomic phenotypes, but also for researchers who would like to investigate life science in general through plant metabolomic approaches.

PMID:34918130 | DOI:10.1093/pcp/pcab173

Categories: Literature Watch

A rule-based decision support system for aiding iron deficiency management

Wed, 2021-12-15 06:00

Health Informatics J. 2021 Oct-Dec;27(4):14604582211066054. doi: 10.1177/14604582211066054.

ABSTRACT

Iron is a vital mineral for the proper function of hemoglobin which is also a protein needed to transport oxygen in the blood. The lack of iron in human blood causes a range of serious health problems including "anemia." In this article, the COntAneRS (Clinical ONTology-based Iron Deficiency-ANEmia- Recommendation System) is proposed as a clinical decision support system to diagnose iron deficiency and manage its treatment. The applied methodologies and main technical contributions of this study are discussed in four aspects: (1) Iron Deficiency Domain Ontology (IDDOnt), (2) Semantic Web Rule Knowledgebase (SWRL), (3) Inference Engine, and (4) Physician Portal of the system. Experimental studies of the proposed system have been applied on a population of 200 people, consisting of real anemia patients and healthy individuals. First, a decision tree classifier is used to diagnose iron deficiency condition based on the patients' demographic information and certain medical data, as well as recently measured hemoglobin CBC levels of the patients. To check the effectiveness of the system, the data of 50 anonymous patients randomly selected from 200 patients are entered manually in the IDDOnt and the system is then verified according to the inferencing results. After inferencing step, the recommendations related to appropriate iron drugs, daily consumption dose, drug consumption periods, and additional medical suggestions about drug interactions are provided by the system to the responsible physician through system ontology, SWRL rules, and web services. As a result of experimental studies, our system has provided very good accuracy (99.5%) and robust results in producing patient-suitable suggestions. In addition, the applicability of the system on the cases is discussed as case studies in this paper. The results reported from the applied case studies are promising in demonstrating the applicability, effectiveness, and efficiency of the proposed approach.

PMID:34910611 | DOI:10.1177/14604582211066054

Categories: Literature Watch

Identifying opportunities for timely diagnosis of bladder and renal cancer via abnormal blood tests: a longitudinal linked data study

Tue, 2021-12-14 06:00

Br J Gen Pract. 2021 Dec 31;72(714):e19-e25. doi: 10.3399/BJGP.2021.0282. Print 2022 Jan.

ABSTRACT

BACKGROUND: Understanding pre-diagnostic test use could reveal diagnostic windows where more timely evaluation for cancer may be indicated.

AIM: To examine pre-diagnostic patterns of results of abnormal blood tests in patients with bladder and renal cancer.

DESIGN AND SETTING: A retrospective cohort study using primary care and cancer registry data on patients with bladder and renal cancer who were diagnosed between April 2012 and December 2015 in England.

METHOD: The rates of patients with a first abnormal result in the year before cancer diagnosis, for 'generic' (full blood count components, inflammatory markers, and calcium) and 'organ-specific' blood tests (creatinine and liver function test components) that may lead to subsequent detection of incidental cancers, were examined. Poisson regression was used to detect the month during which the cohort's rate of each abnormal test started to increase from baseline. The proportion of patients with a test found in the first half of the diagnostic window was examined, as these 'early' tests might represent opportunities where further evaluation could be initiated.

RESULTS: Data from 4533 patients with bladder and renal cancer were analysed. The monthly rate of patients with a first abnormal test increased towards the time of cancer diagnosis. Abnormalities of both generic (for example, high inflammatory markers) and organ-specific tests (for example, high creatinine) started to increase from 6-8 months pre-diagnosis, with 25%-40% of these patients having an abnormal test in the 'early half' of the diagnostic window.

CONCLUSION: Population-level signals of bladder and renal cancer can be observed in abnormalities in commonly performed primary care blood tests up to 8 months before diagnosis, indicating the potential for earlier diagnosis in some patients.

PMID:34903517 | PMC:PMC8714503 | DOI:10.3399/BJGP.2021.0282

Categories: Literature Watch

A botanical demonstration of the potential of linking data using unique identifiers for people

Tue, 2021-12-14 06:00

PLoS One. 2021 Dec 14;16(12):e0261130. doi: 10.1371/journal.pone.0261130. eCollection 2021.

ABSTRACT

Natural history collection data available digitally on the web have so far only made limited use of the potential of semantic links among themselves and with cross-disciplinary resources. In a pilot study, botanical collections of the Consortium of European Taxonomic Facilities (CETAF) have therefore begun to semantically annotate their collection data, starting with data on people, and to link them via a central index system. As a result, it is now possible to query data on collectors across different collections and automatically link them to a variety of external resources. The system is being continuously developed and is already in production use in an international collection portal.

PMID:34905557 | DOI:10.1371/journal.pone.0261130

Categories: Literature Watch

Semantic Network Analysis Using Construction Accident Cases to Understand Workers' Unsafe Acts

Fri, 2021-12-10 06:00

Int J Environ Res Public Health. 2021 Dec 1;18(23):12660. doi: 10.3390/ijerph182312660.

ABSTRACT

Unsafe acts by workers are a direct cause of accidents in the labor-intensive construction industry. Previous studies have reviewed past accidents and analyzed their causes to understand the nature of the human error involved. However, these studies focused their investigations on only a small number of construction accidents, even though a large number of them have been collected from various countries. Consequently, this study developed a semantic network analysis (SNA) model that uses approximately 60,000 construction accident cases to understand the nature of the human error that affects safety in the construction industry. A modified human factor analysis and classification system (HFACS) framework was used to classify major human error factors-that is, the causes of the accidents in each of the accident summaries in the accident case data-and an SNA analysis was conducted on all of the classified data to analyze correlations between the major factors that lead to unsafe acts. The results show that an overwhelming number of accidents occurred due to unintended acts such as perceptual errors (PERs) and skill-based errors (SBEs). Moreover, this study visualized the relationships between factors that affected unsafe acts based on actual construction accident case data, allowing for an intuitive understanding of the major keywords for each of the factors that lead to accidents.

PMID:34886388 | PMC:PMC8656935 | DOI:10.3390/ijerph182312660

Categories: Literature Watch

LIO-CSI: LiDAR inertial odometry with loop closure combined with semantic information

Wed, 2021-12-08 06:00

PLoS One. 2021 Dec 8;16(12):e0261053. doi: 10.1371/journal.pone.0261053. eCollection 2021.

ABSTRACT

Accurate and reliable state estimation and mapping are the foundation of most autonomous driving systems. In recent years, researchers have focused on pose estimation through geometric feature matching. However, most of the works in the literature assume a static scenario. Moreover, a registration based on a geometric feature is vulnerable to the interference of a dynamic object, resulting in a decline of accuracy. With the development of a deep semantic segmentation network, we can conveniently obtain the semantic information from the point cloud in addition to geometric information. Semantic features can be used as an accessory to geometric features that can improve the performance of odometry and loop closure detection. In a more realistic environment, semantic information can filter out dynamic objects in the data, such as pedestrians and vehicles, which lead to information redundancy in generated map and map-based localization failure. In this paper, we propose a method called LiDAR inertial odometry (LIO) with loop closure combined with semantic information (LIO-CSI), which integrates semantic information to facilitate the front-end process as well as loop closure detection. First, we made a local optimization on the semantic labels provided by the Sparse Point-Voxel Neural Architecture Search (SPVNAS) network. The optimized semantic information is combined into the front-end process of tightly-coupled light detection and ranging (LiDAR) inertial odometry via smoothing and mapping (LIO-SAM), which allows us to filter dynamic objects and improve the accuracy of the point cloud registration. Then, we proposed a semantic assisted scan-context method to improve the accuracy and robustness of loop closure detection. The experiments were conducted on an extensively used dataset KITTI and a self-collected dataset on the Jilin University (JLU) campus. The experimental results demonstrate that our method is better than the purely geometric method, especially in dynamic scenarios, and it has a good generalization ability.

PMID:34879118 | PMC:PMC8654169 | DOI:10.1371/journal.pone.0261053

Categories: Literature Watch

Changes of the Public Attitudes of China to Domestic COVID-19 Vaccination After the Vaccines Were Approved: A Semantic Network and Sentiment Analysis Based on Sina Weibo Texts

Fri, 2021-12-03 06:00

Front Public Health. 2021 Nov 11;9:723015. doi: 10.3389/fpubh.2021.723015. eCollection 2021.

ABSTRACT

Introduction: On December 31, 2020, the Chinese government announced that the domestic coronavirus disease-2019 (COVID-19) vaccines have obtained approval for conditional marketing and are free for vaccination. This release drove the attention of the public and intense debates on social media, which reflected public attitudes to the domestic vaccine. This study examines whether the public concerns and public attitudes to domestic COVID-19 vaccines changed after the official announcement. Methods: This article used big data analytics in the research, including semantic network and sentiment analysis. The purpose of the semantic network is to obtain the public concerns about domestic vaccines. Sentiment analysis reflects the sentiments of the public to the domestic vaccines and the emotional changes by comparing the specific sentiments shown on the posts before and after the official announcement. Results: There exists a correlation between the public concerns about domestic vaccines before and after the official announcement. According to the semantic network analysis, the public concerns about vaccines have changed after the official announcement. The public focused on the performance issues of the vaccines before the official approval, but they cared more about the practical issues of vaccination after that. The sentiment analysis showed that both positive and negative emotions increased among the public after the official announcement. "Good" was the most increased positive emotion and indicated great public appreciation for the production capacity and free vaccination. "Fear" was the significantly increased negative emotion and reflected the public concern about the safety of the vaccines. Conclusion: The official announcement of the approval for marketing improved the Chinese public acceptance of the domestic COVID-19 vaccines. In addition, safety and effectiveness are vital factors influencing public vaccine hesitancy.

PMID:34858918 | PMC:PMC8632040 | DOI:10.3389/fpubh.2021.723015

Categories: Literature Watch

Investigating the breast cancer screening-treatment-mortality pathway of women diagnosed with invasive breast cancer: Results from linked health data

Wed, 2021-12-01 06:00

Eur J Cancer Care (Engl). 2022 Jan;31(1):e13539. doi: 10.1111/ecc.13539. Epub 2021 Nov 30.

ABSTRACT

OBJECTIVE: To examine the screening-treatment-mortality pathway among women with invasive breast cancer in 2006-2014 using linked data.

METHODS: BreastScreen histories of South Australian women diagnosed with breast cancer (n = 8453) were investigated. Treatments recorded within 12 months from diagnosis were obtained from linked registry and administrative data. Associations of screening history with treatment were investigated using logistic regression and with cancer mortality outcomes using competing risk analyses, adjusting for socio-demographic, cancer and comorbidity characteristics.

RESULTS AND CONCLUSION: For screening ages of 50-69 years, 70% had participated in BreastScreen SA ≤ 5 years and 53% ≤ 2 years of diagnosis. Five-year disease-specific survival post-diagnosis was 90%. Compared with those not screened ≤5 years, women screened ≤2 years had higher odds, adjusted for socio-demographic, cancer and comorbidity characteristics, and diagnostic period, of breast-conserving surgery (aOR 2.5, 95% CI 1.9-3.2) and radiotherapy (aOR 1.2, 95% CI 1.1-1.3). These women had a lower unadjusted risk of post-diagnostic cancer mortality (SHR 0.33, 95% CI 0.27-0.41), partly mediated by stage (aSHR 0.65, 95% CI 0.51-0.81), and less breast surgery (aSHR 0.78, 95% CI 0.62-0.99). Screening ≤2 years and conserving surgery appeared to have a greater than additive association with lower post-diagnostic mortality (interaction term SHR 0.42, 95% CI 0.23-0.78). The screening-treatment-mortality pathway was investigated using linked data.

PMID:34850484 | DOI:10.1111/ecc.13539

Categories: Literature Watch

Fully automatic image colorization based on semantic segmentation technology

Tue, 2021-11-30 06:00

PLoS One. 2021 Nov 30;16(11):e0259953. doi: 10.1371/journal.pone.0259953. eCollection 2021.

ABSTRACT

Aiming at these problems of image colorization algorithms based on deep learning, such as color bleeding and insufficient color, this paper converts the study of image colorization to the optimization of image semantic segmentation, and proposes a fully automatic image colorization model based on semantic segmentation technology. Firstly, we use the encoder as the local feature extraction network and use VGG-16 as the global feature extraction network. These two parts do not interfere with each other, but they share the low-level feature. Then, the first fusion module is constructed to merge local features and global features, and the fusion results are input into semantic segmentation network and color prediction network respectively. Finally, the color prediction network obtains the semantic segmentation information of the image through the second fusion module, and predicts the chrominance of the image based on it. Through several sets of experiments, it is proved that the performance of our model becomes stronger and stronger under the nourishment of the data. Even in some complex scenes, our model can predict reasonable colors and color correctly, and the output effect is very real and natural.

PMID:34847177 | PMC:PMC8631650 | DOI:10.1371/journal.pone.0259953

Categories: Literature Watch

The Collaborative Metadata Repository (CoMetaR) Web App: Quantitative and Qualitative Usability Evaluation

Tue, 2021-11-30 06:00

JMIR Med Inform. 2021 Nov 29;9(11):e30308. doi: 10.2196/30308.

ABSTRACT

BACKGROUND: In the field of medicine and medical informatics, the importance of comprehensive metadata has long been recognized, and the composition of metadata has become its own field of profession and research. To ensure sustainable and meaningful metadata are maintained, standards and guidelines such as the FAIR (Findability, Accessibility, Interoperability, Reusability) principles have been published. The compilation and maintenance of metadata is performed by field experts supported by metadata management apps. The usability of these apps, for example, in terms of ease of use, efficiency, and error tolerance, crucially determines their benefit to those interested in the data.

OBJECTIVE: This study aims to provide a metadata management app with high usability that assists scientists in compiling and using rich metadata. We aim to evaluate our recently developed interactive web app for our collaborative metadata repository (CoMetaR). This study reflects how real users perceive the app by assessing usability scores and explicit usability issues.

METHODS: We evaluated the CoMetaR web app by measuring the usability of 3 modules: core module, provenance module, and data integration module. We defined 10 tasks in which users must acquire information specific to their user role. The participants were asked to complete the tasks in a live web meeting. We used the System Usability Scale questionnaire to measure the usability of the app. For qualitative analysis, we applied a modified think aloud method with the following thematic analysis and categorization into the ISO 9241-110 usability categories.

RESULTS: A total of 12 individuals participated in the study. We found that over 97% (85/88) of all the tasks were completed successfully. We measured usability scores of 81, 81, and 72 for the 3 evaluated modules. The qualitative analysis resulted in 24 issues with the app.

CONCLUSIONS: A usability score of 81 implies very good usability for the 2 modules, whereas a usability score of 72 still indicates acceptable usability for the third module. We identified 24 issues that serve as starting points for further development. Our method proved to be effective and efficient in terms of effort and outcome. It can be adapted to evaluate apps within the medical informatics field and potentially beyond.

PMID:34847059 | DOI:10.2196/30308

Categories: Literature Watch

Orchestrating Heterogeneous Devices and AI Services as Virtual Sensors for Secure Cloud-Based IoT Applications

Sat, 2021-11-27 06:00

Sensors (Basel). 2021 Nov 12;21(22):7509. doi: 10.3390/s21227509.

ABSTRACT

The concept of the cloud-to-thing continuum addresses advancements made possible by the widespread adoption of cloud, edge, and IoT resources. It opens the possibility of combining classical symbolic AI with advanced machine learning approaches in a meaningful way. In this paper, we present a thing registry and an agent-based orchestration framework, which we combine to support semantic orchestration of IoT use cases across several federated cloud environments. We use the concept of virtual sensors based on machine learning (ML) services as abstraction, mediating between the instance level and the semantic level. We present examples of virtual sensors based on ML models for activity recognition and describe an approach to remedy the problem of missing or scarce training data. We illustrate the approach with a use case from an assisted living scenario.

PMID:34833585 | DOI:10.3390/s21227509

Categories: Literature Watch

FAIR data representation in times of eScience: a comparison of instance-based and class-based semantic representations of empirical data using phenotype descriptions as example

Fri, 2021-11-26 06:00

J Biomed Semantics. 2021 Nov 25;12(1):20. doi: 10.1186/s13326-021-00254-0.

ABSTRACT

BACKGROUND: The size, velocity, and heterogeneity of Big Data outclasses conventional data management tools and requires data and metadata to be fully machine-actionable (i.e., eScience-compliant) and thus findable, accessible, interoperable, and reusable (FAIR). This can be achieved by using ontologies and through representing them as semantic graphs. Here, we discuss two different semantic graph approaches of representing empirical data and metadata in a knowledge graph, with phenotype descriptions as an example. Almost all phenotype descriptions are still being published as unstructured natural language texts, with far-reaching consequences for their FAIRness, substantially impeding their overall usability within the life sciences. However, with an increasing amount of anatomy ontologies becoming available and semantic applications emerging, a solution to this problem becomes available. Researchers are starting to document and communicate phenotype descriptions through the Web in the form of highly formalized and structured semantic graphs that use ontology terms and Uniform Resource Identifiers (URIs) to circumvent the problems connected with unstructured texts.

RESULTS: Using phenotype descriptions as an example, we compare and evaluate two basic representations of empirical data and their accompanying metadata in the form of semantic graphs: the class-based TBox semantic graph approach called Semantic Phenotype and the instance-based ABox semantic graph approach called Phenotype Knowledge Graph. Their main difference is that only the ABox approach allows for identifying every individual part and property mentioned in the description in a knowledge graph. This technical difference results in substantial practical consequences that significantly affect the overall usability of empirical data. The consequences affect findability, accessibility, and explorability of empirical data as well as their comparability, expandability, universal usability and reusability, and overall machine-actionability. Moreover, TBox semantic graphs often require querying under entailment regimes, which is computationally more complex.

CONCLUSIONS: We conclude that, from a conceptual point of view, the advantages of the instance-based ABox semantic graph approach outweigh its shortcomings and outweigh the advantages of the class-based TBox semantic graph approach. Therefore, we recommend the instance-based ABox approach as a FAIR approach for documenting and communicating empirical data and metadata in a knowledge graph.

PMID:34823588 | DOI:10.1186/s13326-021-00254-0

Categories: Literature Watch

Pages