Semantic Web

semantic web: Latest results from PubMed

URL: https://pubmed.ncbi.nlm.nih.gov/rss-feed/?feed_id=1JEkdnYNUQx_r7xNn8x36OtSBbORVE88SJ4HzAiB058dOQT5D1&utm_content=1JEkdnYNUQx_r7xNn8x36OtSBbORVE88SJ4HzAiB058dOQT5D1&v=2.18.0.post9+e462414&utm_medium=rss&ff=20250810165645&utm_source=Other

Updated: 1 hour 49 min ago

Exploring biodiversity and ethnobotanical significance of <em>Solanum</em> species in Uzbekistan: unveiling the cultural wealth and ethnopharmacological uses

Fri, 2024-02-09 06:00

Front Pharmacol. 2024 Jan 24;14:1287793. doi: 10.3389/fphar.2023.1287793. eCollection 2023.

ABSTRACT

Despite its millennial existence and empirical documentation, the ethnological knowledge of herbs is a more recent phenomenon. The knowledge of their historical uses as food, medicine, source of income and small-scale businesses, and the sociological impacts are threatened due to the slow ethnobotanical research drive. Species of the genus Solanum have long been extensively used in folk medicine to treat various illnesses of humans since the dawn of civilization. All data were systematically obtained from papers, monographs, and books written in Uzbek, Russian, and English through various scientific online databases, including Google, Google Scholar, PubMed, Scopus, Semantic Scholar, Science Direct, and Web of Science using specific keywords focused on eight Solanum species. Eight native and non-native Solanum species as S. dulcamara L., S. lycopersicum L., S. melongena L., S. nigrum L., S. rostratum Dunal., S. sisymbriifolium Lam., S. tuberosum L., and S. villosum Mill. have been recorded in Uzbekistan of Central Asia. In this article we presented recently obtained data on the diversity, morphological characteristics, global distribution, habitat, population status, phenology, reproduction, pharmacology and phytochemistry of these Solanum species in Uzbekistan. Furthermore, relying on a combination of literature reviews and analyses from various scientific papers, we focus on food consumption coupled with global ethnobotanical and ethnopharmacological uses in human diseases of the Solanum species growing in Uzbekistan. Since the dawn of civilization, these eight cultivated and non-cultivated species of Solanum have provided sustainable resources of medicinal plants in Uzbekistan to prevent and treat various human diseases. Based on the collected data, it was shown that Solanum species have not been studied ethnobotanically and ethnomedicinally in Uzbekistan and it is necessary to conduct phytochemical and biotechnological research on them in the future. Traditional uses and scientific evaluation of Solanum indicate that S. nigrum, S. sisymbriifolium and S. tuberosum are one of the most widely used species in some parts of the world. Although considerable progress has been made to comprehend the chemical and biological properties of S. nigrum and S. tuberosum species, more research on the pharmacology and toxicology of these species is needed to ensure the safety, efficacy, and quality of their biologically active extracts and isolated bioactive compounds. Additionally, conducting additional research on the structure-activity relationship of certain isolated phytochemicals has the potential to enhance their biological efficacy and advance the scientific utilization of traditional applications of Solanum taxa.

PMID:38333226 | PMC:PMC10851437 | DOI:10.3389/fphar.2023.1287793

Categories: Literature Watch

Application of an ontology for model cards to generate computable artifacts for linking machine learning information from biomedical research

Thu, 2024-02-08 06:00

Proc Int World Wide Web Conf. 2023 Apr;2023(Companion):820-825. doi: 10.1145/3543873.3587601. Epub 2023 Apr 30.

ABSTRACT

Model card reports provide a transparent description of machine learning models which includes information about their evaluation, limitations, intended use, etc. Federal health agencies have expressed an interest in model cards report for research studies using machine-learning based AI. Previously, we have developed an ontology model for model card reports to structure and formalize these reports. In this paper, we demonstrate a Java-based library (OWL API, FaCT++) that leverages our ontology to publish computable model card reports. We discuss future directions and other use cases that highlight applicability and feasibility of ontology-driven systems to support FAIR challenges.

PMID:38327770 | PMC:PMC10848146 | DOI:10.1145/3543873.3587601

Categories: Literature Watch

Ontologies4Cat: investigating the landscape of ontologies for catalysis research data management

Wed, 2024-02-07 06:00

J Cheminform. 2024 Feb 7;16(1):16. doi: 10.1186/s13321-024-00807-2.

ABSTRACT

As scientific digitization advances it is imperative ensuring data is Findable, Accessible, Interoperable, and Reusable (FAIR) for machine-processable data. Ontologies play a vital role in enhancing data FAIRness by explicitly representing knowledge in a machine-understandable format. Research data in catalysis research often exhibits complexity and diversity, necessitating a respectively broad collection of ontologies. While ontology portals such as EBI OLS and BioPortal aid in ontology discovery, they lack deep classification, while quality metrics for ontology reusability and domains are absent for the domain of catalysis research. Thus, this work provides an approach for systematic collection of ontology metadata with focus on the catalysis research data value chain. By classifying ontologies by subdomains of catalysis research, the approach is offering efficient comparison across ontologies. Furthermore, a workflow and codebase is presented, facilitating representation of the metadata on GitHub. Finally, a method is presented to automatically map the classes contained in the ontologies of the metadata collection against each other, providing further insights on relatedness of the ontologies listed. The presented methodology is designed for its reusability, enabling its adaptation to other ontology collections or domains of knowledge. The ontology metadata taken up for this work and the code developed and described in this work are available in a GitHub repository at: https://github.com/nfdi4cat/Ontology-Overview-of-NFDI4Cat .

PMID:38326906 | PMC:PMC10851519 | DOI:10.1186/s13321-024-00807-2

Categories: Literature Watch

The organization of the semantic network as reflected by the neural correlates of six semantic dimensions

Wed, 2024-01-31 06:00

Brain Lang. 2024 Mar;250:105388. doi: 10.1016/j.bandl.2024.105388. Epub 2024 Jan 31.

ABSTRACT

Multiple sensory-motor and non-sensory-motor dimensions have been proposed for semantic representation, but it remains unclear how the semantic system is organized along them in the human brain. Using naturalistic fMRI data and large-scale semantic ratings, we investigated the overlaps and dissociations between the neural correlates of six semantic dimensions: vision, motor, socialness, emotion, space, and time. Our findings revealed a more complex semantic atlas than what is predicted by current neurobiological models of semantic representation. Brain regions that are selectively sensitive to specific semantic dimensions were found both within and outside the brain networks assumed to represent multimodal general and/or abstract semantics. Overlaps between the neural correlates of different semantic dimensions were mainly found inside the default mode network, concentrated in the left anterior superior temporal gyrus and angular gyrus, which have been proposed as two connector hubs that bridge the multimodal experiential semantic system and the language-supported semantic system.

PMID:38295716 | DOI:10.1016/j.bandl.2024.105388

Categories: Literature Watch

Electronic Health Record and Semantic Issues Using Fast Healthcare Interoperability Resources: Systematic Mapping Review

Tue, 2024-01-30 06:00

J Med Internet Res. 2024 Jan 30;26:e45209. doi: 10.2196/45209.

ABSTRACT

BACKGROUND: The increasing use of electronic health records and the Internet of Things has led to interoperability issues at different levels (structural and semantic). Standards are important not only for successfully exchanging data but also for appropriately interpreting them (semantic interoperability). Thus, to facilitate the semantic interoperability of data exchanged in health care, considerable resources have been deployed to improve the quality of shared clinical data by structuring and mapping them to the Fast Healthcare Interoperability Resources (FHIR) standard.

OBJECTIVE: The aims of this study are 2-fold: to inventory the studies on FHIR semantic interoperability resources and terminologies and to identify and classify the approaches and contributions proposed in these studies.

METHODS: A systematic mapping review (SMR) was conducted using 10 electronic databases as sources of information for inventory and review studies published during 2012 to 2022 on the development and improvement of semantic interoperability using the FHIR standard.

RESULTS: A total of 70 FHIR studies were selected and analyzed to identify FHIR resource types and terminologies from a semantic perspective. The proposed semantic approaches were classified into 6 categories, namely mapping (31/126, 24.6%), terminology services (18/126, 14.3%), resource description framework or web ontology language-based proposals (24/126, 19%), annotation proposals (18/126, 14.3%), machine learning (ML) and natural language processing (NLP) proposals (20/126, 15.9%), and ontology-based proposals (15/126, 11.9%). From 2012 to 2022, there has been continued research in 6 categories of approaches as well as in new and emerging annotations and ML and NLP proposals. This SMR also classifies the contributions of the selected studies into 5 categories: framework or architecture proposals, model proposals, technique proposals, comparison services, and tool proposals. The most frequent type of contribution is the proposal of a framework or architecture to enable semantic interoperability.

CONCLUSIONS: This SMR provides a classification of the different solutions proposed to address semantic interoperability using FHIR at different levels: collecting, extracting and annotating data, modeling electronic health record data from legacy systems, and applying transformation and mapping to FHIR models and terminologies. The use of ML and NLP for unstructured data is promising and has been applied to specific use case scenarios. In addition, terminology services are needed to accelerate their use and adoption; furthermore, techniques and tools to automate annotation and ontology comparison should help reduce human interaction.

PMID:38289660 | PMC:PMC10865191 | DOI:10.2196/45209

Categories: Literature Watch

Case-Reported Data Management Methodology Using an RDF Data Model for Building a Multicenter Clinical Registry

Thu, 2024-01-25 06:00

Stud Health Technol Inform. 2024 Jan 25;310:184-188. doi: 10.3233/SHTI230952.

ABSTRACT

In multicenter clinical research, case-reported clinical data are managed for each research project. Participating institutions manage the mapping between standardized codes and in-house codes. To use the data extracted from electronic medical records in case report forms, it is necessary to pay attention to the gap in the semantic hierarchy. Managing mapping information between in-house and standardized codes is centralized in Resource Description Framework (RDF) stores. The relationship between standardized and in-house codes is described in RDF and stored in RDF stores. RESTful APIs for accessing RDF stores in SPARQL was developed and verified. The relationship between standardized codes and in-house codes of pharmaceuticals was expressed in RDF triples. As a +result of the operational verification of the implemented APIs, it was confirmed that data management with knowledge bases expressed in RDF graphs is possible. The ability to dynamically modify mapping definitions enables flexible data management and ease of operational restrictions.

PMID:38269790 | DOI:10.3233/SHTI230952

Categories: Literature Watch

Are We Where We Want to Be in Undergraduate Pathology Education?

Wed, 2024-01-24 06:00

Turk Patoloji Derg. 2024 Jan 24. doi: 10.5146/tjpath.2023.13048. Online ahead of print.

ABSTRACT

OBJECTIVE: This review which aims to examine the recent and current status of pathology education in medical schools, and covers the publications related to undergraduate pathology education published between 2010 January and June 2023.

MATERIAL AND METHOD: A search was performed through PubMed, Google Scholar, Semantic Scholar, and Ulakbim search engines for the Science Citation Index, Science Citation Index Expanded, Emerging Sources Citation Index, Directory of Open Access Journals, Scopus, PubMed as well as TR Dizin indexed articles. The findings are categorized into two periods as 2010 January - 2020 April (pre-COVID-19 pandemic) and May 2020 - 2023 June. A total of 24 reviews/editorials/letters to the editor and 63 research articles in the pre-pandemic period and 11 reviews/ editorials/ letters to the editor and 35 research articles between 2020 May and 2023 June are included in the analysis.

RESULTS: Currently, medical education generally depends on core education programs with defined learning objectives and outcomes. Moreover, problem-based, case-based, and team-based interactive learning are being used along with traditional didactic courses. Additionally, digital/ web-based/remote education methods have gained prominence after the COVID-19 pandemic. The virtual or augmented reality and 3D drawing applications are offered as a solution for the autopsy and macroscopy courses. A scarce number of publications are found on measuring and evaluating the effectiveness of learning.

CONCLUSION: Artificial intelligence in pathology education is a topic that looks likely to become important in the near future. National and international comprehensive standardization is a necessity. A joint effort and collective intelligence are needed to achieve the desired goals in undergraduate pathology education.

PMID:38265100 | DOI:10.5146/tjpath.2023.13048

Categories: Literature Watch

Development and quality appraisal of a new English breast screening linked data set as part of the age, test threshold, and frequency of mammography screening (ATHENA-M) study

Wed, 2024-01-24 06:00

Br J Radiol. 2024 Jan 23;97(1153):98-112. doi: 10.1093/bjr/tqad023.

ABSTRACT

OBJECTIVES: To build a data set capturing the whole breast cancer screening journey from individual breast cancer screening records to outcomes and assess data quality.

METHODS: Routine screening records (invitation, attendance, test results) from all 79 English NHS breast screening centres between January 1, 1988 and March 31, 2018 were linked to cancer registry (cancer characteristics and treatment) and national mortality data. Data quality was assessed using comparability, validity, timeliness, and completeness.

RESULTS: Screening records were extracted from 76/79 English breast screening centres, 3/79 were not possible due to software issues. Data linkage was successful from 1997 after introduction of a universal identifier for women (NHS number). Prior to 1997 outcome data are incomplete due to linkage issues, reducing validity. Between January 1, 1997 and March 31, 2018, a total of 11 262 730 women were offered screening of whom 9 371 973 attended at least one appointment, with 139 million person-years of follow-up (a median of 12.4 person years for each woman included) with 73 810 breast cancer deaths and 1 111 139 any-cause deaths. Comparability to reference data sets and internal validity were demonstrated. Data completeness was high for core screening variables (>99%) and main cancer outcomes (>95%).

CONCLUSIONS: The ATHENA-M project has created a large high-quality and representative data set of individual women's screening trajectories and outcomes in England from 1997 to 2018, data before 1997 are lower quality.

ADVANCES IN KNOWLEDGE: This is the most complete data set of English breast screening records and outcomes constructed to date, which can be used to evaluate and optimize screening.

PMID:38263823 | DOI:10.1093/bjr/tqad023

Categories: Literature Watch

Development and application of Chinese medical ontology for diabetes mellitus

Fri, 2024-01-19 06:00

BMC Med Inform Decis Mak. 2024 Jan 19;24(1):18. doi: 10.1186/s12911-023-02405-y.

ABSTRACT

OBJECTIVE: To develop a Chinese Diabetes Mellitus Ontology (CDMO) and explore methods for constructing high-quality Chinese biomedical ontologies.

MATERIALS AND METHODS: We used various data sources, including Chinese clinical practice guidelines, expert consensus, literature, and hospital information system database schema, to build the CDMO. We combined top-down and bottom-up strategies and integrated text mining and cross-lingual ontology mapping. The ontology was validated by clinical experts and ontology development tools, and its application was validated through clinical decision support and Chinese natural language medical question answering.

RESULTS: The current CDMO consists of 3,752 classes, 182 fine-grained object properties with hierarchical relationships, 108 annotation properties, and over 12,000 mappings to other well-known medical ontologies in English. Based on the CDMO and clinical practice guidelines, we developed 200 rules for diabetes diagnosis, treatment, diet, and medication recommendations using the Semantic Web Rule Language. By injecting ontology knowledge, CDMO enhances the performance of the T5 model on a real-world Chinese medical question answering dataset related to diabetes.

CONCLUSION: CDMO has fine-grained semantic relationships and extensive annotation information, providing a foundation for medical artificial intelligence applications in Chinese contexts, including the construction of medical knowledge graphs, clinical decision support systems, and automated medical question answering. Furthermore, the development process incorporated natural language processing and cross-lingual ontology mapping to improve the quality of the ontology and improved development efficiency. This workflow offers a methodological reference for the efficient development of other high-quality Chinese as well as non-English medical ontologies.

PMID:38243204 | PMC:PMC10799385 | DOI:10.1186/s12911-023-02405-y

Categories: Literature Watch

Channel semantic mutual learning for visible-thermal person re-identification

Fri, 2024-01-19 06:00

PLoS One. 2024 Jan 19;19(1):e0293498. doi: 10.1371/journal.pone.0293498. eCollection 2024.

ABSTRACT

Visible-infrared person re-identification (VI-ReID) is a cross-modality retrieval issue aiming to match the same pedestrian between visible and infrared cameras. Thus, the modality discrepancy presents a significant challenge for this task. Most methods employ different networks to extract features that are invariant between modalities. While we propose a novel channel semantic mutual learning network (CSMN), which attributes the difference in semantics between modalities to the difference at the channel level, it optimises the semantic consistency between channels from two perspectives: the local inter-channel semantics and the global inter-modal semantics. Meanwhile, we design a channel-level auto-guided double metric loss (CADM) to learn modality-invariant features and the sample distribution in a fine-grained manner. We conducted experiments on RegDB and SYSU-MM01, and the experimental results validate the superiority of CSMN. Especially on RegDB datasets, CSMN improves the current best performance by 3.43% and 0.5% on the Rank-1 score and mINP value, respectively. The code is available at https://github.com/013zyj/CSMN.

PMID:38241236 | PMC:PMC10798514 | DOI:10.1371/journal.pone.0293498

Categories: Literature Watch

Voxel representation of brain images inpainting via Regional Pixel Semantic Network and pyramidal attention AE - Quantile differential mechanism model

Fri, 2024-01-12 06:00

Comput Biol Med. 2024 Mar;170:107767. doi: 10.1016/j.compbiomed.2023.107767. Epub 2023 Nov 28.

ABSTRACT

Medical image inpainting holds significant importance in enhancing the quality of medical images by restoring missing areas, thereby rendering them suitable for diagnostic purposes. While several techniques have been previously proposed for medical image inpainting, they are not suitable for distorted images containing metallic implants due to their limited consideration of known shaped masking. To overcome this limitation, a novel Vectorized Box Interpolation with Arbitrary Auto-Rand Augment Masking technique has been proposed which involves scaling and vectorizing images to expand their details and generating asymmetrically shaped masking in an automatic random format. One of the challenging tasks in this regard is the precise detection of lost regions, which is addressed through the introduction of the Regional Pixel Semantic Network. This technique employs the locally shared features (LSF) based region sensing with FCN (fully convolutional network) segmentation, which performs automatic segmentation based on neighboring pixel local dependency and regional features to determine the location of masked regions. During the reconstruction of missing parts, a significant challenge posed is the inability to recognize proximity in encoding owing to the generation of shadow-like regions on the feature map. To address this issue, a novel Multilayered DRC Regularized Pyramidal Attention AE Model has been proposed which employs dilated convolution with coherent pyramidal attention for feature extraction and improves image resolution using a Laplacian convolutional layer. Moreover, the realness of the generated image is determined using the Quantile Differential Mechanism model, where in the Quantile Differential Partial Convolutional Discriminator utilizes the hyperbolic tangent activation function in the partial convolutional layer to calculate recognition accuracy. As a result, the proposed method achieves high percentages for accuracy (98 %), precision (97 %), sensitivity (96 %), recall (95 %), and F-measure (96 %) thereby outperforming existing methods. Overall, this proposed method effectively handles distorted images with metallic implants, accurately detects lost regions, and improves the reconstructed image quality.

PMID:38215616 | DOI:10.1016/j.compbiomed.2023.107767

Categories: Literature Watch

Methods and datasets for segmentation of minimally invasive surgical instruments in endoscopic images and videos: A review of the state of the art

Sun, 2024-01-07 06:00

Comput Biol Med. 2024 Jan 4;169:107929. doi: 10.1016/j.compbiomed.2024.107929. Online ahead of print.

ABSTRACT

In the field of computer- and robot-assisted minimally invasive surgery, enormous progress has been made in recent years based on the recognition of surgical instruments in endoscopic images and videos. In particular, the determination of the position and type of instruments is of great interest. Current work involves both spatial and temporal information, with the idea that predicting the movement of surgical tools over time may improve the quality of the final segmentations. The provision of publicly available datasets has recently encouraged the development of new methods, mainly based on deep learning. In this review, we identify and characterize datasets used for method development and evaluation and quantify their frequency of use in the literature. We further present an overview of the current state of research regarding the segmentation and tracking of minimally invasive surgical instruments in endoscopic images and videos. The paper focuses on methods that work purely visually, without markers of any kind attached to the instruments, considering both single-frame semantic and instance segmentation approaches, as well as those that incorporate temporal information. The publications analyzed were identified through the platforms Google Scholar, Web of Science, and PubMed. The search terms used were "instrument segmentation", "instrument tracking", "surgical tool segmentation", and "surgical tool tracking", resulting in a total of 741 articles published between 01/2015 and 07/2023, of which 123 were included using systematic selection criteria. A discussion of the reviewed literature is provided, highlighting existing shortcomings and emphasizing the available potential for future developments.

PMID:38184862 | DOI:10.1016/j.compbiomed.2024.107929

Categories: Literature Watch

Does use of GP and specialist services vary across areas and according to individual socioeconomic position? A multilevel analysis using linked data in Australia

Sat, 2024-01-06 06:00

BMJ Open. 2024 Jan 6;14(1):e074624. doi: 10.1136/bmjopen-2023-074624.

ABSTRACT

OBJECTIVE: Timely access to primary care and supporting specialist care relative to need is essential for health equity. However, use of services can vary according to an individual's socioeconomic circumstances or where they live. This study aimed to quantify individual socioeconomic variation in general practitioner (GP) and specialist use in New South Wales (NSW), accounting for area-level variation in use.

DESIGN: Outcomes were GP use and quality-of-care and specialist use. Multilevel logistic regression was used to estimate: (1) median ORs (MORs) to quantify small area variation in outcomes, which gives the median increased risk of moving to an area of higher risk of an outcome, and (2) ORs to quantify associations between outcomes and individual education level, our main exposure variable. Analyses were adjusted for individual sociodemographic and health characteristics and performed separately by remoteness categories.

SETTING: Baseline data (2006-2009) from the 45 and Up Study, NSW, Australia, linked to Medicare Benefits Schedule and death data (to December 2012).

PARTICIPANTS: 267 153 adults aged 45 years and older.

RESULTS: GP (MOR=1.32-1.35) and specialist use (1.16-1.18) varied between areas, accounting for individual characteristics. For a given level of need and accounting for area variation, low education-level individuals were more likely to be frequent users of GP services (no school certificate vs university, OR=1.63-1.91, depending on remoteness category) and have continuity of care (OR=1.14-1.24), but were less likely to see a specialist (OR=0.85-0.95).

CONCLUSION: GP and specialist use varied across small areas in NSW, independent of individual characteristics. Use of GP care was equitable, but specialist care was not. Failure to address inequitable specialist use may undermine equity gains within the primary care system. Policies should also focus on local variation.

PMID:38184309 | DOI:10.1136/bmjopen-2023-074624

Categories: Literature Watch

A patient safety knowledge graph supporting vaccine product development

Thu, 2024-01-04 06:00

BMC Med Inform Decis Mak. 2024 Jan 4;24(1):10. doi: 10.1186/s12911-023-02409-8.

ABSTRACT

BACKGROUND: Knowledge graphs are well-suited for modeling complex, unstructured, and multi-source data and facilitating their analysis. During the COVID-19 pandemic, adverse event data were integrated into a knowledge graph to support vaccine safety surveillance and nimbly respond to urgent health authority questions. Here, we provide details of this post-marketing safety system using public data sources. In addition to challenges with varied data representations, adverse event reporting on the COVID-19 vaccines generated an unprecedented volume of data; an order of magnitude larger than adverse events for all previous vaccines. The Patient Safety Knowledge Graph (PSKG) is a robust data store to accommodate the volume of adverse event data and harmonize primary surveillance data sources.

METHODS: We designed a semantic model to represent key safety concepts. We built an extract-transform-load (ETL) data pipeline to parse and import primary public data sources; align key elements such as vaccine names; integrated the Medical Dictionary for Regulatory Activities (MedDRA); and applied quality metrics. PSKG is deployed in a Neo4J graph database, and made available via a web interface and Application Programming Interfaces (APIs).

RESULTS: We import and align adverse event data and vaccine exposure data from 250 countries on a weekly basis, producing a graph with 4,340,980 nodes and 30,544,475 edges as of July 1, 2022. PSKG is used for ad-hoc analyses and periodic reporting for several widely available COVID-19 vaccines. Analysis code using the knowledge graph is 80% shorter than an equivalent implementation written entirely in Python, and runs over 200 times faster.

CONCLUSIONS: Organizing safety data into a concise model of nodes, properties, and edge relationships has greatly simplified analysis code by removing complex parsing and transformation algorithms from individual analyses and instead managing these centrally. The adoption of the knowledge graph transformed how the team answers key scientific and medical questions. Whereas previously an analysis would involve aggregating and transforming primary datasets from scratch to answer a specific question, the team can now iterate easily and respond as quickly as requests evolve (e.g., "Produce vaccine-X safety profile for adverse event-Y by country instead of age-range").

PMID:38178113 | DOI:10.1186/s12911-023-02409-8

Categories: Literature Watch

DisoFLAG: accurate prediction of protein intrinsic disorder and its functions using graph-based interaction protein language model

Wed, 2024-01-03 06:00

BMC Biol. 2024 Jan 2;22(1):3. doi: 10.1186/s12915-023-01803-y.

ABSTRACT

Intrinsically disordered proteins and regions (IDPs/IDRs) are functionally important proteins and regions that exist as highly dynamic conformations under natural physiological conditions. IDPs/IDRs exhibit a broad range of molecular functions, and their functions involve binding interactions with partners and remaining native structural flexibility. The rapid increase in the number of proteins in sequence databases and the diversity of disordered functions challenge existing computational methods for predicting protein intrinsic disorder and disordered functions. A disordered region interacts with different partners to perform multiple functions, and these disordered functions exhibit different dependencies and correlations. In this study, we introduce DisoFLAG, a computational method that leverages a graph-based interaction protein language model (GiPLM) for jointly predicting disorder and its multiple potential functions. GiPLM integrates protein semantic information based on pre-trained protein language models into graph-based interaction units to enhance the correlation of the semantic representation of multiple disordered functions. The DisoFLAG predictor takes amino acid sequences as the only inputs and provides predictions of intrinsic disorder and six disordered functions for proteins, including protein-binding, DNA-binding, RNA-binding, ion-binding, lipid-binding, and flexible linker. We evaluated the predictive performance of DisoFLAG following the Critical Assessment of protein Intrinsic Disorder (CAID) experiments, and the results demonstrated that DisoFLAG offers accurate and comprehensive predictions of disordered functions, extending the current coverage of computationally predicted disordered function categories. The standalone package and web server of DisoFLAG have been established to provide accurate prediction tools for intrinsic disorders and their associated functions.

PMID:38166858 | DOI:10.1186/s12915-023-01803-y

Categories: Literature Watch

Concreteness ratings for 36,000 Estonian words

Fri, 2023-12-22 06:00

Behav Res Methods. 2023 Dec 21. doi: 10.3758/s13428-023-02257-4. Online ahead of print.

ABSTRACT

We present a collection of concreteness ratings for 35,979 words in Estonian. The data were collected via a web application from 2278 native Estonian speakers. Human ratings of concreteness have not been collected for Estonian beforehand. We compare our results to Aedmaa et al. (2018), who assigned concreteness ratings to 240,000 Estonian words by means of machine learning. We show that while these two datasets show reasonable correlation (R = 0.71), there are considerable differences in the distribution of the ratings, which we discuss in this paper. Furthermore, the results also raise questions about the importance of the type of scale used for collecting ratings. While most other datasets have been compiled based on questionnaires entailing five- or seven-point Likert scales, we used a continuous 0-10 scale. Comparing our rating distribution to those of other studies, we found that it is most similar to the distribution in Lahl et al. (Behavior Research Methods, 41(1), 13-19, 2009), who also used a 0-10 scale. Concreteness ratings for Estonian words are available at OSF .

PMID:38129738 | DOI:10.3758/s13428-023-02257-4

Categories: Literature Watch

Postnatal growth in small vulnerable newborns: a longitudinal study of 2 million Brazilians using routine register-based linked data

Thu, 2023-12-21 06:00

Am J Clin Nutr. 2024 Feb;119(2):444-455. doi: 10.1016/j.ajcnut.2023.12.009. Epub 2023 Dec 20.

ABSTRACT

BACKGROUND: Preterm, low-birth weight (LBW) and small-for-gestational age (SGA) newborns have a higher frequency of adverse health outcomes, including linear and ponderal growth impairment.

OBJECTIVE: To describe the growth trajectories and to estimate catch-up growth during the first 5 y of life of small newborns according to 3 vulnerability phenotypes (preterm, LBW, SGA).

METHODS: Longitudinal study using linked data from the 100 Million Brazilian Cohort baseline, the Brazilian National Live Birth System (SINASC), and the Food and Nutrition Surveillance System (SISVAN) from 2011 to 2017. We estimated the length/height-for-age (L/HAZ) and weight-for-age z-score (WAZ) trajectories from children of 6-59 mo using the linear mixed model for each vulnerable newborn phenotype. Growth velocity for both L/HAZ and WAZ was calculated considering the change (Δ) in the mean z-score between 2 time points. Catch-up growth was defined as a change in z-score > 0.67 at any time during follow-up.

RESULTS: We analyzed 2,021,998 live born children and 8,726,599 observations. The prevalence of at least one of the vulnerable phenotypes was 16.7% and 0.6% were simultaneously preterm, LBW, and SGA. For those born at term, all phenotypes had a period of growth recovery from 12 mo. For preterm infants, the onset of L/HAZ growth recovery started later at 24 mo and the growth trajectories appear to be lower than those born at term, a condition aggravated among children with the 3 phenotypes. Preterm and female infants seem to experience slower growth recovery than those born at term and males. The catch-up growth occurs at 24-59 mo for males preterm: preterm + AGA + NBW (Δ = 0.80), preterm + AGA + LBW (Δ = 0.88), and preterm + SGA + LBW (Δ = 1.08); and among females: term + SGA + NBW (Δ = 0.69), term + AGA + LBW (Δ = 0.72), term + SGA + LBW (Δ = 0.77), preterm + AGA + LBW (Δ = 0.68), and preterm + SGA + LBW (Δ = 0.83).

CONCLUSIONS: Children born preterm seem to reach L/HAZ and WAZ growth trajectories lower than those attained by children born at term, a condition aggravated among the most vulnerable.

PMID:38128734 | DOI:10.1016/j.ajcnut.2023.12.009

Categories: Literature Watch

Designing an Ontology for Emotion-driven Visual Representations

Thu, 2023-12-21 06:00

Proceedings (IEEE Int Conf Bioinformatics Biomed). 2017 Nov;2017:1280-1283. doi: 10.1109/bibm.2017.8217844. Epub 2017 Dec 18.

ABSTRACT

Emotions influence our perceptions and decisions and are often felt more strongly in situations related to healthcare. Therefore, it is important to understand how both providers and patients express their emotions in face-to-face scenarios. An ontology is a way to represent domain concepts and the relationships between them in a polyarchical manner. We have created an ontological model called the Visualized Emotion Ontology (VEO) that expresses the semantic definitions and visualizations of 25 emotions based on published research. With VEO, we can augment patient-facing software tools, like embodied conversational agents, to improve patient-provider interaction in clinical environments.

PMID:38125740 | PMC:PMC10732717 | DOI:10.1109/bibm.2017.8217844

Categories: Literature Watch

Ontology of Consumer Health Vocabulary: providing a formal and interoperable semantic resource for linking lay language and medical terminology

Thu, 2023-12-21 06:00

Proceedings (IEEE Int Conf Bioinformatics Biomed). 2019 Nov;2019:1177-1178. doi: 10.1109/bibm47256.2019.8983220. Epub 2020 Feb 6.

ABSTRACT

The Consumer Health Vocabulary has been an important contribution to the health informatics field since its introduction in 2006. Many studies have utilized the vocabulary for various scientific research to bridge the gap between consumers and health experts. Given the flat file format of the Consumer Health Vocabulary dataset, we developed a SKOS-based ontology of the dataset. As an ontology, this dataset can be semantically linked to other resources to provide consumer-level meaning. In addition with this artifact, we plan to further expand the terminology.

PMID:38125584 | PMC:PMC10732699 | DOI:10.1109/bibm47256.2019.8983220

Categories: Literature Watch

Predicting risk of suicidal behavior from insurance claims data vs. linked data from insurance claims and electronic health records

Tue, 2023-12-19 06:00

Pharmacoepidemiol Drug Saf. 2024 Jan;33(1):e5734. doi: 10.1002/pds.5734. Epub 2023 Dec 19.

ABSTRACT

PURPOSE: Observational studies assessing effects of medical products on suicidal behavior often rely on health record data to account for pre-existing risk. We assess whether high-dimensional models predicting suicide risk using data derived from insurance claims and electronic health records (EHRs) are superior to models using data from insurance claims alone.

METHODS: Data were from seven large health systems identified outpatient mental health visits by patients aged 11 or older between 1/1/2009 and 9/30/2017. Data for the 5 years prior to each visit identified potential predictors of suicidal behavior typically available from insurance claims (e.g., mental health diagnoses, procedure codes, medication dispensings) and additional potential predictors available from EHRs (self-reported race and ethnicity, responses to Patient Health Questionnaire or PHQ-9 depression questionnaires). Nonfatal self-harm events following each visit were identified from insurance claims data and fatal self-harm events were identified by linkage to state mortality records. Random forest models predicting nonfatal or fatal self-harm over 90 days following each visit were developed in a 70% random sample of visits and validated in a held-out sample of 30%. Performance of models using linked claims and EHR data was compared to models using claims data only.

RESULTS: Among 15 845 047 encounters by 1 574 612 patients, 99 098 (0.6%) were followed by a self-harm event within 90 days. Overall classification performance did not differ between the best-fitting model using all data (area under the receiver operating curve or AUC = 0.846, 95% CI 0.839-0.854) and the best-fitting model limited to data available from insurance claims (AUC = 0.846, 95% CI 0.838-0.853). Competing models showed similar classification performance across a range of cut-points and similar calibration performance across a range of risk strata. Results were similar when the sample was limited to health systems and time periods where PHQ-9 depression questionnaires were recorded more frequently.

CONCLUSION: Investigators using health record data to account for pre-existing risk in observational studies of suicidal behavior need not limit that research to databases including linked EHR data.

PMID:38112287 | PMC:PMC10843611 | DOI:10.1002/pds.5734

Categories: Literature Watch

Anil Jegga

Semantic Web

Exploring biodiversity and ethnobotanical significance of <em>Solanum</em> species in Uzbekistan: unveiling the cultural wealth and ethnopharmacological uses

Application of an ontology for model cards to generate computable artifacts for linking machine learning information from biomedical research

Ontologies4Cat: investigating the landscape of ontologies for catalysis research data management

The organization of the semantic network as reflected by the neural correlates of six semantic dimensions

Electronic Health Record and Semantic Issues Using Fast Healthcare Interoperability Resources: Systematic Mapping Review

Case-Reported Data Management Methodology Using an RDF Data Model for Building a Multicenter Clinical Registry

Are We Where We Want to Be in Undergraduate Pathology Education?

Development and quality appraisal of a new English breast screening linked data set as part of the age, test threshold, and frequency of mammography screening (ATHENA-M) study

Development and application of Chinese medical ontology for diabetes mellitus

Channel semantic mutual learning for visible-thermal person re-identification

Voxel representation of brain images inpainting via Regional Pixel Semantic Network and pyramidal attention AE - Quantile differential mechanism model

Methods and datasets for segmentation of minimally invasive surgical instruments in endoscopic images and videos: A review of the state of the art

Does use of GP and specialist services vary across areas and according to individual socioeconomic position? A multilevel analysis using linked data in Australia

A patient safety knowledge graph supporting vaccine product development

DisoFLAG: accurate prediction of protein intrinsic disorder and its functions using graph-based interaction protein language model

Concreteness ratings for 36,000 Estonian words

Postnatal growth in small vulnerable newborns: a longitudinal study of 2 million Brazilians using routine register-based linked data

Designing an Ontology for Emotion-driven Visual Representations

Ontology of Consumer Health Vocabulary: providing a formal and interoperable semantic resource for linking lay language and medical terminology

Predicting risk of suicidal behavior from insurance claims data vs. linked data from insurance claims and electronic health records

Pages