Semantic Web

MetaTron: advancing biomedical annotation empowering relation annotation and collaboration

Fri, 2024-03-15 06:00

BMC Bioinformatics. 2024 Mar 14;25(1):112. doi: 10.1186/s12859-024-05730-9.

ABSTRACT

BACKGROUND: The constant growth of biomedical data is accompanied by the need for new methodologies to effectively and efficiently extract machine-readable knowledge for training and testing purposes. A crucial aspect in this regard is creating large, often manually or semi-manually, annotated corpora vital for developing effective and efficient methods for tasks like relation extraction, topic recognition, and entity linking. However, manual annotation is expensive and time-consuming especially if not assisted by interactive, intuitive, and collaborative computer-aided tools. To support healthcare experts in the annotation process and foster annotated corpora creation, we present MetaTron. MetaTron is an open-source and free-to-use web-based annotation tool to annotate biomedical data interactively and collaboratively; it supports both mention-level and document-level annotations also integrating automatic built-in predictions. Moreover, MetaTron enables relation annotation with the support of ontologies, functionalities often overlooked by off-the-shelf annotation tools.

RESULTS: We conducted a qualitative analysis to compare MetaTron with a set of manual annotation tools including TeamTat, INCEpTION, LightTag, MedTAG, and brat, on three sets of criteria: technical, data, and functional. A quantitative evaluation allowed us to assess MetaTron performances in terms of time and number of clicks to annotate a set of documents. The results indicated that MetaTron fulfills almost all the selected criteria and achieves the best performances.

CONCLUSIONS: MetaTron stands out as one of the few annotation tools targeting the biomedical domain supporting the annotation of relations, and fully customizable with documents in several formats-PDF included, as well as abstracts retrieved from PubMed, Semantic Scholar, and OpenAIRE. To meet any user need, we released MetaTron both as an online instance and as a Docker image locally deployable.

PMID:38486137 | DOI:10.1186/s12859-024-05730-9

Categories: Literature Watch

A Semantic Approach to Describe Social and Economic Characteristics That Impact Health Outcomes (Social Determinants of Health): Ontology Development Study

Wed, 2024-03-13 06:00

Online J Public Health Inform. 2024 Mar 13;16:e52845. doi: 10.2196/52845.

ABSTRACT

BACKGROUND: Social determinants of health (SDoH) have been described by the World Health Organization as the conditions in which individuals are born, live, work, and age. These conditions can be grouped into 3 interrelated levels known as macrolevel (societal), mesolevel (community), and microlevel (individual) determinants. The scope of SDoH expands beyond the biomedical level, and there remains a need to connect other areas such as economics, public policy, and social factors.

OBJECTIVE: Providing a computable artifact that can link health data to concepts involving the different levels of determinants may improve our understanding of the impact SDoH have on human populations. Modeling SDoH may help to reduce existing gaps in the literature through explicit links between the determinants and biological factors. This in turn can allow researchers and clinicians to make better sense of data and discover new knowledge through the use of semantic links.

METHODS: An experimental ontology was developed to represent knowledge of the social and economic characteristics of SDoH. Information from 27 literature sources was analyzed to gather concepts and encoded using Web Ontology Language, version 2 (OWL2) and Protégé. Four evaluators independently reviewed the ontology axioms using natural language translation. The analyses from the evaluations and selected terminologies from the Basic Formal Ontology were used to create a revised ontology with a broad spectrum of knowledge concepts ranging from the macrolevel to the microlevel determinants.

RESULTS: The literature search identified several topics of discussion for each determinant level. Publications for the macrolevel determinants centered around health policy, income inequality, welfare, and the environment. Articles relating to the mesolevel determinants discussed work, work conditions, psychosocial factors, socioeconomic position, outcomes, food, poverty, housing, and crime. Finally, sources found for the microlevel determinants examined gender, ethnicity, race, and behavior. Concepts were gathered from the literature and used to produce an ontology consisting of 383 classes, 109 object properties, and 748 logical axioms. A reasoning test revealed no inconsistent axioms.

CONCLUSIONS: This ontology models heterogeneous social and economic concepts to represent aspects of SDoH. The scope of SDoH is expansive, and although the ontology is broad, it is still in its early stages. To our current understanding, this ontology represents the first attempt to concentrate on knowledge concepts that are currently not covered by existing ontologies. Future direction will include further expanding the ontology to link with other biomedical ontologies, including alignment for granular semantics.

PMID:38477963 | DOI:10.2196/52845

Categories: Literature Watch

Sharing Digital Health Educational Resources in a One-Stop Shop Portal: Tutorial on the Catalog and Index of Digital Health Teaching Resources (CIDHR) Semantic Search Engine

Mon, 2024-03-04 06:00

JMIR Med Educ. 2024 Mar 4;10:e48393. doi: 10.2196/48393.

ABSTRACT

BACKGROUND: Access to reliable and accurate digital health web-based resources is crucial. However, the lack of dedicated search engines for non-English languages, such as French, is a significant obstacle in this field. Thus, we developed and implemented a multilingual, multiterminology semantic search engine called Catalog and Index of Digital Health Teaching Resources (CIDHR). CIDHR is freely accessible to everyone, with a focus on French-speaking resources. CIDHR has been initiated to provide validated, high-quality content tailored to the specific needs of each user profile, be it students or professionals.

OBJECTIVE: This study's primary aim in developing and implementing the CIDHR is to improve knowledge sharing and spreading in digital health and health informatics and expand the health-related educational community, primarily French speaking but also in other languages. We intend to support the continuous development of initial (ie, bachelor level), advanced (ie, master and doctoral levels), and continuing training (ie, professionals and postgraduate levels) in digital health for health and social work fields. The main objective is to describe the development and implementation of CIDHR. The hypothesis guiding this research is that controlled vocabularies dedicated to medical informatics and digital health, such as the Medical Informatics Multilingual Ontology (MIMO) and the concepts structuring the French National Referential on Digital Health (FNRDH), to index digital health teaching and learning resources, are effectively increasing the availability and accessibility of these resources to medical students and other health care professionals.

METHODS: First, resource identification is processed by medical librarians from websites and scientific sources preselected and validated by domain experts and surveyed every week. Then, based on MIMO and FNRDH, the educational resources are indexed for each related knowledge domain. The same resources are also tagged with relevant academic and professional experience levels. Afterward, the indexed resources are shared with the digital health teaching and learning community. The last step consists of assessing CIDHR by obtaining informal feedback from users.

RESULTS: Resource identification and evaluation processes were executed by a dedicated team of medical librarians, aiming to collect and curate an extensive collection of digital health teaching and learning resources. The resources that successfully passed the evaluation process were promptly included in CIDHR. These resources were diligently indexed (with MIMO and FNRDH) and tagged for the study field and degree level. By October 2023, a total of 371 indexed resources were available on a dedicated portal.

CONCLUSIONS: CIDHR is a multilingual digital health education semantic search engine and platform that aims to increase the accessibility of educational resources to the broader health care-related community. It focuses on making resources "findable," "accessible," "interoperable," and "reusable" by using a one-stop shop portal approach. CIDHR has and will have an essential role in increasing digital health literacy.

PMID:38437007 | DOI:10.2196/48393

Categories: Literature Watch

Data resource profile: the Edinburgh Child Protection Dataset - a new linked administrative data source of children referred to Child Protection paediatric services in Edinburgh, Scotland

Fri, 2024-03-01 06:00

Int J Popul Data Sci. 2023 Dec 14;8(3):2173. doi: 10.23889/ijpds.v8i6.2173. eCollection 2023.

ABSTRACT

INTRODUCTION: Child maltreatment affects a substantial number of children. However current evidence relies on either longitudinal studies, which are complex and resource-intensive, or linked data studies based on social services data, which is arguably the tip of the iceberg in terms of children who are maltreated. Reliable, linked, population-level data on children referred to services due to suspected abuse or neglect will increase our ability to examine risk factors for, and outcomes following, abuse and neglect.

OBJECTIVE: The objective of this project was to create a linkable population level dataset, The Edinburgh Child Protection Dataset (ECPD), comprising all children referred to the Edinburgh Child Protection Paediatric healthcare team due to a concern about their welfare between 1995 and 2015.

METHODS: The paper presents the process for creating the dataset. The analyses provide examples of available data from the main referrals dataset between 1995 and 2011 (where data quality was highest).

RESULTS: 19,969 referrals were captured, relating to 11,653 children. Of the 19,969 referrals, a higher proportion were girls (54%), although boys were referred for physical abuse more often than girls (41% versus 30%). Younger children were more likely to be referred for physical abuse (35% of 0-4 year olds vs. 27% 15+): older children were more likely to be referred for sexual abuse (48% of 15+ years vs. 18% of 0-4 years). Most referrals came from social workers (46%) or police (31%).

CONCLUSIONS: The ECPD offers a unique insight into the characteristics of referrals to child protection paediatric services over a key period in the history of child protection in Scotland. It is hoped that by making these data available to researchers, and able to be easily linked with both mother and child current and future health records, evidence will be created to better support maltreated children and monitor changes over time.

PMID:38425374 | PMC:PMC10900286 | DOI:10.23889/ijpds.v8i6.2173

Categories: Literature Watch

Toward Robust Graph Semi-Supervised Learning Against Extreme Data Scarcity

Thu, 2024-02-29 06:00

IEEE Trans Neural Netw Learn Syst. 2024 Feb 29;PP. doi: 10.1109/TNNLS.2024.3351938. Online ahead of print.

ABSTRACT

The success of graph neural networks (GNNs) in graph-based web mining highly relies on abundant human-annotated data, which is laborious to obtain in practice. When only a few labeled nodes are available, how to improve their robustness is key to achieving replicable and sustainable graph semi-supervised learning. Though self-training is powerful for semi-supervised learning, its application on graph-structured data may fail because 1) larger receptive fields are not leveraged to capture long-range node interactions, which exacerbates the difficulty of propagating feature-label patterns from labeled nodes to unlabeled nodes and 2) limited labeled data makes it challenging to learn well-separated decision boundaries for different node classes without explicitly capturing the underlying semantic structure. To address the challenges of capturing informative structural and semantic knowledge, we propose a new graph data augmentation framework, augmented graph self-training (AGST), which is built with two new (i.e., structural and semantic) augmentation modules on top of a decoupled GST backbone. In this work, we investigate whether this novel framework can learn a robust graph predictive model under the low-data context. We conduct comprehensive evaluations on semi-supervised node classification under different scenarios of limited labeled-node data. The experimental results demonstrate the unique contributions of the novel data augmentation framework for node classification with few labeled data.

PMID:38421848 | DOI:10.1109/TNNLS.2024.3351938

Categories: Literature Watch

Impact of low birth weight on academic attainment during adolescence: A comprehensive retrospective cohort study using linked data

Wed, 2024-02-28 06:00

Early Hum Dev. 2024 Apr;191:105974. doi: 10.1016/j.earlhumdev.2024.105974. Epub 2024 Feb 27.

ABSTRACT

BACKGROUND: This study addresses a critical knowledge gap by exploring the intricate relationship between low birth weight (LBW) and the heightened risk of suboptimal academic achievement during adolescence through a comprehensive retrospective cohort design.

METHODS: In this registry-based cohort study, meticulously linked health and curriculum-based test data for individuals born in New South Wales (NSW), Australia, between 2003 and 2005 were employed. Birth weight data were carefully sourced from the NSW perinatal data collection (PDC). The educational performance of offspring was thoroughly evaluated using the National Assessment Program for Literacy and Numeracy (NAPLAN) during grade 9, approximately at 14 years of age.

RESULTS: After rigorous adjustments for potential confounders, findings revealed a compelling narrative: LBW adolescents demonstrated an elevated susceptibility to not meeting national minimum standards across all domains, encompassing spelling [OR, 1.59 (95%CI 1.48-1.69)], writing [OR, 1.51 (95%CI 1.41-1.61)], reading [OR, 1.38 (95%CI 1.29-1.48)], and numeracy [OR, 1.52 (95%CI 1.40-1.63)]. Notably, LBW boys exhibited a more pronounced inclination towards diminished academic performance compared to their female counterparts.

CONCLUSIONS: This comprehensive retrospective cohort study, based on linked data, unequivocally establishes LBW as significantly associated with an increased vulnerability to substandard educational achievement during adolescence. Particularly robust effects were observed in females across all outcomes. Aimed at investigating whether LBW serves as a predictive factor for later academic difficulties, this study underscores the imperative for the adoption and fortification of preventative and early intervention strategies to curtail the prevalence of LBW-associated academic underachievement in later adolescence.

PMID:38417379 | DOI:10.1016/j.earlhumdev.2024.105974

Categories: Literature Watch

How to Identify e-Cigarette Brands Available in the United States During 2020-2022: Development and Usability Study

Wed, 2024-02-28 06:00

JMIR Form Res. 2024 Feb 28;8:e47570. doi: 10.2196/47570.

ABSTRACT

BACKGROUND: Prior studies have demonstrated that the e-cigarette market contains a large number of brands. Identifying these existing e-cigarette brands is a key element of market surveillance, which will further assist in policy making and compliance checks.

OBJECTIVE: To facilitate the surveillance of the diverse product landscape in the e-cigarette market, we constructed a semantic database of e-cigarette brands that have appeared in the US market as of 2020-2022.

METHODS: In order to build the brand database, we searched and compiled e-cigarette brands from a comprehensive list of retail channels and sources, including (1) e-liquid and disposable brands sold in web-based stores, (2) e-cigarette brands sold in brick-and-mortar stores and collected by the Nielsen Retail Scanner Data, (3) e-cigarette brands compiled by Wikipedia, (4) self-reported e-cigarette brands from the 2020 International Tobacco Control Four-Country Smoking and Vaping (ITC 4CV) US survey, and (5) e-cigarette brands on Twitter. We also estimated the top 5 e-cigarette brands by sales volume in brick-and-mortar stores, by the frequency and variety of offerings in web-based shops, and by the frequency of self-reported brands from the 2020 ITC 4CV US survey.

RESULTS: As of 2020-2022, a total of 912 e-cigarette brands have been sold by various retail channels. During 2020-2022, the top 5 brands are JUUL, vuse, njoy, blu, and logic in brick-and-mortar stores; blu, king, monster, twist, and air factory for e-liquids in web-based stores; hyde, pod mesh, suorin, vaporlax, and xtra for disposables sold in web-based stores; and smok, aspire, vaporesso, innokin, and eleaf based on self-reported survey data.

CONCLUSIONS: As the US Food and Drug Administration enforces the premarket tobacco market authorization, many e-cigarette brands may become illegal in the US market. In this context, how e-cigarette brands evolve and consolidate in different retail channels will be critical for understanding the regulatory impacts on product availability. Our semantic database of e-cigarette brands can serve as a useful tool to monitor product and marketplace development, conduct compliance checks, assess manufacturers' marketing behaviors, and identify regulatory impacts.

PMID:38416562 | DOI:10.2196/47570

Categories: Literature Watch

Integration of Patient-Reported Outcome Data Collected Via Web Applications and Mobile Apps Into a Nation-Wide COVID-19 Research Platform Using Fast Healthcare Interoperability Resources: Development Study

Tue, 2024-02-27 06:00

J Med Internet Res. 2024 Feb 27;26:e47846. doi: 10.2196/47846.

ABSTRACT

BACKGROUND: The Network University Medicine projects are an important part of the German COVID-19 research infrastructure. They comprise 2 subprojects: COVID-19 Data Exchange (CODEX) and Coordination on Mobile Pandemic Apps Best Practice and Solution Sharing (COMPASS). CODEX provides a centralized and secure data storage platform for research data, whereas in COMPASS, expert panels were gathered to develop a reference app framework for capturing patient-reported outcomes (PROs) that can be used by any researcher.

OBJECTIVE: Our study aims to integrate the data collected with the COMPASS reference app framework into the central CODEX platform, so that they can be used by secondary researchers. Although both projects used the Fast Healthcare Interoperability Resources (FHIR) standard, it was not used in a way that data could be shared directly. Given the short time frame and the parallel developments within the CODEX platform, a pragmatic and robust solution for an interface component was required.

METHODS: We have developed a means to facilitate and promote the use of the German Corona Consensus (GECCO) data set, a core data set for COVID-19 research in Germany. In this way, we ensured semantic interoperability for the app-collected PRO data with the COMPASS app. We also developed an interface component to sustain syntactic interoperability.

RESULTS: The use of different FHIR types by the COMPASS reference app framework (the general-purpose FHIR Questionnaire) and the CODEX platform (eg, Patient, Condition, and Observation) was found to be the most significant obstacle. Therefore, we developed an interface component that realigns the Questionnaire items with the corresponding items in the GECCO data set and provides the correct resources for the CODEX platform. We extended the existing COMPASS questionnaire editor with an import function for GECCO items, which also tags them for the interface component. This ensures syntactic interoperability and eases the reuse of the GECCO data set for researchers.

CONCLUSIONS: This paper shows how PRO data, which are collected across various studies conducted by different researchers, can be captured in a research-compatible way. This means that the data can be shared with a central research infrastructure and be reused by other researchers to gain more insights about COVID-19 and its sequelae.

PMID:38411999 | DOI:10.2196/47846

Categories: Literature Watch

A Semantic Framework to Detect Problems in Activities of Daily Living Monitored through Smart Home Sensors

Sat, 2024-02-24 06:00

Sensors (Basel). 2024 Feb 8;24(4):1107. doi: 10.3390/s24041107.

ABSTRACT

Activities of daily living (ADLs) are fundamental routine tasks that the majority of physically and mentally healthy people can independently execute. In this paper, we present a semantic framework for detecting problems in ADLs execution, monitored through smart home sensors. In the context of this work, we conducted a pilot study, gathering raw data from various sensors and devices installed in a smart home environment. The proposed framework combines multiple Semantic Web technologies (i.e., ontology, RDF, triplestore) to handle and transform these raw data into meaningful representations, forming a knowledge graph. Subsequently, SPARQL queries are used to define and construct explicit rules to detect problematic behaviors in ADL execution, a procedure that leads to generating new implicit knowledge. Finally, all available results are visualized in a clinician dashboard. The proposed framework can monitor the deterioration of ADLs performance for people across the dementia spectrum by offering a comprehensive way for clinicians to describe problematic behaviors in the everyday life of an individual.

PMID:38400265 | DOI:10.3390/s24041107

Categories: Literature Watch

Exploring #MentholBan on TikTok: a Thematic and Semantic Network Analysis

Wed, 2024-02-21 06:00

Nicotine Tob Res. 2024 Feb 21:ntae036. doi: 10.1093/ntr/ntae036. Online ahead of print.

ABSTRACT

INTRODUCTION: In April 2021, the U.S. Food and Drug Administration (FDA) announced its intention to ban the sale of menthol cigarettes and cigars. Decades of research support the premise a menthol ban will reduce initiation and disparities in tobacco-related disease among menthol smokers. The tobacco industry opposed such a policy and worked for decades to shape public opposition. Social media discourse can inform our understanding of public opinion about the proposed ban and guide communication strategies and policy implementation.

METHODS: This research employed a mixed-methods design to explore TikTok posts discussing the announced menthol ban. Using a TikTok web scraper to extract all content in the #mentholban hashtag (n=171), we coded for 11 themes, characterized content with descriptive statistics, and created a semantic network of co-occurring hashtags.

RESULTS: We found primarily negative attitudes towards the US ban announcement and a large volume of menthol "hacks" to circumvent the bans. Our semantic network analysis revealed strong co-occurrences between #mentholban and popularity-seeking hashtags. The metadata associated with each TikTok demonstrated that most posters in #mentholban are not "influencers" in the sense of having many followers, aside from a few niche organizations with multiple posts. We found that perceived political and racial motivations shaped posters' assessments of the menthol ban. Furthermore, we uncovered how individuals and organizational actors shaped menthol ban content on TikTok.

CONCLUSION: Our study indicates targeted marketing from alternative menthol product companies and advocacy organizations. The latter of these organizations is more likely to saturate the TikTok landscape with multiple posts and strategic hashtags.

IMPLICATIONS: This study pursued an exploration of tobacco policy discussion on TikTok, specifically related to the FDA proposed menthol ban. TikTok is newer platform and our study provides early evidence of policy discussion emerging there, including the types of accounts creating the content and their valence toward the policy.

PMID:38381598 | DOI:10.1093/ntr/ntae036

Categories: Literature Watch

Neurocognitive changes at different follow-up times after bilateral subthalamic nucleus deep brain stimulation in patients with Parkinson's disease

Wed, 2024-02-21 06:00

Heliyon. 2024 Feb 10;10(4):e26303. doi: 10.1016/j.heliyon.2024.e26303. eCollection 2024 Feb 29.

ABSTRACT

BACKGROUND: Bilateral deep thalamic nucleus brain stimulation (STN-DBS) surgery is often used to treat the motor symptoms of patients with Parkinson's disease. The change of neurocognitive symptoms in patients is, however, still unclear.

OBJECTIVE: We aimed at analyzing the deterioration of neurocognitive symptoms in patients with Parkinson's disease after deep brain stimulation surgery under different follow-up times.

METHODS: A comprehensive literature review was conducted using Pubmed, Cochrane Library, and Web of Science to screen eligible study records, the meta-analysis was performed using an inverse variance method and a random-effects model. Additionally, the areas of analysis include five: cognition, executive function, memory capacity, and verbal fluency (phonetic fluency and semantic fluency). They were analyzed for changes at six and twelve months postoperatively compared to baseline. The Meta-analysis has been registered with PROSPERO under the registration number: CRD42022308786.

RESULTS: In terms of overall cognitive performance, executive function, and memory capacity, the original studies show a trend of improvement in these areas at 12 months postoperatively compared with 6 months, at variance, patients did not improve or deteriorated in phonetic fluency(d = -0.42 at both 6-month and 12-month follow-up) and semantic fluency from 6 to 12 months postoperatively.

CONCLUSION: In terms of most neurocognitive symptoms, including cognitive ability, executive function, and learning memory capacity, bilateral STN-DBS surgery appears to be safe at relatively long follow-up times. However, postoperative phonetic and semantic fluency changes should still not be underestimated, and clinicians should pay more attention to patients' changes in both.

PMID:38379975 | PMC:PMC10877422 | DOI:10.1016/j.heliyon.2024.e26303

Categories: Literature Watch

Detecting and Grounding Multi-Modal Media Manipulation and Beyond

Tue, 2024-02-20 06:00

IEEE Trans Pattern Anal Mach Intell. 2024 Feb 20;PP. doi: 10.1109/TPAMI.2024.3367749. Online ahead of print.

ABSTRACT

Misinformation has become a pressing issue. Fake media, in both visual and textual forms, is widespread on the web. Whilevarious deepfake detection and text fake news detection methods have been proposed, they are only designed for single-modality forgery based on binary classification, let alone analyzing and reasoning subtle forgery traces across different modalities. In this paper, we highlight a new research problem for multi-modal fake media, namely Detecting and Grounding Multi-Modal Media Manipulation (DGM4). DGM4 aims to not only detect the authenticity of multi-modal media, but also ground the manipulated content, which requires deeper reasoning of multi-modal media manipulation. To support a large-scale investigation, we construct the first DGM4 dataset. Moreover, we propose a novel HierArchical Multi-modal Manipulation rEasoning tRansformer (HAMMER) to fully capture the fine-grained interaction between different modalities. HAMMER performs 1) manipulation-aware contrastive learning between two uni-modal encoders as shallow manipulation reasoning, and 2) modality-aware cross-attention by multi-modal aggregator as deep manipulation reasoning. Dedicated manipulation detection and grounding heads are integrated from shallow to deep levels based on the interacted multi-modal information. To exploit more fine-grained contrastive learning for cross-modal semantic alignment, we further integrate Manipulation-Aware Contrastive Loss with Local View and construct a more advanced model HAMMER++ Finally, we build an extensive benchmark and set up rigorous evaluation metrics for this new research problem. Comprehensive experiments demonstrate the superiority of HAMMER and HAMMER++; several valuable observations are also revealed to facilitate future research in multi-modal media manipulation..

PMID:38376967 | DOI:10.1109/TPAMI.2024.3367749

Categories: Literature Watch

Chinese Title Generation for Short Videos: Dataset, Metric and Algorithm

Wed, 2024-02-14 06:00

IEEE Trans Pattern Anal Mach Intell. 2024 Feb 14;PP. doi: 10.1109/TPAMI.2024.3365739. Online ahead of print.

ABSTRACT

Previous work for video captioning aims to objectively describe the video content but the captions lack human interest and attractiveness, limiting its practical application scenarios. The intention of video title generation (video titling) is to produce attractive titles, but there is a lack of benchmarks. This work offers CREATE, the first large-scale Chinese shoRt vidEo retrievAl and Title gEneration dataset, to assist research and applications in video titling, video captioning, and video retrieval in Chinese. CREATE comprises a high-quality labeled 210K dataset and two web-scale 3M and 10M pre-training datasets, covering 51 categories, 50K+ tags, 537K+ manually annotated titles and captions, and 10M+ short videos with original video information. This work presents ACTEr, a unique Attractiveness-Consensus-based Title Evaluation, to objectively evaluate the quality of video title generation. This metric measures the semantic correlation between the candidate (model-generated title) and references (manual-labeled titles) and introduces attractive consensus weights to assess the attractiveness and relevance of the video title. Accordingly, this work proposes a novel multi-modal ALignment WIth Generation model, ALWIG, as one strong baseline to aid future model development. With the help of a tag-driven video-text alignment module and a GPT-based generation module, this model achieves video titling, captioning, and retrieval simultaneously. We believe that the release of the CREATE dataset, ACTEr metric, and ALWIG model will encourage in-depth research on the analysis and creation of Chinese short videos. Project webpage: https://createbenchmark.github.io/.

PMID:38354071 | DOI:10.1109/TPAMI.2024.3365739

Categories: Literature Watch

Analysis of the impact of COVID-19 on Scotland's care-homes from March 2020 to October 2021: national linked data cohort analysis

Sun, 2024-02-11 06:00

Age Ageing. 2024 Feb 1;53(2):afae015. doi: 10.1093/ageing/afae015.

ABSTRACT

BACKGROUND: The impact of the COVID-19 pandemic on long-term care residents remains of wide interest, but most analyses focus on the initial wave of infections.

OBJECTIVE: To examine change over time in: (i) The size, duration, classification and pattern of care-home outbreaks of COVID-19 and associated mortality and (ii) characteristics associated with an outbreak.

DESIGN: Retrospective observational cohort study using routinely-collected data.

SETTING: All adult care-homes in Scotland (1,092 homes, 41,299 places).

METHODS: Analysis was undertaken at care-home level, over three periods. Period (P)1 01/03/2020-31/08/2020; P2 01/09/2020-31/05/2021 and P3 01/06/2021-31/10/2021. Outcomes were the presence and characteristics of outbreaks and mortality within the care-home. Cluster analysis was used to compare the pattern of outbreaks. Logistic regression examined care-home characteristics associated with outbreaks.

RESULTS: In total 296 (27.1%) care-homes had one outbreak, 220 (20.1%) had two, 91 (8.3%) had three, and 68 (6.2%) had four or more. There were 1,313 outbreaks involving residents: 431 outbreaks in P1, 559 in P2 and 323 in P3. The COVID-19 mortality rate per 1,000 beds fell from 45.8 in P1, to 29.3 in P2, and 3.5 in P3. Larger care-homes were much more likely to have an outbreak, but associations between size and outbreaks were weaker in later periods.

CONCLUSIONS: COVID-19 mitigation measures appear to have been beneficial, although the impact on residents remained severe until early 2021. Care-home residents, staff, relatives and providers are critical groups for consideration and involvement in future pandemic planning.

PMID:38342752 | PMC:PMC10859243 | DOI:10.1093/ageing/afae015

Categories: Literature Watch

Sniffing out meaning: Chemosensory and semantic neural network changes in sommeliers

Sat, 2024-02-10 06:00

Hum Brain Mapp. 2024 Feb 1;45(2):e26564. doi: 10.1002/hbm.26564.

ABSTRACT

Wine tasting is a very complex process that integrates a combination of sensation, language, and memory. Taste and smell provide perceptual information that, together with the semantic narrative that converts flavor into words, seem to be processed differently between sommeliers and naïve wine consumers. We investigate whether sommeliers' wine experience shapes only chemosensory processing, as has been previously demonstrated, or if it also modulates the way in which the taste and olfactory circuits interact with the semantic network. Combining diffusion-weighted images and fMRI (activation and connectivity) we investigated whether brain response to tasting wine differs between sommeliers and nonexperts (1) in the sensory neural circuits representing flavor and/or (2) in the neural circuits for language and memory. We demonstrate that training in wine tasting shapes the microstructure of the left and right superior longitudinal fasciculus. Using mediation analysis, we showed that the experience modulates the relationship between fractional anisotropy and behavior: the higher the fractional anisotropy the higher the capacity to recognize wine complexity. In addition, we found functional differences between sommeliers and naïve consumers affecting the flavor sensory circuit, but also regions involved in semantic operations. The former reflects a capacity for differential sensory processing, while the latter reflects sommeliers' ability to attend to relevant sensory inputs and translate them into complex verbal descriptions. The enhanced synchronization between these apparently independent circuits suggests that sommeliers integrated these descriptions with previous semantic knowledge to optimize their capacity to distinguish between subtle differences in the qualitative character of the wine.

PMID:38339911 | PMC:PMC10823763 | DOI:10.1002/hbm.26564

Categories: Literature Watch

Exploring biodiversity and ethnobotanical significance of <em>Solanum</em> species in Uzbekistan: unveiling the cultural wealth and ethnopharmacological uses

Fri, 2024-02-09 06:00

Front Pharmacol. 2024 Jan 24;14:1287793. doi: 10.3389/fphar.2023.1287793. eCollection 2023.

ABSTRACT

Despite its millennial existence and empirical documentation, the ethnological knowledge of herbs is a more recent phenomenon. The knowledge of their historical uses as food, medicine, source of income and small-scale businesses, and the sociological impacts are threatened due to the slow ethnobotanical research drive. Species of the genus Solanum have long been extensively used in folk medicine to treat various illnesses of humans since the dawn of civilization. All data were systematically obtained from papers, monographs, and books written in Uzbek, Russian, and English through various scientific online databases, including Google, Google Scholar, PubMed, Scopus, Semantic Scholar, Science Direct, and Web of Science using specific keywords focused on eight Solanum species. Eight native and non-native Solanum species as S. dulcamara L., S. lycopersicum L., S. melongena L., S. nigrum L., S. rostratum Dunal., S. sisymbriifolium Lam., S. tuberosum L., and S. villosum Mill. have been recorded in Uzbekistan of Central Asia. In this article we presented recently obtained data on the diversity, morphological characteristics, global distribution, habitat, population status, phenology, reproduction, pharmacology and phytochemistry of these Solanum species in Uzbekistan. Furthermore, relying on a combination of literature reviews and analyses from various scientific papers, we focus on food consumption coupled with global ethnobotanical and ethnopharmacological uses in human diseases of the Solanum species growing in Uzbekistan. Since the dawn of civilization, these eight cultivated and non-cultivated species of Solanum have provided sustainable resources of medicinal plants in Uzbekistan to prevent and treat various human diseases. Based on the collected data, it was shown that Solanum species have not been studied ethnobotanically and ethnomedicinally in Uzbekistan and it is necessary to conduct phytochemical and biotechnological research on them in the future. Traditional uses and scientific evaluation of Solanum indicate that S. nigrum, S. sisymbriifolium and S. tuberosum are one of the most widely used species in some parts of the world. Although considerable progress has been made to comprehend the chemical and biological properties of S. nigrum and S. tuberosum species, more research on the pharmacology and toxicology of these species is needed to ensure the safety, efficacy, and quality of their biologically active extracts and isolated bioactive compounds. Additionally, conducting additional research on the structure-activity relationship of certain isolated phytochemicals has the potential to enhance their biological efficacy and advance the scientific utilization of traditional applications of Solanum taxa.

PMID:38333226 | PMC:PMC10851437 | DOI:10.3389/fphar.2023.1287793

Categories: Literature Watch

Application of an ontology for model cards to generate computable artifacts for linking machine learning information from biomedical research

Thu, 2024-02-08 06:00

Proc Int World Wide Web Conf. 2023 Apr;2023(Companion):820-825. doi: 10.1145/3543873.3587601. Epub 2023 Apr 30.

ABSTRACT

Model card reports provide a transparent description of machine learning models which includes information about their evaluation, limitations, intended use, etc. Federal health agencies have expressed an interest in model cards report for research studies using machine-learning based AI. Previously, we have developed an ontology model for model card reports to structure and formalize these reports. In this paper, we demonstrate a Java-based library (OWL API, FaCT++) that leverages our ontology to publish computable model card reports. We discuss future directions and other use cases that highlight applicability and feasibility of ontology-driven systems to support FAIR challenges.

PMID:38327770 | PMC:PMC10848146 | DOI:10.1145/3543873.3587601

Categories: Literature Watch

Ontologies4Cat: investigating the landscape of ontologies for catalysis research data management

Wed, 2024-02-07 06:00

J Cheminform. 2024 Feb 7;16(1):16. doi: 10.1186/s13321-024-00807-2.

ABSTRACT

As scientific digitization advances it is imperative ensuring data is Findable, Accessible, Interoperable, and Reusable (FAIR) for machine-processable data. Ontologies play a vital role in enhancing data FAIRness by explicitly representing knowledge in a machine-understandable format. Research data in catalysis research often exhibits complexity and diversity, necessitating a respectively broad collection of ontologies. While ontology portals such as EBI OLS and BioPortal aid in ontology discovery, they lack deep classification, while quality metrics for ontology reusability and domains are absent for the domain of catalysis research. Thus, this work provides an approach for systematic collection of ontology metadata with focus on the catalysis research data value chain. By classifying ontologies by subdomains of catalysis research, the approach is offering efficient comparison across ontologies. Furthermore, a workflow and codebase is presented, facilitating representation of the metadata on GitHub. Finally, a method is presented to automatically map the classes contained in the ontologies of the metadata collection against each other, providing further insights on relatedness of the ontologies listed. The presented methodology is designed for its reusability, enabling its adaptation to other ontology collections or domains of knowledge. The ontology metadata taken up for this work and the code developed and described in this work are available in a GitHub repository at: https://github.com/nfdi4cat/Ontology-Overview-of-NFDI4Cat .

PMID:38326906 | PMC:PMC10851519 | DOI:10.1186/s13321-024-00807-2

Categories: Literature Watch

The organization of the semantic network as reflected by the neural correlates of six semantic dimensions

Wed, 2024-01-31 06:00

Brain Lang. 2024 Mar;250:105388. doi: 10.1016/j.bandl.2024.105388. Epub 2024 Jan 31.

ABSTRACT

Multiple sensory-motor and non-sensory-motor dimensions have been proposed for semantic representation, but it remains unclear how the semantic system is organized along them in the human brain. Using naturalistic fMRI data and large-scale semantic ratings, we investigated the overlaps and dissociations between the neural correlates of six semantic dimensions: vision, motor, socialness, emotion, space, and time. Our findings revealed a more complex semantic atlas than what is predicted by current neurobiological models of semantic representation. Brain regions that are selectively sensitive to specific semantic dimensions were found both within and outside the brain networks assumed to represent multimodal general and/or abstract semantics. Overlaps between the neural correlates of different semantic dimensions were mainly found inside the default mode network, concentrated in the left anterior superior temporal gyrus and angular gyrus, which have been proposed as two connector hubs that bridge the multimodal experiential semantic system and the language-supported semantic system.

PMID:38295716 | DOI:10.1016/j.bandl.2024.105388

Categories: Literature Watch

Pages