Deep learning
Higher blood biochemistry-based biological age developed by advanced deep learning techniques is associated with frailty in geriatric rehabilitation inpatients: RESORT
Exp Gerontol. 2024 Apr 6:112421. doi: 10.1016/j.exger.2024.112421. Online ahead of print.
ABSTRACT
BACKGROUND: Accelerated biological ageing is a major underlying mechanism of frailty development. This study aimed to investigate if the biological age measured by a blood biochemistry-based ageing clock is associated with frailty in geriatric rehabilitation inpatients.
METHODS: Within the REStORing health of acutely unwell adulTs (RESORT) cohort, patients' biological age was measured by an ageing clock based on completed data of 30 routine blood test variables measured at rehabilitation admission. The delta of biological age minus chronological age (years) was calculated. Ordinal logistic regression and multinomial logistic regression were performed to evaluate the association of the delta of ages with frailty assessed by the Clinical Frailty Scale. Effect modification of Cumulative Illness Rating Scale (CIRS) score was tested.
RESULTS: A total of 1187 geriatric rehabilitation patients were included (median age: 83.4 years, IQR: 77.7-88.5; 57.4 % female). The biochemistry-based biological age was strongly correlated with chronological age (Spearman r = 0.883). After adjustment for age, sex and primary reasons for acute admission, higher biological age (per 1 year higher in delta of ages) was associated with more severe frailty at admission (OR: 1.053, 95 % CI:1.012-1.096) in patients who had a CIRS score of <12 not in patients with a CIRS score >12. The delta of ages was not associated with frailty change from admission to discharge. The specific frailty manifestations as cardiac, hematological, respiratory, renal, and endocrine conditions were associated with higher biological age.
CONCLUSION: Higher biological age was associated with severe frailty in geriatric rehabilitation inpatients with less comorbidity burden.
PMID:38588752 | DOI:10.1016/j.exger.2024.112421
CBCT-DRRs superior to CT-DRRs for target-tracking applications for pancreatic SBRT
Biomed Phys Eng Express. 2024 Apr 8. doi: 10.1088/2057-1976/ad3bb9. Online ahead of print.
ABSTRACT
In current radiograph-based intra-fraction markerless target-tracking, digitally reconstructed radiographs (DRRs) from planning CTs (CT-DRRs) are often used to train deep learning models that extract information from the intra-fraction radiographs acquired during treatment. Traditional DRR algorithms were designed for patient alignment (i.e. bone matching) and may not replicate the radiographic image quality of intra-fraction radiographs at treatment. Hypothetically, generating DRRs from pre-treatment Cone-Beam CTs (CBCT-DRRs) with DRR algorithms incorporating physical modelling of on-board-imagers (OBIs) could improve the similarity between intra-fraction radiographs and DRRs by eliminating inter-fraction variation and reducing image-quality mismatches between radiographs and DRRs. In this study, we test the two hypotheses that intra-fraction radiographs are more similar to CBCT-DRRs than CT-DRRs, and that intra-fraction radiographs are more similar to DRRs from algorithms incorporating physical models of OBI components than DRRs from algorithms omitting these models.

Main results: Intra-fraction radiographs were more similar to CBCT-DRRs than CT-DRRs for both metrics across all algorithms, with all p<0.007. Source-spectrum modelling improved radiograph-DRR similarity for both metrics, with all p<1E-6. OBI detector modelling and patient material modelling did not influence radiograph-DRR similarity for either metric.

Significance: Generating DRRs from pre-treatment CBCT-DRRs is feasible, and incorporating CBCT-DRRs into markerless target-tracking methods may promote improved target-tracking accuracies. Incorporating source-spectrum modelling into a treatment planning system's DRR algorithms may reinforce the safe treatment of cancer patients by aiding in patient alignment.
PMID:38588646 | DOI:10.1088/2057-1976/ad3bb9
Prediction of protein N-terminal acetylation modification sites based on CNN-BiLSTM-attention model
Comput Biol Med. 2024 Mar 21;174:108330. doi: 10.1016/j.compbiomed.2024.108330. Online ahead of print.
ABSTRACT
N-terminal acetylation is one of the most common and important post-translational modifications (PTM) of eukaryotic proteins. PTM plays a crucial role in various cellular processes and disease pathogenesis. Thus, the accurate identification of N-terminal acetylation modifications is important to gain insight into cellular processes and other possible functional mechanisms. Although some algorithmic models have been proposed, most have been developed based on traditional machine learning algorithms and small training datasets. Their practical applications are limited. Nevertheless, deep learning algorithmic models are better at handling high-throughput and complex data. In this study, DeepCBA, a model based on the hybrid framework of convolutional neural network (CNN), bidirectional long short-term memory network (BiLSTM), and attention mechanism deep learning, was constructed to detect the N-terminal acetylation sites. The DeepCBA was built as follows: First, a benchmark dataset was generated by selecting low-redundant protein sequences from the Uniport database and further reducing the redundancy of the protein sequences using the CD-HIT tool. Subsequently, based on the skip-gram model in the word2vec algorithm, tripeptide word vector features were generated on the benchmark dataset. Finally, the CNN, BiLSTM, and attention mechanism were combined, and the tripeptide word vector features were fed into the stacked model for multiple rounds of training. The model performed excellently on independent dataset test, with accuracy and area under the curve of 80.51% and 87.36%, respectively. Altogether, DeepCBA achieved superior performance compared with the baseline model, and significantly outperformed most existing predictors. Additionally, our model can be used to identify disease loci and drug targets.
PMID:38588617 | DOI:10.1016/j.compbiomed.2024.108330
AI-based support for optical coherence tomography in age-related macular degeneration
Int J Retina Vitreous. 2024 Apr 8;10(1):31. doi: 10.1186/s40942-024-00549-1.
ABSTRACT
Artificial intelligence (AI) has emerged as a transformative technology across various fields, and its applications in the medical domain, particularly in ophthalmology, has gained significant attention. The vast amount of high-resolution image data, such as optical coherence tomography (OCT) images, has been a driving force behind AI growth in this field. Age-related macular degeneration (AMD) is one of the leading causes for blindness in the world, affecting approximately 196 million people worldwide in 2020. Multimodal imaging has been for a long time the gold standard for diagnosing patients with AMD, however, currently treatment and follow-up in routine disease management are mainly driven by OCT imaging. AI-based algorithms have by their precision, reproducibility and speed, the potential to reliably quantify biomarkers, predict disease progression and assist treatment decisions in clinical routine as well as academic studies. This review paper aims to provide a summary of the current state of AI in AMD, focusing on its applications, challenges, and prospects.
PMID:38589936 | DOI:10.1186/s40942-024-00549-1
Deep transfer learning with fuzzy ensemble approach for the early detection of breast cancer
BMC Med Imaging. 2024 Apr 8;24(1):82. doi: 10.1186/s12880-024-01267-8.
ABSTRACT
Breast Cancer is a significant global health challenge, particularly affecting women with higher mortality compared with other cancer types. Timely detection of such cancer types is crucial, and recent research, employing deep learning techniques, shows promise in earlier detection. The research focuses on the early detection of such tumors using mammogram images with deep-learning models. The paper utilized four public databases where a similar amount of 986 mammograms each for three classes (normal, benign, malignant) are taken for evaluation. Herein, three deep CNN models such as VGG-11, Inception v3, and ResNet50 are employed as base classifiers. The research adopts an ensemble method where the proposed approach makes use of the modified Gompertz function for building a fuzzy ranking of the base classification models and their decision scores are integrated in an adaptive manner for constructing the final prediction of results. The classification results of the proposed fuzzy ensemble approach outperform transfer learning models and other ensemble approaches such as weighted average and Sugeno integral techniques. The proposed ResNet50 ensemble network using the modified Gompertz function-based fuzzy ranking approach provides a superior classification accuracy of 98.986%.
PMID:38589813 | DOI:10.1186/s12880-024-01267-8
Improving the performance of supervised deep learning for regulatory genomics using phylogenetic augmentation
Bioinformatics. 2024 Apr 8:btae190. doi: 10.1093/bioinformatics/btae190. Online ahead of print.
ABSTRACT
MOTIVATION: Supervised deep learning is used to model the complex relationship between genomic sequence and regulatory function. Understanding how these models make predictions can provide biological insight into regulatory functions. Given the complexity of the sequence to regulatory function mapping (the cis-regulatory code), it has been suggested that the genome contains insufficient sequence variation to train models with suitable complexity. Data augmentation is a widely used approach to increase the data variation available for model training, however current data augmentation methods for genomic sequence data are limited.
RESULTS: Inspired by the success of comparative genomics, we show that augmenting genomic sequences with evolutionarily related sequences from other species, which we term phylogenetic augmentation, improves the performance of deep learning models trained on regulatory genomic sequences to predict high-throughput functional assay measurements. Additionally, we show that phylogenetic augmentation can rescue model performance when the training set is down-sampled and permits deep learning on a real-world small dataset, demonstrating that this approach improves data efficiency. Overall, this data augmentation method represents a solution for improving model performance that is applicable to many supervised deep learning problems in genomics.
AVAILABILITY: The open-source GitHub repository agduncan94/phylogenetic_augmentation_paper includes the code for rerunning the analyses here and recreating the figures.
SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
PMID:38588559 | DOI:10.1093/bioinformatics/btae190
Association Between Body Composition and Survival in Patients With Gastroesophageal Adenocarcinoma: An Automated Deep Learning Approach
JCO Clin Cancer Inform. 2024 Apr;8:e2300231. doi: 10.1200/CCI.23.00231.
ABSTRACT
PURPOSE: Body composition (BC) may play a role in outcome prognostication in patients with gastroesophageal adenocarcinoma (GEAC). Artificial intelligence provides new possibilities to opportunistically quantify BC from computed tomography (CT) scans. We developed a deep learning (DL) model for fully automatic BC quantification on routine staging CTs and determined its prognostic role in a clinical cohort of patients with GEAC.
MATERIALS AND METHODS: We developed and tested a DL model to quantify BC measures defined as subcutaneous and visceral adipose tissue (VAT) and skeletal muscle on routine CT and investigated their prognostic value in a cohort of patients with GEAC using baseline, 3-6-month, and 6-12-month postoperative CTs. Primary outcome was all-cause mortality, and secondary outcome was disease-free survival (DFS). Cox regression assessed the association between (1) BC at baseline and mortality and (2) the decrease in BC between baseline and follow-up scans and mortality/DFS.
RESULTS: Model performance was high with Dice coefficients ≥0.94 ± 0.06. Among 299 patients with GEAC (age 63.0 ± 10.7 years; 19.4% female), 140 deaths (47%) occurred over a median follow-up of 31.3 months. At baseline, no BC measure was associated with DFS. Only a substantial decrease in VAT >70% after a 6- to 12-month follow-up was associated with mortality (hazard ratio [HR], 1.99 [95% CI, 1.18 to 3.34]; P = .009) and DFS (HR, 1.73 [95% CI, 1.01 to 2.95]; P = .045) independent of age, sex, BMI, Union for International Cancer Control stage, histologic grading, resection status, neoadjuvant therapy, and time between surgery and follow-up CT.
CONCLUSION: DL enables opportunistic estimation of BC from routine staging CT to quantify prognostic information. In patients with GEAC, only a substantial decrease of VAT 6-12 months postsurgery was an independent predictor for DFS beyond traditional risk factors, which may help to identify individuals at high risk who go otherwise unnoticed.
PMID:38588476 | DOI:10.1200/CCI.23.00231
Artificial intelligence and natural product research
Nat Prod Res. 2024 Apr 8:1-3. doi: 10.1080/14786419.2024.2333048. Online ahead of print.
NO ABSTRACT
PMID:38588438 | DOI:10.1080/14786419.2024.2333048
How Artificial Intelligence Unravels the Complex Web of Cancer Drug Response
Cancer Res. 2024 Apr 8. doi: 10.1158/0008-5472.CAN-24-1123. Online ahead of print.
ABSTRACT
The intersection of precision medicine and artificial intelligence (AI) holds profound implications for cancer treatment, with the potential to significantly advance our understanding of drug responses based on the intricate architecture of tumor cells. A recent study by Park and colleagues titled "A deep learning model of tumor cell architecture elucidates response and resistance to CDK4/6 inhibitors," epitomizes this intersection by leveraging an interpretable deep learning model grounded in a comprehensive map of multiprotein assemblies in cancer, known as Nested Systems in Tumors (NeST). This study not only elucidates mechanisms underlying the response to CDK4/6 inhibitors in breast cancer therapy but also highlights the critical role of model interpretability leading to new mechanistic insights.
PMID:38588311 | DOI:10.1158/0008-5472.CAN-24-1123
Statistical Machine Learning for Power Flow Analysis Considering the Influence of Weather Factors on Photovoltaic Power Generation
IEEE Trans Neural Netw Learn Syst. 2024 Apr 8;PP. doi: 10.1109/TNNLS.2024.3382763. Online ahead of print.
ABSTRACT
It is generally accepted that the impact of weather variation is gradually increasing in modern distribution networks with the integration of high-proportion photovoltaic (PV) power generation and weather-sensitive loads. This article analyzes power flow using a novel stochastic weather generator (SWG) based on statistical machine learning (SML). The proposed SML model, which incorporates generative adversarial networks (GANs), probability theory, and information theory, enables the generation and evaluation of simulated hourly weather data throughout the year. The GAN model captures various weather variation characteristics, including weather uncertainties, diurnal variations, and seasonal patterns. Compared to shallow learning models, the proposed deep learning model exhibits significant advantages in stochastic weather simulation. The simulated data generated by the proposed model closely resemble real data in terms of time-series regularity, integrity, and stochasticity. The SWG is applied to model PV power generation and weather-sensitive loads. Then, we actively conduct a power flow analysis (PFA) on a real distribution network in Guangdong, China, using simulated data for an entire year. The results provide evidence that the GAN-based SWG surpasses the shallow machine learning approach in terms of accuracy. The proposed model ensures accurate analysis of weather-related power flow and provides valuable insights for the analysis, planning, and design of distribution networks.
PMID:38587954 | DOI:10.1109/TNNLS.2024.3382763
Dual-Channel Prototype Network for Few-Shot Pathology Image Classification
IEEE J Biomed Health Inform. 2024 Apr 8;PP. doi: 10.1109/JBHI.2024.3386197. Online ahead of print.
ABSTRACT
In the field of pathology, the scarcity of certain diseases and the difficulty of annotating images hinder the development of large, high-quality datasets, which in turn affects the advancement of deep learning-assisted diagnostics. Few-shot learning has demonstrated unique advantages in modeling tasks with limited data, yet explorations of this method in the field of pathology remain in the early stages. To address this issue, we present a dual-channel prototype network (DCPN), a novel few-shot learning approach for efficiently classifying pathology images with limited data. The DCPN leverages self-supervised learning to extend the pyramid vision transformer (PVT) to few-shot classification tasks and combines it with a convolutional neural network to construct a dual-channel network for extracting multi-scale, high-precision pathological features, thereby substantially enhancing the generalizability of prototype representations. Additionally, we design a soft voting classifier based on multi-scale features to further augment the discriminative power of the model in complex pathology image classification tasks. We constructed three few-shot classification tasks with varying degrees of domain shift using three publicly available pathological datasets-CRCTP, NCTCRC, and LC25000-to emulate real-world clinical scenarios. The results demonstrated that the DCPN outperformed the prototypical network across all metrics, achieving the highest accuracies in same-domain tasks-70.86% for 1-shot, 82.57% for 5-shot, and 85.2% for 10-shot setups-corresponding to improvements of 5.51%, 5.72%, and 6.81%, respectively, over the prototypical network. Notably, in the same-domain 10-shot setting, the accuracy of the DCPN (85.2%) surpassed that of the PVT-based supervised learning model (85.15%), confirming its potential to diagnose rare diseases within few-shot learning frameworks.
PMID:38587946 | DOI:10.1109/JBHI.2024.3386197
DeepP450: Predicting Human P450 Activities of Small Molecules by Integrating Pretrained Protein Language Model and Molecular Representation
J Chem Inf Model. 2024 Apr 8. doi: 10.1021/acs.jcim.4c00115. Online ahead of print.
ABSTRACT
Cytochrome P450 enzymes (CYPs) play a crucial role in Phase I drug metabolism in the human body, and CYP activity toward compounds can significantly affect druggability, making early prediction of CYP activity and substrate identification essential for therapeutic development. Here, we established a deep learning model for assessing potential CYP substrates, DeepP450, by fine-tuning protein and molecule pretrained models through feature integration with cross-attention and self-attention layers. This model exhibited high prediction accuracy (0.92) on the test set, with area under the receiver operating characteristic curve (AUROC) values ranging from 0.89 to 0.98 in substrate/nonsubstrate predictions across the nine major human CYPs, surpassing current benchmarks for CYP activity prediction. Notably, DeepP450 uses only one model to predict substrates/nonsubstrates for any of the nine CYPs and exhibits certain generalizability on novel compounds and different categories of human CYPs, which could greatly facilitate early stage drug design by avoiding CYP-reactive compounds.
PMID:38587937 | DOI:10.1021/acs.jcim.4c00115
Automated stratification of trauma injury severity across multiple body regions using multi-modal, multi-class machine learning models
J Am Med Inform Assoc. 2024 Apr 8:ocae071. doi: 10.1093/jamia/ocae071. Online ahead of print.
ABSTRACT
OBJECTIVE: The timely stratification of trauma injury severity can enhance the quality of trauma care but it requires intense manual annotation from certified trauma coders. The objective of this study is to develop machine learning models for the stratification of trauma injury severity across various body regions using clinical text and structured electronic health records (EHRs) data.
MATERIALS AND METHODS: Our study utilized clinical documents and structured EHR variables linked with the trauma registry data to create 2 machine learning models with different approaches to representing text. The first one fuses concept unique identifiers (CUIs) extracted from free text with structured EHR variables, while the second one integrates free text with structured EHR variables. Temporal validation was undertaken to ensure the models' temporal generalizability. Additionally, analyses to assess the variable importance were conducted.
RESULTS: Both models demonstrated impressive performance in categorizing leg injuries, achieving high accuracy with macro-F1 scores of over 0.8. Additionally, they showed considerable accuracy, with macro-F1 scores exceeding or near 0.7, in assessing injuries in the areas of the chest and head. We showed in our variable importance analysis that the most important features in the model have strong face validity in determining clinically relevant trauma injuries.
DISCUSSION: The CUI-based model achieves comparable performance, if not higher, compared to the free-text-based model, with reduced complexity. Furthermore, integrating structured EHR data improves performance, particularly when the text modalities are insufficiently indicative.
CONCLUSIONS: Our multi-modal, multiclass models can provide accurate stratification of trauma injury severity and clinically relevant interpretations.
PMID:38587875 | DOI:10.1093/jamia/ocae071
An Automated Deep Learning-Based Framework for Uptake Segmentation and Classification on PSMA PET/CT Imaging of Patients with Prostate Cancer
J Imaging Inform Med. 2024 Apr 8. doi: 10.1007/s10278-024-01104-y. Online ahead of print.
ABSTRACT
Uptake segmentation and classification on PSMA PET/CT are important for automating whole-body tumor burden determinations. We developed and evaluated an automated deep learning (DL)-based framework that segments and classifies uptake on PSMA PET/CT. We identified 193 [18F] DCFPyL PET/CT scans of patients with biochemically recurrent prostate cancer from two institutions, including 137 [18F] DCFPyL PET/CT scans for training and internally testing, and 56 scans from another institution for external testing. Two radiologists segmented and labelled foci as suspicious or non-suspicious for malignancy. A DL-based segmentation was developed with two independent CNNs. An anatomical prior guidance was applied to make the DL framework focus on PSMA-avid lesions. Segmentation performance was evaluated by Dice, IoU, precision, and recall. Classification model was constructed with multi-modal decision fusion framework evaluated by accuracy, AUC, F1 score, precision, and recall. Automatic segmentation of suspicious lesions was improved under prior guidance, with mean Dice, IoU, precision, and recall of 0.700, 0.566, 0.809, and 0.660 on the internal test set and 0.680, 0.548, 0.749, and 0.740 on the external test set. Our multi-modal decision fusion framework outperformed single-modal and multi-modal CNNs with accuracy, AUC, F1 score, precision, and recall of 0.764, 0.863, 0.844, 0.841, and 0.847 in distinguishing suspicious and non-suspicious foci on the internal test set and 0.796, 0.851, 0.865, 0.814, and 0.923 on the external test set. DL-based lesion segmentation on PSMA PET is facilitated through our anatomical prior guidance strategy. Our classification framework differentiates suspicious foci from those not suspicious for cancer with good accuracy.
PMID:38587770 | DOI:10.1007/s10278-024-01104-y
Automated Pulmonary Tuberculosis Severity Assessment on Chest X-rays
J Imaging Inform Med. 2024 Apr 8. doi: 10.1007/s10278-024-01052-7. Online ahead of print.
ABSTRACT
According to the 2022 World Health Organization's Global Tuberculosis (TB) report, an estimated 10.6 million people fell ill with TB, and 1.6 million died from the disease in 2021. In addition, 2021 saw a reversal of a decades-long trend of declining TB infections and deaths, with an estimated increase of 4.5% in the number of people who fell ill with TB compared to 2020, and an estimated yearly increase of 450,000 cases of drug resistant TB. Estimating the severity of pulmonary TB using frontal chest X-rays (CXR) can enable better resource allocation in resource constrained settings and monitoring of treatment response, enabling prompt treatment modifications if disease severity does not decrease over time. The Timika score is a clinically used TB severity score based on a CXR reading. This work proposes and evaluates three deep learning-based approaches for predicting the Timika score with varying levels of explainability. The first approach uses two deep learning-based models, one to explicitly detect lesion regions using YOLOV5n and another to predict the presence of cavitation using DenseNet121, which are then utilized in score calculation. The second approach uses a DenseNet121-based regression model to directly predict the affected lung percentage and another to predict cavitation presence using a DenseNet121-based classification model. Finally, the third approach directly predicts the Timika score using a DenseNet121-based regression model. The best performance is achieved by the second approach with a mean absolute error of 13-14% and a Pearson correlation of 0.7-0.84 using three held-out datasets for evaluating generalization.
PMID:38587769 | DOI:10.1007/s10278-024-01052-7
A Classification-Based Adaptive Segmentation Pipeline: Feasibility Study Using Polycystic Liver Disease and Metastases from Colorectal Cancer CT Images
J Imaging Inform Med. 2024 Apr 8. doi: 10.1007/s10278-024-01072-3. Online ahead of print.
ABSTRACT
Automated segmentation tools often encounter accuracy and adaptability issues when applied to images of different pathology. The purpose of this study is to explore the feasibility of building a workflow to efficiently route images to specifically trained segmentation models. By implementing a deep learning classifier to automatically classify the images and route them to appropriate segmentation models, we hope that our workflow can segment the images with different pathology accurately. The data we used in this study are 350 CT images from patients affected by polycystic liver disease and 350 CT images from patients presenting with liver metastases from colorectal cancer. All images had the liver manually segmented by trained imaging analysts. Our proposed adaptive segmentation workflow achieved a statistically significant improvement for the task of total liver segmentation compared to the generic single-segmentation model (non-parametric Wilcoxon signed rank test, n = 100, p-value << 0.001). This approach is applicable in a wide range of scenarios and should prove useful in clinical implementations of segmentation pipelines.
PMID:38587766 | DOI:10.1007/s10278-024-01072-3
Modeling Zinc Complexes Using Neural Networks
J Chem Inf Model. 2024 Apr 8. doi: 10.1021/acs.jcim.4c00095. Online ahead of print.
ABSTRACT
Understanding the energetic landscapes of large molecules is necessary for the study of chemical and biological systems. Recently, deep learning has greatly accelerated the development of models based on quantum chemistry, making it possible to build potential energy surfaces and explore chemical space. However, most of this work has focused on organic molecules due to the simplicity of their electronic structures as well as the availability of data sets. In this work, we build a deep learning architecture to model the energetics of zinc organometallic complexes. To achieve this, we have compiled a configurationally and conformationally diverse data set of zinc complexes using metadynamics to overcome the limitations of traditional sampling methods. In terms of the neural network potentials, our results indicate that for zinc complexes, partial charges play an important role in modeling the long-range interactions with a neural network. Our developed model outperforms semiempirical methods in predicting the relative energy of zinc conformers, yielding a mean absolute error (MAE) of 1.32 kcal/mol with reference to the double-hybrid PWPB95 method.
PMID:38587510 | DOI:10.1021/acs.jcim.4c00095
Influence of training and expertise on deep neural network attention and human attention during a medical image classification task
J Vis. 2024 Apr 1;24(4):6. doi: 10.1167/jov.24.4.6.
ABSTRACT
In many different domains, experts can make complex decisions after glancing very briefly at an image. However, the perceptual mechanisms underlying expert performance are still largely unknown. Recently, several machine learning algorithms have been shown to outperform human experts in specific tasks. But these algorithms often behave as black boxes and their information processing pipeline remains unknown. This lack of transparency and interpretability is highly problematic in applications involving human lives, such as health care. One way to "open the black box" is to compute an artificial attention map from the model, which highlights the pixels of the input image that contributed the most to the model decision. In this work, we directly compare human visual attention to machine visual attention when performing the same visual task. We have designed a medical diagnosis task involving the detection of lesions in small bowel endoscopic images. We collected eye movements from novices and gastroenterologist experts while they classified medical images according to their relevance for Crohn's disease diagnosis. We trained three state-of-the-art deep learning models on our carefully labeled dataset. Both humans and machine performed the same task. We extracted artificial attention with six different post hoc methods. We show that the model attention maps are significantly closer to human expert attention maps than to novices', especially for pathological images. As the model gets trained and its performance gets closer to the human experts, the similarity between model and human attention increases. Through the understanding of the similarities between the visual decision-making process of human experts and deep neural networks, we hope to inform both the training of new doctors and the architecture of new algorithms.
PMID:38587421 | DOI:10.1167/jov.24.4.6
Associations of intra-pancreatic fat deposition with incident diseases of the exocrine and endocrine pancreas: A UK Biobank prospective cohort study
Am J Gastroenterol. 2024 Apr 8. doi: 10.14309/ajg.0000000000002792. Online ahead of print.
ABSTRACT
OBJECTIVE: Investigate whether increased IPFD heightens the risk of diseases of the exocrine and endocrine pancreas.
METHODS: A prospective cohort study was conducted using data from the UK Biobank. IPFD was quantified using MRI and a deep learning-based framework called nnUNet. The prevalence of fatty change of the pancreas (FP) was determined using gender- and age-specific thresholds. Associations between IPFD and pancreatic diseases were assessed with multivariate Cox proportional hazard model adjusted for age, sex, ethnicity, body mass index, smoking and drinking status, central obesity, hypertension, dyslipidemia, liver fat content, and spleen fat content.
RESULTS: Of the 42,599 participants included in the analysis, the prevalence of FP was 17.86%. Elevated IPFD levels were associated with an increased risk of acute pancreatitis (AP) (HR per one quintile change [95%CI]: 1.513 [1.179-1.941]), pancreatic cancer (PC) (HR per one quintile change [95%CI]: 1.365 [1.058-1.762]) and diabetes mellitus (DM) (HR per one quintile change [95%CI]: 1.221 [1.132-1.318]). FP was also associated with a higher risk of AP (HR [95%CI]: 3.982 [2.192-7.234]), PC (HR [95%CI]: 1.976 [1.054-3.704]), and DM (HR [95%CI]: 1.337 [1.122-1.593], P=0.001).
CONCLUSIONS: FP is a common pancreatic disorder. Fat in the pancreas is an independent risk factor for diseases of both the exocrine pancreas and endocrine pancreas.
PMID:38587286 | DOI:10.14309/ajg.0000000000002792
A QSAR study for predicting malformation in Zebrafish embryo
Toxicol Mech Methods. 2024 Apr 8:1-24. doi: 10.1080/15376516.2024.2338907. Online ahead of print.
ABSTRACT
BackgroundDevelopmental toxicity tests are extremely expensive, require a large number of animals, and are time-consuming. It is necessary to develop a new approach to simplify the analysis of developmental endpoints. One of these endpoints is malformation, and one group of ongoing methods for simplifying is in silico models. In this study, we aim to develop a Quantitive Structure- Activity Relationship (QSAR) model and identify the best algorithm for predicting malformations, as well as the most important and effective physicochemical properties associated with malformation.MethodsThe dataset was extracted from a reliable database called COMPTOX. Physicochemical properties (descriptors) were calculated using Mordred and RDKit chemoinformatic software. The data were cleaned, preprocessed, and then split into training and testing sets. Machine learning algorithms, such as Gradient Boosting (GBM) and logistic regression (LR), as well as deep learning models, including multilayer perceptron (MLP) and neural networks (NN) trained with train set data and different sets of descriptors. The models were then validated with test set and various statistical parameters, such as Matthew's correlation coefficient (MCC) and balanced accuracy score, were used to compare the modelsResultsA set of descriptors containing with 78% AUC was identified as the best set of descriptors. Gradient Boosting was determined to be the best algorithm with 78% predictive power.ConclusionThe descriptors that were the most effective for developing models directly impact the mechanism of malformation, and gradient boosting is the best model due to its Matthews correlation coefficient (MCC) and balanced accuracy (BAC).
PMID:38586962 | DOI:10.1080/15376516.2024.2338907