Deep learning

deep learning: Latest results from PubMed

URL: https://pubmed.ncbi.nlm.nih.gov/rss-feed/?feed_id=1rmqmrY0qppU3YGIhI4yOg4EHo8T8cGeZzT5QkT7cIOPiglIw_&ff=20250818082612&fc=20210220094940&v=2.18.0.post9+e462414&utm_source=Other&utm_medium=rss&utm_content=1rmqmrY0qppU3YGIhI4yOg4EHo8T8cGeZzT5QkT7cIOPiglIw_&utm_campaign=pubmed-2

Updated: 2 hours 14 min ago

Multi-task machine learning reveals the functional neuroanatomy fingerprint of mental processing

Wed, 2025-07-09 06:00

bioRxiv [Preprint]. 2025 Jul 5:2023.11.30.569385. doi: 10.1101/2023.11.30.569385.

ABSTRACT

Mental processing delineates the functions of human mind encompassing a wide range of motor, sensory, emotional, and cognitive processes, each of which is underlain by the neuroanatomical substrates. Identifying accurate representation of functional neuroanatomy substrates of mental processing could inform understanding of its neural mechanism. The challenge is that it is unclear whether a specific mental process possesses a 'functional neuroanatomy fingerprint', i.e., a unique and reliable pattern of functional neuroanatomy that underlies the mental process. To address this question, we utilized a multi-task deep learning model to disentangle the functional neuroanatomy fingerprint of seven different and representative mental processes including Emotion, Gambling, Language, Motor, Relational, Social, and Working Memory. Results based on the functional magnetic resonance imaging data of two independent cohorts of 1235 subjects from the US and China consistently show that each of the seven mental processes possessed a functional neuroanatomy fingerprint, which is represented by a unique set of functional activity weights of whole-brain regions characterizing the degree of each region involved in the mental process. The functional neuroanatomy fingerprint of a specific mental process exhibits high discrimination ability (93% classification accuracy and AUC of 0.99) with those of the other mental processes, and is robust across different datasets and using different brain atlases. This study provides a solid functional neuroanatomy foundation for investigating the neural mechanism of mental processing.

PMID:40631327 | PMC:PMC12236813 | DOI:10.1101/2023.11.30.569385

Categories: Literature Watch

spRefine Denoises and Imputes Spatial Transcriptomics with a Reference-Free Framework Powered by Genomic Language Model

Wed, 2025-07-09 06:00

bioRxiv [Preprint]. 2025 Jul 7:2025.04.22.649977. doi: 10.1101/2025.04.22.649977.

ABSTRACT

The analysis of spatial transcriptomics is hindered by high noise levels and missing gene measurements, challenges that are further compounded by the higher cost of spatial data compared to traditional single-cell data. To overcome this challenge, we introduce spRefine, a deep learning framework that leverages genomic language models to jointly denoise and impute spatial transcriptomic data. Our results demonstrate that spRefine yields more robust cell- and spot-level representations after denoising and imputation, substantially improving data integration. In addition, spRefine serves as a strong framework for model pre-training and the discovery of novel biological signals, as highlighted by multiple downstream applications across datasets of varying scales. Notably, spRefine enhances the accuracy of spatial ageing clock estimations and uncovers new aging-related relationships associated with key biological processes, such as neuronal function loss, which offers new insights for analyzing ageing effect with spatial transcriptomics.

PMID:40631230 | PMC:PMC12236840 | DOI:10.1101/2025.04.22.649977

Categories: Literature Watch

Multimodal deep learning integration of cryo-EM and AlphaFold3 for high-accuracy protein structure determination

Wed, 2025-07-09 06:00

bioRxiv [Preprint]. 2025 Jul 3:2025.07.03.663071. doi: 10.1101/2025.07.03.663071.

ABSTRACT

Cryo-electron microscopy (cryo-EM) is a key technology for determining the structures of proteins, particularly large protein complexes. However, automatically building high-accuracy protein structures from cryo-EM density maps remains a crucial challenge. In this work, we introduce MICA, a fully automatic and multimodal deep learning approach combining cryo-EM density maps with AlphaFold3-predicted structures at both input and output levels to improve cryo-EM protein structure modeling. It first uses a multi-task encoder-decoder architecture with a feature pyramid network to predict backbone atoms, Cα atoms and amino acid types from both cryo-EM maps and AlphaFold3-predicted structures, which are used to build an initial backbone model. This model is further refined using AlphaFold3-predicted structures and density maps to build final atomic structures. MICA significantly outperforms other state-of-the-art deep learning methods in terms of both modeling accuracy and completeness and is robust to protein size and map resolution. Additionally, it builds high-accuracy structural models with an average template-based modeling score (TM-score) of 0.93 from recently released high-resolution cryo-EM density maps, showing it can be used for real-world, automated, accurate protein structure determination.

PMID:40631196 | PMC:PMC12236472 | DOI:10.1101/2025.07.03.663071

Categories: Literature Watch

Structural similarities reveal an expansive conotoxin family with a two-finger toxin fold

Wed, 2025-07-09 06:00

bioRxiv [Preprint]. 2025 Jul 5:2025.07.03.662903. doi: 10.1101/2025.07.03.662903.

ABSTRACT

Venomous animals have evolved a diverse repertoire of toxins with considerable pharmaceutical potential. The rapid evolution of peptide toxins, such as the conotoxins produced by venomous marine cone snails, often complicates efforts to infer their evolutionary relationships based solely on sequence information. Structural bioinformatics, however, can provide robust support. Here, we first solve the NMR structure of a macro-conotoxin from the MLSML superfamily, Tx33.1, which is composed of 124 residues, including 12 cysteines. We then apply deep learning-based methods for structure prediction and comparison to identify structural similarities between this toxin and five additional, previously uncharacterized conotoxin superfamilies. Although only three of these superfamilies exhibit sequence homology, a combined approach incorporating structure prediction, structure comparison, and gene structure analysis supports the conclusion that all six superfamilies share a common evolutionary past. The Tx33.1 NMR structure displays similarity to the first two domains of Argos, a secretory protein from Drosophila melanogaster that comprises three domains, each harboring two short β-stranded loops ('fingers'). Consequently, we propose the name 'two-finger toxin (2FTX)' fold for this type of domain. Finally, using structure similarity searches, we identify a wide range of 2FTX proteins in protostomes, including non-venom-derived, secretory cone snail proteins. This study demonstrates how structural bioinformatics can be employed to uncover evolutionary relationships among rapidly evolving genes. It simultaneously identifies a large, previously unrecognized group of protostome 2FTX proteins, many of which exhibit close structural similarity to Argos and may perform a similar function in regulating EGFR signaling.

PMID:40631153 | PMC:PMC12236594 | DOI:10.1101/2025.07.03.662903

Categories: Literature Watch

Fluctuation structure predicts genome-wide perturbation outcomes

Wed, 2025-07-09 06:00

bioRxiv [Preprint]. 2025 Jul 1:2025.06.27.661814. doi: 10.1101/2025.06.27.661814.

ABSTRACT

Pooled single-cell perturbation screens represent powerful experimental platforms for functional genomics, yet interpreting these rich datasets for meaningful biological conclusions remains challenging. Most current methods fall at one of two extremes: either opaque deep learning models that obscure biological meaning, or simplified frameworks that treat genes as isolated units. As such, these approaches overlook a crucial insight: gene co-fluctuations in unperturbed cellular states can be harnessed to model perturbation responses. Here we present CIPHER (Covariance Inference for Perturbation and High-dimensional Expression Response), a conceptual framework leveraging linear response theory from statistical physics to predict transcriptome-wide perturbation outcomes using gene co-fluctuations in unperturbed cells. We validated CIPHER on synthetic regulatory networks before applying it to 11 large-scale single-cell perturbation datasets covering 4,234 perturbations and over 1.36M cells. CIPHER robustly recapitulated genome-wide responses to single and double perturbations by exploiting baseline gene covariance structure. Importantly, eliminating gene-gene covariances, while retaining gene-intrinsic variances, reduced model performance by 11-fold, demonstrating the rich information stored within baseline fluctuation structures. Moreover, gene-gene correlations transferred successfully across independent experiments of the same cell type, revealing stereotypic fluctuation structures. Furthermore, CIPHER outperformed conventional differential expression metrics in identifying true perturbations while providing uncertainty-aware effect size estimates through Bayesian inference. Finally, most genome-wide responses propagated through the covariance matrix along approximately three independent and global gene modules. CIPHER underscores the importance of theoretically-grounded models in capturing complex biological responses, highlighting fundamental design principles encoded in cellular fluctuation patterns.

PMID:40631127 | PMC:PMC12236818 | DOI:10.1101/2025.06.27.661814

Categories: Literature Watch

Transformer-based Deep Learning for Glycan Structure Inference from Tandem Mass Spectrometry

Wed, 2025-07-09 06:00

bioRxiv [Preprint]. 2025 Jul 5:2025.07.02.662857. doi: 10.1101/2025.07.02.662857.

ABSTRACT

Glycans play critical roles in diverse biological processes, but their structural analysis by tandem mass spectrometry (MS/MS) remains a major challenge due to their branched structure and stereochemistry. Traditional computational methods, such as database searching, are constrained by the scope of existing libraries and can be computationally intensive. While recent deep learning models have advanced the field, they often struggle to capture the complex, long-range dependencies within MS/MS spectra required for accurate inference. To address these challenges, we present GlycoBERT and GlycoBART, novel transformer-based models for glycan structure prediction from MS/MS data. GlycoBERT, a sequence classifier, achieves 95.1% structural accuracy on test data, surpassing the current state-of-the-art deep learning model, CandyCrunch. However, classification-based methods are inherently limited to predicting structures present in the training data. To overcome this, we developed GlycoBART, a generative sequence-to-sequence model capable of de novo glycan inference. On independent validation datasets, both models demonstrate robust performance. Critically, when applied to an MS/MS dataset from human embryonic kidney cells, GlycoBART generated two de novo glycan structures absent from the training set, one of which is a novel structure not catalogued in major glycan databases. Together, GlycoBERT and GlycoBART establish a new benchmark for glycan analysis, offering a powerful framework that enables more accurate and comprehensive exploration of glycan diversity and discovery.

PMID:40631101 | PMC:PMC12236585 | DOI:10.1101/2025.07.02.662857

Categories: Literature Watch

De novo design of high-affinity miniprotein binders targeting Francisella tularensis virulence factor

Wed, 2025-07-09 06:00

bioRxiv [Preprint]. 2025 Jul 5:2025.07.02.662053. doi: 10.1101/2025.07.02.662053.

ABSTRACT

Francisella tularensis poses considerable public health risk due to its high infectivity and potential for bioterrorism. Francisella-like lipoprotein (Flpp3), a key virulence factor unique to Francisella, plays critical roles in infection and immune evasion, making it a promising target for therapeutic development. However, the lack of well-defined binding pockets and structural information on native interactions has hindered structure-guided ligand discovery against Flpp3. Here, we used a combination of physics-based and deep-learning methods to design high-affinity miniprotein binders targeting two distinct sites on Flpp3. We identified four binders for site I with binding affinities ranging between 24-110 nM. For the second site, an initial binder showed a dissociation constant (K D ) of 81 nM, and subsequent site saturation mutagenesis yielded variants with sub-nanomolar affinities. Circular dichroism confirmed the topology of designed miniproteins. The X-ray crystal structure of Flpp3 in complex with a site I binder is nearly identical to the design model (Cα root-mean-square deviation: 0.9 Å). These designed miniproteins provide research tools to explore the roles of Flpp3 in tularemia and should enable the development of new therapeutic candidates.

PMID:40631088 | PMC:PMC12236611 | DOI:10.1101/2025.07.02.662053

Categories: Literature Watch

Artificial intelligence in atrial fibrillation: emerging applications, research directions and ethical considerations

Wed, 2025-07-09 06:00

Front Cardiovasc Med. 2025 Jun 24;12:1596574. doi: 10.3389/fcvm.2025.1596574. eCollection 2025.

ABSTRACT

Atrial fibrillation (AF) is the most prevalent sustained arrhythmia and a major contributor to stroke and heart failure. Despite progress in management, challenges persist in early detection, risk stratification, and personalised treatment. Artificial intelligence (AI), especially machine learning (ML) and deep learning (DL), has emerged as a transformative tool in AF care. This scoping review examines the applications of AI across key domains: detection, risk prediction, treatment optimisation, and remote monitoring. AI-driven models enhance AF detection by analysing ECGs and wearable device data with high accuracy, enabling early identification of asymptomatic cases. By incorporating diverse clinical, imaging, and genomic data, predictive models outperform conventional risk scores in estimating stroke risk and disease progression. In treatment, AI assists in personalised anticoagulation decisions, catheter ablation planning, and optimising antiarrhythmic drug selection. Furthermore, AI-powered remote monitoring integrates wearable-derived insights with real-time decision support, improving patient engagement and adherence. Despite these advances, significant challenges persist, including algorithm transparency, bias, data integration, and regulatory hurdles. Explainable AI (XAI) is crucial to ensure clinician trust and facilitate implementation into clinical workflows. Future research should focus on large-scale validation, multi-modal data integration, and real-world AI deployment in AF management. AI has the potential to revolutionise AF care, shifting from reactive treatment to proactive, personalised management. Addressing current limitations through interdisciplinary collaboration will be key to realising AI's full potential in clinical practice and improving patient outcomes.

PMID:40630898 | PMC:PMC12234536 | DOI:10.3389/fcvm.2025.1596574

Categories: Literature Watch

Comparative analysis of deep learning and tree-based models in power demand prediction: Accuracy, interpretability, and computational efficiency

Wed, 2025-07-09 06:00

J Build Phys. 2025 Jun 10;49(1):127-169. doi: 10.1177/17442591251333144. eCollection 2025 Jul.

ABSTRACT

Research and development have demonstrated that effective building energy prediction is significant for enhancing energy efficiency and ensuring grid reliability. Many machine learning (ML) models, particularly deep learning (DL) approaches, are widely used for power or peak demand forecasting. However, evaluating prediction models solely based on accuracy is insufficient, as complex models often suffer from low interpretability and high computational costs, making them difficult to implement in real-world applications. This study proposes a multi-perspective evaluation analysis that includes prediction accuracy (both overall and at different power levels), interpretability (global/local perspectives and model structure), and computational efficiency. Three popular DL models-recurrent neural network, gated recurrent unit, long short-term memory, and three tree-based models-random forecast, extreme gradient boosting, and light gradient boosting machine-are analyzed due to their popularity and high prediction accuracy in the field of power demand prediction. The comparison reveals the following: (1) The best-performing prediction model changes under different power demand levels. In scenarios with lower power usage patterns, tree-based models achieve an average CV-RMSE of 13.62%, which is comparable to the 12.17% average CV-RMSE of DL models. (2) Global and local interpretations indicate that past power use and time-related features are the most important. Tree-based models excel at identifying which specific lagged features are more significant. (3) The DL model behavior can be interpreted by visualizing the hidden state at each layer to reveal how the model captures temporal dynamics across different time steps. However, tree-based models are more intuitive to interpret using straightforward decision rules and structures. This study provides guidance for applying ML algorithms to load forecasting, offering multiple perspectives on model selection trade-offs.

PMID:40630870 | PMC:PMC12233639 | DOI:10.1177/17442591251333144

Categories: Literature Watch

LEARNING ACCURATE RIGID REGISTRATION FOR LONGITUDINAL BRAIN MRI FROM SYNTHETIC DATA

Wed, 2025-07-09 06:00

Proc IEEE Int Symp Biomed Imaging. 2025 Apr;2025. doi: 10.1109/isbi60581.2025.10980859. Epub 2025 May 12.

ABSTRACT

Rigid registration aims to determine the translations and rotations necessary to align features in a pair of images. While recent machine learning methods have become state-of-the-art for linear and deformable registration across subjects, they have demonstrated limitations when applied to longitudinal (within-subject) registration, where achieving precise alignment is critical. Building on an existing framework for anatomy-aware, acquisition-agnostic affine registration, we propose a model optimized for longitudinal, rigid brain registration. By training the model with synthetic within-subject pairs augmented with rigid and subtle nonlinear transforms, the model estimates more accurate rigid transforms than previous cross-subject networks and performs robustly on longitudinal registration pairs within and across magnetic resonance imaging (MRI) contrasts.

PMID:40630833 | PMC:PMC12237398 | DOI:10.1109/isbi60581.2025.10980859

Categories: Literature Watch

Structural health monitoring and evaluation method for an immersed tunnel based on deep learning

Tue, 2025-07-08 06:00

Sci Rep. 2025 Jul 8;15(1):24393. doi: 10.1038/s41598-025-10643-5.

ABSTRACT

The health monitoring of the subsea-immersed tunnels is essential for the early detection of anomalies and the assurance of their long-term operational safety. This research examines sensor data to evaluate variations in critical parameters and their effects on structural integrity. It also compares the efficacy of two deep learning algorithms, Long Short-Term Memory (LSTM) and Transformer, in predicting structural conditions. The findings indicate that the Transformer model exhibits high reliability in forecasting and is particularly adept at handling extended periods and intricate time series data, rendering it especially appropriate for health assessments of immersed tube structures. Three correlation analysis techniques are employed to calculate correlation coefficients, thereby identifying the parameters that exert the most significant influence on structural health. The CRITIC method is utilized to assign weights to these parameters. Subsequently, a structural health evaluation model based on the fusion of multi-source information is proposed, facilitating the assessment of the immersed tube's condition and enabling predictions regarding its status over the subsequent 12 to 24 h. This proactive methodology offers a comprehensive insight into the tunnel health, allowing for implementing preventive measures before the emergence of issues and ultimately reducing maintenance costs. The practical implications of this research are significant, as it provides a robust methodology for the real-time monitoring and evaluation of subsea-immersed tunnels, thereby enhancing their operational safety and reducing maintenance costs.

PMID:40628967 | DOI:10.1038/s41598-025-10643-5

Categories: Literature Watch

Robust Bi-CBMSegNet framework for advancing breast mass segmentation in mammography with a dual module encoder-decoder approach

Tue, 2025-07-08 06:00

Sci Rep. 2025 Jul 8;15(1):24434. doi: 10.1038/s41598-025-09775-5.

ABSTRACT

Breast cancer is a prevalent disease affecting millions of women worldwide, and early screening can significantly reduce mortality rates. Mammograms are widely used for screening, but manual readings can lead to misdiagnosis. Computer-assisted diagnosis can help physicians make faster, more accurate judgments, which benefits patients. However, segmenting and classifying breast masses in mammograms is challenging due to their similar shapes to the surrounding glands. Current target detection algorithms have limited applications and low accuracy. Automated segmentation of breast masses on mammograms is a significant research challenge due to its considerable classification and contouring. This study introduces the Bi-Contextual Breast Mass Segmentation Framework (Bi-CBMSegNet), a novel paradigm that enhances the precision and efficiency of breast mass segmentation within full-field mammograms. Bi-CBMSegNet employs an advanced encoder-decoder architecture comprising two distinct modules: the Global Feature Enhancement Module (GFEM) and the Local Feature Enhancement Module (LFEM). GFEM aggregates and assimilates features from all positions within the mammogram, capturing extensive contextual dependencies that facilitate the enriched representation of homogeneous regions. The LFEM module accentuates semantic information pertinent to each specific position, refining the delineation of heterogeneous regions. The efficacy of Bi-CBMSegNet has been rigorously evaluated on two publicly available mammography databases, demonstrating superior computational efficiency and performance metrics. The findings advocate for Bi-CBMSegNet to effectuate a significant leap forward in medical imaging, particularly in breast cancer screening, thereby augmenting the accuracy and efficacy of diagnostic and treatment planning processes.

PMID:40628947 | DOI:10.1038/s41598-025-09775-5

Categories: Literature Watch

ResNet-based image processing approach for precise detection of cracks in photovoltaic panels

Tue, 2025-07-08 06:00

Sci Rep. 2025 Jul 8;15(1):24356. doi: 10.1038/s41598-025-09101-z.

ABSTRACT

Advancing renewable energy solutions requires efficient and durable solar Photovoltaic (PV) modules. A novel mechanism based on Deep Learning (DL) and Residual Network (ResNet) for accurate cracking detection using Electroluminescence (EL) images of PV panels is proposed in this paper. Different kinds of ResNet architectures, where ResNet34, ResNet50, and ResNet152 were tested, came out with an F1-Score of 86.63%, 87.37%, and 88.89%, respectively. Although the accuracy for ResNet152 is slightly higher, ResNet34 was chosen as the best model since it gives us a trade-off between detection performance and computational performance. The main contribution in this research is the design of an efficient crack detection system trained on a large PV power dataset composed of 2000 EL images collected from different polycrystalline and monocrystalline cells. Although the dataset has some imperfections, to guarantee the presence of many cell states in each subset, it was split into training (70%), validating (20%), and testing (10%). This research demonstrates the application of advanced DL frameworks for early defect diagnosis from raw data to enhance PV panel maintenance, thereby bolstering the sustainability of solar systems. This research also has a significant impact on the academic industry, offering practical solutions for the renewable energy sector during periods of sustainable energy instability, particularly when new materials supplement PV panel usage. The technology preserves the efficiency of solar modules and encourages clean energy solutions by accurately identifying PV panel faults. The study lays a foundation for the further development of image-based defect detection methods in PV systems.

PMID:40628943 | DOI:10.1038/s41598-025-09101-z

Categories: Literature Watch

Residual capsule network with threshold convolution and attention mechanism for forest fire detection using UAV imagery

Tue, 2025-07-08 06:00

Sci Rep. 2025 Jul 8;15(1):24360. doi: 10.1038/s41598-025-09298-z.

ABSTRACT

Wildfires pose a severe threat to ecosystems, economies, and human lives, exemplified by the 2019 Australian bushfires, which devastated 46 million acres, destroyed thousands of structures, and caused USD 148.5 billion in economic losses, alongside profound ecological damage. With climate change intensifying the frequency and severity of such events, there is a pressing need for advanced, real-time wildfire detection systems. Unmanned Aerial Vehicles (UAVs) integrated with remote sensing and Artificial Intelligence (AI) offer a promising solution for early detection and continuous monitoring. This paper introduces ResCaps-TC-Attn-Fire, a novel deep learning framework tailored for UAV-based forest fire detection, combining Residual-Capsule Networks, Threshold Convolution, and Attention Mechanisms. Residual-Capsule Networks enhance the capture of spatial hierarchies and inter-feature relationships, improving robustness to diverse fire characteristics, while Threshold Convolution filters irrelevant features to boost generalization and efficiency. The Attention Mechanism prioritizes critical fire-related regions, ensuring precise detection. Evaluated on a comprehensive UAV-sourced dataset of 14,140 images, ResCaps-TC-Attn-Fire achieves an impressive 99.78% accuracy, 99.7% precision, and 99.8% recall, surpassing existing methods like YOLOv3 (85.2%), ABi-LSTM (96.2%), and Enhanced YOLOv8n (99.0%). With early detection (3.2 s faster than YOLOv3), a 0.1% false alarm rate, and fire intensity estimation (MAE 0.15), this model demonstrates its potential as a reliable, real-time solution for wildfire mitigation, despite challenges like higher computational cost (12 s inference, 15W power), paving the way for future optimizations and broader deployment.

PMID:40628850 | DOI:10.1038/s41598-025-09298-z

Categories: Literature Watch

Artificial intelligence-driven discovery of YH395A: A novel TGFβR1 inhibitor with potent anti-tumor activity against triple-negative breast cancer

Tue, 2025-07-08 06:00

Cell Commun Signal. 2025 Jul 8;23(1):326. doi: 10.1186/s12964-025-02337-2.

ABSTRACT

Characterized by high malignancy and limited treatment efficacy, triple-negative breast cancer (TNBC) remains a clinically challenging subtype within breast cancer classifications, marked by rapid progression and high mortality. Abnormal activation of the transforming growth factor-β (TGFβ) pathway signaling, a pathway integral to tumor progression, metastasis, angiogenesis and immune evasion, is a common feature in a broad spectrum of malignancies. Owing to the restricted effectiveness of first-line interventions including surgical resection, cytotoxic agents, and radiation therapy for TNBC, novel agents that modulate TGFβ activity represent a compelling therapeutic avenue. Herein, we reported the identification and preclinical evaluation of YH395A, a novel tetrahydro-β-carboline derivative derived from the lead compound YR-290 with virtual screening from pseudo molecular library generated by generative deep learning method. In vitro studies demonstrated that YH395A dose-dependently inhibited TNBC cell migration and invasion. In vivo, administration of YH395A not only curtailed metastatic dissemination and prevented the extravasation of breast cancer cells into lung parenchyma in mouse models but also significantly reduced tumor growth in a patient-derived xenograft (PDX) model. Mechanistic analyses indicated that these antitumor effects are mediated via potent inhibition of TGFβ signaling. These cumulative results demonstrate YH395A's viability as a novel therapeutic agent for TNBC, while emphasizing the necessity for expanded preclinical validation studies.

PMID:40629347 | DOI:10.1186/s12964-025-02337-2

Categories: Literature Watch

Comparison of 2D, 2.5D, and 3D segmentation networks for mandibular canals in CBCT images: a study on public and external datasets

Tue, 2025-07-08 06:00

BMC Oral Health. 2025 Jul 8;25(1):1126. doi: 10.1186/s12903-025-06483-4.

ABSTRACT

The purpose of this study was to compare the performances of 2D, 2.5D, and 3D CNN-based segmentation networks, along with a 3D vision transformer-based segmentation network, for segmenting mandibular canals (MCs) on the public and external CBCT datasets under the same GPU memory capacity. We also performed ablation studies for an image-cropping (IC) technique and segmentation loss functions. 3D-UNet showed the highest segmentation performance for the MC than those of 2D and 2.5D segmentation networks on public test datasets, achieving 0.569 ± 0.107, 0.719 ± 0.092, 0.664 ± 0.131, and 0.812 ± 0.095 in terms of JI, DSC, PR, and RC, respectively. On the external test dataset, 3D-UNet achieved 0.564 ± 0.092, 0.716 ± 0.081, 0.812 ± 0.087, and 0.652 ± 0.103 in terms of JI, DSC, PR, and RC, respectively. The IC technique and multi-planar Dice loss improved the boundary details and structural connectivity of the MC from the mental foramen to the mandibular foramen. The 3D-UNet demonstrated superior segmentation performance for the MC by learning 3D volumetric context information for the entire MC in the CBCT volume.

PMID:40629331 | DOI:10.1186/s12903-025-06483-4

Categories: Literature Watch

Integrative multimodal ultrasound and radiomics for early prediction of neoadjuvant therapy response in breast cancer: a clinical study

Tue, 2025-07-08 06:00

BMC Cancer. 2025 Jul 9;25(1):1156. doi: 10.1186/s12885-025-14556-4.

ABSTRACT

PURPOSE: This study aimed to develop an early predictive model for neoadjuvant therapy (NAT) response in breast cancer by integrating multimodal ultrasound (conventional B-mode, shear-wave elastography, and contrast-enhanced ultrasound) and radiomics with clinical-pathological data, and to evaluate its predictive accuracy after two cycles of NAT.

METHODS: This retrospective study included 239 breast cancer patients receiving neoadjuvant therapy, divided into training (n = 167) and validation (n = 72) cohorts. Multimodal ultrasound-B-mode, shear-wave elastography (SWE), and contrast-enhanced ultrasound (CEUS)-was performed at baseline and after two cycles. Tumors were segmented using a U-Net-based deep learning model with radiologist adjustment, and radiomic features were extracted via PyRadiomics. Candidate variables were screened using univariate analysis and multicollinearity checks, followed by LASSO and stepwise logistic regression to build three models: a clinical-ultrasound model, a radiomics-only model, and a combined model. Model performance for early response prediction was assessed using ROC analysis.

RESULTS: In the training cohort (n = 167), Model_Clinic achieved an AUC of 0.85, with HER2 positivity, maximum tumor stiffness (Emax), stiffness heterogeneity (Estd), and the CEUS "radiation sign" emerging as independent predictors (all P < 0.05). The radiomics model showed moderate performance at baseline (AUC 0.69) but improved after two cycles (AUC 0.83), and a model using radiomic feature changes achieved an AUC of 0.79. Model_Combined demonstrated the best performance with a training AUC of 0.91 (sensitivity 89.4%, specificity 82.9%). In the validation cohort (n = 72), all models showed comparable AUCs (Model_Combined ~ 0.90) without significant degradation, and Model_Combined significantly outperformed Model_Clinic and Model_RSA (DeLong P = 0.006 and 0.042, respectively).

CONCLUSION: In our study, integrating multimodal ultrasound and radiomic features improved the early prediction of NAT response in breast cancer, and could provide valuable information to enable timely treatment adjustments and more personalized management strategies.

PMID:40629283 | DOI:10.1186/s12885-025-14556-4

Categories: Literature Watch

Assessment of T2-weighted MRI-derived synthetic CT for the detection of suspected lumbar facet arthritis: a comparative analysis with conventional CT

Tue, 2025-07-08 06:00

Eur Spine J. 2025 Jul 8. doi: 10.1007/s00586-025-08958-y. Online ahead of print.

ABSTRACT

PURPOSE: We evaluated sCT generated from T2-weighted imaging (T2WI) using deep learning techniques to detect structural lesions in lumbar facet arthritis, with conventional CT as the reference standard.

METHODS: This single-center retrospective study included 40 patients who had lumbar MRI and CT with in 1 week (September 2020 to August 2021). A Pix2Pix-GAN framework generated CT images from MRI data, and image quality was assessed using structural similarity index (SSIM), mean absolute error (MAE), peak signal-to-noise ratio (PSNR), nd Dice similarity coefficient (DSC). Two senior radiologists evaluated 15 anatomical landmarks. Sensitivity, specificity, and accuracy for detecting bone erosion, osteosclerosis, and joint space alterations were analyzed for sCT, T2-weighted MRI, and conventional CT.

RESULTS: Forty participants (21 men, 19 women) were enrolled, with a mean age of 39 ± 16.9 years. sCT showed strong agreement with conventional CT, with SSIM values of 0.888 for axial and 0.889 for sagittal views. PSNR and MAE values were 24.56 dB and 0.031 for axial and 23.75 dB and 0.038 for sagittal views, respectively. DSC values were 0.935 for axial and 0.876 for sagittal views. sCT showed excellent intra- and inter-reader reliability intraclass correlation coefficients (0.953-0.995 and 0.839-0.983, respectively). sCT had higher sensitivity (57.9% vs. 5.3%), specificity (98.8% vs. 84.6%), and accuracy (93.0% vs. 73.3%) for bone erosion than T2-weighted MRI and outperformed it for osteosclerosis and joint space changes.

CONCLUSIONS: sCT outperformed conventional T2-weighted MRI in detecting structural lesions indicative of lumbar facet arthritis, with conventional CT as the reference standard.

PMID:40629162 | DOI:10.1007/s00586-025-08958-y

Categories: Literature Watch

FeatureForest: the power of foundation models, the usability of random forests

Tue, 2025-07-08 06:00

Npj Imaging. 2025 Jul 8;3(1):32. doi: 10.1038/s44303-025-00089-9.

ABSTRACT

Analysis of biological images relies heavily on segmenting the biological objects of interest in the image before performing quantitative analysis. Deep learning (DL) is ubiquitous in such segmentation tasks, but can be cumbersome to apply, as it often requires a large amount of manual labeling to produce ground-truth data, and expert knowledge to train the models. More recently, large foundation models, such as SAM, have shown promising results on scientific images. They, however, require manual prompting for each object or tedious post-processing to selectively segment these objects. Here, we present FeatureForest, a method that leverages the feature embeddings of large foundation models to train a random forest classifier, thereby providing users with a rapid way of semantically segmenting complex images using only a few labeling strokes. We demonstrate the improvement in performance over a variety of datasets and provide an open-source implementation in napari that can be extended to new models.

PMID:40629147 | DOI:10.1038/s44303-025-00089-9

Categories: Literature Watch

Interpretability-guided RNA N<sup>6</sup>-methyladenosine modification site prediction with invertible neural networks

Tue, 2025-07-08 06:00

Commun Biol. 2025 Jul 8;8(1):1022. doi: 10.1038/s42003-025-08265-8.

ABSTRACT

As one of the most common and abundant post-transcriptional modifications, N6-methyladenosine (m6A) has been extensively studied for its essential regulatory role in gene expression and cell functions. The location of m6A RNA modification sites, however, remains a challenging problem, because of the inability to characterize m6A modified sites at a multi-scale level in their native RNA context. Here, we introduce an interpretability-guided invertible neural network (m6A-IIN), a deep learning model to accurately identify m6A RNA modification sites by integrating both primary and secondary structure information under an invertible coupling framework. Compared to existing methods, m6A-IIN achieves state-of-the-art performance in the prediction of m6A RNA modification sites across 11 benchmark datasets collected from different species and tissues. Furthermore, we find evidence indicating high consistency in methylation-related regions between primary and secondary structure of RNA, providing novel insights into m6A biology from the phylogenetic perspective. By analyzing conserved methylation-related regions identified by m6A-IIN across tissues, m6A-IIN facilitates the identification of novel pan-cancer genes, providing valuable contributions to cancer biology. Our results underscore the interpretability and predictive accuracy of m6A-IIN, opening an avenue towards the understanding of m6A RNA modification mechanisms.

PMID:40629144 | DOI:10.1038/s42003-025-08265-8

Categories: Literature Watch

Anil Jegga

Deep learning

Multi-task machine learning reveals the functional neuroanatomy fingerprint of mental processing

spRefine Denoises and Imputes Spatial Transcriptomics with a Reference-Free Framework Powered by Genomic Language Model

Multimodal deep learning integration of cryo-EM and AlphaFold3 for high-accuracy protein structure determination

Structural similarities reveal an expansive conotoxin family with a two-finger toxin fold

Fluctuation structure predicts genome-wide perturbation outcomes

Transformer-based Deep Learning for Glycan Structure Inference from Tandem Mass Spectrometry

De novo design of high-affinity miniprotein binders targeting Francisella tularensis virulence factor

Artificial intelligence in atrial fibrillation: emerging applications, research directions and ethical considerations

Comparative analysis of deep learning and tree-based models in power demand prediction: Accuracy, interpretability, and computational efficiency

LEARNING ACCURATE RIGID REGISTRATION FOR LONGITUDINAL BRAIN MRI FROM SYNTHETIC DATA

Structural health monitoring and evaluation method for an immersed tunnel based on deep learning

Robust Bi-CBMSegNet framework for advancing breast mass segmentation in mammography with a dual module encoder-decoder approach

ResNet-based image processing approach for precise detection of cracks in photovoltaic panels

Residual capsule network with threshold convolution and attention mechanism for forest fire detection using UAV imagery

Artificial intelligence-driven discovery of YH395A: A novel TGFβR1 inhibitor with potent anti-tumor activity against triple-negative breast cancer

Comparison of 2D, 2.5D, and 3D segmentation networks for mandibular canals in CBCT images: a study on public and external datasets

Integrative multimodal ultrasound and radiomics for early prediction of neoadjuvant therapy response in breast cancer: a clinical study

Assessment of T2-weighted MRI-derived synthetic CT for the detection of suspected lumbar facet arthritis: a comparative analysis with conventional CT

FeatureForest: the power of foundation models, the usability of random forests

Interpretability-guided RNA N<sup>6</sup>-methyladenosine modification site prediction with invertible neural networks

Pages