Deep learning
Forecasting carbon dioxide emissions in Chongming: a novel hybrid forecasting model coupling gray correlation analysis and deep learning method
Environ Monit Assess. 2024 Sep 17;196(10):941. doi: 10.1007/s10661-024-13092-1.
ABSTRACT
Predicting regional carbon dioxide (CO2) emissions is essential for advancing toward global carbon neutrality. This study introduces a novel CO2 emissions prediction model tailored to the unique environmental, economic, and energy consumption of Shanghai Chongming. Utilizing an innovative hybrid approach, the study first applies grey relational analysis to evaluate the influence of economic activity, natural conditions, and energy consumption on CO2 emissions. This is followed by the implementation of a dual-channel pooled convolutional neural network (DCNN) that captures both local and global features of the data, enhanced through feature stacking. Gated recurrent unit (GRU) network then assesses the temporal aspects of these features, culminating in precise CO2 emission predictions for the region. The results indicate: (1) The proposed hybrid model achieves accurate predictions based on accounting data, with high precision, low error, and good stability. (2) The study found an overall increase in Chongming's carbon emissions from 2000 to 2022, with the prediction results being generally consistent with existing research findings. (3) The proposed method, based on Chongming's CO2 emission predictions, addresses issues such as the scarcity of effective accounting data and inaccuracies in traditional calculation methods. The results can provide effective technical support for local government policies on carbon reduction and promote sustainable development.
PMID:39287717 | DOI:10.1007/s10661-024-13092-1
Deep learning classification method for boar sperm morphology analysis
Andrology. 2024 Sep 17. doi: 10.1111/andr.13758. Online ahead of print.
ABSTRACT
BACKGROUND: Boar semen quality emphasizes three major criteria: sperm concentration, motility, and morphology. Methods to analyze concentration and motility quickly and objectively readily exist, but few exist for analyzing morphology outside of subjective manual counting. Other vital factors for fertilization, like acrosome health, lack efficient detection methods due to limitations in detection by the human eye and costly biomarker analysis, which is rarely used in semen diagnostics.
OBJECTIVE: To overcome these challenges, we propose a novel approach integrating deep-learning technology with high-throughput image-based flow cytometry (IBFC) for objective and accurate analysis of both morphology and label-free acrosome health of thousands of individual spermatozoa at once, as opposed to manually counting on a microscope slide.
MATERIALS AND METHODS: Images of 10,000 spermatozoa were captured using an IBFC and manually annotated based on the primary morphological defect or acrosome health status for the training of the convolutional neural network (CNN). The CNN used these images to train and then applied that training to unannotated images to predict the model accuracy.
RESULTS: Using the CNNs, high F1 scores of 96.73%, 98.55%, and 99.31% for 20x, 40x, and 60x magnifications, respectively, for morphological classification were attained. Additionally, the model demonstrates an F1 score of 99.8% in detecting subtle acrosome health variations at the 60x magnification.
DISCUSSION AND CONCLUSIONS: We have established an integrated approach to rapidly collect and classify morphological defects and acrosome health status, without the use of manual counting or biomarker labeling. Our study underscores the potential of artificial intelligence in semen diagnostics, reducing technician variability, streamlining assays, and facilitating the development of additional label-free detection methods. This innovative approach addresses the barriers hindering biomarker adoption in semen analysis, offering a promising avenue for enhancing reproductive health assessments.
PMID:39287620 | DOI:10.1111/andr.13758
Progression of Bone Marrow Lesions and the Development of Knee Osteoarthritis: Osteoarthritis Initiative Data
Radiology. 2024 Sep;312(3):e240470. doi: 10.1148/radiol.240470.
ABSTRACT
Background Bone marrow lesions (BMLs) are a known risk factor for incident knee osteoarthritis (OA), and deep learning (DL) methods can assist in automated segmentation and risk prediction. Purpose To develop and validate a DL model for quantifying tibiofemoral BML volume on MRI scans in knees without radiographic OA and to assess the association between longitudinal BML changes and incident knee OA. Materials and Methods This retrospective study included knee MRI scans from the Osteoarthritis Initiative prospective cohort (February 2004-October 2015). The DL model, developed between August and October 2023, segmented the tibiofemoral joint into 10 subregions and measured BML volume in each subregion. Baseline and 4-year follow-up MRI scans were analyzed. Knees without OA at baseline were categorized into three groups based on 4-year BML volume changes: BML-free, BML regression, and BML progression. The risk of developing radiographic and symptomatic OA over 9 years was compared among these groups. Results Included were 3869 non-OA knees in 2430 participants (mean age, 59.5 years ± 9.0 [SD]; female-to-male ratio, 1.3:1). At 4-year follow-up, 2216 knees remained BML-free, 1106 showed an increase in BML volume, and 547 showed a decrease in BML volume. BML progression was associated with a higher risk of developing radiographic knee OA compared with remaining BML-free (hazard ratio [HR] = 3.0; P < .001) or BML regression (HR = 2.0; P < .001). Knees with BML progression also had a higher risk of developing symptomatic OA compared with BML-free knees (HR = 1.3; P < .001). Larger volume changes in BML progression were associated with a higher risk of developing both radiographic OA (HR = 2.0; P < .001) and symptomatic OA (HR = 1.7; P < .001). In almost all subchondral plates, especially the medial femur and tibia, BML progression was associated with a higher risk of developing both radiographic and symptomatic OA compared with remaining BML-free. Conclusion Knees with BML progression, according to subregion and extent of volume changes, were associated with an increased risk of OA compared with BML-free knees and knees with BML regression, highlighting the potential utility of monitoring BML volume changes in evaluating interventions to prevent OA development. ClinicalTrials.gov Identifier: NCT00080171 © RSNA, 2024 Supplemental material is available for this article. See also the editorial by Said and Sakly in this issue.
PMID:39287521 | DOI:10.1148/radiol.240470
Benchmarking deep learning-based low-dose CT image denoising algorithms
Med Phys. 2024 Sep 17. doi: 10.1002/mp.17379. Online ahead of print.
ABSTRACT
BACKGROUND: Long-lasting efforts have been made to reduce radiation dose and thus the potential radiation risk to the patient for computed tomography (CT) acquisitions without severe deterioration of image quality. To this end, various techniques have been employed over the years including iterative reconstruction methods and noise reduction algorithms.
PURPOSE: Recently, deep learning-based methods for noise reduction became increasingly popular and a multitude of papers claim ever improving performance both quantitatively and qualitatively. However, the lack of a standardized benchmark setup and inconsistencies in experimental design across studies hinder the verifiability and reproducibility of reported results.
METHODS: In this study, we propose a benchmark setup to overcome those flaws and improve reproducibility and verifiability of experimental results in the field. We perform a comprehensive and fair evaluation of several state-of-the-art methods using this standardized setup.
RESULTS: Our evaluation reveals that most deep learning-based methods show statistically similar performance, and improvements over the past years have been marginal at best.
CONCLUSIONS: This study highlights the need for a more rigorous and fair evaluation of novel deep learning-based methods for low-dose CT image denoising. Our benchmark setup is a first and important step towards this direction and can be used by future researchers to evaluate their algorithms.
PMID:39287517 | DOI:10.1002/mp.17379
A systematic review of the application of machine learning techniques to ultrasound tongue imaging analysis
J Acoust Soc Am. 2024 Sep 1;156(3):1796-1819. doi: 10.1121/10.0028610.
ABSTRACT
B-mode ultrasound has emerged as a prevalent tool for observing tongue motion in speech production, gaining traction in speech therapy applications. However, the effective analysis of ultrasound tongue image frame sequences (UTIFs) encounters many challenges, such as the presence of high levels of speckle noise and obscured views. Recently, the application of machine learning, especially deep learning techniques, to UTIF interpretation has shown promise in overcoming these hurdles. This paper presents a thorough examination of the existing literature, focusing on UTIF analysis. The scope of our work encompasses four key areas: a foundational introduction to deep learning principles, an exploration of motion tracking methodologies, a discussion of feature extraction techniques, and an examination of cross-modality mapping. The paper concludes with a detailed discussion of insights gleaned from the comprehensive literature review, outlining potential trends and challenges that lie ahead in the field.
PMID:39287468 | DOI:10.1121/10.0028610
Tailoring nonsurgical therapy for elderly patients with head and neck squamous cell carcinoma: A deep learning-based approach
Medicine (Baltimore). 2024 Sep 13;103(37):e39659. doi: 10.1097/MD.0000000000039659.
ABSTRACT
To assess deep learning models for personalized chemotherapy selection and quantify the impact of baseline characteristics on treatment efficacy for elderly head and neck squamous cell carcinoma (HNSCC) patients who are not surgery candidates. A comparison was made between patients whose treatments aligned with model recommendations and those whose did not, using overall survival as the primary metric. Bias was addressed through inverse probability treatment weighting (IPTW), and the impact of patient characteristics on treatment choice was analyzed via mixed-effects regression. Four thousand two hundred seventy-six elderly HNSCC patients in total met the inclusion criteria. Self-Normalizing Balanced individual treatment effect for survival data model performed best in treatment recommendation (IPTW-adjusted hazard ratio: 0.74, 95% confidence interval [CI], 0.63-0.87; IPTW-adjusted risk difference: 9.92%, 95% CI, 4.96-14.90; IPTW-adjusted the difference in restricted mean survival time: 16.42 months, 95% CI, 10.83-21.22), which surpassed other models and National Comprehensive Cancer Network guidelines. No survival benefit for chemoradiotherapy was seen for patients not recommended to receive this treatment. Self-Normalizing Balanced individual treatment effect for survival data model effectively identifies elderly HNSCC patients who could benefit from chemoradiotherapy, offering personalized survival predictions and treatment recommendations. The practical application will become a reality with further validation in clinical settings.
PMID:39287264 | DOI:10.1097/MD.0000000000039659
A deep learning-based method enables the automatic and accurate assembly of chromosome-level genomes
Nucleic Acids Res. 2024 Sep 17:gkae789. doi: 10.1093/nar/gkae789. Online ahead of print.
ABSTRACT
The application of high-throughput chromosome conformation capture (Hi-C) technology enables the construction of chromosome-level assemblies. However, the correction of errors and the anchoring of sequences to chromosomes in the assembly remain significant challenges. In this study, we developed a deep learning-based method, AutoHiC, to address the challenges in chromosome-level genome assembly by enhancing contiguity and accuracy. Conventional Hi-C-aided scaffolding often requires manual refinement, but AutoHiC instead utilizes Hi-C data for automated workflows and iterative error correction. When trained on data from 300+ species, AutoHiC demonstrated a robust average error detection accuracy exceeding 90%. The benchmarking results confirmed its significant impact on genome contiguity and error correction. The innovative approach and comprehensive results of AutoHiC constitute a breakthrough in automated error detection, promising more accurate genome assemblies for advancing genomics research.
PMID:39287126 | DOI:10.1093/nar/gkae789
Regional, rural and remote medicine attracts students with a similar approach to learning in both the Northern and Southern hemisphere
Int J Circumpolar Health. 2024 Dec;83(1):2404274. doi: 10.1080/22423982.2024.2404274. Epub 2024 Sep 16.
ABSTRACT
Doctors who work in areas of workforce shortage, such as regional, rural and remote areas or areas of low socioeconomic means need to be more self-motivated, adaptable and self-directed than their metropolitan counterparts. This study aimed to examine the goal orientation and learning characteristics of students recruited into two medical programmes, one from the Northern hemisphere and one from the Southern hemisphere; both with a commitment to producing doctors to practice medicine in rural locations. Three survey tools were administered to 263 medical students: 1. achievement goal orientation survey; 2. learning characteristics survey and 3. the study process questionnaire. Medical students from both cohorts showed a learning goal orientation, which significantly increased with age (P0.007). In terms of learning characteristics, the students from the south had significantly higher scores for curiosity (P0.003), while the northern students had significantly higher scores for methodical (p < 0.001). Both cohorts were similar for adaptability and consciousness. Across the entire student cohort, three of the four learning disposition characteristics were also seen to correlate with learning goal orientation. In both cohorts of medical students deep learning scores exceeded surface learning scores. Selection of students with a learning goal orientation and learning characteristics of curiosity, adaptability and conscientiousness could potentially help students to flourish in rural placement environments.
PMID:39285655 | DOI:10.1080/22423982.2024.2404274
External validation of a deep learning model for automatic segmentation of skeletal muscle and adipose tissue on abdominal computed tomography images
Br J Radiol. 2024 Sep 16:tqae191. doi: 10.1093/bjr/tqae191. Online ahead of print.
ABSTRACT
BACKGROUND: Body composition assessment using computed tomography (CT) images at the L3-level is increasingly applied in cancer research. Robust high-throughput automated segmentation is key to assess large patient cohorts and to support implementation of body composition analysis into routine clinical practice. We trained and externally validated a deep learning neural network (DLNN) to automatically segment L3-CT images.
METHODS: Expert-drawn segmentations of visceral and subcutaneous adipose tissue (VAT/SAT) and skeletal muscle (SM) of L3-CT-images of 3,187 patients undergoing abdominal surgery were used to train a DLNN. The external validation cohort was comprised of 2,535 patients with abdominal cancer. DLNN performance was evaluated with (geometric) Dice Similarity (DS) and Lin's Concordance Correlation Coefficient.
RESULTS: There was a strong concordance between automatic and manual segmentations with median DS for SM, VAT, and SAT of 0.97 (interquartile range, IQR: 0.95-0.98), 0.98 (IQR: 0.95-0.98), and 0.95 (IQR: 0.92-0.97), respectively. Concordance correlations were excellent: SM 0.964 (0.959-0.968), VAT 0.998 (0.998-0.998), and SAT 0.992 (0.991-0.993). Bland-Altman metrics indicated only small and clinically insignificant systematic offsets; SM radiodensity: 0.23 hounsfield units (0.5%), SM: 1.26 cm2.m-2 (2.8%), VAT: -1.02 cm2.m-2 (1.7%), and SAT: 3.24 cm2.m-2 (4.6%).
CONCLUSION: A robustly-performing and independently externally validated DLNN for automated body composition analysis was developed.
ADVANCES IN KNOWLEDGE: CT-based body composition analysis is highly prognostic for long-term overall survival in oncology. This DLNN was succesfully trained and externally validated on several large patient cohorts and will therefore enable large scale population studies and implementation of body composition analysis into clinical practice.
PMID:39286936 | DOI:10.1093/bjr/tqae191
AlphaCRV: a pipeline for identifying accurate binder topologies in mass-modeling with AlphaFold
Bioinform Adv. 2024 Sep 6;4(1):vbae131. doi: 10.1093/bioadv/vbae131. eCollection 2024.
ABSTRACT
MOTIVATION: The speed and accuracy of deep learning-based structure prediction algorithms make it now possible to perform in silico "pull-downs" to identify protein-protein interactions on a proteome-wide scale. However, on such a large scale, existing scoring algorithms are often insufficient to discriminate biologically relevant interactions from false positives.
RESULTS: Here, we introduce AlphaCRV, a Python package that helps identify correct interactors in a one-against-many AlphaFold screen by clustering, ranking, and visualizing conserved binding topologies, based on protein sequence and fold.
AVAILABILITY AND IMPLEMENTATION: AlphaCRV is a Python package for Linux, freely available at https://github.com/strubelab/AlphaCRV.
PMID:39286602 | PMC:PMC11405088 | DOI:10.1093/bioadv/vbae131
Accuracy optimized neural networks do not effectively model optic flow tuning in brain area MSTd
Front Neurosci. 2024 Sep 2;18:1441285. doi: 10.3389/fnins.2024.1441285. eCollection 2024.
ABSTRACT
Accuracy-optimized convolutional neural networks (CNNs) have emerged as highly effective models at predicting neural responses in brain areas along the primate ventral stream, but it is largely unknown whether they effectively model neurons in the complementary primate dorsal stream. We explored how well CNNs model the optic flow tuning properties of neurons in dorsal area MSTd and we compared our results with the Non-Negative Matrix Factorization (NNMF) model, which successfully models many tuning properties of MSTd neurons. To better understand the role of computational properties in the NNMF model that give rise to optic flow tuning that resembles that of MSTd neurons, we created additional CNN model variants that implement key NNMF constraints - non-negative weights and sparse coding of optic flow. While the CNNs and NNMF models both accurately estimate the observer's self-motion from purely translational or rotational optic flow, NNMF and the CNNs with nonnegative weights yield substantially less accurate estimates than the other CNNs when tested on more complex optic flow that combines observer translation and rotation. Despite its poor accuracy, NNMF gives rise to tuning properties that align more closely with those observed in primate MSTd than any of the accuracy-optimized CNNs. This work offers a step toward a deeper understanding of the computational properties and constraints that describe the optic flow tuning of primate area MSTd.
PMID:39286477 | PMC:PMC11403719 | DOI:10.3389/fnins.2024.1441285
AHD: Arabic healthcare dataset
Data Brief. 2024 Aug 22;56:110855. doi: 10.1016/j.dib.2024.110855. eCollection 2024 Oct.
ABSTRACT
With the soaring demand for healthcare systems, chatbots are gaining tremendous popularity and research attention. Numerous language-centric research on healthcare is conducted day by day. Despite significant advances in Arabic Natural Language Processing (NLP), challenges remain in natural language classification and generation due to the lack of suitable datasets. The primary shortcoming of these models is the lack of suitable Arabic datasets for training. To address this, authors introduce a large Arabic Healthcare Dataset (AHD) of textual data. The dataset consists of over 808k questions and answers across 90 categories, offered to the research community for Arabic computational linguistics. Authors anticipate that this rich dataset would make a great aid for a variety of NLP tasks on Arabic textual data, especially for text classification and generation purposes. Authors present the data in raw form. AHD is composed of main dataset scraped from medical website, which is Altibbi website. AHD is made public and freely available at http://data.mendeley.com/datasets/mgj29ndgrk/5.
PMID:39286413 | PMC:PMC11403399 | DOI:10.1016/j.dib.2024.110855
Deep learning-based scoring method of the three-chamber social behaviour test in a mouse model of alcohol intoxication. A comparative analysis of DeepLabCut, commercial automatic tracking and manual scoring
Heliyon. 2024 Aug 28;10(17):e36352. doi: 10.1016/j.heliyon.2024.e36352. eCollection 2024 Sep 15.
ABSTRACT
BACKGROUND: Alcohol consumption and withdrawal alter social behaviour in humans in a sex-dependent manner. The three-chamber test is a widely used paradigm to assess rodents' social behaviour, including sociability and social novelty. Automatic tracking systems are commonly used to score time spent with conspecifics, despite failing to score direct interaction time with conspecifics rather than time in the nearby zone. Thereby, the automatically scored results are usually inaccurate and need manual corrections.
NEW METHOD: New advances in artificial intelligence (AI) have been used recently to analyze complex behaviours. DeepLabCat is a pose-estimation toolkit that allows the tracking of animal body parts. Thus, we used DeepLabCut, to introduce a scoring model of the three-chamber test to investigate alcohol withdrawal effects on social behaviour in mice considering sex and withdrawal periods. We have compared the results of two automatic pose estimation methods: automatic tracking (AnyMaze) and DeepLabCut considering the manual scoring method, the current gold standard.
RESULTS: We have found that the automatic tracking method (AnyMaze) has failed to detect the significance of social deficits in female mice during acute withdrawal. However, tracking the animal's nose using DeepLabCut showed a significant social deficit in agreement with manual scoring. Interestingly, this social deficit was shown only in females during acute and recovered by the protracted withdrawal. DLC and manually scored results showed a higher Spearman correlation coefficient and a lower bias in the Bland-Altman analysis.
CONCLUSION: our approach helps improve the accuracy of scoring the three-chamber test while outperforming commercial automatic tracking systems.
PMID:39286202 | PMC:PMC11403434 | DOI:10.1016/j.heliyon.2024.e36352
Nonrigid registration method for longitudinal chest CT images in COVID-19
Heliyon. 2024 Aug 31;10(17):e37272. doi: 10.1016/j.heliyon.2024.e37272. eCollection 2024 Sep 15.
ABSTRACT
RATIONALE AND OBJECTIVES: To analyze morphological changes in patients with COVID-19-associated pneumonia over time, a nonrigid registration technique is required that reduces differences in respiratory phase and imaging position and does not excessively deform the lesion region. A nonrigid registration method using deep learning was applied for lung field alignment, and its practicality was verified through quantitative evaluation, such as image similarity of whole lung region and image similarity of lesion region, as well as visual evaluation by a physician.
MATERIALS AND METHODS: First, the lung field positions and sizes of the first and second CT images were roughly matched using a classical registration method based on iterative calculations as a preprocessing step. Then, voxel-by-voxel transformation was performed using VoxelMorph, a nonrigid deep learning registration method. As an objective evaluation, the similarity of the images was calculated. To evaluate the invariance of image features in the lesion site, primary statistics and 3D shape features were calculated and statistically analyzed. Furthermore, as a subjective evaluation, the similarity of images and whether nonrigid transformation caused unnatural changes in the shape and size of the lesion region were visually evaluated by a pulmonologist.
RESULTS: The proposed method was applied to 509 patient data points with high image similarity. The variances in histogram characteristics before and after image deformation were confirmed. Visual evaluation confirmed the agreement between the shape and internal structure of the lung field and the natural deformation of the lesion region.
CONCLUSION: The developed nonrigid registration method was shown to be effective for quantitative time series analysis of the lungs.
PMID:39286087 | PMC:PMC11403531 | DOI:10.1016/j.heliyon.2024.e37272
A machine learning approach to predicting dry eye-related signs, symptoms and diagnoses from meibography images
Heliyon. 2024 Aug 13;10(17):e36021. doi: 10.1016/j.heliyon.2024.e36021. eCollection 2024 Sep 15.
ABSTRACT
PURPOSE: To use artificial intelligence to identify relationships between morphological characteristics of the Meibomian glands (MGs), subject factors, clinical outcomes, and subjective symptoms of dry eye.
METHODS: A total of 562 infrared meibography images were collected from 363 subjects (170 contact lens wearers, 193 non-wearers). Subjects were 67.2 % female and were 54.8 % Caucasian. Subjects were 18 years of age or older. A deep learning model was trained to take meibography as input, segment the individual MG in the images, and learn their detailed morphological features. Morphological characteristics were then combined with clinical and symptom data in prediction models of MG function, tear film stability, ocular surface health, and subjective discomfort and dryness. The models were analyzed to identify the most heavily weighted features used by the algorithm for predictions.
RESULTS: MG morphological characteristics were heavily weighted predictors for eyelid notching and vascularization, MG expressate quality and quantity, tear film stability, corneal staining, and comfort and dryness ratings, with accuracies ranging from 65 % to 99 %. Number of visible MG, along with other clinical parameters, were able to predict MG dysfunction, aqueous deficiency and blepharitis with accuracies ranging from 74 % to 85 %.
CONCLUSIONS: Machine learning-derived MG morphological characteristics were found to be important in predicting multiple signs, symptoms, and diagnoses related to MG dysfunction and dry eye. This deep learning method illustrates the rich clinical information that detailed morphological analysis of the MGs can provide, and shows promise in advancing our understanding of the role of MG morphology in ocular surface health.
PMID:39286076 | PMC:PMC11403426 | DOI:10.1016/j.heliyon.2024.e36021
Pushing the Boundaries of Molecular Property Prediction for Drug Discovery with Multitask Learning BERT Enhanced by SMILES Enumeration
Research (Wash D C). 2022 Dec 15;2022:0004. doi: 10.34133/research.0004. eCollection 2022.
ABSTRACT
Accurate prediction of pharmacological properties of small molecules is becoming increasingly important in drug discovery. Traditional feature-engineering approaches heavily rely on handcrafted descriptors and/or fingerprints, which need extensive human expert knowledge. With the rapid progress of artificial intelligence technology, data-driven deep learning methods have shown unparalleled advantages over feature-engineering-based methods. However, existing deep learning methods usually suffer from the scarcity of labeled data and the inability to share information between different tasks when applied to predicting molecular properties, thus resulting in poor generalization capability. Here, we proposed a novel multitask learning BERT (Bidirectional Encoder Representations from Transformer) framework, named MTL-BERT, which leverages large-scale pre-training, multitask learning, and SMILES (simplified molecular input line entry specification) enumeration to alleviate the data scarcity problem. MTL-BERT first exploits a large amount of unlabeled data through self-supervised pretraining to mine the rich contextual information in SMILES strings and then fine-tunes the pretrained model for multiple downstream tasks simultaneously by leveraging their shared information. Meanwhile, SMILES enumeration is used as a data enhancement strategy during the pretraining, fine-tuning, and test phases to substantially increase data diversity and help to learn the key relevant patterns from complex SMILES strings. The experimental results showed that the pretrained MTL-BERT model with few additional fine-tuning can achieve much better performance than the state-of-the-art methods on most of the 60 practical molecular datasets. Additionally, the MTL-BERT model leverages attention mechanisms to focus on SMILES character features essential to target properties for model interpretability.
PMID:39285949 | PMC:PMC11404312 | DOI:10.34133/research.0004
Artificial intelligence for geoscience: Progress, challenges, and perspectives
Innovation (Camb). 2024 Aug 22;5(5):100691. doi: 10.1016/j.xinn.2024.100691. eCollection 2024 Sep 9.
ABSTRACT
This paper explores the evolution of geoscientific inquiry, tracing the progression from traditional physics-based models to modern data-driven approaches facilitated by significant advancements in artificial intelligence (AI) and data collection techniques. Traditional models, which are grounded in physical and numerical frameworks, provide robust explanations by explicitly reconstructing underlying physical processes. However, their limitations in comprehensively capturing Earth's complexities and uncertainties pose challenges in optimization and real-world applicability. In contrast, contemporary data-driven models, particularly those utilizing machine learning (ML) and deep learning (DL), leverage extensive geoscience data to glean insights without requiring exhaustive theoretical knowledge. ML techniques have shown promise in addressing Earth science-related questions. Nevertheless, challenges such as data scarcity, computational demands, data privacy concerns, and the "black-box" nature of AI models hinder their seamless integration into geoscience. The integration of physics-based and data-driven methodologies into hybrid models presents an alternative paradigm. These models, which incorporate domain knowledge to guide AI methodologies, demonstrate enhanced efficiency and performance with reduced training data requirements. This review provides a comprehensive overview of geoscientific research paradigms, emphasizing untapped opportunities at the intersection of advanced AI techniques and geoscience. It examines major methodologies, showcases advances in large-scale models, and discusses the challenges and prospects that will shape the future landscape of AI in geoscience. The paper outlines a dynamic field ripe with possibilities, poised to unlock new understandings of Earth's complexities and further advance geoscience exploration.
PMID:39285902 | PMC:PMC11404188 | DOI:10.1016/j.xinn.2024.100691
A Nanoparticle-Based Artificial Ear for Personalized Classification of Emotions in the Human Voice Using Deep Learning
ACS Appl Mater Interfaces. 2024 Sep 16. doi: 10.1021/acsami.4c13223. Online ahead of print.
ABSTRACT
Artificial intelligence and human-computer interaction advances demand bioinspired sensing modalities capable of comprehending human affective states and speech. However, endowing skin-like interfaces with such intricate perception abilities remains challenging. Here, we have developed a flexible piezoresistive artificial ear (AE) sensor based on gold nanoparticles, which can convert sound signals into electrical signals through changes in resistance. By testing the sensor's performance at both frequency and sound pressure level (SPL), the AE has a frequency response range of 20 Hz to 12 kHz and can sense sound signals from up to 5 m away at a frequency of 1 kHz and an SPL of 126 dB. Furthermore, through deep learning, the device achieves up to 96.9% and 95.0% accuracy in classification and recognition applications for seven emotional and eight urban environmental noises, respectively. Hence, on one hand, our device can monitor the patient's emotional state by their speech, such as sudden yelling and screaming, which can help healthcare workers understand patients' condition in time. On the other hand, the device could also be used for real-time monitoring of noise levels in aircraft, ships, factories, and other high-decibel equipment and environments.
PMID:39285705 | DOI:10.1021/acsami.4c13223
The impact of deep learning image reconstruction of spectral CTU virtual non contrast images for patients with renal stones
Eur J Radiol Open. 2024 Aug 31;13:100599. doi: 10.1016/j.ejro.2024.100599. eCollection 2024 Dec.
ABSTRACT
PURPOSE: To compare image quality and detection accuracy of renal stones between deep learning image reconstruction (DLIR) and Adaptive Statistical Iterative Reconstruction-Veo (ASIR-V) reconstructed virtual non-contrast (VNC) images and true non-contrast (TNC) images in spectral CT Urography (CTU).
METHODS: A retrospective analysis was conducted on images of 70 patients who underwent abdominal-pelvic CTU in TNC phase using non-contrast scan and contrast-enhanced corticomedullary phase (CP) and excretory phase (EP) using spectral scan. The TNC scan was reconstructed using ASIR-V70 % (TNC-AR70), contrast-enhanced scans were reconstructed using AR70, DLIR medium-level (DM), and high-level (DH) to obtain CP-VNC-AR70/DM/DH and EP-VNC-AR70/DM/DH image groups, respectively. CT value, image quality and kidney stones quantification accuracy were measured and compared among groups. The subjective evaluation was independently assessed by two senior radiologists using the 5-point Likert scale for image quality and lesion visibility.
RESULTS: DH images were superior to AR70 and DM images in objective image quality evaluation. There was no statistical difference in the liver and spleen (both P > 0.05), or within 6HU in renal and fat in CT value between VNC and TNC images. EP-VNC-DH had the lowest image noise, highest SNR, and CNR, and VNC-AR70 images had better noise and SNR performance than TNC-AR70 images (all p < 0.05). EP-VNC-DH had the highest subjective image quality, and CP-VNC-DH performed the best in lesion visibility. In stone CT value and volume measurements, there was no statistical difference between VNC and TNC (P > 0.05).
CONCLUSION: The DLIR-reconstructed VNC images in CTU provide better image quality than the ASIR-V reconstructed TNC images and similar quantification accuracy for kidney stones for potential dose savings.The study highlights that deep learning image reconstruction (DLIR)-reconstructed virtual non-contrast (VNC) images in spectral CT Urography (CTU) offer improved image quality compared to traditional true non-contrast (TNC) images, while maintaining similar accuracy in kidney stone detection, suggesting potential dose savings in clinical practice.
PMID:39280122 | PMC:PMC11402413 | DOI:10.1016/j.ejro.2024.100599
Using machine learning to predict carotid artery symptoms from CT angiography: A radiomics and deep learning approach
Eur J Radiol Open. 2024 Aug 31;13:100594. doi: 10.1016/j.ejro.2024.100594. eCollection 2024 Dec.
ABSTRACT
PURPOSE: To assess radiomics and deep learning (DL) methods in identifying symptomatic Carotid Artery Disease (CAD) from carotid CT angiography (CTA) images. We further compare the performance of these novel methods to the conventional calcium score.
METHODS: Carotid CT angiography (CTA) images from symptomatic patients (ischaemic stroke/transient ischaemic attack within the last 3 months) and asymptomatic patients were analysed. Carotid arteries were classified into culprit, non-culprit and asymptomatic. The calcium score was assessed using the Agatston method. 93 radiomic features were extracted from regions-of-interest drawn on 14 consecutive CTA slices. For DL, convolutional neural networks (CNNs) with and without transfer learning were trained directly on CTA slices. Predictive performance was assessed over 5-fold cross validated AUC scores. SHAP and GRAD-CAM algorithms were used for explainability.
RESULTS: 132 carotid arteries were analysed (41 culprit, 41 non-culprit, and 50 asymptomatic). For asymptomatic vs symptomatic arteries, radiomics attained a mean AUC of 0.96(± 0.02), followed by DL 0.86(± 0.06) and then calcium 0.79(± 0.08). For culprit vs non-culprit arteries, radiomics achieved a mean AUC of 0.75(± 0.09), followed by DL 0.67(± 0.10) and then calcium 0.60(± 0.02). For multi-class classification, the mean AUCs were 0.95(± 0.07), 0.79(± 0.05), and 0.71(± 0.07) for radiomics, DL and calcium, respectively. Explainability revealed consistent patterns in the most important radiomic features.
CONCLUSIONS: Our study highlights the potential of novel image analysis techniques in extracting quantitative information beyond calcification in the identification of CAD. Though further work is required, the transition of these novel techniques into clinical practice may eventually facilitate better stroke risk stratification.
PMID:39280120 | PMC:PMC11402422 | DOI:10.1016/j.ejro.2024.100594