Deep learning
Deep learning-based metastasis detection in patients with lung cancer to enhance reproducibility and reduce workload in brain metastasis screening with MRI: a multi-center study
Cancer Imaging. 2024 Mar 1;24(1):32. doi: 10.1186/s40644-024-00669-9.
ABSTRACT
OBJECTIVES: To assess whether a deep learning-based system (DLS) with black-blood imaging for brain metastasis (BM) improves the diagnostic workflow in a multi-center setting.
MATERIALS AND METHODS: In this retrospective study, a DLS was developed in 101 patients and validated on 264 consecutive patients (with lung cancer) having newly developed BM from two tertiary university hospitals, which performed black-blood imaging between January 2020 and April 2021. Four neuroradiologists independently evaluated BM either with segmented masks and BM counts provided (with DLS) or not provided (without DLS) on a clinical trial imaging management system (CTIMS). To assess reading reproducibility, BM count agreement between the readers and the reference standard were calculated using limits of agreement (LoA). Readers' workload was assessed with reading time, which was automatically measured on CTIMS, and were compared between with and without DLS using linear mixed models considering the imaging center.
RESULTS: In the validation cohort, the detection sensitivity and positive predictive value of the DLS were 90.2% (95% confidence interval [CI]: 88.1-92.2) and 88.2% (95% CI: 85.7-90.4), respectively. The difference between the readers and the reference counts was larger without DLS (LoA: -0.281, 95% CI: -2.888, 2.325) than with DLS (LoA: -0.163, 95% CI: -2.692, 2.367). The reading time was reduced from mean 66.9 s (interquartile range: 43.2-90.6) to 57.3 s (interquartile range: 33.6-81.0) (P <.001) in the with DLS group, regardless of the imaging center.
CONCLUSION: Deep learning-based BM detection and counting with black-blood imaging improved reproducibility and reduced reading time, on multi-center validation.
PMID:38429843 | DOI:10.1186/s40644-024-00669-9
Improving chemical reaction yield prediction using pre-trained graph neural networks
J Cheminform. 2024 Mar 1;16(1):25. doi: 10.1186/s13321-024-00818-z.
ABSTRACT
Graph neural networks (GNNs) have proven to be effective in the prediction of chemical reaction yields. However, their performance tends to deteriorate when they are trained using an insufficient training dataset in terms of quantity or diversity. A promising solution to alleviate this issue is to pre-train a GNN on a large-scale molecular database. In this study, we investigate the effectiveness of GNN pre-training in chemical reaction yield prediction. We present a novel GNN pre-training method for performance improvement.Given a molecular database consisting of a large number of molecules, we calculate molecular descriptors for each molecule and reduce the dimensionality of these descriptors by applying principal component analysis. We define a pre-text task by assigning a vector of principal component scores as the pseudo-label to each molecule in the database. A GNN is then pre-trained to perform the pre-text task of predicting the pseudo-label for the input molecule. For chemical reaction yield prediction, a prediction model is initialized using the pre-trained GNN and then fine-tuned with the training dataset containing chemical reactions and their yields. We demonstrate the effectiveness of the proposed method through experimental evaluation on benchmark datasets.
PMID:38429787 | DOI:10.1186/s13321-024-00818-z
A comparative study of CNN-capsule-net, CNN-transformer encoder, and Traditional machine learning algorithms to classify epileptic seizure
BMC Med Inform Decis Mak. 2024 Mar 1;24(1):60. doi: 10.1186/s12911-024-02460-z.
ABSTRACT
INTRODUCTION: Epilepsy is a disease characterized by an excessive discharge in neurons generally provoked without any external stimulus, known as convulsions. About 2 million people are diagnosed each year in the world. This process is carried out by a neurological doctor using an electroencephalogram (EEG), which is lengthy.
METHOD: To optimize these processes and make them more efficient, we have resorted to innovative artificial intelligence methods essential in classifying EEG signals. For this, comparing traditional models, such as machine learning or deep learning, with cutting-edge models, in this case, using Capsule-Net architectures and Transformer Encoder, has a crucial role in finding the most accurate model and helping the doctor to have a faster diagnosis.
RESULT: In this paper, a comparison was made between different models for binary and multiclass classification of the epileptic seizure detection database, achieving a binary accuracy of 99.92% with the Capsule-Net model and a multiclass accuracy with the Transformer Encoder model of 87.30%.
CONCLUSION: Artificial intelligence is essential in diagnosing pathology. The comparison between models is helpful as it helps to discard those that are not efficient. State-of-the-art models overshadow conventional models, but data processing also plays an essential role in evaluating the higher accuracy of the models.
PMID:38429718 | DOI:10.1186/s12911-024-02460-z
DeepCG: A cell graph model for predicting prognosis in lung adenocarcinoma
Int J Cancer. 2024 Mar 1. doi: 10.1002/ijc.34901. Online ahead of print.
ABSTRACT
Lung cancer is the first leading cause of cancer-related death in the United States, with lung adenocarcinoma as the major subtype accounting for 40% of all cases. To improve patient survival, image-based prognostic models were developed due to the ready availability of pathological images at diagnosis. However, the application of these models is hampered by two main challenges: the lack of publicly available image datasets with high-quality survival information and the poor interpretability of conventional convolutional neural network models. Here, we integrated matched transcriptomic and H&E staining data from TCGA (The Cancer Genome Atlas) to develop an image-based prognostic model, termed Deep-learning based Cell Graph (DeepCG) model. Instead of survival data, we used a gene signature to predict patient prognostic risks, which was then used as labels for training DeepCG. Importantly, by employing graph structures to capture cell patterns, DeepCG can provide cell-level interpretation, which was more biologically relevant than previous region-level insights. We validated the prognostic values of DeepCG in independent datasets and demonstrated its ability to identify prognostically informative cells in images.
PMID:38429627 | DOI:10.1002/ijc.34901
Auto-BCS: A Hybrid System for Real-Time Breast Cancer Screening from Pathological Images
J Imaging Inform Med. 2024 Mar 1. doi: 10.1007/s10278-024-01056-3. Online ahead of print.
ABSTRACT
Breast cancer is recognized as a prominent cause of cancer-related mortality among women globally, emphasizing the critical need for early diagnosis resulting improvement in survival rates. Current breast cancer diagnostic procedures depend on manual assessments of pathological images by medical professionals. However, in remote or underserved regions, the scarcity of expert healthcare resources often compromised the diagnostic accuracy. Machine learning holds great promise for early detection, yet existing breast cancer screening algorithms are frequently characterized by significant computational demands, rendering them unsuitable for deployment on low-processing-power mobile devices. In this paper, a real-time automated system "Auto-BCS" is introduced that significantly enhances the efficiency of early breast cancer screening. The system is structured into three distinct phases. In the initial phase, images undergo a pre-processing stage aimed at noise reduction. Subsequently, feature extraction is carried out using a lightweight and optimized deep learning model followed by extreme gradient boosting classifier, strategically employed to optimize the overall performance and prevent overfitting in the deep learning model. The system's performance is gauged through essential metrics, including accuracy, precision, recall, F1 score, and inference time. Comparative evaluations against state-of-the-art algorithms affirm that Auto-BCS outperforms existing models, excelling in both efficiency and processing speed. Computational efficiency is prioritized by Auto-BCS, making it particularly adaptable to low-processing-power mobile devices. Comparative assessments confirm the superior performance of Auto-BCS, signifying its potential to advance breast cancer screening technology.
PMID:38429562 | DOI:10.1007/s10278-024-01056-3
Deep Learning Imaging Reconstruction Algorithm for Carotid Dual Energy CT Angiography: Opportunistic Evaluation of Cervical Intervertebral Discs-A Preliminary Study
J Imaging Inform Med. 2024 Mar 1. doi: 10.1007/s10278-024-01016-x. Online ahead of print.
ABSTRACT
Thus, the aim of this study is to evaluate the performance of deep learning imaging reconstruction (DLIR) algorithm in different image sets derived from carotid dual-energy computed tomography angiography (DECTA) for evaluating cervical intervertebral discs (IVDs) and compare them with those reconstructed using adaptive statistical iterative reconstruction-Veo (ASiR-V). Forty-two patients who underwent carotid DECTA were included in this retrospective analysis. Three types of image sets (70 keV, water-iodine, and water-calcium) were reconstructed using 50% ASiR-V and DLIR at medium and high levels (DLIR-M and DLIR-H). The diagnostic acceptability and conspicuity of IVDs were assessed using a 5-point scale. Hounsfield Units (HU) and water concentration (WC) values of the IVDs; standard deviation (SD); and coefficient of variation (CV) were calculated. Measurement parameters of the 50% ASIR-V, DLIR-M, and DLIR-H groups were compared. The DLIR-H group showed higher scores for diagnostic acceptability and conspicuity, as well as lower SD values for HU and WC than the ASiR-V and DLIR-M groups for the 70 keV and water-iodine image sets (all p < .001). However, there was no significant difference in scores and SD among the three groups for the water-calcium image set (all p > .005). The water-calcium image set showed better diagnostic accuracy for evaluating IVDs compared to the other image sets. The inter-rater agreement using ASiR-V, DLIR-M, and DLIR-H was good for the 70 keV image set, excellent for the water-iodine and water-calcium image sets. DLIR improved the visualization of IVDs in the 70 keV and water-iodine image sets. However, its improvement on color-coded water-calcium image set was limited.
PMID:38429560 | DOI:10.1007/s10278-024-01016-x
A Systematic Review on Caries Detection, Classification, and Segmentation from X-Ray Images: Methods, Datasets, Evaluation, and Open Opportunities
J Imaging Inform Med. 2024 Mar 1. doi: 10.1007/s10278-024-01054-5. Online ahead of print.
ABSTRACT
Dental caries occurs from the interaction between oral bacteria and sugars, generating acids that damage teeth over time. The importance of X-ray images for detecting oral problems is undeniable in dentistry. With technological advances, it is feasible to identify these lesions using techniques such as deep learning, machine learning, and image processing. Therefore, the survey and systematization of these methods are essential to determining the main computational approaches for identifying caries in X-ray images. In this systematic review, we investigated the primary computational methods used for classifying, detecting, and segmenting caries in X-ray images. Following the PRISMA methodology, we selected relevant studies and analyzed their methods, strengths, limitations, imaging modalities, evaluation metrics, datasets, and classification techniques. The review encompassed 42 studies retrieved from the Science Direct, IEEExplore, ACM Digital, and PubMed databases from the Computer Science and Health areas. The results indicate that 12% of the included articles utilized public datasets, with deep learning being the predominant approach, accounting for 69% of the studies. The majority of these studies (76%) focused on classifying dental caries, either in binary or multiclass classification. Panoramic imaging was the most commonly used radiographic modality, representing 29% of the cases studied. Overall, our systematic review provides a comprehensive overview of the computational methods employed in identifying caries in radiographic images and highlights trends, patterns, and challenges in this research field.
PMID:38429559 | DOI:10.1007/s10278-024-01054-5
LGDNet: local feature coupling global representations network for pulmonary nodules detection
Med Biol Eng Comput. 2024 Mar 2. doi: 10.1007/s11517-024-03043-w. Online ahead of print.
ABSTRACT
Detection of suspicious pulmonary nodules from lung CT scans is a crucial task in computer-aided diagnosis (CAD) systems. In recent years, various deep learning-based approaches have been proposed and demonstrated significant potential for addressing this task. However, existing deep convolutional neural networks exhibit limited long-range dependency capabilities and neglect crucial contextual information, resulting in reduced performance on detecting small-size nodules in CT scans. In this work, we propose a novel end-to-end framework called LGDNet for the detection of suspicious pulmonary nodules in lung CT scans by fusing local features and global representations. To overcome the limited long-range dependency capabilities inherent in convolutional operations, a dual-branch module is designed to integrate the convolutional neural network (CNN) branch that extracts local features with the transformer branch that captures global representations. To further address the issue of misalignment between local features and global representations, an attention gate module is proposed in the up-sampling stage to selectively combine misaligned semantic data from both branches, resulting in more accurate detection of small-size nodules. Our experiments on the large-scale LIDC dataset demonstrate that the proposed LGDNet with the dual-branch module and attention gate module could significantly improve the nodule detection sensitivity by achieving a final competition performance metric (CPM) score of 89.49%, outperforming the state-of-the-art nodule detection methods, indicating its potential for clinical applications in the early diagnosis of lung diseases.
PMID:38429443 | DOI:10.1007/s11517-024-03043-w
Design and testing of ultrasound probe adapters for a robotic imaging platform
Sci Rep. 2024 Mar 1;14(1):5102. doi: 10.1038/s41598-024-55480-0.
ABSTRACT
Medical imaging-based triage is a critical tool for emergency medicine in both civilian and military settings. Ultrasound imaging can be used to rapidly identify free fluid in abdominal and thoracic cavities which could necessitate immediate surgical intervention. However, proper ultrasound image capture requires a skilled ultrasonography technician who is likely unavailable at the point of injury where resources are limited. Instead, robotics and computer vision technology can simplify image acquisition. As a first step towards this larger goal, here, we focus on the development of prototypes for ultrasound probe securement using a robotics platform. The ability of four probe adapter technologies to precisely capture images at anatomical locations, repeatedly, and with different ultrasound transducer types were evaluated across more than five scoring criteria. Testing demonstrated two of the adapters outperformed the traditional robot gripper and manual image capture, with a compact, rotating design compatible with wireless imaging technology being most suitable for use at the point of injury. Next steps will integrate the robotic platform with computer vision and deep learning image interpretation models to automate image capture and diagnosis. This will lower the skill threshold needed for medical imaging-based triage, enabling this procedure to be available at or near the point of injury.
PMID:38429442 | DOI:10.1038/s41598-024-55480-0
A novel difficult-to-segment samples focusing network for oral CBCT image segmentation
Sci Rep. 2024 Mar 1;14(1):5068. doi: 10.1038/s41598-024-55522-7.
ABSTRACT
Using deep learning technology to segment oral CBCT images for clinical diagnosis and treatment is one of the important research directions in the field of clinical dentistry. However, the blurred contour and the scale difference limit the segmentation accuracy of the crown edge and the root part of the current methods, making these regions become difficult-to-segment samples in the oral CBCT segmentation task. Aiming at the above problems, this work proposed a Difficult-to-Segment Focus Network (DSFNet) for segmenting oral CBCT images. The network utilizes a Feature Capturing Module (FCM) to efficiently capture local and long-range features, enhancing the feature extraction performance. Additionally, a Multi-Scale Feature Fusion Module (MFFM) is employed to merge multiscale feature information. To further improve the loss ratio for difficult-to-segment samples, a hybrid loss function is proposed, combining Focal Loss and Dice Loss. By utilizing the hybrid loss function, DSFNet achieves 91.85% Dice Similarity Coefficient (DSC) and 0.216 mm Average Symmetric Surface Distance (ASSD) performance in oral CBCT segmentation tasks. Experimental results show that the proposed method is superior to current dental CBCT image segmentation techniques and has real-world applicability.
PMID:38429362 | DOI:10.1038/s41598-024-55522-7
Physics-Informed Deep Learning Approach for Reintroducing Atomic Detail in Coarse-Grained Configurations of Multiple Poly(lactic acid) Stereoisomers
J Chem Inf Model. 2024 Mar 1. doi: 10.1021/acs.jcim.3c01870. Online ahead of print.
ABSTRACT
Multiscale modeling of complex molecular systems, such as macromolecules, encompasses methods that combine information from fine and coarse representations of molecules to capture material properties over a wide range of spatiotemporal scales. Being able to exchange information between different levels of resolution is essential for the effective transfer of this information. The inverse problem of reintroducing atomistic degrees of freedom in coarse-grained (CG) molecular configurations is particularly challenging as, from a mathematical point of view, it is an ill-posed problem; the forward mapping from the atomistic to the CG description is typically defined via a deterministic operator ("one-to-one" problem), whereas the reversed mapping from the CG to the atomistic model refers to creating one representative configuration out of many possible ones ("one-to-many" problem). Most of the backmapping methods proposed so far balance accuracy, efficiency, and general applicability. This is particularly important for macromolecular systems with different types of isomers, i.e., molecules that have the same molecular formula and sequence of bonded atoms (constitution) but differ in the three-dimensional configurations of their atoms in space. Here, we introduce a versatile deep learning approach for backmapping multicomponent CG macromolecules with chiral centers, trained to learn structural correlations between polymer configurations at the atomistic level and their corresponding CG descriptions. This method is intended to be simple and flexible while presenting a generic solution for resolution transformation. In addition, the method is aimed to respect the structural features of the molecule, such as local packing, capturing therefore the physical properties of the material. As an illustrative example, we apply the model on linear poly(lactic acid) (PLA) in melt, which is one of the most popular biodegradable polymers. The framework is tested on a number of model systems starting from homopolymer stereoisomers of PLA to copolymers with randomly placed chiral centers. The results demonstrate the efficiency and efficacy of the new approach.
PMID:38427962 | DOI:10.1021/acs.jcim.3c01870
Exponential Capacity of Dense Associative Memories
Phys Rev Lett. 2024 Feb 16;132(7):077301. doi: 10.1103/PhysRevLett.132.077301.
ABSTRACT
Recent generalizations of the Hopfield model of associative memories are able to store a number P of random patterns that grows exponentially with the number N of neurons, P=exp(αN). Besides the huge storage capacity, another interesting feature of these networks is their connection to the attention mechanism which is part of the Transformer architecture widely applied in deep learning. In this work, we study a generic family of pattern ensembles using a statistical mechanics analysis which gives exact asymptotic thresholds for the retrieval of a typical pattern, α_{1}, and lower bounds for the maximum of the load α for which all patterns can be retrieved, α_{c}, as well as sizes of attraction basins. We discuss in detail the cases of Gaussian and spherical patterns, and show that they display rich and qualitatively different phase diagrams.
PMID:38427855 | DOI:10.1103/PhysRevLett.132.077301
Advancing crop classification in smallholder agriculture: A multifaceted approach combining frequency-domain image co-registration, transformer-based parcel segmentation, and Bi-LSTM for crop classification
PLoS One. 2024 Mar 1;19(3):e0299350. doi: 10.1371/journal.pone.0299350. eCollection 2024.
ABSTRACT
Agricultural Remote Sensing has the potential to enhance agricultural monitoring in smallholder economies to mitigate losses. However, its widespread adoption faces challenges, such as diminishing farm sizes, lack of reliable data-sets and high cost related to commercial satellite imagery. This research focuses on opportunities, practices and novel approaches for effective utilization of remote sensing in agriculture applications for smallholder economies. The work entails insights from experiments using datasets representative of major crops during different growing seasons. We propose an optimized solution for addressing challenges associated with remote sensing-based crop mapping in smallholder agriculture farms. Open source tools and data are used for inter and intra-sensor image registration, with a root mean square error of 0.3 or less. We also propose and emphasize on the use of delineated vegetation parcels through Segment Anything Model for Geospatial (SAM-GEOs). Furthermore a Bidirectional-Long Short-Term Memory-based (Bi-LSTM) deep learning model is developed and trained for crop classification, achieving results with accuracy of more than 94% and 96% for validation sets of two data sets collected in the field, during 2 growing seasons.
PMID:38427638 | DOI:10.1371/journal.pone.0299350
Ten simple rules for pushing boundaries of inclusion at academic events
PLoS Comput Biol. 2024 Mar 1;20(3):e1011797. doi: 10.1371/journal.pcbi.1011797. eCollection 2024 Mar.
ABSTRACT
Inclusion at academic events is facing increased scrutiny as the communities these events serve raise their expectations for who can practically attend. Active efforts in recent years to bring more diversity to academic events have brought progress and created momentum. However, we must reflect on these efforts and determine which underrepresented groups are being disadvantaged. Inclusion at academic events is important to ensure diversity of discourse and opinion, to help build networks, and to avoid academic siloing. All of these contribute to the development of a robust and resilient academic field. We have developed these Ten Simple Rules both to amplify the voices that have been speaking out and to celebrate the progress of many Equity, Diversity, and Inclusivity practices that continue to drive the organisation of academic events. The Rules aim to raise awareness as well as provide actionable suggestions and tools to support these initiatives further. This aims to support academic organisations such as the Deep Learning Indaba, Neuromatch Academy, the IBRO-Simons Computational Neuroscience Imbizo, Biodiversity Information Standards (TDWG), Arabs in Neuroscience, FAIRPoints, and OLS (formerly Open Life Science). This article is a call to action for organisers to reevaluate the impact and reach of their inclusive practices.
PMID:38427633 | DOI:10.1371/journal.pcbi.1011797
SAAN: Similarity-aware attention flow network for change detection with VHR remote sensing images
IEEE Trans Image Process. 2024 Mar 1;PP. doi: 10.1109/TIP.2024.3349868. Online ahead of print.
ABSTRACT
Change detection (CD) is a fundamental and important task for monitoring the land surface dynamics in the earth observation field. Existing deep learning-based CD methods typically extract bi-temporal image features using a weight-sharing Siamese encoder network and identify change regions using a decoder network. These CD methods, however, still perform far from satisfactorily as we observe that 1) deep encoder layers focus on irrelevant background regions and 2) the models' confidence in the change regions is inconsistent at different decoder stages. The first problem is because deep encoder layers cannot effectively learn from imbalanced change categories using the sole output supervision, while the second problem is attributed to the lack of explicit semantic consistency preservation. To address these issues, we design a novel similarity-aware attention flow network (SAAN). SAAN incorporates a similarity-guided attention flow module with deeply supervised similarity optimization to achieve effective change detection. Specifically, we counter the first issue by explicitly guiding deep encoder layers to discover semantic relations from bi-temporal input images using deeply supervised similarity optimization. The extracted features are optimized to be semantically similar in the unchanged regions and dissimilar in the changing regions. The second drawback can be alleviated by the proposed similarity-guided attention flow module, which incorporates similarity-guided attention modules and attention flow mechanisms to guide the model to focus on discriminative channels and regions. We evaluated the effectiveness and generalization ability of the proposed method by conducting experiments on a wide range of CD tasks. The experimental results demonstrate that our method achieves excellent performance on several CD tasks, with discriminative features and semantic consistency preserved.
PMID:38427550 | DOI:10.1109/TIP.2024.3349868
Applications of artificial intelligence in biliary tract cancers
Indian J Gastroenterol. 2024 Mar 1. doi: 10.1007/s12664-024-01518-0. Online ahead of print.
ABSTRACT
Biliary tract cancers are malignant neoplasms arising from bile duct epithelial cells. They include cholangiocarcinomas and gallbladder cancer. Gallbladder cancer has a marked geographical preference and is one of the most common cancers in women in northern India. Biliary tract cancers are usually diagnosed at an advanced, unresectable stage. Hence, the prognosis is extremely dismal. The five-year survival rate in advanced gallbladder cancer is < 5%. Hence, early detection and radical surgery are critical to improving biliary tract cancer prognoses. Radiological imaging plays an essential role in diagnosing and managing biliary tract cancers. However, the diagnosis is challenging because the biliary tract is affected by many diseases that may have radiological appearances similar to cancer. Artificial intelligence (AI) can improve radiologists' performance in various tasks. Deep learning (DL)-based approaches are increasingly incorporated into medical imaging to improve diagnostic performance. This paper reviews the AI-based strategies in biliary tract cancers to improve the diagnosis and prognosis.
PMID:38427281 | DOI:10.1007/s12664-024-01518-0
FSMN-Net: a free space matching network based on manifold convolution for optical molecular tomography
Opt Lett. 2024 Mar 1;49(5):1161-1164. doi: 10.1364/OL.512235.
ABSTRACT
Optical molecular tomography (OMT) can monitor glioblastomas in small animals non-invasively. Although deep learning (DL) methods have made remarkable achievements in this field, improving its generalization against diverse reconstruction systems remains a formidable challenge. In this Letter, a free space matching network (FSMN-Net) was presented to overcome the parameter mismatch problem in different reconstruction systems. Specifically, a novel, to the best of our knowledge, manifold convolution operator was designed by considering the mathematical model of OMT as a space matching process. Based on the dynamic domain expansion concept, an end-to-end fully convolutional codec further integrates this operator to realize robust reconstruction with voxel-level accuracy. The results of numerical simulations and in vivo experiments demonstrate that the FSMN-Net can stably generate high-resolution reconstruction volumetric images under different reconstruction systems.
PMID:38426963 | DOI:10.1364/OL.512235
A systematic review on deep learning-based automated cancer diagnosis models
J Cell Mol Med. 2024 Mar;28(6):e18144. doi: 10.1111/jcmm.18144.
ABSTRACT
Deep learning is gaining importance due to its wide range of applications. Many researchers have utilized deep learning (DL) models for the automated diagnosis of cancer patients. This paper provides a systematic review of DL models for automated diagnosis of cancer patients. Initially, various DL models for cancer diagnosis are presented. Five major categories of cancers such as breast, lung, liver, brain and cervical cancer are considered. As these categories of cancers have a very high percentage of occurrences with high mortality rate. The comparative analysis of different types of DL models is drawn for the diagnosis of cancer at early stages by considering the latest research articles from 2016 to 2022. After comprehensive comparative analysis, it is found that most of the researchers achieved appreciable accuracy with implementation of the convolutional neural network model. These utilized the pretrained models for automated diagnosis of cancer patients. Various shortcomings with the existing DL-based automated cancer diagnosis models are also been presented. Finally, future directions are discussed to facilitate further research for automated diagnosis of cancer patients.
PMID:38426930 | DOI:10.1111/jcmm.18144
Effects of Language Differences on Inpatient Fall Detection Using Deep Learning
Stud Health Technol Inform. 2024 Mar 1;310:1584-1585. doi: 10.3233/SHTI231329.
ABSTRACT
This study examined the effects of language differences between Korean and English on the performance of natural language processing in the classification task of identifying inpatient falls from unstructured nursing notes.
PMID:38426881 | DOI:10.3233/SHTI231329
Enhancing music recognition using deep learning-powered source separation technology for cochlear implant users
J Acoust Soc Am. 2024 Mar 1;155(3):1694-1703. doi: 10.1121/10.0025057.
ABSTRACT
Cochlear implant (CI) is currently the vital technological device for assisting deaf patients in hearing sounds and greatly enhances their sound listening appreciation. Unfortunately, it performs poorly for music listening because of the insufficient number of electrodes and inaccurate identification of music features. Therefore, this study applied source separation technology with a self-adjustment function to enhance the music listening benefits for CI users. In the objective analysis method, this study showed that the results of the source-to-distortion, source-to-interference, and source-to-artifact ratios were 4.88, 5.92, and 15.28 dB, respectively, and significantly better than the Demucs baseline model. For the subjective analysis method, it scored higher than the traditional baseline method VIR6 (vocal to instrument ratio, 6 dB) by approximately 28.1 and 26.4 (out of 100) in the multi-stimulus test with hidden reference and anchor test, respectively. The experimental results showed that the proposed method can benefit CI users in identifying music in a live concert, and the personal self-fitting signal separation method had better results than any other default baselines (vocal to instrument ratio of 6 dB or vocal to instrument ratio of 0 dB) did. This finding suggests that the proposed system is a potential method for enhancing the music listening benefits for CI users.
PMID:38426839 | DOI:10.1121/10.0025057