Deep learning

deep learning: Latest results from PubMed

URL: https://pubmed.ncbi.nlm.nih.gov/rss-feed/?feed_id=1rmqmrY0qppU3YGIhI4yOg4EHo8T8cGeZzT5QkT7cIOPiglIw_&utm_source=Other&utm_medium=rss&utm_campaign=pubmed-2&v=2.18.0.post9+e462414&fc=20210220094940&ff=20250813201115&utm_content=1rmqmrY0qppU3YGIhI4yOg4EHo8T8cGeZzT5QkT7cIOPiglIw_

Updated: 1 hour 55 min ago

Performance evaluation of deep learning image reconstruction algorithm for dual-energy spectral CT imaging: A phantom study

Fri, 2024-02-23 06:00

J Xray Sci Technol. 2024 Feb 21. doi: 10.3233/XST-230333. Online ahead of print.

ABSTRACT

OBJECTIVES: To evaluate the performance of deep learning image reconstruction (DLIR) algorithm in dual-energy spectral CT (DEsCT) as a function of radiation dose and image energy level, in comparison with filtered-back-projection (FBP) and adaptive statistical iterative reconstruction-V (ASIR-V) algorithms.

METHODS: An ACR464 phantom was scanned with DEsCT at four dose levels (3.5 mGy, 5 mGy, 7.5 mGy, and 10 mGy). Virtual monochromatic images were reconstructed at five energy levels (40 keV, 50 keV, 68 keV, 74 keV, and 140 keV) using FBP, 50% and 100% ASIR-V, DLIR at low (DLIR-L), medium (DLIR-M), and high (DLIR-H) settings. The noise power spectrum (NPS), task-based transfer function (TTF) and detectability index (d') were computed and compared among reconstructions.

RESULTS: NPS area and noise increased as keV decreased, with DLIR having slower increase than FBP and ASIR-V, and DLIR-H having the lowest values. DLIR had the best 40 keV/140 keV noise ratio at various energy levels, DLIR showed higher TTF (50%) than ASIR-V for all materials, especially for the soft tissue-like polystyrene insert, and DLIR-M and DLIR-H provided higher d' than DLIR-L, ASIR-V and FBP in all dose and energy levels. As keV increases, d' increased for acrylic insert, and d' of the 50 keV DLIR-M and DLIR-H images at 3.5 mGy (7.39 and 8.79, respectively) were higher than that (7.20) of the 50 keV ASIR-V50% images at 10 mGy.

CONCLUSIONS: DLIR provides better noise containment for low keV images in DEsCT and higher TTF(50%) for the polystyrene insert over ASIR-V. DLIR-H has the lowest image noise and highest detectability in all dose and energy levels. DEsCT 50 keV images with DLIR-M and DLIR-H show potential for 65% dose reduction over ASIR-V50% withhigher d'.

PMID:38393883 | DOI:10.3233/XST-230333

Categories: Literature Watch

Lossless compression-based detection of osteoporosis using bone X-ray imaging

Fri, 2024-02-23 06:00

J Xray Sci Technol. 2024 Feb 20. doi: 10.3233/XST-230238. Online ahead of print.

ABSTRACT

BACKGROUND: Digital X-ray imaging is essential for diagnosing osteoporosis, but distinguishing affected patients from healthy individuals using these images remains challenging.

OBJECTIVE: This study introduces a novel method using deep learning to improve osteoporosis diagnosis from bone X-ray images.

METHODS: A dataset of bone X-ray images was analyzed using a newly proposed procedure. This procedure involves segregating the images into regions of interest (ROI) and non-ROI, thereby reducing data redundancy. The images were then processed to enhance both spatial and statistical features. For classification, a Support Vector Machine (SVM) classifier was employed to distinguish between osteoporotic and non-osteoporotic cases.

RESULTS: The proposed method demonstrated a promising Area under the Curve (AUC) of 90.8% in diagnosing osteoporosis, benchmarking favorably against existing techniques. This signifies a high level of accuracy in distinguishing osteoporosis patients from healthy controls.

CONCLUSIONS: The proposed method effectively distinguishes between osteoporotic and non-osteoporotic cases using bone X-ray images. By enhancing image features and employing SVM classification, the technique offers a promising tool for efficient and accurate osteoporosis diagnosis.

PMID:38393881 | DOI:10.3233/XST-230238

Categories: Literature Watch

Implemented classification techniques for osteoporosis using deep learning from the perspective of healthcare analytics

Fri, 2024-02-23 06:00

Technol Health Care. 2024 Feb 4. doi: 10.3233/THC-231517. Online ahead of print.

ABSTRACT

BACKGROUND: Osteoporosis is a medical disorder that causes bone tissue to deteriorate and lose density, increasing the risk of fractures. Applying Neural Networks (NN) to analyze medical imaging data and detect the presence or severity of osteoporosis in patients is known as osteoporosis classification using Deep Learning (DL) algorithms. DL algorithms can extract relevant information from bone images and discover intricate patterns that could indicate osteoporosis.

OBJECTIVE: DCNN biases must be initialized carefully, much like their weights. Biases that are initialized incorrectly might affect the network's learning dynamics and hinder the model's ability to converge to an ideal solution. In this research, Deep Convolutional Neural Networks (DCNNs) are used, which have several benefits over conventional ML techniques for image processing.

METHOD: One of the key benefits of DCNNs is the ability to automatically Feature Extraction (FE) from raw data. Feature learning is a time-consuming procedure in conventional ML algorithms. During the training phase of DCNNs, the network learns to recognize relevant characteristics straight from the data. The Squirrel Search Algorithm (SSA) makes use of a combination of Local Search (LS) and Random Search (RS) techniques that are inspired by the foraging habits of squirrels.

RESULTS: The method made it possible to efficiently explore the search space to find prospective values while using promising areas to refine and improve the solutions. Effectively recognizing optimum or nearly optimal solutions depends on balancing exploration and exploitation. The weight in the DCNN is optimized with the help of SSA, which enhances the performance of the classification.

CONCLUSION: The comparative analysis with state-of-the-art techniques shows that the proposed SSA-based DCNN is highly accurate, with 96.57% accuracy.

PMID:38393861 | DOI:10.3233/THC-231517

Categories: Literature Watch

Software that combines deep learning, 3D reconstruction and CFD to analyze the state of carotid arteries from ultrasound imaging

Fri, 2024-02-23 06:00

Technol Health Care. 2024 Feb 7. doi: 10.3233/THC-231306. Online ahead of print.

ABSTRACT

BACKGROUND: Ultrasound is one of the non-invasive techniques that are used in clinical diagnostics of carotid artery disease.

OBJECTIVE: This paper presents software methodology that can be used in combination with this imaging technique to provide additional information about the state of patient-specific artery.

METHODS: Overall three modules are combined within the proposed methodology. A clinical dataset is used within the deep learning module to extract the contours of the carotid artery. This data is then used within the second module to perform the three-dimensional reconstruction of the geometry of the carotid bifurcation and ultimately this geometry is used within the third module, where the hemodynamic analysis is performed. The obtained distributions of hemodynamic quantities enable a more detailed analysis of the blood flow and state of the arterial wall and could be useful to predict further progress of present abnormalities in the carotid bifurcation.

RESULTS: The performance of the deep learning module was demonstrated through the high values of relevant common classification metric parameters. Also, the accuracy of the proposed methodology was shown through the validation of results for the reconstructed parameters against the clinically measured values.

CONCLUSION: The presented methodology could be used in combination with standard clinical ultrasound examination to quickly provide additional quantitative and qualitative information about the state of the patient's carotid bifurcation and thus ensure a treatment that is more adapted to the specific patient.

PMID:38393860 | DOI:10.3233/THC-231306

Categories: Literature Watch

HBNET: A blended ensemble model for the detection of cardiovascular anomalies using phonocardiogram

Fri, 2024-02-23 06:00

Technol Health Care. 2024 Feb 1. doi: 10.3233/THC-231290. Online ahead of print.

ABSTRACT

BACKGROUND: Cardiac diseases are highly detrimental illnesses, responsible for approximately 32% of global mortality [1]. Early diagnosis and prompt treatment can reduce deaths caused by cardiac diseases. In paediatric patients, it is challenging for paediatricians to identify functional murmurs and pathological murmurs from heart sounds.

OBJECTIVE: The study intends to develop a novel blended ensemble model using hybrid deep learning models and softmax regression to classify adult, and paediatric heart sounds into five distinct classes, distinguishing itself as a groundbreaking work in this domain. Furthermore, the research aims to create a comprehensive 5-class paediatric phonocardiogram (PCG) dataset. The dataset includes two critical pathological classes, namely atrial septal defects and ventricular septal defects, along with functional murmurs, pathological and normal heart sounds.

METHODS: The work proposes a blended ensemble model (HbNet-Heartbeat Network) comprising two hybrid models, CNN-BiLSTM and CNN-LSTM, as base models and Softmax regression as meta-learner. HbNet leverages the strengths of base models and improves the overall PCG classification accuracy. Mel Frequency Cepstral Coefficients (MFCC) capture the crucial audio signal characteristics relevant to the classification. The amalgamation of these two deep learning structures enhances the precision and reliability of PCG classification, leading to improved diagnostic results.

RESULTS: The HbNet model exhibited excellent results with an average accuracy of 99.72% and sensitivity of 99.3% on an adult dataset, surpassing all the existing state-of-the-art works. The researchers have validated the reliability of the HbNet model by testing it on a real-time paediatric dataset. The paediatric model's accuracy is 86.5%. HbNet detected functional murmur with 100% precision.

CONCLUSION: The results indicate that the HbNet model exhibits a high level of efficacy in the early detection of cardiac disorders. Results also imply that HbNet has the potential to serve as a valuable tool for the development of decision-support systems that aid medical practitioners in confirming their diagnoses. This method makes it easier for medical professionals to diagnose and initiate prompt treatment while performing preliminary auscultation and reduces unnecessary echocardiograms.

PMID:38393859 | DOI:10.3233/THC-231290

Categories: Literature Watch

A Soft Sensor for Multirate Quality Variables Based on MC-CNN

Fri, 2024-02-23 06:00

IEEE Trans Neural Netw Learn Syst. 2024 Feb 23;PP. doi: 10.1109/TNNLS.2024.3360030. Online ahead of print.

ABSTRACT

In recent years, data-driven soft sensor modeling methods have been widely used in industrial production, chemistry, and biochemical. In industrial processes, the sampling rates of quality variables are always lower than those of process variables. Meanwhile, the sampling rates among quality variables are also different. However, few multi-input multi-output (MIMO) sensors take this temporal factor into consideration. To solve this problem, a deep-learning (DL) model based on a multitemporal channels convolutional neural network (MC-CNN) is proposed. In the MC-CNN, the network consists of two parts: the shared network used to extract the temporal feature and the parallel prediction network used to predict each quality variable. The modified BP algorithm makes the blank values generated at unsampled moments not participate in the backpropagation (BP) process during training. By predicting multiple quality variables of two industrial cases, the effectiveness of the proposed method is verified.

PMID:38393838 | DOI:10.1109/TNNLS.2024.3360030

Categories: Literature Watch

NeuroIGN: Explainable Multimodal Image-Guided System for Precise Brain Tumor Surgery

Fri, 2024-02-23 06:00

J Med Syst. 2024 Feb 23;48(1):25. doi: 10.1007/s10916-024-02037-3.

ABSTRACT

Precise neurosurgical guidance is critical for successful brain surgeries and plays a vital role in all phases of image-guided neurosurgery (IGN). Neuronavigation software enables real-time tracking of surgical tools, ensuring their presentation with high precision in relation to a virtual patient model. Therefore, this work focuses on the development of a novel multimodal IGN system, leveraging deep learning and explainable AI to enhance brain tumor surgery outcomes. The study establishes the clinical and technical requirements of the system for brain tumor surgeries. NeuroIGN adopts a modular architecture, including brain tumor segmentation, patient registration, and explainable output prediction, and integrates open-source packages into an interactive neuronavigational display. The NeuroIGN system components underwent validation and evaluation in both laboratory and simulated operating room (OR) settings. Experimental results demonstrated its accuracy in tumor segmentation and the success of ExplainAI in increasing the trust of medical professionals in deep learning. The proposed system was successfully assembled and set up within 11 min in a pre-clinical OR setting with a tracking accuracy of 0.5 (± 0.1) mm. NeuroIGN was also evaluated as highly useful, with a high frame rate (19 FPS) and real-time ultrasound imaging capabilities. In conclusion, this paper describes not only the development of an open-source multimodal IGN system but also demonstrates the innovative application of deep learning and explainable AI algorithms in enhancing neuronavigation for brain tumor surgeries. By seamlessly integrating pre- and intra-operative patient image data with cutting-edge interventional devices, our experiments underscore the potential for deep learning models to improve the surgical treatment of brain tumors and long-term post-operative outcomes.

PMID:38393660 | DOI:10.1007/s10916-024-02037-3

Categories: Literature Watch

Convolutional neural networks combined with classification algorithms for the diagnosis of periodontitis

Fri, 2024-02-23 06:00

Oral Radiol. 2024 Feb 23. doi: 10.1007/s11282-024-00739-5. Online ahead of print.

ABSTRACT

OBJECTIVES: We aim to develop a deep learning model based on a convolutional neural network (CNN) combined with a classification algorithm (CA) to assist dentists in quickly and accurately diagnosing the stage of periodontitis.

MATERIALS AND METHODS: Periapical radiographs (PERs) and clinical data were collected. The CNNs including Alexnet, VGG16, and ResNet18 were trained on PER to establish the PER-CNN models for no periodontal bone loss (PBL) and PBL. The CAs including random forest (RF), support vector machine (SVM), naive Bayes (NB), logistic regression (LR), and k-nearest neighbor (KNN) were added to the PER-CNN model for control, stage I, stage II and stage III/IV periodontitis. Heat map was produced using a gradient-weighted class activation mapping method to visualize the regions of interest of the PER-Alexnet model. Clustering analysis was performed based on the ten PER-CNN scores and the clinical characteristics.

RESULTS: The accuracy of the PER-Alexnet and PER-VGG16 models with the higher performance was 0.872 and 0.853, respectively. The accuracy of the PER-Alexnet + RF model with the highest performance for control, stage I, stage II and stage III/IV was 0.968, 0.960, 0.835 and 0.842, respectively. Heat map showed that the regions of interest predicted by the model were periodontitis bone lesions. We found that age and smoking were significantly related to periodontitis based on the PER-Alexnet scores.

CONCLUSION: The PER-Alexnet + RF model has reached high performance for whole-case periodontal diagnosis. The CNN models combined with CA can assist dentists in quickly and accurately diagnosing the stage of periodontitis.

PMID:38393548 | DOI:10.1007/s11282-024-00739-5

Categories: Literature Watch

Integrated machine learning and deep learning for predicting diabetic nephropathy model construction, validation, and interpretability

Fri, 2024-02-23 06:00

Endocrine. 2024 Feb 23. doi: 10.1007/s12020-024-03735-1. Online ahead of print.

ABSTRACT

OBJECTIVE: To construct a risk prediction model for assisted diagnosis of Diabetic Nephropathy (DN) using machine learning algorithms, and to validate it internally and externally.

METHODS: Firstly, the data was cleaned and enhanced, and was divided into training and test sets according to the 7:3 ratio. Then, the metrics related to DN were filtered by difference analysis, Least Absolute Shrinkage and Selection Operator (LASSO), Recursive Feature Elimination (RFE), and Max-relevance and Min-redundancy (MRMR) algorithms. Ten machine learning models were constructed based on the key variables. The best model was filtered by Receiver Operating Characteristic (ROC), Precision-Recall (PR), Accuracy, Matthews Correlation Coefficient (MCC), and Kappa, and was internally and externally validated. Based on the best model, an online platform had been constructed.

RESULTS: 15 key variables were selected, and among the 10 machine learning models, the Random Forest model achieved the best predictive performance. In the test set, the area under the ROC curve was 0.912, and in two external validation cohorts, the area under the ROC curve was 0.828 and 0.863, indicating excellent predictive and generalization abilities.

CONCLUSION: The model has a good predictive value and is expected to help in the early diagnosis and screening of clinical DN.

PMID:38393509 | DOI:10.1007/s12020-024-03735-1

Categories: Literature Watch

Improved detection of cholesterol gallstones using quasi-material decomposition images generated from single-energy computed tomography images via deep learning

Fri, 2024-02-23 06:00

Radiol Phys Technol. 2024 Feb 23. doi: 10.1007/s12194-024-00783-0. Online ahead of print.

ABSTRACT

In this study, we developed a method for generating quasi-material decomposition (quasi-MD) images from single-energy computed tomography (SECT) images using a deep convolutional neural network (DCNN). Our aim was to improve the detection of cholesterol gallstones and to determine the clinical utility of quasi-MD images. Four thousand pairs of virtual monochromatic images (70 keV) and MD images (fat/water) of the same section, obtained via dual-energy computed tomography (DECT), were used to train the DCNN. The trained DCNN can automatically generate quasi-MD images from the SECT images. Additional SECT images were obtained from 70 patients (40 with and 30 without cholesterol gallstones) to generate quasi-MD images for testing. The presence of gallstones in this dataset was confirmed by ultrasonography. We conducted a receiver operating characteristic (ROC) observer study with three radiologists to validate the clinical utility of the quasi-MD images for detecting cholesterol gallstones. The mean area under the ROC curve for the detection of cholesterol gallstones improved from 0.867 to 0.921 (p = 0.001) when quasi-MD images were added to SECT images. The clinical utility of quasi-MD imaging for detecting cholesterol gallstones was showed. This study demonstrated that the lesion detection capability of images obtained from SECT can be improved using a DCNN trained with DECT images obtained using high-end computed tomography systems.

PMID:38393491 | DOI:10.1007/s12194-024-00783-0

Categories: Literature Watch

Utilizing fully-automated 3D organ segmentation for hepatic steatosis assessment with CT attenuation-based parameters

Fri, 2024-02-23 06:00

Eur Radiol. 2024 Feb 23. doi: 10.1007/s00330-024-10660-4. Online ahead of print.

ABSTRACT

OBJECTIVES: To investigate the clinical utility of fully-automated 3D organ segmentation in assessing hepatic steatosis on pre-contrast and post-contrast CT images using magnetic resonance spectroscopy (MRS)-proton density fat fraction (PDFF) as reference standard.

MATERIALS AND METHODS: This retrospective study analyzed 362 adult potential living liver donors with abdominal CT scans and MRS-PDFF. Using a deep learning-based tool, mean volumetric CT attenuation of the liver and spleen were measured on pre-contrast (liver(L)_pre and spleen(S)_pre) and post-contrast (L_post and S_post) images. Agreements between volumetric and manual region-of-interest (ROI)-based measurements were assessed using the intraclass correlation coefficient (ICC) and Bland-Altman analysis. Diagnostic performances of volumetric parameters (L_pre, liver-minus-spleen (L-S)_pre, L_post, and L-S_post) were evaluated for detecting MRS-PDFF ≥ 5% and ≥ 10% using receiver operating characteristic (ROC) curve analysis and compared with those of ROI-based parameters.

RESULTS: Among the 362 subjects, 105 and 35 had hepatic steatosis with MRS-PDFF ≥ 5% and ≥ 10%, respectively. Volumetric and ROI-based measurements revealed ICCs of 0.974, 0.825, 0.992, and 0.962, with mean differences of -4.2 HU, -3.4 HU, -1.2 HU, and -7.7 HU for L_pre, S_pre, L_post, and S_post, respectively. Volumetric L_pre, L-S_pre, L_post, and L-S_post yielded areas under the ROC curve of 0.813, 0.813, 0.734, and 0.817 for MRS-PDFF ≥ 5%; and 0.901, 0.915, 0.818, and 0.868 for MRS-PDFF ≥ 10%, comparable with those of ROI-based parameters (0.735-0.818; and 0.816-0.895, Ps = 0.228-0.911).

CONCLUSION: Automated 3D segmentation of the liver and spleen in CT scans can provide volumetric CT attenuation-based parameters to detect and grade hepatic steatosis, applicable to pre-contrast and post-contrast images.

CLINICAL RELEVANCE STATEMENT: Volumetric CT attenuation-based parameters of the liver and spleen, obtained through automated segmentation tools from pre-contrast or post-contrast CT scans, can efficiently detect and grade hepatic steatosis, making them applicable for large population data collection.

KEY POINTS: • Automated organ segmentation enables the extraction of CT attenuation-based parameters for the target organ. • Volumetric liver and spleen CT attenuation-based parameters are highly accurate in hepatic steatosis assessment. • Automated CT measurements from pre- or post-contrast imaging show promise for hepatic steatosis screening in large cohorts.

PMID:38393403 | DOI:10.1007/s00330-024-10660-4

Categories: Literature Watch

Identifying Diabetic Retinopathy in the Human Eye: A Hybrid Approach Based on a Computer-Aided Diagnosis System Combined with Deep Learning

Fri, 2024-02-23 06:00

Tomography. 2024 Feb 5;10(2):215-230. doi: 10.3390/tomography10020017.

ABSTRACT

Diagnosing and screening for diabetic retinopathy is a well-known issue in the biomedical field. A component of computer-aided diagnosis that has advanced significantly over the past few years as a result of the development and effectiveness of deep learning is the use of medical imagery from a patient's eye to identify the damage caused to blood vessels. Issues with unbalanced datasets, incorrect annotations, a lack of sample images, and improper performance evaluation measures have negatively impacted the performance of deep learning models. Using three benchmark datasets of diabetic retinopathy, we conducted a detailed comparison study comparing various state-of-the-art approaches to address the effect caused by class imbalance, with precision scores of 93%, 89%, 81%, 76%, and 96%, respectively, for normal, mild, moderate, severe, and DR phases. The analyses of the hybrid modeling, including CNN analysis and SHAP model derivation results, are compared at the end of the paper, and ideal hybrid modeling strategies for deep learning classification models for automated DR detection are identified.

PMID:38393285 | DOI:10.3390/tomography10020017

Categories: Literature Watch

Ref-MEF: Reference-Guided Flexible Gated Image Reconstruction Network for Multi-Exposure Image Fusion

Fri, 2024-02-23 06:00

Entropy (Basel). 2024 Feb 3;26(2):139. doi: 10.3390/e26020139.

ABSTRACT

Multi-exposure image fusion (MEF) is a computational approach that amalgamates multiple images, each captured at varying exposure levels, into a singular, high-quality image that faithfully encapsulates the visual information from all the contributing images. Deep learning-based MEF methodologies often confront obstacles due to the inherent inflexibilities of neural network structures, presenting difficulties in dynamically handling an unpredictable amount of exposure inputs. In response to this challenge, we introduce Ref-MEF, a method for color image multi-exposure fusion guided by a reference image designed to deal with an uncertain amount of inputs. We establish a reference-guided exposure correction (REC) module based on channel attention and spatial attention, which can correct input features and enhance pre-extraction features. The exposure-guided feature fusion (EGFF) module combines original image information and uses Gaussian filter weights for feature fusion while keeping the feature dimensions constant. The image reconstruction is completed through a gated context aggregation network (GCAN) and global residual learning GRL. Our refined loss function incorporates gradient fidelity, producing high dynamic range images that are rich in detail and demonstrate superior visual quality. In evaluation metrics focused on image features, our method exhibits significant superiority and leads in holistic assessments as well. It is worth emphasizing that as the number of input images increases, our algorithm exhibits notable computational efficiency.

PMID:38392394 | DOI:10.3390/e26020139

Categories: Literature Watch

Adversarial Robustness with Partial Isometry

Fri, 2024-02-23 06:00

Entropy (Basel). 2024 Jan 24;26(2):103. doi: 10.3390/e26020103.

ABSTRACT

Despite their remarkable performance, deep learning models still lack robustness guarantees, particularly in the presence of adversarial examples. This significant vulnerability raises concerns about their trustworthiness and hinders their deployment in critical domains that require certified levels of robustness. In this paper, we introduce an information geometric framework to establish precise robustness criteria for l2 white-box attacks in a multi-class classification setting. We endow the output space with the Fisher information metric and derive criteria on the input-output Jacobian to ensure robustness. We show that model robustness can be achieved by constraining the model to be partially isometric around the training points. We evaluate our approach using MNIST and CIFAR-10 datasets against adversarial attacks, revealing its substantial improvements over defensive distillation and Jacobian regularization for medium-sized perturbations and its superior robustness performance to adversarial training for large perturbations, all while maintaining the desired accuracy.

PMID:38392358 | DOI:10.3390/e26020103

Categories: Literature Watch

Semantic Communication: A Survey of Its Theoretical Development

Fri, 2024-02-23 06:00

Entropy (Basel). 2024 Jan 24;26(2):102. doi: 10.3390/e26020102.

ABSTRACT

In recent years, semantic communication has received significant attention from both academia and industry, driven by the growing demands for ultra-low latency and high-throughput capabilities in emerging intelligent services. Nonetheless, a comprehensive and effective theoretical framework for semantic communication has yet to be established. In particular, finding the fundamental limits of semantic communication, exploring the capabilities of semantic-aware networks, or utilizing theoretical guidance for deep learning in semantic communication are very important yet still unresolved issues. In general, the mathematical theory of semantic communication and the mathematical representation of semantics are referred to as semantic information theory. In this paper, we introduce the pertinent advancements in semantic information theory. Grounded in the foundational work of Claude Shannon, we present the latest developments in semantic entropy, semantic rate-distortion, and semantic channel capacity. Additionally, we analyze some open problems in semantic information measurement and semantic coding, providing a theoretical basis for the design of a semantic communication system. Furthermore, we carefully review several mathematical theories and tools and evaluate their applicability in the context of semantic communication. Finally, we shed light on the challenges encountered in both semantic communication and semantic information theory.

PMID:38392357 | DOI:10.3390/e26020102

Categories: Literature Watch

Automatic Vertebral Rotation Angle Measurement of 3D Vertebrae Based on an Improved Transformer Network

Fri, 2024-02-23 06:00

Entropy (Basel). 2024 Jan 23;26(2):97. doi: 10.3390/e26020097.

ABSTRACT

The measurement of vertebral rotation angles serves as a crucial parameter in spinal assessments, particularly in understanding conditions such as idiopathic scoliosis. Historically, these angles were calculated from 2D CT images. However, such 2D techniques fail to comprehensively capture the intricate three-dimensional deformities inherent in spinal curvatures. To overcome the limitations of manual measurements and 2D imaging, we introduce an entirely automated approach for quantifying vertebral rotation angles using a three-dimensional vertebral model. Our method involves refining a point cloud segmentation network based on a transformer architecture. This enhanced network segments the three-dimensional vertebral point cloud, allowing for accurate measurement of vertebral rotation angles. In contrast to conventional network methodologies, our approach exhibits notable improvements in segmenting vertebral datasets. To validate our approach, we compare our automated measurements with angles derived from prevalent manual labeling techniques. The analysis, conducted through Bland-Altman plots and the corresponding intraclass correlation coefficient results, indicates significant agreement between our automated measurement method and manual measurements. The observed high intraclass correlation coefficients (ranging from 0.980 to 0.993) further underscore the reliability of our automated measurement process. Consequently, our proposed method demonstrates substantial potential for clinical applications, showcasing its capacity to provide accurate and efficient vertebral rotation angle measurements.

PMID:38392353 | DOI:10.3390/e26020097

Categories: Literature Watch

NIR spectroscopy-CNN-enabled chemometrics for multianalyte monitoring in microbial fermentation

Fri, 2024-02-23 06:00

Biotechnol Bioeng. 2024 Feb 23. doi: 10.1002/bit.28681. Online ahead of print.

ABSTRACT

As the biopharmaceutical industry looks to implement Industry 4.0, the need for rapid and robust analytical characterization of analytes has become a pressing priority. Spectroscopic tools, like near-infrared (NIR) spectroscopy, are finding increasing use for real-time quantitative analysis. Yet detection of multiple low-concentration analytes in microbial and mammalian cell cultures remains an ongoing challenge, requiring the selection of carefully calibrated, resilient chemometrics for each analyte. The convolutional neural network (CNN) is a puissant tool for processing complex data and making it a potential approach for automatic multivariate spectral processing. This work proposes an inception module-based two-dimensional (2D) CNN approach (I-CNN) for calibrating multiple analytes using NIR spectral data. The I-CNN model, coupled with orthogonal partial least squares (PLS) preprocessing, converts the NIR spectral data into a 2D data matrix, after which the critical features are extracted, leading to model development for multiple analytes. Escherichia coli fermentation broth was taken as a case study, where calibration models were developed for 23 analytes, including 20 amino acids, glucose, lactose, and acetate. The I-CNN model result statistics depicted an average R2 values of prediction 0.90, external validation data set 0.86 and significantly lower root mean square error of prediction values ∼0.52 compared to conventional regression models like PLS. Preprocessing steps were applied to I-CNN models to evaluate any augmentation in prediction performance. Finally, the model reliability was assessed via real-time process monitoring and comparison with offline analytics. The proposed I-CNN method is systematic and novel in extracting distinctive spectral features from a multianalyte bioprocess data set and could be adapted to other complex cell culture systems requiring rapid quantification using spectroscopy.

PMID:38390805 | DOI:10.1002/bit.28681

Categories: Literature Watch

Privacy-proof Live Surgery Streaming: Development and Validation of a low-cost, Real-time Robotic Surgery Anonymization Algorithm

Fri, 2024-02-23 06:00

Ann Surg. 2024 Feb 23. doi: 10.1097/SLA.0000000000006245. Online ahead of print.

ABSTRACT

OBJECTIVE: Develop a pioneer surgical anonymization algorithm for reliable and accurate real-time removal of out-of-body images, validated across various robotic platforms.

SUMMARY BACKGROUND DATA / BACKGROUND: The use of surgical video data has become common practice in enhancing research and training. Video sharing requires complete anonymization, which, in the case of endoscopic surgery, entails the removal of all nonsurgical video frames where the endoscope can record the patient or operating room staff. To date, no openly available algorithmic solution for surgical anonymization offers reliable real-time anonymization for video streaming, which is also robotic-platform- and procedure-independent.

METHODS: A dataset of 63 surgical videos of 6 procedures performed on four robotic systems was annotated for out-of-body sequences. The resulting 496.828 images were used to develop a deep learning algorithm that automatically detected out-of-body frames. Our solution was subsequently benchmarked against existing anonymization methods. In addition, we offer a post-processing step to enhance the performance and test a low-cost setup for real-time anonymization during live surgery streaming.

RESULTS: Framewise anonymization yielded an ROC AUC-score of 99.46% on unseen procedures, increasing to 99.89% after post-processing. Our Robotic Anonymization Network (ROBAN) outperforms previous state-of-the-art algorithms, even on unseen procedural types, despite the fact that alternative solutions are explicitly trained using these procedures.

CONCLUSIONS: Our deep learning model ROBAN offers reliable, accurate, and safe real-time anonymization during complex and lengthy surgical procedures regardless of the robotic platform. The model can be used in real-time for surgical live streaming and is openly available.

PMID:38390732 | DOI:10.1097/SLA.0000000000006245

Categories: Literature Watch

Modularity-Constrained Dynamic Representation Learning for Interpretable Brain Disorder Analysis with Functional MRI

Fri, 2024-02-23 06:00

Med Image Comput Comput Assist Interv. 2023 Oct;14220:46-56. doi: 10.1007/978-3-031-43907-0_5. Epub 2023 Oct 1.

ABSTRACT

Resting-state functional MRI (rs-fMRI) is increasingly used to detect altered functional connectivity patterns caused by brain disorders, thereby facilitating objective quantification of brain pathology. Existing studies typically extract fMRI features using various machine/deep learning methods, but the generated imaging biomarkers are often challenging to interpret. Besides, the brain operates as a modular system with many cognitive/topological modules, where each module contains subsets of densely inter-connected regions-of-interest (ROIs) that are sparsely connected to ROIs in other modules. However, current methods cannot effectively characterize brain modularity. This paper proposes a modularity-constrained dynamic representation learning (MDRL) framework for interpretable brain disorder analysis with rs-fMRI. The MDRL consists of 3 parts: (1) dynamic graph construction, (2) modularity-constrained spatiotemporal graph neural network (MSGNN) for dynamic feature learning, and (3) prediction and biomarker detection. In particular, the MSGNN is designed to learn spatiotemporal dynamic representations of fMRI, constrained by 3 functional modules (i.e., central executive network, salience network, and default mode network). To enhance discriminative ability of learned features, we encourage the MSGNN to reconstruct network topology of input graphs. Experimental results on two public and one private datasets with a total of 1,155 subjects validate that our MDRL outperforms several state-of-the-art methods in fMRI-based brain disorder analysis. The detected fMRI biomarkers have good explainability and can be potentially used to improve clinical diagnosis.

PMID:38390374 | PMC:PMC10883232 | DOI:10.1007/978-3-031-43907-0_5

Categories: Literature Watch

Research on the morphological structure of partial fracture healing process in diabetic mice based on synchrotron radiation phase-contrast imaging computed tomography and deep learning

Fri, 2024-02-23 06:00

Bone Rep. 2024 Feb 11;20:101743. doi: 10.1016/j.bonr.2024.101743. eCollection 2024 Mar.

ABSTRACT

The prevalence of diabetes mellitus has exhibited a notable surge in recent years, thereby augmenting the susceptibility to fractures and impeding the process of fracture healing. The primary objective of this investigation is to employ synchrotron radiation phase-contrast imaging computed tomography (SR-PCI-CT) to examine the morphological and structural attributes of different types of callus in a murine model of diabetic partial fractures. Additionally, a deep learning image segmentation model was utilized to facilitate both qualitative and quantitative analysis of callus during various time intervals. A total of forty male Kunming mice, aged five weeks, were randomly allocated into two groups, each consisting of twenty mice, namely, simple fracture group (SF) and diabetic fracture group (DF). Mice in DF group were intraperitoneally injected 60 mg/kg 1 % streptozotocin(STZ) solution for 5 consecutive days, and the standard for modeling was that the fasting blood glucose level was ≥11.1 mmol /l one week after the last injection of STZ. The right tibias of all mice were observed to have oblique fractures that did not traverse the entire bone. At three, seven, ten and fourteen days after the fracture occurred, the fractured tibias were extracted for SR-PCI-CT imaging and histological analysis. Furthermore, a deep learning image segmentation model was devised to automatically detect, categorize and quantitatively examine different types of callus. Image J software was utilized to measure the grayscale values of different types of callus and perform quantitative analysis. The findings demonstrated that:1)SR-PCI-CT imaging effectively depicted the morphological attributes of different types of callus of fracture healing. The grayscale values of different types of callus were significantly different(P < 0.01).2)In comparison to the SF group, the DF group exhibited a significant reduction in the total amount of callus during the same period (P < 0.01). Additionally, the peak of cartilage callus in the hypertrophic phase was delayed.3)Histology provides the basis for training algorithms for deep learning image segmentation models. The deep-learning image segmentation models achieved accuracies of 0.69, 0.81 and 0.733 for reserve/proliferative cartilage, hypertrophic cartilage and mineralized cartilage, respectively, in the test set. The corresponding Dice values were 0.72, 0.83 and 0.76, respectively. In summary, SR-PCI-CT images are close to the histological level, and a variety of cartilage can be identified on synchrotron radiation CT images compared with histological examination, while artificial intelligence image segmentation model can realize automatic analysis and data generation through deep learning, and further determine the objectivity and accuracy of SR-PCI-CT in identifying various cartilage tissues. Therefore, this imaging technique combined with deep learning image segmentation model can effectively evaluate the effect of diabetes on the morphological and structural changes of callus during fracture healing in mice.

PMID:38390284 | PMC:PMC10882109 | DOI:10.1016/j.bonr.2024.101743

Categories: Literature Watch

Anil Jegga

Deep learning

Performance evaluation of deep learning image reconstruction algorithm for dual-energy spectral CT imaging: A phantom study

Lossless compression-based detection of osteoporosis using bone X-ray imaging

Implemented classification techniques for osteoporosis using deep learning from the perspective of healthcare analytics

Software that combines deep learning, 3D reconstruction and CFD to analyze the state of carotid arteries from ultrasound imaging

HBNET: A blended ensemble model for the detection of cardiovascular anomalies using phonocardiogram

A Soft Sensor for Multirate Quality Variables Based on MC-CNN

NeuroIGN: Explainable Multimodal Image-Guided System for Precise Brain Tumor Surgery

Convolutional neural networks combined with classification algorithms for the diagnosis of periodontitis

Integrated machine learning and deep learning for predicting diabetic nephropathy model construction, validation, and interpretability

Improved detection of cholesterol gallstones using quasi-material decomposition images generated from single-energy computed tomography images via deep learning

Utilizing fully-automated 3D organ segmentation for hepatic steatosis assessment with CT attenuation-based parameters

Identifying Diabetic Retinopathy in the Human Eye: A Hybrid Approach Based on a Computer-Aided Diagnosis System Combined with Deep Learning

Ref-MEF: Reference-Guided Flexible Gated Image Reconstruction Network for Multi-Exposure Image Fusion

Adversarial Robustness with Partial Isometry

Semantic Communication: A Survey of Its Theoretical Development

Automatic Vertebral Rotation Angle Measurement of 3D Vertebrae Based on an Improved Transformer Network

NIR spectroscopy-CNN-enabled chemometrics for multianalyte monitoring in microbial fermentation

Privacy-proof Live Surgery Streaming: Development and Validation of a low-cost, Real-time Robotic Surgery Anonymization Algorithm

Modularity-Constrained Dynamic Representation Learning for Interpretable Brain Disorder Analysis with Functional MRI

Research on the morphological structure of partial fracture healing process in diabetic mice based on synchrotron radiation phase-contrast imaging computed tomography and deep learning

Pages