Semantic Web
Electronic Health Record and Semantic Issues Using Fast Healthcare Interoperability Resources: Systematic Mapping Review
J Med Internet Res. 2024 Jan 30;26:e45209. doi: 10.2196/45209.
ABSTRACT
BACKGROUND: The increasing use of electronic health records and the Internet of Things has led to interoperability issues at different levels (structural and semantic). Standards are important not only for successfully exchanging data but also for appropriately interpreting them (semantic interoperability). Thus, to facilitate the semantic interoperability of data exchanged in health care, considerable resources have been deployed to improve the quality of shared clinical data by structuring and mapping them to the Fast Healthcare Interoperability Resources (FHIR) standard.
OBJECTIVE: The aims of this study are 2-fold: to inventory the studies on FHIR semantic interoperability resources and terminologies and to identify and classify the approaches and contributions proposed in these studies.
METHODS: A systematic mapping review (SMR) was conducted using 10 electronic databases as sources of information for inventory and review studies published during 2012 to 2022 on the development and improvement of semantic interoperability using the FHIR standard.
RESULTS: A total of 70 FHIR studies were selected and analyzed to identify FHIR resource types and terminologies from a semantic perspective. The proposed semantic approaches were classified into 6 categories, namely mapping (31/126, 24.6%), terminology services (18/126, 14.3%), resource description framework or web ontology language-based proposals (24/126, 19%), annotation proposals (18/126, 14.3%), machine learning (ML) and natural language processing (NLP) proposals (20/126, 15.9%), and ontology-based proposals (15/126, 11.9%). From 2012 to 2022, there has been continued research in 6 categories of approaches as well as in new and emerging annotations and ML and NLP proposals. This SMR also classifies the contributions of the selected studies into 5 categories: framework or architecture proposals, model proposals, technique proposals, comparison services, and tool proposals. The most frequent type of contribution is the proposal of a framework or architecture to enable semantic interoperability.
CONCLUSIONS: This SMR provides a classification of the different solutions proposed to address semantic interoperability using FHIR at different levels: collecting, extracting and annotating data, modeling electronic health record data from legacy systems, and applying transformation and mapping to FHIR models and terminologies. The use of ML and NLP for unstructured data is promising and has been applied to specific use case scenarios. In addition, terminology services are needed to accelerate their use and adoption; furthermore, techniques and tools to automate annotation and ontology comparison should help reduce human interaction.
PMID:38289660 | PMC:PMC10865191 | DOI:10.2196/45209
Case-Reported Data Management Methodology Using an RDF Data Model for Building a Multicenter Clinical Registry
Stud Health Technol Inform. 2024 Jan 25;310:184-188. doi: 10.3233/SHTI230952.
ABSTRACT
In multicenter clinical research, case-reported clinical data are managed for each research project. Participating institutions manage the mapping between standardized codes and in-house codes. To use the data extracted from electronic medical records in case report forms, it is necessary to pay attention to the gap in the semantic hierarchy. Managing mapping information between in-house and standardized codes is centralized in Resource Description Framework (RDF) stores. The relationship between standardized and in-house codes is described in RDF and stored in RDF stores. RESTful APIs for accessing RDF stores in SPARQL was developed and verified. The relationship between standardized codes and in-house codes of pharmaceuticals was expressed in RDF triples. As a +result of the operational verification of the implemented APIs, it was confirmed that data management with knowledge bases expressed in RDF graphs is possible. The ability to dynamically modify mapping definitions enables flexible data management and ease of operational restrictions.
PMID:38269790 | DOI:10.3233/SHTI230952
Are We Where We Want to Be in Undergraduate Pathology Education?
Turk Patoloji Derg. 2024 Jan 24. doi: 10.5146/tjpath.2023.13048. Online ahead of print.
ABSTRACT
OBJECTIVE: This review which aims to examine the recent and current status of pathology education in medical schools, and covers the publications related to undergraduate pathology education published between 2010 January and June 2023.
MATERIAL AND METHOD: A search was performed through PubMed, Google Scholar, Semantic Scholar, and Ulakbim search engines for the Science Citation Index, Science Citation Index Expanded, Emerging Sources Citation Index, Directory of Open Access Journals, Scopus, PubMed as well as TR Dizin indexed articles. The findings are categorized into two periods as 2010 January - 2020 April (pre-COVID-19 pandemic) and May 2020 - 2023 June. A total of 24 reviews/editorials/letters to the editor and 63 research articles in the pre-pandemic period and 11 reviews/ editorials/ letters to the editor and 35 research articles between 2020 May and 2023 June are included in the analysis.
RESULTS: Currently, medical education generally depends on core education programs with defined learning objectives and outcomes. Moreover, problem-based, case-based, and team-based interactive learning are being used along with traditional didactic courses. Additionally, digital/ web-based/remote education methods have gained prominence after the COVID-19 pandemic. The virtual or augmented reality and 3D drawing applications are offered as a solution for the autopsy and macroscopy courses. A scarce number of publications are found on measuring and evaluating the effectiveness of learning.
CONCLUSION: Artificial intelligence in pathology education is a topic that looks likely to become important in the near future. National and international comprehensive standardization is a necessity. A joint effort and collective intelligence are needed to achieve the desired goals in undergraduate pathology education.
PMID:38265100 | DOI:10.5146/tjpath.2023.13048
Development and quality appraisal of a new English breast screening linked data set as part of the age, test threshold, and frequency of mammography screening (ATHENA-M) study
Br J Radiol. 2024 Jan 23;97(1153):98-112. doi: 10.1093/bjr/tqad023.
ABSTRACT
OBJECTIVES: To build a data set capturing the whole breast cancer screening journey from individual breast cancer screening records to outcomes and assess data quality.
METHODS: Routine screening records (invitation, attendance, test results) from all 79 English NHS breast screening centres between January 1, 1988 and March 31, 2018 were linked to cancer registry (cancer characteristics and treatment) and national mortality data. Data quality was assessed using comparability, validity, timeliness, and completeness.
RESULTS: Screening records were extracted from 76/79 English breast screening centres, 3/79 were not possible due to software issues. Data linkage was successful from 1997 after introduction of a universal identifier for women (NHS number). Prior to 1997 outcome data are incomplete due to linkage issues, reducing validity. Between January 1, 1997 and March 31, 2018, a total of 11 262 730 women were offered screening of whom 9 371 973 attended at least one appointment, with 139 million person-years of follow-up (a median of 12.4 person years for each woman included) with 73 810 breast cancer deaths and 1 111 139 any-cause deaths. Comparability to reference data sets and internal validity were demonstrated. Data completeness was high for core screening variables (>99%) and main cancer outcomes (>95%).
CONCLUSIONS: The ATHENA-M project has created a large high-quality and representative data set of individual women's screening trajectories and outcomes in England from 1997 to 2018, data before 1997 are lower quality.
ADVANCES IN KNOWLEDGE: This is the most complete data set of English breast screening records and outcomes constructed to date, which can be used to evaluate and optimize screening.
PMID:38263823 | DOI:10.1093/bjr/tqad023
Development and application of Chinese medical ontology for diabetes mellitus
BMC Med Inform Decis Mak. 2024 Jan 19;24(1):18. doi: 10.1186/s12911-023-02405-y.
ABSTRACT
OBJECTIVE: To develop a Chinese Diabetes Mellitus Ontology (CDMO) and explore methods for constructing high-quality Chinese biomedical ontologies.
MATERIALS AND METHODS: We used various data sources, including Chinese clinical practice guidelines, expert consensus, literature, and hospital information system database schema, to build the CDMO. We combined top-down and bottom-up strategies and integrated text mining and cross-lingual ontology mapping. The ontology was validated by clinical experts and ontology development tools, and its application was validated through clinical decision support and Chinese natural language medical question answering.
RESULTS: The current CDMO consists of 3,752 classes, 182 fine-grained object properties with hierarchical relationships, 108 annotation properties, and over 12,000 mappings to other well-known medical ontologies in English. Based on the CDMO and clinical practice guidelines, we developed 200 rules for diabetes diagnosis, treatment, diet, and medication recommendations using the Semantic Web Rule Language. By injecting ontology knowledge, CDMO enhances the performance of the T5 model on a real-world Chinese medical question answering dataset related to diabetes.
CONCLUSION: CDMO has fine-grained semantic relationships and extensive annotation information, providing a foundation for medical artificial intelligence applications in Chinese contexts, including the construction of medical knowledge graphs, clinical decision support systems, and automated medical question answering. Furthermore, the development process incorporated natural language processing and cross-lingual ontology mapping to improve the quality of the ontology and improved development efficiency. This workflow offers a methodological reference for the efficient development of other high-quality Chinese as well as non-English medical ontologies.
PMID:38243204 | PMC:PMC10799385 | DOI:10.1186/s12911-023-02405-y
Channel semantic mutual learning for visible-thermal person re-identification
PLoS One. 2024 Jan 19;19(1):e0293498. doi: 10.1371/journal.pone.0293498. eCollection 2024.
ABSTRACT
Visible-infrared person re-identification (VI-ReID) is a cross-modality retrieval issue aiming to match the same pedestrian between visible and infrared cameras. Thus, the modality discrepancy presents a significant challenge for this task. Most methods employ different networks to extract features that are invariant between modalities. While we propose a novel channel semantic mutual learning network (CSMN), which attributes the difference in semantics between modalities to the difference at the channel level, it optimises the semantic consistency between channels from two perspectives: the local inter-channel semantics and the global inter-modal semantics. Meanwhile, we design a channel-level auto-guided double metric loss (CADM) to learn modality-invariant features and the sample distribution in a fine-grained manner. We conducted experiments on RegDB and SYSU-MM01, and the experimental results validate the superiority of CSMN. Especially on RegDB datasets, CSMN improves the current best performance by 3.43% and 0.5% on the Rank-1 score and mINP value, respectively. The code is available at https://github.com/013zyj/CSMN.
PMID:38241236 | PMC:PMC10798514 | DOI:10.1371/journal.pone.0293498
Voxel representation of brain images inpainting via Regional Pixel Semantic Network and pyramidal attention AE - Quantile differential mechanism model
Comput Biol Med. 2024 Mar;170:107767. doi: 10.1016/j.compbiomed.2023.107767. Epub 2023 Nov 28.
ABSTRACT
Medical image inpainting holds significant importance in enhancing the quality of medical images by restoring missing areas, thereby rendering them suitable for diagnostic purposes. While several techniques have been previously proposed for medical image inpainting, they are not suitable for distorted images containing metallic implants due to their limited consideration of known shaped masking. To overcome this limitation, a novel Vectorized Box Interpolation with Arbitrary Auto-Rand Augment Masking technique has been proposed which involves scaling and vectorizing images to expand their details and generating asymmetrically shaped masking in an automatic random format. One of the challenging tasks in this regard is the precise detection of lost regions, which is addressed through the introduction of the Regional Pixel Semantic Network. This technique employs the locally shared features (LSF) based region sensing with FCN (fully convolutional network) segmentation, which performs automatic segmentation based on neighboring pixel local dependency and regional features to determine the location of masked regions. During the reconstruction of missing parts, a significant challenge posed is the inability to recognize proximity in encoding owing to the generation of shadow-like regions on the feature map. To address this issue, a novel Multilayered DRC Regularized Pyramidal Attention AE Model has been proposed which employs dilated convolution with coherent pyramidal attention for feature extraction and improves image resolution using a Laplacian convolutional layer. Moreover, the realness of the generated image is determined using the Quantile Differential Mechanism model, where in the Quantile Differential Partial Convolutional Discriminator utilizes the hyperbolic tangent activation function in the partial convolutional layer to calculate recognition accuracy. As a result, the proposed method achieves high percentages for accuracy (98 %), precision (97 %), sensitivity (96 %), recall (95 %), and F-measure (96 %) thereby outperforming existing methods. Overall, this proposed method effectively handles distorted images with metallic implants, accurately detects lost regions, and improves the reconstructed image quality.
PMID:38215616 | DOI:10.1016/j.compbiomed.2023.107767
Methods and datasets for segmentation of minimally invasive surgical instruments in endoscopic images and videos: A review of the state of the art
Comput Biol Med. 2024 Jan 4;169:107929. doi: 10.1016/j.compbiomed.2024.107929. Online ahead of print.
ABSTRACT
In the field of computer- and robot-assisted minimally invasive surgery, enormous progress has been made in recent years based on the recognition of surgical instruments in endoscopic images and videos. In particular, the determination of the position and type of instruments is of great interest. Current work involves both spatial and temporal information, with the idea that predicting the movement of surgical tools over time may improve the quality of the final segmentations. The provision of publicly available datasets has recently encouraged the development of new methods, mainly based on deep learning. In this review, we identify and characterize datasets used for method development and evaluation and quantify their frequency of use in the literature. We further present an overview of the current state of research regarding the segmentation and tracking of minimally invasive surgical instruments in endoscopic images and videos. The paper focuses on methods that work purely visually, without markers of any kind attached to the instruments, considering both single-frame semantic and instance segmentation approaches, as well as those that incorporate temporal information. The publications analyzed were identified through the platforms Google Scholar, Web of Science, and PubMed. The search terms used were "instrument segmentation", "instrument tracking", "surgical tool segmentation", and "surgical tool tracking", resulting in a total of 741 articles published between 01/2015 and 07/2023, of which 123 were included using systematic selection criteria. A discussion of the reviewed literature is provided, highlighting existing shortcomings and emphasizing the available potential for future developments.
PMID:38184862 | DOI:10.1016/j.compbiomed.2024.107929
Does use of GP and specialist services vary across areas and according to individual socioeconomic position? A multilevel analysis using linked data in Australia
BMJ Open. 2024 Jan 6;14(1):e074624. doi: 10.1136/bmjopen-2023-074624.
ABSTRACT
OBJECTIVE: Timely access to primary care and supporting specialist care relative to need is essential for health equity. However, use of services can vary according to an individual's socioeconomic circumstances or where they live. This study aimed to quantify individual socioeconomic variation in general practitioner (GP) and specialist use in New South Wales (NSW), accounting for area-level variation in use.
DESIGN: Outcomes were GP use and quality-of-care and specialist use. Multilevel logistic regression was used to estimate: (1) median ORs (MORs) to quantify small area variation in outcomes, which gives the median increased risk of moving to an area of higher risk of an outcome, and (2) ORs to quantify associations between outcomes and individual education level, our main exposure variable. Analyses were adjusted for individual sociodemographic and health characteristics and performed separately by remoteness categories.
SETTING: Baseline data (2006-2009) from the 45 and Up Study, NSW, Australia, linked to Medicare Benefits Schedule and death data (to December 2012).
PARTICIPANTS: 267 153 adults aged 45 years and older.
RESULTS: GP (MOR=1.32-1.35) and specialist use (1.16-1.18) varied between areas, accounting for individual characteristics. For a given level of need and accounting for area variation, low education-level individuals were more likely to be frequent users of GP services (no school certificate vs university, OR=1.63-1.91, depending on remoteness category) and have continuity of care (OR=1.14-1.24), but were less likely to see a specialist (OR=0.85-0.95).
CONCLUSION: GP and specialist use varied across small areas in NSW, independent of individual characteristics. Use of GP care was equitable, but specialist care was not. Failure to address inequitable specialist use may undermine equity gains within the primary care system. Policies should also focus on local variation.
PMID:38184309 | DOI:10.1136/bmjopen-2023-074624
A patient safety knowledge graph supporting vaccine product development
BMC Med Inform Decis Mak. 2024 Jan 4;24(1):10. doi: 10.1186/s12911-023-02409-8.
ABSTRACT
BACKGROUND: Knowledge graphs are well-suited for modeling complex, unstructured, and multi-source data and facilitating their analysis. During the COVID-19 pandemic, adverse event data were integrated into a knowledge graph to support vaccine safety surveillance and nimbly respond to urgent health authority questions. Here, we provide details of this post-marketing safety system using public data sources. In addition to challenges with varied data representations, adverse event reporting on the COVID-19 vaccines generated an unprecedented volume of data; an order of magnitude larger than adverse events for all previous vaccines. The Patient Safety Knowledge Graph (PSKG) is a robust data store to accommodate the volume of adverse event data and harmonize primary surveillance data sources.
METHODS: We designed a semantic model to represent key safety concepts. We built an extract-transform-load (ETL) data pipeline to parse and import primary public data sources; align key elements such as vaccine names; integrated the Medical Dictionary for Regulatory Activities (MedDRA); and applied quality metrics. PSKG is deployed in a Neo4J graph database, and made available via a web interface and Application Programming Interfaces (APIs).
RESULTS: We import and align adverse event data and vaccine exposure data from 250 countries on a weekly basis, producing a graph with 4,340,980 nodes and 30,544,475 edges as of July 1, 2022. PSKG is used for ad-hoc analyses and periodic reporting for several widely available COVID-19 vaccines. Analysis code using the knowledge graph is 80% shorter than an equivalent implementation written entirely in Python, and runs over 200 times faster.
CONCLUSIONS: Organizing safety data into a concise model of nodes, properties, and edge relationships has greatly simplified analysis code by removing complex parsing and transformation algorithms from individual analyses and instead managing these centrally. The adoption of the knowledge graph transformed how the team answers key scientific and medical questions. Whereas previously an analysis would involve aggregating and transforming primary datasets from scratch to answer a specific question, the team can now iterate easily and respond as quickly as requests evolve (e.g., "Produce vaccine-X safety profile for adverse event-Y by country instead of age-range").
PMID:38178113 | DOI:10.1186/s12911-023-02409-8
DisoFLAG: accurate prediction of protein intrinsic disorder and its functions using graph-based interaction protein language model
BMC Biol. 2024 Jan 2;22(1):3. doi: 10.1186/s12915-023-01803-y.
ABSTRACT
Intrinsically disordered proteins and regions (IDPs/IDRs) are functionally important proteins and regions that exist as highly dynamic conformations under natural physiological conditions. IDPs/IDRs exhibit a broad range of molecular functions, and their functions involve binding interactions with partners and remaining native structural flexibility. The rapid increase in the number of proteins in sequence databases and the diversity of disordered functions challenge existing computational methods for predicting protein intrinsic disorder and disordered functions. A disordered region interacts with different partners to perform multiple functions, and these disordered functions exhibit different dependencies and correlations. In this study, we introduce DisoFLAG, a computational method that leverages a graph-based interaction protein language model (GiPLM) for jointly predicting disorder and its multiple potential functions. GiPLM integrates protein semantic information based on pre-trained protein language models into graph-based interaction units to enhance the correlation of the semantic representation of multiple disordered functions. The DisoFLAG predictor takes amino acid sequences as the only inputs and provides predictions of intrinsic disorder and six disordered functions for proteins, including protein-binding, DNA-binding, RNA-binding, ion-binding, lipid-binding, and flexible linker. We evaluated the predictive performance of DisoFLAG following the Critical Assessment of protein Intrinsic Disorder (CAID) experiments, and the results demonstrated that DisoFLAG offers accurate and comprehensive predictions of disordered functions, extending the current coverage of computationally predicted disordered function categories. The standalone package and web server of DisoFLAG have been established to provide accurate prediction tools for intrinsic disorders and their associated functions.
PMID:38166858 | DOI:10.1186/s12915-023-01803-y
Concreteness ratings for 36,000 Estonian words
Behav Res Methods. 2023 Dec 21. doi: 10.3758/s13428-023-02257-4. Online ahead of print.
ABSTRACT
We present a collection of concreteness ratings for 35,979 words in Estonian. The data were collected via a web application from 2278 native Estonian speakers. Human ratings of concreteness have not been collected for Estonian beforehand. We compare our results to Aedmaa et al. (2018), who assigned concreteness ratings to 240,000 Estonian words by means of machine learning. We show that while these two datasets show reasonable correlation (R = 0.71), there are considerable differences in the distribution of the ratings, which we discuss in this paper. Furthermore, the results also raise questions about the importance of the type of scale used for collecting ratings. While most other datasets have been compiled based on questionnaires entailing five- or seven-point Likert scales, we used a continuous 0-10 scale. Comparing our rating distribution to those of other studies, we found that it is most similar to the distribution in Lahl et al. (Behavior Research Methods, 41(1), 13-19, 2009), who also used a 0-10 scale. Concreteness ratings for Estonian words are available at OSF .
PMID:38129738 | DOI:10.3758/s13428-023-02257-4
Postnatal growth in small vulnerable newborns: a longitudinal study of 2 million Brazilians using routine register-based linked data
Am J Clin Nutr. 2024 Feb;119(2):444-455. doi: 10.1016/j.ajcnut.2023.12.009. Epub 2023 Dec 20.
ABSTRACT
BACKGROUND: Preterm, low-birth weight (LBW) and small-for-gestational age (SGA) newborns have a higher frequency of adverse health outcomes, including linear and ponderal growth impairment.
OBJECTIVE: To describe the growth trajectories and to estimate catch-up growth during the first 5 y of life of small newborns according to 3 vulnerability phenotypes (preterm, LBW, SGA).
METHODS: Longitudinal study using linked data from the 100 Million Brazilian Cohort baseline, the Brazilian National Live Birth System (SINASC), and the Food and Nutrition Surveillance System (SISVAN) from 2011 to 2017. We estimated the length/height-for-age (L/HAZ) and weight-for-age z-score (WAZ) trajectories from children of 6-59 mo using the linear mixed model for each vulnerable newborn phenotype. Growth velocity for both L/HAZ and WAZ was calculated considering the change (Δ) in the mean z-score between 2 time points. Catch-up growth was defined as a change in z-score > 0.67 at any time during follow-up.
RESULTS: We analyzed 2,021,998 live born children and 8,726,599 observations. The prevalence of at least one of the vulnerable phenotypes was 16.7% and 0.6% were simultaneously preterm, LBW, and SGA. For those born at term, all phenotypes had a period of growth recovery from 12 mo. For preterm infants, the onset of L/HAZ growth recovery started later at 24 mo and the growth trajectories appear to be lower than those born at term, a condition aggravated among children with the 3 phenotypes. Preterm and female infants seem to experience slower growth recovery than those born at term and males. The catch-up growth occurs at 24-59 mo for males preterm: preterm + AGA + NBW (Δ = 0.80), preterm + AGA + LBW (Δ = 0.88), and preterm + SGA + LBW (Δ = 1.08); and among females: term + SGA + NBW (Δ = 0.69), term + AGA + LBW (Δ = 0.72), term + SGA + LBW (Δ = 0.77), preterm + AGA + LBW (Δ = 0.68), and preterm + SGA + LBW (Δ = 0.83).
CONCLUSIONS: Children born preterm seem to reach L/HAZ and WAZ growth trajectories lower than those attained by children born at term, a condition aggravated among the most vulnerable.
PMID:38128734 | DOI:10.1016/j.ajcnut.2023.12.009
Designing an Ontology for Emotion-driven Visual Representations
Proceedings (IEEE Int Conf Bioinformatics Biomed). 2017 Nov;2017:1280-1283. doi: 10.1109/bibm.2017.8217844. Epub 2017 Dec 18.
ABSTRACT
Emotions influence our perceptions and decisions and are often felt more strongly in situations related to healthcare. Therefore, it is important to understand how both providers and patients express their emotions in face-to-face scenarios. An ontology is a way to represent domain concepts and the relationships between them in a polyarchical manner. We have created an ontological model called the Visualized Emotion Ontology (VEO) that expresses the semantic definitions and visualizations of 25 emotions based on published research. With VEO, we can augment patient-facing software tools, like embodied conversational agents, to improve patient-provider interaction in clinical environments.
PMID:38125740 | PMC:PMC10732717 | DOI:10.1109/bibm.2017.8217844
Ontology of Consumer Health Vocabulary: providing a formal and interoperable semantic resource for linking lay language and medical terminology
Proceedings (IEEE Int Conf Bioinformatics Biomed). 2019 Nov;2019:1177-1178. doi: 10.1109/bibm47256.2019.8983220. Epub 2020 Feb 6.
ABSTRACT
The Consumer Health Vocabulary has been an important contribution to the health informatics field since its introduction in 2006. Many studies have utilized the vocabulary for various scientific research to bridge the gap between consumers and health experts. Given the flat file format of the Consumer Health Vocabulary dataset, we developed a SKOS-based ontology of the dataset. As an ontology, this dataset can be semantically linked to other resources to provide consumer-level meaning. In addition with this artifact, we plan to further expand the terminology.
PMID:38125584 | PMC:PMC10732699 | DOI:10.1109/bibm47256.2019.8983220
Predicting risk of suicidal behavior from insurance claims data vs. linked data from insurance claims and electronic health records
Pharmacoepidemiol Drug Saf. 2024 Jan;33(1):e5734. doi: 10.1002/pds.5734. Epub 2023 Dec 19.
ABSTRACT
PURPOSE: Observational studies assessing effects of medical products on suicidal behavior often rely on health record data to account for pre-existing risk. We assess whether high-dimensional models predicting suicide risk using data derived from insurance claims and electronic health records (EHRs) are superior to models using data from insurance claims alone.
METHODS: Data were from seven large health systems identified outpatient mental health visits by patients aged 11 or older between 1/1/2009 and 9/30/2017. Data for the 5 years prior to each visit identified potential predictors of suicidal behavior typically available from insurance claims (e.g., mental health diagnoses, procedure codes, medication dispensings) and additional potential predictors available from EHRs (self-reported race and ethnicity, responses to Patient Health Questionnaire or PHQ-9 depression questionnaires). Nonfatal self-harm events following each visit were identified from insurance claims data and fatal self-harm events were identified by linkage to state mortality records. Random forest models predicting nonfatal or fatal self-harm over 90 days following each visit were developed in a 70% random sample of visits and validated in a held-out sample of 30%. Performance of models using linked claims and EHR data was compared to models using claims data only.
RESULTS: Among 15 845 047 encounters by 1 574 612 patients, 99 098 (0.6%) were followed by a self-harm event within 90 days. Overall classification performance did not differ between the best-fitting model using all data (area under the receiver operating curve or AUC = 0.846, 95% CI 0.839-0.854) and the best-fitting model limited to data available from insurance claims (AUC = 0.846, 95% CI 0.838-0.853). Competing models showed similar classification performance across a range of cut-points and similar calibration performance across a range of risk strata. Results were similar when the sample was limited to health systems and time periods where PHQ-9 depression questionnaires were recorded more frequently.
CONCLUSION: Investigators using health record data to account for pre-existing risk in observational studies of suicidal behavior need not limit that research to databases including linked EHR data.
PMID:38112287 | PMC:PMC10843611 | DOI:10.1002/pds.5734
Deep Learning Artificial Intelligence Tool for Automated Radiographic Determination of Posterior Tibial Slope in Patients With ACL Injury
Orthop J Sports Med. 2023 Dec 15;11(12):23259671231215820. doi: 10.1177/23259671231215820. eCollection 2023 Dec.
ABSTRACT
BACKGROUND: An increased posterior tibial slope (PTS) corresponds with an increased risk of graft failure after anterior cruciate ligament (ACL) reconstruction (ACLR). Validated methods of manual PTS measurements are subject to potential interobserver variability and can be inefficient on large datasets.
PURPOSE/HYPOTHESIS: To develop a deep learning artificial intelligence technique for automated PTS measurement from standard lateral knee radiographs. It was hypothesized that this deep learning tool would be able to measure the PTS on a high volume of radiographs expeditiously and that these measurements would be similar to previously validated manual measurements.
STUDY DESIGN: Cohort study (diagnosis); Level of evidence, 2.
METHODS: A deep learning U-Net model was developed on a cohort of 300 postoperative short-leg lateral radiographs from patients who underwent ACLR to segment the tibial shaft, tibial joint surface, and tibial tuberosity. The model was trained via a random split after an 80 to 20 train-validation scheme. Masks for training images were manually segmented, and the model was trained for 400 epochs. An image processing pipeline was then deployed to annotate and measure the PTS using the predicted segmentation masks. Finally, the performance of this combined pipeline was compared with human measurements performed by 2 study personnel using a previously validated manual technique for measuring the PTS on short-leg lateral radiographs on an independent test set consisting of both pre- and postoperative images.
RESULTS: The U-Net semantic segmentation model achieved a mean Dice similarity coefficient of 0.885 on the validation cohort. The mean difference between the human-made and computer-vision measurements was 1.92° (σ = 2.81° [P = .24]). Extreme disagreements between the human and machine measurements, as defined by ≥5° differences, occurred <5% of the time. The model was incorporated into a web-based digital application front-end for demonstration purposes, which can measure a single uploaded image in Portable Network Graphics format in a mean time of 5 seconds.
CONCLUSION: We developed an efficient and reliable deep learning computer vision algorithm to automate the PTS measurement on short-leg lateral knee radiographs. This tool, which demonstrated good agreement with human annotations, represents an effective clinical adjunct for measuring the PTS as part of the preoperative assessment of patients with ACL injuries.
PMID:38107846 | PMC:PMC10725654 | DOI:10.1177/23259671231215820
Fertility treatment pathways and births for women with and without polycystic ovary syndrome-a retrospective population linked data study
Fertil Steril. 2024 Feb;121(2):314-322. doi: 10.1016/j.fertnstert.2023.11.008. Epub 2023 Dec 12.
ABSTRACT
OBJECTIVE: To study the fertility treatment pathways used by women with and without polycystic ovary syndrome (PCOS) and which pathways were more likely to result in a birth.
DESIGN: This retrospective national community-based cohort study used longitudinal self-report survey data (collected 1996-2022; aged 18-49 years) from women born in 1973-1978 who are participants in the Australian Longitudinal Study on Women's Health. The study also used linked administrative data on fertility treatments (1996-2021).
PATIENTS: Of the 8,463 eligible women, 1,109 accessed fertility treatment and were included.
EXPOSURE: Polycystic ovary syndrome diagnosis was self-reported.
MAIN OUTCOME MEASURE: use of ovulation induction (OI), intrauterine insemination, and/or in vitro fertilization (IVF) was established through linked administrative data. Births were self-reported.
RESULTS: One in 10 of the eligible participants had PCOS (783/7,987, 10%) and 1 in 4 of the women who used fertility treatment had PCOS (274/1,109, 25%). Women with PCOS were 3 years younger on average at first fertility treatment (M = 31.4 years, SD = 4.18) than women without PCOS (M = 34.2 years, SD = 4.56). Seven treatment pathways were identified and use differed by PCOS status. Women with PCOS were more likely to start with OI (71%; odds ratio [OR] 4.20, 95% confidence interval [CI]: 2.91, 6.07) than women without PCOS (36%). Of the women with PCOS who started with OI, 46% required additional types of treatment. More women without PCOS ended up in IVF (72% vs. 51%). Overall, 63% (701/1,109) had an attributed birth, and in adjusted regressions births did not vary by last type of treatment (IVF: 67%, reference; intrauterine insemination: 67%, OR 0.94 95% CI: 0.56, 1.58; OI: 61%, OR 0.71, 95% CI: 0.52, 0.98), or by PCOS status (OR 1.27, 95% CI: 0.91, 1.77). By age, 74% of women under 35 years (471/639) and 49% of women 35 years or older had a birth.
CONCLUSION: More women with PCOS used fertility treatment but births were equivalent to women without PCOS. Most women followed clinical recommendations. Births did not differ between pathways, so there was no disadvantage in starting with less invasive treatments (although there may be financial or emotional disadvantages).
PMID:38099868 | DOI:10.1016/j.fertnstert.2023.11.008
Predicting the cause of seizures using features extracted from interactions with a virtual agent
Seizure. 2023 Dec 7;114:84-89. doi: 10.1016/j.seizure.2023.11.022. Online ahead of print.
ABSTRACT
OBJECTIVE: A clinical decision tool for Transient Loss of Consciousness (TLOC) could reduce currently high misdiagnosis rates and waiting times for specialist assessments. Most clinical decision tools based on patient-reported symptom inventories only distinguish between two of the three most common causes of TLOC (epilepsy, functional /dissociative seizures, and syncope) or struggle with the particularly challenging differentiation between epilepsy and FDS. Based on previous research describing differences in spoken accounts of epileptic seizures and FDS seizures, this study explored the feasibility of predicting the cause of TLOC by combining the automated analysis of patient-reported symptoms and spoken TLOC descriptions.
METHOD: Participants completed an online web application that consisted of a 34-item medical history and symptom questionnaire (iPEP) and spoken interaction with a virtual agent (VA) that asked eight questions about the most recent experience of TLOC. Support Vector Machines (SVM) were trained using different combinations of features and nested leave-one-out cross validation. The iPEP provided a baseline performance. Inspired by previous qualitative research three spoken language based feature sets were designed to assess: (1) formulation effort, (2) the proportion of words from different semantic categories, and (3) verb, adverb, and adjective usage.
RESULTS: 76 participants completed the application (Epilepsy = 24, FDS = 36, syncope = 16). Only 61 participants also completed the VA interaction (Epilepsy = 20, FDS = 29, syncope = 12). The iPEP model accurately predicted 65.8 % of all diagnoses, but the inclusion of the language features increased the accuracy to 85.5 % by improving the differential diagnosis between epilepsy and FDS.
CONCLUSION: These findings suggest that an automated analysis of TLOC descriptions collected using an online web application and VA could improve the accuracy of current clinical decisions tools for TLOC and facilitate clinical stratification processes (such as ensuring appropriate referral to cardiological versus neurological investigation and management pathways).
PMID:38091849 | DOI:10.1016/j.seizure.2023.11.022
Transformation and Articulation of Clinical Data to Understand Students' and Health Professionals' Clinical Reasoning: Protocol for a Scoping Review
JMIR Res Protoc. 2023 Dec 13;12:e50797. doi: 10.2196/50797.
ABSTRACT
BACKGROUND: There are still unanswered questions regarding effective educational strategies to promote the transformation and articulation of clinical data while teaching and learning clinical reasoning. Additionally, understanding how this process can be analyzed and assessed is crucial, particularly considering the rapid growth of natural language processing in artificial intelligence.
OBJECTIVE: The aim of this study is to map educational strategies to promote the transformation and articulation of clinical data among students and health care professionals and to explore the methods used to assess these individuals' transformation and articulation of clinical data.
METHODS: This scoping review follows the Joanna Briggs Institute framework for scoping reviews and the PRISMA-ScR (Preferred Reporting Items for Systematic Reviews and Meta-Analyses Extension for Scoping Reviews) checklist for the analysis. A literature search was performed in November 2022 using 5 databases: CINAHL (EBSCOhost), MEDLINE (Ovid), Embase (Ovid), PsycINFO (Ovid), and Web of Science (Clarivate). The protocol was registered on the Open Science Framework in November 2023. The scoping review will follow the 9-step framework proposed by Peters and colleagues of the Joanna Briggs Institute. A data extraction form has been developed using key themes from the research questions.
RESULTS: After removing duplicates, the initial search yielded 6656 results, and study selection is underway. The extracted data will be qualitatively analyzed and presented in a diagrammatic or tabular form alongside a narrative summary. The review will be completed by February 2024.
CONCLUSIONS: By synthesizing the evidence on semantic transformation and articulation of clinical data during clinical reasoning education, this review aims to contribute to the refinement of educational strategies and assessment methods used in academic and continuing education programs. The insights gained from this review will help educators develop more effective semantic approaches for teaching or learning clinical reasoning, as opposed to fragmented, purely symptom-based or probabilistic approaches. Besides, the results may suggest some ways to address challenges related to the assessment of clinical reasoning and ensure that the assessment tasks accurately reflect learners' developing competencies and educational progress.
INTERNATIONAL REGISTERED REPORT IDENTIFIER (IRRID): DERR1-10.2196/50797.
PMID:38090795 | DOI:10.2196/50797