Drug-induced Adverse Events

Classification of clinically useful sentences in clinical evidence resources.
Classification of clinically useful sentences in clinical evidence resources.
J Biomed Inform. 2016 Apr;60:14-22
Authors: Morid MA, Fiszman M, Raja K, Jonnalagadda SR, Del Fiol G
Abstract
UNLABELLED: Most patient care questions raised by clinicians can be answered by online clinical knowledge resources. However, important barriers still challenge the use of these resources at the point of care.
OBJECTIVE: To design and assess a method for extracting clinically useful sentences from synthesized online clinical resources that represent the most clinically useful information for directly answering clinicians' information needs.
MATERIALS AND METHODS: We developed a Kernel-based Bayesian Network classification model based on different domain-specific feature types extracted from sentences in a gold standard composed of 18 UpToDate documents. These features included UMLS concepts and their semantic groups, semantic predications extracted by SemRep, patient population identified by a pattern-based natural language processing (NLP) algorithm, and cue words extracted by a feature selection technique. Algorithm performance was measured in terms of precision, recall, and F-measure.
RESULTS: The feature-rich approach yielded an F-measure of 74% versus 37% for a feature co-occurrence method (p<0.001). Excluding predication, population, semantic concept or text-based features reduced the F-measure to 62%, 66%, 58% and 69% respectively (p<0.01). The classifier applied to Medline sentences reached an F-measure of 73%, which is equivalent to the performance of the classifier on UpToDate sentences (p=0.62).
CONCLUSIONS: The feature-rich approach significantly outperformed general baseline methods. This approach significantly outperformed classifiers based on a single type of feature. Different types of semantic features provided a unique contribution to overall classification performance. The classifier's model and features used for UpToDate generalized well to Medline abstracts.
PMID: 26774763 [PubMed - indexed for MEDLINE]
Investigation of the cutaneous penetration behavior of dexamethasone loaded to nano-sized lipid particles by EPR spectroscopy, and confocal Raman and laser scanning microscopy.
Investigation of the cutaneous penetration behavior of dexamethasone loaded to nano-sized lipid particles by EPR spectroscopy, and confocal Raman and laser scanning microscopy.
Eur J Pharm Biopharm. 2016 Dec 30;:
Authors: Lohan SB, Saeidpour S, Solik A, Schanzer S, Richter H, Dong P, Darvin ME, Bodmeier R, Patzelt A, Zoubari G, Unbehauen M, Haag R, Lademann J, Teutloff C, Bittl R, Meinke MC
Abstract
An improvement of the penetration efficiency combined with the controlled release of actives in the skin can facilitate the medical treatment of skin diseases immensely. Dexamethasone (Dx), a synthetic glucocorticoid, is frequently used for the treatment of inflammatory skin diseases. To investigate the penetration of nano-sized lipid particles (NLP) loaded with Dx in comparison to a commercially available base cream, different techniques were applied. Electron paramagnetic resonance (EPR) spectroscopy was used to monitor the penetration of Dx, which was covalently labeled with the spin probe 3-(Carboxy)-2,2,5,5-tetramethyl-1-pyrrolidinyloxy (PCA). The penetration into hair follicles was studied using confocal laser scanning microscopy (CLSM) with curcumin-loaded NLP. The penetration of the vehicle was followed by confocal Raman microscopy (CRM). Penetration studies using excised porcine skin revealed a more than twofold higher penetration efficiency for DxPCA into the stratum corneum (SC) after 24 h incubation compared to 4 h incubation when loaded to the NLP, whereas when applied in the base cream, almost no further penetration was observed beyond 4 h. The distribution of DxPCA within the SC was investigated by consecutive tape stripping. The release of DxPCA from the base cream after 24 h in deeper SC layers and the viable epidermis was shown by EPR. For NLP, no release from the carrier was observed, although DxPCA was detectable in the skin after the complete SC was removed. This phenomenon can be explained by the penetration of the NLP into the hair follicles. However, penetration profiles measured by CRM indicate that NLP did not penetrate as deep into the SC as the base cream formulation. In conclusion, NLP can improve the accumulation of Dx in the skin and provide a reservoir within the SC and in the follicular infundibula.
PMID: 28043865 [PubMed - as supplied by publisher]
Motif-Based Text Mining of Microbial Metagenome Redundancy Profiling Data for Disease Classification.
Motif-Based Text Mining of Microbial Metagenome Redundancy Profiling Data for Disease Classification.
Biomed Res Int. 2016;2016:6598307
Authors: Wang Y, Li R, Zhou Y, Ling Z, Guo X, Xie L, Liu L
Abstract
BACKGROUND: Text data of 16S rRNA are informative for classifications of microbiota-associated diseases. However, the raw text data need to be systematically processed so that features for classification can be defined/extracted; moreover, the high-dimension feature spaces generated by the text data also pose an additional difficulty.
RESULTS: Here we present a Phylogenetic Tree-Based Motif Finding algorithm (PMF) to analyze 16S rRNA text data. By integrating phylogenetic rules and other statistical indexes for classification, we can effectively reduce the dimension of the large feature spaces generated by the text datasets. Using the retrieved motifs in combination with common classification methods, we can discriminate different samples of both pneumonia and dental caries better than other existing methods.
CONCLUSIONS: We extend the phylogenetic approaches to perform supervised learning on microbiota text data to discriminate the pathological states for pneumonia and dental caries. The results have shown that PMF may enhance the efficiency and reliability in analyzing high-dimension text data.
PMID: 27057545 [PubMed - indexed for MEDLINE]
Enhancing Risk Assessment in Patients Receiving Chronic Opioid Analgesic Therapy Using Natural Language Processing.
Enhancing Risk Assessment in Patients Receiving Chronic Opioid Analgesic Therapy Using Natural Language Processing.
Pain Med. 2016 Dec 29;:
Authors: Haller IV, Renier CM, Juusola M, Hitz P, Steffen W, Asmus MJ, Craig T, Mardekian J, Masters ET, Elliott TE
Abstract
OBJECTIVES: Clinical guidelines for the use of opioids in chronic noncancer pain recommend assessing risk for aberrant drug-related behaviors prior to initiating opioid therapy. Despite recent dramatic increases in prescription opioid misuse and abuse, use of screening tools by clinicians continues to be underutilized. This research evaluated natural language processing (NLP) together with other data extraction techniques for risk assessment of patients considered for opioid therapy as a means of predicting opioid abuse.
DESIGN: Using a retrospective cohort of 3,668 chronic noncancer pain patients with at least one opioid agreement between January 1, 2007, and December 31, 2012, we examined the availability of electronic health record structured and unstructured data to populate the Opioid Risk Tool (ORT) and other selected outcomes. Clinician-documented opioid agreement violations in the clinical notes were determined using NLP techniques followed by manual review of the notes.
RESULTS: Confirmed through manual review, the NLP algorithm had 96.1% sensitivity, 92.8% specificity, and 92.6% positive predictive value in identifying opioid agreement violation. At the time of most recent opioid agreement, automated ORT identified 42.8% of patients as at low risk, 28.2% as at moderate risk, and 29.0% as at high risk for opioid abuse. During a year following the agreement, 22.5% of patients had opioid agreement violations. Patients classified as high risk were three times more likely to violate opioid agreements compared with those with low/moderate risk. CONCLUSION : Our findings suggest that NLP techniques have potential utility to support clinicians in screening chronic noncancer pain patients considered for long-term opioid therapy.
PMID: 28034982 [PubMed - as supplied by publisher]
Temporal data representation, normalization, extraction, and reasoning: A review from clinical domain.
Temporal data representation, normalization, extraction, and reasoning: A review from clinical domain.
Comput Methods Programs Biomed. 2016 May;128:52-68
Authors: Madkour M, Benhaddou D, Tao C
Abstract
BACKGROUND AND OBJECTIVE: We live our lives by the calendar and the clock, but time is also an abstraction, even an illusion. The sense of time can be both domain-specific and complex, and is often left implicit, requiring significant domain knowledge to accurately recognize and harness. In the clinical domain, the momentum gained from recent advances in infrastructure and governance practices has enabled the collection of tremendous amount of data at each moment in time. Electronic health records (EHRs) have paved the way to making these data available for practitioners and researchers. However, temporal data representation, normalization, extraction and reasoning are very important in order to mine such massive data and therefore for constructing the clinical timeline. The objective of this work is to provide an overview of the problem of constructing a timeline at the clinical point of care and to summarize the state-of-the-art in processing temporal information of clinical narratives.
METHODS: This review surveys the methods used in three important area: modeling and representing of time, medical NLP methods for extracting time, and methods of time reasoning and processing. The review emphasis on the current existing gap between present methods and the semantic web technologies and catch up with the possible combinations.
RESULTS: The main findings of this review are revealing the importance of time processing not only in constructing timelines and clinical decision support systems but also as a vital component of EHR data models and operations.
CONCLUSIONS: Extracting temporal information in clinical narratives is a challenging task. The inclusion of ontologies and semantic web will lead to better assessment of the annotation task and, together with medical NLP techniques, will help resolving granularity and co-reference resolution problems.
PMID: 27040831 [PubMed - indexed for MEDLINE]
Blending water- and nutrient-source wastewaters for cost-effective cultivation of high lipid content microalgal species Micractinium inermum NLP-F014.
Blending water- and nutrient-source wastewaters for cost-effective cultivation of high lipid content microalgal species Micractinium inermum NLP-F014.
Bioresour Technol. 2015 Dec;198:388-94
Authors: Park S, Kim J, Yoon Y, Park Y, Lee T
Abstract
The possibility of utilizing blended wastewaters from different streams was investigated for cost-efficient microalgal cultivation. The influent of a domestic wastewater treatment plant and the liquid fertilizer from a swine wastewater treatment plant were selected as water- and nutrient-source wastewaters, respectively. The growth of Micractinium inermum NLP-F014 in the blended wastewater medium without any pretreatment was comparable to that in Bold's Basal Medium. The optimum blending ratio of 5-15% (vv(-1)) facilitated biomass production up to 5.7 g-dry cell weight (DCW) L(-1), and the maximum biomass productivity (1.03 g-DCWL(-1)d(-1)) was achieved after three days of cultivation. Nutrient depletion induced lipid accumulation in the cell up to 39.1% (ww(-1)) and the maximum lipid productivity was 0.19 g-FAMEL(-1)d(-1). These results suggest that blending water- and nutrient-source wastewaters at a proper ratio without pretreatment can significantly cut costs in microalgae cultivation for biodiesel production.
PMID: 26409109 [PubMed - indexed for MEDLINE]
Public Understanding of Science in turbulent times III: Deficit to dialogue, champions to critics.
Public Understanding of Science in turbulent times III: Deficit to dialogue, champions to critics.
Public Underst Sci. 2016 Feb;25(2):186-97
Authors: Smallman M
Abstract
As part of the 20th Anniversary of the Public Understanding of Science journal, the journal has been reflecting on how the field and journal have developed. This research note takes a closer look at some of the trends, considering the journal's 50 most cited papers and using IRaMuTeQ, an open-source computer text analysis technique. The research note presents data that show that the move within public engagement from deficit to dialogue has been followed by a further shift from championing dialogue to criticising its practice. This shift has taken place alongside a continued, but changing, interest in media coverage, surveys and models of public understanding.
PMID: 25234052 [PubMed - indexed for MEDLINE]
Detection of clinically important colorectal surgical site infection using Bayesian network.
Detection of clinically important colorectal surgical site infection using Bayesian network.
J Surg Res. 2016 Oct 05;209:168-173
Authors: Sohn S, Larson DW, Habermann EB, Naessens JM, Alabbad JY, Liu H
Abstract
BACKGROUND: Despite extensive efforts to monitor and prevent surgical site infections (SSIs), real-time surveillance of clinical practice has been sparse and expensive or nonexistent. However, natural language processing (NLP) and machine learning (i.e., Bayesian network analysis) may provide the methodology necessary to approach this issue in a new way. We investigated the ability to identify SSIs after colorectal surgery (CRS) through an automated detection system using a Bayesian network.
MATERIALS AND METHODS: Patients who underwent CRS from 2010 to 2012 and were captured in our institutional American College of Surgeons National Surgical Quality Improvement Program (ACS-NSQIP) comprised our cohort. A Bayesian network was applied to detect SSIs using risk factors from ACS-NSQIP data and keywords extracted from clinical notes by NLP. Two surgeons provided expertise informing the Bayesian network to identify clinically meaningful SSIs (CM-SSIs) occurring within 30 d after surgery.
RESULTS: We used data from 751 CRS cases experiencing 67 (8.9%) SSIs and 78 (10.4%) CM-SSIs. Our Bayesian network detected ACS-NSQIP-captured SSIs with a receiver operating characteristic area under the curve of 0.827, but this value increased to 0.892 when using surgeon-identified CM-SSIs.
CONCLUSIONS: A Bayesian network coupled with NLP has the potential to be used in real-time SSI surveillance. Moreover, surgeons identified CM-SSI not captured under current NSQIP definitions. Future efforts to expand CM-SSI identification may lead to improved and potentially automated approaches to survey for postoperative SSI in clinical practice.
PMID: 28032554 [PubMed - as supplied by publisher]
Seqenv: linking sequences to environments through text mining.
Seqenv: linking sequences to environments through text mining.
PeerJ. 2016;4:e2690
Authors: Sinclair L, Ijaz UZ, Jensen LJ, Coolen MJ, Gubry-Rangin C, Chroňáková A, Oulas A, Pavloudi C, Schnetzer J, Weimann A, Ijaz A, Eiler A, Quince C, Pafilis E
Abstract
Understanding the distribution of taxa and associated traits across different environments is one of the central questions in microbial ecology. High-throughput sequencing (HTS) studies are presently generating huge volumes of data to address this biogeographical topic. However, these studies are often focused on specific environment types or processes leading to the production of individual, unconnected datasets. The large amounts of legacy sequence data with associated metadata that exist can be harnessed to better place the genetic information found in these surveys into a wider environmental context. Here we introduce a software program, seqenv, to carry out precisely such a task. It automatically performs similarity searches of short sequences against the "nt" nucleotide database provided by NCBI and, out of every hit, extracts-if it is available-the textual metadata field. After collecting all the isolation sources from all the search results, we run a text mining algorithm to identify and parse words that are associated with the Environmental Ontology (EnvO) controlled vocabulary. This, in turn, enables us to determine both in which environments individual sequences or taxa have previously been observed and, by weighted summation of those results, to summarize complete samples. We present two demonstrative applications of seqenv to a survey of ammonia oxidizing archaea as well as to a plankton paleome dataset from the Black Sea. These demonstrate the ability of the tool to reveal novel patterns in HTS and its utility in the fields of environmental source tracking, paleontology, and studies of microbial biogeography. To install seqenv, go to: https://github.com/xapple/seqenv.
PMID: 28028456 [PubMed]
Machine learning to assist risk-of-bias assessments in systematic reviews.
Machine learning to assist risk-of-bias assessments in systematic reviews.
Int J Epidemiol. 2016 Feb;45(1):266-77
Authors: Millard LA, Flach PA, Higgins JP
Abstract
BACKGROUND: Risk-of-bias assessments are now a standard component of systematic reviews. At present, reviewers need to manually identify relevant parts of research articles for a set of methodological elements that affect the risk of bias, in order to make a risk-of-bias judgement for each of these elements. We investigate the use of text mining methods to automate risk-of-bias assessments in systematic reviews. We aim to identify relevant sentences within the text of included articles, to rank articles by risk of bias and to reduce the number of risk-of-bias assessments that the reviewers need to perform by hand.
METHODS: We use supervised machine learning to train two types of models, for each of the three risk-of-bias properties of sequence generation, allocation concealment and blinding. The first model predicts whether a sentence in a research article contains relevant information. The second model predicts a risk-of-bias value for each research article. We use logistic regression, where each independent variable is the frequency of a word in a sentence or article, respectively.
RESULTS: We found that sentences can be successfully ranked by relevance with area under the receiver operating characteristic (ROC) curve (AUC) > 0.98. Articles can be ranked by risk of bias with AUC > 0.72. We estimate that more than 33% of articles can be assessed by just one reviewer, where two reviewers are normally required.
CONCLUSIONS: We show that text mining can be used to assist risk-of-bias assessments.
PMID: 26659355 [PubMed - indexed for MEDLINE]
Scaling-up NLP Pipelines to Process Large Corpora of Clinical Notes.
Scaling-up NLP Pipelines to Process Large Corpora of Clinical Notes.
Methods Inf Med. 2015;54(6):548-52
Authors: Divita G, Carter M, Redd A, Zeng Q, Gupta K, Trautner B, Samore M, Gundlapalli A
Abstract
INTRODUCTION: This article is part of the Focus Theme of Methods of Information in Medicine on "Big Data and Analytics in Healthcare".
OBJECTIVES: This paper describes the scale-up efforts at the VA Salt Lake City Health Care System to address processing large corpora of clinical notes through a natural language processing (NLP) pipeline. The use case described is a current project focused on detecting the presence of an indwelling urinary catheter in hospitalized patients and subsequent catheter-associated urinary tract infections.
METHODS: An NLP algorithm using v3NLP was developed to detect the presence of an indwelling urinary catheter in hospitalized patients. The algorithm was tested on a small corpus of notes on patients for whom the presence or absence of a catheter was already known (reference standard). In planning for a scale-up, we estimated that the original algorithm would have taken 2.4 days to run on a larger corpus of notes for this project (550,000 notes), and 27 days for a corpus of 6 million records representative of a national sample of notes. We approached scaling-up NLP pipelines through three techniques: pipeline replication via multi-threading, intra-annotator threading for tasks that can be further decomposed, and remote annotator services which enable annotator scale-out.
RESULTS: The scale-up resulted in reducing the average time to process a record from 206 milliseconds to 17 milliseconds or a 12- fold increase in performance when applied to a corpus of 550,000 notes.
CONCLUSIONS: Purposely simplistic in nature, these scale-up efforts are the straight forward evolution from small scale NLP processing to larger scale extraction without incurring associated complexities that are inherited by the use of the underlying UIMA framework. These efforts represent generalizable and widely applicable techniques that will aid other computationally complex NLP pipelines that are of need to be scaled out for processing and analyzing big data.
PMID: 26534722 [PubMed - indexed for MEDLINE]
Understanding the Relationship between Social Cognition and Word Difficulty. A Language Based Analysis of Individuals with Autism Spectrum Disorder.
Understanding the Relationship between Social Cognition and Word Difficulty. A Language Based Analysis of Individuals with Autism Spectrum Disorder.
Methods Inf Med. 2015;54(6):522-9
Authors: Aramaki E, Shikata S, Miyabe M, Usuda Y, Asada K, Ayaya S, Kumagaya S
Abstract
BACKGROUND: Few quantitative studies have been conducted on the relationship between society and its languages. Individuals with autistic spectrum disorder (ASD) are known to experience social hardships, and a wide range of clinical information about their quality of life has been provided through numerous narrative analyses. However, the narratives of ASD patients have thus far been examined mainly through qualitative approaches.
OBJECTIVES: In this study, we analyzed adults with ASD to quantitatively examine the relationship between language abilities and ASD severity scores.
METHODS: We generated phonetic transcriptions of speeches by 16 ASD adults at an ASD workshop, and divided the participants into 2 groups according to their Social Responsiveness Scale(TM), 2nd Edition (SRS(TM)-2) scores (where higher scores represent more severe ASD): Group A comprised high-scoring ASD adults (SRS(TM)-2 score: ≥ 76) and Group B comprised low- and intermediate-scoring ASD adults (SRS(TM)-2 score: < 76). Using natural language processing (NLP)-based analytical methods, the narratives were converted into numerical data according to four language ability indicators, and the relationships between the language ability scores and ASD severity scores were compared.
RESULTS AND DISCUSSION: Group A showed a marginally negative correlation with the level of Japanese word difficulty (p < .10), while the "social cognition" subscale of the SRS(TM)-2 score showed a significantly negative correlation (p < .05) with word difficulty. When comparing only male participants, Group A demonstrated a significantly lower correlation with word difficulty level than Group B (p < .10).
CONCLUSION: Social communication was found to be strongly associated with the level of word difficulty in speech. The clinical applications of these findings may be available in the near future, and there is a need for further detailed study on language metrics designed for ASD adults.
PMID: 26391807 [PubMed - indexed for MEDLINE]
Pressing needs of biomedical text mining in biocuration and beyond: opportunities and challenges.
Pressing needs of biomedical text mining in biocuration and beyond: opportunities and challenges.
Database (Oxford). 2016;2016:
Authors: Singhal A, Leaman R, Catlett N, Lemberger T, McEntyre J, Polson S, Xenarios I, Arighi C, Lu Z
Abstract
Text mining in the biomedical sciences is rapidly transitioning from small-scale evaluation to large-scale application. In this article, we argue that text-mining technologies have become essential tools in real-world biomedical research. We describe four large scale applications of text mining, as showcased during a recent panel discussion at the BioCreative V Challenge Workshop. We draw on these applications as case studies to characterize common requirements for successfully applying text-mining techniques to practical biocuration needs. We note that system 'accuracy' remains a challenge and identify several additional common difficulties and potential research directions including (i) the 'scalability' issue due to the increasing need of mining information from millions of full-text articles, (ii) the 'interoperability' issue of integrating various text-mining systems into existing curation workflows and (iii) the 'reusability' issue on the difficulty of applying trained systems to text genres that are not seen previously during development. We then describe related efforts within the text-mining community, with a special focus on the BioCreative series of challenge workshops. We believe that focusing on the near-term challenges identified in this work will amplify the opportunities afforded by the continued adoption of text-mining tools. Finally, in order to sustain the curation ecosystem and have text-mining systems adopted for practical benefits, we call for increased collaboration between text-mining researchers and various stakeholders, including researchers, publishers and biocurators.
PMID: 28025348 [PubMed - in process]
Direct transcriptional activation of BT genes by NLP transcription factors is a key component of the nitrate response in Arabidopsis.
Direct transcriptional activation of BT genes by NLP transcription factors is a key component of the nitrate response in Arabidopsis.
Biochem Biophys Res Commun. 2016 Dec 23;:
Authors: Sato T, Maekawa S, Konishi M, Yoshioka N, Sasaki Y, Maeda H, Ishida T, Kato Y, Yamaguchi J, Yanagisawa S
Abstract
Nitrate modulates growth and development, functioning as a nutrient signal in plants. Although many changes in physiological processes in response to nitrate have been well characterized as nitrate responses, the molecular mechanisms underlying the nitrate response are not yet fully understood. Here, we show that NLP transcription factors, which are key regulators of the nitrate response, directly activate the nitrate-inducible expression of BT1 and BT2 encoding putative scaffold proteins with a plant-specific domain structure in Arabidopsis. Interestingly, the 35S promoter-driven expression of BT2 partially rescued growth inhibition caused by reductions in NLP activity in Arabidopsis. Furthermore, simultaneous disruption of BT1 and BT2 affected nitrate-dependent lateral root development. These results suggest that direct activation of BT1 and BT2 by NLP transcriptional activators is a key component of the molecular mechanism underlying the nitrate response in Arabidopsis.
PMID: 28025145 [PubMed - as supplied by publisher]
[Pharmacists' Behavior in Clinical Practice: Results from a Questionnaire Survey of Pharmacy Students].
[Pharmacists' Behavior in Clinical Practice: Results from a Questionnaire Survey of Pharmacy Students].
Yakugaku Zasshi. 2016;136(2):351-8
Authors: Nakada A, Akagawa K, Yamamoto H, Kato Y, Yamamoto T
Abstract
A questionnaire survey was performed to obtain pharmacy students' impressions of pharmacists' behavior, to classify these based on professionalism, and to analyze the relationship between these experiences and students' satisfaction with their clinical practice in Japan. The questionnaire was answered by 327 5th-year pharmacy school students upon completing clinical practice at community pharmacies from 2011 to 2012. They rated their satisfaction with their clinical practice using a 6-point Likert scale, and provided descriptions of their experience such as, "This health provider is professional", or "What a great person he/she is as a health provider". We counted the words and then categorized the responses into 10 traits, as defined by the American Pharmaceutical Association Academy of Students of Pharmacy-American Association of Colleges of Pharmacy, Council of Deans Task Force on Professionalism 1999, using text mining. We analyzed the relationship between their experiences with respectful persons, and satisfaction, using the Mann-Whitney U-test (significance level<0.05). Most students (337 of 364, 92.6%) reported experiences with respectful health providers. These students experienced significantly more satisfaction than did other students (p<0.001). We analyzed 343 sentences written by 261 students, using text mining analysis after excluding unsuitable responses. The word most used was "patient" (121 times). Many students noted their impression that the pharmacists had answered patients' questions. Of the 10 trait categories, "professional knowledge and skills" was mentioned most often (151 students).
PMID: 26831812 [PubMed - indexed for MEDLINE]
cBinderDB: a covalent binding agent database.
cBinderDB: a covalent binding agent database.
Bioinformatics. 2016 Dec 23;:
Authors: Du J, Yan X, Liu Z, Cui L, Ding P, Tan X, Li X, Zhou H, Gu Q, Xu J
Abstract
MOTIVATION: Small molecule drug candidates with attractive toxicity profiles that modulate target proteins through noncovalent interactions are usually favored by scientists and pharmaceutical industry. In the past decades, many noncovalent binding agents have been developed for different diseases. However, an increasing attention has been paid to covalent binding agents in pharmaceutical fields during recent years. Many covalent binding agents entered clinical trials and exerted significant advantages for diseases such as infection, cancers, gastrointestinal disorders, central nervous system or cardiovascular diseases. It has been recognized that covalent binding ligands can be attractive drug candidates. But, there is lack of resource to support covalent ligand discovery.
RESULTS: Hence, we initiated a covalent binder database (cBinderDB). To our best knowledge, it is the first online database that provides information on covalent binding compound structures, chemotypes, targets, covalent binding types, and other biological properties. The covalent binding targets are annotated with biological functions, protein family and domains, gene information, modulators, and receptor-ligand complex structure. The data in the database were collected from scientific publications by combining a text mining method and manual inspection processes. cBinderDB covers covalent binder's data up to September 2016.
AVAILABILITY AND IMPLEMENTATION: cBinderDB is freely available at www.rcdd.org.cn/cbinderdb/ CONTACT: junxu@biochemomes.com SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
PMID: 28011781 [PubMed - as supplied by publisher]
Disease and pharmacologic risk factors for first and subsequent episodes of equine laminitis: A cohort study of free-text electronic medical records.
Disease and pharmacologic risk factors for first and subsequent episodes of equine laminitis: A cohort study of free-text electronic medical records.
Prev Vet Med. 2017 Jan 01;136:11-18
Authors: Welsh CE, Duz M, Parkin TD, Marshall JF
Abstract
Electronic medical records from first opinion equine veterinary practice may represent a unique resource for epidemiologic research. The appropriateness of this resource for risk factor analyses was explored as part of an investigation into clinical and pharmacologic risk factors for laminitis. Amalgamated medical records from seven UK practices were subjected to text mining to identify laminitis episodes, systemic or intra-synovial corticosteroid prescription, diseases known to affect laminitis risk and clinical signs or syndromes likely to lead to corticosteroid use. Cox proportional hazard models and Prentice, Williams, Peterson models for repeated events were used to estimate associations with time to first, or subsequent laminitis episodes, respectively. Over seventy percent of horses that were diagnosed with laminitis suffered at least one recurrence. Risk factors for first and subsequent laminitis episodes were found to vary. Corticosteroid use (prednisolone only) was only significantly associated with subsequent, and not initial laminitis episodes. Electronic medical record use for such analyses is plausible and offers important advantages over more traditional data sources. It does, however, pose challenges and limitations that must be taken into account, and requires a conceptual change to disease diagnosis which should be considered carefully.
PMID: 28010903 [PubMed - in process]
Impact of late presentation of HIV infection on short-, mid- and long-term mortality and causes of death in a multicenter national cohort: 2004-2013.
Impact of late presentation of HIV infection on short-, mid- and long-term mortality and causes of death in a multicenter national cohort: 2004-2013.
J Infect. 2016 May;72(5):587-96
Authors: Sobrino-Vegas P, Moreno S, Rubio R, Viciana P, Bernardino JI, Blanco JR, Bernal E, Asensi V, Pulido F, del Amo J, Hernando V, Cohorte de la Red de Investigación en Sida, Spain
Abstract
OBJECTIVES: To analyze the impact of late presentation (LP) on overall mortality and causes of death and describe LP trends and risk factors (2004-2013).
METHODS: Cox models and logistic regression were used to analyze data from a nation-wide cohort in Spain. LP is defined as being diagnosed when CD4 < 350 cells/ml or AIDS.
RESULTS: Of 7165 new HIV diagnoses, 46.9% (CI95%:45.7-48.0) were LP, 240 patients died. First-year mortality was the highest (aHRLP.vs.nLP = 10.3[CI95%:5.5-19.3]); between 1 and 4 years post-diagnosis, aHRLP.vs.nLP = 1.9(1.2-3.0); and >4 years, aHRLP.vs.nLP = 1.5(0.7-3.1). First-year's main cause of death was HIV/AIDS (73%); and malignancies among those surviving >4 years (32%). HIV/AIDS-related deaths were more likely in LP (59.2% vs. 25.0%; p < 0.001). LP declined from 55.9% (2004-05) to 39.4% (2012-13), and reduced in 46.1% in men who have sex with men (MSM) and 37.6% in heterosexual men, but increased in 22.6% in heterosexual women. Factors associated with LP: sex (ORMEN.vs.WOMEN = 1.4[1.2-1.7]); age (OR31-40.vs.<30 = 1.6[1.4-1.8], OR41-50.vs.<30 = 2.2[1.8-2.6], OR>50.vs.<30 = 3.6[2.9-4.4]); behavior (ORInjectedDrugUse.vs.MSM = 2.8[2.0-3.8]; ORHeterosexual.vs.MSM = 2.2[1.7-3.0]); education (ORPrimaryEducation.vs.University = 1.5[1.1-2.0], ORLowerSecondary.vs.University = 1.3[1.1-1.5]); and geographical origin (ORSub-Saharan.vs.Spain = 1.6[1.3-2.0], ORLatin-American.vs.Spain = 1.4[1.2-1.8]).
CONCLUSIONS: LP is associated with higher mortality, especially short-term- and HIV/AIDS-related mortality. Mid-term-, but not long-term mortality, remained also higher in LP than nLP. LP decreased in MSM and heterosexual men, not in heterosexual women. The groups most affected by LP are low educated, non-Spanish and heterosexual women.
PMID: 26920789 [PubMed - indexed for MEDLINE]
FLP-4 neuropeptide and its receptor in a neuronal circuit regulate preference choice through functions of ASH-2 trithorax complex in Caenorhabditis elegans.
FLP-4 neuropeptide and its receptor in a neuronal circuit regulate preference choice through functions of ASH-2 trithorax complex in Caenorhabditis elegans.
Sci Rep. 2016 Feb 18;6:21485
Authors: Yu Y, Zhi L, Guan X, Wang D, Wang D
Abstract
Preference choice on food is an important response strategy for animals living in the environment. Using assay system of preference choice on bacterial foods, OP50 and PA14, we identified the involvement of ADL sensory neurons in the control of preference choice in Caenorhabditis elegans. Both genetically silencing and ChR2-mediated activation of ADL sensory neurons significantly affected preference choice. ADL regulated preference choice by inhibiting function of G protein-coupled receptor (GPCR)/SRH-220. ADL sensory neurons might regulate preference choice through peptidergic signals of FLP-4 and NLP-10, and function of FLP-4 or NLP-10 in regulating preference choice was regulated by SRH-220. FLP-4 released from ADL sensory neurons further regulated preference choice through its receptor of NPR-4 in AIB interneurons. In AIB interneurons, NPR-4 was involved in the control of preference choice by activating the functions of ASH-2 trithorax complex consisting of SET-2, ASH-2, and WDR-5, implying the crucial role of molecular machinery of trimethylation of histone H3K4 in the preference choice control. The identified novel neuronal circuit and the underlying molecular mechanisms will strengthen our understanding neuronal basis of preference choice in animals.
PMID: 26887501 [PubMed - indexed for MEDLINE]
Building an Evaluation Scale using Item Response Theory.
Building an Evaluation Scale using Item Response Theory.
Proc Conf Empir Methods Nat Lang Process. 2016 Nov;2016:648-657
Authors: Lalor JP, Wu H, Yu H
Abstract
Evaluation of NLP methods requires testing against a previously vetted gold-standard test set and reporting standard metrics (accuracy/precision/recall/F1). The current assumption is that all items in a given test set are equal with regards to difficulty and discriminating power. We propose Item Response Theory (IRT) from psychometrics as an alternative means for gold-standard test-set generation and NLP system evaluation. IRT is able to describe characteristics of individual items - their difficulty and discriminating power - and can account for these characteristics in its estimation of human intelligence or ability for an NLP task. In this paper, we demonstrate IRT by generating a gold-standard test set for Recognizing Textual Entailment. By collecting a large number of human responses and fitting our IRT model, we show that our IRT model compares NLP systems with the performance in a human population and is able to provide more insight into system performance than standard evaluation metrics. We show that a high accuracy score does not always imply a high IRT score, which depends on the item characteristics and the response pattern.
PMID: 28004039 [PubMed]