NIH Extramural Nexus News
Predicting genome-wide tissue-specific enhancers via combinatorial transcription factor genomic occupancy analysis
FEBS Lett. 2024 Oct 4. doi: 10.1002/1873-3468.15030. Online ahead of print.
ABSTRACT
Enhancers are non-coding cis-regulatory elements crucial for transcriptional regulation. Mutations in enhancers can disrupt gene regulation, leading to disease phenotypes. Identifying enhancers and their tissue-specific activity is challenging due to their lack of stereotyped sequences. This study presents a sequence-based computational model that uses combinatorial transcription factor (TF) genomic occupancy to predict tissue-specific enhancers. Trained on diverse datasets, including ENCODE and Vista enhancer browser data, the model predicted 25 000 forebrain-specific cis-regulatory modules (CRMs) in the human genome. Validation using biochemical features, disease-associated SNPs, and in vivo zebrafish analysis confirmed its effectiveness. This model aids in predicting enhancers lacking well-characterized chromatin features, complementing experimental approaches in tissue-specific enhancer discovery.
PMID:39367524 | DOI:10.1002/1873-3468.15030
Mechanisms of transcriptional regulation in <em>Anopheles gambiae</em> revealed by allele-specific expression
Proc Biol Sci. 2024 Sep;291(2031):20241142. doi: 10.1098/rspb.2024.1142. Epub 2024 Sep 18.
ABSTRACT
Malaria control relies on insecticides targeting the mosquito vector, but this is increasingly compromised by insecticide resistance, which can be achieved by elevated expression of detoxifying enzymes that metabolize the insecticide. In diploid organisms, gene expression is regulated both in cis, by regulatory sequences on the same chromosome, and by trans acting factors, affecting both alleles equally. Differing levels of transcription can be caused by mutations in cis-regulatory modules (CRM), but few of these have been identified in mosquitoes. We crossed bendiocarb-resistant and susceptible Anopheles gambiae strains to identify cis-regulated genes that might be responsible for the resistant phenotype using RNAseq, and CRM sequences controlling gene expression in insecticide resistance relevant tissues were predicted using machine learning. We found 115 genes showing allele-specific expression (ASE) in hybrids of insecticide susceptible and resistant strains, suggesting cis-regulation is an important mechanism of gene expression regulation in A. gambiae. The genes showing ASE included a higher proportion of Anopheles-specific genes on average younger than genes with balanced allelic expression.
PMID:39288798 | PMC:PMC11407855 | DOI:10.1098/rspb.2024.1142
Epigenomic landscapes during prefrontal cortex development and aging in rhesus
Natl Sci Rev. 2024 Jun 18;11(8):nwae213. doi: 10.1093/nsr/nwae213. eCollection 2024 Aug.
ABSTRACT
The prefrontal cortex (PFC) is essential for higher-level cognitive functions. How epigenetic dynamics participates in PFC development and aging is largely unknown. Here, we profiled epigenomic landscapes of rhesus monkey PFCs from prenatal to aging stages. The dynamics of chromatin states, including higher-order chromatin structure, chromatin interaction and histone modifications are coordinated to regulate stage-specific gene transcription, participating in distinct processes of neurodevelopment. Dramatic changes of epigenetic signals occur around the birth stage. Notably, genes involved in neuronal cell differentiation and layer specification are pre-configured by bivalent promoters. We identified a cis-regulatory module and the transcription factors (TFs) associated with basal radial glia development, which was associated with large brain size in primates. These TFs include GLI3, CREB5 and SOX9. Interestingly, the genes associated with the basal radial glia (bRG)-associated cis-element module, such as SRY and SOX9, are enriched in sex differentiation. Schizophrenia-associated single nucleotide polymorphisms are more enriched in super enhancers (SEs) than typical enhancers, suggesting that SEs play an important role in neural network wiring. A cis-regulatory element of DBN1 is identified, which is critical for neuronal cell proliferation and synaptic neuron differentiation. Notably, the loss of distal chromatin interaction and H3K27me3 signal are hallmarks of PFC aging, which are associated with abnormal expression of aging-related genes and transposon activation, respectively. Collectively, our findings shed light on epigenetic mechanisms underlying primate brain development and aging.
PMID:39183748 | PMC:PMC11342245 | DOI:10.1093/nsr/nwae213
Integrative multi-omics increase resolution of the sea urchin posterior gut gene regulatory network at single cell level
Development. 2024 Jul 26:dev.202278. doi: 10.1242/dev.202278. Online ahead of print.
ABSTRACT
Drafting gene regulatory networks (GRNs) requires embryological knowledge pertaining to the cell type families, information on the regulatory genes, causal data from gene knockdown experiments and validations of the identified interactions by cis-regulatory analysis. We use multi-omics involving next-generation sequencing (-seq) to obtain the necessary information for drafting Strongylocentrotus purpuratus posterior gut GRN. Here we present an update to the GRN using i) a single cell RNA-seq derived cell atlas highlighting the 2 day post fertilization (dpf) sea urchin gastrula cell type families, as well as the genes expressed at single cell level, ii) a set of putative cis-regulatory modules and transcription factor (TF) binding sites obtained from chromatin accessibility ATAC-seq data, and iii) interactions directionality obtained from differential bulk RNA-seq following knockdown of the TF Sp-Pdx1, a key regulator of gut patterning in sea urchins. Combining these datasets, we draft the GRN for the hindgut Sp-Pdx1 positive cells in the 2 dpf gastrula embryo. Overall, our data suggests the complex connectivity of the posterior gut GRN and increases the resolution of gene regulatory cascades operating within it.
PMID:39058236 | DOI:10.1242/dev.202278
Integration of chromosome locations and functional aspects of enhancers and topologically associating domains in knowledge graphs enables versatile queries about gene regulation
Nucleic Acids Res. 2024 Jul 5:gkae566. doi: 10.1093/nar/gkae566. Online ahead of print.
ABSTRACT
Knowledge about transcription factor binding and regulation, target genes, cis-regulatory modules and topologically associating domains is not only defined by functional associations like biological processes or diseases but also has a determinative genome location aspect. Here, we exploit these location and functional aspects together to develop new strategies to enable advanced data querying. Many databases have been developed to provide information about enhancers, but a schema that allows the standardized representation of data, securing interoperability between resources, has been lacking. In this work, we use knowledge graphs for the standardized representation of enhancers and topologically associating domains, together with data about their target genes, transcription factors, location on the human genome, and functional data about diseases and gene ontology annotations. We used this schema to integrate twenty-five enhancer datasets and two domain datasets, creating the most powerful integrative resource in this field to date. The knowledge graphs have been implemented using the Resource Description Framework and integrated within the open-access BioGateway knowledge network, generating a resource that contains an interoperable set of knowledge graphs (enhancers, TADs, genes, proteins, diseases, GO terms, and interactions between domains). We show how advanced queries, which combine functional and location restrictions, can be used to develop new hypotheses about functional aspects of gene expression regulation.
PMID:38967009 | DOI:10.1093/nar/gkae566
DMLS: an automated pipeline to extract the Drosophila modular transcription regulators and targets from massive literature articles
Database (Oxford). 2024 Jun 20;2024:0. doi: 10.1093/database/baae049.
ABSTRACT
Transcription regulation in multicellular species is mediated by modular transcription factor (TF) binding site combinations termed cis-regulatory modules (CRMs). Such CRM-mediated transcription regulation determines the gene expression patterns during development. Biologists frequently investigate CRM transcription regulation on gene expressions. However, the knowledge of the target genes and regulatory TFs participating in the CRMs under study is mostly fragmentary throughout the literature. Researchers need to afford tremendous human resources to fully surf through the articles deposited in biomedical literature databases in order to obtain the information. Although several novel text-mining systems are now available for literature triaging, these tools do not specifically focus on CRM-related literature prescreening, failing to correctly extract the information of the CRM target genes and regulatory TFs from the literature. For this reason, we constructed a supportive auto-literature prescreener called Drosophila Modular transcription-regulation Literature Screener (DMLS) that achieves the following: (i) prescreens articles describing experiments on modular transcription regulation, (ii) identifies the described target genes and TFs of the CRMs under study for each modular transcription-regulation-describing article and (iii) features an automated and extendable pipeline to perform the task. We demonstrated that the final performance of DMLS in extracting the described target gene and regulatory TF lists of CRMs under study for given articles achieved test macro area under the ROC curve (auROC) = 89.7% and area under the precision-recall curve (auPRC) = 77.6%, outperforming the intuitive gene name-occurrence-counting method by at least 19.9% in auROC and 30.5% in auPRC. The web service and the command line versions of DMLS are available at https://cobis.bme.ncku.edu.tw/DMLS/ and https://github.com/cobisLab/DMLS/, respectively. Database Tool URL: https://cobis.bme.ncku.edu.tw/DMLS/.
PMID:38900628 | DOI:10.1093/database/baae049
Validated Negative Regions (VNRs) in the VISTA Database might be Truncated Forms of Bona Fide Enhancers
Adv Genet (Hoboken). 2024 May 16;5(2):2300209. doi: 10.1002/ggn2.202300209. eCollection 2024 Jun.
ABSTRACT
The VISTA enhancer database is a valuable resource for evaluating predicted enhancers in humans and mice. In addition to thousands of validated positive regions (VPRs) in the human and mouse genomes, the database also contains similar numbers of validated negative regions (VNRs). It is previously shown that the VPRs are on average half as long as predicted overlapping enhancers that are highly conserved and hypothesize that the VPRs may be truncated forms of long bona fide enhancers. Here, it is shown that like the VPRs, the VNRs also are under strong evolutionary constraints and overlap predicted enhancers in the genomes. The VNRs are also on average half as long as predicted overlapping enhancers that are highly conserved. Moreover, the VNRs and the VPRs display similar cell/tissue-specific modification patterns of key epigenetic marks of active enhancers. Furthermore, the VNRs and the VPRs show similar impact score spectra of in silico mutagenesis. These highly similar properties between the VPRs and the VNRs suggest that like the VPRs, the VNRs may also be truncated forms of long bona fide enhancers.
PMID:38884049 | PMC:PMC11170074 | DOI:10.1002/ggn2.202300209
A cis-regulatory module underlies retinal ganglion cell genesis and axonogenesis
Cell Rep. 2024 May 31;43(6):114291. doi: 10.1016/j.celrep.2024.114291. Online ahead of print.
ABSTRACT
Atoh7 is transiently expressed in retinal progenitor cells (RPCs) and is required for retinal ganglion cell (RGC) differentiation. In humans, a deletion in a distal non-coding regulatory region upstream of ATOH7 is associated with optic nerve atrophy and blindness. Here, we functionally interrogate the significance of the Atoh7 regulatory landscape to retinogenesis in mice. Deletion of the Atoh7 enhancer structure leads to RGC deficiency, optic nerve hypoplasia, and retinal blood vascular abnormalities, phenocopying inactivation of Atoh7. Further, loss of the Atoh7 remote enhancer impacts ipsilaterally projecting RGCs and disrupts proper axonal projections to the visual thalamus. Deletion of the Atoh7 remote enhancer is also associated with the dysregulation of axonogenesis genes, including the derepression of the axon repulsive cue Robo3. Our data provide insights into how Atoh7 enhancer elements function to promote RGC development and optic nerve formation and highlight a key role of Atoh7 in the transcriptional control of axon guidance molecules.
PMID:38823017 | DOI:10.1016/j.celrep.2024.114291
A cell cycle regulator, E2F2, and glucocorticoid receptor cooperatively transactivate the bovine alphaherpesvirus 1 immediate early transcription unit 1 promoter
J Virol. 2024 May 21:e0042324. doi: 10.1128/jvi.00423-24. Online ahead of print.
ABSTRACT
Bovine alphaherpesvirus 1 (BoHV-1) infection causes respiratory tract disorders and immune suppression and may induce bacterial pneumonia. BoHV-1 establishes lifelong latency in sensory neurons after acute infection. Reactivation from latency consistently occurs following stress or intravenous injection of the synthetic corticosteroid dexamethasone (DEX), which mimics stress. The immediate early transcription unit 1 (IEtu1) promoter drives expression of infected cell protein 0 (bICP0) and bICP4, two viral transcriptional regulators necessary for productive infection and reactivation from latency. The IEtu1 promoter contains two glucocorticoid receptor (GR) responsive elements (GREs) that are transactivated by activated GR. GC-rich motifs, including consensus binding sites for specificity protein 1 (Sp1), are in the IEtu1 promoter sequences. E2F family members bind a consensus sequence (TTTCCCGC) and certain specificity protein 1 (Sp1) sites. Consequently, we hypothesized that certain E2F family members activate IEtu1 promoter activity. DEX treatment of latently infected calves increased the number of E2F2+ TG neurons. GR and E2F2, but not E2F1, E2F3a, or E2F3b, cooperatively transactivate a 436-bp cis-regulatory module in the IEtu1 promoter that contains both GREs. A luciferase reporter construct containing a 222-bp fragment downstream of the GREs was transactivated by E2F2 unless two adjacent Sp1 binding sites were mutated. Chromatin immunoprecipitation studies revealed that E2F2 occupied IEtu1 promoter sequences when the BoHV-1 genome was transfected into mouse neuroblastoma (Neuro-2A) or monkey kidney (CV-1) cells. In summary, these findings revealed that GR and E2F2 cooperatively transactivate IEtu1 promoter activity, which is predicted to influence the early stages of BoHV-1 reactivation from latency.
IMPORTANCE: Bovine alpha-herpesvirus 1 (BoHV-1) acute infection in cattle leads to establishment of latency in sensory neurons in the trigeminal ganglia (TG). A synthetic corticosteroid dexamethasone consistently initiates BoHV-1 reactivation in latently infected calves. The BoHV-1 immediate early transcription unit 1 (IEtu1) promoter regulates expression of infected cell protein 0 (bICP0) and bICP4, two viral transcriptional regulators. Hence, the IEtu1 promoter must be activated for the reactivation to occur. The number of TG neurons expressing E2F2, a transcription factor and cell cycle regulator, increased during early stages of reactivation from latency. The glucocorticoid receptor (GR) and E2F2, but not E2F1, E2F3a, or E2F3b, cooperatively transactivated a 436-bp cis-regulatory module (CRM) in the IEtu1 promoter that contains two GR responsive elements. Chromatin immunoprecipitation studies revealed that E2F2 occupies IEtu1 promoter sequences in cultured cells. GR and E2F2 mediate cooperative transactivation of IEtu1 promoter activity, which is predicted to stimulate viral replication following stressful stimuli.
PMID:38771044 | DOI:10.1128/jvi.00423-24
Characterization of activating cis-regulatory elements from the histone genes of Chlamydomonas reinhardtii
Plant J. 2024 May 1. doi: 10.1111/tpj.16781. Online ahead of print.
ABSTRACT
Regulation of gene expression in eukaryotes is controlled by cis-regulatory modules (CRMs). A major class of CRMs are enhancers which are composed of activating cis-regulatory elements (CREs) responsible for upregulating transcription. To date, most enhancers and activating CREs have been studied in angiosperms; in contrast, our knowledge about these key regulators of gene expression in green algae is limited. In this study, we aimed at characterizing putative activating CREs/CRMs from the histone genes of the unicellular model alga Chlamydomonas reinhardtii. To test the activity of four candidates, reporter constructs consisting of a tetramerized CRE, an established promoter, and a gene for the mCerulean3 fluorescent protein were incorporated into the nuclear genome of C. reinhardtii, and their activity was quantified by flow cytometry. Two tested candidates, Eupstr and Ehist cons, significantly upregulated gene expression and were characterized in detail. Eupstr, which originates from highly expressed genes of C. reinhardtii, is an orientation-independent CRE capable of activating both the RBCS2 and β2-tubulin promoters. Ehist cons, which is a CRM from histone genes of angiosperms, upregulates the β2-tubulin promoter in C. reinhardtii over a distance of at least 1.5 kb. The octamer motif present in Ehist cons was identified in C. reinhardtii and the related green algae Chlamydomonas incerta, Chlamydomonas schloesseri, and Edaphochlamys debaryana, demonstrating its high evolutionary conservation. The results of this investigation expand our knowledge about the regulation of gene expression in green algae. Furthermore, the characterized activating CREs/CRMs can be applied as valuable genetic tools.
PMID:38693717 | DOI:10.1111/tpj.16781
Conserved and novel enhancers in the Aedes aegypti single-minded locus recapitulate embryonic ventral midline gene expression
PLoS Genet. 2024 Apr 29;20(4):e1010891. doi: 10.1371/journal.pgen.1010891. Online ahead of print.
ABSTRACT
Transcriptional cis-regulatory modules, e.g., enhancers, control the time and location of metazoan gene expression. While changes in enhancers can provide a powerful force for evolution, there is also significant deep conservation of enhancers for developmentally important genes, with function and sequence characteristics maintained over hundreds of millions of years of divergence. Not well understood, however, is how the overall regulatory composition of a locus evolves, with important outstanding questions such as how many enhancers are conserved vs. novel, and to what extent are the locations of conserved enhancers within a locus maintained? We begin here to address these questions with a comparison of the respective single-minded (sim) loci in the two dipteran species Drosophila melanogaster (fruit fly) and Aedes aegypti (mosquito). sim encodes a highly conserved transcription factor that mediates development of the arthropod embryonic ventral midline. We identify two enhancers in the A. aegypti sim locus and demonstrate that they function equivalently in both transgenic flies and transgenic mosquitoes. One A. aegypti enhancer is highly similar to known Drosophila counterparts in its activity, location, and autoregulatory capability. The other differs from any known Drosophila sim enhancers with a novel location, failure to autoregulate, and regulation of expression in a unique subset of midline cells. Our results suggest that the conserved pattern of sim expression in the two species is the result of both conserved and novel regulatory sequences. Further examination of this locus will help to illuminate how the overall regulatory landscape of a conserved developmental gene evolves.
PMID:38683842 | DOI:10.1371/journal.pgen.1010891
Unraveling the evolutionary origin of the complex Nuclear Receptor Element (cNRE), a cis-regulatory module required for preferential expression in the atrial chamber
Commun Biol. 2024 Apr 4;7(1):371. doi: 10.1038/s42003-024-05972-6.
ABSTRACT
Cardiac function requires appropriate proteins in each chamber. Atria requires slow myosin to act as reservoirs, while ventricles demand fast myosin for swift pumping. Myosins are thus under chamber-biased cis-regulation, with myosin gene expression imbalances leading to congenital heart dysfunction. To identify regulatory inputs leading to cardiac chamber-biased expression, we computationally and molecularly dissected the quail Slow Myosin Heavy Chain III (SMyHC III) promoter that drives preferential expression to the atria. We show that SMyHC III gene states are orchestrated by a complex Nuclear Receptor Element (cNRE) of 32 base pairs. Using transgenesis in zebrafish and mice, we demonstrate that preferential atrial expression is achieved by a combinatorial regulatory input composed of atrial activation motifs and ventricular repression motifs. Using comparative genomics, we show that the cNRE might have emerged from an endogenous viral element through infection of an ancestral host germline, revealing an evolutionary pathway to cardiac chamber-specific expression.
PMID:38575811 | DOI:10.1038/s42003-024-05972-6
Glucocorticoid receptor and specificity protein 1 (Sp1) or Sp3, but not the antibiotic Mithramycin A, stimulates human alphaherpesvirus 1 (HSV-1) replication
Antiviral Res. 2024 Mar 29:105870. doi: 10.1016/j.antiviral.2024.105870. Online ahead of print.
ABSTRACT
Following acute human alphaherpesvirus 1 (HSV-1) infection of oral-facial mucosal surfaces, sensory neurons in trigeminal ganglia (TG) are important sites for life-long latency. Neurons in the central nervous system, including brainstem, also harbor viral genomes during latency. Periodically, certain cellular stressors trigger reactivation from latency, which can lead to recurrent HSV-1 disease: herpes labialis, herpes stromal keratitis, and encephalitis for example. Activation of the glucocorticoid receptor (GR) by stressful stimuli enhances HSV-1 gene expression, replication, and explant-induced reactivation. GR and certain stress-induced Krüppel like factors (KLF) cooperatively transactivate cis-regulatory modules (CRM) that drive expression of viral transcriptional regulatory proteins (ICP0, ICP4, and ICP27). These CRMs lack GR response elements (GRE); however, specificity protein 1 (Sp1) binding sites are crucial for GR and KLF15 or KLF4 mediated transactivation. Hence, we tested whether Sp1 or Sp3 regulate viral replication and transactivation of the ICP0 promoter. During early stages of explant-induced reactivation from latency, the number of Sp3+ TG neurons were significantly higher relative to TG from latently infected mice. Conversely, Sp1+ TG neurons were only increased in females, but not male mice, during explant-induced reactivation. Sp1 siRNA significantly reduced HSV-1 replication in cultured mouse (Neuro-2A) and monkey (CV-1) cells. Mithramycin A, an antibiotic that has anti-tumor activity preferentially interacts with GC-rich DNA, including Sp1 binding sites, significantly reduced HSV-1 replication indicating it has antiviral activity. GR and Sp1 or Sp3 transactivated the HSV-1 ICP0 promoter in Neuro-2A and CV-1 cells confirming these transcription factors enhance viral replication and gene expression.
PMID:38556059 | DOI:10.1016/j.antiviral.2024.105870
From sequence to consequence: Deciphering the complex cisregulatory landscape
J Biosci. 2024;49:46.
ABSTRACT
Cell type-specific expression of genes plays a pivotal role in the development and evolution of multicellular organisms over millions of years. The majority of regulatory control resides within the non-coding regions of the genome, referred to as 'dark matter', which contains cis-regulatory modules. These cis-regulatory modules function collectively and can impact gene expression even when located far from the target gene, exhibiting context-specific behaviour. Consequently, the cis-regulatory code governing gene expression patterns is intricate, in contrast to the universally understood genetic code. This overview centres on the current knowledge regarding cis-regulatory elements, primarily enhancers and their role in governing the spatiotemporal gene expression patterns, and how they have evolved and adapted across different species.
PMID:38516913
Aberrant non-canonical NF-κB signalling reprograms the epigenome landscape to drive oncogenic transcriptomes in multiple myeloma
Nat Commun. 2024 Mar 21;15(1):2513. doi: 10.1038/s41467-024-46728-4.
ABSTRACT
In multiple myeloma, abnormal plasma cells establish oncogenic niches within the bone marrow by engaging the NF-κB pathway to nurture their survival while they accumulate pro-proliferative mutations. Under these conditions, many cases eventually develop genetic abnormalities endowing them with constitutive NF-κB activation. Here, we find that sustained NF-κB/p52 levels resulting from such mutations favours the recruitment of enhancers beyond the normal B-cell repertoire. Furthermore, through targeted disruption of p52, we characterise how such enhancers are complicit in the formation of super-enhancers and the establishment of cis-regulatory interactions with myeloma dependencies during constitutive activation of p52. Finally, we functionally validate the pathological impact of these cis-regulatory modules on cell and tumour phenotypes using in vitro and in vivo models, confirming RGS1 as a p52-dependent myeloma driver. We conclude that the divergent epigenomic reprogramming enforced by aberrant non-canonical NF-κB signalling potentiates transcriptional programs beneficial for multiple myeloma progression.
PMID:38514625 | DOI:10.1038/s41467-024-46728-4
Cacao pod transcriptome profiling of seven genotypes identifies features associated with post-penetration resistance to Phytophthora palmivora
Sci Rep. 2024 Feb 20;14(1):4175. doi: 10.1038/s41598-024-54355-8.
ABSTRACT
The oomycete Phytophthora palmivora infects the fruit of cacao trees (Theobroma cacao) causing black pod rot and reducing yields. Cacao genotypes vary in their resistance levels to P. palmivora, yet our understanding of how cacao fruit respond to the pathogen at the molecular level during disease establishment is limited. To address this issue, disease development and RNA-Seq studies were conducted on pods of seven cacao genotypes (ICS1, WFT, Gu133, Spa9, CCN51, Sca6 and Pound7) to better understand their reactions to the post-penetration stage of P. palmivora infection. The pod tissue-P. palmivora pathogen assay resulted in the genotypes being classified as susceptible (ICS1, WFT, Gu133 and Spa9) or resistant (CCN51, Sca6 and Pound7). The number of differentially expressed genes (DEGs) ranged from 1625 to 6957 depending on genotype. A custom gene correlation approach identified 34 correlation groups. De novo motif analysis was conducted on upstream promoter sequences of differentially expressed genes, identifying 76 novel motifs, 31 of which were over-represented in the upstream sequences of correlation groups and associated with gene ontology terms related to oxidative stress response, defense against fungal pathogens, general metabolism and cell function. Genes in one correlation group (Group 6) were strongly induced in all genotypes and enriched in genes annotated with defense-responsive terms. Expression pattern profiling revealed that genes in Group 6 were induced to higher levels in the resistant genotypes. An additional analysis allowed the identification of 17 candidate cis-regulatory modules likely to be involved in cacao defense against P. palmivora. This study is a comprehensive exploration of the cacao pod transcriptional response to P. palmivora spread after infection. We identified cacao genes, promoter motifs, and promoter motif combinations associated with post-penetration resistance to P. palmivora in cacao pods and provide this information as a resource to support future and ongoing efforts to breed P. palmivora-resistant cacao.
PMID:38378988 | DOI:10.1038/s41598-024-54355-8
BestCRM: An Exhaustive Search for Optimal Cis-Regulatory Modules in Promoters Accelerated by the Multidimensional Hash Function
Int J Mol Sci. 2024 Feb 5;25(3):1903. doi: 10.3390/ijms25031903.
ABSTRACT
The concept of cis-regulatory modules located in gene promoters represents today's vision of the organization of gene transcriptional regulation. Such modules are a combination of two or more single, short DNA motifs. The bioinformatic identification of such modules belongs to so-called NP-hard problems with extreme computational complexity, and therefore, simplifications, assumptions, and heuristics are usually deployed to tackle the problem. In practice, this requires, first, many parameters to be set before the search, and second, it leads to the identification of locally optimal results. Here, a novel method is presented, aimed at identifying the cis-regulatory elements in gene promoters based on an exhaustive search of all the feasible modules' configurations. All required parameters are automatically estimated using positive and negative datasets. To be computationally efficient, the search is accelerated using a multidimensional hash function, allowing the search to complete in a few hours on a regular laptop (for example, a CPU Intel i7, 3.2 GH, 32 Gb RAM). Tests on an established benchmark and real data show better performance of BestCRM compared to the available methods according to several metrics like specificity, sensitivity, AUC, etc. A great practical advantage of the method is its minimum number of input parameters-apart from positive and negative promoters, only a desired level of module presence in promoters is required.
PMID:38339181 | PMC:PMC10856692 | DOI:10.3390/ijms25031903
<em>Cis</em>-regulatory modes of <em>Ultrabithorax</em> inactivation in butterfly forewings
Elife. 2024 Jan 23;12:RP90846. doi: 10.7554/eLife.90846.
ABSTRACT
Hox gene clusters encode transcription factors that drive regional specialization during animal development: for example the Hox factor Ubx is expressed in the insect metathoracic (T3) wing appendages and differentiates them from T2 mesothoracic identities. Hox transcriptional regulation requires silencing activities that prevent spurious activation and regulatory crosstalks in the wrong tissues, but this has seldom been studied in insects other than Drosophila, which shows a derived Hox dislocation into two genomic clusters that disjoined Antennapedia (Antp) and Ultrabithorax (Ubx). Here, we investigated how Ubx is restricted to the hindwing in butterflies, amidst a contiguous Hox cluster. By analysing Hi-C and ATAC-seq data in the butterfly Junonia coenia, we show that a Topologically Associated Domain (TAD) maintains a hindwing-enriched profile of chromatin opening around Ubx. This TAD is bordered by a Boundary Element (BE) that separates it from a region of joined wing activity around the Antp locus. CRISPR mutational perturbation of this BE releases ectopic Ubx expression in forewings, inducing homeotic clones with hindwing identities. Further mutational interrogation of two non-coding RNA encoding regions and one putative cis-regulatory module within the Ubx TAD cause rare homeotic transformations in both directions, indicating the presence of both activating and repressing chromatin features. We also describe a series of spontaneous forewing homeotic phenotypes obtained in Heliconius butterflies, and discuss their possible mutational basis. By leveraging the extensive wing specialization found in butterflies, our initial exploration of Ubx regulation demonstrates the existence of silencing and insulating sequences that prevent its spurious expression in forewings.
PMID:38261357 | DOI:10.7554/eLife.90846
Deep molecular learning of transcriptional control of a synthetic CRE enhancer and its variants
iScience. 2023 Dec 15;27(1):108747. doi: 10.1016/j.isci.2023.108747. eCollection 2024 Jan 19.
ABSTRACT
Massively parallel reporter assay measures transcriptional activities of various cis-regulatory modules (CRMs) in a single experiment. We developed a thermodynamic computational model framework that calculates quantitative levels of gene expression directly from regulatory DNA sequences. Using the framework, we investigated the molecular mechanisms of cis-regulatory mutations of a synthetic enhancer that cause abnormal gene expression. We found that, in a human cell line, competitive binding between family transcription factors (TFs) with slightly different binding preferences significantly increases the accuracy of recapitulating the transcriptional effects of thousands of single- or multi-mutations. We also discovered that even if various harmful mutations occurred in an activator binding site, CRM could stably maintain or even increase gene expression through a certain form of competitive binding between family TFs. These findings enhance understanding the effect of SNPs and indels on CRMs and would help building robust custom-designed CRMs for biologics production and gene therapy.
PMID:38222110 | PMC:PMC10784702 | DOI:10.1016/j.isci.2023.108747
Mechanisms of transcriptional regulation in <em>Anopheles gambiae</em> revealed by allele specific expression
bioRxiv. 2023 Nov 22:2023.11.22.568226. doi: 10.1101/2023.11.22.568226. Preprint.
ABSTRACT
Malaria control relies on insecticides targeting the mosquito vector, but this is increasingly compromised by insecticide resistance, which can be achieved by elevated expression of detoxifying enzymes that metabolize the insecticide. In diploid organisms, gene expression is regulated both in cis , by regulatory sequences on the same chromosome, and by trans acting factors, affecting both alleles equally. Differing levels of transcription can be caused by mutations in cis -regulatory modules (CRM), but few of these have been identified in mosquitoes. We crossed bendiocarb resistant and susceptible Anopheles gambiae strains to identify cis -regulated genes that might be responsible for the resistant phenotype using RNAseq, and cis -regulatory module sequences controlling gene expression in insecticide resistance relevant tissues were predicted using machine learning. We found 115 genes showing allele specific expression in hybrids of insecticide susceptible and resistant strains, suggesting cis regulation is an important mechanism of gene expression regulation in Anopheles gambiae . The genes showing allele specific expression included a higher proportion of Anopheles specific genes on average younger than genes those with balanced allelic expression.
PMID:38045426 | PMC:PMC10690255 | DOI:10.1101/2023.11.22.568226