Nucleic Acids Research

Syndicate content
Nucleic Acids Research - RSS feed of current issue
Updated: 8 years 21 weeks ago

The interplay between DNA methylation and sequence divergence in recent human evolution

Tue, 09/29/2015 - 01:41

Despite the increasing knowledge about DNA methylation, the understanding of human epigenome evolution is in its infancy. Using whole genome bisulfite sequencing we identified hundreds of differentially methylated regions (DMRs) in humans compared to non-human primates and estimated that ~25% of these regions were detectable throughout several human tissues. Human DMRs were enriched for specific histone modifications and the majority were located distal to transcription start sites, highlighting the importance of regions outside the direct regulatory context. We also found a significant excess of endogenous retrovirus elements in human-specific hypomethylated.

We reported for the first time a close interplay between inter-species genetic and epigenetic variation in regions of incomplete lineage sorting, transcription factor binding sites and human differentially hypermethylated regions. Specifically, we observed an excess of human-specific substitutions in transcription factor binding sites located within human DMRs, suggesting that alteration of regulatory motifs underlies some human-specific methylation patterns. We also found that the acquisition of DNA hypermethylation in the human lineage is frequently coupled with a rapid evolution at nucleotide level in the neighborhood of these CpG sites. Taken together, our results reveal new insights into the mechanistic basis of human-specific DNA methylation patterns and the interpretation of inter-species non-coding variation.

Categories: Journal Articles

Chromosomal position shift of a regulatory gene alters the bacterial phenotype

Tue, 09/29/2015 - 01:41

Recent studies strongly suggest that in bacterial cells the order of genes along the chromosomal origin-to-terminus axis is determinative for regulation of the growth phase-dependent gene expression. The prediction from this observation is that positional displacement of pleiotropic genes will affect the genetic regulation and hence, the cellular phenotype. To test this prediction we inserted the origin-proximal dusB-fis operon encoding the global regulator FIS in the vicinity of replication terminus on both arms of the Escherichia coli chromosome. We found that the lower fis gene dosage in the strains with terminus-proximal dusB-fis operons was compensated by increased fis expression such that the intracellular concentration of FIS was homeostatically adjusted. Nevertheless, despite unchanged FIS levels the positional displacement of dusB-fis impaired the competitive growth fitness of cells and altered the state of the overarching network regulating DNA topology, as well as the cellular response to environmental stress, hazardous substances and antibiotics. Our finding that the chromosomal repositioning of a regulatory gene can determine the cellular phenotype unveils an important yet unexplored facet of the genetic control mechanisms and paves the way for novel approaches to manipulate bacterial physiology.

Categories: Journal Articles

ZNF555 protein binds to transcriptional activator site of 4qA allele and ANT1: potential implication in Facioscapulohumeral dystrophy

Tue, 09/29/2015 - 01:41

Facioscapulohumeral dystrophy (FSHD) is an epi/genetic satellite disease associated with at least two satellite sequences in 4q35: (i) D4Z4 macrosatellite and (ii) β-satellite repeats (BSR), a prevalent part of the 4qA allele. Most of the recent FSHD studies have been focused on a DUX4 transcript inside D4Z4 and its tandem contraction in FSHD patients. However, the D4Z4-contraction alone is not pathological, which would also require the 4qA allele. Since little is known about BSR, we investigated the 4qA BSR functional role in the transcriptional control of the FSHD region 4q35. We have shown that an individual BSR possesses enhancer activity leading to activation of the Adenine Nucleotide Translocator 1 gene (ANT1), a major FSHD candidate gene. We have identified ZNF555, a previously uncharacterized protein, as a putative transcriptional factor highly expressed in human primary myoblasts that interacts with the BSR enhancer site and impacts the ANT1 promoter activity in FSHD myoblasts. The discovery of the functional role of the 4qA allele and ZNF555 in the transcriptional control of ANT1 advances our understanding of FSHD pathogenesis and provides potential therapeutic targets.

Categories: Journal Articles

Deciphering the principles that govern mutually exclusive expression of Plasmodium falciparum clag3 genes

Tue, 09/29/2015 - 01:41

The product of the Plasmodium falciparum genes clag3.1 and clag3.2 plays a fundamental role in malaria parasite biology by determining solute transport into infected erythrocytes. Expression of the two clag3 genes is mutually exclusive, such that a single parasite expresses only one of the two genes at a time. Here we investigated the properties and mechanisms of clag3 mutual exclusion using transgenic parasite lines with extra copies of clag3 promoters located either in stable episomes or integrated in the parasite genome. We found that the additional clag3 promoters in these transgenic lines are silenced by default, but under strong selective pressure parasites with more than one clag3 promoter simultaneously active are observed, demonstrating that clag3 mutual exclusion is strongly favored but it is not strict. We show that silencing of clag3 genes is associated with the repressive histone mark H3K9me3 even in parasites with unusual clag3 expression patterns, and we provide direct evidence for heterochromatin spreading in P. falciparum. We also found that expression of a neighbor ncRNA correlates with clag3.1 expression. Altogether, our results reveal a scenario where fitness costs and non-deterministic molecular processes that favor mutual exclusion shape the expression patterns of this important gene family.

Categories: Journal Articles

Splicing inhibition decreases phosphorylation level of Ser2 in Pol II CTD

Tue, 09/29/2015 - 01:41

Phosphorylation of the C-terminal domain of the largest subunit of RNA polymerase II (Pol II), especially Ser2 and Ser5 residues, plays important roles in transcription and mRNA processing, including 5' end capping, splicing and 3' end processing. These phosphorylation events stimulate mRNA processing, however, it is not clear whether splicing activity affects the phosphorylation status of Pol II. In this study, we found that splicing inhibition by potent splicing inhibitors spliceostatin A (SSA) and pladienolide B or by antisense oligos against snRNAs decreased phospho-Ser2 level, but had little or no effects on phospho-Ser5 level. In contrast, transcription and translation inhibitors did not decrease phospho-Ser2 level, therefore inhibition of not all the gene expression processes cause the decrease of phospho-Ser2. SSA treatment caused early dissociation of Pol II and decrease in phospho-Ser2 level of chromatin-bound Pol II, suggesting that splicing inhibition causes downregulation of phospho-Ser2 through at least these two mechanisms.

Categories: Journal Articles

ChIP-seq reveals the global regulator AlgR mediating cyclic di-GMP synthesis in Pseudomonas aeruginosa

Tue, 09/29/2015 - 01:41

AlgR is a key transcriptional regulator required for the expression of multiple virulence factors, including type IV pili and alginate in Pseudomonas aeruginosa. However, the regulon and molecular regulatory mechanism of AlgR have yet to be fully elucidated. Here, among 157 loci that were identified by a ChIP-seq assay, we characterized a gene, mucR, which encodes an enzyme that synthesizes the intracellular second messenger cyclic diguanylate (c-di-GMP). A algR strain produced lesser biofilm than did the wild-type strain, which is consistent with a phenotype controlled by c-di-GMP. AlgR positively regulates mucR via direct binding to its promoter. A algRmucR double mutant produced lesser biofilm than did the single algR mutant, demonstrating that c-di-GMP is a positive regulator of biofilm formation. AlgR controls the levels of c-di-GMP synthesis via direct regulation of mucR. In addition, the cognate sensor of AlgR, FimS/AlgZ, also plays an important role in P. aeruginosa virulence. Taken together, this study provides new insights into the AlgR regulon and reveals the involvement of c-di-GMP in the mechanism underlying AlgR regulation.

Categories: Journal Articles

The RNA-binding protein HOS5 and serine/arginine-rich proteins RS40 and RS41 participate in miRNA biogenesis in Arabidopsis

Tue, 09/29/2015 - 01:41

MicroRNAs are a class of small regulatory RNAs that are generated from primary miRNA (pri-miRNA) transcripts with a stem-loop structure. Accuracy of the processing of pri-miRNA into mature miRNA in plants can be enhanced by SERRATE (SE) and HYPONASTIC LEAVES 1 (HYL1). HYL1 activity is regulated by the FIERY2 (FRY2)/RNA polymerase II C-terminal domain phosphatase-like 1 (CPL1). Here, we discover that HIGH OSMOTIC STRESS GENE EXPRESSION 5 (HOS5) and two serine/arginine-rich splicing factors RS40 and RS41, previously shown to be involved in pre-mRNA splicing, affect the biogenesis of a subset of miRNA. These proteins are required for correct miRNA strand selection and the maintenance of miRNA levels. FRY2 dephosphorylates HOS5 whose phosphorylation status affects its subnuclear localization. HOS5 and the RS proteins bind both intronless and intron-containing pri-miRNAs. Importantly, all of these splicing-related factors directly interact with both HYL1 and SE in nuclear splicing speckles. Our results indicate that these splicing factors are directly involved in the biogenesis of a group of miRNA.

Categories: Journal Articles

The yeast genome undergoes significant topological reorganization in quiescence

Tue, 09/29/2015 - 01:41

We have examined the three-dimensional organization of the yeast genome during quiescence by a chromosome capture technique as a means of understanding how genome organization changes during development. For exponentially growing cells we observe high levels of inter-centromeric interaction but otherwise a predominance of intrachromosomal interactions over interchromosomal interactions, consistent with aggregation of centromeres at the spindle pole body and compartmentalization of individual chromosomes within the nucleoplasm. Three major changes occur in the organization of the quiescent cell genome. First, intrachromosomal associations increase at longer distances in quiescence as compared to growing cells. This suggests that chromosomes undergo condensation in quiescence, which we confirmed by microscopy by measurement of the intrachromosomal distances between two sites on one chromosome. This compaction in quiescence requires the condensin complex. Second, inter-centromeric interactions decrease, consistent with prior data indicating that centromeres disperse along an array of microtubules during quiescence. Third, inter-telomeric interactions significantly increase in quiescence, an observation also confirmed by direct measurement. Thus, survival during quiescence is associated with substantial topological reorganization of the genome.

Categories: Journal Articles

In vivo detection and replication studies of {alpha}-anomeric lesions of 2'-deoxyribonucleosides

Tue, 09/29/2015 - 01:41

DNA damage, arising from endogenous metabolism or exposure to environmental agents, may perturb the transmission of genetic information by blocking DNA replication and/or inducing mutations, which contribute to the development of cancer and likely other human diseases. Hydroxyl radical attack on the C1', C3' and C4' of 2-deoxyribose can give rise to epimeric 2-deoxyribose lesions, for which the in vivo occurrence and biological consequences remain largely unexplored. Through independent chemical syntheses of all three epimeric lesions of 2'-deoxyguanosine (dG) and liquid chromatography-tandem mass spectrometry analysis, we demonstrated unambiguously the presence of substantial levels of the α-anomer of dG (α-dG) in calf thymus DNA and in DNA isolated from mouse pancreatic tissues. We further assessed quantitatively the impact of all four α-dN lesions on DNA replication in Escherichia coli by employing a shuttle-vector method. We found that, without SOS induction, all α-dN lesions except α-dA strongly blocked DNA replication and, while replication across α-dA was error-free, replicative bypass of α-dC and α-dG yielded mainly C->A and G->A mutations. In addition, SOS induction could lead to markedly elevated bypass efficiencies for the four α-dN lesions, abolished the G->A mutation for α-dG, pronouncedly reduced the C->A mutation for α-dC and triggered T->A mutation for α-dT. The preferential misincorporation of dTMP opposite the α-dNs could be attributed to the unique base-pairing properties of the nucleobases elicited by the inversion of the configuration of the N-glycosidic linkage. Our results also revealed that Pol V played a major role in bypassing α-dC, α-dG and α-dT in vivo. The abundance of α-dG in mammalian tissue and the impact of the α-dNs on DNA replication demonstrate for the first time the biological significance of this family of DNA lesions.

Categories: Journal Articles

FANCD2 and REV1 cooperate in the protection of nascent DNA strands in response to replication stress

Tue, 09/29/2015 - 01:41

REV1 is a eukaryotic member of the Y-family of DNA polymerases involved in translesion DNA synthesis and genome mutagenesis. Recently, REV1 is also found to function in homologous recombination. However, it remains unclear how REV1 is recruited to the sites where homologous recombination is processed. Here, we report that loss of mammalian REV1 results in a specific defect in replication-associated gene conversion. We found that REV1 is targeted to laser-induced DNA damage stripes in a manner dependent on its ubiquitin-binding motifs, on RAD18, and on monoubiquitinated FANCD2 (FANCD2-mUb) that associates with REV1. Expression of a FANCD2-Ub chimeric protein in RAD18-depleted cells enhances REV1 assembly at laser-damaged sites, suggesting that FANCD2-mUb functions downstream of RAD18 to recruit REV1 to DNA breaks. Consistent with this suggestion we found that REV1 and FANCD2 are epistatic with respect to sensitivity to the double-strand break-inducer camptothecin. REV1 enrichment at DNA damage stripes also partially depends on BRCA1 and BRCA2, components of the FANCD2/BRCA supercomplex. Intriguingly, analogous to FANCD2-mUb and BRCA1/BRCA2, REV1 plays an unexpected role in protecting nascent replication tracts from degradation by stabilizing RAD51 filaments. Collectively these data suggest that REV1 plays multiple roles at stalled replication forks in response to replication stress.

Categories: Journal Articles

DNA polymerases {kappa} and {zeta} cooperatively perform mutagenic translesion synthesis of the C8-2'-deoxyguanosine adduct of the dietary mutagen IQ in human cells

Tue, 09/29/2015 - 01:41

The roles of translesion synthesis (TLS) DNA polymerases in bypassing the C8–2'-deoxyguanosine adduct (dG-C8-IQ) formed by 2-amino-3-methylimidazo[4,5-f]quinoline (IQ), a highly mutagenic and carcinogenic heterocyclic amine found in cooked meats, were investigated. Three plasmid vectors containing the dG-C8-IQ adduct at the G1-, G2- or G3-positions of the NarI site (5'-G1G2CG3CC-3') were replicated in HEK293T cells. Fifty percent of the progeny from the G3 construct were mutants, largely G->T, compared to 18% and 24% from the G1 and G2 constructs, respectively. Mutation frequency (MF) of dG-C8-IQ was reduced by 38–67% upon siRNA knockdown of pol , whereas it was increased by 10–24% in pol knockdown cells. When pol and pol were simultaneously knocked down, MF of the G1 and G3 constructs was reduced from 18% and 50%, respectively, to <3%, whereas it was reduced from 24% to <1% in the G2 construct. In vitro TLS using yeast pol showed that it can extend G3*:A pair more efficiently than G3*:C pair, but it is inefficient at nucleotide incorporation opposite dG-C8-IQ. We conclude that pol and pol cooperatively carry out the majority of the error-prone TLS of dG-C8-IQ, whereas pol is involved primarily in its error-free bypass.

Categories: Journal Articles

RAX2: a genome-wide detection method of condition-associated transcription variation

Fri, 08/28/2015 - 01:15

Most mammalian genes have mRNA variants due to alternative promoter usage, alternative splicing, and alternative cleavage and polyadenylation. Expression of alternative RNA isoforms has been found to be associated with tumorigenesis, proliferation and differentiation. Detection of condition-associated transcription variation requires association methods. Traditional association methods such as Pearson chi-square test and Fisher Exact test are single test methods and do not work on count data with replicates. Although the Cochran Mantel Haenszel (CMH) approach can handle replicated count data, our simulations showed that multiple CMH tests still had very low power. To identify condition-associated variation of transcription, we here proposed a ranking analysis of chi-squares (RAX2) for large-scale association analysis. RAX2 is a nonparametric method and has accurate and conservative estimation of FDR profile. Simulations demonstrated that RAX2 performs well in finding condition-associated transcription variants. We applied RAX2 to primary T-cell transcriptomic data and identified 1610 (16.3%) tags associated in transcription with immune stimulation at FDR < 0.05. Most of these tags also had differential expression. Analysis of two and three tags within genes revealed that under immune stimulation short RNA isoforms were preferably used.

Categories: Journal Articles

Why weight? Modelling sample and observational level variability improves power in RNA-seq analyses

Fri, 08/28/2015 - 01:15

Variations in sample quality are frequently encountered in small RNA-sequencing experiments, and pose a major challenge in a differential expression analysis. Removal of high variation samples reduces noise, but at a cost of reducing power, thus limiting our ability to detect biologically meaningful changes. Similarly, retaining these samples in the analysis may not reveal any statistically significant changes due to the higher noise level. A compromise is to use all available data, but to down-weight the observations from more variable samples. We describe a statistical approach that facilitates this by modelling heterogeneity at both the sample and observational levels as part of the differential expression analysis. At the sample level this is achieved by fitting a log-linear variance model that includes common sample-specific or group-specific parameters that are shared between genes. The estimated sample variance factors are then converted to weights and combined with observational level weights obtained from the mean–variance relationship of the log-counts-per-million using ‘voom’. A comprehensive analysis involving both simulations and experimental RNA-sequencing data demonstrates that this strategy leads to a universally more powerful analysis and fewer false discoveries when compared to conventional approaches. This methodology has wide application and is implemented in the open-source ‘limma’ package.

Categories: Journal Articles

Efficient exploration of pan-cancer networks by generalized covariance selection and interactive web content

Fri, 08/28/2015 - 01:15

Statistical network modeling techniques are increasingly important tools to analyze cancer genomics data. However, current tools and resources are not designed to work across multiple diagnoses and technical platforms, thus limiting their applicability to comprehensive pan-cancer datasets such as The Cancer Genome Atlas (TCGA). To address this, we describe a new data driven modeling method, based on generalized Sparse Inverse Covariance Selection (SICS). The method integrates genetic, epigenetic and transcriptional data from multiple cancers, to define links that are present in multiple cancers, a subset of cancers, or a single cancer. It is shown to be statistically robust and effective at detecting direct pathway links in data from TCGA. To facilitate interpretation of the results, we introduce a publicly accessible tool (cancerlandscapes.org), in which the derived networks are explored as interactive web content, linked to several pathway and pharmacological databases. To evaluate the performance of the method, we constructed a model for eight TCGA cancers, using data from 3900 patients. The model rediscovered known mechanisms and contained interesting predictions. Possible applications include prediction of regulatory relationships, comparison of network modules across multiple forms of cancer and identification of drug targets.

Categories: Journal Articles

Identification of human telomerase assembly inhibitors enabled by a novel method to produce hTERT

Fri, 08/28/2015 - 01:15

Telomerase is the enzyme that maintains the length of telomeres. It is minimally constituted of two components: a core reverse transcriptase protein (hTERT) and an RNA (hTR). Despite its significance as an almost universal cancer target, the understanding of the structure of telomerase and the optimization of specific inhibitors have been hampered by the limited amount of enzyme available. Here, we present a breakthrough method to produce unprecedented amounts of recombinant hTERT and to reconstitute human telomerase with purified components. This system provides a decisive tool to identify regulators of the assembly of this ribonucleoprotein complex. It also enables the large-scale screening of small-molecules capable to interfere with telomerase assembly. Indeed, it has allowed us to identify a compound that inhibits telomerase activity when added prior to the assembly of the enzyme, while it has no effect on an already assembled telomerase. Therefore, the novel system presented here may accelerate the understanding of human telomerase assembly and facilitate the discovery of potent and mechanistically unique inhibitors.

Categories: Journal Articles

Longitudinal epigenetic and gene expression profiles analyzed by three-component analysis reveal down-regulation of genes involved in protein translation in human aging

Fri, 08/28/2015 - 01:15

Data on biological mechanisms of aging are mostly obtained from cross-sectional study designs. An inherent disadvantage of this design is that inter-individual differences can mask small but biologically significant age-dependent changes. A serially sampled design (same individual at different time points) would overcome this problem but is often limited by the relatively small numbers of available paired samples and the statistics being used. To overcome these limitations, we have developed a new vector-based approach, termed three-component analysis, which incorporates temporal distance, signal intensity and variance into one single score for gene ranking and is combined with gene set enrichment analysis. We tested our method on a unique age-based sample set of human skin fibroblasts and combined genome-wide transcription, DNA methylation and histone methylation (H3K4me3 and H3K27me3) data. Importantly, our method can now for the first time demonstrate a clear age-dependent decrease in expression of genes coding for proteins involved in translation and ribosome function. Using analogies with data from lower organisms, we propose a model where age-dependent down-regulation of protein translation-related components contributes to extend human lifespan.

Categories: Journal Articles

Haploinsufficiency predictions without study bias

Fri, 08/28/2015 - 01:15

Any given human individual carries multiple genetic variants that disrupt protein-coding genes, through structural variation, as well as nucleotide variants and indels. Predicting the phenotypic consequences of a gene disruption remains a significant challenge. Current approaches employ information from a range of biological networks to predict which human genes are haploinsufficient (meaning two copies are required for normal function) or essential (meaning at least one copy is required for viability). Using recently available study gene sets, we show that these approaches are strongly biased towards providing accurate predictions for well-studied genes. By contrast, we derive a haploinsufficiency score from a combination of unbiased large-scale high-throughput datasets, including gene co-expression and genetic variation in over 6000 human exomes. Our approach provides a haploinsufficiency prediction for over twice as many genes currently unassociated with papers listed in Pubmed as three commonly-used approaches, and outperforms these approaches for predicting haploinsufficiency for less-studied genes. We also show that fine-tuning the predictor on a set of well-studied ‘gold standard’ haploinsufficient genes does not improve the prediction for less-studied genes. This new score can readily be used to prioritize gene disruptions resulting from any genetic variant, including copy number variants, indels and single-nucleotide variants.

Categories: Journal Articles

Elastic network models for RNA: a comparative assessment with molecular dynamics and SHAPE experiments

Fri, 08/28/2015 - 01:15

Elastic network models (ENMs) are valuable and efficient tools for characterizing the collective internal dynamics of proteins based on the knowledge of their native structures. The increasing evidence that the biological functionality of RNAs is often linked to their innate internal motions poses the question of whether ENM approaches can be successfully extended to this class of biomolecules. This issue is tackled here by considering various families of elastic networks of increasing complexity applied to a representative set of RNAs. The fluctuations predicted by the alternative ENMs are stringently validated by comparison against extensive molecular dynamics simulations and SHAPE experiments. We find that simulations and experimental data are systematically best reproduced by either an all-atom or a three-beads-per-nucleotide representation (sugar-base-phosphate), with the latter arguably providing the best balance of accuracy and computational complexity.

Categories: Journal Articles

RefSeq curation and annotation of antizyme and antizyme inhibitor genes in vertebrates

Fri, 08/28/2015 - 01:15

Polyamines are ubiquitous cations that are involved in regulating fundamental cellular processes such as cell growth and proliferation; hence, their intracellular concentration is tightly regulated. Antizyme and antizyme inhibitor have a central role in maintaining cellular polyamine levels. Antizyme is unique in that it is expressed via a novel programmed ribosomal frameshifting mechanism. Conventional computational tools are unable to predict a programmed frameshift, resulting in misannotation of antizyme transcripts and proteins on transcript and genomic sequences. Correct annotation of a programmed frameshifting event requires manual evaluation. Our goal was to provide an accurately curated and annotated Reference Sequence (RefSeq) data set of antizyme transcript and protein records across a broad taxonomic scope that would serve as standards for accurate representation of these gene products. As antizyme and antizyme inhibitor proteins are functionally connected, we also curated antizyme inhibitor genes to more fully represent the elegant biology of polyamine regulation. Manual review of genes for three members of the antizyme family and two members of the antizyme inhibitor family in 91 vertebrate organisms resulted in a total of 461 curated RefSeq records.

Categories: Journal Articles

Genome wide interactions of wild-type and activator bypass forms of {sigma}54

Fri, 08/28/2015 - 01:15

Enhancer-dependent transcription involving the promoter specificity factor 54 is widely distributed amongst bacteria and commonly associated with cell envelope function. For transcription initiation, 54-RNA polymerase yields open promoter complexes through its remodelling by cognate AAA+ ATPase activators. Since activators can be bypassed in vitro, bypass transcription in vivo could be a source of emergent gene expression along evolutionary pathways yielding new control networks and transcription patterns. At a single test promoter in vivo bypass transcription was not observed. We now use genome-wide transcription profiling, genome-wide mutagenesis and gene over-expression strategies in Escherichia coli, to (i) scope the range of bypass transcription in vivo and (ii) identify genes which might alter bypass transcription in vivo. We find little evidence for pervasive bypass transcription in vivo with only a small subset of 54 promoters functioning without activators. Results also suggest no one gene limits bypass transcription in vivo, arguing bypass transcription is strongly kept in check. Promoter sequences subject to repression by 54 were evident, indicating loss of rpoN (encoding 54) rather than creating rpoN bypass alleles would be one evolutionary route for new gene expression patterns. Finally, cold-shock promoters showed unusual 54-dependence in vivo not readily correlated with conventional 54 binding-sites.

Categories: Journal Articles