Feed aggregator
Estimating first-passage time distributions from weighted ensemble simulations and non-Markovian analyses
First-passage times (FPTs) are widely used to characterize stochastic processes such as chemical reactions, protein folding, diffusion processes or triggering a stock option. In previous work (Suarez et al., JCTC 2014;10:2658-2667), we demonstrated a non-Markovian analysis approach that, with a sufficient subset of history information, yields unbiased mean first-passage times from weighted-ensemble (WE) simulations. The estimation of the distribution of the first-passage times is, however, a more ambitious goal since it cannot be obtained by direct observation in WE trajectories. Likewise, a large number of events would be required to make a good estimation of the distribution from a regular “brute force” simulation. Here, we show how the previously developed non-Markovian analysis can generate approximate, but highly accurate, FPT distributions from WE data. The analysis can also be applied to any other unbiased trajectories, such as from standard molecular dynamics simulations. The present study employs a range of systems with independent verification of the distributions to demonstrate the success and limitations of the approach. By comparison to a standard Markov analysis, the non-Markovian approach is less sensitive to the user-defined discretization of configuration space.
Development of electron spin echo envelope modulation spectroscopy to probe the secondary structure of recombinant membrane proteins in a lipid bilayer
Membrane proteins conduct many important biological functions essential to the survival of organisms. However, due to their inherent hydrophobic nature, it is very difficult to obtain structural information on membrane-bound proteins using traditional biophysical techniques. We are developing a new approach to probe the secondary structure of membrane proteins using the pulsed EPR technique of Electron Spin Echo Envelope Modulation (ESEEM) Spectroscopy. This method has been successfully applied to model peptides made synthetically. However, in order for this ESEEM technique to be widely applicable to larger membrane protein systems with no size limitations, protein samples with deuterated residues need to be prepared via protein expression methods. For the first time, this study shows that the ESEEM approach can be used to probe the local secondary structure of a 2H-labeled d8-Val overexpressed membrane protein in a membrane mimetic environment. The membrane-bound human KCNE1 protein was used with a known solution NMR structure to demonstrate the applicability of this methodology. Three different α-helical regions of KCNE1 were probed: the extracellular domain (Val21), transmembrane domain (Val50), and cytoplasmic domain (Val95). These results indicated α-helical structures in all three segments, consistent with the micelle structure of KCNE1. Furthermore, KCNE1 was incorporated into a lipid bilayer and the secondary structure of the transmembrane domain (Val50) was shown to be α-helical in a more native-like environment. This study extends the application of this ESEEM approach to much larger membrane protein systems that are difficult to study with X-ray crystallography and/or NMR spectroscopy.
Word vs. Class-Based Word Sense Disambiguation
Solving #SAT and MAXSAT by Dynamic Programming
Knowledge-Based Textual Inference via Parse-Tree Transformations
A Gene Gravity Model for the Evolution of Cancer Genomes: A Study of 3,000 Cancer Genomes across 9 Cancer Types
by Feixiong Cheng, Chuang Liu, Chen-Ching Lin, Junfei Zhao, Peilin Jia, Wen-Hsiung Li, Zhongming Zhao
Cancer development and progression result from somatic evolution by an accumulation of genomic alterations. The effects of those alterations on the fitness of somatic cells lead to evolutionary adaptations such as increased cell proliferation, angiogenesis, and altered anticancer drug responses. However, there are few general mathematical models to quantitatively examine how perturbations of a single gene shape subsequent evolution of the cancer genome. In this study, we proposed the gene gravity model to study the evolution of cancer genomes by incorporating the genome-wide transcription and somatic mutation profiles of ~3,000 tumors across 9 cancer types from The Cancer Genome Atlas into a broad gene network. We found that somatic mutations of a cancer driver gene may drive cancer genome evolution by inducing mutations in other genes. This functional consequence is often generated by the combined effect of genetic and epigenetic (e.g., chromatin regulation) alterations. By quantifying cancer genome evolution using the gene gravity model, we identified six putative cancer genes (AHNAK, COL11A1, DDX3X, FAT4, STAG2, and SYNE1). The tumor genomes harboring the nonsynonymous somatic mutations in these genes had a higher mutation density at the genome level compared to the wild-type groups. Furthermore, we provided statistical evidence that hypermutation of cancer driver genes on inactive X chromosomes is a general feature in female cancer genomes. In summary, this study sheds light on the functional consequences and evolutionary characteristics of somatic mutations during tumorigenesis by propelling adaptive cancer genome evolution, which would provide new perspectives for cancer research and therapeutics.Sequential unfolding of the hemolysin two-partner secretion domain from Proteus mirabilis
Protein secretion is a major contributor to Gram-negative bacterial virulence. Type Vb or two-partner secretion (TPS) pathways utilize a membrane bound β-barrel B component (TpsB) to translocate large and predominantly virulent exoproteins (TpsA) through a nucleotide independent mechanism. We focused our studies on a truncated TpsA member termed hemolysin A (HpmA265), a structurally and functionally characterized TPS domain from Proteus mirabilis. Contrary to the expectation that the TPS domain of HpmA265 would denature in a single cooperative transition, we found that the unfolding follows a sequential model with three distinct transitions linking four states. The solvent inaccessible core of HpmA265 can be divided into two different regions. The C-proximal region contains nonpolar residues and forms a prototypical hydrophobic core as found in globular proteins. The N-proximal region of the solvent inaccessible core, however, contains polar residues. To understand the contributions of the hydrophobic and polar interiors to overall TPS domain stability, we conducted unfolding studies on HpmA265 and site-specific mutants of HpmA265. By correlating the effect of individual site-specific mutations with the sequential unfolding results we were able to divide the HpmA265 TPS domain into polar core, nonpolar core, and C-terminal subdomains. Moreover, the unfolding studies provide quantitative evidence that the folding free energy for the polar core subdomain is more favorable than for the nonpolar core and C-terminal subdomains. This study implicates the hydrogen bonds shared among these conserved internal residues as a primary means for stabilizing the N-proximal polar core subdomain.
Insights into electron leakage in the reaction cycle of cytochrome P450 BM3 revealed by kinetic modeling and mutagenesis
As a single polypeptide, cytochrome P450 BM3 fuses oxidase and reductase domains and couples each domain's function to perform catalysis with exceptional activity upon binding of substrate for hydroxylation. Mutations introduced into the enzyme to change its substrate specificity often decrease coupling efficiency between the two domains, resulting in unproductive consumption of cofactors and formation of water and/or reactive species. This phenomenon can correlate with leakage, in which P450 BM3 uses electrons from NADPH to reduce oxygen to water and/or reactive species even without bound substrate. The physical basis for leakage is not yet well understood in this particular member of the cytochrome P450 family. To clarify the relationship between leakage and coupling, we used simulations to illustrate how different combinations of kinetic parameters related to substrate-free consumption of NADPH and substrate hydroxylation can lead to either minimal effects on coupling or a dramatic decrease in coupling as a result of leakage. We explored leakage in P450 BM3 by introducing leakage-enhancing mutations and combining these mutations to assess whether doing so increases leakage further. The variants in this study provide evidence that while a transition to high spin may be vital for coupled hydroxylation, it is not required for enhanced leakage; substrate binding and the consequent shift in spin state are not necessary as a redox switch for catalytic oxidation of NADPH. Additionally, the variants in this study suggest a tradeoff between leakage and stability and thus evolvability, as the mutations we investigated were far more deleterious than other mutations that have been used to change substrate specificity.
Parkinson's disease: Crystals of a toxic core
Parkinson's disease: Crystals of a toxic core
Nature 525, 7570 (2015). doi:10.1038/nature15630
Authors: Michel Goedert & Yifan Cheng
An ultra-high-resolution structure of the core segment of assembled α-synuclein — the protein that aggregates in the brains of patients with Parkinson's disease — has been determined. A neurobiologist and a structural biologist discuss the implications of this advance. See Article p.486
Epigenetics: The karma of oil palms
Epigenetics: The karma of oil palms
Nature 525, 7570 (2015). doi:10.1038/nature15216
Authors: Jerzy Paszkowski
Despite their clonal origin, some oil palm trees develop fruits that give almost no oil. It emerges that the number of methyl groups attached to a DNA region called Karma determine which plants are defective. See Letter p.533
Structure of the toxic core of α-synuclein from invisible crystals
Structure of the toxic core of α-synuclein from invisible crystals
Nature 525, 7570 (2015). doi:10.1038/nature15368
Authors: Jose A. Rodriguez, Magdalena I. Ivanova, Michael R. Sawaya, Duilio Cascio, Francis E. Reyes, Dan Shi, Smriti Sangwan, Elizabeth L. Guenther, Lisa M. Johnson, Meng Zhang, Lin Jiang, Mark A. Arbing, Brent L. Nannenga, Johan Hattne, Julian Whitelegge, Aaron S. Brewster, Marc Messerschmidt, Sébastien Boutet, Nicholas K. Sauter, Tamir Gonen & David S. Eisenberg
The protein α-synuclein is the main component of Lewy bodies, the neuron-associated aggregates seen in Parkinson disease and other neurodegenerative pathologies. An 11-residue segment, which we term NACore, appears to be responsible for amyloid formation and cytotoxicity of human α-synuclein. Here we describe crystals of
Loss of Karma transposon methylation underlies the mantled somaclonal variant of oil palm
Loss of Karma transposon methylation underlies the mantled somaclonal variant of oil palm
Nature 525, 7570 (2015). doi:10.1038/nature15365
Authors: Meilina Ong-Abdullah, Jared M. Ordway, Nan Jiang, Siew-Eng Ooi, Sau-Yee Kok, Norashikin Sarpan, Nuraziyan Azimi, Ahmad Tarmizi Hashim, Zamzuri Ishak, Samsul Kamal Rosli, Fadila Ahmad Malike, Nor Azwani Abu Bakar, Marhalil Marjuni, Norziha Abdullah, Zulkifli Yaakub, Mohd Din Amiruddin, Rajanaidu Nookiah, Rajinder Singh, Eng-Ti Leslie Low, Kuang-Lim Chan, Norazah Azizi, Steven W. Smith, Blaire Bacher, Muhammad A. Budiman, Andrew Van Brunt, Corey Wischmeyer, Melissa Beil, Michael Hogan, Nathan Lakey, Chin-Ching Lim, Xaviar Arulandoo, Choo-Kien Wong, Chin-Nee Choo, Wei-Chee Wong, Yen-Yen Kwan, Sharifah Shahrul Rabiah Syed Alwee, Ravigadevi Sambanthamurthi & Robert A. Martienssen
Somaclonal variation arises in plants and animals when differentiated somatic cells are induced into a pluripotent state, but the resulting clones differ from each other and from their parents. In agriculture, somaclonal variation has hindered the micropropagation of elite hybrids and genetically modified crops, but the mechanism responsible remains unknown. The oil palm fruit ‘mantled’ abnormality is a somaclonal variant arising from tissue culture that drastically reduces yield, and has largely halted efforts to clone elite hybrids for oil production. Widely regarded as an epigenetic phenomenon, ‘mantling’ has defied explanation, but here we identify the MANTLED locus using epigenome-wide association studies of the African oil palm Elaeis guineensis. DNA hypomethylation of a LINE retrotransposon related to rice Karma, in the intron of the homeotic gene DEFICIENS, is common to all mantled clones and is associated with alternative splicing and premature termination. Dense methylation near the Karma splice site (termed the Good Karma epiallele) predicts normal fruit set, whereas hypomethylation (the Bad Karma epiallele) predicts homeotic transformation, parthenocarpy and marked loss of yield. Loss of Karma methylation and of small RNA in tissue culture contributes to the origin of mantled, while restoration in spontaneous revertants accounts for non-Mendelian inheritance. The ability to predict and cull mantling at the plantlet stage will facilitate the introduction of higher performing clones and optimize environmentally sensitive land resources.
Probing protease sensitivity of recombinant human erythropoietin reveals α3–α4 inter-helical loop as a stability determinant
Although unglycosylated HuEpo is fully functional, it has very short serum half-life. However, the mechanism of in vivo clearance of human Epo (HuEpo) remains largely unknown. In this study, the relative importance of protease-sensitive sites of recombinant HuEpo (rHuEpo) has been investigated by analysis of structural data coupled with in vivo half-life measurements. Our results identify α3-α4 inter-helical loop region as a target site of lysosomal protease Cathepsin L. Consistent with previously-reported lysosomal degradation of HuEpo, these results for the first time identify cleavage sites of rHuEpo by specific lysosomal proteases. Furthermore, in agreement with the lowered exposure of the peptide backbone around the cleavage site, remarkably substitutions of residues with bulkier amino acids result in significantly improved in vivo stability. Together, these results have implications for the mechanism of in vivo clearance of the protein in humans. Proteins 2015; 83:1813–1822. © 2015 Wiley Periodicals, Inc.
Weak conservation of structural features in the interfaces of homologous transient protein–protein complexes
Residue types at the interface of protein–protein complexes (PPCs) are known to be reasonably well conserved. However, we show, using a dataset of known 3-D structures of homologous transient PPCs, that the 3-D location of interfacial residues and their interaction patterns are only moderately and poorly conserved, respectively. Another surprising observation is that a residue at the interface that is conserved is not necessarily in the interface in the homolog. Such differences in homologous complexes are manifested by substitution of the residues that are spatially proximal to the conserved residue and structural differences at the interfaces as well as differences in spatial orientations of the interacting proteins. Conservation of interface location and the interaction pattern at the core of the interfaces is higher than at the periphery of the interface patch. Extents of variability of various structural features reported here for homologous transient PPCs are higher than the variation in homologous permanent homomers. Our findings suggest that straightforward extrapolation of interfacial nature and inter-residue interaction patterns from template to target could lead to serious errors in the modeled complex structure. Understanding the evolution of interfaces provides insights to improve comparative modeling of PPC structures.
Electrostatic steering enhances the rate of cAMP binding to phosphodiesterase: Brownian dynamics modeling
Signaling in cells often involves co-localization of the signaling molecules. Most experimental evidence has shown that intracellular compartmentalization restricts the range of action of the second messenger, 3'-5'-cyclic adenosine monophosphate (cAMP), which is degraded by phosphodiesterases (PDEs). The objective of this study is to understand the details of molecular encounter that may play a role in efficient operation of the cAMP signaling apparatus. The results from electrostatic potential calculations and Brownian dynamics simulations suggest that positive potential of the active site from PDE enhances capture of diffusing cAMP molecules. This electrostatic steering between cAMP and the active site of a PDE plays a major role in the enzyme-substrate encounter, an effect that may be of significance in sequestering cAMP released from a nearby binding site or in attracting more freely diffusing cAMP molecules.
A Reminiscence about Early Times of Vitreous Water in Electron Cryomicroscopy
Free fatty acid receptors: structural models and elucidation of ligand binding interactions
Mutations can cause light chains to be too stable or too unstable to form amyloid fibrils
Light chain (AL) amyloidosis is an incurable human disease, where the amyloid precursor is a misfolding-prone immunoglobulin light-chain. Here, we identify the role of somatic mutations in the structure, stability and in vitro fibril formation for an amyloidogenic AL-12 protein by restoring four nonconservative mutations to their germline (wild-type) sequence. The single restorative mutations do not affect significantly the native structure, the unfolding pathway, and the reversibility of the protein. However, certain mutations either decrease (H32Y and H70D) or increase (R65S and Q96Y) the protein thermal stability. Interestingly, the most and the least stable mutants, Q96Y and H32Y, do not form amyloid fibrils under physiological conditions. Thus, Q96 and H32 are key residues for AL-12 stability and fibril formation and restoring them to the wild-type residues preclude amyloid formation. The mutants whose equilibrium is shifted to either the native or unfolded states barely sample transient partially folded states, and therefore do not form fibrils. These results agree with previous observations by our laboratory and others that amyloid formation occurs because of the sampling of partially folded states found within the unfolding transition (Blancas-Mejia and Ramirez-Alvarado, Ann Rev Biochem 2013;82:745–774). Here we provide a new insight on the AL amyloidosis mechanism by demonstrating that AL-12 does not follow the established thermodynamic hypothesis of amyloid formation. In this hypothesis, thermodynamically unstable proteins are more prone to amyloid formation. Here we show that within a thermal stability range, the most stable protein in this study is the most amyloidogenic protein.
Bisulfite-free, base-resolution analysis of 5-formylcytosine at the genome scale
Nature Methods 12, 1047 (2015). doi:10.1038/nmeth.3569
Authors: Bo Xia, Dali Han, Xingyu Lu, Zhaozhu Sun, Ankun Zhou, Qiangzong Yin, Hu Zeng, Menghao Liu, Xiang Jiang, Wei Xie, Chuan He & Chengqi Yi
Active DNA demethylation in mammals involves oxidation of 5-methylcytosine (5mC) to 5-hydroxymethylcytosine (5hmC), 5-formylcytosine (5fC) and 5-carboxylcytosine (5caC). However, genome-wide detection of 5fC at single-base resolution remains challenging. Here we present fC-CET, a bisulfite-free method for whole-genome analysis of 5fC based on selective chemical labeling of 5fC and subsequent C-to-T transition during PCR. Base-resolution 5fC maps showed limited overlap with 5hmC, with 5fC-marked regions more active than 5hmC-marked ones.
Cas9 gRNA engineering for genome editing, activation and repression
Nature Methods 12, 1051 (2015). doi:10.1038/nmeth.3580
Authors: Samira Kiani, Alejandro Chavez, Marcelle Tuttle, Richard N Hall, Raj Chari, Dmitry Ter-Ovanesyan, Jason Qian, Benjamin W Pruitt, Jacob Beal, Suhani Vora, Joanna Buchthal, Emma J K Kowal, Mohammad R Ebrahimkhani, James J Collins, Ron Weiss & George Church
We demonstrate that by altering the length of Cas9-associated guide RNA (gRNA) we were able to control Cas9 nuclease activity and simultaneously perform genome editing and transcriptional regulation with a single Cas9 protein. We exploited these principles to engineer mammalian synthetic circuits with combined transcriptional regulation and kill functions governed by a single multifunctional Cas9 protein.