Co-reporter:Chen Zhao, Anna Marie Pyle
Trends in Biochemical Sciences 2017 Volume 42, Issue 6(Issue 6) pp:
Publication Date(Web):1 June 2017
DOI:10.1016/j.tibs.2017.03.007
While the major architectural features and active-site components of group II introns have been known for almost a decade, information on the individual stages of splicing has been lacking. Recent advances in crystallography and cryo-electron microscopy (cryo-EM) have provided major new insights into the structure of intact lariat introns. Conformational changes that mediate the steps of splicing and retrotransposition are being elucidated, revealing the dynamic, highly coordinated motions that are required for group II intron activity. Finally, these ribozymes can now be viewed in their larger, more natural context as components of holoenzymes that include encoded maturase proteins. These studies expand our understanding of group II intron structural diversity and evolution, while setting the stage for rigorous mechanistic analysis of RNA splicing machines.
Co-reporter:Rebecca L Adams, Nathan Pirakitikulr, Anna Marie Pyle
Current Opinion in Virology 2017 Volume 24(Volume 24) pp:
Publication Date(Web):1 June 2017
DOI:10.1016/j.coviro.2017.04.007
•Important RNA structures are found throughout the HCV genome, including the ORF.•The HCV IRES adopts a dynamic structure that recruits the cellular translation machinery.•RNA structural elements can be targeted for antiviral therapy.The single-stranded Hepatitis C Virus (HCV) genome adopts a set of elaborate RNA structures that are involved in every stage of the viral lifecycle. Recent advances in chemical probing, sequencing, and structural biology have facilitated analysis of RNA folding on a genome-wide scale, revealing novel structures and networks of interactions. These studies have underscored the active role played by RNA in every function of HCV and they open the door to new types of RNA-targeted therapeutics.
Co-reporter:Tamar Schlick, Anna Marie Pyle
Biophysical Journal 2017 Volume 113, Issue 2(Volume 113, Issue 2) pp:
Publication Date(Web):25 July 2017
DOI:10.1016/j.bpj.2016.12.037
We describe opportunities and challenges in RNA structural modeling and design, as recently discussed during the second Telluride Science Research Center workshop organized in June 2016. Topics include fundamental processes of RNA, such as structural assemblies (hierarchical folding, multiple conformational states and their clustering), RNA motifs, and chemical reactivity of RNA, as used for structural prediction and functional inference. We also highlight the software and database issues associated with RNA structures, such as the multiple approaches for motif annotation, the need for frequent database updating, and the importance of quality control of RNA structures. We discuss various modeling approaches for structure prediction, mechanistic analysis of RNA reactions, and RNA design, and the complementary roles that both atomistic and coarse-grained approaches play in such simulations. Collectively, as scientists from varied disciplines become familiar and drawn into these unique challenges, new approaches and collaborative efforts will undoubtedly be catalyzed.
Co-reporter:Chen Zhao, Anna Marie Pyle
Current Opinion in Structural Biology 2017 Volume 47(Volume 47) pp:
Publication Date(Web):1 December 2017
DOI:10.1016/j.sbi.2017.05.002
•The intron encoded protein (IEP) is a reverse transcriptase that functions in splicing.•The IEP has a canonical polymerase scaffold that is elaborated by insertions.•As a splicing factor, the finger and thumb domains recruit the IEP to its intron target.•As a splicing factor, the thumb domain stabilizes exon-intron interaction.•The intron-specific IEP transformed into a general splicing factor during evolution.The splicing of group II introns in vivo requires the assistance of a multifunctional intron encoded protein (IEP, or maturase). Each IEP is also a reverse-transcriptase enzyme that enables group II introns to behave as mobile genetic elements. During splicing or retro-transposition, each group II intron forms a tight, specific complex with its own encoded IEP, resulting in a highly reactive holoenzyme. This review focuses on the structural basis for IEP function, as revealed by recent crystal structures of an IEP reverse transcriptase domain and cryo-EM structures of an IEP-intron complex. These structures explain how the same IEP scaffold is utilized for intron recognition, splicing and reverse transcription, while providing a physical basis for understanding the evolutionary transformation of the IEP into the eukaryotic splicing factor Prp8.
Co-reporter:Samuel E. Butcher and Anna Marie Pyle
Accounts of Chemical Research 2011 Volume 44(Issue 12) pp:1302
Publication Date(Web):September 7, 2011
DOI:10.1021/ar200098t
RNA molecules adopt specific three-dimensional structures critical to their function. Many essential metabolic processes, including protein synthesis and RNA splicing, are carried out by RNA molecules with elaborate tertiary structures (e.g. 3QIQ, right). Indeed, the ribosome and self-splicing introns are complex RNA machines. But even the coding regions in messenger RNAs and viral RNAs are flanked by highly structured untranslated regions, which provide regulatory information necessary for gene expression.RNA tertiary structure is defined as the three-dimensional arrangement of RNA building blocks, which include helical duplexes, triple-stranded structures, and other components that are held together through connections collectively termed RNA tertiary interactions. The structural diversity of these interactions is now a subject of intense investigation, involving the techniques of NMR, X-ray crystallography, chemical genetics, and phylogenetic analysis. At the same time, many investigators are using biophysical techniques to elucidate the driving forces for tertiary structure formation and the mechanisms for its stabilization. RNA tertiary folding is promoted by maximization of base stacking, much like the hydrophobic effect that drives protein folding. RNA folding also requires electrostatic stabilization, both through charge screening and site binding of metals, and it is enhanced by desolvation of the phosphate backbone. In this Account, we provide an overview of the features that specify and stabilize RNA tertiary structure.A major determinant for overall tertiary RNA architecture is local conformation in secondary-structure junctions, which are regions from which two or more duplexes project. At junctions and other structures, such as pseudoknots and kissing loops, adjacent helices stack on one another, and these coaxial stacks play a major role in dictating the overall architectural form of an RNA molecule. In addition to RNA junction topology, a second determinant for RNA tertiary structure is the formation of sequence-specific interactions. Networks of triple helices, tetraloop–receptor interactions, and other sequence-specific contacts establish the framework for the overall tertiary fold. The third determinant of tertiary structure is the formation of stabilizing stacking and backbone interactions, and many are not sequence specific. For example, ribose zippers allow 2'-hydroxyl groups on different RNA strands to form networks of interdigitated hydrogen bonds, serving to seal strands together and thereby stabilize adjacent substructures. These motifs often require monovalent and divalent cations, which can interact diffusely or through chelation to specific RNA functional groups.As we learn more about the components of RNA tertiary structure, we will be able to predict the structures of RNA molecules from their sequences, thereby obtaining key information about biological function. Understanding and predicting RNA structure is particularly important given the recent discovery that although most of our genome is transcribed into RNA molecules, few of them have a known function. The prevalence of RNA viruses and pathogens with RNA genomes makes RNA drug discovery an active area of research. Finally, knowledge of RNA structure will facilitate the engineering of supramolecular RNA structures, which can be used as nanomechanical components for new materials. But all of this promise depends on a better understanding of the RNA parts list, and how the pieces fit together.
Co-reporter:Anna Marie Pyle
Current Opinion in Chemical Biology 2011 Volume 15(Issue 5) pp:636-642
Publication Date(Web):October 2011
DOI:10.1016/j.cbpa.2011.07.019
It is becoming increasingly clear that RNA molecules play a major role in all aspects of metabolism. The conformational state and stability of RNA are controlled by RNA remodeling proteins, which are ubiquitous motor proteins in the cell. Here, we review advances in our understanding of the structure and function of three major structural families of RNA remodeling proteins, the hexameric ring proteins, the processive monomeric RNA translocase/helicases, and the functionally diverse DEAD-box remodeling proteins. New studies have revealed molecular mechanisms for coupling between ATP hydrolysis and unwinding, the physical basis for regulatory control by cofactors, and novel functions for RNA remodeling proteins.Highlights► RNA molecules play a major role in all aspects of metabolism and their conformational state is controlled by RNA remodeling motor proteins. ► RNA remodeling proteins can be described in three major structural categories: the hexameric ring proteins, the processive monomeric RNA translocase/helicases, and the functionally diverse DEAD-box remodeling proteins. ► Biophysical studies have revealed new molecular mechanisms and functions for RNA remodeling proteins.
Co-reporter:Kevin S. Keating
PNAS 2010 Volume 107 (Issue 18 ) pp:8177-8182
Publication Date(Web):2010-05-04
DOI:10.1073/pnas.0911888107
Structured RNA molecules play essential roles in a variety of cellular processes; however, crystallographic studies of such
RNA molecules present a large number of challenges. One notable complication arises from the low resolutions typical of RNA
crystallography, which results in electron density maps that are imprecise and difficult to interpret. This problem is exacerbated
by the lack of computational tools for RNA modeling, as many of the techniques commonly used in protein crystallography have
no equivalents for RNA structure. This leads to difficulty and errors in the model building process, particularly in modeling
of the RNA backbone, which is highly error prone due to the large number of variable torsion angles per nucleotide. To address
this, we have developed a method for accurately building the RNA backbone into maps of intermediate or low resolution. This
method is semiautomated, as it requires a crystallographer to first locate phosphates and bases in the electron density map.
After this initial trace of the molecule, however, an accurate backbone structure can be built without further user intervention.
To accomplish this, backbone conformers are first predicted using RNA pseudotorsions and the base-phosphate perpendicular
distance. Detailed backbone coordinates are then calculated to conform both to the predicted conformer and to the previously
located phosphates and bases. This technique is shown to produce accurate backbone structure even when starting from imprecise
phosphate and base coordinates. A program implementing this methodology is currently available, and a plugin for the Coot model building program is under development.
Co-reporter:Kevin S. Keating;Sean D. Taylor;Navtej Toor
Science 2008 Volume 320(Issue 5872) pp:77-82
Publication Date(Web):04 Apr 2008
DOI:10.1126/science.1153803
Abstract
Group II introns are self-splicing ribozymes that catalyze their own excision from precursor transcripts and insertion into new genetic locations. Here we report the crystal structure of an intact, self-spliced group II intron from Oceanobacillus iheyensis at 3.1 angstrom resolution. An extensive network of tertiary interactions facilitates the ordered packing of intron subdomains around a ribozyme core that includes catalytic domain V. The bulge of domain V adopts an unusual helical structure that is located adjacent to a major groove triple helix (catalytic triplex). The bulge and catalytic triplex jointly coordinate two divalent metal ions in a configuration that is consistent with a two–metal ion mechanism for catalysis. Structural and functional analogies support the hypothesis that group II introns and the spliceosome share a common ancestor.
Co-reporter:Victor Serebrov
and
Anna Marie Pyle
Nature 2004 430(6998) pp:476
Publication Date(Web):
DOI:10.1038/nature02704
Co-reporter:
Nature Structural and Molecular Biology 2004 11(6) pp:526-530
Publication Date(Web):16 May 2004
DOI:10.1038/nsmb771
Members of the DExH/D family of proteins, a subset of helicase superfamily 2 (SF2), are involved in virtually all aspects of RNA metabolism. NPH-II, a prototypical member of this protein family, exhibits robust RNA helicase activity in vitro. Using a series of modified substrates to explore the unwinding mechanism of NPH-II, we observed that the helicase tracks exclusively on the loading strand, where it requires covalent continuity and specifically recognizes the ribose-phosphate backbone. NPH-II unwinding was unaffected by lesions and nicks on the top strand, which has a minimal role in substrate recognition. NPH-II required physical continuity of phosphodiester linkages on the loading strand, although abasic regions were tolerated. These findings suggest that SF2 helicases are mechanistically distinct from other helicase families that can tolerate breaks in the loading strand and for which bases are the primary recognition determinant.
Co-reporter:
Nature Structural and Molecular Biology 2004 11(2) pp:187-192
Publication Date(Web):25 January 2004
DOI:10.1038/nsmb717
Domain 5 (D5) is the central core of group II intron ribozymes. Many base and backbone substituents of this highly conserved hairpin participate in catalysis and are crucial for binding to other intron domains. We report the solution structures of the 34-nucleotide D5 hairpin from the group II intron ai5 in the absence and presence of divalent metal ions. The bulge region of D5 adopts a novel fold, where G26 adopts a syn conformation and flips down into the major groove of helix 1, close to the major groove face of the catalytic AGC triad. The backbone near G26 is kinked, exposing the base plane of the adjacent A-U pair to the solvent and causing bases of the bulge to stack intercalatively. Metal ion titrations reveal strong Mg2+ binding to a minor groove shelf in the D5 bulge. Another distinct metal ion−binding site is observed along the minor groove side of the catalytic triad, in a manner consistent with metal ion binding in the ribozyme active site.
Co-reporter:David MJ Lilley, Anna Marie Pyle
Current Opinion in Structural Biology (February 2016) Volume 36() pp:vii-viii
Publication Date(Web):1 February 2016
DOI:10.1016/j.sbi.2016.02.002
Co-reporter:Marina Ostankovitch, Anna Marie Pyle
Journal of Molecular Biology (9 October 2013) Volume 425(Issue 19) pp:3577-3581
Publication Date(Web):9 October 2013
DOI:10.1016/j.jmb.2013.07.032
Co-reporter:Dahai Luo, Andrew Kohlway, Adriana Vela, Anna Marie Pyle
Structure (7 November 2012) Volume 20(Issue 11) pp:1983-1988
Publication Date(Web):7 November 2012
DOI:10.1016/j.str.2012.08.029
Retinoic acid inducible gene-I (RIG-I) is a key intracellular immune receptor for pathogenic RNAs, particularly from RNA viruses. Here, we report the crystal structure of human RIG-I bound to a 5′ triphosphorylated RNA hairpin and ADP nucleotide at 2.8 Å resolution. The RNA ligand contains all structural features that are essential for optimal recognition by RIG-I, as it mimics the panhandle-like signatures within the genome of negative-stranded RNA viruses. RIG-I adopts an intermediate, semiclosed conformation in this product state of ATP hydrolysis. The structure of this complex allows us to visualize the first steps in RIG-I recognition and activation upon viral infection.Graphical AbstractDownload high-res image (685KB)Download full-size imageHighlights► Crystallographic structure of RIG-I with 5′ triphosphorylated RNA duplex and ADP ► Solution biophysical analysis of ATP-dependent RIG-I conformational change ► The structural basis for ATP-stimulated RIG-I activation
Co-reporter:Nathan Pirakitikulr, Andrew Kohlway, Brett D. Lindenbach, Anna M. Pyle
Molecular Cell (7 April 2016) Volume 62(Issue 1) pp:111-120
Publication Date(Web):7 April 2016
DOI:10.1016/j.molcel.2016.01.024
•The genome of HCV is folded into specific RNA structures•HCV genomic structures are conserved across multiple genotypes•Elaborate RNA structures are present within protein-coding sequences•Genetic manipulation of these structures affects replication and infectivityRNA is a versatile macromolecule that accommodates functional information in primary sequence and secondary and tertiary structure. We use a combination of chemical probing, RNA structure modeling, comparative sequence analysis, and functional assays to examine the role of RNA structure in the hepatitis C virus (HCV) genome. We describe a set of conserved but functionally diverse structural RNA motifs that occur in multiple coding regions of the HCV genome, and we demonstrate that conformational changes in these motifs influence specific stages in the virus’ life cycle. Our study shows that these types of structures can pervade a genome, where they play specific mechanistic and regulatory roles, constituting a “code within the code” for controlling biological processes.Download high-res image (256KB)Download full-size image
Co-reporter:Leven M. Wadley, Kevin S. Keating, Carlos M. Duarte, Anna Marie Pyle
Journal of Molecular Biology (28 September 2007) Volume 372(Issue 4) pp:942-957
Publication Date(Web):28 September 2007
DOI:10.1016/j.jmb.2007.06.058
Quantitatively describing RNA structure and conformational elements remains a formidable problem. Seven standard torsion angles and the sugar pucker are necessary to characterize the conformation of an RNA nucleotide completely. Progress has been made toward understanding the discrete nature of RNA structure, but classifying simple and ubiquitous structural elements such as helices and motifs remains a difficult task. One approach for describing RNA structure in a simple, mathematically consistent, and computationally accessible manner involves the invocation of two pseudotorsions, η (C4′n-1, Pn, C4′n, Pn+1) and θ (Pn, C4′n, Pn+1, C4′n+1), which can be used to describe RNA conformation in much the same way that ϕ and ψ are used to describe backbone configuration of proteins. Here, we conduct an exploration and statistical evaluation of pseudotorsional space and of the Ramachandran-like η–θ plot. We show that, through the rigorous quantitative analysis of the η–θ plot, the pseudotorsional descriptors η and θ, together with sugar pucker, are sufficient to describe RNA backbone conformation fully in most cases. These descriptors are also shown to contain considerable information about nucleotide base conformation, revealing a previously uncharacterized interplay between backbone and base orientation. A window function analysis is used to discern statistically relevant regions of density in the η–θ scatter plot and then nucleotides in colocalized clusters in the η–θ plane are shown to have similar 3-D structures through RMSD analysis of the RNA structural constituents. We find that major clusters in the η–θ plot are few, underscoring the discrete nature of RNA backbone conformation. Like the Ramachandran plot, the η–θ plot is a valuable system for conceptualizing biomolecular conformation, it is a useful tool for analyzing RNA tertiary structures, and it is a vital component of new approaches for solving the 3-D structures of large RNA molecules and RNA assemblies.
Co-reporter:Anna Marie Pyle, Tamar Schlick
Journal of Molecular Biology (27 February 2016) Volume 428(Issue 5) pp:733-735
Publication Date(Web):27 February 2016
DOI:10.1016/j.jmb.2016.02.012
Co-reporter:Anna Marie Pyle
Molecular Cell (21 November 2013) Volume 52(Issue 4) pp:469-470
Publication Date(Web):21 November 2013
DOI:10.1016/j.molcel.2013.11.007
Using hydroxyl-radical footprinting to map the structures of RNA molecules in whole cells, Soper et al. (2013) determine the specific role of assembly factors during the final stages of ribosomal subunit assembly and visualize structural features of intermediate states.
Co-reporter:Anna Marie Pyle
Molecular Cell (2 October 2014) Volume 56(Issue 1) pp:13-17
Publication Date(Web):2 October 2014
DOI:10.1016/j.molcel.2014.08.025
A diverse population of large RNA molecules controls every aspect of cellular function, and yet we know very little about their molecular structures. However, robust technologies developed for visualizing ribozymes and riboswitches, together with new approaches for mapping RNA inside cells, provide the foundation for visualizing the structures of long noncoding RNAs, mRNAs, and viral RNAs, thereby facilitating new mechanistic insights.
Co-reporter:Olga Fedorova, Christina Waldsich, Anna Marie Pyle
Journal of Molecular Biology (2 March 2007) Volume 366(Issue 4) pp:1099-1114
Publication Date(Web):2 March 2007
DOI:10.1016/j.jmb.2006.12.003
The folding of group II intron ribozymes has been studied extensively under optimal conditions for self-splicing in vitro (42 °C and high magnesium ion concentrations). In these cases, the ribozymes fold directly to the native state by an apparent two-state mechanism involving the formation of an obligate intermediate within intron domain 1. We have now characterized the folding pathway under near-physiological conditions. We observe that compaction of the RNA proceeds slowly to completion, even at low magnesium concentration (3 mM). Kinetic analysis shows that this compact species is a “near-native” intermediate state that is readily chased into the native state by the addition of high salt. Structural probing reveals that the near-native state represents a compact domain 1 scaffold that is not yet docked with the catalytic domains (D3 and D5). Interestingly, native ribozyme reverts to the near-native state upon reduction in magnesium concentration. Therefore, while the intron can sustain the intermediate state under physiological conditions, the native structure is not maintained and is likely to require stabilization by protein cofactors in vivo.
Co-reporter:Srinivas Somarowthu, Michal Legiewicz, Isabel Chillón, Marco Marcia, ... Anna Marie Pyle
Molecular Cell (16 April 2015) Volume 58(Issue 2) pp:353-361
Publication Date(Web):16 April 2015
DOI:10.1016/j.molcel.2015.03.006
•Natively purified HOTAIR adopts a single, well-defined conformation•HOTAIR has an elaborate and well-defined secondary structure•HOTAIR forms independent structural domains•HOTAIR shows a significant degree of phylogenetic covariationLong noncoding RNAs (lncRNAs) have recently emerged as key players in fundamental cellular processes and diseases, but their functions are poorly understood. HOTAIR is a 2,148-nt-long lncRNA molecule involved in physiological epidermal development and in pathogenic cancer progression, where it has been demonstrated to repress tumor and metastasis suppressor genes. To gain insights into the molecular mechanisms of HOTAIR, we purified it in a stable and homogenous form in vitro, and we determined its functional secondary structure through chemical probing and phylogenetic analysis. The HOTAIR structure reveals a degree of structural organization comparable to well-folded RNAs, like the group II intron, rRNA, or lncRNA steroid receptor activator. It is composed of four independently folding modules, two of which correspond to predicted protein-binding domains. Secondary structure elements that surround protein-binding motifs are evolutionarily conserved. Our work serves as a guide for “navigating” through the lncRNA HOTAIR and ultimately for understanding its function.Download high-res image (153KB)Download full-size image
Co-reporter:Kevin S. Keating, Navtej Toor, Anna Marie Pyle
Journal of Molecular Biology (14 November 2008) Volume 383(Issue 3) pp:475-481
Publication Date(Web):14 November 2008
DOI:10.1016/j.jmb.2008.08.043
Tetraloops are a common building block for RNA tertiary structure, and most tetraloops fall into one of three well-characterized classes: GNRA, UNCG, and CUYG. Here, we present the sequence and structure of a fourth highly conserved class of tetraloop that occurs only within the ζ–ζ′ interaction of group IIC introns. This GANC tetraloop was identified, along with an unusual cognate receptor, in the crystal structure of the group IIC intron and through phylogenetic analysis of intron RNA sequence alignments. Unlike conventional tetraloop–receptor interactions, which are stabilized by extensive hydrogen-bonding interactions, the GANC–receptor interaction is limited to a single base stack between the conserved adenosine of the tetraloop and a single purine of the receptor, which consists of a one- to three-nucleotide bulge and does not contain an A-platform. Unlike GNRA tetraloops, the GANC tetraloop forms a sharp angle relative to the adjacent helix, bending by approximately 45° toward the major groove side of the helix. These structural attributes allow GANC tetraloops to fit precisely within the group IIC intron core, thereby demonstrating that structural motifs can adapt to function in a specific niche.