File Download
Supplementary

postgraduate thesis: Genomic and transcriptomic analysis of plant PPR genes and organellar RNA editing

TitleGenomic and transcriptomic analysis of plant PPR genes and organellar RNA editing
Authors
Issue Date2016
PublisherThe University of Hong Kong (Pokfulam, Hong Kong)
Citation
Cheng, S. [程时锋]. (2016). Genomic and transcriptomic analysis of plant PPR genes and organellar RNA editing. (Thesis). University of Hong Kong, Pokfulam, Hong Kong SAR.
AbstractThe precise and perplexing coordination of plant organellar gene expression through nucleus-encoded trans-acting factors is puzzling from the genomic and evolutionary perspectives. Pentatricopeptide repeat (PPR) proteins are such players that are targeted exclusively to plant chloroplasts and mitochondria to participate nearly all of the post-transcriptional processes during RNA maturation. As a typical helix repeat modular protein, PPR constitutes one of the largest gene family in land plants, with more than 400 members estimated in most seed plants or larger than 2,000 copies in some genomes in bryophytes and ferns. The accurate recognition and regulation of target organellar RNA molecules by PPR proteins follow a modular manner with high specificity (one-motif-one-base), which has been described as a ―PPR code‖ to predict PPR-RNA interactions. However, this code is still poor in prediction capability due to the low-quality of the architectural annotation of the RNA-binding motifs and structural array arrangement. Thus, the accurate definition of PPR motif and the consistent annotation of PPR protein structure across species is a prerequisite for many downstream analyses, like the illumination of the molecular functions for diverse PPRs and the comparative analysis of PPR gene family evolution. On the other hand, the PLS-type PPRs are particularly interesting as key players in RNA editing activity in plant chloroplast and mitochondria, though their co-evolutionary patterns are still elusive. In this study, I performed a systematic study on PPR gene family in a wide range of plant genomes and transcriptomes, and further explored the co-evolution between PPR and RNA editing events in several model species. Firstly, structural modeling and contact prediction approach was used to redefine and confirm ten different variants of PPR motifs in plant proteins. Based on this, a robust computational pipeline for PPR gene annotation was developed through the combination of protein-based annotation results and genome-based predictions working on directly from the six-frame translation of genome assembly. I further applied my computational pipeline to analyze 109 published genomes, resulting in an unprecedented large PPR datasets, with high accuracy and consistency. A user-friendly PPR database searching and on-line annotation portal was built in this study: http://www.plantppr.com. This consistently annotated dataset is a valuable resource for comparative genomics studies and for large-scale prediction of PPR function. Secondly, I expanded the analysis by investigation of the 1KP data (1000 plant transcriptomes) that covers a broader species across evolution, revealing tens of thousands of new PPR proteins that were never identified before. Interestingly, a spectacular expansion of PPR gene family in land plants was con-firmed, especially during the colonization of land and evolutionary transition from a gametophyte- dominant to a sporophyte-dominant lifestyle. Thirdly, I explored the evolutionary characteristics of PPR gene family and the correlated RNA editing events in the newly sequenced ferns (Azolla filiculoides and Salvinia cucullata), by whole genome and transcriptome sequencing, RNA-seq alignment and variation calling, as well as comparative analysis of these two ferns with other closely-related plants. In summary, by comparative and computational genomic and transcriptomic studies on a broad wide of species including the newly sequenced model ferns, I redefined and reannotated PPR motifs and proteins followed by an extensive comparative analysis for PPR gene family and RNA editing events in plant organellar transcripts. These unprecedented studies revealed a remarkable journey of the evolution of PPR gene family and its co-evolution with the entire RNA editing machinery.
DegreeDoctor of Philosophy
SubjectPlant molecular genetics - Research
Dept/ProgramComputer Science
Persistent Identifierhttp://hdl.handle.net/10722/238887
HKU Library Item IDb5824307

 

DC FieldValueLanguage
dc.contributor.authorCheng, Shifeng-
dc.contributor.author程时锋-
dc.date.accessioned2017-02-20T02:06:47Z-
dc.date.available2017-02-20T02:06:47Z-
dc.date.issued2016-
dc.identifier.citationCheng, S. [程时锋]. (2016). Genomic and transcriptomic analysis of plant PPR genes and organellar RNA editing. (Thesis). University of Hong Kong, Pokfulam, Hong Kong SAR.-
dc.identifier.urihttp://hdl.handle.net/10722/238887-
dc.description.abstractThe precise and perplexing coordination of plant organellar gene expression through nucleus-encoded trans-acting factors is puzzling from the genomic and evolutionary perspectives. Pentatricopeptide repeat (PPR) proteins are such players that are targeted exclusively to plant chloroplasts and mitochondria to participate nearly all of the post-transcriptional processes during RNA maturation. As a typical helix repeat modular protein, PPR constitutes one of the largest gene family in land plants, with more than 400 members estimated in most seed plants or larger than 2,000 copies in some genomes in bryophytes and ferns. The accurate recognition and regulation of target organellar RNA molecules by PPR proteins follow a modular manner with high specificity (one-motif-one-base), which has been described as a ―PPR code‖ to predict PPR-RNA interactions. However, this code is still poor in prediction capability due to the low-quality of the architectural annotation of the RNA-binding motifs and structural array arrangement. Thus, the accurate definition of PPR motif and the consistent annotation of PPR protein structure across species is a prerequisite for many downstream analyses, like the illumination of the molecular functions for diverse PPRs and the comparative analysis of PPR gene family evolution. On the other hand, the PLS-type PPRs are particularly interesting as key players in RNA editing activity in plant chloroplast and mitochondria, though their co-evolutionary patterns are still elusive. In this study, I performed a systematic study on PPR gene family in a wide range of plant genomes and transcriptomes, and further explored the co-evolution between PPR and RNA editing events in several model species. Firstly, structural modeling and contact prediction approach was used to redefine and confirm ten different variants of PPR motifs in plant proteins. Based on this, a robust computational pipeline for PPR gene annotation was developed through the combination of protein-based annotation results and genome-based predictions working on directly from the six-frame translation of genome assembly. I further applied my computational pipeline to analyze 109 published genomes, resulting in an unprecedented large PPR datasets, with high accuracy and consistency. A user-friendly PPR database searching and on-line annotation portal was built in this study: http://www.plantppr.com. This consistently annotated dataset is a valuable resource for comparative genomics studies and for large-scale prediction of PPR function. Secondly, I expanded the analysis by investigation of the 1KP data (1000 plant transcriptomes) that covers a broader species across evolution, revealing tens of thousands of new PPR proteins that were never identified before. Interestingly, a spectacular expansion of PPR gene family in land plants was con-firmed, especially during the colonization of land and evolutionary transition from a gametophyte- dominant to a sporophyte-dominant lifestyle. Thirdly, I explored the evolutionary characteristics of PPR gene family and the correlated RNA editing events in the newly sequenced ferns (Azolla filiculoides and Salvinia cucullata), by whole genome and transcriptome sequencing, RNA-seq alignment and variation calling, as well as comparative analysis of these two ferns with other closely-related plants. In summary, by comparative and computational genomic and transcriptomic studies on a broad wide of species including the newly sequenced model ferns, I redefined and reannotated PPR motifs and proteins followed by an extensive comparative analysis for PPR gene family and RNA editing events in plant organellar transcripts. These unprecedented studies revealed a remarkable journey of the evolution of PPR gene family and its co-evolution with the entire RNA editing machinery.-
dc.languageeng-
dc.publisherThe University of Hong Kong (Pokfulam, Hong Kong)-
dc.relation.ispartofHKU Theses Online (HKUTO)-
dc.rightsThis work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.-
dc.rightsThe author retains all proprietary rights, (such as patent rights) and the right to use in future works.-
dc.subject.lcshPlant molecular genetics - Research-
dc.titleGenomic and transcriptomic analysis of plant PPR genes and organellar RNA editing-
dc.typePG_Thesis-
dc.identifier.hkulb5824307-
dc.description.thesisnameDoctor of Philosophy-
dc.description.thesislevelDoctoral-
dc.description.thesisdisciplineComputer Science-
dc.description.naturepublished_or_final_version-
dc.identifier.mmsid991044001142203414-

Export via OAI-PMH Interface in XML Formats


OR


Export to Other Non-XML Formats