Characterizing Gene Networks and RNA-Mediated Gene Regulation in Maize
Huang, Ji (author)
McGinnis, Karen M. (professor directing dissertation)
Lemmon, Alan R (university representative)
Jones, Kathryn M. (committee member)
Chadwick, Brian P. (committee member)
Dennis, Jonathan Hancock (committee member)
Florida State University (degree granting institution)
College of Arts and Sciences (degree granting college)
Department of Biological Science (degree granting department)
Controlling spatial-temporal gene expression patterns is a fundamental task for maize growth and development. With the emergence of massively parallel sequencing, genome-wide expression data production has reached an unprecedented level. This abundance of data has greatly facilitated maize research, but may not be amenable to traditional analysis techniques that were optimized for other data types. In one project, using publicly available data, a Gene Co-expression Network (GCN) was constructed and used for gene function prediction, candidate gene selection and improving understanding of regulatory pathways. To build an optimal GCN from plant materials RNA-Seq data, parameters for expression data normalization and network inference were evaluated. A comprehensive evaluation of these two parameters and ranked aggregation strategy on network performance using libraries from 1266 maize samples was conducted. Three normalization methods (VST, CPM, RPKM) and ten inference methods, including six correlation and four mutual information (MI) methods, were tested. The three normalization methods had very similar performance. For network inference, correlation methods performed better than MI methods at some genes. Increasing sample size also had a positive effect on GCN. Aggregating single networks together resulted in improved performance compared to single networks. In another project, a maize mutant, transgene reactivated 9-1 (tgr9-1) in the transcriptional gene silencing (TGS) pathway, was cloned. The B-A translocation lines were used to map tgr9-1 on chromosome 3 and this result was consistent with molecular markers. To further locate tgr9-1, next-generation sequencing (NGS) combined with bulk segregant analysis was applied to the tgr9-1 mapping population. Using coexpression analysis, our result indicates a maize dicer-like3a (Zmdcl3a) gene is a high-confidence candidate gene for tgr9. Zmdcl3a is involved in the RNA-directed DNA methylation (RdDM) pathway. This pathway is driven by two plant-specific DNA-dependent RNA polymerases, Polymerase IV (Pol IV) and Polymerase V (Pol V). Several kinds of non-coding RNAs are involved, including long single-stranded RNAs, double-stranded RNAs, and small interfering RNAs. The identification of tgr9-1 uncovered the role of non-coding RNAs in TGS and revealed the diversity of TGS pathways in maize. One primary focus of gene regulation study is by studying transcription factors (TFs). Transcription factors (TFs) are proteins that can bind to DNA sequences and regulate gene expression. Many TFs are master regulators in cells that contribute to tissue-specific and cell-type-specific gene expression patterns in eukaryotes. Little is known about tissue-specific gene regulation through TFs in maize. In this project, a network approach was applied to elucidate gene regulatory networks (GRNs) in four tissues (leaf, root, shoot apical meristem and seed) in maize. We used GENIE3 machine-learning algorithm combined with the large quantity of RNA-Seq expression data to construct four tissue-specific GRNs. Although many TFs were expressed across multiple tissues, a multi-tiered analysis predicted tissue-specific regulatory functions for many transcription factors. Some well-studied TFs emerged within the four tissue-specific GRNs, and the GRN predictions matched expectations based upon published results for many of these examples. The GRNs were also validated by ChIP-Seq datasets (KN1, FEA4, and O2). Key TFs were identified for each tissue and matched expectations for key regulators in each tissue, including GO enrichment and identity with known regulatory factors for that tissue.
1 online resource (170 pages)
2018_Sp_Huang_fsu_0071E_14421_comp
monographic
Florida State University
Tallahassee, Florida
A Dissertation submitted to the Department of Biological Science in partial fulfillment of the requirements for the degree of Doctor of Philosophy.
Spring Semester 2018.
April 2, 2018.
GENE EXPRESSION, MAIZE, NETWORK, RDDM, SMALL RNA, TRANSCRIPTION FACTOR
Includes bibliographical references.
Karen M. McGinnis, Professor Directing Dissertation; Alan R. Lemmon, University Representative; Kathryn M. Jones, Committee Member; Brian P. Chadwick, Committee Member; Jonathan H. Dennis, Committee Member.
GENE EXPRESSION, MAIZE, NETWORK, RDDM, SMALL RNA, TRANSCRIPTION FACTOR
April 2, 2018.
A Dissertation submitted to the Department of Biological Science in partial fulfillment of the requirements for the degree of Doctor of Philosophy.
Includes bibliographical references.
Karen M. McGinnis, Professor Directing Dissertation; Alan R. Lemmon, University Representative; Kathryn M. Jones, Committee Member; Brian P. Chadwick, Committee Member; Jonathan H. Dennis, Committee Member.
Characterizing Gene Networks and RNA-Mediated Gene Regulation in Maize
Huang, Ji (author)
McGinnis, Karen M. (professor directing dissertation)
Lemmon, Alan R (university representative)
Jones, Kathryn M. (committee member)
Chadwick, Brian P. (committee member)
Dennis, Jonathan Hancock (committee member)
Florida State University (degree granting institution)
College of Arts and Sciences (degree granting college)
Department of Biological Science (degree granting department)
2018
text
doctoral thesis
Controlling spatial-temporal gene expression patterns is a fundamental task for maize growth and development. With the emergence of massively parallel sequencing, genome-wide expression data production has reached an unprecedented level. This abundance of data has greatly facilitated maize research, but may not be amenable to traditional analysis techniques that were optimized for other data types. In one project, using publicly available data, a Gene Co-expression Network (GCN) was constructed and used for gene function prediction, candidate gene selection and improving understanding of regulatory pathways. To build an optimal GCN from plant materials RNA-Seq data, parameters for expression data normalization and network inference were evaluated. A comprehensive evaluation of these two parameters and ranked aggregation strategy on network performance using libraries from 1266 maize samples was conducted. Three normalization methods (VST, CPM, RPKM) and ten inference methods, including six correlation and four mutual information (MI) methods, were tested. The three normalization methods had very similar performance. For network inference, correlation methods performed better than MI methods at some genes. Increasing sample size also had a positive effect on GCN. Aggregating single networks together resulted in improved performance compared to single networks. In another project, a maize mutant, transgene reactivated 9-1 (tgr9-1) in the transcriptional gene silencing (TGS) pathway, was cloned. The B-A translocation lines were used to map tgr9-1 on chromosome 3 and this result was consistent with molecular markers. To further locate tgr9-1, next-generation sequencing (NGS) combined with bulk segregant analysis was applied to the tgr9-1 mapping population. Using coexpression analysis, our result indicates a maize dicer-like3a (Zmdcl3a) gene is a high-confidence candidate gene for tgr9. Zmdcl3a is involved in the RNA-directed DNA methylation (RdDM) pathway. This pathway is driven by two plant-specific DNA-dependent RNA polymerases, Polymerase IV (Pol IV) and Polymerase V (Pol V). Several kinds of non-coding RNAs are involved, including long single-stranded RNAs, double-stranded RNAs, and small interfering RNAs. The identification of tgr9-1 uncovered the role of non-coding RNAs in TGS and revealed the diversity of TGS pathways in maize. One primary focus of gene regulation study is by studying transcription factors (TFs). Transcription factors (TFs) are proteins that can bind to DNA sequences and regulate gene expression. Many TFs are master regulators in cells that contribute to tissue-specific and cell-type-specific gene expression patterns in eukaryotes. Little is known about tissue-specific gene regulation through TFs in maize. In this project, a network approach was applied to elucidate gene regulatory networks (GRNs) in four tissues (leaf, root, shoot apical meristem and seed) in maize. We used GENIE3 machine-learning algorithm combined with the large quantity of RNA-Seq expression data to construct four tissue-specific GRNs. Although many TFs were expressed across multiple tissues, a multi-tiered analysis predicted tissue-specific regulatory functions for many transcription factors. Some well-studied TFs emerged within the four tissue-specific GRNs, and the GRN predictions matched expectations based upon published results for many of these examples. The GRNs were also validated by ChIP-Seq datasets (KN1, FEA4, and O2). Key TFs were identified for each tissue and matched expectations for key regulators in each tissue, including GO enrichment and identity with known regulatory factors for that tissue.
GENE EXPRESSION, MAIZE, NETWORK, RDDM, SMALL RNA, TRANSCRIPTION FACTOR
April 2, 2018.
A Dissertation submitted to the Department of Biological Science in partial fulfillment of the requirements for the degree of Doctor of Philosophy.
Includes bibliographical references.
Karen M. McGinnis, Professor Directing Dissertation; Alan R. Lemmon, University Representative; Kathryn M. Jones, Committee Member; Brian P. Chadwick, Committee Member; Jonathan H. Dennis, Committee Member.
Florida State University
2018_Sp_Huang_fsu_0071E_14421