dc.contributor.advisor | Shyu, Chi-Ren | eng |
dc.contributor.author | Reneker, Jeff | eng |
dc.date.issued | 2010 | eng |
dc.date.submitted | 2010 Fall | eng |
dc.description.abstract | All of the information necessary to reproduce a living organism is contained in the DNA of its genome. Within the genomic sequence, there are subsequences called genes that are transcribed into RNA and translated into proteins which control cellular function. The genomes of many different organisms, including human, have already been sequenced and many more are currently in progress. Also, there are concurrent projects to annotate the genes from these genomes. Once annotated, cross-species comparisons of both gene sequence and gene annotation terms are possible, which can facilitate knowledge discovery. This dissertation introduces our work in this area. For example, our newly-developed refined repetitive sequence searches were used to identify a potentially new phase-variable site in Haemophilus influenzae based on information previously reported from Helicobacter pylori. The genome of an organism is inherited from its parents and is, in turn, passed on to its descendants. Determinative DNA sequences, both intragenic and intergenic, are often very highly conserved between diverging species over time. In fact, many of these sequences have been exactly conserved in multiple species throughout evolution. Once these sequences can be comprehensively identified in a set of genomes, they can be studied in more detail. This dissertation introduces our work in this area as well. For instance, several previously unreported long identical multi-species elements (LIMEs) were identified and studied in mammals. Also, the comprehensive set of LIMEs in six large plant genomes are identified and extensively characterized in terms structure, function and evolution. | eng |
dc.format.extent | xi, 100 pages | eng |
dc.identifier.uri | https://hdl.handle.net/10355/41915 | |
dc.identifier.uri | https://doi.org/10.32469/10355/41915 | eng |
dc.language | English | eng |
dc.publisher | University of Missouri--Columbia | eng |
dc.relation.ispartofcommunity | University of Missouri--Columbia. Graduate School. Theses and Dissertations | eng |
dc.rights | OpenAccess. | eng |
dc.rights.license | This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivs 3.0 License. | |
dc.subject | Cross-species searches | eng |
dc.subject | Hashmap indexing of DNA sequence | eng |
dc.subject | Refined repetitive searches | eng |
dc.subject | Comparative genomics | eng |
dc.subject | Long identical multi-species elements (LIMES) | eng |
dc.subject.FAST | Computational biology | eng |
dc.subject.FAST | Genomics | eng |
dc.subject.FAST | Nucleotide sequence | eng |
dc.title | Refined repetitive searches and long identical multi-species elements in mammals and plants : Insights into structure, function and evolution | eng |
dc.type | Thesis | eng |
thesis.degree.discipline | Computer science (MU) | eng |
thesis.degree.grantor | University of Missouri--Columbia | eng |
thesis.degree.level | Doctoral | eng |
thesis.degree.name | Ph. D. | eng |