[-] Show simple item record

dc.contributor.advisorXu, Dong, 1965-eng
dc.contributor.authorAlazmi, Meshari Saudeng
dc.date.issued2012eng
dc.date.submitted2012 Springeng
dc.descriptionTitle from PDF of title page (University of Missouri--Columbia, viewed on September 10, 2012).eng
dc.descriptionThe entire thesis text is included in the research.pdf file; the official abstract appears in the short.pdf file; a non-technical public abstract appears in the public.pdf file.eng
dc.descriptionThesis advisor: Dr. Dong Xueng
dc.descriptionIncludes bibliographical references.eng
dc.descriptionM. S. University of Missouri--Columbia 2012.eng
dc.description"May 2012"eng
dc.description.abstractQuality assessment for protein structure models is an important issue in protein structure prediction. Consensus methods assess each model based on its structural similarity to all the other models in a model set, while single scoring methods, such as Opus-ca and RW, evaluate each model based on its structural properties. In this work, a novel method proposed and developed to effectively combine consensus methods and single scoring methods for better quality assessment. At first, a new method called Single Position Specific Probability (SPSP) Score is proposed based on consensus method using 4-mer sequence. Specifically, every letter in the 4-mer sequence represents a state for a local region consisting of four amino acids. A machine learning method (Neural Network) helped to combine several single scoring methods, RW, DDFire, and OPusCa with consensus methods, SPSP and Consensus Global Distance Test-Total Score (CGDT-TS) to achieve a good combination of all the terms. The method was tested on two benchmark datasets and achieved improvements over the state-of-the-art methods. The first benchmark was on Yang Zhang's data containing 56 targets. The second benchmark was from Rosetta data containing 35 targets. For Zhang's data, the CGDT score is 0.6058, while combined method achieved 0.6105. For Rosetta data, the CGDT score achieved 0.4255, while combined method achieved 0.4529.eng
dc.format.extentxiii, 98 pageseng
dc.identifier.urihttp://hdl.handle.net/10355/15238
dc.languageEnglisheng
dc.publisherUniversity of Missouri--Columbiaeng
dc.relation.ispartofcommunityUniversity of Missouri--Columbia. Graduate School. Theses and Dissertationseng
dc.rightsOpenAccess.eng
dc.rights.licenseThis work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivs 3.0 License.
dc.subject4-mer sequenceeng
dc.subjectprotein structure predictioneng
dc.subjectprotein structure modeleng
dc.titleProtein structural models selection using 4-mer sequence and combined single and consensus scoreseng
dc.typeThesiseng
thesis.degree.disciplineComputer science (MU)eng
thesis.degree.grantorUniversity of Missouri--Columbiaeng
thesis.degree.levelMasterseng
thesis.degree.nameM.S.eng


Files in this item

[PDF]
[PDF]
[PDF]

This item appears in the following Collection(s)

[-] Show simple item record