dc.contributor.advisor | Zhao, Yunxin | eng |
dc.contributor.author | Hu, Rusheng, 1971- | eng |
dc.date.issued | 2006 | eng |
dc.date.submitted | 2006 Fall | eng |
dc.description | The entire dissertation/thesis text is included in the research.pdf file; the official abstract appears in the short.pdf file (which also appears in the research.pdf); a non-technical general description, or public abstract, appears in the public.pdf file. | eng |
dc.description | Title from title screen of research.pdf file (viewed on August 2, 2007) | eng |
dc.description | Vita. | eng |
dc.description | Thesis (Ph. D.) University of Missouri-Columbia 2006. | eng |
dc.description.abstract | This dissertation investigates optimization of acoustic models in speech recognition. Two new optimization methods are proposed for phonetic decision tree (PDT) search and Hidden Markov modeling (HMM)-- the knowledge-based adaptive PDT algorithm and the HMM gradient boosting algorithm. Investigations are conducted to applying both methods to improve word error rate of the state-of-the-art speech recognition system. However, these two methods are developed in a general machine learning background and their applications are not limited to speech recognition. The HMM gradient boosting method is based on a function approximation scheme from the perspective of optimization in function space rather than the parameter space, based on the fact that the Gaussian mixture model in each HMM state is an additive model of homogeneous functions (Gaussians). It provides a new scheme which can jointly optimize model structure and parameters. Experiments are conducted on the World Street Journal (WSJ) task and good improvements on word error rate are observed. The knowledge-based adaptive PDT algorithm is developed under a trend toward knowledge-based systems and aims at optimizing the mapping from contextual phones to articulatory states by maximizing implicit usage of the phonological and phonetic information, which is presumed to be contained in large data corpus. A computational efficient algorithm is developed to incorporate this prior knowledge in PDT construction. This algorithm is evaluated on the Telehealth conversational speech recognition and significant improvement on system performance is achieved. | eng |
dc.description.bibref | Includes bibliographical references. | eng |
dc.identifier.merlin | b59269194 | eng |
dc.identifier.oclc | 162129635 | eng |
dc.identifier.uri | https://doi.org/10.32469/10355/4329 | eng |
dc.identifier.uri | https://hdl.handle.net/10355/4329 | |
dc.language | English | eng |
dc.publisher | University of Missouri--Columbia | eng |
dc.relation.ispartofcommunity | University of Missouri--Columbia. Graduate School. Theses and Dissertations | eng |
dc.rights | OpenAccess. | eng |
dc.rights.license | This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivs 3.0 License. Copyright held by author. | |
dc.subject | phonetic decision tree. | eng |
dc.subject | phonetic decision tree | eng |
dc.subject.lcsh | Speech perception | eng |
dc.subject.lcsh | Pattern recognition systems | eng |
dc.subject.lcsh | Hidden Markov models | eng |
dc.title | Statistical optimization of acoustic models for large vocabulary speech recognition | eng |
dc.type | Thesis | eng |
thesis.degree.discipline | Computer science (MU) | eng |
thesis.degree.grantor | University of Missouri--Columbia | eng |
thesis.degree.level | Doctoral | eng |
thesis.degree.name | Ph. D. | eng |