dc.contributor.advisor | Zhao, Yunxin | eng |
dc.contributor.author | Xie, Xiaolin | eng |
dc.date.issued | 2013 | eng |
dc.date.submitted | 2013 Spring | eng |
dc.description | Title from PDF of title page (University of Missouri--Columbia, viewed on September 12, 2013). | eng |
dc.description | The entire thesis text is included in the research.pdf file; the official abstract appears in the short.pdf file; a non-technical public abstract appears in the public.pdf file. | eng |
dc.description | Thesis advisor: Dr. Yunxin Zhao | eng |
dc.description | Includes bibliographical references. | eng |
dc.description | M.S. University of Missouri--Columbia 2013. | eng |
dc.description | Dissertations, Academic -- University of Missouri--Columbia -- Computer science. | eng |
dc.description | "May 2013" | eng |
dc.description.abstract | Language models are one of the most critical knowledge sources of automatic speech recognition (ASR) systems. In the past decades, many language models have been developed, and some have proved useful and successful in speech recognition systems. However, almost all language models only capture one or two aspects of natural language. This study aims to investigate the effects of a syntactic, semantic, and lexical language model on speech recognition. In this study, we refer this language model as the composite language model (CLM). The parameters of the CLM in our study are distributed among hundreds of computer nodes in a supercomputer because they are too large to be stored in just one computer node. A distributed application has been developed to implement two speech rescoring techniques by using the CLM: lattice rescoring and confusion network rescoring. Experiments on a Wall Street Journal task have shown that using CLM to rescore word lattices and confusion networks have led to improvements in word accuracy over the commonly used trigram language model, with the latter offering a larger performance gain. | eng |
dc.format.extent | vii, 67 pages | eng |
dc.identifier.uri | http://hdl.handle.net/10355/38527 | |
dc.language | English | eng |
dc.publisher | University of Missouri--Columbia | eng |
dc.relation.ispartofcommunity | University of Missouri--Columbia. Graduate School. Theses and Dissertations | eng |
dc.source | Submitted by the University of Missouri--Columbia Graduate School | eng |
dc.subject | language modeling | eng |
dc.subject | confusion network | eng |
dc.subject | speech recognition | eng |
dc.subject | word lattice | eng |
dc.title | An exploration of composite language modeling for speech recognition | eng |
dc.type | Thesis | eng |
thesis.degree.discipline | Computer science (MU) | eng |
thesis.degree.grantor | University of Missouri--Columbia | eng |
thesis.degree.level | Masters | eng |
thesis.degree.name | M.S. | eng |