[-] Show simple item record

dc.contributor.advisorZhao, Yunxineng
dc.contributor.authorXie, Xiaolineng
dc.date.issued2013eng
dc.date.submitted2013 Springeng
dc.descriptionTitle from PDF of title page (University of Missouri--Columbia, viewed on September 12, 2013).eng
dc.descriptionThe entire thesis text is included in the research.pdf file; the official abstract appears in the short.pdf file; a non-technical public abstract appears in the public.pdf file.eng
dc.descriptionThesis advisor: Dr. Yunxin Zhaoeng
dc.descriptionIncludes bibliographical references.eng
dc.descriptionM.S. University of Missouri--Columbia 2013.eng
dc.descriptionDissertations, Academic -- University of Missouri--Columbia -- Computer science.eng
dc.description"May 2013"eng
dc.description.abstractLanguage models are one of the most critical knowledge sources of automatic speech recognition (ASR) systems. In the past decades, many language models have been developed, and some have proved useful and successful in speech recognition systems. However, almost all language models only capture one or two aspects of natural language. This study aims to investigate the effects of a syntactic, semantic, and lexical language model on speech recognition. In this study, we refer this language model as the composite language model (CLM). The parameters of the CLM in our study are distributed among hundreds of computer nodes in a supercomputer because they are too large to be stored in just one computer node. A distributed application has been developed to implement two speech rescoring techniques by using the CLM: lattice rescoring and confusion network rescoring. Experiments on a Wall Street Journal task have shown that using CLM to rescore word lattices and confusion networks have led to improvements in word accuracy over the commonly used trigram language model, with the latter offering a larger performance gain.eng
dc.format.extentvii, 67 pageseng
dc.identifier.urihttp://hdl.handle.net/10355/38527
dc.languageEnglisheng
dc.publisherUniversity of Missouri--Columbiaeng
dc.relation.ispartofcommunityUniversity of Missouri--Columbia. Graduate School. Theses and Dissertationseng
dc.sourceSubmitted by the University of Missouri--Columbia Graduate Schooleng
dc.subjectlanguage modelingeng
dc.subjectconfusion networkeng
dc.subjectspeech recognitioneng
dc.subjectword latticeeng
dc.titleAn exploration of composite language modeling for speech recognitioneng
dc.typeThesiseng
thesis.degree.disciplineComputer science (MU)eng
thesis.degree.grantorUniversity of Missouri--Columbiaeng
thesis.degree.levelMasterseng
thesis.degree.nameM.S.eng


Files in this item

[PDF]
[PDF]
[PDF]

This item appears in the following Collection(s)

[-] Show simple item record