An exploration of composite language modeling for speech recognition

Xie, Xiaolin

URI

http://hdl.handle.net/10355/38527

dc.contributor.advisor	Zhao, Yunxin	eng
dc.contributor.author	Xie, Xiaolin	eng
dc.date.issued	2013	eng
dc.date.submitted	2013 Spring	eng
dc.description	Title from PDF of title page (University of Missouri--Columbia, viewed on September 12, 2013).	eng
dc.description	The entire thesis text is included in the research.pdf file; the official abstract appears in the short.pdf file; a non-technical public abstract appears in the public.pdf file.	eng
dc.description	Thesis advisor: Dr. Yunxin Zhao	eng
dc.description	Includes bibliographical references.	eng
dc.description	M.S. University of Missouri--Columbia 2013.	eng
dc.description	Dissertations, Academic -- University of Missouri--Columbia -- Computer science.	eng
dc.description	"May 2013"	eng
dc.description.abstract	Language models are one of the most critical knowledge sources of automatic speech recognition (ASR) systems. In the past decades, many language models have been developed, and some have proved useful and successful in speech recognition systems. However, almost all language models only capture one or two aspects of natural language. This study aims to investigate the effects of a syntactic, semantic, and lexical language model on speech recognition. In this study, we refer this language model as the composite language model (CLM). The parameters of the CLM in our study are distributed among hundreds of computer nodes in a supercomputer because they are too large to be stored in just one computer node. A distributed application has been developed to implement two speech rescoring techniques by using the CLM: lattice rescoring and confusion network rescoring. Experiments on a Wall Street Journal task have shown that using CLM to rescore word lattices and confusion networks have led to improvements in word accuracy over the commonly used trigram language model, with the latter offering a larger performance gain.	eng
dc.format.extent	vii, 67 pages	eng
dc.identifier.uri	http://hdl.handle.net/10355/38527
dc.language	English	eng
dc.publisher	University of Missouri--Columbia	eng
dc.relation.ispartofcommunity	University of Missouri--Columbia. Graduate School. Theses and Dissertations	eng
dc.source	Submitted by the University of Missouri--Columbia Graduate School	eng
dc.subject	language modeling	eng
dc.subject	confusion network	eng
dc.subject	speech recognition	eng
dc.subject	word lattice	eng
dc.title	An exploration of composite language modeling for speech recognition	eng
dc.type	Thesis	eng
thesis.degree.discipline	Computer science (MU)	eng
thesis.degree.grantor	University of Missouri--Columbia	eng
thesis.degree.level	Masters	eng
thesis.degree.name	M.S.	eng

Files in this item

Name:: public.pdf
Size:: 14.97Kb
Format:: PDF

View/Open

Name:: research.pdf
Size:: 27.65Mb
Format:: PDF

View/Open

Name:: short.pdf
Size:: 52.65Kb
Format:: PDF

View/Open

This item appears in the following Collection(s)

2013 MU theses - Freely available online
Computer Science electronic theses and dissertations (MU)
The electronic theses and dissertations of the Department of Computer Science.

[-] Show simple item record