An exploration of composite language modeling for speech recognition

Xie, Xiaolin

An exploration of composite language modeling for speech recognition

Files

public.pdf (14.98 KB)

research.pdf (27.65 MB)

short.pdf (52.65 KB)

Authors

Xie, Xiaolin

Date

2013

Format

Thesis

Subject

language modeling, confusion network, speech recognition, word lattice

Abstract

Language models are one of the most critical knowledge sources of automatic speech recognition (ASR) systems. In the past decades, many language models have been developed, and some have proved useful and successful in speech recognition systems. However, almost all language models only capture one or two aspects of natural language. This study aims to investigate the effects of a syntactic, semantic, and lexical language model on speech recognition. In this study, we refer this language model as the composite language model (CLM). The parameters of the CLM in our study are distributed among hundreds of computer nodes in a supercomputer because they are too large to be stored in just one computer node. A distributed application has been developed to implement two speech rescoring techniques by using the CLM: lattice rescoring and confusion network rescoring. Experiments on a Wall Street Journal task have shown that using CLM to rescore word lattices and confusion networks have led to improvements in word accuracy over the commonly used trigram language model, with the latter offering a larger performance gain.

URI

http://hdl.handle.net/10355/38527

Degree

M.S.

Thesis Department

Computer science (MU)

Collections

2013 MU theses - Freely available online
Computer Science electronic theses and dissertations (MU)

Full item page

An exploration of composite language modeling for speech recognition

Files

Authors

Meeting name

Sponsors

Date

Journal Title

Format

Subject

Research Projects

Organizational Units

Journal Issue

Abstract

Table of Contents

URI

DOI

PubMed ID

Degree

Thesis Department

Rights

License

Collections