Language modeling for automatic speech recognition in telehealth

MOspace/Manakin Repository

Breadcrumbs Navigation

Language modeling for automatic speech recognition in telehealth

Please use this identifier to cite or link to this item: http://hdl.handle.net/10355/4245

[+] show full item record


Title: Language modeling for automatic speech recognition in telehealth
Author: Zhang, Xiaojia, 1977-
Date: 2005
Publisher: University of Missouri--Columbia
Abstract: Standard statistic n-gram language models play a critical and indispensable role in automatic speech recognition (ASR) applications. Though helpful to ASR, it suffers from a practical problem when lacking sufficient in-domain training data that come from same or similar sources as the task text. In order to improve language model performance, various datasets need to be used to supplement the in-domain training data. This thesis investigates effective approaches to language modeling for telehealth which consists of doctor-patient conversation speech in medical specialty domain. Efforts were made to collect and analyze various datasets for training as well as to find a method for modeling target language. By effectively defining word classes, and by combining class and word trigram language models trained separately from in-domain and out-of-domain datasets, large improvements were achieved in perplexity reduction over a baseline word trigram language model that simply interpolates word trigram models trained from different data sources.
URI: http://hdl.handle.net/10355/4245
Other Identifiers: ZhangX-051706-T3786

This item appears in the following Collection(s)

[+] show full item record