Ensemble methods in large vocabulary continuous speech recognition

Chen, Xin, 1983-

Chen, Xin, 1983-

View/Open

public.pdf (7.216Kb)

short.pdf (10.86Kb)

research.pdf (480.2Kb)

Date

2008

Format

Thesis

Metadata

[+] Show full item record

Abstract

Combining a group of classifiers and therefore improving the overall classification performance is a young and promising direction in Large Vocabulary Continuous Speech Recognition (LVCSR). Previous works on acoustic modeling of speech signals such as Random Forests (RFs) of Phonetic Decision Trees (PDTs) has produced significant improvements in word recognition accuracy. In this thesis, several new ensemble approaches are proposed for LVCSR and experimental evaluations have shown absolute accuracy gains up to 2.3% over the conventional PDT-based acoustic models in our telehealth conversational speech recognition task. The word accuracy performance improvement achieved in this thesis work is significant and the techniques have been integrated in the telemedicine automatic captioning system developed by the SLIPL group of the University of Missouri--Columbia.

URI

https://doi.org/10.32469/10355/5797
https://hdl.handle.net/10355/5797

Degree

M.S.

Thesis Department

Computer science (MU)

Rights

OpenAccess.

This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivs 3.0 License.