Towards deep learning on speech recognition for Khmer language

Lim, Chanmann

Towards deep learning on speech recognition for Khmer language

Files

public.pdf (39.53 KB)

research.pdf (1 MB)

short.pdf (20.65 KB)

Authors

Lim, Chanmann

Date

2016

Format

Thesis

Abstract

In order to perform speech recognition well, a huge amount of transcribed speech and textual data in the target language must be available for system training. The high demand for language resources constrains the development of speech recognition systems for new languages. In this thesis the development of a low-resourced isolated-word recognition system for "Khmer" language is investigated. Speech data, collected via mobile phone, containing 194 vocabulary words is used in our experiments. Data pre-processing based on Voice Activity Detection (VAD) is discussed. As by-products of this work, phoneme based pronunciation lexicon and state tying questions set for Khmer speech recognizer are built from scratch. In addition to the conventional statistical acoustic modeling using Gaussian Mixture Model and hidden Markov Model (GMMHMM), a hybrid acoustic model based on Deep Neural Network (DNN-HMM) trained to predict contextdependent triphone states is evaluated. Dropout is used to improve the robustness of the DNN, and crosslingual transfer learning that makes use of auxiliary training data in English is also investigated. As the first effort in using DNN-HMM for low-resourced isolated-word recognition for Khmer language, the system currently performs at 93.31% word accuracy in speaker-independent mode on our test set.

URI

https://hdl.handle.net/10355/56110

Degree

M.S.

Thesis Department

Computer science (MU)

Rights

OpenAccess.

License

This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivs 3.0 License.

Collections

2016 MU theses - Freely available online
Computer Science electronic theses and dissertations (MU)

Full item page

Towards deep learning on speech recognition for Khmer language

Files

Authors

Meeting name

Sponsors

Date

Journal Title

Format

Subject

Research Projects

Organizational Units

Journal Issue

Abstract

Table of Contents

URI

DOI

PubMed ID

Degree

Thesis Department

Rights

License

Collections