Modulation domain processing and speech phase spectrum in speech enhancement

Zhang, Yi

Zhang, Yi

View/Open

public.pdf (1.944Kb)

research.pdf (988.8Kb)

short.pdf (9.847Kb)

Date

2012

Format

Thesis

Metadata

[+] Show full item record

Abstract

Clean speech signal is often accompanied by various kinds of interferences, such as background noise, reverberation, and competing speech. These interferences degrade speech perceptual quality and intelligibility, and hamper speech technology applications. Conventional speech enhancement methods enhance the acoustic magnitude spectrum and use the corrupted speech phase spectrum for signal recovery. Besides, acoustic frequency domain subtraction methods often introduce large speech distortions, which degrade the enhancement performance. We propose a novel spectral subtraction method for noisy speech enhancement (MRISS) to enhance magnitude as well as phase through spectral subtraction. We investigate applying the MRISS algorithm to the speech dereverberation task to recover the reverberant speech. We investigate DOA based blind speech separation method under clean, noisy and reverberant conditions. We propose using ALMM to fit the subband IPD data to improve the DOA estimation, and propose using a log likelihood criterion to estimate the source numbers. Both subjective and objective measurements proved that the proposed methods obtained better results over state-of-art techniques on TIMIT dataset.

URI

https://hdl.handle.net/10355/33117
https://doi.org/10.32469/10355/33117

Degree

Ph. D.

Thesis Department

Computer science (MU)

Rights

OpenAccess.

This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivs 3.0 License.