Robust speech recognition based on a Bayesian prediction approach

Jiang, H; Hirose, K; Huo, Q

File Download

47887.pdf

Links for fulltext

(May Require Subscription)

Publisher Website: 10.1109/89.771309
Scopus: eid_2-s2.0-0032685060
WOS: WOS:000080961100007
Find via

Supplementary

Citations:
- Scopus: 0
- Web of Science: 0
Appears in Collections:
- Information Technology Services: Journal/Magazine Articles

Article: Robust speech recognition based on a Bayesian prediction approach

Title	Robust speech recognition based on a Bayesian prediction approach
Authors	Jiang, H Hirose, K Huo, Q
Issue Date	1999
Publisher	IEEE.
Citation	IEEE Transactions on Speech and Audio Processing, 1999, v. 7 n. 4, p. 426-440 How to Cite? DOI: http://dx.doi.org/10.1109/89.771309
Abstract	We study a category of robust speech recognition problem in which mismatches exist between training and testing conditions, and no accurate knowledge of the mismatch mechanism is available. The only available information is the test data along with a set of pretrained Gaussian mixture continuous density hidden Markov models (CDHMMs). We investigate the problem from the viewpoint of Bayesian prediction. A simple prior distribution, namely constrained uniform distribution, is adopted to characterize the uncertainty of the mean vectors of the CDHMMs. Two methods, namely a model compensation technique based on Bayesian predictive density and a robust decision strategy called Viterbi Bayesian predictive classification are studied. The proposed methods are compared with the conventional Viterbi decoding algorithm in speaker-independent recognition experiments on isolated digits and TI connected digit strings (TIDTGITS), where the mismatches between training and testing conditions are caused by: (1) additive Gaussian white noise, (2) each of 25 types of actual additive ambient noises, and (3) gender difference. The experimental results show that the adopted prior distribution and the proposed techniques help to improve the performance robustness under the examined mismatch conditions.
Persistent Identifier	http://hdl.handle.net/10722/43648
ISSN	1063-6676
ISI Accession Number ID	WOS:000080961100007

DC Field	Value	Language
dc.contributor.author	Jiang, H	en_HK
dc.contributor.author	Hirose, K	en_HK
dc.contributor.author	Huo, Q	en_HK
dc.date.accessioned	2007-03-23T04:51:13Z	-
dc.date.available	2007-03-23T04:51:13Z	-
dc.date.issued	1999	en_HK
dc.identifier.citation	IEEE Transactions on Speech and Audio Processing, 1999, v. 7 n. 4, p. 426-440	en_HK
dc.identifier.issn	1063-6676	en_HK
dc.identifier.uri	http://hdl.handle.net/10722/43648	-
dc.description.abstract	We study a category of robust speech recognition problem in which mismatches exist between training and testing conditions, and no accurate knowledge of the mismatch mechanism is available. The only available information is the test data along with a set of pretrained Gaussian mixture continuous density hidden Markov models (CDHMMs). We investigate the problem from the viewpoint of Bayesian prediction. A simple prior distribution, namely constrained uniform distribution, is adopted to characterize the uncertainty of the mean vectors of the CDHMMs. Two methods, namely a model compensation technique based on Bayesian predictive density and a robust decision strategy called Viterbi Bayesian predictive classification are studied. The proposed methods are compared with the conventional Viterbi decoding algorithm in speaker-independent recognition experiments on isolated digits and TI connected digit strings (TIDTGITS), where the mismatches between training and testing conditions are caused by: (1) additive Gaussian white noise, (2) each of 25 types of actual additive ambient noises, and (3) gender difference. The experimental results show that the adopted prior distribution and the proposed techniques help to improve the performance robustness under the examined mismatch conditions.	en_HK
dc.format.extent	464112 bytes	-
dc.format.extent	27136 bytes	-
dc.format.mimetype	application/pdf	-
dc.format.mimetype	application/msword	-
dc.language	eng	en_HK
dc.publisher	IEEE.	en_HK
dc.relation.ispartof	IEEE Transactions on Speech and Audio Processing	-
dc.rights	©1999 IEEE. Personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution to servers or lists, or to reuse any copyrighted component of this work in other works must be obtained from the IEEE.	-
dc.title	Robust speech recognition based on a Bayesian prediction approach	en_HK
dc.type	Article	en_HK
dc.identifier.openurl	http://library.hku.hk:4550/resserv?sid=HKU:IR&issn=1063-6676&volume=7&issue=4&spage=426&epage=440&date=1999&atitle=Robust+speech+recognition+based+on+a+Bayesian+prediction+approach	en_HK
dc.description.nature	published_or_final_version	en_HK
dc.identifier.doi	10.1109/89.771309	en_HK
dc.identifier.scopus	eid_2-s2.0-0032685060	-
dc.identifier.hkuros	47887	-
dc.identifier.isi	WOS:000080961100007	-
dc.identifier.issnl	1063-6676	-

File Download

Links for fulltext

(May Require Subscription)

Supplementary

Article: Robust speech recognition based on a Bayesian prediction approach

Export via OAI-PMH Interface in XML Formats

OR

Export to Other Non-XML Formats