The Johns Hopkins University

Whiting School of Engineering

Department of Electrical and Computer Engineering

 

Multiple Heteroscedastic Linear Discriminant Analysis



Seminar By

 

Haolang Zhou

Graduate Research Assistant

Electrical and Computer

 

 

 

Abstract:

Current automatic speech recognition systems employ dimension reduction schemes to project high dimensional features to a lower dimensional space. However, the resulting lower dimensional features used during recognition and decoding are the same for all phones. At the same time, for related tasks such as phone recognition, phoneticians have long relied on phone-specific cues to verify the identity of different phones.

In this proposal, we present a novel feature reduction mechanism that allows different transforms for each phone, class of phones, or even acoustic state. The approach adopted is to provide a rich and redundant representation of the input feature space which is then reduced using class-specific linear transforms. Care needs to be taken to ensure that the resulting scores are comparable across different phones. Performance is reported on Rich Transcription 04 Arabic speech recognition test data. Preliminary results are promising.

 

 

Friday, October 12, 2007

11:00 AM

Barton 114

 

 

Refreshments will be served at 10:45 AM

 

 

 

FOR DISABILITY INFORMATION

CONTACT:  Candace Abel (410) 516-7031 cabel@jhu.edu