Dear guest, welcome to this publication database. As an anonymous user, you will probably not have edit rights. Also, the collapse status of the topic tree will not be persistent. If you like to have these and other options enabled, you might ask Admin (Ivan Eggel) for a login account.
 [BibTeX] [RIS]
Coping with Two Different Transmission Channels in Language Recognition
Type of publication: Inproceedings
Citation: verdet10:odyssey
Booktitle: Odyssey 2010, The Speaker and Language Recognition Workshop
Year: 2010
Month: June
Pages: 230-237
Location: Brno, Czech Republic
URL: http://www.hennebert.org/downl...
Abstract: This paper confirms the huge benefits of Factor Analysis over Maximum A-Posteriori adaptation for language recognition (up to 87% relative gain). We investigate ways to cope with the particularity of NIST’s LRE 2009, containing Conversational Telephone Speech (CTS) and phone bandwidth segments of radio broadcasts (Voice Of America, VOA). We analyze GMM systems using all data pooled together, eigensession matrices estimated on a per condition basis and systems using a concatenation of these matrices. Results are presented on all LRE 2009 test segments, as well as only on the CTS or only on the VOA test utterances. Since performances on all 23 languages are not trivial to compare, due to lacking language–channel combinations in the training and also in the testing data, all systems are also evaluated in the context of the subset of 8 common languages. Addressing the question if a fusion of two channel specific systems may be more beneficial than putting all data together, we study an oracle based system selector. On the 8 language subset, a pure CTS system performs at a minimal average cost of 2.7% and pure VOA at 1.9% minCavg on their respective test conditions. The fusion of these two systems runs at 2.0% minCavg. As main observation, we see that the way we estimate the session compensation matrix has not a big influence, as long as the language–channel combinations cover those used for training the language models. Far more crucial is the kind of data used for model estimation.
Keywords: Benchmarking, Language Identification, machine learning
Authors Verdet, Florian
Matrouf, Driss
Bonastre, Jean-François
Hennebert, Jean
Added by: []
Total mark: 0
Attachments
  • odyssey10_Coping-with-Two-Diff...
Notes
    Topics