UWEETR-2006-0013 Author(s): Keywords: Abstract This paper describes our efforts in porting the SRI Decipher English system into Mandarin for transcribing telephone conversations. This includes all aspects of the system: the pronunciation phone set and lexicon, word segmentation, pitch features, discriminatively trained acoustic models with parameter sharing determined by decision trees, and web-data augmented language models. |