Now showing items 1-3 of 3

    • Broadcast news story segmentation using conditional random fields and multimodal features 

      Wang, Xiaoxuan; Xie, Lei; Lu, Mimi; Ma, Bin; Chng, Eng Siong; Li, Haizhou (2012)
      In this paper, we propose integration of multimodal features using conditional random fields (CRFs) for the segmentation of broadcast news stories. We study story boundary cues from lexical, audio and video modalities, ...
    • Discriminative feature extraction for speech recognition using continuous output codes 

      Dehzangi, Omid; Ma, Bin; Chng, Eng Siong; Li, Haizhou (2012)
      Feature transformation techniques have been widely investigated to reduce feature redundancy and to introduce additional discriminative information with the aim to improve the performance of automatic speech recognition ...
    • The NNi Vietnamese speech recognition system for mediaeval 2016 

      Wang, Lei; Ni, Chongjia; Leung, Cheung-Chi; You, Changhuai; Xie, Lei; Xu, Haihua; Xiao, Xiong; Nwe, Tin Lay; Chng, Eng Siong; Ma, Bin; Li, Haizhou (2016)
      This paper provides an overall description of the Vietnamese speech recognition system developed by the joint team for MediaEval 2016. The submitted system consisted of 3 subsystems, and adopted different deep neural ...