Please use this identifier to cite or link to this item: https://hdl.handle.net/10356/48504
Title: Speaker recognition system
Authors: Song, Liyan.
Keywords: DRNTU::Engineering::Computer science and engineering::Computing methodologies::Pattern recognition
Issue Date: 2012
Abstract: This report givens an overview of a Gaussian Mixture Model – Universal Background Model (GMM-UBM) system which focusing on speaker identification. In this report we will be focusing on the traditional FFT-based Mel-Frequency Cepstral Coefficients (MFCCs) method to extract feature from wav file and GMM-UBM to create speaker model. The detail information of MFCC and GMM-UBM will be explained in the report. The program is build based using GMM-UBM and MFCC, the likelihood ratio of the testing speech are the output of the program. The experiment is carry out to evaluate the effects on accuracy when different mixture and file of MFC are used.
URI: http://hdl.handle.net/10356/48504
Rights: Nanyang Technological University
Fulltext Permission: restricted
Fulltext Availability: With Fulltext
Appears in Collections:SCSE Student Reports (FYP/IA/PA/PI)

Files in This Item:
File Description SizeFormat 
SCE0053.pdf
  Restricted Access
1.16 MBAdobe PDFView/Open

Page view(s) 50

162
checked on Oct 26, 2020

Download(s) 50

9
checked on Oct 26, 2020

Google ScholarTM

Check

Items in DR-NTU are protected by copyright, with all rights reserved, unless otherwise indicated.