Speaker Line-up Calibration of the i-vector Based Speaker Recognition System for Forensic Application
نویسندگان
چکیده
An automatic speaker recognition (ASR) system must produce reliable likelihood ratios (LR) in order to be used for evaluating and presenting speech evidence to court. The LR is only reliable if it produced from a well-calibrated ASR. A study by Rodriguez (2007) showed that the LR calculated from the un-calibrated system was often misleading, while the calibrated system produced more reliable LRs. These results illustrate the necessity of performing LR calibration when using the ASR system for forensic applications. Several calibration techniques for ASR systems, such as the linear calibration method as presented in (Brümmer, 2006) and the novel line-up calibration method which was developed by van Leeuwen and Brümmer (2011) have been proposed in order to calibrate the LR produced by ASR.
منابع مشابه
Evaluation of i-vector Speaker Recognition Systems for Forensic Application
This paper contributes a study on i-vector based speaker recognition systems and their application to forensics. The sensitivity of i-vector based speaker recognition is analyzed with respect to the effects of speech duration. This approach is motivated by the potentially limited speech available in a recording for a forensic case. In this context, the classification performance and calibration...
متن کاملProceedings of Biometric Technologies in Forensic Science
This paper presents the performance evaluation of i-vector and PLDA based speaker recognition system which incorporate quality measures function (QMF) in linear calibration. Evaluated on the recent NIST SRE’12 corpus, the linear calibration with QMF as the additional term shows a positive gain in the system performance compared to the conventional linear calibration with only two terms. Based o...
متن کاملSpeaker Adaptation in Continuous Speech Recognition Using MLLR-Based MAP Estimation
A variety of methods are used for speaker adaptation in speech recognition. In some techniques, such as MAP estimation, only the models with available training data are updated. Hence, large amounts of training data are required in order to have significant recognition improvements. In some others, such as MLLR, where several general transformations are applied to model clusters, the results ar...
متن کاملSpeaker Adaptation in Continuous Speech Recognition Using MLLR-Based MAP Estimation
A variety of methods are used for speaker adaptation in speech recognition. In some techniques, such as MAP estimation, only the models with available training data are updated. Hence, large amounts of training data are required in order to have significant recognition improvements. In some others, such as MLLR, where several general transformations are applied to model clusters, the results ar...
متن کاملSource normalization for language-independent speaker recognition using i-vectors
Source-normalization (SN) is an effective means of improving the robustness of i-vector-based speaker recognition for under-resourced and unseen cross-speech-source evaluation conditions. The technique of source-normalization estimates directions of undesired within-speaker variation more accurately than traditional methods when cross-source variation is not explicitly observed from each speake...
متن کامل