Frame pruning for automatic speaker identification
Author(s)
Besacier, Laurent
Bonastre, J. F.
Date issued
September 8, 1998
In
IEEE European Signal Processing Conference EUSIPCO, Institute of Electrical and Electronics Engineers (IEEE), 1998///367-370
Abstract
In this paper, we propose a frame selection procedure for text-independent speaker identification. Instead of averaging the frame likelihoods along the whole test utterance, some of these are rejected (pruning) and the final score is computed with a limited number of frames. This pruning stage requires a prior frame level likelihood normalization in order to make comparison between frames meaningful. This normalization procedure alone leads to a significative performance enhancement. As far as pruning is concerned, the optimal number of frames pruned is learned on a tuning data set for normal and telephone speech. Validation of the pruning procedure on 567 speakers leads to a significative improvement on TIMIT and NTIMIT (up to 30% error rate reduction on TIMIT).
Publication type
journal article
File(s)![Thumbnail Image]()
Loading...
Name
Besacier_Laurent_-_Frame_pruning_for_automatic_speaker_20070115.pdf
Type
Main Article
Size
320.85 KB
Format
Adobe PDF
