Variance Normalised Features for Language and Dialect Discrimination

Miao, Xiaoxiao; McLoughlin, Ian; Song, Yan

doi:10.1007/s00034-020-01641-1

Variance Normalised Features for Language and Dialect Discrimination

journal contribution

posted on 2024-04-03, 05:22 authored by Xiaoxiao MiaoXiaoxiao Miao, Ian McLoughlinIan McLoughlin, Yan Song

This paper proposes novel features for automated language and dialect identification that aim to improve discriminative power by ensuring that each element of the feature vector has a normalised contribution to inter-class variance. The method firstly computes inter- and intra-class frequency variance statistics and then distributes the overall spectral variance across spectral regions which are sized to contain near-equal-variance difference. Spectral features are average pooled within regions to obtain variance normalised features (VNFs). The proposed VNFs are low complexity drop-in replacements for MFCC, SDC, PLP or other input features used for speech-related tasks. In this paper, they are evaluated in three types of system, against MFCCs, for two data-constrained language and dialect identification tasks. VNFs demonstrate good results, comfortably outperforming MFCCs at most dimension sizes, and yielding particularly good performance for the most challenging data-constrained 3s utterance length in the LID task.

History

Journal/Conference/Book title

Circuits, Systems, and Signal Processing

Publication date

2021-01-11

Usage metrics

Keywords

Language/dialect identification DNN x-vector CLSTM x-vector Variance normalised features

Licence

In Copyright

Variance Normalised Features for Language and Dialect Discrimination

History

Journal/Conference/Book title

Publication date

Usage metrics

Categories

Keywords

Licence

Exports