An improved mean teacher based method for large scale weakly labeled semi-supervised sound event detection

Zheng, Xu; Song, Yan; McLoughlin, Ian; Liu, Lin; Dai, Li-Rong

doi:10.1109/ICASSP39728.2021.9414931

File(s) not publicly available

An improved mean teacher based method for large scale weakly labeled semi-supervised sound event detection

conference contribution

posted on 2024-04-03, 04:30 authored by Xu Zheng, Yan Song, Ian McLoughlinIan McLoughlin, Lin Liu, Li-Rong Dai

This paper presents an improved mean teacher (MT) based method for large-scale weakly labeled semi-supervised sound event detection (SED), by focusing on learning a better student model. Two main improvements are proposed based on the authors’ previous perturbation based MT method. Firstly, an event-aware module is de-signed to allow multiple branches with different kernel sizes to be fused via an attention mechanism. By inserting this module after the convolutional layer, each neuron can adaptively adjust its receptive field to suit different sound events. Secondly, instead of using the teacher model to provide a consistency cost term, we propose using a stochastic inference of unlabeled examples to generate high quality pseudo-targets by averaging multiple predictions from the perturbed student model. MixUp of both labeled and unlabeled data is further exploited to improve the effectiveness of student model. Finally, the teacher model can be obtained via exponential moving average (EMA) of the student model, which generates final predictions for SED during inference. Experiments on the DCASE2018 task4 dataset demonstrate the ability of the proposed method. Specifically, an F1-score of 42.1% is achieved, significantly outperforming the 32.4% achieved by the winning system, or the 39.3% by the previous perturbation based method.

History

Journal/Conference/Book title

2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 06-11 June 2021, Toronto, Ontario, Canada.

Publication date

2021-05-13

Usage metrics

Keywords

sound event detection event-aware mean-teacher semi-supervised learning perturbation

Licence

In Copyright

Exports

RefWorks

BibTeX

Ref. manager

Endnote

DataCite

NLM

DC

File(s) not publicly available

An improved mean teacher based method for large scale weakly labeled semi-supervised sound event detection

History

Journal/Conference/Book title

Publication date

Usage metrics

Categories

Keywords

Licence

Exports