Joint Generative-Contrastive Representation Learning for Anomalous Sound Detection

Zeng, Xiao-Min; Song, Yan; Zhuo, Zhu; Zhou, Yu; Li, Yu-Hong; Xue, Hui; Dai, Li-Rong; McLoughlin, Ian

doi:10.1109/ICASSP49357.2023.10095568

File(s) stored somewhere else

https://doi.org/10.1109/ICASSP49357.2023.10095568

Please note: Linked content is NOT stored on Singapore Institute of Technology and we can't guarantee its availability, quality, security or accept any liability.

Joint Generative-Contrastive Representation Learning for Anomalous Sound Detection

conference contribution

posted on 2023-10-01, 00:58 authored by Xiao-Min Zeng, Yan Song, Zhu Zhuo, Yu Zhou, Yu-Hong Li, Hui Xue, Li-Rong Dai, Ian McLoughlinIan McLoughlin

In this paper, we propose a joint generative and contrastive representation learning method (GeCo) for anomalous sound detection (ASD). GeCo exploits a Predictive AutoEncoder (PAE) equipped with self-attention as a generative model to perform frame-level prediction. The output of the PAE together with original normal samples, are used for supervised contrastive representative learning in a multi-task framework. Besides cross-entropy loss between classes, contrastive loss is used to separate PAE output and original samples within each class. GeCo aims to better capture context information among frames, thanks to the self-attention mechanism for PAE model. Furthermore, GeCo combines generative and contrastive learning from which we aim to yield more effective and informative representations, compared to existing methods. Extensive experiments have been conducted on the DCASE2020 Task2 development dataset, showing that GeCo outperforms state-of-the-art generative and discriminative methods.

History

Journal/Conference/Book title

ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Publication date

2023-06-04

Usage metrics

Keywords

predictive autoencoder contrastive learning representation learning anomalous sound detection

Licence

In Copyright

Exports

RefWorks

BibTeX

Ref. manager

Endnote

DataCite

NLM

DC

File(s) stored somewhere else

Joint Generative-Contrastive Representation Learning for Anomalous Sound Detection

History

Journal/Conference/Book title

Publication date

Usage metrics

Categories

Keywords

Licence

Exports