Learning Sound Events From Webly Labeled Data

Published in 28th International Joint Conference on Artificial Intelligence (IJCAI), 2019

In the last couple of years, weakly labeled learning for sound events has turned out to be an exciting approach for audio event detection. In this work, we introduce webly labeled learning for sound events in which we aim to remove human supervision altogether from the learning process. We first develop a method of obtaining labeled audio data from the web (albeit noisy), in which no manual labeling is involved. We then describe deep learning methods to efficiently learn from these webly labeled audio recordings. In our proposed system, WeblyNet, two deep neural networks co-teach each other to robustly learn from webly labeled data, leading to around 17% relative improvement over the baseline method. The method also involves transfer learning to obtain efficient representations.

Recommended citation: @inproceedings{kumar2019learning, title={Learning Sound Events From Webly Labeled Data}, author={Kumar, Anurag and Shah, Ankit and Hauptmann, Alexander G and Raj, Bhiksha}, booktitle={Proceedings of the 28th International Joint Conference on Artificial Intelligence (IJCAI)}, pages={2772--2778}, year={2019} }
Download Paper