Posts by Collection

portfolio

publications

Experiments on DCASE Challenge 2016 Acoustic Scene Classification and Sound Event Detection in Real Life Recording

Published in IEEE AASP Challenge: Detection and Classification of Acoustic Scenes and Events., 2016

Paper Link

Citation: Elizalde, Benjamin, Anurag Kumar, Ankit Shah, Rohan Badlani, Emmanuel Vincent, Bhiksha Raj, and Ian Lane. "Experimentation on the DCASE challenge 2016: Task 1—Acoustic scene classification and task 3—Sound event detection in real life audio." IEEE AASP Challenge: Detection and Classification of Acoustic Scenes and Events (2016). http://www.cs.tut.fi/sgn/arg/dcase2016/documents/challenge_technical_reports/DCASE2016_Elizalde_3001.pdf

DCASE 2017 challenge setup: tasks, datasets and baseline system

Published in Detection and Classification of Acoustic Scenes and Events 2017 Workshop, 2017

Paper Link

Citation: A. Mesaros, T. Heittola, A. Diment, B. Elizalde, A. Shah, E. Vincent, B. Raj, and T. Virtanen, “DCASE 2017 challenge setup: Tasks, datasets and baseline system,” in Proceedings of the Detection and Classification of Acoustic Scenes and Events 2017 Workshop (DCASE2017) , November 2017 http://www.cs.tut.fi/sgn/arg/dcase2017/documents/dcase-2017-challenge-paper.pdf

talks

A Framework towards Large scale Learning of Sound Events

Published:

A significant portion of internet’s multimedia data is videos, which contain sounds that often possess a meaning. Hence, automatic analysis of audio content for sound events is crucial. Current literature consists of small scale audio only datasets and with no audio from the web apart from AudioSet since annotating audio events is time-consuming. Videos have no tags or labels for sound events at the segment level adding to the challenges for evaluation of sound recognition on a large scale. We introduce a framework for continuous large-scale sound event recognition on web videos consisting of three modules - Crawl, Hear, and Feedback. The modular design allows our framework to scale and evolve as required. The framework has processed 3.5 million video segments, and humans inspected a subset of segments to evaluate the performance of web audio. Poster

volunteer

Teaching Assistant

Elements of Electronics and Communication - EC 110, National Institute of Technology Karnataka Surathkal, Department of Electronics and Communication, 2013

Teaching assistant for course as part of the peer mentoring program at NITK. Taught Elements of Electronics and Communication to peers between 2013-2014.

Teaching Assistant

Data Structures and Algorithms - EC 232, National Institute of Technology Karnataka Surathkal, Department of Electronics and Communication, 2014

Teaching assistant for course as part of the peer mentoring program at NITK. Taught Data Structures and Algorithms to peers between 2014-2015.

Mentor at Junior Academy

Global STEM Alliance, ARM, 2017

  • Unique opportunity to participate in a fast paced programme to develop research driven solution addressing pressing challenges at a global scale.
  • Mentored a young team of students on wearables challenge implementing an innovative water filtration system.
  • Demonstrated and presented the idea with working prototype eventually winning the innovation challenge.

Contributor/Organizer for Task 4 DCASE 2017 Challenge

IEEE-DCASE 2017 challenge - Task 4 - Large Scale weakly supervised sound event detection for smart cars, Carnegie Mellon University, 2017

Organizer of Task 4 “Large-scale weakly labeled semi-supervised sound event detection in domestic environments”. Accountable for code development, audio annotation, evaluation of papers and system submissions as well as providing technical support to participants via email and DCASE forum.

Organizer for Task 4 DCASE 2018 Challenge

IEEE-DCASE 2018 challenge - Task 4 - Large-scale weakly labeled semi-supervised sound event detection in domestic environments, Carnegie Mellon University, 2018

Organizer of Task 4 “Large-scale weakly labeled semi-supervised sound event detection in domestic environments”. Accountable for code development, audio annotation, evaluation of papers and system submissions as well as providing technical support to participants via email and DCASE forum.