Early detection of continuous and partial audio events using CNN

Ian McLoughlin, Yan Song, Lam Pham, Ramaswamy Palaniappan, Huy Phan, Lang Yue

Publikation: Beitrag in Buch oder TagungsbandVortrag mit Beitrag in TagungsbandBegutachtung

Abstract

Sound event detection is an extension of the static auditory classification task into continuous environments, where performance depends jointly upon the detection of overlapping events and their correct classification. Several approaches have been
published to date which either develop novel classifiers or employ well-trained static classifiers with a detection front-end.
This paper takes the latter approach, by combining a proven CNN classifier acting on spectrogram image features, with
time-frequency shaped energy detection that identifies seed regions within the spectrogram that are characteristic of auditory energy events. Furthermore, the shape detector is optimised to
allow early detection of events as they are developing. Since some sound events naturally have longer durations than others, waiting until completion of entire events before classification
may not be practical in a deployed system. The early detection capability of the system is thus evaluated for the classification of partial events. Performance for continuous event detection is
shown to be good, with accuracy being maintained well when detecting partial events.
OriginalspracheEnglisch
TitelINTERSPEECH, 2018
Seiten3314-3318
PublikationsstatusVeröffentlicht - Sept. 2018
VeranstaltungInterspeech 2018 - Hyderabad, Indien
Dauer: 2 Sept. 20186 Sept. 2018

Konferenz

KonferenzInterspeech 2018
Land/GebietIndien
StadtHyderabad
Zeitraum2/09/186/09/18

Research Field

  • Ehemaliges Research Field - Data Science

Fingerprint

Untersuchen Sie die Forschungsthemen von „Early detection of continuous and partial audio events using CNN“. Zusammen bilden sie einen einzigartigen Fingerprint.

Diese Publikation zitieren