Abstract
Typical sound event detection (SED) applications, employed in
real environments, generate huge amounts of unlabeled data
each day. These data can potentially be used to re-train the
underlying machine learning models. However, as the labeling
budget is usually restricted, active learning plays a vital role in
re-training. Especially for applications with sparse event occurrence,
a data selection process is paramount. In this paper we (i)
introduce a novel application for vandalism SED, and (ii) analyze
an active learning scheme for reduced training and annotation
effort.
In the presented system, the employed machine learning classifier
shall recognize various acts of vandalism, i.e., glass breakage
and graffiti spraying. To this end, we utilize embeddings generated
with a pre-trained network and train a recurrent neural
network for event detection. The applied data selection strategy
is based on a mismatch-first, farthest-traversal approach and is
compared to an upper bound by using all available data. Furthermore,
results for the active learning scheme are evaluated
with respect to different labeling budgets and compared to an
active learning scheme with a random sampling scheme.
real environments, generate huge amounts of unlabeled data
each day. These data can potentially be used to re-train the
underlying machine learning models. However, as the labeling
budget is usually restricted, active learning plays a vital role in
re-training. Especially for applications with sparse event occurrence,
a data selection process is paramount. In this paper we (i)
introduce a novel application for vandalism SED, and (ii) analyze
an active learning scheme for reduced training and annotation
effort.
In the presented system, the employed machine learning classifier
shall recognize various acts of vandalism, i.e., glass breakage
and graffiti spraying. To this end, we utilize embeddings generated
with a pre-trained network and train a recurrent neural
network for event detection. The applied data selection strategy
is based on a mismatch-first, farthest-traversal approach and is
compared to an upper bound by using all available data. Furthermore,
results for the active learning scheme are evaluated
with respect to different labeling budgets and compared to an
active learning scheme with a random sampling scheme.
Original language | English |
---|---|
Title of host publication | Book of peer-reviewed papers 10th Congress of the Alps Adria Acoustics Association |
Pages | 142-146 |
Number of pages | 5 |
ISBN (Electronic) | 978-961-94085-2-0 |
Publication status | Published - 21 Sept 2023 |
Event | 10th Congress of the Alps Adria Acoustics Association - Izola, Izola, Slovenia Duration: 20 Sept 2023 → 21 Sept 2023 |
Conference
Conference | 10th Congress of the Alps Adria Acoustics Association |
---|---|
Country/Territory | Slovenia |
City | Izola |
Period | 20/09/23 → 21/09/23 |
Research Field
- Former Research Field - New Sensor Technologies
Keywords
- sound event detection
- active learning
- dataset selection
- vandalism