Microphone Array Based Surveillance Audio Classification
Dimitri L. O. Silva, Tito Spadini, Ricardo Suyama

DOI: 10.14209/SBRT.2020.1570653439
Evento: XXXVIII Simpósio Brasileiro de Telecomunicações e Processamento de Sinais (SBrT2020)
Keywords: Audio classification Microphone array Support Vector Machine Delay-and-Sum
Abstract
The work assessed seven classical classifiers and two beamforming algorithms for detecting surveillance sound events. The tests included the use of AWGN with -10 dB to 30 dB SNR. Data Augmentation (DA) was also employed to improve algorithms' performance. The results showed that the combination of SVM and Delay-and-Sum (DaS) scored the best accuracy (up to 86.0%), but had high computational cost (≈ 79 ms), mainly due to DaS and DA. The use of SGD also seems to be a good alternative since it has achieved good accuracy either (up to 85.3%), but with quicker processing time (≈ 25 ms).

Download