In recent years, deep learning has achieved high performance in bioacoustic classification tasks by leveraging automatically processed acoustic features for large datasets. However, few performance evaluations of automatically processed acoustic features have been conducted on small-scale data because deep learning requires large datasets. To test whether mel spectrograms (automatically processed acoustic features) are effective for classifying relatively small acoustic data, we evaluated the performance of two classification machines (random forest and support vector machine) using mel-spectrograms of 651 coo calls of six wild female Japanese macaques on two tasks: 1) individual identification and 2) age-class classification between younger (< 10 yrs) and the older animals (> 20 yrs). For the individual identification task, the mean balanced accuracy was 0.81 for random forest and 0.82 for support vector machine. For the age-class classification task, the mean balanced accuracy was 0.91 for random forest and 0.93 for support vector machine. Considering that of all the calls were recorded in the wild, methods using automatically processed acoustic features, such as mel spectrogram, are effective in classifying small acoustic data for the individual identification task. The high performance in the age-class classification task might be attributable to the potential of mel spectrograms to capture the characteristics of older individuals (e.g. harshness).
扫码关注我们
求助内容:
应助结果提醒方式:

