Detection/classification | Classification technique | Feature Extraction technique | Accuracy | Reference |
---|---|---|---|---|
Vocal classification | BP + GA | Short-time energy, Frequency centroid, Formant frequency, MFCC | 93.20 | [ |
CNN-MobileNet V3 | Fast Fourier transform (FFT), Log-mel spectrogram | 97.52 | [ | |
MnasNet | ACAM, VAD | 94.72 | [ | |
SVM, AdaBoost, BiLSTM | MFCC, PSD, CQT, SqueezeNet | 91.41 | [ | |
SE-DenseNet-121 | MFCC, ΔMFCC, Δ2MFCC | 93.80 | [ | |
SVM | RMSE, MFCC, ZCR, Centroid, Flatness, Bandwidth, Chroma | 96.45 | [ | |
CNN, SVM, KNN | DNS | 96.57 | [ | |
TransformerCNN | MLMC | 96.05 | [ |