Hybrid Particle Swarm Optimization-Gravitational Search Algorithm based Deep Belief Network: Speech Emotion Recognition
Hybrid PSO-GSA based DBN
Keywords:
DBN, Emotion, Gender, Human-Computer Interface, SERAbstract
One of the most important research areas is the Speech Emotion Recognition (SER) technique, which is applied in many fields such as speech processing and human-computer interaction. In general, it is mainly concentrated on using the techniques of machine learning in order to predict the precise emotional category from speech. In affective computing as well as the human-computer interaction, the developed applications of SER are very effective that are considered as the important module of the computer's next-generation system. It is due to the automatic service provisions are granted by the natural human-machine interface that requires an improved approval of user emotional conditions. Hence, this work proposes a novel SER model which integrated both emotion and gender recognition. Various features are extracted and that is fed for the emotions classifications. In this work, the Deep Belief Network (DBN) is exploited. At last, performance analysis of the developed technique is seen that better accuracy rate while comparing with the conventional models. This work proposes a novel technique for the SER model which helps both emotion as well as gender recognition. Here, the proposed Hybrid Particle Swarm Optimization (PSO) and Gravitational Search Algorithm (GSA) algorithm are introduced to identify the optimal weight of the DBN technique.
References
Zijiang ZhuWeihuang DaiJunshan Li,"Speech emotion recognition model based on Bi-GRU and Focal Loss", Pattern Recognition Letters,11 November 2020.
Wanlu ZhengWenming ZhengYuan Zong,"Multi-scale discrepancy adversarial network for crosscorpus speech emotion recognition",Virtual Reality & Intelligent Hardware, 24 February 2021.
Turker TuncerSengul DoganU. Rajendra Acharya,"Automated accurate speech emotion recognition system using twine shuffle pattern and iterative neighborhood component analysis techniques", Knowledge-Based Systems,31 October 2020.
Dongdong LiYijun ZhouDaqi Gao," Exploiting the potentialities of features for speech emotion recognition", Information Sciences,8 October 2020.
Reinert Yosua RumagitGlenn AlexanderIrfan Fahmi Saputra,"Model Comparison in Speech Emotion Recognition for Indonesian Language",Procedia Computer Science,19 February 2021.
Kasiprasad Mannepalli, Panyam Narahari Sastry and Maloji Suman, "A novel Adaptive Fractional Deep Belief Networks for speaker emotion recognition", Alexandria Engineering Journal, October 2016.
C. Ng, " Principle component analysis to reduce dimension on digital image", Procedia Computer Science, vol. 111, pp. 113-119, 2017.
Lijun ZHANG, Zhengguang CHEN, Miao ZHENG and Xiaofei HE, " Robust non-negative matrix factorization", Front. Electr. Electron., vol.6, no.2, pp. 192-200, 2011.
Sira Gonzalez and Mike Brookes, " A Pitch Estimation Filter Robust To High Levels Of Noise (PEFAC)", 19th European Signal Processing Conference, 2011.
Yogeswaran Mohan, Sia Seng Chee, Donica Kan Pei Xin and Lee Poh Foong, " Artificial Neural Network for Classification of Depressive and Normal in EEG", IEEE EMBS Conference on Biomedical Engineering and Sciences (IECBES), 2016.
Mirjalili, S.Z.M. Hashim, A new hybrid PSO-GSA algorithm for function optimization, in: International Conference on Computer and Information Application, IEEE, 2010, pp. 374-377.
Cristin R,Gladiss Merlin N.R,Ramanathan L,Vimala S,"Image Forgery Detection Using Back Propagation Neural Network Model and Particle Swarm Optimization Algorithm", Multimedia Research, vol 3, no. 1, January 2020.
Cristin,Dr.V.Cyril Raj and Ramalatha Marimuthu,"Face Image Forgery Detection by Weight Optimized Neural Network Model",Multimedia Research, vol 2, no. 2, April 2019.