Real Time Human Emotion Recognition Using Artificial Neural Networks

T. Muni Reddy, Dr. R.P. Singh .

Abstract


Now a day’s one of the unsolved problem in computer vision is recognizing or understanding other people's emotions and feelings. Although recent methods achieve close to human accuracy in controlled scenarios, the recognition of emotions in the wild remains a challenging problem. In this paper we proposed MAM Pooling (Mean of Average and Maximum) method with CNN to recognize human emotions. We focus on automatic identification of six emotions in real time: Happiness, Anger, Sadness, Surprise, Fear, and Disgust. Convolutional Neural Network (CNN) is a biologically inspired trainable architecture that can learn invariant features for a number of applications. In general, CNNs consist of alternating convolutional layers, non-linearity layers and feature pooling layers. In this work, a Novel feature pooling method, named as MAM pooling is proposed to regularize CNNs, which replaces the deterministic pooling operations with a stochastic procedure by taking the average of  max pooling and average pooling methods. The advantage of the proposed MAM pooling method lies in its wonderful ability to address the over fitting problem encountered by CNN generation.

Full Text:

PDF

References


Fukushima, K. Neocognitron: A self-organizing neural network model for a mechanism of pattern recognition unaffected by shift in position. Biological Cybernetics 36(4), 193–202 (1980)

LeCun, Y., Boser, B., Denker, J.S., Henderson, D., Howard, R.E., Hubbard, W.Jackel, L.D.: Backpropagation applied to handwritten zip code recognition. Neural computation 1(4), 541–551 (1989)

Lawrence, S., Giles, C.L., Tsoi, A.C., Back, A.D.: Face recognition: A convolutional neural-network approach. IEEE Transactions on Neural Networks 8(1), 98–113 (1997)

Fan, J., Xu, W., Wu, Y., Gong, Y.: Human tracking using convolutional neural networks. IEEE Transactions on Neural Networks 21(10), 1610–1623 (2010)

Cire¸san, D., Meier, U., Schmidhuber, J.: Multi-column deep neural networks for image classification. In: CVPR, pp. 3642–3649 (2012)

Ji, S., Xu, W., Yang, M., Yu, K.: 3D convolutional neural networks for human action recognition. IEEE Transactions on Pattern Analysis and Machine Intelligence 35(1), 221–231 (2013)

Krizhevsky, A., Sutskever, I., Hinton, G.E.: ImageNet classification with deep convolutional neural networks. In: NIPS, vol. 1 (2012)

Deng, J., Dong,W., Socher, R., Li, L.-J., Li, K., Fei-Fei, L.: ImageNet: A large-scale hierarchical image database. In: CVPR, pp. 248–255 (2012)

Montavon, G., Orr, G.B., M¨uller, K.R. (eds.): Neural networks: tricks of the trade, 2nd edn. Spinger, San Francisco (2012)

Zeiler, M.D.: Hierarchical convolutional deep learning in computer vision. PhD thesis, ch. 6, New York University (2014)

Hinton, G.E., Srivastava, N., Krizhevsky, A., Sutskever, I., Salakhutdinov, R.R.:Improving neural networks by reventing co-adaptation of feature detectors. arXiv preprint arXiv:1207.0580 (2012)

Wan, L., Zeiler, M.D., Zhang, S., LeCun, Y., Fergus, R.: Regularization of neural networks using DropConnect. In: ICML, pp. 1058–1066 (2013)

LeCun, Y., Kavukcuoglu, K., Farabet, C.: Convolutional networks and applications in vision. In: ISCAS, pp. 253–256 (2010)

Nair, V., Hinton, G.E.: Rectified linear units improve restricted boltzmann machines. In: ICML, pp. 807–814 (2010)

Jarrett, K., Kavukcuoglu, K., Ranzato, M., LeCun, Y.:What is the best multi-stage architecture for object recognition? In: ICCV, pp. 2146–2153 (2009)




DOI: https://doi.org/10.23956/ijarcsse.v8i6.741

Refbacks

  • There are currently no refbacks.




© International Journals of Advanced Research in Computer Science and Software Engineering (IJARCSSE)| All Rights Reserved | Powered by Advance Academic Publisher.