This paper presents an approach to automatic visual emotion recognition from two modalities: expressive face and body gesture.
Face and body movements are captured simultaneously using two separate cameras. For each face and body image sequence single
“expressive” frames are selected manually for analysis and recognition of emotions. Firstly, individual classifiers are trained
from individual modalities for mono-modal emotion recognition. Secondly, we fuse facial expression and affective body gesture
information at the feature and at the decision-level. In the experiments performed, the emotion classification using the two
modalities achieved a better recognition accuracy outperforming the classification using the individual facial modality. We
further extend the affect analysis into a whole image sequence by a multi-frame post integration approach over the single
frame recognition results. In our experiments, the post integration based on the fusion of face and body has shown to be more
accurate than the post integration based on the facial modality only.