Detecting head movements in video-recorded dyadic conversations

Publikation: Bidrag til bog/antologi/rapport › Konferencebidrag i proceedings › Forskning › fagfællebedømt

This paper is about the automatic recognition of head movements in videos of face-to-face dyadic conversations. We present an approach where recognition of head movements is casted as a multimodal frame classification problem based on visual and acoustic features. The visual features include velocity, acceleration, and jerk values associated with head movements, while the acoustic ones are pitch and intensity measurements from the co-occuring speech. We present the results obtained by training and testing a number of classifiers on manually annotated data from two conversations. The best performing classifier, a Multilayer Perceptron trained using all the features, obtains 0.75 accuracy and outperforms the mono-modal baseline classifier.

Originalsprog	Engelsk
Titel	Proceedings of the International Conference on Multimodal Interaction: Adjunct
Antal sider	6
Udgivelsessted	New York
Forlag	Association for Computing Machinery
Publikationsdato	2018
Sider	1-6
ISBN (Trykt)	978-1-4503-6002-9
DOI	https://doi.org/10.1145/3281151.3281152
Status	Udgivet - 2018

ID: 209096029