Proceedings – AVSP 2017

ISSN 2308-975X (Online)
Editors: Slim Ouni, Chris Davis, Alexandra Jesse, Jonas Beskow
Publisher: KTH
August 25-26, 2017, Stockholm, Sweden.

Eric Vatikiotis-Bateson and the Birth of AVSP – Denis Burnham, with input from Laurie Fais, Ruth Campbell, Kevin Munhall, and Phillip Rubin and Editor-in-Chief, Chris Davis – D1.SP1

Acoustic cue variability affects eye movement behaviour during non-native speech perception: a GAMM model – Jessie S. Nixon and Catherine T. Best -D1.S1.1
The effect of age and hearing loss on partner-directed gaze in a communicative task – Chris Davis, Jeesun Kim, Outi Tuomainen and Valerie Hazan – D1.S1.2
Referential Gaze Makes a Difference in Spoken Language Comprehension: Human Speaker vs. Virtual Agent Listener Gaze – Eva Maria Nunnemann, Kirsten Bergmann, Helene Kreysa and Pia Knoeferle – D1.S1.3
The influence of handedness and pointing direction on deictic gestures and speech interaction: Evidence from motion capture data on Polish counting-out rhymes – Katarzyna Stoltmann and Susanne Fuchs – D1.S1.4
The Influence of Familial Sinistrality on Audiovisual Speech Perception – Sandhya Vinay and Dawn Behne – D1.S1.5

Using deep neural networks to estimate tongue movements from speech face motion – Christian Kroos, Rikke Bundgaard-Nielsen, Catherine Best and Mark D. Plumbley – D1.S2.1
End-to-End Audiovisual Fusion with LSTMs – Stavros Petridis, Yujiang Wang, Zuwei Li and Maja Pantic – D1.S2.2
Using visual speech information and perceptually motivated loss functions for binary mask estimation – Danny Websdale and Ben Milner – D1.S2.3
Combining Multiple Views for Visual Speech Recognition – Marina Zimmermann, Mostafa Mehdipour Ghazi, Hazim Kemal Ekenel and Jean-Philippe Thiran – D1.S2.4
On the quality of an expressive audiovisual corpus: a case study of acted speech – Slim Ouni, Sara Dahmani and Vincent Colotte – D1.S2.5
Thin slicing to predict viewer impressions of TED Talks – Ailbhe Cullen and Naomi Harte – D1.S2.6

Exploring ROI size in deep learning based lipreading – Alexandros Koumparoulis, Gerasimos Potamianos, Youssef Mroueh and Steven J. Rennie – D1.S3.1
Towards Lipreading Sentences with Active Appearance Models – George Sterpu and Naomi Harte – D1.S3.2
Lipreading using deep bottleneck features for optical and depth images Satoshi Tamura, Koichi Miyazaki and Satoru Hayamizu – D1.S3.3
Inner Lips Parameter Estimation based on Adaptive Ellipse Model – Li Liu, Gang Feng and Denis Beautemps – D1.S3.4