HCI09 Program

9.30 - 11.05    Oral Session I: Face Analysis and Gesture Recognition

Incorporating Facial Features into a Multi-Channel Gesture Recognition System for the Interpretation of Irish Sign Language Sequences
   Daniel Kelly, Jane Reilly Delannoy, John Mc Donald and Charles Markham

Feature Selection of Facial Displays for Detection of Non Verbal Communication in Natural Conversation
   Tim Sheerman-Chase, Eng-Jon Ong and Richard Bowden

Automatic Detection of Facial Actions from 3D Data
   Arman Savran and Bulent Sankur

Automatic Sign Segmentation from Continuous Signing via Multiple Sequence Alignment
   Pinar Santemiz, Oya Aran, Murat Saraclar and Lale Akarun

11.05 – 11.30   Coffee Break

11.30 – 12.30   Keynote presentation: Understanding How People Communicate With Each Other: Multimodal Approach for Modeling and Recognizing Multiparty Face-to-Face Conversations.

Kazuhiro Otsuka, NTT Corporation, Japan

This talk overviews the research trends and progress in communication scene analysis, especially that made in NTT CSL(CommunicationScience Labs.).The goal is to develop techniques and systems that can automatically understand how people communicate with each other through the exchange of nonverbal messages in face-to-face situations. To that end, we have focused on and developed multimodal approaches for modeling and recognizing multiparty meetings (more than two participants) by exploiting multimodal data from cameras and microphones. This talk introduces our works from the following perspectives. First, I will present a probabilistic framework that can model the relationship between the structure of conversation and peoples’ behaviors including utterances, head pose, gaze direction, and head gestures. Second, for tracking head pose, we have developed a novel particle filter called Memory-Based particle filter (M-PF); it keeps in memory past state estimates, and predicts prior distributions by temporal sampling according to the recurrent probability of the past states in the future. Its advantages include robustness against complex motion, and quick recovery from track loss. Moreover, I will describe a real-time system for meeting analysis that combines speaker diarization and face pose tracking using an omnidirectional camera and microphone array. Finally, I will describe recent progress on the automatic inference of interpersonal emotions that evolve during meetings; the cue is directed smiling, i.e. “who is smiling at whom”.

12.30 - 14.00   Lunch Break

14.00 - 15.35   Oral Session II: Applications

Learning to Rank Images from Eye Movements
   Kitsuchart Pasupa, Craig J. Saunders, Sandor Szedmak, Arto Klami, Samuel Kaski and Steve R. Gunn

The Use of Tongue Protrusion Gestures for Video-based Communication
   Luis Ricardo Sapaico, Hamid Laga and Masayuki Nakajima

Object Recognition in Service Robots: Conducting Verbal Interaction on Color and Spatial Relationship
   Yoshinori Kuno, Katsutoshi Sakata and Yoshinori Kobayashi

Is Gender Recognition Affected by Age?
   Guodong Guo, Charles Dyer, Yun Fu and Thomas Huang

15.35 – 16.00   Coffee Break

16.00 - 17.15     Oral Session III: Tracking

Tracking Humans Interacting with the Environment Using Efficient Hierarchical Sampling and Layered Observation Models
  Jan Bandouch and Michael Beetz

Shoulder Gesture Interface for Operating Electric Wheelchair
   Nobuhiko Sato, Ikushi Yoda and Takenobu Inoue

Real-Time Motion Control Using Pose Space Probability Density Estimation
   Dumebi Okwechime and Richard Bowden

17.15 - 18.00   Panel