Multimodal AI: Understanding Human Behaviors

6th April 2021

Lecture by Prof. LP Morency Multimodal AI: Understanding Human Behaviors

Abstract

Human face-to-face communication is a little like a dance, in that participants continuously adjust their behaviors based on verbal and nonverbal cues from the social context. Today’s computers and interactive devices are still lacking many of these human-like abilities to hold fluid and natural interactions. Leveraging recent advances in machine learning, audio-visual signal processing and computational linguistic, my research focuses on creating computational technologies able to analyze, recognize and predict human subtle communicative behaviors in social context. Central to this research effort is the introduction of new probabilistic models able to learn the temporal and fine-grained latent dependencies across behaviors, modalities and interlocutors. In this talk, I will present some of our recent achievements in multimodal machine learning, addressing five core challenges: representation, alignment, fusion, translation and co-learning.

Short CV

Multimodal Interaction, Machine Learning, Computer Vision: 16k h-index: 68 Speaker’s bio: Louis-Philippe Morency is Associate Professor in the Language Technology Institute at Carnegie Mellon University where he leads the Multimodal Communication and Machine Learning Laboratory (MultiComp Lab). He was formerly research faculty in the Computer Sciences Department at University of Southern California and received his Ph.D. degree from MIT Computer Science and Artificial Intelligence Laboratory. His research focuses on building the computational foundations to enable computers with the abilities to analyze, recognize and predict subtle human communicative behaviors during social interactions. He received diverse awards including AI’s 10 to Watch by IEEE Intelligent Systems, NetExplo Award in partnership with UNESCO and 10 best paper awards at IEEE and ACM conferences. His research was covered by media outlets such as Wall Street Journal, The Economist and NPR.

Presentation

PDF & VIDEO

More events

2024. 09. 27

Back to List

6th April 2021

Lecture by Prof. LP Morency Multimodal AI: Understanding Human Behaviors

More events

Geometric deep learning and its application for multimedia

VISion Understanding and Machine intelligence (VISUM)

Decentralized DNN Architectures for Visual Data Analysis

About

PHD Studies

Activities

Resources

User Login

<img src="https://www.i-aida.org/wp-content/themes/twentytwentyone-child/images/bach.svg" />Back to List