Louis-Philippe Morency

Orcid: 0000-0001-6376-7696

Affiliations:
  • Carnegie Mellon University


According to our database1, Louis-Philippe Morency authored at least 361 papers between 2001 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Foundations & Trends in Multimodal Machine Learning: Principles, Challenges, and Open Questions.
ACM Comput. Surv., October, 2024

Isolated Causal Effects of Natural Language.
CoRR, 2024

IoT-LM: Large Multisensory Language Models for the Internet of Things.
CoRR, 2024

HEMM: Holistic Evaluation of Multimodal Foundation Models.
CoRR, 2024

Improving Dialogue Agents by Decomposing One Global Explicit Annotation with Local Implicit Multimodal Feedback.
CoRR, 2024

Optimizing Language Models for Human Preferences is a Causal Inference Problem.
CoRR, 2024

Let's Dance Together! AI Dancers Can Dance to Your Favorite Music and Style.
Proceedings of the Companion Proceedings of the 26th International Conference on Multimodal Interaction, 2024

SOTOPIA: Interactive Evaluation for Social Intelligence in Language Agents.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Multimodal Learning Without Labeled Multimodal Data: Guarantees and Applications.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

MMoE: Enhancing Multimodal Models with Mixtures of Multimodal Interaction Experts.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Advancing Social Intelligence in AI Agents: Technical Challenges and Open Questions.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Global Reward to Local Rewards: Multimodal-Guided Decomposition for Improving Dialogue Agents.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Think Twice: Perspective-Taking Improves Large Language Models' Theory-of-Mind Capabilities.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

2023
Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models.
, , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , ,
Trans. Mach. Learn. Res., 2023

High-Modality Multimodal Transformer: Quantifying Modality & Interaction Heterogeneity for High-Modality Representation Learning.
Trans. Mach. Learn. Res., 2023

MultiZoo and MultiBench: A Standardized Toolkit for Multimodal Deep Learning.
J. Mach. Learn. Res., 2023

MMOE: Mixture of Multimodal Interaction Experts.
CoRR, 2023

MultiIoT: Towards Large-scale Multisensory Learning for the Internet of Things.
CoRR, 2023

Comparative Knowledge Distillation.
CoRR, 2023

MultiZoo & MultiBench: A Standardized Toolkit for Multimodal Deep Learning.
CoRR, 2023

Factorized Contrastive Learning: Going Beyond Multi-view Redundancy.
CoRR, 2023

Quantifying & Modeling Feature Interactions: An Information Decomposition Framework.
CoRR, 2023

Factorized Contrastive Learning: Going Beyond Multi-view Redundancy.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Quantifying & Modeling Multimodal Interactions: An Information Decomposition Framework.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Neural Mixed Effects for Nonlinear Personalized Predictions.
Proceedings of the 25th International Conference on Multimodal Interaction, 2023

Representation Learning for Interpersonal and Multimodal Behavior Dynamics: A Multiview Extension of Latent Change Score Models.
Proceedings of the 25th International Conference on Multimodal Interaction, 2023

Expanding the Role of Affective Phenomena in Multimodal Interaction Research.
Proceedings of the 25th International Conference on Multimodal Interaction, 2023

Tutorial on Multimodal Machine Learning: Principles, Challenges, and Open Questions.
Proceedings of the International Conference on Multimodal Interaction, 2023

Multimodal Fusion Interactions: A Study of Human and Automatic Quantification.
Proceedings of the 25th International Conference on Multimodal Interaction, 2023

SHAP-based Prediction of Mother's History of Depression to Understand the Influence on Child Behavior.
Proceedings of the 25th International Conference on Multimodal Interaction, 2023

MultiViz: Towards Visualizing and Understanding Multimodal Models.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Lecture Presentations Multimodal Dataset: Towards Understanding Multimodality in Educational Videos.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Continual Learning for Personalized Co-Speech Gesture Generation.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Face-to-Face Contrastive Learning for Social Intelligence Question-Answering.
Proceedings of the 17th IEEE International Conference on Automatic Face and Gesture Recognition, 2023

Multimodal Feature Selection for Detecting Mothers' Depression in Dyadic Interactions with their Adolescent Offspring.
Proceedings of the 17th IEEE International Conference on Automatic Face and Gesture Recognition, 2023

Difference-Masking: Choosing What to Mask in Continued Pretraining.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Counterfactual Augmentation for Multimodal Learning Under Presentation Bias.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Text-Transport: Toward Learning Causal Effects of Natural Language.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Understanding Masked Autoencoders via Hierarchical Latent Variable Models.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

MultiViz: Towards User-Centric Visualizations and Interpretations of Multimodal Models.
Proceedings of the Extended Abstracts of the 2023 CHI Conference on Human Factors in Computing Systems, 2023

Language Models Get a Gender Makeover: Mitigating Gender Bias with Few-Shot Data Interventions.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), 2023

Cross-modal Attention Congruence Regularization for Vision-Language Relation Alignment.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Nano: Nested Human-in-the-Loop Reward Learning for Few-shot Language Model Control.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

SenteCon: Leveraging Lexicons to Learn Human-Interpretable Language Representations.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

2022
SeedBERT: Recovering Annotator Rating Distributions from an Aggregated Label.
CoRR, 2022

Paraphrasing Is All You Need for Novel Object Captioning.
CoRR, 2022

Foundations and Recent Trends in Multimodal Machine Learning: Principles, Challenges, and Open Questions.
CoRR, 2022

Multimodal Lecture Presentations Dataset: Understanding Multimodality in Educational Slides.
CoRR, 2022

Face-to-Face Contrastive Learning for Social Intelligence Question-Answering.
CoRR, 2022

MultiViz: An Analysis Benchmark for Visualizing and Understanding Multimodal Models.
CoRR, 2022

HighMMT: Towards Modality and Task Generalization for High-Modality Representation Learning.
CoRR, 2022

Paraphrasing Is All You Need for Novel Object Captioning.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Toward Causal Understanding of Therapist-Client Relationships: A Study of Language Modality and Social Entrainment.
Proceedings of the International Conference on Multimodal Interaction, 2022

What is Multimodal?
Proceedings of the International Conference on Multimodal Interaction, 2022

Conditional Contrastive Learning with Kernel.
Proceedings of the Tenth International Conference on Learning Representations, 2022

Learning Weakly-supervised Contrastive Representations.
Proceedings of the Tenth International Conference on Learning Representations, 2022

Uncertainty Quantification with Pre-trained Language Models: A Large-Scale Empirical Analysis.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

Beyond Additive Fusion: Learning Non-Additive Multimodal Interactions.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

PACS: A Dataset for Physical Audiovisual CommonSense Reasoning.
Proceedings of the Computer Vision - ECCV 2022, 2022

Low-Resource Adaptation for Personalized Co-Speech Gesture Generation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

DIME: Fine-grained Interpretations of Multimodal Models via Disentangled Local Explanations.
Proceedings of the AIES '22: AAAI/ACM Conference on AI, Ethics, and Society, Oxford, United Kingdom, May 19, 2022

HOLM: Hallucinating Objects with Language Models for Referring Expression Recognition in Partially-Observed Scenes.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

Language Use in Mother-Adolescent Dyadic Interaction: Preliminary Results.
Proceedings of the 10th International Conference on Affective Computing and Intelligent Interaction, 2022

2021
Relay Variational Inference: A Method for Accelerated Encoderless VI.
CoRR, 2021

Integrating Auxiliary Information in Self-supervised Learning.
CoRR, 2021

Conditional Contrastive Learning: Removing Undesirable Information in Self-Supervised Representations.
CoRR, 2021

A Note on Connecting Barlow Twins with Negative-Sample-Free Contrastive Learning.
CoRR, 2021

Understanding the Tradeoffs in Client-Side Privacy for Speech Recognition.
CoRR, 2021

StarNet: Gradient-free Training of Deep Generative Models using Determined System of Linear Equations.
CoRR, 2021

MultiBench: Multiscale Benchmarks for Multimodal Representation Learning.
Proceedings of the Neural Information Processing Systems Track on Datasets and Benchmarks 1, 2021

MTAG: Modal-Temporal Attention Graph for Unaligned Human Multimodal Language Sequences.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

StylePTB: A Compositional Benchmark for Fine-grained Controllable Text Style Transfer.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

Cross-Modal Generalization: Learning in Low Resource Modalities via Meta-Alignment.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

Social Signals and Multimedia: Past, Present, Future.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

Multimodal and Multitask Approach to Listener's Backchannel Prediction: Can Prediction of Turn-changing and Turn-management Willingness Improve Backchannel Modeling?
Proceedings of the IVA '21: ACM International Conference on Intelligent Virtual Agents, 2021

Towards Understanding and Mitigating Social Biases in Language Models.
Proceedings of the 38th International Conference on Machine Learning, 2021

Human-Guided Modality Informativeness for Affective States.
Proceedings of the ICMI '21: International Conference on Multimodal Interaction, 2021

Bi-Bimodal Modality Fusion for Correlation-Controlled Multimodal Sentiment Analysis.
Proceedings of the ICMI '21: International Conference on Multimodal Interaction, 2021

M2H2: A Multimodal Multiparty Hindi Dataset For Humor Recognition in Conversations.
Proceedings of the ICMI '21: International Conference on Multimodal Interaction, 2021

Crossmodal Clustered Contrastive Learning: Grounding of Spoken Language to Gesture.
Proceedings of the ICMI '21 Companion: Companion Publication of the 2021 International Conference on Multimodal Interaction, Montreal, QC, Canada, October 18, 2021

Self-supervised Representation Learning with Relative Predictive Coding.
Proceedings of the 9th International Conference on Learning Representations, 2021

Self-supervised Learning from a Multi-view Perspective.
Proceedings of the 9th International Conference on Learning Representations, 2021

Goals, Tasks, and Bonds: Toward the Computational Assessment of Therapist Versus Client Perception of Working Alliance.
Proceedings of the 16th IEEE International Conference on Automatic Face and Gesture Recognition, 2021

Understanding the Tradeoffs in Client-side Privacy for Downstream Speech Tasks.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2021

Learning Language and Multimodal Privacy-Preserving Markers of Mood from Mobile Data.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

Humor Knowledge Enriched Transformer for Understanding Multimodal Humor.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020
Foundations of Multimodal Co-learning.
Inf. Fusion, 2020

Multimodal Privacy-preserving Mood Prediction from Mobile Data: A Preliminary Study.
CoRR, 2020

Unsupervised Domain Adaptation for Visual Navigation.
CoRR, 2020

MTGAT: Multimodal Temporal Graph Attention Networks for Unaligned Human Multimodal Language Sequences.
CoRR, 2020

What Gives the Answer Away? Question Answering Bias Analysis on Video QA Datasets.
CoRR, 2020

Demystifying Self-Supervised Learning: An Information-Theoretical Framework.
CoRR, 2020

Improving Aspect-Level Sentiment Analysis with Aspect Extraction.
CoRR, 2020

Interpretable Multimodal Routing for Human Multimodal Language.
CoRR, 2020

Learning Not to Learn in the Presence of Noisy Labels.
CoRR, 2020

Think Locally, Act Globally: Federated Learning with Local and Global Representations.
CoRR, 2020

Neural Methods for Point-wise Dependency Estimation.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Can Prediction of Turn-management Willingness Improve Turn-changing Modeling?
Proceedings of the IVA '20: ACM International Conference on Intelligent Virtual Agents, 2020

Impact of Personality on Nonverbal Behavior Generation.
Proceedings of the IVA '20: ACM International Conference on Intelligent Virtual Agents, 2020

Depression Severity Assessment for Adolescents at High Risk of Mental Disorders.
Proceedings of the ICMI '20: International Conference on Multimodal Interaction, 2020

Toward Multimodal Modeling of Emotional Expressiveness.
Proceedings of the ICMI '20: International Conference on Multimodal Interaction, 2020

Simple and Effective Approaches for Uncertainty Prediction in Facial Action Unit Intensity Regression.
Proceedings of the 15th IEEE International Conference on Automatic Face and Gesture Recognition, 2020

CMU-MOSEAS: A Multimodal Language Dataset for Spanish, Portuguese, German and French.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

Multimodal Routing: Improving Local and Global Interpretability of Multimodal Language Analysis.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

No Gestures Left Behind: Learning Relationships between Spoken Language and Freeform Gestures.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2020, 2020

Diverse and Admissible Trajectory Forecasting Through Multimodal Context Understanding.
Proceedings of the Computer Vision - ECCV 2020, 2020

Style Transfer for Co-speech Gesture Animation: A Multi-speaker Conditional-Mixture Approach.
Proceedings of the Computer Vision - ECCV 2020, 2020

On Emergent Communication in Competitive Multi-Agent Teams.
Proceedings of the 19th International Conference on Autonomous Agents and Multiagent Systems, 2020

Integrating Multimodal Information in Large Pretrained Transformers.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

Towards Debiasing Sentence Representations.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

Language to Network: Conditional Parameter Adaptation with Natural Language Descriptions.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

Refer360$^\circ$: A Referring Expression Recognition Dataset in 360$^\circ$ Images.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

Context-Dependent Models for Predicting and Characterizing Facial Expressiveness.
Proceedings of the 3rd Workshop on Affective Content Analysis (AffCon 2020) co-located with Thirty-Fourth AAAI Conference on Artificial Intelligence (AAAI 2020), 2020

2019
Learning Pose-Aware Models for Pose-Invariant Face Recognition in the Wild.
IEEE Trans. Pattern Anal. Mach. Intell., 2019

Multimodal Machine Learning: A Survey and Taxonomy.
IEEE Trans. Pattern Anal. Mach. Intell., 2019

Spontaneous smile intensity estimation by fusing saliency maps and convolutional neural networks.
J. Electronic Imaging, 2019

Pseudo-Encoded Stochastic Variational Inference.
CoRR, 2019

Factorized Multimodal Transformer for Multimodal Sequential Learning.
CoRR, 2019

WildMix Dataset and Spectro-Temporal Transformer Model for Monoaural Audio Source Separation.
CoRR, 2019

M-BERT: Injecting Multimodal Information in the BERT Structure.
CoRR, 2019

Report of 2017 NSF Workshop on Multimedia Challenges, Opportunities and Research Roadmaps.
CoRR, 2019

Variational Auto-Decoder.
CoRR, 2019

Sensing Affective Response to Visual Narratives.
IEEE Comput. Intell. Mag., 2019

Induced Attention Invariance: Defending VQA Models against Adversarial Attacks.
Proceedings of the Visually Grounded Interaction and Language (ViGIL), 2019

Deep Gamblers: Learning to Abstain with Portfolio Theory.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Strong and Simple Baselines for Multimodal Utterance Embeddings.
Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019

PANEL: Challenges for Multimedia/Multimodal Research in the Next Decade.
Proceedings of the 27th ACM International Conference on Multimedia, 2019

Bag-of-Acoustic-Words for Mental Health Assessment: A Deep Autoencoding Approach.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Multimodal Behavioral Markers Exploring Suicidal Intent in Social Media Videos.
Proceedings of the International Conference on Multimodal Interaction, 2019

ElderReact: A Multimodal Dataset for Recognizing Emotional Response in Aging Adults.
Proceedings of the International Conference on Multimodal Interaction, 2019

To React or not to React: End-to-End Visual Pose Forecasting for Personalized Avatar during Dyadic Conversations.
Proceedings of the International Conference on Multimodal Interaction, 2019

Learning Factorized Multimodal Representations.
Proceedings of the 7th International Conference on Learning Representations, 2019

Transformer Dissection: An Unified Understanding for Transformer's Attention via the Lens of Kernel.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

UR-FUNNY: A Multimodal Language Dataset for Understanding Humor.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

Video Relationship Reasoning Using Gated Spatio-Temporal Energy Graph.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Social-IQ: A Question Answering Benchmark for Artificial Social Intelligence.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Multimodal Transformer for Unaligned Multimodal Language Sequences.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

Learning Representations from Imperfect Time Series Data via Tensor Rank Regularization.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

Reconsidering the Duchenne Smile: Indicator of Positive Emotion or Artifact of Smile Intensity?
Proceedings of the 8th International Conference on Affective Computing and Intelligent Interaction, 2019

Words Can Shift: Dynamically Adjusting Word Representations Using Nonverbal Behaviors.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

Found in Translation: Learning Robust Joint Representations by Cyclic Translations between Modalities.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

Language2Pose: Natural Language Grounded Pose Forecasting.
Proceedings of the 2019 International Conference on 3D Vision, 2019

2018
Multimodal Polynomial Fusion for Detecting Driver Distraction.
CoRR, 2018

GazeDirector: Fully Articulated Eye Gaze Redirection in Video.
Comput. Graph. Forum, 2018

Factorized Convolutional Networks: Unsupervised Fine-Tuning for Image Clustering.
Proceedings of the 2018 IEEE Winter Conference on Applications of Computer Vision, 2018

Speaker-Follower Models for Vision-and-Language Navigation.
Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

Conversational Memory Network for Emotion Recognition in Dyadic Dialogue Videos.
Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2018

Visual Referring Expression Recognition: What Do Systems Actually Learn?
Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2018

Multimodal Polynomial Fusion for Detecting Driver Distraction.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Toward Objective, Multifaceted Characterization of Psychotic Disorders: Lexical, Structural, and Disfluency Markers of Spoken Language.
Proceedings of the 2018 on International Conference on Multimodal Interaction, 2018

Multimodal Local-Global Ranking Fusion for Emotion Recognition.
Proceedings of the 2018 on International Conference on Multimodal Interaction, 2018

Edge Convolutional Network for Facial Action Intensity Estimation.
Proceedings of the 13th IEEE International Conference on Automatic Face & Gesture Recognition, 2018

Toward Visual Behavior Markers of Suicidal Ideation.
Proceedings of the 13th IEEE International Conference on Automatic Face & Gesture Recognition, 2018

OpenFace 2.0: Facial Behavior Analysis Toolkit.
Proceedings of the 13th IEEE International Conference on Automatic Face & Gesture Recognition, 2018

Multimodal Language Analysis with Recurrent Multistage Fusion.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

Efficient Low-rank Multimodal Fusion With Modality-Specific Factors.
Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, 2018

Multimodal Language Analysis in the Wild: CMU-MOSEI Dataset and Interpretable Dynamic Fusion Graph.
Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, 2018

Multi-attention Recurrent Network for Human Communication Comprehension.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

Memory Fusion Network for Multi-view Sequential Learning.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

Using Syntax to Ground Referring Expressions in Natural Images.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

Lattice Recurrent Unit: Improving Convergence and Statistical Efficiency for Sequence Modeling.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

Perspectives on predictive power of multimodal deep learning: surprises and future directions.
Proceedings of the Handbook of Multimodal-Multisensor Interfaces: Foundations, User Modeling, and Common Modality Combinations, 2018

Challenges and applications in multimodal machine learning.
Proceedings of the Handbook of Multimodal-Multisensor Interfaces: Foundations, User Modeling, and Common Modality Combinations, 2018

2017
Adolescent Suicidal Risk Assessment in Clinician-Patient Interaction.
IEEE Trans. Affect. Comput., 2017

MultiSense - Context-Aware Nonverbal Behavior Analysis Framework: A Psychological Distress Use Case.
IEEE Trans. Affect. Comput., 2017

Reporting Mental Health Symptoms: Breaking Down Barriers to Care with Virtual Human Interviewers.
Frontiers Robotics AI, 2017

Combating Human Trafficking with Deep Multimodal Models.
CoRR, 2017

Deducing the severity of psychiatric symptoms from the human voice.
CoRR, 2017

Preserving Intermediate Objectives: One Simple Trick to Improve Learning for Hierarchical Models.
CoRR, 2017

Temporally Selective Attention Model for Social and Affective State Recognition in Multimedia Content.
Proceedings of the 2017 ACM on Multimedia Conference, 2017

Computational Analysis of Acoustic Descriptors in Psychotic Patients.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

Multimodal sentiment analysis with word-level fusion and reinforcement learning.
Proceedings of the 19th ACM International Conference on Multimodal Interaction, 2017

Automatically predicting human knowledgeability through non-verbal cues.
Proceedings of the 19th ACM International Conference on Multimodal Interaction, 2017

Select-additive learning: Improving generalization in multimodal sentiment analysis.
Proceedings of the 2017 IEEE International Conference on Multimedia and Expo, 2017

Multi-level Multiple Attentions for Contextual Multimodal Sentiment Analysis.
Proceedings of the 2017 IEEE International Conference on Data Mining, 2017

Convolutional Experts Constrained Local Model for 3D Facial Landmark Detection.
Proceedings of the 2017 IEEE International Conference on Computer Vision Workshops, 2017

Combining Sequential Geometry and Texture Features for Distinguishing Genuine and Deceptive Emotions.
Proceedings of the 2017 IEEE International Conference on Computer Vision Workshops, 2017

Constrained Ensemble Initialization for Facial Landmark Tracking in Video.
Proceedings of the 12th IEEE International Conference on Automatic Face & Gesture Recognition, 2017

Investigating Facial Behavior Indicators of Suicidal Ideation.
Proceedings of the 12th IEEE International Conference on Automatic Face & Gesture Recognition, 2017

Local-Global Landmark Confidences for Face Recognition.
Proceedings of the 12th IEEE International Conference on Automatic Face & Gesture Recognition, 2017

Curriculum Learning for Facial Expression Recognition.
Proceedings of the 12th IEEE International Conference on Automatic Face & Gesture Recognition, 2017

Tensor Fusion Network for Multimodal Sentiment Analysis.
Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, 2017

Convolutional Experts Constrained Local Model for Facial Landmark Detection.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2017

Temporal Attention-Gated Model for Robust Sequence Classification.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

Exceptionally Social: Design of an Avatar-Mediated Interactive System for Promoting Social Skills in Children with Autism.
Proceedings of the 2017 CHI Conference on Human Factors in Computing Systems, 2017

Combating Human Trafficking with Multimodal Deep Models.
Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, 2017

Context-Dependent Sentiment Analysis in User-Generated Videos.
Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, 2017

Multimodal Machine Learning: Integrating Language, Vision and Speech.
Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, 2017

Affect-LM: A Neural Language Model for Customizable Affective Text Generation.
Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, 2017

Visual attention in schizophrenia: Eye contact and gaze aversion during clinical interactions.
Proceedings of the Seventh International Conference on Affective Computing and Intelligent Interaction, 2017

Hand2Face: Automatic synthesis and recognition of hand over face occlusions.
Proceedings of the Seventh International Conference on Affective Computing and Intelligent Interaction, 2017

Local-global ranking for facial expression intensity estimation.
Proceedings of the Seventh International Conference on Affective Computing and Intelligent Interaction, 2017

Integrating Verbal and Nonvebval Input into a Dynamic Response Spoken Dialogue System.
Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, 2017

2016
Multimodal Analysis and Prediction of Persuasiveness in Online Social Multimedia.
ACM Trans. Interact. Intell. Syst., 2016

Self-Reported Symptoms of Depression and PTSD Are Associated with Reduced Vowel Space in Screening Interviews.
IEEE Trans. Affect. Comput., 2016

Multimodal Sentiment Intensity Analysis in Videos: Facial Gestures and Verbal Messages.
IEEE Intell. Syst., 2016

MOSI: Multimodal Corpus of Sentiment Intensity and Subjectivity Analysis in Online Opinion Videos.
CoRR, 2016

Deep Constrained Local Models for Facial Landmark Detection.
CoRR, 2016

Select-Additive Learning: Improving Cross-individual Generalization in Multimodal Sentiment Analysis.
CoRR, 2016

Video Analysis for Body-worn Cameras in Law Enforcement.
CoRR, 2016

Visualizing and Understanding Curriculum Learning for Long Short-Term Memory Networks.
CoRR, 2016

The Future Belongs to the Curious: Towards Automatic Understanding and Recognition of Curiosity in Children.
Proceedings of the 5th Workshop on Child Computer Interaction, 2016

OpenFace: An open source facial behavior analysis toolkit.
Proceedings of the 2016 IEEE Winter Conference on Applications of Computer Vision, 2016

Keynote - Modeling Human Communication Dynamics.
Proceedings of the SIGDIAL 2016 Conference, 2016

Automatic Behavior Analysis During a Clinical Interview with a Virtual Human.
Proceedings of the Medicine Meets Virtual Reality 22 - NextMed, 2016

A Multimodal Corpus for the Assessment of Public Speaking Ability and Anxiety.
Proceedings of the Tenth International Conference on Language Resources and Evaluation LREC 2016, 2016

Manipulating the Perception of Virtual Audiences Using Crowdsourced Behaviors.
Proceedings of the Intelligent Virtual Agents - 16th International Conference, 2016

Recognizing Human Actions in the Motion Trajectories of Shapes.
Proceedings of the 21st International Conference on Intelligent User Interfaces, 2016

Representation Learning for Speech Emotion Recognition.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Deep multimodal fusion for persuasiveness prediction.
Proceedings of the 18th ACM International Conference on Multimodal Interaction, 2016

EmoReact: a multimodal approach and dataset for recognizing emotional responses in children.
Proceedings of the 18th ACM International Conference on Multimodal Interaction, 2016

An unsupervised approach to glottal inverse filtering.
Proceedings of the 24th European Signal Processing Conference, 2016

A 3D Morphable Model of the Eye Region.
Proceedings of the 37th Annual Conference of the European Association for Computer Graphics, 2016

Learning an appearance-based gaze estimator from one million synthesised images.
Proceedings of the Ninth Biennial ACM Symposium on Eye Tracking Research & Applications, 2016

Unsupervised Text Recap Extraction for TV Series.
Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing, 2016

Riding an emotional roller-coaster: A multimodal study of young child's math problem solving activities.
Proceedings of the 9th International Conference on Educational Data Mining, 2016

A 3D Morphable Eye Region Model for Gaze Estimation.
Proceedings of the Computer Vision - ECCV 2016, 2016

Extending Long Short-Term Memory for Multi-View Structured Learning.
Proceedings of the Computer Vision - ECCV 2016, 2016

Holistically Constrained Local Model: Going Beyond Frontal Poses for Facial Landmark Detection.
Proceedings of the British Machine Vision Conference 2016, 2016

2015
I Can Already Guess Your Answer: Predicting Respondent Reactions during Dyadic Negotiation.
IEEE Trans. Affect. Comput., 2015

Preface of pattern recognition in human computer interaction.
Pattern Recognit. Lett., 2015

Variational Infinite Hidden Conditional Random Fields.
IEEE Trans. Pattern Anal. Mach. Intell., 2015

Automatic nonverbal behavior indicators of depression and PTSD: the effect of gender.
J. Multimodal User Interfaces, 2015

Learning Representations of Affect from Speech.
CoRR, 2015

NRGsuite: a PyMOL plugin to perform docking simulations in real time using FlexAID.
Bioinform., 2015

Predicting Co-verbal Gestures: A Deep and Temporal Modeling Approach.
Proceedings of the Intelligent Virtual Agents - 15th International Conference, 2015

Multimodal Public Speaking Performance Assessment.
Proceedings of the 2015 ACM on International Conference on Multimodal Interaction, Seattle, WA, USA, November 09, 2015

Exploring Behavior Representation for Learning Analytics.
Proceedings of the 2015 ACM on International Conference on Multimodal Interaction, Seattle, WA, USA, November 09, 2015

ERM4CT 2015: Workshop on Emotion Representations and Modelling for Companion Systems.
Proceedings of the International Workshop on Emotion Representations and Modelling for Companion Technologies, 2015

Combining Two Perspectives on Classifying Multimodal Data for Recognizing Speaker Traits.
Proceedings of the 2015 ACM on International Conference on Multimodal Interaction, Seattle, WA, USA, November 09, 2015

Acoustic and para-verbal indicators of persuasiveness in social multimedia.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Reduced vowel space is a robust indicator of psychological distress: A cross-corpus analysis.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Exploring feedback strategies to improve public speaking: an interactive virtual audience framework.
Proceedings of the 2015 ACM International Joint Conference on Pervasive and Ubiquitous Computing, 2015

Exploring the Implications of Virtual Human Research for Human-Robot Teams.
Proceedings of the Virtual, Augmented and Mixed Reality, 2015

Time-slice Prediction of Dyadic Human Activities.
Proceedings of the British Machine Vision Conference 2015, 2015

Automatic assessment and analysis of public speaking anxiety: A virtual audience case study.
Proceedings of the 2015 International Conference on Affective Computing and Intelligent Interaction, 2015

A demonstration of the perception system in SimSensei, a virtual human application for healthcare interviews.
Proceedings of the 2015 International Conference on Affective Computing and Intelligent Interaction, 2015

A multi-label convolutional neural network approach to cross-domain action unit detection.
Proceedings of the 2015 International Conference on Affective Computing and Intelligent Interaction, 2015

SimSensei Demonstration: A Perceptive Virtual Human Interviewer for Healthcare Applications.
Proceedings of the Twenty-Ninth AAAI Conference on Artificial Intelligence, 2015

2014
Automatic audiovisual behavior descriptors for psychological disorder analysis.
Image Vis. Comput., 2014

It's only a computer: Virtual humans increase willingness to disclose.
Comput. Hum. Behav., 2014

Relative facial action unit detection.
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2014

Adolescent suicidal risk assessment in clinician-patient interaction: A study of verbal and acoustic behaviors.
Proceedings of the 2014 IEEE Spoken Language Technology Workshop, 2014

A Demonstration of Dialogue Processing in SimSensei Kiosk.
Proceedings of the SIGDIAL 2014 Conference, 2014

Search Strategies for Pattern Identification in Multimodal Data: Three Case Studies.
Proceedings of the International Conference on Multimedia Retrieval, 2014

The Distress Analysis Interview Corpus of human and computer interviews.
Proceedings of the Ninth International Conference on Language Resources and Evaluation, 2014

Towards Learning Nonverbal Identities from the Web: Automatically Identifying Visually Accentuated Words.
Proceedings of the Intelligent Virtual Agents - 14th International Conference, 2014

Toward crowdsourcing micro-level behavior annotations: the challenges of interface, training, and generalization.
Proceedings of the 19th International Conference on Intelligent User Interfaces, 2014

Dyadic Behavior Analysis in Depression Severity Assessment Interviews.
Proceedings of the 16th International Conference on Multimodal Interaction, 2014

Computational Analysis of Persuasiveness in Social Multimedia: A Novel Dataset and Multimodal Prediction Approach.
Proceedings of the 16th International Conference on Multimodal Interaction, 2014

A Multimodal Context-based Approach for Distress Assessment.
Proceedings of the 16th International Conference on Multimodal Interaction, 2014

Context-based signal descriptors of heart-rate variability for anxiety assessment.
Proceedings of the IEEE International Conference on Acoustics, 2014

Continuous Conditional Neural Fields for Structured Regression.
Proceedings of the Computer Vision - ECCV 2014, 2014

It's only a computer: the impact of human-agent interaction in clinical interviews.
Proceedings of the International conference on Autonomous Agents and Multi-Agent Systems, 2014

SimSensei kiosk: a virtual human interviewer for healthcare decision support.
Proceedings of the International conference on Autonomous Agents and Multi-Agent Systems, 2014

An interactive virtual audience platform for public speaking training.
Proceedings of the International conference on Autonomous Agents and Multi-Agent Systems, 2014

Verbal Behaviors and Persuasiveness in Online Multimedia Content.
Proceedings of the Second Workshop on Natural Language Processing for Social Media, 2014

2013
Infinite Hidden Conditional Random Fields for Human Behavior Analysis.
IEEE Trans. Neural Networks Learn. Syst., 2013

Latent Mixture of Discriminative Experts.
IEEE Trans. Multim., 2013

YouTube Movie Reviews: Sentiment Analysis in an Audio-Visual Context.
IEEE Intell. Syst., 2013

Multimodal Sentiment Analysis of Spanish Online Videos.
IEEE Intell. Syst., 2013

Verbal indicators of psychological distress in interactive dialogue with a virtual human.
Proceedings of the SIGDIAL 2013 Conference, 2013

Variational Hidden Conditional Random Fields with Coupled Dirichlet Process Mixtures.
Proceedings of the Machine Learning and Knowledge Discovery in Databases, 2013

User-State Sensing for Virtual Health Agents and TeleHealth Applications.
Proceedings of the Medicine Meets Virtual Reality 20 - NextMed, 2013

The Similar Segments in Social Speech Task.
Proceedings of the MediaEval 2013 Multimedia Benchmark Workshop, 2013

Prediction of Visual Backchannels in the Absence of Visual Context Using Mutual Influence.
Proceedings of the Intelligent Virtual Agents - 13th International Conference, 2013

All Together Now - Introducing the Virtual Human Toolkit.
Proceedings of the Intelligent Virtual Agents - 13th International Conference, 2013

Cicero - Towards a Multimodal Virtual Audience Platform for Public Speaking Training.
Proceedings of the Intelligent Virtual Agents - 13th International Conference, 2013

Investigating voice quality as a speaker-independent indicator of depression and PTSD.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

Prediction of strategy and outcome as negotiation unfolds by using basic verbal and behavioral features.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

A comparative study of glottal open quotient estimation techniques.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

Learning a sparse codebook of facial and body microexpressions for emotion recognition.
Proceedings of the 2013 International Conference on Multimodal Interaction, 2013

Audiovisual behavior descriptors for depression assessment.
Proceedings of the 2013 International Conference on Multimodal Interaction, 2013

ICMI 2013 grand challenge workshop on multimodal learning analytics.
Proceedings of the 2013 International Conference on Multimodal Interaction, 2013

Who is persuasive?: the role of perceived personality and communication modality in social multimedia.
Proceedings of the 2013 International Conference on Multimodal Interaction, 2013

Interactive relevance search and modeling: support for expert-driven analysis of multimodal data.
Proceedings of the 2013 International Conference on Multimodal Interaction, 2013

Automatic multimodal descriptors of rhythmic body movement.
Proceedings of the 2013 International Conference on Multimodal Interaction, 2013

Speaker-adaptive multimodal prediction model for listener responses.
Proceedings of the 2013 International Conference on Multimodal Interaction, 2013

Constrained Local Neural Fields for Robust Facial Landmark Detection in the Wild.
Proceedings of the 2013 IEEE International Conference on Computer Vision Workshops, 2013

Speaker trait characterization in web videos: Uniting speech, language, and facial features.
Proceedings of the IEEE International Conference on Acoustics, 2013

Investigating the speech characteristics of suicidal adolescents.
Proceedings of the IEEE International Conference on Acoustics, 2013

Speaker and language independent voice quality classification applied to unlabelled corpora of expressive speech.
Proceedings of the IEEE International Conference on Acoustics, 2013

Distribution-sensitive learning for imbalanced datasets.
Proceedings of the 10th IEEE International Conference and Workshops on Automatic Face and Gesture Recognition, 2013

Automatic behavior descriptors for psychological disorder analysis.
Proceedings of the 10th IEEE International Conference and Workshops on Automatic Face and Gesture Recognition, 2013

Sequential emotion recognition using Latent-Dynamic Conditional Neural Fields.
Proceedings of the 10th IEEE International Conference and Workshops on Automatic Face and Gesture Recognition, 2013

Action Recognition by Hierarchical Sequence Summarization.
Proceedings of the 2013 IEEE Conference on Computer Vision and Pattern Recognition, 2013

Utterance-Level Multimodal Sentiment Analysis.
Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics, 2013

Automatic Nonverbal Behavior Indicators of Depression and PTSD: Exploring Gender Differences.
Proceedings of the 2013 Humaine Association Conference on Affective Computing and Intelligent Interaction, 2013

Mutual Behaviors during Dyadic Negotiation: Automatic Prediction of Respondent Reactions.
Proceedings of the 2013 Humaine Association Conference on Affective Computing and Intelligent Interaction, 2013

Fifth International Workshop on Affective Interaction in Natural Environments (AFFINE 2013): Interacting with Affective Artefacts in the Wild.
Proceedings of the 2013 Humaine Association Conference on Affective Computing and Intelligent Interaction, 2013

2012
Introduction to the special issue on affective interaction in natural environments.
ACM Trans. Interact. Intell. Syst., 2012

Exploring the effect of illumination on automatic expression recognition using the ICT-3DRFE database.
Image Vis. Comput., 2012

Investigating the influence of virtual peers as dialect models on students' prosodic inventory.
Proceedings of the Third Workshop on Child, Computer and Interaction, 2012

Dialogue Act Recognition using Reweighted Speaker Adaptation.
Proceedings of the SIGDIAL 2012 Conference, 2012

Crowdsourcing micro-level multimedia annotations: the challenges of evaluation and interface.
Proceedings of the ACM multimedia 2012 workshop on Crowdsourcing for multimedia, 2012

Perception Markup Language: Towards a Standardized Representation of Perceived Nonverbal Behaviors.
Proceedings of the Intelligent Virtual Agents - 12th International Conference, 2012

Multimodal human behavior analysis: learning correlation and interaction across modalities.
Proceedings of the International Conference on Multimodal Interaction, 2012

1st international workshop on multimodal learning analytics: extended abstract.
Proceedings of the International Conference on Multimodal Interaction, 2012

I already know your answer: using nonverbal behaviors to predict immediate outcomes in a dyadic negotiation.
Proceedings of the International Conference on Multimodal Interaction, 2012

Step-wise emotion recognition using concatenated-HMM.
Proceedings of the International Conference on Multimodal Interaction, 2012

Structural and temporal inference search (STIS): pattern identification in multimodal data.
Proceedings of the International Conference on Multimodal Interaction, 2012

Towards sensing the influence of visual narratives on human affect.
Proceedings of the International Conference on Multimodal Interaction, 2012

Multi-view latent variable discriminative models for action recognition.
Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition, 2012

3D Constrained Local Model for rigid and non-rigid facial tracking.
Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition, 2012

Gesture-based Object Recognition using Histograms of Guiding Strokes.
Proceedings of the British Machine Vision Conference, 2012

Towards building a virtual counselor: modeling nonverbal behavior during intimate self-disclosure.
Proceedings of the International Conference on Autonomous Agents and Multiagent Systems, 2012

2011
Computational study of human communication dynamic.
Proceedings of the 2011 joint ACM workshop on Human gesture and behavior understanding, 2011

Modeling Nonverbal Behavior of a Virtual Counselor during Intimate Self-disclosure.
Proceedings of the Intelligent Virtual Agents - 11th International Conference, 2011

Virtual Rapport 2.0.
Proceedings of the Intelligent Virtual Agents - 11th International Conference, 2011

Towards multimodal sentiment analysis: harvesting opinions from the web.
Proceedings of the 13th International Conference on Multimodal Interfaces, 2011

Effect of illumination on automatic expression recognition: A novel 3D relightable facial database.
Proceedings of the Ninth IEEE International Conference on Automatic Face and Gesture Recognition (FG 2011), 2011

Modeling hidden dynamics of multimodal cues for spontaneous agreement and disagreement recognition.
Proceedings of the Ninth IEEE International Conference on Automatic Face and Gesture Recognition (FG 2011), 2011

Fusing Symbolic and Decision-Theoretic Problem Solving + Perception in a Graphical Cognitive Architecture.
Proceedings of the Biologically Inspired Cognitive Architectures 2011, 2011

A multimodal end-of-turn prediction model: learning from parasocial consensus sampling.
Proceedings of the 10th International Conference on Autonomous Agents and Multiagent Systems (AAMAS 2011), 2011

Modeling Wisdom of Crowds Using Latent Mixture of Discriminative Experts.
Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, Proceedings of the Conference, 19-24 June, 2011, Portland, Oregon, USA, 2011

Modeling Latent Discriminative Dynamic of Multi-dimensional Affective Signals.
Proceedings of the Affective Computing and Intelligent Interaction, 2011

Are You Friendly or Just Polite? - Analysis of Smiles in Spontaneous Face-to-Face Interactions.
Proceedings of the Affective Computing and Intelligent Interaction, 2011

Machine Learning for Affective Computing.
Proceedings of the Affective Computing and Intelligent Interaction, 2011

2010
Modeling Human Communication Dynamics [Social Sciences].
IEEE Signal Process. Mag., 2010

Monocular head pose estimation using generalized adaptive view-based appearance model.
Image Vis. Comput., 2010

A probabilistic multimodal approach for predicting listener backchannels.
Auton. Agents Multi Agent Syst., 2010

3rd international workshop on affective interaction in natural environments (AFFINE).
Proceedings of the 18th International Conference on Multimedia 2010, 2010

Learning Backchannel Prediction Model from Parasocial Consensus Sampling: A Subjective Evaluation.
Proceedings of the Intelligent Virtual Agents, 10th International Conference, 2010

Concensus of Self-features for Nonverbal Behavior Analysis.
Proceedings of the Human Behavior Understanding, First International Workshop, 2010

Learning and evaluating response prediction models using parallel listener consensus.
Proceedings of the 12th International Conference on Multimodal Interfaces / 7. International Workshop on Machine Learning for Multimodal Interaction, 2010

Latent Mixture of Discriminative Experts for Multimodal Prediction Modeling.
Proceedings of the COLING 2010, 2010

Parasocial consensus sampling: combining multiple perspectives to learn virtual human behavior.
Proceedings of the 9th International Conference on Autonomous Agents and Multiagent Systems (AAMAS 2010), 2010

2008
AI's 10 to Watch.
IEEE Intell. Syst., 2008

Reducing drift in differential tracking.
Comput. Vis. Image Underst., 2008

Predicting Listener Backchannels: A Probabilistic Multimodal Approach.
Proceedings of the Intelligent Virtual Agents, 8th International Conference, 2008

Context-based recognition during human interactions: automatic feature selection and encoding dictionary.
Proceedings of the 10th International Conference on Multimodal Interfaces, 2008

Generalized adaptive view-based appearance model: Integrated framework for monocular head pose estimation.
Proceedings of the 8th IEEE International Conference on Automatic Face and Gesture Recognition (FG 2008), 2008

Modeling Latent-Dynamic in Shallow Parsing: A Latent Conditional Model with Imrpoved Inference.
Proceedings of the COLING 2008, 2008

2007
Hidden Conditional Random Fields.
IEEE Trans. Pattern Anal. Mach. Intell., 2007

Head gestures for perceptual interfaces: The role of context in improving recognition.
Artif. Intell., 2007

Conditional Sequence Model for Context-Based Recognition of Gaze Aversion.
Proceedings of the Machine Learning for Multimodal Interaction , 2007

Can Virtual Humans Be More Engaging Than Real Ones?
Proceedings of the Human-Computer Interaction. HCI Intelligent Multimodal Interaction Environments, 2007

Latent-Dynamic Discriminative Models for Continuous Gesture Recognition.
Proceedings of the 2007 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2007), 2007

2006
Context-based visual feedback recognition.
PhD thesis, 2006

Non-parametric and light-field deformable models.
Comput. Vis. Image Underst., 2006

Virtual Rapport.
Proceedings of the Intelligent Virtual Agents, 6th International Conference, 2006

Head gesture recognition in intelligent interfaces: the role of context in improving recognition.
Proceedings of the 11th International Conference on Intelligent User Interfaces, 2006

Recognizing gaze aversion gestures in embodied conversational discourse.
Proceedings of the 8th International Conference on Multimodal Interfaces, 2006

Co-Adaptation of audio-visual speech and gesture classifiers.
Proceedings of the 8th International Conference on Multimodal Interfaces, 2006

The effect of head-nod recognition in human-robot conversation.
Proceedings of the 1st ACM SIGCHI/SIGART Conference on Human-Robot Interaction, 2006

Hidden Conditional Random Fields for Gesture Recognition.
Proceedings of the 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2006), 2006

The Role of Context in Head Gesture Recognition.
Proceedings of the Proceedings, 2006

2005
Contextual recognition of head gestures.
Proceedings of the 7th International Conference on Multimodal Interfaces, 2005

2004
From conversational tooltips to grounded discourse: head poseTracking in interactive dialog systems.
Proceedings of the 6th International Conference on Multimodal Interfaces, 2004

Light Field Appearance Manifolds.
Proceedings of the Computer Vision, 2004

Nodding in conversations with a robot.
Proceedings of the Extended abstracts of the 2004 Conference on Human Factors in Computing Systems, 2004

2003
A multi-modal approach for determining speaker location and focus.
Proceedings of the 5th International Conference on Multimodal Interfaces, 2003

Robust real-time egomotion from stereo images.
Proceedings of the 2003 International Conference on Image Processing, 2003

Adaptive View-Based Appearance Models.
Proceedings of the 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2003), 2003

Pose Estimation using 3D View-Based Eigenspaces.
Proceedings of the 2003 IEEE International Workshop on Analysis and Modeling of Faces and Gestures (AMFG 2003), 2003

2002
Stereo Tracking Using ICP and Normal Flow Constraint.
Proceedings of the 16th International Conference on Pattern Recognition, 2002

Face-Responsive Interfaces: From Direct Manipulation to Perceptive Presence.
Proceedings of the UbiComp 2002: Ubiquitous Computing, 4th International Conference, Göteborg, Sweden, September 29, 2002

Fast Stereo-Based Head Tracking for Interactive Environments.
Proceedings of the 5th IEEE International Conference on Automatic Face and Gesture Recognition (FGR 2002), 2002

Evaluating look-to-talk: a gaze-aware interface in a collaborative environment.
Proceedings of the Extended abstracts of the 2002 Conference on Human Factors in Computing Systems, 2002

Fast 3D Model Acquisition from Stereo Images.
Proceedings of the 1st International Symposium on 3D Data Processing Visualization and Transmission (3DPVT 2002), 2002

2001
Reducing Drift in Parametric Motion Tracking.
Proceedings of the Eighth International Conference On Computer Vision (ICCV-01), Vancouver, British Columbia, Canada, July 7-14, 2001, 2001


  Loading...