DCTM: Dilated Convolutional Transformer Model for Multimodal Engagement Estimation in Conversation.
Proceedings of the 31st ACM International Conference on Multimedia, 2023
Ensemble Spatial and Temporal Vision Transformer for Action Units Detection.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023
Multiscale Transformer-Based for Multimodal Affective States Estimation from Physiological Signals.
Proceedings of the Pattern Recognition - 7th Asian Conference, 2023