2025

Dynamic prompting class distribution optimization for semi-supervised sound event detection.

[DOI]

,

,

,

,

Frontiers Inf. Technol. Electron. Eng., April, 2025

Infrared-Visible Image Fusion Using Dual-Branch Auto-Encoder With Invertible High-Frequency Encoding.

[DOI]

,

,

,

IEEE Trans. Circuits Syst. Video Technol., March, 2025

Rebalanced Multimodal Learning with Data-aware Unimodal Sampling.

[DOI]

,

,

,

,

,

CoRR, March, 2025

EDSep: An Effective Diffusion-Based Method for Speech Source Separation.

[DOI]

,

,

CoRR, January, 2025

A transformer-based model with feature compensation and local information enhancement for end-to-end pest detection.

[DOI]

,

,

,

,

Comput. Electron. Agric., 2025

2024

Tiny Object Detection via Regional Cross Self-Attention Network.

[DOI]

,

,

Humaira abdul Ghafoor

,

,

,

IEEE Trans. Circuits Syst. Video Technol., October, 2024

Exploring Prototype-Anchor Contrast for Semantic Segmentation.

[DOI]

,

,

,

IEEE Trans. Circuits Syst. Video Technol., August, 2024

Adaptive Density Subgraph Clustering.

[DOI]

,

,

,

,

IEEE Trans. Comput. Soc. Syst., August, 2024

A novel conversational hierarchical attention network for speech emotion recognition in dyadic conversation.

[DOI]

Mohammed Tellai

,

,

,

Mounir Abdelaziz

Multim. Tools Appl., June, 2024

Prototypical Bidirectional Adaptation and Learning for Cross-Domain Semantic Segmentation.

[DOI]

,

,

IEEE Trans. Multim., 2024

On Local Temporal Embedding for Semi-Supervised Sound Event Detection.

[DOI]

,

,

IEEE ACM Trans. Audio Speech Lang. Process., 2024

A post-processing framework for class-imbalanced learning in a transductive setting.

[DOI]

,

,

,

,

Expert Syst. Appl., 2024

Leveraging Contrastive Language-Image Pre-Training and Bidirectional Cross-attention for Multimodal Keyword Spotting.

[DOI]

,

,

,

Eng. Appl. Artif. Intell., 2024

A Survey of Deep Learning for Group-level Emotion Recognition.

[DOI]

,

,

,

,

CoRR, 2024

TF-DiffuSE: Time-Frequency Prior-Conditioned Diffusion Model for Speech Enhancement.

[DOI]

,

,

,

Proceedings of the 14th IEEE International Symposium on Chinese Spoken Language Processing, 2024

PL-TTS: A Generalizable Prompt-based Diffusion TTS Augmented by Large Language Model.

[DOI]

,

,

Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024

On Learning Frequency-Instance Correlations by Model-Agnostic Training for Synthetic Speech Detection.

[DOI]

,

,

,

Proceedings of the Asian Conference on Machine Learning, 2024

2023

CCTG-NET: Contextualized Convolutional Transformer-GRU Network for speech emotion recognition.

[DOI]

Mohammed Tellai

,

Int. J. Speech Technol., December, 2023

Weighted contrastive learning using pseudo labels for facial expression recognition.

[DOI]

,

,

Vis. Comput., October, 2023

Multi-level distance embedding learning for robust acoustic scene classification with unseen devices.

[DOI]

,

,

,

Pattern Anal. Appl., August, 2023

Large-scale non-negative subspace clustering based on Nyström approximation.

[DOI]

,

,

,

,

,

Inf. Sci., August, 2023

A semi-supervised resampling method for class-imbalanced learning.

[DOI]

,

,

,

,

Expert Syst. Appl., July, 2023

Global and local structure preserving nonnegative subspace clustering.

[DOI]

,

,

,

,

,

Pattern Recognit., June, 2023

Semi-Supervised Clustering Under a "Compact-Cluster" Assumption.

[DOI]

,

,

,

IEEE Trans. Knowl. Data Eng., May, 2023

An efficient speech emotion recognition based on a dual-stream CNN-transformer fusion network.

[DOI]

Mohammed Tellai

,

,

Int. J. Speech Technol., 2023

Multi-branch feature aggregation based on multiple weighting for speaker verification.

[DOI]

,

,

,

Comput. Speech Lang., 2023

An Empirical Study of Super-resolution on Low-resolution Micro-expression Recognition.

[DOI]

,

,

,

,

,

CoRR, 2023

Towards A Robust Group-level Emotion Recognition via Uncertainty-Aware Learning.

[DOI]

,

,

,

,

CoRR, 2023

TE-KWS: Text-Informed Speech Enhancement for Noise-Robust Keyword Spotting.

[DOI]

,

,

,

,

,

Proceedings of the 31st ACM International Conference on Multimedia, 2023

Joint-Former: Jointly Regularized and Locally Down-sampled Conformer for Semi-supervised Sound Event Detection.

[DOI]

,

,

Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

2022

Objective Class-Based Micro-Expression Recognition Under Partial Occlusion Via Region-Inspired Relation Reasoning Network.

[DOI]

,

,

,

,

IEEE Trans. Affect. Comput., 2022

Feature refinement: An expression-specific feature learning and fusion method for micro-expression recognition.

[DOI]

,

,

,

,

Pattern Recognit., 2022

A context aware-based deep neural network approach for simultaneous speech denoising and dereverberation.

[DOI]

Sidheswar Routray

,

Neural Comput. Appl., 2022

Editor's Note.

[DOI]

,

Joel J. P. C. Rodrigues

,

Anthony G. Cohn

,

,

Int. J. Interact. Multim. Artif. Intell., 2022

Convolutional relation network for facial expression recognition in the wild with few-shot learning.

[DOI]

,

,

,

Ocquaye Elias Nii Noi

,

Expert Syst. Appl., 2022

Phase sensitive masking-based single channel speech enhancement using conditional generative adversarial network.

[DOI]

Sidheswar Routray

,

Comput. Speech Lang., 2022

Label Structure Preserving Contrastive Embedding for Multi-Label Learning with Missing Labels.

[DOI]

,

,

,

CoRR, 2022

Weakly Supervised Sentiment-Specific Region Discovery for VSA.

[DOI]

,

,

,

,

Comput. J., 2022

Sparse signal reconstruction via generalized two-stage thresholding.

[DOI]

,

,

,

,

Sci. China Inf. Sci., 2022

Self-supervised transformer-based pre-training method using latent semantic masking auto-encoder for pest and disease classification.

[DOI]

,

,

,

,

Comput. Electron. Agric., 2022

Cross-Scene Speaker Verification Based on Dynamic Convolution for the CNSRC 2022 Challenge.

[DOI]

,

,

,

,

Proceedings of the Odyssey 2022: The Speaker and Language Recognition Workshop, 28 June, 2022

Adaptive Hierarchical Pooling for Weakly-supervised Sound Event Detection.

[DOI]

,

,

,

Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

DCTCN: Deep Complex Temporal Convolutional Network for Long Time Speech Enhancement.

[DOI]

,

Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Statistical Pyramid Dense Time Delay Neural Network for Speaker Verification.

[DOI]

,

,

,

Proceedings of the IEEE International Conference on Acoustics, 2022

Efficient Monaural Speech Separation with Multiscale Time-Delay Sampling.

[DOI]

Shuang-qing Qian

,

,

,

Proceedings of the IEEE International Conference on Acoustics, 2022

2021

Deep face clustering using residual graph convolutional network.

[DOI]

,

,

,

,

,

Knowl. Based Syst., 2021

Erratum to: Latent discriminative representation learning for speaker recognition.

[DOI]

,

,

,

,

Sidheswar Routray

,

Ocquaye Elias Nii Noi

Frontiers Inf. Technol. Electron. Eng., 2021

Latent discriminative representation learning for speaker recognition.

[DOI]

,

,

,

,

Sidheswar Routray

,

Ocquaye Elias Nii Noi

Frontiers Inf. Technol. Electron. Eng., 2021

A survey of micro-expression recognition.

[DOI]

,

,

Image Vis. Comput., 2021

Learning to disentangle emotion factors for facial expression recognition in the wild.

[DOI]

,

,

,

Int. J. Intell. Syst., 2021

Cross lingual speech emotion recognition via triple attentive asymmetric convolutional neural network.

[DOI]

Ocquaye Elias Nii Noi

,

,

,

Int. J. Intell. Syst., 2021

An efficient Nyström spectral clustering algorithm using incomplete Cholesky decomposition.

[DOI]

,

,

,

,

Expert Syst. Appl., 2021

Region attention and graph embedding network for occlusion objective class-based micro-expression recognition.

[DOI]

,

,

,

,

CoRR, 2021

Reproducibility Companion Paper: On Learning Disentangled Representation for Acoustic Event Detection.

[DOI]

,

,

,

,

Ratna Babu Chinnam

,

Lucile Sassatelli

,

Miguel Fabián Romero Rondón

,

Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

2020

A Unified Deep Model for Joint Facial Expression Recognition, Face Synthesis, and Face Alignment.

[DOI]

,

,

,

IEEE Trans. Image Process., 2020

Geometry Guided Pose-Invariant Facial Expression Recognition.

[DOI]

,

,

,

IEEE Trans. Image Process., 2020

Weighted discriminative collaborative competitive representation for robust image classification.

[DOI]

,

,

,

,

,

Neural Networks, 2020

NLWSNet: a weakly supervised network for visual sentiment analysis in mislabeled web images.

[DOI]

,

,

,

Frontiers Inf. Technol. Electron. Eng., 2020

Latent source-specific generative factor learning for monaural speech separation using weighted-factor autoencoder.

[DOI]

,

,

,

Shuang-qing Qian

,

Frontiers Inf. Technol. Electron. Eng., 2020

环境辅助的多任务混合声音事件检测方法 (Environment-assisted Multi-task Learning for Polyphonic Acoustic Event Detection).

[DOI]

,

计算机科学, 2020

Discriminative globality and locality preserving graph embedding for dimensionality reduction.

[DOI]

,

,

,

,

,

Expert Syst. Appl., 2020

Objective Class-based Micro-Expression Recognition through Simultaneous Action Unit Detection and Feature Aggregation.

[DOI]

,

,

CoRR, 2020

Visual Sentiment Analysis With Active Learning.

[DOI]

,

,

IEEE Access, 2020

Joint Attribute Manipulation and Modality Alignment Learning for Composing Text and Image to Image Retrieval.

[DOI]

,

,

,

Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Face Aging with Conditional Generative Adversarial Network Guided by Ranking-CNN.

[DOI]

,

,

,

,

Proceedings of the 3rd IEEE Conference on Multimedia Information Processing and Retrieval, 2020

Dual-Path Transformer Network: Direct Context-Aware Modeling for End-to-End Monaural Speech Separation.

[DOI]

,

,

Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

On Synthesis for Supervised Monaural Speech Separation in Time Domain.

[DOI]

,

,

Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Salient Attention Model and Classes Imbalance Remission for Video Anomaly Analysis with Weak Label.

[DOI]

,

,

,

Proceedings of the Human Centered Computing - 6th International Conference, 2020

2019

An emotion-based responding model for natural language conversation.

[DOI]

,

,

,

,

,

World Wide Web, 2019

A Local Mean Representation-based <i>K</i>-Nearest Neighbor Classifier.

[DOI]

,

,

,

,

,

ACM Trans. Intell. Syst. Technol., 2019

Multimodal shared features learning for emotion recognition by enhanced sparse local discriminative canonical correlation analysis.

[DOI]

,

,

,

Multim. Syst., 2019

Mood-aware visual question answering.

[DOI]

,

,

,

,

Neurocomputing, 2019

Affective question answering on video.

[DOI]

,

,

,

Neurocomputing, 2019

Several robust extensions of collaborative representation for image classification.

[DOI]

,

,

,

,

,

Neurocomputing, 2019

Dictionary-induced least squares framework for multi-view dimensionality reduction with multi-manifold embeddings.

[DOI]

Timothy Apasiba Abeo

,

,

,

,

,

IET Comput. Vis., 2019

Two-phase probabilistic collaborative representation-based classification.

[DOI]

,

,

,

,

,

Expert Syst. Appl., 2019

Triple attention network for sentimental visual question answering.

[DOI]

,

,

,

,

Comput. Vis. Image Underst., 2019

An Emotion-Embedded Visual Attention Model for Dimensional Emotion Context Learning.

[DOI]

,

,

,

,

IEEE Access, 2019

Dual Exclusive Attentive Transfer for Unsupervised Deep Convolutional Domain Adaptation in Speech Emotion Recognition.

[DOI]

Ocquaye Elias Nii Noi

,

,

,

,

IEEE Access, 2019

Learning Hierarchical Emotion Context for Continuous Dimensional Emotion Recognition From Video Sequences.

[DOI]

,

,

,

,

IEEE Access, 2019

On Learning Disentangled Representation for Acoustic Event Detection.

[DOI]

,

,

,

,

Ratna Babu Chinnam

Proceedings of the 27th ACM International Conference on Multimedia, 2019

Cross-Database Micro-Expression Recognition: A Style Aggregated and Attention Transfer Approach.

[DOI]

,

,

Proceedings of the IEEE International Conference on Multimedia & Expo Workshops, 2019

Discriminative Group Collaborative Competitive Representation for Visual Classification.

[DOI]

,

,

,

,

,

Proceedings of the IEEE International Conference on Multimedia and Expo, 2019

Dual-Inception Network for Cross-Database Micro-Expression Recognition.

[DOI]

,

,

Proceedings of the 14th IEEE International Conference on Automatic Face & Gesture Recognition, 2019

2018

Spatially Coherent Feature Learning for Pose-Invariant Facial Expression Recognition.

[DOI]

,

,

,

,

ACM Trans. Multim. Comput. Commun. Appl., 2018

Discriminative self-adapted locality-sensitive sparse representation for video semantic analysis.

[DOI]

,

,

,

Multim. Tools Appl., 2018

Affective rating ranking based on face images in arousal-valence dimensional space.

[DOI]

,

,

,

Frontiers Inf. Technol. Electron. Eng., 2018

Two-phase linear reconstruction measure-based classification for face recognition.

[DOI]

,

,

,

,

,

Inf. Sci., 2018

Cascaded Multi-level Transformed Dirichlet Process for Multi-pose Facial Expression Recognition.

[DOI]

,

,

,

,

Comput. J., 2018

A New Discriminative Collaborative Neighbor Representation Method for Robust Face Recognition.

[DOI]

,

,

,

,

,

IEEE Access, 2018

Facial Expression Recognition in the Wild: A Cycle-Consistent Adversarial Attention Transfer Approach.

[DOI]

,

,

,

,

Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018

Affective Visual Question Answering Network.

[DOI]

,

,

,

Proceedings of the IEEE 1st Conference on Multimedia Information Processing and Retrieval, 2018

A K-AP Clustering Algorithm Based on Manifold Similarity Measure.

[DOI]

,

,

,

,

Proceedings of the Intelligent Information Processing IX, 2018

Joint Pose and Expression Modeling for Facial Expression Recognition.

[DOI]

,

,

,

Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Coupled Unsupervised Deep Convolutional Domain Adaptation for Speech Emotion Recognition.

[DOI]

Ocquaye Elias Nii Noi

,

,

,

Proceedings of the Fourth IEEE International Conference on Multimedia Big Data, 2018

2017

Hierarchical Bayesian Theme Models for Multipose Facial Expression Recognition.

[DOI]

,

,

,

IEEE Trans. Multim., 2017

Learning emotion-discriminative and domain-invariant features for domain adaptation in speech emotion recognition.

[DOI]

,

,

,

,

Speech Commun., 2017

Unsupervised domain adaptation for speech emotion recognition using PCANet.

[DOI]

,

,

,

Multim. Tools Appl., 2017

A Multi-local Means Based Nearest Neighbor Classifier.

[DOI]

,

,

,

,

,

Proceedings of the 29th IEEE International Conference on Tools with Artificial Intelligence, 2017

2016

Pose-robust feature learning for facial expression recognition.

[DOI]

,

,

,

,

Frontiers Comput. Sci., 2016

Collaborative Q-Learning Based Routing Control in Unstructured P2P Networks.

[DOI]

,

,

,

,

,

Proceedings of the MultiMedia Modeling - 22nd International Conference, 2016

Multi-pose Facial Expression Recognition Using Transformed Dirichlet Process.

[DOI]

,

,

,

Proceedings of the 2016 ACM Conference on Multimedia Conference, 2016

Domain adaptation for speech emotion recognition by sharing priors between related source and target classes.

[DOI]

,

,

,

,

Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

2015

A semi-supervised incremental learning method based on adaptive probabilistic hypergraph for video semantic detection.

[DOI]

,

,

,

,

Multim. Tools Appl., 2015

Using Kinect for real-time emotion recognition via facial expressions.

[DOI]

,

,

,

Frontiers Inf. Technol. Electron. Eng., 2015

Speech emotion recognition with unsupervised feature learning.

[DOI]

,

,

Frontiers Inf. Technol. Electron. Eng., 2015

面向视频语义分析的局部敏感的可鉴别稀疏表示 (Locality-sensitive Discriminant Sparse Representation for Video Semantic Analysis).

[DOI]

,

,

,

计算机科学, 2015

A Video Semantic Analysis Method Based on Kernel Discriminative Sparse Representation and Weighted KNN.

[DOI]

,

,

,

,

Comput. J., 2015

Two-Phase Representation Based Classification.

[DOI]

,

,

,

,

Proceedings of the Advances in Multimedia Information Processing - PCM 2015, 2015

Learning speech emotion features by joint disentangling-discrimination.

[DOI]

,

,

,

Proceedings of the 2015 International Conference on Affective Computing and Intelligent Interaction, 2015

Multi-pose facial expression recognition based on SURF boosting.

[DOI]

,

,

,

Proceedings of the 2015 International Conference on Affective Computing and Intelligent Interaction, 2015

2014

Learning Salient Features for Speech Emotion Recognition Using Convolutional Neural Networks.

[DOI]

,

,

,

IEEE Trans. Multim., 2014

An SVM-AdaBoost-based face detection system.

[DOI]

,

,

J. Exp. Theor. Artif. Intell., 2014

A neural-AdaBoost based facial expression recognition system.

[DOI]

,

,

Expert Syst. Appl., 2014

An SVM-AdaBoost facial expression recognition system.

[DOI]

,

,

Appl. Intell., 2014

Speech Emotion Recognition Using CNN.

[DOI]

,

,

,

Proceedings of the ACM International Conference on Multimedia, MM '14, Orlando, FL, USA, November 03, 2014

2013

Pedestrian Detection Based on Kernel Discriminative Sparse Representation.

[DOI]

,

,

Trans. Edutainment, 2013

Speaker-independent speech emotion recognition by fusion of functional and accompanying paralanguage features.

[DOI]

,

,

,

J. Zhejiang Univ. Sci. C, 2013

Regularized least squares fisher linear discriminant with applications to image recognition.

[DOI]

,

,

,

Neurocomputing, 2013

Improved twin support vector machine using total margin and graph embedding.

[DOI]

,

,

,

Proceedings of the Ninth International Conference on Natural Computation, 2013

A Video Semantic Analysis Method Based on Kernel Discriminative Sparse Representation and Weighted KNN.

[DOI]

,

,

,

Proceedings of the 2013 IEEE International Conference on Green Computing and Communications (GreenCom) and IEEE Internet of Things (iThings) and IEEE Cyber, 2013

2010

Speech Emotion Recognition Method Based on Improved Decision Tree and Layered Feature Selection.

[DOI]

,

,

Int. J. Humanoid Robotics, 2010

Extraction and analysis of the speech emotion features based on multi-fractal spectrum.

[DOI]

,

Int. J. Comput. Appl. Technol., 2010

A novel hierarchical speech emotion recognition method based on improved DDAGSVM.

[DOI]

,

Comput. Sci. Inf. Syst., 2010

Knowledge Preference Based Learning Community Construction and Service Support.

[DOI]

,

,

Proceedings of the Entertainment for Education. Digital Techniques and Systems, 2010

2008

Application Research of Ontology in E-Learning Environment.

[DOI]

,

,

Proceedings of the International Conference on Cyberworlds 2008, 2008

2007

Ontology Based Situation Analysis and Encouragement in E-Learning System.

[DOI]

,

,

Proceedings of the Technologies for E-Learning and Digital Entertainment, 2007

2005

The shared knowledge space model in Web-based cooperative learning coalition.

[DOI]

,

,

,

,

Proceedings of the Ninth International Conference on Computer Supported Cooperative Work in Design, 2005

2004

DRMR: Dynamic-Ring-Based Multicast Routing Protocol for Ad Hoc Networks.

[DOI]

,

,

,

,

J. Comput. Sci. Technol., 2004

Optimistic Locking Concurrency Control Scheme for Collaborative Editing System Based on Relative Position.

[DOI]

,

,

Proceedings of the Computer Supported Cooperative Work in Design I, 2004

Design and Simulation of Multicast Routing Protocol for Mobile Internet.

[DOI]

,

,

,

,

Proceedings of the Advanced Web Technologies and Applications, 2004