2025
Dynamic prompting class distribution optimization for semi-supervised sound event detection.
Frontiers Inf. Technol. Electron. Eng., April, 2025
Infrared-Visible Image Fusion Using Dual-Branch Auto-Encoder With Invertible High-Frequency Encoding.
IEEE Trans. Circuits Syst. Video Technol., March, 2025
Rebalanced Multimodal Learning with Data-aware Unimodal Sampling.
CoRR, March, 2025
EDSep: An Effective Diffusion-Based Method for Speech Source Separation.
CoRR, January, 2025
A transformer-based model with feature compensation and local information enhancement for end-to-end pest detection.
Comput. Electron. Agric., 2025
2024
Tiny Object Detection via Regional Cross Self-Attention Network.
IEEE Trans. Circuits Syst. Video Technol., October, 2024
Exploring Prototype-Anchor Contrast for Semantic Segmentation.
IEEE Trans. Circuits Syst. Video Technol., August, 2024
Adaptive Density Subgraph Clustering.
IEEE Trans. Comput. Soc. Syst., August, 2024
A novel conversational hierarchical attention network for speech emotion recognition in dyadic conversation.
Multim. Tools Appl., June, 2024
Prototypical Bidirectional Adaptation and Learning for Cross-Domain Semantic Segmentation.
IEEE Trans. Multim., 2024
On Local Temporal Embedding for Semi-Supervised Sound Event Detection.
IEEE ACM Trans. Audio Speech Lang. Process., 2024
A post-processing framework for class-imbalanced learning in a transductive setting.
Expert Syst. Appl., 2024
Leveraging Contrastive Language-Image Pre-Training and Bidirectional Cross-attention for Multimodal Keyword Spotting.
Eng. Appl. Artif. Intell., 2024
A Survey of Deep Learning for Group-level Emotion Recognition.
CoRR, 2024
TF-DiffuSE: Time-Frequency Prior-Conditioned Diffusion Model for Speech Enhancement.
Proceedings of the 14th IEEE International Symposium on Chinese Spoken Language Processing, 2024
PL-TTS: A Generalizable Prompt-based Diffusion TTS Augmented by Large Language Model.
Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024
On Learning Frequency-Instance Correlations by Model-Agnostic Training for Synthetic Speech Detection.
Proceedings of the Asian Conference on Machine Learning, 2024
2023
CCTG-NET: Contextualized Convolutional Transformer-GRU Network for speech emotion recognition.
Int. J. Speech Technol., December, 2023
Weighted contrastive learning using pseudo labels for facial expression recognition.
Vis. Comput., October, 2023
Multi-level distance embedding learning for robust acoustic scene classification with unseen devices.
Pattern Anal. Appl., August, 2023
Large-scale non-negative subspace clustering based on Nyström approximation.
Inf. Sci., August, 2023
A semi-supervised resampling method for class-imbalanced learning.
Expert Syst. Appl., July, 2023
Global and local structure preserving nonnegative subspace clustering.
Pattern Recognit., June, 2023
Semi-Supervised Clustering Under a "Compact-Cluster" Assumption.
IEEE Trans. Knowl. Data Eng., May, 2023
An efficient speech emotion recognition based on a dual-stream CNN-transformer fusion network.
Int. J. Speech Technol., 2023
Multi-branch feature aggregation based on multiple weighting for speaker verification.
Comput. Speech Lang., 2023
An Empirical Study of Super-resolution on Low-resolution Micro-expression Recognition.
CoRR, 2023
Towards A Robust Group-level Emotion Recognition via Uncertainty-Aware Learning.
CoRR, 2023
TE-KWS: Text-Informed Speech Enhancement for Noise-Robust Keyword Spotting.
Proceedings of the 31st ACM International Conference on Multimedia, 2023
Joint-Former: Jointly Regularized and Locally Down-sampled Conformer for Semi-supervised Sound Event Detection.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
2022
Objective Class-Based Micro-Expression Recognition Under Partial Occlusion Via Region-Inspired Relation Reasoning Network.
IEEE Trans. Affect. Comput., 2022
Feature refinement: An expression-specific feature learning and fusion method for micro-expression recognition.
Pattern Recognit., 2022
A context aware-based deep neural network approach for simultaneous speech denoising and dereverberation.
Neural Comput. Appl., 2022
Int. J. Interact. Multim. Artif. Intell., 2022
Convolutional relation network for facial expression recognition in the wild with few-shot learning.
Expert Syst. Appl., 2022
Phase sensitive masking-based single channel speech enhancement using conditional generative adversarial network.
Comput. Speech Lang., 2022
Label Structure Preserving Contrastive Embedding for Multi-Label Learning with Missing Labels.
CoRR, 2022
Weakly Supervised Sentiment-Specific Region Discovery for VSA.
Comput. J., 2022
Sparse signal reconstruction via generalized two-stage thresholding.
Sci. China Inf. Sci., 2022
Self-supervised transformer-based pre-training method using latent semantic masking auto-encoder for pest and disease classification.
Comput. Electron. Agric., 2022
Cross-Scene Speaker Verification Based on Dynamic Convolution for the CNSRC 2022 Challenge.
Proceedings of the Odyssey 2022: The Speaker and Language Recognition Workshop, 28 June, 2022
Adaptive Hierarchical Pooling for Weakly-supervised Sound Event Detection.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022
DCTCN: Deep Complex Temporal Convolutional Network for Long Time Speech Enhancement.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
Statistical Pyramid Dense Time Delay Neural Network for Speaker Verification.
Proceedings of the IEEE International Conference on Acoustics, 2022
Efficient Monaural Speech Separation with Multiscale Time-Delay Sampling.
Proceedings of the IEEE International Conference on Acoustics, 2022
2021
Deep face clustering using residual graph convolutional network.
Knowl. Based Syst., 2021
Erratum to: Latent discriminative representation learning for speaker recognition.
Frontiers Inf. Technol. Electron. Eng., 2021
Latent discriminative representation learning for speaker recognition.
Frontiers Inf. Technol. Electron. Eng., 2021
A survey of micro-expression recognition.
Image Vis. Comput., 2021
Learning to disentangle emotion factors for facial expression recognition in the wild.
Int. J. Intell. Syst., 2021
Cross lingual speech emotion recognition via triple attentive asymmetric convolutional neural network.
Int. J. Intell. Syst., 2021
An efficient Nyström spectral clustering algorithm using incomplete Cholesky decomposition.
Expert Syst. Appl., 2021
Region attention and graph embedding network for occlusion objective class-based micro-expression recognition.
CoRR, 2021
Reproducibility Companion Paper: On Learning Disentangled Representation for Acoustic Event Detection.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021
2020
A Unified Deep Model for Joint Facial Expression Recognition, Face Synthesis, and Face Alignment.
IEEE Trans. Image Process., 2020
Geometry Guided Pose-Invariant Facial Expression Recognition.
IEEE Trans. Image Process., 2020
Weighted discriminative collaborative competitive representation for robust image classification.
Neural Networks, 2020
NLWSNet: a weakly supervised network for visual sentiment analysis in mislabeled web images.
Frontiers Inf. Technol. Electron. Eng., 2020
Latent source-specific generative factor learning for monaural speech separation using weighted-factor autoencoder.
Frontiers Inf. Technol. Electron. Eng., 2020
环境辅助的多任务混合声音事件检测方法 (Environment-assisted Multi-task Learning for Polyphonic Acoustic Event Detection).
计算机科学, 2020
Discriminative globality and locality preserving graph embedding for dimensionality reduction.
Expert Syst. Appl., 2020
Objective Class-based Micro-Expression Recognition through Simultaneous Action Unit Detection and Feature Aggregation.
CoRR, 2020
Visual Sentiment Analysis With Active Learning.
IEEE Access, 2020
Joint Attribute Manipulation and Modality Alignment Learning for Composing Text and Image to Image Retrieval.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020
Face Aging with Conditional Generative Adversarial Network Guided by Ranking-CNN.
Proceedings of the 3rd IEEE Conference on Multimedia Information Processing and Retrieval, 2020
Dual-Path Transformer Network: Direct Context-Aware Modeling for End-to-End Monaural Speech Separation.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
On Synthesis for Supervised Monaural Speech Separation in Time Domain.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
Salient Attention Model and Classes Imbalance Remission for Video Anomaly Analysis with Weak Label.
Proceedings of the Human Centered Computing - 6th International Conference, 2020
2019
An emotion-based responding model for natural language conversation.
World Wide Web, 2019
A Local Mean Representation-based <i>K</i>-Nearest Neighbor Classifier.
ACM Trans. Intell. Syst. Technol., 2019
Multimodal shared features learning for emotion recognition by enhanced sparse local discriminative canonical correlation analysis.
Multim. Syst., 2019
Mood-aware visual question answering.
Neurocomputing, 2019
Affective question answering on video.
Neurocomputing, 2019
Several robust extensions of collaborative representation for image classification.
Neurocomputing, 2019
Dictionary-induced least squares framework for multi-view dimensionality reduction with multi-manifold embeddings.
IET Comput. Vis., 2019
Two-phase probabilistic collaborative representation-based classification.
Expert Syst. Appl., 2019
Triple attention network for sentimental visual question answering.
Comput. Vis. Image Underst., 2019
An Emotion-Embedded Visual Attention Model for Dimensional Emotion Context Learning.
IEEE Access, 2019
Dual Exclusive Attentive Transfer for Unsupervised Deep Convolutional Domain Adaptation in Speech Emotion Recognition.
IEEE Access, 2019
Learning Hierarchical Emotion Context for Continuous Dimensional Emotion Recognition From Video Sequences.
IEEE Access, 2019
On Learning Disentangled Representation for Acoustic Event Detection.
Proceedings of the 27th ACM International Conference on Multimedia, 2019
Cross-Database Micro-Expression Recognition: A Style Aggregated and Attention Transfer Approach.
Proceedings of the IEEE International Conference on Multimedia & Expo Workshops, 2019
Discriminative Group Collaborative Competitive Representation for Visual Classification.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2019
Dual-Inception Network for Cross-Database Micro-Expression Recognition.
Proceedings of the 14th IEEE International Conference on Automatic Face & Gesture Recognition, 2019
2018
Spatially Coherent Feature Learning for Pose-Invariant Facial Expression Recognition.
ACM Trans. Multim. Comput. Commun. Appl., 2018
Discriminative self-adapted locality-sensitive sparse representation for video semantic analysis.
Multim. Tools Appl., 2018
Affective rating ranking based on face images in arousal-valence dimensional space.
Frontiers Inf. Technol. Electron. Eng., 2018
Two-phase linear reconstruction measure-based classification for face recognition.
Inf. Sci., 2018
Cascaded Multi-level Transformed Dirichlet Process for Multi-pose Facial Expression Recognition.
Comput. J., 2018
A New Discriminative Collaborative Neighbor Representation Method for Robust Face Recognition.
IEEE Access, 2018
Facial Expression Recognition in the Wild: A Cycle-Consistent Adversarial Attention Transfer Approach.
Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018
Affective Visual Question Answering Network.
Proceedings of the IEEE 1st Conference on Multimedia Information Processing and Retrieval, 2018
A K-AP Clustering Algorithm Based on Manifold Similarity Measure.
Proceedings of the Intelligent Information Processing IX, 2018
Joint Pose and Expression Modeling for Facial Expression Recognition.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018
Coupled Unsupervised Deep Convolutional Domain Adaptation for Speech Emotion Recognition.
Proceedings of the Fourth IEEE International Conference on Multimedia Big Data, 2018
2017
Hierarchical Bayesian Theme Models for Multipose Facial Expression Recognition.
IEEE Trans. Multim., 2017
Learning emotion-discriminative and domain-invariant features for domain adaptation in speech emotion recognition.
Speech Commun., 2017
Unsupervised domain adaptation for speech emotion recognition using PCANet.
Multim. Tools Appl., 2017
A Multi-local Means Based Nearest Neighbor Classifier.
Proceedings of the 29th IEEE International Conference on Tools with Artificial Intelligence, 2017
2016
Pose-robust feature learning for facial expression recognition.
Frontiers Comput. Sci., 2016
Collaborative Q-Learning Based Routing Control in Unstructured P2P Networks.
Proceedings of the MultiMedia Modeling - 22nd International Conference, 2016
Multi-pose Facial Expression Recognition Using Transformed Dirichlet Process.
Proceedings of the 2016 ACM Conference on Multimedia Conference, 2016
Domain adaptation for speech emotion recognition by sharing priors between related source and target classes.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016
2015
A semi-supervised incremental learning method based on adaptive probabilistic hypergraph for video semantic detection.
Multim. Tools Appl., 2015
Using Kinect for real-time emotion recognition via facial expressions.
Frontiers Inf. Technol. Electron. Eng., 2015
Speech emotion recognition with unsupervised feature learning.
Frontiers Inf. Technol. Electron. Eng., 2015
面向视频语义分析的局部敏感的可鉴别稀疏表示 (Locality-sensitive Discriminant Sparse Representation for Video Semantic Analysis).
计算机科学, 2015
A Video Semantic Analysis Method Based on Kernel Discriminative Sparse Representation and Weighted KNN.
Comput. J., 2015
Two-Phase Representation Based Classification.
Proceedings of the Advances in Multimedia Information Processing - PCM 2015, 2015
Learning speech emotion features by joint disentangling-discrimination.
Proceedings of the 2015 International Conference on Affective Computing and Intelligent Interaction, 2015
Multi-pose facial expression recognition based on SURF boosting.
Proceedings of the 2015 International Conference on Affective Computing and Intelligent Interaction, 2015
2014
Learning Salient Features for Speech Emotion Recognition Using Convolutional Neural Networks.
IEEE Trans. Multim., 2014
An SVM-AdaBoost-based face detection system.
J. Exp. Theor. Artif. Intell., 2014
A neural-AdaBoost based facial expression recognition system.
Expert Syst. Appl., 2014
An SVM-AdaBoost facial expression recognition system.
Appl. Intell., 2014
Speech Emotion Recognition Using CNN.
Proceedings of the ACM International Conference on Multimedia, MM '14, Orlando, FL, USA, November 03, 2014
2013
Pedestrian Detection Based on Kernel Discriminative Sparse Representation.
Trans. Edutainment, 2013
Speaker-independent speech emotion recognition by fusion of functional and accompanying paralanguage features.
J. Zhejiang Univ. Sci. C, 2013
Regularized least squares fisher linear discriminant with applications to image recognition.
Neurocomputing, 2013
Improved twin support vector machine using total margin and graph embedding.
Proceedings of the Ninth International Conference on Natural Computation, 2013
A Video Semantic Analysis Method Based on Kernel Discriminative Sparse Representation and Weighted KNN.
Proceedings of the 2013 IEEE International Conference on Green Computing and Communications (GreenCom) and IEEE Internet of Things (iThings) and IEEE Cyber, 2013
2010
Speech Emotion Recognition Method Based on Improved Decision Tree and Layered Feature Selection.
Int. J. Humanoid Robotics, 2010
Extraction and analysis of the speech emotion features based on multi-fractal spectrum.
Int. J. Comput. Appl. Technol., 2010
A novel hierarchical speech emotion recognition method based on improved DDAGSVM.
Comput. Sci. Inf. Syst., 2010
Knowledge Preference Based Learning Community Construction and Service Support.
Proceedings of the Entertainment for Education. Digital Techniques and Systems, 2010
2008
Application Research of Ontology in E-Learning Environment.
Proceedings of the International Conference on Cyberworlds 2008, 2008
2007
Ontology Based Situation Analysis and Encouragement in E-Learning System.
Proceedings of the Technologies for E-Learning and Digital Entertainment, 2007
2005
The shared knowledge space model in Web-based cooperative learning coalition.
Proceedings of the Ninth International Conference on Computer Supported Cooperative Work in Design, 2005
2004
DRMR: Dynamic-Ring-Based Multicast Routing Protocol for Ad Hoc Networks.
J. Comput. Sci. Technol., 2004
Optimistic Locking Concurrency Control Scheme for Collaborative Editing System Based on Relative Position.
Proceedings of the Computer Supported Cooperative Work in Design I, 2004
Design and Simulation of Multicast Routing Protocol for Mobile Internet.
Proceedings of the Advanced Web Technologies and Applications, 2004