Jun Yu
Orcid: 0000-0002-3197-8103Affiliations:
- University of Science and Technology of China, Department of Automation, Hefei, China
According to our database1,
Jun Yu
authored at least 197 papers
between 2013 and 2024.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
Online presence:
-
on orcid.org
On csauthors.net:
Bibliography
2024
IEEE Trans. Pattern Anal. Mach. Intell., September, 2024
ACM Trans. Multim. Comput. Commun. Appl., July, 2024
IEEE Trans. Pattern Anal. Mach. Intell., May, 2024
Mach. Learn., April, 2024
Eng. Appl. Artif. Intell., January, 2024
IEEE Trans. Multim., 2024
Conditional Consistency Regularization for Semi-Supervised Multi-Label Image Classification.
IEEE Trans. Multim., 2024
IEEE Trans. Pattern Anal. Mach. Intell., 2024
Data and knowledge-driven deep multiview fusion network based on diffusion model for hyperspectral image classification.
Expert Syst. Appl., 2024
Guided Self-attention: Find the Generalized Necessarily Distinct Vectors for Grain Size Grading.
CoRR, 2024
Learning the Generalizable Manipulation Skills on Soft-body Tasks via Guided Self-attention Behavior Cloning Policy.
CoRR, 2024
Emotional Cues Extraction and Fusion for Multi-modal Emotion Prediction and Recognition in Conversation.
CoRR, 2024
CoRR, 2024
AUD-TGN: Advancing Action Unit Detection with Temporal Convolution and GPT-2 in Wild Audiovisual Contexts.
CoRR, 2024
Multimodal Fusion Method with Spatiotemporal Sequences and Relationship Learning for Valence-Arousal Estimation.
CoRR, 2024
Exploring Facial Expression Recognition through Semi-Supervised Pretraining and Temporal Modeling.
CoRR, 2024
Efficient Feature Extraction and Late Fusion Strategy for Audiovisual Emotional Mimicry Intensity Estimation.
CoRR, 2024
Part-level Reconstruction for Self-Supervised Category-level 6D Object Pose Estimation with Coarse-to-Fine Correspondence Optimization.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024
RAG-Guided Large Language Models for Visual Spatial Description with Adaptive Hallucination Corrector.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024
Temporal-Informative Adapters in VideoMAE V2 and Multi-Scale Feature Fusion for Micro-Expression Spotting-then-Recognize.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024
Building Robust Video-Level Deepfake Detection via Audio-Visual Local-Global Interactions.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024
Dialogue Cross-Enhanced Central Engagement Attention Model for Real-Time Engagement Estimation.
Proceedings of the Thirty-Third International Joint Conference on Artificial Intelligence, 2024
Proceedings of the Forty-first International Conference on Machine Learning, 2024
Proceedings of the Twelfth International Conference on Learning Representations, 2024
Enhancing Emotion Recognition in Conversation Through Emotional Cross-Modal Fusion and Inter-class Contrastive Learning.
Proceedings of the Advanced Intelligent Computing Technology and Applications, 2024
Proceedings of the IEEE International Conference on Acoustics, 2024
Proceedings of the IEEE International Conference on Acoustics, 2024
LMT-GP: Combined Latent Mean-Teacher and Gaussian Process for Semi-supervised Low-Light Image Enhancement.
Proceedings of the Computer Vision - ECCV 2024, 2024
Efficient Feature Extraction and Late Fusion Strategy for Audiovisual Emotional Mimicry Intensity Estimation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024
AUD-TGN: Advancing Action Unit Detection with Temporal Convolution and GPT-2 in Wild Audiovisual Contexts.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024
Improving Valence-Arousal Estimation with Spatiotemporal Relationship Learning and Multimodal Fusion.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024
Exploring Facial Expression Recognition through Semi-Supervised Pre-training and Temporal Modeling.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024
Proceedings of the Web and Big Data - 8th International Joint Conference, 2024
2023
Image Vis. Comput., November, 2023
Multi-Object Tracking: Decoupling Features to Solve the Contradictory Dilemma of Feature Requirements.
IEEE Trans. Circuits Syst. Video Technol., September, 2023
Mach. Learn., June, 2023
Winning Prize Comes from Losing Tickets: Improve Invariant Learning by Exploring Variant Parameters for Out-of-Distribution Generalization.
CoRR, 2023
CoRR, 2023
A Two-stage Fine-tuning Strategy for Generalizable Manipulation Skill of Embodied AI.
CoRR, 2023
CoRR, 2023
Making Binary Classification from Multiple Unlabeled Datasets Almost Free of Supervision.
CoRR, 2023
CoRR, 2023
CoRR, 2023
Local Region Perception and Relationship Learning Combined with Feature Fusion for Facial Action Unit Detection.
CoRR, 2023
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023
FlatMatch: Bridging Labeled Data and Unlabeled Data with Cross-Sharpness for Semi-Supervised Learning.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023
MMT-GD: Multi-Modal Transformer with Graph Distillation for Cross-Cultural Humor Detection.
Proceedings of the 4th on Multimodal Sentiment Analysis Challenge and Workshop: Mimicked Emotions, 2023
Relative Boundary Modeling: A High-Resolution Cricket Bowl Release Detection Framework with I3D Features.
Proceedings of the 6th International Workshop on Multimedia Content Analysis in Sports, 2023
Proceedings of the 6th International Workshop on Multimedia Content Analysis in Sports, 2023
Proceedings of the 6th International Workshop on Multimedia Content Analysis in Sports, 2023
Leveraging the Latent Diffusion Models for Offline Facial Multiple Appropriate Reactions Generation.
Proceedings of the 31st ACM International Conference on Multimedia, 2023
Proceedings of the 31st ACM International Conference on Multimedia, 2023
Proceedings of the 31st ACM International Conference on Multimedia, 2023
Efficient Micro-Expression Spotting Based on Main Directional Mean Optical Flow Feature.
Proceedings of the 31st ACM International Conference on Multimedia, 2023
Proceedings of the 31st ACM International Conference on Multimedia, 2023
Proceedings of the 1st International Workshop on Multimedia Content Generation and Evaluation: New Methods and Practice, 2023
Proceedings of the International Joint Conference on Neural Networks, 2023
Proceedings of the 19th International Conference on Natural Computation, 2023
Which is Better for Learning with Noisy Labels: The Semi-supervised Method or Modeling Label Noise?
Proceedings of the International Conference on Machine Learning, 2023
SAGE-NDVI: A Stereotype-Breaking Evaluation Metric for Remote Sensing Image Dehazing Using Satellite-to-Ground NDVI Knowledge.
Proceedings of the IEEE International Conference on Multimedia and Expo Workshops, 2023
Moderate Coreset: A Universal Method of Data Selection for Real-world Data-efficient Deep Learning.
Proceedings of the Eleventh International Conference on Learning Representations, 2023
Proceedings of the Eleventh International Conference on Learning Representations, 2023
RatiO R-CNN: An Efficient and Accurate Detection Method for Oriented Object Detection.
Proceedings of the Image and Graphics - 12th International Conference, 2023
Proceedings of the Image and Graphics - 12th International Conference, 2023
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023
Cross-Domain Transformer with Adaptive Thresholding for Domain Adaptive Semantic Segmentation.
Proceedings of the Artificial Neural Networks and Machine Learning - ICANN 2023, 2023
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023
Local Region Perception and Relationship Learning Combined with Feature Fusion for Facial Action Unit Detection.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023
Robust Generalization Against Photon-Limited Corruptions via Worst-Case Sharpness Minimization.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023
Proceedings of the AI 2023: Advances in Artificial Intelligence, 2023
2022
ACM Trans. Multim. Comput. Commun. Appl., 2022
IEEE Trans. Multim., 2022
IEEE Trans. Circuits Syst. Video Technol., 2022
J. Mach. Learn. Res., 2022
Image Vis. Comput., 2022
Dual feature fusion network: A dual feature fusion network for point cloud completion.
IET Comput. Vis., 2022
Efficient 6D object pose estimation based on attentive multi-scale contextual information.
IET Comput. Vis., 2022
Scene Clustering Based Pseudo-labeling Strategy for Multi-modal Aerial View Object Classification.
CoRR, 2022
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022
Proceedings of the International Conference on Machine Learning, 2022
Proceedings of the IEEE International Conference on Multimedia and Expo, 2022
Proceedings of the Tenth International Conference on Learning Representations, 2022
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2022
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2022
Pseudo-label Generation and Various Data Augmentation for Semi-Supervised Hyperspectral Object Detection.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2022
Proceedings of the Working Notes of CLEF 2022 - Conference and Labs of the Evaluation Forum, Bologna, Italy, September 5th - to, 2022
Proceedings of the Working Notes of CLEF 2022 - Conference and Labs of the Evaluation Forum, Bologna, Italy, September 5th - to, 2022
2021
Learning Face Image Super-Resolution Through Facial Semantic Attribute Transformation and Self-Attentive Structure Enhancement.
IEEE Trans. Multim., 2021
IEEE Trans. Intell. Transp. Syst., 2021
IEEE Trans. Circuits Syst. Video Technol., 2021
IEEE Trans. Cogn. Dev. Syst., 2021
BiSTF: Bilateral-Branch Self-Training Framework for Semi-Supervised Large-scale Fine-Grained Recognition.
CoRR, 2021
Improving White-box Robustness of Pre-processing Defenses via Joint Adversarial Training.
CoRR, 2021
Emotional Deep Learning Programming Controller for Automatic Voltage Control of Power Systems.
IEEE Access, 2021
Facial Expression Recognition With Confidence Guided Refined Horizontal Pyramid Network.
IEEE Access, 2021
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021
Proceedings of the ICMR '21: International Conference on Multimedia Retrieval, 2021
Proceedings of the IEEE/CVF International Conference on Computer Vision Workshops, 2021
Proceedings of the IEEE/CVF International Conference on Computer Vision Workshops, 2021
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021
Proceedings of the 16th IEEE International Conference on Automatic Face and Gesture Recognition, 2021
2020
ACM Trans. Multim. Comput. Commun. Appl., 2020
Deep Convolutional Neural Network with Optical Flow for Facial Micro-Expression Recognition.
J. Circuits Syst. Comput., 2020
Coordinated Complex-Valued Encoding Dragonfly Algorithm and Artificial Emotional Reinforcement Learning for Coordinated Secondary Voltage Control and Automatic Voltage Regulation in Multi-Generator Power Systems.
IEEE Access, 2020
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020
Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, 2020
Proceedings of the 2020 IEEE International Conference on Multimedia & Expo Workshops, 2020
Proceedings of the 15th IEEE International Conference on Automatic Face and Gesture Recognition, 2020
Proceedings of the 15th IEEE International Conference on Automatic Face and Gesture Recognition, 2020
Proceedings of the Computer Vision - ECCV 2020 Workshops, 2020
2019
BLTRCNN-Based 3-D Articulatory Movement Prediction: Learning Articulatory Synchronicity From Both Text and Audio Inputs.
IEEE Trans. Multim., 2019
IEEE Trans. Multim., 2019
Synthesizing 3D Trump: Predicting and Visualizing the Relationship Between Text, Speech, and Articulatory Movements.
IEEE ACM Trans. Audio Speech Lang. Process., 2019
Deep Neural Network Based 3D Articulatory Movement Prediction Using Both Text and Audio Inputs.
Proceedings of the MultiMedia Modeling - 25th International Conference, 2019
STDGAN: ResBlock Based Generative Adversarial Nets Using Spectral Normalization and Two Different Discriminators.
Proceedings of the 27th ACM International Conference on Multimedia, 2019
3D Singing Head for Music VR: Learning External and Internal Articulatory Synchronicity from Lyric, Audio and Notes.
Proceedings of the 27th ACM International Conference on Multimedia, 2019
Beauty Product Retrieval Based on Regional Maximum Activation of Convolutions with Generalized Attention.
Proceedings of the 27th ACM International Conference on Multimedia, 2019
Proceedings of the IEEE International Conference on Multimedia & Expo Workshops, 2019
Proceedings of the IEEE International Conference on Multimedia & Expo Workshops, 2019
Proceedings of the IEEE International Conference on Multimedia and Expo, 2019
Proceedings of the 2019 IEEE International Conference on Data Mining, 2019
Proceedings of the 2019 IEEE International Conference on Data Mining, 2019
Proceedings of the IEEE International Conference on Acoustics, 2019
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2019
Towards the Gradient Vanishing, Divergence Mismatching and Mode Collapse of Generative Adversarial Nets.
Proceedings of the 28th ACM International Conference on Information and Knowledge Management, 2019
2018
IEEE Trans. Circuits Syst. Video Technol., 2018
Probability contour guided depth map inpainting and superresolution using non-local total generalized variation.
Multim. Tools Appl., 2018
J. Vis. Commun. Image Represent., 2018
Synthesizing 3D Acoustic-Articulatory Mapping Trajectories: Predicting Articulatory Movements by Long-Term Recurrent Convolutional Neural Network.
Proceedings of the IEEE Visual Communications and Image Processing, 2018
Proceedings of the SIGGRAPH Asia 2018 Technical Briefs, Tokyo, Japan, December 04-07, 2018, 2018
Proceedings of the 2018 IEEE International Conference on Image Processing, 2018
Simultaneous Facial Landmark and 3D Action Estimation Based on Probabilistic Random Forest.
Proceedings of the 2018 IEEE International Conference on Image Processing, 2018
Proceedings of the 2018 IEEE International Conference on Image Processing, 2018
Synthesizing Photo-Realistic 3D Talking Head: Learning Lip Synchronicity and Emotion from Audio and Video.
Proceedings of the 2018 IEEE International Conference on Image Processing, 2018
Proceedings of the British Machine Vision Conference 2018, 2018
2017
Multim. Tools Appl., 2017
Multim. Tools Appl., 2017
Realistic emotion visualization by combining facial animation and hairstyle synthesis.
Multim. Tools Appl., 2017
Creating and simulating a realistic physiological tongue model for speech production.
Multim. Tools Appl., 2017
Image classification based on convolutional neural networks with cross-level strategy.
Multim. Tools Appl., 2017
Joint facial landmark detection and action estimation based on deep probabilistic random forest.
Proceedings of the 2017 IEEE Visual Communications and Image Processing, 2017
Multimodal 3D visible articulation system for syllable based Mandarin Chinese training.
Proceedings of the 2017 IEEE Visual Communications and Image Processing, 2017
Proceedings of the Advances in Multimedia Information Processing - PCM 2017, 2017
A Unified Framework for Monocular Video-Based Facial Motion Tracking and Expression Recognition.
Proceedings of the MultiMedia Modeling - 23rd International Conference, 2017
Speech Synchronized Tongue Animation by Combining Physiology Modeling and X-ray Image Fitting.
Proceedings of the MultiMedia Modeling - 23rd International Conference, 2017
Proceedings of the MultiMedia Modeling - 23rd International Conference, 2017
Proceedings of the MultiMedia Modeling - 23rd International Conference, 2017
Adaptively Weighted Facial Expression Recognition by Feature Fusion Under Intense Illumination Condition.
Proceedings of the Neural Information Processing - 24th International Conference, 2017
From talking head to singing head: A significant enhancement for more natural human computer interaction.
Proceedings of the 2017 IEEE International Conference on Multimedia and Expo, 2017
Depth map super-resolution using non-local higher-order regularization with classified weights.
Proceedings of the 2017 IEEE International Conference on Image Processing, 2017
Proceedings of the 2017 IEEE International Conference on Image Processing, 2017
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017
Proceedings of the IEEE International Symposium on Haptic, 2017
2016
Multim. Tools Appl., 2016
A monocular video-based facial expression recognition system by combining static and dynamic knowledge.
Proceedings of the 9th International Conference on Utility and Cloud Computing, 2016
Proceedings of the 2016 IEEE International Conference on Digital Signal Processing, 2016
Proceedings of the Pattern Recognition - 7th Chinese Conference, 2016
Proceedings of the Pattern Recognition - 7th Chinese Conference, 2016
A realistic and reliable 3D pronunciation visualization instruction system for computer-assisted language learning.
Proceedings of the IEEE International Conference on Bioinformatics and Biomedicine, 2016
A fast and precise speech-triggered tongue animation system by combining parameterized model and anatomical model.
Proceedings of the IEEE International Conference on Bioinformatics and Biomedicine, 2016
2015
A Video, Text, and Speech-Driven Realistic 3-D Virtual Head for Human-Machine Interface.
IEEE Trans. Cybern., 2015
IEEE Signal Process. Lett., 2015
Real-Time Robust Video Stabilization Based on Empirical Mode Decomposition and Multiple Evaluation Criteria.
Proceedings of the Image and Graphics - 8th International Conference, 2015
Proceedings of the Image and Graphics - 8th International Conference, 2015
Proceedings of the 12th International Conference on Fuzzy Systems and Knowledge Discovery, 2015
Electro-Magnetic Articulography data stabilization for speech synchronized articulatory animation.
Proceedings of the 12th International Conference on Fuzzy Systems and Knowledge Discovery, 2015
A Digital Video Stabilization System Based on Reliable SIFT Feature Matching and Adaptive Low-Pass Filtering.
Proceedings of the Computer Vision - CCF Chinese Conference, 2015
Cross-Level: A Practical Strategy for Convolutional Neural Networks Based Image Classification.
Proceedings of the Computer Vision - CCF Chinese Conference, 2015
2014
3D facial motion tracking by combining online appearance model and cylinder head model in particle filtering.
Sci. China Inf. Sci., 2014
A mass-spring tongue model with efficient collision detection and response during speech.
Proceedings of the 9th International Symposium on Chinese Spoken Language Processing, 2014
Proceedings of the 2013 IEEE International Conference on Multimedia and Expo Workshops, 2014
Proceedings of the IEEE International Conference on Multimedia and Expo, 2014
Proceedings of the 2013 IEEE International Conference on Multimedia and Expo Workshops, 2014
Proceedings of the 2014 IEEE International Conference on Image Processing, 2014
Proceedings of the IEEE International Conference on Acoustics, 2014
Proceedings of the Pattern Recognition - 6th Chinese Conference, 2014
2013
Proceedings of the 2013 International Conference Oriental COCOSDA held jointly with 2013 Conference on Asian Spoken Language Research and Evaluation (O-COCOSDA/CASLRE), 2013
Proceedings of the Advances in Multimedia Modeling, 19th International Conference, 2013