Xun Yang

Orcid: 0000-0003-0201-1638

Affiliations:

University of Science and Technology of China, Department of Electronic Engineering and Information Science, China

According to our database¹, Xun Yang authored at least 100 papers between 2015 and 2024.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of three.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Bibliography

2024

Disentangled Cascaded Graph Convolution Networks for Multi-Behavior Recommendation.

[BibT_eX]

[DOI]

Trans. Recomm. Syst., December, 2024

FedGAMMA: Federated Learning With Global Sharpness-Aware Minimization.

[BibT_eX]

[DOI]

IEEE Trans. Neural Networks Learn. Syst., December, 2024

Efficiently Gluing Pre-Trained Language and Vision Models for Image Captioning.

[BibT_eX]

[DOI]

ACM Trans. Intell. Syst. Technol., December, 2024

Exploring and exploiting model uncertainty for robust visual question answering.

[BibT_eX]

[DOI]

Multim. Syst., December, 2024

Mutual-weighted feature disentanglement for unsupervised domain adaptation.

[BibT_eX]

[DOI]

Multim. Syst., December, 2024

Depth Matters: Spatial Proximity-Based Gaze Cone Generation for Gaze Following in Wild.

[BibT_eX]

[DOI]

ACM Trans. Multim. Comput. Commun. Appl., November, 2024

Cross-Lingual Cross-Modal Retrieval With Noise-Robust Fine-Tuning.

[BibT_eX]

[DOI]

IEEE Trans. Knowl. Data Eng., November, 2024

Learning Hierarchical Visual Transformation for Domain Generalizable Visual Matching and Recognition.

[BibT_eX]

[DOI]

Int. J. Comput. Vis., November, 2024

Mitigating Hidden Confounding Effects for Causal Recommendation.

[BibT_eX]

[DOI]

IEEE Trans. Knowl. Data Eng., September, 2024

Dual-Path TokenLearner for Remote Photoplethysmography-Based Physiological Measurement With Facial Videos.

[BibT_eX]

[DOI]

IEEE Trans. Comput. Soc. Syst., June, 2024

Graph Pooling Inference Network for Text-based VQA.

[BibT_eX]

[DOI]

ACM Trans. Multim. Comput. Commun. Appl., April, 2024

Visual-linguistic-stylistic Triple Reward for Cross-lingual Image Captioning.

[BibT_eX]

[DOI]

ACM Trans. Multim. Comput. Commun. Appl., April, 2024

FaSRnet: a feature and semantics refinement network for human pose estimation.

[BibT_eX]

[DOI]

Frontiers Inf. Technol. Electron. Eng., March, 2024

Decoupled domain-specific and domain-conditional representation learning for cross-domain recommendation.

[BibT_eX]

[DOI]

Inf. Process. Manag., March, 2024

Video Compressed Sensing Reconstruction via an Untrained Network with Low-Rank Regularization.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2024

Frame-Padded Multiscale Transformer for Monocular 3D Human Pose Estimation.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2024

Efficient Cross-Modal Video Retrieval With Meta-Optimized Frames.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2024

Emotional Video Captioning With Vision-Based Emotion Interpretation Network.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2024

Complex Power Quality Disturbance Recognition Research Based on Deep Complementary Fusion of 2-D Coding Transition.

[BibT_eX]

[DOI]

IEEE Trans. Instrum. Meas., 2024

Repetitive Action Counting with Hybrid Temporal Relation Modeling.

[BibT_eX]

[DOI]

CoRR, 2024

Visual-Oriented Fine-Grained Knowledge Editing for MultiModal Large Language Models.

[BibT_eX]

[DOI]

CoRR, 2024

TV-3DG: Mastering Text-to-3D Customized Generation with Visual Prompt.

[BibT_eX]

[DOI]

CoRR, 2024

FaceVid-1K: A Large-Scale High-Quality Multiracial Human Face Video Dataset.

[BibT_eX]

[DOI]

CoRR, 2024

Grounding is All You Need? Dual Temporal Grounding for Video Dialog.

[BibT_eX]

[DOI]

CoRR, 2024

Scene-Text Grounding for Text-Based Video Question Answering.

[BibT_eX]

[DOI]

CoRR, 2024

GRPose: Learning Graph Relations for Human Image Generation with Pose Priors.

[BibT_eX]

[DOI]

CoRR, 2024

Vulnerabilities in AI-generated Image Detection: The Challenge of Adversarial Attacks.

[BibT_eX]

[DOI]

CoRR, 2024

TrAME: Trajectory-Anchored Multi-View Editing for Text-Guided 3D Gaussian Splatting Manipulation.

[BibT_eX]

[DOI]

CoRR, 2024

Gradually Vanishing Gap in Prototypical Network for Unsupervised Domain Adaptation.

[BibT_eX]

[DOI]

CoRR, 2024

Dual-State Personalized Knowledge Tracing with Emotional Incorporation.

[BibT_eX]

[DOI]

CoRR, 2024

Personalized Forgetting Mechanism with Concept-Driven Knowledge Tracing.

[BibT_eX]

[DOI]

CoRR, 2024

Unified Static and Dynamic Network: Efficient Temporal Filtering for Video Grounding.

[BibT_eX]

[DOI]

CoRR, 2024

Hyper-3DG: Text-to-3D Gaussian Generation via Hypergraph.

[BibT_eX]

[DOI]

CoRR, 2024

AMTN: Attention-Enhanced Multimodal Temporal Network for Humor Detection.

[BibT_eX]

[DOI]

Proceedings of the 5th on Multimodal Sentiment Analysis Challenge and Workshop: Social Perception and Humor, 2024

Informative Point cloud Dataset Extraction for Classification via Gradient-based Points Moving.

[BibT_eX]

[DOI]

Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Reverse2Complete: Unpaired Multimodal Point Cloud Completion via Guided Diffusion.

[BibT_eX]

[DOI]

Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Maskable Retentive Network for Video Moment Retrieval.

[BibT_eX]

[DOI]

Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

FedCAFE: Federated Cross-Modal Hashing with Adaptive Feature Enhancement.

[BibT_eX]

[DOI]

Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Advancing Prompt Learning through an External Layer.

[BibT_eX]

[DOI]

Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Dual-stream Feature Augmentation for Domain Generalization.

[BibT_eX]

[DOI]

Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Enhancing One-Shot Federated Learning Through Data and Ensemble Co-Boosting.

[BibT_eX]

[DOI]

Proceedings of the Twelfth International Conference on Learning Representations, 2024

Rethinking Human Motion Prediction with Symplectic Integral.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Finding and Editing Multi-Modal Neurons in Pre-Trained Transformers.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics, 2024

Boosting Neural Cognitive Diagnosis with Student's Affective State Modeling.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

Causality-Inspired Invariant Representation Learning for Text-Based Person Retrieval.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

KPA-Tracker: Towards Robust and Real-Time Category-Level Articulated Object 6D Pose Tracking.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023

Progressive Localization Networks for Language-Based Moment Localization.

[BibT_eX]

[DOI]

ACM Trans. Multim. Comput. Commun. Appl., 2023

Exploring Sparse Spatial Relation in Graph Inference for Text-Based VQA.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2023

Finding and Editing Multi-Modal Neurons in Pre-Trained Transformer.

[BibT_eX]

[DOI]

CoRR, 2023

Towards Complex-query Referring Image Segmentation: A Novel Benchmark.

[BibT_eX]

[DOI]

CoRR, 2023

From Region to Patch: Attribute-Aware Foreground-Background Contrastive Learning for Fine-Grained Fashion Retrieval.

[BibT_eX]

[DOI]

Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2023

InstanT: Semi-supervised Learning with Instance-dependent Thresholds.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Self-Distillation Dual-Memory Online Hashing with Hash Centers for Streaming Data Retrieval.

[BibT_eX]

[DOI]

Proceedings of the 31st ACM International Conference on Multimedia, 2023

Unified Multi-modal Unsupervised Representation Learning for Skeleton-based Action Understanding.

[BibT_eX]

[DOI]

Proceedings of the 31st ACM International Conference on Multimedia, 2023

Emotion-Prior Awareness Network for Emotional Video Captioning.

[BibT_eX]

[DOI]

Proceedings of the 31st ACM International Conference on Multimedia, 2023

Semantics-Enriched Cross-Modal Alignment for Complex-Query Video Moment Retrieval.

[BibT_eX]

[DOI]

Proceedings of the 31st ACM International Conference on Multimedia, 2023

Category-Level Articulated Object 9D Pose Estimation via Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the 31st ACM International Conference on Multimedia, 2023

Modeling Multi-Relational Connectivity for Personalized Fashion Matching.

[BibT_eX]

[DOI]

Proceedings of the 31st ACM International Conference on Multimedia, 2023

Learning Style-Invariant Robust Representation for Generalizable Visual Instance Retrieval.

[BibT_eX]

[DOI]

Proceedings of the 31st ACM International Conference on Multimedia, 2023

Disentangled Representation Learning with Causality for Unsupervised Domain Adaptation.

[BibT_eX]

[DOI]

Proceedings of the 31st ACM International Conference on Multimedia, 2023

Redundancy-aware Transformer for Video Question Answering.

[BibT_eX]

[DOI]

Proceedings of the 31st ACM International Conference on Multimedia, 2023

Domain Generalized Stereo Matching via Hierarchical Visual Transformation.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Self-Supervised Graph Learning for Long-Tailed Cognitive Diagnosis.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022

Introduction to the Special Section on Learning Representations, Similarity, and Associations in Dynamic Multimedia Environments.

[BibT_eX]

[DOI]

ACM Trans. Multim. Comput. Commun. Appl., 2022

Topic-Guided Conversational Recommender in Multiple Domains.

[BibT_eX]

[DOI]

IEEE Trans. Knowl. Data Eng., 2022

Video Moment Retrieval With Cross-Modal Neural Architecture Search.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2022

Dual Encoding for Video Retrieval by Text.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., 2022

Partially Relevant Video Retrieval.

[BibT_eX]

[DOI]

Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Modeling Field-Level Factor Interactions for Fashion Recommendation.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Multimedia and Expo, 2022

2021

Fine-Grained Fashion Similarity Prediction by Attribute-Specific Embedding Learning.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2021

Semantic manifold modularization-based ranking for image recommendation.

[BibT_eX]

[DOI]

Pattern Recognit., 2021

Progressive Localization Networks for Language-based Moment Localization.

[BibT_eX]

[DOI]

CoRR, 2021

Deconfounded Video Moment Retrieval with Causal Intervention.

[BibT_eX]

[DOI]

Proceedings of the SIGIR '21: The 44th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2021

Selective Dependency Aggregation for Action Classification.

[BibT_eX]

[DOI]

Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

ADVM'21: 1st International Workshop on Adversarial Learning for Multimedia.

[BibT_eX]

[DOI]

Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

Interventional Video Relation Detection.

[BibT_eX]

[DOI]

Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

Reproducibility Companion Paper: Knowledge Enhanced Neural Fashion Trend Forecasting.

[BibT_eX]

[DOI]

Proceedings of the ICMR '21: International Conference on Multimedia Retrieval, 2021

2020

Deep Neighborhood Component Analysis for Visual Similarity Modeling.

[BibT_eX]

[DOI]

ACM Trans. Intell. Syst. Technol., 2020

Introduction to the Special Section on Contextual Object Analysis in Complex Scenes.

[BibT_eX]

[DOI]

IEEE Trans. Circuits Syst. Video Technol., 2020

Tree-Augmented Cross-Modal Encoding for Complex-Query Video Retrieval.

[BibT_eX]

[DOI]

Proceedings of the 43rd International ACM SIGIR conference on research and development in Information Retrieval, 2020

Weakly-Supervised Video Object Grounding by Exploring Spatio-Temporal Contexts.

[BibT_eX]

[DOI]

Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Knowledge Enhanced Neural Fashion Trend Forecasting.

[BibT_eX]

[DOI]

Proceedings of the 2020 on International Conference on Multimedia Retrieval, 2020

Visual Relation Grounding in Videos.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2020, 2020

Learning to Match on Graph for Fashion Compatibility Modeling.

[BibT_eX]

[DOI]

Xun Yang

Xiaoyu Du

Meng Wang

Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019

Person Reidentification via Structural Deep Metric Learning.

[BibT_eX]

[DOI]

Xun Yang

Peicheng Zhou

Meng Wang

IEEE Trans. Neural Networks Learn. Syst., 2019

Deep Conversational Recommender in Travel.

[BibT_eX]

[DOI]

CoRR, 2019

Interpretable Fashion Matching with Rich Attributes.

[BibT_eX]

[DOI]

Proceedings of the 42nd International ACM SIGIR Conference on Research and Development in Information Retrieval, 2019

Learning Using Privileged Information for Food Recognition.

[BibT_eX]

[DOI]

Proceedings of the 27th ACM International Conference on Multimedia, 2019

Who, Where, and What to Wear?: Extracting Fashion Knowledge from Social Media.

[BibT_eX]

[DOI]

Proceedings of the 27th ACM International Conference on Multimedia, 2019

Annotating Objects and Relations in User-Generated Videos.

[BibT_eX]

[DOI]

Proceedings of the 2019 on International Conference on Multimedia Retrieval, 2019

Cross-modal Collaborative Manifold Propagation for Image Recommendation.

[BibT_eX]

[DOI]

Proceedings of the 2019 on International Conference on Multimedia Retrieval, 2019

Progressive Image Enhancement under Aesthetic Guidance.

[BibT_eX]

[DOI]

Proceedings of the 2019 on International Conference on Multimedia Retrieval, 2019

Multiple Hypothesis Video Relation Detection.

[BibT_eX]

[DOI]

Proceedings of the Fifth IEEE International Conference on Multimedia Big Data, 2019

TransNFCM: Translation-Based Neural Fashion Compatibility Modeling.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

2018

Person Re-Identification With Metric Learning Using Privileged Information.

[BibT_eX]

[DOI]

Xun Yang

Meng Wang

Dacheng Tao

IEEE Trans. Image Process., 2018

2017

Saliency Detection on Light Field: A Multi-Cue Approach.

[BibT_eX]

[DOI]

ACM Trans. Multim. Comput. Commun. Appl., 2017

Enhancing Person Re-identification in a Self-Trained Subspace.

[BibT_eX]

[DOI]

ACM Trans. Multim. Comput. Commun. Appl., 2017

2016

An Efficient Tracking System by Orthogonalized Templates.

[BibT_eX]

[DOI]

IEEE Trans. Ind. Electron., 2016

Empirical Risk Minimization for Metric Learning Using Privileged Information.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence, 2016

2015

Robust visual tracking via multi-graph ranking.

[BibT_eX]

[DOI]

Xun Yang

Meng Wang

Dacheng Tao

Neurocomputing, 2015

Xun Yang

Timeline

Legend:

Links

Online presence:

On csauthors.net:

Bibliography

Loading...