Xuxin Cheng

Orcid: 0009-0002-6244-2931

According to our database1, Xuxin Cheng authored at least 73 papers between 2019 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Helpful DoggyBot: Open-World Object Fetching using Legged Robots and Vision-Language Models.
CoRR, 2024

DiffATR: Diffusion-based Generative Modeling for Audio-Text Retrieval.
CoRR, 2024

Audio-text Retrieval with Transformer-based Hierarchical Alignment and Disentangled Cross-modal Representation.
CoRR, 2024

ACE: A Cross-Platform Visual-Exoskeletons System for Low-Cost Dexterous Teleoperation.
CoRR, 2024

MMRA: A Benchmark for Multi-granularity Multi-image Relational Association.
CoRR, 2024

Open-TeleVision: Teleoperation with Immersive Active Visual Feedback.
CoRR, 2024

CLEME2.0: Towards More Interpretable Evaluation by Disentangling Edits for Grammatical Error Correction.
CoRR, 2024

EXCGEC: A Benchmark of Edit-wise Explainable Chinese Grammatical Error Correction.
CoRR, 2024

WeightedPose: Generalizable Cross-Pose Estimation via Weighted SVD.
CoRR, 2024

Visual Whole-Body Control for Legged Loco-Manipulation.
CoRR, 2024

Expressive Whole-Body Control for Humanoid Robots.
CoRR, 2024

Dance with Labels: Dual-Heterogeneous Label Graph Interaction for Multi-intent Spoken Language Understanding.
Proceedings of the 17th ACM International Conference on Web Search and Data Mining, 2024

MaCSC: Towards Multimodal-augmented Pre-trained Language Models via Conceptual Prototypes and Self-balancing Calibration.
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024

Towards Multimodal-augmented Pre-trained Language Models via Self-balanced Expectation-Maximization Iteration.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

InMu-Net: Advancing Multi-modal Intent Detection via Information Bottleneck and Multi-sensory Processing.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

FD2Talk: Towards Generalized Talking Head Generation with Facial Decoupled Diffusion Model.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Multivariate Cooperative Game for Image-Report Pairs: Hierarchical Semantic Alignment for Medical Report Generation.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2024, 2024

Generating More Audios for End-to-End Spoken Language Understanding.
Proceedings of the Thirty-Third International Joint Conference on Artificial Intelligence, 2024

Extreme Parkour with Legged Robots.
Proceedings of the IEEE International Conference on Robotics and Automation, 2024

PolyVoice: Language Models for Speech to Speech Translation.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Retrieval is Accurate Generation.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

KC-Prompt: End-To-End Knowledge-Complementary Prompting for Rehearsal-Free Continual Learning.
Proceedings of the IEEE International Conference on Acoustics, 2024

What are the Generator Preferences for End-to-end Task-Oriented Dialog System?
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Learning to Match Representations is Better for End-to-End Task-Oriented Dialog System.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

RAG-HAT: A Hallucination-Aware Tuning Pipeline for LLM in Retrieval-Augmented Generation.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing: EMNLP 2024, 2024

KDProR: A Knowledge-Decoupling Probabilistic Framework for Video-Text Retrieval.
Proceedings of the Computer Vision - ECCV 2024, 2024

Uncertainty-Aware Sign Language Video Retrieval with Probability Distribution Modeling.
Proceedings of the Computer Vision - ECCV 2024, 2024

Alignment before Awareness: Towards Visual Question Localized-Answering in Robotic Surgery via Optimal Transport and Answer Semantics.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

Towards Multi-modal Sarcasm Detection via Disentangled Multi-grained Multi-modal Distilling.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

Zero-Shot Spoken Language Understanding via Large Language Models: A Preliminary Study.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

Knowledge-enhanced Prompt Tuning for Dialogue-based Relation Extraction with Trigger and Label Semantic.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

SaLa: Scenario-aware Label Graph Interaction for Multi-intent Spoken Language Understanding.
Proceedings of the 33rd ACM International Conference on Information and Knowledge Management, 2024

PCAD: Towards ASR-Robust Spoken Language Understanding via Prototype Calibration and Asymmetric Decoupling.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

Code-Switching Can be Better Aligners: Advancing Cross-Lingual SLU through Representation-Level and Prediction-Level Alignment.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics, 2024

Enhancing Dialogue State Tracking Models through LLM-backed User-Agents Simulation.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

MoE-SLU: Towards ASR-Robust Spoken Language Understanding via Mixture-of-Experts.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

Cyclical Contrastive Learning Based on Geodesic for Zero-shot Cross-lingual Spoken Language Understanding.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

Soul-Mix: Enhancing Multimodal Machine Translation with Manifold Mixup.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

Towards Explainable Joint Models via Information Theory for Multiple Intent Detection and Slot Filling.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

Aligner²: Enhancing Joint Multiple Intent Detection and Slot Filling via Adjustive and Forced Cross-Task Alignment.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

Embracing Language Inclusivity and Diversity in CLIP through Continual Language Learning.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

Exploiting Auxiliary Caption for Video Grounding.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

Towards Multi-Intent Spoken Language Understanding via Hierarchical Attention and Optimal Transport.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
PolyVoice: Language Models for Speech to Speech Translation.
CoRR, 2023

PoseRAC: Pose Saliency Transformer for Repetitive Action Counting.
CoRR, 2023

Generating Templated Caption for Video Grounding.
CoRR, 2023

Discover and Align Taxonomic Context Priors for Open-world Semi-Supervised Learning.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Mix before Align: Towards Zero-shot Cross-lingual Sentiment Analysis via Soft-Mix and Multi-View Learning.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

GhostT5: Generate More Features with Cheap Operations to Improve Textless Spoken Question Answering.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

FC-MTLF: A Fine- and Coarse-grained Multi-Task Learning Framework for Cross-Lingual Spoken Language Understanding.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

C²A-SLU: Cross and Contrastive Attention for Improving ASR Robustness in Spoken Language Understanding.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Legs as Manipulator: Pushing Quadrupedal Agility Beyond Locomotion.
Proceedings of the IEEE International Conference on Robotics and Automation, 2023

Unify, Align and Refine: Multi-Level Semantic Alignment for Radiology Report Generation.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

G2L: Semantically Aligned and Uniform Video Grounding via Geodesic and Game Theory.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

A Dynamic Graph Interactive Framework with Label-Semantic Injection for Spoken Language Understanding.
Proceedings of the IEEE International Conference on Acoustics, 2023

SSVMR: Saliency-Based Self-Training for Video-Music Retrieval.
Proceedings of the IEEE International Conference on Acoustics, 2023

M<sup>3</sup>ST: Mix at Three Levels for Speech Translation.
Proceedings of the IEEE International Conference on Acoustics, 2023

Enhancing Code-Switching for Cross-lingual SLU: A Unified View of Semantic and Grammatical Coherence.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Syntax Matters: Towards Spoken Language Understanding via Syntax-Aware Attention.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

MCLF: A Multi-grained Contrastive Learning Framework for ASR-robust Spoken Language Understanding.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Accelerating Multiple Intent Detection and Slot Filling via Targeted Knowledge Distillation.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

MRRL: Modifying the Reference via Reinforcement Learning for Non-Autoregressive Joint Multiple Intent Detection and Slot Filling.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

DAS-CL: Towards Multimodal Machine Translation via Dual-Level Asymmetric Contrastive Learning.
Proceedings of the 32nd ACM International Conference on Information and Knowledge Management, 2023

Towards Spoken Language Understanding via Multi-level Multi-grained Contrastive Learning.
Proceedings of the 32nd ACM International Conference on Information and Knowledge Management, 2023

Towards Unified Spoken Language Understanding Decoding via Label-aware Compact Linguistics Representations.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

ML-LMCL: Mutual Learning and Large-Margin Contrastive Learning for Improving ASR Robustness in Spoken Language Understanding.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

2022
M3ST: Mix at Three Levels for Speech Translation.
CoRR, 2022

Deep Whole-Body Control: Learning a Unified Policy for Manipulation and Locomotion.
Proceedings of the Conference on Robot Learning, 2022

2021
Reinforcement Learning for Robust Parameterized Locomotion Control of Bipedal Robots.
Proceedings of the IEEE International Conference on Robotics and Automation, 2021

2020
Automated Lane Change Strategy using Proximal Policy Optimization-based Deep Reinforcement Learning.
CoRR, 2020

Automated Lane Change Strategy using Proximal Policy Optimization-based Deep Reinforcement Learning.
Proceedings of the IEEE Intelligent Vehicles Symposium, 2020

2019
Driving Decision and Control for Autonomous Lane Change based on Deep Reinforcement Learning.
CoRR, 2019

Driving Decision and Control for Automated Lane Change Behavior based on Deep Reinforcement Learning.
Proceedings of the 2019 IEEE Intelligent Transportation Systems Conference, 2019


  Loading...