Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024

Towards Multimodal-augmented Pre-trained Language Models via Self-balanced Expectation-Maximization Iteration.

[BibT_eX]

[DOI]

Xianwei Zhuang

Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

InMu-Net: Advancing Multi-modal Intent Detection via Information Bottleneck and Multi-sensory Processing.

[BibT_eX]

[DOI]

Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

FD2Talk: Towards Generalized Talking Head Generation with Facial Decoupled Diffusion Model.

[BibT_eX]

[DOI]

Ziyu Yao

Xuxin Cheng

Zhiqi Huang

Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Multivariate Cooperative Game for Image-Report Pairs: Hierarchical Semantic Alignment for Medical Report Generation.

[BibT_eX]

[DOI]

Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2024, 2024

Generating More Audios for End-to-End Spoken Language Understanding.

[BibT_eX]

[DOI]

Xuxin Cheng

Yuexian Zou

Proceedings of the Thirty-Third International Joint Conference on Artificial Intelligence, 2024

Extreme Parkour with Legged Robots.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Robotics and Automation, 2024

PolyVoice: Language Models for Speech to Speech Translation.

[BibT_eX]

[DOI]

Proceedings of the Twelfth International Conference on Learning Representations, 2024

Retrieval is Accurate Generation.

[BibT_eX]

[DOI]

Proceedings of the Twelfth International Conference on Learning Representations, 2024

KC-Prompt: End-To-End Knowledge-Complementary Prompting for Rehearsal-Free Continual Learning.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2024

What are the Generator Preferences for End-to-end Task-Oriented Dialog System?

[BibT_eX]

[DOI]

Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Learning to Match Representations is Better for End-to-End Task-Oriented Dialog System.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

RAG-HAT: A Hallucination-Aware Tuning Pipeline for LLM in Retrieval-Augmented Generation.

[BibT_eX]

[DOI]

Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing: EMNLP 2024, 2024

KDProR: A Knowledge-Decoupling Probabilistic Framework for Video-Text Retrieval.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2024, 2024

Uncertainty-Aware Sign Language Video Retrieval with Probability Distribution Modeling.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2024, 2024

Alignment before Awareness: Towards Visual Question Localized-Answering in Robotic Surgery via Optimal Transport and Answer Semantics.

[BibT_eX]

[DOI]

Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

Towards Multi-modal Sarcasm Detection via Disentangled Multi-grained Multi-modal Distilling.

[BibT_eX]

[DOI]

Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

Zero-Shot Spoken Language Understanding via Large Language Models: A Preliminary Study.

[BibT_eX]

[DOI]

Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

Knowledge-enhanced Prompt Tuning for Dialogue-based Relation Extraction with Trigger and Label Semantic.

[BibT_eX]

[DOI]

Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

SaLa: Scenario-aware Label Graph Interaction for Multi-intent Spoken Language Understanding.

[BibT_eX]

[DOI]

Proceedings of the 33rd ACM International Conference on Information and Knowledge Management, 2024

PCAD: Towards ASR-Robust Spoken Language Understanding via Prototype Calibration and Asymmetric Decoupling.

[BibT_eX]

[DOI]

Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

Code-Switching Can be Better Aligners: Advancing Cross-Lingual SLU through Representation-Level and Prediction-Level Alignment.

[BibT_eX]

[DOI]

Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics, 2024

Enhancing Dialogue State Tracking Models through LLM-backed User-Agents Simulation.

[BibT_eX]

[DOI]

Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

MoE-SLU: Towards ASR-Robust Spoken Language Understanding via Mixture-of-Experts.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics, 2024

Cyclical Contrastive Learning Based on Geodesic for Zero-shot Cross-lingual Spoken Language Understanding.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics, 2024

Soul-Mix: Enhancing Multimodal Machine Translation with Manifold Mixup.

[BibT_eX]

[DOI]

Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

Towards Explainable Joint Models via Information Theory for Multiple Intent Detection and Slot Filling.

[BibT_eX]

[DOI]

Xianwei Zhuang

Xuxin Cheng

Yuexian Zou

Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

Aligner²: Enhancing Joint Multiple Intent Detection and Slot Filling via Adjustive and Forced Cross-Task Alignment.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

Embracing Language Inclusivity and Diversity in CLIP through Continual Language Learning.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

Exploiting Auxiliary Caption for Video Grounding.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

Towards Multi-Intent Spoken Language Understanding via Hierarchical Attention and Optimal Transport.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023

PolyVoice: Language Models for Speech to Speech Translation.

[BibT_eX]

[DOI]

CoRR, 2023

PoseRAC: Pose Saliency Transformer for Repetitive Action Counting.

[BibT_eX]

[DOI]

Ziyu Yao

Xuxin Cheng

Yuexian Zou

CoRR, 2023

Generating Templated Caption for Video Grounding.

[BibT_eX]

[DOI]

CoRR, 2023

Discover and Align Taxonomic Context Priors for Open-world Semi-Supervised Learning.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Mix before Align: Towards Zero-shot Cross-lingual Sentiment Analysis via Soft-Mix and Multi-View Learning.

[BibT_eX]

[DOI]

Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

GhostT5: Generate More Features with Cheap Operations to Improve Textless Spoken Question Answering.

[BibT_eX]

[DOI]

Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

FC-MTLF: A Fine- and Coarse-grained Multi-Task Learning Framework for Cross-Lingual Spoken Language Understanding.

[BibT_eX]

[DOI]

Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

C²A-SLU: Cross and Contrastive Attention for Improving ASR Robustness in Spoken Language Understanding.

[BibT_eX]

[DOI]

Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Legs as Manipulator: Pushing Quadrupedal Agility Beyond Locomotion.

[BibT_eX]

[DOI]

Xuxin Cheng

Ashish Kumar

Deepak Pathak

Proceedings of the IEEE International Conference on Robotics and Automation, 2023

Unify, Align and Refine: Multi-Level Semantic Alignment for Radiology Report Generation.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

G2L: Semantically Aligned and Uniform Video Grounding via Geodesic and Game Theory.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

A Dynamic Graph Interactive Framework with Label-Semantic Injection for Spoken Language Understanding.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2023

SSVMR: Saliency-Based Self-Training for Video-Music Retrieval.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2023

M<sup>3</sup>ST: Mix at Three Levels for Speech Translation.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2023

Enhancing Code-Switching for Cross-lingual SLU: A Unified View of Semantic and Grammatical Coherence.

[BibT_eX]

[DOI]

Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Syntax Matters: Towards Spoken Language Understanding via Syntax-Aware Attention.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

MCLF: A Multi-grained Contrastive Learning Framework for ASR-robust Spoken Language Understanding.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Accelerating Multiple Intent Detection and Slot Filling via Targeted Knowledge Distillation.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

MRRL: Modifying the Reference via Reinforcement Learning for Non-Autoregressive Joint Multiple Intent Detection and Slot Filling.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

DAS-CL: Towards Multimodal Machine Translation via Dual-Level Asymmetric Contrastive Learning.

[BibT_eX]

[DOI]

Proceedings of the 32nd ACM International Conference on Information and Knowledge Management, 2023

Towards Spoken Language Understanding via Multi-level Multi-grained Contrastive Learning.

[BibT_eX]

[DOI]

Proceedings of the 32nd ACM International Conference on Information and Knowledge Management, 2023

Towards Unified Spoken Language Understanding Decoding via Label-aware Compact Linguistics Representations.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

ML-LMCL: Mutual Learning and Large-Margin Contrastive Learning for Improving ASR Robustness in Spoken Language Understanding.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

2022

M3ST: Mix at Three Levels for Speech Translation.

[BibT_eX]

[DOI]

CoRR, 2022

Deep Whole-Body Control: Learning a Unified Policy for Manipulation and Locomotion.

[BibT_eX]

[DOI]

Zipeng Fu

Xuxin Cheng

Deepak Pathak

Proceedings of the Conference on Robot Learning, 2022

2021

Reinforcement Learning for Robust Parameterized Locomotion Control of Bipedal Robots.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Robotics and Automation, 2021

2020

Automated Lane Change Strategy using Proximal Policy Optimization-based Deep Reinforcement Learning.

[BibT_eX]

[DOI]

CoRR, 2020

Automated Lane Change Strategy using Proximal Policy Optimization-based Deep Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the IEEE Intelligent Vehicles Symposium, 2020

2019

Driving Decision and Control for Autonomous Lane Change based on Deep Reinforcement Learning.

[BibT_eX]

[DOI]

CoRR, 2019

Driving Decision and Control for Automated Lane Change Behavior based on Deep Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the 2019 IEEE Intelligent Transportation Systems Conference, 2019

Xuxin Cheng

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...