Zhihong Zhu

Orcid: 0009-0007-8954-6737

According to our database1, Zhihong Zhu authored at least 77 papers between 2021 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
DiffATR: Diffusion-based Generative Modeling for Audio-Text Retrieval.
CoRR, 2024

Audio-text Retrieval with Transformer-based Hierarchical Alignment and Disentangled Cross-modal Representation.
CoRR, 2024

Recent Trends of Multimodal Affective Computing: A Survey from NLP Perspective.
CoRR, 2024

MJ-Bench: Is Your Multimodal Reward Model Really a Good Judge for Text-to-Image Generation?
CoRR, 2024

D2O: Dynamic Discriminative Operations for Efficient Generative Inference of Large Language Models.
CoRR, 2024

UniMEEC: Towards Unified Multimodal Emotion Recognition and Emotion Cause.
CoRR, 2024

Editing Factual Knowledge and Explanatory Ability of Medical Large Language Models.
CoRR, 2024

Securing Reliability: A Brief Overview on Enhancing In-Context Learning for Foundation Models.
CoRR, 2024

AutoPRM: Automating Procedural Supervision for Multi-Step Reasoning via Controllable Question Decomposition.
CoRR, 2024

Dance with Labels: Dual-Heterogeneous Label Graph Interaction for Multi-intent Spoken Language Understanding.
Proceedings of the 17th ACM International Conference on Web Search and Data Mining, 2024

AutoPRM: Automating Procedural Supervision for Multi-Step Reasoning via Controllable Question Decomposition.
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024

Towards Multimodal-augmented Pre-trained Language Models via Self-balanced Expectation-Maximization Iteration.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

InMu-Net: Advancing Multi-modal Intent Detection via Information Bottleneck and Multi-sensory Processing.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

MoBA: Mixture of Bi-directional Adapter for Multi-modal Sarcasm Detection.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

<i>Aspects are Anchors: </i> Towards Multimodal Aspect-based Sentiment Analysis via Aspect-driven Alignment and Refinement.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

XMeCap: Meme Caption Generation with Sub-Image Adaptability.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Multivariate Cooperative Game for Image-Report Pairs: Hierarchical Semantic Alignment for Medical Report Generation.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2024, 2024

Textual Inversion and Self-supervised Refinement for Radiology Report Generation.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2024, 2024

TFCD: Towards Multi-modal Sarcasm Detection via Training-Free Counterfactual Debiasing.
Proceedings of the Thirty-Third International Joint Conference on Artificial Intelligence, 2024

Chinese Named Entity Recognition in the Ship News Field Based on Adversarial Transfer Learning.
Proceedings of the 2024 16th International Conference on Machine Learning and Computing, 2024

KC-Prompt: End-To-End Knowledge-Complementary Prompting for Rehearsal-Free Continual Learning.
Proceedings of the IEEE International Conference on Acoustics, 2024

Game on Tree: Visual Hallucination Mitigation via Coarse-to-Fine View Tree and Game Theory.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

DGLF: A Dual Graph-based Learning Framework for Multi-modal Sarcasm Detection.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Mitigating Hallucinations of Large Language Models in Medical Information Extraction via Contrastive Decoding.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

What are the Generator Preferences for End-to-end Task-Oriented Dialog System?
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Learning to Match Representations is Better for End-to-End Task-Oriented Dialog System.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

LOOK-M: Look-Once Optimization in KV Cache for Efficient Multimodal Long-Context Inference.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

UniMEEC: Towards Unified Multimodal Emotion Recognition and Emotion Cause.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

Dual-oriented Disentangled Network with Counterfactual Intervention for Multimodal Intent Detection.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Relevance Is a Guiding Light: Relevance-aware Adaptive Learning for End-to-end Task-oriented Dialogue System.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

KDProR: A Knowledge-Decoupling Probabilistic Framework for Video-Text Retrieval.
Proceedings of the Computer Vision - ECCV 2024, 2024

Alignment before Awareness: Towards Visual Question Localized-Answering in Robotic Surgery via Optimal Transport and Answer Semantics.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

Towards Multi-modal Sarcasm Detection via Disentangled Multi-grained Multi-modal Distilling.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

Zero-Shot Spoken Language Understanding via Large Language Models: A Preliminary Study.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

Multi-perspective Improvement of Knowledge Graph Completion with Large Language Models.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

InfoEnh: Towards Multimodal Sentiment Analysis via Information Bottleneck Filter and Optimal Transport Alignment.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

Knowledge-enhanced Prompt Tuning for Dialogue-based Relation Extraction with Trigger and Label Semantic.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

SaLa: Scenario-aware Label Graph Interaction for Multi-intent Spoken Language Understanding.
Proceedings of the 33rd ACM International Conference on Information and Knowledge Management, 2024

Editing Factual Knowledge and Explanatory Ability of Medical Large Language Models.
Proceedings of the 33rd ACM International Conference on Information and Knowledge Management, 2024

MOAT: Graph Prompting for 3D Molecular Graphs.
Proceedings of the 33rd ACM International Conference on Information and Knowledge Management, 2024

PIXEL: Prompt-based Zero-shot Hashing via Visual and Textual Semantic Alignment.
Proceedings of the 33rd ACM International Conference on Information and Knowledge Management, 2024

Code-Switching Can be Better Aligners: Advancing Cross-Lingual SLU through Representation-Level and Prediction-Level Alignment.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics, 2024

MoE-SLU: Towards ASR-Robust Spoken Language Understanding via Mixture-of-Experts.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

Cyclical Contrastive Learning Based on Geodesic for Zero-shot Cross-lingual Spoken Language Understanding.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

Aligner²: Enhancing Joint Multiple Intent Detection and Slot Filling via Adjustive and Forced Cross-Task Alignment.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

Exploiting Auxiliary Caption for Video Grounding.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

Towards Multi-Intent Spoken Language Understanding via Hierarchical Attention and Optimal Transport.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
An ECG Signal Acquisition and Analysis System Based on Machine Learning with Model Fusion.
Sensors, September, 2023

Implicit smoothed particle hydrodynamics model for simulating incompressible fluid-elastic coupling.
Comput. Animat. Virtual Worlds, 2023

Improve Retrieval-based Dialogue System via Syntax-Informed Attention.
CoRR, 2023

Generating Templated Caption for Video Grounding.
CoRR, 2023

Mix before Align: Towards Zero-shot Cross-lingual Sentiment Analysis via Soft-Mix and Multi-View Learning.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

GhostT5: Generate More Features with Cheap Operations to Improve Textless Spoken Question Answering.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

FC-MTLF: A Fine- and Coarse-grained Multi-Task Learning Framework for Cross-Lingual Spoken Language Understanding.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

C²A-SLU: Cross and Contrastive Attention for Improving ASR Robustness in Spoken Language Understanding.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Improved Notch Filter Method for Vibration Suppression of Flexible Joint Robots with Harmonic Reducers.
Proceedings of the Intelligent Robotics and Applications - 16th International Conference, 2023

Unify, Align and Refine: Multi-Level Semantic Alignment for Radiology Report Generation.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

G2L: Semantically Aligned and Uniform Video Grounding via Geodesic and Game Theory.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

A Dynamic Graph Interactive Framework with Label-Semantic Injection for Spoken Language Understanding.
Proceedings of the IEEE International Conference on Acoustics, 2023

Improving Retrieval-Based Dialogue System Via Syntax-Informed Attention.
Proceedings of the IEEE International Conference on Acoustics, 2023

SSVMR: Saliency-Based Self-Training for Video-Music Retrieval.
Proceedings of the IEEE International Conference on Acoustics, 2023

A Grouping-Based Multi-task Scheduling Strategy with Deadline Constraint on Heterogeneous Edge Computing.
Proceedings of the Algorithms and Architectures for Parallel Processing, 2023

Enhancing Code-Switching for Cross-lingual SLU: A Unified View of Semantic and Grammatical Coherence.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Syntax Matters: Towards Spoken Language Understanding via Syntax-Aware Attention.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

MCLF: A Multi-grained Contrastive Learning Framework for ASR-robust Spoken Language Understanding.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Accelerating Multiple Intent Detection and Slot Filling via Targeted Knowledge Distillation.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

MRRL: Modifying the Reference via Reinforcement Learning for Non-Autoregressive Joint Multiple Intent Detection and Slot Filling.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

TLAG: An Informative Trigger and Label-Aware Knowledge Guided Model for Dialogue-based Relation Extraction.
Proceedings of the 26th International Conference on Computer Supported Cooperative Work in Design, 2023

DAS-CL: Towards Multimodal Machine Translation via Dual-Level Asymmetric Contrastive Learning.
Proceedings of the 32nd ACM International Conference on Information and Knowledge Management, 2023

Towards Spoken Language Understanding via Multi-level Multi-grained Contrastive Learning.
Proceedings of the 32nd ACM International Conference on Information and Knowledge Management, 2023

Towards Unified Spoken Language Understanding Decoding via Label-aware Compact Linguistics Representations.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

ML-LMCL: Mutual Learning and Large-Margin Contrastive Learning for Improving ASR Robustness in Spoken Language Understanding.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

2022
An empirical study on the influencing factors of the stability of the coupling symbiosis network of industry-university-research.
J. Comput. Methods Sci. Eng., 2022

An Improved LightGBM Job Running Status Prediction Algorithm Integrating Combinatorial Feature Selection and Bayesian Hyperparameter Optimization on Spark.
Proceedings of the IEEE Smartworld, 2022

2021
Realization path of the stability of university-industry coupling symbiotic network.
J. Comput. Methods Sci. Eng., 2021

Research on Collision Detection and Collision Reaction of Collaborative Robots.
Proceedings of the Intelligent Robotics and Applications - 14th International Conference, 2021

Silicone Oil-Water Interaction and Emulsification Visual Simulation for Intraocular Silicone Oil Tamponade.
Proceedings of the IEEE International Conference on Bioinformatics and Biomedicine, 2021


  Loading...