Hao Fei

Orcid: 0000-0003-3026-6347

Affiliations:
  • National University of Singapore, School of Computing, Sea-NExT Joint Lab, Singapore
  • Wuhan University, School of Cyber Science and Engineering, Key Laboratory of Aerospace Information Security and Trusted Computing, Wuhan, China (former)


According to our database1, Hao Fei authored at least 134 papers between 2018 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Enhancing Video-Language Representations With Structural Spatio-Temporal Alignment.
IEEE Trans. Pattern Anal. Mach. Intell., December, 2024

TKDP: Threefold Knowledge-Enriched Deep Prompt Tuning for Few-Shot Named Entity Recognition.
IEEE Trans. Knowl. Data Eng., November, 2024

Integrating discourse features and response assessment for advancing empathetic dialogue.
Inf. Process. Manag., 2024

Unified Generative and Discriminative Training for Multi-modal Large Language Models.
CoRR, 2024

Synergistic Dual Spatial-aware Generation of Image-to-Text and Text-to-Image.
CoRR, 2024

Grounding is All You Need? Dual Temporal Grounding for Video Dialog.
CoRR, 2024

Grammar Induction from Visual, Speech and Text.
CoRR, 2024

Towards Unified Multimodal Editing with Enhanced Knowledge Collaboration.
CoRR, 2024

ControlMLLM: Training-Free Visual Prompt Learning for Multimodal Large Language Models.
CoRR, 2024

OMG-LLaVA: Bridging Image-level, Object-level, Pixel-level Reasoning and Understanding.
CoRR, 2024

EmpathyEar: An Open-source Avatar Multimodal Empathetic Chatbot.
CoRR, 2024

Towards Semantic Equivalence of Tokenization in Multimodal LLM.
CoRR, 2024

Recognizing Everything from All Modalities at Once: Grounded Multimodal Universal Information Extraction.
CoRR, 2024

Modeling Unified Semantic Discourse Structure for High-quality Headline Generation.
CoRR, 2024

MMLSCU: A Dataset for Multi-modal Multi-domain Live Streaming Comment Understanding.
Proceedings of the ACM on Web Conference 2024, 2024

I3: Intent-Introspective Retrieval Conditioned on Instructions.
Proceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2024

NUS-Emo at SemEval-2024 Task 3: Instruction-Tuning LLM for Multimodal Emotion-Cause Analysis in Conversations.
Proceedings of the 18th International Workshop on Semantic Evaluation, 2024

Overview of the NLPCC 2024 Shared Task 3: Dialogue-Level Coreference Resolution and Relation Extraction.
Proceedings of the Natural Language Processing and Chinese Computing, 2024

Overview of the NLPCC 2024 Shared Task 2: Nominal Compound Chain Extraction.
Proceedings of the Natural Language Processing and Chinese Computing, 2024

Actively Learn from LLMs with Uncertainty Propagation for Generalized Category Discovery.
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024

The 2nd International Workshop on Deep Multi-modal Generation and Retrieval.
Proceedings of the 2nd International Workshop on Deep Multimodal Generation and Retrieval, 2024

Self-Adaptive Fine-grained Multi-modal Data Augmentation for Semi-supervised Muti-modal Coreference Resolution.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

The ACM Multimedia 2024 Viual Spatial Description Grand Challenge.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

SpeechEE: A Novel Benchmark for Speech Event Extraction.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

PanoSent: A Panoptic Sextuple Extraction Benchmark for Multimodal Conversational Aspect-based Sentiment Analysis.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

De-fine: Decomposing and Refining Visual Programs with Auto-Feedback.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

From Multimodal LLM to Human-level AI: Modality, Instruction, Reasoning and Beyond.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Multi-view Counterfactual Contrastive Learning for Fact-checking Fake News Detection.
Proceedings of the 2024 International Conference on Multimedia Retrieval, 2024

NExT-GPT: Any-to-Any Multimodal LLM.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Momentor: Advancing Video Large Language Model with Fine-Grained Temporal Reasoning.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Video-of-Thought: Step-by-Step Video Reasoning from Perception to Cognition.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

In-Context Learning for Few-Shot Nested Named Entity Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2024

XFashion: Character Animation Generation via Facial-enhanced and Granularly Controlling.
Proceedings of the 5th International Workshop on Human-centric Multimedia Analysis, 2024

Divide and Conquer: Legal Concept-guided Criminal Court View Generation.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

Guided Knowledge Generation with Language Models for Commonsense Reasoning.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

A Survey of Ontology Expansion for Conversational Understanding.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

LL3DA: Visual Interactive Instruction Tuning for Omni-3D Understanding, Reasoning, and Planning.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Dysen-VDM: Empowering Dynamics-Aware Text-to-Video Diffusion with LLMs.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

What Factors Influence LLMs' Judgments? A Case Study on Question Answering.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

Revisiting Structured Sentiment Analysis as Latent Dependency Graph Parsing.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

Recognizing Everything from All Modalities at Once: Grounded Multimodal Universal Information Extraction.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

Faithful Logical Reasoning via Symbolic Chain-of-Thought.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

ProtT3: Protein-to-Text Generation for Text-based Protein Understanding.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

Synergizing Large Language Models and Pre-Trained Smaller Models for Conversational Intent Discovery.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

Reverse Multi-Choice Dialogue Commonsense Inference with Graph-of-Thought.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

Improving Expressive Power of Spectral Graph Neural Networks with Eigenvalue Correction.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

Harnessing Holistic Discourse Features and Triadic Interaction for Sentiment Quadruple Extraction in Dialogues.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
Knowledge-enhanced event relation extraction via event ontology prompt.
Inf. Fusion, December, 2023

Nonautoregressive Encoder-Decoder Neural Framework for End-to-End Aspect-Based Sentiment Triplet Extraction.
IEEE Trans. Neural Networks Learn. Syst., September, 2023

Syntax-based dynamic latent graph for event relation extraction.
Inf. Process. Manag., September, 2023

On the Robustness of Aspect-based Sentiment Analysis: Rethinking Model, Data, and Training.
ACM Trans. Inf. Syst., April, 2023

De-fine: Decomposing and Refining Visual Programs with Auto-Feedback.
CoRR, 2023

Towards Complex-query Referring Image Segmentation: A Novel Benchmark.
CoRR, 2023

Empowering Dynamics-aware Text-to-Video Diffusion with Large Language Models.
CoRR, 2023

ControlRetriever: Harnessing the Power of Instructions for Controllable Retrieval.
CoRR, 2023

DialogRE^C+: An Extension of DialogRE to Investigate How Much Coreference Helps Relation Extraction in Dialogs.
CoRR, 2023

XNLP: An Interactive Demonstration System for Universal Structured NLP.
CoRR, 2023

Revisiting Conversation Discourse for Dialogue Disentanglement.
CoRR, 2023

ECQED: Emotion-Cause Quadruple Extraction in Dialogs.
CoRR, 2023

Transfer Visual Prompt Generator across LLMs.
CoRR, 2023

DialogRE<sup>C+</sup>: An Extension of DialogRE to Investigate How Much Coreference Helps Relation Extraction in Dialogs.
Proceedings of the Natural Language Processing and Chinese Computing, 2023

VPGTrans: Transfer Visual Prompt Generator across LLMs.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Imagine That! Abstract-to-Intricate Text-to-Image Synthesis with Scene Graph Hallucination Diffusion.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Constructing Holistic Spatio-Temporal Scene Graph for Video Semantic Role Labeling.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

LayoutLLM-T2I: Eliciting Layout Guidance from LLM for Text-to-Image Generation.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Revisiting Disentanglement and Fusion on Modality and Context in Conversational Multimodal Emotion Recognition.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Deep Multimodal Learning for Information Retrieval.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Partial Annotation-based Video Moment Retrieval via Iterative Learning.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

MolCA: Molecular Graph-Language Modeling with Cross-Modal Projector and Uni-Modal Adapter.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Generating Visual Spatial Description via Holistic 3D Scene Understanding.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Cross2StrA: Unpaired Cross-lingual Image Captioning with Cross-lingual Cross-modal Structure-pivoted Alignment.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Information Screening whilst Exploiting! Multimodal Relation Extraction with Feature Denoising and Multimodal Topic Modeling.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

DiaASQ: A Benchmark of Conversational Aspect-based Sentiment Quadruple Analysis.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

Constructing Code-mixed Universal Dependency Forest for Unbiased Cross-lingual Relation Extraction.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

Scene Graph as Pivoting: Inference-time Image-free Unsupervised Multimodal Machine Translation with Visual Scene Hallucination.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Reasoning Implicit Sentiment with Chain-of-Thought Prompting.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), 2023

2022
Optimizing Attention for Sequence Modeling via Reinforcement Learning.
IEEE Trans. Neural Networks Learn. Syst., 2022

A semantic and syntactic enhanced neural model for financial sentiment analysis.
Inf. Process. Manag., 2022

Pair-wise aspect and opinion terms extraction as graph parsing via a novel mutually-aware interaction mechanism.
Neurocomputing, 2022

OneEE: A One-Stage Framework for Fast Overlapping and Nested Event Extraction.
CoRR, 2022

Subband-based Generative Adversarial Network for Non-parallel Many-to-many Voice Conversion.
CoRR, 2022

Making Decision like Human: Joint Aspect Category Sentiment Analysis and Rating Prediction with Fine-to-Coarse Reasoning.
Proceedings of the WWW '22: The ACM Web Conference 2022, Virtual Event, Lyon, France, April 25, 2022

Mutual Disentanglement Learning for Joint Fine-Grained Sentiment Classification and Controllable Text Generation.
Proceedings of the SIGIR '22: The 45th International ACM SIGIR Conference on Research and Development in Information Retrieval, Madrid, Spain, July 11, 2022

Prompt-Based Generative Multi-label Emotion Prediction with Label Contrastive Learning.
Proceedings of the Natural Language Processing and Chinese Computing, 2022

LasUIE: Unifying Information Extraction with Latent Adaptive Structure-aware Generative Language Model.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Conversational Semantic Role Labeling with Predicate-Oriented Latent Graph.
Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence, 2022

Global Inference with Explicit Syntactic and Discourse Structures for Dialogue-Level Relation Extraction.
Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence, 2022

Inheriting the Wisdom of Predecessors: A Multiplex Cascade Framework for Unified Aspect-based Sentiment Analysis.
Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence, 2022

Matching Structure for Dual Learning.
Proceedings of the International Conference on Machine Learning, 2022

Entity-centered Cross-document Relation Extraction.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Conversation Disentanglement with Bi-Level Contrastive Learning.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

Joint Alignment of Multi-Task Feature and Label Spaces for Emotion Cause Pair Extraction.
Proceedings of the 29th International Conference on Computational Linguistics, 2022

OneEE: A One-Stage Framework for Fast Overlapping and Nested Event Extraction.
Proceedings of the 29th International Conference on Computational Linguistics, 2022

Effective Token Graph Modeling using a Novel Labeling Strategy for Structured Sentiment Analysis.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

Mastering the Explicit Opinion-Role Interaction: Syntax-Aided Neural Transition System for Unified Opinion Role Labeling.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

Unified Named Entity Recognition as Word-Word Relation Classification.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021
High-Order Pair-Wise Aspect and Opinion Terms Extraction With Edge-Enhanced Syntactic Graph Convolution.
IEEE ACM Trans. Audio Speech Lang. Process., 2021

Second-Order Semantic Role Labeling With Global Structural Refinement.
IEEE ACM Trans. Audio Speech Lang. Process., 2021

Adversarial shared-private model for cross-domain clinical text entailment recognition.
Knowl. Based Syst., 2021

Learn from Syntax: Improving Pair-wise Aspect and Opinion Terms Extractionwith Rich Syntactic Knowledge.
CoRR, 2021

A span-graph neural model for overlapping entity relation extraction in biomedical texts.
Bioinform., 2021

Enriching contextualized language model from knowledge graph for biomedical information extraction.
Briefings Bioinform., 2021

Latent Target-Opinion as Prior for Document-Level Sentiment Classification: A Variational Approach from Fine-Grained Perspective.
Proceedings of the WWW '21: The Web Conference 2021, 2021

Learn from Syntax: Improving Pair-wise Aspect and Opinion Terms Extraction with Rich Syntactic Knowledge.
Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence, 2021

MRN: A Locally and Globally Mention-Based Reasoning Network for Document-Level Relation Extraction.
Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021

Better Combine Them Together! Integrating Syntactic Constituency and Dependency Representations for Semantic Role Labeling.
Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021

End-to-end Semantic Role Labeling with Neural Transition-based Model.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

Encoder-Decoder Based Unified Semantic Role Labeling with Label-Aware Syntax.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

Rethinking Boundaries: End-To-End Recognition of Discontinuous Mentions with Pointer Networks.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020
Cross-Lingual Semantic Role Labeling With Model Transfer.
IEEE ACM Trans. Audio Speech Lang. Process., 2020

Topic-Enhanced Capsule Network for Multi-Label Emotion Classification.
IEEE ACM Trans. Audio Speech Lang. Process., 2020

Encoding multi-granularity structural information for joint Chinese word segmentation and POS tagging.
Pattern Recognit. Lett., 2020

Electronic word-of-mouth effects on studio performance leveraging attention-based model.
Neural Comput. Appl., 2020

Dispatched attention with multi-task learning for nested mention recognition.
Inf. Sci., 2020

A tree-based neural network model for biomedical event trigger detection.
Inf. Sci., 2020

An end-to-end joint model for evidence information extraction from court record document.
Inf. Process. Manag., 2020

A deep neural network model for speakers coreference resolution in legal texts.
Inf. Process. Manag., 2020

Boundaries and edges rethinking: An end-to-end neural model for overlapping entity relation extraction.
Inf. Process. Manag., 2020

Negation and speculation scope detection using recursive neural conditional random fields.
Neurocomputing, 2020

Aggressive Language Detection with Joint Text Normalization via Adversarial Multi-task Learning.
Proceedings of the Natural Language Processing and Chinese Computing, 2020

Nominal Compound Chain Extraction: A New Task for Semantic-Enriched Lexical Chain.
Proceedings of the Natural Language Processing and Chinese Computing, 2020

High-order Refining for End-to-end Chinese Semantic Role Labeling.
Proceedings of the 1st Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics and the 10th International Joint Conference on Natural Language Processing, 2020

Retrofitting Structure-aware Transformer Language Model for End Tasks.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

Mimic and Conquer: Heterogeneous Tree Structure Distillation for Syntactic NLP.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2020, 2020

Improving Text Understanding via Deep Syntax-Semantics Communication.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2020, 2020

Modeling Local Contexts for Joint Dialogue Act Recognition and Sentiment Classification with Bi-channel Dynamic Convolutions.
Proceedings of the 28th International Conference on Computational Linguistics, 2020

Cross-Lingual Semantic Role Labeling with High-Quality Translated Training Corpus.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

Latent Emotion Memory for Multi-Label Emotion Classification.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
A hybrid neural network model for predicting kidney disease in hypertension patients based on electronic health records.
BMC Medical Informatics Decis. Mak., 2019

Implicit Objective Network for Emotion Detection.
Proceedings of the Natural Language Processing and Chinese Computing, 2019

Drug-Drug Interaction Extraction Using a Span-based Neural Network Model.
Proceedings of the 2019 IEEE International Conference on Bioinformatics and Biomedicine, 2019

Recognizing Nested Named Entity in Biomedical Texts: A Neural Network Model with Multi-Task Learning.
Proceedings of the 2019 IEEE International Conference on Bioinformatics and Biomedicine, 2019

2018
Neural Networks for Bacterial Named Entity Recognition.
Proceedings of the IEEE International Conference on Bioinformatics and Biomedicine, 2018

Detecting the Scope of Negation and Speculation in Biomedical Texts by Using Recursive Neural Network.
Proceedings of the IEEE International Conference on Bioinformatics and Biomedicine, 2018


  Loading...