Fei Huang

Orcid: 0000-0002-3709-5053

Affiliations:
  • Alibaba Group, DAMO Academy, Institute of Intelligent Computing, China


According to our database1, Fei Huang authored at least 281 papers between 2011 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2025
LORE++: Logical location regression network for table structure recognition with pre-training.
Pattern Recognit., 2025

2024
A Unified Conditional Diffusion Framework for Dual Protein Targets-Based Bioactive Molecule Generation.
IEEE Trans. Artif. Intell., September, 2024

Multilingual named Entity Extraction and Translation from Text and Speech
PhD thesis, 2024

A Survey on Out-of-Distribution Detection in NLP.
Trans. Mach. Learn. Res., 2024

Sequence Labeling as Non-Autoregressive Dual-Query Set Generation.
IEEE ACM Trans. Audio Speech Lang. Process., 2024

Unifying Structured Data as Graph for Data-to-Text Pre-Training.
Trans. Assoc. Comput. Linguistics, 2024

Benchmarking Agentic Workflow Generation.
CoRR, 2024

In-Context Transfer Learning: Demonstration Synthesis by Transferring Similar Tasks.
CoRR, 2024

The Imperative of Conversation Analysis in the Era of LLMs: A Survey of Tasks, Techniques, and Trends.
CoRR, 2024

Qwen2.5-Coder Technical Report.
CoRR, 2024

MMEvol: Empowering Multimodal Large Language Models with Evol-Instruct.
CoRR, 2024

mPLUG-DocOwl2: High-resolution Compressing for OCR-free Multi-page Document Understanding.
CoRR, 2024

MaVEn: An Effective Multi-granularity Hybrid Visual Encoding Framework for Multimodal Large Language Model.
CoRR, 2024

mPLUG-Owl3: Towards Long Image-Sequence Understanding in Multi-Modal Large Language Models.
CoRR, 2024

Extend Model Merging from Fine-Tuned to Pre-Trained Large Language Models via Weight Disentanglement.
CoRR, 2024

mGTE: Generalized Long-Context Text Representation and Reranking Models for Multilingual Text Retrieval.
CoRR, 2024

Visual Text Generation in the Wild.
CoRR, 2024

Qwen2 Technical Report.
CoRR, 2024

Retrieved In-Context Principles from Previous Mistakes.
CoRR, 2024

ProductAgent: Benchmarking Conversational Product Search Agent with Asking Clarification Questions.
CoRR, 2024

Leave No Document Behind: Benchmarking Long-Context LLMs with Extended Multi-Doc QA.
CoRR, 2024

Adaptive Selection for Homogeneous Tools: An Instantiation in the RAG Scenario.
CoRR, 2024

Supportiveness-based Knowledge Rewriting for Retrieval-augmented Language Modeling.
CoRR, 2024

How Alignment and Jailbreak Work: Explain LLM Safety through Intermediate Hidden States.
CoRR, 2024

Mobile-Agent-v2: Mobile Device Operation Assistant with Effective Navigation via Multi-Agent Collaboration.
CoRR, 2024

Online Merging Optimizers for Boosting Rewards and Mitigating Tax in Alignment.
CoRR, 2024

WISE: Rethinking the Knowledge Memory for Lifelong Model Editing of Large Language Models.
CoRR, 2024

Agent Planning with World Knowledge Model.
CoRR, 2024

TinyChart: Efficient Chart Understanding with Visual Token Merging and Program-of-Thoughts Learning.
CoRR, 2024

A Survey on Self-Evolution of Large Language Models.
CoRR, 2024

Exploring Key Point Analysis with Pairwise Generation and Graph Partitioning.
CoRR, 2024

Fine-Tuning Language Models with Reward Learning on Policy.
CoRR, 2024

ReAct Meets ActRe: When Language Agents Enjoy Training Data Autonomy.
CoRR, 2024

RoleInteract: Evaluating the Social Interaction of Role-Playing Agents.
CoRR, 2024

mPLUG-DocOwl 1.5: Unified Structure Learning for OCR-free Document Understanding.
CoRR, 2024

Efficient Vision-and-Language Pre-training with Text-Relevant Image Patch Selection.
CoRR, 2024

Improving Cross-lingual Representation for Semantic Retrieval with Code-switching.
CoRR, 2024

Self-Retrieval: Building an Information Retrieval System with One Large Language Model.
CoRR, 2024

Let LLMs Take on the Latest Challenges! A Chinese Dynamic Question Answering Benchmark.
CoRR, 2024

Effective Two-Stage Knowledge Transfer for Multi-Entity Cross-Domain Recommendation.
CoRR, 2024

Meta Ranking: Less Capable Language Models are Capable for Single Response Judgement.
CoRR, 2024

AI Hospital: Interactive Evaluation and Collaboration of LLMs as Intern Doctors for Clinical Diagnosis.
CoRR, 2024

Mobile-Agent: Autonomous Multi-Modal Mobile Device Agent with Visual Perception.
CoRR, 2024

A Comprehensive Study of Knowledge Editing for Large Language Models.
CoRR, 2024

Editing Personality For Large Language Models.
Proceedings of the Natural Language Processing and Chinese Computing, 2024

Exploring Key Point Analysis with Pairwise Generation and Graph Partitioning.
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024

mPLUG-PaperOwl: Scientific Diagram Analysis with the Multimodal Large Language Model.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

FactCHD: Benchmarking Fact-Conflicting Hallucination Detection.
Proceedings of the Thirty-Third International Joint Conference on Artificial Intelligence, 2024

Language Models are Super Mario: Absorbing Abilities from Homologous Models as a Free Lunch.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Training-Free Long-Context Scaling of Large Language Models.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

DialCLIP: Empowering Clip As Multi-Modal Dialog Retriever.
Proceedings of the IEEE International Conference on Acoustics, 2024

TinyChart: Efficient Chart Understanding with Program-of-Thoughts Learning and Visual Token Merging.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Predicting Rewards Alongside Tokens: Non-disruptive Parameter Insertion for Efficient Inference Intervention in Large Language Model.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

FlowBench: Revisiting and Benchmarking Workflow-Guided Planning for LLM-based Agents.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

Knowledge Mechanisms in Large Language Models: A Survey and Perspective.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

Leave No Document Behind: Benchmarking Long-Context LLMs with Extended Multi-Doc QA.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Retrieved In-Context Principles from Previous Mistakes.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Small LLMs Are Weak Tool Learners: A Multi-LLM Agent.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

RaFe: Ranking Feedback Improves Query Rewriting for RAG.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

MIBench: Evaluating Multimodal Large Language Models over Multiple Images.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

mPLUG-DocOwl 1.5: Unified Structure Learning for OCR-free Document Understanding.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

Improving Factual Consistency of News Summarization by Contrastive Preference Optimization.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

A Simple yet Effective Training-free Prompt-free Approach to Chinese Spelling Correction Based on Large Language Models.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Platypus: A Generalized Specialist Model for Reading Text in Various Forms.
Proceedings of the Computer Vision - ECCV 2024, 2024

Geo-Encoder: A Chunk-Argument Bi-Encoder Framework for Chinese Geographic Re-Ranking.
Proceedings of the 18th Conference of the European Chapter of the Association for Computational Linguistics, 2024

mPLUG-OwI2: Revolutionizing Multi-modal Large Language Model with Modality Collaboration.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

OMNIPARSER: A Unified Framework for Text Spotting, Key Information Extraction and Table Recognition.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Hallucination Augmented Contrastive Learning for Multimodal Large Language Model.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Tree-Instruct: A Preliminary Study of the Intrinsic Relationship between Complexity and Alignment.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

Unifying Latent and Lexicon Representations for Effective Video-Text Retrieval.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

Semantics-enhanced Cross-modal Masked Image Modeling for Vision-Language Pre-training.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

Out-of-Domain Intent Detection Considering Multi-Turn Dialogue Contexts.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

Self-Explanation Prompting Improves Dialogue Understanding in Large Language Models.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

Scaling Data Diversity for Fine-Tuning Language Models in Human Alignment.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

Budget-Constrained Tool Learning with Planning.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

Speculative Contrastive Decoding.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics, 2024

Iterative Forward Tuning Boosts In-Context Learning in Language Models.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

Browse and Concentrate: Comprehending Multimodal Content via Prior-LLM Context Fusion.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

PANDA: Preference Adaptation for Enhancing Domain-Specific Abilities of LLMs.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

One-Shot Learning as Instruction Data Prospector for Large Language Models.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

Fortify the Shortest Stave in Attention: Enhancing Context Awareness of Large Language Models for Effective Tool Use.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

SocialBench: Sociality Evaluation of Role-Playing Conversational Agents.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

Improving Retrieval Augmented Open-Domain Question-Answering with Vectorized Contexts.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

Model Composition for Multimodal Large Language Models.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

SeqGPT: An Out-of-the-Box Large Language Model for Open Domain Sequence Understanding.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

EcomGPT: Instruction-Tuning Large Language Models with Chain-of-Task Tasks for E-commerce.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

Preference Ranking Optimization for Human Alignment.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
Schema dependency-enhanced curriculum pre-training for table semantic parsing.
Knowl. Based Syst., February, 2023

Achieving Human Parity on Visual Question Answering.
ACM Trans. Inf. Syst., 2023

LOGEN: Few-Shot Logical Knowledge-Conditioned Text Generation With Self-Training.
IEEE ACM Trans. Audio Speech Lang. Process., 2023

EcomGPT-CT: Continual Pre-training of E-commerce Large Language Models with Semi-structured Data.
CoRR, 2023

One Shot Learning as Instruction Data Prospector for Large Language Models.
CoRR, 2023

mPLUG-Owl2: Revolutionizing Multi-modal Large Language Model with Modality Collaboration.
CoRR, 2023

Improving Factual Consistency of Text Summarization by Adversarially Decoupling Comprehension and Embellishment Abilities of LLMs.
CoRR, 2023

FactCHD: Benchmarking Fact-Conflicting Hallucination Detection.
CoRR, 2023

Constructive Large Language Models Alignment with Diverse Feedback.
CoRR, 2023

UReader: Universal OCR-free Visually-situated Language Understanding with Multimodal Large Language Model.
CoRR, 2023

Editing Personality for LLMs.
CoRR, 2023

Qwen Technical Report.
CoRR, 2023

VDialogUE: A Unified Evaluation Benchmark for Visually-grounded Dialogue.
CoRR, 2023

ModelScope-Agent: Building Your Customizable Agent System with Open-source Large Language Models.
CoRR, 2023

A Preliminary Study of the Intrinsic Relationship between Complexity and Alignment.
CoRR, 2023

Wider and Deeper LLM Networks are Fairer LLM Evaluators.
CoRR, 2023

CValues: Measuring the Values of Chinese Large Language Models from Safety to Responsibility.
CoRR, 2023

PolyLM: An Open Source Polyglot Large Language Model.
CoRR, 2023

mPLUG-DocOwl: Modularized Multimodal Large Language Model for Document Understanding.
CoRR, 2023

Improving Text Matching in E-Commerce Search with A Rationalizable, Intervenable and Fast Entity-Based Relevance Model.
CoRR, 2023

DiffDTM: A conditional structure-free framework for bioactive molecules generation targeted for dual proteins.
CoRR, 2023

Bidirectional End-to-End Learning of Retriever-Reader Paradigm for Entity Linking.
CoRR, 2023

Youku-mPLUG: A 10 Million Large-scale Chinese Video-Language Dataset for Pre-training and Benchmarks.
CoRR, 2023

Optimizing Non-Autoregressive Transformers with Contrastive Learning.
CoRR, 2023

SpokenWOZ: A Large-Scale Speech-Text Benchmark for Spoken Task-Oriented Dialogue in Multiple Domains.
CoRR, 2023

Iterative Forward Tuning Boosts In-context Learning in Language Models.
CoRR, 2023

Speech-Text Dialog Pre-training for Spoken Dialog Understanding with Explicit Cross-Modal Alignment.
CoRR, 2023

GeoGLUE: A GeoGraphic Language Understanding Evaluation Benchmark.
CoRR, 2023

Vision Langauge Pre-training by Contrastive Learning with Cross-Modal Similarity Regulation.
CoRR, 2023

DAMO-NLP at SemEval-2023 Task 2: A Unified Retrieval-augmented System for Multilingual Named Entity Recognition.
CoRR, 2023

Can LLM Already Serve as A Database Interface? A BIg Bench for Large-Scale Database Grounded Text-to-SQLs.
CoRR, 2023

Unsupervised Dialogue Topic Segmentation with Topic-aware Utterance Representation.
CoRR, 2023

Transforming Visual Scene Graphs to Image Captions.
CoRR, 2023

mPLUG-Owl: Modularization Empowers Large Language Models with Multimodality.
CoRR, 2023

API-Bank: A Benchmark for Tool-Augmented LLMs.
CoRR, 2023

VECO 2.0: Cross-lingual Language Model Pre-training with Multi-granularity Contrastive Learning.
CoRR, 2023

ChatPLUG: Open-Domain Generative Dialogue System with Internet-Augmented Instruction Tuning for Digital Human.
CoRR, 2023

RRHF: Rank Responses to Align Language Models with Human Feedback without tears.
CoRR, 2023

Revisiting Automatic Question Summarization Evaluation in the Biomedical Domain.
CoRR, 2023

RAMM: Retrieval-augmented Biomedical Visual Question Answering with Multi-modal Pre-training.
CoRR, 2023

Molecular Geometry-aware Transformer for accurate 3D Atomic System modeling.
CoRR, 2023

Large Language Models are Versatile Decomposers: Decompose Evidence and Questions for Table-based Reasoning.
CoRR, 2023

A Multi-Modal Geographic Pre-Training Method.
CoRR, 2023

Adaptively Clustering Neighbor Elements for Image Captioning.
CoRR, 2023

Large Language Models are Versatile Decomposers: Decomposing Evidence and Questions for Table-based Reasoning.
Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2023

Unsupervised Dialogue Topic Segmentation with Topic-aware Contrastive Learning.
Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2023

MGeo: Multi-Modal Geographic Language Model Pre-Training.
Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2023

DAMO-NLP at SemEval-2023 Task 2: A Unified Retrieval-augmented System for Multilingual Named Entity Recognition.
Proceedings of the The 17th International Workshop on Semantic Evaluation, 2023

Knowledgeable Salient Span Mask for Enhancing Language Models as Knowledge Base.
Proceedings of the Natural Language Processing and Chinese Computing, 2023

SPA: A Graph Spectral Alignment Perspective for Domain Adaptation.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Debiased and Denoised Entity Recognition from Distant Supervision.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

SpokenWOZ: A Large-Scale Speech-Text Benchmark for Spoken Task-Oriented Dialogue Agents.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Can LLM Already Serve as A Database Interface? A BIg Bench for Large-Scale Database Grounded Text-to-SQLs.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

EMMA-X: An EM-like Multilingual Pre-training Algorithm for Cross-lingual Representation Learning.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

mPLUG-Octopus: The Versatile Assistant Empowered by A Modularized End-to-End Multimodal LLM.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Learning Semantics-Grounded Vocabulary Representation for Video-Text Retrieval.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

COPA : Efficient Vision-Language Pre-training through Collaborative Object- and Patch-Text Alignment.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

RAMM: Retrieval-augmented Biomedical Visual Question Answering with Multi-modal Pre-training.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

One Model for All Domains: Collaborative Domain-Prefix Tuning for Cross-Domain NER.
Proceedings of the Thirty-Second International Joint Conference on Artificial Intelligence, 2023

mPLUG-2: A Modularized Multi-modal Foundation Model Across Text, Image and Video.
Proceedings of the International Conference on Machine Learning, 2023

ICDAR 2023 Competition on Born Digital Video Text Question Answering.
Proceedings of the Document Analysis and Recognition - ICDAR 2023, 2023

HiTeA: Hierarchical Temporal-Aware Video-Language Pre-training.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Learning Trajectory-Word Alignments for Video-Language Tasks.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

BUS : Efficient and Effective Vision-language Pre-training with Bottom-Up Patch Summarization.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Causal Document-Grounded Dialogue Pre-training.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

UReader: Universal OCR-free Visually-situated Language Understanding with Multimodal Large Language Model.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Knowledge Rumination for Pre-trained Language Models.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Improving Question Generation with Multi-level Content Planning.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

API-Bank: A Comprehensive Benchmark for Tool-Augmented LLMs.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

ModelScope-Agent: Building Your Customizable Agent System with Open-source Large Language Models.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Diversify Question Generation with Retrieval-Augmented Style Transfer.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Exploring Large Language Models for Multi-Modal Out-of-Distribution Detection.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Improving Seq2Seq Grammatical Error Correction via Decoding Interventions.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

NaSGEC: a Multi-Domain Chinese Grammatical Error Correction Dataset from Native Speaker Texts.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

HyPe: Better Pre-trained Language Model Fine-tuning with Hidden Representation Perturbation.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Speech-Text Pre-training for Spoken Dialog Understanding with Explicit Cross-Modal Alignment.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Transforming Visual Scene Graphs to Image Captions.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Reasoning with Language Model Prompting: A Survey.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Multimodal Recommendation Dialog with Subjective Preference: A New Challenge and Benchmark.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

PaCE: Unified Multi-modal Dialogue Pre-training with Progressive and Compositional Experts.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

CATS: A Pragmatic Chinese Answer-to-Sequence Dataset with Large Scale and High Quality.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Vision Language Pre-training by Contrastive Learning with Cross-Modal Similarity Regulation.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Entity-to-Text based Data Augmentation for various Named Entity Recognition Tasks.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

PEER: Pre-training ELECTRA Extended by Ranking.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

MANNER: A Variational Memory-Augmented Model for Cross Domain Few-Shot Named Entity Recognition.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Long-Tailed Question Answering in an Open World.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Domain Incremental Lifelong Learning in an Open World.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

Universal Information Extraction with Meta-Pretrained Self-Retrieval.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

Distinguish Before Answer: Generating Contrastive Explanation as Knowledge for Commonsense Question Answering.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

Unified Language Representation for Question Answering over Text, Tables, and Images.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

Adversarial Self-Attention for Language Understanding.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

Graphix-T5: Mixing Pre-trained Transformers with Graph-Aware Layers for Text-to-SQL Parsing.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
Enhancing Neural Machine Translation With Dual-Side Multimodal Awareness.
IEEE Trans. Multim., 2022

Low-resource extraction with knowledge-aware pairwise prototype learning.
Knowl. Based Syst., 2022

SeqDiffuSeq: Text Diffusion with Encoder-Decoder Transformers.
CoRR, 2022

EnTDA: Entity-to-Text based Data Augmentation Approach for Named Entity Recognition Tasks.
CoRR, 2022

SPACE-3: Unified Dialog Model Pre-training for Task-Oriented Dialog Understanding and Generation.
CoRR, 2022

A Survey on Text-to-SQL Parsing: Concepts, Methods, and Future Directions.
CoRR, 2022

Proton: Probing Schema Linking Information from Pre-trained Language Models for Text-to-SQL Parsing.
CoRR, 2022

Duplex Conversation: Towards Human-like Interaction in Spoken Dialogue System.
CoRR, 2022

mPLUG: Effective and Efficient Vision-Language Learning by Cross-modal Skip-connections.
CoRR, 2022

Good Visual Guidance Makes A Better Extractor: Hierarchical Visual Prefix for Multimodal Entity and Relation Extraction.
CoRR, 2022

On Effectively Learning of Knowledge in Continual Pre-training.
CoRR, 2022

Image Captioning In the Transformer Age.
CoRR, 2022

Contrastive Demonstration Tuning for Pre-trained Language Models.
CoRR, 2022

DeepKE: A Deep Learning Based Knowledge Extraction Toolkit for Knowledge Base Population.
CoRR, 2022

Disentangled representation for sequential treatment effect estimation.
Comput. Methods Programs Biomed., 2022

KnowPrompt: Knowledge-aware Prompt-tuning with Synergistic Optimization for Relation Extraction.
Proceedings of the WWW '22: The ACM Web Conference 2022, Virtual Event, Lyon, France, April 25, 2022

Unified Dialog Model Pre-training for Task-Oriented Dialog Understanding and Generation.
Proceedings of the SIGIR '22: The 45th International ACM SIGIR Conference on Research and Development in Information Retrieval, Madrid, Spain, July 11, 2022

Hybrid Transformer with Multi-level Fusion for Multimodal Knowledge Graph Completion.
Proceedings of the SIGIR '22: The 45th International ACM SIGIR Conference on Research and Development in Information Retrieval, Madrid, Spain, July 11, 2022

Relation Extraction as Open-book Examination: Retrieval-enhanced Prompt Tuning.
Proceedings of the SIGIR '22: The 45th International ACM SIGIR Conference on Research and Development in Information Retrieval, Madrid, Spain, July 11, 2022

DAMO-NLP at SemEval-2022 Task 11: A Knowledge-based System for Multilingual Named Entity Recognition.
Proceedings of the 16th International Workshop on Semantic Evaluation, SemEval@NAACL 2022, 2022

Decoupling Knowledge from Memorization: Retrieval-augmented Prompt Learning.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Good Visual Guidance Make A Better Extractor: Hierarchical Visual Prefix for Multimodal Entity and Relation Extraction.
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2022, 2022

MuCGEC: a Multi-Reference Multi-Source Evaluation Dataset for Chinese Grammatical Error Correction.
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022

Proton: Probing Schema Linking Information from Pre-trained Language Models for Text-to-SQL Parsing.
Proceedings of the KDD '22: The 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, Washington, DC, USA, August 14, 2022

Duplex Conversation: Towards Human-like Interaction in Spoken Dialogue Systems.
Proceedings of the KDD '22: The 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, Washington, DC, USA, August 14, 2022

Meta-Learning Based Knowledge Extrapolation for Knowledge Graphs in the Federated Setting.
Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence, 2022

Differentiable Prompt Makes Pre-trained Language Models Better Few-shot Learners.
Proceedings of the Tenth International Conference on Learning Representations, 2022

AISHELL-NER: Named Entity Recognition from Chinese Speech.
Proceedings of the IEEE International Conference on Acoustics, 2022

Forging Multiple Training Objectives for Pre-trained Language Models via Meta-Learning.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

Dial2vec: Self-Guided Contrastive Learning of Unsupervised Dialogue Embeddings.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

mPLUG: Effective and Efficient Vision-Language Learning by Cross-modal Skip-connections.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Estimating Soft Labels for Out-of-Domain Intent Detection.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Towards Generalizable and Robust Text-to-SQL Parsing.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

Doc2Bot: Accessing Heterogeneous Documents via Conversational Bots.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

STAR: SQL Guided Pre-Training for Context-dependent Text-to-SQL Parsing.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

SUN: Exploring Intrinsic Uncertainties in Text-to-SQL Parsers.
Proceedings of the 29th International Conference on Computational Linguistics, 2022

SPACE-2: Tree-Structured Semi-Supervised Contrastive Pre-training for Task-Oriented Dialog Understanding.
Proceedings of the 29th International Conference on Computational Linguistics, 2022

LightNER: A Lightweight Tuning Paradigm for Low-resource NER via Pluggable Prompting.
Proceedings of the 29th International Conference on Computational Linguistics, 2022

CBLUE: A Chinese Biomedical Language Understanding Evaluation Benchmark.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

S$^4$-Tuning: A Simple Cross-lingual Sub-network Tuning Method.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), 2022

Probing Structured Pruning on Multilingual Pre-trained Models: Settings, Algorithms, and Efficiency.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

Fusing Heterogeneous Factors with Triaffine Mechanism for Nested Named Entity Recognition.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2022, 2022

Parallel Instance Query Network for Named Entity Recognition.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

Learning to Ask for Data-Efficient Event Argument Extraction (Student Abstract).
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

From Dense to Sparse: Contrastive Pruning for Better Pre-trained Language Model Compression.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

GALAXY: A Generative Pre-trained Model for Task-Oriented Dialog with Semi-supervised Learning and Explicit Policy Injection.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021
Contrastive Information Extraction With Generative Transformer.
IEEE ACM Trans. Audio Speech Lang. Process., 2021

ITA: Image-Text Alignments for Multi-Modal Named Entity Recognition.
CoRR, 2021

Achieving Human Parity on Visual Question Answering.
CoRR, 2021

Learning to Ask for Data-Efficient Event Argument Extraction.
CoRR, 2021

CBLUE: A Chinese Biomedical Language Understanding Evaluation Benchmark.
CoRR, 2021

Preview, Attend and Review: Schema-Aware Curriculum Learning for Multi-Domain Dialog State Tracking.
CoRR, 2021

Normal vs. Adversarial: Salience-based Analysis of Adversarial Samples for Relation Extraction.
CoRR, 2021

Dynamic Hybrid Relation Network for Cross-Domain Context-Dependent Semantic Parsing.
CoRR, 2021

Relational Learning with Gated and Attentive Neighbor Aggregator for Few-Shot Knowledge Graph Completion.
Proceedings of the SIGIR '21: The 44th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2021

A Unified Span-Based Approach for Opinion Mining with Syntactic Constituents.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

Product-oriented Machine Translation with Cross-modal Cross-lingual Pre-training.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

Document-level Relation Extraction as Semantic Segmentation.
Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence, 2021

Automatically Paraphrasing via Sentence Reconstruction and Round-trip Translation.
Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence, 2021

Prototypical Representation Learning for Relation Extraction.
Proceedings of the 9th International Conference on Learning Representations, 2021

Raise a Child in Large Language Model: Towards Effective and Generalizable Fine-tuning.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

MuVER: Improving First-Stage Entity Retrieval with Multi-View Entity Representations.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

Rethinking Denoised Auto-Encoding in Language Pre-Training.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

DialogueCSE: Dialogue-based Contrastive Learning of Sentence Embeddings.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

A Unified Encoding of Structures in Transition Systems.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

Word Reordering for Zero-shot Cross-lingual Structured Prediction.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

Improving Biomedical Pretrained Language Models with Knowledge.
Proceedings of the 20th Workshop on Biomedical Language Processing, 2021

E2E-VLP: End-to-End Vision-Language Pre-training Enhanced by Visual Learning.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

Structural Knowledge Distillation: Tractably Distilling Information for Structured Predictor.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

Automated Concatenation of Embeddings for Structured Prediction.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

Improving Named Entity Recognition by External Context Retrieving and Cooperative Learning.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

VECO: Variable and Flexible Cross-lingual Pre-training for Language Understanding and Generation.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

StructuralLM: Structural Pre-training for Form Understanding.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

Risk Minimization for Zero-shot Sequence Labeling.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

Multi-View Cross-Lingual Structured Prediction with Minimum Supervision.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

OntoED: Low-resource Event Detection with Ontology Embedding.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

Preview, Attend and Review: Schema-Aware Curriculum Learning for Multi-Domain Dialogue State Tracking.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

Contrastive Triple Extraction with Generative Transformer.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

Bridging the Domain Gap: Improve Informal Language Translation via Counterfactual Domain Adaptation.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

Unsupervised Learning of Deterministic Dialogue Structure with Edge-Enhanced Graph Auto-Encoder.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

Knowledge-aware Named Entity Recognition with Alleviating Heterogeneity.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

Dynamic Hybrid Relation Exploration Network for Cross-Domain Context-Dependent Semantic Parsing.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

Nested Named Entity Recognition with Partially-Observed TreeCRFs.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020
VECO: Variable Encoder-decoder Pre-training for Cross-lingual Understanding and Generation.
CoRR, 2020

Structural Knowledge Distillation.
CoRR, 2020

Fast and Accurate Sequence Labeling with Approximate Inference Network.
CoRR, 2020


OpenUE: An Open Toolkit of Universal Extraction from Text.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing: System Demonstrations, 2020

Aspect Sentiment Classification with Aspect-Specific Opinion Spans.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

AIN: Fast and Accurate Sequence Labeling with Approximate Inference Network.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

An Investigation of Potential Function Designs for Neural CRF.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2020, 2020

PALM: Pre-training an Autoencoding&Autoregressive Language Model for Context-conditioned Generation.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

More Embeddings, Better Sequence Labelers?
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2020, 2020

Structure-Level Knowledge Distillation For Multilingual Sequence Labeling.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

Boundary Enhanced Neural Span Classification for Nested Named Entity Recognition.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

Knowing What, How and Why: A Near Complete Solution for Aspect-Based Sentiment Analysis.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
Noisy BiLSTM-Based Models for Disfluency Detection.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Unsupervised Multi-Modal Neural Machine Translation.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

2018
Alibaba Speech Translation Systems for IWSLT 2018.
Proceedings of the 15th International Conference on Spoken Language Translation, 2018

2016
Semi-supervised Convolutional Networks for Translation Adaptation with Tiny Amount of In-domain Data.
Proceedings of the 20th SIGNLL Conference on Computational Natural Language Learning, 2016

Bilingual Methods for Adaptive Training Data Selection for Machine Translation.
Proceedings of the 12th Conferences of the Association for Machine Translation in the Americas: MT Researchers' Track, 2016

2011
Goodness: A Method for Measuring Machine Translation Confidence.
Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, 2011


  Loading...