An End-to-End Model for Photo-Sharing Multi-modal Dialogue Generation.
CoRR, 2024

Effective Demonstration Annotation for In-Context Learning via Language Model-Based Determinantal Point Process.
CoRR, 2024

mGTE: Generalized Long-Context Text Representation and Reranking Models for Multilingual Text Retrieval.
CoRR, 2024

Learning Robust Named Entity Recognizers From Noisy Data With Retrieval Augmentation.
CoRR, 2024

Knowledge Mechanisms in Large Language Models: A Survey and Perspective.
CoRR, 2024

Retrieved In-Context Principles from Previous Mistakes.
CoRR, 2024

ProductAgent: Benchmarking Conversational Product Search Agent with Asking Clarification Questions.
CoRR, 2024

Adaptive Selection for Homogeneous Tools: An Instantiation in the RAG Scenario.
CoRR, 2024

Supportiveness-based Knowledge Rewriting for Retrieval-augmented Language Modeling.
CoRR, 2024

WISE: Rethinking the Knowledge Memory for Lifelong Model Editing of Large Language Models.
CoRR, 2024

RaFe: Ranking Feedback Improves Query Rewriting for RAG.
CoRR, 2024

Agent Planning with World Knowledge Model.
CoRR, 2024

Let LLMs Take on the Latest Challenges! A Chinese Dynamic Question Answering Benchmark.
CoRR, 2024

A Comprehensive Study of Knowledge Editing for Large Language Models.
CoRR, 2024

Exploring Key Point Analysis with Pairwise Generation and Graph Partitioning.
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024

Geo-Encoder: A Chunk-Argument Bi-Encoder Framework for Chinese Geographic Re-Ranking.
Proceedings of the 18th Conference of the European Chapter of the Association for Computational Linguistics, 2024

Chinese Sequence Labeling with Semi-Supervised Boundary-Aware Language Model Pre-training.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

A Two-Stage Adaptation of Large Language Models for Text Ranking.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

Improving Retrieval Augmented Open-Domain Question-Answering with Vectorized Contexts.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

SeqGPT: An Out-of-the-Box Large Language Model for Open Domain Sequence Understanding.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

EcomGPT: Instruction-Tuning Large Language Models with Chain-of-Task Tasks for E-commerce.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

Three Heads Are Better than One: Improving Cross-Domain NER with Progressive Decomposed Network.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

Fine-Grained Domain Adaptation for Chinese Syntactic Processing.
ACM Trans. Asian Low Resour. Lang. Inf. Process., November, 2023

EcomGPT-CT: Continual Pre-training of E-commerce Large Language Models with Semi-structured Data.
CoRR, 2023

RankingGPT: Empowering Large Language Models in Text Ranking with Progressive Enhancement.
CoRR, 2023

Language Models are Universal Embedders.
CoRR, 2023

Editing Personality for LLMs.
CoRR, 2023

Hybrid Retrieval and Multi-stage Text Ranking Solution at TREC 2022 Deep Learning Track.
CoRR, 2023

Towards General Text Embeddings with Multi-stage Contrastive Learning.
CoRR, 2023

Improving Text Matching in E-Commerce Search with A Rationalizable, Intervenable and Fast Entity-Based Relevance Model.
CoRR, 2023

Bidirectional End-to-End Learning of Retriever-Reader Paradigm for Entity Linking.
CoRR, 2023

Challenging Decoder helps in Masked Auto-Encoder Pre-training for Dense Passage Retrieval.
CoRR, 2023

GeoGLUE: A GeoGraphic Language Understanding Evaluation Benchmark.
CoRR, 2023

Zero-Shot Information Extraction via Chatting with ChatGPT.
CoRR, 2023

A Multi-Modal Geographic Pre-Training Method.
CoRR, 2023

MGeo: Multi-Modal Geographic Language Model Pre-Training.
Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2023

DAMO-NLP at SemEval-2023 Task 2: A Unified Retrieval-augmented System for Multilingual Named Entity Recognition.
Proceedings of the The 17th International Workshop on Semantic Evaluation, 2023

Text Representation Distillation via Information Bottleneck Principle.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

COMBO: A Complete Benchmark for Open KG Canonicalization.
Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics, 2023

Do PLMs Know and Understand Ontological Knowledge?
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Recall, Expand, and Multi-Candidate Cross-Encode: Fast and Accurate Ultra-Fine Entity Typing.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Entity-to-Text based Data Augmentation for various Named Entity Recognition Tasks.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

MANNER: A Variational Memory-Augmented Model for Cross Domain Few-Shot Named Entity Recognition.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Few-shot Classification with Hypersphere Modeling of Prototypes.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

Exploring Lottery Prompts for Pre-trained Language Models.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Graph Propagation based Data Augmentation for Named Entity Recognition.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), 2023

Adversarial Self-Attention for Language Understanding.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

Retrieval Oriented Masking Pre-training Language Model for Dense Passage Retrieval.
CoRR, 2022

EnTDA: Entity-to-Text based Data Augmentation Approach for Named Entity Recognition Tasks.
CoRR, 2022

HLATR: Enhance Multi-stage Text Retrieval with Hybrid List Aware Transformer Reranking.
CoRR, 2022

Hybrid Retrieval and Multi-stage Ranking at TREC 2022 Deep Learning Track.
Proceedings of the Thirty-First Text REtrieval Conference, 2022

Multi-CPR: A Multi Domain Chinese Dataset for Passage Retrieval.
Proceedings of the SIGIR '22: The 45th International ACM SIGIR Conference on Research and Development in Information Retrieval, Madrid, Spain, July 11, 2022

DAMO-NLP at SemEval-2022 Task 11: A Knowledge-based System for Multilingual Named Entity Recognition.
Proceedings of the 16th International Workshop on Semantic Evaluation, SemEval@NAACL 2022, 2022

DAMO-NLP at NLPCC-2022 Task 2: Knowledge Enhanced Robust NER for Speech Entity Linking.
Proceedings of the Natural Language Processing and Chinese Computing, 2022

Robust Self-Augmentation for Named Entity Recognition with Meta Reweighting.
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022

AISHELL-NER: Named Entity Recognition from Chinese Speech.
Proceedings of the IEEE International Conference on Acoustics, 2022

Forging Multiple Training Objectives for Pre-trained Language Models via Meta-Learning.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

Named Entity and Relation Extraction with Multi-Modal Retrieval.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

Unsupervised Boundary-Aware Language Model Pretraining for Chinese Sequence Labeling.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Modeling Label Correlations for Ultra-Fine Entity Typing with Neural Pairwise Conditional Random Field.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Prompt-learning for Fine-grained Entity Typing.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

Domain-Specific NER via Retrieving Correlated Samples.
Proceedings of the 29th International Conference on Computational Linguistics, 2022

Parallel Instance Query Network for Named Entity Recognition.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

Prototypical Representation Learning for Relation Extraction.
Proceedings of the 9th International Conference on Learning Representations, 2021

Probing BERT in Hyperbolic Spaces.
Proceedings of the 9th International Conference on Learning Representations, 2021

A Fine-Grained Domain Adaption Model for Joint Word Segmentation and POS Tagging.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

Crowdsourcing Learning as Domain Adaptation: A Case Study on Named Entity Recognition.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

Few-NERD: A Few-shot Named Entity Recognition Dataset.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

Counterfactual Inference for Text Classification Debiasing.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

Knowledge-aware Named Entity Recognition with Alleviating Heterogeneity.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

Keyphrase Extraction with Dynamic Graph Convolutional Networks and Diversified Inference.
CoRR, 2020

Learning with Noise: Improving Distantly-Supervised Fine-grained Entity Typing via Automatic Relabeling.
Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, 2020

Hierarchy-Aware Global Model for Hierarchical Text Classification.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

Coupling Distant Annotation and Adversarial Training for Cross-Domain Chinese Word Segmentation.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

DM_NLP at SemEval-2018 Task 12: A Pipeline System for Toponym Resolution.
Proceedings of the 13th International Workshop on Semantic Evaluation, 2019

Neural Chinese Address Parsing.
Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019

Better Modeling of Incomplete Annotations for Named Entity Recognition.
Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019

A Neural Multi-digraph Model for Chinese NER with Gazetteers.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

DM_NLP at SemEval-2018 Task 8: neural sequence labeling with linguistic features.
Proceedings of The 12th International Workshop on Semantic Evaluation, 2018

Alibaba at IJCNLP-2017 Task 1: Embedding Grammatical Features into LSTMs for Chinese Grammatical Error Diagnosis Task.
Proceedings of the IJCNLP 2017, Shared Tasks, Taipei, Taiwan, November 27, 2017
