Zhiyuan Liu

Orcid: 0000-0002-7709-2543

Affiliations:
  • Tsinghua University, Department of Computer Science and Technology, Beijing, China
  • Tencent Inc, AI Platform Department, Shenzhen, China


According to our database1, Zhiyuan Liu authored at least 519 papers between 2008 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
D-Bot: Database Diagnosis System using Large Language Models.
Proc. VLDB Endow., June, 2024

Exploring Format Consistency for Instruction Tuning.
Trans. Mach. Learn. Res., 2024

Exploring Universal Intrinsic Task Subspace for Few-Shot Learning via Prompt Tuning.
IEEE ACM Trans. Audio Speech Lang. Process., 2024

Hyperbolic Pre-Trained Language Model.
IEEE ACM Trans. Audio Speech Lang. Process., 2024

RAG-DDR: Optimizing Retrieval-Augmented Generation Using Differentiable Data Rewards.
CoRR, 2024

Proactive Agent: Shifting LLM Agents from Reactive Responses to Active Assistance.
CoRR, 2024

VisRAG: Vision-based Retrieval-augmented Generation on Multi-modality Documents.
CoRR, 2024

LLM⨉MapReduce: Simplified Long-Sequence Processing using Large Language Models.
CoRR, 2024

Retriever-and-Memory: Towards Adaptive Note-Enhanced Retrieval-Augmented Generation.
CoRR, 2024

Optima: Optimizing Effectiveness and Efficiency for LLM-Based Multi-Agent System.
CoRR, 2024

Stuffed Mamba: State Collapse and State Capacity of RNN-Based Long-Context Modeling.
CoRR, 2024

DecorateLM: Data Engineering through Corpus Rating, Tagging, and Editing with Language Models.
CoRR, 2024

Text-guided Diffusion Model for 3D Molecule Generation.
CoRR, 2024

Locret: Enhancing Eviction in Long-Context LLM Inference with Trained Retaining Heads.
CoRR, 2024

Can Large Language Models Analyze Graphs like Professionals? A Benchmark, Datasets and Models.
CoRR, 2024

Enabling Real-Time Conversations with Minimal Training Costs.
CoRR, 2024

Awaking the Slides: A Tuning-free and Knowledge-regulated AI Tutoring System via Language Model Coordination.
CoRR, 2024

From MOOC to MAIC: Reshaping Online Teaching and Learning through LLM-driven Agents.
CoRR, 2024

Configurable Foundation Models: Building LLMs from a Modular Perspective.
CoRR, 2024

Multi-Modal Multi-Granularity Tokenizer for Chu Bamboo Slip Scripts.
CoRR, 2024

AdaNAT: Exploring Adaptive Policy for Token-Based Image Generation.
CoRR, 2024

AgentMonitor: A Plug-and-Play Framework for Predictive and Secure Multi-Agent Systems.
CoRR, 2024

Enhancing the Code Debugging Ability of LLMs via Communicative Agent Based Data Refinement.
CoRR, 2024

MiniCPM-V: A GPT-4V Level MLLM on Your Phone.
CoRR, 2024

RAGEval: Scenario Specific RAG Evaluation Dataset Generation Framework.
CoRR, 2024

PyBench: Evaluating LLM Agent on various real-world coding tasks.
CoRR, 2024

PersLLM: A Personified Training Approach for Large Language Models.
CoRR, 2024

States Hidden in Hidden States: LLMs Emerge Discrete State Representations Implicitly.
CoRR, 2024

Internet of Agents: Weaving a Web of Heterogeneous Agents for Collaborative Intelligence.
CoRR, 2024

Simulating Classroom Education with LLM-Empowered Agents.
CoRR, 2024

PRESTO: Progressive Pretraining Enhances Synthetic Chemistry Outcomes.
CoRR, 2024

Scaling Efficient Masked Autoencoder Learning on Large Remote Sensing Dataset.
CoRR, 2024

Zero-Shot Generalization during Instruction Tuning: Insights from Similarity and Granularity.
CoRR, 2024

GUICourse: From General Vision Language Models to Versatile GUI Agents.
CoRR, 2024

Delta-CoMe: Training-Free Delta-Compression with Mixed-Precision for Large Language Models.
CoRR, 2024

Scaling Large-Language-Model-based Multi-Agent Collaboration.
CoRR, 2024

Revisiting Non-Autoregressive Transformers for Efficient Image Synthesis.
CoRR, 2024

UltraMedical: Building Specialized Generalists in Biomedicine.
CoRR, 2024

Seq1F1B: Efficient Sequence-Level Pipeline Parallelism for Large Language Model Training.
CoRR, 2024

RLAIF-V: Aligning MLLMs through Open-Source AI Feedback for Super GPT-4V Trustworthiness.
CoRR, 2024

Iterative Experience Refinement of Software-Developing Agents.
CoRR, 2024

LEGENT: Open Platform for Embodied Agents.
CoRR, 2024

UltraEval: A Lightweight Platform for Flexible and Comprehensive Evaluation for LLMs.
CoRR, 2024

MiniCPM: Unveiling the Potential of Small Language Models with Scalable Training Strategies.
CoRR, 2024

Advancing LLM Reasoning Generalists with Preference Trees.
CoRR, 2024

LLaVA-UHD: an LMM Perceiving Any Aspect Ratio and High-Resolution Images.
CoRR, 2024

BurstAttention: An Efficient Distributed Attention Framework for Extremely Long Sequences.
CoRR, 2024

Mastering Text, Code and Math Simultaneously via Fusing Highly Specialized Language Models.
CoRR, 2024

Controllable Preference Optimization: Toward Controllable Multi-Objective Alignment.
CoRR, 2024

RepoAgent: An LLM-Powered Open-Source Framework for Repository-level Code Documentation Generation.
CoRR, 2024

Unified View of Grokking, Double Descent and Emergent Abilities: A Perspective from Circuits Competition.
CoRR, 2024

Cleaner Pretraining Corpus Curation with Neural Web Scraping.
CoRR, 2024

Ouroboros: Speculative Decoding with Large Model Enhanced Drafting.
CoRR, 2024

∞Bench: Extending Long Context Evaluation Beyond 100K Tokens.
CoRR, 2024

ActiveRAG: Revealing the Treasures of Knowledge via Active Learning.
CoRR, 2024

ProSparse: Introducing and Enhancing Intrinsic Activation Sparsity within Large Language Models.
CoRR, 2024

OneBit: Towards Extremely Low-bit Large Language Models.
CoRR, 2024

InfLLM: Unveiling the Intrinsic Capacity of LLMs for Understanding Extremely Long Sequences with Training-Free Memory.
CoRR, 2024

ReLU<sup>2</sup> Wins: Discovering Efficient Activation Functions for Sparse LLMs.
CoRR, 2024

UniMem: Towards a Unified View of Long-Context Large Language Models.
CoRR, 2024

Investigate-Consolidate-Exploit: A General Strategy for Inter-Task Agent Self-Evolution.
CoRR, 2024

DebugBench: Evaluating Debugging Capability of Large Language Models.
CoRR, 2024

CPT: Colorful Prompt Tuning for pre-trained vision-language models.
AI Open, 2024

Relation-aware deep neural network enables more efficient biomedical knowledge acquisition from massive literature.
AI Open, 2024

Toolink: Linking Toolkit Creation and Using through Chain-of-Solving on Open-Source Model.
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024

Exploring the Benefit of Activation Sparsity in Pre-training.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

NExT-Chat: An LMM for Chat, Detection and Segmentation.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

ULTRAFEEDBACK: Boosting Language Models with Scaled AI Feedback.
Proceedings of the Forty-first International Conference on Machine Learning, 2024


ToolLLM: Facilitating Large Language Models to Master 16000+ Real-world APIs.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Towards 3D Molecule-Text Interpretation in Language Models.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Predicting Emergent Abilities with Infinite Resolution Evaluation.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Large Multilingual Models Pivot Zero-Shot Multimodal Learning across Languages.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

AgentVerse: Facilitating Multi-Agent Collaboration and Exploring Emergent Behaviors.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

ChatEval: Towards Better LLM-based Evaluators through Multi-Agent Debate.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Chain of Thought Empowers Transformers to Solve Inherently Serial Problems.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

DecorateLM: Data Engineering through Corpus Rating, Tagging, and Editing with Language Models.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Ouroboros: Generating Longer Drafts Phrase by Phrase for Faster Speculative Decoding.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Beyond the Turn-Based Game: Enabling Real-Time Conversations with Duplex Models.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Controllable Preference Optimization: Toward Controllable Multi-Objective Alignment.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Enhancing Legal Case Retrieval via Scaling High-quality Synthetic Query-Candidate Pairs.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Large Language Model-based Human-Agent Collaboration for Complex Task Solving.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

Beyond Natural Language: LLMs Leveraging Alternative Formats for Enhanced Reasoning and Communication.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

AdaNAT: Exploring Adaptive Policy for Token-Based Image Generation.
Proceedings of the Computer Vision - ECCV 2024, 2024

LLaVA-UHD: An LMM Perceiving Any Aspect Ratio and High-Resolution Images.
Proceedings of the Computer Vision - ECCV 2024, 2024

RLHF-V: Towards Trustworthy MLLMs via Behavior Alignment from Fine-Grained Correctional Human Feedback.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Revisiting Non-Autoregressive Transformers for Efficient Image Synthesis.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

ABCF: An Adaptive Balanced Multimodal Website Classification Framework.
Proceedings of the 27th International Conference on Computer Supported Cooperative Work in Design, 2024

Fusion-in-T5: Unifying Variant Signals for Simple and Effective Document Ranking with Attention Fusion.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

Fine-Grained Legal Argument-Pair Extraction via Coarse-Grained Pre-training.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

Robust and Scalable Model Editing for Large Language Models.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

Empowering Private Tutoring by Chaining Large Language Models.
Proceedings of the 33rd ACM International Conference on Information and Knowledge Management, 2024

MARVEL: Unlocking the Multi-Modal Capability of Dense Retrieval via Visual Module Plugin.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

ınftyBench: Extending Long Context Evaluation Beyond 100K Tokens.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

MatPlotAgent: Method and Evaluation for LLM-Based Agentic Scientific Data Visualization.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

UltraLink: An Open-Source Knowledge-Enhanced Multilingual Supervised Fine-tuning Dataset.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

LoRA-Flow: Dynamic LoRA Fusion for Large Language Models in Generative Tasks.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

INTERVENOR: Prompting the Coding Ability of Large Language Models with the Interactive Chain of Repair.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

MAVEN-ARG: Completing the Puzzle of All-in-One Event Understanding Dataset with Event Argument Annotation.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

DebugBench: Evaluating Debugging Capability of Large Language Models.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

ChatDev: Communicative Agents for Software Development.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

Tell Me More! Towards Implicit User Intention Understanding of Language Model Driven Agents.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

Experiential Co-Learning of Software-Developing Agents.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

ProtT3: Protein-to-Text Generation for Text-based Protein Understanding.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

ReactXT: Understanding Molecular "Reaction-ship" via Reaction-Contextualized Molecule-Text Pretraining.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

OlympiadBench: A Challenging Benchmark for Promoting AGI with Olympiad-Level Bilingual Multimodal Scientific Problems.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

StableToolBench: Towards Stable Large-Scale Benchmarking on Tool Learning of Large Language Models.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

MolTC: Towards Molecular Relational Modeling In Language Models.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

2023
Full-Scale Information Diffusion Prediction With Reinforced Recurrent Networks.
IEEE Trans. Neural Networks Learn. Syst., May, 2023

Event Detection With Dynamic Word-Trigger-Argument Graph Neural Networks.
IEEE Trans. Knowl. Data Eng., April, 2023

Red Alarm for Pre-trained Models: Universal Vulnerability to Neuron-level Backdoor Attacks.
Mach. Intell. Res., April, 2023

Parameter-efficient fine-tuning of large-scale pre-trained language models.
Nat. Mac. Intell., March, 2023

Improving task generalization via unified schema prompt.
AI Open, January, 2023

UPRec: User-aware Pre-training for sequential Recommendation.
AI Open, January, 2023

Removing Backdoors in Pre-trained Models by Regularized Continual Pre-training.
Trans. Assoc. Comput. Linguistics, 2023

Sub-Character Tokenization for Chinese Pretrained Language Models.
Trans. Assoc. Comput. Linguistics, 2023

GitAgent: Facilitating Autonomous Agent with GitHub by Tool Extension.
CoRR, 2023

Experiential Co-Learning of Software-Developing Agents.
CoRR, 2023

RLHF-V: Towards Trustworthy MLLMs via Behavior Alignment from Fine-grained Correctional Human Feedback.
CoRR, 2023

ProAgent: From Robotic Process Automation to Agentic Process Automation.
CoRR, 2023

INTERVENOR: Prompt the Coding Ability of Large Language Models with the Interactive Chain of Repairing.
CoRR, 2023

Distributionally Robust Unsupervised Dense Retrieval Training on Web Graphs.
CoRR, 2023

Unlock Multi-Modal Capability of Dense Retrieval via Visual Module Plugin.
CoRR, 2023

Thoroughly Modeling Multi-domain Pre-trained Recommendation as Language.
CoRR, 2023

Unlock Predictable Scaling from Emergent Abilities.
CoRR, 2023

UltraFeedback: Boosting Language Models with High-quality Feedback.
CoRR, 2023

Reformulating Vision-Language Foundation Models and Datasets Towards Universal Multimodal Assistants.
CoRR, 2023

ConPET: Continual Parameter-Efficient Tuning for Large Language Models.
CoRR, 2023

Large Language Model as Autonomous Decision Maker.
CoRR, 2023

AgentVerse: Facilitating Multi-Agent Collaboration and Exploring Emergent Behaviors in Agents.
CoRR, 2023

LLM As DBA.
CoRR, 2023

ToolLLM: Facilitating Large Language Models to Master 16000+ Real-world APIs.
CoRR, 2023

Communicative Agents for Software Development.
CoRR, 2023

CPET: Effective Parameter-Efficient Tuning for Compressed Large Language Models.
CoRR, 2023

Interactive Molecular Discovery with Natural Language.
CoRR, 2023

Revisiting Out-of-distribution Robustness in NLP: Benchmark, Analysis, and LLMs Evaluations.
CoRR, 2023

Arbitrary Few Parameters are Good Enough for Adapting Large-scale Pre-trained Language Models.
CoRR, 2023

Fusion-in-T5: Unifying Document Ranking Signals for Improved Information Retrieval.
CoRR, 2023

CREATOR: Disentangling Abstract and Concrete Reasonings of Large Language Models through Tool Creation.
CoRR, 2023

Enhancing Chat Language Models by Scaling High-quality Instructional Conversations.
CoRR, 2023

Efficient Cross-Lingual Transfer for Chinese Stable Diffusion with Images as Pivots.
CoRR, 2023

Transfer Visual Prompt Generator across LLMs.
CoRR, 2023

Tool Learning with Foundation Models.
CoRR, 2023

Rethinking Dense Retrieval's Few-Shot Ability.
CoRR, 2023

Decoder-Only or Encoder-Decoder? Interpreting Language Model as a Regularized Encoder-Decoder.
CoRR, 2023

Human Emotion Knowledge Representation Emerges in Large Language Model and Supports Discrete Emotion Inference.
CoRR, 2023

Knowledge-Adaptive Contrastive Learning for Recommendation.
Proceedings of the Sixteenth ACM International Conference on Web Search and Data Mining, 2023

OpenMatch-v2: An All-in-one Multi-Modality PLM-based Information Retrieval Toolkit.
Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2023

VPGTrans: Transfer Visual Prompt Generator across LLMs.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Revisiting Out-of-distribution Robustness in NLP: Benchmarks, Analysis, and LLMs Evaluations.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

H3T: Efficient Integration of Memory Optimization and Parallelism for Large-scale Transformer Training.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Rethinking Tokenizer and Decoder in Masked Graph Modeling for Molecules.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

GLM-130B: An Open Bilingual Pre-trained Model.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Universal Vision-Language Dense Retrieval: Learning A Unified Representation Space for Multi-Modal Retrieval.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Beat LLMs at Their Own Game: Zero-Shot LLM-Generated Text Detection via Querying ChatGPT.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Variator: Accelerating Pre-trained Models with Plug-and-Play Compression Modules.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Exploring the Impact of Model Scaling on Parameter-Efficient Tuning.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

ReLM: Leveraging Language Models for Enhanced Chemical Reaction Prediction.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

CREATOR: Tool Creation for Disentangling Abstract and Concrete Reasoning of Large Language Models.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

MolCA: Molecular Graph-Language Modeling with Cross-Modal Projector and Uni-Modal Adapter.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Sparse Low-rank Adaptation of Pre-trained Language Models.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Enhancing Chat Language Models by Scaling High-quality Instructional Conversations.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Boosting Inference Efficiency: Unleashing the Power of Parameter-Shared Pre-trained Language Models.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Text Matching Improves Sequential Recommendation by Reducing Popularity Biases.
Proceedings of the 32nd ACM International Conference on Information and Knowledge Management, 2023

MUSER: A Multi-View Similar Case Retrieval Dataset.
Proceedings of the 32nd ACM International Conference on Information and Knowledge Management, 2023

Emergent Modularity in Pre-trained Transformers.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

Plug-and-Play Knowledge Injection for Pre-trained Language Models.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Augmentation-Adapted Retriever Improves Generalization of Language Models as Generic Plug-In.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Plug-and-Play Document Modules for Pre-trained Models.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

READIN: A Chinese Multi-Task Benchmark with Realistic and Diverse Input Noises.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Recyclable Tuning for Continual Pre-training.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

WebCPM: Interactive Web Search for Chinese Long-form Question Answering.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

The Devil is in the Details: On the Pitfalls of Event Extraction Evaluation.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

Parameter-efficient Weight Ensembling Facilitates Task-level Knowledge Transfer.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), 2023

Structure-Aware Language Model Pretraining Improves Dense Retrieval on Structured Data.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

Won't Get Fooled Again: Answering Questions with False Premises.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

OpenDelta: A Plug-and-play Library for Parameter-efficient Adaptation of Pre-trained Models.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics: System Demonstrations, 2023

Few-shot Classification with Hypersphere Modeling of Prototypes.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

Decoder Tuning: Efficient Language Understanding as Decoding.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

A Close Look into the Calibration of Pre-trained Language Models.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Stochastic Bridges as Effective Regularizers for Parameter-Efficient Tuning.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

From Adversarial Arms Race to Model-centric Evaluation: Motivating a Unified Automatic Robustness Evaluation Framework.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

Exploring Lottery Prompts for Pre-trained Language Models.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Visually Grounded Commonsense Knowledge Acquisition.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
Spatial relation learning in complementary scenarios with deep neural networks.
Frontiers Neurorobotics, September, 2022

PTR: Prompt Tuning with Rules for Text Classification.
AI Open, January, 2022

A Unified Understanding of Deep NLP Models for Text Classification.
IEEE Trans. Vis. Comput. Graph., 2022

COSINE: Compressive Network Embedding on Large-Scale Information Networks.
IEEE Trans. Knowl. Data Eng., 2022

RHINE: Relation Structure-Aware Heterogeneous Information Network Embedding.
IEEE Trans. Knowl. Data Eng., 2022

Guest Editorial: Special Issue on Social Media Computing.
IEEE Trans. Big Data, 2022

Improved Transformer With Multi-Head Dense Collaboration.
IEEE ACM Trans. Audio Speech Lang. Process., 2022

Different Tunes Played with Equal Skill: Exploring a Unified Optimization Subspace for Delta Tuning.
CoRR, 2022

Universal Multi-Modality Retrieval with One Unified Embedding Space.
CoRR, 2022

GACT: Activation Compressed Training for General Architectures.
CoRR, 2022

Sparse Structure Search for Parameter-Efficient Tuning.
CoRR, 2022

P<sup>3</sup> Ranker: Mitigating the Gaps between Pre-training and Ranking Fine-tuning with Prompt-based Learning and Pre-finetuning.
CoRR, 2022

Delta Tuning: A Comprehensive Study of Parameter Efficient Methods for Pre-trained Language Models.
CoRR, 2022

Training Free Graph Neural Networks for Graph Matching.
CoRR, 2022

P3 Ranker: Mitigating the Gaps between Pre-training and Ranking Fine-tuning with Prompt-based Learning and Pre-finetuning.
Proceedings of the SIGIR '22: The 45th International ACM SIGIR Conference on Research and Development in Information Retrieval, Madrid, Spain, July 11, 2022

Moderate-fitting as a Natural Backdoor Defender for Pre-trained Language Models.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Sparse Structure Search for Delta Tuning.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

A Unified Evaluation of Textual Backdoor Learning: Frameworks and Benchmarks.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

ProQA: Structural Prompt-based Pre-training for Unified Question Answering.
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022

On Transferability of Prompt Tuning for Natural Language Processing.
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022

Knowledge Inheritance for Pre-trained Language Models.
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022

Exploring the Universal Vulnerability of Prompt-based Learning Paradigm.
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2022, 2022

GACT: Activation Compressed Training for Generic Network Architectures.
Proceedings of the International Conference on Machine Learning, 2022

Effective Few-Shot Named Entity Linking by Meta-Learning.
Proceedings of the 38th IEEE International Conference on Data Engineering, 2022

BMCook: A Task-agnostic Compression Toolkit for Big Models.
Proceedings of the The 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Different Tunes Played with Equal Skill: Exploring a Unified Optimization Subspace for Parameter-Efficient Tuning.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

PEVL: Position-enhanced Pre-training and Prompt Tuning for Vision-language Models.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Finding Skill Neurons in Pre-trained Transformer-based Language Models.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

MAVEN-ERE: A Unified Large-scale Dataset for Event Coreference, Temporal, Causal, and Subevent Relation Extraction.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Reduce Catastrophic Forgetting of Dense Retrieval Training with Teleportation Negatives.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Exploring Mode Connectivity for Pre-trained Language Models.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

COPEN: Probing Conceptual Knowledge in Pre-trained Language Models.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Dimension Reduction for Efficient Dense Retrieval via Conditional Autoencoder.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

FPT: Improving Prompt Tuning Efficiency via Progressive Training.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

Textual Backdoor Attacks Can Be More Harmful via Two Simple Tricks.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Why Should Adversarial Perturbations be Imperceptible? Rethink the Research Paradigm in Adversarial NLP.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Prompt-learning for Fine-grained Entity Typing.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

Fine-Grained Scene Graph Generation with Data Transfer.
Proceedings of the Computer Vision - ECCV 2022, 2022

Automatic Label Sequence Generation for Prompting Sequence-to-sequence Models.
Proceedings of the 29th International Conference on Computational Linguistics, 2022

Pass off Fish Eyes for Pearls: Attacking Model Selection of Pre-trained Models.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

MoEfication: Transformer Feed-forward Layers are Mixtures of Experts.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2022, 2022

Going "Deeper": Structured Sememe Prediction via Transformer with Tree Attention.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2022, 2022

A Simple but Effective Pluggable Entity Lookup Table for Pre-trained Language Models.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), 2022

LEVEN: A Large-Scale Chinese Legal Event Detection Dataset.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2022, 2022

Prompt Tuning for Discriminative Pre-trained Language Models.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2022, 2022

ELLE: Efficient Lifelong Pre-training for Emerging Data.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2022, 2022

QuoteR: A Benchmark of Quote Recommendation for Writing.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

Sememe Prediction for BabelNet Synsets using Multilingual and Multimodal Information.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2022, 2022

Do Pre-trained Models Benefit Knowledge Graph Completion? A Reliable Evaluation and a Reasonable Approach.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2022, 2022

Knowledgeable Prompt-tuning: Incorporating Knowledge into Prompt Verbalizer for Text Classification.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

BMInf: An Efficient Toolkit for Big Model Inference and Tuning.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics, 2022

Cross-Lingual Contrastive Learning for Fine-Grained Entity Typing for Low-Resource Languages.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

PPT: Pre-trained Prompt Tuning for Few-shot Learning.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

OpenPrompt: An Open-source Framework for Prompt-learning.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics, 2022

Prototypical Verbalizer for Prompt-based Few-shot Tuning.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

bert2BERT: Towards Reusable Pretrained Language Models.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

Fully Hyperbolic Neural Networks.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

Program Transfer for Answering Complex Questions over Knowledge Bases.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

Video as Conditional Graph Hierarchy for Multi-Granular Question Answering.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021
Network Embedding: Theories, Methods, and Applications
Synthesis Lectures on Artificial Intelligence and Machine Learning, Morgan & Claypool Publishers, ISBN: 978-3-031-01590-8, 2021

Neural Diffusion Model for Microscopic Cascade Study.
IEEE Trans. Knowl. Data Eng., 2021

CED: Credible Early Detection of Social Media Rumors.
IEEE Trans. Knowl. Data Eng., 2021

Country Image in COVID-19 Pandemic: A Case Study of China.
IEEE Trans. Big Data, 2021

CSS-LM: A Contrastive Framework for Semi-Supervised Fine-Tuning of Pre-Trained Language Models.
IEEE ACM Trans. Audio Speech Lang. Process., 2021

KEPLER: A Unified Model for Knowledge Embedding and Pre-trained Language Representation.
Trans. Assoc. Comput. Linguistics, 2021

From Symbols to Embeddings: A Tale of Two Representations in Computational Social Science.
J. Soc. Comput., 2021

Text-Graph Enhanced Knowledge Graph Representation Learning.
Frontiers Artif. Intell., 2021

Knowledge Transfer via Pre-training for Recommendation: A Review and Prospect.
Frontiers Big Data, 2021

Sememe knowledge computation: a review of recent advances in application and expansion of sememe knowledge bases.
Frontiers Comput. Sci., 2021

CUGE: A Chinese Language Understanding and Generation Evaluation Benchmark.
CoRR, 2021

On Transferability of Prompt Tuning for Natural Language Understanding.
CoRR, 2021

Textual Backdoor Attacks Can Be More Harmful via Two Simple Tricks.
CoRR, 2021

Exploring Low-dimensional Intrinsic Task Subspace via Prompt Tuning.
CoRR, 2021

MoEfication: Conditional Computation of Transformer Models for Efficient Inference.
CoRR, 2021

Prompt-Learning for Fine-Grained Entity Typing.
CoRR, 2021

Knowledgeable Prompt-tuning: Incorporating Knowledge into Prompt Verbalizer for Text Classification.
CoRR, 2021

Evaluating Modules in Graph Contrastive Learning.
CoRR, 2021

Pre-Trained Models: Past, Present and Future.
CoRR, 2021

SHUOWEN-JIEZI: Linguistically Informed Tokenizers For Chinese Language Model Pretraining.
CoRR, 2021

Equality before the Law: Legal Judgment Consistency Analysis for Fairness.
CoRR, 2021

UPRec: User-Aware Pre-training for Recommender Systems.
CoRR, 2021

CSS-LM: A Contrastive Framework for Semi-supervised Fine-tuning of Pre-trained Language Models.
CoRR, 2021

Representation Learning for Natural Language Processing.
CoRR, 2021

OpenMatch: An Open-Source Package for Information Retrieval.
CoRR, 2021

Red Alarm for Pre-trained Models: Universal Vulnerabilities by Neuron-Level Backdoor Attacks.
CoRR, 2021

Knowledgeable machine learning for natural language processing.
Commun. ACM, 2021

Know what you don't need: Single-Shot Meta-Pruning for attention heads.
AI Open, 2021

CPM: A large-scale generative Chinese Pre-trained language model.
AI Open, 2021

CPM-2: Large-scale cost-effective pre-trained language models.
AI Open, 2021

Lawformer: A pre-trained language model for Chinese legal long documents.
AI Open, 2021

CokeBERT: Contextual knowledge selection and embedding towards enhanced pre-trained language models.
AI Open, 2021

Pre-trained models: Past, present and future.
AI Open, 2021

Few-Shot Conversational Dense Retrieval.
Proceedings of the SIGIR '21: The 44th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2021

OpenMatch: An Open Source Library for Neu-IR Research.
Proceedings of the SIGIR '21: The 44th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2021

Capturing Global Informativeness in Open Domain Keyphrase Extraction.
Proceedings of the Natural Language Processing and Chinese Computing, 2021

Open Hierarchical Relation Extraction.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

EnglishBot: An AI-Powered Conversational System for Second Language Learning.
Proceedings of the IUI '21: 26th International Conference on Intelligent User Interfaces, 2021

Robotic Occlusion Reasoning for Efficient Object Existence Prediction.
Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2021

More Robust Dense Retrieval with Contrastive Dual Learning.
Proceedings of the ICTIR '21: The 2021 ACM SIGIR International Conference on the Theory of Information Retrieval, 2021

Visual Distant Supervision for Scene Graph Generation.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

CodRED: A Cross-Document Relation Extraction Dataset for Acquiring Knowledge in the Wild.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

TIAGE: A Benchmark for Topic-Shift Aware Dialog Modeling.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2021, 2021

Mind the Style of Text! Adversarial and Backdoor Attacks Based on Text Style Transfer.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

ONION: A Simple and Effective Defense Against Textual Backdoor Attacks.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

Is Multi-Hop Reasoning Really Explainable? Towards Benchmarking Reasoning Interpretability.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

KACC: A Multi-task Benchmark for Knowledge Abstraction, Concretization and Completion.
Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021

OpenAttack: An Open-source Textual Adversarial Attack Toolkit.
Proceedings of the Joint Conference of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

CLEVE: Contrastive Pre-training for Event Extraction.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

Few-Shot Text Ranking with Meta Adapted Synthetic Weak Supervision.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

Better Robustness by More Coverage: Adversarial and Mixup Data Augmentation for Robust Finetuning.
Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021

ERICA: Improving Entity and Relation Understanding for Pre-trained Language Models via Contrastive Learning.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

Turn the Combination Lock: Learnable Textual Backdoor Attacks via Word Substitution.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

Hidden Killer: Invisible Textual Backdoor Attacks with Syntactic Trigger.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

Automatic Construction of Sememe Knowledge Bases via Dictionaries.
Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021

Manual Evaluation Matters: Reviewing Test Protocols of Distantly Supervised Relation Extraction.
Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021

Few-NERD: A Few-shot Named Entity Recognition Dataset.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

Adversarial Language Games for Advanced Natural Language Intelligence.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020
Introduction to Graph Neural Networks
Synthesis Lectures on Artificial Intelligence and Machine Learning, Morgan & Claypool Publishers, ISBN: 978-3-031-01587-8, 2020

Representation Learning for the Semantic Web.
J. Web Semant., 2020

Improving Sequence Modeling Ability of Recurrent Neural Networks via Sememes.
IEEE ACM Trans. Audio Speech Lang. Process., 2020

Reinforced Zero-Shot Cross-Lingual Neural Headline Generation.
IEEE ACM Trans. Audio Speech Lang. Process., 2020

Integrating Image-Based and Knowledge-Based Representation Learning.
IEEE Trans. Cogn. Dev. Syst., 2020

Crossmodal Language Grounding in an Embodied Neurocognitive Model.
Frontiers Neurorobotics, 2020

Better Robustness by More Coverage: Adversarial Training with Mixup Augmentation for Robust Fine-tuning.
CoRR, 2020

Meta Adaptive Neural Ranking with Contrastive Synthetic Supervision.
CoRR, 2020

ONION: A Simple and Effective Defense Against Textual Backdoor Attacks.
CoRR, 2020

CMT in TREC-COVID Round 2: Mitigating the Generalization Gaps from Web to Special Domain Search.
CoRR, 2020

Exploring and Evaluating Attributes, Values, and Structures for Entity Alignment.
CoRR, 2020

Contextual Knowledge Selection and Embedding towards Enhanced Pre-Trained Language Models.
CoRR, 2020

Learning to Attack: Towards Textual Adversarial Attacking in Real-world Situations.
CoRR, 2020

OpenAttack: An Open-source Textual Adversarial Attack Toolkit.
CoRR, 2020

Joint Keyphrase Chunking and Salience Ranking with BERT.
CoRR, 2020

KACC: A Multi-task Benchmark for Knowledge Abstraction, Concretization and Completion.
CoRR, 2020

Coreferential Reasoning Learning for Language Representation.
CoRR, 2020

Lexical Sememe Prediction using Dictionary Definitions by Capturing Local Semantic Correspondence.
CoRR, 2020

Topic-sensitive neural headline generation.
Sci. China Inf. Sci., 2020

Graph neural networks: A review of methods and applications.
AI Open, 2020

Selective Weak Supervision for Neural Information Retrieval.
Proceedings of the WWW '20: The Web Conference 2020, Taipei, Taiwan, April 20-24, 2020, 2020

Few-Shot Generative Conversational Query Rewriting.
Proceedings of the 43rd International ACM SIGIR conference on research and development in Information Retrieval, 2020

Enhancing Transformer with Sememe Knowledge.
Proceedings of the 5th Workshop on Representation Learning for NLP, 2020

Graph Policy Network for Transferable Active Learning on Graphs.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Adaptive Graph Encoder for Attributed Graph Embedding.
Proceedings of the KDD '20: The 26th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2020

ExpanRL: Hierarchical Reinforcement Learning for Course Concept Expansion in MOOCs.
Proceedings of the 1st Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics and the 10th International Joint Conference on Natural Language Processing, 2020

Neural Gibbs Sampling for Joint Event Argument Extraction.
Proceedings of the 1st Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics and the 10th International Joint Conference on Natural Language Processing, 2020

More Data, More Relations, More Context and More Openness: A Review and Outlook for Relation Extraction.
Proceedings of the 1st Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics and the 10th International Joint Conference on Natural Language Processing, 2020

Learning from Explanations with Neural Execution Tree.
Proceedings of the 8th International Conference on Learning Representations, 2020

Coreferential Reasoning Learning for Language Representation.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

Denoising Relation Extraction from Document-level Distant Supervision.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

MAVEN: A Massive General Domain Event Detection Dataset.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

WantWords: An Open-source Online Reverse Dictionary System.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing: System Demonstrations, 2020

Learning from Context or Names? An Empirical Study on Neural Relation Extraction.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

Dynamic Anticipation and Completion for Multi-Hop Reasoning over Sparse Knowledge Graph.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

Generalizing Open Domain Fact Extraction and Verification to COVID-FACT thorough In-Domain Language Modeling.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2020, 2020

Exploring and Evaluating Attributes, Values, and Structures for Entity Alignment.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

IsOBS: An Information System for Oracle Bone Script.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing: System Demonstrations, 2020

Train No Evil: Selective Masking for Task-Guided Pre-Training.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

Partially-Aligned Data-to-Text Generation with Distant Supervision.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

Try to Substitute: An Unsupervised Chinese Word Sense Disambiguation Method Based on HowNet.
Proceedings of the 28th International Conference on Computational Linguistics, 2020

Meta-Information Guided Meta-Learning for Few-Shot Relation Classification.
Proceedings of the 28th International Conference on Computational Linguistics, 2020

How Does NLP Benefit Legal System: A Summary of Legal Artificial Intelligence.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

Grounded Conversation Generation as Guided Traverses in Commonsense Knowledge Graphs.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

Word-level Textual Adversarial Attacking as Combinatorial Optimization.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

MOOCCube: A Large-scale Data Repository for NLP Applications in MOOCs.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

Fine-grained Fact Verification with Kernel Graph Attention Network.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

Continual Relation Learning via Episodic Memory Activation and Reconsolidation.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

Expertise Style Transfer: A New Task Towards Better Communication between Experts and Laymen.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

JEC-QA: A Legal-Domain Question Answering Dataset.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

Iteratively Questioning and Answering for Interpretable Legal Judgment Prediction.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

Multi-Channel Reverse Dictionary Model.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

Towards Building a Multilingual Sememe Knowledge Base: Predicting Sememes for BabelNet Synsets.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

Neural Snowball for Few-Shot Relation Learning.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

Representation Learning for Natural Language Processing
Springer, ISBN: 978-981-15-5572-5, 2020

2019
A Unified Framework for Community Detection and Network Representation Learning.
IEEE Trans. Knowl. Data Eng., 2019

Bootstrapping Knowledge Graphs From Images and Text.
Frontiers Neurorobotics, 2019

Multi-channel Reverse Dictionary Model.
CoRR, 2019

CAIL2019-SCM: A Dataset of Similar Case Matching in Legal Domain.
CoRR, 2019

KEPLER: A Unified Model for Knowledge Embedding and Pre-trained Language Representation.
CoRR, 2019

Conversation Generation with Concept Flow.
CoRR, 2019

Multi-Paragraph Reasoning with Knowledge-enhanced Graph Neural Network.
CoRR, 2019

Adversarial Language Games for Advanced Natural Language Intelligence.
CoRR, 2019

Learning to Annotate: Modularizing Data Augmentation for Text Classifiers with Natural Language Explanations.
CoRR, 2019

Open the Boxes of Words: Incorporating Sememes into Textual Adversarial Attack.
CoRR, 2019

Enhancing Recurrent Neural Networks with Sememes.
CoRR, 2019

COS960: A Chinese Word Similarity Dataset of 960 Word Pairs.
CoRR, 2019

Understanding the Behaviors of BERT in Ranking.
CoRR, 2019

OpenHowNet: An Open Sememe-based Lexical Knowledge Base.
CoRR, 2019

Adversarial Training for Weakly Supervised Event Detection.
Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019

Fact Discovery from Knowledge Base via Facet Decomposition.
Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019

Multi-scale Information Diffusion Prediction with Reinforced Recurrent Networks.
Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019

Neural Document Expansion with User Feedback.
Proceedings of the 2019 ACM SIGIR International Conference on Theory of Information Retrieval, 2019

Poster: WatchYouWatch A Web-Cam Based Natural Customer Attention Tracking Shelf.
Proceedings of the 2019 International Conference on Embedded Wireless Systems and Networks, 2019

Open Relation Extraction: Relational Knowledge Transfer from Supervised Data to Unsupervised Data.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

HMEAE: Hierarchical Modular Event Argument Extraction.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

NumNet: Machine Reading Comprehension with Numerical Reasoning.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

Adapting Meta Knowledge Graph Information for Multi-Hop Reasoning over Few-Shot Relations.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

OpenNRE: An Open and Extensible Toolkit for Neural Relation Extraction.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

FewRel 2.0: Towards More Challenging Few-Shot Relation Classification.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

Event Detection with Trigger-Aware Lattice Neural Network.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

Low-Resource Name Tagging Learned with Weakly Labeled Data.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

Automatic Judgment Prediction via Legal Reading Comprehension.
Proceedings of the Chinese Computational Linguistics - 18th China National Conference, 2019

Explore Entity Embedding Effectiveness in Entity Retrieval.
Proceedings of the Chinese Computational Linguistics - 18th China National Conference, 2019

CJRC: A Reliable Human-Annotated Benchmark DataSet for Chinese Judicial Reading Comprehension.
Proceedings of the Chinese Computational Linguistics - 18th China National Conference, 2019

Legal Cause Prediction with Inner Descriptions and Outer Hierarchies.
Proceedings of the Chinese Computational Linguistics - 18th China National Conference, 2019

Multi-Interest Network with Dynamic Routing for Recommendation at Tmall.
Proceedings of the 28th ACM International Conference on Information and Knowledge Management, 2019

Graph Neural Networks with Generated Parameters for Relation Extraction.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

GEAR: Graph-based Evidence Aggregating and Reasoning for Fact Verification.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

DIAG-NRE: A Neural Pattern Diagnosis Framework for Distantly Supervised Neural Relation Extraction.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

ERNIE: Enhanced Language Representation with Informative Entities.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

Course Concept Expansion in MOOCs with External Knowledge and Interactive Game.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

DocRED: A Large-Scale Document-Level Relation Extraction Dataset.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

Modeling Semantic Compositionality with Sememe Knowledge.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

XQA: A Cross-lingual Open-domain Question Answering Dataset.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

Chinese Relation Extraction with Multi-Grained Information and External Linguistic Knowledge.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

Quantifying Similarity between Relations with Fact Distribution.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

Multi-Channel Graph Neural Network for Entity Alignment.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

DeepChannel: Salience Estimation by Contrastive Learning for Extractive Document Summarization.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

Learning to Embed Sentences Using Attentive Recursive Trees.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

Relation Structure-Aware Heterogeneous Information Network Embedding.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

Hybrid Attention-Based Prototypical Networks for Noisy Few-Shot Relation Classification.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

2018
Zero-Shot Cross-Lingual Neural Headline Generation.
IEEE ACM Trans. Audio Speech Lang. Process., 2018

Knowledge Representation Learning: A Quantitative Review.
CoRR, 2018

Neural Diffusion Model for Microscopic Cascade Prediction.
CoRR, 2018

Graph Neural Networks: A Review of Methods and Applications.
CoRR, 2018

CED: Credible Early Detection of Social Media Rumors.
CoRR, 2018

Overview of CAIL2018: Legal Judgment Prediction Competition.
CoRR, 2018

CAIL2018: A Large-Scale Legal Dataset for Judgment Prediction.
CoRR, 2018

Denoising Distant Supervision for Relation Extraction via Instance-Level Adversarial Training.
CoRR, 2018

Query Suggestion with Feedback Memory Network.
Proceedings of the 2018 World Wide Web Conference on World Wide Web, 2018

Convolutional Neural Networks for Soft-Matching N-Grams in Ad-hoc Search.
Proceedings of the Eleventh ACM International Conference on Web Search and Data Mining, 2018

Consistency and Variation in Kernel Neural Ranking Model.
Proceedings of the 41st International ACM SIGIR Conference on Research & Development in Information Retrieval, 2018

GazeLabel: A Cost-free Data Labeling System with Public Displays using Eye-tracking.
Proceedings of the 16th ACM Conference on Embedded Networked Sensor Systems, SenSys 2018, 2018

Legal Judgment Prediction via Topological Learning.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

Put It Back: Entity Typing with Language Model Enhancement.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

Cross-lingual Lexical Sememe Prediction.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

Differentiating Concepts and Instances for Knowledge Graph Embedding.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

FewRel: A Large-Scale Supervised Few-shot Relation Classification Dataset with State-of-the-Art Evaluation.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

Hierarchical Relation Extraction with Coarse-to-Fine Grained Attention.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

OpenKE: An Open Toolkit for Knowledge Embedding.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, 2018

Language Modeling with Sparse Product of Sememe Experts.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

Joint Representation Learning of Cross-lingual Words and Entities via Attentive Distant Supervision.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

Adversarial Multi-lingual Neural Relation Extraction.
Proceedings of the 27th International Conference on Computational Linguistics, 2018

Few-Shot Charge Prediction with Discriminative Legal Attributes.
Proceedings of the 27th International Conference on Computational Linguistics, 2018

Neural Collective Entity Linking.
Proceedings of the 27th International Conference on Computational Linguistics, 2018

Entity-Duet Neural Ranking: Understanding the Role of Knowledge Graph Semantics in Neural Information Retrieval.
Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, 2018

Incorporating Chinese Characters of Words for Lexical Sememe Prediction.
Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, 2018

Denoising Distantly Supervised Open-Domain Question Answering.
Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, 2018

Chinese LIWC Lexicon Expansion via Hierarchical Classification of Word Embeddings with Sememe Attention.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

Improving Neural Fine-Grained Entity Typing With Knowledge Attention.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

Does William Shakespeare REALLY Write Hamlet? Knowledge Representation Learning With Confidence.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

Neural Knowledge Acquisition via Mutual Attention Between Knowledge Graph and Text.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

2017
A Neural Network Approach to Jointly Modeling Social Networks and Mobile Trajectories.
ACM Trans. Inf. Syst., 2017

PRISM: Profession Identification in Social Media.
ACM Trans. Intell. Syst. Technol., 2017

Improving Word Representations with Document Labels.
IEEE ACM Trans. Audio Speech Lang. Process., 2017

Recent Advances on Neural Headline Generation.
J. Comput. Sci. Technol., 2017

Does William Shakespeare REALLY Write Hamlet? Knowledge Representation Learning with Confidence.
CoRR, 2017

End-to-End Neural Ad-hoc Ranking with Kernel Pooling.
Proceedings of the 40th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2017

On Modeling Sense Relatedness in Multi-prototype Word Embedding.
Proceedings of the Eighth International Joint Conference on Natural Language Processing, 2017

Iterative Entity Alignment via Joint Knowledge Embeddings.
Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence, 2017

Fast Network Embedding Enhancement via High Order Proximity Approximation.
Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence, 2017

Lexical Sememe Prediction via Word Embeddings and Matrix Factorization.
Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence, 2017

Image-embodied Knowledge Representation Learning.
Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence, 2017

TransNet: Translation-Based Network Representation Learning for Social Relation Extraction.
Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence, 2017

Incorporating Relation Paths in Neural Relation Extraction.
Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, 2017

CANE: Context-Aware Network Embedding for Relation Modeling.
Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, 2017

Improved Word Representation Learning with Sememes.
Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, 2017

Neural Relation Extraction with Multi-lingual Attention.
Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, 2017

2016
A Neural Network Approach to Joint Modeling Social Networks and Mobile Trajectories.
CoRR, 2016

Neural Emoji Recommendation in Dialogue Systems.
CoRR, 2016

Image-embodied Knowledge Representation Learning.
CoRR, 2016

Knowledge Representation via Joint Learning of Sequential Text and Knowledge Graphs.
CoRR, 2016

Community-enhanced Network Representation Learning for Network Analysis.
CoRR, 2016

Joint Representation Learning of Text and Knowledge for Knowledge Graph Completion.
CoRR, 2016

Neural Headline Generation with Minimum Risk Training.
CoRR, 2016

Crossmodal Language Grounding, Learning, and Teaching.
Proceedings of the Workshop on Cognitive Computation: Integrating neural and symbolic approaches 2016 co-located with the 30th Annual Conference on Neural Information Processing Systems (NIPS 2016), 2016

Representation Learning of Knowledge Graphs with Hierarchical Types.
Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence, 2016

Max-Margin DeepWalk: Discriminative Learning of Network Representation.
Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence, 2016

Knowledge Representation Learning with Entities, Attributes and Relations.
Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence, 2016

Neural Sentiment Classification with User and Product Attention.
Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing, 2016

Relation Classification via Multi-Level Attention CNNs.
Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, 2016

Neural Relation Extraction with Selective Attention over Instances.
Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, 2016

Representation Learning of Knowledge Graphs with Entity Descriptions.
Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, 2016

Learning to Appreciate the Aesthetic Effects of Clothing.
Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, 2016

2015
Tag Correspondence Model for User Tag Suggestion.
J. Comput. Sci. Technol., 2015

Estimating translation probabilities for social tag suggestion.
Expert Syst. Appl., 2015

A C-LSTM Neural Network for Text Classification.
CoRR, 2015

Category Enhanced Word Embedding.
CoRR, 2015

Comprehend DeepWalk as Matrix Factorization.
CoRR, 2015

Modeling Relation Paths for Representation Learning of Knowledge Bases.
CoRR, 2015

PRISM: Profession Identification in Social Media with Personal Information and Community Structure.
Proceedings of the Social Media Processing - 4th National Conference, 2015

Representation Learning for Measuring Entity Relatedness with Rich Information.
Proceedings of the Twenty-Fourth International Joint Conference on Artificial Intelligence, 2015

Network Representation Learning with Rich Text Information.
Proceedings of the Twenty-Fourth International Joint Conference on Artificial Intelligence, 2015

Joint Learning of Character and Word Embeddings.
Proceedings of the Twenty-Fourth International Joint Conference on Artificial Intelligence, 2015

Online Learning of Interpretable Word Embeddings.
Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, 2015

Modeling Relation Paths for Representation Learning of Knowledge Bases.
Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, 2015

Learning Cross-lingual Word Embeddings via Matrix Co-factorization.
Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing of the Asian Federation of Natural Language Processing, 2015

Phrase Type Sensitive Tensor Indexing Model for Semantic Composition.
Proceedings of the Twenty-Ninth AAAI Conference on Artificial Intelligence, 2015

Topical Word Embeddings.
Proceedings of the Twenty-Ninth AAAI Conference on Artificial Intelligence, 2015

Learning Entity and Relation Embeddings for Knowledge Graph Completion.
Proceedings of the Twenty-Ninth AAAI Conference on Artificial Intelligence, 2015

2014
Linking GloVe with word2vec.
CoRR, 2014

Errata: Distant Supervision for Relation Extraction with Matrix Completion.
CoRR, 2014

Inferring Correspondences from Multiple Sources for Microblog User Tags.
Proceedings of the Social Media Processing - Third National Conference, 2014

Discriminating gender on Chinese microblog: A study of online behaviour, writing style and preferred vocabulary.
Proceedings of the 10th International Conference on Natural Computation, 2014

A Unified Model for Word Sense Representation and Disambiguation.
Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing, 2014

ImgWordle: image and text visualization for events in microblogging services.
Proceedings of the International Working Conference on Advanced Visual Interfaces, 2014

Distant Supervision for Relation Extraction with Matrix Completion.
Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics, 2014

Quantifying Chinese Happiness via Large-Scale Microblogging Data.
Proceedings of the 11th Web Information System and Application Conference, 2014

2013
Measuring and Visualizing Interest Similarity between Microblog Users.
Proceedings of the Web-Age Information Management - 14th International Conference, 2013

THUNLP at TAC KBP 2013 in Entity Linking.
Proceedings of the Sixth Text Analysis Conference, 2013

Creating reflections in public emotion visualization: prototype exploration on traffic theme.
Proceedings of the Creativity and Cognition 2013, 2013

2012
Mining the interests of Chinese microbloggers via keyword extraction.
Frontiers Comput. Sci. China, 2012

City Flow: Prototype Exploration for Visualizing Urban Traffic Conversations.
Proceedings of the 2012 International Conference on Privacy, 2012

Tag Dispatch Model with Social Network Regularization for Microblog User Tag Suggestion.
Proceedings of the COLING 2012, 2012

Topical Word Trigger Model for Keyphrase Extraction.
Proceedings of the COLING 2012, 2012

Expert Finding for Microblog Misinformation Identification.
Proceedings of the COLING 2012, 2012

Random Walks on Context-Aware Relation Graphs for Ranking Social Tags.
Proceedings of the COLING 2012, 2012

2011
PLDA+: Parallel latent dirichlet allocation with data placement and pipeline processing.
ACM Trans. Intell. Syst. Technol., 2011

User Behaviors in Related Word Retrieval and New Word Detection: A Collaborative Perspective.
ACM Trans. Asian Lang. Inf. Process., 2011

THUNLP at TAC KBP 2011 in Entity Linking.
Proceedings of the Fourth Text Analysis Conference, 2011

Extracting Chinese abbreviation-definition pairs from anchor texts.
Proceedings of the International Conference on Machine Learning and Cybernetics, 2011

A Simple Word Trigger Method for Social Tag Suggestion.
Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing, 2011

Automatic Keyphrase Extraction by Bridging Vocabulary Gap.
Proceedings of the Fifteenth Conference on Computational Natural Language Learning, 2011

Why Press Backspace? Understanding User Input Behaviors in Chinese Pinyin Input Method.
Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, Proceedings of the Conference, 19-24 June, 2011, Portland, Oregon, USA, 2011

2010
Modeling Social Annotations via Latent Reason Identification.
IEEE Intell. Syst., 2010

Automatic Keyphrase Extraction via Topic Decomposition.
Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing, 2010

Explore the Structure of Social Tags by Subsumption Relations.
Proceedings of the COLING 2010, 2010

FolkDiffusion: A Graph-Based Tag Suggestion Method for Folksonomies.
Proceedings of the Information Retrieval Technology, 2010

Domain-Specific Term Rankings Using Topic Models.
Proceedings of the Information Retrieval Technology, 2010

Growing Related Words from Seed via User Behaviors: A Re-Ranking Based Approach.
Proceedings of the ACL 2010, 2010

2009
Content-based and Graph-based Tag Suggestion.
Proceedings of ECML PKDD (The European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases) Discovery Challenge 2009, 2009

Quantifying Asymmetric Semantic Relations from Query Logs by Resource Allocation.
Proceedings of the Advances in Knowledge Discovery and Data Mining, 2009

Incorporating User Behaviors in New Word Detection.
Proceedings of the IJCAI 2009, 2009

Clustering to Find Exemplar Terms for Keyphrase Extraction.
Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, 2009

Efficient Text Classification Using Term Projection.
Proceedings of the Information Retrieval Technology, 2009

2008
Asymmetrical query recommendation method based on bipartite network resource allocation.
Proceedings of the 17th International Conference on World Wide Web, 2008

Text Classification Based on Transfer Learning and Self-Training.
Proceedings of the Fourth International Conference on Natural Computation, 2008


  Loading...