Maosong Sun

Orcid: 0000-0002-6011-6115

Affiliations:
  • Tsinghua University, Department of Computer Science and Technology, Beijing, China
  • City University of Hong Kong (PhD 2004)


According to our database1, Maosong Sun authored at least 519 papers between 1997 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Generating chord progression from melody with flexible harmonic rhythm and controllable harmonic density.
EURASIP J. Audio Speech Music. Process., December, 2024

Personality-affected Emotion Generation in Dialog Systems.
ACM Trans. Inf. Syst., September, 2024

Exploring Format Consistency for Instruction Tuning.
Trans. Mach. Learn. Res., 2024

Exploring Universal Intrinsic Task Subspace for Few-Shot Learning via Prompt Tuning.
IEEE ACM Trans. Audio Speech Lang. Process., 2024

Hyperbolic Pre-Trained Language Model.
IEEE ACM Trans. Audio Speech Lang. Process., 2024

Selecting Influential Samples for Long Context Alignment via Homologous Models' Guidance and Contextual Awareness Measurement.
CoRR, 2024

RAG-DDR: Optimizing Retrieval-Augmented Generation Using Differentiable Data Rewards.
CoRR, 2024

CLaMP 2: Multimodal Music Information Retrieval Across 101 Languages Using Large Language Models.
CoRR, 2024

Proactive Agent: Shifting LLM Agents from Reactive Responses to Active Assistance.
CoRR, 2024

VisRAG: Vision-based Retrieval-augmented Generation on Multi-modality Documents.
CoRR, 2024

LLM⨉MapReduce: Simplified Long-Sequence Processing using Large Language Models.
CoRR, 2024

Retriever-and-Memory: Towards Adaptive Note-Enhanced Retrieval-Augmented Generation.
CoRR, 2024

Optima: Optimizing Effectiveness and Efficiency for LLM-Based Multi-Agent System.
CoRR, 2024

Stuffed Mamba: State Collapse and State Capacity of RNN-Based Long-Context Modeling.
CoRR, 2024

DecorateLM: Data Engineering through Corpus Rating, Tagging, and Editing with Language Models.
CoRR, 2024

Can Large Language Models Analyze Graphs like Professionals? A Benchmark, Datasets and Models.
CoRR, 2024

From MOOC to MAIC: Reshaping Online Teaching and Learning through LLM-driven Agents.
CoRR, 2024

Configurable Foundation Models: Building LLMs from a Modular Perspective.
CoRR, 2024

Multi-Modal Multi-Granularity Tokenizer for Chu Bamboo Slip Scripts.
CoRR, 2024

MiniCPM-V: A GPT-4V Level MLLM on Your Phone.
CoRR, 2024

RAGEval: Scenario Specific RAG Evaluation Dataset Generation Framework.
CoRR, 2024

PersLLM: A Personified Training Approach for Large Language Models.
CoRR, 2024

States Hidden in Hidden States: LLMs Emerge Discrete State Representations Implicitly.
CoRR, 2024

Internet of Agents: Weaving a Web of Heterogeneous Agents for Collaborative Intelligence.
CoRR, 2024

MelodyT5: A Unified Score-to-Score Transformer for Symbolic Music Processing.
CoRR, 2024

Scaling Efficient Masked Autoencoder Learning on Large Remote Sensing Dataset.
CoRR, 2024

Zero-Shot Generalization during Instruction Tuning: Insights from Similarity and Granularity.
CoRR, 2024

GUICourse: From General Vision Language Models to Versatile GUI Agents.
CoRR, 2024

Delta-CoMe: Training-Free Delta-Compression with Mixed-Precision for Large Language Models.
CoRR, 2024

Scaling Large-Language-Model-based Multi-Agent Collaboration.
CoRR, 2024

Seq1F1B: Efficient Sequence-Level Pipeline Parallelism for Large Language Model Training.
CoRR, 2024

RLAIF-V: Aligning MLLMs through Open-Source AI Feedback for Super GPT-4V Trustworthiness.
CoRR, 2024

Iterative Experience Refinement of Software-Developing Agents.
CoRR, 2024

LEGENT: Open Platform for Embodied Agents.
CoRR, 2024

Beyond Human Norms: Unveiling Unique Values of Large Language Models through Interdisciplinary Approaches.
CoRR, 2024

UltraEval: A Lightweight Platform for Flexible and Comprehensive Evaluation for LLMs.
CoRR, 2024

MiniCPM: Unveiling the Potential of Small Language Models with Scalable Training Strategies.
CoRR, 2024

Advancing LLM Reasoning Generalists with Preference Trees.
CoRR, 2024

LLaVA-UHD: an LMM Perceiving Any Aspect Ratio and High-Resolution Images.
CoRR, 2024

BurstAttention: An Efficient Distributed Attention Framework for Extremely Long Sequences.
CoRR, 2024

Mastering Text, Code and Math Simultaneously via Fusing Highly Specialized Language Models.
CoRR, 2024

Controllable Preference Optimization: Toward Controllable Multi-Objective Alignment.
CoRR, 2024

RepoAgent: An LLM-Powered Open-Source Framework for Repository-level Code Documentation Generation.
CoRR, 2024

Cross-domain Chinese Sentence Pattern Parsing.
CoRR, 2024

Unified View of Grokking, Double Descent and Emergent Abilities: A Perspective from Circuits Competition.
CoRR, 2024

Ouroboros: Speculative Decoding with Large Model Enhanced Drafting.
CoRR, 2024

∞Bench: Extending Long Context Evaluation Beyond 100K Tokens.
CoRR, 2024

OMGEval: An Open Multilingual Generative Evaluation Benchmark for Large Language Models.
CoRR, 2024

ProSparse: Introducing and Enhancing Intrinsic Activation Sparsity within Large Language Models.
CoRR, 2024

Exploring Perceptual Limitation of Multimodal Large Language Models.
CoRR, 2024

InfLLM: Unveiling the Intrinsic Capacity of LLMs for Understanding Extremely Long Sequences with Training-Free Memory.
CoRR, 2024

ReLU<sup>2</sup> Wins: Discovering Efficient Activation Functions for Sparse LLMs.
CoRR, 2024

UniMem: Towards a Unified View of Long-Context Large Language Models.
CoRR, 2024

Investigate-Consolidate-Exploit: A General Strategy for Inter-Task Agent Self-Evolution.
CoRR, 2024

DebugBench: Evaluating Debugging Capability of Large Language Models.
CoRR, 2024

CPT: Colorful Prompt Tuning for pre-trained vision-language models.
AI Open, 2024

Relation-aware deep neural network enables more efficient biomedical knowledge acquisition from massive literature.
AI Open, 2024

scMulan: A Multitask Generative Pre-Trained Language Model for Single-Cell Analysis.
Proceedings of the Research in Computational Molecular Biology, 2024

On the Essence and Prospect: An Investigation of Alignment Approaches for Big Models.
Proceedings of the Thirty-Third International Joint Conference on Artificial Intelligence, 2024

Exploring the Benefit of Activation Sparsity in Pre-training.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

ULTRAFEEDBACK: Boosting Language Models with Scaled AI Feedback.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

ToolLLM: Facilitating Large Language Models to Master 16000+ Real-world APIs.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Predicting Emergent Abilities with Infinite Resolution Evaluation.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Large Multilingual Models Pivot Zero-Shot Multimodal Learning across Languages.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

AgentVerse: Facilitating Multi-Agent Collaboration and Exploring Emergent Behaviors.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

DecorateLM: Data Engineering through Corpus Rating, Tagging, and Editing with Language Models.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Ouroboros: Generating Longer Drafts Phrase by Phrase for Faster Speculative Decoding.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Beyond the Turn-Based Game: Enabling Real-Time Conversations with Duplex Models.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Controllable Preference Optimization: Toward Controllable Multi-Objective Alignment.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Enhancing Legal Case Retrieval via Scaling High-quality Synthetic Query-Candidate Pairs.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Beyond Natural Language: LLMs Leveraging Alternative Formats for Enhanced Reasoning and Communication.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

RLHF-V: Towards Trustworthy MLLMs via Behavior Alignment from Fine-Grained Correctional Human Feedback.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Fine-Grained Legal Argument-Pair Extraction via Coarse-Grained Pre-training.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

Robust and Scalable Model Editing for Large Language Models.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

Empowering Private Tutoring by Chaining Large Language Models.
Proceedings of the 33rd ACM International Conference on Information and Knowledge Management, 2024

Enhancing Multilingual Capabilities of Large Language Models through Self-Distillation from Resource-Rich Languages.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

ınftyBench: Extending Long Context Evaluation Beyond 100K Tokens.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

MatPlotAgent: Method and Evaluation for LLM-Based Agentic Scientific Data Visualization.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

Reasoning in Conversation: Solving Subjective Tasks through Dialogue Simulation for Large Language Models.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

UltraLink: An Open-Source Knowledge-Enhanced Multilingual Supervised Fine-tuning Dataset.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

LoRA-Flow: Dynamic LoRA Fusion for Large Language Models in Generative Tasks.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

Browse and Concentrate: Comprehending Multimodal Content via Prior-LLM Context Fusion.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

DebugBench: Evaluating Debugging Capability of Large Language Models.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

ChatDev: Communicative Agents for Software Development.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

Tell Me More! Towards Implicit User Intention Understanding of Language Model Driven Agents.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

Experiential Co-Learning of Software-Developing Agents.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

CODIS: Benchmarking Context-dependent Visual Comprehension for Multimodal Large Language Models.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

Fantastic Semantics and Where to Find Them: Investigating Which Layers of Generative LLMs Reflect Lexical Semantics.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

FastFiD: Improve Inference Efficiency of Open Domain Question Answering via Sentence Selection.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

OlympiadBench: A Challenging Benchmark for Promoting AGI with Olympiad-Level Bilingual Multimodal Scientific Problems.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

StableToolBench: Towards Stable Large-Scale Benchmarking on Tool Learning of Large Language Models.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

Model Composition for Multimodal Large Language Models.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

2023
CCOM-HuQin: an Annotated Multimodal Chinese Fiddle Performance Dataset.
Dataset, July, 2023

Full-Scale Information Diffusion Prediction With Reinforced Recurrent Networks.
IEEE Trans. Neural Networks Learn. Syst., May, 2023

Red Alarm for Pre-trained Models: Universal Vulnerability to Neuron-level Backdoor Attacks.
Mach. Intell. Res., April, 2023

Parameter-efficient fine-tuning of large-scale pre-trained language models.
Nat. Mac. Intell., March, 2023

CCOM-HuQin: An Annotated Multimodal Chinese Fiddle Performance Dataset.
Trans. Int. Soc. Music. Inf. Retr., January, 2023

AdaDS: Adaptive data selection for accelerating pre-trained language model knowledge distillation.
AI Open, January, 2023

UPRec: User-aware Pre-training for sequential Recommendation.
AI Open, January, 2023

Removing Backdoors in Pre-trained Models by Regularized Continual Pre-training.
Trans. Assoc. Comput. Linguistics, 2023

Sub-Character Tokenization for Chinese Pretrained Language Models.
Trans. Assoc. Comput. Linguistics, 2023

GitAgent: Facilitating Autonomous Agent with GitHub by Tool Extension.
CoRR, 2023

Experiential Co-Learning of Software-Developing Agents.
CoRR, 2023

RLHF-V: Towards Trustworthy MLLMs via Behavior Alignment from Fine-grained Correctional Human Feedback.
CoRR, 2023

ProAgent: From Robotic Process Automation to Agentic Process Automation.
CoRR, 2023

Unlock Predictable Scaling from Emergent Abilities.
CoRR, 2023

UltraFeedback: Boosting Language Models with High-quality Feedback.
CoRR, 2023

Reformulating Vision-Language Foundation Models and Datasets Towards Universal Multimodal Assistants.
CoRR, 2023

ConPET: Continual Parameter-Efficient Tuning for Large Language Models.
CoRR, 2023

QASnowball: An Iterative Bootstrapping Framework for High-Quality Question-Answering Data Generation.
CoRR, 2023

Position-Enhanced Visual Instruction Tuning for Multimodal Large Language Models.
CoRR, 2023

Large Language Model as Autonomous Decision Maker.
CoRR, 2023

AgentVerse: Facilitating Multi-Agent Collaboration and Exploring Emergent Behaviors in Agents.
CoRR, 2023

ToolLLM: Facilitating Large Language Models to Master 16000+ Real-world APIs.
CoRR, 2023

Communicative Agents for Software Development.
CoRR, 2023

CPET: Effective Parameter-Efficient Tuning for Compressed Large Language Models.
CoRR, 2023

Interactive Molecular Discovery with Natural Language.
CoRR, 2023

Revisiting Out-of-distribution Robustness in NLP: Benchmark, Analysis, and LLMs Evaluations.
CoRR, 2023

Arbitrary Few Parameters are Good Enough for Adapting Large-scale Pre-trained Language Models.
CoRR, 2023

Enhancing Chat Language Models by Scaling High-quality Instructional Conversations.
CoRR, 2023

Efficient Cross-Lingual Transfer for Chinese Stable Diffusion with Images as Pivots.
CoRR, 2023

UNTER: A Unified Knowledge Interface for Enhancing Pre-trained Language Models.
CoRR, 2023

Tool Learning with Foundation Models.
CoRR, 2023

Revisiting Out-of-distribution Robustness in NLP: Benchmarks, Analysis, and LLMs Evaluations.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

H3T: Efficient Integration of Memory Optimization and Parallelism for Large-scale Transformer Training.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

C-Eval: A Multi-Level Multi-Discipline Chinese Evaluation Suite for Foundation Models.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Chord-Conditioned Melody Harmonization With Controllable Harmonicity.
Proceedings of the IEEE International Conference on Acoustics, 2023

TunesFormer: Forming Irish Tunes with Control Codes by Bar Patching.
Proceedings of the 2nd Workshop on Human-Centric Music Information Retrieval 2023 co-located with the 24th International Society for Music Information Retrieval Conference (ISMIR 2023), 2023

Beat LLMs at Their Own Game: Zero-Shot LLM-Generated Text Detection via Querying ChatGPT.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Variator: Accelerating Pre-trained Models with Plug-and-Play Compression Modules.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Self-Knowledge Guided Retrieval Augmentation for Large Language Models.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Exploring the Impact of Model Scaling on Parameter-Efficient Tuning.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Sparse Low-rank Adaptation of Pre-trained Language Models.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Enhancing Chat Language Models by Scaling High-quality Instructional Conversations.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Boosting Inference Efficiency: Unleashing the Power of Parameter-Shared Pre-trained Language Models.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

MUSER: A Multi-View Similar Case Retrieval Dataset.
Proceedings of the 32nd ACM International Conference on Information and Knowledge Management, 2023

Emergent Modularity in Pre-trained Transformers.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

Plug-and-Play Knowledge Injection for Pre-trained Language Models.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Continual Knowledge Distillation for Neural Machine Translation.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Plug-and-Play Document Modules for Pre-trained Models.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

READIN: A Chinese Multi-Task Benchmark with Realistic and Diverse Input Noises.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Recyclable Tuning for Continual Pre-training.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

WebCPM: Interactive Web Search for Chinese Long-form Question Answering.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Parameter-efficient Weight Ensembling Facilitates Task-level Knowledge Transfer.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), 2023

An Extensible Plug-and-Play Method for Multi-Aspect Controllable Text Generation.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Won't Get Fooled Again: Answering Questions with False Premises.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

OpenDelta: A Plug-and-play Library for Parameter-efficient Adaptation of Pre-trained Models.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics: System Demonstrations, 2023

Decoder Tuning: Efficient Language Understanding as Decoding.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Weakly Supervised Vision-and-Language Pre-training with Relative Representations.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Stochastic Bridges as Effective Regularizers for Parameter-Efficient Tuning.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

From Adversarial Arms Race to Model-centric Evaluation: Motivating a Unified Automatic Robustness Evaluation Framework.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

Visually Grounded Commonsense Knowledge Acquisition.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
PTR: Prompt Tuning with Rules for Text Classification.
AI Open, January, 2022

A Unified Understanding of Deep NLP Models for Text Classification.
IEEE Trans. Vis. Comput. Graph., 2022

COSINE: Compressive Network Embedding on Large-Scale Information Networks.
IEEE Trans. Knowl. Data Eng., 2022

Dynamic Multi-Branch Layers for On-Device Neural Machine Translation.
IEEE ACM Trans. Audio Speech Lang. Process., 2022

Data augmentation for low-resource languages NMT guided by constrained sampling.
Int. J. Intell. Syst., 2022

Prompt Gating: A Parameter Efficient Tuning Method for Zero-Shot Multi-Source Translation.
CoRR, 2022

Continually Learning from Existing Models: Knowledge Accumulation for Neural Machine Translation.
CoRR, 2022

Different Tunes Played with Equal Skill: Exploring a Unified Optimization Subspace for Delta Tuning.
CoRR, 2022

Sparse Structure Search for Parameter-Efficient Tuning.
CoRR, 2022

Delta Tuning: A Comprehensive Study of Parameter Efficient Methods for Pre-trained Language Models.
CoRR, 2022

Influence of musical elements on the perception of 'Chinese style' in music.
Cogn. Comput. Syst., 2022

Moderate-fitting as a Natural Backdoor Defender for Pre-trained Language Models.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Sparse Structure Search for Delta Tuning.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

A Unified Evaluation of Textual Backdoor Learning: Frameworks and Benchmarks.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

On Transferability of Prompt Tuning for Natural Language Processing.
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022

Knowledge Inheritance for Pre-trained Language Models.
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022

Fuse It More Deeply! A Variational Transformer with Layer-Wise Latent Variable Inference for Text Generation.
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022

Symphony Generation with Permutation Invariant Language Model.
Proceedings of the 23rd International Society for Music Information Retrieval Conference, 2022

BMCook: A Task-agnostic Compression Toolkit for Big Models.
Proceedings of the The 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Different Tunes Played with Equal Skill: Exploring a Unified Optimization Subspace for Parameter-Efficient Tuning.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

PEVL: Position-enhanced Pre-training and Prompt Tuning for Vision-language Models.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

A Template-based Method for Constrained Neural Machine Translation.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Exploring Mode Connectivity for Pre-trained Language Models.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Evade the Trap of Mediocrity: Promoting Diversity and Novelty in Text Generation via Concentrating Attention.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

FPT: Improving Prompt Tuning Efficiency via Progressive Training.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

Recurrence Boosts Diversity! Revisiting Recurrent Latent Variable in Transformer-Based Variational AutoEncoder for Diverse Text Generation.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

Textual Backdoor Attacks Can Be More Harmful via Two Simple Tricks.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

End-to-End Unsupervised Vision-and-Language Pre-training with Referring Expression Matching.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Why Should Adversarial Perturbations be Imperceptible? Rethink the Research Paradigm in Adversarial NLP.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Fine-Grained Scene Graph Generation with Data Transfer.
Proceedings of the Computer Vision - ECCV 2022, 2022

Automatic Label Sequence Generation for Prompting Sequence-to-sequence Models.
Proceedings of the 29th International Conference on Computational Linguistics, 2022

Pass off Fish Eyes for Pearls: Attacking Model Selection of Pre-trained Models.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

MoEfication: Transformer Feed-forward Layers are Mixtures of Experts.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2022, 2022

Going "Deeper": Structured Sememe Prediction via Transformer with Tree Attention.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2022, 2022

Packed Levitated Marker for Entity and Relation Extraction.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

A Simple but Effective Pluggable Entity Lookup Table for Pre-trained Language Models.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), 2022

LEVEN: A Large-Scale Chinese Legal Event Detection Dataset.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2022, 2022

Prompt Tuning for Discriminative Pre-trained Language Models.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2022, 2022

ELLE: Efficient Lifelong Pre-training for Emerging Data.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2022, 2022

QuoteR: A Benchmark of Quote Recommendation for Writing.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

Sememe Prediction for BabelNet Synsets using Multilingual and Multimodal Information.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2022, 2022

Knowledgeable Prompt-tuning: Incorporating Knowledge into Prompt Verbalizer for Text Classification.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

BMInf: An Efficient Toolkit for Big Model Inference and Tuning.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics, 2022

Cross-Lingual Contrastive Learning for Fine-Grained Entity Typing for Low-Resource Languages.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

OpenPrompt: An Open-source Framework for Prompt-learning.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics, 2022

Fully Hyperbolic Neural Networks.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

2021
Network Embedding: Theories, Methods, and Applications
Synthesis Lectures on Artificial Intelligence and Machine Learning, Morgan & Claypool Publishers, ISBN: 978-3-031-01590-8, 2021

Neural Diffusion Model for Microscopic Cascade Study.
IEEE Trans. Knowl. Data Eng., 2021

CED: Credible Early Detection of Social Media Rumors.
IEEE Trans. Knowl. Data Eng., 2021

Country Image in COVID-19 Pandemic: A Case Study of China.
IEEE Trans. Big Data, 2021

Neural Machine Translation With Explicit Phrase Alignment.
IEEE ACM Trans. Audio Speech Lang. Process., 2021

CSS-LM: A Contrastive Framework for Semi-Supervised Fine-Tuning of Pre-Trained Language Models.
IEEE ACM Trans. Audio Speech Lang. Process., 2021

Learning to Generate Explainable Plots for Neural Story Generation.
IEEE ACM Trans. Audio Speech Lang. Process., 2021

Improving Data Augmentation for Low-Resource NMT Guided by POS-Tagging and Paraphrase Embedding.
ACM Trans. Asian Low Resour. Lang. Inf. Process., 2021

From Symbols to Embeddings: A Tale of Two Representations in Computational Social Science.
J. Soc. Comput., 2021

Knowledge Transfer via Pre-training for Recommendation: A Review and Prospect.
Frontiers Big Data, 2021

Sememe knowledge computation: a review of recent advances in application and expansion of sememe knowledge bases.
Frontiers Comput. Sci., 2021

YACLC: A Chinese Learner Corpus with Multidimensional Annotation.
CoRR, 2021

CUGE: A Chinese Language Understanding and Generation Evaluation Benchmark.
CoRR, 2021

Melody Harmonization with Controllable Harmonic Rhythm.
CoRR, 2021

On Transferability of Prompt Tuning for Natural Language Understanding.
CoRR, 2021

Textual Backdoor Attacks Can Be More Harmful via Two Simple Tricks.
CoRR, 2021

Exploring Low-dimensional Intrinsic Task Subspace via Prompt Tuning.
CoRR, 2021

MoEfication: Conditional Computation of Transformer Models for Efficient Inference.
CoRR, 2021

Pack Together: Entity and Relation Extraction with Levitated Marker.
CoRR, 2021

Knowledgeable Prompt-tuning: Incorporating Knowledge into Prompt Verbalizer for Text Classification.
CoRR, 2021

Language Models are Good Translators.
CoRR, 2021

CCPM: A Chinese Classical Poetry Matching Dataset.
CoRR, 2021

SHUOWEN-JIEZI: Linguistically Informed Tokenizers For Chinese Language Model Pretraining.
CoRR, 2021

Dynamic Multi-Branch Layers for On-Device Neural Machine Translation.
CoRR, 2021

Equality before the Law: Legal Judgment Consistency Analysis for Fairness.
CoRR, 2021

UPRec: User-Aware Pre-training for Recommender Systems.
CoRR, 2021

CSS-LM: A Contrastive Framework for Semi-supervised Fine-tuning of Pre-trained Language Models.
CoRR, 2021

Representation Learning for Natural Language Processing.
CoRR, 2021

Red Alarm for Pre-trained Models: Universal Vulnerabilities by Neuron-Level Backdoor Attacks.
CoRR, 2021

Know what you don't need: Single-Shot Meta-Pruning for attention heads.
AI Open, 2021

CPM: A large-scale generative Chinese Pre-trained language model.
AI Open, 2021

CPM-2: Large-scale cost-effective pre-trained language models.
AI Open, 2021

Lawformer: A pre-trained language model for Chinese legal long documents.
AI Open, 2021

CokeBERT: Contextual knowledge selection and embedding towards enhanced pre-trained language models.
AI Open, 2021

Towards a universal continuous knowledge base.
AI Open, 2021

OpenMatch: An Open Source Library for Neu-IR Research.
Proceedings of the SIGIR '21: The 44th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2021

Open Hierarchical Relation Extraction.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

TR-BERT: Dynamic Token Reduction for Accelerating BERT Inference.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

Neural Quality Estimation with Multiple Hypotheses for Grammatical Error Correction.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

Visual Distant Supervision for Scene Graph Generation.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Self-Supervised Quality Estimation for Machine Translation.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

CodRED: A Cross-Document Relation Extraction Dataset for Acquiring Knowledge in the Wild.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

Mind the Style of Text! Adversarial and Backdoor Attacks Based on Text Style Transfer.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

ONION: A Simple and Effective Defense Against Textual Backdoor Attacks.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

Knowledge Representation Learning with Contrastive Completion Coding.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2021, 2021

Segment, Mask, and Predict: Augmenting Chinese Word Segmentation with Self-Supervision.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

MOOCCubeX: A Large Knowledge-centered Repository for Adaptive Learning in MOOCs.
Proceedings of the CIKM '21: The 30th ACM International Conference on Information and Knowledge Management, Virtual Event, Queensland, Australia, November 1, 2021

KACC: A Multi-task Benchmark for Knowledge Abstraction, Concretization and Completion.
Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021

OpenAttack: An Open-source Textual Adversarial Attack Toolkit.
Proceedings of the Joint Conference of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

On the Language Coverage Bias for Neural Machine Translation.
Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021

Better Robustness by More Coverage: Adversarial and Mixup Data Augmentation for Robust Finetuning.
Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021

ERICA: Improving Entity and Relation Understanding for Pre-trained Language Models via Contrastive Learning.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

Turn the Combination Lock: Learnable Textual Backdoor Attacks via Word Substitution.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

Hidden Killer: Invisible Textual Backdoor Attacks with Syntactic Trigger.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

Automatic Construction of Sememe Knowledge Bases via Dictionaries.
Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021

Alternated Training with Synthetic and Authentic Data for Neural Machine Translation.
Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021

Transfer Learning for Sequence Generation: from Single-source to Multi-source.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

Manual Evaluation Matters: Reviewing Test Protocols of Distantly Supervised Relation Extraction.
Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021

Mask-Align: Self-Supervised Neural Word Alignment.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

Adversarial Language Games for Advanced Natural Language Intelligence.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

Aspect-Level Sentiment-Controllable Review Generation with Mutual Learning Framework.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020
Improving Sequence Modeling Ability of Recurrent Neural Networks via Sememes.
IEEE ACM Trans. Audio Speech Lang. Process., 2020

Reinforced Zero-Shot Cross-Lingual Neural Headline Generation.
IEEE ACM Trans. Audio Speech Lang. Process., 2020

Integrating Image-Based and Knowledge-Based Representation Learning.
IEEE Trans. Cogn. Dev. Syst., 2020

Better Robustness by More Coverage: Adversarial Training with Mixup Augmentation for Robust Fine-tuning.
CoRR, 2020

ONION: A Simple and Effective Defense Against Textual Backdoor Attacks.
CoRR, 2020

Contextual Knowledge Selection and Embedding towards Enhanced Pre-Trained Language Models.
CoRR, 2020

Learning to Attack: Towards Textual Adversarial Attacking in Real-world Situations.
CoRR, 2020

OpenAttack: An Open-source Textual Adversarial Attack Toolkit.
CoRR, 2020

KACC: A Multi-task Benchmark for Knowledge Abstraction, Concretization and Completion.
CoRR, 2020

Coreferential Reasoning Learning for Language Representation.
CoRR, 2020

Lexical Sememe Prediction using Dictionary Definitions by Capturing Local Semantic Correspondence.
CoRR, 2020

Topic-sensitive neural headline generation.
Sci. China Inf. Sci., 2020

Graph neural networks: A review of methods and applications.
AI Open, 2020

Neural machine translation: A review of methods, resources, and tools.
AI Open, 2020

Towards Interpretable Natural Language Understanding with Explanations as Latent Variables.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

PiRhDy: Learning Pitch-, Rhythm-, and Dynamics-aware Embeddings for Symbolic Music.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Generating Major Types of Chinese Classical Poetry in a Uniformed Framework.
Proceedings of The 12th Language Resources and Evaluation Conference, 2020

More Data, More Relations, More Context and More Openness: A Review and Outlook for Relation Extraction.
Proceedings of the 1st Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics and the 10th International Joint Conference on Natural Language Processing, 2020

Text Style Transfer via Learning Style Instance Supported Latent Space.
Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, 2020

Modeling Voting for System Combination in Machine Translation.
Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, 2020

Coreferential Reasoning Learning for Language Representation.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

Denoising Relation Extraction from Document-level Distant Supervision.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

WantWords: An Open-source Online Reverse Dictionary System.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing: System Demonstrations, 2020

Learning from Context or Names? An Empirical Study on Neural Relation Extraction.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

Generalizing Open Domain Fact Extraction and Verification to COVID-FACT thorough In-Domain Language Modeling.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2020, 2020

IsOBS: An Information System for Oracle Bone Script.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing: System Demonstrations, 2020

Train No Evil: Selective Masking for Task-Guided Pre-Training.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

Try to Substitute: An Unsupervised Chinese Word Sense Disambiguation Method Based on HowNet.
Proceedings of the 28th International Conference on Computational Linguistics, 2020

Meta-Information Guided Meta-Learning for Few-Shot Relation Classification.
Proceedings of the 28th International Conference on Computational Linguistics, 2020

Tsinghua University Neural Machine Translation Systems for CCMT 2020.
Proceedings of the Machine Translation - 16th China Conference, 2020

THUMT: An Open-Source Toolkit for Neural Machine Translation.
Proceedings of the 14th Conference of the Association for Machine Translation in the Americas, 2020

How Does NLP Benefit Legal System: A Summary of Legal Artificial Intelligence.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

Word-level Textual Adversarial Attacking as Combinatorial Optimization.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

Fine-grained Fact Verification with Kernel Graph Attention Network.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

Continual Relation Learning via Episodic Memory Activation and Reconsolidation.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

JEC-QA: A Legal-Domain Question Answering Dataset.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

Iteratively Questioning and Answering for Interpretable Legal Judgment Prediction.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

Multi-Channel Reverse Dictionary Model.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

MixPoet: Diverse Poetry Generation via Learning Controllable Mixed Latent Space.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

Towards Building a Multilingual Sememe Knowledge Base: Predicting Sememes for BabelNet Synsets.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

Neural Snowball for Few-Shot Relation Learning.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

Representation Learning for Natural Language Processing
Springer, ISBN: 978-981-15-5572-5, 2020

2019
A Unified Framework for Community Detection and Network Representation Learning.
IEEE Trans. Knowl. Data Eng., 2019

Multi-Round Transfer Learning for Low-Resource NMT Using Multiple High-Resource Languages.
ACM Trans. Asian Low Resour. Lang. Inf. Process., 2019

Bootstrapping Knowledge Graphs From Images and Text.
Frontiers Neurorobotics, 2019

Multi-channel Reverse Dictionary Model.
CoRR, 2019

Learning to Predict Explainable Plots for Neural Story Generation.
CoRR, 2019

CAIL2019-SCM: A Dataset of Similar Case Matching in Legal Domain.
CoRR, 2019

Multi-Paragraph Reasoning with Knowledge-enhanced Graph Neural Network.
CoRR, 2019

Adversarial Language Games for Advanced Natural Language Intelligence.
CoRR, 2019

Open the Boxes of Words: Incorporating Sememes into Textual Adversarial Attack.
CoRR, 2019

Kernel Graph Attention Network for Fact Verification.
CoRR, 2019

Enhancing Recurrent Neural Networks with Sememes.
CoRR, 2019

Using BERT for Word Sense Disambiguation.
CoRR, 2019

COS960: A Chinese Word Similarity Dataset of 960 Word Pairs.
CoRR, 2019

OpenHowNet: An Open Sememe-based Lexical Knowledge Base.
CoRR, 2019

Adversarial Training for Weakly Supervised Event Detection.
Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019

Multi-scale Information Diffusion Prediction with Reinforced Recurrent Networks.
Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019

Enhancing Stock Movement Prediction with Adversarial Training.
Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019

Sentiment-Controllable Chinese Poetry Generation.
Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019

Open Relation Extraction: Relational Knowledge Transfer from Supervised Data to Unsupervised Data.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

HMEAE: Hierarchical Modular Event Argument Extraction.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

Improving Back-Translation with Uncertainty-based Confidence Estimation.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

Learning to Copy for Automatic Post-Editing.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

OpenNRE: An Open and Extensible Toolkit for Neural Relation Extraction.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

FewRel 2.0: Towards More Challenging Few-Shot Relation Classification.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

Automatic Judgment Prediction via Legal Reading Comprehension.
Proceedings of the Chinese Computational Linguistics - 18th China National Conference, 2019

Explore Entity Embedding Effectiveness in Entity Retrieval.
Proceedings of the Chinese Computational Linguistics - 18th China National Conference, 2019

Construction of an English-Uyghur WordNet Dataset.
Proceedings of the Chinese Computational Linguistics - 18th China National Conference, 2019

Legal Cause Prediction with Inner Descriptions and Outer Hierarchies.
Proceedings of the Chinese Computational Linguistics - 18th China National Conference, 2019

Graph Neural Networks with Generated Parameters for Relation Extraction.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

GEAR: Graph-based Evidence Aggregating and Reasoning for Fact Verification.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

ERNIE: Enhanced Language Representation with Informative Entities.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

DocRED: A Large-Scale Document-Level Relation Extraction Dataset.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

Reducing Word Omission Errors in Neural Machine Translation: A Contrastive Learning Approach.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

Modeling Semantic Compositionality with Sememe Knowledge.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

XQA: A Cross-lingual Open-domain Question Answering Dataset.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

Jiuge: A Human-Machine Collaborative Chinese Classical Poetry Generation System.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

Quantifying Similarity between Relations with Fact Distribution.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

Hybrid Attention-Based Prototypical Networks for Noisy Few-Shot Relation Classification.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

2018
Joint POS Tagging and Dependence Parsing With Transition-Based Neural Networks.
IEEE ACM Trans. Audio Speech Lang. Process., 2018

Zero-Shot Cross-Lingual Neural Headline Generation.
IEEE ACM Trans. Audio Speech Lang. Process., 2018

Knowledge Representation Learning: A Quantitative Review.
CoRR, 2018

Neural Diffusion Model for Microscopic Cascade Prediction.
CoRR, 2018

Graph Neural Networks: A Review of Methods and Applications.
CoRR, 2018

CED: Credible Early Detection of Social Media Rumors.
CoRR, 2018

Improving Stock Movement Prediction with Adversarial Training.
CoRR, 2018

Overview of CAIL2018: Legal Judgment Prediction Competition.
CoRR, 2018

CAIL2018: A Large-Scale Legal Dataset for Judgment Prediction.
CoRR, 2018

Denoising Distant Supervision for Relation Extraction via Instance-Level Adversarial Training.
CoRR, 2018

Query Suggestion with Feedback Memory Network.
Proceedings of the 2018 World Wide Web Conference on World Wide Web, 2018

Bandit Learning with Implicit Feedback.
Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

Chinese Poetry Generation with a Working Memory Model.
Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018

Legal Judgment Prediction via Topological Learning.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

Improving the Transformer Translation Model with Document-Level Context.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

Automatic Poetry Generation with Mutual Reinforcement Learning.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

Stylistic Chinese Poetry Generation via Unsupervised Style Disentanglement.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

Put It Back: Entity Typing with Language Model Enhancement.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

Cross-lingual Lexical Sememe Prediction.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

A Multi-answer Multi-task Framework for Real-world Machine Reading Comprehension.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

FewRel: A Large-Scale Supervised Few-shot Relation Classification Dataset with State-of-the-Art Evaluation.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

Hierarchical Relation Extraction with Coarse-to-Fine Grained Attention.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

OpenKE: An Open Toolkit for Knowledge Embedding.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, 2018

Language Modeling with Sparse Product of Sememe Experts.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

Chinese Poetry Generation with a Salient-Clue Mechanism.
Proceedings of the 22nd Conference on Computational Natural Language Learning, 2018

Adversarial Multi-lingual Neural Relation Extraction.
Proceedings of the 27th International Conference on Computational Linguistics, 2018

Few-Shot Charge Prediction with Discriminative Legal Attributes.
Proceedings of the 27th International Conference on Computational Linguistics, 2018

Entity-Duet Neural Ranking: Understanding the Role of Knowledge Graph Semantics in Neural Information Retrieval.
Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, 2018

Incorporating Chinese Characters of Words for Lexical Sememe Prediction.
Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, 2018

Denoising Distantly Supervised Open-Domain Question Answering.
Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, 2018

Chinese LIWC Lexicon Expansion via Hierarchical Classification of Word Embeddings with Sememe Attention.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

Improving Neural Fine-Grained Entity Typing With Knowledge Attention.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

Neural Knowledge Acquisition via Mutual Attention Between Knowledge Graph and Text.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

2017
A Neural Network Approach to Jointly Modeling Social Networks and Mobile Trajectories.
ACM Trans. Inf. Syst., 2017

PRISM: Profession Identification in Social Media.
ACM Trans. Intell. Syst. Technol., 2017

Improving Word Representations with Document Labels.
IEEE ACM Trans. Audio Speech Lang. Process., 2017

Neural Parse Combination.
J. Comput. Sci. Technol., 2017

Optimizing Non-Decomposable Evaluation Metrics for Neural Machine Translation.
J. Comput. Sci. Technol., 2017

Recent Advances on Neural Headline Generation.
J. Comput. Sci. Technol., 2017

THUMT: An Open Source Toolkit for Neural Machine Translation.
CoRR, 2017

Joint POS Tagging and Dependency Parsing with Transition-based Neural Networks.
CoRR, 2017

Does William Shakespeare REALLY Write Hamlet? Knowledge Representation Learning with Confidence.
CoRR, 2017

Smart Jump: Automated Navigation Suggestion for Videos in MOOCs.
Proceedings of the 26th International Conference on World Wide Web Companion, 2017

Iterative Entity Alignment via Joint Knowledge Embeddings.
Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence, 2017

Fast Network Embedding Enhancement via High Order Proximity Approximation.
Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence, 2017

Lexical Sememe Prediction via Word Embeddings and Matrix Factorization.
Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence, 2017

Image-embodied Knowledge Representation Learning.
Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence, 2017

TransNet: Translation-Based Network Representation Learning for Social Relation Extraction.
Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence, 2017

Joint Training for Pivot-based Neural Machine Translation.
Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence, 2017

Earth Mover's Distance Minimization for Unsupervised Bilingual Lexicon Induction.
Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, 2017

Incorporating Relation Paths in Neural Relation Extraction.
Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, 2017

Generating Chinese Classical Poems with RNN Encoder-Decoder.
Proceedings of the Chinese Computational Linguistics and Natural Language Processing Based on Naturally Annotated Big Data, 2017

Prior Knowledge Integration for Neural Machine Translation using Posterior Regularization.
Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, 2017

Adversarial Training for Unsupervised Bilingual Lexicon Induction.
Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, 2017

CANE: Context-Aware Network Embedding for Relation Modeling.
Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, 2017

Improved Word Representation Learning with Sememes.
Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, 2017

Neural Relation Extraction with Multi-lingual Attention.
Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, 2017

Visualizing and Understanding Neural Machine Translation.
Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, 2017

Bilingual Lexicon Induction from Non-Parallel Data with Minimal Supervision.
Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, 2017

2016
Listwise Ranking Functions for Statistical Machine Translation.
IEEE ACM Trans. Audio Speech Lang. Process., 2016

A Neural Network Approach to Joint Modeling Social Networks and Mobile Trajectories.
CoRR, 2016

Neural Emoji Recommendation in Dialogue Systems.
CoRR, 2016

Image-embodied Knowledge Representation Learning.
CoRR, 2016

Knowledge Representation via Joint Learning of Sequential Text and Knowledge Graphs.
CoRR, 2016

Community-enhanced Network Representation Learning for Network Analysis.
CoRR, 2016

Joint Representation Learning of Text and Knowledge for Knowledge Graph Completion.
CoRR, 2016

Neural Machine Translation with Pivot Languages.
CoRR, 2016

Neural Headline Generation with Minimum Risk Training.
CoRR, 2016

Representation Learning of Knowledge Graphs with Hierarchical Types.
Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence, 2016

Max-Margin DeepWalk: Discriminative Learning of Network Representation.
Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence, 2016

Knowledge Representation Learning with Entities, Attributes and Relations.
Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence, 2016

Agreement-Based Joint Training for Bidirectional Attention-Based Neural Machine Translation.
Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence, 2016

Neural Sentiment Classification with User and Product Attention.
Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing, 2016

Inducing Bilingual Lexica From Non-Parallel Data With Earth Mover's Distance Regularization.
Proceedings of the COLING 2016, 2016

Minimum Risk Training for Neural Machine Translation.
Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, 2016

Agreement-based Learning of Parallel Lexicons and Phrases from Non-Parallel Corpora.
Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, 2016

Neural Relation Extraction with Selective Attention over Instances.
Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, 2016

Semi-Supervised Learning for Neural Machine Translation.
Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, 2016

Building Earth Mover's Distance on Bilingual Word Embeddings for Machine Translation.
Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, 2016

Representation Learning of Knowledge Graphs with Entity Descriptions.
Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, 2016

2015
Tag Correspondence Model for User Tag Suggestion.
J. Comput. Sci. Technol., 2015

Estimating translation probabilities for social tag suggestion.
Expert Syst. Appl., 2015

Modeling Relation Paths for Representation Learning of Knowledge Bases.
CoRR, 2015

PRISM: Profession Identification in Social Media with Personal Information and Community Structure.
Proceedings of the Social Media Processing - 4th National Conference, 2015

Representation Learning for Measuring Entity Relatedness with Rich Information.
Proceedings of the Twenty-Fourth International Joint Conference on Artificial Intelligence, 2015

Network Representation Learning with Rich Text Information.
Proceedings of the Twenty-Fourth International Joint Conference on Artificial Intelligence, 2015

Iterative Learning of Parallel Lexicons and Phrases from Non-Parallel Corpora.
Proceedings of the Twenty-Fourth International Joint Conference on Artificial Intelligence, 2015

Joint Learning of Character and Word Embeddings.
Proceedings of the Twenty-Fourth International Joint Conference on Artificial Intelligence, 2015

Consistency-Aware Search for Word Alignment.
Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, 2015

Online Learning of Interpretable Word Embeddings.
Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, 2015

Generalized Agreement for Bidirectional Word Alignment.
Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, 2015

Modeling Relation Paths for Representation Learning of Knowledge Bases.
Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, 2015

Improved Learning of Chinese Word Embeddings with Semantic Knowledge.
Proceedings of the Chinese Computational Linguistics and Natural Language Processing Based on Naturally Annotated Big Data, 2015

Learning Distributed Representations of Uyghur Words and Morphemes.
Proceedings of the Chinese Computational Linguistics and Natural Language Processing Based on Naturally Annotated Big Data, 2015

Learning Cross-lingual Word Embeddings via Matrix Co-factorization.
Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing of the Asian Federation of Natural Language Processing, 2015

Phrase Type Sensitive Tensor Indexing Model for Semantic Composition.
Proceedings of the Twenty-Ninth AAAI Conference on Artificial Intelligence, 2015

Contrastive Unsupervised Word Alignment with Non-Local Features.
Proceedings of the Twenty-Ninth AAAI Conference on Artificial Intelligence, 2015

Topical Word Embeddings.
Proceedings of the Twenty-Ninth AAAI Conference on Artificial Intelligence, 2015

Learning Entity and Relation Embeddings for Knowledge Graph Completion.
Proceedings of the Twenty-Ninth AAAI Conference on Artificial Intelligence, 2015

2014
The design of a live social observatory system.
Proceedings of the 23rd International World Wide Web Conference, 2014

Inferring Correspondences from Multiple Sources for Microblog User Tags.
Proceedings of the Social Media Processing - Third National Conference, 2014

Discriminating gender on Chinese microblog: A study of online behaviour, writing style and preferred vocabulary.
Proceedings of the 10th International Conference on Natural Computation, 2014

A Unified Model for Word Sense Representation and Disambiguation.
Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing, 2014

A Neural Reordering Model for Phrase-based Translation.
Proceedings of the COLING 2014, 2014

Query Lattice for Translation Retrieval.
Proceedings of the COLING 2014, 2014

ASR-Based Input Method for Postal Address Recognition in Chinese Mandarin.
Proceedings of the Chinese Computational Linguistics and Natural Language Processing Based on Naturally Annotated Big Data, 2014

ImgWordle: image and text visualization for events in microblogging services.
Proceedings of the International Working Conference on Advanced Visual Interfaces, 2014

Quantifying Chinese Happiness via Large-Scale Microblogging Data.
Proceedings of the 11th Web Information System and Application Conference, 2014

2013
Measuring and Visualizing Interest Similarity between Microblog Users.
Proceedings of the Web-Age Information Management - 14th International Conference, 2013

THUNLP at TAC KBP 2013 in Entity Linking.
Proceedings of the Sixth Text Analysis Conference, 2013

HMeanMax: Placing HMAX and HoG into a unified framework.
Proceedings of the 2013 International Joint Conference on Neural Networks, 2013

Recursive Autoencoders for ITG-Based Translation.
Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, 2013

Exploiting Lexicalized Statistical Patterns in Chinese Linguistic Analysis.
Proceedings of the Chinese Computational Linguistics and Natural Language Processing Based on Naturally Annotated Big Data, 2013

Minimum Error Rate Training for Bilingual News Alignment.
Proceedings of the Chinese Lexical Semantics - 14th Workshop, 2013

An Extended GHKM Algorithm for Inducing Lambda-SCFG.
Proceedings of the Twenty-Seventh AAAI Conference on Artificial Intelligence, 2013

2012
NExT: NUS-Tsinghua Center for Extreme Search of User-Generated Content.
IEEE Multim., 2012

Mining the interests of Chinese microbloggers via keyword extraction.
Frontiers Comput. Sci. China, 2012

Particle Mixed Membership Stochastic Block Model.
Proceedings of the Eighth International Conference on Semantics, Knowledge and Grids, 2012

Tag Dispatch Model with Social Network Regularization for Microblog User Tag Suggestion.
Proceedings of the COLING 2012, 2012

Topical Word Trigger Model for Keyphrase Extraction.
Proceedings of the COLING 2012, 2012

THUTR: A Translation Retrieval System.
Proceedings of the COLING 2012, 2012

Expert Finding for Microblog Misinformation Identification.
Proceedings of the COLING 2012, 2012

A Beam Search Algorithm for ITG Word Alignment.
Proceedings of the COLING 2012, 2012

Random Walks on Context-Aware Relation Graphs for Ranking Social Tags.
Proceedings of the COLING 2012, 2012

2011
PLDA+: Parallel latent dirichlet allocation with data placement and pipeline processing.
ACM Trans. Intell. Syst. Technol., 2011

User Behaviors in Related Word Retrieval and New Word Detection: A Collaborative Perspective.
ACM Trans. Asian Lang. Inf. Process., 2011

Report on the first summer school on NLP and IR in Beijing.
SIGIR Forum, 2011

Exploring the Granularity Level of Social Tags.
Int. J. Comput. Process. Orient. Lang., 2011

THUNLP at TAC KBP 2011 in Entity Linking.
Proceedings of the Fourth Text Analysis Conference, 2011

A comparison study of candidate generation for Chinese word segmentation.
Proceedings of the 7th International Conference on Natural Language Processing and Knowledge Engineering, 2011

Extract Chinese Unknown Words from a Large-scale Corpus Using Morphological and Distributional Evidences.
Proceedings of the Fifth International Joint Conference on Natural Language Processing, 2011

CHIME: An Efficient Error-Tolerant Chinese Pinyin Input Method.
Proceedings of the IJCAI 2011, 2011

Extracting Chinese abbreviation-definition pairs from anchor texts.
Proceedings of the International Conference on Machine Learning and Cybernetics, 2011

A Simple Word Trigger Method for Social Tag Suggestion.
Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing, 2011

Automatic Keyphrase Extraction by Bridging Vocabulary Gap.
Proceedings of the Fifteenth Conference on Computational Natural Language Learning, 2011

Is Simhash Achilles?
Proceedings of the Information Retrieval Technology, 2011

Why Press Backspace? Understanding User Input Behaviors in Chinese Pinyin Input Method.
Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, Proceedings of the Conference, 19-24 June, 2011, Portland, Oregon, USA, 2011

2010
Confucius and Its Intelligent Disciples: Integrating Social with Search.
Proc. VLDB Endow., 2010

Chinese Word Frequency Approximation Based on Multitype Corpora.
J. Quant. Linguistics, 2010

Modeling Social Annotations via Latent Reason Identification.
IEEE Intell. Syst., 2010

Tag Allocation Model: Model Noisy Social Annotations by Reason Finding.
Proceedings of the 2010 IEEE/WIC/ACM International Conference on Web Intelligence, 2010

Automatic Keyphrase Extraction via Topic Decomposition.
Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing, 2010

Explore the Structure of Social Tags by Subsumption Relations.
Proceedings of the COLING 2010, 2010

Fast-Champollion: A Fast and Robust Sentence Alignment Algorithm.
Proceedings of the COLING 2010, 2010

A Local Generative Model for Chinese Word Segmentation.
Proceedings of the Information Retrieval Technology, 2010

FolkDiffusion: A Graph-Based Tag Suggestion Method for Folksonomies.
Proceedings of the Information Retrieval Technology, 2010

Domain-Specific Term Rankings Using Topic Models.
Proceedings of the Information Retrieval Technology, 2010

Chinese New Word Detection from Query Logs.
Proceedings of the Advanced Data Mining and Applications - 6th International Conference, 2010

2009
Punctuation as Implicit Annotations for Chinese Word Segmentation.
Comput. Linguistics, 2009

Disambiguating Tags in Blogs.
Proceedings of the Text, Speech and Dialogue, 12th International Conference, 2009

Content-based and Graph-based Tag Suggestion.
Proceedings of ECML PKDD (The European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases) Discovery Challenge 2009, 2009

Quantifying Asymmetric Semantic Relations from Query Logs by Resource Allocation.
Proceedings of the Advances in Knowledge Discovery and Data Mining, 2009

Incorporating User Behaviors in New Word Detection.
Proceedings of the IJCAI 2009, 2009

Clustering to Find Exemplar Terms for Keyphrase Extraction.
Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, 2009

Efficient Text Classification Using Term Projection.
Proceedings of the Information Retrieval Technology, 2009

Community Detection on Weighted Networks: A Variational Bayesian Method.
Proceedings of the Advances in Machine Learning, 2009

2008
Asymmetrical query recommendation method based on bipartite network resource allocation.
Proceedings of the 17th International Conference on World Wide Web, 2008

Statistical Properties of Overlapping Ambiguities in Chinese Word Segmentation and a Strategy for Their Disambiguation.
Proceedings of the Text, Speech and Dialogue, 11th International Conference, 2008

Disyllabic Chinese Word Extraction Based on Character Thesaurus and Semantic Constraints in Word-Formation.
Proceedings of the Text, Speech and Dialogue, 11th International Conference, 2008

Text Classification Based on Transfer Learning and Self-Training.
Proceedings of the Fourth International Conference on Natural Computation, 2008

Full-reference quality diagnosis for video summary.
Proceedings of the 2008 IEEE International Conference on Multimedia and Expo, 2008

Multi-modal Multi-label Semantic Indexing of Images Using Unlabeled Data.
Proceedings of the ALPIT 2008, 2008

2007
Multi-modal Multi-label Semantic Indexing of Images Based on Hybrid Ensemble Learning.
Proceedings of the Advances in Multimedia Information Processing, 2007

Exploiting Category Information and Document Information to Improve Term Weighting for Text Categorization.
Proceedings of the Computational Linguistics and Intelligent Text Processing, 2007

2006
An Integrated Approach to Chinese Word Segmentation and Part-of-Speech Tagging.
Proceedings of the Computer Processing of Oriental Languages. Beyond the Orient: The Research Challenges Ahead, 2006

Word Frequency Approximation for Chinese Using Raw, MM-Segmented and Manually Segmented Corpora.
Proceedings of the Computer Processing of Oriental Languages. Beyond the Orient: The Research Challenges Ahead, 2006

Semi-supervised Learning for Image Annotation Based on Conditional Random Fields.
Proceedings of the Image and Video Retrieval, 5th International Conference, 2006

Word Frequency Approximation for Chinese Without Using Manually-Annotated Corpus.
Proceedings of the Computational Linguistics and Intelligent Text Processing, 2006

Automatic Image Annotation Based on WordNet and Hierarchical Ensembles.
Proceedings of the Computational Linguistics and Intelligent Text Processing, 2006

Incorporating Prior Knowledge into Multi-label Boosting for Cross-Modal Image Annotation and Retrieval.
Proceedings of the Information Retrieval Technology, 2006

A Comparison and Semi-Quantitative Analysis of Words and Character-Bigrams as Features in Chinese Text Categorization.
Proceedings of the ACL 2006, 2006

2005
Automatic Image Annotation Using Maximum Entropy Model.
Proceedings of the Natural Language Processing, 2005

Classifying Chinese Texts in Two Steps.
Proceedings of the Natural Language Processing, 2005

A Method of Recognizing Entity and Relation.
Proceedings of the Natural Language Processing, 2005

Transductive Support Vector Machines Using Simulated Annealing.
Proceedings of the Computational Intelligence and Security, International Conference, 2005

Word Extraction Based on Semantic Constraints in Chinese Word-Formation.
Proceedings of the Computational Linguistics and Intelligent Text Processing, 2005

Merging Case Relations into VSM to Improve Information Retrieval Precision.
Proceedings of the Computational Linguistics and Intelligent Text Processing, 2005

2003
Two-Character Chinese Word Extraction Based on Hybrid of Internal and Contextual Measures.
Proceedings of the Second Workshop on Chinese Language Processing, 2003

1998
Chinese Work Segmentation without Using Lexicon and Hand-crafted Training Data.
Proceedings of the 36th Annual Meeting of the Association for Computational Linguistics and 17th International Conference on Computational Linguistics, 1998

1997
CSeg&Tagl.0: A Practical Word Segmenter and POS Tagger for Chinese Texts.
Proceedings of the 5th Applied Natural Language Processing Conference, 1997


  Loading...