2025
A Survey of Reasoning with Foundation Models: Concepts, Methodologies, and Outlook.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
ACM Comput. Surv., November, 2025
XY-Tokenizer: Mitigating the Semantic-Acoustic Conflict in Low-Bitrate Speech Codecs.
CoRR, June, 2025
Unleashing Embodied Task Planning Ability in LLMs via Reinforcement Learning.
CoRR, June, 2025
World-aware Planning Narratives Enhance Large Vision-Language Model Planner.
CoRR, June, 2025
InstructTTSEval: Benchmarking Complex Natural-Language Instruction Following in Text-to-Speech Systems.
CoRR, June, 2025
LongLLaDA: Unlocking Long Context Capabilities in Diffusion LLMs.
CoRR, June, 2025
Beyond Homogeneous Attention: Memory-Efficient LLMs via Fourier-Approximated KV Cache.
,
,
,
,
,
,
,
,
,
,
,
CoRR, June, 2025
Domain2Vec: Vectorizing Datasets to Find the Optimal Data Mixture without Training.
CoRR, June, 2025
R3-RAG: Learning Step-by-Step Reasoning and Retrieval for LLMs via Reinforcement Learning.
CoRR, May, 2025
REARANK: Reasoning Re-ranking Agent via Reinforcement Learning.
CoRR, May, 2025
ConvSearch-R1: Enhancing Query Reformulation for Conversational Search with Reasoning via Reinforcement Learning.
CoRR, May, 2025
Code2Logic: Game-Code-Driven Data Synthesis for Enhancing VLMs General Reasoning.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
CoRR, May, 2025
Teach2Eval: An Indirect Evaluation Method for LLM by Judging How It Teaches.
,
,
,
,
,
,
,
,
,
,
CoRR, May, 2025
Task-Core Memory Management and Consolidation for Long-term Continual Learning.
CoRR, May, 2025
Reinforced Interactive Continual Learning via Real-time Noisy Human Feedback.
CoRR, May, 2025
Towards Understanding the Nature of Attention with Low-Rank Sparse Decomposition.
CoRR, April, 2025
Revisiting LLM Evaluation through Mechanism Interpretability: a New Metric and Model Utility Law.
CoRR, April, 2025
FamilyTool: A Multi-hop Personalized Tool Use Benchmark.
CoRR, April, 2025
YuE: Scaling Open Foundation Models for Long-Form Music Generation.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
CoRR, March, 2025
Prior-Fitted Networks Scale to Larger Datasets When Treated as Weak Learners.
CoRR, March, 2025
DuoDecoding: Hardware-aware Heterogeneous Speculative Decoding with Dynamic Multi-Sequence Drafting.
CoRR, March, 2025
CritiQ: Mining Data Quality Criteria from Human Preferences.
,
,
,
,
,
,
,
,
,
,
CoRR, February, 2025
Thus Spake Long-Context Large Language Model.
,
,
,
,
,
,
,
,
,
,
,
,
CoRR, February, 2025
AutoLogi: Automated Generation of Logic Puzzles for Evaluating Reasoning Abilities of Large Language Models.
CoRR, February, 2025
Human2Robot: Learning Robot Actions from Paired Human-Robot Videos.
CoRR, February, 2025
Towards Economical Inference: Enabling DeepSeek's Multi-Head Latent Attention in Any Transformer-based LLMs.
CoRR, February, 2025
UnitCoder: Scalable Iterative Code Synthesis with Unit Test Guidance.
CoRR, February, 2025
Safety at Scale: A Comprehensive Survey of Large Model Safety.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
CoRR, February, 2025
VideoRoPE: What Makes for Good Video Rotary Position Embedding?
,
,
,
,
,
,
,
,
,
,
,
CoRR, February, 2025
Efficient Link Prediction via GNN Layers Induced by Negative Sampling.
IEEE Trans. Knowl. Data Eng., January, 2025
Error Classification of Large Language Models on Math Word Problems: A Dynamically Adaptive Framework.
CoRR, January, 2025
Human-like conceptual representations emerge from language prediction.
CoRR, January, 2025
LLMs and Their Applications in Medical Artificial Intelligence.
ACM Trans. Manag. Inf. Syst., 2025
MOSS-MED: A Family of Multimodal Models Serving Medical Image Analysis.
ACM Trans. Manag. Inf. Syst., 2025
The rise and potential of large language model based agents: a survey.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
Sci. China Inf. Sci., 2025
MetaAlign: Align Large Language Models with Diverse Preferences during Inference Time.
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2025, Albuquerque, New Mexico, USA, April 29, 2025
Safe Inputs but Unsafe Output: Benchmarking Cross-modality Safety Alignment of Large Vision-Language Models.
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2025, Albuquerque, New Mexico, USA, April 29, 2025
FiNE: Filtering and Improving Noisy Data Elaborately with Large Language Models.
Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies, 2025
CAMIEval: Enhancing NLG Evaluation through Multidimensional Comparative Instruction-Following Analysis.
Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies, 2025
Data Mixing Laws: Optimizing Data Mixtures by Predicting Language Modeling Performance.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025
Towards Universality: Studying Mechanistic Similarity Across Language Model Architectures.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025
BitStack: Any-Size Compression of Large Language Models in Variable Memory Environments.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025
ReAttention: Training-Free Infinite Context with Finite Attention Scope.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025
CHiP: Cross-modal Hierarchical Direct Preference Optimization for Multimodal LLMs.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025
Case2Code: Scalable Synthetic Data for Code Generation.
,
,
,
,
,
,
,
,
,
,
,
,
,
Proceedings of the 31st International Conference on Computational Linguistics, 2025
Perceive the Passage of Time: A Systematic Evaluation of Large Language Model in Temporal Relativity.
Proceedings of the 31st International Conference on Computational Linguistics, 2025
Are LLMs Rational Investors? A Study on the Financial Bias in LLMs.
,
,
,
,
,
,
,
,
,
,
,
,
Proceedings of the Findings of the Association for Computational Linguistics, 2025
Dynamic and Generalizable Process Reward Modeling.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025
AgentGym: Evaluating and Training Large Language Model-based Agents across Diverse Environments.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025
VisuoThink: Empowering LVLM Reasoning with Multimodal Tree Search.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025
World Modeling Makes a Better Planner: Dual Preference Optimization for Embodied Task Planning.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025
How to Mitigate Overfitting in Weak-to-strong Generalization?
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025
FastMCTS: A Simple Sampling Strategy for Data Synthesis.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025
Towards Economical Inference: Enabling DeepSeek's Multi-Head Latent Attention in Any Transformer-based LLMs.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025
CritiQ: Mining Data Quality Criteria from Human Preferences.
,
,
,
,
,
,
,
,
,
,
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025
Revisiting the Test-Time Scaling of o1-like Models: Do they Truly Possess Test-Time Scaling Capabilities?
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025
2024
Correction to: MOSS: An Open Conversational Large Language Model.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
Mach. Intell. Res., December, 2024
MOSS: An Open Conversational Large Language Model.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
Mach. Intell. Res., October, 2024
$$\cal{Y}$$-Tuning: an efficient tuning paradigm for large-scale pre-trained models via label representation learning.
Frontiers Comput. Sci., August, 2024
ChatGPT: potential, prospects, and limitations.
Frontiers Inf. Technol. Electron. Eng., January, 2024
Deep learning prediction of glycopeptide tandem mass spectra powers glycoproteomics.
Nat. Mac. Intell., 2024
VLABench: A Large-Scale Benchmark for Language-Conditioned Robotics Manipulation with Long-Horizon Reasoning Tasks.
,
,
,
,
,
,
,
,
,
,
CoRR, 2024
Scaling of Search and Learning: A Roadmap to Reproduce o1 from Reinforcement Learning Perspective.
CoRR, 2024
Inst-IT: Boosting Multimodal Instance Understanding via Explicit Visual Prompt Instruction Tuning.
CoRR, 2024
ForgerySleuth: Empowering Multimodal Large Language Models for Image Manipulation Detection.
CoRR, 2024
Enhancing LLM Reasoning via Critique Models with Test-Time and Training-Time Supervision.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
CoRR, 2024
LongSafetyBench: Long-Context LLMs Struggle with Safety Issues.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
CoRR, 2024
BitStack: Fine-Grained Size Control for Compressed Large Language Models in Variable Memory Environments.
CoRR, 2024
Multi-Programming Language Sandbox for LLMs.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
CoRR, 2024
Llama Scope: Extracting Millions of Features from Llama-3.1-8B with Sparse Autoencoders.
,
,
,
,
,
,
,
,
,
,
,
CoRR, 2024
Understanding the Role of LLMs in Multimodal Evaluation Benchmarks.
CoRR, 2024
IntrinsicVoice: Empowering LLMs with Intrinsic Real-time Voice Interaction Abilities.
,
,
,
,
,
,
,
,
,
,
,
,
CoRR, 2024
DetectiveQA: Evaluating Long-Context Reasoning on Detective Novels.
CoRR, 2024
Efficient Training of Large Language Models on Distributed Infrastructures: A Survey.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
CoRR, 2024
Farewell to Length Extrapolation, a Training-Free Infinite Context with Finite Attention Scope.
CoRR, 2024
Case2Code: Learning Inductive Reasoning with Synthetic Data.
,
,
,
,
,
,
,
,
,
,
,
,
,
CoRR, 2024
What's Wrong with Your Code Generated by Large Language Models? An Extensive Study.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
CoRR, 2024
Cross-Modality Safety Alignment.
CoRR, 2024
AgentGym: Evolving Large Language Model-based Agents across Diverse Environments.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
CoRR, 2024
Automatically Identifying Local and Global Circuits with Linear Computation Graphs.
CoRR, 2024
A Survey of Neural Code Intelligence: Paradigms, Advances and Beyond.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
CoRR, 2024
In-Memory Learning: A Declarative Learning Framework for Large Language Models.
CoRR, 2024
Data-freeWeight Compress and Denoise for Large Language Models.
CoRR, 2024
Hint-before-Solving Prompting: Guiding LLMs to Effectively Utilize Encoded Knowledge.
CoRR, 2024
Identifying Semantic Induction Heads to Understand In-Context Learning.
CoRR, 2024
Dictionary Learning Improves Patch-Free Circuit Discovery in Mechanistic Interpretability: A Case Study on Othello-GPT.
CoRR, 2024
InternLM-Math: Open Math Large Language Models Toward Verifiable Reasoning.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
CoRR, 2024
MouSi: Poly-Visual-Expert Vision-Language Models.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
CoRR, 2024
F-Eval: Asssessing Fundamental Abilities with Refined Evaluation Methods.
CoRR, 2024
Query of CC: Unearthing Large Scale Domain-Specific Knowledge from Public Corpora.
CoRR, 2024
SpeechGPT-Gen: Scaling Chain-of-Information Speech Generation.
CoRR, 2024
InferAligner: Inference-Time Alignment for Harmlessness through Cross-Model Guidance.
CoRR, 2024
Secrets of RLHF in Large Language Models Part II: Reward Modeling.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
CoRR, 2024
Agent Alignment in Evolving Social Norms.
CoRR, 2024
SpeechAgents: Human-Communication Simulation with Multi-Modal Multi-Agent Systems.
CoRR, 2024
CPT: a pre-trained unbalanced transformer for both Chinese language understanding and generation.
Sci. China Inf. Sci., 2024
SpeechAlign: Aligning Speech Generation to Human Preferences.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024
Can Language Models Learn to Skip Steps?
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024
LLatrieval: LLM-Verified Retrieval for Verifiable Generation.
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024
Flames: Benchmarking Value Alignment of LLMs in Chinese.
,
,
,
,
,
,
,
,
,
,
,
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024
Can AI Assistants Know What They Don't Know?
Proceedings of the Forty-first International Conference on Machine Learning, 2024
Training-Free Long-Context Scaling of Large Language Models.
Proceedings of the Forty-first International Conference on Machine Learning, 2024
SpeechTokenizer: Unified Speech Tokenizer for Speech Language Models.
Proceedings of the Twelfth International Conference on Learning Representations, 2024
Scaling Laws of RoPE-based Extrapolation.
Proceedings of the Twelfth International Conference on Learning Representations, 2024
Inference-Time Decontamination: Reusing Leaked Benchmarks for Large Language Model Evaluation.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024
R³-NL2GQL: A Model Coordination and Knowledge Graph Alignment Approach for NL2GQL.
,
,
,
,
,
,
,
,
,
,
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024
Calibrating the Confidence of Large Language Models by Eliciting Fidelity.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024
Memorize Step by Step: Efficient Long-Context Prefilling with Incremental Memory and Decremental Chunk.
,
,
,
,
,
,
,
,
,
,
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024
Turn Waste into Worth: Rectifying Top-k Router of MoE.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024
Explicit Memory Learning with Expectation Maximization.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024
InferAligner: Inference-Time Alignment for Harmlessness through Cross-Model Guidance.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024
Scaling Laws for Fact Memorization of Large Language Models.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024
LongWanjuan: Towards Systematic Measurement for Long Text Quality.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024
Making Large Language Models Better Reasoners with Orchestrated Streaming Experiences.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024
Unified Active Retrieval for Retrieval Augmented Generation.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024
Pixel-Level Semantic Correspondence Through Layout-Aware Representation Learning and Multi-Scale Matching Integration.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024
The Open-World Lottery Ticket Hypothesis for OOD Intent Classification.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024
Aggregation of Reasoning: A Hierarchical Framework for Enhancing Answer Selection in Large Language Models.
,
,
,
,
,
,
,
,
,
,
,
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024
Benchmarking Hallucination in Large Language Models Based on Unanswerable Math Word Problem.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024
GAOKAO-MM: A Chinese Human-Level Benchmark for Multimodal Models Evaluation.
Proceedings of the Findings of the Association for Computational Linguistics, 2024
AnyGPT: Unified Multimodal LLM with Discrete Sequence Modeling.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024
Reasoning in Flux: Enhancing Large Language Models Reasoning through Uncertainty-aware Adaptive Guidance.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024
Enhancing EEG-to-Text Decoding through Transferable Representations from Pre-trained Contrastive EEG-Text Masked Autoencoder.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024
LLM can Achieve Self-Regulation via Hyperparameter Aware Generation.
Proceedings of the Findings of the Association for Computational Linguistics, 2024
F-Eval: Asssessing Fundamental Abilities with Refined Evaluation Methods.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024
Code Needs Comments: Enhancing Code LLMs with Comment Augmentation.
,
,
,
,
,
,
,
,
,
,
Proceedings of the Findings of the Association for Computational Linguistics, 2024
Balanced Data Sampling for Language Model Training with Clustering.
Proceedings of the Findings of the Association for Computational Linguistics, 2024
Identifying Semantic Induction Heads to Understand In-Context Learning.
Proceedings of the Findings of the Association for Computational Linguistics, 2024
Full Parameter Fine-tuning for Large Language Models with Limited Resources.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024
AdaLomo: Low-memory Optimization with Adaptive Learning Rate.
Proceedings of the Findings of the Association for Computational Linguistics, 2024
L-Eval: Instituting Standardized Evaluation for Long Context Language Models.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024
DenoSent: A Denoising Objective for Self-Supervised Sentence Representation Learning.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024
2023
An anchor-guided sequence labeling model for event detection in both data-abundant and data-scarce scenarios.
Inf. Sci., November, 2023
A Composable Generative Framework Based on Prompt Learning for Various Information Extraction Tasks.
IEEE Trans. Big Data, August, 2023
Improving BERT Fine-Tuning via Self-Ensemble and Self-Distillation.
J. Comput. Sci. Technol., July, 2023
A Survey of Reasoning with Foundation Models.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
CoRR, 2023
Flames: Benchmarking Value Alignment of Chinese Large Language Models.
,
,
,
,
,
,
,
,
,
,
,
CoRR, 2023
Scaling Laws of RoPE-based Extrapolation.
CoRR, 2023
Evaluating Hallucinations in Chinese Large Language Models.
,
,
,
,
,
,
,
,
,
,
CoRR, 2023
Corex: Pushing the Boundaries of Complex Reasoning through Multi-Model Collaboration.
CoRR, 2023
The Rise and Potential of Large Language Model Based Agents: A Survey.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
CoRR, 2023
SpeechTokenizer: Unified Speech Tokenizer for Speech Large Language Models.
CoRR, 2023
EduChat: A Large-Scale Language Model-based Chatbot System for Intelligent Education.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
CoRR, 2023
Does Correction Remain A Problem For Large Language Models?
CoRR, 2023
L-Eval: Instituting Standardized Evaluation for Long Context Language Models.
CoRR, 2023
Secrets of RLHF in Large Language Models Part I: PPO.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
CoRR, 2023
Full Parameter Fine-tuning for Large Language Models with Limited Resources.
CoRR, 2023
Optimizing Non-Autoregressive Transformers with Contrastive Learning.
CoRR, 2023
Evaluating the Performance of Large Language Models on GAOKAO Benchmark.
CoRR, 2023
PromptNER: A Prompting Method for Few-shot Named Entity Recognition via k Nearest Neighbor Search.
CoRR, 2023
MoT: Pre-thinking and Recalling Enable ChatGPT to Self-Improve with Memory-of-Thoughts.
CoRR, 2023
Origin Tracing and Detecting of LLMs.
CoRR, 2023
Finding Supporting Examples for In-Context Learning.
CoRR, 2023
MarkBERT: Marking Word Boundaries Improves Chinese BERT.
Proceedings of the Natural Language Processing and Chinese Computing, 2023
From Hypergraph Energy Functions to Hypergraph Neural Networks.
Proceedings of the International Conference on Machine Learning, 2023
SpeechGPT: Empowering Large Language Models with Intrinsic Cross-Modal Conversational Abilities.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023
Exchange-of-Thought: Enhancing Large Language Model Capabilities through Cross-Model Communication.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023
SeqXGPT: Sentence-Level AI-Generated Text Detection.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023
Character-LLM: A Trainable Agent for Role-Playing.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023
CoLLiE: Collaborative Training of Large Language Models in an Efficient Way.
,
,
,
,
,
,
,
,
,
,
,
,
,
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023
Plan, Verify and Switch: Integrated Reasoning with Diverse X-of-Thoughts.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023
PerturbScore: Connecting Discrete and Continuous Perturbations in NLP.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023
Finding Support Examples for In-Context Learning.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023
MoT: Memory-of-Thought Enables ChatGPT to Self-Improve.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023
Watermarking LLMs with Weight Quantization.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023
Rethinking Label Smoothing on Multi-Hop Question Answering.
Proceedings of the Chinese Computational Linguistics - 22nd China National Conference, 2023
Graph Structure Learning via Lottery Hypothesis at Scale.
Proceedings of the Asian Conference on Machine Learning, 2023
Two Birds One Stone: Dynamic Ensemble for OOD Intent Classification.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023
A Probabilistic Framework for Discovering New Intents.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023
Towards Open Environment Intent Prediction.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023
Investigating Glyph-Phonetic Information for Chinese Spell Checking: What Works and What's Next?
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023
Do Large Language Models Know What They Don't Know?
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023
Multitask Pre-training of Modular Prompt for Chinese Few-Shot Learning.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023
Distributed Marker Representation for Ambiguous Discourse Markers and Entangled Relations.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023
Multijugate Dual Learning for Low-Resource Task-Oriented Dialogue System.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023
CodeIE: Large Code Generation Models are Better Few-Shot Information Extractors.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023
Text Adversarial Purification as Defense against Adversarial Attacks.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023
Unified Demonstration Retriever for In-Context Learning.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023
DiffusionBERT: Improving Generative Masked Language Models with Diffusion Models.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023
Dual Cache for Long Document Neural Coreference Resolution.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023
Improving Contrastive Learning of Sentence Embeddings from AI Feedback.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023
An AMR-based Link Prediction Approach for Document-level Event Argument Extraction.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023
UTC-IE: A Unified Token-pair Classification Architecture for Information Extraction.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023
An Embarrassingly Easy but Strong Baseline for Nested Named Entity Recognition.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), 2023
Mitigating Negative Style Transfer in Hybrid Dialogue System.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023
2022
A survey of transformers.
AI Open, January, 2022
Paradigm Shift in Natural Language Processing.
Int. J. Autom. Comput., 2022
DiffusionBERT: Improving Generative Masked Language Models with Diffusion Models.
CoRR, 2022
Word-Level Representation From Bytes For Language Modeling.
CoRR, 2022
SDCL: Self-Distillation Contrastive Learning for Chinese Spell Checking.
CoRR, 2022
Discovering New Intents Using Latent Variables.
CoRR, 2022
Multi-Task Pre-Training of Modular Prompt for Few-Shot Learning.
CoRR, 2022
An Open-World Lottery Ticket for Out-of-Domain Intent Classification.
CoRR, 2022
A Unified Generative Framework based on Prompt Learning for Various Information Extraction Tasks.
CoRR, 2022
What Dense Graph Do You Need for Self-Attention?
CoRR, 2022
BBTv2: Pure Black-Box Optimization Can Be Comparable to Gradient Descent for Few-Shot Learning.
CoRR, 2022
Rebuild and Ensemble: Exploring Defense Against Text Adversaries.
CoRR, 2022
MarkBERT: Marking Word Boundaries Improves Chinese BERT.
CoRR, 2022
Y-Tuning: An Efficient Tuning Paradigm for Large-Scale Pre-Trained Models via Label Representation Learning.
CoRR, 2022
TURNER: The Uncertainty-based Retrieval Framework for Chinese NER.
CoRR, 2022
CodeRetriever: Unimodal and Bimodal Contrastive Learning.
CoRR, 2022
Towards Collaborative Question Answering: A Preliminary Study.
CoRR, 2022
BART-Reader: Predicting Relations Between Entities via Reading Their Document-Level Context Information.
Proceedings of the Natural Language Processing and Chinese Computing, 2022
CoNT: Contrastive Neural Text Generation.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022
Towards Efficient NLP: A Standard Evaluation and A Strong Baseline.
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022
What Dense Graph Do You Need for Self-Attention?
Proceedings of the International Conference on Machine Learning, 2022
Black-Box Tuning for Language-Model-as-a-Service.
Proceedings of the International Conference on Machine Learning, 2022
BBTv2: Towards a Gradient-Free Future with Large Language Models.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022
BERTScore is Unfair: On Social Bias in Language Model-Based Metrics for Text Generation.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022
Late Prompt Tuning: A Late Prompt Could Be Better Than Many Prompts.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022
RLET: A Reinforcement Learning Based Approach for Explainable QA with Entailment Trees.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022
CodeRetriever: A Large Scale Contrastive Pre-Training Method for Code Search.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022
Soft-Labeled Contrastive Pre-Training for Function-Level Code Representation.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022
Dialogue Meaning Representation for Task-Oriented Dialogue Systems.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022
DORE: Document Ordered Relation Extraction based on Generative Framework.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022
Is MultiWOZ a Solved Task? An Interactive TOD Evaluation Framework with User Simulator.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022
Improving Abstractive Dialogue Summarization with Speaker-Aware Supervised Contrastive Learning.
Proceedings of the 29th International Conference on Computational Linguistics, 2022
Coarse-to-Fine: Hierarchical Multi-task Learning for Natural Language Understanding.
,
,
,
,
,
,
,
,
,
,
,
Proceedings of the 29th International Conference on Computational Linguistics, 2022
CoLo: A Contrastive Learning Based Re-ranking Framework for One-Stage Summarization.
Proceedings of the 29th International Conference on Computational Linguistics, 2022
KNN-Contrastive Learning for Out-of-Domain Intent Classification.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022
A Simple Hash-Based Early Exiting Approach For Language Understanding and Generation.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2022, 2022
"Is Whole Word Masking Always Better for Chinese BERT?": Probing on Chinese Grammatical Error Correction.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2022, 2022
Contrast and Generation Make BART a Good Dialogue Emotion Recognizer.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022
2021
Co-Attention Memory Network for Multimodal Microblog's Hashtag Recommendation.
IEEE Trans. Knowl. Data Eng., 2021
Towards More Effective and Economic Sparsely-Activated Model.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
CoRR, 2021
Towards Efficient NLP: A Standard Evaluation and A Strong Baseline.
CoRR, 2021
KNN-BERT: Fine-Tuning Pre-Trained Models with KNN Classifier.
CoRR, 2021
RetrievalSum: A Retrieval Enhanced Framework for Abstractive Summarization.
CoRR, 2021
CPT: A Pre-Trained Unbalanced Transformer for Both Chinese Language Understanding and Generation.
CoRR, 2021
Learning to Teach with Student Feedback.
CoRR, 2021
Pre-Trained Models: Past, Present and Future.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
CoRR, 2021
Early Exiting with Ensemble Internal Classifiers.
CoRR, 2021
Dual-axial self-attention network for text classification.
Sci. China Inf. Sci., 2021
Syntax-guided text generation via graph neural network.
Sci. China Inf. Sci., 2021
Text information aggregation with centrality attention.
Sci. China Inf. Sci., 2021
Pre-trained models: Past, present and future.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
AI Open, 2021
Searching Effective Transformer for Seq2Seq Keyphrase Generation.
Proceedings of the Natural Language Processing and Chinese Computing, 2021
QMSum: A New Benchmark for Query-based Multi-domain Meeting Summarization.
,
,
,
,
,
,
,
,
,
,
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021
Pre-training with Meta Learning for Chinese Word Segmentation.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021
Does syntax matter? A strong baseline for Aspect-based Sentiment Analysis with RoBERTa.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021
Keyphrase Generation with Fine-Grained Evaluation-Guided Reinforcement Learning.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2021, 2021
Backdoor Attacks on Pre-trained Models by Layerwise Weight Poisoning.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021
SpellBERT: A Lightweight Pretrained Model for Chinese Spelling Check.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021
Are Factuality Checkers Reliable? Adversarial Meta-evaluation of Factuality in Summarization.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2021, 2021
Fork or Fail: Cycle-Consistent Training with Many-to-One Mappings.
Proceedings of the 24th International Conference on Artificial Intelligence and Statistics, 2021
A Unified Generative Framework for Various NER Subtasks.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021
A Unified Generative Framework for Aspect-based Sentiment Analysis.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021
TextFlint: Unified Multilingual Robustness Evaluation Toolkit for Natural Language Processing.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
Proceedings of the Joint Conference of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021
Contrastive Aligned Joint Learning for Multilingual Summarization.
Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021
Accelerating BERT Inference for Sequence Labeling via Early-Exit.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021
fastHan: A BERT-based Multi-Task Toolkit for Chinese NLP.
Proceedings of the Joint Conference of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021
Finding Sparse Structures for Domain Specific Neural Machine Translation.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021
Token-Aware Virtual Adversarial Training in Natural Language Understanding.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021
Enhancing Scientific Papers Summarization with Citation Graph.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021
2020
A Graph-based Model for Joint Chinese Word Segmentation and Dependency Parsing.
Trans. Assoc. Comput. Linguistics, 2020
Chinese Word Segmentation via BiLSTM+Semi-CRF with Relay Node.
J. Comput. Sci. Technol., 2020
Generating Adversarial Examples in Chinese Texts Using Sentence-Pieces.
CoRR, 2020
Finding Sparse Structure for Domain Specific Neural Machine Translation.
CoRR, 2020
Text Information Aggregation with Centrality Attention.
CoRR, 2020
Pre-trained Model for Chinese Word Segmentation with Meta Learning.
CoRR, 2020
CDEvalSumm: An Empirical Study of Cross-Dataset Evaluation for Neural Summarization Systems.
CoRR, 2020
BERT for Monolingual and Cross-Lingual Reverse Dictionary.
CoRR, 2020
AutoRC: Improving BERT Based Relation Classification Models via Architecture Search.
CoRR, 2020
fastHan: A BERT-based Joint Many-Task Toolkit for Chinese NLP.
CoRR, 2020
Improving Image Captioning with Better Use of Captions.
CoRR, 2020
CycleGT: Unsupervised Graph-to-Text and Text-to-Graph Generation via Cycle Training.
CoRR, 2020
Relation of the Relations: A New Paradigm of the Relation Extraction Problem.
CoRR, 2020
TextAT: Adversarial Training for Natural Language Understanding with Token-Level Perturbation.
CoRR, 2020
Unified Multi-Criteria Chinese Word Segmentation with BERT.
CoRR, 2020
Pre-trained Models for Natural Language Processing: A Survey.
CoRR, 2020
BERT for Monolingual and Cross-Lingual Reverse Dictionary.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2020, 2020
A Concise Model for Multi-Criteria Chinese Word Segmentation with Transformer Encoder.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2020, 2020
Pre-training Multilingual Neural Machine Translation by Leveraging Alignment Information.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020
BERT-ATTACK: Adversarial Attack Against BERT Using BERT.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020
An Empirical Study of Cross-Dataset Evaluation for Neural Summarization Systems.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2020, 2020
CoLAKE: Contextualized Language and Knowledge Embedding.
Proceedings of the 28th International Conference on Computational Linguistics, 2020
GenWiki: A Dataset of 1.3 Million Content-Sharing Text and Graphs for Unsupervised Graph-to-Text Generation.
Proceedings of the 28th International Conference on Computational Linguistics, 2020
Extractive Summarization as Text Matching.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020
Heterogeneous Graph Neural Networks for Extractive Document Summarization.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020
Improving Image Captioning with Better Use of Caption.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020
FLAT: Chinese NER Using Flat-Lattice Transformer.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020
Learning Sparse Sharing Architectures for Multiple Tasks.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020
Joint Parsing and Generation for Abstractive Summarization.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020
Multi-Scale Self-Attention for Text Classification.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020
2019
Sequence Labeling With Deep Gated Dual Path CNN.
IEEE ACM Trans. Audio Speech Lang. Process., 2019
Low-Rank and Locality Constrained Self-Attention for Sequence Modeling.
IEEE ACM Trans. Audio Speech Lang. Process., 2019
TENER: Adapting Transformer Encoder for Named Entity Recognition.
CoRR, 2019
BP-Transformer: Modelling Long-Range Context via Binary Partitioning.
CoRR, 2019
A Closer Look at Data Bias in Neural Extractive Summarization Models.
CoRR, 2019
Exploring Domain Shift in Extractive Text Summarization.
CoRR, 2019
DropAttention: A Regularization Method for Fully-Connected Self-Attention Networks.
CoRR, 2019
Multi-Criteria Chinese Word Segmentation with Transformer.
CoRR, 2019
A Unified Model for Joint Chinese Word Segmentation and Dependency Parsing.
CoRR, 2019
Implicit discourse relation detection using concatenated word embeddings and a gated relevance network.
Sci. China Inf. Sci., 2019
VCWE: Visual Character-Enhanced Word Embeddings.
Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019
Utilizing BERT for Aspect-Based Sentiment Analysis via Constructing Auxiliary Sentence.
Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019
Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019
GlossBERT: BERT for Word Sense Disambiguation with Gloss Knowledge.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019
How to Fine-Tune BERT for Text Classification?
Proceedings of the Chinese Computational Linguistics - 18th China National Conference, 2019
Searching for Effective Neural Extractive Summarization: What Works and What's Next.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019
Style Transformer: Unpaired Text Style Transfer without Disentangled Latent Representation.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019
Learning Multi-Task Communication with Message Passing for Sequence Learning.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019
Switch-LSTMs for Multi-Criteria Chinese Word Segmentation.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019
2018
Multi-task Learning over Graph Structures.
CoRR, 2018
U-Net: Machine Reading Comprehension with Unanswerable Questions.
CoRR, 2018
Deformable Stacked Structure for Named Entity Recognition.
CoRR, 2018
Neural Arithmetic Expression Calculator.
CoRR, 2018
Exploring Shared Structures and Hierarchies for Multiple NLP Tasks.
CoRR, 2018
Gaussian Word Embedding with a Wasserstein Distance Loss.
CoRR, 2018
Top-Down Tree Structured Text Generation.
CoRR, 2018
Towards Diverse Text Generation with Inverse Reinforcement Learning.
CoRR, 2018
Same Representation, Different Attentions: Shareable Sentence Representation Learning from Multiple Tasks.
Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018
Toward Diverse Text Generation with Inverse Reinforcement Learning.
Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018
Reinforced Mnemonic Reader for Machine Reading Comprehension.
Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018
Convolutional Interaction Network for Natural Language Inference.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018
A Simple yet Effective Joint Training Method for Cross-Lingual Universal Dependency Parsing.
Proceedings of the CoNLL 2018 Shared Task: Multilingual Parsing from Raw Text to Universal Dependencies, Brussels, Belgium, October 31, 2018
Information Aggregation via Dynamic Routing for Sequence Encoding.
Proceedings of the 27th International Conference on Computational Linguistics, 2018
Incorporating Discriminator in Sentence Generation: a Gibbs Sampling Method.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018
Meta Multi-Task Learning for Sequence Modeling.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018
2017
Mnemonic Reader for Machine Comprehension.
CoRR, 2017
DAG-based Long Short-Term Memory for Neural Word Segmentation.
CoRR, 2017
Hyper-Gated Recurrent Neural Networks for Chinese Word Segmentation.
Proceedings of the Natural Language Processing and Chinese Computing, 2017
Overview of the NLPCC 2017 Shared Task: Chinese News Headline Categorization.
Proceedings of the Natural Language Processing and Chinese Computing, 2017
Knowledge Graph Representation with Jointly Structural and Textual Encoding.
Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence, 2017
Adaptive Semantic Compositionality for Sentence Modelling.
Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence, 2017
Dynamic Compositional Neural Networks over Tree Structure.
Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence, 2017
A Feature-Enriched Neural Model for Joint Chinese Word Segmentation and Part-of-Speech Tagging.
Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence, 2017
Idiom-Aware Compositional Distributed Semantics.
Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, 2017
End-to-End Neural Text Classification for Tibetan.
Proceedings of the Chinese Computational Linguistics and Natural Language Processing Based on Naturally Annotated Big Data, 2017
Adversarial Multi-task Learning for Text Classification.
Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, 2017
Adversarial Multi-Criteria Learning for Chinese Word Segmentation.
Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, 2017
2016
Deep Multi-Task Learning with Shared Memory.
CoRR, 2016
Syntax-based Attention Model for Natural Language Inference.
CoRR, 2016
Modelling Interaction of Sentence Pair with coupled-LSTMs.
CoRR, 2016
End-to-End Neural Sentence Ordering Using Pointer Network.
CoRR, 2016
A Long Dependency Aware Deep Architecture for Joint Chinese Word Segmentation and POS Tagging.
CoRR, 2016
Neural Sentence Ordering.
CoRR, 2016
Learning Word Embeddings from Intrinsic and Extrinsic Views.
CoRR, 2016
Overview of the NLPCC-ICCPOL 2016 Shared Task: Chinese Word Segmentation for Micro-Blog Texts.
Proceedings of the Natural Language Understanding and Intelligent Applications, 2016
Convolutional Deep Neural Networks for Document-Based Question Answering.
Proceedings of the Natural Language Understanding and Intelligent Applications, 2016
Bridging LSTM Architecture and the Neural Dynamics during Reading.
Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence, 2016
Recurrent Neural Network for Text Classification with Multi-Task Learning.
Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence, 2016
Cached Long Short-Term Memory Neural Networks for Document-Level Sentiment Classification.
Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing, 2016
Analyzing Linguistic Knowledge in Sequential Model of Sentence.
Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing, 2016
Modelling Interaction of Sentence Pair with Coupled-LSTMs.
Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing, 2016
Deep Multi-Task Learning with Shared Memory for Text Classification.
Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing, 2016
A New Psychometric-inspired Evaluation Metric for Chinese Word Segmentation.
Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, 2016
Investigating Language Universal and Specific Properties in Word Embeddings.
Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, 2016
Deep Fusion LSTMs for Text Semantic Matching.
Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, 2016
Implicit Discourse Relation Detection via a Deep Architecture with Gated Relevance Network.
Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, 2016
2015
Overview of the NLPCC 2015 Shared Task: Chinese Word Segmentation and POS Tagging for Micro-blog Texts.
CoRR, 2015
Gaussian Mixture Embeddings for Multiple Word Prototypes.
CoRR, 2015
Transition-Based Dependency Parsing with Long Distance Collocations.
Proceedings of the Natural Language Processing and Chinese Computing - 4th CCF Conference, 2015
Overview of the NLPCC 2015 Shared Task: Chinese Word Segmentation and POS Tagging for Micro-blog Texts.
Proceedings of the Natural Language Processing and Chinese Computing - 4th CCF Conference, 2015
Convolutional Neural Tensor Network Architecture for Community-Based Question Answering.
Proceedings of the Twenty-Fourth International Joint Conference on Artificial Intelligence, 2015
Learning Context-Sensitive Word Embeddings with Neural Tensor Skip-Gram Model.
Proceedings of the Twenty-Fourth International Joint Conference on Artificial Intelligence, 2015
Multi-Timescale Long Short-Term Memory Neural Network for Modelling Sentences and Documents.
Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, 2015
Transition-based Dependency Parsing Using Two Heterogeneous Gated Recursive Neural Networks.
Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, 2015
Sentence Modeling with Gated Recursive Neural Network.
Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, 2015
Long Short-Term Memory Neural Networks for Chinese Word Segmentation.
Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, 2015
Learning to Rank Answers for Definitional Question Answering.
Proceedings of the Chinese Computational Linguistics and Natural Language Processing Based on Naturally Annotated Big Data, 2015
A Re-ranking Model for Dependency Parser with Recursive Convolutional Neural Network.
Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing of the Asian Federation of Natural Language Processing, 2015
Gated Recursive Neural Network for Chinese Word Segmentation.
Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing of the Asian Federation of Natural Language Processing, 2015
2014
Automatic Corpus Expansion for Chinese Word Segmentation by Exploiting the Redundancy of Web Information.
Proceedings of the COLING 2014, 2014
Improving Multi-pass Transition-Based Dependency Parsing Using Enhanced Shift Actions.
Proceedings of the Chinese Computational Linguistics and Natural Language Processing Based on Naturally Annotated Big Data, 2014
Text Classification with Document Embeddings.
Proceedings of the Chinese Computational Linguistics and Natural Language Processing Based on Naturally Annotated Big Data, 2014
2013
Learning Topical Translation Model for Microblog Hashtag Suggestion.
Proceedings of the IJCAI 2013, 2013
Question identification in Chinese micro-texts.
Proceedings of the International Conference on Machine Learning and Cybernetics, 2013
A Unified Model for Joint Chinese Word Segmentation and POS Tagging with Heterogeneous Annotation Corpora.
Proceedings of the 2013 International Conference on Asian Language Processing, 2013
Feature Abstraction for Lightweight and Accurate Chinese Word Segmentation.
Proceedings of the 2013 International Conference on Asian Language Processing, 2013
Joint Chinese Word Segmentation and POS Tagging on Heterogeneous Annotated Corpora with Multiple Task Learning.
Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, 2013
Online Distributed Passive-Aggressive Algorithm for Structured Learning.
Proceedings of the Chinese Computational Linguistics and Natural Language Processing Based on Naturally Annotated Big Data, 2013
Chinese Word Segmentation with Character Abstraction.
Proceedings of the Chinese Computational Linguistics and Natural Language Processing Based on Naturally Annotated Big Data, 2013
FudanNLP: A Toolkit for Chinese Natural Language Processing.
Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics, 2013
Latent Semantic Tensor Indexing for Community-based Question Answering.
Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics, 2013
2012
Recognizing Inference in Texts with Markov Logic Networks.
ACM Trans. Asian Lang. Inf. Process., 2012
Part-of-Speech Tagging for Chinese-English Mixed Texts with Dynamic Features.
Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, 2012
Joint Segmentation and Tagging with Coupled Sequences Labeling.
Proceedings of the COLING 2012, 2012
Discovering logical knowledge for deep question answering.
Proceedings of the 21st ACM International Conference on Information and Knowledge Management, 2012
2011
An Effective Feature Selection Method for Text Categorization.
Proceedings of the Advances in Knowledge Discovery and Data Mining, 2011
FudanNLP at RITE 2011: a Shallow Semantic Approach to Textual Entailment.
Proceedings of the 9th NTCIR Workshop Meeting on Evaluation of Information Access Technologies: Information Retrieval, 2011
A Fast Accurate Two-stage Training Algorithm for L1-regularized CRFs with Heuristic Line Search Strategy.
Proceedings of the Fifth International Joint Conference on Natural Language Processing, 2011
Question Answering for Machine Reading with Lexical Chain.
Proceedings of the CLEF 2011 Labs and Workshop, 2011
Labelwise Margin Maximization for Sequence Labeling.
Proceedings of the Computational Linguistics and Intelligent Text Processing, 2011
Hierarchical Text Classification with Latent Concepts.
Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, Proceedings of the Conference, 19-24 June, 2011, Portland, Oregon, USA, 2011
2010
Bagging to find better expansion words.
Proceedings of the 6th International Conference on Natural Language Processing and Knowledge Engineering, 2010
Detecting Hedge Cues and their Scopes with Average Perceptron.
Proceedings of the Fourteenth Conference on Computational Natural Language Learning: Shared Task, 2010
Mining Uncertain Sentences with Multiple Instance Learning.
Proceedings of the Advanced Data Mining and Applications - 6th International Conference, 2010
Triplet-Based Chinese Word Sense Induction.
Proceedings of the CIPS-SIGHAN Joint Conference on Chinese Language Processing, 2010
Adaptive Chinese Word Segmentation with Online Passive-Aggressive Algorithm.
Proceedings of the CIPS-SIGHAN Joint Conference on Chinese Language Processing, 2010
2009
Info-margin maximization for feature extraction.
Pattern Recognit. Lett., 2009
Face recognition with info-margin maximization.
Proceedings of the 2009 IEEE International Conference on Multimedia and Expo, 2009
Hierarchical Multi-Label Text Categorization with Global Margin Maximization.
Proceedings of the ACL 2009, 2009
2008
Feature Selection Based on a New Dependency Measure.
Proceedings of the Fifth International Conference on Fuzzy Systems and Knowledge Discovery, 2008
Answering Definition Question: Ranking for Top-k.
Proceedings of the ECAI 2008, 2008
Domain Adaptation for Conditional Random Fields.
Proceedings of the Information Retrieval Technology, 2008
KLNCC: A new nonlinear correlation clustering algorithm based on KL-divergence.
Proceedings of 8th IEEE International Conference on Computer and Information Technology, 2008
2007
Two-dimensional nearest neighbor discriminant analysis.
Neurocomputing, 2007
FDUQA on TREC 2007 QA Track.
Proceedings of The Sixteenth Text REtrieval Conference, 2007
2006
Nearest Neighbor Discriminant Analysis.
Int. J. Pattern Recognit. Artif. Intell., 2006
2005
Stepwise Nearest Neighbor Discriminant Analysis.
Proceedings of the IJCAI-05, Proceedings of the Nineteenth International Joint Conference on Artificial Intelligence, Edinburgh, Scotland, UK, July 30, 2005
Null space-based LDA with weighted dual personal subspaces for face recognition.
Proceedings of the 2005 International Conference on Image Processing, 2005
Nonparametric maximum margin criterion for face recognition.
Proceedings of the 2005 International Conference on Image Processing, 2005
Face Recognition by Stepwise Nonparametric Margin Maximum Criterion.
Proceedings of the 10th IEEE International Conference on Computer Vision (ICCV 2005), 2005
2004
Boosting image classification scheme.
Proceedings of the 2004 IEEE International Conference on Multimedia and Expo, 2004
2003
Fudan University at TRECVID 2003.
Proceedings of the 2003 TREC Video Retrieval Evaluation, 2003