Xu Han

Orcid: 0000-0002-4726-7621

Affiliations:
  • Tsinghua University, Department of Computer Science and Technology, China


According to our database1, Xu Han authored at least 120 papers between 2016 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2025
ProSparse: Introducing and Enhancing Intrinsic Activation Sparsity within Large Language Models.
Proceedings of the 31st International Conference on Computational Linguistics, 2025

Multi-Modal Multi-Granularity Tokenizer for Chu Bamboo Slips.
Proceedings of the 31st International Conference on Computational Linguistics, 2025

2024
Hyperbolic Pre-Trained Language Model.
IEEE ACM Trans. Audio Speech Lang. Process., 2024

Densing Law of LLMs.
CoRR, 2024

KBAlign: Efficient Self Adaptation on Specific Knowledge Bases.
CoRR, 2024

Sparsing Law: Towards Large Language Models with Greater Activation Sparsity.
CoRR, 2024

VisRAG: Vision-based Retrieval-augmented Generation on Multi-modality Documents.
CoRR, 2024

LLM⨉MapReduce: Simplified Long-Sequence Processing using Large Language Models.
CoRR, 2024

Retriever-and-Memory: Towards Adaptive Note-Enhanced Retrieval-Augmented Generation.
CoRR, 2024

Stuffed Mamba: State Collapse and State Capacity of RNN-Based Long-Context Modeling.
CoRR, 2024

Locret: Enhancing Eviction in Long-Context LLM Inference with Trained Retaining Heads.
CoRR, 2024

Enabling Real-Time Conversations with Minimal Training Costs.
CoRR, 2024

From MOOC to MAIC: Reshaping Online Teaching and Learning through LLM-driven Agents.
CoRR, 2024

Configurable Foundation Models: Building LLMs from a Modular Perspective.
CoRR, 2024

Multi-Modal Multi-Granularity Tokenizer for Chu Bamboo Slip Scripts.
CoRR, 2024

MiniCPM-V: A GPT-4V Level MLLM on Your Phone.
CoRR, 2024

RAGEval: Scenario Specific RAG Evaluation Dataset Generation Framework.
CoRR, 2024

Delta-CoMe: Training-Free Delta-Compression with Mixed-Precision for Large Language Models.
CoRR, 2024

Seq1F1B: Efficient Sequence-Level Pipeline Parallelism for Large Language Model Training.
CoRR, 2024

UltraEval: A Lightweight Platform for Flexible and Comprehensive Evaluation for LLMs.
CoRR, 2024

MiniCPM: Unveiling the Potential of Small Language Models with Scalable Training Strategies.
CoRR, 2024

BurstAttention: An Efficient Distributed Attention Framework for Extremely Long Sequences.
CoRR, 2024

Unified View of Grokking, Double Descent and Emergent Abilities: A Perspective from Circuits Competition.
CoRR, 2024

Ouroboros: Speculative Decoding with Large Model Enhanced Drafting.
CoRR, 2024

∞Bench: Extending Long Context Evaluation Beyond 100K Tokens.
CoRR, 2024

OneBit: Towards Extremely Low-bit Large Language Models.
CoRR, 2024

InfLLM: Unveiling the Intrinsic Capacity of LLMs for Understanding Extremely Long Sequences with Training-Free Memory.
CoRR, 2024

ReLU<sup>2</sup> Wins: Discovering Efficient Activation Functions for Sparse LLMs.
CoRR, 2024

Exploring the Benefit of Activation Sparsity in Pre-training.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Predicting Emergent Abilities with Infinite Resolution Evaluation.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Large Multilingual Models Pivot Zero-Shot Multimodal Learning across Languages.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Ouroboros: Generating Longer Drafts Phrase by Phrase for Faster Speculative Decoding.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Beyond the Turn-Based Game: Enabling Real-Time Conversations with Duplex Models.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Fine-Grained Legal Argument-Pair Extraction via Coarse-Grained Pre-training.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

Robust and Scalable Model Editing for Large Language Models.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

ınftyBench: Extending Long Context Evaluation Beyond 100K Tokens.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

MatPlotAgent: Method and Evaluation for LLM-Based Agentic Scientific Data Visualization.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

UltraLink: An Open-Source Knowledge-Enhanced Multilingual Supervised Fine-tuning Dataset.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

LoRA-Flow: Dynamic LoRA Fusion for Large Language Models in Generative Tasks.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

MAVEN-ARG: Completing the Puzzle of All-in-One Event Understanding Dataset with Event Argument Annotation.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

FastFiD: Improve Inference Efficiency of Open Domain Question Answering via Sentence Selection.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

OlympiadBench: A Challenging Benchmark for Promoting AGI with Olympiad-Level Bilingual Multimodal Scientific Problems.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

2023
A Systematic Approach for the Calibration of Route Choice Models Based on Stochastic User Equilibrium.
IEEE Intell. Transp. Syst. Mag., 2023

Unlock Predictable Scaling from Emergent Abilities.
CoRR, 2023

ConPET: Continual Parameter-Efficient Tuning for Large Language Models.
CoRR, 2023

QASnowball: An Iterative Bootstrapping Framework for High-Quality Question-Answering Data Generation.
CoRR, 2023

CPET: Effective Parameter-Efficient Tuning for Compressed Large Language Models.
CoRR, 2023

Efficient Cross-Lingual Transfer for Chinese Stable Diffusion with Images as Pivots.
CoRR, 2023

Tool Learning with Foundation Models.
CoRR, 2023

H3T: Efficient Integration of Memory Optimization and Parallelism for Large-scale Transformer Training.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Variator: Accelerating Pre-trained Models with Plug-and-Play Compression Modules.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Boosting Inference Efficiency: Unleashing the Power of Parameter-Shared Pre-trained Language Models.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Emergent Modularity in Pre-trained Transformers.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

Plug-and-Play Knowledge Injection for Pre-trained Language Models.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Plug-and-Play Document Modules for Pre-trained Models.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Recyclable Tuning for Continual Pre-training.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

WebCPM: Interactive Web Search for Chinese Long-form Question Answering.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Stochastic Bridges as Effective Regularizers for Parameter-Efficient Tuning.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

2022
PTR: Prompt Tuning with Rules for Text Classification.
AI Open, January, 2022

Different Tunes Played with Equal Skill: Exploring a Unified Optimization Subspace for Delta Tuning.
CoRR, 2022

GACT: Activation Compressed Training for General Architectures.
CoRR, 2022

Knowledge Inheritance for Pre-trained Language Models.
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022

GACT: Activation Compressed Training for Generic Network Architectures.
Proceedings of the International Conference on Machine Learning, 2022

BMCook: A Task-agnostic Compression Toolkit for Big Models.
Proceedings of the The 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Different Tunes Played with Equal Skill: Exploring a Unified Optimization Subspace for Parameter-Efficient Tuning.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

MAVEN-ERE: A Unified Large-scale Dataset for Event Coreference, Temporal, Causal, and Subevent Relation Extraction.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Exploring Mode Connectivity for Pre-trained Language Models.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Prompt-learning for Fine-grained Entity Typing.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

BMInf: An Efficient Toolkit for Big Model Inference and Tuning.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics, 2022

Cross-Lingual Contrastive Learning for Fine-Grained Entity Typing for Low-Resource Languages.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

PPT: Pre-trained Prompt Tuning for Few-shot Learning.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

Fully Hyperbolic Neural Networks.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

2021
CSS-LM: A Contrastive Framework for Semi-Supervised Fine-Tuning of Pre-Trained Language Models.
IEEE ACM Trans. Audio Speech Lang. Process., 2021

Prompt-Learning for Fine-Grained Entity Typing.
CoRR, 2021

Pre-Trained Models: Past, Present and Future.
CoRR, 2021

CSS-LM: A Contrastive Framework for Semi-supervised Fine-tuning of Pre-trained Language Models.
CoRR, 2021

Knowledgeable machine learning for natural language processing.
Commun. ACM, 2021

CPM: A large-scale generative Chinese Pre-trained language model.
AI Open, 2021

CPM-2: Large-scale cost-effective pre-trained language models.
AI Open, 2021

CokeBERT: Contextual knowledge selection and embedding towards enhanced pre-trained language models.
AI Open, 2021

Pre-trained models: Past, present and future.
AI Open, 2021

Open Hierarchical Relation Extraction.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

Visual Distant Supervision for Scene Graph Generation.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

CLEVE: Contrastive Pre-training for Event Extraction.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

Manual Evaluation Matters: Reviewing Test Protocols of Distantly Supervised Relation Extraction.
Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021

Few-NERD: A Few-shot Named Entity Recognition Dataset.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

Adversarial Language Games for Advanced Natural Language Intelligence.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020
Contextual Knowledge Selection and Embedding towards Enhanced Pre-Trained Language Models.
CoRR, 2020

Neural Gibbs Sampling for Joint Event Argument Extraction.
Proceedings of the 1st Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics and the 10th International Joint Conference on Natural Language Processing, 2020

More Data, More Relations, More Context and More Openness: A Review and Outlook for Relation Extraction.
Proceedings of the 1st Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics and the 10th International Joint Conference on Natural Language Processing, 2020

Denoising Relation Extraction from Document-level Distant Supervision.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

MAVEN: A Massive General Domain Event Detection Dataset.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

Learning from Context or Names? An Empirical Study on Neural Relation Extraction.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

Dynamic Anticipation and Completion for Multi-Hop Reasoning over Sparse Knowledge Graph.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

IsOBS: An Information System for Oracle Bone Script.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing: System Demonstrations, 2020

Meta-Information Guided Meta-Learning for Few-Shot Relation Classification.
Proceedings of the 28th International Conference on Computational Linguistics, 2020

Continual Relation Learning via Episodic Memory Activation and Reconsolidation.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

Neural Snowball for Few-Shot Relation Learning.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
Adversarial Language Games for Advanced Natural Language Intelligence.
CoRR, 2019

Adversarial Training for Weakly Supervised Event Detection.
Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019

Open Relation Extraction: Relational Knowledge Transfer from Supervised Data to Unsupervised Data.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

HMEAE: Hierarchical Modular Event Argument Extraction.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

Adapting Meta Knowledge Graph Information for Multi-Hop Reasoning over Few-Shot Relations.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

OpenNRE: An Open and Extensible Toolkit for Neural Relation Extraction.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

FewRel 2.0: Towards More Challenging Few-Shot Relation Classification.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

GEAR: Graph-based Evidence Aggregating and Reasoning for Fact Verification.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

DIAG-NRE: A Neural Pattern Diagnosis Framework for Distantly Supervised Neural Relation Extraction.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

ERNIE: Enhanced Language Representation with Informative Entities.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

DocRED: A Large-Scale Document-Level Relation Extraction Dataset.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

Quantifying Similarity between Relations with Fact Distribution.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

Hybrid Attention-Based Prototypical Networks for Noisy Few-Shot Relation Classification.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

2018
Knowledge Representation Learning: A Quantitative Review.
CoRR, 2018

Denoising Distant Supervision for Relation Extraction via Instance-Level Adversarial Training.
CoRR, 2018

Put It Back: Entity Typing with Language Model Enhancement.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

FewRel: A Large-Scale Supervised Few-shot Relation Classification Dataset with State-of-the-Art Evaluation.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

Hierarchical Relation Extraction with Coarse-to-Fine Grained Attention.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

OpenKE: An Open Toolkit for Knowledge Embedding.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, 2018

Adversarial Multi-lingual Neural Relation Extraction.
Proceedings of the 27th International Conference on Computational Linguistics, 2018

Neural Knowledge Acquisition via Mutual Attention Between Knowledge Graph and Text.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

2016
Joint Representation Learning of Text and Knowledge for Knowledge Graph Completion.
CoRR, 2016


  Loading...