Hang Yan

Affiliations:
  • Shanghai AI Laboratory, Shanghai, China
  • Fudan University, Key Laboratory of Intelligent Information Processing, Shanghai, China


According to our database1, Hang Yan authored at least 83 papers between 2018 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
MOSS: An Open Conversational Large Language Model.
Mach. Intell. Res., October, 2024

What are the Essential Factors in Crafting Effective Long Context Multi-Hop Instruction Datasets? Insights and Best Practices.
CoRR, 2024

Efficient Training of Large Language Models on Distributed Infrastructures: A Survey.
CoRR, 2024

Farewell to Length Extrapolation, a Training-Free Infinite Context with Finite Attention Scope.
CoRR, 2024

Case2Code: Learning Inductive Reasoning with Synthetic Data.
CoRR, 2024

InternLM-XComposer-2.5: A Versatile Large Vision Language Model Supporting Long-Contextual Input and Output.
CoRR, 2024

AlchemistCoder: Harmonizing and Eliciting Code Capability by Hindsight Tuning on Multi-source Data.
CoRR, 2024

FoundaBench: Evaluating Chinese Fundamental Knowledge Capabilities of Large Language Models.
CoRR, 2024

How Far Are We to GPT-4V? Closing the Gap to Commercial Multimodal Models with Open-Source Suites.
CoRR, 2024

InternLM-XComposer2-4KHD: A Pioneering Large Vision-Language Model Handling Resolutions from 336 Pixels to 4K HD.
CoRR, 2024

EasyJailbreak: A Unified Framework for Jailbreaking Large Language Models.
CoRR, 2024

In-Memory Learning: A Declarative Learning Framework for Large Language Models.
CoRR, 2024

WanJuan-CC: A Safe and High-Quality Open-sourced English Webtext Dataset.
CoRR, 2024

Data-freeWeight Compress and Denoise for Large Language Models.
CoRR, 2024

Hint-before-Solving Prompting: Guiding LLMs to Effectively Utilize Encoded Knowledge.
CoRR, 2024

Identifying Semantic Induction Heads to Understand In-Context Learning.
CoRR, 2024

AnyGPT: Unified Multimodal LLM with Discrete Sequence Modeling.
CoRR, 2024

ChemLLM: A Chemical Large Language Model.
CoRR, 2024

InternLM-Math: Open Math Large Language Models Toward Verifiable Reasoning.
CoRR, 2024

MouSi: Poly-Visual-Expert Vision-Language Models.
CoRR, 2024

InternLM-XComposer2: Mastering Free-form Text-Image Composition and Comprehension in Vision-Language Large Model.
CoRR, 2024

F-Eval: Asssessing Fundamental Abilities with Refined Evaluation Methods.
CoRR, 2024

Query of CC: Unearthing Large Scale Domain-Specific Knowledge from Public Corpora.
CoRR, 2024

Linear Alignment: A Closed-form Solution for Aligning Human Preferences without Tuning and Feedback.
CoRR, 2024

Secrets of RLHF in Large Language Models Part II: Reward Modeling.
CoRR, 2024

CPT: a pre-trained unbalanced transformer for both Chinese language understanding and generation.
Sci. China Inf. Sci., 2024

Lins: Reducing Communication Overhead of ZeRO for Efficient LLM Training.
Proceedings of the 32nd IEEE/ACM International Symposium on Quality of Service, 2024

Linear Alignment: A Closed-form Solution for Aligning Human Preferences without Tuning and Feedback.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Scaling Laws of RoPE-based Extrapolation.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Turn Waste into Worth: Rectifying Top-k Router of MoE.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

LongWanjuan: Towards Systematic Measurement for Long Text Quality.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

Unified Active Retrieval for Retrieval Augmented Generation.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

AnyGPT: Unified Multimodal LLM with Discrete Sequence Modeling.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

Length Generalization of Causal Transformers without Position Encoding.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

F-Eval: Asssessing Fundamental Abilities with Refined Evaluation Methods.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

Benchmarking Chinese Commonsense Reasoning of LLMs: From Chinese-Specifics to Reasoning-Memorization Correlations.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

Code Needs Comments: Enhancing Code LLMs with Comment Augmentation.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

Balanced Data Sampling for Language Model Training with Clustering.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

Identifying Semantic Induction Heads to Understand In-Context Learning.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

AdaLomo: Low-memory Optimization with Adaptive Learning Rate.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

2023
Scaling Laws of RoPE-based Extrapolation.
CoRR, 2023

InternLM-XComposer: A Vision-Language Large Model for Advanced Text-image Comprehension and Composition.
CoRR, 2023

WanJuan: A Comprehensive Multimodal Dataset for Advancing English and Chinese Large Models.
CoRR, 2023

Does Correction Remain A Problem For Large Language Models?
CoRR, 2023

Secrets of RLHF in Large Language Models Part I: PPO.
CoRR, 2023

PromptNER: A Prompting Method for Few-shot Named Entity Recognition via k Nearest Neighbor Search.
CoRR, 2023

CoLLiE: Collaborative Training of Large Language Models in an Efficient Way.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Watermarking LLMs with Weight Quantization.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Rethinking Label Smoothing on Multi-Hop Question Answering.
Proceedings of the Chinese Computational Linguistics - 22nd China National Conference, 2023

Investigating Glyph-Phonetic Information for Chinese Spell Checking: What Works and What's Next?
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

CodeIE: Large Code Generation Models are Better Few-Shot Information Extractors.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Unified Demonstration Retriever for In-Context Learning.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

UTC-IE: A Unified Token-pair Classification Architecture for Information Extraction.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

An Embarrassingly Easy but Strong Baseline for Nested Named Entity Recognition.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), 2023

2022
SDCL: Self-Distillation Contrastive Learning for Chinese Spell Checking.
CoRR, 2022

TURNER: The Uncertainty-based Retrieval Framework for Chinese NER.
CoRR, 2022

Towards Collaborative Question Answering: A Preliminary Study.
CoRR, 2022

BART-Reader: Predicting Relations Between Entities via Reading Their Document-Level Context Information.
Proceedings of the Natural Language Processing and Chinese Computing, 2022

Dialogue Meaning Representation for Task-Oriented Dialogue Systems.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

DORE: Document Ordered Relation Extraction based on Generative Framework.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

Contrast and Generation Make BART a Good Dialogue Emotion Recognizer.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021
Text information aggregation with centrality attention.
Sci. China Inf. Sci., 2021

Does syntax matter? A strong baseline for Aspect-based Sentiment Analysis with RoBERTa.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

SpellBERT: A Lightweight Pretrained Model for Chinese Spelling Check.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

A Unified Generative Framework for Various NER Subtasks.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

A Unified Generative Framework for Aspect-based Sentiment Analysis.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

Accelerating BERT Inference for Sequence Labeling via Early-Exit.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

fastHan: A BERT-based Multi-Task Toolkit for Chinese NLP.
Proceedings of the Joint Conference of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

2020
A Graph-based Model for Joint Chinese Word Segmentation and Dependency Parsing.
Trans. Assoc. Comput. Linguistics, 2020

Chinese Word Segmentation via BiLSTM+Semi-CRF with Relay Node.
J. Comput. Sci. Technol., 2020

Text Information Aggregation with Centrality Attention.
CoRR, 2020

BERT for Monolingual and Cross-Lingual Reverse Dictionary.
CoRR, 2020

fastHan: A BERT-based Joint Many-Task Toolkit for Chinese NLP.
CoRR, 2020

BERT for Monolingual and Cross-Lingual Reverse Dictionary.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2020, 2020

A Concise Model for Multi-Criteria Chinese Word Segmentation with Transformer Encoder.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2020, 2020

FLAT: Chinese NER Using Flat-Lattice Transformer.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

Learning Sparse Sharing Architectures for Multiple Tasks.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

Constructing Multiple Tasks for Augmentation: Improving Neural Image Classification with <i>K</i>-Means Features.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
Constructing Multiple Tasks for Augmentation: Improving Neural Image Classification With K-means Features.
CoRR, 2019

TENER: Adapting Transformer Encoder for Named Entity Recognition.
CoRR, 2019

Multi-Criteria Chinese Word Segmentation with Transformer.
CoRR, 2019

A Unified Model for Joint Chinese Word Segmentation and Dependency Parsing.
CoRR, 2019

2018
Gaussian Word Embedding with a Wasserstein Distance Loss.
CoRR, 2018


  Loading...