Jie Fu

Orcid: 0000-0002-4494-843X

Affiliations:

Shangai AI Lab, China
Hong Kong University of Science and Technology, Department of Computer Science and Engineering, Hong Kong (2023 - 2024)
Beijing Academy of Artificial Intelligence, China (2022 - 2023)
Quebec AI Institute (Mila), Montreal, Canada (2017 - 2022)
National University of Singapore (PhD 2017)

According to our database¹, Jie Fu authored at least 118 papers between 2015 and 2024.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of three.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Bibliography

2024

Pre-trained Language Models in Biomedical Domain: A Systematic Survey.

[BibT_eX]

[DOI]

ACM Comput. Surv., March, 2024

Exploring Clean Label Backdoor Attacks and Defense in Language Models.

[BibT_eX]

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2024

PopAlign: Diversifying Contrasting Patterns for a More Comprehensive Alignment.

[BibT_eX]

[DOI]

CoRR, 2024

PositionID: LLMs can Control Lengths, Copy and Paste with Explicit Positional Awareness.

[BibT_eX]

[DOI]

CoRR, 2024

Layerwise Recurrent Router for Mixture-of-Experts.

[BibT_eX]

[DOI]

CoRR, 2024

A Closer Look into Mixture-of-Experts in Large Language Models.

[BibT_eX]

[DOI]

CoRR, 2024

PIN: A Knowledge-Intensive Dataset for Paired and Interleaved Multimodal Documents.

[BibT_eX]

[DOI]

CoRR, 2024

MAP: Low-compute Model Merging with Amortized Pareto Fronts via Quadratic Approximation.

[BibT_eX]

[DOI]

CoRR, 2024

A Survey of Backdoor Attacks and Defenses on Large Language Models: Implications for Security Measures.

[BibT_eX]

[DOI]

CoRR, 2024

VCR: Visual Caption Restoration.

[BibT_eX]

[DOI]

CoRR, 2024

D-CPT Law: Domain-specific Continual Pre-Training Scaling Law for Large Language Models.

[BibT_eX]

[DOI]

CoRR, 2024

Stacking Your Transformers: A Closer Look at Model Growth for Efficient LLM Pre-Training.

[BibT_eX]

[DOI]

CoRR, 2024

ComposerX: Multi-Agent Symbolic Music Composition with LLMs.

[BibT_eX]

[DOI]

CoRR, 2024

MuPT: A Generative Symbolic Music Pretrained Transformer.

[BibT_eX]

[DOI]

CoRR, 2024

Chinese Tiny LLM: Pretraining a Chinese-Centric Large Language Model.

[BibT_eX]

[DOI]

CoRR, 2024

CodeEditorBench: Evaluating Code Editing Capability of Large Language Models.

[BibT_eX]

[DOI]

CoRR, 2024

The Fine Line: Navigating Large Language Model Pretraining with Down-streaming Capability Analysis.

[BibT_eX]

[DOI]

CoRR, 2024

RQ-RAG: Learning to Refine Queries for Retrieval Augmented Generation.

[BibT_eX]

[DOI]

CoRR, 2024

COIG-CQIA: Quality is All You Need for Chinese Instruction Fine-tuning.

[BibT_eX]

[DOI]

CoRR, 2024

Electrocardiogram Instruction Tuning for Report Generation.

[BibT_eX]

[DOI]

CoRR, 2024

DEEP-ICL: Definition-Enriched Experts for Language Model In-Context Learning.

[BibT_eX]

[DOI]

CoRR, 2024

m2mKD: Module-to-Module Knowledge Distillation for Modular Transformers.

[BibT_eX]

[DOI]

CoRR, 2024

StructLM: Towards Building Generalist Models for Structured Knowledge Grounding.

[BibT_eX]

[DOI]

CoRR, 2024

ChatMusician: Understanding and Generating Music Intrinsically with LLM.

[BibT_eX]

[DOI]

CoRR, 2024

CMDAG: A Chinese Metaphor Dataset with Annotated Grounds as CoT for Boosting Metaphor Generation.

[BibT_eX]

[DOI]

CoRR, 2024

CIF-Bench: A Chinese Instruction-Following Benchmark for Evaluating the Generalizability of Large Language Models.

[BibT_eX]

[DOI]

CoRR, 2024

Read to Play (R2-Play): Decision Transformer with Multimodal Game Instruction.

[BibT_eX]

[DOI]

CoRR, 2024

CMMMU: A Chinese Massive Multi-discipline Multimodal Understanding Benchmark.

[BibT_eX]

[DOI]

CoRR, 2024

E^2-LLM: Efficient and Extreme Length Extension of Large Language Models.

[BibT_eX]

[DOI]

CoRR, 2024

Kun: Answer Polishment for Chinese Self-Alignment with Instruction Back-Translation.

[BibT_eX]

[DOI]

CoRR, 2024

Defending Against Weight-Poisoning Backdoor Attacks for Parameter-Efficient Fine-Tuning.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2024, 2024

Unlocking Emergent Modularity in Large Language Models.

[BibT_eX]

[DOI]

Zihan Qiu

Zeyu Huang

Jie Fu

Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024

ReForm-Eval: Evaluating Large Vision Language Models via Unified Re-Formulation of Task-Oriented Benchmarks.

[BibT_eX]

[DOI]

Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

AutoAgents: A Framework for Automatic Agent Generation.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Third International Joint Conference on Artificial Intelligence, 2024

Think Before You Act: Decision Transformers with Working Memory.

[BibT_eX]

[DOI]

Proceedings of the Forty-first International Conference on Machine Learning, 2024

Can LLMs Express Their Uncertainty? An Empirical Evaluation of Confidence Elicitation in LLMs.

[BibT_eX]

[DOI]

Proceedings of the Twelfth International Conference on Learning Representations, 2024

Massive Editing for Large Language Models via Meta Learning.

[BibT_eX]

[DOI]

Chenmien Tan

Ge Zhang

Jie Fu

Proceedings of the Twelfth International Conference on Learning Representations, 2024

Empirical Study on Updating Key-Value Memories in Transformer Feed-forward Layers.

[BibT_eX]

[DOI]

Proceedings of the Second Tiny Papers Track at ICLR 2024, 2024

MERT: Acoustic Music Understanding Model with Large-Scale Self-supervised Training.

[BibT_eX]

[DOI]

Proceedings of the Twelfth International Conference on Learning Representations, 2024

ChatEval: Towards Better LLM-based Evaluators through Multi-Agent Debate.

[BibT_eX]

[DOI]

Proceedings of the Twelfth International Conference on Learning Representations, 2024

PositionID: LLMs can Control Lengths, Copy and Paste with Explicit Positional Awareness.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

Unlocking Continual Learning Abilities in Language Models.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

UniIR: Training and Benchmarking Universal Multimodal Information Retrievers.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2024, 2024

CMDAG: A Chinese Metaphor Dataset with Annotated Grounds as CoT for Boosting Metaphor Generation.

[BibT_eX]

[DOI]

Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

OpenCodeInterpreter: Integrating Code Generation with Execution and Refinement.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics, 2024

AnyGPT: Unified Multimodal LLM with Discrete Sequence Modeling.

[BibT_eX]

[DOI]

Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

ChatMusician: Understanding and Generating Music Intrinsically with LLM.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics, 2024

SciMMIR: Benchmarking Scientific Multi-modal Information Retrieval.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics, 2024

RoleLLM: Benchmarking, Eliciting, and Enhancing Role-Playing Abilities of Large Language Models.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics, 2024

E2-LLM: Efficient and Extreme Length Extension of Large Language Models.

[BibT_eX]

[DOI]

JiakaiWang JiakaiWang

Proceedings of the Findings of the Association for Computational Linguistics, 2024

CIF-Bench: A Chinese Instruction-Following Benchmark for Evaluating the Generalizability of Large Language Models.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics, 2024

Scalable Geometric Fracture Assembly via Co-creation Space among Assemblers.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023

Running ahead of evolution - AI-based simulation for predicting future high-risk SARS-CoV-2 variants.

[BibT_eX]

[DOI]

Int. J. High Perform. Comput. Appl., November, 2023

Align on the Fly: Adapting Chatbot Behavior to Established Norms.

[BibT_eX]

[DOI]

CoRR, 2023

A Survey of Reasoning with Foundation Models.

[BibT_eX]

[DOI]

CoRR, 2023

T3D: Towards 3D Medical Image Understanding through Vision-Language Pre-training.

[BibT_eX]

[DOI]

Che Liu

Cheng Ouyang

Yinda Chen

César Quilodrán Casas

CoRR, 2023

SynFundus: A synthetic fundus images dataset with millions of samples and multi-disease annotations.

[BibT_eX]

[DOI]

CoRR, 2023

AI Alignment: A Comprehensive Survey.

[BibT_eX]

[DOI]

CoRR, 2023

Emergent Mixture-of-Experts: Can Dense Pre-trained Transformers Benefit from Emergent Modular Structures?

[BibT_eX]

[DOI]

Zihan Qiu

Zeyu Huang

Jie Fu

CoRR, 2023

RoleLLM: Benchmarking, Eliciting, and Enhancing Role-Playing Abilities of Large Language Models.

[BibT_eX]

[DOI]

CoRR, 2023

Mixup-Augmented Meta-Learning for Sample-Efficient Fine-Tuning of Protein Simulators.

[BibT_eX]

[DOI]

CoRR, 2023

Deep Reinforcement Learning with Multitask Episodic Memory Based on Task-Conditioned Hypernetwork.

[BibT_eX]

[DOI]

CoRR, 2023

MERT: Acoustic Music Understanding Model with Large-Scale Self-supervised Training.

[BibT_eX]

[DOI]

CoRR, 2023

TPDM: Selectively Removing Positional Information for Zero-shot Translation via Token-Level Position Disentangle Module.

[BibT_eX]

[DOI]

Xingran Chen

Ge Zhang

Jie Fu

CoRR, 2023

Interactive Natural Language Processing.

[BibT_eX]

[DOI]

CoRR, 2023

Huatuo-26M, a Large-scale Chinese Medical QA Dataset.

[BibT_eX]

[DOI]

CoRR, 2023

Chinese Open Instruction Generalist: A Preliminary Release.

[BibT_eX]

[DOI]

CoRR, 2023

Modular Retrieval for Generalization and Interpretation.

[BibT_eX]

[DOI]

CoRR, 2023

A Pathway Towards Responsible AI Generated Content.

[BibT_eX]

[DOI]

Chen Chen

Jie Fu

Lingjuan Lyu

CoRR, 2023

CORGI-PM: A Chinese Corpus For Gender Bias Probing and Mitigation.

[BibT_eX]

[DOI]

CoRR, 2023

MARBLE: Music Audio Representation Benchmark for Universal Evaluation.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Med-UniC: Unifying Cross-Lingual Medical Vision-Language Pre-Training by Diminishing Bias.

[BibT_eX]

[DOI]

César Quilodrán Casas

Rossella Arcucci

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

LyricWhiz: Robust Multilingual Zero-Shot Lyrics Transcription by Whispering to ChatGPT.

[BibT_eX]

[DOI]

Proceedings of the 24th International Society for Music Information Retrieval Conference, 2023

On the Effectiveness of Speech Self-Supervised Learning for Music.

[BibT_eX]

[DOI]

Proceedings of the 24th International Society for Music Information Retrieval Conference, 2023

Prompt as Triggers for Backdoor Attack: Examining the Vulnerability in Language Models.

[BibT_eX]

[DOI]

Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Unifying Discrete and Continuous Representations for Unsupervised Paraphrase Generation.

[BibT_eX]

[DOI]

Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

2022

MAP-Music2Vec: A Simple and Effective Baseline for Self-Supervised Music Audio Representation Learning.

[BibT_eX]

[DOI]

CoRR, 2022

Bidirectional Learning for Offline Infinite-width Model-based Optimization.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

MentalBERT: Publicly Available Pretrained Language Models for Mental Healthcare.

[BibT_eX]

[DOI]

Proceedings of the Thirteenth Language Resources and Evaluation Conference, 2022

HERB: Measuring Hierarchical Regional Bias in Pre-trained Language Models.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: AACL-IJCNLP 2022, 2022

Biological Sequence Design with GFlowNets.

[BibT_eX]

[DOI]

Moksh Jain

Emmanuel Bengio

Alex Hernández-García

Jarrid Rector-Brooks

Bonaventure F. P. Dossou

Proceedings of the International Conference on Machine Learning, 2022

Unifying Likelihood-free Inference with Black-box Optimization and Beyond.

[BibT_eX]

[DOI]

Proceedings of the Tenth International Conference on Learning Representations, 2022

Reconciliation of Pre-trained Models and Prototypical Neural Networks in Few-shot Named Entity Recognition.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

Learning Multi-Objective Curricula for Robotic Policy Learning.

[BibT_eX]

[DOI]

Proceedings of the Conference on Robot Learning, 2022

1Cademy @ Causal News Corpus 2022: Leveraging Self-Training in Causality Classification of Socio-Political Event Data.

[BibT_eX]

[DOI]

Proceedings of the 5th Workshop on Challenges and Applications of Automated Extraction of Socio-political Events from Text, 2022

1Cademy @ Causal News Corpus 2022: Enhance Causal Span Detection via Beam-Search-based Position Selector.

[BibT_eX]

[DOI]

Proceedings of the 5th Workshop on Challenges and Applications of Automated Extraction of Socio-political Events from Text, 2022

2021

Pre-trained Language Models in Biomedical Domain: A Systematic Survey.

[BibT_eX]

[DOI]

CoRR, 2021

Unifying Likelihood-free Inference with Black-box Sequence Design and Beyond.

[BibT_eX]

[DOI]

CoRR, 2021

Learning Multi-Objective Curricula for Deep Reinforcement Learning.

[BibT_eX]

[DOI]

CoRR, 2021

Beyond Fully-Connected Layers with Quaternions: Parameterization of Hypercomplex Multiplications with 1/n Parameters.

[BibT_eX]

[DOI]

Proceedings of the 9th International Conference on Learning Representations, 2021

FloW: A Dataset and Benchmark for Floating Waste Detection in Inland Waters.

[BibT_eX]

[DOI]

Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

On Orthogonality Constraints for Transformers.

[BibT_eX]

[DOI]

Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

GLGE: A New General Language Generation Evaluation Benchmark.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021

2020

Role-Wise Data Augmentation for Knowledge Distillation.

[BibT_eX]

[DOI]

CoRR, 2020

Diverse, Controllable, and Keyphrase-Aware: A Corpus and Method for News Multi-Headline Generation.

[BibT_eX]

[DOI]

CoRR, 2020

Diverse, Controllable, and Keyphrase-Aware: A Corpus and Method for News Multi-Headline Generation.

[BibT_eX]

[DOI]

Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

Tell Me How to Ask Again: Question Data Augmentation with Controllable Rewriting in Continuous Space.

[BibT_eX]

[DOI]

Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

Interactive Machine Comprehension with Information Seeking Agents.

[BibT_eX]

[DOI]

Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

Would you Rather? A New Benchmark for Learning Machine Alignment with Cultural Values and Social Preferences.

[BibT_eX]

[DOI]

Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

RikiNet: Reading Wikipedia Pages for Natural Question Answering.

[BibT_eX]

[DOI]

Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

Revision in Continuous Space: Unsupervised Text Style Transfer without Adversarial Learning.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019

BFGAN: Backward and Forward Generative Adversarial Networks for Lexically Constrained Sentence Generation.

[BibT_eX]

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2019

Learning Sparse Mixture of Experts for Visual Question Answering.

[BibT_eX]

[DOI]

Vardaan Pahuja

Jie Fu

Christopher J. Pal

CoRR, 2019

Exploring Domain Shift in Extractive Text Summarization.

[BibT_eX]

[DOI]

CoRR, 2019

Conditional Computation for Continual Learning.

[BibT_eX]

[DOI]

Min Lin

Jie Fu

Yoshua Bengio

CoRR, 2019

Revision in Continuous Space: Fine-Grained Control of Text Style Transfer.

[BibT_eX]

[DOI]

CoRR, 2019

Dataflow-based Joint Quantization of Weights and Activations for Deep Neural Networks.

[BibT_eX]

[DOI]

CoRR, 2019

Interactive Language Learning by Question Answering.

[BibT_eX]

[DOI]

Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

Structure Learning for Neural Module Networks.

[BibT_eX]

[DOI]

Vardaan Pahuja

Jie Fu

Sarath Chandar

Christopher Joseph Pal

Proceedings of the Beyond Vision and LANguage: inTEgrating Real-world kNowledge, 2019

Dataflow-Based Joint Quantization for Deep Neural Networks.

[BibT_eX]

[DOI]

Xue Geng