Jie Fu

Orcid: 0000-0002-4494-843X

Affiliations:
  • Shangai AI Lab, China
  • Hong Kong University of Science and Technology, Department of Computer Science and Engineering, Hong Kong (2023 - 2024)
  • Beijing Academy of Artificial Intelligence, China (2022 - 2023)
  • Quebec AI Institute (Mila), Montreal, Canada (2017 - 2022)
  • National University of Singapore (PhD 2017)


According to our database1, Jie Fu authored at least 118 papers between 2015 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Pre-trained Language Models in Biomedical Domain: A Systematic Survey.
ACM Comput. Surv., March, 2024

Exploring Clean Label Backdoor Attacks and Defense in Language Models.
IEEE ACM Trans. Audio Speech Lang. Process., 2024

PositionID: LLMs can Control Lengths, Copy and Paste with Explicit Positional Awareness.
CoRR, 2024

Layerwise Recurrent Router for Mixture-of-Experts.
CoRR, 2024

A Closer Look into Mixture-of-Experts in Large Language Models.
CoRR, 2024

PIN: A Knowledge-Intensive Dataset for Paired and Interleaved Multimodal Documents.
CoRR, 2024

MAP: Low-compute Model Merging with Amortized Pareto Fronts via Quadratic Approximation.
CoRR, 2024

A Survey of Backdoor Attacks and Defenses on Large Language Models: Implications for Security Measures.
CoRR, 2024

VCR: Visual Caption Restoration.
CoRR, 2024

D-CPT Law: Domain-specific Continual Pre-Training Scaling Law for Large Language Models.
CoRR, 2024

Stacking Your Transformers: A Closer Look at Model Growth for Efficient LLM Pre-Training.
CoRR, 2024

ComposerX: Multi-Agent Symbolic Music Composition with LLMs.
CoRR, 2024

MuPT: A Generative Symbolic Music Pretrained Transformer.
CoRR, 2024

Chinese Tiny LLM: Pretraining a Chinese-Centric Large Language Model.
CoRR, 2024

CodeEditorBench: Evaluating Code Editing Capability of Large Language Models.
CoRR, 2024

The Fine Line: Navigating Large Language Model Pretraining with Down-streaming Capability Analysis.
CoRR, 2024

RQ-RAG: Learning to Refine Queries for Retrieval Augmented Generation.
CoRR, 2024

COIG-CQIA: Quality is All You Need for Chinese Instruction Fine-tuning.
CoRR, 2024

Electrocardiogram Instruction Tuning for Report Generation.
CoRR, 2024

DEEP-ICL: Definition-Enriched Experts for Language Model In-Context Learning.
CoRR, 2024

m2mKD: Module-to-Module Knowledge Distillation for Modular Transformers.
CoRR, 2024

StructLM: Towards Building Generalist Models for Structured Knowledge Grounding.
CoRR, 2024

ChatMusician: Understanding and Generating Music Intrinsically with LLM.
CoRR, 2024

CMDAG: A Chinese Metaphor Dataset with Annotated Grounds as CoT for Boosting Metaphor Generation.
CoRR, 2024

CIF-Bench: A Chinese Instruction-Following Benchmark for Evaluating the Generalizability of Large Language Models.
CoRR, 2024

AnyGPT: Unified Multimodal LLM with Discrete Sequence Modeling.
CoRR, 2024

Read to Play (R2-Play): Decision Transformer with Multimodal Game Instruction.
CoRR, 2024

CMMMU: A Chinese Massive Multi-discipline Multimodal Understanding Benchmark.
CoRR, 2024

E^2-LLM: Efficient and Extreme Length Extension of Large Language Models.
CoRR, 2024

Kun: Answer Polishment for Chinese Self-Alignment with Instruction Back-Translation.
CoRR, 2024

Defending Against Weight-Poisoning Backdoor Attacks for Parameter-Efficient Fine-Tuning.
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2024, 2024

Unlocking Emergent Modularity in Large Language Models.
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024

ReForm-Eval: Evaluating Large Vision Language Models via Unified Re-Formulation of Task-Oriented Benchmarks.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

AutoAgents: A Framework for Automatic Agent Generation.
Proceedings of the Thirty-Third International Joint Conference on Artificial Intelligence, 2024

Think Before You Act: Decision Transformers with Working Memory.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Can LLMs Express Their Uncertainty? An Empirical Evaluation of Confidence Elicitation in LLMs.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Massive Editing for Large Language Models via Meta Learning.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Empirical Study on Updating Key-Value Memories in Transformer Feed-forward Layers.
Proceedings of the Second Tiny Papers Track at ICLR 2024, 2024

MERT: Acoustic Music Understanding Model with Large-Scale Self-supervised Training.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

ChatEval: Towards Better LLM-based Evaluators through Multi-Agent Debate.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

PositionID: LLMs can Control Lengths, Copy and Paste with Explicit Positional Awareness.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

Unlocking Continual Learning Abilities in Language Models.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

CMDAG: A Chinese Metaphor Dataset with Annotated Grounds as CoT for Boosting Metaphor Generation.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

OpenCodeInterpreter: Integrating Code Generation with Execution and Refinement.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

AnyGPT: Unified Multimodal LLM with Discrete Sequence Modeling.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024


SciMMIR: Benchmarking Scientific Multi-modal Information Retrieval.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

RoleLLM: Benchmarking, Eliciting, and Enhancing Role-Playing Abilities of Large Language Models.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

E2-LLM: Efficient and Extreme Length Extension of Large Language Models.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

CIF-Bench: A Chinese Instruction-Following Benchmark for Evaluating the Generalizability of Large Language Models.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

Scalable Geometric Fracture Assembly via Co-creation Space among Assemblers.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
Running ahead of evolution - AI-based simulation for predicting future high-risk SARS-CoV-2 variants.
Int. J. High Perform. Comput. Appl., November, 2023

Align on the Fly: Adapting Chatbot Behavior to Established Norms.
CoRR, 2023

A Survey of Reasoning with Foundation Models.
CoRR, 2023

T3D: Towards 3D Medical Image Understanding through Vision-Language Pre-training.
CoRR, 2023

SynFundus: A synthetic fundus images dataset with millions of samples and multi-disease annotations.
CoRR, 2023

UniIR: Training and Benchmarking Universal Multimodal Information Retrievers.
CoRR, 2023

AI Alignment: A Comprehensive Survey.
CoRR, 2023

Emergent Mixture-of-Experts: Can Dense Pre-trained Transformers Benefit from Emergent Modular Structures?
CoRR, 2023

RoleLLM: Benchmarking, Eliciting, and Enhancing Role-Playing Abilities of Large Language Models.
CoRR, 2023

Mixup-Augmented Meta-Learning for Sample-Efficient Fine-Tuning of Protein Simulators.
CoRR, 2023

Deep Reinforcement Learning with Multitask Episodic Memory Based on Task-Conditioned Hypernetwork.
CoRR, 2023

MERT: Acoustic Music Understanding Model with Large-Scale Self-supervised Training.
CoRR, 2023

TPDM: Selectively Removing Positional Information for Zero-shot Translation via Token-Level Position Disentangle Module.
CoRR, 2023

Interactive Natural Language Processing.
CoRR, 2023

Huatuo-26M, a Large-scale Chinese Medical QA Dataset.
CoRR, 2023

Chinese Open Instruction Generalist: A Preliminary Release.
CoRR, 2023

Modular Retrieval for Generalization and Interpretation.
CoRR, 2023

A Pathway Towards Responsible AI Generated Content.
CoRR, 2023

CORGI-PM: A Chinese Corpus For Gender Bias Probing and Mitigation.
CoRR, 2023

MARBLE: Music Audio Representation Benchmark for Universal Evaluation.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Med-UniC: Unifying Cross-Lingual Medical Vision-Language Pre-Training by Diminishing Bias.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

LyricWhiz: Robust Multilingual Zero-Shot Lyrics Transcription by Whispering to ChatGPT.
Proceedings of the 24th International Society for Music Information Retrieval Conference, 2023

On the Effectiveness of Speech Self-Supervised Learning for Music.
Proceedings of the 24th International Society for Music Information Retrieval Conference, 2023

Prompt as Triggers for Backdoor Attack: Examining the Vulnerability in Language Models.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Unifying Discrete and Continuous Representations for Unsupervised Paraphrase Generation.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

2022
MAP-Music2Vec: A Simple and Effective Baseline for Self-Supervised Music Audio Representation Learning.
CoRR, 2022

Bidirectional Learning for Offline Infinite-width Model-based Optimization.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

MentalBERT: Publicly Available Pretrained Language Models for Mental Healthcare.
Proceedings of the Thirteenth Language Resources and Evaluation Conference, 2022

HERB: Measuring Hierarchical Regional Bias in Pre-trained Language Models.
Proceedings of the Findings of the Association for Computational Linguistics: AACL-IJCNLP 2022, 2022

Biological Sequence Design with GFlowNets.
Proceedings of the International Conference on Machine Learning, 2022

Unifying Likelihood-free Inference with Black-box Optimization and Beyond.
Proceedings of the Tenth International Conference on Learning Representations, 2022

Reconciliation of Pre-trained Models and Prototypical Neural Networks in Few-shot Named Entity Recognition.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

Learning Multi-Objective Curricula for Robotic Policy Learning.
Proceedings of the Conference on Robot Learning, 2022

1Cademy @ Causal News Corpus 2022: Leveraging Self-Training in Causality Classification of Socio-Political Event Data.
Proceedings of the 5th Workshop on Challenges and Applications of Automated Extraction of Socio-political Events from Text, 2022

1Cademy @ Causal News Corpus 2022: Enhance Causal Span Detection via Beam-Search-based Position Selector.
Proceedings of the 5th Workshop on Challenges and Applications of Automated Extraction of Socio-political Events from Text, 2022

2021
Pre-trained Language Models in Biomedical Domain: A Systematic Survey.
CoRR, 2021

Unifying Likelihood-free Inference with Black-box Sequence Design and Beyond.
CoRR, 2021

Learning Multi-Objective Curricula for Deep Reinforcement Learning.
CoRR, 2021

Beyond Fully-Connected Layers with Quaternions: Parameterization of Hypercomplex Multiplications with 1/n Parameters.
Proceedings of the 9th International Conference on Learning Representations, 2021

FloW: A Dataset and Benchmark for Floating Waste Detection in Inland Waters.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

On Orthogonality Constraints for Transformers.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

GLGE: A New General Language Generation Evaluation Benchmark.
Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021

2020
Role-Wise Data Augmentation for Knowledge Distillation.
CoRR, 2020

Diverse, Controllable, and Keyphrase-Aware: A Corpus and Method for News Multi-Headline Generation.
CoRR, 2020

Diverse, Controllable, and Keyphrase-Aware: A Corpus and Method for News Multi-Headline Generation.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

Tell Me How to Ask Again: Question Data Augmentation with Controllable Rewriting in Continuous Space.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

Interactive Machine Comprehension with Information Seeking Agents.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

Would you Rather? A New Benchmark for Learning Machine Alignment with Cultural Values and Social Preferences.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

RikiNet: Reading Wikipedia Pages for Natural Question Answering.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

Revision in Continuous Space: Unsupervised Text Style Transfer without Adversarial Learning.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
BFGAN: Backward and Forward Generative Adversarial Networks for Lexically Constrained Sentence Generation.
IEEE ACM Trans. Audio Speech Lang. Process., 2019

Learning Sparse Mixture of Experts for Visual Question Answering.
CoRR, 2019

Exploring Domain Shift in Extractive Text Summarization.
CoRR, 2019

Conditional Computation for Continual Learning.
CoRR, 2019

Revision in Continuous Space: Fine-Grained Control of Text Style Transfer.
CoRR, 2019

Dataflow-based Joint Quantization of Weights and Activations for Deep Neural Networks.
CoRR, 2019

Interactive Language Learning by Question Answering.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

Structure Learning for Neural Module Networks.
Proceedings of the Beyond Vision and LANguage: inTEgrating Real-world kNowledge, 2019

Dataflow-Based Joint Quantization for Deep Neural Networks.
Proceedings of the Data Compression Conference, 2019

Graph Neural Networks with Generated Parameters for Relation Extraction.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

Lightweight and Efficient Neural Natural Language Processing with Quaternion Networks.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

Simple and Effective Curriculum Pointer-Generator Networks for Reading Comprehension over Long Narratives.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

TIGS: An Inference Algorithm for Text Infilling with Gradient Search.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

Learning Multi-Task Communication with Message Passing for Sequence Learning.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

2018
Multi-task Learning over Graph Structures.
CoRR, 2018

2016
DrMAD: Distilling Reverse-Mode Automatic Differentiation for Optimizing Hyperparameters of Deep Neural Networks.
Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence, 2016

2015
AffectiveSpace 2: Enabling Affective Intuition for Concept-Level Sentiment Analysis.
Proceedings of the Twenty-Ninth AAAI Conference on Artificial Intelligence, 2015


  Loading...