Lifeng Shang

According to our database1, Lifeng Shang authored at least 119 papers between 2006 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
ToolFlow: Boosting LLM Tool-Calling Through Natural and Coherent Dialogue Synthesis.
CoRR, 2024

Subtle Errors Matter: Preference Learning via Error-injected Self-editing.
CoRR, 2024

RevisEval: Improving LLM-as-a-Judge via Response-Adapted References.
CoRR, 2024

Flat-LoRA: Low-Rank Adaption over a Flat Loss Landscape.
CoRR, 2024

ToolACE: Winning the Points of LLM Function Calling.
CoRR, 2024

Bridging and Modeling Correlations in Pairwise Data for Direct Preference Optimization.
CoRR, 2024

Chain-of-Probe: Examing the Necessity and Accuracy of CoT Step-by-Step.
CoRR, 2024

Evaluating the External and Parametric Knowledge Fusion of Large Language Models.
CoRR, 2024

Does the Generator Mind its Contexts? An Analysis of Generative Model Faithfulness under Context Transfer.
CoRR, 2024

YODA: Teacher-Student Progressive Learning for Language Models.
CoRR, 2024

PROXYQA: An Alternative Framework for Evaluating Long-Form Text Generation with Large Language Models.
CoRR, 2024

Visually Guided Generative Text-Layout Pre-training for Document Intelligence.
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024

Retrieval-based Disentangled Representation Learning with Natural Language Supervision.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Gaining Wisdom from Setbacks: Aligning Large Language Models via Mistake Analysis.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

MT-Eval: A Multi-Turn Capabilities Evaluation Benchmark for Large Language Models.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Does the Generator Mind Its Contexts? An Analysis of Generative Model Faithfulness under Context Transfer.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

ProxyQA: An Alternative Framework for Evaluating Long-Form Text Generation with Large Language Models.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

M4LE: A Multi-Ability Multi-Range Multi-Task Multi-Domain Long-Context Evaluation Benchmark for Large Language Models.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

Prompt-Based Length Controlled Generation with Multiple Control Types.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

Learning to Edit: Aligning LLMs with Knowledge Editing.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

FollowBench: A Multi-level Fine-grained Constraints Following Benchmark for Large Language Models.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

Planning, Creation, Usage: Benchmarking LLMs for Comprehensive Tool Utilization in Real-World Complex Scenarios.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

Preparing Lessons for Progressive Training on Language Models.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
Cooperative Game Modeling With Weighted Token-Level Alignment for Audio-Text Retrieval.
IEEE Signal Process. Lett., 2023

Data Management For Large Language Models: A Survey.
CoRR, 2023

M4LE: A Multi-Ability Multi-Range Multi-Task Multi-Domain Long-Context Evaluation Benchmark for Large Language Models.
CoRR, 2023

Gaining Wisdom from Setbacks: Aligning Large Language Models via Mistake Analysis.
CoRR, 2023

Exploring the Usage of Chinese Pinyin in Pretraining.
CoRR, 2023

SELF: Language-Driven Self-Evolution for Large Language Model.
CoRR, 2023

Prompt-Based Length Controlled Generation with Reinforcement Learning.
CoRR, 2023

Aligning Large Language Models with Human: A Survey.
CoRR, 2023

Enhancing Coherence of Extractive Summarization with Multitask Learning.
CoRR, 2023

Study of a Random Warranty Model Maintaining Fairness and a Random Replacement Next Model Sustaining Post-Warranty Reliability.
Axioms, 2023

Reusing Pretrained Models by Multi-linear Operators for Efficient Training.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Improving Factual Consistency for Knowledge-Grounded Dialogue Systems via Knowledge Enhancement and Alignment.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Gradually Excavating External Knowledge for Implicit Complex Question Answering.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Retrieval-free Knowledge Injection through Multi-Document Traversal for Dialogue Models.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

AutoConv: Automatically Generating Information-seeking Conversations with Large Language Models.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), 2023

NewsDialogues: Towards Proactive News Grounded Conversation.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

mCLIP: Multilingual CLIP via Cross-lingual Transfer.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Self-Supervised Logic Induction for Explainable Fuzzy Temporal Commonsense Reasoning.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
Retrieval-based Disentanglement with Distant Supervision.
CoRR, 2022

PANGUBOT: Efficient Generative Dialogue Pre-training from Pre-trained Language Model.
CoRR, 2022

Towards Efficient Post-training Quantization of Pre-trained Language Models.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Exploring extreme parameter compression for pre-trained language models.
Proceedings of the Tenth International Conference on Learning Representations, 2022

Constructing Highly Inductive Contexts for Dialogue Safety through Controllable Reverse Generation.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

G-MAP: General Memory-Augmented Pre-trained Language Model for Domain Tasks.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Pre-training Language Models with Deterministic Factual Knowledge.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

LiteVL: Efficient Video-Language Learning with Enhanced Spatial-Temporal Modeling.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Hyperlink-induced Pre-training for Passage Retrieval in Open-domain Question Answering.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

Compression of Generative Pre-trained Language Models via Quantization.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

MINER: Multi-Interest Matching Network for News Recommendation.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2022, 2022

How Pre-trained Language Models Capture Factual Knowledge? A Causal-Inspired Analysis.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2022, 2022

Controlled Text Generation Using Dictionary Prior in Variational Autoencoders.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2022, 2022

Enabling Multimodal Generation on CLIP via Vision-Language Knowledge Distillation.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2022, 2022

bert2BERT: Towards Reusable Pretrained Language Models.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

MTRec: Multi-Task Learning over BERT for News Recommendation.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2022, 2022

Read before Generate! Faithful Long Form Question Answering with Machine Reading.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2022, 2022

2021
Improving task-agnostic BERT distillation with layer mapping search.
Neurocomputing, 2021

Integrating Regular Expressions with Neural Networks via DFA.
CoRR, 2021

Improved OOD Generalization via Adversarial Training and Pre-training.
CoRR, 2021

LightMBERT: A Simple Yet Effective Method for Multilingual BERT Distillation.
CoRR, 2021

Non-invasive Self-attention for Side Information Fusion in Sequential Recommendation.
CoRR, 2021

Dual Sequence Transformer for Query-based Interactive Recommendation.
Proceedings of the 22nd IEEE International Conference on Mobile Data Management, 2021

Improved OOD Generalization via Adversarial Training and Pretraing.
Proceedings of the 38th International Conference on Machine Learning, 2021

Reweighting Augmented Samples by Minimizing the Maximal Expected Loss.
Proceedings of the 9th International Conference on Learning Representations, 2021

On Position Embeddings in BERT.
Proceedings of the 9th International Conference on Learning Representations, 2021

Extract then Distill: Efficient and Effective Task-Agnostic BERT Distillation.
Proceedings of the Artificial Neural Networks and Machine Learning - ICANN 2021, 2021

DyLex: Incorporating Dynamic Lexicons into BERT for Sequence Labeling.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

Generate & Rank: A Multi-task Framework for Math Word Problems.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2021, 2021

Improving Unsupervised Question Answering via Summarization-Informed Question Generation.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

AutoTinyBERT: Automatic Hyper-parameter Optimization for Efficient Pre-trained Language Models.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

A Mutual Information Maximization Approach for the Spurious Solution Problem in Weakly Supervised Question Answering.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

GhostBERT: Generate More Features with Cheap Operations for BERT.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

BinaryBERT: Pushing the Limit of BERT Quantization.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

Noninvasive Self-attention for Side Information Fusion in Sequential Recommendation.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

HopRetriever: Retrieve Hops over Wikipedia to Answer Complex Questions.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020
BinaryBERT: Pushing the Limit of BERT Quantization.
CoRR, 2020

SparTerm: Learning Term-based Sparse Representation for Fast Text Retrieval.
CoRR, 2020

DynaBERT: Dynamic BERT with Adaptive Width and Depth.
CoRR, 2020

DynaBERT: Dynamic BERT with Adaptive Width and Depth.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Neural Subgraph Isomorphism Counting.
Proceedings of the KDD '20: The 26th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2020

An Investigation of Few-Shot Learning in Spoken Term Classification.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

TernaryBERT: Distillation-aware Ultra-low Bit BERT.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

TinyBERT: Distilling BERT for Natural Language Understanding.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2020, 2020

Enriching Large-Scale Eventuality Knowledge Graph with Entailment Relations.
Proceedings of the Conference on Automated Knowledge Base Construction, 2020

Dialog State Tracking with Reinforced Data Augmentation.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
Decomposable Neural Paraphrase Generation.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

2018
Towards Automatic Evaluation of Customer-Helpdesk Dialogues.
J. Inf. Process., 2018

Meta Learning for Few-shot Keyword Spotting.
CoRR, 2018

Paraphrase Generation with Deep Reinforcement Learning.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

2017
Test Collections and Measures for Evaluating Customer-Helpdesk Dialogues.
Proceedings of the 8th International Workshop on Evaluating Information Access co-located with the 13th NTCIR Conference on the Evaluation of Information Access Technologies (NTCIR 2017), 2017

Overview of the NTCIR-13 Short Text Conversation Task.
Proceedings of the 13th NTCIR Conference, 2017

Neural Machine Translation with Reconstruction.
Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, 2017

2016
Overview of the NTCIR-12 Short Text Conversation Task.
Proceedings of the 12th NTCIR Conference on Evaluation of Information Access Technologies, 2016

On Estimating Variances for Topic Set Size Design.
Proceedings of the Seventh International Workshop on Evaluating Information Access, 2016

Neural Generative Question Answering.
Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence, 2016

2015
Multimodal Convolutional Neural Networks for Matching Image and Sentence.
Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015

Reliability concerns on time-to-digital converter due to bias temperature instability in nanometer era.
Proceedings of the 2015 IEEE 11th International Conference on ASIC, 2015

Topic Set Size Design with the Evaluation Measures for Short Text Conversation.
Proceedings of the Information Retrieval Technology, 2015

Neural Responding Machine for Short-Text Conversation.
Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing of the Asian Federation of Natural Language Processing, 2015

2014
Spatial temporal pyramid matching using temporal sparse representation for human motion retrieval.
Vis. Comput., 2014

A Robust Likelihood Function for 3D Human Pose Tracking.
IEEE Trans. Image Process., 2014

Human motion variation synthesis with multivariate Gaussian processes.
Comput. Animat. Virtual Worlds, 2014

2013
On Approximate Inference for Generalized Gaussian Process Models.
CoRR, 2013

2012
Facial expression analysis with graphical models
PhD thesis, 2012

Mode Seeking with an Adaptive Distance Measure.
Proceedings of the Computer Vision - ECCV 2012. Workshops and Demonstrations, 2012

2011
DTTM: A Discriminative Temporal Topic Model for Facial Expression Recognition.
Proceedings of the Advances in Visual Computing - 7th International Symposium, 2011

2010
Real-time large scale near-duplicate web video retrieval.
Proceedings of the 18th International Conference on Multimedia 2010, 2010

A Temporal Latent Topic Model for Facial Expression Recognition.
Proceedings of the Computer Vision - ACCV 2010, 2010

2009
Constrained ZIP code segmentation by a PCNN-based thinning algorithm.
Neurocomputing, 2009

Nonparametric discriminant HMM and application to facial expression recognition.
Proceedings of the 2009 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2009), 2009

Collaborative resource discovery in social tagging systems.
Proceedings of the 18th ACM Conference on Information and Knowledge Management, 2009

2008
An improved pulse coupled neural network for image processing.
Neural Comput. Appl., 2008

Temporal Exemplar-Based Bayesian Networks for Facial Expression Recognition.
Proceedings of the Seventh International Conference on Machine Learning and Applications, 2008

2007
Binary Fingerprint Image Thinning Using Template-Based PCNNs.
IEEE Trans. Syst. Man Cybern. Part B, 2007

Binary Image Thinning Using Autowaves Generated by PCNN.
Neural Process. Lett., 2007

A class of binary images thinning using two PCNNs.
Neurocomputing, 2007

2006
Rigid medical image registration using PCA neural network.
Neurocomputing, 2006


  Loading...