Liang Li

Orcid: 0000-0002-1943-8219

  • Chinese Academy of Sciences, Institute of Computing Technology, Key Laboratory of Intelligent Information Processing, Beijing, China
  • University of Chinese Academy of Sciences, School of Computer and Control Engineering, Beijing, China (2013 - 2015)

According to our database1, Liang Li authored at least 130 papers between 2010 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.



In proceedings 
PhD thesis 


Online presence:



Context Disentangling and Prototype Inheriting for Robust Visual Grounding.
IEEE Trans. Pattern Anal. Mach. Intell., May, 2024

Progressive Depth Decoupling and Modulating for Flexible Depth Completion.
IEEE Trans. Instrum. Meas., 2024

SMART: Syntax-Calibrated Multi-Aspect Relation Transformer for Change Captioning.
IEEE Trans. Pattern Anal. Mach. Intell., 2024

Generating High-quality Symbolic Music Using Fine-grained Discriminators.
CoRR, 2024

Cool-Fusion: Fuse Large Language Models without Training.
CoRR, 2024

Downstream-Pretext Domain Knowledge Traceback for Active Learning.
CoRR, 2024

Distractors-Immune Representation Learning with Cross-modal Contrastive Regularization for Change Captioning.
CoRR, 2024

Technique Report of CVPR 2024 PBDL Challenges.
CoRR, 2024

Pick-and-Draw: Training-free Semantic Guidance for Text-to-Image Personalization.
CoRR, 2024

LiDAR-PTQ: Post-Training Quantization for Point Cloud 3D Object Detection.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

R&B: Region and Boundary Aware Zero-shot Grounded Text-to-image Generation.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Context-aware Difference Distilling for Multi-change Captioning.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

StyleDubber: Towards Multi-Scale Style Learning for Movie Dubbing.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

Norm Tweaking: High-Performance Low-Bit Quantization of Large Language Models.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

Make RepVGG Greater Again: A Quantization-Aware Approach.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

Semantic and Relation Modulation for Audio-Visual Event Localization.
IEEE Trans. Pattern Anal. Mach. Intell., June, 2023

Entity-Enhanced Adaptive Reconstruction Network for Weakly Supervised Referring Expression Grounding.
IEEE Trans. Pattern Anal. Mach. Intell., March, 2023

Weakly Supervised Text-based Actor-Action Video Segmentation by Clip-level Multi-instance Learning.
ACM Trans. Multim. Comput. Commun. Appl., January, 2023

I3N: Intra- and Inter-Representation Interaction Network for Change Captioning.
IEEE Trans. Multim., 2023

Neighborhood Contrastive Transformer for Change Captioning.
IEEE Trans. Multim., 2023

Viewpoint Alignment and Discriminative Parts Enhancement in 3D Space for Vehicle ReID.
IEEE Trans. Multim., 2023

Viewpoint-Adaptive Representation Disentanglement Network for Change Captioning.
IEEE Trans. Image Process., 2023

A Speed Odyssey for Deployable Quantization of LLMs.
CoRR, 2023

FPTQ: Fine-grained Post-Training Quantization for Large Language Models.
CoRR, 2023

Reducing Intrinsic and Extrinsic Data Biases for Moment Localization with Natural Language.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

MaTCR: Modality-Aligned Thought Chain Reasoning for Multimodal Task-Oriented Dialogue Generation.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Dynamic Contrastive Learning with Pseudo-samples Intervention for Weakly Supervised Joint Video MR and HD.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Self-supervised Cross-view Representation Reconstruction for Change Captioning.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Text-Driven Generative Domain Adaptation with Spectral Consistency Regularization.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Decoupling-and-Aggregating for Image Exposure Correction.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Learning to Dub Movies via Hierarchical Prosody Models.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Age-Invariant Face Recognition by Multi-Feature Fusionand Decomposition with Self-attention.
ACM Trans. Multim. Comput. Commun. Appl., 2022

I<sup>2</sup>Transformer: Intra- and Inter-Relation Embedding Transformer for TV Show Captioning.
IEEE Trans. Image Process., 2022

Long Short-Term Relation Transformer With Global Gating for Video Captioning.
IEEE Trans. Image Process., 2022

CBREN: Convolutional Neural Networks for Constant Bit Rate Video Quality Enhancement.
IEEE Trans. Circuits Syst. Video Technol., 2022

Task-Adaptive Attention for Image Captioning.
IEEE Trans. Circuits Syst. Video Technol., 2022

Syntax-Guided Hierarchical Attention Network for Video Captioning.
IEEE Trans. Circuits Syst. Video Technol., 2022

Bidirectional difference locating and semantic consistency reasoning for change captioning.
Int. J. Intell. Syst., 2022

Learning Degradation-Invariant Representation for Robust Real-World Person Re-Identification.
Int. J. Comput. Vis., 2022

YOLOv6: A Single-Stage Object Detection Framework for Industrial Applications.
CoRR, 2022

LS-GAN: Iterative Language-based Image Manipulation via Long and Short Term Consistency Reasoning.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Think Beyond Words: Exploring Context-Relevant Visual Commonsense for Diverse Dialogue Generation.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

Few Shot Generative Model Adaption via Relaxed Spatial Structural Alignment.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Automatic Relation-aware Graph Network Proliferation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Unsupervised Coherent Video Cartoonization with Perceptual Motion Consistency.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

Debiased Batch Normalization via Gaussian Process for Generalizable Person Re-identification.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

Modality-Adaptive Mixup and Invariant Decomposition for RGB-Infrared Person Re-identification.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

Graph Regularized Encoder-Decoder Networks for Image Representation Learning.
IEEE Trans. Multim., 2021

Local-binarized very deep residual network for visual categorization.
Neurocomputing, 2021

Cross-modal semantic correlation learning by Bi-CNN network.
IET Image Process., 2021

Calibrated Feature Decomposition for Generalizable Person Re-Identification.
CoRR, 2021

R<sup>3</sup>Net: Relation-embedded Representation Reconstruction Network for Change Captioning.
CoRR, 2021

Edge-featured Graph Neural Architecture Search.
CoRR, 2021

Multi-Modulation Network for Audio-Visual Event Localization.
CoRR, 2021

Fast Batch Nuclear-norm Maximization and Minimization for Robust Domain Adaptation.
CoRR, 2021

Rethinking Graph Neural Network Search from Message-passing.
CoRR, 2021

Heuristic Depth Estimation with Progressive Depth Reconstruction and Confidence-Aware Loss.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

R\^3Net: Relation-embedded Representation Reconstruction Network for Change Captioning.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

Structured Multi-Level Interaction Network for Video Moment Localization via Language Query.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Rethinking Graph Neural Architecture Search From Message-Passing.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Semantic Relation-aware Difference Representation Learning for Change Captioning.
Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021

Enabling 5G: sentimental image dominant graph topic model for cross-modality topic detection.
Wirel. Networks, 2020

Learning salient features to prevent model drift for correlation tracking.
Neurocomputing, 2020

Two-stream deep sparse network for accurate and efficient image restoration.
Comput. Vis. Image Underst., 2020

Structural Semantic Adversarial Active Learning for Image Captioning.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Multi-Features Fusion and Decomposition for Age-Invariant Face Recognition.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Fine-grained Feature Alignment with Part Perspective Transformation for Vehicle ReID.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Transferrable Referring Expression Grounding with Concept Transfer and Context Inheritance.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

IR-GAN: Image Manipulation with Linguistic Instruction by Increment Reasoning.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Diverter-Guider Recurrent Network for Diverse Poems Generation from Image.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

A Structured Latent Variable Recurrent Network With Stochastic Attention For Generating Weibo Comments.
Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, 2020

State-Relabeling Adversarial Active Learning.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Parsing-Based View-Aware Embedding Network for Vehicle Re-Identification.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Real-World Person Re-Identification via Degradation Invariance Learning.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Towards Discriminability and Diversity: Batch Nuclear-Norm Maximization Under Label Insufficient Situations.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

SkeletonNet: A Hybrid Network With a Skeleton-Embedding Process for Multi-View Image Representation Learning.
IEEE Trans. Multim., 2019

Cross-Modality Bridging and Knowledge Transferring for Image Understanding.
IEEE Trans. Multim., 2019

Image classification base on PCA of multi-view deep representation.
J. Vis. Commun. Image Represent., 2019

Regularized topic-aware latent influence propagation in dynamic relational networks.
GeoInformatica, 2019

Image Classification base on PCA of Multi-view Deep Representation.
CoRR, 2019

Active Perception Network for Salient Object Detection.
Proceedings of the MMAsia '19: ACM Multimedia Asia, Beijing, China, December 16-18, 2019, 2019

Training Efficient Saliency Prediction Models with Knowledge Distillation.
Proceedings of the 27th ACM International Conference on Multimedia, 2019

Structured Stochastic Recurrent Network for Linguistic Video Prediction.
Proceedings of the 27th ACM International Conference on Multimedia, 2019

Knowledge-guided Pairwise Reconstruction Network for Weakly Supervised Referring Expression Grounding.
Proceedings of the 27th ACM International Conference on Multimedia, 2019

Two-Stream Sparse Network for Accurate Image Super-Resolution.
Proceedings of the IEEE International Conference on Multimedia & Expo Workshops, 2019

Adaptive Reconstruction Network for Weakly Supervised Referring Expression Grounding.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Object Categorization Using Class-Specific Representations.
IEEE Trans. Neural Networks Learn. Syst., 2018

A two-step approach to describing web topics via probable keywords and prototype images from background-removed similarities.
Neurocomputing, 2018

Attentive Recurrent Neural Network for Weak-supervised Multi-label Image Classification.
Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018

Saliency-Based Spatiotemporal Attention for Video Captioning.
Proceedings of the Fourth IEEE International Conference on Multimedia Big Data, 2018

Reverse Densely Connected Feature Pyramid Network for Object Detection.
Proceedings of the Computer Vision - ACCV 2018, 2018

Fine-Grained Image Classification via Low-Rank Sparse Coding With General and Class-Specific Codebooks.
IEEE Trans. Neural Networks Learn. Syst., 2017

Three-dimensional laser scanning under the pinhole camera with lens distortion.
Mach. Vis. Appl., 2017

Guest Editorial: Knowledge-Based Multimedia Computing.
Multim. Tools Appl., 2017

EvoPass: Evolvable graphical password against shoulder-surfing attacks.
Comput. Secur., 2017

Dependency Exploitation: A Unified CNN-RNN Approach for Visual Emotion Recognition.
Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence, 2017

Cross-media retrieval with semantics clustering and enhancement.
Proceedings of the 2017 IEEE International Conference on Multimedia and Expo, 2017

Stereoscopic visualization of 3D model using OpenGL.
Proceedings of the 2017 IEEE Global Conference on Signal and Information Processing, 2017

Online multi-target tracking via depth range segmentation.
Proceedings of the 2017 IEEE Global Conference on Signal and Information Processing, 2017

Image classification by principal component analysis of multi-channel deep feature.
Proceedings of the 2017 IEEE Global Conference on Signal and Information Processing, 2017

Virtual reality realization technology and its application based on augmented reality.
Proceedings of the 2017 IEEE Global Conference on Signal and Information Processing, 2017

A panoramic survey method based on gesture recognition.
Proceedings of the 2017 IEEE Global Conference on Signal and Information Processing, 2017

A Graph Regularized Deep Neural Network for Unsupervised Image Representation Learning.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

Multi-view Subspace Learning with Diversity Enforced Skeleton Embedding.
Proceedings of the Third IEEE International Conference on Multimedia Big Data, 2017

Efficient virtual network transmission using correlated equilibrium on Xen-based platform.
J. Vis. Commun. Image Represent., 2016

Distributed image understanding with semantic dictionary and semantic expansion.
Neurocomputing, 2016

Robust latent poisson deconvolution from multiple imperfect features for web topic detection.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2016

A Parallel Algorithm for Game Tree Search Using GPGPU.
IEEE Trans. Parallel Distributed Syst., 2015

Polysemious visual representation based on feature aggregation for large scale image applications.
Multim. Tools Appl., 2015

LSH-based semantic dictionary learning for large scale image understanding.
J. Vis. Commun. Image Represent., 2015

Joint image representation and classification in random semantic spaces.
Neurocomputing, 2015

Cross-media Topic Detection with Refined CNN based Image-Dominant Topic Model.
Proceedings of the 23rd Annual ACM Conference on Multimedia Conference, MM '15, Brisbane, Australia, October 26, 2015

Image-regulated graph topic model for cross-media topic detection.
Proceedings of the 7th International Conference on Internet Multimedia Computing and Service, 2015

A Highly Parallel Framework for HEVC Coding Unit Partitioning Tree Decision on Many-core Processors.
IEEE Signal Process. Lett., 2014

Fusing multi-cues description for partial-duplicate image retrieval.
J. Vis. Commun. Image Represent., 2014

Optimizing the Join Operation on Hive to Accelerate Cross-Matching in Astronomy.
Proceedings of the 2014 IEEE International Parallel & Distributed Processing Symposium Workshops, 2014

Large scale image understanding with non-convex multi-task learning.
Proceedings of the 2014 5th International Conference on Game Theory for Networks, 2014

Partial-Duplicate Image Retrieval via Saliency-Guided Visual Matching.
IEEE Multim., 2013

Efficient Parallel Framework for HEVC Deblocking Filter on Many-Core Platform.
Proceedings of the 2013 Data Compression Conference, 2013

Highly Parallel Framework for HEVC Motion Estimation on Many-Core Platform.
Proceedings of the 2013 Data Compression Conference, 2013

Time evolving graphical password for securing mobile devices.
Proceedings of the 8th ACM Symposium on Information, Computer and Communications Security, 2013

Learning Hierarchical Semantic Description Via Mixed-Norm Regularization for Image Understanding.
IEEE Trans. Multim., 2012

Nonlocal Means-Based Denoising for Medical Images.
Comput. Math. Methods Medicine, 2012

A Node-based Parallel Game Tree Algorithm Using GPUs.
Proceedings of the 2012 IEEE International Conference on Cluster Computing, 2012

Matching Content-based Saliency Regions for partial-duplicate image retrieval.
Proceedings of the 2011 IEEE International Conference on Multimedia and Expo, 2011

Online Vicept learning for web-scale image understanding.
Proceedings of the 18th IEEE International Conference on Image Processing, 2011

Learning image Vicept description via mixed-norm regularization for large scale semantic image search.
Proceedings of the 24th IEEE Conference on Computer Vision and Pattern Recognition, 2011

Vicept: link visual features to concepts for large-scale image understanding.
Proceedings of the 18th International Conference on Multimedia 2010, 2010

Adding Affine Invariant Geometric Constraint for Partial-Duplicate Image Retrieval.
Proceedings of the 20th International Conference on Pattern Recognition, 2010

Multi-description of local interest point for partial-duplicate image retrieval.
Proceedings of the International Conference on Image Processing, 2010
