Liang Li

Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Stochastic Context Consistency Reasoning for Domain Adaptive Object Detection.

[BibT_eX]

[DOI]

Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Generating High-Quality Symbolic Music Using Fine-Grained Discriminators.

[BibT_eX]

[DOI]

Proceedings of the Pattern Recognition - 27th International Conference, 2024

ASQuery: A Query-based Model for Action Segmentation.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Multimedia and Expo, 2024

LiDAR-PTQ: Post-Training Quantization for Point Cloud 3D Object Detection.

[BibT_eX]

[DOI]

Proceedings of the Twelfth International Conference on Learning Representations, 2024

R&B: Region and Boundary Aware Zero-shot Grounded Text-to-image Generation.

[BibT_eX]

[DOI]

Proceedings of the Twelfth International Conference on Learning Representations, 2024

Distractors-Immune Representation Learning with Cross-Modal Contrastive Regularization for Change Captioning.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2024, 2024

Prompt-Enhanced Multiple Instance Learning for Weakly Supervised Video Anomaly Detection.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

SynSP: Synergy of Smoothness and Precision in Pose Sequences Refinement.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Context-aware Difference Distilling for Multi-change Captioning.

[BibT_eX]

[DOI]

Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

StyleDubber: Towards Multi-Scale Style Learning for Movie Dubbing.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics, 2024

Norm Tweaking: High-Performance Low-Bit Quantization of Large Language Models.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

Make RepVGG Greater Again: A Quantization-Aware Approach.

[BibT_eX]

[DOI]

Xiangxiang Chu

Bo Zhang

Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023

Semantic and Relation Modulation for Audio-Visual Event Localization.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., June, 2023

Entity-Enhanced Adaptive Reconstruction Network for Weakly Supervised Referring Expression Grounding.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., March, 2023

Weakly Supervised Text-based Actor-Action Video Segmentation by Clip-level Multi-instance Learning.

[BibT_eX]

[DOI]

ACM Trans. Multim. Comput. Commun. Appl., January, 2023

I3N: Intra- and Inter-Representation Interaction Network for Change Captioning.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2023

Neighborhood Contrastive Transformer for Change Captioning.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2023

Viewpoint Alignment and Discriminative Parts Enhancement in 3D Space for Vehicle ReID.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2023

Viewpoint-Adaptive Representation Disentanglement Network for Change Captioning.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2023

A Speed Odyssey for Deployable Quantization of LLMs.

[BibT_eX]

[DOI]

CoRR, 2023

FPTQ: Fine-grained Post-Training Quantization for Large Language Models.

[BibT_eX]

[DOI]

CoRR, 2023

Reducing Intrinsic and Extrinsic Data Biases for Moment Localization with Natural Language.

[BibT_eX]

[DOI]

Proceedings of the 31st ACM International Conference on Multimedia, 2023

MaTCR: Modality-Aligned Thought Chain Reasoning for Multimodal Task-Oriented Dialogue Generation.

[BibT_eX]

[DOI]

Proceedings of the 31st ACM International Conference on Multimedia, 2023

Dynamic Contrastive Learning with Pseudo-samples Intervention for Weakly Supervised Joint Video MR and HD.

[BibT_eX]

[DOI]

Proceedings of the 31st ACM International Conference on Multimedia, 2023

Self-supervised Cross-view Representation Reconstruction for Change Captioning.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Text-Driven Generative Domain Adaptation with Spectral Consistency Regularization.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Decoupling-and-Aggregating for Image Exposure Correction.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Learning to Dub Movies via Hierarchical Prosody Models.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022

Age-Invariant Face Recognition by Multi-Feature Fusionand Decomposition with Self-attention.

[BibT_eX]

[DOI]

ACM Trans. Multim. Comput. Commun. Appl., 2022

I<sup>2</sup>Transformer: Intra- and Inter-Relation Embedding Transformer for TV Show Captioning.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2022

Long Short-Term Relation Transformer With Global Gating for Video Captioning.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2022

CBREN: Convolutional Neural Networks for Constant Bit Rate Video Quality Enhancement.

[BibT_eX]

[DOI]

IEEE Trans. Circuits Syst. Video Technol., 2022

Task-Adaptive Attention for Image Captioning.

[BibT_eX]

[DOI]

IEEE Trans. Circuits Syst. Video Technol., 2022

Syntax-Guided Hierarchical Attention Network for Video Captioning.

[BibT_eX]

[DOI]

IEEE Trans. Circuits Syst. Video Technol., 2022

Bidirectional difference locating and semantic consistency reasoning for change captioning.

[BibT_eX]

[DOI]

Int. J. Intell. Syst., 2022

Learning Degradation-Invariant Representation for Robust Real-World Person Re-Identification.

[BibT_eX]

[DOI]

Int. J. Comput. Vis., 2022

YOLOv6: A Single-Stage Object Detection Framework for Industrial Applications.

[BibT_eX]

[DOI]

CoRR, 2022

LS-GAN: Iterative Language-based Image Manipulation via Long and Short Term Consistency Reasoning.

[BibT_eX]

[DOI]

Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Think Beyond Words: Exploring Context-Relevant Visual Commonsense for Diverse Dialogue Generation.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

Few Shot Generative Model Adaption via Relaxed Spatial Structural Alignment.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Automatic Relation-aware Graph Network Proliferation.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Unsupervised Coherent Video Cartoonization with Perceptual Motion Consistency.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

Debiased Batch Normalization via Gaussian Process for Generalizable Person Re-identification.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

Modality-Adaptive Mixup and Invariant Decomposition for RGB-Infrared Person Re-identification.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021

Graph Regularized Encoder-Decoder Networks for Image Representation Learning.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2021

Local-binarized very deep residual network for visual categorization.

[BibT_eX]

[DOI]

Neurocomputing, 2021

Cross-modal semantic correlation learning by Bi-CNN network.

[BibT_eX]

[DOI]

IET Image Process., 2021

Calibrated Feature Decomposition for Generalizable Person Re-Identification.

[BibT_eX]

[DOI]

CoRR, 2021

R<sup>3</sup>Net: Relation-embedded Representation Reconstruction Network for Change Captioning.

[BibT_eX]

[DOI]

CoRR, 2021

Edge-featured Graph Neural Architecture Search.

[BibT_eX]

[DOI]

CoRR, 2021

Multi-Modulation Network for Audio-Visual Event Localization.

[BibT_eX]

[DOI]

CoRR, 2021

Fast Batch Nuclear-norm Maximization and Minimization for Robust Domain Adaptation.

[BibT_eX]

[DOI]

CoRR, 2021

Rethinking Graph Neural Network Search from Message-passing.

[BibT_eX]

[DOI]

CoRR, 2021

Heuristic Depth Estimation with Progressive Depth Reconstruction and Confidence-Aware Loss.

[BibT_eX]

[DOI]

Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

R\^3Net: Relation-embedded Representation Reconstruction Network for Change Captioning.

[BibT_eX]

[DOI]

Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

Structured Multi-Level Interaction Network for Video Moment Localization via Language Query.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Rethinking Graph Neural Architecture Search From Message-Passing.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Semantic Relation-aware Difference Representation Learning for Change Captioning.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021

2020

Enabling 5G: sentimental image dominant graph topic model for cross-modality topic detection.

[BibT_eX]

[DOI]

Wirel. Networks, 2020

Learning salient features to prevent model drift for correlation tracking.

[BibT_eX]

[DOI]

Neurocomputing, 2020

Two-stream deep sparse network for accurate and efficient image restoration.

[BibT_eX]

[DOI]

Comput. Vis. Image Underst., 2020

Anchor-Free One-Stage Online Multi-object Tracking.

[BibT_eX]

[DOI]

Proceedings of the Pattern Recognition and Computer Vision - Third Chinese Conference, 2020

Structural Semantic Adversarial Active Learning for Image Captioning.

[BibT_eX]

[DOI]

Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Multi-Features Fusion and Decomposition for Age-Invariant Face Recognition.

[BibT_eX]

[DOI]

Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Fine-grained Feature Alignment with Part Perspective Transformation for Vehicle ReID.

[BibT_eX]

[DOI]

Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Transferrable Referring Expression Grounding with Concept Transfer and Context Inheritance.

[BibT_eX]

[DOI]

Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

IR-GAN: Image Manipulation with Linguistic Instruction by Increment Reasoning.

[BibT_eX]

[DOI]

Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Diverter-Guider Recurrent Network for Diverse Poems Generation from Image.

[BibT_eX]

[DOI]

Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

A Structured Latent Variable Recurrent Network With Stochastic Attention For Generating Weibo Comments.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, 2020

State-Relabeling Adversarial Active Learning.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Parsing-Based View-Aware Embedding Network for Vehicle Re-Identification.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Real-World Person Re-Identification via Degradation Invariance Learning.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Towards Discriminability and Diversity: Batch Nuclear-Norm Maximization Under Label Insufficient Situations.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

2019

SkeletonNet: A Hybrid Network With a Skeleton-Embedding Process for Multi-View Image Representation Learning.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2019

Cross-Modality Bridging and Knowledge Transferring for Image Understanding.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2019

Image classification base on PCA of multi-view deep representation.

[BibT_eX]

[DOI]

J. Vis. Commun. Image Represent., 2019

Regularized topic-aware latent influence propagation in dynamic relational networks.

[BibT_eX]

[DOI]

GeoInformatica, 2019

Image Classification base on PCA of Multi-view Deep Representation.

[BibT_eX]

[DOI]

CoRR, 2019

Active Perception Network for Salient Object Detection.

[BibT_eX]

[DOI]

Proceedings of the MMAsia '19: ACM Multimedia Asia, Beijing, China, December 16-18, 2019, 2019

Training Efficient Saliency Prediction Models with Knowledge Distillation.

[BibT_eX]

[DOI]

Proceedings of the 27th ACM International Conference on Multimedia, 2019

Structured Stochastic Recurrent Network for Linguistic Video Prediction.

[BibT_eX]

[DOI]

Proceedings of the 27th ACM International Conference on Multimedia, 2019

Knowledge-guided Pairwise Reconstruction Network for Weakly Supervised Referring Expression Grounding.

[BibT_eX]

[DOI]

Proceedings of the 27th ACM International Conference on Multimedia, 2019

Two-Stream Sparse Network for Accurate Image Super-Resolution.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Multimedia & Expo Workshops, 2019

Adaptive Reconstruction Network for Weakly Supervised Referring Expression Grounding.

[BibT_eX]

[DOI]

Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

2018

Object Categorization Using Class-Specific Representations.

[BibT_eX]

[DOI]

IEEE Trans. Neural Networks Learn. Syst., 2018

A two-step approach to describing web topics via probable keywords and prototype images from background-removed similarities.

[BibT_eX]

[DOI]

Neurocomputing, 2018

Attentive Recurrent Neural Network for Weak-supervised Multi-label Image Classification.

[BibT_eX]

[DOI]

Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018

Saliency-Based Spatiotemporal Attention for Video Captioning.

[BibT_eX]

[DOI]

Proceedings of the Fourth IEEE International Conference on Multimedia Big Data, 2018

Reverse Densely Connected Feature Pyramid Network for Object Detection.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ACCV 2018, 2018

2017

Fine-Grained Image Classification via Low-Rank Sparse Coding With General and Class-Specific Codebooks.

[BibT_eX]

[DOI]

IEEE Trans. Neural Networks Learn. Syst., 2017

Three-dimensional laser scanning under the pinhole camera with lens distortion.

[BibT_eX]

[DOI]

Binbin Lv

Chenggang Yan

Mach. Vis. Appl., 2017

Guest Editorial: Knowledge-Based Multimedia Computing.

[BibT_eX]

[DOI]

Multim. Tools Appl., 2017

EvoPass: Evolvable graphical password against shoulder-surfing attacks.

[BibT_eX]

[DOI]

Comput. Secur., 2017

Dependency Exploitation: A Unified CNN-RNN Approach for Visual Emotion Recognition.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence, 2017

Cross-media retrieval with semantics clustering and enhancement.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE International Conference on Multimedia and Expo, 2017

Stereoscopic visualization of 3D model using OpenGL.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE Global Conference on Signal and Information Processing, 2017

Online multi-target tracking via depth range segmentation.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE Global Conference on Signal and Information Processing, 2017

Image classification by principal component analysis of multi-channel deep feature.

[BibT_eX]

[DOI]

Ping Wang

Chenggang Yan

Proceedings of the 2017 IEEE Global Conference on Signal and Information Processing, 2017

Virtual reality realization technology and its application based on augmented reality.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE Global Conference on Signal and Information Processing, 2017

A panoramic survey method based on gesture recognition.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE Global Conference on Signal and Information Processing, 2017

A Graph Regularized Deep Neural Network for Unsupervised Image Representation Learning.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

Multi-view Subspace Learning with Diversity Enforced Skeleton Embedding.

[BibT_eX]

[DOI]

Proceedings of the Third IEEE International Conference on Multimedia Big Data, 2017

2016

Efficient virtual network transmission using correlated equilibrium on Xen-based platform.

[BibT_eX]

[DOI]

J. Vis. Commun. Image Represent., 2016

Distributed image understanding with semantic dictionary and semantic expansion.

[BibT_eX]

[DOI]

Chenggang Clarence Yan

Neurocomputing, 2016

Robust latent poisson deconvolution from multiple imperfect features for web topic detection.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Multimedia and Expo, 2016

2015

A Parallel Algorithm for Game Tree Search Using GPGPU.

[BibT_eX]

[DOI]

IEEE Trans. Parallel Distributed Syst., 2015

Polysemious visual representation based on feature aggregation for large scale image applications.

[BibT_eX]

[DOI]

Multim. Tools Appl., 2015

LSH-based semantic dictionary learning for large scale image understanding.

[BibT_eX]

[DOI]

Chenggang Clarence Yan

J. Vis. Commun. Image Represent., 2015

Joint image representation and classification in random semantic spaces.

[BibT_eX]

[DOI]

Neurocomputing, 2015

Cross-media Topic Detection with Refined CNN based Image-Dominant Topic Model.

[BibT_eX]

[DOI]

Zhiyi Wang

Proceedings of the 23rd Annual ACM Conference on Multimedia Conference, MM '15, Brisbane, Australia, October 26, 2015

Image-regulated graph topic model for cross-media topic detection.

[BibT_eX]

[DOI]

Proceedings of the 7th International Conference on Internet Multimedia Computing and Service, 2015

2014

A Highly Parallel Framework for HEVC Coding Unit Partitioning Tree Decision on Many-core Processors.

[BibT_eX]

[DOI]

IEEE Signal Process. Lett., 2014

Fusing multi-cues description for partial-duplicate image retrieval.

[BibT_eX]

[DOI]

Chenggang Clarence Yan

J. Vis. Commun. Image Represent., 2014

Optimizing the Join Operation on Hive to Accelerate Cross-Matching in Astronomy.

[BibT_eX]

[DOI]

Proceedings of the 2014 IEEE International Parallel & Distributed Processing Symposium Workshops, 2014

Large scale image understanding with non-convex multi-task learning.

[BibT_eX]

[DOI]

Proceedings of the 2014 5th International Conference on Game Theory for Networks, 2014

2013

Partial-Duplicate Image Retrieval via Saliency-Guided Visual Matching.

[BibT_eX]

[DOI]

IEEE Multim., 2013

Efficient Parallel Framework for HEVC Deblocking Filter on Many-Core Platform.

[BibT_eX]

[DOI]

Proceedings of the 2013 Data Compression Conference, 2013

Highly Parallel Framework for HEVC Motion Estimation on Many-Core Platform.

[BibT_eX]

[DOI]

Proceedings of the 2013 Data Compression Conference, 2013

Time evolving graphical password for securing mobile devices.

[BibT_eX]

[DOI]

Zhan Wang

Jiwu Jing

Proceedings of the 8th ACM Symposium on Information, Computer and Communications Security, 2013

2012

Learning Hierarchical Semantic Description Via Mixed-Norm Regularization for Image Understanding.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2012

Nonlocal Means-Based Denoising for Medical Images.

[BibT_eX]

[DOI]

Ke Lu

Ning He

Comput. Math. Methods Medicine, 2012

A Node-based Parallel Game Tree Algorithm Using GPUs.

[BibT_eX]

[DOI]

Proceedings of the 2012 IEEE International Conference on Cluster Computing, 2012

2011

Matching Content-based Saliency Regions for partial-duplicate image retrieval.

[BibT_eX]

[DOI]

Proceedings of the 2011 IEEE International Conference on Multimedia and Expo, 2011

Online Vicept learning for web-scale image understanding.

[BibT_eX]

[DOI]

Proceedings of the 18th IEEE International Conference on Image Processing, 2011

Learning image Vicept description via mixed-norm regularization for large scale semantic image search.

[BibT_eX]

[DOI]

Proceedings of the 24th IEEE Conference on Computer Vision and Pattern Recognition, 2011

2010

Vicept: link visual features to concepts for large-scale image understanding.

[BibT_eX]

[DOI]

Proceedings of the 18th International Conference on Multimedia 2010, 2010

Adding Affine Invariant Geometric Constraint for Partial-Duplicate Image Retrieval.

[BibT_eX]

[DOI]

Proceedings of the 20th International Conference on Pattern Recognition, 2010

Multi-description of local interest point for partial-duplicate image retrieval.

[BibT_eX]

[DOI]